Skip to main content

Showing 1–50 of 84 results for author: Wan, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12550  [pdf, other

    cs.LG

    UniTE: A Survey and Unified Pipeline for Pre-training ST Trajectory Embeddings

    Authors: Yan Lin, Zeyu Zhou, Yicheng Liu, Haochen Lv, Haomin Wen, Tianyi Li, Yushuai Li, Christian S. Jensen, Shengnan Guo, Youfang Lin, Huaiyu Wan

    Abstract: Spatio-temporal (ST) trajectories are sequences of timestamped locations, which enable a variety of analyses that in turn enable important real-world applications. It is common to map trajectories to vectors, called embeddings, before subsequent analyses. Thus, the qualities of embeddings are very important. Methods for pre-training embeddings, which leverage unlabeled trajectories for training un… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  2. arXiv:2407.09096  [pdf, other

    cs.LG cs.AI

    STD-LLM: Understanding Both Spatial and Temporal Properties of Spatial-Temporal Data with LLMs

    Authors: Yiheng Huang, Xiaowei Mao, Shengnan Guo, Yubin Chen, Youfang Lin, Huaiyu Wan

    Abstract: Spatial-temporal forecasting and imputation are important for real-world dynamic systems such as intelligent transportation, urban planning, and public health. Most existing methods are tailored for individual forecasting or imputation tasks but are not designed for both. Additionally, they are less effective for zero-shot and few-shot learning. While large language models (LLMs) have exhibited st… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  3. arXiv:2406.20015  [pdf, other

    cs.CL cs.AI

    ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models

    Authors: Yuxiang Zhang, Jing Chen, Junjie Wang, Yaxin Liu, Cheng Yang, Chufan Shi, Xinyu Zhu, Zihao Lin, Hanwen Wan, Yujiu Yang, Tetsuya Sakai, Tian Feng, Hayato Yamana

    Abstract: Tool-augmented large language models (LLMs) are rapidly being integrated into real-world applications. Due to the lack of benchmarks, the community still needs to fully understand the hallucination issues within these models. To address this challenge, we introduce a comprehensive diagnostic benchmark, ToolBH. Specifically, we assess the LLM's hallucinations through two perspectives: depth and bre… ▽ More

    Submitted 28 June, 2024; originally announced June 2024.

  4. arXiv:2406.00734  [pdf, other

    cs.LG

    GLADformer: A Mixed Perspective for Graph-level Anomaly Detection

    Authors: Fan Xu, Nan Wang, Hao Wu, Xuezhi Wen, Dalin Zhang, Siyang Lu, Binyong Li, Wei Gong, Hai Wan, Xibin Zhao

    Abstract: Graph-Level Anomaly Detection (GLAD) aims to distinguish anomalous graphs within a graph dataset. However, current methods are constrained by their receptive fields, struggling to learn global features within the graphs. Moreover, most contemporary methods are based on spatial domain and lack exploration of spectral characteristics. In this paper, we propose a multi-perspective hybrid graph-level… ▽ More

    Submitted 3 July, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

  5. arXiv:2405.12459  [pdf, other

    cs.LG

    PLM4Traj: Cognizing Movement Patterns and Travel Purposes from Trajectories with Pre-trained Language Models

    Authors: Zeyu Zhou, Yan Lin, Haomin Wen, Shengnan Guo, Jilin Hu, Youfang Lin, Huaiyu Wan

    Abstract: Spatio-temporal trajectories play a vital role in various spatio-temporal data mining tasks. Developing a versatile trajectory learning approach that can adapt to different tasks while ensuring high accuracy is crucial. This requires effectively extracting movement patterns and travel purposes embedded in trajectories. However, this task is challenging due to limitations in the size and quality of… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  6. arXiv:2404.19141  [pdf, other

    cs.LG

    Micro-Macro Spatial-Temporal Graph-based Encoder-Decoder for Map-Constrained Trajectory Recovery

    Authors: Tonglong Wei, Youfang Lin, Yan Lin, Shengnan Guo, Lan Zhang, Huaiyu Wan

    Abstract: Recovering intermediate missing GPS points in a sparse trajectory, while adhering to the constraints of the road network, could offer deep insights into users' moving behaviors in intelligent transportation systems. Although recent studies have demonstrated the advantages of achieving map-constrained trajectory recovery via an end-to-end manner, they still face two significant challenges. Firstly,… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: This paper has been accepted as a regular paper at IEEE TKDE

  7. arXiv:2403.11432  [pdf, other

    cs.RO cs.AI cs.LG

    Demystifying the Physics of Deep Reinforcement Learning-Based Autonomous Vehicle Decision-Making

    Authors: Hanxi Wan, Pei Li, Arpan Kusari

    Abstract: With the advent of universal function approximators in the domain of reinforcement learning, the number of practical applications leveraging deep reinforcement learning (DRL) has exploded. Decision-making in autonomous vehicles (AVs) has emerged as a chief application among them, taking the sensor data or the higher-order kinematic variables as the input and providing a discrete choice or continuo… ▽ More

    Submitted 13 June, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

    Comments: Submitted for peer-review

  8. arXiv:2403.09733  [pdf, other

    cs.CL cs.AI

    OverleafCopilot: Empowering Academic Writing in Overleaf with Large Language Models

    Authors: Haomin Wen, Zhenjie Wei, Yan Lin, Jiyuan Wang, Yuxuan Liang, Huaiyu Wan

    Abstract: The rapid development of Large Language Models (LLMs) has facilitated a variety of applications from different domains. In this technical report, we explore the integration of LLMs and the popular academic writing tool, Overleaf, to enhance the efficiency and quality of academic writing. To achieve the above goal, there are three challenges: i) including seamless interaction between Overleaf and L… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  9. arXiv:2403.05268  [pdf, ps, other

    cs.CL cs.LG

    Deep Prompt Multi-task Network for Abuse Language Detection

    Authors: Jian Zhu, Yuping Ruan, Jingfei Chang, Wenhui Sun, Hui Wan, Jian Long, Cheng Luo

    Abstract: The detection of abusive language remains a long-standing challenge with the extensive use of social networks. The detection task of abusive language suffers from limited accuracy. We argue that the existing detection methods utilize the fine-tuning technique of the pre-trained language models (PLMs) to handle downstream tasks. Hence, these methods fail to stimulate the general knowledge of the PL… ▽ More

    Submitted 24 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: Accepted by the International Conference on Pattern Recognition (ICPR) 2024

  10. arXiv:2402.10426  [pdf, other

    cs.CL

    DELL: Generating Reactions and Explanations for LLM-Based Misinformation Detection

    Authors: Herun Wan, Shangbin Feng, Zhaoxuan Tan, Heng Wang, Yulia Tsvetkov, Minnan Luo

    Abstract: Large language models are limited by challenges in factuality and hallucinations to be directly employed off-the-shelf for judging the veracity of news articles, where factual accuracy is paramount. In this work, we propose DELL that identifies three key stages in misinformation detection where LLMs could be incorporated as part of the pipeline: 1) LLMs could \emph{generate news reactions} to repr… ▽ More

    Submitted 4 July, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  11. arXiv:2402.07369  [pdf, other

    cs.LG

    Diff-RNTraj: A Structure-aware Diffusion Model for Road Network-constrained Trajectory Generation

    Authors: Tonglong Wei, Youfang Lin, Shengnan Guo, Yan Lin, Yiheng Huang, Chenyang Xiang, Yuqing Bai, Menglu Ya, Huaiyu Wan

    Abstract: Trajectory data is essential for various applications as it records the movement of vehicles. However, publicly available trajectory datasets remain limited in scale due to privacy concerns, which hinders the development of trajectory data mining and trajectory-based applications. To address this issue, some methods for generating synthetic trajectories have been proposed to expand the scale of th… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  12. arXiv:2402.07232  [pdf, other

    cs.LG

    UVTM: Universal Vehicle Trajectory Modeling with ST Feature Domain Generation

    Authors: Yan Lin, Jilin Hu, Shengnan Guo, Bin Yang, Christian S. Jensen, Youfang Lin, Huaiyu Wan

    Abstract: Vehicle movement is frequently captured in the form of trajectories, i.e., sequences of timestamped locations. Numerous methods exist that target different tasks involving trajectories such as travel-time estimation, trajectory recovery, and trajectory prediction. However, most methods target only one specific task and cannot be applied universally. Existing efforts to create a universal trajector… ▽ More

    Submitted 23 April, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

  13. arXiv:2402.04454  [pdf, other

    cs.NI

    Evolving Mobile Cloud Gaming with 5G Standalone Network Telemetry

    Authors: Haoran Wan, Kyle Jamieson

    Abstract: Mobile cloud gaming places the simultaneous demands of high capacity and low latency on the wireless network, demands that Private and Metropolitan-Area Standalone 5G networks are poised to meet. However, lacking introspection into the 5G Radio Access Network (RAN), cloud gaming servers are ill-poised to cope with the vagaries of the wireless last hop to a mobile client, while 5G network operators… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  14. arXiv:2402.00371  [pdf, other

    cs.CL

    What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection

    Authors: Shangbin Feng, Herun Wan, Ningnan Wang, Zhaoxuan Tan, Minnan Luo, Yulia Tsvetkov

    Abstract: Social media bot detection has always been an arms race between advancements in machine learning bot detectors and adversarial bot strategies to evade detection. In this work, we bring the arms race to the next level by investigating the opportunities and risks of state-of-the-art large language models (LLMs) in social bot detection. To investigate the opportunities, we design novel LLM-based bot… ▽ More

    Submitted 4 July, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: ACL 2024

  15. arXiv:2401.17188  [pdf, other

    cs.IT cs.AI

    Nested Construction of Polar Codes via Transformers

    Authors: Sravan Kumar Ankireddy, S Ashwin Hebbar, Heping Wan, Joonyoung Cho, Charlie Zhang

    Abstract: Tailoring polar code construction for decoding algorithms beyond successive cancellation has remained a topic of significant interest in the field. However, despite the inherent nested structure of polar codes, the use of sequence models in polar code construction is understudied. In this work, we propose using a sequence modeling framework to iteratively construct a polar code for any given lengt… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: 7 pages; 8 figures

  16. arXiv:2312.06441  [pdf, other

    cs.LG cs.AI cs.SI

    Revisiting Graph-Based Fraud Detection in Sight of Heterophily and Spectrum

    Authors: Fan Xu, Nan Wang, Hao Wu, Xuezhi Wen, Xibin Zhao, Hai Wan

    Abstract: Graph-based fraud detection (GFD) can be regarded as a challenging semi-supervised node binary classification task. In recent years, Graph Neural Networks (GNN) have been widely applied to GFD, characterizing the anomalous possibility of a node by aggregating neighbor information. However, fraud graphs are inherently heterophilic, thus most of GNNs perform poorly due to their assumption of homophi… ▽ More

    Submitted 8 July, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

  17. arXiv:2312.01601  [pdf, other

    cs.AI

    Local-Global History-aware Contrastive Learning for Temporal Knowledge Graph Reasoning

    Authors: Wei Chen, Huaiyu Wan, Yuting Wu, Shuyuan Zhao, Jiayaqi Cheng, Yuxin Li, Youfang Lin

    Abstract: Temporal knowledge graphs (TKGs) have been identified as a promising approach to represent the dynamics of facts along the timeline. The extrapolation of TKG is to predict unknowable facts happening in the future, holding significant practical value across diverse fields. Most extrapolation studies in TKGs focus on modeling global historical fact repeating and cyclic patterns, as well as local his… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

    Comments: 14 pages, Accept ICDE2024

  18. arXiv:2311.08705  [pdf, other

    cs.CL

    Evaluating Robustness of Dialogue Summarization Models in the Presence of Naturally Occurring Variations

    Authors: Ankita Gupta, Chulaka Gunasekara, Hui Wan, Jatin Ganhotra, Sachindra Joshi, Marina Danilevsky

    Abstract: Dialogue summarization task involves summarizing long conversations while preserving the most salient information. Real-life dialogues often involve naturally occurring variations (e.g., repetitions, hesitations) and existing dialogue summarization models suffer from performance drop on such conversations. In this study, we systematically investigate the impact of such variations on state-of-the-a… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  19. arXiv:2311.01759  [pdf, other

    cs.LG cs.AR

    TinyFormer: Efficient Transformer Design and Deployment on Tiny Devices

    Authors: Jianlei Yang, Jiacheng Liao, Fanding Lei, Meichen Liu, Junyi Chen, Lingkun Long, Han Wan, Bei Yu, Weisheng Zhao

    Abstract: Developing deep learning models on tiny devices (e.g. Microcontroller units, MCUs) has attracted much attention in various embedded IoT applications. However, it is challenging to efficiently design and deploy recent advanced models (e.g. transformers) on tiny devices due to their severe hardware resource constraints. In this work, we propose TinyFormer, a framework specifically designed to develo… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  20. arXiv:2310.13411  [pdf, other

    cs.CL

    Towards Enhancing Relational Rules for Knowledge Graph Link Prediction

    Authors: Shuhan Wu, Huaiyu Wan, Wei Chen, Yuting Wu, Junfeng Shen, Youfang Lin

    Abstract: Graph neural networks (GNNs) have shown promising performance for knowledge graph reasoning. A recent variant of GNN called progressive relational graph neural network (PRGNN), utilizes relational rules to infer missing knowledge in relational digraphs and achieves notable results. However, during reasoning with PRGNN, two important properties are often overlooked: (1) the sequentiality of relatio… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: Accepted at Findings of EMNLP2023

  21. arXiv:2309.11902  [pdf, other

    cs.NI cs.AR

    A Switch Architecture for Time-Triggered Transmission with Best-Effort Delivery

    Authors: Zonghui Li, Wenlin Zhu, Kang G. Shin, Hai Wan, Xiaoyu Song, Dong Yang, Bo Ai

    Abstract: In Time-Triggered (TT) or time-sensitive networks, the transmission of a TT frame is required to be scheduled at a precise time instant for industrial distributed real-time control systems. Other (or {\em best-effort} (BE)) frames are forwarded in a BE manner. Under this scheduling strategy, the transmission of a TT frame must wait until its scheduled instant even if it could have been transmitted… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: 14 pages

  22. arXiv:2309.04891  [pdf, other

    cs.CV cs.AI cs.IT

    How to Evaluate Semantic Communications for Images with ViTScore Metric?

    Authors: Tingting Zhu, Bo Peng, Jifan Liang, Tingchen Han, Hai Wan, Jingqiao Fu, Junjie Chen

    Abstract: Semantic communications (SC) have been expected to be a new paradigm shifting to catalyze the next generation communication, whose main concerns shift from accurate bit transmission to effective semantic information exchange in communications. However, the previous and widely-used metrics for images are not applicable to evaluate the image semantic similarity in SC. Classical metrics to measure th… ▽ More

    Submitted 20 April, 2024; v1 submitted 9 September, 2023; originally announced September 2023.

  23. arXiv:2309.01194  [pdf, other

    cs.AI

    A Survey on Service Route and Time Prediction in Instant Delivery: Taxonomy, Progress, and Prospects

    Authors: Haomin Wen, Youfang Lin, Lixia Wu, Xiaowei Mao, Tianyue Cai, Yunfeng Hou, Shengnan Guo, Yuxuan Liang, Guangyin Jin, Yiji Zhao, Roger Zimmermann, Jieping Ye, Huaiyu Wan

    Abstract: Instant delivery services, such as food delivery and package delivery, have achieved explosive growth in recent years by providing customers with daily-life convenience. An emerging research area within these services is service Route\&Time Prediction (RTP), which aims to estimate the future service route as well as the arrival time of a given worker. As one of the most crucial tasks in those serv… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

  24. arXiv:2308.13760  [pdf, other

    cs.AI cs.CL cs.IR

    How Can Context Help? Exploring Joint Retrieval of Passage and Personalized Context

    Authors: Hui Wan, Hongkang Li, Songtao Lu, Xiaodong Cui, Marina Danilevsky

    Abstract: The integration of external personalized context information into document-grounded conversational systems has significant potential business value, but has not been well-studied. Motivated by the concept of personalized context-aware document-grounded conversational systems, we introduce the task of context-aware passage retrieval. We also construct a dataset specifically curated for this purpose… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

  25. arXiv:2307.16246  [pdf, other

    cs.LG cs.AI

    DRL4Route: A Deep Reinforcement Learning Framework for Pick-up and Delivery Route Prediction

    Authors: Xiaowei Mao, Haomin Wen, Hengrui Zhang, Huaiyu Wan, Lixia Wu, Jianbin Zheng, Haoyuan Hu, Youfang Lin

    Abstract: Pick-up and Delivery Route Prediction (PDRP), which aims to estimate the future service route of a worker given his current task pool, has received rising attention in recent years. Deep neural networks based on supervised learning have emerged as the dominant model for the task because of their powerful ability to capture workers' behavior patterns from massive historical data. Though promising,… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: Accepted by KDD23

  26. arXiv:2307.03048  [pdf, other

    cs.LG

    Origin-Destination Travel Time Oracle for Map-based Services

    Authors: Yan Lin, Huaiyu Wan, Jilin Hu, Shengnan Guo, Bin Yang, Youfang Lin, Christian S. Jensen

    Abstract: Given an origin (O), a destination (D), and a departure time (T), an Origin-Destination (OD) travel time oracle~(ODT-Oracle) returns an estimate of the time it takes to travel from O to D when departing at T. ODT-Oracles serve important purposes in map-based services. To enable the construction of such oracles, we provide a travel-time estimation (TTE) solution that leverages historical trajectori… ▽ More

    Submitted 6 July, 2023; originally announced July 2023.

    Comments: 15 pages, 12 figures, accepted by SIGMOD International Conference on Management of Data 2024

  27. arXiv:2306.10675  [pdf, other

    cs.DB cs.AI

    LaDe: The First Comprehensive Last-mile Delivery Dataset from Industry

    Authors: Lixia Wu, Haomin Wen, Haoyuan Hu, Xiaowei Mao, Yutong Xia, Ergang Shan, Jianbin Zhen, Junhong Lou, Yuxuan Liang, Liuqing Yang, Roger Zimmermann, Youfang Lin, Huaiyu Wan

    Abstract: Real-world last-mile delivery datasets are crucial for research in logistics, supply chain management, and spatio-temporal data mining. Despite a plethora of algorithms developed to date, no widely accepted, publicly available last-mile delivery dataset exists to support research in this field. In this paper, we introduce \texttt{LaDe}, the first publicly available last-mile delivery dataset with… ▽ More

    Submitted 2 January, 2024; v1 submitted 18 June, 2023; originally announced June 2023.

  28. arXiv:2305.04750  [pdf, other

    cs.RO cs.AI cs.LG

    Sense, Imagine, Act: Multimodal Perception Improves Model-Based Reinforcement Learning for Head-to-Head Autonomous Racing

    Authors: Elena Shrestha, Chetan Reddy, Hanxi Wan, Yulun Zhuang, Ram Vasudevan

    Abstract: Model-based reinforcement learning (MBRL) techniques have recently yielded promising results for real-world autonomous racing using high-dimensional observations. MBRL agents, such as Dreamer, solve long-horizon tasks by building a world model and planning actions by latent imagination. This approach involves explicitly learning a model of the system dynamics and using it to learn the optimal poli… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

  29. arXiv:2304.12090  [pdf, other

    cs.AI

    Reinforcement Learning with Knowledge Representation and Reasoning: A Brief Survey

    Authors: Chao Yu, Xuejing Zheng, Hankz Hankui Zhuo, Hai Wan, Weilin Luo

    Abstract: Reinforcement Learning(RL) has achieved tremendous development in recent years, but still faces significant obstacles in addressing complex real-life problems due to the issues of poor system generalization, low sample efficiency as well as safety and interpretability concerns. The core reason underlying such dilemmas can be attributed to the fact that most of the work has focused on the computati… ▽ More

    Submitted 24 April, 2023; originally announced April 2023.

  30. arXiv:2303.03677  [pdf, other

    cs.CY cs.AI cs.LG

    Training Machine Learning Models to Characterize Temporal Evolution of Disadvantaged Communities

    Authors: Milan Jain, Narmadha Meenu Mohankumar, Heng Wan, Sumitrra Ganguly, Kyle D Wilson, David M Anderson

    Abstract: Disadvantaged communities (DAC), as defined by the Justice40 initiative of the Department of Energy (DOE), USA, identifies census tracts across the USA to determine where benefits of climate and energy investments are or are not currently accruing. The DAC status not only helps in determining the eligibility for future Justice40-related investments but is also critical for exploring ways to achiev… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  31. arXiv:2302.00381  [pdf, other

    cs.SI

    BotPercent: Estimating Bot Populations in Twitter Communities

    Authors: Zhaoxuan Tan, Shangbin Feng, Melanie Sclar, Herun Wan, Minnan Luo, Yejin Choi, Yulia Tsvetkov

    Abstract: Twitter bot detection is vital in combating misinformation and safeguarding the integrity of social media discourse. While malicious bots are becoming more and more sophisticated and personalized, standard bot detection approaches are still agnostic to social environments (henceforth, communities) the bots operate at. In this work, we introduce community-specific bot detection, estimating the perc… ▽ More

    Submitted 18 October, 2023; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: Accepted to findings of EMNLP 2023

  32. arXiv:2301.13629  [pdf, other

    cs.LG

    DiffSTG: Probabilistic Spatio-Temporal Graph Forecasting with Denoising Diffusion Models

    Authors: Haomin Wen, Youfang Lin, Yutong Xia, Huaiyu Wan, Qingsong Wen, Roger Zimmermann, Yuxuan Liang

    Abstract: Spatio-temporal graph neural networks (STGNN) have emerged as the dominant model for spatio-temporal graph (STG) forecasting. Despite their success, they fail to model intrinsic uncertainties within STG data, which cripples their practicality in downstream tasks for decision-making. To this end, this paper focuses on probabilistic STG forecasting, which is challenging due to the difficulty in mode… ▽ More

    Submitted 9 March, 2024; v1 submitted 31 January, 2023; originally announced January 2023.

    Comments: Accepted to the 31st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

  33. arXiv:2301.03128  [pdf, ps, other

    cs.IT

    Compress-and-Forward via Multilevel Coding and Trellis Coded Quantization

    Authors: Heping Wan, Anders Host-Madsen, Aria Nosratinia

    Abstract: Compress-forward (CF) relays can improve communication rates even when the relay cannot decode the source signal. Efficient implementation of CF is a topic of contemporary interest, in part because of its potential impact on wireless technologies such as cloud-RAN. There exists a gap between the performance of CF implementations in the high spectral efficiency regime and the corresponding informat… ▽ More

    Submitted 8 January, 2023; originally announced January 2023.

  34. arXiv:2301.01015  [pdf, other

    cs.CV cs.AI cs.CL

    Semi-Structured Object Sequence Encoders

    Authors: Rudra Murthy V, Riyaz Bhat, Chulaka Gunasekara, Siva Sankalp Patel, Hui Wan, Tejas Indulal Dhamecha, Danish Contractor, Marina Danilevsky

    Abstract: In this paper we explore the task of modeling semi-structured object sequences; in particular, we focus our attention on the problem of developing a structure-aware input representation for such sequences. Examples of such data include user activity on websites, machine logs, and many others. This type of data is often represented as a sequence of sets of key-value pairs over time and can present… ▽ More

    Submitted 22 May, 2023; v1 submitted 3 January, 2023; originally announced January 2023.

  35. arXiv:2212.00373  [pdf, other

    cs.AI

    A Noise-tolerant Differentiable Learning Approach for Single Occurrence Regular Expression with Interleaving

    Authors: Rongzhen Ye, Tianqu Zhuang, Hai Wan, Jianfeng Du, Weilin Luo, Pingjia Liang

    Abstract: We study the problem of learning a single occurrence regular expression with interleaving (SOIRE) from a set of text strings possibly with noise. SOIRE fully supports interleaving and covers a large portion of regular expressions used in practice. Learning SOIREs is challenging because it requires heavy computation and text strings usually contain noise in practice. Most of the previous studies on… ▽ More

    Submitted 11 January, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

  36. arXiv:2211.15666  [pdf, other

    cs.LG cs.AI cs.CV

    Learning Visual Planning Models from Partially Observed Images

    Authors: Kebing Jin, Zhanhao Xiao, Hankui Hankz Zhuo, Hai Wan, Jiaran Cai

    Abstract: There has been increasing attention on planning model learning in classical planning. Most existing approaches, however, focus on learning planning models from structured data in symbolic representations. It is often difficult to obtain such structured data in real-world scenarios. Although a number of approaches have been developed for learning planning models from fully observed unstructured dat… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: 25 pages, 5 figures

  37. arXiv:2210.12415  [pdf, other

    cs.LG cs.DC

    ALT: Boosting Deep Learning Performance by Breaking the Wall between Graph and Operator Level Optimizations

    Authors: Zhiying Xu, Jiafan Xu, Hongding Peng, Wei Wang, Xiaoliang Wang, Haoran Wan, Haipeng Dai, Yixu Xu, Hao Cheng, Kun Wang, Guihai Chen

    Abstract: Deep learning models rely on highly optimized tensor libraries for efficient inference on heterogeneous hardware. Current deep compilers typically predetermine layouts of tensors and then optimize loops of operators. However, such unidirectional and one-off workflow strictly separates graph-level optimization and operator-level optimization into different system layers, missing opportunities for u… ▽ More

    Submitted 29 October, 2022; v1 submitted 22 October, 2022; originally announced October 2022.

  38. arXiv:2210.04174  [pdf, other

    cs.LG cs.CV

    Grow and Merge: A Unified Framework for Continuous Categories Discovery

    Authors: Xinwei Zhang, Jianwen Jiang, Yutong Feng, Zhi-Fan Wu, Xibin Zhao, Hai Wan, Mingqian Tang, Rong Jin, Yue Gao

    Abstract: Although a number of studies are devoted to novel category discovery, most of them assume a static setting where both labeled and unlabeled data are given at once for finding new categories. In this work, we focus on the application scenarios where unlabeled data are continuously fed into the category discovery system. We refer to it as the {\bf Continuous Category Discovery} ({\bf CCD}) problem,… ▽ More

    Submitted 9 October, 2022; originally announced October 2022.

    Comments: This paper has already been accepted by 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  39. arXiv:2209.15166  [pdf, other

    cs.IR cs.AI cs.LG

    Reward Shaping for User Satisfaction in a REINFORCE Recommender

    Authors: Konstantina Christakopoulou, Can Xu, Sai Zhang, Sriraj Badam, Trevor Potter, Daniel Li, Hao Wan, Xinyang Yi, Ya Le, Chris Berg, Eric Bencomo Dixon, Ed H. Chi, Minmin Chen

    Abstract: How might we design Reinforcement Learning (RL)-based recommenders that encourage aligning user trajectories with the underlying user satisfaction? Three research questions are key: (1) measuring user satisfaction, (2) combatting sparsity of satisfaction signals, and (3) adapting the training of the recommender agent to maximize satisfaction. For measurement, it has been found that surveys explici… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

    Comments: Accepted in Reinforcement Learning for Real Life (RL4RealLife) Workshop in the 38th International Conference on Machine Learning, 2021

  40. GraTO: Graph Neural Network Framework Tackling Over-smoothing with Neural Architecture Search

    Authors: Xinshun Feng, Herun Wan, Shangbin Feng, Hongrui Wang, Jun Zhou, Qinghua Zheng, Minnan Luo

    Abstract: Current Graph Neural Networks (GNNs) suffer from the over-smoothing problem, which results in indistinguishable node representations and low model performance with more GNN layers. Many methods have been put forward to tackle this problem in recent years. However, existing tackling over-smoothing methods emphasize model performance and neglect the over-smoothness of node representations. Additiona… ▽ More

    Submitted 22 October, 2022; v1 submitted 18 August, 2022; originally announced August 2022.

    Comments: accepted at CIKM2022

  41. arXiv:2208.08320  [pdf, other

    cs.AI

    BIC: Twitter Bot Detection with Text-Graph Interaction and Semantic Consistency

    Authors: Zhenyu Lei, Herun Wan, Wenqian Zhang, Shangbin Feng, Zilong Chen, Jundong Li, Qinghua Zheng, Minnan Luo

    Abstract: Twitter bots are automatic programs operated by malicious actors to manipulate public opinion and spread misinformation. Research efforts have been made to automatically identify bots based on texts and networks on social media. Existing methods only leverage texts or networks alone, and while few works explored the shallow combination of the two modalities, we hypothesize that the interaction and… ▽ More

    Submitted 17 February, 2023; v1 submitted 17 August, 2022; originally announced August 2022.

  42. arXiv:2207.14539  [pdf, other

    cs.CV cs.LG

    Pre-training General Trajectory Embeddings with Maximum Multi-view Entropy Coding

    Authors: Yan Lin, Huaiyu Wan, Shengnan Guo, Jilin Hu, Christian S. Jensen, Youfang Lin

    Abstract: Spatio-temporal trajectories provide valuable information about movement and travel behavior, enabling various downstream tasks that in turn power real-world applications. Learning trajectory embeddings can improve task performance but may incur high computational costs and face limited training data availability. Pre-training learns generic embeddings by means of specially constructed pretext tas… ▽ More

    Submitted 25 December, 2023; v1 submitted 29 July, 2022; originally announced July 2022.

    Comments: 15 pages, 7 figures, accepted by IEEE Trans. on Knowledge and Data Engineering

  43. arXiv:2206.04564  [pdf, other

    cs.SI cs.AI

    TwiBot-22: Towards Graph-Based Twitter Bot Detection

    Authors: Shangbin Feng, Zhaoxuan Tan, Herun Wan, Ningnan Wang, Zilong Chen, Binchi Zhang, Qinghua Zheng, Wenqian Zhang, Zhenyu Lei, Shujie Yang, Xinshun Feng, Qingyue Zhang, Hongrui Wang, Yuhan Liu, Yuyang Bai, Heng Wang, Zijian Cai, Yanbo Wang, Lijing Zheng, Zihan Ma, Jundong Li, Minnan Luo

    Abstract: Twitter bot detection has become an increasingly important task to combat misinformation, facilitate social media moderation, and preserve the integrity of the online discourse. State-of-the-art bot detection methods generally leverage the graph structure of the Twitter network, and they exhibit promising performance when confronting novel Twitter bots that traditional methods fail to detect. Howe… ▽ More

    Submitted 12 February, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022, Datasets and Benchmarks Track

  44. arXiv:2205.14748  [pdf, other

    cs.CL

    Learning as Conversation: Dialogue Systems Reinforced for Information Acquisition

    Authors: Pengshan Cai, Hui Wan, Fei Liu, Mo Yu, Hong Yu, Sachindra Joshi

    Abstract: We propose novel AI-empowered chat bots for learning as conversation where a user does not read a passage but gains information and knowledge through conversation with a teacher bot. Our information-acquisition-oriented dialogue system employs a novel adaptation of reinforced self-play so that the system can be transferred to various domains without in-domain dialogue data, and can carry out conve… ▽ More

    Submitted 29 May, 2022; originally announced May 2022.

    Comments: 10 pages, accepted by NAACL 2022

  45. arXiv:2205.14226  [pdf, other

    cs.IR cs.CL

    Fast and Light-Weight Answer Text Retrieval in Dialogue Systems

    Authors: Hui Wan, Siva Sankalp Patel, J. William Murdock, Saloni Potdar, Sachindra Joshi

    Abstract: Dialogue systems can benefit from being able to search through a corpus of text to find information relevant to user requests, especially when encountering a request for which no manually curated response is available. The state-of-the-art technology for neural dense retrieval or re-ranking involves deep learning models with hundreds of millions of parameters. However, it is difficult and expensiv… ▽ More

    Submitted 31 May, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Accepted in NAACL-HLT 2022 Industry Track

  46. arXiv:2203.01414  [pdf, other

    cs.AR

    ICARUS: A Specialized Architecture for Neural Radiance Fields Rendering

    Authors: Chaolin Rao, Huangjie Yu, Haochuan Wan, Jindong Zhou, Yueyang Zheng, Yu Ma, Anpei Chen, Minye Wu, Binzhe Yuan, Pingqiang Zhou, Xin Lou, Jingyi Yu

    Abstract: The practical deployment of Neural Radiance Fields (NeRF) in rendering applications faces several challenges, with the most critical one being low rendering speed on even high-end graphic processing units (GPUs). In this paper, we present ICARUS, a specialized accelerator architecture tailored for NeRF rendering. Unlike GPUs using general purpose computing and memory architectures for NeRF, ICARUS… ▽ More

    Submitted 26 September, 2022; v1 submitted 28 February, 2022; originally announced March 2022.

  47. arXiv:2111.00207  [pdf, other

    cs.CV

    PatchFormer: An Efficient Point Transformer with Patch Attention

    Authors: Zhang Cheng, Haocheng Wan, Xinyi Shen, Zizhao Wu

    Abstract: The point cloud learning community witnesses a modeling shift from CNNs to Transformers, where pure Transformer architectures have achieved top accuracy on the major learning benchmarks. However, existing point Transformers are computationally expensive since they need to generate a large attention map, which has quadratic complexity (both in space and time) with respect to input size. To solve th… ▽ More

    Submitted 24 March, 2022; v1 submitted 30 October, 2021; originally announced November 2021.

    Comments: 10 pages

  48. Gradient-Based Mixed Planning with Symbolic and Numeric Action Parameters

    Authors: Kebing Jin, Hankz Hankui Zhuo, Zhanhao Xiao, Hai Wan, Subbarao Kambhampati

    Abstract: Dealing with planning problems with both logical relations and numeric changes in real-world dynamic environments is challenging. Existing numeric planning systems for the problem often discretize numeric variables or impose convex constraints on numeric variables, which harms the performance when solving problems. In this paper, we propose a novel algorithm framework to solve numeric planning pro… ▽ More

    Submitted 9 October, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

    Comments: 41 pages, 22 figures. Accepted by Artificial Intelligence

  49. MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents

    Authors: Song Feng, Siva Sankalp Patel, Hui Wan, Sachindra Joshi

    Abstract: We propose MultiDoc2Dial, a new task and dataset on modeling goal-oriented dialogues grounded in multiple documents. Most previous works treat document-grounded dialogue modeling as a machine reading comprehension task based on a single given document or passage. In this work, we aim to address more realistic scenarios where a goal-oriented information-seeking conversation involves multiple topics… ▽ More

    Submitted 26 September, 2021; originally announced September 2021.

    Journal ref: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

  50. arXiv:2108.06076  [pdf, other

    cs.CV cs.AI cs.GR

    PVT: Point-Voxel Transformer for Point Cloud Learning

    Authors: Cheng Zhang, Haocheng Wan, Xinyi Shen, Zizhao Wu

    Abstract: The recently developed pure Transformer architectures have attained promising accuracy on point cloud learning benchmarks compared to convolutional neural networks. However, existing point cloud Transformers are computationally expensive since they waste a significant amount of time on structuring the irregular data. To solve this shortcoming, we present Sparse Window Attention (SWA) module to gat… ▽ More

    Submitted 25 May, 2022; v1 submitted 13 August, 2021; originally announced August 2021.

    Comments: 29 pages