Zum Hauptinhalt springen

Showing 51–100 of 355 results for author: Xie, Q

.
  1. arXiv:2401.13884  [pdf, other

    stat.ML cs.LG math.OC

    Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation

    Authors: Yixuan Zhang, Qiaomin Xie

    Abstract: Stochastic Approximation (SA) is a widely used algorithmic approach in various fields, including optimization and reinforcement learning (RL). Among RL algorithms, Q-learning is particularly popular due to its empirical success. In this paper, we study asynchronous Q-learning with constant stepsize, which is commonly used in practice for its fast convergence. By connecting the constant stepsize Q-… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

    Comments: 41 pages, 3 figures

  2. EmoLLMs: A Series of Emotional Large Language Models and Annotation Tools for Comprehensive Affective Analysis

    Authors: Zhiwei Liu, Kailai Yang, Tianlin Zhang, Qianqian Xie, Sophia Ananiadou

    Abstract: Sentiment analysis and emotion detection are important research topics in natural language processing (NLP) and benefit many downstream tasks. With the widespread application of LLMs, researchers have started exploring the application of LLMs based on instruction-tuning in the field of sentiment analysis. However, these models only focus on single aspects of affective classification tasks (e.g. se… ▽ More

    Submitted 17 June, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted by KDD 2024

  3. arXiv:2401.08022  [pdf, other

    cs.RO

    Preprocessing-based Kinodynamic Motion Planning Framework for Intercepting Projectiles using a Robot Manipulator

    Authors: Ramkumar Natarajan, Hanlan Yang, Qintong Xie, Yash Oza, Manash Pratim Das, Fahad Islam, Muhammad Suhail Saleem, Howie Choset, Maxim Likhachev

    Abstract: We are interested in studying sports with robots and starting with the problem of intercepting a projectile moving toward a robot manipulator equipped with a shield. To successfully perform this task, the robot needs to (i) detect the incoming projectile, (ii) predict the projectile's future motion, (iii) plan a minimum-time rapid trajectory that can evade obstacles and intercept the projectile, a… ▽ More

    Submitted 16 March, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: Proceedings of the IEEE International Conference on Robotics and Automation (ICRA) 2024

  4. arXiv:2401.04335  [pdf

    physics.optics physics.app-ph

    SiN-on-SOI Optical Phased Array LiDAR for Ultra-Wide Field of View and 4D Sensing

    Authors: Baisong Chen, Yingzhi Li, Qijie Xie, Quanxin Na, Min Tao, Ziming Wang, Zihao Zhi, Heming Hu, Xuetong Li, Huan Qu, Yafang He, Xiaolong Hu, Guoqiang Lo, Junfeng Song

    Abstract: Three-dimensional (3D) imaging techniques are facilitating the autonomous vehicles to build intelligent system. Optical phased arrays (OPAs) featured by all solid-state configurations are becoming a promising solution for 3D imaging. However, majority of state-of-art OPAs commonly suffer from severe power degradation at the edge of field of view (FoV), resulting in limited effective FoV and deteri… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: 18 pages with 13 figures

    Journal ref: Laser Photonics Rev 2024, 2301360

  5. arXiv:2401.03804  [pdf, other

    cs.CL cs.AI

    TeleChat Technical Report

    Authors: Zhongjiang He, Zihan Wang, Xinzhang Liu, Shixuan Liu, Yitong Yao, Yuyao Huang, Xuelong Li, Yongxiang Li, Zhonghao Che, Zhaoxi Zhang, Yan Wang, Xin Wang, Luwen Pu, Huinan Xu, Ruiyu Fang, Yu Zhao, Jie Zhang, Xiaomeng Huang, Zhilong Lu, Jiaxin Peng, Wenjun Zheng, Shiquan Wang, Bingkai Yang, Xuewei he, Zhuoru Jiang , et al. (11 additional authors not shown)

    Abstract: In this technical report, we present TeleChat, a collection of large language models (LLMs) with parameters of 3 billion, 7 billion and 12 billion. It includes pretrained language models as well as fine-tuned chat models that is aligned with human preferences. TeleChat is initially pretrained on an extensive corpus containing a diverse collection of texts from both English and Chinese languages, i… ▽ More

    Submitted 1 April, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Comments: 28 pages, 2 figures

    ACM Class: I.2.7

  6. arXiv:2401.02901  [pdf, other

    hep-ph hep-ex

    Charged-current non-standard neutrino interactions at Daya Bay

    Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, Y. C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng, X. Y. Ding , et al. (177 additional authors not shown)

    Abstract: The full data set of the Daya Bay reactor neutrino experiment is used to probe the effect of the charged current non-standard interactions (CC-NSI) on neutrino oscillation experiments. Two different approaches are applied and constraints on the corresponding CC-NSI parameters are obtained with the neutrino flux taken from the Huber-Mueller model with a $5\%$ uncertainty. For the quantum mechanics-… ▽ More

    Submitted 19 March, 2024; v1 submitted 5 January, 2024; originally announced January 2024.

    Comments: 25 pages, 16 figures, 6 tables; 36 pages, format changed, references added

  7. arXiv:2401.01623  [pdf, other

    cs.AI cs.CL

    Can AI Be as Creative as Humans?

    Authors: Haonan Wang, James Zou, Michael Mozer, Anirudh Goyal, Alex Lamb, Linjun Zhang, Weijie J Su, Zhun Deng, Michael Qizhe Xie, Hannah Brown, Kenji Kawaguchi

    Abstract: Creativity serves as a cornerstone for societal progress and innovation. With the rise of advanced generative AI models capable of tasks once reserved for human creativity, the study of AI's creative potential becomes imperative for its responsible development and application. In this paper, we prove in theory that AI can be as creative as humans under the condition that it can properly fit the da… ▽ More

    Submitted 25 January, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

    Comments: The paper examines AI's creativity, introducing Relative and Statistical Creativity for theoretical and practical analysis, along with practical training guidelines. Project Page: ai-relative-creativity.github.io

  8. arXiv:2401.01369  [pdf, other

    cs.IR cs.AI cs.LG

    RL-MPCA: A Reinforcement Learning Based Multi-Phase Computation Allocation Approach for Recommender Systems

    Authors: Jiahong Zhou, Shunhui Mao, Guoliang Yang, Bo Tang, Qianlong Xie, Lebin Lin, Xingxing Wang, Dong Wang

    Abstract: Recommender systems aim to recommend the most suitable items to users from a large number of candidates. Their computation cost grows as the number of user requests and the complexity of services (or models) increases. Under the limitation of computation resources (CRs), how to make a trade-off between computation cost and business revenue becomes an essential question. The existing studies focus… ▽ More

    Submitted 27 December, 2023; originally announced January 2024.

    Comments: 11 pages, 7 figures, published to Proceedings of the ACM Web Conference 2023

  9. arXiv:2401.01059  [pdf, other

    q-bio.QM

    Accelerating Discovery of Novel and Bioactive Ligands With Pharmacophore-Informed Generative Models

    Authors: Weixin Xie, Jianhang Zhang, Qin Xie, Chaojun Gong, Youjun Xu, Luhua Lai, Jianfeng Pei

    Abstract: Deep generative models have gained significant advancements to accelerate drug discovery by generating bioactive chemicals against desired targets. Nevertheless, most generated compounds that have been validated for potent bioactivity often exhibit structural novelty levels that fall short of satisfaction, thereby providing limited inspiration to human medicinal chemists. The challenge faced by ge… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  10. arXiv:2312.17503  [pdf, other

    cs.LG cs.GT

    HiBid: A Cross-Channel Constrained Bidding System with Budget Allocation by Hierarchical Offline Deep Reinforcement Learning

    Authors: Hao Wang, Bo Tang, Chi Harold Liu, Shangqin Mao, Jiahong Zhou, Zipeng Dai, Yaqi Sun, Qianlong Xie, Xingxing Wang, Dong Wang

    Abstract: Online display advertising platforms service numerous advertisers by providing real-time bidding (RTB) for the scale of billions of ad requests every day. The bidding strategy handles ad requests cross multiple channels to maximize the number of clicks under the set financial constraints, i.e., total budget and cost-per-click (CPC), etc. Different from existing works mainly focusing on single chan… ▽ More

    Submitted 20 August, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

    Report number: 23-NX-HOIX

  11. arXiv:2312.15701  [pdf, other

    eess.IV cs.CV cs.LG

    Rotation Equivariant Proximal Operator for Deep Unfolding Methods in Image Restoration

    Authors: Jiahong Fu, Qi Xie, Deyu Meng, Zongben Xu

    Abstract: The deep unfolding approach has attracted significant attention in computer vision tasks, which well connects conventional image processing modeling manners with more recent deep learning techniques. Specifically, by establishing a direct correspondence between algorithm operators at each implementation step and network modules within each layer, one can rationally construct an almost ``white box'… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

  12. arXiv:2312.15268  [pdf, other

    cs.CV

    MGDepth: Motion-Guided Cost Volume For Self-Supervised Monocular Depth In Dynamic Scenarios

    Authors: Kaichen Zhou, Jia-Xing Zhong, Jia-Wang Bian, Qian Xie, Jian-Qing Zheng, Niki Trigoni, Andrew Markham

    Abstract: Despite advancements in self-supervised monocular depth estimation, challenges persist in dynamic scenarios due to the dependence on assumptions about a static world. In this paper, we present MGDepth, a Motion-Guided Cost Volume Depth Net, to achieve precise depth estimation for both dynamic objects and static backgrounds, all while maintaining computational efficiency. To tackle the challenges p… ▽ More

    Submitted 23 December, 2023; originally announced December 2023.

  13. arXiv:2312.10894  [pdf, other

    stat.ML cs.LG stat.ME

    Effectiveness of Constant Stepsize in Markovian LSA and Statistical Inference

    Authors: Dongyan Huo, Yudong Chen, Qiaomin Xie

    Abstract: In this paper, we study the effectiveness of using a constant stepsize in statistical inference via linear stochastic approximation (LSA) algorithms with Markovian data. After establishing a Central Limit Theorem (CLT), we outline an inference procedure that uses averaged LSA iterates to construct confidence intervals (CIs). Our procedure leverages the fast mixing property of constant-stepsize LSA… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: AAAI 2024

  14. arXiv:2312.08782  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis

    Authors: Yafei Hu, Quanting Xie, Vidhi Jain, Jonathan Francis, Jay Patrikar, Nikhil Keetha, Seungchan Kim, Yaqi Xie, Tianyi Zhang, Shibo Zhao, Yu Quan Chong, Chen Wang, Katia Sycara, Matthew Johnson-Roberson, Dhruv Batra, Xiaolong Wang, Sebastian Scherer, Zsolt Kira, Fei Xia, Yonatan Bisk

    Abstract: Building general-purpose robots that can operate seamlessly, in any environment, with any object, and utilizing various skills to complete diverse tasks has been a long-standing goal in Artificial Intelligence. Unfortunately, however, most existing robotic systems have been constrained - having been designed for specific tasks, trained on specific datasets, and deployed within specific environment… ▽ More

    Submitted 15 December, 2023; v1 submitted 14 December, 2023; originally announced December 2023.

  15. arXiv:2312.01139  [pdf, other

    hep-th

    Quantum corrected Q-ball dynamics

    Authors: Qi-Xin Xie, Paul M. Saffin, Anders Tranberg, Shuang-Yong Zhou

    Abstract: The physics of individual Q-balls and interactions between multiple Q-balls are well-studied in classical numerical simulations. Interesting properties and phenomena have been discovered, involving stability, forces, collisions and swapping of charge between different components of multi-Q-ball systems. We investigate these phenomena in quantum field theory, including quantum corrections to leadin… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: 29 pages, 18 figures

    Report number: USTC-ICTS/PCFT-23-37

  16. Optimal Attack and Defense for Reinforcement Learning

    Authors: Jeremy McMahan, Young Wu, Xiaojin Zhu, Qiaomin Xie

    Abstract: To ensure the usefulness of Reinforcement Learning (RL) in real systems, it is crucial to ensure they are robust to noise and adversarial attacks. In adversarial RL, an external attacker has the power to manipulate the victim agent's interaction with the environment. We study the full class of online manipulation attacks, which include (i) state attacks, (ii) observation attacks (which are a gener… ▽ More

    Submitted 17 June, 2024; v1 submitted 30 November, 2023; originally announced December 2023.

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, 38(13), 14332-14340. 2024

  17. arXiv:2311.17086  [pdf, other

    cs.CV cs.CL

    PEA-Diffusion: Parameter-Efficient Adapter with Knowledge Distillation in non-English Text-to-Image Generation

    Authors: Jian Ma, Chen Chen, Qingsong Xie, Haonan Lu

    Abstract: Text-to-image diffusion models are well-known for their ability to generate realistic images based on textual prompts. However, the existing works have predominantly focused on English, lacking support for non-English text-to-image models. The most commonly used translation methods cannot solve the generation problem related to language culture, while training from scratch on a specific language d… ▽ More

    Submitted 23 July, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: ECCV 2024

  18. arXiv:2311.00582  [pdf, other

    cs.GT cs.AI

    Minimally Modifying a Markov Game to Achieve Any Nash Equilibrium and Value

    Authors: Young Wu, Jeremy McMahan, Yiding Chen, Yudong Chen, Xiaojin Zhu, Qiaomin Xie

    Abstract: We study the game modification problem, where a benevolent game designer or a malevolent adversary modifies the reward function of a zero-sum Markov game so that a target deterministic or stochastic policy profile becomes the unique Markov perfect Nash equilibrium and has a value within a target range, in a way that minimizes the modification cost. We characterize the set of policy profiles that c… ▽ More

    Submitted 24 August, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: Accepted by ICML 2024 Conference

  19. arXiv:2311.00327  [pdf, other

    cs.LG

    Multi-task Representation Learning for Pure Exploration in Bilinear Bandits

    Authors: Subhojyoti Mukherjee, Qiaomin Xie, Josiah P. Hanna, Robert Nowak

    Abstract: We study multi-task representation learning for the problem of pure exploration in bilinear bandits. In bilinear bandits, an action takes the form of a pair of arms from two different entity types and the reward is a bilinear function of the known feature vectors of the arms. In the \textit{multi-task bilinear bandit problem}, we aim to find optimal actions for multiple tasks that share a common l… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: Accepted in 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  20. arXiv:2310.20329  [pdf, other

    cs.CL cs.SE

    InstructCoder: Instruction Tuning Large Language Models for Code Editing

    Authors: Kaixin Li, Qisheng Hu, Xu Zhao, Hui Chen, Yuxi Xie, Tiedong Liu, Qizhe Xie, Junxian He

    Abstract: Code editing encompasses a variety of pragmatic tasks that developers deal with daily. Despite its relevance and practical usefulness, automatic code editing remains an underexplored area in the evolution of deep learning models, partly due to data scarcity. In this work, we explore the use of Large Language Models (LLMs) to edit code based on user instructions. Evaluated on a novel human-written… ▽ More

    Submitted 28 February, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

  21. arXiv:2310.16326  [pdf, other

    cs.GT cs.LG

    Reinforcement Learning for SBM Graphon Games with Re-Sampling

    Authors: Peihan Huo, Oscar Peralta, Junyu Guo, Qiaomin Xie, Andreea Minca

    Abstract: The Mean-Field approximation is a tractable approach for studying large population dynamics. However, its assumption on homogeneity and universal connections among all agents limits its applicability in many real-world scenarios. Multi-Population Mean-Field Game (MP-MFG) models have been introduced in the literature to address these limitations. When the underlying Stochastic Block Model is known,… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

  22. arXiv:2310.05620  [pdf, other

    cs.CL

    LAiW: A Chinese Legal Large Language Models Benchmark

    Authors: Yongfu Dai, Duanyu Feng, Jimin Huang, Haochen Jia, Qianqian Xie, Yifang Zhang, Weiguang Han, Wei Tian, Hao Wang

    Abstract: General and legal domain LLMs have demonstrated strong performance in various tasks of LegalAI. However, the current evaluations of these LLMs in LegalAI are defined by the experts of computer science, lacking consistency with the logic of legal practice, making it difficult to judge their practical capabilities. To address this challenge, we are the first to build the Chinese legal LLMs benchmark… ▽ More

    Submitted 18 February, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

  23. arXiv:2310.02174  [pdf, other

    cs.CL cs.AI cs.LG

    Ask Again, Then Fail: Large Language Models' Vacillations in Judgment

    Authors: Qiming Xie, Zengzhi Wang, Yi Feng, Rui Xia

    Abstract: We observe that current conversational language models often waver in their judgments when faced with follow-up questions, even if the original judgment was correct. This wavering presents a significant challenge for generating reliable responses and building user trust. To comprehensively assess this issue, we introduce a \textsc{Follow-up Questioning Mechanism} along with two metrics to quantify… ▽ More

    Submitted 11 June, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: Accepted by ACL 2024 main conference

  24. arXiv:2310.01074  [pdf, other

    cs.CL cs.AI

    Back to the Future: Towards Explainable Temporal Reasoning with Large Language Models

    Authors: Chenhan Yuan, Qianqian Xie, Jimin Huang, Sophia Ananiadou

    Abstract: Temporal reasoning is a crucial NLP task, providing a nuanced understanding of time-sensitive contexts within textual data. Although recent advancements in LLMs have demonstrated their potential in temporal reasoning, the predominant focus has been on tasks such as temporal expression and temporal relation extraction. These tasks are primarily designed for the extraction of direct and past tempora… ▽ More

    Submitted 8 October, 2023; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 14 pages, 5 figures, code and dataset: https://github.com/chenhan97/TimeLlama

  25. arXiv:2310.00566  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    Empowering Many, Biasing a Few: Generalist Credit Scoring through Large Language Models

    Authors: Duanyu Feng, Yongfu Dai, Jimin Huang, Yifang Zhang, Qianqian Xie, Weiguang Han, Zhengyu Chen, Alejandro Lopez-Lira, Hao Wang

    Abstract: In the financial industry, credit scoring is a fundamental element, shaping access to credit and determining the terms of loans for individuals and businesses alike. Traditional credit scoring methods, however, often grapple with challenges such as narrow knowledge scope and isolated evaluation of credit tasks. Our work posits that Large Language Models (LLMs) have great potential for credit scori… ▽ More

    Submitted 17 February, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

  26. Overview of the BioLaySumm 2023 Shared Task on Lay Summarization of Biomedical Research Articles

    Authors: Tomas Goldsack, Zheheng Luo, Qianqian Xie, Carolina Scarton, Matthew Shardlow, Sophia Ananiadou, Chenghua Lin

    Abstract: This paper presents the results of the shared task on Lay Summarisation of Biomedical Research Articles (BioLaySumm), hosted at the BioNLP Workshop at ACL 2023. The goal of this shared task is to develop abstractive summarisation models capable of generating "lay summaries" (i.e., summaries that are comprehensible to non-technical audiences) in both a controllable and non-controllable setting. The… ▽ More

    Submitted 25 October, 2023; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: Published at BioNLP@ACL2023

    Journal ref: The 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks (2023) 468-477

  27. arXiv:2309.15638  [pdf, other

    eess.IV cs.CV cs.LG

    FRS-Nets: Fourier Parameterized Rotation and Scale Equivariant Networks for Retinal Vessel Segmentation

    Authors: Zihong Sun, Qi Xie, Deyu Meng

    Abstract: With translation equivariance, convolution neural networks (CNNs) have achieved great success in retinal vessel segmentation. However, some other symmetries of the vascular morphology are not characterized by CNNs, such as rotation and scale symmetries. To embed more equivariance into CNNs and achieve the accuracy requirement for retinal vessel segmentation, we construct a novel convolution operat… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  28. MentaLLaMA: Interpretable Mental Health Analysis on Social Media with Large Language Models

    Authors: Kailai Yang, Tianlin Zhang, Ziyan Kuang, Qianqian Xie, Jimin Huang, Sophia Ananiadou

    Abstract: With the development of web technology, social media texts are becoming a rich source for automatic mental health analysis. As traditional discriminative methods bear the problem of low interpretability, the recent large language models have been explored for interpretable mental health analysis on social media, which aims to provide detailed explanations along with predictions. The results show t… ▽ More

    Submitted 3 February, 2024; v1 submitted 24 September, 2023; originally announced September 2023.

    Comments: Accepted by WWW 2024

  29. arXiv:2309.12455  [pdf, other

    cs.CL cs.AI cs.LG

    LongDocFACTScore: Evaluating the Factuality of Long Document Abstractive Summarisation

    Authors: Jennifer A Bishop, Qianqian Xie, Sophia Ananiadou

    Abstract: Maintaining factual consistency is a critical issue in abstractive text summarisation, however, it cannot be assessed by traditional automatic metrics used for evaluating text summarisation, such as ROUGE scoring. Recent efforts have been devoted to developing improved metrics for measuring factual consistency using pre-trained language models, but these metrics have restrictive token limits, and… ▽ More

    Submitted 28 May, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: This paper has been published in LREC-COLING 2024, Pages 10777-10789 and was presented as an oral presentation during the conference held in Turin, Italy. The published version is available at https://aclanthology.org/2024.lrec-main.941. 13 pages, 5 figures

    ACM Class: I.2.7

    Journal ref: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

  30. arXiv:2309.10103  [pdf, other

    cs.RO cs.AI

    Reasoning about the Unseen for Efficient Outdoor Object Navigation

    Authors: Quanting Xie, Tianyi Zhang, Kedi Xu, Matthew Johnson-Roberson, Yonatan Bisk

    Abstract: Robots should exist anywhere humans do: indoors, outdoors, and even unmapped environments. In contrast, the focus of recent advancements in Object Goal Navigation(OGN) has targeted navigating in indoor environments by leveraging spatial and semantic cues that do not generalize outdoors. While these contributions provide valuable insights into indoor scenarios, the broader spectrum of real-world ro… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

    Comments: 6 pages, 7 figures

  31. arXiv:2309.07726  [pdf, other

    cs.RO

    GRID: Scene-Graph-based Instruction-driven Robotic Task Planning

    Authors: Zhe Ni, Xiaoxin Deng, Cong Tai, Xinyue Zhu, Qinghongbing Xie, Weihang Huang, Xiang Wu, Long Zeng

    Abstract: Recent works have shown that Large Language Models (LLMs) can facilitate the grounding of instructions for robotic task planning. Despite this progress, most existing works have primarily focused on utilizing raw images to aid LLMs in understanding environmental information. However, this approach not only limits the scope of observation but also typically necessitates extensive multimodal data co… ▽ More

    Submitted 10 March, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: 8 pages, 10 figures

  32. arXiv:2309.06160  [pdf

    cs.DL

    A comparison of citation-based clustering and topic modeling for science mapping

    Authors: Qianqian Xie, Ludo Waltman

    Abstract: Science mapping is an important tool to gain insight into scientific fields, to identify emerging research trends, and to support science policy. Understanding the different ways in which different science mapping approaches capture the structure of scientific fields is critical. This paper presents a comparative analysis of two commonly used approaches, topic modeling (TM) and citation-based clus… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: 29 pages and 6 figures

  33. arXiv:2309.01142  [pdf, other

    eess.AS cs.SD

    MSM-VC: High-fidelity Source Style Transfer for Non-Parallel Voice Conversion by Multi-scale Style Modeling

    Authors: Zhichao Wang, Xinsheng Wang, Qicong Xie, Tao Li, Lei Xie, Qiao Tian, Yuping Wang

    Abstract: In addition to conveying the linguistic content from source speech to converted speech, maintaining the speaking style of source speech also plays an important role in the voice conversion (VC) task, which is essential in many scenarios with highly expressive source speech, such as dubbing and data augmentation. Previous work generally took explicit prosodic features or fixed-length style embeddin… ▽ More

    Submitted 3 September, 2023; originally announced September 2023.

    Comments: This work was submitted on April 10, 2022 and accepted on August 29, 2023

  34. arXiv:2308.02565  [pdf, other

    cs.CL cs.AI

    SimTeG: A Frustratingly Simple Approach Improves Textual Graph Learning

    Authors: Keyu Duan, Qian Liu, Tat-Seng Chua, Shuicheng Yan, Wei Tsang Ooi, Qizhe Xie, Junxian He

    Abstract: Textual graphs (TGs) are graphs whose nodes correspond to text (sentences or documents), which are widely prevalent. The representation learning of TGs involves two stages: (i) unsupervised feature extraction and (ii) supervised graph representation learning. In recent years, extensive efforts have been devoted to the latter stage, where Graph Neural Networks (GNNs) have dominated. However, the fo… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: 9 pages, 3 figures

  35. arXiv:2307.12571  [pdf, other

    cs.CV

    MataDoc: Margin and Text Aware Document Dewarping for Arbitrary Boundary

    Authors: Beiya Dai, Xing li, Qunyi Xie, Yulin Li, Xiameng Qin, Chengquan Zhang, Kun Yao, Junyu Han

    Abstract: Document dewarping from a distorted camera-captured image is of great value for OCR and document understanding. The document boundary plays an important role which is more evident than the inner region in document dewarping. Current learning-based methods mainly focus on complete boundary cases, leading to poor document correction performance of documents with incomplete boundaries. In contrast to… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: 12 pages

  36. arXiv:2307.09652  [pdf, other

    cs.GT cs.AI cs.CR cs.MA eess.SY

    VISER: A Tractable Solution Concept for Games with Information Asymmetry

    Authors: Jeremy McMahan, Young Wu, Yudong Chen, Xiaojin Zhu, Qiaomin Xie

    Abstract: Many real-world games suffer from information asymmetry: one player is only aware of their own payoffs while the other player has the full game information. Examples include the critical domain of security games and adversarial multi-agent reinforcement learning. Information asymmetry renders traditional solution concepts such as Strong Stackelberg Equilibrium (SSE) and Robust-Optimization Equilib… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

    Comments: 17 pages, 6 figures

    MSC Class: 91A27 (Primary); 93E20 (Secondary) ACM Class: F.2.1; G.3; I.2.8

  37. A scoping review on multimodal deep learning in biomedical images and texts

    Authors: Zhaoyi Sun, Mingquan Lin, Qingqing Zhu, Qianqian Xie, Fei Wang, Zhiyong Lu, Yifan Peng

    Abstract: Computer-assisted diagnostic and prognostic systems of the future should be capable of simultaneously processing multimodal data. Multimodal deep learning (MDL), which involves the integration of multiple sources of data, such as images and text, has the potential to revolutionize the analysis and interpretation of biomedical data. However, it only caught researchers' attention recently. To this e… ▽ More

    Submitted 18 October, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: This paper has been accepted by the Journal of Biomedical Informatics

    Journal ref: Journal of Biomedical Informatics, Volume 146, October 2023, 104482

  38. arXiv:2307.07119  [pdf, other

    cs.LG cs.AI cs.DB

    DataAssist: A Machine Learning Approach to Data Cleaning and Preparation

    Authors: Kartikay Goyle, Quin Xie, Vakul Goyle

    Abstract: Current automated machine learning (ML) tools are model-centric, focusing on model selection and parameter optimization. However, the majority of the time in data analysis is devoted to data cleaning and wrangling, for which limited tools are available. Here we present DataAssist, an automated data preparation and cleaning platform that enhances dataset quality using ML-informed methods. We show t… ▽ More

    Submitted 17 July, 2023; v1 submitted 13 July, 2023; originally announced July 2023.

  39. arXiv:2307.02078  [pdf, other

    cs.CL

    Graph Contrastive Topic Model

    Authors: Zheheng Luo, Lei Liu, Qianqian Xie, Sophia Ananiadou

    Abstract: Existing NTMs with contrastive learning suffer from the sample bias problem owing to the word frequency-based sampling strategy, which may result in false negative samples with similar semantics to the prototypes. In this paper, we aim to explore the efficient sampling strategy and contrastive learning in NTMs to address the aforementioned issue. We propose a new sampling assumption that negative… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 17 pages, 4 figures

  40. arXiv:2306.16956  [pdf, other

    cs.CL cs.AI

    MEMD-ABSA: A Multi-Element Multi-Domain Dataset for Aspect-Based Sentiment Analysis

    Authors: Hongjie Cai, Nan Song, Zengzhi Wang, Qiming Xie, Qiankun Zhao, Ke Li, Siwei Wu, Shijie Liu, Jianfei Yu, Rui Xia

    Abstract: Aspect-based sentiment analysis is a long-standing research interest in the field of opinion mining, and in recent years, researchers have gradually shifted their focus from simple ABSA subtasks to end-to-end multi-element ABSA tasks. However, the datasets currently used in the research are limited to individual elements of specific tasks, usually focusing on in-domain settings, ignoring implicit… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

  41. arXiv:2306.16502  [pdf, other

    stat.ML cs.LG math.OC

    Stochastic Methods in Variational Inequalities: Ergodicity, Bias and Refinements

    Authors: Emmanouil-Vasileios Vlatakis-Gkaragkounis, Angeliki Giannou, Yudong Chen, Qiaomin Xie

    Abstract: For min-max optimization and variational inequalities problems (VIP) encountered in diverse machine learning tasks, Stochastic Extragradient (SEG) and Stochastic Gradient Descent Ascent (SGDA) have emerged as preeminent algorithms. Constant step-size variants of SEG/SGDA have gained popularity, with appealing benefits such as easy tuning and rapid forgiveness of initial conditions, but their conve… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: 37 pages, 6 main figures

  42. arXiv:2306.16394  [pdf, ps, other

    cs.LG

    Sharper Model-free Reinforcement Learning for Average-reward Markov Decision Processes

    Authors: Zihan Zhang, Qiaomin Xie

    Abstract: We develop several provably efficient model-free reinforcement learning (RL) algorithms for infinite-horizon average-reward Markov Decision Processes (MDPs). We consider both online setting and the setting with access to a simulator. In the online setting, we propose model-free RL algorithms based on reference-advantage decomposition. Our algorithm achieves… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  43. arXiv:2306.08041  [pdf, ps, other

    cs.MA cs.AI cs.CR cs.GT cs.LG

    Data Poisoning to Fake a Nash Equilibrium in Markov Games

    Authors: Young Wu, Jeremy McMahan, Xiaojin Zhu, Qiaomin Xie

    Abstract: We characterize offline data poisoning attacks on Multi-Agent Reinforcement Learning (MARL), where an attacker may change a data set in an attempt to install a (potentially fictitious) unique Markov-perfect Nash equilibrium for a two-player zero-sum Markov game. We propose the unique Nash set, namely the set of games, specified by their Q functions, with a specific joint policy being the unique Na… ▽ More

    Submitted 18 June, 2024; v1 submitted 13 June, 2023; originally announced June 2023.

  44. arXiv:2306.07520  [pdf, other

    cs.CV

    Instruct-ReID: A Multi-purpose Person Re-identification Task with Instructions

    Authors: Weizhen He, Yiheng Deng, Shixiang Tang, Qihao Chen, Qingsong Xie, Yizhou Wang, Lei Bai, Feng Zhu, Rui Zhao, Wanli Ouyang, Donglian Qi, Yunfeng Yan

    Abstract: Human intelligence can retrieve any person according to both visual and language descriptions. However, the current computer vision community studies specific person re-identification (ReID) tasks in different scenarios separately, which limits the applications in the real world. This paper strives to resolve this problem by proposing a new instruct-ReID task that requires the model to retrieve im… ▽ More

    Submitted 31 December, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

  45. arXiv:2306.06779  [pdf, other

    cs.CL

    Multi-Source Test-Time Adaptation as Dueling Bandits for Extractive Question Answering

    Authors: Hai Ye, Qizhe Xie, Hwee Tou Ng

    Abstract: In this work, we study multi-source test-time model adaptation from user feedback, where K distinct models are established for adaptation. To allow efficient adaptation, we cast the problem as a stochastic decision-making process, aiming to determine the best adapted model after adaptation. We discuss two frameworks: multi-armed bandit learning and multi-armed dueling bandits. Compared to multi-ar… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

    Comments: Main conference of ACL 2023

  46. arXiv:2306.05443  [pdf, other

    cs.CL cs.AI

    PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for Finance

    Authors: Qianqian Xie, Weiguang Han, Xiao Zhang, Yanzhao Lai, Min Peng, Alejandro Lopez-Lira, Jimin Huang

    Abstract: Although large language models (LLMs) has shown great performance on natural language processing (NLP) in the financial domain, there are no publicly available financial tailtored LLMs, instruction tuning datasets, and evaluation benchmarks, which is critical for continually pushing forward the open-source development of financial artificial intelligence (AI). This paper introduces PIXIU, a compre… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: 12 pages, 1 figures

  47. arXiv:2306.01896  [pdf, other

    cs.LG

    Learning to Stabilize Online Reinforcement Learning in Unbounded State Spaces

    Authors: Brahma S. Pavse, Matthew Zurek, Yudong Chen, Qiaomin Xie, Josiah P. Hanna

    Abstract: In many reinforcement learning (RL) applications, we want policies that reach desired states and then keep the controlled system within an acceptable region around the desired states over an indefinite period of time. This latter objective is called stability and is especially important when the state space is unbounded, such that the states can be arbitrarily far from each other and the agent can… ▽ More

    Submitted 26 May, 2024; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: Accepted to International Conference on Machine Learning (ICML) 2024

  48. arXiv:2306.01868  [pdf, other

    gr-qc astro-ph.HE hep-ph hep-th

    Boson Star Superradiance

    Authors: He-Yu Gao, Paul M. Saffin, Yi-Jie Wang, Qi-Xin Xie, Shuang-Yong Zhou

    Abstract: Recently, it has been realized that in some systems internal space rotation can induce energy amplification for scattering waves, similar to rotation in real space. Particularly, it has been shown that energy extraction is possible for a Q-ball, a stationary non-topological soliton that is coherently rotating in its field space. In this paper, we generalize the analysis to the case of boson stars,… ▽ More

    Submitted 29 June, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Comments: 10 pages, 6 figures; v2: refs added and clarified

    Report number: USTC-ICTS/PCFT-23-16

  49. arXiv:2306.00603  [pdf, other

    cs.LG cs.AI cs.RO

    Safe Offline Reinforcement Learning with Real-Time Budget Constraints

    Authors: Qian Lin, Bo Tang, Zifan Wu, Chao Yu, Shangqin Mao, Qianlong Xie, Xingxing Wang, Dong Wang

    Abstract: Aiming at promoting the safe real-world deployment of Reinforcement Learning (RL), research on safe RL has made significant progress in recent years. However, most existing works in the literature still focus on the online setting where risky violations of the safety budget are likely to be incurred during training. Besides, in many real-world applications, the learned policy is required to respon… ▽ More

    Submitted 4 March, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: We propose a method to handle the constraint problem with dynamically determined safety budgets under the offline setting

  50. arXiv:2306.00196  [pdf, other

    cs.LG math.OC math.PR stat.ML

    Restless Bandits with Average Reward: Breaking the Uniform Global Attractor Assumption

    Authors: Yige Hong, Qiaomin Xie, Yudong Chen, Weina Wang

    Abstract: We study the infinite-horizon restless bandit problem with the average reward criterion, in both discrete-time and continuous-time settings. A fundamental goal is to efficiently compute policies that achieve a diminishing optimality gap as the number of arms, $N$, grows large. Existing results on asymptotic optimality all rely on the uniform global attractor property (UGAP), a complex and challeng… ▽ More

    Submitted 16 January, 2024; v1 submitted 31 May, 2023; originally announced June 2023.

    Comments: NeurIPS 2023. 35 pages, 8 figures

    MSC Class: 90C40 ACM Class: G.3; I.6