Zum Hauptinhalt springen

Showing 151–200 of 2,055 results for author: Ma, S

.
  1. arXiv:2403.19716  [pdf, other

    cs.CL cs.AI cs.CV cs.IR

    Capability-aware Prompt Reformulation Learning for Text-to-Image Generation

    Authors: Jingtao Zhan, Qingyao Ai, Yiqun Liu, Jia Chen, Shaoping Ma

    Abstract: Text-to-image generation systems have emerged as revolutionary tools in the realm of artistic creation, offering unprecedented ease in transforming textual prompts into visual art. However, the efficacy of these systems is intricately linked to the quality of user-provided prompts, which often poses a challenge to users unfamiliar with prompt crafting. This paper addresses this challenge by levera… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: Accepted at SIGIR 2024

  2. arXiv:2403.19251  [pdf, other

    quant-ph eess.SY

    Arbitrary State Transition of Open Qubit System Based on Switching Control

    Authors: Guangpu Wu, Shibei Xue, Shan Ma, Sen Kuang, Daoyi Dong, Ian R. Petersen

    Abstract: We present a switching control strategy based on Lyapunov control for arbitrary state transitions in open qubit systems. With coherent vector representation, we propose a switching control strategy, which can prevent the state of the qubit from entering invariant sets and singular value sets, effectively driving the system ultimately to a sufficiently small neighborhood of target states. In compar… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 12 pages, 7 figures

  3. arXiv:2403.18405  [pdf, other

    cs.AI cs.IR

    Leveraging Large Language Models for Relevance Judgments in Legal Case Retrieval

    Authors: Shengjie Ma, Chong Chen, Qi Chu, Jiaxin Mao

    Abstract: Collecting relevant judgments for legal case retrieval is a challenging and time-consuming task. Accurately judging the relevance between two legal cases requires a considerable effort to read the lengthy text and a high level of domain expertise to extract Legal Facts and make juridical judgments. With the advent of advanced large language models, some recent studies have suggested that it is pro… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  4. arXiv:2403.17827  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions

    Authors: Sammy Christen, Shreyas Hampali, Fadime Sener, Edoardo Remelli, Tomas Hodan, Eric Sauser, Shugao Ma, Bugra Tekin

    Abstract: Generating natural hand-object interactions in 3D is challenging as the resulting hand and object motions are expected to be physically plausible and semantically meaningful. Furthermore, generalization to unseen objects is hindered by the limited scale of available hand-object interaction datasets. We propose DiffH2O, a novel method to synthesize realistic, one or two-handed object interactions f… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Project Page: https://diffh2o.github.io/

  5. arXiv:2403.17188  [pdf, other

    cs.CV cs.CR

    LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning

    Authors: Siyuan Cheng, Guanhong Tao, Yingqi Liu, Guangyu Shen, Shengwei An, Shiwei Feng, Xiangzhe Xu, Kaiyuan Zhang, Shiqing Ma, Xiangyu Zhang

    Abstract: Backdoor attack poses a significant security threat to Deep Learning applications. Existing attacks are often not evasive to established backdoor detection techniques. This susceptibility primarily stems from the fact that these attacks typically leverage a universal trigger pattern or transformation function, such that the trigger can cause misclassification for any input. In response to this, re… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)

  6. arXiv:2403.17007  [pdf, other

    cs.CV

    DreamLIP: Language-Image Pre-training with Long Captions

    Authors: Kecheng Zheng, Yifei Zhang, Wei Wu, Fan Lu, Shuailei Ma, Xin Jin, Wei Chen, Yujun Shen

    Abstract: Language-image pre-training largely relies on how precisely and thoroughly a text describes its paired image. In practice, however, the contents of an image can be so rich that well describing them requires lengthy captions (e.g., with 10 sentences), which are usually missing in existing datasets. Consequently, there are currently no clear evidences on whether and how language-image pre-training c… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  7. arXiv:2403.16812  [pdf, other

    cs.HC cs.AI

    Towards Human-AI Deliberation: Design and Evaluation of LLM-Empowered Deliberative AI for AI-Assisted Decision-Making

    Authors: Shuai Ma, Qiaoyi Chen, Xinru Wang, Chengbo Zheng, Zhenhui Peng, Ming Yin, Xiaojuan Ma

    Abstract: In AI-assisted decision-making, humans often passively review AI's suggestion and decide whether to accept or reject it as a whole. In such a paradigm, humans are found to rarely trigger analytical thinking and face difficulties in communicating the nuances of conflicting opinions to the AI when disagreements occur. To tackle this challenge, we propose Human-AI Deliberation, a novel framework to p… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  8. arXiv:2403.15709  [pdf, other

    cs.CV cs.AI

    Contact-aware Human Motion Generation from Textual Descriptions

    Authors: Sihan Ma, Qiong Cao, Jing Zhang, Dacheng Tao

    Abstract: This paper addresses the problem of generating 3D interactive human motion from text. Given a textual description depicting the actions of different body parts in contact with objects, we synthesize sequences of 3D body poses that are visually natural and physically plausible. Yet, this task poses a significant challenge due to the inadequate consideration of interactions by physical contacts in b… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: Project page: https://xymsh.github.io/RICH-CAT/

  9. arXiv:2403.15285  [pdf, other

    cs.NI cs.CR cs.HC cs.LG

    Blockchain-based Pseudonym Management for Vehicle Twin Migrations in Vehicular Edge Metaverse

    Authors: Jiawen Kang, Xiaofeng Luo, Jiangtian Nie, Tianhao Wu, Haibo Zhou, Yonghua Wang, Dusit Niyato, Shiwen Mao, Shengli Xie

    Abstract: Driven by the great advances in metaverse and edge computing technologies, vehicular edge metaverses are expected to disrupt the current paradigm of intelligent transportation systems. As highly computerized avatars of Vehicular Metaverse Users (VMUs), the Vehicle Twins (VTs) deployed in edge servers can provide valuable metaverse services to improve driving safety and on-board satisfaction for th… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 14 pages, 9 figures

  10. Statistical Inference For Noisy Matrix Completion Incorporating Auxiliary Information

    Authors: Shujie Ma, Po-Yao Niu, Yichong Zhang, Yinchu Zhu

    Abstract: This paper investigates statistical inference for noisy matrix completion in a semi-supervised model when auxiliary covariates are available. The model consists of two parts. One part is a low-rank matrix induced by unobserved latent factors; the other part models the effects of the observed covariates through a coefficient matrix which is composed of high-dimensional column vectors. We model the… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  11. The Galactic latitude dependency of Faraday complexity in the S-PASS/ATCA RM catalogue

    Authors: S. Ranchod, S. A. Mao, R. Deane, S. S. Sridhar, A. Damas-Segovia, J. D. Livingston, Y. K. Ma

    Abstract: The S-band Polarisation All Sky Survey (SPASS/ATCA) rotation measure (RM) catalogue is the largest broadband RM catalogue to date, increasing the RM density in the sparse southern sky. Through analysis of this catalogue, we report a latitude dependency of the Faraday complexity of polarised sources in this catalogue within 10$^\circ$ of the Galactic plane towards the inner Galaxy. In this study, w… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 16 pages, 16 figures

    Journal ref: A&A 686, A104 (2024)

  12. arXiv:2403.12840  [pdf, other

    gr-qc

    Optical properties of Euler-Heisenberg black hole in the Cold Dark Matter Halo

    Authors: Lei You, Rui-bo Wang, Shi-Jie Ma, Jian-Bo Deng, Xian-Ru Hu

    Abstract: The optical properties of Euler-Heisenberg (EH) black hole (BH) surrounded by Cold Dark Matter (CDM) halo are investigated. By changing BH's parameters, we found that the radius of horizon r_{h} and radius of photon sphere r_{ph} will transparently increase as CDM halo parameters R and ρincrease. To show the influence of CDM halo on the BH's optical characteristics, we took two sets of R and ρwith… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 42 pages,16 figures,4 tables

  13. arXiv:2403.11417  [pdf, ps, other

    eess.SP

    Positioning Using Wireless Networks: Applications, Recent Progress and Future Challenges

    Authors: Yang Yang, Mingzhe Chen, Yufei Blankenship, Jemin Lee, Zabih Ghassemlooy, Julian Cheng, Shiwen Mao

    Abstract: Positioning has recently received considerable attention as a key enabler in emerging applications such as extended reality, unmanned aerial vehicles and smart environments. These applications require both data communication and high-precision positioning, and thus they are particularly well-suited to be offered in wireless networks (WNs). The purpose of this paper is to provide a comprehensive ov… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  14. arXiv:2403.11152  [pdf, other

    cs.CL cs.AI

    Evaluation Ethics of LLMs in Legal Domain

    Authors: Ruizhe Zhang, Haitao Li, Yueyue Wu, Qingyao Ai, Yiqun Liu, Min Zhang, Shaoping Ma

    Abstract: In recent years, the utilization of large language models for natural language dialogue has gained momentum, leading to their widespread adoption across various domains. However, their universal competence in addressing challenges specific to specialized fields such as law remains a subject of scrutiny. The incorporation of legal ethics into the model has been overlooked by researchers. We asserts… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 10 pages, in processing of ACL 2024

  15. arXiv:2403.11102  [pdf, other

    cs.NI eess.SP

    Jointly Optimizing Terahertz based Sensing and Communications in Vehicular Networks: A Dynamic Graph Neural Network Approach

    Authors: Xuefei Li, Mingzhe Chen, Ye Hu, Zhilong Zhang, Danpu Liu, Shiwen Mao

    Abstract: In this paper, the problem of vehicle service mode selection (sensing, communication, or both) and vehicle connections within terahertz (THz) enabled joint sensing and communications over vehicular networks is studied. The considered network consists of several service provider vehicles (SPVs) that can provide: 1) only sensing service, 2) only communication service, and 3) both services, sensing s… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

  16. arXiv:2403.10172  [pdf, other

    cs.HC

    Unpacking ICT-supported Social Connections and Support of Late-life Migration: From the Lens of Social Convoys

    Authors: Ying Lei, Shuai Ma, Yuling Sun

    Abstract: Migration and aging-related dilemmas have limited the opportunities for late-life migrants to rebuild social connections and access support. While research on migrants has drawn increasing attention in HCI, limited attention has been paid to the increasing number of late-life migrants. This paper reports a qualitative study examining the social connections and support of late-life migrants. In par… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  17. arXiv:2403.09805  [pdf, other

    cs.CV cs.LG

    On the Utility of 3D Hand Poses for Action Recognition

    Authors: Md Salman Shamil, Dibyadip Chatterjee, Fadime Sener, Shugao Ma, Angela Yao

    Abstract: 3D hand pose is an underexplored modality for action recognition. Poses are compact yet informative and can greatly benefit applications with limited compute budgets. However, poses alone offer an incomplete understanding of actions, as they cannot fully capture objects and environments with which humans interact. We propose HandFormer, a novel multimodal transformer, to efficiently model hand-obj… ▽ More

    Submitted 14 August, 2024; v1 submitted 14 March, 2024; originally announced March 2024.

    Comments: ECCV 2024; https://s-shamil.github.io/HandFormer/

  18. arXiv:2403.09552  [pdf, other

    cs.HC

    "Are You Really Sure?" Understanding the Effects of Human Self-Confidence Calibration in AI-Assisted Decision Making

    Authors: Shuai Ma, Xinru Wang, Ying Lei, Chuhan Shi, Ming Yin, Xiaojuan Ma

    Abstract: In AI-assisted decision-making, it is crucial but challenging for humans to achieve appropriate reliance on AI. This paper approaches this problem from a human-centered perspective, "human self-confidence calibration". We begin by proposing an analytical framework to highlight the importance of calibrated human self-confidence. In our first study, we explore the relationship between human self-con… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

  19. arXiv:2403.07376  [pdf, other

    cs.CV cs.AI cs.CL cs.RO

    NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning

    Authors: Bingqian Lin, Yunshuang Nie, Ziming Wei, Jiaqi Chen, Shikui Ma, Jianhua Han, Hang Xu, Xiaojun Chang, Xiaodan Liang

    Abstract: Vision-and-Language Navigation (VLN), as a crucial research problem of Embodied AI, requires an embodied agent to navigate through complex 3D environments following natural language instructions. Recent research has highlighted the promising capacity of large language models (LLMs) in VLN by improving navigational reasoning accuracy and interpretability. However, their predominant use in an offlin… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  20. arXiv:2403.07354  [pdf, other

    cs.CV

    BID: Boundary-Interior Decoding for Unsupervised Temporal Action Localization Pre-Trainin

    Authors: Qihang Fang, Chengcheng Tang, Shugao Ma, Yanchao Yang

    Abstract: Skeleton-based motion representations are robust for action localization and understanding for their invariance to perspective, lighting, and occlusion, compared with images. Yet, they are often ambiguous and incomplete when taken out of context, even for human annotators. As infants discern gestures before associating them with words, actions can be conceptualized before being grounded with label… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 18 pages, 8 figures

    MSC Class: 68T45 ACM Class: I.4.8

  21. arXiv:2403.07274  [pdf, other

    cs.IT eess.SP

    Achievable Rate Analysis and Optimization of Double-RIS Assisted Spatially Correlated MIMO with Statistical CSI

    Authors: Kaizhe Xu, Jiajia Guo, Jun Zhang, Shi Jin, Shaodan Ma

    Abstract: Reconfigurable intelligent surface (RIS) is a novel meta-material which can form a smart radio environment by dynamically altering reflection directions of the impinging electromagnetic waves. In the prior literature, the inter-RIS links which also contribute to the performance of the whole system are usually neglected when multiple RISs are deployed. In this paper we investigate a general double-… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  22. arXiv:2403.06579  [pdf, other

    eess.SY

    Edge Information Hub: Orchestrating Satellites, UAVs, MEC, Sensing and Communications for 6G Closed-Loop Controls

    Authors: Chengleyang Lei, Wei Feng, Peng Wei, Yunfei Chen, Ning Ge, Shiwen Mao

    Abstract: An increasing number of field robots would be used for mission-critical tasks in remote or post-disaster areas. Due to the limited individual abilities, these robots usually require an edge information hub (EIH), with not only communication but also sensing and computing functions. Such EIH could be deployed on a flexibly-dispatched unmanned aerial vehicle (UAV). Different from traditional aerial… ▽ More

    Submitted 24 August, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 16pages, 11 figures

  23. arXiv:2403.06259  [pdf, other

    cs.CL cs.AI cs.DB cs.IR cs.LG

    Editing Conceptual Knowledge for Large Language Models

    Authors: Xiaohan Wang, Shengyu Mao, Ningyu Zhang, Shumin Deng, Yunzhi Yao, Yue Shen, Lei Liang, Jinjie Gu, Huajun Chen

    Abstract: Recently, there has been a growing interest in knowledge editing for Large Language Models (LLMs). Current approaches and evaluations merely explore the instance-level editing, while whether LLMs possess the capability to modify concepts remains unclear. This paper pioneers the investigation of editing conceptual knowledge for LLMs, by constructing a novel benchmark dataset ConceptEdit and establi… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: Work in progress. Code: https://github.com/zjunlp/EasyEdit Dataset: https://huggingface.co/datasets/zjunlp/ConceptEdit

  24. Imaginary gap-closed points and dynamics in a class of dissipative systems

    Authors: Shicheng Ma, Heng Lin, Jinghui Pi

    Abstract: We investigate imaginary gap-closed (IGC) points and their associated dynamics in dissipative systems. In a general non-Hermitian model, we derive the equation governing the IGC points of the energy spectrum, establishing that these points are only determined by the Hermitian part of the Hamiltonian. Focusing on a class of one-dimensional dissipative chains, we explore quantum walks across differe… ▽ More

    Submitted 2 July, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

    Comments: 11pages,8 figures

    Journal ref: Phys. Rev. B 109, 214311 (2024)

  25. arXiv:2403.05987  [pdf, other

    astro-ph.GA astro-ph.IM astro-ph.SR

    ROME/REA: Three-year, Tri-color Timeseries Photometry of the Galactic Bulge

    Authors: R. A. Street, E. Bachelet, Y. Tsapras, M. P. G. Hundertmark, V. Bozza, D. M. Bramich, A. Cassan, M. Dominik, R. Figuera Jaimes, K. Horne, S. Mao, A. Saha, J. Wambsganss, Weicheng Zang

    Abstract: The ROME/REA (Robotic Observations of Microlensing Events/Reactive Event Assessment) Survey was a Key Project at Las Cumbres Observatory (hereafter LCO) which continuously monitored 20 selected fields (3.76 sq.deg.) in the Galactic Bulge throughout their seasonal visibility window over a three-year period, between March 2017 and March 2020. Observations were made in three optical passbands (SDSS-g… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: Accepted for publication in PASP

  26. arXiv:2403.05826  [pdf, other

    cs.NI eess.SP

    Cached Model-as-a-Resource: Provisioning Large Language Model Agents for Edge Intelligence in Space-air-ground Integrated Networks

    Authors: Minrui Xu, Dusit Niyato, Hongliang Zhang, Jiawen Kang, Zehui Xiong, Shiwen Mao, Zhu Han

    Abstract: Edge intelligence in space-air-ground integrated networks (SAGINs) can enable worldwide network coverage beyond geographical limitations for users to access ubiquitous and low-latency intelligence services. Facing global coverage and complex environments in SAGINs, edge intelligence can provision approximate large language models (LLMs) agents for users via edge servers at ground base stations (BS… ▽ More

    Submitted 31 May, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  27. arXiv:2403.05567  [pdf, other

    cs.HC

    A Unified Framework for Underwater Metaverse with Optical Perception

    Authors: Jingyang Cao, Mu Zhou, Jiacheng Wang, Guangyuan Liu, Dusit Niyato, Shiwen Mao, Zhu Han, Jiawen Kang

    Abstract: With the advancement of AI technology and increasing attention to deep-sea exploration, the underwater Metaverse is gradually emerging. This paper explores the concept of underwater Metaverse, emerging virtual reality systems and services aimed at simulating and enhancing virtual experience of marine environments. First, we discuss potential applications of underwater Metaverse in underwater scien… ▽ More

    Submitted 20 February, 2024; originally announced March 2024.

  28. arXiv:2403.04272  [pdf, other

    cs.CV

    Active Generalized Category Discovery

    Authors: Shijie Ma, Fei Zhu, Zhun Zhong, Xu-Yao Zhang, Cheng-Lin Liu

    Abstract: Generalized Category Discovery (GCD) is a pragmatic and challenging open-world task, which endeavors to cluster unlabeled samples from both novel and old classes, leveraging some labeled data of old classes. Given that knowledge learned from old classes is not fully transferable to new classes, and that novel categories are fully unlabeled, GCD inherently faces intractable problems, including imba… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  29. arXiv:2403.04259  [pdf, other

    math.OC cs.LG

    Decentralized and Equitable Optimal Transport

    Authors: Ivan Lau, Shiqian Ma, César A. Uribe

    Abstract: This paper considers the decentralized (discrete) optimal transport (D-OT) problem. In this setting, a network of agents seeks to design a transportation plan jointly, where the cost function is the sum of privately held costs for each agent. We reformulate the D-OT problem as a constraint-coupled optimization problem and propose a single-loop decentralized algorithm with an iteration complexity o… ▽ More

    Submitted 12 March, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Comments: Accepted to ACC 2024

  30. arXiv:2403.03809  [pdf, ps, other

    eess.SP

    Variational Bayesian Learning based Joint Localization and Path Loss Exponent with Distance-dependent Noise in Wireless Sensor Network

    Authors: Yunfei Li, Yiting Luo, Weiqiang Tan, Chunguo Li, Shaodan Ma, Guanghua Yang

    Abstract: This paper focuses on the challenge of jointly optimizing location and path loss exponent (PLE) in distance-dependent noise. Departing from the conventional independent noise model used in localization and path loss exponent estimation problems, we consider a more realistic model incorporating distance-dependent noise variance, as revealed in recent theoretical analyses and experimental results. T… ▽ More

    Submitted 20 July, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  31. arXiv:2403.03736  [pdf, other

    cs.CV cs.LG eess.IV

    Unifying Generation and Compression: Ultra-low bitrate Image Coding Via Multi-stage Transformer

    Authors: Naifu Xue, Qi Mao, Zijian Wang, Yuan Zhang, Siwei Ma

    Abstract: Recent progress in generative compression technology has significantly improved the perceptual quality of compressed data. However, these advancements primarily focus on producing high-frequency details, often overlooking the ability of generative models to capture the prior distribution of image content, thus impeding further bitrate reduction in extreme compression scenarios (<0.05 bpp). Motivat… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  32. arXiv:2403.03145  [pdf, other

    cs.CV cs.LG cs.MM cs.SD eess.AS

    Dual Mean-Teacher: An Unbiased Semi-Supervised Framework for Audio-Visual Source Localization

    Authors: Yuxin Guo, Shijie Ma, Hu Su, Zhiqing Wang, Yuhao Zhao, Wei Zou, Siyang Sun, Yun Zheng

    Abstract: Audio-Visual Source Localization (AVSL) aims to locate sounding objects within video frames given the paired audio clips. Existing methods predominantly rely on self-supervised contrastive learning of audio-visual correspondence. Without any bounding-box annotations, they struggle to achieve precise localization, especially for small objects, and suffer from blurry boundaries and false positives.… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted to NeurIPS2023

  33. arXiv:2403.03095  [pdf, other

    cs.CV cs.MM cs.SD eess.AS

    Cross Pseudo-Labeling for Semi-Supervised Audio-Visual Source Localization

    Authors: Yuxin Guo, Shijie Ma, Yuhao Zhao, Hu Su, Wei Zou

    Abstract: Audio-Visual Source Localization (AVSL) is the task of identifying specific sounding objects in the scene given audio cues. In our work, we focus on semi-supervised AVSL with pseudo-labeling. To address the issues with vanilla hard pseudo-labels including bias accumulation, noise sensitivity, and instability, we propose a novel method named Cross Pseudo-Labeling (XPL), wherein two models learn fro… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted To ICASSP2024

  34. arXiv:2403.03004  [pdf, other

    astro-ph.CO gr-qc hep-ph

    Ultralight vector dark matter search using data from the KAGRA O3GK run

    Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

    Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: 20 pages, 5 figures

    Report number: LIGO-P2300250

  35. arXiv:2403.02437  [pdf, other

    cs.LG cs.AI cs.DC

    SoK: Challenges and Opportunities in Federated Unlearning

    Authors: Hyejun Jeong, Shiqing Ma, Amir Houmansadr

    Abstract: Federated learning (FL), introduced in 2017, facilitates collaborative learning between non-trusting parties with no need for the parties to explicitly share their data among themselves. This allows training models on user data while respecting privacy regulations such as GDPR and CPRA. However, emerging privacy requirements may mandate model owners to be able to \emph{forget} some learned data, e… ▽ More

    Submitted 5 June, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  36. arXiv:2403.01791  [pdf, other

    cs.HC cs.AI

    Beyond Recommender: An Exploratory Study of the Effects of Different AI Roles in AI-Assisted Decision Making

    Authors: Shuai Ma, Chenyi Zhang, Xinru Wang, Xiaojuan Ma, Ming Yin

    Abstract: Artificial Intelligence (AI) is increasingly employed in various decision-making tasks, typically as a Recommender, providing recommendations that the AI deems correct. However, recent studies suggest this may diminish human analytical thinking and lead to humans' inappropriate reliance on AI, impairing the synergy in human-AI teams. In contrast, human advisors in group decision-making perform var… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  37. arXiv:2403.01759  [pdf, other

    cs.LG cs.CV

    Open-world Machine Learning: A Review and New Outlooks

    Authors: Fei Zhu, Shijie Ma, Zhen Cheng, Xu-Yao Zhang, Zhaoxiang Zhang, Cheng-Lin Liu

    Abstract: Machine learning has achieved remarkable success in many applications. However, existing studies are largely based on the closed-world assumption, which assumes that the environment is stationary, and the model is fixed once deployed. In many real-world applications, this fundamental and rather naive assumption may not hold because an open environment is complex, dynamic, and full of unknowns. In… ▽ More

    Submitted 14 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  38. arXiv:2403.01093  [pdf, other

    eess.SP

    Variational Bayesian Learning Based Localization and Channel Reconstruction in RIS-aided Systems

    Authors: Yunfei Li, Yiting Luo, Xianda Wu, Zheng Shi, Shaodan Ma, Guanghua Yang

    Abstract: The emerging immersive and autonomous services have posed stringent requirements on both communications and localization. By considering the great potential of reconfigurable intelligent surface (RIS), this paper focuses on the joint channel estimation and localization for RIS-aided wireless systems. As opposed to existing works that treat channel estimation and localization independently, this pa… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  39. arXiv:2402.19193  [pdf, ps, other

    hep-ph

    Magnetic catalysis and diamagnetism from pion fluctuations

    Authors: Jie Mei, Rui Wen, Shijun Mao, Mei Huang, Kun Xu

    Abstract: In the framework of Nambu--Jona-Lasinio model beyond mean field approximation, the effects of pion fluctuations on (inverse) magnetic catalysis and magnetic susceptibility are studied. The negative magnetic susceptibility at low temperature is observed when contributions from both neutral and charged pions are taken into account. In weak field approximation, it is observed that at finite temperatu… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 14 pages, 8 figures

  40. arXiv:2402.17764  [pdf, other

    cs.CL cs.LG

    The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

    Authors: Shuming Ma, Hongyu Wang, Lingxiao Ma, Lei Wang, Wenhui Wang, Shaohan Huang, Li Dong, Ruiping Wang, Jilong Xue, Furu Wei

    Abstract: Recent research, such as BitNet, is paving the way for a new era of 1-bit Large Language Models (LLMs). In this work, we introduce a 1-bit LLM variant, namely BitNet b1.58, in which every single parameter (or weight) of the LLM is ternary {-1, 0, 1}. It matches the full-precision (i.e., FP16 or BF16) Transformer LLM with the same model size and training tokens in terms of both perplexity and end-t… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: Work in progress

  41. arXiv:2402.16661  [pdf, other

    stat.ML cs.LG stat.ME

    Penalized Generative Variable Selection

    Authors: Tong Wang, Jian Huang, Shuangge Ma

    Abstract: Deep networks are increasingly applied to a wide variety of data, including data with high-dimensional predictors. In such analysis, variable selection can be needed along with estimation/model building. Many of the existing deep network studies that incorporate variable selection have been limited to methodological and numerical developments. In this study, we consider modeling/estimation using t… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  42. arXiv:2402.16366  [pdf, other

    cs.CV cs.MM

    SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field

    Authors: Zetian Song, Wenhong Duan, Yuhuai Zhang, Shiqi Wang, Siwei Ma, Wen Gao

    Abstract: Representing the Neural Radiance Field (NeRF) with the explicit voxel grid (EVG) is a promising direction for improving NeRFs. However, the EVG representation is not efficient for storage and transmission because of the terrific memory cost. Current methods for compressing EVG mainly inherit the methods designed for neural network compression, such as pruning and quantization, which do not take fu… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  43. arXiv:2402.15713  [pdf, other

    cs.CL cs.AI

    Making Pre-trained Language Models Better Continual Few-Shot Relation Extractors

    Authors: Shengkun Ma, Jiale Han, Yi Liang, Bo Cheng

    Abstract: Continual Few-shot Relation Extraction (CFRE) is a practical problem that requires the model to continuously learn novel relations while avoiding forgetting old ones with few labeled training data. The primary challenges are catastrophic forgetting and overfitting. This paper harnesses prompt learning to explore the implicit capabilities of pre-trained language models to address the above two chal… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: Accepted as COLING2024

  44. arXiv:2402.15690  [pdf, other

    cs.CL cs.AI

    Foot In The Door: Understanding Large Language Model Jailbreaking via Cognitive Psychology

    Authors: Zhenhua Wang, Wei Xie, Baosheng Wang, Enze Wang, Zhiwen Gui, Shuoyoucheng Ma, Kai Chen

    Abstract: Large Language Models (LLMs) have gradually become the gateway for people to acquire new knowledge. However, attackers can break the model's security protection ("jail") to access restricted information, which is called "jailbreaking." Previous studies have shown the weakness of current LLMs when confronted with such jailbreaking attacks. Nevertheless, comprehension of the intrinsic decision-makin… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  45. arXiv:2402.13959  [pdf, other

    cs.IR

    Retention Induced Biases in a Recommendation System with Heterogeneous Users

    Authors: Shichao Ma

    Abstract: I examine a conceptual model of a recommendation system (RS) with user inflow and churn dynamics. When inflow and churn balance out, the user distribution reaches a steady state. Changing the recommendation algorithm alters the steady state and creates a transition period. During this period, the RS behaves differently from its new steady state. In particular, A/B experiment metrics obtained in tr… ▽ More

    Submitted 6 March, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

  46. arXiv:2402.12903  [pdf, ps, other

    math.AP

    Inverse problems for semilinear Schrödinger equations at large frequency via polynomial resolvent estimates on manifolds

    Authors: Katya Krupchyk, Shiqi Ma, Suman Kumar Sahoo, Mikko Salo, Simon St-Amant

    Abstract: We study inverse boundary problems for semilinear Schrödinger equations on smooth compact Riemannian manifolds of dimensions $\ge 2$ with smooth boundary, at a large fixed frequency. We show that certain classes of cubic nonlinearities are determined uniquely from the knowledge of the nonlinear Dirichlet--to--Neumann map at a large fixed frequency on quite general Riemannian manifolds. In particul… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  47. arXiv:2402.12474  [pdf, other

    astro-ph.GA

    CGOLS V: Disk-wide Stellar Feedback and Observational Implications of the Cholla Galactic Wind Model

    Authors: Evan E. Schneider, S. Alwin Mao

    Abstract: We present the fifth simulation in the CGOLS project -- a set of isolated starburst galaxy simulations modeled over large scales ($10\kpc$) at uniformly high resolution ($Δx \approx 5\pc$). Supernova feedback in this simulation is implemented as a disk-wide distribution of clusters, and we assess the impact of this geometry on several features of the resulting outflow, including radial profiles of… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 22 pages, 13 figures, accepted in ApJ

  48. arXiv:2402.11422  [pdf, other

    cs.CL

    Mitigating Catastrophic Forgetting in Multi-domain Chinese Spelling Correction by Multi-stage Knowledge Transfer Framework

    Authors: Peng Xing, Yinghui Li, Shirong Ma, Xinnian Liang, Haojing Huang, Yangning Li, Hai-Tao Zheng, Wenhao Jiang, Ying Shen

    Abstract: Chinese Spelling Correction (CSC) aims to detect and correct spelling errors in given sentences. Recently, multi-domain CSC has gradually attracted the attention of researchers because it is more practicable. In this paper, we focus on the key flaw of the CSC model when adapting to multi-domain scenarios: the tendency to forget previously acquired knowledge upon learning new domain-specific knowle… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  49. arXiv:2402.11420  [pdf, other

    cs.CL

    Rethinking the Roles of Large Language Models in Chinese Grammatical Error Correction

    Authors: Yinghui Li, Shang Qin, Jingheng Ye, Shirong Ma, Yangning Li, Libo Qin, Xuming Hu, Wenhao Jiang, Hai-Tao Zheng, Philip S. Yu

    Abstract: Recently, Large Language Models (LLMs) have been widely studied by researchers for their roles in various downstream NLP tasks. As a fundamental task in the NLP field, Chinese Grammatical Error Correction (CGEC) aims to correct all potential grammatical errors in the input sentences. Previous studies have shown that LLMs' performance as correctors on CGEC remains unsatisfactory due to its challeng… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  50. arXiv:2402.11100  [pdf, other

    cs.CL

    When LLMs Meet Cunning Texts: A Fallacy Understanding Benchmark for Large Language Models

    Authors: Yinghui Li, Qingyu Zhou, Yuanzhen Luo, Shirong Ma, Yangning Li, Hai-Tao Zheng, Xuming Hu, Philip S. Yu

    Abstract: Recently, Large Language Models (LLMs) make remarkable evolutions in language understanding and generation. Following this, various benchmarks for measuring all kinds of capabilities of LLMs have sprung up. In this paper, we challenge the reasoning and understanding abilities of LLMs by proposing a FaLlacy Understanding Benchmark (FLUB) containing cunning texts that are easy for humans to understa… ▽ More

    Submitted 9 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.