Zum Hauptinhalt springen

Showing 101–150 of 998 results for author: Wei, Z

.
  1. arXiv:2404.13267  [pdf, other

    cs.SI

    Demystify Adult Learning: A Social Network and Large Language Model Assisted Approach

    Authors: Fang Liu, Bosheng Ding, Chong Guan, Zhang Wei, Dusit Niyato, Justina Tan

    Abstract: Adult learning is increasingly recognized as a crucial way for personal development and societal progress. It however is challenging, and adult learners face unique challenges such as balancing education with other life responsibilities. Collecting feedback from adult learners is effective in understanding their concerns and improving learning experiences, and social networks provide a rich source… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 6 pages, 3 figures

  2. arXiv:2404.12705  [pdf, other

    eess.SP

    Integrated Sensing and Communication enabled Multiple Base Stations Cooperative UAV Detection

    Authors: Xi Lu, Zhiqing Wei, Ruizhong Xu, Lin Wang, Bohao Lu, Jinghui Piao

    Abstract: Integrated sensing and communication (ISAC) exhibits notable potential for sensing the unmanned aerial vehicles (UAVs), facilitating real-time monitoring of UAVs for security insurance. Due to the low sensing accuracy of single base stations (BSs), a cooperative UAV sensing method by multi-BS is proposed in this paper to achieve high-accuracy sensing. Specifically, a multiple signal classification… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  3. arXiv:2404.12686  [pdf, other

    nucl-ex nucl-th

    The MONUMENT Experiment: Ordinary Muon Capture studies for 0$νββ$ decay

    Authors: Dhanurdhar Bajpai, Laura Baudis, Viacheslav Belov, Elisabetta Bossio, Thomas E. Cocolios, Hiroyasu Ejiri, Evgenii Sushenok, Maria Fomina, Izyan H. Hashim, Michael Heines, Konstantin Gusev, Sergej Kazartsev, Andreas Knecht, Elizabeth Mondragon, Ng Zheng Wei, Faiznur Othman, Igor Ostrovskiy, Gabriela R. Araujo, Nadyia Rumyantseva, Mario Schwarz, Stefan Schoenert, Mark Shirchenko, Egor Shevchik, Yury Shitov, Jouni Suhonen , et al. (4 additional authors not shown)

    Abstract: The MONUMENT experiment measures ordinary muon capture (OMC) on isotopes relevant for neutrinoless double-beta (0$νββ$) decay and nuclear astrophysics. OMC is a particularly attractive tool for improving the theoretical description of 0$νββ$ decay. It involves similar momentum transfers and allows testing the virtual transitions involved in 0$νββ$ decay against experimental data. During the 2021 c… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

    Comments: 18 pages, 20 figures, submitted to EPJC

  4. arXiv:2404.10464  [pdf, other

    cs.CL cs.AI

    DESTEIN: Navigating Detoxification of Language Models via Universal Steering Pairs and Head-wise Activation Fusion

    Authors: Yu Li, Han Jiang, Chuanyang Gong, Zhihua Wei

    Abstract: Despite the remarkable achievements of language models (LMs) across a broad spectrum of tasks, their propensity for generating toxic outputs remains a prevalent concern. Current solutions involving finetuning or auxiliary models usually require extensive computational resources, hindering their practicality in large language models (LLMs). In this paper, we propose DeStein, a novel method that det… ▽ More

    Submitted 10 August, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  5. arXiv:2404.10343  [pdf, other

    cs.CV eess.IV

    The Ninth NTIRE 2024 Efficient Super-Resolution Challenge Report

    Authors: Bin Ren, Yawei Li, Nancy Mehta, Radu Timofte, Hongyuan Yu, Cheng Wan, Yuxin Hong, Bingnan Han, Zhuoyuan Wu, Yajun Zou, Yuqing Liu, Jizhe Li, Keji He, Chao Fan, Heng Zhang, Xiaolin Zhang, Xuanwu Yin, Kunlong Zuo, Bohao Liao, Peizhe Xia, Long Peng, Zhibo Du, Xin Di, Wangkai Li, Yang Wang , et al. (109 additional authors not shown)

    Abstract: This paper provides a comprehensive review of the NTIRE 2024 challenge, focusing on efficient single-image super-resolution (ESR) solutions and their outcomes. The task of this challenge is to super-resolve an input image with a magnification factor of x4 based on pairs of low and corresponding high-resolution images. The primary objective is to develop networks that optimize various aspects such… ▽ More

    Submitted 25 June, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: The report paper of NTIRE2024 Efficient Super-resolution, accepted by CVPRW2024

  6. arXiv:2404.05603  [pdf, other

    cs.CV cs.AI

    Self-Explainable Affordance Learning with Embodied Caption

    Authors: Zhipeng Zhang, Zhimin Wei, Guolei Sun, Peng Wang, Luc Van Gool

    Abstract: In the field of visual affordance learning, previous methods mainly used abundant images or videos that delineate human behavior patterns to identify action possibility regions for object manipulation, with a variety of applications in robotic tasks. However, they encounter a main challenge of action ambiguity, illustrated by the vagueness like whether to beat or carry a drum, and the complexities… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  7. arXiv:2404.04990  [pdf, other

    cs.CL

    MLaKE: Multilingual Knowledge Editing Benchmark for Large Language Models

    Authors: Zihao Wei, Jingcheng Deng, Liang Pang, Hanxing Ding, Huawei Shen, Xueqi Cheng

    Abstract: The extensive utilization of large language models (LLMs) underscores the crucial necessity for precise and contemporary knowledge embedded within their intrinsic parameters. Existing research on knowledge editing primarily concentrates on monolingual scenarios, neglecting the complexities presented by multilingual contexts and multi-hop reasoning. To address these challenges, our study introduces… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  8. arXiv:2404.01994  [pdf, other

    cs.CV cs.CL cs.LG

    DELAN: Dual-Level Alignment for Vision-and-Language Navigation by Cross-Modal Contrastive Learning

    Authors: Mengfei Du, Binhao Wu, Jiwen Zhang, Zhihao Fan, Zejun Li, Ruipu Luo, Xuanjing Huang, Zhongyu Wei

    Abstract: Vision-and-Language navigation (VLN) requires an agent to navigate in unseen environment by following natural language instruction. For task completion, the agent needs to align and integrate various navigation modalities, including instruction, observation and navigation history. Existing works primarily concentrate on cross-modal attention at the fusion stage to achieve this objective. Neverthel… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted by LREC-COLING 2024

  9. arXiv:2404.00046  [pdf, ps, other

    math.OC math.PR

    Partial Backorder Inventory System: Asymptotic Optimality and Demand Learning

    Authors: Andrew E. B. Lim, Zhao-Xuan Wei, Hanqin Zhang

    Abstract: We develop a stochastic inventory system which accounts for the limited patience of backlogged customers. While limited patience is a feature that is closer to the nature of unmet demand, our model also unifies the classic backlogging and lost-sales inventory systems which are special cases of the one we propose. We establish the uniform (asymptotic) optimality of the base-stock policy when both d… ▽ More

    Submitted 25 March, 2024; originally announced April 2024.

  10. arXiv:2403.16189  [pdf, other

    cs.NI

    Interference Management for Integrated Sensing and Communication Systems: A Survey

    Authors: Yangyang Niu, Zhiqing Wei, Lin Wang, Huici Wu, Zhiyong Feng

    Abstract: Emerging applications such as autonomous driving and Internet of things (IoT) services put forward the demand for simutaneous sensing and communication functions in the same system. Integrated sensing and communication (ISAC) has the potential to meet the demands of ubiquitous communication and high-precision sensing due to the advantages of spectrum and hardware resource sharing, as well as the m… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  11. arXiv:2403.15059  [pdf, other

    cs.CV cs.AI

    MM-Diff: High-Fidelity Image Personalization via Multi-Modal Condition Integration

    Authors: Zhichao Wei, Qingkun Su, Long Qin, Weizhi Wang

    Abstract: Recent advances in tuning-free personalized image generation based on diffusion models are impressive. However, to improve subject fidelity, existing methods either retrain the diffusion model or infuse it with dense visual embeddings, both of which suffer from poor generalization and efficiency. Also, these methods falter in multi-subject image generation due to the unconstrained cross-attention… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  12. arXiv:2403.14192  [pdf, ps, other

    cs.IT eess.SP

    Fundamentals of Delay-Doppler Communications: Practical Implementation and Extensions to OTFS

    Authors: Shuangyang Li, Peter Jung, Weijie Yuan, Zhiqiang Wei, Jinhong Yuan, Baoming Bai, Giuseppe Caire

    Abstract: The recently proposed orthogonal time frequency space (OTFS) modulation, which is a typical Delay-Doppler (DD) communication scheme, has attracted significant attention thanks to its appealing performance over doubly-selective channels. In this paper, we present the fundamentals of general DD communications from the viewpoint of the Zak transform. We start our study by constructing DD domain basis… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

  13. arXiv:2403.13678  [pdf, other

    cs.CV

    AUD-TGN: Advancing Action Unit Detection with Temporal Convolution and GPT-2 in Wild Audiovisual Contexts

    Authors: Jun Yu, Zerui Zhang, Zhihong Wei, Gongpeng Zhao, Zhongpeng Cai, Yongqi Wang, Guochen Xie, Jichao Zhu, Wangyuan Zhu

    Abstract: Leveraging the synergy of both audio data and visual data is essential for understanding human emotions and behaviors, especially in in-the-wild setting. Traditional methods for integrating such multimodal information often stumble, leading to less-than-ideal outcomes in the task of facial action unit detection. To overcome these shortcomings, we propose a novel approach utilizing audio-visual mul… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  14. arXiv:2403.12648  [pdf, other

    cs.DS

    Revisiting Local Computation of PageRank: Simple and Optimal

    Authors: Hanzhi Wang, Zhewei Wei, Ji-Rong Wen, Mingji Yang

    Abstract: We revisit the classic local graph exploration algorithm ApproxContributions proposed by Andersen, Borgs, Chayes, Hopcroft, Mirrokni, and Teng (WAW '07, Internet Math. '08) for computing an $ε$-approximation of the PageRank contribution vector for a target node $t$ on a graph with $n$ nodes and $m$ edges. We give a worst-case complexity bound of ApproxContributions as… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 30 pages, 3 figures, full version of a STOC 2024 paper

  15. arXiv:2403.12425  [pdf, other

    cs.CV cs.SD eess.AS

    Multimodal Fusion Method with Spatiotemporal Sequences and Relationship Learning for Valence-Arousal Estimation

    Authors: Jun Yu, Gongpeng Zhao, Yongqi Wang, Zhihong Wei, Yang Zheng, Zerui Zhang, Zhongpeng Cai, Guochen Xie, Jichao Zhu, Wangyuan Zhu

    Abstract: This paper presents our approach for the VA (Valence-Arousal) estimation task in the ABAW6 competition. We devised a comprehensive model by preprocessing video frames and audio segments to extract visual and audio features. Through the utilization of Temporal Convolutional Network (TCN) modules, we effectively captured the temporal and spatial correlations between these features. Subsequently, we… ▽ More

    Submitted 20 March, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 8 pages,3 figures

  16. arXiv:2403.11942  [pdf, other

    cs.CV cs.AI

    Exploring Facial Expression Recognition through Semi-Supervised Pretraining and Temporal Modeling

    Authors: Jun Yu, Zhihong Wei, Zhongpeng Cai, Gongpeng Zhao, Zerui Zhang, Yongqi Wang, Guochen Xie, Jichao Zhu, Wangyuan Zhu

    Abstract: Facial Expression Recognition (FER) plays a crucial role in computer vision and finds extensive applications across various fields. This paper aims to present our approach for the upcoming 6th Affective Behavior Analysis in-the-Wild (ABAW) competition, scheduled to be held at CVPR2024. In the facial expression recognition task, The limited size of the FER dataset poses a challenge to the expressio… ▽ More

    Submitted 19 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  17. arXiv:2403.11114  [pdf, other

    cs.LG cs.AI

    Phasic Diversity Optimization for Population-Based Reinforcement Learning

    Authors: Jingcheng Jiang, Haiyin Piao, Yu Fu, Yihang Hao, Chuanlu Jiang, Ziqi Wei, Xin Yang

    Abstract: Reviewing the previous work of diversity Rein-forcement Learning,diversity is often obtained via an augmented loss function,which requires a balance between reward and diversity.Generally,diversity optimization algorithms use Multi-armed Bandits algorithms to select the coefficient in the pre-defined space. However, the dynamic distribution of reward signals for MABs or the conflict between qualit… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 7 pages, 4 figures

    MSC Class: 14J60 (Primary) ACM Class: I.2.9

  18. arXiv:2403.10815  [pdf, other

    eess.IV cs.CV

    MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections

    Authors: Mude Hui, Zihao Wei, Hongru Zhu, Fei Xia, Yuyin Zhou

    Abstract: Volumetric optical microscopy using non-diffracting beams enables rapid imaging of 3D volumes by projecting them axially to 2D images but lacks crucial depth information. Addressing this, we introduce MicroDiffusion, a pioneering tool facilitating high-quality, depth-resolved 3D volume reconstruction from limited 2D projections. While existing Implicit Neural Representation (INR) models often yiel… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024

  19. arXiv:2403.09733  [pdf, other

    cs.CL cs.AI

    OverleafCopilot: Empowering Academic Writing in Overleaf with Large Language Models

    Authors: Haomin Wen, Zhenjie Wei, Yan Lin, Jiyuan Wang, Yuxuan Liang, Huaiyu Wan

    Abstract: The rapid development of Large Language Models (LLMs) has facilitated a variety of applications from different domains. In this technical report, we explore the integration of LLMs and the popular academic writing tool, Overleaf, to enhance the efficiency and quality of academic writing. To achieve the above goal, there are three challenges: i) including seamless interaction between Overleaf and L… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  20. arXiv:2403.08010  [pdf, other

    cs.CL

    Debatrix: Multi-dimensional Debate Judge with Iterative Chronological Analysis Based on LLM

    Authors: Jingcong Liang, Rong Ye, Meng Han, Ruofei Lai, Xinyu Zhang, Xuanjing Huang, Zhongyu Wei

    Abstract: How can we construct an automated debate judge to evaluate an extensive, vibrant, multi-turn debate? This task is challenging, as judging a debate involves grappling with lengthy texts, intricate argument relationships, and multi-dimensional assessments. At the same time, current research mainly focuses on short dialogues, rarely touching upon the evaluation of an entire debate. In this paper, by… ▽ More

    Submitted 19 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  21. arXiv:2403.07504  [pdf

    cond-mat.supr-con cond-mat.mtrl-sci cond-mat.str-el

    Two-dimensional phase diagram of the charge density wave in doped CsV$_3$Sb$_5$

    Authors: Linwei Huai, Hongyu Li, Yulei Han, Yang Luo, Shuting Peng, Zhiyuan Wei, Jianchang Shen, Bingqian Wang, Yu Miao, Xiupeng Sun, Zhipeng Ou, Bo Liu, Xiaoxiao Yu, Ziji Xiang, Min-Quan Kuang, Zhenhua Qiao, Xianhui Chen, Junfeng He

    Abstract: Kagome superconductors AV$_3$Sb$_5$ (A = K, Rb and Cs) have attracted much recent attention due to the coexistence of multiple exotic orders. Among them, the charge density wave (CDW) order has been shown to host various unconventional behaviors. Here, we investigate the CDW order by a combination of both bulk and surface doping methods. While element substitutions in bulk doping change both carri… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 14 pages, 4 figures

    Journal ref: npj Quantum Mater. 9,23(2024)

  22. arXiv:2403.07376  [pdf, other

    cs.CV cs.AI cs.CL cs.RO

    NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning

    Authors: Bingqian Lin, Yunshuang Nie, Ziming Wei, Jiaqi Chen, Shikui Ma, Jianhua Han, Hang Xu, Xiaojun Chang, Xiaodan Liang

    Abstract: Vision-and-Language Navigation (VLN), as a crucial research problem of Embodied AI, requires an embodied agent to navigate through complex 3D environments following natural language instructions. Recent research has highlighted the promising capacity of large language models (LLMs) in VLN by improving navigational reasoning accuracy and interpretability. However, their predominant use in an offlin… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  23. arXiv:2403.07246  [pdf, other

    cs.CV

    Towards Zero-shot Human-Object Interaction Detection via Vision-Language Integration

    Authors: Weiying Xue, Qi Liu, Qiwei Xiong, Yuxiao Wang, Zhenao Wei, Xiaofen Xing, Xiangmin Xu

    Abstract: Human-object interaction (HOI) detection aims to locate human-object pairs and identify their interaction categories in images. Most existing methods primarily focus on supervised learning, which relies on extensive manual HOI annotations. In this paper, we propose a novel framework, termed Knowledge Integration to HOI (KI2HOI), that effectively integrates the knowledge of visual-language model to… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  24. arXiv:2403.06754  [pdf, other

    cs.CL cs.AI cs.LG

    ALaRM: Align Language Models via Hierarchical Rewards Modeling

    Authors: Yuhang Lai, Siyuan Wang, Shujun Liu, Xuanjing Huang, Zhongyu Wei

    Abstract: We introduce ALaRM, the first framework modeling hierarchical rewards in reinforcement learning from human feedback (RLHF), which is designed to enhance the alignment of large language models (LLMs) with human preferences. The framework addresses the limitations of current alignment approaches, which often struggle with the inconsistency and sparsity of human supervision signals, by integrating ho… ▽ More

    Submitted 16 March, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: 15 pages, 6 figures

  25. Efficient Algorithms for Personalized PageRank Computation: A Survey

    Authors: Mingji Yang, Hanzhi Wang, Zhewei Wei, Sibo Wang, Ji-Rong Wen

    Abstract: Personalized PageRank (PPR) is a traditional measure for node proximity on large graphs. For a pair of nodes $s$ and $t$, the PPR value $π_s(t)$ equals the probability that an $α$-discounted random walk from $s$ terminates at $t$ and reflects the importance between $s$ and $t$ in a bidirectional way. As a generalization of Google's celebrated PageRank centrality, PPR has been extensively studied a… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 20 pages, "accepted version" of an article accepted by IEEE Transactions on Knowledge and Data Engineering (TKDE) for publication

  26. arXiv:2403.04204  [pdf, other

    cs.AI cs.CL

    On the Essence and Prospect: An Investigation of Alignment Approaches for Big Models

    Authors: Xinpeng Wang, Shitong Duan, Xiaoyuan Yi, Jing Yao, Shanlin Zhou, Zhihua Wei, Peng Zhang, Dongkuan Xu, Maosong Sun, Xing Xie

    Abstract: Big models have achieved revolutionary breakthroughs in the field of AI, but they might also pose potential concerns. Addressing such concerns, alignment technologies were introduced to make these models conform to human preferences and values. Despite considerable advancements in the past year, various challenges lie in establishing the optimal alignment strategy, such as data cost and scalable o… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 23 pages, 7 figures

  27. arXiv:2403.04066  [pdf, ps, other

    cs.CV

    LoDisc: Learning Global-Local Discriminative Features for Self-Supervised Fine-Grained Visual Recognition

    Authors: Jialu Shi, Zhiqiang Wei, Jie Nie, Lei Huang

    Abstract: Self-supervised contrastive learning strategy has attracted remarkable attention due to its exceptional ability in representation learning. However, current contrastive learning tends to learn global coarse-grained representations of the image that benefit generic object recognition, whereas such coarse-grained features are insufficient for fine-grained visual recognition. In this paper, we presen… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 11 pages, submitted

    MSC Class: 68U10 ACM Class: I.4

  28. arXiv:2403.02713  [pdf, other

    cs.CL cs.CV cs.HC cs.LG

    Android in the Zoo: Chain-of-Action-Thought for GUI Agents

    Authors: Jiwen Zhang, Jihao Wu, Yihua Teng, Minghui Liao, Nuo Xu, Xiao Xiao, Zhongyu Wei, Duyu Tang

    Abstract: Large language model (LLM) leads to a surge of autonomous GUI agents for smartphone, which completes a task triggered by natural language through predicting a sequence of actions of API. Even though the task highly relies on past actions and visual observations, existing studies typically consider little semantic information carried out by intermediate screenshots and screen operations. To address… ▽ More

    Submitted 12 July, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Dataset could be found in https://github.com/IMNearth/CoAT

  29. arXiv:2403.02565  [pdf, other

    eess.SP

    Deep Cooperation in ISAC System: Resource, Node and Infrastructure Perspectives

    Authors: Zhiqing Wei, Haotian Liu, Zhiyong Feng, Huici Wu, Fan Liu, Qixun Zhang, Yucong Du

    Abstract: With the emerging Integrated Sensing and Communication (ISAC) technique, exploiting the mobile communication system with multi-domain resources, multiple network elements, and large-scale infrastructures to realize cooperative sensing is a crucial approach satisfying the requirements of high-accuracy and large-scale sensing in IoE. In this article, the deep cooperation in ISAC system including thr… ▽ More

    Submitted 29 May, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: 8 pages and 6 figures, Accepted by IEEE Internet of Things Magazine

  30. arXiv:2403.01840  [pdf, other

    cs.CV cs.AI

    FreeA: Human-object Interaction Detection using Free Annotation Labels

    Authors: Yuxiao Wang, Zhenao Wei, Xinyu Jiang, Yu Lei, Weiying Xue, Jinxiu Liu, Qi Liu

    Abstract: Recent human-object interaction (HOI) detection approaches rely on high cost of manpower and require comprehensive annotated image datasets. In this paper, we propose a novel self-adaption language-driven HOI detection method, termed as FreeA, without labeling by leveraging the adaptability of CLIP to generate latent HOI labels. To be specific, FreeA matches image features of human-object pairs wi… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 11 pages, 7 figures, 6 tables

  31. arXiv:2403.00485  [pdf, other

    cs.LG

    A Survey of Geometric Graph Neural Networks: Data Structures, Models and Applications

    Authors: Jiaqi Han, Jiacheng Cen, Liming Wu, Zongzhao Li, Xiangzhe Kong, Rui Jiao, Ziyang Yu, Tingyang Xu, Fandi Wu, Zihe Wang, Hongteng Xu, Zhewei Wei, Yang Liu, Yu Rong, Wenbing Huang

    Abstract: Geometric graph is a special kind of graph with geometric features, which is vital to model many scientific problems. Unlike generic graphs, geometric graphs often exhibit physical symmetries of translations, rotations, and reflections, making them ineffectively processed by current Graph Neural Networks (GNNs). To tackle this issue, researchers proposed a variety of Geometric Graph Neural Network… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  32. arXiv:2402.16333  [pdf, other

    cs.CY cs.CL

    Unveiling the Truth and Facilitating Change: Towards Agent-based Large-scale Social Movement Simulation

    Authors: Xinyi Mou, Zhongyu Wei, Xuanjing Huang

    Abstract: Social media has emerged as a cornerstone of social movements, wielding significant influence in driving societal change. Simulating the response of the public and forecasting the potential impact has become increasingly important. However, existing methods for simulating such phenomena encounter challenges concerning their efficacy and efficiency in capturing the behaviors of social movement part… ▽ More

    Submitted 17 June, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: Accepted to findings of ACL 2024

  33. arXiv:2402.16026  [pdf

    cs.LG

    Feature Selection Based on Orthogonal Constraints and Polygon Area

    Authors: Zhenxing Zhang, Jun Ge, Zheng Wei, Chunjie Zhou, Yilei Wang

    Abstract: The goal of feature selection is to choose the optimal subset of features for a recognition task by evaluating the importance of each feature, thereby achieving effective dimensionality reduction. Currently, proposed feature selection methods often overlook the discriminative dependencies between features and labels. To address this problem, this paper introduces a novel orthogonal regression mode… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  34. arXiv:2402.15152  [pdf, other

    cs.LG cs.AI cs.CR math.OC

    On the Duality Between Sharpness-Aware Minimization and Adversarial Training

    Authors: Yihao Zhang, Hangzhou He, Jingyu Zhu, Huanran Chen, Yifei Wang, Zeming Wei

    Abstract: Adversarial Training (AT), which adversarially perturb the input samples during training, has been acknowledged as one of the most effective defenses against adversarial attacks, yet suffers from inevitably decreased clean accuracy. Instead of perturbing the samples, Sharpness-Aware Minimization (SAM) perturbs the model weights during training to find a more flat loss landscape and improve general… ▽ More

    Submitted 5 June, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  35. arXiv:2402.13048  [pdf, other

    cs.CL

    Stable Knowledge Editing in Large Language Models

    Authors: Zihao Wei, Liang Pang, Hanxing Ding, Jingcheng Deng, Huawei Shen, Xueqi Cheng

    Abstract: Efficient knowledge editing of large language models is crucial for replacing obsolete information or incorporating specialized knowledge on a large scale. However, previous methods implicitly assume that knowledge is localized and isolated within the model, an assumption that oversimplifies the interconnected nature of model knowledge. The premise of localization results in an incomplete knowledg… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  36. arXiv:2402.13022  [pdf, other

    cs.CL cs.MM

    SoMeLVLM: A Large Vision Language Model for Social Media Processing

    Authors: Xinnong Zhang, Haoyu Kuang, Xinyi Mou, Hanjia Lyu, Kun Wu, Siming Chen, Jiebo Luo, Xuanjing Huang, Zhongyu Wei

    Abstract: The growth of social media, characterized by its multimodal nature, has led to the emergence of diverse phenomena and challenges, which calls for an effective approach to uniformly solve automated tasks. The powerful Large Vision Language Models make it possible to handle a variety of tasks simultaneously, but even with carefully designed prompting methods, the general domain models often fall sho… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  37. arXiv:2402.11443  [pdf, other

    cs.CL

    Benchmark Self-Evolving: A Multi-Agent Framework for Dynamic LLM Evaluation

    Authors: Siyuan Wang, Zhuohan Long, Zhihao Fan, Zhongyu Wei, Xuanjing Huang

    Abstract: This paper presents a benchmark self-evolving framework to dynamically evaluate rapidly advancing Large Language Models (LLMs), aiming for a more accurate assessment of their capabilities and limitations. We utilize a multi-agent system to manipulate the context or question of original instances, reframing new evolving instances with high confidence that dynamically extend existing benchmarks. Tow… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  38. arXiv:2402.11442  [pdf, other

    cs.CL

    Can LLMs Reason with Rules? Logic Scaffolding for Stress-Testing and Improving LLMs

    Authors: Siyuan Wang, Zhongyu Wei, Yejin Choi, Xiang Ren

    Abstract: Large language models (LLMs) have achieved impressive human-like performance across various reasoning tasks. However, their mastery of underlying inferential rules still falls short of human capabilities. To investigate this, we propose a logic scaffolding inferential rule generation framework, to construct an inferential rule base, ULogic, comprising both primitive and compositional rules across… ▽ More

    Submitted 20 June, 2024; v1 submitted 17 February, 2024; originally announced February 2024.

    Comments: Accepted as a long paper to ACL 2024 Main

  39. arXiv:2402.10612  [pdf, other

    cs.CL

    Retrieve Only When It Needs: Adaptive Retrieval Augmentation for Hallucination Mitigation in Large Language Models

    Authors: Hanxing Ding, Liang Pang, Zihao Wei, Huawei Shen, Xueqi Cheng

    Abstract: Hallucinations pose a significant challenge for the practical implementation of large language models (LLMs). The utilization of parametric knowledge in generating factual content is constrained by the limited knowledge of LLMs, potentially resulting in internal hallucinations. While incorporating external information can help fill knowledge gaps, it also introduces the risk of irrelevant informat… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  40. arXiv:2402.09742  [pdf, other

    cs.CL

    AI Hospital: Benchmarking Large Language Models in a Multi-agent Medical Interaction Simulator

    Authors: Zhihao Fan, Jialong Tang, Wei Chen, Siyuan Wang, Zhongyu Wei, Jun Xi, Fei Huang, Jingren Zhou

    Abstract: Artificial intelligence has significantly advanced healthcare, particularly through large language models (LLMs) that excel in medical question answering benchmarks. However, their real-world clinical application remains limited due to the complexities of doctor-patient interactions. To address this, we introduce \textbf{AI Hospital}, a multi-agent framework simulating dynamic medical interactions… ▽ More

    Submitted 27 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

    Comments: https://github.com/LibertFan/AI_Hospital

  41. arXiv:2402.09694  [pdf, other

    cs.CV

    Seed Optimization with Frozen Generator for Superior Zero-shot Low-light Enhancement

    Authors: Yuxuan Gu, Yi Jin, Ben Wang, Zhixiang Wei, Xiaoxiao Ma, Pengyang Ling, Haoxuan Wang, Huaian Chen, Enhong Chen

    Abstract: In this work, we observe that the generators, which are pre-trained on massive natural images, inherently hold the promising potential for superior low-light image enhancement against varying scenarios.Specifically, we embed a pre-trained generator to Retinex model to produce reflectance maps with enhanced detail and vividness, thereby recovering features degraded by low-light conditions.Taking on… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  42. arXiv:2402.08834  [pdf, other

    cond-mat.dis-nn cond-mat.mtrl-sci

    Machine Learning Potential Powered Insights into the Mechanical Stability of Amorphous Li-Si Alloys

    Authors: Zixiong Wei, Nongnuch Artrith

    Abstract: Understanding the mechanical properties of solid-state materials at the atomic scale is crucial for developing novel materials. For example, amorphous LiSi alloys are attractive anode materials for solid-state Li-ion batteries but face mechanical instabilities due to significant volume variations with changing Li content. A fundamental grasp of the mechanical behavior in such systems is essential… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 22 pages, 12 figures

  43. arXiv:2402.08085  [pdf, other

    cs.LG cs.AI cs.CG

    Message Detouring: A Simple Yet Effective Cycle Representation for Expressive Graph Learning

    Authors: Ziquan Wei, Tingting Dan, Guorong Wu

    Abstract: Graph learning is crucial in the fields of bioinformatics, social networks, and chemicals. Although high-order graphlets, such as cycles, are critical to achieving an informative graph representation for node classification, edge prediction, and graph recognition, modeling high-order topological characteristics poses significant computational challenges, restricting its widespread applications in… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: 16 pages, 5 figures

  44. arXiv:2402.06663  [pdf, other

    cs.CR cs.AI

    Explainable Adversarial Learning Framework on Physical Layer Secret Keys Combating Malicious Reconfigurable Intelligent Surface

    Authors: Zhuangkun Wei, Wenxiu Hu, Weisi Guo

    Abstract: The development of reconfigurable intelligent surfaces (RIS) is a double-edged sword to physical layer security (PLS). Whilst a legitimate RIS can yield beneficial impacts including increased channel randomness to enhance physical layer secret key generation (PL-SKG), malicious RIS can poison legitimate channels and crack most of existing PL-SKGs. In this work, we propose an adversarial learning f… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  45. arXiv:2402.06255  [pdf, other

    cs.LG cs.AI cs.CL cs.CR

    Fight Back Against Jailbreaking via Prompt Adversarial Tuning

    Authors: Yichuan Mo, Yuji Wang, Zeming Wei, Yisen Wang

    Abstract: While Large Language Models (LLMs) have achieved tremendous success in various applications, they are also susceptible to jailbreak attacks. Several primary defense strategies have been proposed to protect LLMs from producing harmful information, mostly with a particular focus on harmful content filtering or heuristical defensive prompt designs. However, how to achieve intrinsic robustness through… ▽ More

    Submitted 21 August, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

  46. arXiv:2402.05780  [pdf, other

    quant-ph cond-mat.stat-mech hep-th math-ph

    Magic Class and the Convolution Group

    Authors: Kaifeng Bu, Arthur Jaffe, Zixia Wei

    Abstract: The classification of many-body quantum states plays a fundamental role in the study of quantum phases of matter. In this work, we propose an approach to classify quantum states by introducing the concept of magic class. In addition, we introduce an efficient coarse-graining procedure to extract the magic feature of states, which we call the ``convolution group (CG).'' We classify quantum states i… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 6+2 pages

  47. arXiv:2402.05390  [pdf, other

    cs.NI eess.SP

    Integrated Sensing and Communication Driven Digital Twin for Intelligent Machine Network

    Authors: Zhiqing Wei, Yucong Du, Qixun Zhang, Wangjun Jiang, Yanpeng Cui, Zeyang Meng, Huici Wu, Zhiyong Feng

    Abstract: Intelligent machines (IMs), including industrial machines, unmanned aerial vehicles (UAVs), and unmanned vehicles, etc., could perform effective cooperation in complex environment when they form IM network. The efficient environment sensing and communication are crucial for IM network, enabling the real-time and stable control of IMs. With the emergence of integrated sensing and communication (ISA… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 9 pages, 5 figures, 1 Table

    ACM Class: C.2.1

  48. arXiv:2402.04485  [pdf, other

    cs.LG cs.GT

    Incentivized Truthful Communication for Federated Bandits

    Authors: Zhepei Wei, Chuanhao Li, Tianze Ren, Haifeng Xu, Hongning Wang

    Abstract: To enhance the efficiency and practicality of federated bandit learning, recent advances have introduced incentives to motivate communication among clients, where a client participates only when the incentive offered by the server outweighs its participation cost. However, existing incentive mechanisms naively assume the clients are truthful: they all report their true cost and thus the higher cos… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 20 pages, 2 figures. Accepted at ICLR 2024

  49. arXiv:2402.02105  [pdf, other

    cs.CV

    ParZC: Parametric Zero-Cost Proxies for Efficient NAS

    Authors: Peijie Dong, Lujun Li, Xinglin Pan, Zimian Wei, Xiang Liu, Qiang Wang, Xiaowen Chu

    Abstract: Recent advancements in Zero-shot Neural Architecture Search (NAS) highlight the efficacy of zero-cost proxies in various NAS benchmarks. Several studies propose the automated design of zero-cost proxies to achieve SOTA performance but require tedious searching progress. Furthermore, we identify a critical issue with current zero-cost proxies: they aggregate node-wise zero-cost statistics without c… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  50. Deep Semantic-Visual Alignment for Zero-Shot Remote Sensing Image Scene Classification

    Authors: Wenjia Xu, Jiuniu Wang, Zhiwei Wei, Mugen Peng, Yirong Wu

    Abstract: Deep neural networks have achieved promising progress in remote sensing (RS) image classification, for which the training process requires abundant samples for each class. However, it is time-consuming and unrealistic to annotate labels for each RS category, given the fact that the RS target database is increasing dynamically. Zero-shot learning (ZSL) allows for identifying novel classes that are… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: Published in ISPRS P&RS. The code is available at https://github.com/wenjiaXu/RS_Scene_ZSL

    Journal ref: ISPRS Journal of Photogrammetry and Remote Sensing, Volume 198, 2023, Pages 140-152