Zum Hauptinhalt springen

Showing 51–100 of 998 results for author: Wei, Z

.
  1. arXiv:2406.10839  [pdf, other

    cs.CV cs.CL

    Reminding Multimodal Large Language Models of Object-aware Knowledge with Retrieved Tags

    Authors: Daiqing Qi, Handong Zhao, Zijun Wei, Sheng Li

    Abstract: Despite recent advances in the general visual instruction-following ability of Multimodal Large Language Models (MLLMs), they still struggle with critical problems when required to provide a precise and detailed response to a visual instruction: (1) failure to identify novel objects or entities, (2) mention of non-existent objects, and (3) neglect of object's attributed details. Intuitive solution… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 18 pages, 11 figures

  2. arXiv:2406.05756  [pdf, other

    cs.AI cs.CL cs.CV cs.MM

    EmbSpatial-Bench: Benchmarking Spatial Understanding for Embodied Tasks with Large Vision-Language Models

    Authors: Mengfei Du, Binhao Wu, Zejun Li, Xuanjing Huang, Zhongyu Wei

    Abstract: The recent rapid development of Large Vision-Language Models (LVLMs) has indicated their potential for embodied tasks.However, the critical skill of spatial understanding in embodied environments has not been thoroughly evaluated, leaving the gap between current LVLMs and qualified embodied intelligence unknown. Therefore, we construct EmbSpatial-Bench, a benchmark for evaluating embodied spatial… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL 2024 Main

  3. arXiv:2406.05564  [pdf, other

    cs.LG cs.AI cs.CL cs.FL

    Automata Extraction from Transformers

    Authors: Yihao Zhang, Zeming Wei, Meng Sun

    Abstract: In modern machine (ML) learning systems, Transformer-based architectures have achieved milestone success across a broad spectrum of tasks, yet understanding their operational mechanisms remains an open problem. To improve the transparency of ML systems, automata extraction methods, which interpret stateful ML models as automata typically through formal languages, have proven effective for explaini… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  4. arXiv:2406.03402  [pdf, other

    cs.LG cs.AI

    Mixed-Precision Over-The-Air Federated Learning via Approximated Computing

    Authors: Jinsheng Yuan, Zhuangkun Wei, Weisi Guo

    Abstract: Over-the-Air Federated Learning (OTA-FL) has been extensively investigated as a privacy-preserving distributed learning mechanism. Realistic systems will see FL clients with diverse size, weight, and power configurations. A critical research gap in existing OTA-FL research is the assumption of homogeneous client computational bit precision. Indeed, many clients may exploit approximate computing (A… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  5. arXiv:2406.02430  [pdf, other

    eess.AS cs.SD

    Seed-TTS: A Family of High-Quality Versatile Speech Generation Models

    Authors: Philip Anastassiou, Jiawei Chen, Jitong Chen, Yuanzhe Chen, Zhuo Chen, Ziyi Chen, Jian Cong, Lelai Deng, Chuang Ding, Lu Gao, Mingqing Gong, Peisong Huang, Qingqing Huang, Zhiying Huang, Yuanyuan Huo, Dongya Jia, Chumin Li, Feiya Li, Hui Li, Jiaxin Li, Xiaoyang Li, Xingxing Li, Lin Liu, Shouda Liu, Sichao Liu , et al. (21 additional authors not shown)

    Abstract: We introduce Seed-TTS, a family of large-scale autoregressive text-to-speech (TTS) models capable of generating speech that is virtually indistinguishable from human speech. Seed-TTS serves as a foundation model for speech generation and excels in speech in-context learning, achieving performance in speaker similarity and naturalness that matches ground truth human speech in both objective and sub… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  6. arXiv:2406.01195  [pdf, other

    cs.RO

    C$^3$P-VoxelMap: Compact, Cumulative and Coalescible Probabilistic Voxel Mapping

    Authors: Xu Yang, Wenhao Li, Qijie Ge, Lulu Suo, Weijie Tang, Zhengyu Wei, Longxiang Huang, Bo Wang

    Abstract: This work presents a compact, cumulative and coalescible probabilistic voxel mapping method to enhance performance, accuracy and memory efficiency in LiDAR odometry. Probabilistic voxel mapping requires storing past point clouds and re-iterating on them to update the uncertainty every iteration, which consumes large memory space and CPU cycles. To solve this problem, we propose a two-folded strate… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  7. arXiv:2406.01027  [pdf, other

    cs.DB cs.LG

    PRICE: A Pretrained Model for Cross-Database Cardinality Estimation

    Authors: Tianjing Zeng, Junwei Lan, Jiahong Ma, Wenqing Wei, Rong Zhu, Pengfei Li, Bolin Ding, Defu Lian, Zhewei Wei, Jingren Zhou

    Abstract: Cardinality estimation (CardEst) is essential for optimizing query execution plans. Recent ML-based CardEst methods achieve high accuracy but face deployment challenges due to high preparation costs and lack of transferability across databases. In this paper, we propose PRICE, a PRetrained multI-table CardEst model, which addresses these limitations. PRICE takes low-level but transferable features… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  8. arXiv:2405.20742  [pdf, ps, other

    cond-mat.supr-con

    Terahertz emission from mutually synchronized standalone Bi2Sr2CaCu2O8+x intrinsic-Josephson-junction stacks

    Authors: Raphael Wieland, Olcay Kizilaslan, Nickolay Kinev, Eric Dorsch, Stefan Guénon, Ziyu Song, Zihan Wei, Huabing Wang, Peiheng Wu, Dieter Koelle, Valery P. Koshelets, Reinhold Kleiner

    Abstract: Suitably patterned single crystals made of the cuprate superconductor Bi$_2$Sr$_2$CaCu$_2$O$_{8+x}$ (BSCCO), intrinsically forming a stack of Josephson junctions, can generate electromagnetic radiation in the lower terahertz regime. Due to Joule heating the emission power of single stacks seems to be limited to values below 100 $μ$W. To increase the radiation power, mutually synchronized arrays si… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  9. arXiv:2405.19788  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.supr-con

    Unidirectional charge orders induced by oxygen vacancies on SrTiO$_3$(001)

    Authors: Cui Ding, Wenfeng Dong, Xiaotong Jiao, Zhiyu Zhang, Guanming Gong, Zhongxu Wei, Lili Wang, Jin-Feng Jia, Qi-Kun Xue

    Abstract: The discovery of high-mobility two-dimensional electron gas and low carrier density superconductivity in multiple SrTiO$_3$-based heterostructures has stimulated intense interest in the surface properties of SrTiO$_3$. The recent discovery of high-T$_c$ superconductivity in the monolayer FeSe/SrTiO$_3$ aroused the upsurge and underscored the atomic precision probe of the surface structure. By perf… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  10. arXiv:2405.18731  [pdf, other

    eess.SP cs.AI physics.comp-ph

    VBIM-Net: Variational Born Iterative Network for Inverse Scattering Problems

    Authors: Ziqing Xing, Zhaoyang Zhang, Zirui Chen, Yusong Wang, Haoran Ma, Zhun Wei, Gang Bao

    Abstract: Recently, studies have shown the potential of integrating field-type iterative methods with deep learning (DL) techniques in solving inverse scattering problems (ISPs). In this article, we propose a novel Variational Born Iterative Network, namely, VBIM-Net, to solve the full-wave ISPs with significantly improved flexibility and inversion quality. The proposed VBIM-Net emulates the alternating upd… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 14 pages, 21 figures

  11. Correctable Landmark Discovery via Large Models for Vision-Language Navigation

    Authors: Bingqian Lin, Yunshuang Nie, Ziming Wei, Yi Zhu, Hang Xu, Shikui Ma, Jianzhuang Liu, Xiaodan Liang

    Abstract: Vision-Language Navigation (VLN) requires the agent to follow language instructions to reach a target position. A key factor for successful navigation is to align the landmarks implied in the instruction with diverse visual observations. However, previous VLN agents fail to perform accurate modality alignment especially in unexplored scenes, since they learn from limited navigation data and lack s… ▽ More

    Submitted 5 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted by TPAMI 2024

  12. arXiv:2405.18634  [pdf, other

    cs.LG cs.CL stat.ML

    A Theoretical Understanding of Self-Correction through In-context Alignment

    Authors: Yifei Wang, Yuyang Wu, Zeming Wei, Stefanie Jegelka, Yisen Wang

    Abstract: Going beyond mimicking limited human experiences, recent studies show initial evidence that, like humans, large language models (LLMs) are capable of improving their abilities purely by self-correction, i.e., correcting previous responses through self-examination, in certain circumstances. Nevertheless, little is known about how such capabilities arise. In this work, based on a simplified setup ak… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  13. arXiv:2405.16919  [pdf, other

    cs.CV cs.AI cs.CL

    VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models

    Authors: Zejun Li, Ruipu Luo, Jiwen Zhang, Minghui Qiu, Zhongyu Wei

    Abstract: While large multi-modal models (LMMs) have exhibited impressive capabilities across diverse tasks, their effectiveness in handling complex tasks has been limited by the prevailing single-step reasoning paradigm. To this end, this paper proposes VoCoT, a multi-step Visually grounded object-centric Chain-of-Thought reasoning framework tailored for inference with LMMs. VoCoT is characterized by two k… ▽ More

    Submitted 28 May, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  14. arXiv:2405.16405  [pdf, other

    cs.LG cs.AI

    Intruding with Words: Towards Understanding Graph Injection Attacks at the Text Level

    Authors: Runlin Lei, Yuwei Hu, Yuchen Ren, Zhewei Wei

    Abstract: Graph Neural Networks (GNNs) excel across various applications but remain vulnerable to adversarial attacks, particularly Graph Injection Attacks (GIAs), which inject malicious nodes into the original graph and pose realistic threats. Text-attributed graphs (TAGs), where nodes are associated with textual features, are crucial due to their prevalence in real-world applications and are commonly used… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 29 pages

  15. arXiv:2405.16398  [pdf, other

    eess.SP

    Networked Integrated Sensing and Communications for 6G Wireless Systems

    Authors: Jiapeng Li, Xiaodan Shao, Feng Chen, Shaohua Wan, Chang Liu, Zhiqiang Wei, Derrick Wing Kwan Ng

    Abstract: Integrated sensing and communication (ISAC) is envisioned as a key pillar for enabling the upcoming sixth generation (6G) communication systems, requiring not only reliable communication functionalities but also highly accurate environmental sensing capabilities. In this paper, we design a novel networked ISAC framework to explore the collaboration among multiple users for environmental sensing. S… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: Received by IEEE Internet of Things Journal

  16. arXiv:2405.16357  [pdf, other

    q-bio.NC

    Exploring the Enigma of Neural Dynamics Through A Scattering-Transform Mixer Landscape for Riemannian Manifold

    Authors: Tingting Dan, Ziquan Wei, Won Hwa Kim, Guorong Wu

    Abstract: The human brain is a complex inter-wired system that emerges spontaneous functional fluctuations. In spite of tremendous success in the experimental neuroscience field, a system-level understanding of how brain anatomy supports various neural activities remains elusive. Capitalizing on the unprecedented amount of neuroimaging data, we present a physics-informed deep model to uncover the coupling m… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 15 pages, 6 figures

    MSC Class: 51H30 ACM Class: I.3.5

  17. arXiv:2405.15349  [pdf, other

    cs.CL

    UnKE: Unstructured Knowledge Editing in Large Language Models

    Authors: Jingcheng Deng, Zihao Wei, Liang Pang, Hanxing Ding, Huawei Shen, Xueqi Cheng

    Abstract: Recent knowledge editing methods have primarily focused on modifying structured knowledge in large language models, heavily relying on the assumption that structured knowledge is stored as key-value pairs locally in MLP layers or specific neurons. However, this task setting overlooks the fact that a significant portion of real-world knowledge is stored in an unstructured format, characterized by l… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  18. arXiv:2405.14195  [pdf, other

    cs.CV cs.AI

    Enhanced Object Tracking by Self-Supervised Auxiliary Depth Estimation Learning

    Authors: Zhenyu Wei, Yujie He, Zhanchuan Cai

    Abstract: RGB-D tracking significantly improves the accuracy of object tracking. However, its dependency on real depth inputs and the complexity involved in multi-modal fusion limit its applicability across various scenarios. The utilization of depth information in RGB-D tracking inspired us to propose a new method, named MDETrack, which trains a tracking network with an additional capability to understand… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  19. arXiv:2405.14116  [pdf, other

    cs.RO cs.HC cs.LG

    Learning Multimodal Confidence for Intention Recognition in Human-Robot Interaction

    Authors: Xiyuan Zhao, Huijun Li, Tianyuan Miao, Xianyi Zhu, Zhikai Wei, Aiguo Song

    Abstract: The rapid development of collaborative robotics has provided a new possibility of helping the elderly who has difficulties in daily life, allowing robots to operate according to specific intentions. However, efficient human-robot cooperation requires natural, accurate and reliable intention recognition in shared environments. The current paramount challenge for this is reducing the uncertainty of… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  20. arXiv:2405.11936  [pdf, other

    cs.CV

    UAV-VisLoc: A Large-scale Dataset for UAV Visual Localization

    Authors: Wenjia Xu, Yaxuan Yao, Jiaqi Cao, Zhiwei Wei, Chunbo Liu, Jiuniu Wang, Mugen Peng

    Abstract: The application of unmanned aerial vehicles (UAV) has been widely extended recently. It is crucial to ensure accurate latitude and longitude coordinates for UAVs, especially when the global navigation satellite systems (GNSS) are disrupted and unreliable. Existing visual localization methods achieve autonomous visual localization without error accumulation by matching the ground-down view image of… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  21. arXiv:2405.10606  [pdf, other

    eess.SP

    Carrier Aggregation Enabled MIMO-OFDM Integrated Sensing and Communication

    Authors: Haotian Liu, Zhiqing Wei, Jinghui Piao, Huici Wu, Xingwang Li, Zhiyong Feng

    Abstract: In the evolution towards the forthcoming era of sixth-generation (6G) mobile communication systems characterized by ubiquitous intelligence, integrated sensing and communication (ISAC) is in a phase of burgeoning development. However, the capabilities of communication and sensing within single frequency band fall short of meeting the escalating demands. To this end, this paper introduces a carrier… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 13page, 9figures, Submitted to IEEE Transactions on Wireless Communications

  22. arXiv:2405.09179  [pdf, other

    eess.SP

    Integrated Sensing and Communication Enabled Cooperative Passive Sensing Using Mobile Communication System

    Authors: Zhiqing Wei, Haotian Liu, Hujun Li, Wangjun Jiang, Zhiyong Feng, Huici Wu, Ping Zhang

    Abstract: Integrated sensing and communication (ISAC) is a potential technology of the sixth-generation (6G) mobile communication system, which enables communication base station (BS) with sensing capability. However, the performance of single-BS sensing is limited, which can be overcome by multi-BS cooperative sensing. There are three types of multi-BS cooperative sensing, including cooperative active sens… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 16 pages, 11 figures, Submitted to IEEE Transactions on Mobile Computing

  23. Multi-Objective Optimization-based Transmit Beamforming for Multi-Target and Multi-User MIMO-ISAC Systems

    Authors: Chunwei Meng, Zhiqing Wei, Dingyou Ma, Wanli Ni, Liyan Su, Zhiyong Feng

    Abstract: Integrated sensing and communication (ISAC) is an enabling technology for the sixth-generation mobile communications, which equips the wireless communication networks with sensing capabilities. In this paper, we investigate transmit beamforming design for multiple-input and multiple-output (MIMO)-ISAC systems in scenarios with multiple radar targets and communication users. A general form of multi… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

  24. arXiv:2405.08815  [pdf, other

    cs.CV

    Efficient Vision-Language Pre-training by Cluster Masking

    Authors: Zihao Wei, Zixuan Pan, Andrew Owens

    Abstract: We propose a simple strategy for masking image patches during visual-language contrastive learning that improves the quality of the learned representations and the training speed. During each iteration of training, we randomly mask clusters of visually similar image patches, as measured by their raw pixel intensities. This provides an extra learning signal, beyond the contrastive training itself,… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: CVPR 2024, Project page: https://zxp46.github.io/cluster-masking/ , Code: https://github.com/Zi-hao-Wei/Efficient-Vision-Language-Pre-training-by-Cluster-Masking

  25. arXiv:2405.07792  [pdf, other

    cs.DB cs.DS cs.LG

    Optimal Matrix Sketching over Sliding Windows

    Authors: Hanyan Yin, Dongxie Wen, Jiajun Li, Zhewei Wei, Xiao Zhang, Zengfeng Huang, Feifei Li

    Abstract: Matrix sketching, aimed at approximating a matrix $\boldsymbol{A} \in \mathbb{R}^{N\times d}$ consisting of vector streams of length $N$ with a smaller sketching matrix $\boldsymbol{B} \in \mathbb{R}^{\ell\times d}, \ell \ll N$, has garnered increasing attention in fields such as large-scale data analytics and machine learning. A well-known deterministic matrix sketching method is the Frequent Dir… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  26. arXiv:2405.07668  [pdf, other

    cs.SE cs.AI cs.CR

    CrossCert: A Cross-Checking Detection Approach to Patch Robustness Certification for Deep Learning Models

    Authors: Qilin Zhou, Zhengyuan Wei, Haipeng Wang, Bo Jiang, W. K. Chan

    Abstract: Patch robustness certification is an emerging kind of defense technique against adversarial patch attacks with provable guarantees. There are two research lines: certified recovery and certified detection. They aim to label malicious samples with provable guarantees correctly and issue warnings for malicious samples predicted to non-benign labels with provable guarantees, respectively. However, ex… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 23 pages, 2 figures, accepted by FSE 2024 (The ACM International Conference on the Foundations of Software Engineering)

  27. arXiv:2405.07469  [pdf, other

    quant-ph physics.optics

    Phase coding semi-quantum key distribution system based on the Single-state protocol

    Authors: Qincheng Hou, Siying Huang, Naida Mo, Jindong Wang, Zhengjun Wei, Yafei Yu, Tianming Zhao, Zhiming Zhang

    Abstract: Semi-quantum key distribution (SQKD) allows sharing random keys between a quantum user and a classical user. However, implementing classical user operations is challenging, posing a hurdle to achieving the Single-state protocol. By using the "selective modulation" method, the feasibility of SQKD is verified in principle. The proposal of the selective modulation method enables the realization of ot… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  28. arXiv:2405.05663  [pdf, other

    cs.CV

    RPBG: Towards Robust Neural Point-based Graphics in the Wild

    Authors: Qingtian Zhu, Zizhuang Wei, Zhongtian Zheng, Yifan Zhan, Zhuyu Yao, Jiawang Zhang, Kejian Wu, Yinqiang Zheng

    Abstract: Point-based representations have recently gained popularity in novel view synthesis, for their unique advantages, e.g., intuitive geometric representation, simple manipulation, and faster convergence. However, based on our observation, these point-based neural re-rendering methods are only expected to perform well under ideal conditions and suffer from noisy, patchy points and unbounded scenes, wh… ▽ More

    Submitted 10 July, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

    Comments: ECCV 2024

  29. arXiv:2405.05486  [pdf, other

    cond-mat.mes-hall cond-mat.str-el

    Quantum Hall interferometry at finite bias with multiple edge channels

    Authors: Zezhu Wei, D. E. Feldman, Bertrand I. Halperin

    Abstract: In a quantum Hall interferometer, the dependence of the signal on source-drain voltage is controlled by details of the edge physics, such as the velocities of edge modes and the interaction between them and with screening layers. Such dependence of the signal has been seen in recent experiments at various integer and fractional filling factors, including $ν=2$ and $ν=2/5$, where two edge modes are… ▽ More

    Submitted 28 August, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: 26 pages, 13 figures, typos fixed and reference updated, published in PRB as an Editors' Suggestion

    Journal ref: Phys. Rev. B 110, 075306 (2024)

  30. arXiv:2405.03976  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Anomalous Gate-tunable Capacitance in Graphene Moiré Heterostructures

    Authors: Linshang Chen, Haoran Long, Heng Wu, Rui Mei, Zhengyu Su, Mengjie Feng, Jiang-Bin Wu, Kenji Watanabe, Takashi Taniguchi, Xuewei Cao, Zhongming Wei, Ping-Heng Tan, Yanmeng Shi

    Abstract: Interface engineered ferroelectricity in van der Waals heterostructures is of broad interest both fundamentally and technologically for the applications in neuromorphic computing and so on. In particular, the moiré ferroelectricity in graphene/hexagonal boron nitride (hBN) heterostructures driven by charge ordering instead of traditional lattice displacement has drawn considerable attention becaus… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 20 pages, 13 figures

  31. arXiv:2405.03755  [pdf, other

    hep-th

    Holographic Dual of Crosscap Conformal Field Theory

    Authors: Zixia Wei

    Abstract: We propose a holographic dual for 2D CFT defined on closed non-orientable manifolds, such as the real projective plane $\mathbb{RP}^2$ and the Klein bottle $\mathbb{K}^2$. Such CFT can be constructed by introducing antipodally identified cuttings, i.e. crosscaps, to a sphere and hence called crosscap CFT (XCFT). The gravity dual is AdS$_3$ spacetime with dS$_2$ end-of-the-world branes. In particul… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 15 pages + references, 4 figures

  32. arXiv:2405.02873  [pdf, other

    eess.SP

    Target Localization with Macro and Micro Base Stations Cooperative Sensing

    Authors: Haotian Liu, Zhiqing Wei, Furong Yang, Huici Wu, Kaifeng Han, Zhiyong Feng

    Abstract: Addressing the communication and sensing demands of sixth-generation (6G) mobile communication system, integrated sensing and communication (ISAC) has garnered traction in academia and industry. With the sensing limitation of single base station (BS), multi-BS cooperative sensing is regarded as a promising solution. The coexistence and overlapped coverage of macro BS (MBS) and micro BS (MiBS) are… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: 7 pages 6 figures, submitted to 2024 IEEE GLOBECOM

  33. arXiv:2405.01229  [pdf, ps, other

    cs.LG cs.AI cs.CL cs.CR math.OC

    Boosting Jailbreak Attack with Momentum

    Authors: Yihao Zhang, Zeming Wei

    Abstract: Large Language Models (LLMs) have achieved remarkable success across diverse tasks, yet they remain vulnerable to adversarial attacks, notably the well-documented \textit{jailbreak} attack. Recently, the Greedy Coordinate Gradient (GCG) attack has demonstrated efficacy in exploiting this vulnerability by optimizing adversarial prompts through a combination of gradient heuristics and greedy search.… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: ICLR 2024 Workshop on Reliable and Responsible Foundation Models

  34. arXiv:2405.00820  [pdf, other

    cs.AR cs.LG

    HLSFactory: A Framework Empowering High-Level Synthesis Datasets for Machine Learning and Beyond

    Authors: Stefan Abi-Karam, Rishov Sarkar, Allison Seigler, Sean Lowe, Zhigang Wei, Hanqiu Chen, Nanditha Rao, Lizy John, Aman Arora, Cong Hao

    Abstract: Machine learning (ML) techniques have been applied to high-level synthesis (HLS) flows for quality-of-result (QoR) prediction and design space exploration (DSE). Nevertheless, the scarcity of accessible high-quality HLS datasets and the complexity of building such datasets present challenges. Existing datasets have limitations in terms of benchmark coverage, design space enumeration, vendor extens… ▽ More

    Submitted 17 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: Edit to "Section V.E" for proper attribution of open-source HLSyn, AutoDSE, and the Merlin compiler

  35. arXiv:2405.00636  [pdf, other

    physics.soc-ph cs.LG cs.SI physics.data-an

    Robustness of graph embedding methods for community detection

    Authors: Zhi-Feng Wei, Pablo Moriano, Ramakrishnan Kannan

    Abstract: This study investigates the robustness of graph embedding methods for community detection in the face of network perturbations, specifically edge deletions. Graph embedding techniques, which represent nodes as low-dimensional vectors, are widely used for various graph machine learning tasks due to their ability to capture structural properties of networks effectively. However, the impact of pertur… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 17 pages, 26 figures, 3 tables. Comments are welcome

  36. arXiv:2405.00527  [pdf, other

    cs.DB

    ChatBI: Towards Natural Language to Complex Business Intelligence SQL

    Authors: Jinqing Lian, Xinyi Liu, Yingxia Shao, Yang Dong, Ming Wang, Zhang Wei, Tianqi Wan, Ming Dong, Hailin Yan

    Abstract: The Natural Language to SQL (NL2SQL) technology provides non-expert users who are unfamiliar with databases the opportunity to use SQL for data analysis.Converting Natural Language to Business Intelligence (NL2BI) is a popular practical scenario for NL2SQL in actual production systems. Compared to NL2SQL, NL2BI introduces more challenges. In this paper, we propose ChatBI, a comprehensive and eff… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  37. arXiv:2405.00417  [pdf, other

    cs.LG stat.ME stat.ML

    Conformal Risk Control for Ordinal Classification

    Authors: Yunpeng Xu, Wenge Guo, Zhi Wei

    Abstract: As a natural extension to the standard conformal prediction method, several conformal risk control methods have been recently developed and applied to various learning problems. In this work, we seek to control the conformal risk in expectation for ordinal classification tasks, which have broad applications to many real problems. For this purpose, we firstly formulated the ordinal classification t… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 17 pages, 8 figures, 2 table; 1 supplementary page

    Journal ref: In UAI 2023: The 39th Conference on Uncertainty in Artificial Intelligence

  38. arXiv:2404.18644  [pdf, other

    quant-ph

    Low-Overhead Defect-Adaptive Surface Code with Bandage-Like Super-Stabilizers

    Authors: Zuolin Wei, Tan He, Yangsen Ye, Dachao Wu, Yiming Zhang, Youwei Zhao, Weiping Lin, He-Liang Huang, Xiaobo Zhu, Jian-Wei Pan

    Abstract: To make practical quantum algorithms work, large-scale quantum processors protected by error-correcting codes are required to resist noise and ensure reliable computational outcomes. However, a major challenge arises from defects in processor fabrication, as well as occasional losses or cosmic rays during the computing process, all of which can lead to qubit malfunctions and disrupt error-correcti… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  39. arXiv:2404.18211  [pdf, other

    cs.LG cs.SI

    A survey of dynamic graph neural networks

    Authors: Yanping Zheng, Lu Yi, Zhewei Wei

    Abstract: Graph neural networks (GNNs) have emerged as a powerful tool for effectively mining and learning from graph-structured data, with applications spanning numerous domains. However, most research focuses on static graphs, neglecting the dynamic nature of real-world networks where topologies and attributes evolve over time. By integrating sequence modeling modules into traditional GNN architectures, d… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  40. arXiv:2404.18191  [pdf, other

    cs.CL cs.AI cs.CR cs.LG math.OC

    Exploring the Robustness of In-Context Learning with Noisy Labels

    Authors: Chen Cheng, Xinzhi Yu, Haodong Wen, Jingsong Sun, Guanzhang Yue, Yihao Zhang, Zeming Wei

    Abstract: Recently, the mysterious In-Context Learning (ICL) ability exhibited by Transformer architectures, especially in large language models (LLMs), has sparked significant research interest. However, the resilience of Transformers' in-context learning capabilities in the presence of noisy samples, prevalent in both training corpora and prompt demonstrations, remains underexplored. In this paper, inspir… ▽ More

    Submitted 1 May, 2024; v1 submitted 28 April, 2024; originally announced April 2024.

    Comments: ICLR 2024 Workshop on Reliable and Responsible Foundation Models

  41. arXiv:2404.18041  [pdf, other

    quant-ph cs.LG math.OC

    Variational Optimization for Quantum Problems using Deep Generative Networks

    Authors: Lingxia Zhang, Xiaodie Lin, Peidong Wang, Kaiyan Yang, Xiao Zeng, Zhaohui Wei, Zizhu Wang

    Abstract: Optimization is one of the keystones of modern science and engineering. Its applications in quantum technology and machine learning helped nurture variational quantum algorithms and generative AI respectively. We propose a general approach to design variational optimization algorithms based on generative models: the Variational Generative Optimization Network (VGON). To demonstrate its broad appli… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: 17 pages, 13 figures, comments welcome

  42. arXiv:2404.17769  [pdf, other

    cs.IR stat.ME stat.ML

    Conformal Ranked Retrieval

    Authors: Yunpeng Xu, Wenge Guo, Zhi Wei

    Abstract: Given the wide adoption of ranked retrieval techniques in various information systems that significantly impact our daily lives, there is an increasing need to assess and address the uncertainty inherent in their predictions. This paper introduces a novel method using the conformal risk control framework to quantitatively measure and manage risks in the context of ranked retrieval problems. Our re… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 14 pages, 6 figures, 1 table; 7 supplementary pages, 12 supplementary figures, 2 supplementary tables

  43. arXiv:2404.17462  [pdf, other

    cs.NI

    Integrated Sensing and Communication Channel Modeling: A Survey

    Authors: Zhiqing Wei, Jinzhu Jia, Yangyang Niu, Lin Wang, Huici Wu, Heng Yang, Zhiyong Feng

    Abstract: Integrated sensing and communication (ISAC) is expected to play a crucial role in the sixth-generation (6G) mobile communication systems, offering potential applications in the scenarios of intelligent transportation, smart factories, etc. The performance of radar sensing in ISAC systems is closely related to the characteristics of radar sensing and communication channels. Therefore, ISAC channel… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  44. arXiv:2404.17343  [pdf, other

    cs.CL cs.FL

    A Bionic Natural Language Parser Equivalent to a Pushdown Automaton

    Authors: Zhenghao Wei, Kehua Lin, Jianlin Feng

    Abstract: Assembly Calculus (AC), proposed by Papadimitriou et al., aims to reproduce advanced cognitive functions through simulating neural activities, with several applications based on AC having been developed, including a natural language parser proposed by Mitropolsky et al. However, this parser lacks the ability to handle Kleene closures, preventing it from parsing all regular languages and rendering… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: to be published in IJCNN 2024

  45. arXiv:2404.16275  [pdf

    cs.NI cs.IT eess.SP

    Spectrum Sharing Policy in the Asia-Pacific Region

    Authors: Zhiyong Feng, Zhiqing Wei

    Abstract: In this chapter, we investigate the spectrum measurement results in Asia-Pacific region. Then the spectrum sharing policy in the Asia-Pacific region is reviewed in details, where the national projects and strategies on spectrum refarming and spectrum sharing in China, Japan, Singapore, India, Korea and Australia are investigated. Then we introduce the spectrum sharing test-bed that is developed in… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 33 pages, 17figures

  46. arXiv:2404.15805  [pdf, other

    q-bio.BM cs.LG

    Beyond ESM2: Graph-Enhanced Protein Sequence Modeling with Efficient Clustering

    Authors: Shujian Jiao, Bingxuan Li, Lei Wang, Xiaojin Zhang, Wei Chen, Jiajie Peng, Zhongyu Wei

    Abstract: Proteins are essential to life's processes, underpinning evolution and diversity. Advances in sequencing technology have revealed millions of proteins, underscoring the need for sophisticated pre-trained protein models for biological analysis and AI development. Facebook's ESM2, the most advanced protein language model to date, leverages a masked prediction task for unsupervised learning, crafting… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  47. arXiv:2404.14862  [pdf, other

    eess.SP

    Deep Learning Based Multi-Node ISAC 4D Environmental Reconstruction with Uplink- Downlink Cooperation

    Authors: Bohao Lu, Zhiqing Wei, Huici Wu, Xinrui Zeng, Lin Wang, Xi Lu, Dongyang Mei, Zhiyong Feng

    Abstract: Utilizing widely distributed communication nodes to achieve environmental reconstruction is one of the significant scenarios for Integrated Sensing and Communication (ISAC) and a crucial technology for 6G. To achieve this crucial functionality, we propose a deep learning based multi-node ISAC 4D environment reconstruction method with Uplink-Downlink (UL-DL) cooperation, which employs virtual apert… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: 13 pages,21 figures,4 tables

  48. arXiv:2404.14444  [pdf, other

    cs.LG cs.AI cs.ET

    Practical Battery Health Monitoring using Uncertainty-Aware Bayesian Neural Network

    Authors: Yunyi Zhao, Zhang Wei, Qingyu Yan, Man-Fai Ng, B. Sivaneasan, Cheng Xiang

    Abstract: Battery health monitoring and prediction are critically important in the era of electric mobility with a huge impact on safety, sustainability, and economic aspects. Existing research often focuses on prediction accuracy but tends to neglect practical factors that may hinder the technology's deployment in real-world applications. In this paper, we address these practical considerations and develop… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 6 pages

  49. arXiv:2404.13752  [pdf, other

    cs.LG cs.AI cs.CL cs.CR math.OC

    Towards General Conceptual Model Editing via Adversarial Representation Engineering

    Authors: Yihao Zhang, Zeming Wei, Jun Sun, Meng Sun

    Abstract: Since the development of Large Language Models (LLMs) has achieved remarkable success, understanding and controlling their internal complex mechanisms has become an urgent problem. Recent research has attempted to interpret their behaviors through the lens of inner representation. However, developing practical and efficient methods for applying these representations for general and flexible model… ▽ More

    Submitted 23 May, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

  50. arXiv:2404.13603  [pdf, other

    cs.IT eess.SP

    Beyond MMSE: Rank-1 Subspace Channel Estimator for Massive MIMO Systems

    Authors: Bin Li, Ziping Wei, Shaoshi Yang, Yang Zhang, Jun Zhang, Chenglin Zhao, Sheng Chen

    Abstract: To glean the benefits offered by massive multi-input multi-output (MIMO) systems, channel state information must be accurately acquired. Despite the high accuracy, the computational complexity of classical linear minimum mean squared error (MMSE) estimator becomes prohibitively high in the context of massive MIMO, while the other low-complexity methods degrade the estimation accuracy seriously. In… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: 15 pages, 12 figures, accepted to appear on IEEE Transactions on Communications, Apr. 2024