Zum Hauptinhalt springen

Showing 1–50 of 232 results for author: Lv, C

.
  1. arXiv:2408.17073  [pdf, other

    eess.IV cs.CV

    Approximately Invertible Neural Network for Learned Image Compression

    Authors: Yanbo Gao, Meng Fu, Shuai Li, Chong Lv, Xun Cai, Hui Yuan, Mao Ye

    Abstract: Learned image compression have attracted considerable interests in recent years. It typically comprises an analysis transform, a synthesis transform, quantization and an entropy coding model. The analysis transform and synthesis transform are used to encode an image to latent feature and decode the quantized feature to reconstruct the image, and can be regarded as coupled transforms. However, the… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

  2. arXiv:2408.11558  [pdf, other

    cs.CV

    GSTran: Joint Geometric and Semantic Coherence for Point Cloud Segmentation

    Authors: Abiao Li, Chenlei Lv, Guofeng Mei, Yifan Zuo, Jian Zhang, Yuming Fang

    Abstract: Learning meaningful local and global information remains a challenge in point cloud segmentation tasks. When utilizing local information, prior studies indiscriminately aggregates neighbor information from different classes to update query points, potentially compromising the distinctive feature of query points. In parallel, inaccurate modeling of long-distance contextual dependencies when utilizi… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: ICPR 2024

  3. arXiv:2408.05132  [pdf, other

    quant-ph cond-mat.mes-hall cond-mat.quant-gas

    Hidden curved spaces in Bosonic Kitaev model

    Authors: Chenwei Lv, Qi Zhou

    Abstract: Quantum matter in curved spaces exhibits remarkable properties unattainable in flat spaces. To access curved spaces in laboratories, the conventional wisdom is that physical distortions need to be implemented into a system. In contrast to this belief, here, we show that two hyperbolic surfaces readily exist in bosonic Kitaev model in the absence of any physical distortions and give rise to a range… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: 7 pages, 3 figures

  4. arXiv:2408.03552  [pdf, other

    astro-ph.SR astro-ph.GA

    A deeper investigation of the Primordial Binary Cluster

    Authors: Qingshun Hu, Songmei Qin, Chunyan Li, Chenglong Lv, Yang Pan, Yangping Luo

    Abstract: We hereby reported a new physical binary cluster (ASCC~19 and ASCC~21) near the Orion star-forming complex based on the data in the literature. Analysis of the results shows that it is a primordial binary cluster. It is possible that this binary cluster is undergoing two-body relaxation by inspecting the radial velocity anomalies of its member stars. In addition, based on the analysis of its metal… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: 15 pages, 16 figures, 3 tables

  5. arXiv:2407.19208  [pdf, other

    cs.GR

    WindPoly: Polygonal Mesh Reconstruction via Winding Numbers

    Authors: Xin He, Chenlei Lv, Pengdi Huang, Hui Huang

    Abstract: Polygonal mesh reconstruction of a raw point cloud is a valuable topic in the field of computer graphics and 3D vision. Especially to 3D architectural models, polygonal mesh provides concise expressions for fundamental geometric structures while effectively reducing data volume. However, there are some limitations of traditional reconstruction methods: normal vector dependency, noisy points and de… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

    Comments: European Conference on Computer Vision (Proceedings of ECCV 2024)

  6. arXiv:2407.13480  [pdf

    cs.RO cs.AI

    Risk-Aware Vehicle Trajectory Prediction Under Safety-Critical Scenarios

    Authors: Qingfan Wang, Dongyang Xu, Gaoyuan Kuang, Chen Lv, Shengbo Eben Li, Bingbing Nie

    Abstract: Trajectory prediction is significant for intelligent vehicles to achieve high-level autonomous driving, and a lot of relevant research achievements have been made recently. Despite the rapid development, most existing studies solely focused on normal safe scenarios while largely neglecting safety-critical scenarios, particularly those involving imminent collisions. This oversight may result in aut… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  7. arXiv:2407.11585  [pdf, other

    cs.CV cs.AI

    QVD: Post-training Quantization for Video Diffusion Models

    Authors: Shilong Tian, Hong Chen, Chengtao Lv, Yu Liu, Jinyang Guo, Xianglong Liu, Shengxi Li, Hao Yang, Tao Xie

    Abstract: Recently, video diffusion models (VDMs) have garnered significant attention due to their notable advancements in generating coherent and realistic video content. However, processing multiple frame features concurrently, coupled with the considerable model size, results in high latency and extensive memory consumption, hindering their broader application. Post-training quantization (PTQ) is an effe… ▽ More

    Submitted 17 July, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: accepted by ACMMM2024

  8. arXiv:2407.09359  [pdf, other

    cs.CV

    A Unified Anomaly Synthesis Strategy with Gradient Ascent for Industrial Anomaly Detection and Localization

    Authors: Qiyu Chen, Huiyuan Luo, Chengkan Lv, Zhengtao Zhang

    Abstract: Anomaly synthesis strategies can effectively enhance unsupervised anomaly detection. However, existing strategies have limitations in the coverage and controllability of anomaly synthesis, particularly for weak defects that are very similar to normal regions. In this paper, we propose Global and Local Anomaly co-Synthesis Strategy (GLASS), a novel unified framework designed to synthesize a broader… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  9. arXiv:2407.01219  [pdf, other

    cs.CL

    Searching for Best Practices in Retrieval-Augmented Generation

    Authors: Xiaohua Wang, Zhenghua Wang, Xuan Gao, Feiran Zhang, Yixin Wu, Zhibo Xu, Tianyuan Shi, Zhengyuan Wang, Shizheng Li, Qi Qian, Ruicheng Yin, Changze Lv, Xiaoqing Zheng, Xuanjing Huang

    Abstract: Retrieval-augmented generation (RAG) techniques have proven to be effective in integrating up-to-date information, mitigating hallucinations, and enhancing response quality, particularly in specialized domains. While many RAG approaches have been proposed to enhance large language models through query-dependent retrievals, these approaches still suffer from their complex implementation and prolong… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  10. arXiv:2406.19230  [pdf, other

    cs.NE cs.CL

    Spiking Convolutional Neural Networks for Text Classification

    Authors: Changze Lv, Jianhan Xu, Xiaoqing Zheng

    Abstract: Spiking neural networks (SNNs) offer a promising pathway to implement deep neural networks (DNNs) in a more energy-efficient manner since their neurons are sparsely activated and inferences are event-driven. However, there have been very few works that have demonstrated the efficacy of SNNs in language tasks partially because it is non-trivial to represent words in the forms of spikes and to deal… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  11. arXiv:2406.16062  [pdf, other

    cs.NE

    Towards Biologically Plausible Computing: A Comprehensive Comparison

    Authors: Changze Lv, Yufei Gu, Zhengkang Guo, Zhibo Xu, Yixin Wu, Feiran Zhang, Tianyuan Shi, Zhenghua Wang, Ruicheng Yin, Yu Shang, Siqi Zhong, Xiaohua Wang, Muling Wu, Wenhao Liu, Tianlong Li, Jianhao Zhu, Cenyuan Zhang, Zixuan Ling, Xiaoqing Zheng

    Abstract: Backpropagation is a cornerstone algorithm in training neural networks for supervised learning, which uses a gradient descent method to update network weights by minimizing the discrepancy between actual and desired outputs. Despite its pivotal role in propelling deep learning advancements, the biological plausibility of backpropagation is questioned due to its requirements for weight symmetry, gl… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  12. arXiv:2406.10976  [pdf, other

    cs.LG cs.CL cs.CR

    Promoting Data and Model Privacy in Federated Learning through Quantized LoRA

    Authors: JianHao Zhu, Changze Lv, Xiaohua Wang, Muling Wu, Wenhao Liu, Tianlong Li, Zixuan Ling, Cenyuan Zhang, Xiaoqing Zheng, Xuanjing Huang

    Abstract: Conventional federated learning primarily aims to secure the privacy of data distributed across multiple edge devices, with the global model dispatched to edge devices for parameter updates during the learning process. However, the development of large language models (LLMs) requires substantial data and computational resources, rendering them valuable intellectual properties for their developers… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  13. arXiv:2406.02993  [pdf, other

    physics.optics

    Dual-color Q-switched mode-locking in an Erbium-doped fiber laser

    Authors: Chenyue Lv, Baole Lu, Jintao Bai

    Abstract: Q-switched mode-locking (QML) has been widely observed in various lasers, but its generation mechanism in passive mode-locking remains unclear. In this paper, we build up a dual-color QML Erbium-doped fiber laser and find a bound-state-like envelope on the optical spectrum for the first time. Theoretically, the formation mechanism of QML is numerically investigated using the coupled Ginzburg-Landa… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  14. arXiv:2405.14362  [pdf, other

    cs.NE

    Advancing Spiking Neural Networks for Sequential Modeling with Central Pattern Generators

    Authors: Changze Lv, Dongqi Han, Yansen Wang, Xiaoqing Zheng, Xuanjing Huang, Dongsheng Li

    Abstract: Spiking neural networks (SNNs) represent a promising approach to developing artificial neural networks that are both energy-efficient and biologically plausible. However, applying SNNs to sequential tasks, such as text classification and time-series forecasting, has been hindered by the challenge of creating an effective and hardware-friendly spike-form positional encoding (PE) strategy. Drawing i… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  15. arXiv:2405.14226  [pdf, other

    cs.LG cs.AI

    Variational Delayed Policy Optimization

    Authors: Qingyuan Wu, Simon Sinong Zhan, Yixuan Wang, Yuhui Wang, Chung-Wei Lin, Chen Lv, Qi Zhu, Chao Huang

    Abstract: In environments with delayed observation, state augmentation by including actions within the delay window is adopted to retrieve Markovian property to enable reinforcement learning (RL). However, state-of-the-art (SOTA) RL techniques with Temporal-Difference (TD) learning frameworks often suffer from learning inefficiency, due to the significant expansion of the augmented state space with the dela… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  16. arXiv:2405.11186  [pdf, other

    physics.plasm-ph physics.acc-ph

    Compact Spin-Polarized Positron Acceleration in Multi-Layer Microhole Array Films

    Authors: Zhen-Ke Dou, Chong Lv, Yousef I. Salamin, Nan Zhang, Feng Wan, Zhong-Feng Xu, Jian-Xing Li

    Abstract: Compact spin-polarized positron accelerators play a major role in promoting significant positron application research, which typically require high acceleration gradients and polarization degree, both of which, however, are still great challenging. Here, we put forward a novel spin-polarized positron acceleration method which employs an ultrarelativistic high-density electron beam passing through… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  17. arXiv:2405.10157  [pdf

    eess.SY

    Incorporating ESO into Deep Koopman Operator Modelling for Control of Autonomous Vehicles

    Authors: Hao Chen, Chen Lv

    Abstract: Koopman operator theory is a kind of data-driven modelling approach that accurately captures the nonlinearities of mechatronic systems such as vehicles against physics-based methods. However, the infinite-dimensional Koopman operator is impossible to implement in real-world applications. To approximate the infinite-dimensional Koopman operator through collection dataset rather than manual trial an… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  18. arXiv:2405.10145  [pdf

    eess.SY

    Deep Koopman Operator-Informed Safety Command Governor for Autonomous Vehicles

    Authors: Hao Chen, Xiangkun He, Shuo Cheng, Chen Lv

    Abstract: Modeling of nonlinear behaviors with physical-based models poses challenges. However, Koopman operator maps the original nonlinear system into an infinite-dimensional linear space to achieve global linearization of the nonlinear system through input and output data, which derives an absolute equivalent linear representation of the original state space. Due to the impossibility of implementing the… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  19. arXiv:2405.08263  [pdf, other

    cs.CV

    Palette-based Color Transfer between Images

    Authors: Chenlei Lv, Dan Zhang

    Abstract: As an important subtopic of image enhancement, color transfer aims to enhance the color scheme of a source image according to a reference one while preserving the semantic context. To implement color transfer, the palette-based color mapping framework was proposed. \textcolor{black}{It is a classical solution that does not depend on complex semantic analysis to generate a new color scheme. However… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  20. arXiv:2405.06426  [pdf, other

    physics.plasm-ph

    Generation of Ultra-Collimated Polarized Attosecond $γ-$Rays via Beam Instabilities

    Authors: Li-Jie Cui, Ke-Jia Wei, Chong Lv, Feng Wan, Yousef I. Salamin, Lei-Feng Cao, Jian-Xing Li

    Abstract: Polarized attosecond $γ-$rays may offer excitation and hyperfine tracking of reactions relevant to nuclear physics, astrophysics, high-energy physics, etc. However, unfortunately, generation of a feasible and easy-to-deploy source is still a great challenge. Here, we put forward a novel method for producing ultra-collimated high-brilliance polarized attosecond $γ-$rays via the interaction of an un… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  21. arXiv:2405.06001  [pdf, other

    cs.LG cs.AI cs.CL

    LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit

    Authors: Ruihao Gong, Yang Yong, Shiqiao Gu, Yushi Huang, Chentao Lv, Yunchen Zhang, Xianglong Liu, Dacheng Tao

    Abstract: Recent advancements in large language models (LLMs) are propelling us toward artificial general intelligence with their remarkable emergent abilities and reasoning capabilities. However, the substantial computational and memory requirements limit the widespread adoption. Quantization, a key compression technique, can effectively mitigate these demands by compressing and accelerating LLMs, albeit w… ▽ More

    Submitted 20 July, 2024; v1 submitted 9 May, 2024; originally announced May 2024.

  22. arXiv:2405.03144  [pdf, other

    cs.CV cs.LG

    PTQ4SAM: Post-Training Quantization for Segment Anything

    Authors: Chengtao Lv, Hong Chen, Jinyang Guo, Yifu Ding, Xianglong Liu

    Abstract: Segment Anything Model (SAM) has achieved impressive performance in many computer vision tasks. However, as a large-scale model, the immense memory and computation costs hinder its practical deployment. In this paper, we propose a post-training quantization (PTQ) framework for Segment Anything Model, namely PTQ4SAM. First, we investigate the inherent bottleneck of SAM quantization attributed to th… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: CVPR 2024

  23. arXiv:2404.15348  [pdf

    eess.SP physics.optics

    High-Linearity PAM-4 Silicon Micro-ring Transmitter Architecture with Electronic-Photonic Hybrid DAC

    Authors: Zheng Li, Chengyang Lv, Min Tan

    Abstract: This paper presents a high linearity PAM-4 transmitter (TX) architecture, consisting of a three-segment micro-ring modulator (MRM) and a matched CMOS driver. This architecture can drive a high-linearity 4-level pulse amplitude (PAM-4) modulation signal, thereby extending the tunable operating wavelength range for achieving linear PAM-4 output. We use the three-segment MRM to increase design flexib… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 14 pages, 11 figures

  24. arXiv:2404.14047  [pdf, other

    cs.LG

    An Empirical Study of LLaMA3 Quantization: From LLMs to MLLMs

    Authors: Wei Huang, Xingyu Zheng, Xudong Ma, Haotong Qin, Chengtao Lv, Hong Chen, Jie Luo, Xiaojuan Qi, Xianglong Liu, Michele Magno

    Abstract: The LLaMA family has become one of the most powerful open-source Large Language Models (LLMs) and the popular LLM backbones of Multimodal Large Language Models (MLLMs), widely applied in Computer Vision (CV) and Natural Language Understanding (NLU) tasks. Notably, LLaMA3 models have recently been released and achieve impressive performance across various with super-large scale pre-training on over… ▽ More

    Submitted 19 July, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

  25. GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting

    Authors: Hongyun Yu, Zhan Qu, Qihang Yu, Jianchuan Chen, Zhonghua Jiang, Zhiwen Chen, Shengyu Zhang, Jimin Xu, Fei Wu, Chengfei Lv, Gang Yu

    Abstract: Recent works on audio-driven talking head synthesis using Neural Radiance Fields (NeRF) have achieved impressive results. However, due to inadequate pose and expression control caused by NeRF implicit representation, these methods still have some limitations, such as unsynchronized or unnatural lip movements, and visual jitter and artifacts. In this paper, we propose GaussianTalker, a novel method… ▽ More

    Submitted 9 August, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: Accepted by ACM MM 2024. Project page: https://yuhongyun777.github.io/GaussianTalker/

  26. arXiv:2404.11593  [pdf, other

    cs.CV

    IntrinsicAnything: Learning Diffusion Priors for Inverse Rendering Under Unknown Illumination

    Authors: Xi Chen, Sida Peng, Dongchen Yang, Yuan Liu, Bowen Pan, Chengfei Lv, Xiaowei Zhou

    Abstract: This paper aims to recover object materials from posed images captured under an unknown static lighting condition. Recent methods solve this task by optimizing material parameters through differentiable physically based rendering. However, due to the coupling between object geometry, materials, and environment lighting, there is inherent ambiguity during the inverse rendering process, preventing p… ▽ More

    Submitted 22 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

    Comments: Project page: https://zju3dv.github.io/IntrinsicAnything

  27. arXiv:2404.02524  [pdf, other

    cs.RO

    Versatile Scene-Consistent Traffic Scenario Generation as Optimization with Diffusion

    Authors: Zhiyu Huang, Zixu Zhang, Ameya Vaidya, Yuxiao Chen, Chen Lv, Jaime Fernández Fisac

    Abstract: Generating realistic and controllable agent behaviors in traffic simulation is crucial for the development of autonomous vehicles. This problem is often formulated as imitation learning (IL) from real-world driving data by either directly predicting future trajectories or inferring cost functions with inverse optimal control. In this paper, we draw a conceptual connection between IL and diffusion-… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  28. arXiv:2403.17297  [pdf, other

    cs.CL cs.AI

    InternLM2 Technical Report

    Authors: Zheng Cai, Maosong Cao, Haojiong Chen, Kai Chen, Keyu Chen, Xin Chen, Xun Chen, Zehui Chen, Zhi Chen, Pei Chu, Xiaoyi Dong, Haodong Duan, Qi Fan, Zhaoye Fei, Yang Gao, Jiaye Ge, Chenya Gu, Yuzhe Gu, Tao Gui, Aijia Guo, Qipeng Guo, Conghui He, Yingfan Hu, Ting Huang, Tao Jiang , et al. (75 additional authors not shown)

    Abstract: The evolution of Large Language Models (LLMs) like ChatGPT and GPT-4 has sparked discussions on the advent of Artificial General Intelligence (AGI). However, replicating such advancements in open-source models has been challenging. This paper introduces InternLM2, an open-source LLM that outperforms its predecessors in comprehensive evaluations across 6 dimensions and 30 benchmarks, long-context m… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  29. arXiv:2403.12642  [pdf

    physics.app-ph cond-mat.mes-hall

    Interlayer Dzyaloshinskii-Moriya interaction in synthetic ferrimagnets

    Authors: Shen Li, Mouad Fattouhi, Tianxun Huang, Chen Lv, Mark C. H. de Jong, Pingzhi Li, Xiaoyang Lin, Felipe Garcia-Sanchez, Eduardo Martinez, Stéphane Mangin, Bert Koopmans, Weisheng Zhao, Reinoud Lavrijsen

    Abstract: The antisymmetric interlayer exchange interaction, i.e., interlayer Dzyaloshinskii-Moriya interaction (IL-DMI) has attracted significant interest since this long-range chiral spin interaction provides a new dimension for controlling spin textures and dynamics. However, the role of IL-DMI in the field induced and spin-orbit torque (SOT) induced switching of synthetic ferrimagnets (SFi) has not been… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: no

    MSC Class: no

  30. arXiv:2403.11183  [pdf, other

    cs.CL

    Decoding Continuous Character-based Language from Non-invasive Brain Recordings

    Authors: Cenyuan Zhang, Xiaoqing Zheng, Ruicheng Yin, Shujie Geng, Jianhan Xu, Xuan Gao, Changze Lv, Zixuan Ling, Xuanjing Huang, Miao Cao, Jianfeng Feng

    Abstract: Deciphering natural language from brain activity through non-invasive devices remains a formidable challenge. Previous non-invasive decoders either require multiple experiments with identical stimuli to pinpoint cortical regions and enhance signal-to-noise ratios in brain activity, or they are limited to discerning basic linguistic elements such as letters and words. We propose a novel approach to… ▽ More

    Submitted 19 March, 2024; v1 submitted 17 March, 2024; originally announced March 2024.

  31. arXiv:2403.10206  [pdf, other

    astro-ph.IM astro-ph.SR cs.CV physics.ins-det physics.optics

    A Data-Driven Approach for Mitigating Dark Current Noise and Bad Pixels in Complementary Metal Oxide Semiconductor Cameras for Space-based Telescopes

    Authors: Peng Jia, Chao Lv, Yushan Li, Yongyang Sun, Shu Niu, Zhuoxiao Wang

    Abstract: In recent years, there has been a gradual increase in the performance of Complementary Metal Oxide Semiconductor (CMOS) cameras. These cameras have gained popularity as a viable alternative to charge-coupled device (CCD) cameras in a wide range of applications. One particular application is the CMOS camera installed in small space telescopes. However, the limited power and spatial resources availa… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted by the AJ, comments are welcome. The complete code could be downloaded from: DOI: 10.12149/101387

  32. arXiv:2403.00436  [pdf, other

    cs.CV cs.AI

    Abductive Ego-View Accident Video Understanding for Safe Driving Perception

    Authors: Jianwu Fang, Lei-lei Li, Junfei Zhou, Junbin Xiao, Hongkai Yu, Chen Lv, Jianru Xue, Tat-Seng Chua

    Abstract: We present MM-AU, a novel dataset for Multi-Modal Accident video Understanding. MM-AU contains 11,727 in-the-wild ego-view accident videos, each with temporally aligned text descriptions. We annotate over 2.23 million object boxes and 58,650 pairs of video-based accident reasons, covering 58 accident categories. MM-AU supports various accident understanding tasks, particularly multimodal video dif… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024. This is not the camera-ready version. The Project page: http://www.lotvsmmau.net

  33. arXiv:2402.19013  [pdf, other

    eess.SY

    Ultraviolet Positioning via TDOA: Error Analysis and System Prototype

    Authors: Shihui Yu, Chubing Lv, Yueke Yang, Yuchen Pan, Lei Sun, Juliang Cao, Ruihang Yu, Chen Gong, Wenqi Wu, Zhengyuan Xu

    Abstract: This work performs the design, real-time hardware realization, and experimental evaluation of a positioning system by ultra-violet (UV) communication under photon-level signal detection. The positioning is based on time-difference of arrival (TDOA) principle. Time division-based transmission of synchronization sequence from three transmitters with known positions is applied. We investigate the pos… ▽ More

    Submitted 14 April, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

  34. arXiv:2402.15179  [pdf, other

    cs.LG cs.CL

    Advancing Parameter Efficiency in Fine-tuning via Representation Editing

    Authors: Muling Wu, Wenhao Liu, Xiaohua Wang, Tianlong Li, Changze Lv, Zixuan Ling, Jianhao Zhu, Cenyuan Zhang, Xiaoqing Zheng, Xuanjing Huang

    Abstract: Parameter Efficient Fine-Tuning (PEFT) techniques have drawn significant attention due to their ability to yield competitive results while updating only a small portion of the adjustable parameters. However, existing PEFT methods pose challenges in hyperparameter selection, such as choosing the rank for LoRA or Adapter, or specifying the length of soft prompts. To address these challenges, we prop… ▽ More

    Submitted 2 June, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  35. arXiv:2402.11960  [pdf, other

    cs.LG cs.AI cs.CL

    DB-LLM: Accurate Dual-Binarization for Efficient LLMs

    Authors: Hong Chen, Chengtao Lv, Liang Ding, Haotong Qin, Xiabin Zhou, Yifu Ding, Xuebo Liu, Min Zhang, Jinyang Guo, Xianglong Liu, Dacheng Tao

    Abstract: Large language models (LLMs) have significantly advanced the field of natural language processing, while the expensive memory and computation consumption impede their practical deployment. Quantization emerges as one of the most effective methods for improving the computational efficiency of LLMs. However, existing ultra-low-bit quantization always causes severe accuracy drops. In this paper, we e… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  36. arXiv:2402.03141  [pdf, other

    cs.LG cs.AI eess.SY

    Boosting Reinforcement Learning with Strongly Delayed Feedback Through Auxiliary Short Delays

    Authors: Qingyuan Wu, Simon Sinong Zhan, Yixuan Wang, Yuhui Wang, Chung-Wei Lin, Chen Lv, Qi Zhu, Jürgen Schmidhuber, Chao Huang

    Abstract: Reinforcement learning (RL) is challenging in the common case of delays between events and their sensory perceptions. State-of-the-art (SOTA) state augmentation techniques either suffer from state space explosion or performance degeneration in stochastic environments. To address these challenges, we present a novel Auxiliary-Delayed Reinforcement Learning (AD-RL) method that leverages auxiliary ta… ▽ More

    Submitted 5 June, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: ICML 2024

  37. arXiv:2402.02426  [pdf, other

    cs.RO cs.CV cs.LG

    Hybrid-Prediction Integrated Planning for Autonomous Driving

    Authors: Haochen Liu, Zhiyu Huang, Wenhui Huang, Haohan Yang, Xiaoyu Mo, Chen Lv

    Abstract: Autonomous driving systems require the ability to fully understand and predict the surrounding environment to make informed decisions in complex scenarios. Recent advancements in learning-based systems have highlighted the importance of integrating prediction and planning modules. However, this integration has brought forth three major challenges: inherent trade-offs by sole prediction, consistenc… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  38. arXiv:2402.01533  [pdf, other

    cs.NE

    Efficient and Effective Time-Series Forecasting with Spiking Neural Networks

    Authors: Changze Lv, Yansen Wang, Dongqi Han, Xiaoqing Zheng, Xuanjing Huang, Dongsheng Li

    Abstract: Spiking neural networks (SNNs), inspired by the spiking behavior of biological neurons, provide a unique pathway for capturing the intricacies of temporal data. However, applying SNNs to time-series forecasting is challenging due to difficulties in effective temporal alignment, complexities in encoding processes, and the absence of standardized guidelines for model selection. In this paper, we pro… ▽ More

    Submitted 29 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  39. arXiv:2401.15315  [pdf, other

    cs.RO

    Learning Online Belief Prediction for Efficient POMDP Planning in Autonomous Driving

    Authors: Zhiyu Huang, Chen Tang, Chen Lv, Masayoshi Tomizuka, Wei Zhan

    Abstract: Effective decision-making in autonomous driving relies on accurate inference of other traffic agents' future behaviors. To achieve this, we propose an online belief-update-based behavior prediction model and an efficient planner for Partially Observable Markov Decision Processes (POMDPs). We develop a Transformer-based prediction model, enhanced with a recurrent neural memory model, to dynamically… ▽ More

    Submitted 17 June, 2024; v1 submitted 27 January, 2024; originally announced January 2024.

    Comments: IEEE Robotics and Automation Letters

  40. arXiv:2401.14075  [pdf, other

    physics.plasm-ph

    Generation of High-Brilliance Polarized $γ$-Rays via Vacuum Dichroism-assisted Vacuum Birefringence

    Authors: Chong Lv, Feng Wan, Yousef I. Salamin, Qian Zhao, Mamutjan Ababekri, Ruirui Xu, Jian-Xing Li

    Abstract: We put forward a novel method to generate high-brilliance polarized $γ$-photon beams via vacuum dichroism (VD)-assisted vacuum birefringence (VB) effect. We split a linearly polarized (LP) laser pulse into two subpulses with the first one colliding with a dense unpolarized electron beam to generate LP $γ$ photons (via nonlinear Compton scattering), which then further collide with the second subpul… ▽ More

    Submitted 30 April, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

  41. arXiv:2401.06824  [pdf, other

    cs.CL cs.AI

    Rethinking Jailbreaking through the Lens of Representation Engineering

    Authors: Tianlong Li, Shihan Dou, Wenhao Liu, Muling Wu, Changze Lv, Rui Zheng, Xiaoqing Zheng, Xuanjing Huang

    Abstract: The recent surge in jailbreaking methods has revealed the vulnerability of Large Language Models (LLMs) to malicious inputs. While earlier research has primarily concentrated on increasing the success rates of jailbreaking attacks, the underlying mechanism for safeguarding LLMs remains underexplored. This study investigates the vulnerability of safety-aligned LLMs by uncovering specific activity p… ▽ More

    Submitted 6 August, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: 21 pages, 20 figures, 6 tables

  42. arXiv:2401.06439  [pdf, other

    cs.RO

    Ordering-Flexible Multi-Robot Coordination for MovingTarget Convoying Using Long-TermTask Execution

    Authors: Bin-Bin Hu, Yanxin Zhou, Henglai Wei, Yan Wang, Chen Lv

    Abstract: In this paper, we propose a cooperative long-term task execution (LTTE) algorithm for protecting a moving target into the interior of an ordering-flexible convex hull by a team of robots resiliently in the changing environments. Particularly, by designing target-approaching and sensing-neighbor collision-free subtasks, and incorporating these subtasks into the constraints rather than the tradition… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Journal ref: Published on Automatica, 2024

  43. arXiv:2401.05268  [pdf, other

    cs.CL cs.AI cs.HC cs.LG cs.MA

    AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning

    Authors: Shuofei Qiao, Ningyu Zhang, Runnan Fang, Yujie Luo, Wangchunshu Zhou, Yuchen Eleanor Jiang, Chengfei Lv, Huajun Chen

    Abstract: Language agents have achieved considerable performance on various complex question-answering tasks by planning with external tools. Despite the incessant exploration in this field, existing language agent systems still struggle with costly, non-reproducible data reliance and face the challenge of compelling a single model for multiple functions. To this end, we introduce AutoAct, an automatic agen… ▽ More

    Submitted 26 May, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: ACL 2024

  44. arXiv:2312.17589  [pdf

    physics.app-ph

    Improving photon number resolvability of a superconducting nanowire detector array using a level comparator circuit

    Authors: Jia Huang, Xingyu Zhang, Weijun Zhang, Chaomeng Ding, Yong Wang, Chaolin Lv, Guangzhao Xu, Xiaoyu Liu, Hao Li, Zhen Wang, Lixing You

    Abstract: Photon number resolving (PNR) capability is very important in many optical applications, including quantum information processing, fluorescence detection, and few-photon-level ranging and imaging. Superconducting nanowire single-photon detectors (SNSPDs) with a multipixel interleaved architecture give the array an excellent spatial PNR capability. However, the signal-to-noise ratio (SNR) of the ph… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

    Comments: 8 pages, 7 figures

  45. arXiv:2312.15997  [pdf, other

    cs.CL

    Aligning Large Language Models with Human Preferences through Representation Engineering

    Authors: Wenhao Liu, Xiaohua Wang, Muling Wu, Tianlong Li, Changze Lv, Zixuan Ling, Jianhao Zhu, Cenyuan Zhang, Xiaoqing Zheng, Xuanjing Huang

    Abstract: Aligning large language models (LLMs) with human preferences is crucial for enhancing their utility in terms of helpfulness, truthfulness, safety, harmlessness, and interestingness. Existing methods for achieving this alignment often involves employing reinforcement learning from human feedback (RLHF) to fine-tune LLMs based on human labels assessing the relative quality of model responses. Nevert… ▽ More

    Submitted 3 July, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

  46. arXiv:2312.12913  [pdf, other

    cs.CV

    Produce Once, Utilize Twice for Anomaly Detection

    Authors: Shuyuan Wang, Qi Li, Huiyuan Luo, Chengkan Lv, Zhengtao Zhang

    Abstract: Visual anomaly detection aims at classifying and locating the regions that deviate from the normal appearance. Embedding-based methods and reconstruction-based methods are two main approaches for this task. However, they are either not efficient or not precise enough for the industrial detection. To deal with this problem, we derive POUTA (Produce Once Utilize Twice for Anomaly detection), which i… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  47. arXiv:2311.03012  [pdf, other

    physics.flu-dyn cond-mat.soft

    The role of viscosity on drop impact forces

    Authors: Vatsal Sanjay, Bin Zhang, Cunjing Lv, Detlef Lohse

    Abstract: A liquid drop impacting a rigid substrate undergoes deformation and spreading due to normal reaction forces, which are counteracted by surface tension. On a non-wetting substrate, the drop subsequently retracts and takes off. Our recent work (Zhang et al., \textit{Phys. Rev. Lett.}, vol. 129, 2022, 104501) revealed two peaks in the temporal evolution of the normal force $F(t)$--one at impact and a… ▽ More

    Submitted 30 April, 2024; v1 submitted 6 November, 2023; originally announced November 2023.

    Comments: version after first round of the review

  48. arXiv:2310.19559  [pdf, other

    cs.CV

    Disentangled Counterfactual Learning for Physical Audiovisual Commonsense Reasoning

    Authors: Changsheng Lv, Shuai Zhang, Yapeng Tian, Mengshi Qi, Huadong Ma

    Abstract: In this paper, we propose a Disentangled Counterfactual Learning~(DCL) approach for physical audiovisual commonsense reasoning. The task aims to infer objects' physics commonsense based on both video and audio input, with the main challenge is how to imitate the reasoning ability of humans. Most of the current methods fail to take full advantage of different characteristics in multi-modal data, an… ▽ More

    Submitted 1 November, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: To be published in 37th Conference on Neural Information Processing Systems

  49. arXiv:2310.17133  [pdf, other

    cs.CL cs.AI

    Incorporating Probing Signals into Multimodal Machine Translation via Visual Question-Answering Pairs

    Authors: Yuxin Zuo, Bei Li, Chuanhao Lv, Tong Zheng, Tong Xiao, Jingbo Zhu

    Abstract: This paper presents an in-depth study of multimodal machine translation (MMT), examining the prevailing understanding that MMT systems exhibit decreased sensitivity to visual information when text inputs are complete. Instead, we attribute this phenomenon to insufficient cross-modal interaction, rather than image information redundancy. A novel approach is proposed to generate parallel Visual Ques… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Findings of EMNLP2023

  50. arXiv:2310.16582  [pdf, other

    cs.CL

    Tailoring Personality Traits in Large Language Models via Unsupervisedly-Built Personalized Lexicons

    Authors: Tianlong Li, Shihan Dou, Changze Lv, Wenhao Liu, Jianhan Xu, Muling Wu, Zixuan Ling, Xiaoqing Zheng, Xuanjing Huang

    Abstract: Personality plays a pivotal role in shaping human expression patterns, thus regulating the personality of large language models (LLMs) holds significant potential in enhancing the user experience of LLMs. Previous methods either relied on fine-tuning LLMs on specific corpora or necessitated manually crafted prompts to elicit specific personalities from LLMs. However, the former approach is ineffic… ▽ More

    Submitted 6 January, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: Work in progress