Zum Hauptinhalt springen

Showing 51–100 of 1,330 results for author: He, H

.
  1. arXiv:2406.03262  [pdf, other

    cs.CV

    ADer: A Comprehensive Benchmark for Multi-class Visual Anomaly Detection

    Authors: Jiangning Zhang, Haoyang He, Zhenye Gan, Qingdong He, Yuxuan Cai, Zhucun Xue, Yabiao Wang, Chengjie Wang, Lei Xie, Yong Liu

    Abstract: Visual anomaly detection aims to identify anomalous regions in images through unsupervised learning paradigms, with increasing application demand and value in fields such as industrial inspection and medical lesion detection. Despite significant progress in recent years, there is a lack of comprehensive benchmarks to adequately evaluate the performance of various mainstream methods across differen… ▽ More

    Submitted 6 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  2. arXiv:2406.03081  [pdf, other

    quant-ph

    A Quantum Neural Network-Based Approach to Power Quality Disturbances Detection and Recognition

    Authors: Guo-Dong Li, Hai-Yan He, Yue Li, Xin-Hao Li, Hao Liu, Qing-Le Wang, Long Cheng

    Abstract: Power quality disturbances (PQDs) significantly impact the stability and reliability of power systems, necessitating accurate and efficient detection and recognition methods. While numerous classical algorithms for PQDs detection and recognition have been extensively studied and applied, related work in the quantum domain is still in its infancy. In this paper, an improved quantum neural networks… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  3. arXiv:2406.02578  [pdf, other

    cs.LG

    Pretrained Mobility Transformer: A Foundation Model for Human Mobility

    Authors: Xinhua Wu, Haoyu He, Yanchao Wang, Qi Wang

    Abstract: Ubiquitous mobile devices are generating vast amounts of location-based service data that reveal how individuals navigate and utilize urban spaces in detail. In this study, we utilize these extensive, unlabeled sequences of user trajectories to develop a foundation model for understanding urban space and human mobility. We introduce the \textbf{P}retrained \textbf{M}obility \textbf{T}ransformer (P… ▽ More

    Submitted 28 May, 2024; originally announced June 2024.

  4. arXiv:2406.02213  [pdf, other

    cs.LG

    Rectifying Reinforcement Learning for Reward Matching

    Authors: Haoran He, Emmanuel Bengio, Qingpeng Cai, Ling Pan

    Abstract: The Generative Flow Network (GFlowNet) is a probabilistic framework in which an agent learns a stochastic policy and flow functions to sample objects with probability proportional to an unnormalized reward function. GFlowNets share a strong resemblance to reinforcement learning (RL), that typically aims to maximize reward, due to their sequential decision-making processes. Recent works have studie… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  5. arXiv:2406.01150  [pdf, other

    cs.LG

    Looking Backward: Retrospective Backward Synthesis for Goal-Conditioned GFlowNets

    Authors: Haoran He, Can Chang, Huazhe Xu, Ling Pan

    Abstract: Generative Flow Networks (GFlowNets) are amortized sampling methods for learning a stochastic policy to sequentially generate compositional objects with probabilities proportional to their rewards. GFlowNets exhibit a remarkable ability to generate diverse sets of high-reward objects, in contrast to standard return maximization reinforcement learning approaches, which often converge to a single op… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  6. arXiv:2406.00050  [pdf, other

    cs.CL cs.AI

    An Empirical Analysis on Large Language Models in Debate Evaluation

    Authors: Xinyi Liu, Pinxin Liu, Hangfeng He

    Abstract: In this study, we investigate the capabilities and inherent biases of advanced large language models (LLMs) such as GPT-3.5 and GPT-4 in the context of debate evaluation. We discover that LLM's performance exceeds humans and surpasses the performance of state-of-the-art methods fine-tuned on extensive datasets in debate evaluation. We additionally explore and analyze biases present in LLMs, includ… ▽ More

    Submitted 4 June, 2024; v1 submitted 28 May, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024 main

  7. arXiv:2405.20763  [pdf, other

    cs.LG math.OC stat.ML

    Improving Generalization and Convergence by Enhancing Implicit Regularization

    Authors: Mingze Wang, Jinbo Wang, Haotian He, Zilin Wang, Guanhua Huang, Feiyu Xiong, Zhiyu Li, Weinan E, Lei Wu

    Abstract: In this work, we propose an Implicit Regularization Enhancement (IRE) framework to accelerate the discovery of flat solutions in deep learning, thereby improving generalization and convergence. Specifically, IRE decouples the dynamics of flat and sharp directions, which boosts the sharpness reduction along flat directions while maintaining the training stability in sharp directions. We show that I… ▽ More

    Submitted 20 August, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: 35 pages

  8. arXiv:2405.20600  [pdf, other

    cs.AI

    Multi-label Class Incremental Emotion Decoding with Augmented Emotional Semantics Learning

    Authors: Kaicheng Fu, Changde Du, Xiaoyu Chen, Jie Peng, Huiguang He

    Abstract: Emotion decoding plays an important role in affective human-computer interaction. However, previous studies ignored the dynamic real-world scenario, where human experience a blend of multiple emotions which are incrementally integrated into the model, leading to the multi-label class incremental learning (MLCIL) problem. Existing methods have difficulty in solving MLCIL issue due to notorious cata… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  9. arXiv:2405.19678  [pdf, other

    cs.CV cs.AI

    View-Consistent Hierarchical 3D Segmentation Using Ultrametric Feature Fields

    Authors: Haodi He, Colton Stearns, Adam W. Harley, Leonidas J. Guibas

    Abstract: Large-scale vision foundation models such as Segment Anything (SAM) demonstrate impressive performance in zero-shot image segmentation at multiple levels of granularity. However, these zero-shot predictions are rarely 3D-consistent. As the camera viewpoint changes in a scene, so do the segmentation predictions, as well as the characterizations of "coarse" or "fine" granularity. In this work, we ad… ▽ More

    Submitted 17 July, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  10. arXiv:2405.19586  [pdf, other

    cs.CV cs.LG cs.RO

    SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation

    Authors: Junjie Zhang, Chenjia Bai, Haoran He, Wenke Xia, Zhigang Wang, Bin Zhao, Xiu Li, Xuelong Li

    Abstract: Acquiring a multi-task imitation policy in 3D manipulation poses challenges in terms of scene understanding and action prediction. Current methods employ both 3D representation and multi-view 2D representation to predict the poses of the robot's end-effector. However, they still require a considerable amount of high-quality robot trajectories, and suffer from limited generalization in unseen tasks… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: ICML 2024. Project page: https://sam-embodied.github.io

  11. arXiv:2405.18726  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Reverse the auditory processing pathway: Coarse-to-fine audio reconstruction from fMRI

    Authors: Che Liu, Changde Du, Xiaoyu Chen, Huiguang He

    Abstract: Drawing inspiration from the hierarchical processing of the human auditory system, which transforms sound from low-level acoustic features to high-level semantic understanding, we introduce a novel coarse-to-fine audio reconstruction method. Leveraging non-invasive functional Magnetic Resonance Imaging (fMRI) data, our approach mimics the inverse pathway of auditory processing. Initially, we utili… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  12. arXiv:2405.18399  [pdf, ps, other

    math.NA

    A simple, randomized algorithm for diagonalizing normal matrices

    Authors: Haoze He, Daniel Kressner

    Abstract: We present and analyze a simple numerical method that diagonalizes a complex normal matrix A by diagonalizing the Hermitian matrix obtained from a random linear combination of the Hermitian and skew-Hermitian parts of A.

    Submitted 28 May, 2024; originally announced May 2024.

    MSC Class: 65F15; 15B57; 15A18

  13. arXiv:2405.17976  [pdf

    cs.AI cs.CL

    Yuan 2.0-M32: Mixture of Experts with Attention Router

    Authors: Shaohua Wu, Jiangang Luo, Xi Chen, Lingjun Li, Xudong Zhao, Tong Yu, Chao Wang, Yue Wang, Fei Wang, Weixu Qiao, Houbo He, Zeru Zhang, Zeyu Sun, Junxiong Mao, Chong Shen

    Abstract: Yuan 2.0-M32, with a similar base architecture as Yuan-2.0 2B, uses a mixture-of-experts architecture with 32 experts of which 2 experts are active. A new router network, Attention Router, is proposed and adopted for a more efficient selection of experts, which improves the accuracy compared to the model with classical router network. Yuan 2.0-M32 is trained with 2000B tokens from scratch, and the… ▽ More

    Submitted 29 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: 14 pages,3 figures, 7 tables

  14. arXiv:2405.17414  [pdf, other

    cs.CV cs.GR

    Collaborative Video Diffusion: Consistent Multi-video Generation with Camera Control

    Authors: Zhengfei Kuang, Shengqu Cai, Hao He, Yinghao Xu, Hongsheng Li, Leonidas Guibas, Gordon Wetzstein

    Abstract: Research on video generation has recently made tremendous progress, enabling high-quality videos to be generated from text prompts or images. Adding control to the video generation process is an important goal moving forward and recent approaches that condition video generation models on camera trajectories make strides towards it. Yet, it remains challenging to generate a video of the same scene… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  15. arXiv:2405.16730  [pdf, other

    cs.LG cs.AI stat.AP

    Latent Energy-Based Odyssey: Black-Box Optimization via Expanded Exploration in the Energy-Based Latent Space

    Authors: Peiyu Yu, Dinghuai Zhang, Hengzhi He, Xiaojian Ma, Ruiyao Miao, Yifan Lu, Yasi Zhang, Deqian Kong, Ruiqi Gao, Jianwen Xie, Guang Cheng, Ying Nian Wu

    Abstract: Offline Black-Box Optimization (BBO) aims at optimizing a black-box function using the knowledge from a pre-collected offline dataset of function values and corresponding input designs. However, the high-dimensional and highly-multimodal input design space of black-box function pose inherent challenges for most existing methods that model and operate directly upon input designs. These issues inclu… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  16. arXiv:2405.16289  [pdf

    physics.optics

    Intensity adaptive optics

    Authors: Zimo Zhao, Yifei Ma, Jacopo Antonello, Zipei Song, Jiahe Cui, Binguo Chen, Jingyu Wang, Bangshan Sun, Honghui He, Lin Luo, Julian A. J. Fells, Steve J. Elston, Martin J. Booth, Stephen M. Morris, Chao He

    Abstract: Adaptive optics (AO) is a powerful tool used in a wide range of research areas spanning from aerospace to microscopy. To date, AO has largely been applied to optical phase aberration correction, with recent advances extending to include the vectorial properties of light. However, intensity errors widely exist in optical systems, yet their associated correction methods are still very much in their… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  17. arXiv:2405.15525  [pdf, other

    cs.CL

    Sparse Matrix in Large Language Model Fine-tuning

    Authors: Haoze He, Juncheng Billy Li, Xuan Jiang, Heather Miller

    Abstract: LoRA and its variants have become popular parameter-efficient fine-tuning (PEFT) methods due to their ability to avoid excessive computational costs. However, an accuracy gap often exists between PEFT methods and full fine-tuning (FT), and this gap has yet to be systematically studied. In this work, we introduce a method for selecting sparse sub-matrices that aim to minimize the performance gap be… ▽ More

    Submitted 29 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 14 pages

  18. arXiv:2405.15214  [pdf, other

    cs.CV

    PointRWKV: Efficient RWKV-Like Model for Hierarchical Point Cloud Learning

    Authors: Qingdong He, Jiangning Zhang, Jinlong Peng, Haoyang He, Yabiao Wang, Chengjie Wang

    Abstract: Transformers have revolutionized the point cloud learning task, but the quadratic complexity hinders its extension to long sequence and makes a burden on limited computational resources. The recent advent of RWKV, a fresh breed of deep sequence models, has shown immense potential for sequence modeling in NLP tasks. In this paper, we present PointRWKV, a model of linear complexity derived from the… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  19. arXiv:2405.14018  [pdf, other

    cs.CR cs.LG stat.AP

    Watermarking Generative Tabular Data

    Authors: Hengzhi He, Peiyu Yu, Junpeng Ren, Ying Nian Wu, Guang Cheng

    Abstract: In this paper, we introduce a simple yet effective tabular data watermarking mechanism with statistical guarantees. We show theoretically that the proposed watermark can be effectively detected, while faithfully preserving the data fidelity, and also demonstrates appealing robustness against additive noise attack. The general idea is to achieve the watermarking through a strategic embedding based… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  20. arXiv:2405.11826  [pdf, other

    astro-ph.IM hep-ex physics.ins-det

    Data quality control system and long-term performance monitor of the LHAASO-KM2A

    Authors: Zhen Cao, F. Aharonian, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen , et al. (263 additional authors not shown)

    Abstract: The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To… ▽ More

    Submitted 13 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 15 pages, 9 figures

  21. arXiv:2405.11739  [pdf

    cs.LG cs.AI cs.CY

    What Radio Waves Tell Us about Sleep

    Authors: Hao He, Chao Li, Wolfgang Ganglberger, Kaileigh Gallagher, Rumen Hristov, Michail Ouroutzoglou, Haoqi Sun, Jimeng Sun, Brandon Westover, Dina Katabi

    Abstract: The ability to assess sleep at home, capture sleep stages, and detect the occurrence of apnea (without on-body sensors) simply by analyzing the radio waves bouncing off people's bodies while they sleep is quite powerful. Such a capability would allow for longitudinal data collection in patients' homes, informing our understanding of sleep and its interaction with various diseases and their therape… ▽ More

    Submitted 20 July, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

    Comments: The first two authors contributed equally to this work

  22. arXiv:2405.11389  [pdf, other

    cs.LG

    Adjacent Leader Decentralized Stochastic Gradient Descent

    Authors: Haoze He, Jing Wang, Anna Choromanska

    Abstract: This work focuses on the decentralized deep learning optimization framework. We propose Adjacent Leader Decentralized Gradient Descent (AL-DSGD), for improving final model performance, accelerating convergence, and reducing the communication overhead of decentralized deep learning optimizers. AL-DSGD relies on two main ideas. Firstly, to increase the influence of the strongest learners on the lear… ▽ More

    Submitted 19 August, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

    Comments: 9 pages of main paper, and 12 pages of appendix

  23. arXiv:2405.11021  [pdf, other

    cs.CV

    Enhanced 3D Urban Scene Reconstruction and Point Cloud Densification using Gaussian Splatting and Google Earth Imagery

    Authors: Kyle Gao, Dening Lu, Hongjie He, Linlin Xu, Jonathan Li

    Abstract: 3D urban scene reconstruction and modelling is a crucial research area in remote sensing with numerous applications in academia, commerce, industry, and administration. Recent advancements in view synthesis models have facilitated photorealistic 3D reconstruction solely from 2D images. Leveraging Google Earth imagery, we construct a 3D Gaussian Splatting model of the Waterloo region centered on th… ▽ More

    Submitted 1 June, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    ACM Class: I.4; I.3

  24. arXiv:2405.10895  [pdf, other

    astro-ph.HE astro-ph.GA

    The unluckiest star: A spectroscopically confirmed repeated partial tidal disruption event AT 2022dbl

    Authors: Zheyu Lin, Ning Jiang, Tinggui Wang, Xu Kong, Dongyue Li, Han He, Yibo Wang, Jiazheng Zhu, Wentao Li, Ji-an Jiang, Avinash Singh, Rishabh Singh Teja, D. K. Sahu, Chichuan Jin, Keiichi Maeda, Shifeng Huang

    Abstract: The unluckiest star orbits a supermassive black hole elliptically. Every time it reaches the pericenter, it shallowly enters the tidal radius and gets partially tidal disrupted, producing a series of flares. Confirmation of a repeated partial tidal disruption event (pTDE) requires not only evidence to rule out other types of transients, but also proof that only one star is involved, as TDEs from m… ▽ More

    Submitted 29 July, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: 17 pages, 10 figures, accepted by ApJ Letters on 2024 July 15

  25. arXiv:2405.10775  [pdf, other

    astro-ph.HE astro-ph.CO

    A Novel Model for the MeV Emission Line in GRB 221009A

    Authors: Yu-Jia Wei, Jia Ren, Hao-Ning He, Yuan-Pei Yang, Da-Ming Wei, Zi-Gao Dai, B. Theodore Zhang

    Abstract: Gamma-ray bursts (GRBs) have long been considered potential sources of ultra-high-energy cosmic rays (UHECRs; with energy $\gtrsim 10^{18} {\rm~eV}$). In this work, we propose a novel model generating MeV emission lines in GRB, which can constrain the properties of heavy nuclei that potentially exist in GRB jets. Specifically, we find that relativistic hydrogen-like high-atomic-number ions origina… ▽ More

    Submitted 8 June, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: 13 pages, 4 figures; Published in ApJL, https://doi.org/10.3847/2041-8213/ad4ce1

  26. arXiv:2405.10096  [pdf, other

    cs.LG cs.CR cs.DC

    The Effect of Quantization in Federated Learning: A Rényi Differential Privacy Perspective

    Authors: Tianqu Kang, Lumin Liu, Hengtao He, Jun Zhang, S. H. Song, Khaled B. Letaief

    Abstract: Federated Learning (FL) is an emerging paradigm that holds great promise for privacy-preserving machine learning using distributed data. To enhance privacy, FL can be combined with Differential Privacy (DP), which involves adding Gaussian noise to the model weights. However, FL faces a significant challenge in terms of large communication overhead when transmitting these model weights. To address… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 6 pages, 5 figures, submitted to 2024 IEEE MeditCom

  27. Stellar Chromospheric Activity Database of Solar-like Stars Based on the LAMOST Low-Resolution Spectroscopic Survey: II. the bolometric and photospheric calibration

    Authors: Weitao Zhang, Jun Zhang, Han He, Ali Luo, Haotong Zhang

    Abstract: The dependence of stellar magnetic activity on stellar parameters would be inspired by the chromospheric activity studies based on the large-scale spectroscopic surveys. The Ca II H and K lines are employed to construct indicators for assessing and studying the chromospheric activity of solar-like stars. We investigate the widely used bolometric and photospheric calibrated chromospheric activity i… ▽ More

    Submitted 22 May, 2024; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: 18 pages, 20 figures, accepted for publication in A&A

    Journal ref: A&A 688, A23 (2024)

  28. arXiv:2405.09548  [pdf, other

    eess.SP

    Efficient Bilevel Source Mask Optimization

    Authors: Guojin Chen, Hongquan He, Peng Xu, Hao Geng, Bei Yu

    Abstract: Resolution Enhancement Techniques (RETs) are critical to meet the demands of advanced technology nodes. Among RETs, Source Mask Optimization (SMO) is pivotal, concurrently optimizing both the source and the mask to expand the process window. Traditional SMO methods, however, are limited by sequential and alternating optimizations, leading to extended runtimes without performance guarantees. This p… ▽ More

    Submitted 7 March, 2024; originally announced May 2024.

    Comments: Accepted by Design Automation Conference (DAC) 2024

  29. arXiv:2405.09514  [pdf, other

    eess.SP cs.IT cs.LG

    Tackling Distribution Shifts in Task-Oriented Communication with Information Bottleneck

    Authors: Hongru Li, Jiawei Shao, Hengtao He, Shenghui Song, Jun Zhang, Khaled B. Letaief

    Abstract: Task-oriented communication aims to extract and transmit task-relevant information to significantly reduce the communication overhead and transmission latency. However, the unpredictable distribution shifts between training and test data, including domain shift and semantic shift, can dramatically undermine the system performance. In order to tackle these challenges, it is crucial to ensure that t… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 13 pages, 8 figures, submitted to IEEE for potential publication

  30. arXiv:2405.07840  [pdf, other

    cs.HC cs.CL

    Open-vocabulary Auditory Neural Decoding Using fMRI-prompted LLM

    Authors: Xiaoyu Chen, Changde Du, Che Liu, Yizhe Wang, Huiguang He

    Abstract: Decoding language information from brain signals represents a vital research area within brain-computer interfaces, particularly in the context of deciphering the semantic information from the fMRI signal. However, many existing efforts concentrate on decoding small vocabulary sets, leaving space for the exploration of open vocabulary continuous text decoding. In this paper, we introduce a novel m… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  31. arXiv:2405.07691  [pdf, other

    astro-ph.HE

    Discovery of Very-high-energy Gamma-ray Emissions from the Low Luminosity AGN NGC 4278 by LHAASO

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: The first source catalog of Large High Altitude Air Shower Observatory reported the detection of a very-high-energy gamma ray source, 1LHAASO J1219+2915. In this paper a further detailed study of the spectral and temporal behavior of this point-like source have been carried. The best-fit position of the TeV source ($\rm{RA}=185.05^{\circ}\pm0.04^{\circ}$, $\rm{Dec}=29.25^{\circ}\pm0.03^{\circ}$) i… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 11 pages, 5 figures

  32. arXiv:2405.06707  [pdf, other

    cs.CL cs.AI

    Hypothesis Testing Prompting Improves Deductive Reasoning in Large Language Models

    Authors: Yitian Li, Jidong Tian, Hao He, Yaohui Jin

    Abstract: Combining different forms of prompts with pre-trained large language models has yielded remarkable results on reasoning tasks (e.g. Chain-of-Thought prompting). However, along with testing on more complex reasoning, these methods also expose problems such as invalid reasoning and fictional reasoning paths. In this paper, we develop \textit{Hypothesis Testing Prompting}, which adds conclusion assum… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  33. arXiv:2405.04872  [pdf, other

    cs.CL cs.AI cs.LO

    Logical Negation Augmenting and Debiasing for Prompt-based Methods

    Authors: Yitian Li, Jidong Tian, Hao He, Yaohui Jin

    Abstract: Prompt-based methods have gained increasing attention on NLP and shown validity on many downstream tasks. Many works have focused on mining these methods' potential for knowledge extraction, but few explore their ability to make logical reasoning. In this work, we focus on the effectiveness of the prompt-based methods on first-order logical reasoning and find that the bottleneck lies in logical ne… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  34. arXiv:2405.03324  [pdf

    astro-ph.HE astro-ph.GA

    Investigation of Galactic supernova remnants and their environment in 26.6° < l < 30.6°, $\vert b \vert \leq$ 1.25° using radio surveys

    Authors: Tian-Xian Luo, Ping Zhou, Hao-Ning He

    Abstract: The problem of missing Galactic supernova remnants (SNRs) refers to the issue that the currently known Galactic SNRs are significantly incomplete compared to the theoretical prediction. To expand the sample of Galactic SNRs, we use GLEAM and THOR+VGPS data across four wavebands ranging from 118 to 1420 MHz to drive a spectral index map covering the region within 26.6° < l < 30.6°,… ▽ More

    Submitted 30 June, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

    Comments: 13 pages, 6 figures; Published in AJ

  35. arXiv:2405.03280  [pdf, other

    cs.CV cs.AI

    Animate Your Thoughts: Decoupled Reconstruction of Dynamic Natural Vision from Slow Brain Activity

    Authors: Yizhuo Lu, Changde Du, Chong Wang, Xuanliu Zhu, Liuyun Jiang, Huiguang He

    Abstract: Reconstructing human dynamic vision from brain activity is a challenging task with great scientific significance. The difficulty stems from two primary issues: (1) vision-processing mechanisms in the brain are highly intricate and not fully revealed, making it challenging to directly learn a mapping between fMRI and video; (2) the temporal resolution of fMRI is significantly lower than that of nat… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  36. arXiv:2404.19733  [pdf, other

    cs.CL cs.AI

    Iterative Reasoning Preference Optimization

    Authors: Richard Yuanzhe Pang, Weizhe Yuan, Kyunghyun Cho, He He, Sainbayar Sukhbaatar, Jason Weston

    Abstract: Iterative preference optimization methods have recently been shown to perform well for general instruction tuning tasks, but typically make little improvement on reasoning tasks (Yuan et al., 2024, Chen et al., 2024). In this work we develop an iterative approach that optimizes the preference between competing generated Chain-of-Thought (CoT) candidates by optimizing for winning vs. losing reasoni… ▽ More

    Submitted 25 June, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

  37. arXiv:2404.16019  [pdf, other

    cs.CL

    The PRISM Alignment Project: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models

    Authors: Hannah Rose Kirk, Alexander Whitefield, Paul Röttger, Andrew Bean, Katerina Margatina, Juan Ciro, Rafael Mosquera, Max Bartolo, Adina Williams, He He, Bertie Vidgen, Scott A. Hale

    Abstract: Human feedback plays a central role in the alignment of Large Language Models (LLMs). However, open questions remain about the methods (how), domains (where), people (who) and objectives (to what end) of human feedback collection. To navigate these questions, we introduce PRISM, a new dataset which maps the sociodemographics and stated preferences of 1,500 diverse participants from 75 countries, t… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  38. Probing Neutral Triple Gauge Couplings via $\boldsymbol{Zγ\,(\ell^+\ell^-γ)}$ Production at $\boldsymbol{e^+e^-}$ Colliders

    Authors: Danning Liu, Rui-Qing Xiao, Shu Li, John Ellis, Hong-Jian He, Rui Yuan

    Abstract: Neutral triple gauge couplings (nTGCs) are absent in the Standard Model (SM) and at the dimension-6 level in the Standard Model Effective Field Theory (SMEFT), arising first from dimension-8 operators. As such, they provide a unique window for probing new physics beyond the SM. These dimension-8 operators can be mapped to nTGC form factors whose structure is consistent with the spontaneously-broke… ▽ More

    Submitted 1 July, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: Frontiers of Physics (in Press), 22 pages, 10 Figs and 10 Tables

    Report number: KCL-PH-TH/2024-18, CERN-TH-2024-046

    Journal ref: Frontiers of Physics 20 (2025) 15201, no.1 [Cover Article]

  39. Light and hyper nuclei formation at $\sqrt{s_{\text{NN}}} =$ 3 GeV Au+Au collisions using Wigner coalescence approach

    Authors: L. K. Liu, C. L. Hu, X. H. He, S. S. Shi, G. N. Xie

    Abstract: The production of light nuclei and hyper-nuclei in heavy-ion collisions, particularly at high baryon density, is crucial for understanding the dynamical evolution of the collision system and exploring the internal state of nuclear matter of compacted stellar object. Despite being a topic of ongoing debate, an improved theoretical understanding is necessary. In this work, production of light nuclei… ▽ More

    Submitted 22 July, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

    Journal ref: Phys. Lett. B 855 (2024) 138853

  40. arXiv:2404.11209  [pdf, ps, other

    cs.AI cs.CV cs.MM

    Prompt-Guided Generation of Structured Chest X-Ray Report Using a Pre-trained LLM

    Authors: Hongzhao Li, Hongyu Wang, Xia Sun, Hua He, Jun Feng

    Abstract: Medical report generation automates radiology descriptions from images, easing the burden on physicians and minimizing errors. However, current methods lack structured outputs and physician interactivity for clear, clinically relevant reports. Our method introduces a prompt-guided approach to generate structured chest X-ray reports using a pre-trained large language model (LLM). First, we identify… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: Accepted by IEEE Conference on Multimedia Expo 2024

  41. arXiv:2404.10451  [pdf, other

    cond-mat.mtrl-sci

    Ultrahigh Stability of O-Sublattice in $β$-Ga$_2$O$_3$

    Authors: Ru He, Junlei Zhao, Jesper Byggmästar, Huan He, Flyura Djurabekova

    Abstract: Recently reported remarkably high radiation tolerance of $γ$/$β$-Ga$_2$O$_3$ double-polymorphic structure brings this ultrawide bandgap semiconductor to the frontiers of power electronics applications that are able to operate in challenging environments. Understanding the mechanism of radiation tolerance is crucial for further material modification and tailoring of the desired properties. In this… ▽ More

    Submitted 18 April, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  42. arXiv:2404.09932  [pdf, other

    cs.LG cs.AI cs.CL cs.CY

    Foundational Challenges in Assuring Alignment and Safety of Large Language Models

    Authors: Usman Anwar, Abulhair Saparov, Javier Rando, Daniel Paleka, Miles Turpin, Peter Hase, Ekdeep Singh Lubana, Erik Jenner, Stephen Casper, Oliver Sourbut, Benjamin L. Edelman, Zhaowei Zhang, Mario Günther, Anton Korinek, Jose Hernandez-Orallo, Lewis Hammond, Eric Bigelow, Alexander Pan, Lauro Langosco, Tomasz Korbak, Heidi Zhang, Ruiqi Zhong, Seán Ó hÉigeartaigh, Gabriel Recchia, Giulio Corsi , et al. (13 additional authors not shown)

    Abstract: This work identifies 18 foundational challenges in assuring the alignment and safety of large language models (LLMs). These challenges are organized into three different categories: scientific understanding of LLMs, development and deployment methods, and sociotechnical challenges. Based on the identified challenges, we pose $200+$ concrete research questions.

    Submitted 15 April, 2024; originally announced April 2024.

  43. arXiv:2404.08206  [pdf

    physics.optics physics.app-ph

    Non-uniform wave momentum bandgap in biaxial anisotropic photonic time crystals

    Authors: Junhua Dong, Sihao Zhang, Huan He, Huanan Li, Jingjun Xu

    Abstract: Photonic time crystals (PTCs) host momentum bandgaps enabling intriguing non-resonant light amplification in propagating waves, but opening substantial bandgaps demands refractive index changes too extreme for conventional nonlinear optics. Here, we introduce momentum bandgaps for non-uniform waves, including evanescent and ghost types, by extending PTCs to biaxial anisotropic photonic time crysta… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  44. arXiv:2404.07443  [pdf

    physics.optics cs.ET cs.LG

    1-bit Quantized On-chip Hybrid Diffraction Neural Network Enabled by Authentic All-optical Fully-connected Architecture

    Authors: Yu Shao, Haiqi Gao, Yipeng Chen, Yujie liu, Junren Wen, Haidong He, Yuchuan Shao, Yueguang Zhang, Weidong Shen, Chenying Yang

    Abstract: Optical Diffraction Neural Networks (DNNs), a subset of Optical Neural Networks (ONNs), show promise in mirroring the prowess of electronic networks. This study introduces the Hybrid Diffraction Neural Network (HDNN), a novel architecture that incorporates matrix multiplication into DNNs, synergizing the benefits of conventional ONNs with those of DNNs to surmount the modulation limitations inhere… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  45. arXiv:2404.06687  [pdf, other

    cs.RO eess.SY

    Fast and Accurate Relative Motion Tracking for Dual Industrial Robots

    Authors: Honglu He, Chen-lung Lu, Glenn Saunders, Pinghai Yang, Jeffrey Schoonover, Leo Ajdelsztajn, John Wason, Santiago Paternain, Agung Julius, John T. Wen

    Abstract: Industrial robotic applications such as spraying, welding, and additive manufacturing frequently require fast, accurate, and uniform motion along a 3D spatial curve. To increase process throughput, some manufacturers propose a dual-robot setup to overcome the speed limitation of a single robot. Industrial robot motion is programmed through waypoints connected by motion primitives (Cartesian linear… ▽ More

    Submitted 14 August, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  46. arXiv:2404.06564  [pdf, other

    cs.CV

    MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection

    Authors: Haoyang He, Yuhu Bai, Jiangning Zhang, Qingdong He, Hongxu Chen, Zhenye Gan, Chengjie Wang, Xiangtai Li, Guanzhong Tian, Lei Xie

    Abstract: Recent advancements in anomaly detection have seen the efficacy of CNN- and transformer-based approaches. However, CNNs struggle with long-range dependencies, while transformers are burdened by quadratic computational complexity. Mamba-based models, with their superior long-range modeling and linear efficiency, have garnered substantial attention. This study pioneers the application of Mamba to mu… ▽ More

    Submitted 14 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  47. arXiv:2404.05412  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Valley edge states as bound states in the continuum

    Authors: Shunda Yin, Liping Ye, Hailong He, Xueqin Huang, Manzhu Ke, Weiyin Deng, Jiuyang Lu, Zhengyou Liu

    Abstract: Bound states in the continuum (BICs) are spatially localized states with energy embedded in the continuum spectrum of extended states. The combination of BICs physics and nontrivial band topology theory giving rise to topological BICs, which are robust against disorders and meanwhile of the merit of conventional BICs, is attracting wide attention recently. Here, we report valley edge states as top… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: A revised version has been accepted by Science Bulletin

  48. arXiv:2404.04920  [pdf, other

    cs.LG

    Regularized Conditional Diffusion Model for Multi-Task Preference Alignment

    Authors: Xudong Yu, Chenjia Bai, Haoran He, Changhong Wang, Xuelong Li

    Abstract: Sequential decision-making is desired to align with human intents and exhibit versatility across various tasks. Previous methods formulate it as a conditional generation process, utilizing return-conditioned diffusion models to directly model trajectory distributions. Nevertheless, the return-conditioned paradigm relies on pre-defined reward functions, facing challenges when applied in multi-task… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  49. arXiv:2404.04801  [pdf, ps, other

    astro-ph.IM astro-ph.HE

    LHAASO-KM2A detector simulation using Geant4

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (254 additional authors not shown)

    Abstract: KM2A is one of the main sub-arrays of LHAASO, working on gamma ray astronomy and cosmic ray physics at energies above 10 TeV. Detector simulation is the important foundation for estimating detector performance and data analysis. It is a big challenge to simulate the KM2A detector in the framework of Geant4 due to the need to track numerous photons from a large number of detector units (>6000) with… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  50. arXiv:2404.04555  [pdf, other

    astro-ph.GA

    Cloud-Scale Molecular Gas Properties of the Antennae Merger: A Comparative Study with PHANGS-ALMA Galaxies and NGC 3256

    Authors: Nathan Brunetti, Christine D. Wilson, Hao He, Jiayi Sun, Adam K. Leroy, Erik Rosolowsky, Ashley Bemis, Frank Bigiel, Brent Groves, Toshiki Saito, Eva Schinnerer

    Abstract: We present observations of the central 9 kpc of the Antennae merger (NGC 4038/9) at 55 pc resolution in the CO 2-1 line obtained with the Atacama Large Millimeter/submillimeter Array (ALMA). We use a pixel-based analysis to compare the gas properties in the Antennae to those in 70 nearby spiral galaxies from the PHANGS-ALMA survey, as well as the merger and nearest luminous infrared galaxy NGC 325… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: 16 pages, 8 figures, accepted to MNRAS