Zum Hauptinhalt springen

Showing 51–100 of 2,055 results for author: Ma, S

.
  1. arXiv:2407.03541  [pdf

    physics.optics nlin.CD

    Parallel fast random bit generation based on spectrotemporally uncorrelated Brillouin random fiber lasing oscillation

    Authors: Yuxi Pang, Shaonian Ma, Qiang Ji, Xian Zhao, Zengguang Qin, Zhaojun Liu, Ping Lu, Xiaoyi Bao, Yanping Xu

    Abstract: Correlations existing between spectral components in multi-wavelength lasers have been the key challenge that hinders these laser sources from being developed to chaotic comb entropy sources for parallel random bit generation. Herein, spectrotemporally uncorrelated multi-order Stokes/anti-Stokes emissions are achieved by cooperatively exploiting nonlinear optical processes including cascaded stimu… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2407.03390  [pdf, other

    cond-mat.mes-hall physics.optics

    Observation of Co-propagating Chiral Zero Modes in Magnetic Photonic Crystals

    Authors: Zhongfu Li, Shaojie Ma, Shuwei Li, Oubo you, Yachao Liu, Qingdong Yang, Yuanjiang Xiang, Peiheng Zhou, Shuang Zhang

    Abstract: Topological singularities, such as Weyl points and Dirac points, can give rise to unidirectional propagation channels known as chiral zero modes (CZMs) when subject to a magnetic field. These CZMs are responsible for intriguing phenomena like the chiral anomaly in quantum systems. The propagation direction of each CZM is determined by both the applied magnetic field and the topological charge of t… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 6 pages, 5 figures

  3. arXiv:2407.02805  [pdf, other

    cs.SE cs.AI

    Efficient DNN-Powered Software with Fair Sparse Models

    Authors: Xuanqi Gao, Weipeng Jiang, Juan Zhai, Shiqing Ma, Xiaoyu Zhang, Chao Shen

    Abstract: With the emergence of the Software 3.0 era, there is a growing trend of compressing and integrating large models into software systems, with significant societal implications. Regrettably, in numerous instances, model compression techniques impact the fairness performance of these models and thus the ethical behavior of DNN-powered software. One of the most notable example is the Lottery Ticket Hy… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  4. arXiv:2407.01896  [pdf, other

    cs.CL cs.IR

    LogEval: A Comprehensive Benchmark Suite for Large Language Models In Log Analysis

    Authors: Tianyu Cui, Shiyu Ma, Ziang Chen, Tong Xiao, Shimin Tao, Yilun Liu, Shenglin Zhang, Duoming Lin, Changchang Liu, Yuzhe Cai, Weibin Meng, Yongqian Sun, Dan Pei

    Abstract: Log analysis is crucial for ensuring the orderly and stable operation of information systems, particularly in the field of Artificial Intelligence for IT Operations (AIOps). Large Language Models (LLMs) have demonstrated significant potential in natural language processing tasks. In the AIOps domain, they excel in tasks such as anomaly detection, root cause analysis of faults, operations and maint… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  5. arXiv:2407.01537  [pdf, other

    cs.RO cs.CV

    WaveShot: A Compact Portable Unmanned Surface Vessel for Dynamic Water Surface Videography and Media Production

    Authors: Shijian Ma, Shicong Ma, Jianhao Jiao

    Abstract: This paper presents WaveShot, an innovative portable unmanned surface vessel that aims to transform water surface videography by offering a highly maneuverable, cost-effective, and safe alternative to traditional filming methods. WaveShot is designed for the modern demands of film production, advertising, documentaries, and visual arts, equipped with professional-grade waterproof cameras and advan… ▽ More

    Submitted 13 August, 2024; v1 submitted 12 March, 2024; originally announced July 2024.

  6. arXiv:2407.01349  [pdf, other

    cs.CV cs.RO

    PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction

    Authors: Xuan Yu, Yili Liu, Chenrui Han, Sitong Mao, Shunbo Zhou, Rong Xiong, Yiyi Liao, Yue Wang

    Abstract: Panoptic reconstruction is a challenging task in 3D scene understanding. However, most existing methods heavily rely on pre-trained semantic segmentation models and known 3D object bounding boxes for 3D panoptic segmentation, which is not available for in-the-wild scenes. In this paper, we propose a novel zero-shot panoptic reconstruction method from RGB-D images of scenes. For zero-shot segmentat… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  7. arXiv:2407.01006  [pdf, other

    eess.SP

    Multi-Functional Beamforming Design for Integrated Sensing, Communication, and Computation

    Authors: Yapeng Zhao, Qingqing Wu, Wen Chen, Yong Zeng, Ruiqi Liu, Weidong Mei, Fen Hou, Shaodan Ma

    Abstract: Integrated sensing and communication (ISAC) systems may face a heavy computation burden since the sensory data needs to be further processed. This paper studies a novel system that integrates sensing, communication, and computation, aiming to provide services for different objectives efficiently. This system consists of a multi-antenna multi-functional base station (BS), an edge server, a target,… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  8. arXiv:2407.00466  [pdf, other

    cs.CL cs.AI

    BioKGBench: A Knowledge Graph Checking Benchmark of AI Agent for Biomedical Science

    Authors: Xinna Lin, Siqi Ma, Junjie Shan, Xiaojing Zhang, Shell Xu Hu, Tiannan Guo, Stan Z. Li, Kaicheng Yu

    Abstract: Pursuing artificial intelligence for biomedical science, a.k.a. AI Scientist, draws increasing attention, where one common approach is to build a copilot agent driven by Large Language Models (LLMs). However, to evaluate such systems, people either rely on direct Question-Answering (QA) to the LLM itself, or in a biomedical experimental manner. How to precisely benchmark biomedical agents from an… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  9. arXiv:2407.00247  [pdf, other

    cs.CV

    Prompt Refinement with Image Pivot for Text-to-Image Generation

    Authors: Jingtao Zhan, Qingyao Ai, Yiqun Liu, Yingwei Pan, Ting Yao, Jiaxin Mao, Shaoping Ma, Tao Mei

    Abstract: For text-to-image generation, automatically refining user-provided natural language prompts into the keyword-enriched prompts favored by systems is essential for the user experience. Such a prompt refinement process is analogous to translating the prompt from "user languages" into "system languages". However, the scarcity of such parallel corpora makes it difficult to train a prompt refinement mod… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

    Comments: Accepted by ACL 2024

  10. arXiv:2406.19581  [pdf, ps, other

    cs.HC cs.LG

    HarmonICA: Neural non-stationarity correction and source separation for motor neuron interfaces

    Authors: Alexander Kenneth Clarke, Agnese Grison, Irene Mendez Guerra, Pranav Mamidanna, Shihan Ma, Silvia Muceli, Dario Farina

    Abstract: A major outstanding problem when interfacing with spinal motor neurons is how to accurately compensate for non-stationary effects in the signal during source separation routines, particularly when they cannot be estimated in advance. This forces current systems to instead use undifferentiated bulk signal, which limits the potential degrees of freedom for control. In this study we propose a potenti… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  11. arXiv:2406.18025  [pdf, ps, other

    hep-ph

    Precise determination of the bottom-quark on-shell mass using its four-loop relation to the $\overline{\rm MS}$-scheme running mass

    Authors: Shun-Yue Ma, Xu-Dong Huang, Xu-Chang Zheng, Xing-Gang Wu

    Abstract: In this paper, we explore the properties of the bottom-quark on-shell mass ($M_b$) by using its relation to the $\overline{\rm MS}$ mass (${\overline m}_b$). At present, this $\overline{\rm MS}$-on-shell relation has been known up to four-loop QCD corrections, which however still has a $\sim 2\%$ scale uncertainty by taking the renormalization scale as ${\overline m}_b({\overline m}_b)$ and varyin… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 5 pages, 2 figures

  12. arXiv:2406.16457  [pdf, other

    cond-mat.mtrl-sci

    A hybrid FEM-NN optimization method to learn the physics-constrained constitutive relations from full-field data

    Authors: Xinxin Wu, Kaiqiang Sun, Shaohua Yang, Huan Wang, Ye Xu, Yin Zhang, Sheng Mao

    Abstract: Neural networks (NNs) have demonstrated strong capabilities of representing high-dimensional, complex functional relations, and hence have been widely used to characterize complex constitutive relations for various types of materials, such as polycrystals, polymers, etc. However, to construct a reliable NN-based constitutive model, a considerable amount of data, i.e. stress-strain states along dif… ▽ More

    Submitted 30 July, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: 14 pages,7 figures

  13. arXiv:2406.14367  [pdf, other

    cs.CV cs.AI

    PoseBench: Benchmarking the Robustness of Pose Estimation Models under Corruptions

    Authors: Sihan Ma, Jing Zhang, Qiong Cao, Dacheng Tao

    Abstract: Pose estimation aims to accurately identify anatomical keypoints in humans and animals using monocular images, which is crucial for various applications such as human-machine interaction, embodied AI, and autonomous driving. While current models show promising results, they are typically trained and tested on clean data, potentially overlooking the corruption during real-world deployment and thus… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Technical report. Project page: https://xymsh.github.io/PoseBench/

  14. arXiv:2406.13531  [pdf, ps, other

    hep-ph nucl-th

    LQCD constrained magnetic field dependent coupling constant in an effective model

    Authors: Shijun Mao

    Abstract: A magnetic field dependent coupling constant $G(eB)$ is investigated in the two-flavor magnetized NJL model. Based on LQCD results of the neutral (charged) pion mass spectra at vanishing temperature and finite magnetic field, we determine the $G(eB)=G^0(eB)$ ($G(eB)=G^+(eB)$) in the NJL model. $G^0(eB)$ and $G^+(eB)$ are both non-monotonic functions of magnetic fields, but they are different from… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 8 pages, 4 figures

  15. arXiv:2406.13117  [pdf, other

    cs.AI

    State-of-the-Art Review: The Use of Digital Twins to Support Artificial Intelligence-Guided Predictive Maintenance

    Authors: Sizhe Ma, Katherine A. Flanigan, Mario Bergés

    Abstract: In recent years, predictive maintenance (PMx) has gained prominence for its potential to enhance efficiency, automation, accuracy, and cost-effectiveness while reducing human involvement. Importantly, PMx has evolved in tandem with digital advancements, such as Big Data and the Internet of Things (IOT). These technological strides have enabled Artificial Intelligence (AI) to revolutionize PMx proc… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: This work has been submitted to Springer for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  16. arXiv:2406.12798  [pdf, other

    astro-ph.EP astro-ph.SR

    The Aligned Orbit of a Hot Jupiter around the M Dwarf TOI-4201

    Authors: Tianjun Gan, Sharon X. Wang, Fei Dai, Joshua N. Winn, Shude Mao, Siyi Xu, Enric Pallé, Jacob L. Bean, Madison Brady, Nina Brown, Cicero Lu, Rafael Luque, Teo Mocnik, Andreas Seifahrt, Guðmundur K. Stefánsson

    Abstract: Measuring the obliquities of stars hosting giant planets may shed light on the dynamical history of planetary systems. Significant efforts have been made to measure the obliquities of FGK stars with hot Jupiters, mainly based on observations of the Rossiter-McLaughlin effect. In contrast, M dwarfs with hot Jupiters have hardly been explored, because such systems are rare and often not favorable fo… ▽ More

    Submitted 19 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: 12 pages, 5 figures, 3 tables, accepted to ApJL

  17. arXiv:2406.12196  [pdf, other

    cs.SE

    CITADEL: Context Similarity Based Deep Learning Framework Bug Finding

    Authors: Xiaoyu Zhang, Juan Zhai, Shiqing Ma, Shiwei Wang, Chao Shen

    Abstract: With deep learning (DL) technology becoming an integral part of the new intelligent software, tools of DL framework testing and bug-finding are in high demand. Existing DL framework testing tools have limited coverage on bug types. For example, they lack the capability of finding performance bugs, which are critical for DL model training and inference regarding performance, economics, and the envi… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 12 pages, 10 figures

  18. arXiv:2406.11931  [pdf, other

    cs.SE cs.AI cs.LG

    DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

    Authors: DeepSeek-AI, Qihao Zhu, Daya Guo, Zhihong Shao, Dejian Yang, Peiyi Wang, Runxin Xu, Y. Wu, Yukun Li, Huazuo Gao, Shirong Ma, Wangding Zeng, Xiao Bi, Zihui Gu, Hanwei Xu, Damai Dai, Kai Dong, Liyue Zhang, Yishi Piao, Zhibin Gou, Zhenda Xie, Zhewen Hao, Bingxuan Wang, Junxiao Song, Deli Chen , et al. (15 additional authors not shown)

    Abstract: We present DeepSeek-Coder-V2, an open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks. Specifically, DeepSeek-Coder-V2 is further pre-trained from an intermediate checkpoint of DeepSeek-V2 with additional 6 trillion tokens. Through this continued pre-training, DeepSeek-Coder-V2 substantially enhances the coding and mathe… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  19. arXiv:2406.11698  [pdf, other

    cs.CL

    Meta Reasoning for Large Language Models

    Authors: Peizhong Gao, Ao Xie, Shaoguang Mao, Wenshan Wu, Yan Xia, Haipeng Mi, Furu Wei

    Abstract: We introduce Meta-Reasoning Prompting (MRP), a novel and efficient system prompting method for large language models (LLMs) inspired by human meta-reasoning. Traditional in-context learning-based reasoning techniques, such as Tree-of-Thoughts, show promise but lack consistent state-of-the-art performance across diverse tasks due to their specialized nature. MRP addresses this limitation by guiding… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  20. arXiv:2406.11633  [pdf, other

    cs.CV

    DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language Models

    Authors: Renqiu Xia, Song Mao, Xiangchao Yan, Hongbin Zhou, Bo Zhang, Haoyang Peng, Jiahao Pi, Daocheng Fu, Wenjie Wu, Hancheng Ye, Shiyang Feng, Bin Wang, Chao Xu, Conghui He, Pinlong Cai, Min Dou, Botian Shi, Sheng Zhou, Yongwei Wang, Bin Wang, Junchi Yan, Fei Wu, Yu Qiao

    Abstract: Scientific documents record research findings and valuable human knowledge, comprising a vast corpus of high-quality data. Leveraging multi-modality data extracted from these documents and assessing large models' abilities to handle scientific document-oriented tasks is therefore meaningful. Despite promising advancements, large models still perform poorly on multi-page scientific document extract… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Homepage of DocGenome: https://unimodal4reasoning.github.io/DocGenome_page 22 pages, 11 figures

  21. arXiv:2406.10104  [pdf, ps, other

    math.AG

    A moduli space of stable sheaves on a cubic threefold

    Authors: Shihao Ma, Song Yang

    Abstract: In this paper, we prove that the moduli space $\overline{M}_{X}(ν)$ of $H$-Gieseker semistable sheaves on a smooth cubic threefold $X$ with Chern character $ν=(4,-H,-\frac{5}{6}H^{2},\frac{1}{6}H^{3})$ is non-empty, smooth and irreducible of dimension $8$.

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 16 pages. Comments are very welcome

  22. arXiv:2406.09627  [pdf, other

    cs.CV cs.AI eess.IV

    RobustSAM: Segment Anything Robustly on Degraded Images

    Authors: Wei-Ting Chen, Yu-Jiet Vong, Sy-Yen Kuo, Sizhuo Ma, Jian Wang

    Abstract: Segment Anything Model (SAM) has emerged as a transformative approach in image segmentation, acclaimed for its robust zero-shot segmentation capabilities and flexible prompting system. Nonetheless, its performance is challenged by images with degraded quality. Addressing this limitation, we propose the Robust Segment Anything Model (RobustSAM), which enhances SAM's performance on low-quality image… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted by CVPR2024 (Highlight); Project Page: https://robustsam.github.io/

  23. arXiv:2406.09622  [pdf, other

    cs.CV cs.AI eess.IV

    DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer

    Authors: Wei-Ting Chen, Gurunandan Krishnan, Qiang Gao, Sy-Yen Kuo, Sizhuo Ma, Jian Wang

    Abstract: Generic Face Image Quality Assessment (GFIQA) evaluates the perceptual quality of facial images, which is crucial in improving image restoration algorithms and selecting high-quality face images for downstream tasks. We present a novel transformer-based method for GFIQA, which is aided by two unique mechanisms. First, a Dual-Set Degradation Representation Learning (DSL) mechanism uses facial image… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted by CVPR 2024, Project Page: https://dsl-fiqa.github.io/

  24. arXiv:2406.09389  [pdf, other

    eess.IV cs.CV

    Sagiri: Low Dynamic Range Image Enhancement with Generative Diffusion Prior

    Authors: Baiang Li, Sizhuo Ma, Yanhong Zeng, Xiaogang Xu, Youqing Fang, Zhao Zhang, Jian Wang, Kai Chen

    Abstract: Capturing High Dynamic Range (HDR) scenery using 8-bit cameras often suffers from over-/underexposure, loss of fine details due to low bit-depth compression, skewed color distributions, and strong noise in dark areas. Traditional LDR image enhancement methods primarily focus on color mapping, which enhances the visual representation by expanding the image's color range and adjusting the brightness… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: https://sagiri0208.github.io

  25. arXiv:2406.08887  [pdf, other

    eess.SP

    Low-Overhead Channel Estimation via 3D Extrapolation for TDD mmWave Massive MIMO Systems Under High-Mobility Scenarios

    Authors: Binggui Zhou, Xi Yang, Shaodan Ma, Feifei Gao, Guanghua Yang

    Abstract: In TDD mmWave massive MIMO systems, the downlink CSI can be attained through uplink channel estimation thanks to the uplink-downlink channel reciprocity. However, the channel aging issue is significant under high-mobility scenarios and thus necessitates frequent uplink channel estimation. In addition, large amounts of antennas and subcarriers lead to high-dimensional CSI matrices, aggravating the… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: 13 pages, 11 figures, 3 tables. This paper has been submitted to IEEE journal for possible publication

  26. arXiv:2406.08851  [pdf, other

    cs.LG

    Inverse Probability of Treatment Weighting with Deep Sequence Models Enables Accurate treatment effect Estimation from Electronic Health Records

    Authors: Junghwan Lee, Simin Ma, Nicoleta Serban, Shihao Yang

    Abstract: Observational data have been actively used to estimate treatment effect, driven by the growing availability of electronic health records (EHRs). However, EHRs typically consist of longitudinal records, often introducing time-dependent confoundings that hinder the unbiased estimation of treatment effect. Inverse probability of treatment weighting (IPTW) is a widely used propensity score method sinc… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  27. arXiv:2406.08239  [pdf, ps, other

    math-ph

    Infinite-dimensional Frobenius Manifolds Underlying the genus-zero Universal Whitham Hierarchy

    Authors: Shilin Ma

    Abstract: In this paper, we construct a new class of infinite-dimensional Frobenius manifolds on the spaces of pairs of meromorphic functions that are defined on specific regions of the Riemann sphere. We demonstrate that the principal hierarchy of these Frobenius manifolds serves as an extension of the genus-zero universal Whitham hierarchy.

    Submitted 12 June, 2024; originally announced June 2024.

  28. arXiv:2406.07411  [pdf, other

    cs.SE cs.CL

    VersiCode: Towards Version-controllable Code Generation

    Authors: Tongtong Wu, Weigang Wu, Xingyu Wang, Kang Xu, Suyu Ma, Bo Jiang, Ping Yang, Zhenchang Xing, Yuan-Fang Li, Gholamreza Haffari

    Abstract: Significant research has focused on improving the performance of large language model on code-related tasks due to their practical importance. Although performance is typically evaluated using public benchmark datasets, the existing datasets do not account for the concept of \emph{version}, which is crucial in professional software development. In this paper, we introduce VersiCode, the first comp… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  29. arXiv:2406.06383  [pdf, ps, other

    quant-ph

    Dual-cavity controllable quantum battery

    Authors: Dayang Zhang, Shuangquan Ma, Yunxiu Jiang, Youbin Yu, Guangri Jin, Aixi Chen

    Abstract: With the rapid development of quantum science and technology, quantum batteries have also emerged. However, there are still many unresolved issues in the field of quantum batteries. For example, how to improve battery space utilization, maximize battery energy storage, and how to increase and control the charging power of quantum batteries. A major challenge is how to achieve better charging power… ▽ More

    Submitted 18 July, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  30. arXiv:2406.06373  [pdf, other

    quant-ph

    Entanglement and steering in quantum batteries

    Authors: Dayang Zhang, Shuangquan Ma, Yunxiu Jiang, Youbin Yu, Guangri Jin, Aixi Chen

    Abstract: The advantage of quantum batteries is that quantum resources can be used to improve charging efficiency. The quantum resources that are known to be available are: quantum entanglement and quantum coherence. In this paper, we introduce quantum steering as a new quantum resource into batteries for the first time. We analyze the relationship between quantum steering, quantum entanglement, energy stor… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  31. arXiv:2406.06365  [pdf, ps, other

    math.CO

    A curious symmetric decomposition of the (des, exc)-Eulerian polynomials

    Authors: Shi-Mei Ma, Toufik Mansour, Yeong-Nan Yeh

    Abstract: One of the most central result in combinatorics says that the descent statistic and the excedance statistic are equidistribued over the symmetric group. As a continuation of the work of Shareshian-Wachs (Adv. Math., 225(6) (2010), 2921--2966), we provide a curious $t$-symmetric decomposition for the generating polynomial of the joint distribution of the descent and excedance statistics over the sy… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 7 pages

    MSC Class: 05A05

  32. arXiv:2406.05927  [pdf, other

    cs.CV cs.CR cs.LG

    MeanSparse: Post-Training Robustness Enhancement Through Mean-Centered Feature Sparsification

    Authors: Sajjad Amini, Mohammadreza Teymoorianfard, Shiqing Ma, Amir Houmansadr

    Abstract: We present a simple yet effective method to improve the robustness of Convolutional Neural Networks (CNNs) against adversarial examples by post-processing an adversarially trained model. Our technique, MeanSparse, cascades the activation functions of a trained model with novel operators that sparsify mean-centered feature vectors. This is equivalent to reducing feature variations around the mean,… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  33. arXiv:2406.05688  [pdf, other

    cs.CL cs.AI cs.LG

    Peer Review as A Multi-Turn and Long-Context Dialogue with Role-Based Interactions

    Authors: Cheng Tan, Dongxin Lyu, Siyuan Li, Zhangyang Gao, Jingxuan Wei, Siqi Ma, Zicheng Liu, Stan Z. Li

    Abstract: Large Language Models (LLMs) have demonstrated wide-ranging applications across various fields and have shown significant potential in the academic peer-review process. However, existing applications are primarily limited to static review generation based on submitted papers, which fail to capture the dynamic and iterative nature of real-world peer reviews. In this paper, we reformulate the peer-r… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: Under review

  34. arXiv:2406.04211  [pdf, ps, other

    math.CO

    Stirling permutation codes. II

    Authors: Shi-Mei Ma, Hao Qi, Jean Yeh, Yeong-Nan Yeh

    Abstract: In the context of Stirling polynomials, Gessel and Stanley introduced the definition of Stirling permutation, which has attracted extensive attention over the past decades. Recently, we introduced Stirling permutation code and provided numerous equidistribution results as applications. The purpose of the present work is to further analyse Stirling permutation code. First, we derive an expansion fo… ▽ More

    Submitted 8 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: 19 pages

    MSC Class: 05A19; 05E05

  35. arXiv:2406.01593  [pdf, other

    cs.CV

    Reconstructing and Simulating Dynamic 3D Objects with Mesh-adsorbed Gaussian Splatting

    Authors: Shaojie Ma, Yawei Luo, Yi Yang

    Abstract: 3D reconstruction and simulation, while interrelated, have distinct objectives: reconstruction demands a flexible 3D representation adaptable to diverse scenes, whereas simulation requires a structured representation to model motion principles effectively. This paper introduces the Mesh-adsorbed Gaussian Splatting (MaGS) method to resolve such a dilemma. MaGS constrains 3D Gaussians to hover on th… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Project Page: see https://wcwac.github.io/MaGS-page/

  36. arXiv:2406.01482  [pdf, other

    physics.atom-ph quant-ph

    Spectroscopy and modeling of $^{171}$Yb Rydberg states for high-fidelity two-qubit gates

    Authors: Michael Peper, Yiyi Li, Daniel Y. Knapp, Mila Bileska, Shuo Ma, Genyue Liu, Pai Peng, Bichen Zhang, Sebastian P. Horvath, Alex P. Burgers, Jeff D. Thompson

    Abstract: We present multichannel quantum defect (MQDT) models for highly excited $^{174}$Yb and $^{171}$Yb Rydberg states with $L \leq 2$. The models are developed using a combination of existing literature data and new, high-precision laser and microwave spectroscopy in an atomic beam, and validated by detailed comparison with experimentally measured Stark shifts and magnetic moments. We then use these mo… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  37. arXiv:2406.00699  [pdf, other

    cs.CV

    Towards General Robustness Verification of MaxPool-based Convolutional Neural Networks via Tightening Linear Approximation

    Authors: Yuan Xiao, Shiqing Ma, Juan Zhai, Chunrong Fang, Jinyuan Jia, Zhenyu Chen

    Abstract: The robustness of convolutional neural networks (CNNs) is vital to modern AI-driven systems. It can be quantified by formal verification by providing a certified lower bound, within which any perturbation does not alter the original input's classification result. It is challenging due to nonlinear components, such as MaxPool. At present, many verification methods are sound but risk losing some pre… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Accepted to CVPR2024. Project page: https://github.com/xiaoyuanpigo/maxlin

  38. arXiv:2406.00602  [pdf, other

    cs.SE cs.PL

    From Effectiveness to Efficiency: Comparative Evaluation of Code Generated by LCGMs for Bilingual Programming Questions

    Authors: Weipeng Jiang, Xuanqi Gao, Juan Zhai, Shiqing Ma, Xiaoyu Zhang, Chao Shen

    Abstract: Large Code Generation Models (LCGMs) have garnered significant attention and achieved promising results across various programming tasks. However, concerns arise regarding performance when using non-English prompts, as these models are primarily trained on English-centric corpora, and most programming language tokens resemble English. Existing benchmarks often rely on English programming questions… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 10 and a quarter pages, 6 figures

  39. arXiv:2405.20773  [pdf, other

    cs.CR cs.AI

    Visual-RolePlay: Universal Jailbreak Attack on MultiModal Large Language Models via Role-playing Image Character

    Authors: Siyuan Ma, Weidi Luo, Yu Wang, Xiaogeng Liu

    Abstract: With the advent and widespread deployment of Multimodal Large Language Models (MLLMs), ensuring their safety has become increasingly critical. To achieve this objective, it requires us to proactively discover the vulnerability of MLLMs by exploring the attack methods. Thus, structure-based jailbreak attacks, where harmful semantic content is embedded within images, have been proposed to mislead th… ▽ More

    Submitted 12 June, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

  40. arXiv:2405.20568  [pdf, other

    cs.LG cs.NI

    Generative AI for Deep Reinforcement Learning: Framework, Analysis, and Use Cases

    Authors: Geng Sun, Wenwen Xie, Dusit Niyato, Fang Mei, Jiawen Kang, Hongyang Du, Shiwen Mao

    Abstract: As a form of artificial intelligence (AI) technology based on interactive learning, deep reinforcement learning (DRL) has been widely applied across various fields and has achieved remarkable accomplishments. However, DRL faces certain limitations, including low sample efficiency and poor generalization. Therefore, we present how to leverage generative AI (GAI) to address these issues above and en… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  41. Correctable Landmark Discovery via Large Models for Vision-Language Navigation

    Authors: Bingqian Lin, Yunshuang Nie, Ziming Wei, Yi Zhu, Hang Xu, Shikui Ma, Jianzhuang Liu, Xiaodan Liang

    Abstract: Vision-Language Navigation (VLN) requires the agent to follow language instructions to reach a target position. A key factor for successful navigation is to align the landmarks implied in the instruction with diverse visual observations. However, previous VLN agents fail to perform accurate modality alignment especially in unexplored scenes, since they learn from limited navigation data and lack s… ▽ More

    Submitted 5 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted by TPAMI 2024

  42. arXiv:2405.18303  [pdf, other

    gr-qc

    Nonlinear effect of absorption on the ringdown of a spinning black hole

    Authors: Taillte May, Sizheng Ma, Justin L. Ripley, William E. East

    Abstract: The ringdown gravitational wave signal arising e.g., in the final stage of a black hole binary merger, contains important information about the properties of the remnant, and can potentially be used to perform clean tests of general relativity. However, interpreting the ringdown signal, in particular when it is the loudest, requires understanding the role of nonlinearities and their potential impa… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 29 pages, 16 figures

  43. arXiv:2405.18240  [pdf, other

    cs.CV

    MSPE: Multi-Scale Patch Embedding Prompts Vision Transformers to Any Resolution

    Authors: Wenzhuo Liu, Fei Zhu, Shijie Ma, Cheng-Lin Liu

    Abstract: Although Vision Transformers (ViTs) have recently advanced computer vision tasks significantly, an important real-world problem was overlooked: adapting to variable input resolutions. Typically, images are resized to a fixed resolution, such as 224x224, for efficiency during training and inference. However, uniform input size conflicts with real-world scenarios where images naturally vary in resol… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  44. arXiv:2405.18058  [pdf, other

    cs.IR

    ReChorus2.0: A Modular and Task-Flexible Recommendation Library

    Authors: Jiayu Li, Hanyu Li, Zhiyu He, Weizhi Ma, Peijie Sun, Min Zhang, Shaoping Ma

    Abstract: With the applications of recommendation systems rapidly expanding, an increasing number of studies have focused on every aspect of recommender systems with different data inputs, models, and task settings. Therefore, a flexible library is needed to help researchers implement the experimental strategies they require. Existing open libraries for recommendation scenarios have enabled reproducing vari… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 10 pages, 3 figures. Under review

  45. arXiv:2405.17792  [pdf, other

    hep-ex hep-ph

    JUNO Sensitivity to Invisible Decay Modes of Neutrons

    Authors: JUNO Collaboration, Angel Abusleme, Thomas Adam, Kai Adamowicz, Shakeel Ahmad, Rizwan Ahmed, Sebastiano Aiello, Fengpeng An, Qi An, Giuseppe Andronico, Nikolay Anfimov, Vito Antonelli, Tatiana Antoshkina, João Pedro Athayde Marcondes de André, Didier Auguste, Weidong Bai, Nikita Balashov, Wander Baldini, Andrea Barresi, Davide Basilico, Eric Baussan, Marco Bellato, Marco Beretta, Antonio Bergnoli, Daniel Bick , et al. (635 additional authors not shown)

    Abstract: We explore the bound neutrons decay into invisible particles (e.g., $n\rightarrow 3 ν$ or $nn \rightarrow 2 ν$) in the JUNO liquid scintillator detector. The invisible decay includes two decay modes: $ n \rightarrow { inv} $ and $ nn \rightarrow { inv} $. The invisible decays of $s$-shell neutrons in $^{12}{\rm C}$ will leave a highly excited residual nucleus. Subsequently, some de-excitation mode… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 28 pages, 7 figures, 4 tables

  46. arXiv:2405.15339  [pdf, other

    eess.SP

    Environment Sensing-aided Beam Prediction with Transfer Learning for Smart Factory

    Authors: Yuan Feng, Chuanbing Zhao, Feifei Gao, Yong Zhang, Shaodan Ma

    Abstract: In this paper, we propose an environment sensing-aided beam prediction model for smart factory that can be transferred from given environments to a new environment. In particular, we first design a pre-training model that predicts the optimal beam by sensing the present environmental information. When encountering a new environment, it generally requires collecting a large amount of new training d… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  47. arXiv:2405.15121  [pdf, other

    physics.optics

    Two-octave frequency combs from all-silica-fiber implementation

    Authors: Yanyan Zhang, Mingkun Li, Pan Zhang, Yueqing Du, Shibang Ma, Yuanshan Liu, Sida Xing, Shougang Zhang

    Abstract: Mid-infrared frequency comb spectroscopy enables measurement of molecular at megahertz spectral resolution, sub-hertz frequency accuracy and microsecond acquisition speed. However, the widespread adoption of this technique has been hindered by the complexity and alignment sensitivity of mid-infrared frequency comb sources. Leveraging the underexplored mid-infrared window of silica fibers presents… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  48. arXiv:2405.14866  [pdf, other

    cs.CV

    Tele-Aloha: A Low-budget and High-authenticity Telepresence System Using Sparse RGB Cameras

    Authors: Hanzhang Tu, Ruizhi Shao, Xue Dong, Shunyuan Zheng, Hao Zhang, Lili Chen, Meili Wang, Wenyu Li, Siyan Ma, Shengping Zhang, Boyao Zhou, Yebin Liu

    Abstract: In this paper, we present a low-budget and high-authenticity bidirectional telepresence system, Tele-Aloha, targeting peer-to-peer communication scenarios. Compared to previous systems, Tele-Aloha utilizes only four sparse RGB cameras, one consumer-grade GPU, and one autostereoscopic screen to achieve high-resolution (2048x2048), real-time (30 fps), low-latency (less than 150ms) and robust distant… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Paper accepted by SIGGRAPH 2024. Project page: http://118.178.32.38/c/Tele-Aloha/

  49. arXiv:2405.14672  [pdf, other

    cs.CV

    Towards Imperceptible Backdoor Attack in Self-supervised Learning

    Authors: Hanrong Zhang, Zhenting Wang, Tingxu Han, Mingyu Jin, Chenlu Zhan, Mengnan Du, Hongwei Wang, Shiqing Ma

    Abstract: Self-supervised learning models are vulnerable to backdoor attacks. Existing backdoor attacks that are effective in self-supervised learning often involve noticeable triggers, like colored patches, which are vulnerable to human inspection. In this paper, we propose an imperceptible and effective backdoor attack against self-supervised models. We first find that existing imperceptible triggers desi… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  50. arXiv:2405.14431  [pdf, other

    cs.CL cs.AI cs.IR

    RaFe: Ranking Feedback Improves Query Rewriting for RAG

    Authors: Shengyu Mao, Yong Jiang, Boli Chen, Xiao Li, Peng Wang, Xinyu Wang, Pengjun Xie, Fei Huang, Huajun Chen, Ningyu Zhang

    Abstract: As Large Language Models (LLMs) and Retrieval Augmentation Generation (RAG) techniques have evolved, query rewriting has been widely incorporated into the RAG system for downstream tasks like open-domain QA. Many works have attempted to utilize small models with reinforcement learning rather than costly LLMs to improve query rewriting. However, current methods require annotations (e.g., labeled re… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 16 pages