Zum Hauptinhalt springen

Showing 1–50 of 3,119 results for author: Sun, X

.
  1. arXiv:2408.17071  [pdf, other

    hep-ex

    Search for $h_c \to π^+π^-J/ψ$ via $ψ(3686)\to π^0h_c$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (653 additional authors not shown)

    Abstract: Using $(2712.4 \pm 14.3) \times 10^6~ψ$(3686) events collected with the BESIII detector operating at the BEPCII collider, we search for the hadronic transition $h_c \to π^+π^-J/ψ$ via $ψ(3686)\to π^0 h_c$. No significant signal is observed. We set the most stringent upper limits to date on the branching fractions $\mathcal{B}(ψ(3686)\to π^0 h_c)\times\mathcal{B}(h_c\toπ^+π^-J/ψ)$ and… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

  2. arXiv:2408.16706  [pdf, other

    cs.PL cs.SE

    Incremental Context-free Grammar Inference in Black Box Settings

    Authors: Feifei Li, Xiao Chen, Xi Xiao, Xiaoyu Sun, Chuan Chen, Shaohua Wang, Jitao Han

    Abstract: Black-box context-free grammar inference presents a significant challenge in many practical settings due to limited access to example programs. The state-of-the-art methods, Arvada and Treevada, employ heuristic approaches to generalize grammar rules, initiating from flat parse trees and exploring diverse generalization sequences. We have observed that these approaches suffer from low quality and… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  3. arXiv:2408.16654  [pdf, other

    hep-ex

    Measurement of the Decay $Ξ^{0}\toΛγ$ with Entangled $Ξ^{0}\barΞ^{0}$ Pairs

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: In this Letter, a systematic study of the weak radiative hyperon decay $Ξ^{0}\toΛγ$ at an electron-positron collider using entangled $Ξ^{0}\barΞ^{0}$ pair events is presented. The absolute branching fraction for this decay has been measured for the first time, and is $\left(1.347 \pm 0.066_{\mathrm stat.}\pm0.054_{\mathrm syst.}\right)\times 10^{-3}$. The decay asymmetry parameter, which character… ▽ More

    Submitted 29 August, 2024; v1 submitted 29 August, 2024; originally announced August 2024.

    Comments: 10 pages, 3 figures

  4. arXiv:2408.16540  [pdf, other

    cs.CV

    GRPose: Learning Graph Relations for Human Image Generation with Pose Priors

    Authors: Xiangchen Yin, Donglin Di, Lei Fan, Hao Li, Chen Wei, Xiaofei Gou, Yang Song, Xiao Sun, Xun Yang

    Abstract: Recent methods using diffusion models have made significant progress in human image generation with various additional controls such as pose priors. However, existing approaches still struggle to generate high-quality images with consistent pose alignment, resulting in unsatisfactory outputs. In this paper, we propose a framework delving into the graph relations of pose priors to provide control i… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: The code will be released at https://github.com/XiangchenYin/GRPose

  5. arXiv:2408.16279  [pdf, ps, other

    hep-ex

    Model-independent determination of the strong-phase difference between $D^0$ and $\bar{D}^0 \to π^+π^-π^+π^-$ decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (647 additional authors not shown)

    Abstract: Measurements of the strong-phase difference between $D^0$ and $\bar{D}^0\toπ^+π^-π^+π^-$ are performed in bins of phase space. The study exploits a sample of quantum-correlated $D\bar{D}$ mesons collected by the BESIII experiment in $e^+e^-$ collisions at a center-of-mass energy of 3.773~GeV, corresponding to an integrated luminosity of 2.93~fb$^{-1}$. Here, $D$ denotes a neutral charm meson in a… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  6. arXiv:2408.15915  [pdf, other

    cs.CV cs.AI cs.CL

    Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models

    Authors: Yuncheng Yang, Yulei Qin, Tong Wu, Zihan Xu, Gang Li, Pengcheng Guo, Hang Shao, Yucheng Shi, Ke Li, Xing Sun, Jie Yang, Yun Gu

    Abstract: The cultivation of expertise for large language models (LLMs) to solve tasks of specific areas often requires special-purpose tuning with calibrated behaviors on the expected stable outputs. To avoid huge cost brought by manual preparation of instruction datasets and training resources up to hundreds of hours, the exploitation of open knowledge including a wealth of low rank adaptation (LoRA) mode… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 28 pages, 12 tables, 10 figures

  7. arXiv:2408.15738  [pdf

    physics.optics

    Narrow Linewidth Distributed Feedback Lasers Utilizing Distributed Phase Shift

    Authors: Yiming Sun, Bocheng Yuan, Xiao Sun, Simeng Zhu, Yizhe Fan, Mohanad Al-Rubaiee, John H. Marsh, Stephen J. Sweeney, Lianping Hou

    Abstract: This study proposes and experimentally demonstrates a distributed feedback (DFB) laser with a distributed phase shift (DPS) region at the center of the DFB cavity. By modeling the field intensity distribution in the cavity and the output spectrum, the DPS region length and phase shift values have been optimized. Experimental comparisons with lasers using traditional π-phase shifts confirm that DFB… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 5 pages, 5 figures

    MSC Class: 78

  8. arXiv:2408.15664  [pdf, other

    cs.LG cs.CL

    Auxiliary-Loss-Free Load Balancing Strategy for Mixture-of-Experts

    Authors: Lean Wang, Huazuo Gao, Chenggang Zhao, Xu Sun, Damai Dai

    Abstract: For Mixture-of-Experts (MoE) models, an unbalanced expert load will lead to routing collapse or increased computational overhead. Existing methods commonly employ an auxiliary loss to encourage load balance, but a large auxiliary loss will introduce non-negligible interference gradients into training and thus impair the model performance. In order to control load balance while not producing undesi… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

  9. arXiv:2408.15511  [pdf, other

    cs.RO cs.AI

    AeroVerse: UAV-Agent Benchmark Suite for Simulating, Pre-training, Finetuning, and Evaluating Aerospace Embodied World Models

    Authors: Fanglong Yao, Yuanchang Yue, Youzhi Liu, Xian Sun, Kun Fu

    Abstract: Aerospace embodied intelligence aims to empower unmanned aerial vehicles (UAVs) and other aerospace platforms to achieve autonomous perception, cognition, and action, as well as egocentric active interaction with humans and the environment. The aerospace embodied world model serves as an effective means to realize the autonomous intelligence of UAVs and represents a necessary pathway toward aerosp… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

  10. Lepton flavour violation Signals of the singly charged scalar singlet at the ILC

    Authors: Chong-Xing Yue, Xiao-Chen Sun, Na-Qian Zhang, Yang-Yang Bu

    Abstract: The singly charged $SU(2)_L$ singlet scalar is one of the very interesting new particles, as it can generate neutrino masses at loop level, produce contributions to various flavour observables. We study the possibility of detecting this kind of scalar predicted by the singly-charged scalar model at ILC via the lepton flavour violation (LFV) process… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Journal ref: J. Phys. G: Nucl. Part. Phys. 51 (2024) 085004

  11. arXiv:2408.14803  [pdf, other

    math.NA

    Spherical quasi-interpolation using scaled zonal kernels

    Authors: Zhengjie Sun, Wenwu Gao, Xingping Sun

    Abstract: We propose and study a new quasi-interpolation method on spheres featuring the following two-phase construction and analysis. In Phase I, we analyze and characterize a large family of zonal kernels (e.g., the spherical version of Poisson kernel, Gaussian, compactly-supported radial kernels), so that the underlying spherical convolution operators (upon the introduction of a scaling parameter) attai… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    MSC Class: 43A90; 41A25; 41A55; 65D12; 65D32

  12. arXiv:2408.14180  [pdf, other

    cs.CV cs.AI

    I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing

    Authors: Yiwei Ma, Jiayi Ji, Ke Ye, Weihuang Lin, Zhibin Wang, Yonghan Zheng, Qiang Zhou, Xiaoshuai Sun, Rongrong Ji

    Abstract: Significant progress has been made in the field of Instruction-based Image Editing (IIE). However, evaluating these models poses a significant challenge. A crucial requirement in this field is the establishment of a comprehensive evaluation benchmark for accurately assessing editing results and providing valuable insights for its further development. In response to this need, we propose I2EBench,… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: Tech report, 39 pages, 41 figures

  13. arXiv:2408.14158  [pdf, other

    cs.DC cs.AI

    Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning

    Authors: Wei An, Xiao Bi, Guanting Chen, Shanhuang Chen, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Wenjun Gao, Kang Guan, Jianzhong Guo, Yongqiang Guo, Zhe Fu, Ying He, Panpan Huang, Jiashi Li, Wenfeng Liang, Xiaodong Liu, Xin Liu, Yiyuan Liu, Yuxuan Liu, Shanghao Lu, Xuan Lu, Xiaotao Nie, Tian Pei , et al. (27 additional authors not shown)

    Abstract: The rapid progress in Deep Learning (DL) and Large Language Models (LLMs) has exponentially increased demands of computational power and bandwidth. This, combined with the high costs of faster computing chips and interconnects, has significantly inflated High Performance Computing (HPC) construction costs. To address these challenges, we introduce the Fire-Flyer AI-HPC architecture, a synergistic… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: This is the preprint version of the paper accepted for presentation at the 2024 International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'24). \c{opyright} 2024 IEEE. Personal use of this material is permitted. For other uses, permission from IEEE must be obtained. Please refer to IEEE Xplore for the final published version

  14. arXiv:2408.13733  [pdf, other

    eess.IV cs.CV

    Anatomical Consistency Distillation and Inconsistency Synthesis for Brain Tumor Segmentation with Missing Modalities

    Authors: Zheyu Zhang, Xinzhao Liu, Zheng Chen, Yueyi Zhang, Huanjing Yue, Yunwei Ou, Xiaoyan Sun

    Abstract: Multi-modal Magnetic Resonance Imaging (MRI) is imperative for accurate brain tumor segmentation, offering indispensable complementary information. Nonetheless, the absence of modalities poses significant challenges in achieving precise segmentation. Recognizing the shared anatomical structures between mono-modal and multi-modal representations, it is noteworthy that mono-modal images typically ex… ▽ More

    Submitted 25 August, 2024; originally announced August 2024.

    Comments: Accepted Paper to European Conference on Artificial Intelligence (ECAI 2024)

  15. arXiv:2408.12497  [pdf

    physics.optics

    Long-Propagating Ghost Phonon Polaritons Enabled by Selective Mode Excitation

    Authors: Manuka P. Suriyage, Qingyi Zhou, Hao Qin, Xueqian Sun, Zhuoyuan Lu, Stefan A. Maier, Zongfu Yu, Yuerui Lu

    Abstract: The precise control of phonon polaritons(PhPs) is essential for advancements in nanophotonic applications like on-chip optical communication and quantum information processing. Ghost hyperbolic phonon polaritons (g-HPs), which have been recently discovered, feature in-plane hyperbolic dispersion and oblique wavefronts, enabling long-range propagation. Despite their potential, controlling the direc… ▽ More

    Submitted 25 August, 2024; v1 submitted 22 August, 2024; originally announced August 2024.

  16. arXiv:2408.11795  [pdf, other

    cs.CV

    EE-MLLM: A Data-Efficient and Compute-Efficient Multimodal Large Language Model

    Authors: Feipeng Ma, Yizhou Zhou, Hebei Li, Zilong He, Siying Wu, Fengyun Rao, Yueyi Zhang, Xiaoyan Sun

    Abstract: In the realm of multimodal research, numerous studies leverage substantial image-text pairs to conduct modal alignment learning, transforming Large Language Models (LLMs) into Multimodal LLMs and excelling in a variety of visual-language tasks. The prevailing methodologies primarily fall into two categories: self-attention-based and cross-attention-based methods. While self-attention-based methods… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  17. arXiv:2408.11696  [pdf, other

    quant-ph

    M2CS: A Microwave Measurement and Control System for Large-scale Superconducting Quantum Processors

    Authors: Jiawei Zhang, Xuandong Sun, Zechen Guo, Yuefeng Yuan, Yubin Zhang, Ji Chu, Wenhui Huang, Yongqi Liang, Jiawei Qiu, Daxiong Sun, Ziyu Tao, Jiajian Zhang, Weijie Guo, Ji Jiang, Xiayu Linpeng, Yang Liu, Wenhui Ren, Jingjing Niu, Youpeng Zhong, Dapeng Yu

    Abstract: As superconducting quantum computing continues to advance at an unprecedented pace, there is a compelling demand for the innovation of specialized electronic instruments that act as crucial conduits between quantum processors and host computers. Here, we introduce a Microwave Measurement and Control System (M2CS) dedicated for large-scale superconducting quantum processors. M2CS features a compact… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  18. arXiv:2408.11671  [pdf, other

    quant-ph

    In situ mixer calibration for superconducting quantum circuits

    Authors: Nan Wu, Jing Lin, Changrong Xie, Zechen Guo, Wenhui Huang, Libo Zhang, Yuxuan Zhou, Xuandong Sun, Jiawei Zhang, Weijie Guo, Xiayu Linpeng, Song Liu, Yang Liu, Wenhui Ren, Ziyu Tao, Ji Jiang, Ji Chu, Jingjing Niu, Youpeng Zhong, Dapeng Yu

    Abstract: Mixers play a crucial role in superconducting quantum computing, primarily by facilitating frequency conversion of signals to enable precise control and readout of quantum states. However, imperfections, particularly carrier leakage and unwanted sideband signal, can significantly compromise control fidelity. To mitigate these defects, regular and precise mixer calibrations are indispensable, yet t… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 9 pages, 7 figures

  19. arXiv:2408.11182  [pdf, other

    cs.CR cs.AI

    Hide Your Malicious Goal Into Benign Narratives: Jailbreak Large Language Models through Neural Carrier Articles

    Authors: Zhilong Wang, Haizhou Wang, Nanqing Luo, Lan Zhang, Xiaoyan Sun, Yebo Cao, Peng Liu

    Abstract: Jailbreak attacks on Language Model Models (LLMs) entail crafting prompts aimed at exploiting the models to generate malicious content. This paper proposes a new type of jailbreak attacks which shift the attention of the LLM by inserting a prohibited query into a carrier article. The proposed attack leverage the knowledge graph and a composer LLM to automatically generating a carrier article that… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  20. arXiv:2408.10681  [pdf, other

    cs.CL cs.LG

    HMoE: Heterogeneous Mixture of Experts for Language Modeling

    Authors: An Wang, Xingwu Sun, Ruobing Xie, Shuaipeng Li, Jiaqi Zhu, Zhen Yang, Pinxue Zhao, J. N. Han, Zhanhui Kang, Di Wang, Naoaki Okazaki, Cheng-zhong Xu

    Abstract: Mixture of Experts (MoE) offers remarkable performance and computational efficiency by selectively activating subsets of model parameters. Traditionally, MoE models use homogeneous experts, each with identical capacity. However, varying complexity in input data necessitates experts with diverse capabilities, while homogeneous MoE hinders effective expert specialization and efficient parameter util… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  21. arXiv:2408.10505  [pdf, other

    quant-ph

    Quantum-Trajectory-Inspired Lindbladian Simulation

    Authors: Sirui Peng, Xiaoming Sun, Qi Zhao, Hongyi Zhou

    Abstract: Simulating the dynamics of open quantum systems is a crucial task in quantum computing, offering wide-ranging applications but remaining computationally challenging. In this paper, we propose two quantum algorithms for simulating the dynamics of open quantum systems governed by Lindbladians. We introduce a new approximation channel for short-time evolution, inspired by the quantum trajectory metho… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: 14 pages, 16 figures

  22. arXiv:2408.10378  [pdf, other

    math.OC eess.SY

    Finite-time input-to-state stability for infinite-dimensional systems

    Authors: Xiaorong Sun, Jun Zheng, Guchuan Zhu

    Abstract: In this paper, we extend the notion of finite-time input-to-state stability (FTISS) for finite-dimensional systems to infinite-dimensional systems. More specifically, we first prove an FTISS Lyapunov theorem for a class of infinite-dimensional systems, namely, the existence of an FTISS Lyapunov functional (FTISS-LF) implies the FTISS of the system, and then, provide a sufficient condition for ensu… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  23. arXiv:2408.09859  [pdf, other

    cs.CV

    OccMamba: Semantic Occupancy Prediction with State Space Models

    Authors: Heng Li, Yuenan Hou, Xiaohan Xing, Xiao Sun, Yanyong Zhang

    Abstract: Training deep learning models for semantic occupancy prediction is challenging due to factors such as a large number of occupancy cells, severe occlusion, limited visual cues, complicated driving scenarios, etc. Recent methods often adopt transformer-based architectures given their strong capability in learning input-conditioned weights and long-range relationships. However, transformer-based netw… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: 9 pages, 4 figures

  24. arXiv:2408.09739  [pdf, other

    cs.CV

    TraDiffusion: Trajectory-Based Training-Free Image Generation

    Authors: Mingrui Wu, Oucheng Huang, Jiayi Ji, Jiale Li, Xinyue Cai, Huafeng Kuang, Jianzhuang Liu, Xiaoshuai Sun, Rongrong Ji

    Abstract: In this work, we propose a training-free, trajectory-based controllable T2I approach, termed TraDiffusion. This novel method allows users to effortlessly guide image generation via mouse trajectories. To achieve precise control, we design a distance awareness energy function to effectively guide latent variables, ensuring that the focus of generation is within the areas defined by the trajectory.… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: The code: https://github.com/och-mac/TraDiffusion

  25. arXiv:2408.09126  [pdf, other

    cs.CV

    Barbie: Text to Barbie-Style 3D Avatars

    Authors: Xiaokun Sun, Zhenyu Zhang, Ying Tai, Qian Wang, Hao Tang, Zili Yi, Jian Yang

    Abstract: Recent advances in text-guided 3D avatar generation have made substantial progress by distilling knowledge from diffusion models. Despite the plausible generated appearance, existing methods cannot achieve fine-grained disentanglement or high-fidelity modeling between inner body and outfit. In this paper, we propose Barbie, a novel framework for generating 3D avatars that can be dressed in diverse… ▽ More

    Submitted 27 August, 2024; v1 submitted 17 August, 2024; originally announced August 2024.

    Comments: 9 pages, 7 figures

  26. arXiv:2408.08930  [pdf, other

    cs.CR cs.AI cs.CL

    DePrompt: Desensitization and Evaluation of Personal Identifiable Information in Large Language Model Prompts

    Authors: Xiongtao Sun, Gan Liu, Zhipeng He, Hui Li, Xiaoguang Li

    Abstract: Prompt serves as a crucial link in interacting with large language models (LLMs), widely impacting the accuracy and interpretability of model outputs. However, acquiring accurate and high-quality responses necessitates precise prompts, which inevitably pose significant risks of personal identifiable information (PII) leakage. Therefore, this paper proposes DePrompt, a desensitization protection an… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  27. arXiv:2408.08826  [pdf, other

    hep-ex

    Search for the rare decay $J/ψ\to γD^0+c.c.$ at BESIII

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (642 additional authors not shown)

    Abstract: Using $(10087\pm44)\times10^6J/ψ$ events collected with the BESIII detector, we search for the rare decay $J/ψ\to γD^0+c.c.$ for the first time. No obvious signal is observed and the upper limit on the branching fraction is determined to be ${\cal B}(J/ψ\to γD^{0}+c.c.)< 9.1 \times 10^{-8}$ at 90\% confidence level.

    Submitted 16 August, 2024; originally announced August 2024.

  28. arXiv:2408.08669  [pdf, other

    cs.SD eess.AS

    HSDreport: Heart Sound Diagnosis with Echocardiography Reports

    Authors: Zihan Zhao, Pingjie Wang, Liudan Zhao, Yuchen Yang, Ya Zhang, Kun Sun, Xin Sun, Xin Zhou, Yu Wang, Yanfeng Wang

    Abstract: Heart sound auscultation holds significant importance in the diagnosis of congenital heart disease. However, existing methods for Heart Sound Diagnosis (HSD) tasks are predominantly limited to a few fixed categories, framing the HSD task as a rigid classification problem that does not fully align with medical practice and offers only limited information to physicians. Besides, such methods do not… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  29. arXiv:2408.08439  [pdf, other

    cs.DS

    Algebraic Vertex Ordering of a Sparse Graph for Adjacency Access Locality and Graph Compression

    Authors: Dimitris Floros, Nikos Pitsianis, Xiaobai Sun

    Abstract: In this work, we establish theoretical and practical connections between vertex indexing for sparse graph/network compression and matrix ordering for sparse matrix-vector multiplication and variable elimination. We present a fundamental analysis of adjacency access locality in vertex ordering from the perspective of graph composition of, or decomposition into, elementary compact graphs. We introdu… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 7 pages, 5 figures, 1 table

  30. arXiv:2408.07605  [pdf, other

    cs.CV

    Panacea+: Panoramic and Controllable Video Generation for Autonomous Driving

    Authors: Yuqing Wen, Yucheng Zhao, Yingfei Liu, Binyuan Huang, Fan Jia, Yanhui Wang, Chi Zhang, Tiancai Wang, Xiaoyan Sun, Xiangyu Zhang

    Abstract: The field of autonomous driving increasingly demands high-quality annotated video training data. In this paper, we propose Panacea+, a powerful and universally applicable framework for generating video data in driving scenes. Built upon the foundation of our previous work, Panacea, Panacea+ adopts a multi-view appearance noise prior mechanism and a super-resolution module for enhanced consistency… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: Project page: https://panacea-ad.github.io/. arXiv admin note: text overlap with arXiv:2311.16813

  31. arXiv:2408.07339  [pdf

    cond-mat.mes-hall

    Bilayer TeO2: The First Oxide Semiconductor with Symmetric Sub-5-nm NMOS and PMOS

    Authors: Linqiang Xu, Liya Zhao, Chit Siong Lau, Pan Zhang, Lianqiang Xu, Qiuhui Li, Shibo Fang, Yee Sin Ang, Xiaotian Sun, Jing Lu

    Abstract: Wide bandgap oxide semiconductors are very promising channel candidates for next-generation electronics due to their large-area manufacturing, high-quality dielectrics, low contact resistance, and low leakage current. However, the absence of ultra-short gate length (Lg) p-type transistors has restricted their application in future complementary metal-oxide-semiconductor (CMOS) integration. Inspire… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

  32. arXiv:2408.06884  [pdf, ps, other

    math.OC

    Tikhonov regularization of second-order plus first-order primal-dual dynamical systems for separable convex optimization

    Authors: Xiangkai Sun, Lijuan Zheng, Kok Lay Teo

    Abstract: This paper deals with a Tikhonov regularized second-order plus first-order primal-dual dynamical system with time scaling for separable convex optimization problems with linear equality constraints. This system consists of two second-order ordinary differential equations for the primal variables and one first-order ordinary differential equation for the dual variable.By utilizing the Lyapunov anal… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

    MSC Class: 90C25; 37N40; 34D05

  33. arXiv:2408.06677  [pdf, other

    hep-ex hep-ph

    Search for $η_c(2S)\toωω$ and $ωφ$ decays and measurements of $χ_{cJ}\toωω$ and $ωφ$ in $ψ(2S)$ radiative processes

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Using $(2712\pm 14)$ $\times$ 10$^{6}$ $ψ(2S)$ events collected with the BESIII detector at the BEPCII collider, we search for the decays $η_{c}(2S)\toωω$ and $η_{c}(2S)\toωφ$ via the process $ψ(2S)\toγη_{c}(2S)$. Evidence of $η_{c}(2S)\toωω$ is found with a statistical significance of $3.2σ$. The branching fraction is measured to be… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  34. arXiv:2408.06527  [pdf, other

    cs.CL cs.AI

    Chain-of-Strategy Planning with LLMs: Aligning the Generation of Psychotherapy Dialogue with Strategy in Motivational Interviewing

    Authors: Xin Sun, Xiao Tang, Abdallah El Ali, Zhuying Li, Xiaoyu Shen, Pengjie Ren, Jan de Wit, Jiahuan Pei, Jos A. Bosch

    Abstract: Recent advancements in large language models (LLMs) have shown promise in generating psychotherapeutic dialogues, especially in Motivational Interviewing (MI). However, how to employ strategies, a set of motivational interviewing (MI) skills, to generate therapeutic-adherent conversations with explainability is underexplored. We propose an approach called strategy-aware dialogue generation with Ch… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  35. arXiv:2408.06327  [pdf, other

    cs.AI cs.CL cs.CV

    VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

    Authors: Xiao Liu, Tianjie Zhang, Yu Gu, Iat Long Iong, Yifan Xu, Xixuan Song, Shudan Zhang, Hanyu Lai, Xinyi Liu, Hanlin Zhao, Jiadai Sun, Xinyue Yang, Yu Yang, Zehan Qi, Shuntian Yao, Xueqiao Sun, Siyi Cheng, Qinkai Zheng, Hao Yu, Hanchen Zhang, Wenyi Hong, Ming Ding, Lihang Pan, Xiaotao Gu, Aohan Zeng , et al. (5 additional authors not shown)

    Abstract: Large Multimodal Models (LMMs) have ushered in a new era in artificial intelligence, merging capabilities in both language and vision to form highly capable Visual Foundation Agents. These agents are postulated to excel across a myriad of tasks, potentially approaching general artificial intelligence. However, existing benchmarks fail to sufficiently challenge or showcase the full potential of LMM… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  36. arXiv:2408.06152  [pdf, other

    cs.MM cs.AI cs.CV cs.NI

    Palantir: Towards Efficient Super Resolution for Ultra-high-definition Live Streaming

    Authors: Xinqi Jin, Zhui Zhu, Xikai Sun, Fan Dang, Jiangchuan Liu, Jingao Xu, Kebin Liu, Xinlei Chen, Yunhao Liu

    Abstract: Neural enhancement through super-resolution deep neural networks opens up new possibilities for ultra-high-definition live streaming over existing encoding and networking infrastructure. Yet, the heavy SR DNN inference overhead leads to severe deployment challenges. To reduce the overhead, existing systems propose to apply DNN-based SR only on selected anchor frames while upscaling non-anchor fram… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  37. arXiv:2408.05920  [pdf, other

    cs.AI cs.LG

    Urban Region Pre-training and Prompting: A Graph-based Approach

    Authors: Jiahui Jin, Yifan Song, Dong Kan, Haojia Zhu, Xiangguo Sun, Zhicheng Li, Xigang Sun, Jinghui Zhang

    Abstract: Urban region representation is crucial for various urban downstream tasks. However, despite the proliferation of methods and their success, acquiring general urban region knowledge and adapting to different tasks remains challenging. Previous work often neglects the spatial structures and functional layouts between entities, limiting their ability to capture transferable knowledge across regions.… ▽ More

    Submitted 26 August, 2024; v1 submitted 12 August, 2024; originally announced August 2024.

  38. arXiv:2408.05669  [pdf, other

    cs.CV cs.AI

    StealthDiffusion: Towards Evading Diffusion Forensic Detection through Diffusion Model

    Authors: Ziyin Zhou, Ke Sun, Zhongxi Chen, Huafeng Kuang, Xiaoshuai Sun, Rongrong Ji

    Abstract: The rapid progress in generative models has given rise to the critical task of AI-Generated Content Stealth (AIGC-S), which aims to create AI-generated images that can evade both forensic detectors and human inspection. This task is crucial for understanding the vulnerabilities of existing detection methods and developing more robust techniques. However, current adversarial attacks often introduce… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

  39. arXiv:2408.05472  [pdf, other

    cs.LG physics.ao-ph

    FuXi Weather: An end-to-end machine learning weather data assimilation and forecasting system

    Authors: Xiuyu Sun, Xiaohui Zhong, Xiaoze Xu, Yuanqing Huang, Hao Li, Jie Feng, Wei Han, Libo Wu, Yuan Qi

    Abstract: Operational numerical weather prediction systems consist of three fundamental components: the global observing system for data collection, data assimilation for generating initial conditions, and the forecasting model to predict future weather conditions. While NWP have undergone a quiet revolution, with forecast skills progressively improving over the past few decades, their advancement has slowe… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

    Comments: 34 pages, 4 figures

  40. arXiv:2408.05211  [pdf, other

    cs.CV cs.AI cs.CL

    VITA: Towards Open-Source Interactive Omni Multimodal LLM

    Authors: Chaoyou Fu, Haojia Lin, Zuwei Long, Yunhang Shen, Meng Zhao, Yifan Zhang, Xiong Wang, Di Yin, Long Ma, Xiawu Zheng, Ran He, Rongrong Ji, Yunsheng Wu, Caifeng Shan, Xing Sun

    Abstract: The remarkable multimodal capabilities and interactive experience of GPT-4o underscore their necessity in practical applications, yet open-source models rarely excel in both areas. In this paper, we introduce VITA, the first-ever open-source Multimodal Large Language Model (MLLM) adept at simultaneous processing and analysis of Video, Image, Text, and Audio modalities, and meanwhile has an advance… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: Project Page: https://vita-home.github.io

  41. arXiv:2408.04686  [pdf, other

    cs.CL cs.AI

    Multi-Turn Context Jailbreak Attack on Large Language Models From First Principles

    Authors: Xiongtao Sun, Deyue Zhang, Dongdong Yang, Quanchen Zou, Hui Li

    Abstract: Large language models (LLMs) have significantly enhanced the performance of numerous applications, from intelligent conversations to text generation. However, their inherent security vulnerabilities have become an increasingly significant challenge, especially with respect to jailbreak attacks. Attackers can circumvent the security mechanisms of these LLMs, breaching security constraints and causi… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  42. arXiv:2408.04422  [pdf, other

    hep-ex

    Analysis of the dynamics of the decay $D^{+}\to K_{S}^{0} π^{0} e^{+}ν_{e}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: The branching fraction of $D^+\to K_{S}^{0} π^{0}e^+ν_e$ is measured for the first time using $7.93~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$~GeV with the BESIII detector operating at the BEPCII collider, and is determined to be ${\mathcal B}$($D^+\to K_S^0π^0e^+ν_e$) = $(0.881~\pm~0.017_{\rm stat.}~\pm~0.016_{\rm syst.})$\%. Based on a… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  43. arXiv:2408.04400  [pdf, other

    cs.LG cs.AI

    DIVE: Subgraph Disagreement for Graph Out-of-Distribution Generalization

    Authors: Xin Sun, Liang Wang, Qiang Liu, Shu Wu, Zilei Wang, Liang Wang

    Abstract: This paper addresses the challenge of out-of-distribution (OOD) generalization in graph machine learning, a field rapidly advancing yet grappling with the discrepancy between source and target data distributions. Traditional graph learning algorithms, based on the assumption of uniform distribution between training and test data, falter in real-world scenarios where this assumption fails, resultin… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  44. arXiv:2408.03822  [pdf, other

    cs.CV

    Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields

    Authors: Joo Chan Lee, Daniel Rho, Xiangyu Sun, Jong Hwan Ko, Eunbyung Park

    Abstract: 3D Gaussian splatting (3DGS) has recently emerged as an alternative representation that leverages a 3D Gaussian-based representation and introduces an approximated volumetric rendering, achieving very fast rendering speed and promising image quality. Furthermore, subsequent studies have successfully extended 3DGS to dynamic 3D scenes, demonstrating its wide range of applications. However, a signif… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: Project page: https://maincold2.github.io/c3dgs/

  45. arXiv:2408.03531  [pdf, other

    hep-ex

    Measurement of the Branching Fraction of \boldmath{$ψ(2S) \to γπ^0$}

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: Based on $(2712.4\pm14.1)\times10^{6}~ψ(2S)$ events, 7.9 fb$^{-1}$ $ψ(3773)$ data, and 0.8 fb$^{-1}$ off-resonance data samples collected with the BESIII detector, we measure the branching fraction of $ψ(2S)\rightarrowγπ^{0}$ and $e^{+}e^{-}\rightarrowγπ^{0}$ form factor at momentum transfers $Q^{2}\sim13$ GeV$^{2}$. The $e^{+}e^{-}\rightarrowγπ^{0}$ cross section is fitted with considering the in… ▽ More

    Submitted 7 August, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

  46. arXiv:2408.03499  [pdf, other

    cs.CV

    FacialPulse: An Efficient RNN-based Depression Detection via Temporal Facial Landmarks

    Authors: Ruiqi Wang, Jinyang Huang, Jie Zhang, Xin Liu, Xiang Zhang, Zhi Liu, Peng Zhao, Sigui Chen, Xiao Sun

    Abstract: Depression is a prevalent mental health disorder that significantly impacts individuals' lives and well-being. Early detection and intervention are crucial for effective treatment and management of depression. Recently, there are many end-to-end deep learning methods leveraging the facial expression features for automatic depression detection. However, most current methods overlook the temporal dy… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

  47. arXiv:2408.03238  [pdf, other

    cs.RO cs.CV

    LAC-Net: Linear-Fusion Attention-Guided Convolutional Network for Accurate Robotic Grasping Under the Occlusion

    Authors: Jinyu Zhang, Yongchong Gu, Jianxiong Gao, Haitao Lin, Qiang Sun, Xinwei Sun, Xiangyang Xue, Yanwei Fu

    Abstract: This paper addresses the challenge of perceiving complete object shapes through visual perception. While prior studies have demonstrated encouraging outcomes in segmenting the visible parts of objects within a scene, amodal segmentation, in particular, has the potential to allow robots to infer the occluded parts of objects. To this end, this paper introduces a new framework that explores amodal s… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: accepted by IROS2024

  48. arXiv:2408.03205  [pdf, other

    hep-ex

    Measurement of $Σ^+$ transverse polarization in $e^+e^-$ collisions at $\sqrt{s} = 3.68-3.71$ GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: Using $e^+e^-$ collision data collected with the BESIII detector at seven energy points ranging from 3.68 to 3.71 GeV and corresponding to an integrated luminosity of $652.1~{\rm pb^{-1}}$, we present an energy-dependent measurement of the transverse polarization, relative phase and modulus ratio of the electromagnetic form factors of the $Σ^+$ hyperon in the $e^+e^- \to Σ^+ \barΣ^-$ reaction. The… ▽ More

    Submitted 7 August, 2024; v1 submitted 6 August, 2024; originally announced August 2024.

    Comments: 21 pages, 2 tables, 5 figures

  49. arXiv:2408.02984  [pdf

    cond-mat.mes-hall physics.app-ph

    Direct measurement of topological invariants through temporal adiabatic evolution of bulk states in the synthetic Brillouin zone

    Authors: Zhao-Xian Chen, Yuan-hong Zhang, Xiao-Chen Sun, Ruo-Yang Zhang, Jiang-Shan Tang, Xin Yang, Xue-Feng Zhu, Yan-Qing Lu

    Abstract: Mathematically, topological invariants arise from the parallel transport of eigenstates on the energy bands, which, in physics, correspond to the adiabatic dynamical evolution of transient states. It determines the presence of boundary states, while lacking direct measurements. Here, we develop time-varying programmable coupling circuits between acoustic cavities to mimic the Hamiltonians in the B… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: 16 pages, 4 figures

  50. arXiv:2408.02976  [pdf, ps, other

    cs.CL cs.AI

    Empathy Level Alignment via Reinforcement Learning for Empathetic Response Generation

    Authors: Hui Ma, Bo Zhang, Bo Xu, Jian Wang, Hongfei Lin, Xiao Sun

    Abstract: Empathetic response generation, aiming at understanding the user's situation and feelings and respond empathically, is crucial in building human-like dialogue systems. Previous methods mainly focus on using maximum likelihood estimation as the optimization objective for training response generation models, without taking into account the empathy level alignment between generated responses and targ… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.