Zum Hauptinhalt springen

Showing 201–250 of 1,882 results for author: Dong, Y

.
  1. arXiv:2404.00599  [pdf, other

    cs.CL cs.AI cs.SE

    EvoCodeBench: An Evolving Code Generation Benchmark Aligned with Real-World Code Repositories

    Authors: Jia Li, Ge Li, Xuanming Zhang, Yihong Dong, Zhi Jin

    Abstract: How to evaluate Large Language Models (LLMs) in code generation is an open question. Existing benchmarks demonstrate poor alignment with real-world code repositories and are insufficient to evaluate the coding abilities of LLMs. This paper proposes a new benchmark - EvoCodeBench to address the preceding problems, which has three primary advances. (1) EvoCodeBench aligns with real-world repositorie… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: Data: https://github.com/seketeam/EvoCodeBench

  2. arXiv:2404.00540  [pdf, other

    cs.CV cs.AI

    Embodied Active Defense: Leveraging Recurrent Feedback to Counter Adversarial Patches

    Authors: Lingxuan Wu, Xiao Yang, Yinpeng Dong, Liuwei Xie, Hang Su, Jun Zhu

    Abstract: The vulnerability of deep neural networks to adversarial patches has motivated numerous defense strategies for boosting model robustness. However, the prevailing defenses depend on single observation or pre-established adversary information to counter adversarial patches, often failing to be confronted with unseen or adaptive adversarial attacks and easily exhibiting unsatisfying performance in dy… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 27pages

  3. arXiv:2404.00385  [pdf, other

    cs.CV cs.AI cs.LG

    Constrained Layout Generation with Factor Graphs

    Authors: Mohammed Haroon Dupty, Yanfei Dong, Sicong Leng, Guoji Fu, Yong Liang Goh, Wei Lu, Wee Sun Lee

    Abstract: This paper addresses the challenge of object-centric layout generation under spatial constraints, seen in multiple domains including floorplan design process. The design process typically involves specifying a set of spatial constraints that include object attributes like size and inter-object relations such as relative positioning. Existing works, which typically represent objects as single nodes… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: To be published at IEEE/CVF CVPR 2024

  4. arXiv:2403.19943  [pdf, other

    cs.LG cs.AI eess.SP

    TDANet: A Novel Temporal Denoise Convolutional Neural Network With Attention for Fault Diagnosis

    Authors: Zhongzhi Li, Rong Fan, Jingqi Tu, Jinyi Ma, Jianliang Ai, Yiqun Dong

    Abstract: Fault diagnosis plays a crucial role in maintaining the operational integrity of mechanical systems, preventing significant losses due to unexpected failures. As intelligent manufacturing and data-driven approaches evolve, Deep Learning (DL) has emerged as a pivotal technique in fault diagnosis research, recognized for its ability to autonomously extract complex features. However, the practical ap… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  5. arXiv:2403.19256  [pdf, other

    hep-ex

    Measurement of absolute branching fractions of $D_s^+$ hadronic decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (632 additional authors not shown)

    Abstract: Using $e^+ e^-$ collision data collected at the BESIII detector at center-of-mass energies between 4.128 and 4.226 GeV, corresponding to an integrated luminosity of $7.33~{\rm fb}^{-1}$, we determine the absolute branching fractions of fifteen hadronic $D_s^{+}$ decays with a double-tag technique. In particular, we make precise measurements of the branching fractions… ▽ More

    Submitted 30 May, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  6. arXiv:2403.19091  [pdf, other

    hep-ex

    Observation of the semileptonic decays $D^0\rightarrow K_S^0π^-π^0 e^+ ν_e$ and $D^+\rightarrow K_S^0π^+π^- e^+ ν_e$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (600 additional authors not shown)

    Abstract: By analyzing $e^+e^-$ annihilation data corresponding to an integrated luminosity of 2.93 $\rm fb^{-1}$ collected at a center-of-mass energy of 3.773 GeV with the \text{BESIII} detector, the first observation of the semileptonic decays $D^0\rightarrow K_S^0π^-π^0 e^+ ν_e$ and $D^+\rightarrow K_S^0π^+π^- e^+ ν_e$ is reported. With a dominant hadronic contribution from $K_1(1270)$, the branching fra… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

    Comments: 19pages

  7. arXiv:2403.18167  [pdf, other

    cs.CL cs.AI

    Mechanistic Understanding and Mitigation of Language Model Non-Factual Hallucinations

    Authors: Lei Yu, Meng Cao, Jackie Chi Kit Cheung, Yue Dong

    Abstract: State-of-the-art language models (LMs) sometimes generate non-factual hallucinations that misalign with world knowledge. To explore the mechanistic causes of these hallucinations, we create diagnostic datasets with subject-relation queries and adapt interpretability methods to trace hallucinations through internal model representations. We discover two general and distinct mechanistic causes of ha… ▽ More

    Submitted 17 June, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  8. arXiv:2403.16811  [pdf, ps, other

    hep-ex

    Cross section measurement of $e^+e^-\to ηψ(2S)$ and search for $e^+e^-\toη\tilde{X}(3872)$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: The energy-dependent cross section for $e^+e^-\to ηψ(2S)$ is measured at eighteen center of mass energies from 4.288 GeV to 4.951 GeV using the BESIII detector. Using the same data samples, we also perform the first search for the reaction $e^+e^-\toη\tilde{X}(3872)$, but no evidence is found for the $\tilde{X}(3872)$ in the $π^+π^- J/ψ$ mass distribution. At each of the eighteen center of mass en… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  9. arXiv:2403.16440  [pdf, other

    cs.CV

    RCBEVDet: Radar-camera Fusion in Bird's Eye View for 3D Object Detection

    Authors: Zhiwei Lin, Zhe Liu, Zhongyu Xia, Xinhao Wang, Yongtao Wang, Shengxiang Qi, Yang Dong, Nan Dong, Le Zhang, Ce Zhu

    Abstract: Three-dimensional object detection is one of the key tasks in autonomous driving. To reduce costs in practice, low-cost multi-view cameras for 3D object detection are proposed to replace the expansive LiDAR sensors. However, relying solely on cameras is difficult to achieve highly accurate and robust 3D object detection. An effective solution to this issue is combining multi-view cameras with the… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR2024

  10. arXiv:2403.16374  [pdf, other

    cs.LG cs.CV cs.RO

    ProIn: Learning to Predict Trajectory Based on Progressive Interactions for Autonomous Driving

    Authors: Yinke Dong, Haifeng Yuan, Hongkun Liu, Wei Jing, Fangzhen Li, Hongmin Liu, Bin Fan

    Abstract: Accurate motion prediction of pedestrians, cyclists, and other surrounding vehicles (all called agents) is very important for autonomous driving. Most existing works capture map information through an one-stage interaction with map by vector-based attention, to provide map constraints for social interaction and multi-modal differentiation. However, these methods have to encode all required map rul… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  11. arXiv:2403.16110  [pdf, other

    cs.DB

    ByteCard: Enhancing ByteDance's Data Warehouse with Learned Cardinality Estimation

    Authors: Yuxing Han, Haoyu Wang, Lixiang Chen, Yifeng Dong, Xing Chen, Benquan Yu, Chengcheng Yang, Weining Qian

    Abstract: Cardinality estimation is a critical component and a longstanding challenge in modern data warehouses. ByteHouse, ByteDance's cloud-native engine for extensive data analysis in exabyte-scale environments, serves numerous internal decision-making business scenarios. With the increasing demand for ByteHouse, cardinality estimation becomes the bottleneck for efficiently processing queries. Specifical… ▽ More

    Submitted 11 April, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  12. arXiv:2403.15952  [pdf, other

    cs.CV cs.CL

    IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models

    Authors: Haz Sameen Shahgir, Khondker Salman Sayeed, Abhik Bhattacharjee, Wasi Uddin Ahmad, Yue Dong, Rifat Shahriyar

    Abstract: The advent of Vision Language Models (VLM) has allowed researchers to investigate the visual understanding of a neural network using natural language. Beyond object classification and detection, VLMs are capable of visual comprehension and common-sense reasoning. This naturally led to the question: How do VLMs respond when the image itself is inherently unreasonable? To this end, we present Illusi… ▽ More

    Submitted 9 August, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

  13. arXiv:2403.15796  [pdf, other

    cs.CL cs.AI cs.LG

    Understanding Emergent Abilities of Language Models from the Loss Perspective

    Authors: Zhengxiao Du, Aohan Zeng, Yuxiao Dong, Jie Tang

    Abstract: Recent studies have put into question the belief that emergent abilities in language models are exclusive to large models. This skepticism arises from two observations: 1) smaller models can also exhibit high performance on emergent abilities and 2) there is doubt on the discontinuous metrics used to measure these abilities. In this paper, we propose to study emergent abilities in the lens of pre-… ▽ More

    Submitted 30 March, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: 18 pages, 6 figures

  14. arXiv:2403.15559  [pdf, other

    cs.CV cs.AI

    An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes

    Authors: Zhengyi Zhao, Chen Song, Xiaodong Gu, Yuan Dong, Qi Zuo, Weihao Yuan, Liefeng Bo, Zilong Dong, Qixing Huang

    Abstract: A fundamental problem in the texturing of 3D meshes using pre-trained text-to-image models is to ensure multi-view consistency. State-of-the-art approaches typically use diffusion models to aggregate multi-view inputs, where common issues are the blurriness caused by the averaging operation in the aggregation step or inconsistencies in local features. This paper introduces an optimization framewor… ▽ More

    Submitted 2 August, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

  15. arXiv:2403.14998  [pdf, other

    hep-ex

    Precise measurement of the $e^+e^-\to D_s^+D_s^-$ cross sections at center-of-mass energies from threshold to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Using the $e^+e^-$ collision data collected with the BESIII detector operating at the BEPCII collider, at center-of-mass energies from the threshold to $4.95$~GeV, we present precise measurements of the cross sections for the process $e^+e^-\to D_s^+D_s^-$ using a single tag method. The resulting cross section lineshape exhibits several new structures, thereby offering an input for coupled channel… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 9 pages, 4 figures, published to PRL

  16. arXiv:2403.14888  [pdf, other

    cs.CL cs.AI

    AutoRE: Document-Level Relation Extraction with Large Language Models

    Authors: Lilong Xue, Dan Zhang, Yuxiao Dong, Jie Tang

    Abstract: Large Language Models (LLMs) have demonstrated exceptional abilities in comprehending and generating text, motivating numerous researchers to utilize them for Information Extraction (IE) purposes, including Relation Extraction (RE). Nonetheless, most existing methods are predominantly designed for Sentence-level Relation Extraction (SentRE) tasks, which typically encompass a restricted set of rela… ▽ More

    Submitted 26 July, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: 10 pages, 4 figures

  17. arXiv:2403.13437  [pdf, other

    hep-ex

    Search for $ΔS=2$ nonleptonic hyperon decays $Ω^-\toΣ^{0}π^{-}$ and $Ω^-\to nK^{-}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Using $(27.12 \pm 0.14) \times 10^{8}$ $ψ(3686)$ events collected by the BESIII detector at the center-of-mass energy of $\sqrt{s} = 3.686$ GeV, we search for the first time for two nonleptonic hyperon decays that change strangeness by two units, $Ω^-\toΣ^{0}π^-$ and $Ω^-\to nK^{-}$. No significant signal is observed. The upper limits on their decay branching fractions are determined to be… ▽ More

    Submitted 14 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  18. arXiv:2403.12966  [pdf, other

    cs.CV

    Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models

    Authors: Zuyan Liu, Yuhao Dong, Yongming Rao, Jie Zhou, Jiwen Lu

    Abstract: In the realm of vision-language understanding, the proficiency of models in interpreting and reasoning over visual content has become a cornerstone for numerous applications. However, it is challenging for the visual encoder in Large Vision-Language Models (LVLMs) to extract useful features tailored to questions that aid the language model's response. Furthermore, a common practice among existing… ▽ More

    Submitted 21 March, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: Project Page: https://sites.google.com/view/chain-of-spot/

  19. arXiv:2403.12010  [pdf, other

    cs.CV cs.AI cs.GR

    VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model

    Authors: Qi Zuo, Xiaodong Gu, Lingteng Qiu, Yuan Dong, Zhengyi Zhao, Weihao Yuan, Rui Peng, Siyu Zhu, Zilong Dong, Liefeng Bo, Qixing Huang

    Abstract: Generating multi-view images based on text or single-image prompts is a critical capability for the creation of 3D content. Two fundamental questions on this topic are what data we use for training and how to ensure multi-view consistency. This paper introduces a novel framework that makes fundamental contributions to both questions. Unlike leveraging images from 2D diffusion models for training,… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Project page: aigc3d.github.io/VideoMV/

  20. arXiv:2403.11483  [pdf, other

    cs.LG cs.AI cs.SI

    Open-World Semi-Supervised Learning for Node Classification

    Authors: Yanling Wang, Jing Zhang, Lingxi Zhang, Lixin Liu, Yuxiao Dong, Cuiping Li, Hong Chen, Hongzhi Yin

    Abstract: Open-world semi-supervised learning (Open-world SSL) for node classification, that classifies unlabeled nodes into seen classes or multiple novel classes, is a practical but under-explored problem in the graph community. As only seen classes have human labels, they are usually better learned than novel classes, and thus exhibit smaller intra-class variances within the embedding space (named as imb… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted by ICDE 2024

  21. arXiv:2403.10877  [pdf, ps, other

    hep-ex hep-ph

    Test of lepton universality and measurement of the form factors of $D^0\to K^{*}(892)^-μ^+ν_μ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (637 additional authors not shown)

    Abstract: We report a first study of the semileptonic decay $D^0\rightarrow K^-π^0μ^{+}ν_μ$ by analyzing an $e^+e^-$ annihilation data sample of $7.9~\mathrm{fb}^{-1}$ collected at the center-of-mass energy of 3.773 GeV with the BESIII detector. The absolute branching fraction of $D^0\to K^-π^0μ^{+}ν_μ$ is measured for the first time to be $(0.729 \pm 0.014_{\rm stat} \pm 0.011_{\rm syst})\%$. Based on an a… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 9 pages, 3 figures

  22. arXiv:2403.07988  [pdf, other

    eess.SY

    Configuration and EMT Simulation of the 240-bus MiniWECC System Integrating Offshore Wind Farms (OWFs)

    Authors: Buxin She, Hisham Mahmood, Marcelo Elizondo, Veronica Adetola, Yuqing Dong

    Abstract: As offshore wind farms (OWFs) become increasingly prevalent in Northern California and Southern Oregon, they introduce faster dynamics into the Western Electricity Coordinating Council (WECC) system, reshaping its dynamic behavior. Accordingly, electromagnetic transient (EMT) simulation is essential to assess high frequency dynamics of the WECC system with integrated OWFs. Against this background,… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 5 pages

  23. arXiv:2403.07317  [pdf, other

    eess.SY

    GMPC: Geometric Model Predictive Control for Wheeled Mobile Robot Trajectory Tracking

    Authors: Jiawei Tang, Shuang Wu, Bo Lan, Yahui Dong, Yuqiang Jin, Guangjian Tian, Wen-An Zhang, Ling Shi

    Abstract: The configuration of most robotic systems lies in continuous transformation groups. However, in mobile robot trajectory tracking, many recent works still naively utilize optimization methods for elements in vector space without considering the manifold constraint of the robot configuration. In this letter, we propose a geometric model predictive control (MPC) framework for wheeled mobile robot tra… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  24. arXiv:2403.06766  [pdf, other

    hep-ex

    Determination of the number of $ψ(3686)$ events taken at BESIII

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: The number of $ψ(3686)$ events collected by the BESIII detector during the 2021 run period is determined to be $(2259.3\pm 11.1)\times 10^6$ by counting inclusive $ψ(3686)$ hadronic events. The uncertainty is systematic and the statistical uncertainty is negligible. Meanwhile, the numbers of $ψ(3686)$ events collected during the 2009 and 2012 run periods are updated to be… ▽ More

    Submitted 28 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  25. arXiv:2403.05810  [pdf, other

    cs.CV cs.AI

    Recurrent Aligned Network for Generalized Pedestrian Trajectory Prediction

    Authors: Yonghao Dong, Le Wang, Sanping Zhou, Gang Hua, Changyin Sun

    Abstract: Pedestrian trajectory prediction is a crucial component in computer vision and robotics, but remains challenging due to the domain shift problem. Previous studies have tried to tackle this problem by leveraging a portion of the trajectory data from the target domain to adapt the model. However, such domain adaptation methods are impractical in real-world scenarios, as it is infeasible to collect t… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  26. arXiv:2403.05156  [pdf, other

    cs.CR

    On Protecting the Data Privacy of Large Language Models (LLMs): A Survey

    Authors: Biwei Yan, Kun Li, Minghui Xu, Yueyan Dong, Yue Zhang, Zhaochun Ren, Xiuzhen Cheng

    Abstract: Large language models (LLMs) are complex artificial intelligence systems capable of understanding, generating and translating human language. They learn language patterns by analyzing large amounts of text data, allowing them to perform writing, conversation, summarizing and other language tasks. When LLMs process and generate large amounts of data, there is a risk of leaking sensitive information… ▽ More

    Submitted 14 March, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

    Comments: 18 pages, 4 figures

  27. arXiv:2403.05121  [pdf, other

    cs.CV

    CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion

    Authors: Wendi Zheng, Jiayan Teng, Zhuoyi Yang, Weihan Wang, Jidong Chen, Xiaotao Gu, Yuxiao Dong, Ming Ding, Jie Tang

    Abstract: Recent advancements in text-to-image generative systems have been largely driven by diffusion models. However, single-stage text-to-image diffusion models still face challenges, in terms of computational efficiency and the refinement of image details. To tackle the issue, we propose CogView3, an innovative cascaded framework that enhances the performance of text-to-image diffusion. CogView3 is the… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  28. Dual-path Frequency Discriminators for Few-shot Anomaly Detection

    Authors: Yuhu Bai, Jiangning Zhang, Zhaofeng Chen, Yuhang Dong, Yunkang Cao, Guanzhong Tian

    Abstract: Few-shot anomaly detection (FSAD) plays a crucial role in industrial manufacturing. However, existing FSAD methods encounter difficulties leveraging a limited number of normal samples, frequently failing to detect and locate inconspicuous anomalies in the spatial domain. We have further discovered that these subtle anomalies would be more noticeable in the frequency domain. In this paper, we propo… ▽ More

    Submitted 22 August, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Accepted by KBS

  29. arXiv:2403.03500  [pdf, other

    hep-ex

    Observation of the decay $h_{c}\to3(π^{+}π^{-})π^{0}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: Based on $(2712.4\pm14.1)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector, we study the decays $h_{c}\to3(π^{+}π^{-})π^{0}$, $h_{c}\to2(π^{+}π^{-})ω$, $h_{c}\to2(π^{+}π^{-})π^{0}η$, $h_{c}\to2(π^{+}π^{-})η$, and $h_{c}\to p\bar{p}$ via $ψ(3686)\toπ^{0}h_{c}$. The decay channel $h_{c}\to3(π^{+}π^{-})π^{0}$ is observed for the first time, and its branching fraction is determined to… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 11 pages, 3 figures

  30. arXiv:2403.01761  [pdf, other

    hep-ex

    Observation of $ψ(3686)\to 3φ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (645 additional authors not shown)

    Abstract: Using $(2.712\pm0.014)\times 10^9$ $ψ(3686)$ events collected by the BESIII detector operating at the BEPCII collider, we report the first observation of $ψ(3686)\to 3φ$ decay with a significance larger than 10$σ$. The branching fraction of this decay is determined to be $(1.46\pm0.05\pm0.17)\times10^{-5}$, where the first uncertainty is statistical and the second is systematic. No significant str… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  31. arXiv:2403.00046  [pdf, other

    cs.SE cs.AI cs.CL

    SEED: Customize Large Language Models with Sample-Efficient Adaptation for Code Generation

    Authors: Xue Jiang, Yihong Dong, Zhi Jin, Ge Li

    Abstract: Although Large Language Models (LLMs) have made significant progress in code generation, they still struggle with code generation tasks in specific scenarios. These scenarios usually necessitate the adaptation of LLMs to fulfill specific needs, but the limited training samples available in practice lead to poor code generation performance. Therefore, how to effectively adapt LLMs to new scenarios… ▽ More

    Submitted 23 March, 2024; v1 submitted 29 February, 2024; originally announced March 2024.

  32. arXiv:2402.18784  [pdf, other

    cs.AI q-bio.NC

    Brain-inspired and Self-based Artificial Intelligence

    Authors: Yi Zeng, Feifei Zhao, Yuxuan Zhao, Dongcheng Zhao, Enmeng Lu, Qian Zhang, Yuwei Wang, Hui Feng, Zhuoya Zhao, Jihang Wang, Qingqun Kong, Yinqian Sun, Yang Li, Guobin Shen, Bing Han, Yiting Dong, Wenxuan Pan, Xiang He, Aorigele Bao, Jin Wang

    Abstract: The question "Can machines think?" and the Turing Test to assess whether machines could achieve human-level intelligence is one of the roots of AI. With the philosophical argument "I think, therefore I am", this paper challenge the idea of a "thinking machine" supported by current AIs since there is no sense of self in them. Current artificial intelligence is only seemingly intelligent information… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  33. arXiv:2402.18104  [pdf, other

    cs.CR cs.AI

    Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction

    Authors: Tong Liu, Yingjie Zhang, Zhe Zhao, Yinpeng Dong, Guozhu Meng, Kai Chen

    Abstract: In recent years, large language models (LLMs) have demonstrated notable success across various tasks, but the trustworthiness of LLMs is still an open problem. One specific threat is the potential to generate toxic or harmful responses. Attackers can craft adversarial prompts that induce harmful responses from LLMs. In this work, we pioneer a theoretical foundation in LLMs security by identifying… ▽ More

    Submitted 10 June, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

  34. arXiv:2402.17253  [pdf, ps, other

    math.DG math.AP

    Hardy type inequalities on manifolds with nonnegative Ricci curvature

    Authors: Yuxin Dong, Hezi Lin, Lingen Lu

    Abstract: We prove the Heisenberg-Pauli-Weyl inequality, Hardy-Sobolev inequality, and Caffarelli-Kohn-Nirenberg (CKN) inequality on manifolds with nonnegative Ricci curvature and Euclidean volume growth, of dimension n>=3.

    Submitted 3 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: All comments are welcome!

  35. arXiv:2402.17238  [pdf, other

    cs.LG

    Does Negative Sampling Matter? A Review with Insights into its Theory and Applications

    Authors: Zhen Yang, Ming Ding, Tinglin Huang, Yukuo Cen, Junshuai Song, Bin Xu, Yuxiao Dong, Jie Tang

    Abstract: Negative sampling has swiftly risen to prominence as a focal point of research, with wide-ranging applications spanning machine learning, computer vision, natural language processing, data mining, and recommender systems. This growing interest raises several critical questions: Does negative sampling really matter? Is there a general framework that can incorporate all existing negative sampling me… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 20 pages, 11 figures

  36. arXiv:2402.15938  [pdf, other

    cs.CL cs.AI cs.CR cs.LG cs.SE

    Generalization or Memorization: Data Contamination and Trustworthy Evaluation for Large Language Models

    Authors: Yihong Dong, Xue Jiang, Huanyu Liu, Zhi Jin, Bin Gu, Mengfei Yang, Ge Li

    Abstract: Recent statements about the impressive capabilities of large language models (LLMs) are usually supported by evaluating on open-access benchmarks. Considering the vast size and wide-ranging sources of LLMs' training data, it could explicitly or implicitly include test data, leading to LLMs being more susceptible to data contamination. However, due to the opacity of training data, the black-box acc… ▽ More

    Submitted 31 May, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

    Comments: Accepted to ACL

  37. arXiv:2402.15810  [pdf, other

    cs.DL cs.CL cs.LG

    OAG-Bench: A Human-Curated Benchmark for Academic Graph Mining

    Authors: Fanjin Zhang, Shijie Shi, Yifan Zhu, Bo Chen, Yukuo Cen, Jifan Yu, Yelin Chen, Lulu Wang, Qingfei Zhao, Yuqing Cheng, Tianyi Han, Yuwei An, Dan Zhang, Weng Lam Tam, Kun Cao, Yunhe Pang, Xinyu Guan, Huihui Yuan, Jian Song, Xiaoyan Li, Yuxiao Dong, Jie Tang

    Abstract: With the rapid proliferation of scientific literature, versatile academic knowledge services increasingly rely on comprehensive academic graph mining. Despite the availability of public academic graphs, benchmarks, and datasets, these resources often fall short in multi-aspect and fine-grained annotations, are constrained to specific task types and domains, or lack underlying real academic graphs.… ▽ More

    Submitted 20 June, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

    Comments: KDD'24, 9 pages, 5 appendix pages

    Journal ref: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD '24), August 25--29, 2024, Barcelona, Spain

  38. arXiv:2402.15535  [pdf, other

    physics.app-ph physics.optics

    Combinatorial split-ring and spiral meta-resonator for efficient magnon-photon coupling

    Authors: Yuzan Xiong, Andrew Christy, Yun Dong, Andrew Comstock, Dali Sun, Yi Li, James F. Cahoon, Binbin Yang, Wei Zhang

    Abstract: Developing hybrid materials and structures for electromagnetic wave engineering has been a promising route towards novel functionalities and tunabilities in many modern applications and perspectives in new quantum technologies. Despite its established success in engineering optical light and terahertz waves, the implementation of meta-resonators operating at the microwave band is still emerging, e… ▽ More

    Submitted 14 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: 11 pages, 7 figures

  39. arXiv:2402.15218  [pdf, other

    cs.CR cs.CL cs.CV

    BSPA: Exploring Black-box Stealthy Prompt Attacks against Image Generators

    Authors: Yu Tian, Xiao Yang, Yinpeng Dong, Heming Yang, Hang Su, Jun Zhu

    Abstract: Extremely large image generators offer significant transformative potential across diverse sectors. It allows users to design specific prompts to generate realistic images through some black-box APIs. However, some studies reveal that image generators are notably susceptible to attacks and generate Not Suitable For Work (NSFW) contents by manually designed toxin texts, especially imperceptible to… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  40. arXiv:2402.14672  [pdf, other

    cs.CL cs.AI

    Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments

    Authors: Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su

    Abstract: The applications of large language models (LLMs) have expanded well beyond the confines of text processing, signaling a new era where LLMs are envisioned as generalist language agents capable of operating within complex real-world environments. These environments are often highly expansive, making it impossible for the LLM to process them within its short-term memory. Motivated by recent research… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: 16 pages, 8 figures, 4 tables

    ACM Class: I.2.7

  41. arXiv:2402.12729  [pdf, other

    cs.LG cs.AI

    Scalable and reliable deep transfer learning for intelligent fault detection via multi-scale neural processes embedded with knowledge

    Authors: Zhongzhi Li, Jingqi Tu, Jiacheng Zhu, Jianliang Ai, Yiqun Dong

    Abstract: Deep transfer learning (DTL) is a fundamental method in the field of Intelligent Fault Detection (IFD). It aims to mitigate the degradation of method performance that arises from the discrepancies in data distribution between training set (source domain) and testing set (target domain). Considering the fact that fault data collection is challenging and certain faults are scarce, DTL-based methods… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  42. arXiv:2402.12160  [pdf, other

    astro-ph.GA astro-ph.SR

    Probing the nature of rotation in the Pleiades, Alpha Persei, and Hyades clusters

    Authors: C. J. Hao, Y. Xu, L. G. Hou, S. B. Bian, Z. H. Lin, Y. J. Li, Y. W. Dong, D. J. Liu

    Abstract: Unraveling the internal kinematics of open clusters is crucial for understanding their formation and evolution. However, there is a dearth of research on this topic, primarily due to the lack of high-quality kinematic data. Using the exquisite-precision astrometric parameters and radial velocities provided by Gaia data release 3, we investigate the internal rotation in three of the most nearby and… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 23 pages, 17 figures, Accepted for publication in ApJ

  43. arXiv:2402.11940  [pdf, other

    cs.CV cs.CR cs.LG

    AICAttack: Adversarial Image Captioning Attack with Attention-Based Optimization

    Authors: Jiyao Li, Mingze Ni, Yifei Dong, Tianqing Zhu, Wei Liu

    Abstract: Recent advances in deep learning research have shown remarkable achievements across many tasks in computer vision (CV) and natural language processing (NLP). At the intersection of CV and NLP is the problem of image captioning, where the related models' robustness against adversarial attacks has not been well studied. In this paper, we present a novel adversarial attack strategy, which we call AIC… ▽ More

    Submitted 20 February, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  44. DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation

    Authors: Chong Zeng, Yue Dong, Pieter Peers, Youkang Kong, Hongzhi Wu, Xin Tong

    Abstract: This paper presents a novel method for exerting fine-grained lighting control during text-driven diffusion-based image generation. While existing diffusion models already have the ability to generate images under any lighting condition, without additional guidance these models tend to correlate image content and lighting. Moreover, text prompts lack the necessary expressional power to describe det… ▽ More

    Submitted 27 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Accepted to SIGGRAPH 2024. Project page: https://dilightnet.github.io/

    Journal ref: ACM SIGGRAPH 2024 Conference Proceedings

  45. arXiv:2402.11855  [pdf, other

    cs.IR

    TriSampler: A Better Negative Sampling Principle for Dense Retrieval

    Authors: Zhen Yang, Zhou Shao, Yuxiao Dong, Jie Tang

    Abstract: Negative sampling stands as a pivotal technique in dense retrieval, essential for training effective retrieval models and significantly impacting retrieval performance. While existing negative sampling methods have made commendable progress by leveraging hard negatives, a comprehensive guiding principle for constructing negative candidates and designing negative sampling distributions is still lac… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 9 pages, 4 figures

  46. arXiv:2402.11561  [pdf, other

    hep-ph hep-ex hep-lat nucl-th

    Transversity generalized parton distributions in spin-3/2 particles

    Authors: Dongyan Fu, Yubing Dong, S. Kumano

    Abstract: The definitions of the quark and gluon transversity generalized parton distributions (GPDs) in spin-3/2 particles are obtained in the light-cone gauge. It is found that they contain 16 independent components for each parton. Their even or odd property is found in terms of skewness variable, and the odd transversity GPDs vanish in the forward limit. There are 16 amplitudes with helicity flip of qua… ▽ More

    Submitted 10 May, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: 22 pages, 2 figures

  47. arXiv:2402.11507  [pdf, other

    cs.CV cs.RO

    MAL: Motion-Aware Loss with Temporal and Distillation Hints for Self-Supervised Depth Estimation

    Authors: Yup-Jiang Dong, Fang-Lue Zhang, Song-Hai Zhang

    Abstract: Depth perception is crucial for a wide range of robotic applications. Multi-frame self-supervised depth estimation methods have gained research interest due to their ability to leverage large-scale, unlabeled real-world data. However, the self-supervised methods often rely on the assumption of a static scene and their performance tends to degrade in dynamic environments. To address this issue, we… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: Accepted by ICRA 2024; Project homepage: https://yuejiangdong.github.io/MotionAwareLoss/

  48. arXiv:2402.11207  [pdf, ps, other

    hep-ex

    Search for the production of deuterons and antideuterons in e^+e^- annihilation at center-of-mass energies between 4.13 and 4.70 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, R. Aliberti, A. Amoroso, M. R. An, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (593 additional authors not shown)

    Abstract: Using a data sample of $e^+e^-$ collision data corresponding to an integrated luminosity of 19 fb$^{-1}$ collected with the BESIII detector at the BEPCII collider, we search for the production of deuterons and antideuterons via $e^+e^-\to ppπ^-\bar{d}+c.c.$ for the first time at center-of-mass energies between 4.13 and 4.70 GeV. No significant signal is observed and the upper limit of the… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  49. arXiv:2402.11034  [pdf, other

    cs.CL

    PAT-Questions: A Self-Updating Benchmark for Present-Anchored Temporal Question-Answering

    Authors: Jannat Ara Meem, Muhammad Shihab Rashid, Yue Dong, Vagelis Hristidis

    Abstract: Existing work on Temporal Question Answering (TQA) has predominantly focused on questions anchored to specific timestamps or events (e.g. "Who was the US president in 1970?"). Little work has studied questions whose temporal context is relative to the present time (e.g. "Who was the previous US president?"). We refer to this problem as Present-Anchored Temporal QA (PATQA). PATQA poses unique chall… ▽ More

    Submitted 3 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted to Findings of ACL '24

  50. arXiv:2402.10907  [pdf

    physics.app-ph physics.ins-det physics.optics

    Optically Levitated Nanoparticles as Receiving Antennas for Low Frequency Wireless Communication

    Authors: Zhenhai Fu, Jinsheng Xu, Shaochong Zhu, Chaoxiong He, Xunming Zhu, Xiaowen Gao, Han Cai, Peitong He, Zhiming Chen, Yizhou Zhang, Nan Li, Xingfan Chen, Ying Dong, Shiyao Zhu, Cheng Liu, Huizhu Hu

    Abstract: Low-frequency (LF) wireless communications play a crucial role in ensuring anti-interference, long-range, and efficient communication across various environments. However, in conventional LF communication systems, their antenna size is required to be inversely proportional to the wavelength, so that their mobility and flexibility are greatly limited. Here we introduce a novel prototype of LF recei… ▽ More

    Submitted 10 January, 2024; originally announced February 2024.