Zum Hauptinhalt springen

Showing 101–150 of 622 results for author: Wen, Y

.
  1. arXiv:2404.02033  [pdf, other

    hep-ex hep-ph

    Search for $C$-even states decaying to $D_{s}^{\pm}D_{s}^{*\mp}$ with masses between $4.08$ and $4.32~\mathrm{GeV}/c^{2}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Six $C$-even states, denoted as $X$, with quantum numbers $J^{PC}=0^{-+}$, $1^{\pm+}$, or $2^{\pm+}$, are searched for via the $e^+e^-\toγD_{s}^{\pm}D_{s}^{*\mp}$ process using $(1667.39\pm8.84)~\mathrm{pb}^{-1}$ of $e^+e^-$ collision data collected with the BESIII detector operating at the BEPCII storage ring at center-of-mass energy of $\sqrt{s}=(4681.92\pm0.30)~\mathrm{MeV}$. No statistically s… ▽ More

    Submitted 30 August, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Journal ref: Phys. Rev. D 110, 032017 (2024)

  2. arXiv:2404.01231  [pdf, other

    cs.CR cs.LG

    Privacy Backdoors: Enhancing Membership Inference through Poisoning Pre-trained Models

    Authors: Yuxin Wen, Leo Marchyok, Sanghyun Hong, Jonas Geiping, Tom Goldstein, Nicholas Carlini

    Abstract: It is commonplace to produce application-specific models by fine-tuning large pre-trained models using a small bespoke dataset. The widespread availability of foundation model checkpoints on the web poses considerable risks, including the vulnerability to backdoor attacks. In this paper, we unveil a new vulnerability: the privacy backdoor attack. This black-box privacy attack aims to amplify the p… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  3. Multiple Joint Chance Constraints Approximation for Uncertainty Modeling in Dispatch Problems

    Authors: Yilin Wen, Yi Guo, Zechun Hu, Gabriela Hug

    Abstract: Uncertainty modeling has become increasingly important in power system decision-making. The widely-used tractable uncertainty modeling method-chance constraints with Conditional Value at Risk (CVaR) approximation, can be overconservative and even turn an originally feasible problem into an infeasible one. This paper proposes a new approximation method for multiple joint chance constraints (JCCs) t… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  4. arXiv:2404.00368  [pdf, other

    cs.CV

    Towards Variable and Coordinated Holistic Co-Speech Motion Generation

    Authors: Yifei Liu, Qiong Cao, Yandong Wen, Huaiguang Jiang, Changxing Ding

    Abstract: This paper addresses the problem of generating lifelike holistic co-speech motions for 3D avatars, focusing on two key aspects: variability and coordination. Variability allows the avatar to exhibit a wide range of motions even with similar speech content, while coordination ensures a harmonious alignment among facial expressions, hand gestures, and body poses. We aim to achieve both with ProbTalk… ▽ More

    Submitted 15 April, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: CVPR 2024

  5. arXiv:2403.19866  [pdf, other

    cs.CV cs.AI

    Is Synthetic Image Useful for Transfer Learning? An Investigation into Data Generation, Volume, and Utilization

    Authors: Yuhang Li, Xin Dong, Chen Chen, Jingtao Li, Yuxin Wen, Michael Spranger, Lingjuan Lyu

    Abstract: Synthetic image data generation represents a promising avenue for training deep learning models, particularly in the realm of transfer learning, where obtaining real images within a specific domain can be prohibitively expensive due to privacy and intellectual property considerations. This work delves into the generation and utilization of synthetic images derived from text-to-image generative mod… ▽ More

    Submitted 2 April, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: ICLR24 Score 6865 https://openreview.net/forum?id=CjPt1AC6w0

  6. arXiv:2403.19438  [pdf, other

    cs.CV cs.RO

    SubjectDrive: Scaling Generative Data in Autonomous Driving via Subject Control

    Authors: Binyuan Huang, Yuqing Wen, Yucheng Zhao, Yaosi Hu, Yingfei Liu, Fan Jia, Weixin Mao, Tiancai Wang, Chi Zhang, Chang Wen Chen, Zhenzhong Chen, Xiangyu Zhang

    Abstract: Autonomous driving progress relies on large-scale annotated datasets. In this work, we explore the potential of generative models to produce vast quantities of freely-labeled data for autonomous driving applications and present SubjectDrive, the first model proven to scale generative data production in a way that could continuously improve autonomous driving applications. We investigate the impact… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: Project page: https://subjectdrive.github.io/

  7. arXiv:2403.19256  [pdf, other

    hep-ex

    Measurement of absolute branching fractions of $D_s^+$ hadronic decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (632 additional authors not shown)

    Abstract: Using $e^+ e^-$ collision data collected at the BESIII detector at center-of-mass energies between 4.128 and 4.226 GeV, corresponding to an integrated luminosity of $7.33~{\rm fb}^{-1}$, we determine the absolute branching fractions of fifteen hadronic $D_s^{+}$ decays with a double-tag technique. In particular, we make precise measurements of the branching fractions… ▽ More

    Submitted 30 May, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

  8. arXiv:2403.18209  [pdf, other

    cs.LG cs.AI cs.RO

    Long and Short-Term Constraints Driven Safe Reinforcement Learning for Autonomous Driving

    Authors: Xuemin Hu, Pan Chen, Yijun Wen, Bo Tang, Long Chen

    Abstract: Reinforcement learning (RL) has been widely used in decision-making tasks, but it cannot guarantee the agent's safety in the training process due to the requirements of interaction with the environment, which seriously limits its industrial applications such as autonomous driving. Safe RL methods are developed to handle this issue by constraining the expected safety violation costs as a training o… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  9. arXiv:2403.17676  [pdf

    physics.app-ph cs.ET

    Analysis on reservoir activation with the nonlinearity harnessed from solution-processed MoS2 devices

    Authors: Songwei Liu, Yang Liu, Yingyi Wen, Jingfang Pei, Pengyu Liu, Lekai Song, Xiaoyue Fan, Wenchen Yang, Danmei Pan, Teng Ma, Yue Lin, Gang Wang, Guohua Hu

    Abstract: Reservoir computing is a recurrent neural network that has been applied across various domains in machine learning. The implementation of reservoir computing, however, often demands heavy computations for activating the reservoir. Configuring physical reservoir networks and harnessing the nonlinearity from the underlying devices for activation is an emergent solution to address the computational c… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  10. arXiv:2403.16811  [pdf, ps, other

    hep-ex

    Cross section measurement of $e^+e^-\to ηψ(2S)$ and search for $e^+e^-\toη\tilde{X}(3872)$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: The energy-dependent cross section for $e^+e^-\to ηψ(2S)$ is measured at eighteen center of mass energies from 4.288 GeV to 4.951 GeV using the BESIII detector. Using the same data samples, we also perform the first search for the reaction $e^+e^-\toη\tilde{X}(3872)$, but no evidence is found for the $\tilde{X}(3872)$ in the $π^+π^- J/ψ$ mass distribution. At each of the eighteen center of mass en… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  11. arXiv:2403.14998  [pdf, other

    hep-ex

    Precise measurement of the $e^+e^-\to D_s^+D_s^-$ cross sections at center-of-mass energies from threshold to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Using the $e^+e^-$ collision data collected with the BESIII detector operating at the BEPCII collider, at center-of-mass energies from the threshold to $4.95$~GeV, we present precise measurements of the cross sections for the process $e^+e^-\to D_s^+D_s^-$ using a single tag method. The resulting cross section lineshape exhibits several new structures, thereby offering an input for coupled channel… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 9 pages, 4 figures, published to PRL

  12. arXiv:2403.14409  [pdf, other

    cs.CL cs.AI

    Locating and Mitigating Gender Bias in Large Language Models

    Authors: Yuchen Cai, Ding Cao, Rongxi Guo, Yaqin Wen, Guiquan Liu, Enhong Chen

    Abstract: Large language models(LLM) are pre-trained on extensive corpora to learn facts and human cognition which contain human preferences. However, this process can inadvertently lead to these models acquiring biases and stereotypes prevalent in society. Prior research has typically tackled the issue of bias through a one-dimensional perspective, concentrating either on locating or mitigating it. This li… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: 23 pages, 5 figures

  13. arXiv:2403.14381  [pdf, other

    cs.CL cs.AI

    Editing Knowledge Representation of Language Model via Rephrased Prefix Prompts

    Authors: Yuchen Cai, Ding Cao, Rongxi Guo, Yaqin Wen, Guiquan Liu, Enhong Chen

    Abstract: Neural language models (LMs) have been extensively trained on vast corpora to store factual knowledge about various aspects of the world described in texts. Current technologies typically employ knowledge editing methods or specific prompts to modify LM outputs. However, existing knowledge editing methods are costly and inefficient, struggling to produce appropriate text. Additionally, prompt engi… ▽ More

    Submitted 11 May, 2024; v1 submitted 21 March, 2024; originally announced March 2024.

    Comments: 19pages,3figures

  14. arXiv:2403.14091  [pdf, other

    astro-ph.GA

    Evidence for an Outer Component in the Continuum Reverberation Mapping of Active Galactic Nuclei

    Authors: Yuanzhe Jiang, Xue-Bing Wu, Qinchun Ma, Huapeng Gu, Yuhan Wen

    Abstract: The continuum reverberation mapping is widely used in studying accretion disk of active galactic nuclei (AGN). While some indirect evidence and simulations indicated that the diffuse continuum, especially the strong Balmer continuum from the broad line region (BLR), may contribute to the continuum in the u/U band. Here, we present direct evidence for this contribution. In this work, we apply the I… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 17 pages, 10 figures, accepted for publication in the Astrophysical Journal

  15. arXiv:2403.13863  [pdf, other

    cs.LG cs.AI cs.DB

    DiffImpute: Tabular Data Imputation With Denoising Diffusion Probabilistic Model

    Authors: Yizhu Wen, Kai Yi, Jing Ke, Yiqing Shen

    Abstract: Tabular data plays a crucial role in various domains but often suffers from missing values, thereby curtailing its potential utility. Traditional imputation techniques frequently yield suboptimal results and impose substantial computational burdens, leading to inaccuracies in subsequent modeling tasks. To address these challenges, we propose DiffImpute, a novel Denoising Diffusion Probabilistic Mo… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 26 pages, 6 figures

  16. arXiv:2403.13437  [pdf, other

    hep-ex

    Search for $ΔS=2$ nonleptonic hyperon decays $Ω^-\toΣ^{0}π^{-}$ and $Ω^-\to nK^{-}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Using $(27.12 \pm 0.14) \times 10^{8}$ $ψ(3686)$ events collected by the BESIII detector at the center-of-mass energy of $\sqrt{s} = 3.686$ GeV, we search for the first time for two nonleptonic hyperon decays that change strangeness by two units, $Ω^-\toΣ^{0}π^-$ and $Ω^-\to nK^{-}$. No significant signal is observed. The upper limits on their decay branching fractions are determined to be… ▽ More

    Submitted 14 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  17. arXiv:2403.10877  [pdf, ps, other

    hep-ex hep-ph

    Test of lepton universality and measurement of the form factors of $D^0\to K^{*}(892)^-μ^+ν_μ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (637 additional authors not shown)

    Abstract: We report a first study of the semileptonic decay $D^0\rightarrow K^-π^0μ^{+}ν_μ$ by analyzing an $e^+e^-$ annihilation data sample of $7.9~\mathrm{fb}^{-1}$ collected at the center-of-mass energy of 3.773 GeV with the BESIII detector. The absolute branching fraction of $D^0\to K^-π^0μ^{+}ν_μ$ is measured for the first time to be $(0.729 \pm 0.014_{\rm stat} \pm 0.011_{\rm syst})\%$. Based on an a… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

    Comments: 9 pages, 3 figures

  18. arXiv:2403.10223  [pdf, other

    astro-ph.GA

    H$α$ Time Delays of AGNs from the Zwicky Transcient Facility Broadband Photometry

    Authors: Qinchun Ma, Yuhan Wen, Xue-Bing Wu, Huapeng Gu, Yuming Fu

    Abstract: In our previous work on broadband photometric reverberation mapping (PRM), we proposed the ICCF-Cut process to obtain the time lags of H$α$ emission line from two broadband lightcurves via subtracting the continuum emission from the line band. Extending the work, we enlarge our sample to the Zwicky Transient Facility (ZTF) database. We adopt two criteria to select 123 type 1 AGNs with sufficient v… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 23 pages, 34 figures, accepted for publication in the Astrophysical Journal

  19. arXiv:2403.10127  [pdf, other

    cs.CV

    TransLandSeg: A Transfer Learning Approach for Landslide Semantic Segmentation Based on Vision Foundation Model

    Authors: Changhong Hou, Junchuan Yu, Daqing Ge, Liu Yang, Laidian Xi, Yunxuan Pang, Yi Wen

    Abstract: Landslides are one of the most destructive natural disasters in the world, posing a serious threat to human life and safety. The development of foundation models has provided a new research paradigm for large-scale landslide detection. The Segment Anything Model (SAM) has garnered widespread attention in the field of image segmentation. However, our experiment found that SAM performed poorly in th… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  20. arXiv:2403.07648  [pdf, other

    cs.DC cs.LG

    Characterization of Large Language Model Development in the Datacenter

    Authors: Qinghao Hu, Zhisheng Ye, Zerui Wang, Guoteng Wang, Meng Zhang, Qiaoling Chen, Peng Sun, Dahua Lin, Xiaolin Wang, Yingwei Luo, Yonggang Wen, Tianwei Zhang

    Abstract: Large Language Models (LLMs) have presented impressive performance across several transformative tasks. However, it is non-trivial to efficiently utilize large-scale cluster resources to develop LLMs, often riddled with numerous challenges such as frequent hardware failures, intricate parallelization strategies, and imbalanced resource utilization. In this paper, we present an in-depth characteriz… ▽ More

    Submitted 3 April, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  21. arXiv:2403.07552  [pdf, ps, other

    math.AG

    Springer correspondence and mirror symmetries for parabolic Hitchin systems

    Authors: Bin Wang, Xueqing Wen, Yaoxiong Wen

    Abstract: We prove the Strominger--Yau--Zaslow and topological mirror symmetries for parabolic Hitchin systems of types B and C. In contrast to type A, a geometric reinterpretation of Springer duality is necessary. Furthermore, unlike Hitchin's construction in the non-parabolic case, the map between generic fibers in type B and C needs more analysis due to the change of partitions of Springer dual nilpotent… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

    Comments: 74 papes. Comments are welcome!

  22. arXiv:2403.06766  [pdf, other

    hep-ex

    Determination of the number of $ψ(3686)$ events taken at BESIII

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: The number of $ψ(3686)$ events collected by the BESIII detector during the 2021 run period is determined to be $(2259.3\pm 11.1)\times 10^6$ by counting inclusive $ψ(3686)$ hadronic events. The uncertainty is systematic and the statistical uncertainty is negligible. Meanwhile, the numbers of $ψ(3686)$ events collected during the 2009 and 2012 run periods are updated to be… ▽ More

    Submitted 28 May, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

  23. arXiv:2403.06221  [pdf, other

    cs.AI cs.CL cs.IR

    TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision

    Authors: Ruiwen Zhou, Yingxuan Yang, Muning Wen, Ying Wen, Wenhao Wang, Chunling Xi, Guoqiang Xu, Yong Yu, Weinan Zhang

    Abstract: Numerous large language model (LLM) agents have been built for different tasks like web navigation and online shopping due to LLM's wide knowledge and text-understanding ability. Among these works, many of them utilize in-context examples to achieve generalization without the need for fine-tuning, while few of them have considered the problem of how to select and effectively utilize these examples… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: Codes available at: https://github.com/skyriver-2000/TRAD-Official

  24. arXiv:2403.05240  [pdf, ps, other

    math.AG

    Seiberg-like duality for resolutions of determinantal varieties

    Authors: Nathan Priddis, Mark Shoemaker, Yaoxiong Wen

    Abstract: We study the genus-zero Gromov-Witten theory of two natural resolutions of determinantal varieties, termed the PAX and PAXY models. We realize each resolution as lying in a quiver bundle, and show that the respective quiver bundles are related by a quiver mutation. We prove that generating functions of genus-zero Gromov-Witten invariants for the two resolutions are related by a specific cluster ch… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 38 pages. Comments are welcome!

    MSC Class: 14N35; 14M12

  25. arXiv:2403.03500  [pdf, other

    hep-ex

    Observation of the decay $h_{c}\to3(π^{+}π^{-})π^{0}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: Based on $(2712.4\pm14.1)\times10^{6}$ $ψ(3686)$ events collected with the BESIII detector, we study the decays $h_{c}\to3(π^{+}π^{-})π^{0}$, $h_{c}\to2(π^{+}π^{-})ω$, $h_{c}\to2(π^{+}π^{-})π^{0}η$, $h_{c}\to2(π^{+}π^{-})η$, and $h_{c}\to p\bar{p}$ via $ψ(3686)\toπ^{0}h_{c}$. The decay channel $h_{c}\to3(π^{+}π^{-})π^{0}$ is observed for the first time, and its branching fraction is determined to… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 11 pages, 3 figures

  26. arXiv:2403.02576  [pdf, other

    cs.DL cs.LG cs.SI

    AceMap: Knowledge Discovery through Academic Graph

    Authors: Xinbing Wang, Luoyi Fu, Xiaoying Gan, Ying Wen, Guanjie Zheng, Jiaxin Ding, Liyao Xiang, Nanyang Ye, Meng Jin, Shiyu Liang, Bin Lu, Haiwen Wang, Yi Xu, Cheng Deng, Shao Zhang, Huquan Kang, Xingli Wang, Qi Li, Zhixin Guo, Jiexing Qi, Pan Liu, Yuyang Ren, Lyuwen Wu, Jungang Yang, Jianping Zhou , et al. (1 additional authors not shown)

    Abstract: The exponential growth of scientific literature requires effective management and extraction of valuable insights. While existing scientific search engines excel at delivering search results based on relational databases, they often neglect the analysis of collaborations between scientific entities and the evolution of ideas, as well as the in-depth analysis of content within scientific publicatio… ▽ More

    Submitted 14 April, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: Technical Report for AceMap (https://www.acemap.info)

  27. Observation of Seven Astrophysical Tau Neutrino Candidates with IceCube

    Authors: IceCube Collaboration, R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, G. Anton, C. Argüelles, Y. Ashida, S. Athanasiadou, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise, C. Bellenghi , et al. (380 additional authors not shown)

    Abstract: We report on a measurement of astrophysical tau neutrinos with 9.7 years of IceCube data. Using convolutional neural networks trained on images derived from simulated events, seven candidate $ν_τ$ events were found with visible energies ranging from roughly 20 TeV to 1 PeV and a median expected parent $ν_τ$ energy of about 200 TeV. Considering backgrounds from astrophysical and atmospheric neutrin… ▽ More

    Submitted 26 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted for publication in Physical Review Letters. This version includes full author list metadata

    Journal ref: Phys.Rev.Lett. 132 (2024) 15, 151001

  28. arXiv:2403.02470  [pdf, other

    astro-ph.HE astro-ph.IM physics.data-an

    Improved modeling of in-ice particle showers for IceCube event reconstruction

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, G. Anton, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise , et al. (394 additional authors not shown)

    Abstract: The IceCube Neutrino Observatory relies on an array of photomultiplier tubes to detect Cherenkov light produced by charged particles in the South Pole ice. IceCube data analyses depend on an in-depth characterization of the glacial ice, and on novel approaches in event reconstruction that utilize fast approximations of photoelectron yields. Here, a more accurate model is derived for event reconstr… ▽ More

    Submitted 22 April, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: 28 pages, 18 figures, 1 table, submitted to JINST, updated to account for comments received

    Journal ref: 2024 JINST 19 P06026

  29. arXiv:2403.01761  [pdf, other

    hep-ex

    Observation of $ψ(3686)\to 3φ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (645 additional authors not shown)

    Abstract: Using $(2.712\pm0.014)\times 10^9$ $ψ(3686)$ events collected by the BESIII detector operating at the BEPCII collider, we report the first observation of $ψ(3686)\to 3φ$ decay with a significance larger than 10$σ$. The branching fraction of this decay is determined to be $(1.46\pm0.05\pm0.17)\times10^{-5}$, where the first uncertainty is statistical and the second is systematic. No significant str… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  30. arXiv:2403.00841  [pdf, other

    cs.MA cs.AI cs.GT cs.LG

    Offline Fictitious Self-Play for Competitive Games

    Authors: Jingxiao Chen, Weiji Xie, Weinan Zhang, Yong yu, Ying Wen

    Abstract: Offline Reinforcement Learning (RL) has received significant interest due to its ability to improve policies in previously collected datasets without online interactions. Despite its success in the single-agent setting, offline multi-agent RL remains a challenge, especially in competitive games. Firstly, unaware of the game structure, it is impossible to interact with the opponents and conduct a m… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  31. arXiv:2403.00255  [pdf, other

    cs.GT cs.MA

    Leveraging Team Correlation for Approximating Equilibrium in Two-Team Zero-Sum Games

    Authors: Naming Liu, Mingzhi Wang, Youzhi Zhang, Yaodong Yang, Bo An, Ying Wen

    Abstract: Two-team zero-sum games are one of the most important paradigms in game theory. In this paper, we focus on finding an unexploitable equilibrium in large team games. An unexploitable equilibrium is a worst-case policy, where members in the opponent team cannot increase their team reward by taking any policy, e.g., cooperatively changing to other joint policies. As an optimal unexploitable equilibri… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  32. arXiv:2403.00144  [pdf, other

    cs.CL cs.AI cs.LG

    EBBS: An Ensemble with Bi-Level Beam Search for Zero-Shot Machine Translation

    Authors: Yuqiao Wen, Behzad Shayegh, Chenyang Huang, Yanshuai Cao, Lili Mou

    Abstract: The ability of zero-shot translation emerges when we train a multilingual model with certain translation directions; the model can then directly translate in unseen directions. Alternatively, zero-shot translation can be accomplished by pivoting through a third language (e.g., English). In our work, we observe that both direct and pivot translations are noisy and achieve less satisfactory performa… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

    ACM Class: I.2.7; I.2.6; I.2.m; I.5.1; I.7.m

  33. arXiv:2403.00143  [pdf, other

    cs.CL cs.AI cs.LG

    Ensemble-Based Unsupervised Discontinuous Constituency Parsing by Tree Averaging

    Authors: Behzad Shayegh, Yuqiao Wen, Lili Mou

    Abstract: We address unsupervised discontinuous constituency parsing, where we observe a high variance in the performance of the only previous model. We propose to build an ensemble of different runs of the existing discontinuous parser by averaging the predicted trees, to stabilize and boost performance. To begin with, we provide comprehensive computational complexity analysis (in terms of P and NP-complet… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  34. arXiv:2402.18591  [pdf, ps, other

    cs.LG cs.GT math.ST

    Stochastic contextual bandits with graph feedback: from independence number to MAS number

    Authors: Yuxiao Wen, Yanjun Han, Zhengyuan Zhou

    Abstract: We consider contextual bandits with graph feedback, a class of interactive learning problems with richer structures than vanilla contextual bandits, where taking an action reveals the rewards for all neighboring actions in the feedback graph under all contexts. Unlike the multi-armed bandits setting where a growing literature has painted a near-complete understanding of graph feedback, much remain… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  35. arXiv:2402.18035  [pdf, other

    astro-ph.HE astro-ph.GA

    A study of 10 Rotating Radio Transients using Parkes radio telescope

    Authors: Xinhui Ren, Jingbo Wang, Wenming Yan, Jintao Xie, Shuangqiang Wang, Yirong Wen, Yong Xia

    Abstract: Rotating Radio Transients (RRATs) are a relatively new subclass of pulsars that emit detectable radio bursts sporadically. We conducted an analysis of 10 RRATs observed using the Parkes telescope, with 8 of these observed via the Ultra-Wideband Receiver. We measured the burst rate and produced integrated profiles spanning multiple frequency bands for 3 RRATs. We also conducted a spectral analysis… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 16 pages, 8 figures, RAA accepted

  36. arXiv:2402.18026  [pdf, other

    astro-ph.HE astro-ph.IM

    Characterization of the Astrophysical Diffuse Neutrino Flux using Starting Track Events in IceCube

    Authors: R. Abbasi, M. Ackermann, J. Adams, S. K. Agarwalla, J. A. Aguilar, M. Ahlers, J. M. Alameddine, N. M. Amin, K. Andeen, G. Anton, C. Argüelles, Y. Ashida, S. Athanasiadou, L. Ausborm, S. N. Axani, X. Bai, A. Balagopal V., M. Baricevic, S. W. Barwick, S. Bash, V. Basu, R. Bay, J. J. Beatty, J. Becker Tjus, J. Beise , et al. (394 additional authors not shown)

    Abstract: A measurement of the diffuse astrophysical neutrino spectrum is presented using IceCube data collected from 2011-2022 (10.3 years). We developed novel detection techniques to search for events with a contained vertex and exiting track induced by muon neutrinos undergoing a charged-current interaction. Searching for these starting track events allows us to not only more effectively reject atmospher… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 27 pages, 28 figures

    Journal ref: Phys. Rev. D 110, 022001 (2024)

  37. arXiv:2402.17453  [pdf, other

    cs.LG

    DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning

    Authors: Siyuan Guo, Cheng Deng, Ying Wen, Hechang Chen, Yi Chang, Jun Wang

    Abstract: In this work, we investigate the potential of large language models (LLMs) based agents to automate data science tasks, with the goal of comprehending task requirements, then building and training the best-fit machine learning models. Despite their widespread success, existing LLM agents are hindered by generating unreasonable experiment plans within this scenario. To this end, we present DS-Agent… ▽ More

    Submitted 28 May, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted by ICML 2024

  38. arXiv:2402.15972  [pdf, other

    cs.LG cs.NI

    Structural Knowledge-Driven Meta-Learning for Task Offloading in Vehicular Networks with Integrated Communications, Sensing and Computing

    Authors: Ruijin Sun, Yao Wen, Nan Cheng, Wei Wan, Rong Chai, Yilong Hui

    Abstract: Task offloading is a potential solution to satisfy the strict requirements of computation-intensive and latency-sensitive vehicular applications due to the limited onboard computing resources. However, the overwhelming upload traffic may lead to unacceptable uploading time. To tackle this issue, for tasks taking environmental data as input, the data perceived by roadside units (RSU) equipped with… ▽ More

    Submitted 24 February, 2024; originally announced February 2024.

  39. arXiv:2402.14020  [pdf, other

    cs.LG cs.CL cs.CR

    Coercing LLMs to do and reveal (almost) anything

    Authors: Jonas Geiping, Alex Stein, Manli Shu, Khalid Saifullah, Yuxin Wen, Tom Goldstein

    Abstract: It has recently been shown that adversarial attacks on large language models (LLMs) can "jailbreak" the model into making harmful statements. In this work, we argue that the spectrum of adversarial attacks on LLMs is much larger than merely jailbreaking. We provide a broad overview of possible attack surfaces and attack goals. Based on a series of concrete examples, we discuss, categorize and syst… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: 32 pages. Implementation available at https://github.com/JonasGeiping/carving

  40. arXiv:2402.12416  [pdf, other

    cs.MA cs.AI

    Aligning Individual and Collective Objectives in Multi-Agent Cooperation

    Authors: Yang Li, Wenhao Zhang, Jianhong Wang, Shao Zhang, Yali Du, Ying Wen, Wei Pan

    Abstract: Among the research topics in multi-agent learning, mixed-motive cooperation is one of the most prominent challenges, primarily due to the mismatch between individual and collective goals. The cutting-edge research is focused on incorporating domain knowledge into rewards and introducing additional mechanisms to incentivize cooperation. However, these approaches often face shortcomings such as the… ▽ More

    Submitted 22 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: 19 pages

  41. arXiv:2402.08552  [pdf, other

    cs.LG cs.CV

    Confronting Reward Overoptimization for Diffusion Models: A Perspective of Inductive and Primacy Biases

    Authors: Ziyi Zhang, Sen Zhang, Yibing Zhan, Yong Luo, Yonggang Wen, Dacheng Tao

    Abstract: Bridging the gap between diffusion models and human preferences is crucial for their integration into practical generative workflows. While optimizing downstream reward models has emerged as a promising alignment strategy, concerns arise regarding the risk of excessive optimization with learned reward models, which potentially compromises ground-truth performance. In this work, we confront the rew… ▽ More

    Submitted 5 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: Accepted to ICML 2024

  42. arXiv:2402.08073  [pdf, other

    cs.LG cs.PL cs.SE

    Grounding Data Science Code Generation with Input-Output Specifications

    Authors: Yeming Wen, Pengcheng Yin, Kensen Shi, Henryk Michalewski, Swarat Chaudhuri, Alex Polozov

    Abstract: Large language models (LLMs) have recently demonstrated a remarkable ability to generate code from natural language (NL) prompts. However, in the real world, NL is often too ambiguous to capture the true intent behind programming problems, requiring additional input-output (I/O) specifications. Unfortunately, LLMs can have difficulty aligning their outputs with both the NL prompt and the I/O speci… ▽ More

    Submitted 14 March, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  43. arXiv:2402.08010  [pdf, other

    cs.LG cs.AI stat.ML

    Which Frequencies do CNNs Need? Emergent Bottleneck Structure in Feature Learning

    Authors: Yuxiao Wen, Arthur Jacot

    Abstract: We describe the emergence of a Convolution Bottleneck (CBN) structure in CNNs, where the network uses its first few layers to transform the input representation into a representation that is supported only along a few frequencies and channels, before using the last few layers to map back to the outputs. We define the CBN rank, which describes the number and type of frequencies that are kept inside… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  44. arXiv:2402.07792  [pdf, other

    cs.LG cs.DC

    Empowering Federated Learning for Massive Models with NVIDIA FLARE

    Authors: Holger R. Roth, Ziyue Xu, Yuan-Ting Hsieh, Adithya Renduchintala, Isaac Yang, Zhihong Zhang, Yuhong Wen, Sean Yang, Kevin Lu, Kristopher Kersten, Camir Ricketts, Daguang Xu, Chester Chen, Yan Cheng, Andrew Feng

    Abstract: In the ever-evolving landscape of artificial intelligence (AI) and large language models (LLMs), handling and leveraging data effectively has become a critical challenge. Most state-of-the-art machine learning algorithms are data-centric. However, as the lifeblood of model performance, necessary data cannot always be centralized due to various factors such as privacy, regulation, geopolitics, copy… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  45. arXiv:2402.07157  [pdf, other

    cs.CL cs.AI cs.LG

    Natural Language Reinforcement Learning

    Authors: Xidong Feng, Ziyu Wan, Mengyue Yang, Ziyan Wang, Girish A. Koushik, Yali Du, Ying Wen, Jun Wang

    Abstract: Reinforcement Learning (RL) has shown remarkable abilities in learning policies for decision-making tasks. However, RL is often hindered by issues such as low sample efficiency, lack of interpretability, and sparse supervision signals. To tackle these limitations, we take inspiration from the human learning process and introduce Natural Language Reinforcement Learning (NLRL), which innovatively co… ▽ More

    Submitted 14 February, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

    Comments: Work in Progress

  46. arXiv:2402.06700  [pdf, other

    cs.LG cs.AI

    Entropy-Regularized Token-Level Policy Optimization for Language Agent Reinforcement

    Authors: Muning Wen, Junwei Liao, Cheng Deng, Jun Wang, Weinan Zhang, Ying Wen

    Abstract: Large Language Models (LLMs) have shown promise as intelligent agents in interactive decision-making tasks. Traditional approaches often depend on meticulously designed prompts, high-quality examples, or additional reward models for in-context learning, supervised fine-tuning, or RLHF. Reinforcement learning (RL) presents a dynamic alternative for LLMs to overcome these dependencies by engaging di… ▽ More

    Submitted 6 June, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

  47. arXiv:2402.01993  [pdf, other

    hep-ex

    Measurement of the Electromagnetic Transition Form-factors in the decays $η'\rightarrowπ^+π^-l^+l^-$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (618 additional authors not shown)

    Abstract: With a sample of $(10087\pm44)\times10^{6}$ $J/ψ$ events accumulated with the BESIII detector, we analyze the decays $η'\rightarrowπ^+π^-l^+l^-(l=e,$ $μ)$ via the process $J/ψ\rightarrowγη'$. The branching fractions are measured to be $\mathcal{B}(η'\rightarrowπ^+π^-e^+e^-)=(2.45\pm0.02(\rm{stat.})\pm0.08(\rm{syst.})) \times10^{-3}$ and… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  48. Measurements of Normalized Differential Cross Sections of Inclusive $η$ Production in $e^{+}e^{-}$ Annihilation at Energy from 2.0000 to 3.6710 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, D. Anderle, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko , et al. (641 additional authors not shown)

    Abstract: Using data samples collected with the BESIII detector operating at the BEPCII storage ring, the cross section of the inclusive process $e^{+}e^{-} \to η+ X$, normalized by the total cross section of $e^{+}e^{-} \to \text{hadrons}$, is measured at eight center-of-mass energy points from 2.0000 GeV to 3.6710 GeV. These are the first measurements with momentum dependence in this energy region. Our me… ▽ More

    Submitted 15 July, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

    Comments: 9 pages, 2 figures

  49. arXiv:2401.16194  [pdf

    cond-mat.mtrl-sci

    Macroscopic electro-optical modulation of solution-processed molybdenum disulfide

    Authors: Songwei Liu, Yingyi Wen, Jingfang Pei, Xiaoyue Fan, Yongheng Zhou, Yang Liu, Ling-Kiu Ng, Yue Lin, Teng Ma, Panpan Zhang, Xiaolong Chen, Gang Wang, Guohua Hu

    Abstract: Molybdenum disulfide (MoS2) has drawn great interest for tunable photonics and optoelectronics advancement. Its solution processing, though scalable, results in randomly networked ensembles of discrete nanosheets with compromised properties for tunable device fabrication. Here, we show via density-functional theory calculations that the electronic structure of the individual solution-processed nan… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: Manuscript 14 pages, 5 figures. Supplementary Materials 10 pages 5 figures

  50. arXiv:2401.13225  [pdf, ps, other

    hep-ex

    A New Look at the Scalar Meson $f_0(500)$ via $D^+\to π^+π^-\ell^+ν_\ell$ Decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, R. Aliberti, A. Amoroso, Y. Bai, O. Bakina, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann, H. Cai, X. Cai , et al. (615 additional authors not shown)

    Abstract: Using $2.93~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data collected with the BESIII detector at the center-of-mass energy of 3.773 GeV, we investigate the semileptonic decays $D^+\to π^+π^- \ell^+ν_\ell$ ($\ell=e$ and $μ$). The $D^+\to f_0(500)μ^+ν_μ$ decay is observed for the first time. By analyzing simultaneously the differential decay rates of $D^+\to f_0(500) μ^+ν_μ$ and… ▽ More

    Submitted 4 February, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Comments: Supplemental Materials added in this version

    Report number: BAM-00660