Zum Hauptinhalt springen

Showing 151–200 of 3,069 results for author: Chen, T

.
  1. arXiv:2406.08155  [pdf, other

    cs.LG cs.AI cs.CL

    Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark

    Authors: Pingzhi Li, Xiaolong Jin, Yu Cheng, Tianlong Chen

    Abstract: Large Language Models~(LLMs) have become foundational in the realm of natural language processing, demonstrating performance improvements as model sizes increase. The Mixture-of-Experts~(MoE) approach offers a promising way to scale LLMs more efficiently by using fewer computational FLOPs through sparse activation. However, it suffers from significant memory overheads, necessitating model compress… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Our code for reproducing all our experiments is provided at https://github.com/UNITES-Lab/moe-quantization

  2. arXiv:2406.07885  [pdf, other

    cs.LG

    GENIU: A Restricted Data Access Unlearning for Imbalanced Data

    Authors: Chenhao Zhang, Shaofei Shen, Yawen Zhao, Weitong Tony Chen, Miao Xu

    Abstract: With the increasing emphasis on data privacy, the significance of machine unlearning has grown substantially. Class unlearning, which involves enabling a trained model to forget data belonging to a specific class learned before, is important as classification tasks account for the majority of today's machine learning as a service (MLaaS). Retraining the model on the original data, excluding the da… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  3. arXiv:2406.07842  [pdf, other

    eess.AS cs.CL

    Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR

    Authors: Yerbolat Khassanov, Zhipeng Chen, Tianfeng Chen, Tze Yuang Chong, Wei Li, Jun Zhang, Lu Lu, Yuxuan Wang

    Abstract: This paper addresses challenges in integrating new languages into a pre-trained multilingual automatic speech recognition (mASR) system, particularly in scenarios where training data for existing languages is limited or unavailable. The proposed method employs a dual-pipeline with low-rank adaptation (LoRA). It maintains two data flow pipelines-one for existing languages and another for new langua… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: 5 pages, 2 figures, 4 tables

  4. arXiv:2406.07177  [pdf, other

    cs.LG

    TernaryLLM: Ternarized Large Language Model

    Authors: Tianqi Chen, Zhe Li, Weixiang Xu, Zeyu Zhu, Dong Li, Lu Tian, Emad Barsoum, Peisong Wang, Jian Cheng

    Abstract: Large language models (LLMs) have achieved remarkable performance on Natural Language Processing (NLP) tasks, but they are hindered by high computational costs and memory requirements. Ternarization, an extreme form of quantization, offers a solution by reducing memory usage and enabling energy-efficient floating-point additions. However, applying ternarization to LLMs faces challenges stemming fr… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  5. arXiv:2406.06807  [pdf

    cond-mat.mtrl-sci

    Additive engineering for Sb$_2$S$_3$ indoor photovoltaics with efficiency exceeding 17%

    Authors: Xiao Chen, Xiaoxuan Shu, Jiangcheng Zhou, Lei Wan, Peng Xiao, Yuchen Fu, Junzhi Ye, Yi-Teng Huang, Bin Yan, Dingjiang Xue, Tao Chen, Jiejie Chen, Robert L. Z. Hoye, Ru Zhou

    Abstract: Indoor photovoltaics (IPVs) have attracted increasing attention for sustainably powering Internet of Things (IoT) electronics. Sb$_2$S$_3$ is a promising IPV candidate material with a bandgap of ~1.75 eV, which is near the optimal value for indoor energy harvesting. However, the performance of Sb$_2$S$_3$ solar cells is limited by nonradiative recombination, closely associated with the poor-qualit… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 28 pages, 6 figures

  6. arXiv:2406.06523  [pdf, other

    cs.CV

    NaRCan: Natural Refined Canonical Image with Integration of Diffusion Prior for Video Editing

    Authors: Ting-Hsuan Chen, Jiewen Chan, Hau-Shiang Shiu, Shih-Han Yen, Chang-Han Yeh, Yu-Lun Liu

    Abstract: We propose a video editing framework, NaRCan, which integrates a hybrid deformation field and diffusion prior to generate high-quality natural canonical images to represent the input video. Our approach utilizes homography to model global motion and employs multi-layer perceptrons (MLPs) to capture local residual deformations, enhancing the model's ability to handle complex video dynamics. By intr… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: Project page: https://koi953215.github.io/NaRCan_page/

  7. arXiv:2406.06375  [pdf, other

    cs.SD cs.AI eess.AS

    MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing

    Authors: Yu-Fen Huang, Nikki Moran, Simon Coleman, Jon Kelly, Shun-Hwa Wei, Po-Yin Chen, Yun-Hsin Huang, Tsung-Ping Chen, Yu-Chia Kuo, Yu-Chi Wei, Chih-Hsuan Li, Da-Yu Huang, Hsuan-Kai Kao, Ting-Wei Lin, Li Su

    Abstract: In cross-modal music processing, translation between visual, auditory, and semantic content opens up new possibilities as well as challenges. The construction of such a transformative scheme depends upon a benchmark corpus with a comprehensive data infrastructure. In particular, the assembly of a large-scale cross-modal dataset presents major challenges. In this paper, we present the MOSA (Music m… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024. 14 pages, 7 figures. Dataset is available on: https://github.com/yufenhuang/MOSA-Music-mOtion-and-Semantic-Annotation-dataset/tree/main and https://zenodo.org/records/11393449

  8. arXiv:2406.06118  [pdf, other

    hep-ex

    Strong and weak $CP$ tests in sequential decays of polarized $Σ^0$ hyperons

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: The $J/ψ, ψ(3686) \to Σ^0 \barΣ^{0}$ processes and subsequent decays are studied using the world's largest $J/ψ$ and $ψ(3686)$ data samples collected with the BESIII detector. The strong-$CP$ symmetry is tested in the decays of the $Σ^0$ hyperons for the first time by measuring the decay parameters, $α_{Σ^0} = -0.0017 \pm 0.0021 \pm 0.0018$ and $\barα_{Σ^0} = 0.0021 \pm 0.0020 \pm 0.0022$. The wea… ▽ More

    Submitted 16 July, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

  9. arXiv:2406.05827  [pdf, ps, other

    hep-ex

    Measurement of the integrated luminosity of the data collected at 3.773 GeV by BESIII from 2021 to 2024

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: We present a measurement of the integrated luminosity of $e^+e^-$ collision data collected with the BESIII detector at the BEPCII collider at a center-of-mass energy of $E_{\rm cm} = 3.773$~GeV. The integrated luminosities of the data sets taken from December 2021 to June 2022, from November 2022 to June 2023, and from October 2023 to February 2024 are determined to be $4.995 \pm 0.019$~fb$^{-1}$,… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  10. arXiv:2406.05397  [pdf, other

    cs.SE

    Metamorphic Relation Generation: State of the Art and Visions for Future Research

    Authors: Rui Li, Huai Liu, Pak-Lok Poon, Dave Towey, Chang-Ai Sun, Zheng Zheng, Zhi Quan Zhou, Tsong Yueh Chen

    Abstract: Metamorphic testing has become one mainstream technique to address the notorious oracle problem in software testing, thanks to its great successes in revealing real-life bugs in a wide variety of software systems. Metamorphic relations, the core component of metamorphic testing, have continuously attracted research interests from both academia and industry. In the last decade, a rapidly increasing… ▽ More

    Submitted 10 June, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

    Comments: Accepted by International Workshop on Software Engineering in 2030

  11. arXiv:2406.04713  [pdf, other

    cs.LG cond-mat.mtrl-sci cs.AI physics.comp-ph stat.ML

    FlowMM: Generating Materials with Riemannian Flow Matching

    Authors: Benjamin Kurt Miller, Ricky T. Q. Chen, Anuroop Sriram, Brandon M Wood

    Abstract: Crystalline materials are a fundamental component in next-generation technologies, yet modeling their distribution presents unique computational challenges. Of the plausible arrangements of atoms in a periodic lattice only a vanishingly small percentage are thermodynamically stable, which is a key indicator of the materials that can be experimentally realized. Two fundamental tasks in this area ar… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: https://github.com/facebookresearch/flowmm

    Journal ref: ICML 2024

  12. arXiv:2406.03566  [pdf, other

    cond-mat.str-el cond-mat.supr-con

    Quasi-two-dimensional Antiferromagnetic Spin Fluctuations in the Spin-triplet Superconductor Candidate CeRh$_2$As$_2$

    Authors: Tong Chen, Hasan Siddiquee, Zack Rehfuss, Shiyuan Gao, Chris Lygouras, Jack Drouin, Vincent Morano, Keenan E. Avers, Christopher J. Schmitt, Andrey Podlesnyak, Sheng Ran, Yu Song, Collin Broholm

    Abstract: The tetragonal heavy-fermion superconductor CeRh$_2$As$_2$ ($T_{\rm c}=0.3$ K) exhibits an exceptionally high critical field of 14 T for $\textbf{B} \parallel \textbf{c}$. It undergoes a field-driven first-order phase transition between superconducting (SC) states, potentially transitioning from spin-singlet to spin-triplet superconductivity. To elucidate the underlying pairing mechanism, we probe… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 7+6 pages, 4+4 figures

  13. arXiv:2406.03131  [pdf

    physics.app-ph

    Comprehensive Measurement of Three-Dimensional Thermal Conductivity Tensor Using a Beam-Offset Square-Pulsed Source (BO-SPS) Approach

    Authors: Tao Chen, Shangzhi Song, Puqing Jiang

    Abstract: Accurately measuring the three-dimensional thermal conductivity tensor is essential for understanding and engineering the thermal behavior of anisotropic materials. Existing methods often struggle to isolate individual tensor elements, leading to large measurement uncertainties and time-consuming iterative fitting procedures. In this study, we introduce the Beam-Offset Square-Pulsed Source (BO-SPS… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  14. arXiv:2406.03051  [pdf, other

    cs.CV

    Adapter-X: A Novel General Parameter-Efficient Fine-Tuning Framework for Vision

    Authors: Minglei Li, Peng Ye, Yongqi Huang, Lin Zhang, Tao Chen, Tong He, Jiayuan Fan, Wanli Ouyang

    Abstract: Parameter-efficient fine-tuning (PEFT) has become increasingly important as foundation models continue to grow in both popularity and size. Adapter has been particularly well-received due to their potential for parameter reduction and adaptability across diverse tasks. However, striking a balance between high efficiency and robust generalization across tasks remains a challenge for adapter-based m… ▽ More

    Submitted 5 June, 2024; v1 submitted 5 June, 2024; originally announced June 2024.

  15. Measurements of the branching fractions of the $P$-wave charmonium spin-singlet state $h_c(^1P_1) \to h^+ h^-π^0/η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Based on $(2712.4\pm 14.3)\times10^{6}$ $ψ(3686)$ events, we investigate four hadronic decay modes of the $P$-wave charmonium spin-singlet state $h_c(^1P_1) \to h^+ h^- π^0/η$ ($h=π$ or $K$) via the process $ψ(3686) \to π^{0}h_c$ at BESIII. The $h_c \to π^+ π^- π^0$ decay is observed with a significance of 9.6$σ$ after taking into account systematic uncertainties. Evidences for… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 9 pages, 7 figures

  16. arXiv:2406.02518  [pdf, other

    cs.CV eess.IV

    DDGS-CT: Direction-Disentangled Gaussian Splatting for Realistic Volume Rendering

    Authors: Zhongpai Gao, Benjamin Planche, Meng Zheng, Xiao Chen, Terrence Chen, Ziyan Wu

    Abstract: Digitally reconstructed radiographs (DRRs) are simulated 2D X-ray images generated from 3D CT volumes, widely used in preoperative settings but limited in intraoperative applications due to computational bottlenecks, especially for accurate but heavy physics-based Monte Carlo methods. While analytical DRR renderers offer greater efficiency, they overlook anisotropic X-ray image formation phenomena… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  17. arXiv:2406.02468  [pdf, other

    cs.CV

    DL-KDD: Dual-Light Knowledge Distillation for Action Recognition in the Dark

    Authors: Chi-Jui Chang, Oscar Tai-Yuan Chen, Vincent S. Tseng

    Abstract: Human action recognition in dark videos is a challenging task for computer vision. Recent research focuses on applying dark enhancement methods to improve the visibility of the video. However, such video processing results in the loss of critical information in the original (un-enhanced) video. Conversely, traditional two-stream methods are capable of learning information from both original and pr… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  18. arXiv:2406.02260  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Near-Room-Temperature Field-Controllable Exchange Bias in 2D van der Waals Ferromagnet Fe3GaTe2

    Authors: Jifeng Shao, Xiaolong Yin, Chunhao Bao, Sirong Lu, Xiaoming Ma, Shu Guo, Le Wang, Xi Zhang, Zhiyue Li, Longxiang Li, Yue Zhao, Tingyong Chen

    Abstract: Exchange bias (EB) is a cornerstone of modern magnetic memory and sensing technologies. Its extension to the realm of two-dimensional (2D) van der Waals (vdW) magnets holds promise for revolutionary advancements in miniaturized and efficient atomic spintronic devices. However, the blocking temperature of EB in 2D vdW magnets is currently well below room temperature ~130 K. This study reports a rob… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 14 pages, 5 figures

  19. arXiv:2406.01645  [pdf, other

    cs.LG cs.AI

    FNP: Fourier Neural Processes for Arbitrary-Resolution Data Assimilation

    Authors: Kun Chen, Tao Chen, Peng Ye, Hao Chen, Kang Chen, Tao Han, Wanli Ouyang, Lei Bai

    Abstract: Data assimilation is a vital component in modern global medium-range weather forecasting systems to obtain the best estimation of the atmospheric state by combining the short-term forecast and observations. Recently, AI-based data assimilation approaches have attracted increasing attention for their significant advantages over traditional techniques in terms of computational consumption. However,… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  20. arXiv:2406.01586  [pdf, other

    cs.RO cs.AI

    ManiCM: Real-time 3D Diffusion Policy via Consistency Model for Robotic Manipulation

    Authors: Guanxing Lu, Zifeng Gao, Tianxing Chen, Wenxun Dai, Ziwei Wang, Yansong Tang

    Abstract: Diffusion models have been verified to be effective in generating complex distributions from natural images to motion trajectories. Recent diffusion-based methods show impressive performance in 3D robotic manipulation tasks, whereas they suffer from severe runtime inefficiency due to multiple denoising steps, especially with high-dimensional observations. To this end, we propose a real-time roboti… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: https://manicm-fast.github.io/

  21. arXiv:2406.01332  [pdf, ps, other

    hep-ex

    Measurements of the branching fractions of semileptonic $D^{+}_s$ decays via $e^+e^-\to D_s^{*+}D_s^{*-}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: We measure the absolute branching fractions of semileptonic $D^+_s$ decays via the $e^+e^-\to D_s^{*+}D_s^{*-}$ process using $e^+e^-$ collision data corresponding to an integrated luminosity of $10.64~\mathrm{fb}^{-1}$ collected by the BESIII detector at center-of-mass energies between 4.237 and 4.699 GeV. The branching fractions are… ▽ More

    Submitted 4 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: 14 pages, 3 figures

  22. arXiv:2406.01125  [pdf, other

    cs.CV

    $Δ$-DiT: A Training-Free Acceleration Method Tailored for Diffusion Transformers

    Authors: Pengtao Chen, Mingzhu Shen, Peng Ye, Jianjian Cao, Chongjun Tu, Christos-Savvas Bouganis, Yiren Zhao, Tao Chen

    Abstract: Diffusion models are widely recognized for generating high-quality and diverse images, but their poor real-time performance has led to numerous acceleration works, primarily focusing on UNet-based structures. With the more successful results achieved by diffusion transformers (DiT), there is still a lack of exploration regarding the impact of DiT structure on generation, as well as the absence of… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 12 pages, 6 figures, 6 tables

  23. arXiv:2406.00681  [pdf, other

    cs.LG

    Learning Multimodal Behaviors from Scratch with Diffusion Policy Gradient

    Authors: Zechu Li, Rickmer Krohn, Tao Chen, Anurag Ajay, Pulkit Agrawal, Georgia Chalvatzaki

    Abstract: Deep reinforcement learning (RL) algorithms typically parameterize the policy as a deep network that outputs either a deterministic action or a stochastic one modeled as a Gaussian distribution, hence restricting learning to a single behavioral mode. Meanwhile, diffusion models emerged as a powerful framework for multimodal learning. However, the use of diffusion policies in online RL is hindered… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  24. arXiv:2406.00632  [pdf, other

    cs.CV

    Diff-Mosaic: Augmenting Realistic Representations in Infrared Small Target Detection via Diffusion Prior

    Authors: Yukai Shi, Yupei Lin, Pengxu Wei, Xiaoyu Xian, Tianshui Chen, Liang Lin

    Abstract: Recently, researchers have proposed various deep learning methods to accurately detect infrared targets with the characteristics of indistinct shape and texture. Due to the limited variety of infrared datasets, training deep learning models with good generalization poses a challenge. To augment the infrared dataset, researchers employ data augmentation techniques, which often involve generating ne… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  25. arXiv:2406.00288  [pdf, other

    cs.LG stat.ML

    Neural Optimal Transport with Lagrangian Costs

    Authors: Aram-Alexandre Pooladian, Carles Domingo-Enrich, Ricky T. Q. Chen, Brandon Amos

    Abstract: We investigate the optimal transport problem between probability measures when the underlying cost function is understood to satisfy a least action principle, also known as a Lagrangian cost. These generalizations are useful when connecting observations from a physical system where the transport dynamics are influenced by the geometry of the system, such as obstacles (e.g., incorporating barrier f… ▽ More

    Submitted 31 May, 2024; originally announced June 2024.

    Comments: UAI 2024

  26. arXiv:2406.00059  [pdf, other

    cs.CL cs.DC cs.LG

    Conveyor: Efficient Tool-aware LLM Serving with Tool Partial Execution

    Authors: Yechen Xu, Xinhao Kong, Tingjun Chen, Danyang Zhuo

    Abstract: The complexity of large language model (LLM) serving workloads has substantially increased due to the integration with external tool invocations, such as ChatGPT plugins. In this paper, we identify a new opportunity for efficient LLM serving for requests that trigger tools: tool partial execution alongside LLM decoding. To this end, we design Conveyor, an efficient LLM serving system optimized for… ▽ More

    Submitted 4 June, 2024; v1 submitted 29 May, 2024; originally announced June 2024.

    Comments: 11 pages, 8 figures

  27. arXiv:2406.00046  [pdf, other

    cs.CL cs.LG

    Hate Speech Detection with Generalizable Target-aware Fairness

    Authors: Tong Chen, Danny Wang, Xurong Liang, Marten Risius, Gianluca Demartini, Hongzhi Yin

    Abstract: To counter the side effect brought by the proliferation of social media platforms, hate speech detection (HSD) plays a vital role in halting the dissemination of toxic online posts at an early stage. However, given the ubiquitous topical communities on social media, a trained HSD classifier easily becomes biased towards specific targeted groups (e.g., female and black people), where a high rate of… ▽ More

    Submitted 11 June, 2024; v1 submitted 28 May, 2024; originally announced June 2024.

    Comments: To appear in KDD 2024

  28. Simultaneous Measurement of Thermal Conductivity and Heat Capacity Across Diverse Materials Using the Square-Pulsed Source (SPS) Technique

    Authors: Tao Chen, Shangzhi Song, Yang Shen, Kexin Zhang, Puqing Jiang

    Abstract: State-of-the-art techniques like dual-frequency Time-Domain Thermoreflectance (TDTR) and Frequency-Domain Thermoreflectance (FDTR) offer superb capability for simultaneous measurements of thermal conductivity and heat capacity with a spatial resolution on the order of 10 μm. However, their applicability is limited to highly conductive materials with an in-plane thermal conductivity exceeding 10 W/… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  29. arXiv:2405.20853  [pdf, other

    cs.CV

    MeshXL: Neural Coordinate Field for Generative 3D Foundation Models

    Authors: Sijin Chen, Xin Chen, Anqi Pang, Xianfang Zeng, Wei Cheng, Yijun Fu, Fukun Yin, Yanru Wang, Zhibin Wang, Chi Zhang, Jingyi Yu, Gang Yu, Bin Fu, Tao Chen

    Abstract: The polygon mesh representation of 3D data exhibits great flexibility, fast rendering speed, and storage efficiency, which is widely preferred in various applications. However, given its unstructured graph representation, the direct generation of high-fidelity 3D meshes is challenging. Fortunately, with a pre-defined ordering strategy, 3D meshes can be represented as sequences, and the generation… ▽ More

    Submitted 18 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

  30. arXiv:2405.20676  [pdf, other

    hep-ex

    Search for $e^{+}e^{-}\toη'ψ(2S)$ at center-of-mass energies from 4.66 to 4.95 GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Using data samples with an integrated luminosity of $4.67~\mathrm{fb}^{-1}$ collected by the BESIII detector operating at the BEPCII collider, we search for the process $e^+e^- \rightarrow η' ψ(2S)$ at center-of-mass energies from $4.66$ to $4.95~\mathrm{GeV}$. No significant signal is observed, and upper limits for the Born cross sections $σ^B(e^+e^-\rightarrowη'ψ(2S))$ at the 90\% confidence lev… ▽ More

    Submitted 12 August, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

  31. Study of the decays $χ_{cJ} \rightarrow Λ\barΛφ$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (637 additional authors not shown)

    Abstract: Based on $(2712.4 \pm 14.3) \times 10^{6}$ $ e^{+}e^{-}\toψ(3686)$ events collected with the BESIII detector operating at the BEPCII collider, we report the first evidence of $χ_{c0}\to Λ\bar Λφ$ decays and the first observation of $χ_{c1,2}\to Λ\bar Λφ$ decays, with significances of $4.5σ$, $11.3σ$ and $13.0σ$, respectively. The decay branching fractions of $χ_{c0,1,2}\to Λ\bar Λφ$ are measured t… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 10 pages, 9 figures

    Journal ref: Phys. Rev. D 110, 032016 (2024)

  32. arXiv:2405.20617  [pdf, other

    eess.SP

    Large-scale Outdoor Cell-free mMIMO Channel Measurement in an Urban Scenario at 3.5 GHz

    Authors: Yuning Zhang, Thomas Choi, Zihang Cheng, Issei Kanno, Masaaki Ito, Jorge Gomez-Ponce, Hussein Hammoud, Bowei Wu, Ashwani Pradhan, Kelvin Arana, Pramod Krishna, Tianyi Yang, Tyler Chen, Ishita Vasishtha, Haoyu Xie, Linyu Sun, Andreas F. Molisch

    Abstract: The design of cell-free massive MIMO (CF-mMIMO) systems requires accurate, measurement-based channel models. This paper provides the first results from the by far most extensive outdoor measurement campaign for CF-mMIMO channels in an urban environment. We measured impulse responses between over 20,000 potential access point (AP) locations and 80 user equipments (UEs) at 3.5 GHz with 350 MHz bandw… ▽ More

    Submitted 6 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: Submitted to: VTC 2024-Fall

  33. arXiv:2405.19804  [pdf

    cs.LG

    Exploring Key Factors for Long-Term Vessel Incident Risk Prediction

    Authors: Tianyi Chen, Hua Wang, Yutong Cai, Maohan Liang, Qiang Meng

    Abstract: Factor analysis acts a pivotal role in enhancing maritime safety. Most previous studies conduct factor analysis within the framework of incident-related label prediction, where the developed models can be categorized into short-term and long-term prediction models. The long-term models offer a more strategic approach, enabling more proactive risk management, compared to the short-term ones. Nevert… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  34. arXiv:2405.19690  [pdf, other

    cs.LG cs.AI

    Diffusion Policies creating a Trust Region for Offline Reinforcement Learning

    Authors: Tianyu Chen, Zhendong Wang, Mingyuan Zhou

    Abstract: Offline reinforcement learning (RL) leverages pre-collected datasets to train optimal policies. Diffusion Q-Learning (DQL), introducing diffusion models as a powerful and expressive policy class, significantly boosts the performance of offline RL. However, its reliance on iterative denoising sampling to generate actions slows down both training and inference. While several recent attempts have tri… ▽ More

    Submitted 31 May, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

  35. arXiv:2405.19326  [pdf, other

    cs.CV cs.GR cs.HC

    Reasoning3D -- Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation via Large Vision-Language Models

    Authors: Tianrun Chen, Chunan Yu, Jing Li, Jianqi Zhang, Lanyun Zhu, Deyi Ji, Yong Zhang, Ying Zang, Zejian Li, Lingyun Sun

    Abstract: In this paper, we introduce a new task: Zero-Shot 3D Reasoning Segmentation for parts searching and localization for objects, which is a new paradigm to 3D segmentation that transcends limitations for previous category-specific 3D semantic segmentation, 3D instance segmentation, and open-vocabulary 3D segmentation. We design a simple baseline method, Reasoning3D, with the capability to understand… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

  36. arXiv:2405.17630  [pdf, other

    astro-ph.HE

    HERMES: Gamma Ray Burst and Gravitational Wave counterpart hunter

    Authors: G. Ghirlanda, L. Nava, O. Salafia, F. Fiore, R. Campana, R. Salvaterra, A. Sanna, W. Leone, Y. Evangelista, G. Dilillo, S. Puccetti, A. Santangelo, M. Trenti, A. Guzmán, P. Hedderman, G. Amelino-Camelia, M. Barbera, G. Baroni, M. Bechini, P. Bellutti, G. Bertuccio, G. Borghi, A. Brandonisio, L. Burderi, C. Cabras , et al. (45 additional authors not shown)

    Abstract: Gamma Ray Bursts (GRBs) bridge relativistic astrophysics and multi-messenger astronomy. Space-based gamma/X-ray wide field detectors have proven essential to detect and localize the highly variable GRB prompt emission, which is also a counterpart of gravitational wave events. We study the capabilities to detect long and short GRBs by the High Energy Rapid Modular Ensemble of Satellites (HERMES) Pa… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 13 pages, 6 figures, 4 tabels. Accepted for publication by Astronomy & Astrophysics

  37. arXiv:2405.17505  [pdf, other

    cs.LG cs.CL

    Predicting Rental Price of Lane Houses in Shanghai with Machine Learning Methods and Large Language Models

    Authors: Tingting Chen, Shijing Si

    Abstract: Housing has emerged as a crucial concern among young individuals residing in major cities, including Shanghai. Given the unprecedented surge in property prices in this metropolis, young people have increasingly resorted to the rental market to address their housing needs. This study utilizes five traditional machine learning methods: multiple linear regression (MLR), ridge regression (RR), lasso r… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 13 pages, 11 figures, 39 references

  38. arXiv:2405.17461  [pdf, other

    cs.LG cs.CV

    EMR-Merging: Tuning-Free High-Performance Model Merging

    Authors: Chenyu Huang, Peng Ye, Tao Chen, Tong He, Xiangyu Yue, Wanli Ouyang

    Abstract: The success of pretrain-finetune paradigm brings about the release of numerous model weights. In this case, merging models finetuned on different tasks to enable a single model with multi-task capabilities is gaining increasing attention for its practicability. Existing model merging methods usually suffer from (1) significant performance degradation or (2) requiring tuning by additional data or t… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  39. arXiv:2405.17003  [pdf, other

    cs.LG

    Graph Condensation for Open-World Graph Learning

    Authors: Xinyi Gao, Tong Chen, Wentao Zhang, Yayong Li, Xiangguo Sun, Hongzhi Yin

    Abstract: The burgeoning volume of graph data presents significant computational challenges in training graph neural networks (GNNs), critically impeding their efficiency in various applications. To tackle this challenge, graph condensation (GC) has emerged as a promising acceleration solution, focusing on the synthesis of a compact yet representative graph for efficiently training GNNs while retaining perf… ▽ More

    Submitted 12 June, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted by KDD 2024

  40. arXiv:2405.16749  [pdf, other

    cs.LG cs.CV

    DMPlug: A Plug-in Method for Solving Inverse Problems with Diffusion Models

    Authors: Hengkang Wang, Xu Zhang, Taihui Li, Yuxiang Wan, Tiancong Chen, Ju Sun

    Abstract: Pretrained diffusion models (DMs) have recently been popularly used in solving inverse problems (IPs). The existing methods mostly interleave iterative steps in the reverse diffusion process and iterative steps to bring the iterates closer to satisfying the measurement constraint. However, such interleaving methods struggle to produce final results that look like natural objects of interest (i.e.,… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  41. arXiv:2405.16381  [pdf, other

    cs.LG cs.AI stat.ML

    Trivialized Momentum Facilitates Diffusion Generative Modeling on Lie Groups

    Authors: Yuchen Zhu, Tianrong Chen, Lingkai Kong, Evangelos A. Theodorou, Molei Tao

    Abstract: The generative modeling of data on manifold is an important task, for which diffusion models in flat spaces typically need nontrivial adaptations. This article demonstrates how a technique called `trivialization' can transfer the effectiveness of diffusion models in Euclidean spaces to Lie groups. In particular, an auxiliary momentum variable was algorithmically introduced to help transport the po… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  42. arXiv:2405.16263  [pdf, other

    cs.CV cs.AI

    Assessing Image Inpainting via Re-Inpainting Self-Consistency Evaluation

    Authors: Tianyi Chen, Jianfu Zhang, Yan Hong, Yiyi Zhang, Liqing Zhang

    Abstract: Image inpainting, the task of reconstructing missing segments in corrupted images using available data, faces challenges in ensuring consistency and fidelity, especially under information-scarce conditions. Traditional evaluation methods, heavily dependent on the existence of unmasked reference images, inherently favor certain inpainting outcomes, introducing biases. Addressing this issue, we intr… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  43. arXiv:2405.16240  [pdf, other

    cs.LG

    Analytic Federated Learning

    Authors: Huiping Zhuang, Run He, Kai Tong, Di Fang, Han Sun, Haoran Li, Tianyi Chen, Ziqian Zeng

    Abstract: In this paper, we introduce analytic federated learning (AFL), a new training paradigm that brings analytical (i.e., closed-form) solutions to the federated learning (FL) community. Our AFL draws inspiration from analytic learning -- a gradient-free technique that trains neural networks with analytical solutions in one epoch. In the local client training stage, the AFL facilitates a one-epoch trai… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  44. arXiv:2405.15920  [pdf, other

    cs.LG stat.ML

    SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning

    Authors: Shuai Zhang, Heshan Devaka Fernando, Miao Liu, Keerthiram Murugesan, Songtao Lu, Pin-Yu Chen, Tianyi Chen, Meng Wang

    Abstract: This paper studies the transfer reinforcement learning (RL) problem where multiple RL problems have different reward functions but share the same underlying transition dynamics. In this setting, the Q-function of each RL problem (task) can be decomposed into a successor feature (SF) and a reward mapping: the former characterizes the transition dynamics, and the latter characterizes the task-specif… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.16173

  45. arXiv:2405.15008  [pdf, other

    cs.SE cs.DB

    An Empirical Study on the Characteristics of Database Access Bugs in Java Applications

    Authors: Wei Liu, Shouvick Mondal, Tse-Hsun Chen

    Abstract: Database-backed applications rely on the database access code to interact with the underlying database management systems (DBMSs). Although many prior studies aim at database access issues like SQL anti-patterns or SQL code smells, there is a lack of study of database access bugs during the maintenance of database-backed applications. In this paper, we empirically investigate 423 database access b… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Accepted by the ACM Transactions on Software Engineering and Methodology (TOSEM)

  46. arXiv:2405.15005  [pdf, other

    cs.RO

    ReachBot Field Tests in a Mojave Desert Lava Tube as a Martian Analog

    Authors: Tony G. Chen, Julia Di, Stephanie Newdick, Mathieu Lapotre, Marco Pavone, Mark R. Cutkosky

    Abstract: ReachBot is a robot concept for the planetary exploration of caves and lava tubes, which are often inaccessible with traditional robot locomotion methods. It uses extendable booms as appendages, with grippers mounted at the end, to grasp irregular rock surfaces and traverse these difficult terrains. We have built a partial ReachBot prototype consisting of a single boom and gripper, mounted on a tr… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: Accepted to the IEEE ICRA Workshop on Field Robotics 2024; 4 pages

  47. arXiv:2405.14260  [pdf, other

    cs.LG cs.AI

    Graph Sparsification via Mixture of Graphs

    Authors: Guibin Zhang, Xiangguo Sun, Yanwei Yue, Kun Wang, Tianlong Chen, Shirui Pan

    Abstract: Graph Neural Networks (GNNs) have demonstrated superior performance across various graph learning tasks but face significant computational challenges when applied to large-scale graphs. One effective approach to mitigate these challenges is graph sparsification, which involves removing non-essential edges to reduce computational overhead. However, previous graph sparsification methods often rely o… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  48. arXiv:2405.13811  [pdf, other

    cs.IR

    Diffusion-Based Cloud-Edge-Device Collaborative Learning for Next POI Recommendations

    Authors: Jing Long, Guanhua Ye, Tong Chen, Yang Wang, Meng Wang, Hongzhi Yin

    Abstract: The rapid expansion of Location-Based Social Networks (LBSNs) has highlighted the importance of effective next Point-of-Interest (POI) recommendations, which leverage historical check-in data to predict users' next POIs to visit. Traditional centralized deep neural networks (DNNs) offer impressive POI recommendation performance but face challenges due to privacy concerns and limited timeliness. In… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  49. arXiv:2405.13707  [pdf, other

    cs.LG cs.AI

    Rethinking and Accelerating Graph Condensation: A Training-Free Approach with Class Partition

    Authors: Xinyi Gao, Tong Chen, Wentao Zhang, Junliang Yu, Guanhua Ye, Quoc Viet Hung Nguyen, Hongzhi Yin

    Abstract: The increasing prevalence of large-scale graphs poses a significant challenge for graph neural network training, attributed to their substantial computational requirements. In response, graph condensation (GC) emerges as a promising data-centric solution aiming to substitute the large graph with a small yet informative condensed graph to facilitate data-efficient GNN training. However, existing GC… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  50. arXiv:2405.13596  [pdf, other

    astro-ph.HE astro-ph.SR

    SN 2023zaw: the low-energy explosion of an ultra-stripped star, with non-radioactive heating

    Authors: Thomas Moore, James Gillanders, Matt Nicholl, Mark Huber, Stephen Smartt, Shubham Srivastav, Heloise Stevance, Ting-Wan Chen, Kenneth Chambers, Joseph Anderson, Michael Fulton, Samantha Oates, Charlotte Angus, Giuliano Pignata, Nicolas Erasmus, Hua Gao, Joanna Bulger, Chien-Cheng Lin, Thomas Lowe, Eugene Magnier, Paloma Minguez, Chow-Choong Ngeow, Xinyue Sheng, Stuart A. Sim, Ken Smith , et al. (4 additional authors not shown)

    Abstract: Most stripped envelope supernova progenitors are formed through binary interaction, losing hydrogen and/or helium from their outer layers. An emerging class of supernovae with the highest degree of envelope-stripping are thought to be the product of stripping by a NS companion. However, relatively few examples are known and the outcomes of such systems can be diverse and are poorly understood at p… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.