Zum Hauptinhalt springen

Showing 101–150 of 5,538 results for author: Xue, J

.
  1. arXiv:2407.17033  [pdf, other

    cs.LG cs.AI stat.ML

    Sparse Inducing Points in Deep Gaussian Processes: Enhancing Modeling with Denoising Diffusion Variational Inference

    Authors: Jian Xu, Delu Zeng, John Paisley

    Abstract: Deep Gaussian processes (DGPs) provide a robust paradigm for Bayesian deep learning. In DGPs, a set of sparse integration locations called inducing points are selected to approximate the posterior distribution of the model. This is done to reduce computational complexity and improve model efficiency. However, inferring the posterior distribution of inducing points is not straightforward. Tradition… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  2. arXiv:2407.16560  [pdf, other

    cs.CV cs.DC

    COALA: A Practical and Vision-Centric Federated Learning Platform

    Authors: Weiming Zhuang, Jian Xu, Chen Chen, Jingtao Li, Lingjuan Lyu

    Abstract: We present COALA, a vision-centric Federated Learning (FL) platform, and a suite of benchmarks for practical FL scenarios, which we categorize into three levels: task, data, and model. At the task level, COALA extends support from simple classification to 15 computer vision tasks, including object detection, segmentation, pose estimation, and more. It also facilitates federated multiple-task learn… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: ICML'24

  3. arXiv:2407.16266  [pdf, other

    cs.CL

    Beyond Binary Gender: Evaluating Gender-Inclusive Machine Translation with Ambiguous Attitude Words

    Authors: Yijie Chen, Yijin Liu, Fandong Meng, Jinan Xu, Yufeng Chen, Jie Zhou

    Abstract: Gender bias has been a focal point in the study of bias in machine translation and language models. Existing machine translation gender bias evaluations are primarily focused on male and female genders, limiting the scope of the evaluation. To assess gender bias accurately, these studies often rely on calculating the accuracy of gender pronouns or the masculine and feminine attributes of grammatic… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: The code is publicly available at \url{https://github.com/pppa2019/ambGIMT}

  4. arXiv:2407.15467  [pdf, other

    astro-ph.GA

    FASHI: A blind survey of 21cm HI absorption galaxies with FAST

    Authors: Chuan-Peng Zhang, Ming Zhu, Peng Jiang, Cheng Cheng, Jin-Long Xu, Nai-Ping Yu, Xiao-Lan Liu, Bo Zhang

    Abstract: The FAST All Sky HI survey (FASHI) is broader in frequency band and deeper in detection sensitivity than most of previous HI surveys. FASHI is designed to cover the entire sky observable by the Five-hundred-meter Aperture Spherical radio Telescope (FAST). Based on the FASHI data, we perform a blind survey of 21cm HI absorption galaxies at redshift $z<0.09$ over an area of about 10000 square degree… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: 32 pages, 7 figures, 2 tables, submitted to ApJS in 05/24/2024, Comments are welcome

  5. arXiv:2407.15309  [pdf, other

    cs.DC cs.LG

    vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving

    Authors: Jiale Xu, Rui Zhang, Cong Guo, Weiming Hu, Zihan Liu, Feiyang Wu, Yu Feng, Shixuan Sun, Changxu Shao, Yuhong Guo, Junping Zhao, Ke Zhang, Minyi Guo, Jingwen Leng

    Abstract: Large Language Models (LLMs) are widely used across various domains, processing millions of daily requests. This surge in demand poses significant challenges in optimizing throughput and latency while keeping costs manageable. The Key-Value (KV) cache, a standard method for retaining previous computations, makes LLM inference highly bounded by memory. While batching strategies can enhance performa… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: 16 pages, 12 figures

  6. arXiv:2407.15171  [pdf, other

    cs.CV

    Assessing Sample Quality via the Latent Space of Generative Models

    Authors: Jingyi Xu, Hieu Le, Dimitris Samaras

    Abstract: Advances in generative models increase the need for sample quality assessment. To do so, previous methods rely on a pre-trained feature extractor to embed the generated samples and real samples into a common space for comparison. However, different feature extractors might lead to inconsistent assessment outcomes. Moreover, these methods are not applicable for domains where a robust, universal fea… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

    Comments: Accepted paper - ECCV 2024

  7. arXiv:2407.14697  [pdf, ps, other

    nucl-ex

    Single-proton removal reaction in the IQMD+GEMINI model benchmarked by elemental fragmentation cross sections of $^{29-33}\mathrm{Si}$ on carbon at $\sim$230~MeV/nucleon

    Authors: Guang-Shuai Li, Jun Su, Satoru Terashima, Jian-Wei Zhao, Er-Xi Xiao, Ji-Chao Zhang, Liu-Chun He, Ge Guo, Wei-Ping Lin, Wen-Jian Lin, Chuan-Ye Liu, Chen-Gui Lu, Bo Mei, Dan-Yang Pang, Ye-Lei Sun, Zhi-Yu Sun, Meng Wang, Feng Wang, Jing Wang, Shi-Tao Wang, Xiu-Lin Wei, Xiao-Dong Xu, Jun-Yao Xu, Li-Hua Zhu, Yong Zheng , et al. (2 additional authors not shown)

    Abstract: We report on the first measurement of the elemental fragmentation cross sections (EFCSs) of $^{29-33}\mathrm{Si}$ on a carbon target at $\sim$230~MeV/nucleon. The experimental data covering charge changes of $ΔZ$ = 1-4 are reproduced well by the isospin-dependent quantum molecular dynamics (IQMD) coupled with the evaporation GEMINI (IQMD+GEMINI) model. We further explore the mechanisms underlying… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: 7 pages, 4 figures

  8. arXiv:2407.14497  [pdf, other

    quant-ph

    Observable-Driven Speed-ups in Quantum Simulations

    Authors: Wenjun Yu, Jue Xu, Qi Zhao

    Abstract: As quantum technology advances, quantum simulation becomes increasingly promising, with significant implications for quantum many-body physics and quantum chemistry. Despite being one of the most accessible simulation methods, the product formula encounters challenges due to the pessimistic gate count estimation. In this work, we elucidate how observable knowledge can accelerate quantum simulation… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: 37 pages, 5 figures

  9. arXiv:2407.14301  [pdf, other

    hep-ex

    Observation of exotic $J/ψφ$ resonances in diffractive processes in proton-proton collisions

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1068 additional authors not shown)

    Abstract: The first study of $J/ψφ$ production in diffractive processes in proton-proton collisions is presented. The study is based on an LHCb dataset recorded at centre-of-mass energy of 13 TeV, corresponding to an integrated luminosity of 5 fb$^{-1}$. The data disfavour a nonresonant $J/ψφ$ production but are consistent with a resonant model including several resonant states observed previously only in… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at: https://lhcbproject.web.cern.ch/Publications/LHCbProjectPublic/LHCb-PAPER-2023-043.html (LHCb public pages)

    Report number: LHCb-PAPER-2023-043, CERN-EP-2024-149

  10. arXiv:2407.14261  [pdf, other

    hep-ex

    Study of charmonium production via the decay to $p\bar{p}$ at $\sqrt{s} = 13 TeV$

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1060 additional authors not shown)

    Abstract: Charmonium production cross-section in proton-proton collisions is measured at the centre-of-mass energy $\sqrt{s}=13\,TeV$ using decays to $p\bar{p}$ final state. The study is performed using a data sample corresponding to an integrated luminosity of $2.2\,{fb}^{-1}$ collected in 2018 with the $LHCb$ detector. The production cross-section of the $η_c$ meson is measured in a rapidity range of… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2024-004.html (LHCb public pages)

    Report number: LHCb-PAPER-2024-004, CERN-EP-2024-165

  11. arXiv:2407.14146  [pdf, other

    cs.MM

    Fine-grained Knowledge Graph-driven Video-Language Learning for Action Recognition

    Authors: Rui Zhang, Yafen Lu, Pengli Ji, Junxiao Xue, Xiaoran Yan

    Abstract: Recent work has explored video action recognition as a video-text matching problem and several effective methods have been proposed based on large-scale pre-trained vision-language models. However, these approaches primarily operate at a coarse-grained level without the detailed and semantic understanding of action concepts by exploiting fine-grained semantic connections between actions and body m… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  12. arXiv:2407.13992  [pdf, other

    eess.IV

    Semantic Communications for 3D Human Face Transmission with Neural Radiance Fields

    Authors: Guanlin Wu, Zhonghao Lyu, Juyong Zhang, Jie Xu

    Abstract: This paper investigates the transmission of three-dimensional (3D) human face content for immersive communication over a rate-constrained transmitter-receiver link. We propose a new framework named NeRF-SeCom, which leverages neural radiance fields (NeRF) and semantic communications to improve the quality of 3D visualizations while minimizing the communication overhead. In the NeRF-SeCom framework… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 6 pages, 4 figures. arXiv admin note: text overlap with arXiv:2405.12155

  13. arXiv:2407.13989  [pdf, other

    cs.LG cs.AI

    Enhancing Data-Limited Graph Neural Networks by Actively Distilling Knowledge from Large Language Models

    Authors: Quan Li, Tianxiang Zhao, Lingwei Chen, Junjie Xu, Suhang Wang

    Abstract: Graphs are pervasive in the real-world, such as social network analysis, bioinformatics, and knowledge graphs. Graph neural networks (GNNs) have great ability in node classification, a fundamental task on graphs. Unfortunately, conventional GNNs still face challenges in scenarios with few labeled nodes, despite the prevalence of few-shot node classification tasks in real-world applications. To add… ▽ More

    Submitted 28 August, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: 10 pages, 3 Figures

  14. A New Tightly-Coupled Dual-VIO for a Mobile Manipulator With Dynamic Locomotion

    Authors: Jianxiang Xu, Soo Jeon

    Abstract: This paper introduces a new dual monocular visualinertial odometry (dual-VIO) strategy for a mobile manipulator operating under dynamic locomotion, i.e. coordinated movement involving both the base platform and the manipulator arm. Our approach has been motivated by challenges arising from inaccurate estimation due to coupled excitation when the mobile manipulator is engaged in dynamic locomotion… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 8 pages

    Journal ref: IEEE/ASME Transactions on Mechatronics (2024)

  15. arXiv:2407.13687  [pdf

    q-fin.TR cs.LG

    Dynamic Pricing in Securities Lending Market: Application in Revenue Optimization for an Agent Lender Portfolio

    Authors: Jing Xu, Yung-Cheng Hsu, William Biscarri

    Abstract: Securities lending is an important part of the financial market structure, where agent lenders help long term institutional investors to lend out their securities to short sellers in exchange for a lending fee. Agent lenders within the market seek to optimize revenue by lending out securities at the highest rate possible. Typically, this rate is set by hard-coded business rules or standard supervi… ▽ More

    Submitted 19 July, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: 7 pages, 8 figures

  16. arXiv:2407.13675  [pdf, other

    cs.CV

    MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis

    Authors: Ziming Zhong, Yanxu Xu, Jing Li, Jiale Xu, Zhengxin Li, Chaohui Yu, Shenghua Gao

    Abstract: We present MeshSegmenter, a simple yet effective framework designed for zero-shot 3D semantic segmentation. This model successfully extends the powerful capabilities of 2D segmentation models to 3D meshes, delivering accurate 3D segmentation across diverse meshes and segment descriptions. Specifically, our model leverages the Segment Anything Model (SAM) model to segment the target regions from im… ▽ More

    Submitted 25 July, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: The paper was accepted by ECCV2024

  17. arXiv:2407.13220  [pdf, other

    eess.AS cs.SD

    MEDIC: Zero-shot Music Editing with Disentangled Inversion Control

    Authors: Huadai Liu, Jialei Wang, Rongjie Huang, Yang Liu, Jiayang Xu, Zhou Zhao

    Abstract: Text-guided diffusion models catalyze a paradigm shift in audio generation, facilitating the adaptability of source audio to conform to specific textual prompts. Recent advancements introduce inversion techniques, like DDIM inversion, to zero-shot editing, exploiting pre-trained diffusion models for audio modification. Nonetheless, our investigation exposes that DDIM inversion suffers from an accu… ▽ More

    Submitted 20 August, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

  18. arXiv:2407.13195  [pdf, other

    cs.LG cs.AI cs.HC cs.IT stat.ML

    Adaptive Foundation Models for Online Decisions: HyperAgent with Fast Incremental Uncertainty Estimation

    Authors: Yingru Li, Jiawei Xu, Zhi-Quan Luo

    Abstract: Foundation models often struggle with uncertainty when faced with new situations in online decision-making, necessitating scalable and efficient exploration to resolve this uncertainty. We introduce GPT-HyperAgent, an augmentation of GPT with HyperAgent for uncertainty-aware, scalable exploration in contextual bandits, a fundamental online decision problem involving natural language input. We prov… ▽ More

    Submitted 21 July, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: 43 pages. Presentation at ICML 2024 Workshops: (1) Aligning Reinforcement Learning Experimentalists and Theorists; (2) Automated Reinforcement Learning: Exploring Meta-Learning, AutoML, and LLMs

  19. arXiv:2407.13193  [pdf, other

    cs.CL

    Retrieval-Augmented Generation for Natural Language Processing: A Survey

    Authors: Shangyu Wu, Ying Xiong, Yufei Cui, Haolun Wu, Can Chen, Ye Yuan, Lianming Huang, Xue Liu, Tei-Wei Kuo, Nan Guan, Chun Jason Xue

    Abstract: Large language models (LLMs) have demonstrated great success in various fields, benefiting from their huge amount of parameters that store knowledge. However, LLMs still suffer from several key issues, such as hallucination problems, knowledge update issues, and lacking domain-specific expertise. The appearance of retrieval-augmented generation (RAG), which leverages an external knowledge database… ▽ More

    Submitted 18 July, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

  20. arXiv:2407.13147  [pdf, other

    cs.CV

    DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection

    Authors: Zhourui Zhang, Jun Li, Zhijian Wu, Jifeng Shen, Jianhua Xu

    Abstract: In recent years, current mainstream feature masking distillation methods mainly function by reconstructing selectively masked regions of a student network from the feature maps of a teacher network. In these methods, attention mechanisms can help to identify spatially important regions and crucial object-aware channel clues, such that the reconstructed features are encoded with sufficient discrimi… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  21. arXiv:2407.12823  [pdf, other

    cs.CL cs.AI

    WTU-EVAL: A Whether-or-Not Tool Usage Evaluation Benchmark for Large Language Models

    Authors: Kangyun Ning, Yisong Su, Xueqiang Lv, Yuanzhe Zhang, Jian Liu, Kang Liu, Jinan Xu

    Abstract: Although Large Language Models (LLMs) excel in NLP tasks, they still need external tools to extend their ability. Current research on tool learning with LLMs often assumes mandatory tool use, which does not always align with real-world situations, where the necessity for tools is uncertain, and incorrect or unnecessary use of tools can damage the general abilities of LLMs. Therefore, we propose to… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  22. arXiv:2407.12791  [pdf, other

    cs.CL cs.AI

    TourLLM: Enhancing LLMs with Tourism Knowledge

    Authors: Qikai Wei, Mingzhi Yang, Jinqiang Wang, Wenwei Mao, Jiabo Xu, Huansheng Ning

    Abstract: Recently, large language models (LLMs) have demonstrated their effectiveness in various natural language processing (NLP) tasks. However, the lack of tourism knowledge limits the performance of LLMs in tourist attraction presentations and travel planning. To address this challenge, we constructed a supervised fine-tuning dataset for the culture and tourism domain, named Cultour. This dataset consi… ▽ More

    Submitted 18 June, 2024; originally announced July 2024.

  23. arXiv:2407.12777  [pdf, other

    cs.CV cs.GR

    Generalizable Human Gaussians for Sparse View Synthesis

    Authors: Youngjoong Kwon, Baole Fang, Yixing Lu, Haoye Dong, Cheng Zhang, Francisco Vicente Carrasco, Albert Mosella-Montoro, Jianjin Xu, Shingo Takagi, Daeil Kim, Aayush Prakash, Fernando De la Torre

    Abstract: Recent progress in neural rendering has brought forth pioneering methods, such as NeRF and Gaussian Splatting, which revolutionize view rendering across various domains like AR/VR, gaming, and content creation. While these methods excel at interpolating {\em within the training data}, the challenge of generalizing to new scenes and objects from very sparse views persists. Specifically, modeling 3D… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  24. arXiv:2407.12764  [pdf, other

    cs.LG

    Jigsaw Game: Federated Clustering

    Authors: Jinxuan Xu, Hong-You Chen, Wei-Lun Chao, Yuqian Zhang

    Abstract: Federated learning has recently garnered significant attention, especially within the domain of supervised learning. However, despite the abundance of unlabeled data on end-users, unsupervised learning problems such as clustering in the federated setting remain underexplored. In this paper, we investigate the federated clustering problem, with a focus on federated k-means. We outline the challenge… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: Accepted to TMLR

  25. arXiv:2407.12475  [pdf, other

    hep-ex

    Amplitude analysis of $B^+ \to ψ(2S) K^+ π^+ π^-$ decays

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1092 additional authors not shown)

    Abstract: The first full amplitude analysis of $B^+ \to ψ(2S) K^+ π^+ π^-$ decays is performed using proton-proton collision data corresponding to an integrated luminosity of $9\,\text{fb}^{-1}$ recorded with the LHCb detector. The rich $K^+ π^+ π^-$ spectrum is studied and the branching fractions of the resonant substructure associated with the prominent $K_1(1270)^+$ contribution are measured. The data ca… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2024-014.html (LHCb public pages)

    Report number: LHCb-PAPER-2024-014, CERN-EP-2024-177

  26. arXiv:2407.12342  [pdf, other

    cs.CL

    Word Embedding Dimension Reduction via Weakly-Supervised Feature Selection

    Authors: Jintang Xue, Yun-Cheng Wang, Chengwei Wei, C. -C. Jay Kuo

    Abstract: As a fundamental task in natural language processing, word embedding converts each word into a representation in a vector space. A challenge with word embedding is that as the vocabulary grows, the vector space's dimension increases and it can lead to a vast model size. Storing and processing word vectors are resource-demanding, especially for mobile edge-devices applications. This paper explores… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  27. arXiv:2407.12270  [pdf, other

    hep-ex

    Observation of $Λ_c^+ \to Λa_0(980)^+$ and Evidence for $Σ(1380)^+$ in $Λ_c^+ \to Λπ^+ η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Based on $6.1~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at center-of-mass energies from 4.600~GeV to 4.843~GeV with the BESIII detector at the BEPCII collider, a partial wave analysis of $Λ_c^+\toΛπ^+η$ is performed, and branching fractions and decay asymmetry parameters of intermediate processes are determined. The process $Λ_c^+\toΛa_0(980)^+$ is observed for the first time, and… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 16 pages, 8 figures

  28. arXiv:2407.12117  [pdf, other

    cs.LG cs.DC

    Efficiently Training 7B LLM with 1 Million Sequence Length on 8 GPUs

    Authors: Pinxue Zhao, Hailin Zhang, Fangcheng Fu, Xiaonan Nie, Qibin Liu, Fang Yang, Yuanbo Peng, Dian Jiao, Shuaipeng Li, Jinbao Xue, Yangyu Tao, Bin Cui

    Abstract: Nowadays, Large Language Models (LLMs) have been trained using extended context lengths to foster more creative applications. However, long context training poses great challenges considering the constraint of GPU memory. It not only leads to substantial activation memory consumption during training, but also incurs considerable memory fragmentation. To facilitate long context training, existing f… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  29. arXiv:2407.12023  [pdf, other

    cs.CL cs.AI

    CMMaTH: A Chinese Multi-modal Math Skill Evaluation Benchmark for Foundation Models

    Authors: Zhong-Zhi Li, Ming-Liang Zhang, Fei Yin, Zhi-Long Ji, Jin-Feng Bai, Zhen-Ru Pan, Fan-Hu Zeng, Jian Xu, Jia-Xin Zhang, Cheng-Lin Liu

    Abstract: Due to the rapid advancements in multimodal large language models, evaluating their multimodal mathematical capabilities continues to receive wide attention. Despite the datasets like MathVista proposed benchmarks for assessing mathematical capabilities in multimodal scenarios, there is still a lack of corresponding evaluation tools and datasets for fine-grained assessment in the context of K12 ed… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

  30. arXiv:2407.11727  [pdf, ps, other

    hep-ex hep-ph

    Measurement of the branching fraction of $D^+_s\to \ell^+ν_\ell$ via $e^+e^-\to D^{*+}_{s} D^{*-}_{s}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (634 additional authors not shown)

    Abstract: Based on $10.64~\mathrm{fb}^{-1}$ of $e^+e^-$ collision data taken at center-of-mass energies between 4.237 and 4.699 GeV with the BESIII detector, we study the leptonic $D^+_s$ decays using the $e^+e^-\to D^{*+}_{s} D^{*-}_{s}$ process. The branching fractions of $D_s^+\to\ell^+ν_{\ell}\,(\ell=μ,τ)$ are measured to be $\mathcal{B}(D_s^+\toμ^+ν_μ)=(0.547\pm0.026_{\rm stat}\pm0.016_{\rm syst})\%$ a… ▽ More

    Submitted 18 July, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: 27 pages, 13 figures

  31. arXiv:2407.11716  [pdf, other

    q-fin.TR cs.CE

    No Questions Asked: Effects of Transparency on Stablecoin Liquidity During the Collapse of Silicon Valley Bank

    Authors: Walter Hernandez Cruz, Jiahua Xu, Paolo Tasca, Carlo Campajola

    Abstract: Fiat-pegged stablecoins are by nature exposed to spillover effects during market turmoil in Traditional Finance (TradFi). We observe a difference in TradFi market shocks impact between various stablecoins, in particular, USD Coin (USDC) and Tether USDT (USDT), the former with a higher reporting frequency and transparency than the latter. We investigate this, using top USDC and USDT liquidity pools… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  32. arXiv:2407.11474  [pdf, other

    hep-ex

    Search for the rare $Λ_c^+ \to p μ^+ μ^-$ decay

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1062 additional authors not shown)

    Abstract: A search for the nonresonant $Λ_c^+ \to p μ^+ μ^-$ decay is performed using proton-proton collision data recorded at a centre-of-mass energy of 13 TeV by the LHCb experiment, corresponding to an integrated luminosity of 5.4 fb$^{-1}$. No evidence for the decay is found in the dimuon invariant-mass regions where the expected contributions of resonances is subdominant. The upper limit on the branchi… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: All figures and tables, along with any supplementary material and additional information, are available at https://cern.ch/lhcbproject/Publications/p/LHCb-PAPER-2024-005.html (LHCb public pages)

    Report number: LHCb-PAPER-2024-005, CERN-EP-2024-158

  33. arXiv:2407.11417  [pdf, other

    cs.CL

    SPINACH: SPARQL-Based Information Navigation for Challenging Real-World Questions

    Authors: Shicheng Liu, Sina J. Semnani, Harold Triedman, Jialiang Xu, Isaac Dan Zhao, Monica S. Lam

    Abstract: Recent work integrating Large Language Models (LLMs) has led to significant improvements in the Knowledge Base Question Answering (KBQA) task. However, we posit that existing KBQA datasets that either have simple questions, use synthetically generated logical forms, or are based on small knowledge base (KB) schemas, do not capture the true complexity of KBQA tasks. To address this, we introduce… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  34. arXiv:2407.10990  [pdf

    cs.CL cs.AI

    MedBench: A Comprehensive, Standardized, and Reliable Benchmarking System for Evaluating Chinese Medical Large Language Models

    Authors: Mianxin Liu, Jinru Ding, Jie Xu, Weiguo Hu, Xiaoyang Li, Lifeng Zhu, Zhian Bai, Xiaoming Shi, Benyou Wang, Haitao Song, Pengfei Liu, Xiaofan Zhang, Shanshan Wang, Kang Li, Haofen Wang, Tong Ruan, Xuanjing Huang, Xin Sun, Shaoting Zhang

    Abstract: Ensuring the general efficacy and goodness for human beings from medical large language models (LLM) before real-world deployment is crucial. However, a widely accepted and accessible evaluation process for medical LLM, especially in the Chinese context, remains to be established. In this work, we introduce "MedBench", a comprehensive, standardized, and reliable benchmarking system for Chinese med… ▽ More

    Submitted 23 June, 2024; originally announced July 2024.

    Comments: 25 pages.4 figures

  35. arXiv:2407.10759  [pdf, other

    eess.AS cs.CL cs.LG

    Qwen2-Audio Technical Report

    Authors: Yunfei Chu, Jin Xu, Qian Yang, Haojie Wei, Xipin Wei, Zhifang Guo, Yichong Leng, Yuanjun Lv, Jinzheng He, Junyang Lin, Chang Zhou, Jingren Zhou

    Abstract: We introduce the latest progress of Qwen-Audio, a large-scale audio-language model called Qwen2-Audio, which is capable of accepting various audio signal inputs and performing audio analysis or direct textual responses with regard to speech instructions. In contrast to complex hierarchical tags, we have simplified the pre-training process by utilizing natural language prompts for different data an… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: https://github.com/QwenLM/Qwen2-Audio. Checkpoints, codes and scripts will be opensoursed soon

  36. arXiv:2407.10727  [pdf

    cond-mat.soft

    Edwards thermodynamic framework controls density segregation in cyclically sheared granular materials

    Authors: Haiyang Lu, Houfei Yuan, Shuyang Zhang, Zhikun Zeng, Yi Xing, Jiazhao Xu, Xin Wang, Yujie Wang

    Abstract: Using X-ray tomography, we experimentally investigate granular segregation phenomena in a mixture of particles with different densities under quasi-static cyclic shear. We quantitatively characterize their height distributions at steady states by minimizing effective free energy based on a segregation temperature that captures the competition between the mixing entropy and gravitational potential… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 18 pages, 5 figures

  37. arXiv:2407.10687  [pdf, other

    cs.CV cs.GR

    FRI-Net: Floorplan Reconstruction via Room-wise Implicit Representation

    Authors: Honghao Xu, Juzhan Xu, Zeyu Huang, Pengfei Xu, Hui Huang, Ruizhen Hu

    Abstract: In this paper, we introduce a novel method called FRI-Net for 2D floorplan reconstruction from 3D point cloud. Existing methods typically rely on corner regression or box regression, which lack consideration for the global shapes of rooms. To address these issues, we propose a novel approach using a room-wise implicit representation with structural regularization to characterize the shapes of room… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  38. arXiv:2407.10671  [pdf, other

    cs.CL cs.AI

    Qwen2 Technical Report

    Authors: An Yang, Baosong Yang, Binyuan Hui, Bo Zheng, Bowen Yu, Chang Zhou, Chengpeng Li, Chengyuan Li, Dayiheng Liu, Fei Huang, Guanting Dong, Haoran Wei, Huan Lin, Jialong Tang, Jialin Wang, Jian Yang, Jianhong Tu, Jianwei Zhang, Jianxin Ma, Jianxin Yang, Jin Xu, Jingren Zhou, Jinze Bai, Jinzheng He, Junyang Lin , et al. (37 additional authors not shown)

    Abstract: This report introduces the Qwen2 series, the latest addition to our large language models and large multimodal models. We release a comprehensive suite of foundational and instruction-tuned language models, encompassing a parameter range from 0.5 to 72 billion, featuring dense models and a Mixture-of-Experts model. Qwen2 surpasses most prior open-weight models, including its predecessor Qwen1.5, a… ▽ More

    Submitted 17 July, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

    Comments: 25 pages, 1 figure

  39. arXiv:2407.10540  [pdf, other

    astro-ph.HE

    Sudden polarization angle jumps of the repeating fast radio burst FRB 20201124A

    Authors: J. R. Niu, W. Y. Wang, J. C. Jiang, Y. Qu, D. J. Zhou, W. W. Zhu, K. J. Lee, J. L. Han, B. Zhang, D. Li, S. Cao, Z. Y. Fang, Y. Feng, Q. Y. Fu, P. Jiang, W. C. Jing, J. Li, Y. Li, R. Luo, L. Q. Meng, C. C. Miao, X. L. Miao, C. H. Niu, Y. C. Pan, B. J. Wang , et al. (19 additional authors not shown)

    Abstract: We report the first detection of polarization angle (PA) orthogonal jumps, a phenomenon previously only observed from radio pulsars, from a fast radio burst (FRB) source FRB 20201124A. We find three cases of orthogonal jumps in over two thousand bursts, all resembling those observed in pulsar single pulses. We propose that the jumps are due to the superposition of two orthogonal emission modes tha… ▽ More

    Submitted 14 August, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

    Comments: 10 pages, 5 figures, accepted by APJL

  40. arXiv:2407.10199  [pdf, other

    nucl-ex nucl-th

    Charge radii of $^{11-16}$C, $^{13-17}$N and $^{15-18}$O determined from their charge-changing cross-sections and the mirror-difference charge radii

    Authors: J. W. Zhao, B. -H. Sun, I. Tanihata, J. Y. Xu, K. Y. Zhang, A. Prochazka, L. H. Zhu, S. Terashima, J. Meng, L. C. He, C. Y. Liu, G. S. Li, C. G. Lu, W. J. Lin, W. P. Lin, Z. Liu, P. P Ren, Z. Y. Sun, F. Wang, J. Wang, M. Wang, S. T. Wang, X. L. Wei, X. D. Xu, J. C. Zhang , et al. (2 additional authors not shown)

    Abstract: Charge-changing cross-sections of $^{11-16}$C, $^{13-17}$N and $^{15-18}$O on a carbon target have been determined at energies around 300 MeV/nucleon. A nucleon separation energy dependent correction factor has been introduced to the Glauber model calculation for extracting the nuclear charge radii from the experimental CCCSs. The charge radii of $^{11}$C, $^{13,16}$N and $^{15}$O thus were determ… ▽ More

    Submitted 4 August, 2024; v1 submitted 14 July, 2024; originally announced July 2024.

    Comments: 3 figures, submitted to Physics Letters B

  41. Towards Robust Recommendation via Decision Boundary-aware Graph Contrastive Learning

    Authors: Jiakai Tang, Sunhao Dai, Zexu Sun, Xu Chen, Jun Xu, Wenhui Yu, Lantao Hu, Peng Jiang, Han Li

    Abstract: In recent years, graph contrastive learning (GCL) has received increasing attention in recommender systems due to its effectiveness in reducing bias caused by data sparsity. However, most existing GCL models rely on heuristic approaches and usually assume entity independence when constructing contrastive views. We argue that these methods struggle to strike a balance between semantic invariance an… ▽ More

    Submitted 21 July, 2024; v1 submitted 14 July, 2024; originally announced July 2024.

    Comments: KDD 2024

  42. arXiv:2407.09356  [pdf, other

    cs.DS cs.CG

    Bipartizing (Pseudo-)Disk Graphs: Approximation with a Ratio Better than 3

    Authors: Daniel Lokshtanov, Fahad Panolan, Saket Saurabh, Jie Xue, Meirav Zehavi

    Abstract: In a disk graph, every vertex corresponds to a disk in $\mathbb{R}^2$ and two vertices are connected by an edge whenever the two corresponding disks intersect. Disk graphs form an important class of geometric intersection graphs, which generalizes both planar graphs and unit-disk graphs. We study a fundamental optimization problem in algorithmic graph theory, Bipartization (also known as Odd Cycle… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: In APPROX'24

  43. arXiv:2407.09121  [pdf, other

    cs.CL cs.AI

    Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training

    Authors: Youliang Yuan, Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Jiahao Xu, Tian Liang, Pinjia He, Zhaopeng Tu

    Abstract: This study addresses a critical gap in safety tuning practices for Large Language Models (LLMs) by identifying and tackling a refusal position bias within safety tuning data, which compromises the models' ability to appropriately refuse generating unsafe content. We introduce a novel approach, Decoupled Refusal Training (DeRTa), designed to empower LLMs to refuse compliance to harmful prompts at a… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  44. arXiv:2407.09036  [pdf, ps, other

    math.AG

    On the structure of the complement of skeleton

    Authors: Morgan Brown, Jiachang Xu, Muyuan Zhang

    Abstract: We study the higher dimensional geometry of Berkovich spaces using open unit disks, which are given by fibration of relative dimension $1$. Inspired by birational geometry, we conjecture that the Berkovich skeleton is the complement of the union of all open unit disks, and prove this conjecture for $\mathcal{X}$ admitting a strictly semistable model with semiample canonical class.

    Submitted 11 August, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

    Comments: comments are welcome!

    MSC Class: 14G22

  45. arXiv:2407.08558  [pdf, other

    cs.AI

    ST-Mamba: Spatial-Temporal Mamba for Traffic Flow Estimation Recovery using Limited Data

    Authors: Doncheng Yuan, Jianzhe Xue, Jinshan Su, Wenchao Xu, Haibo Zhou

    Abstract: Traffic flow estimation (TFE) is crucial for urban intelligent traffic systems. While traditional on-road detectors are hindered by limited coverage and high costs, cloud computing and data mining of vehicular network data, such as driving speeds and GPS coordinates, present a promising and cost-effective alternative. Furthermore, minimizing data collection can significantly reduce overhead. Howev… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted by 2024 IEEE/CIC International Conference on Communications in China (ICCC)

  46. arXiv:2407.08402  [pdf, ps, other

    cond-mat.mtrl-sci

    Selective area epitaxy of in-plane HgTe nanostrcutures on CdTe(001) substrate

    Authors: Nicolas Chaize, Xavier Baudry, Pierre-Henri Jouneau, Eric Gautier, Jean-Luc Rouvière, Yves Deblock, Jimmy Xu, Maxime Berthe, Clément Barbot, Bruno Grandidier, Ludovic Desplanque, Hermann Sellier, Philippe Ballet

    Abstract: Semiconductor nanowires are believed to play a crucial role for future applications in electronics, spintronics and quantum technologies. A potential candidate is HgTe but its sensitivity to nanofabrication processes restrain its development. A way to circumvent this obstacle is the selective area growth technique. Here, in-plane HgTe nanostructures are grown thanks to selective area molecular bea… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 18 pages and 8 figures. Submitted to Nanotechnology

  47. arXiv:2407.08386  [pdf, other

    eess.SY

    Improved Model and Analysis for RIS-Assisted Indoor Terahertz Wireless Networks

    Authors: Zhi Chai, Jiajie Xu, Mohamed-Slim Alouini, Justin P. Coon

    Abstract: In this paper, we propose a new model for indoor THz communication assisted by RIS. We conduct a realistic modeling of indoor obstacles and analyze their impact on performance. Order statistics are applied to calculate the cumulative distribution functions (CDFs) of distances from the transmitter to the selected RIS, i.e., the nearest RIS in the bounded indoor environment to the transmitter, and f… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 11 pages, 11 figures, submitted to IEEE Transactions on Wireless Communications

  48. arXiv:2407.08047   

    cs.LG cs.AI

    Spatial-Temporal Attention Model for Traffic State Estimation with Sparse Internet of Vehicles

    Authors: Jianzhe Xue, Dongcheng Yuan, Yu Sun, Tianqi Zhang, Wenchao Xu, Haibo Zhou, Xuemin, Shen

    Abstract: The growing number of connected vehicles offers an opportunity to leverage internet of vehicles (IoV) data for traffic state estimation (TSE) which plays a crucial role in intelligent transportation systems (ITS). By utilizing only a portion of IoV data instead of the entire dataset, the significant overheads associated with collecting and processing large amounts of data can be avoided. In this p… ▽ More

    Submitted 14 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: need further improvement

  49. arXiv:2407.08034  [pdf, other

    cs.AI

    Spatial-Temporal Generative AI for Traffic Flow Estimation with Sparse Data of Connected Vehicles

    Authors: Jianzhe Xue, Yunting Xu, Dongcheng Yuan, Caoyi Zha, Hongyang Du, Haibo Zhou, Dusit Niyato

    Abstract: Traffic flow estimation (TFE) is crucial for intelligent transportation systems. Traditional TFE methods rely on extensive road sensor networks and typically incur significant costs. Sparse mobile crowdsensing enables a cost-effective alternative by utilizing sparsely distributed probe vehicle data (PVD) provided by connected vehicles. However, as pointed out by the central limit theorem, the spar… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  50. arXiv:2407.08028  [pdf, other

    cs.RO

    AutoMate: Specialist and Generalist Assembly Policies over Diverse Geometries

    Authors: Bingjie Tang, Iretiayo Akinola, Jie Xu, Bowen Wen, Ankur Handa, Karl Van Wyk, Dieter Fox, Gaurav S. Sukhatme, Fabio Ramos, Yashraj Narang

    Abstract: Robotic assembly for high-mixture settings requires adaptivity to diverse parts and poses, which is an open challenge. Meanwhile, in other areas of robotics, large models and sim-to-real have led to tremendous progress. Inspired by such work, we present AutoMate, a learning framework and system that consists of 4 parts: 1) a dataset of 100 assemblies compatible with simulation and the real world,… ▽ More

    Submitted 31 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.