Zum Hauptinhalt springen

Showing 1–50 of 2,533 results for author: Zhu, H

.
  1. arXiv:2408.17081  [pdf, other

    cs.CV

    Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training

    Authors: Zizheng Huang, Haoxing Chen, Jiaqi Li, Jun Lan, Huijia Zhu, Weiqiang Wang, Limin Wang

    Abstract: Recent Vision Mamba models not only have much lower complexity for processing higher resolution images and longer videos but also the competitive performance with Vision Transformers (ViTs). However, they are stuck into overfitting and thus only present up to base size (about 80M). It is still unclear how vanilla Vision Mamba (Vim) can be efficiently scaled up to larger sizes, which is essentially… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

  2. arXiv:2408.17071  [pdf, other

    hep-ex

    Search for $h_c \to π^+π^-J/ψ$ via $ψ(3686)\to π^0h_c$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (653 additional authors not shown)

    Abstract: Using $(2712.4 \pm 14.3) \times 10^6~ψ$(3686) events collected with the BESIII detector operating at the BEPCII collider, we search for the hadronic transition $h_c \to π^+π^-J/ψ$ via $ψ(3686)\to π^0 h_c$. No significant signal is observed. We set the most stringent upper limits to date on the branching fractions $\mathcal{B}(ψ(3686)\to π^0 h_c)\times\mathcal{B}(h_c\toπ^+π^-J/ψ)$ and… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

  3. arXiv:2408.17039  [pdf, other

    astro-ph.HE

    The bicoherence analysis of type C quasi-periodic oscillations in Swift J1727.8-1613

    Authors: Haifan Zhu, Wei Wang, Ziyuan Zhu

    Abstract: We present the results of bicoherence analysis for Swift J1727.8-1613 during its 2023 outburst, using data from Insight-HXMT. Our analysis focused on observations with quasi-periodic oscillations (QPOs) of frequencies greater than 1 Hz, revealing that all of them belong to type C QPOs. We found a strong correlation between the QPO frequency and the hardness ratio, as well as a linear relationship… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: 14 pages, 6 figures, ApJ in press

  4. arXiv:2408.16654  [pdf, other

    hep-ex

    Measurement of the Decay $Ξ^{0}\toΛγ$ with Entangled $Ξ^{0}\barΞ^{0}$ Pairs

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: In this Letter, a systematic study of the weak radiative hyperon decay $Ξ^{0}\toΛγ$ at an electron-positron collider using entangled $Ξ^{0}\barΞ^{0}$ pair events is presented. The absolute branching fraction for this decay has been measured for the first time, and is $\left(1.347 \pm 0.066_{\mathrm stat.}\pm0.054_{\mathrm syst.}\right)\times 10^{-3}$. The decay asymmetry parameter, which character… ▽ More

    Submitted 29 August, 2024; v1 submitted 29 August, 2024; originally announced August 2024.

    Comments: 10 pages, 3 figures

  5. arXiv:2408.16279  [pdf, ps, other

    hep-ex

    Model-independent determination of the strong-phase difference between $D^0$ and $\bar{D}^0 \to π^+π^-π^+π^-$ decays

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (647 additional authors not shown)

    Abstract: Measurements of the strong-phase difference between $D^0$ and $\bar{D}^0\toπ^+π^-π^+π^-$ are performed in bins of phase space. The study exploits a sample of quantum-correlated $D\bar{D}$ mesons collected by the BESIII experiment in $e^+e^-$ collisions at a center-of-mass energy of 3.773~GeV, corresponding to an integrated luminosity of 2.93~fb$^{-1}$. Here, $D$ denotes a neutral charm meson in a… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  6. arXiv:2408.15995  [pdf, other

    cs.CV

    TEDRA: Text-based Editing of Dynamic and Photoreal Actors

    Authors: Basavaraj Sunagad, Heming Zhu, Mohit Mendiratta, Adam Kortylewski, Christian Theobalt, Marc Habermann

    Abstract: Over the past years, significant progress has been made in creating photorealistic and drivable 3D avatars solely from videos of real humans. However, a core remaining challenge is the fine-grained and user-friendly editing of clothing styles by means of textual descriptions. To this end, we present TEDRA, the first method allowing text-based edits of an avatar, which maintains the avatar's high f… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: For project page, see this https://vcai.mpi-inf.mpg.de/projects/Tedra

  7. arXiv:2408.15815  [pdf, other

    cs.SE

    MR-Adopt: Automatic Deduction of Input Transformation Function for Metamorphic Testing

    Authors: Congying Xu, Songqiang Chen, Jiarong Wu, Shing-Chi Cheung, Valerio Terragni, Hengcheng Zhu, Jialun Cao

    Abstract: While a recent study reveals that many developer-written test cases can encode a reusable Metamorphic Relation (MR), over 70% of them directly hard-code the source input and follow-up input in the encoded relation. Such encoded MRs, which do not contain an explicit input transformation to transform the source inputs to corresponding follow-up inputs, cannot be reused with new source inputs to enha… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: This paper is accepted to ASE 2024

  8. arXiv:2408.15051  [pdf, other

    physics.optics

    Optical Routing via High Efficiency Composite Acoustic Diffraction

    Authors: Yuxiang Zhao, Jiangyong Hu, Ruijuan Liu, Ruochen Gao, Yiming Li, Xiao Zhang, Huanfeng Zhu, Saijun Wu

    Abstract: Acousto-optical modulation (AOM) is a powerful and widely used technique for rapidly controlling the frequency, phase, intensity, and direction of light. Based on Bragg diffraction, AOMs typically exhibit moderate diffraction efficiency, often less than 90\% even for collimated inputs. In this work, we demonstrate that this efficiency can be significantly improved using a composite (CP) setup comp… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: 11 pages, 5 figures

  9. arXiv:2408.14178  [pdf, other

    quant-ph

    Single-photon scattering in giant-atom topological-waveguide-QED systems

    Authors: Hai Zhu, Xian-Li Yin, Jie-Qiao Liao

    Abstract: The giant-atom topological-waveguide-QED systems have recently emerged as a promising platform for manipulating light-matter interactions. The combination of the multiple-point couplings and topological phase effect could lead to rich physical phenomena and effects. Here, we study single-photon scattering in a Su-Schrieffer-Heeger (SSH) waveguide coupled to either one or two two-level giant atoms.… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: 16 pages, 10 figures

  10. arXiv:2408.13705  [pdf, other

    cs.CL cs.SD eess.AS

    Cross-Modal Denoising: A Novel Training Paradigm for Enhancing Speech-Image Retrieval

    Authors: Lifeng Zhou, Yuke Li, Rui Deng, Yuting Yang, Haoqi Zhu

    Abstract: The success of speech-image retrieval relies on establishing an effective alignment between speech and image. Existing methods often model cross-modal interaction through simple cosine similarity of the global feature of each modality, which fall short in capturing fine-grained details within modalities. To address this issue, we introduce an effective framework and a novel learning task named cro… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2408.13119

  11. arXiv:2408.13470  [pdf, other

    eess.SP

    Performance Analysis of Photon-Limited Free-Space Optical Communications with Practical Photon-Counting Receivers

    Authors: Chen Wang, Zhiyong Xu, Jingyuan Wang, Jianhua Li, Weifeng Mou, Huatao Zhu, Jiyong Zhao, Yang Su, Yimin Wang, Ailin Qi

    Abstract: The non-perfect factors of practical photon-counting receiver are recognized as a significant challenge for long-distance photon-limited free-space optical (FSO) communication systems. This paper presents a comprehensive analytical framework for modeling the statistical properties of time-gated single-photon avalanche diode (TG-SPAD) based photon-counting receivers in presence of dead time, non-ph… ▽ More

    Submitted 24 August, 2024; originally announced August 2024.

  12. arXiv:2408.13395  [pdf, other

    cs.CV

    Task-Oriented Diffusion Inversion for High-Fidelity Text-based Editing

    Authors: Yangyang Xu, Wenqi Shao, Yong Du, Haiming Zhu, Yang Zhou, Ping Luo, Shengfeng He

    Abstract: Recent advancements in text-guided diffusion models have unlocked powerful image manipulation capabilities, yet balancing reconstruction fidelity and editability for real images remains a significant challenge. In this work, we introduce \textbf{T}ask-\textbf{O}riented \textbf{D}iffusion \textbf{I}nversion (\textbf{TODInv}), a novel framework that inverts and edits real images tailored to specific… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  13. arXiv:2408.13000  [pdf, other

    stat.ME stat.CO

    Air-HOLP: Adaptive Regularized Feature Screening for High Dimensional Data

    Authors: Ibrahim Joudah, Samuel Muller, Houying Zhu

    Abstract: Handling high-dimensional datasets presents substantial computational challenges, particularly when the number of features far exceeds the number of observations and when features are highly correlated. A modern approach to mitigate these issues is feature screening. In this work, the High-dimensional Ordinary Least-squares Projection (HOLP) feature screening method is advanced by employing adapti… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

    MSC Class: 62J07 (Primary) 62H20; 62J05 (Secondary)

  14. arXiv:2408.12316  [pdf, other

    cs.CV

    Unrolled Decomposed Unpaired Learning for Controllable Low-Light Video Enhancement

    Authors: Lingyu Zhu, Wenhan Yang, Baoliang Chen, Hanwei Zhu, Zhangkai Ni, Qi Mao, Shiqi Wang

    Abstract: Obtaining pairs of low/normal-light videos, with motions, is more challenging than still images, which raises technical issues and poses the technical route of unpaired learning as a critical role. This paper makes endeavors in the direction of learning for low-light video enhancement without using paired ground truth. Compared to low-light image enhancement, enhancing low-light videos is more dif… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  15. arXiv:2408.11871  [pdf, other

    cs.CL cs.AI

    MegaFake: A Theory-Driven Dataset of Fake News Generated by Large Language Models

    Authors: Lionel Z. Wang, Yiming Ma, Renfei Gao, Beichen Guo, Zhuoran Li, Han Zhu, Wenqi Fan, Zexin Lu, Ka Chung Ng

    Abstract: The advent of large language models (LLMs) has revolutionized online content creation, making it much easier to generate high-quality fake news. This misuse threatens the integrity of our digital environment and ethical standards. Therefore, understanding the motivations and mechanisms behind LLM-generated fake news is crucial. In this study, we analyze the creation of fake news from a social psyc… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  16. arXiv:2408.11826  [pdf, other

    cs.CY cs.AI

    Generative Organizational Behavior Simulation using Large Language Model based Autonomous Agents: A Holacracy Perspective

    Authors: Chen Zhu, Yihang Cheng, Jingshuai Zhang, Yusheng Qiu, Sitao Xia, Hengshu Zhu

    Abstract: In this paper, we present the technical details and periodic findings of our project, CareerAgent, which aims to build a generative simulation framework for a Holacracy organization using Large Language Model-based Autonomous Agents. Specifically, the simulation framework includes three phases: construction, execution, and evaluation, and it incorporates basic characteristics of individuals, organ… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  17. arXiv:2408.11312  [pdf, other

    cs.CV cs.AI

    Swarm Intelligence in Geo-Localization: A Multi-Agent Large Vision-Language Model Collaborative Framework

    Authors: Xiao Han, Chen Zhu, Xiangyu Zhao, Hengshu Zhu

    Abstract: Visual geo-localization demands in-depth knowledge and advanced reasoning skills to associate images with real-world geographic locations precisely. In general, traditional methods based on data-matching are hindered by the impracticality of storing adequate visual records of global landmarks. Recently, Large Vision-Language Models (LVLMs) have demonstrated the capability of geo-localization throu… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  18. arXiv:2408.10918  [pdf, other

    cs.CL

    CHECKWHY: Causal Fact Verification via Argument Structure

    Authors: Jiasheng Si, Yibo Zhao, Yingjie Zhu, Haiyang Zhu, Wenpeng Lu, Deyu Zhou

    Abstract: With the growing complexity of fact verification tasks, the concern with "thoughtful" reasoning capabilities is increasing. However, recent fact verification benchmarks mainly focus on checking a narrow scope of semantic factoids within claims and lack an explicit logical reasoning process. In this paper, we introduce CheckWhy, a challenging dataset tailored to a novel causal fact verification tas… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: Accepted by ACL2024; Awarded as Outstanding Paper Award and Area Chair Award

  19. arXiv:2408.10800  [pdf, other

    eess.SP

    A Novel Signal Detection Method for Photon-Counting Communications with Nonlinear Distortion Effects

    Authors: Chen Wang, Zhiyong Xu, Jingyuan Wang, Jianhua Li, Weifeng Mou, Huatao Zhu, Jiyong Zhao, Yang Su, Yimin Wang, Ailin Qi

    Abstract: This paper proposes a method for estimating and detecting optical signals in practical photon-counting receivers. There are two important aspects of non-perfect photon-counting receivers, namely, (i) dead time which results in blocking loss, and (ii) non-photon-number-resolving, which leads to counting loss during the gate-ON interval. These factors introduce nonlinear distortion to the detected p… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  20. arXiv:2408.10520  [pdf, other

    cs.IR

    Efficient and Deployable Knowledge Infusion for Open-World Recommendations via Large Language Models

    Authors: Yunjia Xi, Weiwen Liu, Jianghao Lin, Muyan Weng, Xiaoling Cai, Hong Zhu, Jieming Zhu, Bo Chen, Ruiming Tang, Yong Yu, Weinan Zhang

    Abstract: Recommender systems (RSs) play a pervasive role in today's online services, yet their closed-loop nature constrains their access to open-world knowledge. Recently, large language models (LLMs) have shown promise in bridging this gap. However, previous attempts to directly implement LLMs as recommenders fall short in meeting the requirements of industrial RSs, particularly in terms of online infere… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: arXiv admin note: text overlap with arXiv:2306.10933

  21. arXiv:2408.10155  [pdf, ps, other

    math.RA math.CO

    On Bott--Samelson rings for Coxeter groups

    Authors: Tao Gui, Lin Sun, Shihao Wang, Haoyu Zhu

    Abstract: We study the cohomology ring of the Bott--Samelson variety. We compute an explicit presentation of this ring via Soergel's result, which implies that it is a combinatorial invariant. We use the presentation to introduce the Bott--Samelson ring associated with a word in arbitrary Coxeter system by generators and relations. In general, it is a split quadratic complete intersection algebra with a tri… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: 18 pages, comments are welcome!

    MSC Class: 20F55; 14D07; 16S37; 14M15

  22. Revisiting Reciprocal Recommender Systems: Metrics, Formulation, and Method

    Authors: Chen Yang, Sunhao Dai, Yupeng Hou, Wayne Xin Zhao, Jun Xu, Yang Song, Hengshu Zhu

    Abstract: Reciprocal recommender systems~(RRS), conducting bilateral recommendations between two involved parties, have gained increasing attention for enhancing matching efficiency. However, the majority of existing methods in the literature still reuse conventional ranking metrics to separately assess the performance on each side of the recommendation process. These methods overlook the fact that the rank… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: KDD 2024

  23. arXiv:2408.09416  [pdf, ps, other

    cs.CL cs.AI

    Challenges and Responses in the Practice of Large Language Models

    Authors: Hongyin Zhu

    Abstract: This paper carefully summarizes extensive and profound questions from all walks of life, focusing on the current high-profile AI field, covering multiple dimensions such as industry trends, academic research, technological innovation and business applications. This paper meticulously curates questions that are both thought-provoking and practically relevant, providing nuanced and insightful answer… ▽ More

    Submitted 21 August, 2024; v1 submitted 18 August, 2024; originally announced August 2024.

  24. arXiv:2408.09205  [pdf, ps, other

    cs.CL cs.AI

    Architectural Foundations for the Large Language Model Infrastructures

    Authors: Hongyin Zhu

    Abstract: The development of a large language model (LLM) infrastructure is a pivotal undertaking in artificial intelligence. This paper explores the intricate landscape of LLM infrastructure, software, and data management. By analyzing these core components, we emphasize the pivotal considerations and safeguards crucial for successful LLM development. This work presents a concise synthesis of the challenge… ▽ More

    Submitted 21 August, 2024; v1 submitted 17 August, 2024; originally announced August 2024.

  25. arXiv:2408.08881  [pdf, other

    eess.IV cs.AI cs.CV

    U-MedSAM: Uncertainty-aware MedSAM for Medical Image Segmentation

    Authors: Xin Wang, Xiaoyu Liu, Peng Huang, Pu Huang, Shu Hu, Hongtu Zhu

    Abstract: Medical Image Foundation Models have proven to be powerful tools for mask prediction across various datasets. However, accurately assessing the uncertainty of their predictions remains a significant challenge. To address this, we propose a new model, U-MedSAM, which integrates the MedSAM model with an uncertainty-aware loss function and the Sharpness-Aware Minimization (SharpMin) optimizer. The un… ▽ More

    Submitted 3 August, 2024; originally announced August 2024.

    Comments: arXiv admin note: text overlap with arXiv:2405.17496

  26. arXiv:2408.08826  [pdf, other

    hep-ex

    Search for the rare decay $J/ψ\to γD^0+c.c.$ at BESIII

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (642 additional authors not shown)

    Abstract: Using $(10087\pm44)\times10^6J/ψ$ events collected with the BESIII detector, we search for the rare decay $J/ψ\to γD^0+c.c.$ for the first time. No obvious signal is observed and the upper limit on the branching fraction is determined to be ${\cal B}(J/ψ\to γD^{0}+c.c.)< 9.1 \times 10^{-8}$ at 90\% confidence level.

    Submitted 16 August, 2024; originally announced August 2024.

  27. arXiv:2408.08824  [pdf, other

    cs.LG

    LEVIS: Large Exact Verifiable Input Spaces for Neural Networks

    Authors: Mohamad Fares El Hajj Chehade, Brian Wesley Bell, Russell Bent, Hao Zhu, Wenting Li

    Abstract: The robustness of neural networks is paramount in safety-critical applications. While most current robustness verification methods assess the worst-case output under the assumption that the input space is known, identifying a verifiable input space $\mathcal{C}$, where no adversarial examples exist, is crucial for effective model selection, robustness evaluation, and the development of reliable co… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  28. arXiv:2408.08812  [pdf, other

    cs.LG

    CAT: Caution Aware Transfer in Reinforcement Learning via Distributional Risk

    Authors: Mohamad Fares El Hajj Chehade, Amrit Singh Bedi, Amy Zhang, Hao Zhu

    Abstract: Transfer learning in reinforcement learning (RL) has become a pivotal strategy for improving data efficiency in new, unseen tasks by utilizing knowledge from previously learned tasks. This approach is especially beneficial in real-world deployment scenarios where computational resources are constrained and agents must adapt rapidly to novel environments. However, current state-of-the-art methods o… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  29. arXiv:2408.08551  [pdf, other

    cs.CL

    Integrating Multi-view Analysis: Multi-view Mixture-of-Expert for Textual Personality Detection

    Authors: Haohao Zhu, Xiaokun Zhang, Junyu Lu, Liang Yang, Hongfei Lin

    Abstract: Textual personality detection aims to identify personality traits by analyzing user-generated content. To achieve this effectively, it is essential to thoroughly examine user-generated content from various perspectives. However, previous studies have struggled with automatically extracting and effectively integrating information from multiple perspectives, thereby limiting their performance on per… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: Accepted by NLPCC 2024

  30. arXiv:2408.06677  [pdf, other

    hep-ex hep-ph

    Search for $η_c(2S)\toωω$ and $ωφ$ decays and measurements of $χ_{cJ}\toωω$ and $ωφ$ in $ψ(2S)$ radiative processes

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

    Abstract: Using $(2712\pm 14)$ $\times$ 10$^{6}$ $ψ(2S)$ events collected with the BESIII detector at the BEPCII collider, we search for the decays $η_{c}(2S)\toωω$ and $η_{c}(2S)\toωφ$ via the process $ψ(2S)\toγη_{c}(2S)$. Evidence of $η_{c}(2S)\toωω$ is found with a statistical significance of $3.2σ$. The branching fraction is measured to be… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  31. arXiv:2408.05920  [pdf, other

    cs.AI cs.LG

    Urban Region Pre-training and Prompting: A Graph-based Approach

    Authors: Jiahui Jin, Yifan Song, Dong Kan, Haojia Zhu, Xiangguo Sun, Zhicheng Li, Xigang Sun, Jinghui Zhang

    Abstract: Urban region representation is crucial for various urban downstream tasks. However, despite the proliferation of methods and their success, acquiring general urban region knowledge and adapting to different tasks remains challenging. Previous work often neglects the spatial structures and functional layouts between entities, limiting their ability to capture transferable knowledge across regions.… ▽ More

    Submitted 26 August, 2024; v1 submitted 12 August, 2024; originally announced August 2024.

  32. arXiv:2408.05815  [pdf, other

    cs.CV

    HySparK: Hybrid Sparse Masking for Large Scale Medical Image Pre-Training

    Authors: Fenghe Tang, Ronghao Xu, Qingsong Yao, Xueming Fu, Quan Quan, Heqin Zhu, Zaiyi Liu, S. Kevin Zhou

    Abstract: The generative self-supervised learning strategy exhibits remarkable learning representational capabilities. However, there is limited attention to end-to-end pre-training methods based on a hybrid architecture of CNN and Transformer, which can learn strong local and global representations simultaneously. To address this issue, we propose a generative pre-training strategy called Hybrid Sparse mas… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

    Comments: Early accept at MICCAI 2024

    ACM Class: I.4.10; I.4.6

  33. arXiv:2408.05342  [pdf, other

    econ.EM

    Optimal Treatment Allocation Strategies for A/B Testing in Partially Observable Time Series Experiments

    Authors: Ke Sun, Linglong Kong, Hongtu Zhu, Chengchun Shi

    Abstract: Time series experiments, in which experimental units receive a sequence of treatments over time, are frequently employed in many technological companies to evaluate the performance of a newly developed policy, product, or treatment relative to a baseline control. Many existing A/B testing solutions assume a fully observable experimental environment that satisfies the Markov condition, which often… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

  34. Towards High-resolution 3D Anomaly Detection via Group-Level Feature Contrastive Learning

    Authors: Hongze Zhu, Guoyang Xie, Chengbin Hou, Tao Dai, Can Gao, Jinbao Wang, Linlin Shen

    Abstract: High-resolution point clouds~(HRPCD) anomaly detection~(AD) plays a critical role in precision machining and high-end equipment manufacturing. Despite considerable 3D-AD methods that have been proposed recently, they still cannot meet the requirements of the HRPCD-AD task. There are several challenges: i) It is difficult to directly capture HRPCD information due to large amounts of points at the s… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: ACMMM24, 12 pages, 5 figures

  35. arXiv:2408.04422  [pdf, other

    hep-ex

    Analysis of the dynamics of the decay $D^{+}\to K_{S}^{0} π^{0} e^{+}ν_{e}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: The branching fraction of $D^+\to K_{S}^{0} π^{0}e^+ν_e$ is measured for the first time using $7.93~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$~GeV with the BESIII detector operating at the BEPCII collider, and is determined to be ${\mathcal B}$($D^+\to K_S^0π^0e^+ν_e$) = $(0.881~\pm~0.017_{\rm stat.}~\pm~0.016_{\rm syst.})$\%. Based on a… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  36. arXiv:2408.03931  [pdf, ps, other

    nucl-th astro-ph.HE cond-mat.str-el cond-mat.supr-con

    Superfluid quantum criticality and the thermal evolution of neutron stars

    Authors: Hao-Fu Zhu, Guo-Zhu Liu, Jing-Rong Wang, Xufen Wu

    Abstract: The neutron star starts to cool down shortly after its birth by emitting neutrinos. When it is cold enough, the Cooper pairs of neutrons are formed, which triggers a superfluid transition. Previous works on neutron superfluidity are focused on the finite-temperature transition. Little attention is paid to the potentially important quantum critical phenomena associated with superfluidity. Here, we… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: 25 pages, 3 figure, comments welcome

  37. arXiv:2408.03857  [pdf, ps, other

    cond-mat.supr-con cond-mat.str-el

    Reply to "Comment on `Towards exact solutions of superconducting $T_c$ induced by electron-phonon interaction' "

    Authors: Guo-Zhu Liu, Zhao-Kun Yang, Xiao-Yin Pan, Jing-Rong Wang, Xin Li, Hao-Fu Zhu, Jie Huang

    Abstract: In a series of papers, we have proposed a non-perturbative field-theoretic approach to deal with strong electron-phonon and strong Coulomb interactions. The key ingredient of such an approach is to determine the full fermion-boson vertex corrections by solving a number of self-consistent Ward-Takahashi identities. Palle (see Phys. Rev. B 110, 026501 (2024), arXiv:2404.02918) argued that our Ward-T… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: Reply to arXiv:2404.02918, which is published as Phys. Rev. B 110, 026501 (2024), by Palle, commenting on arXiv:1911.05528

    Journal ref: Phys. Rev. B 110, 026502 (2024)

  38. arXiv:2408.03791  [pdf, other

    quant-ph cond-mat.mes-hall physics.app-ph physics.optics

    Microwave-optics entanglement via coupled opto- and magnomechanical microspheres

    Authors: Hao-Tian Li, Zhi-Yuan Fan, Huai-Bing Zhu, Simon Gröblacher, Jie Li

    Abstract: Microwave-optics entanglement plays a crucial role in building hybrid quantum networks with quantum nodes working in the microwave and optical frequency bands. However, there are limited efficient ways to produce such entanglement due to the large frequency mismatch between the two regimes. Here, we present a new mechanism to prepare microwave-optics entanglement based on a hybrid system of two co… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

  39. arXiv:2408.03531  [pdf, other

    hep-ex

    Measurement of the Branching Fraction of \boldmath{$ψ(2S) \to γπ^0$}

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: Based on $(2712.4\pm14.1)\times10^{6}~ψ(2S)$ events, 7.9 fb$^{-1}$ $ψ(3773)$ data, and 0.8 fb$^{-1}$ off-resonance data samples collected with the BESIII detector, we measure the branching fraction of $ψ(2S)\rightarrowγπ^{0}$ and $e^{+}e^{-}\rightarrowγπ^{0}$ form factor at momentum transfers $Q^{2}\sim13$ GeV$^{2}$. The $e^{+}e^{-}\rightarrowγπ^{0}$ cross section is fitted with considering the in… ▽ More

    Submitted 7 August, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

  40. arXiv:2408.03205  [pdf, other

    hep-ex

    Measurement of $Σ^+$ transverse polarization in $e^+e^-$ collisions at $\sqrt{s} = 3.68-3.71$ GeV

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: Using $e^+e^-$ collision data collected with the BESIII detector at seven energy points ranging from 3.68 to 3.71 GeV and corresponding to an integrated luminosity of $652.1~{\rm pb^{-1}}$, we present an energy-dependent measurement of the transverse polarization, relative phase and modulus ratio of the electromagnetic form factors of the $Σ^+$ hyperon in the $e^+e^- \to Σ^+ \barΣ^-$ reaction. The… ▽ More

    Submitted 7 August, 2024; v1 submitted 6 August, 2024; originally announced August 2024.

    Comments: 21 pages, 2 tables, 5 figures

  41. arXiv:2408.02940  [pdf, other

    hep-ex

    Observation of $η_{c}(2S) \to K^{+}K^{-}η$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

    Abstract: By analyzing $(27.12 \pm 0.14)\times10^{8}$ $ψ(3686)$ events accumulated with the BESIII detector, the decay $η_{c}(2S) \to K^{+} K^{-} η$ is observed for the first time with a significance of $6.2σ$ after considering systematic uncertainties. The product of the branching fractions of $ψ(3686) \to γη_{c}(2S)$ and $η_{c}(2S) \to K^{+} K^{-} η$ is measured to be… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  42. arXiv:2408.01922  [pdf, ps, other

    math.KT

    Intersection of complete cotorsion pairs

    Authors: Qikai Wang, Haiyan Zhu

    Abstract: Given two (hereditary) complete cotorsion pairs $(\mathcal{X}_1,\mathcal{Y}_1)$ and $(\mathcal{X}_2,\mathcal{Y}_2)$ in an exact category with $\mathcal{X}_1\subseteq \mathcal{Y}_2$, we prove that $\left({\rm Smd}\langle \mathcal{X}_1,\mathcal{X}_2 \rangle,\mathcal{Y}_1\cap \mathcal{Y}_2\right)$ is also a (hereditary) complete cotorsion pair, where… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

    MSC Class: 18G25; 18G15; 16E30; 16E10

  43. arXiv:2408.01800  [pdf, other

    cs.CV

    MiniCPM-V: A GPT-4V Level MLLM on Your Phone

    Authors: Yuan Yao, Tianyu Yu, Ao Zhang, Chongyi Wang, Junbo Cui, Hongji Zhu, Tianchi Cai, Haoyu Li, Weilin Zhao, Zhihui He, Qianyu Chen, Huarong Zhou, Zhensheng Zou, Haoye Zhang, Shengding Hu, Zhi Zheng, Jie Zhou, Jie Cai, Xu Han, Guoyang Zeng, Dahai Li, Zhiyuan Liu, Maosong Sun

    Abstract: The recent surge of Multimodal Large Language Models (MLLMs) has fundamentally reshaped the landscape of AI research and industry, shedding light on a promising path toward the next AI milestone. However, significant challenges remain preventing MLLMs from being practical in real-world applications. The most notable challenge comes from the huge cost of running an MLLM with a massive number of par… ▽ More

    Submitted 3 August, 2024; originally announced August 2024.

    Comments: preprint

  44. arXiv:2408.01723  [pdf, other

    cs.CV cs.IR

    A Novel Evaluation Framework for Image2Text Generation

    Authors: Jia-Hong Huang, Hongyi Zhu, Yixian Shen, Stevan Rudinac, Alessio M. Pacces, Evangelos Kanoulas

    Abstract: Evaluating the quality of automatically generated image descriptions is challenging, requiring metrics that capture various aspects such as grammaticality, coverage, correctness, and truthfulness. While human evaluation offers valuable insights, its cost and time-consuming nature pose limitations. Existing automated metrics like BLEU, ROUGE, METEOR, and CIDEr aim to bridge this gap but often show… ▽ More

    Submitted 3 August, 2024; originally announced August 2024.

    Comments: The paper has been accepted for presentation at the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, specifically in the Large Language Model for Evaluation in IR (LLM4Eval) Workshop in 2024

  45. arXiv:2408.01597  [pdf, other

    hep-ex

    Search for $X(3872)\toπ^0π^0χ_{c1,2}$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

    Abstract: Using 10.1 fb$^{-1}$ of $e^+e^-$ collision data collected by the BESIII detector with center-of-mass energies between 4.15 GeV and 4.30 GeV, we search for the decays $X(3872)\toπ^0π^0χ_{c1,2}$, where the $X(3872)$ is produced in $e^+e^-\toγX(3872)$. No evidence above $3σ$ is found for either decay. Upper limits at the $90\%$ C.L. on the branching fractions of $X(3872)\toπ^0π^0χ_{c1,2}$ normalized… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: 12 pages, 4 figures, 6 tables

  46. arXiv:2408.01323  [pdf, other

    cs.CL

    FANNO: Augmenting High-Quality Instruction Data with Open-Sourced LLMs Only

    Authors: He Zhu, Junyou Su, Tianle Lun, Yicheng Tao, Wenjia Zhang, Zipei Fan, Guanhua Chen

    Abstract: Instruction fine-tuning stands as a crucial advancement in leveraging large language models (LLMs) for enhanced task performance. However, the annotation of instruction datasets has traditionally been expensive and laborious, often relying on manual annotations or costly API calls of proprietary LLMs. To address these challenges, we introduce FANNO, a fully autonomous, open-sourced framework that… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  47. arXiv:2408.01055  [pdf, other

    cs.SE cs.AI cs.CR

    LLM as Runtime Error Handler: A Promising Pathway to Adaptive Self-Healing of Software Systems

    Authors: Zhensu Sun, Haotian Zhu, Bowen Xu, Xiaoning Du, Li Li, David Lo

    Abstract: Unanticipated runtime errors, lacking predefined handlers, can abruptly terminate execution and lead to severe consequences, such as data loss or system crashes. Despite extensive efforts to identify potential errors during the development phase, such unanticipated errors remain a challenge to to be entirely eliminated, making the runtime mitigation measurements still indispensable to minimize the… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  48. arXiv:2408.01038  [pdf, other

    cs.CL

    UNER: A Unified Prediction Head for Named Entity Recognition in Visually-rich Documents

    Authors: Yi Tu, Chong Zhang, Ya Guo, Huan Chen, Jinyang Tang, Huijia Zhu, Qi Zhang

    Abstract: The recognition of named entities in visually-rich documents (VrD-NER) plays a critical role in various real-world scenarios and applications. However, the research in VrD-NER faces three major challenges: complex document layouts, incorrect reading orders, and unsuitable task formulations. To address these challenges, we propose a query-aware entity extraction head, namely UNER, to collaborate wi… ▽ More

    Submitted 11 August, 2024; v1 submitted 2 August, 2024; originally announced August 2024.

    Comments: accepted by ACM Multimedia 2024

  49. arXiv:2408.00744  [pdf, other

    cs.CV

    Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation

    Authors: Siyu Jiao, Hongguang Zhu, Jiannan Huang, Yao Zhao, Yunchao Wei, Humphrey Shi

    Abstract: Pre-trained vision-language models, e.g. CLIP, have been increasingly used to address the challenging Open-Vocabulary Segmentation (OVS) task, benefiting from their well-aligned vision-text embedding space. Typical solutions involve either freezing CLIP during training to unilaterally maintain its zero-shot capability, or fine-tuning CLIP vision encoder to achieve perceptual sensitivity to local r… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: ECCV 2024

  50. arXiv:2408.00495  [pdf, other

    hep-ex

    Partial wave analysis of $ψ(3686)\toΛ\barΣ^0π^0+c.c.$

    Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

    Abstract: Based on a sample of $(2712.4\pm14.3)\times10^6\;ψ(3686)$ events collected with the BESIII detector, a partial wave analysis of the decay $ψ(3686)\toΛ\barΣ^0π^0+c.c.$ is performed to investigate $Λ^*$ and $Σ^*$ resonances in the $π^0\barΣ^0$ and $π^0Λ$ invariant mass distributions. Significant contributions are found from the $Λ(1405)$, $Λ(1520)$, $Λ(1600)$, $Λ(1670)$, $Λ(1690)$, $Λ(1800)$,… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: 25 pages, 8 tables, 6 figures