Search | arXiv e-print repository

Search for $h_c \to π^+π^-J/ψ$ via $ψ(3686)\to π^0h_c$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere, A. Brueggemann , et al. (653 additional authors not shown)

Abstract: Using $(2712.4 \pm 14.3) \times 10^6~ψ$(3686) events collected with the BESIII detector operating at the BEPCII collider, we search for the hadronic transition $h_c \to π^+π^-J/ψ$ via $ψ(3686)\to π^0 h_c$. No significant signal is observed. We set the most stringent upper limits to date on the branching fractions $\mathcal{B}(ψ(3686)\to π^0 h_c)\times\mathcal{B}(h_c\toπ^+π^-J/ψ)$ and… ▽ More Using $(2712.4 \pm 14.3) \times 10^6~ψ$(3686) events collected with the BESIII detector operating at the BEPCII collider, we search for the hadronic transition $h_c \to π^+π^-J/ψ$ via $ψ(3686)\to π^0 h_c$. No significant signal is observed. We set the most stringent upper limits to date on the branching fractions $\mathcal{B}(ψ(3686)\to π^0 h_c)\times\mathcal{B}(h_c\toπ^+π^-J/ψ)$ and $\mathcal{B}(h_c \to π^+π^-J/ψ)$ at the 90$\%$ confidence level, which are determined to be $6.7\times 10^{-7}$ and $9.4 \times10^{-4}$, respectively. △ Less

Submitted 30 August, 2024; originally announced August 2024.

arXiv:2408.16997 [pdf, other]

doi 10.1103/PhysRevLett.133.090402

Experimental Verification of Demon-Involved Fluctuation Theorems

Authors: L. -L. Yan, J. -T. Bu, Q. Zeng, K. Zhang, K. -F. Cui, F. Zhou, S. -L. Su, L. Chen, J. Wang, Gang Chen, M. Feng

Abstract: The limit of energy saving in the control of small systems has recently attracted much interest due to the concept refinement of the Maxwell demon. Inspired by a newly proposed set of fluctuation theorems, we report the first experimental verification of these equalities and inequalities in a ultracold 40Ca ion system, confirming the intrinsic nonequilibrium in the system due to involvement of the… ▽ More The limit of energy saving in the control of small systems has recently attracted much interest due to the concept refinement of the Maxwell demon. Inspired by a newly proposed set of fluctuation theorems, we report the first experimental verification of these equalities and inequalities in a ultracold 40Ca ion system, confirming the intrinsic nonequilibrium in the system due to involvement of the demon. Based on elaborately designed demon-involved control protocols, such as the Szilard engine protocol, we provide experimentally quantitative evidence of the dissipative information, and observe tighter bounds of both the extracted work and the demon's efficacy than the limits predicted by the Sagawa-Ueda theorem. Our results substantiate a close connection between the physical nature of information and nonequilibrium processes at the microscale, which help further understanding the thermodynamic characteristics of information and the optimal design of nanoscale and smaller systems. △ Less

Submitted 30 August, 2024; originally announced August 2024.

Journal ref: Physical Review Letters 133, 090402 (2024)

arXiv:2408.16654 [pdf, other]

Measurement of the Decay $Ξ^{0}\toΛγ$ with Entangled $Ξ^{0}\barΞ^{0}$ Pairs

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

Abstract: In this Letter, a systematic study of the weak radiative hyperon decay $Ξ^{0}\toΛγ$ at an electron-positron collider using entangled $Ξ^{0}\barΞ^{0}$ pair events is presented. The absolute branching fraction for this decay has been measured for the first time, and is $\left(1.347 \pm 0.066_{\mathrm stat.}\pm0.054_{\mathrm syst.}\right)\times 10^{-3}$. The decay asymmetry parameter, which character… ▽ More In this Letter, a systematic study of the weak radiative hyperon decay $Ξ^{0}\toΛγ$ at an electron-positron collider using entangled $Ξ^{0}\barΞ^{0}$ pair events is presented. The absolute branching fraction for this decay has been measured for the first time, and is $\left(1.347 \pm 0.066_{\mathrm stat.}\pm0.054_{\mathrm syst.}\right)\times 10^{-3}$. The decay asymmetry parameter, which characterizes the effect of parity violation in the decay, is determined to be $-0.741 \pm 0.062_{\mathrm stat.}\pm 0.019_{\mathrm syst.}$. The obtained results are consistent with the world average values within the uncertainties, offering valuable insights into the underlying mechanism governing the weak radiative hyperon decays. The charge conjugation parity ($CP$) symmetries of branching fraction and decay asymmetry parameter in the decay are also studied. No statistically significant violation of charge conjugation parity symmetry is observed. △ Less

Submitted 29 August, 2024; v1 submitted 29 August, 2024; originally announced August 2024.

Comments: 10 pages, 3 figures

arXiv:2408.16279 [pdf, ps, other]

Model-independent determination of the strong-phase difference between $D^0$ and $\bar{D}^0 \to π^+π^-π^+π^-$ decays

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (647 additional authors not shown)

Abstract: Measurements of the strong-phase difference between $D^0$ and $\bar{D}^0\toπ^+π^-π^+π^-$ are performed in bins of phase space. The study exploits a sample of quantum-correlated $D\bar{D}$ mesons collected by the BESIII experiment in $e^+e^-$ collisions at a center-of-mass energy of 3.773~GeV, corresponding to an integrated luminosity of 2.93~fb$^{-1}$. Here, $D$ denotes a neutral charm meson in a… ▽ More Measurements of the strong-phase difference between $D^0$ and $\bar{D}^0\toπ^+π^-π^+π^-$ are performed in bins of phase space. The study exploits a sample of quantum-correlated $D\bar{D}$ mesons collected by the BESIII experiment in $e^+e^-$ collisions at a center-of-mass energy of 3.773~GeV, corresponding to an integrated luminosity of 2.93~fb$^{-1}$. Here, $D$ denotes a neutral charm meson in a superposition of flavor eigenstates. The reported results are valuable for measurements of the $C\!P$-violating phase $γ$ (also denoted $φ_3$) in $B^\pm \to DK^\pm$, $D \to π^+π^-π^+π^-$ decays, and the binning schemes are designed to provide good statistical sensitivity to this parameter. The expected uncertainty on $γ$ arising from the precision of the strong-phase measurements, when applied to very large samples of $B$-meson decays, is around $1.5^\circ$ or $2^\circ$, depending on the binning scheme. The binned strong-phase parameters are combined to give a value of $F_+^{4π} = 0.746 \pm 0.010 \pm 0.004$ for the $C\!P$-even fraction of $D^0 \to π^+π^-π^+π^-$ decays, which is around 30\% more precise than the previous best measurement of this quantity. △ Less

Submitted 29 August, 2024; originally announced August 2024.

arXiv:2408.15160 [pdf, other]

Flavor Nernst effects in quantum paramagnets

Authors: Bowen Lu, Bowen Ma, Yue Yu, Gang Chen

Abstract: Recent advances in spin transport research have highlighted the potential of quantum paramagnets as platforms for exploring novel phenomena and developing next-generation technologies. In this paper, we investigate the flavor Nernst effect (FNE) in quantum paramagnets, focusing on the Hall-type thermal spin transport of crystal electric field (CEF) excitations with spin-orbit couplings. As a proof… ▽ More Recent advances in spin transport research have highlighted the potential of quantum paramagnets as platforms for exploring novel phenomena and developing next-generation technologies. In this paper, we investigate the flavor Nernst effect (FNE) in quantum paramagnets, focusing on the Hall-type thermal spin transport of crystal electric field (CEF) excitations with spin-orbit couplings. As a proof of principle, we investigate the quantum paramagnetic ground state in an effective spin-1 Hamiltonian with Dzyaloshinskii-Moriya interactions and a large hard-axis anisotropy. We employ linear flavor-wave theory to analyze the low-energy excitations, and obtain the flavor Nernst coefficients from the linear response theory. We demonstrate the FNE in a 2D pyrochlore thin film with an all-in-all-out Ising axis configuration, and investigate their dependence on temperature, anisotropy, DM interaction, and external fields. Our results reveal the connection between the FNE and the Berry curvature of the CEF excitations, suggesting potential applications in manipulating thermal spin currents and exploring topological spin transport phenomena in quantum paramagnets. △ Less

Submitted 27 August, 2024; originally announced August 2024.

Comments: 12 pages, 10 figures

arXiv:2408.14158 [pdf, other]

Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning

Authors: Wei An, Xiao Bi, Guanting Chen, Shanhuang Chen, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Wenjun Gao, Kang Guan, Jianzhong Guo, Yongqiang Guo, Zhe Fu, Ying He, Panpan Huang, Jiashi Li, Wenfeng Liang, Xiaodong Liu, Xin Liu, Yiyuan Liu, Yuxuan Liu, Shanghao Lu, Xuan Lu, Xiaotao Nie, Tian Pei , et al. (27 additional authors not shown)

Abstract: The rapid progress in Deep Learning (DL) and Large Language Models (LLMs) has exponentially increased demands of computational power and bandwidth. This, combined with the high costs of faster computing chips and interconnects, has significantly inflated High Performance Computing (HPC) construction costs. To address these challenges, we introduce the Fire-Flyer AI-HPC architecture, a synergistic… ▽ More The rapid progress in Deep Learning (DL) and Large Language Models (LLMs) has exponentially increased demands of computational power and bandwidth. This, combined with the high costs of faster computing chips and interconnects, has significantly inflated High Performance Computing (HPC) construction costs. To address these challenges, we introduce the Fire-Flyer AI-HPC architecture, a synergistic hardware-software co-design framework and its best practices. For DL training, we deployed the Fire-Flyer 2 with 10,000 PCIe A100 GPUs, achieved performance approximating the DGX-A100 while reducing costs by half and energy consumption by 40%. We specifically engineered HFReduce to accelerate allreduce communication and implemented numerous measures to keep our Computation-Storage Integrated Network congestion-free. Through our software stack, including HaiScale, 3FS, and HAI-Platform, we achieved substantial scalability by overlapping computation and communication. Our system-oriented experience from DL training provides valuable insights to drive future advancements in AI-HPC. △ Less

Submitted 26 August, 2024; originally announced August 2024.

Comments: This is the preprint version of the paper accepted for presentation at the 2024 International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'24). \c{opyright} 2024 IEEE. Personal use of this material is permitted. For other uses, permission from IEEE must be obtained. Please refer to IEEE Xplore for the final published version

arXiv:2408.13716 [pdf, other]

FreqINR: Frequency Consistency for Implicit Neural Representation with Adaptive DCT Frequency Loss

Authors: Meiyi Wei, Liu Xie, Ying Sun, Gang Chen

Abstract: Recent advancements in local Implicit Neural Representation (INR) demonstrate its exceptional capability in handling images at various resolutions. However, frequency discrepancies between high-resolution (HR) and ground-truth images, especially at larger scales, result in significant artifacts and blurring in HR images. This paper introduces Frequency Consistency for Implicit Neural Representatio… ▽ More Recent advancements in local Implicit Neural Representation (INR) demonstrate its exceptional capability in handling images at various resolutions. However, frequency discrepancies between high-resolution (HR) and ground-truth images, especially at larger scales, result in significant artifacts and blurring in HR images. This paper introduces Frequency Consistency for Implicit Neural Representation (FreqINR), an innovative Arbitrary-scale Super-resolution method aimed at enhancing detailed textures by ensuring spectral consistency throughout both training and inference. During training, we employ Adaptive Discrete Cosine Transform Frequency Loss (ADFL) to minimize the frequency gap between HR and ground-truth images, utilizing 2-Dimensional DCT bases and focusing dynamically on challenging frequencies. During inference, we extend the receptive field to preserve spectral coherence between low-resolution (LR) and ground-truth images, which is crucial for the model to generate high-frequency details from LR counterparts. Experimental results show that FreqINR, as a lightweight approach, achieves state-of-the-art performance compared to existing Arbitrary-scale Super-resolution methods and offers notable improvements in computational efficiency. The code for our method will be made publicly available. △ Less

Submitted 24 August, 2024; originally announced August 2024.

Comments: 9 pages, 7 figures

arXiv:2408.13385 [pdf, other]

MICM: Rethinking Unsupervised Pretraining for Enhanced Few-shot Learning

Authors: Zhenyu Zhang, Guangyao Chen, Yixiong Zou, Zhimeng Huang, Yuhua Li, Ruixuan Li

Abstract: Humans exhibit a remarkable ability to learn quickly from a limited number of labeled samples, a capability that starkly contrasts with that of current machine learning systems. Unsupervised Few-Shot Learning (U-FSL) seeks to bridge this divide by reducing reliance on annotated datasets during initial training phases. In this work, we first quantitatively assess the impacts of Masked Image Modelin… ▽ More Humans exhibit a remarkable ability to learn quickly from a limited number of labeled samples, a capability that starkly contrasts with that of current machine learning systems. Unsupervised Few-Shot Learning (U-FSL) seeks to bridge this divide by reducing reliance on annotated datasets during initial training phases. In this work, we first quantitatively assess the impacts of Masked Image Modeling (MIM) and Contrastive Learning (CL) on few-shot learning tasks. Our findings highlight the respective limitations of MIM and CL in terms of discriminative and generalization abilities, which contribute to their underperformance in U-FSL contexts. To address these trade-offs between generalization and discriminability in unsupervised pretraining, we introduce a novel paradigm named Masked Image Contrastive Modeling (MICM). MICM creatively combines the targeted object learning strength of CL with the generalized visual feature learning capability of MIM, significantly enhancing its efficacy in downstream few-shot learning inference. Extensive experimental analyses confirm the advantages of MICM, demonstrating significant improvements in both generalization and discrimination capabilities for few-shot learning. Our comprehensive quantitative evaluations further substantiate the superiority of MICM, showing that our two-stage U-FSL framework based on MICM markedly outperforms existing leading baselines. △ Less

Submitted 23 August, 2024; originally announced August 2024.

Comments: ACMMM 2024 (Oral)

arXiv:2408.13373 [pdf, other]

Learning Unknowns from Unknowns: Diversified Negative Prototypes Generator for Few-Shot Open-Set Recognition

Authors: Zhenyu Zhang, Guangyao Chen, Yixiong Zou, Yuhua Li, Ruixuan Li

Abstract: Few-shot open-set recognition (FSOR) is a challenging task that requires a model to recognize known classes and identify unknown classes with limited labeled data. Existing approaches, particularly Negative-Prototype-Based methods, generate negative prototypes based solely on known class data. However, as the unknown space is infinite while the known space is limited, these methods suffer from lim… ▽ More Few-shot open-set recognition (FSOR) is a challenging task that requires a model to recognize known classes and identify unknown classes with limited labeled data. Existing approaches, particularly Negative-Prototype-Based methods, generate negative prototypes based solely on known class data. However, as the unknown space is infinite while the known space is limited, these methods suffer from limited representation capability. To address this limitation, we propose a novel approach, termed \textbf{D}iversified \textbf{N}egative \textbf{P}rototypes \textbf{G}enerator (DNPG), which adopts the principle of "learning unknowns from unknowns." Our method leverages the unknown space information learned from base classes to generate more representative negative prototypes for novel classes. During the pre-training phase, we learn the unknown space representation of the base classes. This representation, along with inter-class relationships, is then utilized in the meta-learning process to construct negative prototypes for novel classes. To prevent prototype collapse and ensure adaptability to varying data compositions, we introduce the Swap Alignment (SA) module. Our DNPG model, by learning from the unknown space, generates negative prototypes that cover a broader unknown space, thereby achieving state-of-the-art performance on three standard FSOR datasets. △ Less

Submitted 23 August, 2024; originally announced August 2024.

Comments: ACMMM 2024

arXiv:2408.13358 [pdf, other]

Shape-Preserving Generation of Food Images for Automatic Dietary Assessment

Authors: Guangzong Chen, Zhi-Hong Mao, Mingui Sun, Kangni Liu, Wenyan Jia

Abstract: Traditional dietary assessment methods heavily rely on self-reporting, which is time-consuming and prone to bias. Recent advancements in Artificial Intelligence (AI) have revealed new possibilities for dietary assessment, particularly through analysis of food images. Recognizing foods and estimating food volumes from images are known as the key procedures for automatic dietary assessment. However,… ▽ More Traditional dietary assessment methods heavily rely on self-reporting, which is time-consuming and prone to bias. Recent advancements in Artificial Intelligence (AI) have revealed new possibilities for dietary assessment, particularly through analysis of food images. Recognizing foods and estimating food volumes from images are known as the key procedures for automatic dietary assessment. However, both procedures required large amounts of training images labeled with food names and volumes, which are currently unavailable. Alternatively, recent studies have indicated that training images can be artificially generated using Generative Adversarial Networks (GANs). Nonetheless, convenient generation of large amounts of food images with known volumes remain a challenge with the existing techniques. In this work, we present a simple GAN-based neural network architecture for conditional food image generation. The shapes of the food and container in the generated images closely resemble those in the reference input image. Our experiments demonstrate the realism of the generated images and shape-preserving capabilities of the proposed framework. △ Less

Submitted 23 August, 2024; originally announced August 2024.

arXiv:2408.12775 [pdf, other]

Intelligent OPC Engineer Assistant for Semiconductor Manufacturing

Authors: Guojin Chen, Haoyu Yang, Bei Yu, Haoxing Ren

Abstract: Advancements in chip design and manufacturing have enabled the processing of complex tasks such as deep learning and natural language processing, paving the way for the development of artificial general intelligence (AGI). AI, on the other hand, can be leveraged to innovate and streamline semiconductor technology from planning and implementation to manufacturing. In this paper, we present \textit{… ▽ More Advancements in chip design and manufacturing have enabled the processing of complex tasks such as deep learning and natural language processing, paving the way for the development of artificial general intelligence (AGI). AI, on the other hand, can be leveraged to innovate and streamline semiconductor technology from planning and implementation to manufacturing. In this paper, we present \textit{Intelligent OPC Engineer Assistant}, an AI/LLM-powered methodology designed to solve the core manufacturing-aware optimization problem known as optical proximity correction (OPC). The methodology involves a reinforcement learning-based OPC recipe search and a customized multi-modal agent system for recipe summarization. Experiments demonstrate that our methodology can efficiently build OPC recipes on various chip designs with specially handled design topologies, a task that typically requires the full-time effort of OPC engineers with years of experience. △ Less

Submitted 27 August, 2024; v1 submitted 22 August, 2024; originally announced August 2024.

arXiv:2408.11840 [pdf]

Joint PET-MRI Reconstruction with Diffusion Stochastic Differential Model

Authors: Taofeng Xie, Zhuoxu Cui, Congcong Liu, Chen Luo, Huayu Wang, Yuanzhi Zhang, Xuemei Wang, Yihang Zhou, Qiyu Jin, Guoqing Chen, Dong Liang, Haifeng Wang

Abstract: PET suffers from a low signal-to-noise ratio. Meanwhile, the k-space data acquisition process in MRI is time-consuming by PET-MRI systems. We aim to accelerate MRI and improve PET image quality. This paper proposed a novel joint reconstruction model by diffusion stochastic differential equations based on learning the joint probability distribution of PET and MRI. Compare the results underscore the… ▽ More PET suffers from a low signal-to-noise ratio. Meanwhile, the k-space data acquisition process in MRI is time-consuming by PET-MRI systems. We aim to accelerate MRI and improve PET image quality. This paper proposed a novel joint reconstruction model by diffusion stochastic differential equations based on learning the joint probability distribution of PET and MRI. Compare the results underscore the qualitative and quantitative improvements our model brings to PET and MRI reconstruction, surpassing the current state-of-the-art methodologies. Joint PET-MRI reconstruction is a challenge in the PET-MRI system. This studies focused on the relationship extends beyond edges. In this study, PET is generated from MRI by learning joint probability distribution as the relationship. △ Less

Submitted 7 August, 2024; originally announced August 2024.

Comments: Accepted as ISMRM 2024 Digital poster 6575. 04-09 May 2024 Singapore

Journal ref: ISMRM 2024 Digital poster 6575

arXiv:2408.11838 [pdf]

Cold plasma with zirconia nanoparticles for lung cancer via TGF-\b{eta} signaling pathway

Authors: Yueye Huang, Rui Zhang, Xiao Chen, Fei Cao, Qiujie Fang, Qingnan Xu, Shicong Huang, Yufan Wang, Guojun Chen, Zhitong Chen

Abstract: Despite advancements in lung cancer therapy, the prognosis for advanced or metastatic patients remains poor, yet many patients eventually develop resistance to standard treatments leading to disease progression and poor survival. Here, we described a combination of CAP and nanoparticles (ZrO2 NPs (zirconium oxide nanoparticle) and 3Y-TZP NPs (3% mol Yttria Tetragonal Zirconia Polycrystal Nanoparti… ▽ More Despite advancements in lung cancer therapy, the prognosis for advanced or metastatic patients remains poor, yet many patients eventually develop resistance to standard treatments leading to disease progression and poor survival. Here, we described a combination of CAP and nanoparticles (ZrO2 NPs (zirconium oxide nanoparticle) and 3Y-TZP NPs (3% mol Yttria Tetragonal Zirconia Polycrystal Nanoparticle)) for lung cancer therapy. We found that ZrO2 NPs caused obvious damage to the inside of the lung cancer cells. CAP and ZrO2 NPs mainly affected the mitochondria function, leading to a decrease in mitochondrial membrane potential and ATP levels, and causing endoplasmic reticulum stress and cell nucleus internal DNA damage, etc. CAP combined with ZrO2 NPs (CAP@ZrO2) induced lung cancer cell apoptosis by activating the TGF-\b{eta} pathway. CAP@ZrO2 offers a new therapy for the clinical treatment of lung cancer. △ Less

Submitted 6 August, 2024; originally announced August 2024.

Comments: 48 pages

arXiv:2408.10537 [pdf, other]

Subspace Prototype Guidance for Mitigating Class Imbalance in Point Cloud Semantic Segmentation

Authors: Jiawei Han, Kaiqi Liu, Wei Li, Guangzhi Chen

Abstract: Point cloud semantic segmentation can significantly enhance the perception of an intelligent agent. Nevertheless, the discriminative capability of the segmentation network is influenced by the quantity of samples available for different categories. To mitigate the cognitive bias induced by class imbalance, this paper introduces a novel method, namely subspace prototype guidance (\textbf{SPG}), to… ▽ More Point cloud semantic segmentation can significantly enhance the perception of an intelligent agent. Nevertheless, the discriminative capability of the segmentation network is influenced by the quantity of samples available for different categories. To mitigate the cognitive bias induced by class imbalance, this paper introduces a novel method, namely subspace prototype guidance (\textbf{SPG}), to guide the training of segmentation network. Specifically, the point cloud is initially separated into independent point sets by category to provide initial conditions for the generation of feature subspaces. The auxiliary branch which consists of an encoder and a projection head maps these point sets into separate feature subspaces. Subsequently, the feature prototypes which are extracted from the current separate subspaces and then combined with prototypes of historical subspaces guide the feature space of main branch to enhance the discriminability of features of minority categories. The prototypes derived from the feature space of main branch are also employed to guide the training of the auxiliary branch, forming a supervisory loop to maintain consistent convergence of the entire network. The experiments conducted on the large public benchmarks (i.e. S3DIS, ScanNet v2, ScanNet200, Toronto-3D) and collected real-world data illustrate that the proposed method significantly improves the segmentation performance and surpasses the state-of-the-art method. The code is available at \url{https://github.com/Javion11/PointLiBR.git}. △ Less

Submitted 20 August, 2024; originally announced August 2024.

arXiv:2408.09882 [pdf, other]

GINO-Q: Learning an Asymptotically Optimal Index Policy for Restless Multi-armed Bandits

Authors: Gongpu Chen, Soung Chang Liew, Deniz Gunduz

Abstract: The restless multi-armed bandit (RMAB) framework is a popular model with applications across a wide variety of fields. However, its solution is hindered by the exponentially growing state space (with respect to the number of arms) and the combinatorial action space, making traditional reinforcement learning methods infeasible for large-scale instances. In this paper, we propose GINO-Q, a three-tim… ▽ More The restless multi-armed bandit (RMAB) framework is a popular model with applications across a wide variety of fields. However, its solution is hindered by the exponentially growing state space (with respect to the number of arms) and the combinatorial action space, making traditional reinforcement learning methods infeasible for large-scale instances. In this paper, we propose GINO-Q, a three-timescale stochastic approximation algorithm designed to learn an asymptotically optimal index policy for RMABs. GINO-Q mitigates the curse of dimensionality by decomposing the RMAB into a series of subproblems, each with the same dimension as a single arm, ensuring that complexity increases linearly with the number of arms. Unlike recently developed Whittle-index-based algorithms, GINO-Q does not require RMABs to be indexable, enhancing its flexibility and applicability. Our experimental results demonstrate that GINO-Q consistently learns near-optimal policies, even for non-indexable RMABs where Whittle-index-based algorithms perform poorly, and it converges significantly faster than existing baselines. △ Less

Submitted 19 August, 2024; originally announced August 2024.

Comments: 9 pages, 11 figures

arXiv:2408.09815 [pdf, other]

A Population-to-individual Tuning Framework for Adapting Pretrained LM to On-device User Intent Prediction

Authors: Jiahui Gong, Jingtao Ding, Fanjin Meng, Guilong Chen, Hong Chen, Shen Zhao, Haisheng Lu, Yong Li

Abstract: Mobile devices, especially smartphones, can support rich functions and have developed into indispensable tools in daily life. With the rise of generative AI services, smartphones can potentially transform into personalized assistants, anticipating user needs and scheduling services accordingly. Predicting user intents on smartphones, and reflecting anticipated activities based on past interactions… ▽ More Mobile devices, especially smartphones, can support rich functions and have developed into indispensable tools in daily life. With the rise of generative AI services, smartphones can potentially transform into personalized assistants, anticipating user needs and scheduling services accordingly. Predicting user intents on smartphones, and reflecting anticipated activities based on past interactions and context, remains a pivotal step towards this vision. Existing research predominantly focuses on specific domains, neglecting the challenge of modeling diverse event sequences across dynamic contexts. Leveraging pre-trained language models (PLMs) offers a promising avenue, yet adapting PLMs to on-device user intent prediction presents significant challenges. To address these challenges, we propose PITuning, a Population-to-Individual Tuning framework. PITuning enhances common pattern extraction through dynamic event-to-intent transition modeling and addresses long-tailed preferences via adaptive unlearning strategies. Experimental results on real-world datasets demonstrate PITuning's superior intent prediction performance, highlighting its ability to capture long-tailed preferences and its practicality for on-device prediction scenarios. △ Less

Submitted 19 August, 2024; originally announced August 2024.

Comments: accepted by KDD 2024

arXiv:2408.09675 [pdf, other]

Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey

Authors: Ruiqi Zhang, Jing Hou, Florian Walter, Shangding Gu, Jiayi Guan, Florian Röhrbein, Yali Du, Panpan Cai, Guang Chen, Alois Knoll

Abstract: Reinforcement Learning (RL) is a potent tool for sequential decision-making and has achieved performance surpassing human capabilities across many challenging real-world tasks. As the extension of RL in the multi-agent system domain, multi-agent RL (MARL) not only need to learn the control policy but also requires consideration regarding interactions with all other agents in the environment, mutua… ▽ More Reinforcement Learning (RL) is a potent tool for sequential decision-making and has achieved performance surpassing human capabilities across many challenging real-world tasks. As the extension of RL in the multi-agent system domain, multi-agent RL (MARL) not only need to learn the control policy but also requires consideration regarding interactions with all other agents in the environment, mutual influences among different system components, and the distribution of computational resources. This augments the complexity of algorithmic design and poses higher requirements on computational resources. Simultaneously, simulators are crucial to obtain realistic data, which is the fundamentals of RL. In this paper, we first propose a series of metrics of simulators and summarize the features of existing benchmarks. Second, to ease comprehension, we recall the foundational knowledge and then synthesize the recently advanced studies of MARL-related autonomous driving and intelligent transportation systems. Specifically, we examine their environmental modeling, state representation, perception units, and algorithm design. Conclusively, we discuss open challenges as well as prospects and opportunities. We hope this paper can help the researchers integrate MARL technologies and trigger more insightful ideas toward the intelligent and autonomous driving. △ Less

Submitted 18 August, 2024; originally announced August 2024.

Comments: 23 pages, 6 figures and 2 tables. Submitted to IEEE Journal

arXiv:2408.09459 [pdf, other]

WPN: An Unlearning Method Based on N-pair Contrastive Learning in Language Models

Authors: Guitao Chen, Yunshen Wang, Hongye Sun, Guang Chen

Abstract: Generative language models (LMs) offer numerous advantages but may produce inappropriate or harmful outputs due to the harmful knowledge acquired during pre-training. This knowledge often manifests as undesirable correspondences, such as "harmful prompts" leading to "harmful outputs," which our research aims to mitigate through unlearning techniques.However, existing unlearning methods based on gr… ▽ More Generative language models (LMs) offer numerous advantages but may produce inappropriate or harmful outputs due to the harmful knowledge acquired during pre-training. This knowledge often manifests as undesirable correspondences, such as "harmful prompts" leading to "harmful outputs," which our research aims to mitigate through unlearning techniques.However, existing unlearning methods based on gradient ascent can significantly impair the performance of LMs. To address this issue, we propose a novel approach called Weighted Positional N-pair (WPN) Learning, which leverages position-weighted mean pooling within an n-pair contrastive learning framework. WPN is designed to modify the output distribution of LMs by eliminating specific harmful outputs (e.g., replacing toxic responses with neutral ones), thereby transforming the model's behavior from "harmful prompt-harmful output" to "harmful prompt-harmless response".Experiments on OPT and GPT-NEO LMs show that WPN effectively reduces the proportion of harmful responses, achieving a harmless rate of up to 95.8\% while maintaining stable performance on nine common benchmarks (with less than 2\% degradation on average). Moreover, we provide empirical evidence to demonstrate WPN's ability to weaken the harmful correspondences in terms of generalizability and robustness, as evaluated on out-of-distribution test sets and under adversarial attacks. △ Less

Submitted 18 August, 2024; originally announced August 2024.

Comments: ECAI 2024

arXiv:2408.09388 [pdf]

Crystal growth and characterization of Fe$_{1+δ}$Se$_{1-x}$Te$_x$ (0.5 $\leq$ $x$ $\leq$ 1) from LiCl/KCl flux

Authors: Qiaoyu Wang, Kexin Bi, Lewei Chen, Yunqing Shi, Junkun Yi, Yadong Gu, Menghu Zhou, Binbin Ruan, Xingye Lu, Mingwei Ma, Genfu Chen, Zhian Ren

Abstract: An eutectic LiCl/KCl flux method in a horizontal configuration has been used to grow a series of homogeneous Fe$_{1+δ}$Se$_{1-x}$Te$_x$ single crystals of high quality with 0.5 $\leq$ $x$ $\leq$ 1. Compared with previously used melt-growth method, the stable crystallization process in LiCl/KCl flux below their peritectic temperatures results in better homogeneity and crystalline perfection identif… ▽ More An eutectic LiCl/KCl flux method in a horizontal configuration has been used to grow a series of homogeneous Fe$_{1+δ}$Se$_{1-x}$Te$_x$ single crystals of high quality with 0.5 $\leq$ $x$ $\leq$ 1. Compared with previously used melt-growth method, the stable crystallization process in LiCl/KCl flux below their peritectic temperatures results in better homogeneity and crystalline perfection identified by energy dispersive spectrometer and x-ray diffraction. The interstitial Fe value $δ$ remains small within 0.5 $\leq$ $x$ $\leq$ 0.85 where the superconducting temperature $T_C$ is not sensitive to the Te content with sharp superconducting transition widths $Δ$$T_C$ < 1 K and a maximum of $T_C$ = 14.3 K at $x$ = 0.61. The value $δ$ starts to increase quickly accompanied by a deviation of linear behavior of crystal lattice parameters as well as the broadening of $Δ$$T_C$ = 2.1 K at $x$ = 0.91, then suddenly rises up to $δ$ > 0.1 followed by the disappearance of superconductivity and emergence of antiferromagnetic order at x $\geq$ 0.96. We also observed a metallic to semiconducting transition in the normal state resistivity of Fe$_{1+δ}$Se$_{1-x}$Te$_x$ with increasing Te content which is related to a localized electronic state induced by the interstitial Fe. The interstitial Fe value $δ$ might be a key physical parameter to understand various properties of Fe$_{1+δ}$Se$_{1-x}$Te$_x$ system. △ Less

Submitted 18 August, 2024; originally announced August 2024.

arXiv:2408.09333 [pdf, other]

SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama

Authors: Jing Tang, Quanlu Jia, Yuqiang Xie, Zeyu Gong, Xiang Wen, Jiayi Zhang, Yalong Guo, Guibin Chen, Jiangping Yang

Abstract: Generating high-quality shooting scripts containing information such as scene and shot language is essential for short drama script generation. We collect 6,660 popular short drama episodes from the Internet, each with an average of 100 short episodes, and the total number of short episodes is about 80,000, with a total duration of about 2,000 hours and totaling 10 terabytes (TB). We perform keyfr… ▽ More Generating high-quality shooting scripts containing information such as scene and shot language is essential for short drama script generation. We collect 6,660 popular short drama episodes from the Internet, each with an average of 100 short episodes, and the total number of short episodes is about 80,000, with a total duration of about 2,000 hours and totaling 10 terabytes (TB). We perform keyframe extraction and annotation on each episode to obtain about 10,000,000 shooting scripts. We perform 100 script restorations on the extracted shooting scripts based on our self-developed large short drama generation model SkyReels. This leads to a dataset containing 1,000,000,000 pairs of scripts and shooting scripts for short dramas, called SkyScript-100M. We compare SkyScript-100M with the existing dataset in detail and demonstrate some deeper insights that can be achieved based on SkyScript-100M. Based on SkyScript-100M, researchers can achieve several deeper and more far-reaching script optimization goals, which may drive a paradigm shift in the entire field of text-to-video and significantly advance the field of short drama video generation. The data and code are available at https://github.com/vaew/SkyScript-100M. △ Less

Submitted 28 August, 2024; v1 submitted 17 August, 2024; originally announced August 2024.

Comments: 18 pages, 12 figures

arXiv:2408.08969 [pdf, other]

Differentiable Edge-based OPC

Authors: Guojin Chen, Haoyu Yang, Haoxing Ren, Bei Yu, David Z. Pan

Abstract: Optical proximity correction (OPC) is crucial for pushing the boundaries of semiconductor manufacturing and enabling the continued scaling of integrated circuits. While pixel-based OPC, termed as inverse lithography technology (ILT), has gained research interest due to its flexibility and precision. Its complexity and intricate features can lead to challenges in mask writing, increased defects, an… ▽ More Optical proximity correction (OPC) is crucial for pushing the boundaries of semiconductor manufacturing and enabling the continued scaling of integrated circuits. While pixel-based OPC, termed as inverse lithography technology (ILT), has gained research interest due to its flexibility and precision. Its complexity and intricate features can lead to challenges in mask writing, increased defects, and higher costs, hence hindering widespread industrial adoption. In this paper, we propose DiffOPC, a differentiable OPC framework that enjoys the virtue of both edge-based OPC and ILT. By employing a mask rule-aware gradient-based optimization approach, DiffOPC efficiently guides mask edge segment movement during mask optimization, minimizing wafer error by propagating true gradients from the cost function back to the mask edges. Our approach achieves lower edge placement error while reducing manufacturing cost by half compared to state-of-the-art OPC techniques, bridging the gap between the high accuracy of pixel-based OPC and the practicality required for industrial adoption, thus offering a promising solution for advanced semiconductor manufacturing. △ Less

Submitted 29 August, 2024; v1 submitted 16 August, 2024; originally announced August 2024.

Comments: Accepted by ICCAD24

arXiv:2408.08826 [pdf, other]

Search for the rare decay $J/ψ\to γD^0+c.c.$ at BESIII

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (642 additional authors not shown)

Abstract: Using $(10087\pm44)\times10^6J/ψ$ events collected with the BESIII detector, we search for the rare decay $J/ψ\to γD^0+c.c.$ for the first time. No obvious signal is observed and the upper limit on the branching fraction is determined to be ${\cal B}(J/ψ\to γD^{0}+c.c.)< 9.1 \times 10^{-8}$ at 90\% confidence level. Using $(10087\pm44)\times10^6J/ψ$ events collected with the BESIII detector, we search for the rare decay $J/ψ\to γD^0+c.c.$ for the first time. No obvious signal is observed and the upper limit on the branching fraction is determined to be ${\cal B}(J/ψ\to γD^{0}+c.c.)< 9.1 \times 10^{-8}$ at 90\% confidence level. △ Less

Submitted 16 August, 2024; originally announced August 2024.

arXiv:2408.08342 [pdf, other]

CT4D: Consistent Text-to-4D Generation with Animatable Meshes

Authors: Ce Chen, Shaoli Huang, Xuelin Chen, Guangyi Chen, Xiaoguang Han, Kun Zhang, Mingming Gong

Abstract: Text-to-4D generation has recently been demonstrated viable by integrating a 2D image diffusion model with a video diffusion model. However, existing models tend to produce results with inconsistent motions and geometric structures over time. To this end, we present a novel framework, coined CT4D, which directly operates on animatable meshes for generating consistent 4D content from arbitrary user… ▽ More Text-to-4D generation has recently been demonstrated viable by integrating a 2D image diffusion model with a video diffusion model. However, existing models tend to produce results with inconsistent motions and geometric structures over time. To this end, we present a novel framework, coined CT4D, which directly operates on animatable meshes for generating consistent 4D content from arbitrary user-supplied prompts. The primary challenges of our mesh-based framework involve stably generating a mesh with details that align with the text prompt while directly driving it and maintaining surface continuity. Our CT4D framework incorporates a unique Generate-Refine-Animate (GRA) algorithm to enhance the creation of text-aligned meshes. To improve surface continuity, we divide a mesh into several smaller regions and implement a uniform driving function within each area. Additionally, we constrain the animating stage with a rigidity regulation to ensure cross-region continuity. Our experimental results, both qualitative and quantitative, demonstrate that our CT4D framework surpasses existing text-to-4D techniques in maintaining interframe consistency and preserving global geometry. Furthermore, we showcase that this enhanced representation inherently possesses the capability for combinational 4D generation and texture editing. △ Less

Submitted 15 August, 2024; originally announced August 2024.

arXiv:2408.08089 [pdf, other]

AgentCourt: Simulating Court with Adversarial Evolvable Lawyer Agents

Authors: Guhong Chen, Liyang Fan, Zihan Gong, Nan Xie, Zixuan Li, Ziqiang Liu, Chengming Li, Qiang Qu, Shiwen Ni, Min Yang

Abstract: In this paper, we present a simulation system called AgentCourt that simulates the entire courtroom process. The judge, plaintiff's lawyer, defense lawyer, and other participants are autonomous agents driven by large language models (LLMs). Our core goal is to enable lawyer agents to learn how to argue a case, as well as improving their overall legal skills, through courtroom process simulation. T… ▽ More In this paper, we present a simulation system called AgentCourt that simulates the entire courtroom process. The judge, plaintiff's lawyer, defense lawyer, and other participants are autonomous agents driven by large language models (LLMs). Our core goal is to enable lawyer agents to learn how to argue a case, as well as improving their overall legal skills, through courtroom process simulation. To achieve this goal, we propose an adversarial evolutionary approach for the lawyer-agent. Since AgentCourt can simulate the occurrence and development of court hearings based on a knowledge base and LLM, the lawyer agents can continuously learn and accumulate experience from real court cases. The simulation experiments show that after two lawyer-agents have engaged in a thousand adversarial legal cases in AgentCourt (which can take a decade for real-world lawyers), compared to their pre-evolutionary state, the evolved lawyer agents exhibit consistent improvement in their ability to handle legal tasks. To enhance the credibility of our experimental results, we enlisted a panel of professional lawyers to evaluate our simulations. The evaluation indicates that the evolved lawyer agents exhibit notable advancements in responsiveness, as well as expertise and logical rigor. This work paves the way for advancing LLM-driven agent technology in legal scenarios. Code is available at https://github.com/relic-yuexi/AgentCourt. △ Less

Submitted 15 August, 2024; originally announced August 2024.

arXiv:2408.07703 [pdf, other]

Knowledge Distillation with Refined Logits

Authors: Wujie Sun, Defang Chen, Siwei Lyu, Genlang Chen, Chun Chen, Can Wang

Abstract: Recent research on knowledge distillation has increasingly focused on logit distillation because of its simplicity, effectiveness, and versatility in model compression. In this paper, we introduce Refined Logit Distillation (RLD) to address the limitations of current logit distillation methods. Our approach is motivated by the observation that even high-performing teacher models can make incorrect… ▽ More Recent research on knowledge distillation has increasingly focused on logit distillation because of its simplicity, effectiveness, and versatility in model compression. In this paper, we introduce Refined Logit Distillation (RLD) to address the limitations of current logit distillation methods. Our approach is motivated by the observation that even high-performing teacher models can make incorrect predictions, creating a conflict between the standard distillation loss and the cross-entropy loss. This conflict can undermine the consistency of the student model's learning objectives. Previous attempts to use labels to empirically correct teacher predictions may undermine the class correlation. In contrast, our RLD employs labeling information to dynamically refine teacher logits. In this way, our method can effectively eliminate misleading information from the teacher while preserving crucial class correlations, thus enhancing the value and efficiency of distilled knowledge. Experimental results on CIFAR-100 and ImageNet demonstrate its superiority over existing methods. The code is provided at \text{https://github.com/zju-SWJ/RLD}. △ Less

Submitted 19 August, 2024; v1 submitted 14 August, 2024; originally announced August 2024.

Comments: 11 pages, 7 figures

arXiv:2408.06877 [pdf, other]

The Initial Stages of a Generic Singularity for a 2D Pressureless Gas

Authors: Alberto Bressan, Geng Chen, Shoujun Huang

Abstract: We consider the Cauchy problem for the equations of pressureless gases in two space dimensions. For a generic set of smooth initial data (density and velocity), it is known that the solution loses regularity at a finite time $t_0$, where both the the density and the velocity gradient become unbounded. Aim of this paper is to provide an asymptotic description of the solution beyond the time of sing… ▽ More We consider the Cauchy problem for the equations of pressureless gases in two space dimensions. For a generic set of smooth initial data (density and velocity), it is known that the solution loses regularity at a finite time $t_0$, where both the the density and the velocity gradient become unbounded. Aim of this paper is to provide an asymptotic description of the solution beyond the time of singularity formation. For $t>t_0$ we show that a singular curve is formed, where the mass has positive density w.r.t.1-dimensional Hausdorff measure. The system of equations describing the behavior of the singular curve is not hyperbolic. Working within a class of analytic data, local solutions can be constructed using a version of the Cauchy-Kovalevskaya theorem. For this purpose, by a suitable change of variables we rewrite the evolution equations as a first order system of Briot-Bouquet type, to which a general existence-uniqueness theorem can then be applied. △ Less

Submitted 13 August, 2024; originally announced August 2024.

Comments: This paper contains 5 figures

arXiv:2408.06828 [pdf, other]

Photometric Inverse Rendering: Shading Cues Modeling and Surface Reflectance Regularization

Authors: Jingzhi Bao, Guanying Chen, Shuguang Cui

Abstract: This paper addresses the problem of inverse rendering from photometric images. Existing approaches for this problem suffer from the effects of self-shadows, inter-reflections, and lack of constraints on the surface reflectance, leading to inaccurate decomposition of reflectance and illumination due to the ill-posed nature of inverse rendering. In this work, we propose a new method for neural inver… ▽ More This paper addresses the problem of inverse rendering from photometric images. Existing approaches for this problem suffer from the effects of self-shadows, inter-reflections, and lack of constraints on the surface reflectance, leading to inaccurate decomposition of reflectance and illumination due to the ill-posed nature of inverse rendering. In this work, we propose a new method for neural inverse rendering. Our method jointly optimizes the light source position to account for the self-shadows in images, and computes indirect illumination using a differentiable rendering layer and an importance sampling strategy. To enhance surface reflectance decomposition, we introduce a new regularization by distilling DINO features to foster accurate and consistent material decomposition. Extensive experiments on synthetic and real datasets demonstrate that our method outperforms the state-of-the-art methods in reflectance decomposition. △ Less

Submitted 13 August, 2024; originally announced August 2024.

Comments: Project page: https://jzbao03.site/projects/PIR/

arXiv:2408.06677 [pdf, other]

Search for $η_c(2S)\toωω$ and $ωφ$ decays and measurements of $χ_{cJ}\toωω$ and $ωφ$ in $ψ(2S)$ radiative processes

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (643 additional authors not shown)

Abstract: Using $(2712\pm 14)$ $\times$ 10$^{6}$ $ψ(2S)$ events collected with the BESIII detector at the BEPCII collider, we search for the decays $η_{c}(2S)\toωω$ and $η_{c}(2S)\toωφ$ via the process $ψ(2S)\toγη_{c}(2S)$. Evidence of $η_{c}(2S)\toωω$ is found with a statistical significance of $3.2σ$. The branching fraction is measured to be… ▽ More Using $(2712\pm 14)$ $\times$ 10$^{6}$ $ψ(2S)$ events collected with the BESIII detector at the BEPCII collider, we search for the decays $η_{c}(2S)\toωω$ and $η_{c}(2S)\toωφ$ via the process $ψ(2S)\toγη_{c}(2S)$. Evidence of $η_{c}(2S)\toωω$ is found with a statistical significance of $3.2σ$. The branching fraction is measured to be $\mathcal{B}(η_{c}(2S)\toωω)=(5.65\pm3.77(\rm stat.)\pm5.32(\rm syst.))\times10^{-4}$. No statistically significant signal is observed for the decay $η_{c}(2S)\toωφ$. The upper limit of the branching fraction at the 90\% confidence level is determined to be $\mathcal{B}(ψ(2S)\toγη_{c}(2S),η_{c}(2S)\toωφ)<2.24\times 10^{-7}$. We also update the branching fractions of $χ_{cJ}\to ωω$ and $χ_{cJ}\toωφ$ decays via the $ψ(2S)\toγχ_{cJ}$ transition. The branching fractions are determined to be $\mathcal{B}(χ_{c0}\toωω)=(10.63\pm0.11\pm0.46)\times 10^{-4}$, $\mathcal{B}(χ_{c1}\toωω)=(6.39\pm0.07\pm0.29)\times 10^{-4}$, $\mathcal{B}(χ_{c2}\toωω)=(8.50\pm0.08\pm0.38)\times 10^{-4}$, $\mathcal{B}(χ_{c0}\toωφ)=(1.18\pm0.03\pm0.05)\times 10^{-4}$, $\mathcal{B}(χ_{c1}\toωφ)=(2.03\pm0.15\pm0.12)\times 10^{-5}$, and $\mathcal{B}(χ_{c2}\toωφ)=(9.37\pm1.07\pm0.59)\times 10^{-6}$, where the first uncertainties are statistical and the second are systematic. △ Less

Submitted 13 August, 2024; originally announced August 2024.

arXiv:2408.06545 [pdf]

Optimal Preprocessing for Joint Detection and Classification of Wireless Communication Signals in Congested Spectrum Using Computer Vision Methods

Authors: Xiwen Kang, Hua-mei Chen, Genshe Chen, Kuo-Chu Chang, Thomas M. Clemons

Abstract: The joint detection and classification of RF signals has been a critical problem in the field of wideband RF spectrum sensing. Recent advancements in deep learning models have revolutionized this field, remarkably through the application of state-of-the-art computer vision algorithms such as YOLO (You Only Look Once) and DETR (Detection Transformer) to the spectrogram images. This paper focuses on… ▽ More The joint detection and classification of RF signals has been a critical problem in the field of wideband RF spectrum sensing. Recent advancements in deep learning models have revolutionized this field, remarkably through the application of state-of-the-art computer vision algorithms such as YOLO (You Only Look Once) and DETR (Detection Transformer) to the spectrogram images. This paper focuses on optimizing the preprocessing stage to enhance the performance of these computer vision models. Specifically, we investigated the generation of training spectrograms via the classical Short-Time Fourier Transform (STFT) approach, examining four classical STFT parameters: FFT size, window type, window length, and overlapping ratio. Our study aims to maximize the mean average precision (mAP) scores of YOLOv10 models in detecting and classifying various digital modulation signals within a congested spectrum environment. Firstly, our results reveal that additional zero padding in FFT does not enhance detection and classification accuracy and introduces unnecessary computational cost. Secondly, our results indicated that there exists an optimal window size that balances the trade-offs between and the time and frequency resolution, with performance losses of approximately 10% and 30% if the window size is four or eight times off from the optimal. Thirdly, regarding the choice of window functions, the Hamming window yields optimal performance, with non-optimal windows resulting in up to a 10% accuracy loss. Finally, we found a 10% accuracy score performance gap between using 10% and 90% overlap. These findings highlight the potential for significant performance improvements through optimized spectrogram parameters when applying computer vision models to the problem of wideband RF spectrum sensing. △ Less

Submitted 12 August, 2024; originally announced August 2024.

arXiv:2408.06338 [pdf, other]

Closeby Habitable Exoplanet Survey (CHES). II. An Observation Strategy for the Target Stars

Authors: Dongjie Tan, Jianghui Ji, Chunhui Bao, Xiumin Huang, Guo Chen, Su Wang, Yao Dong, Haitao Li, Junbo Zhang, Liang Fang, Dong Li, Lei Deng, Jiacheng Liu, Zi Zhu

Abstract: The Closeby Habitable Exoplanet Survey (CHES) constitutes a mission intricately designed to systematically survey approximately 100 solar-type stars located within the immediate proximity of the solar system, specifically within a range of 10 parsecs. The core objective of this mission is the detection and characterization of potentially habitable Earth-like planets or super-Earths within the habi… ▽ More The Closeby Habitable Exoplanet Survey (CHES) constitutes a mission intricately designed to systematically survey approximately 100 solar-type stars located within the immediate proximity of the solar system, specifically within a range of 10 parsecs. The core objective of this mission is the detection and characterization of potentially habitable Earth-like planets or super-Earths within the habitable zone of these stars. The CHES mission obtains high-precision astrometric measurements of planets orbiting the target stars by observing angular distance variations between the target star and reference stars. As a result, we surveyed the relevant parameters of both target and reference stars in detail, conducting a thorough analysis and calculation of the required observation accuracy, the number of observations, and the priority assigned to each target star. Observational emphasis will be concentrated on targets considered of higher priority, ensuring the effectiveness of their observation capabilities. Through this approach, we formulate a five-year observation strategy that will cover all the target stars within a six-month timeframe. The strategy not only fulfills the required observing capability but also exhibit high efficiency simultaneously, providing an executable program for future mission. Over the span of the mission's five-year duration, a cumulative observation time of 29,220 hours will be available. Approximately 86 percent of this, totaling 25,120 hours, is allocated for the observation of target stars. This allocation leaves approximately 4,100 hours for extended scientific observation programs. We have also performed simulated observations based on this strategy and verified its observational capability for exoplanets. △ Less

Submitted 12 August, 2024; originally announced August 2024.

Comments: 20 pages, 12 figures, accepted for publication in AJ

arXiv:2408.05881 [pdf, other]

Simulating the dynamics of NV^- formation in diamond in the presence of carbon self-interstitials

Authors: Guangzhao Chen, Joseph C. A. Prentice, Jason M. Smith

Abstract: This study utilises linear-scaling density functional theory (DFT) and develops a new machine-learning potential for carbon and nitrogen (GAP-CN), based on the carbon potential (GAP20), to investigate the interaction between carbon self-interstitials and nitrogen-vacancy (NV) centers in diamond, focusing on their excited states and diffusion behaviour. From the simulated excited states, 'Bright',… ▽ More This study utilises linear-scaling density functional theory (DFT) and develops a new machine-learning potential for carbon and nitrogen (GAP-CN), based on the carbon potential (GAP20), to investigate the interaction between carbon self-interstitials and nitrogen-vacancy (NV) centers in diamond, focusing on their excited states and diffusion behaviour. From the simulated excited states, 'Bright', 'Spike', and 'Dark' defect configurations are classified based on their absorption spectrum features. Furthermore, machine learning molecular dynamics simulation provides insight into the possible diffusion mechanism of Ci and NV, showing that Ci can diffuse away or recombine with NV. The study yields new insight into the formation of NV defects in diamond for quantum technology applications. △ Less

Submitted 11 August, 2024; originally announced August 2024.

arXiv:2408.05788 [pdf, other]

Continual Learning of Nonlinear Independent Representations

Authors: Boyang Sun, Ignavier Ng, Guangyi Chen, Yifan Shen, Qirong Ho, Kun Zhang

Abstract: Identifying the causal relations between interested variables plays a pivotal role in representation learning as it provides deep insights into the dataset. Identifiability, as the central theme of this approach, normally hinges on leveraging data from multiple distributions (intervention, distribution shift, time series, etc.). Despite the exciting development in this field, a practical but often… ▽ More Identifying the causal relations between interested variables plays a pivotal role in representation learning as it provides deep insights into the dataset. Identifiability, as the central theme of this approach, normally hinges on leveraging data from multiple distributions (intervention, distribution shift, time series, etc.). Despite the exciting development in this field, a practical but often overlooked problem is: what if those distribution shifts happen sequentially? In contrast, any intelligence possesses the capacity to abstract and refine learned knowledge sequentially -- lifelong learning. In this paper, with a particular focus on the nonlinear independent component analysis (ICA) framework, we move one step forward toward the question of enabling models to learn meaningful (identifiable) representations in a sequential manner, termed continual causal representation learning. We theoretically demonstrate that model identifiability progresses from a subspace level to a component-wise level as the number of distributions increases. Empirically, we show that our method achieves performance comparable to nonlinear ICA methods trained jointly on multiple offline distributions and, surprisingly, the incoming new distribution does not necessarily benefit the identification of all latent variables. △ Less

Submitted 11 August, 2024; originally announced August 2024.

Comments: 9 pages, 5 Figures

arXiv:2408.05545 [pdf, other]

Multi-layer Sequence Labeling-based Joint Biomedical Event Extraction

Authors: Gongchi Chen, Pengchao Wu, Jinghang Gu, Longhua Qian, Guodong Zhou

Abstract: In recent years, biomedical event extraction has been dominated by complicated pipeline and joint methods, which need to be simplified. In addition, existing work has not effectively utilized trigger word information explicitly. Hence, we propose MLSL, a method based on multi-layer sequence labeling for joint biomedical event extraction. MLSL does not introduce prior knowledge and complex structur… ▽ More In recent years, biomedical event extraction has been dominated by complicated pipeline and joint methods, which need to be simplified. In addition, existing work has not effectively utilized trigger word information explicitly. Hence, we propose MLSL, a method based on multi-layer sequence labeling for joint biomedical event extraction. MLSL does not introduce prior knowledge and complex structures. Moreover, it explicitly incorporates the information of candidate trigger words into the sequence labeling to learn the interaction relationships between trigger words and argument roles. Based on this, MLSL can learn well with just a simple workflow. Extensive experimentation demonstrates the superiority of MLSL in terms of extraction performance compared to other state-of-the-art methods. △ Less

Submitted 14 August, 2024; v1 submitted 10 August, 2024; originally announced August 2024.

Comments: 13 pages, 3 figures, accepted by NLPCC2024

arXiv:2408.05080 [pdf, other]

Cusp-transitive 4-manifolds with every cusp section

Authors: Jacopo Guoyi Chen, Edoardo Rizzi

Abstract: We realize every closed flat 3-manifold as a cusp section of a complete, finite-volume hyperbolic 4-manifold whose symmetry group acts transitively on the set of cusps. Moreover, for every such 3-manifold, a dense subset of its flat metrics can be realized as cusp sections of a cusp-transitive 4-manifold. Finally, we prove that there are a lot of 4-manifolds with pairwise isometric cusps, for any… ▽ More We realize every closed flat 3-manifold as a cusp section of a complete, finite-volume hyperbolic 4-manifold whose symmetry group acts transitively on the set of cusps. Moreover, for every such 3-manifold, a dense subset of its flat metrics can be realized as cusp sections of a cusp-transitive 4-manifold. Finally, we prove that there are a lot of 4-manifolds with pairwise isometric cusps, for any given cusp type. △ Less

Submitted 13 August, 2024; v1 submitted 9 August, 2024; originally announced August 2024.

Comments: 18 pages, 7 figures. Added acknowledgments

MSC Class: 57M50

arXiv:2408.04422 [pdf, other]

Analysis of the dynamics of the decay $D^{+}\to K_{S}^{0} π^{0} e^{+}ν_{e}$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

Abstract: The branching fraction of $D^+\to K_{S}^{0} π^{0}e^+ν_e$ is measured for the first time using $7.93~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$~GeV with the BESIII detector operating at the BEPCII collider, and is determined to be ${\mathcal B}$($D^+\to K_S^0π^0e^+ν_e$) = $(0.881~\pm~0.017_{\rm stat.}~\pm~0.016_{\rm syst.})$\%. Based on a… ▽ More The branching fraction of $D^+\to K_{S}^{0} π^{0}e^+ν_e$ is measured for the first time using $7.93~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$~GeV with the BESIII detector operating at the BEPCII collider, and is determined to be ${\mathcal B}$($D^+\to K_S^0π^0e^+ν_e$) = $(0.881~\pm~0.017_{\rm stat.}~\pm~0.016_{\rm syst.})$\%. Based on an analysis of the $D^+\to K_S^0π^0e^+ν_e$ decay dynamics, we observe the $S\text{-}{\rm wave}$ and $P$-wave components with fractions of $f_{S\text{-}{\rm wave}}$ = $(6.13~\pm~0.27_{\rm stat.}~\pm ~0.30_{\rm syst.})\%$ and $f_{\bar K^{*}(892)^0}$ = $(93.88~\pm~0.27_{\rm stat.}~\pm~0.29_{\rm syst.})$\%, respectively. From these results, we obtain the branching fractions ${\mathcal B}$($D^+\to (K_S^0π^0)_{S\text{-}{\rm wave}}~e^+ν_e$) = $(5.41~\pm~0.35_{\rm stat.}~\pm~0.37_{\rm syst.})\times10^{-4}$ and ${\mathcal B}$($D^+\to \bar K^{*}(892)^0e^+ν_e$) = $(4.97~\pm~0.11_{\rm stat.}~\pm~0.12_{\rm syst.})$\%. In addition, the hadronic form-factor ratios of $D^{+} \to \bar {K}^{*}(892)^0e^+ν_e$ at $q^2=0$, assuming a single-pole dominance parameterization, are determined to be $r_V=\frac{V(0)}{A_1(0)}= 1.43~\pm~0.07_{\rm stat.}~\pm~0.03_{\rm syst.}$ and $r_2=\frac{A_2(0)}{A_1(0)}=0.72~\pm~0.06_{\rm stat.}~\pm~0.02_{\rm syst.}$. △ Less

Submitted 8 August, 2024; originally announced August 2024.

arXiv:2408.04030 [pdf, ps, other]

Variability regions for the $n$-th derivative of bounded analytic functions

Authors: Gangqiang Chen

Abstract: Let $\mathcal{H}$ be the class of all analytic self-maps of the open unit disk $\mathbb{D}$. Denote by $H^n f(z)$ the $n$-th order hyperbolic derivative of $f\in \mathcal H$ at $z\in \mathbb{D}$. For $z_0\in \mathbb{D}$ and $γ= (γ_0, γ_1 , \ldots , γ_{n-1}) \in {\mathbb D}^{n}$, let ${\mathcal H} (γ) = \{f \in {\mathcal H} : f (z_0) = γ_0,H^1f (z_0) = γ_1,\ldots ,H^{n-1}f (z_0) = γ_{n-1} \}$. In t… ▽ More Let $\mathcal{H}$ be the class of all analytic self-maps of the open unit disk $\mathbb{D}$. Denote by $H^n f(z)$ the $n$-th order hyperbolic derivative of $f\in \mathcal H$ at $z\in \mathbb{D}$. For $z_0\in \mathbb{D}$ and $γ= (γ_0, γ_1 , \ldots , γ_{n-1}) \in {\mathbb D}^{n}$, let ${\mathcal H} (γ) = \{f \in {\mathcal H} : f (z_0) = γ_0,H^1f (z_0) = γ_1,\ldots ,H^{n-1}f (z_0) = γ_{n-1} \}$. In this paper, we determine the variability region $V(z_0, γ) = \{ f^{(n)}(z_0) : f \in {\mathcal H} (γ) \}$, which can be called ``the generalized Schwarz-Pick Lemma of $n$-th derivative". We then apply the generalized Schwarz-Pick Lemma to establish a $n$-th order Dieudonné's Lemma, which provides an explicit description of the variability region $\{h^{(n)}(z_0): h\in \mathcal{H}, h(0)=0,h(z_0) =w_0, h'(z_0)=w_1,\ldots, h^{(n-1)}(z_0)=w_{n-1}\}$ for given $z_0$, $w_0$, $w_1,\dots,w_{n-1}$. Moreover, we determine the form of all extremal functions. △ Less

Submitted 7 August, 2024; originally announced August 2024.

Comments: 10 pages

MSC Class: Primary 30C80; Secondary 30F45

arXiv:2408.03615 [pdf, other]

Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks

Authors: Zaijing Li, Yuquan Xie, Rui Shao, Gongwei Chen, Dongmei Jiang, Liqiang Nie

Abstract: Building a general-purpose agent is a long-standing vision in the field of artificial intelligence. Existing agents have made remarkable progress in many domains, yet they still struggle to complete long-horizon tasks in an open world. We attribute this to the lack of necessary world knowledge and multimodal experience that can guide agents through a variety of long-horizon tasks. In this paper, w… ▽ More Building a general-purpose agent is a long-standing vision in the field of artificial intelligence. Existing agents have made remarkable progress in many domains, yet they still struggle to complete long-horizon tasks in an open world. We attribute this to the lack of necessary world knowledge and multimodal experience that can guide agents through a variety of long-horizon tasks. In this paper, we propose a Hybrid Multimodal Memory module to address the above challenges. It 1) transforms knowledge into Hierarchical Directed Knowledge Graph that allows agents to explicitly represent and learn world knowledge, and 2) summarises historical information into Abstracted Multimodal Experience Pool that provide agents with rich references for in-context learning. On top of the Hybrid Multimodal Memory module, a multimodal agent, Optimus-1, is constructed with dedicated Knowledge-guided Planner and Experience-Driven Reflector, contributing to a better planning and reflection in the face of long-horizon tasks in Minecraft. Extensive experimental results show that Optimus-1 significantly outperforms all existing agents on challenging long-horizon task benchmarks, and exhibits near human-level performance on many tasks. In addition, we introduce various Multimodal Large Language Models (MLLMs) as the backbone of Optimus-1. Experimental results show that Optimus-1 exhibits strong generalization with the help of the Hybrid Multimodal Memory module, outperforming the GPT-4V baseline on many tasks. △ Less

Submitted 7 August, 2024; originally announced August 2024.

Comments: 30 pages, 13 figures

arXiv:2408.03531 [pdf, other]

Measurement of the Branching Fraction of \boldmath{$ψ(2S) \to γπ^0$}

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (644 additional authors not shown)

Abstract: Based on $(2712.4\pm14.1)\times10^{6}~ψ(2S)$ events, 7.9 fb$^{-1}$ $ψ(3773)$ data, and 0.8 fb$^{-1}$ off-resonance data samples collected with the BESIII detector, we measure the branching fraction of $ψ(2S)\rightarrowγπ^{0}$ and $e^{+}e^{-}\rightarrowγπ^{0}$ form factor at momentum transfers $Q^{2}\sim13$ GeV$^{2}$. The $e^{+}e^{-}\rightarrowγπ^{0}$ cross section is fitted with considering the in… ▽ More Based on $(2712.4\pm14.1)\times10^{6}~ψ(2S)$ events, 7.9 fb$^{-1}$ $ψ(3773)$ data, and 0.8 fb$^{-1}$ off-resonance data samples collected with the BESIII detector, we measure the branching fraction of $ψ(2S)\rightarrowγπ^{0}$ and $e^{+}e^{-}\rightarrowγπ^{0}$ form factor at momentum transfers $Q^{2}\sim13$ GeV$^{2}$. The $e^{+}e^{-}\rightarrowγπ^{0}$ cross section is fitted with considering the interference between the $ψ(2S)$ and continuum amplitudes and two solutions are found, ${\cal B}=3.74\times10^{-7}$ with $φ=3.93$ rad and ${\cal B}=7.87\times10^{-7}$ with $φ=2.08$ rad. Here, ${\cal B}$ is the branching fraction of $ψ(2S)\rightarrowγπ^{0}$ and $φ$ is the relative phase angle between the $ψ(2S)$ and continuum amplitudes. Due to insufficient off-resonance data, the branching fraction ${\cal B}(ψ(2S)\rightarrowγπ^{0})$ is determined to be in the range $[2.7, 9.7]\times10^{-7}$ within one standard deviation of the contour region. △ Less

Submitted 7 August, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

arXiv:2408.03272 [pdf, other]

Suppression of Edge Localized Modes in ITER Baseline Scenario in EAST using Edge Localized Magnetic Perturbations

Authors: P. Xie, Y. Sun, M. Jia, A. Loarte, Y. Q. Liu, C. Ye, S. Gu, H. Sheng, Y. Liang, Q. Ma, H. Yang, C. A. Paz-Soldan, G. Deng, S. Fu, G. Chen, K. He, T. Jia, D. Lu, B. Lv, J. Qian, H. H. Wang, S. Wang, D. Weisberg, X. Wu, W. Xu , et al. (9 additional authors not shown)

Abstract: We report the suppression of Type-I Edge Localized Modes (ELMs) in the EAST tokamak under ITER baseline conditions using $n = 4$ Resonant Magnetic Perturbations (RMPs), while maintaining energy confinement. Achieving RMP-ELM suppression requires a normalized plasma beta ($β_N$) exceeding 1.8 in a target plasma with $q_{95}\approx 3.1$ and tungsten divertors. Quasi-linear modeling shows high plasma… ▽ More We report the suppression of Type-I Edge Localized Modes (ELMs) in the EAST tokamak under ITER baseline conditions using $n = 4$ Resonant Magnetic Perturbations (RMPs), while maintaining energy confinement. Achieving RMP-ELM suppression requires a normalized plasma beta ($β_N$) exceeding 1.8 in a target plasma with $q_{95}\approx 3.1$ and tungsten divertors. Quasi-linear modeling shows high plasma beta enhances RMP-driven neoclassical toroidal viscosity torque, reducing field penetration thresholds. These findings demonstrate the feasibility and efficiency of high $n$ RMPs for ELM suppression in ITER. △ Less

Submitted 6 August, 2024; originally announced August 2024.

Comments: 6 pages, 4 figures

arXiv:2408.03205 [pdf, other]

Measurement of $Σ^+$ transverse polarization in $e^+e^-$ collisions at $\sqrt{s} = 3.68-3.71$ GeV

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

Abstract: Using $e^+e^-$ collision data collected with the BESIII detector at seven energy points ranging from 3.68 to 3.71 GeV and corresponding to an integrated luminosity of $652.1~{\rm pb^{-1}}$, we present an energy-dependent measurement of the transverse polarization, relative phase and modulus ratio of the electromagnetic form factors of the $Σ^+$ hyperon in the $e^+e^- \to Σ^+ \barΣ^-$ reaction. The… ▽ More Using $e^+e^-$ collision data collected with the BESIII detector at seven energy points ranging from 3.68 to 3.71 GeV and corresponding to an integrated luminosity of $652.1~{\rm pb^{-1}}$, we present an energy-dependent measurement of the transverse polarization, relative phase and modulus ratio of the electromagnetic form factors of the $Σ^+$ hyperon in the $e^+e^- \to Σ^+ \barΣ^-$ reaction. These results are helpful to understand the production mechanism of the $Σ^+$-$\barΣ^-$ pairs. △ Less

Submitted 7 August, 2024; v1 submitted 6 August, 2024; originally announced August 2024.

Comments: 21 pages, 2 tables, 5 figures

arXiv:2408.03097 [pdf, other]

Prototype Learning for Micro-gesture Classification

Authors: Guoliang Chen, Fei Wang, Kun Li, Zhiliang Wu, Hehe Fan, Yi Yang, Meng Wang, Dan Guo

Abstract: In this paper, we briefly introduce the solution developed by our team, HFUT-VUT, for the track of Micro-gesture Classification in the MiGA challenge at IJCAI 2024. The task of micro-gesture classification task involves recognizing the category of a given video clip, which focuses on more fine-grained and subtle body movements compared to typical action recognition tasks. Given the inherent comple… ▽ More In this paper, we briefly introduce the solution developed by our team, HFUT-VUT, for the track of Micro-gesture Classification in the MiGA challenge at IJCAI 2024. The task of micro-gesture classification task involves recognizing the category of a given video clip, which focuses on more fine-grained and subtle body movements compared to typical action recognition tasks. Given the inherent complexity of micro-gesture recognition, which includes large intra-class variability and minimal inter-class differences, we utilize two innovative modules, i.e., the cross-modal fusion module and prototypical refinement module, to improve the discriminative ability of MG features, thereby improving the classification accuracy. Our solution achieved significant success, ranking 1st in the track of Micro-gesture Classification. We surpassed the performance of last year's leading team by a substantial margin, improving Top-1 accuracy by 6.13%. △ Less

Submitted 6 August, 2024; originally announced August 2024.

Comments: 1st Place in Micro-gesture Classification in MiGA at IJCAI-2024

arXiv:2408.03013 [pdf, other]

NeurDB: On the Design and Implementation of an AI-powered Autonomous Database

Authors: Zhanhao Zhao, Shaofeng Cai, Haotian Gao, Hexiang Pan, Siqi Xiang, Naili Xing, Gang Chen, Beng Chin Ooi, Yanyan Shen, Yuncheng Wu, Meihui Zhang

Abstract: Databases are increasingly embracing AI to provide autonomous system optimization and intelligent in-database analytics, aiming to relieve end-user burdens across various industry sectors. Nonetheless, most existing approaches fail to account for the dynamic nature of databases, which renders them ineffective for real-world applications characterized by evolving data and workloads. This paper intr… ▽ More Databases are increasingly embracing AI to provide autonomous system optimization and intelligent in-database analytics, aiming to relieve end-user burdens across various industry sectors. Nonetheless, most existing approaches fail to account for the dynamic nature of databases, which renders them ineffective for real-world applications characterized by evolving data and workloads. This paper introduces NeurDB, an AI-powered autonomous database that deepens the fusion of AI and databases with adaptability to data and workload drift. NeurDB establishes a new in-database AI ecosystem that seamlessly integrates AI workflows within the database. This integration enables efficient and effective in-database AI analytics and fast-adaptive learned system components. Empirical evaluations demonstrate that NeurDB substantially outperforms existing solutions in managing AI analytics tasks, with the proposed learned components more effectively handling environmental dynamism than state-of-the-art approaches. △ Less

Submitted 6 August, 2024; originally announced August 2024.

arXiv:2408.02940 [pdf, other]

Observation of $η_{c}(2S) \to K^{+}K^{-}η$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (639 additional authors not shown)

Abstract: By analyzing $(27.12 \pm 0.14)\times10^{8}$ $ψ(3686)$ events accumulated with the BESIII detector, the decay $η_{c}(2S) \to K^{+} K^{-} η$ is observed for the first time with a significance of $6.2σ$ after considering systematic uncertainties. The product of the branching fractions of $ψ(3686) \to γη_{c}(2S)$ and $η_{c}(2S) \to K^{+} K^{-} η$ is measured to be… ▽ More By analyzing $(27.12 \pm 0.14)\times10^{8}$ $ψ(3686)$ events accumulated with the BESIII detector, the decay $η_{c}(2S) \to K^{+} K^{-} η$ is observed for the first time with a significance of $6.2σ$ after considering systematic uncertainties. The product of the branching fractions of $ψ(3686) \to γη_{c}(2S)$ and $η_{c}(2S) \to K^{+} K^{-} η$ is measured to be $\mathcal{B}(ψ(3686) \toγη_{c}(2S))\times \mathcal{B}(η_{c}(2S)\to K^{+} K^{-}η)=(2.39 \pm 0.32 \pm 0.34) \times 10^{-6}$, where the first uncertainty is statistical, and the second one is systematic. The branching fraction of $η_{c}(2S)\to K^{+} K^{-}η$ is determined to be $\mathcal{B}(η_{c}(2S)\to K^{+} K^{-}η) = (3.42 \pm 0.46 \pm 0.48 \pm 2.44) \times 10^{-3}$, where the third uncertainty is due to the branching fraction of $ψ(3686) \to γη_{c}(2S)$. Using a recent BESIII measurement of $\mathcal{B} (η_{c}(2S) \to K^{+} K^{-}π^{0})$, we also determine the ratio between the branching fractions of $η_{c}(2S) \to K^{+} K^{-}η$ and $η_{c}(2S) \to K^{+} K^{-}π^{0}$ to be $1.49 \pm 0.22 \pm 0.25$, which is consistent with the previous result of BaBar at a comparable precision level. △ Less

Submitted 5 August, 2024; originally announced August 2024.

arXiv:2408.02501 [pdf, ps, other]

Fair Resource Allocation For Hierarchical Federated Edge Learning in Space-Air-Ground Integrated Networks via Deep Reinforcement Learning with Hybrid Control

Authors: Chong Huang, Gaojie Chen, Pei Xiao, Jonathon A. Chambers, Wei Huang

Abstract: The space-air-ground integrated network (SAGIN) has become a crucial research direction in future wireless communications due to its ubiquitous coverage, rapid and flexible deployment, and multi-layer cooperation capabilities. However, integrating hierarchical federated learning (HFL) with edge computing and SAGINs remains a complex open issue to be resolved. This paper proposes a novel framework… ▽ More The space-air-ground integrated network (SAGIN) has become a crucial research direction in future wireless communications due to its ubiquitous coverage, rapid and flexible deployment, and multi-layer cooperation capabilities. However, integrating hierarchical federated learning (HFL) with edge computing and SAGINs remains a complex open issue to be resolved. This paper proposes a novel framework for applying HFL in SAGINs, utilizing aerial platforms and low Earth orbit (LEO) satellites as edge servers and cloud servers, respectively, to provide multi-layer aggregation capabilities for HFL. The proposed system also considers the presence of inter-satellite links (ISLs), enabling satellites to exchange federated learning models with each other. Furthermore, we consider multiple different computational tasks that need to be completed within a limited satellite service time. To maximize the convergence performance of all tasks while ensuring fairness, we propose the use of the distributional soft-actor-critic (DSAC) algorithm to optimize resource allocation in the SAGIN and aggregation weights in HFL. Moreover, we address the efficiency issue of hybrid action spaces in deep reinforcement learning (DRL) through a decoupling and recoupling approach, and design a new dynamic adjusting reward function to ensure fairness among multiple tasks in federated learning. Simulation results demonstrate the superiority of our proposed algorithm, consistently outperforming baseline approaches and offering a promising solution for addressing highly complex optimization problems in SAGINs. △ Less

Submitted 5 August, 2024; originally announced August 2024.

Comments: Accepted for publication in IEEE Journal on Selected Areas in Communications

arXiv:2408.01980 [pdf, other]

Measurement Induced Magic Resources

Authors: Gongchu Li, Lei Chen, Si-Qi Zhang, Xu-Song Hong, Huaqing Xu, Yuancheng Liu, You Zhou, Geng Chen, Chuan-Feng Li, Alioscia Hamma, Guang-Can Guo

Abstract: Magic states and magic gates are crucial for achieving universal computation, but some important questions about how magic resources should be implemented to attain quantum advantage have remained unexplored, for instance, in the context of Measurement-based Quantum Computation (MQC) with only single-qubit measurements. This work bridges the gap between MQC and the resource theory of magic by intr… ▽ More Magic states and magic gates are crucial for achieving universal computation, but some important questions about how magic resources should be implemented to attain quantum advantage have remained unexplored, for instance, in the context of Measurement-based Quantum Computation (MQC) with only single-qubit measurements. This work bridges the gap between MQC and the resource theory of magic by introducing the concept of ``invested'' and ``potential" magic resources. The former quantifies the magic cost associated with the MQC framework, serving both as a witness of magic resources and an upper bound for the realization of a desired unitary transformation. Potential magic resources represent the maximum achievable magic resource in a given graph structure defining the MQC. We utilize these concepts to analyze the magic resource requirements of the Quantum Fourier Transform (QFT) and provide a fresh perspective on the universality of MQC of different resource states, highlighting the crucial role of non-Pauli measurements for injecting magic. We demonstrate experimentally our theoretical predictions in a high-fidelity four-photon setup and demonstrate the efficiency of MQC in generating magic states, surpassing the limitations of conventional magic state injection methods. Our findings pave the way for future research exploring magic resource optimization and novel distillation schemes within the MQC framework, contributing to the advancement of fault-tolerant universal quantum computation. △ Less

Submitted 29 August, 2024; v1 submitted 4 August, 2024; originally announced August 2024.

Comments: 25 pages, 11 figures

arXiv:2408.01933 [pdf, other]

DiReCT: Diagnostic Reasoning for Clinical Notes via Large Language Models

Authors: Bowen Wang, Jiuyang Chang, Yiming Qian, Guoxin Chen, Junhao Chen, Zhouqiang Jiang, Jiahao Zhang, Yuta Nakashima, Hajime Nagahara

Abstract: Large language models (LLMs) have recently showcased remarkable capabilities, spanning a wide range of tasks and applications, including those in the medical domain. Models like GPT-4 excel in medical question answering but may face challenges in the lack of interpretability when handling complex tasks in real clinical settings. We thus introduce the diagnostic reasoning dataset for clinical notes… ▽ More Large language models (LLMs) have recently showcased remarkable capabilities, spanning a wide range of tasks and applications, including those in the medical domain. Models like GPT-4 excel in medical question answering but may face challenges in the lack of interpretability when handling complex tasks in real clinical settings. We thus introduce the diagnostic reasoning dataset for clinical notes (DiReCT), aiming at evaluating the reasoning ability and interpretability of LLMs compared to human doctors. It contains 511 clinical notes, each meticulously annotated by physicians, detailing the diagnostic reasoning process from observations in a clinical note to the final diagnosis. Additionally, a diagnostic knowledge graph is provided to offer essential knowledge for reasoning, which may not be covered in the training data of existing LLMs. Evaluations of leading LLMs on DiReCT bring out a significant gap between their reasoning ability and that of human doctors, highlighting the critical need for models that can reason effectively in real-world clinical scenarios. △ Less

Submitted 6 August, 2024; v1 submitted 4 August, 2024; originally announced August 2024.

Comments: 9 pages,6 figures

arXiv:2408.01899 [pdf, other]

Convergence Analysis of Weighted-Median Opinion Dynamics with Prejudice

Authors: Ruichang Zhang, Zhixin Liu, Ge Chen, Wenjun Mei

Abstract: The Friedkin-Johnsen (FJ) model introduces prejudice into the opinion evolution and has been successfully validated in many practical scenarios; however, due to its weighted average mechanism, only one prejudiced agent can always guide all unprejudiced agents synchronizing to its prejudice under the connected influence network, which may not be in line with some social realities. To fundamentally… ▽ More The Friedkin-Johnsen (FJ) model introduces prejudice into the opinion evolution and has been successfully validated in many practical scenarios; however, due to its weighted average mechanism, only one prejudiced agent can always guide all unprejudiced agents synchronizing to its prejudice under the connected influence network, which may not be in line with some social realities. To fundamentally address the limitation of the weighted average mechanism, a weighted-median opinion dynamics has been recently proposed; however, its theoretical analysis is challenging due to its nonlinear nature. This paper studies the weighted-median opinion dynamics with prejudice, and obtains the convergence and convergence rate when all agents have prejudice, and a necessary and sufficient condition for asymptotic consensus when a portion of agents have prejudice. These results are the first time to analyze the discrete-time and synchronous opinion dynamics with the weighted median mechanism, and address the phenomenon of the FJ model that connectivity leads to consensus when a few agents with the same prejudice join in an unprejudiced group. △ Less

Submitted 3 August, 2024; originally announced August 2024.

arXiv:2408.01597 [pdf, other]

Search for $X(3872)\toπ^0π^0χ_{c1,2}$

Authors: BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, O. Afedulidis, X. C. Ai, R. Aliberti, A. Amoroso, Q. An, Y. Bai, O. Bakina, I. Balossino, Y. Ban, H. -R. Bao, V. Batozskaya, K. Begzsuren, N. Berger, M. Berlowski, M. Bertani, D. Bettoni, F. Bianchi, E. Bianco, A. Bortone, I. Boyko, R. A. Briere , et al. (638 additional authors not shown)

Abstract: Using 10.1 fb$^{-1}$ of $e^+e^-$ collision data collected by the BESIII detector with center-of-mass energies between 4.15 GeV and 4.30 GeV, we search for the decays $X(3872)\toπ^0π^0χ_{c1,2}$, where the $X(3872)$ is produced in $e^+e^-\toγX(3872)$. No evidence above $3σ$ is found for either decay. Upper limits at the $90\%$ C.L. on the branching fractions of $X(3872)\toπ^0π^0χ_{c1,2}$ normalized… ▽ More Using 10.1 fb$^{-1}$ of $e^+e^-$ collision data collected by the BESIII detector with center-of-mass energies between 4.15 GeV and 4.30 GeV, we search for the decays $X(3872)\toπ^0π^0χ_{c1,2}$, where the $X(3872)$ is produced in $e^+e^-\toγX(3872)$. No evidence above $3σ$ is found for either decay. Upper limits at the $90\%$ C.L. on the branching fractions of $X(3872)\toπ^0π^0χ_{c1,2}$ normalized to the branching fraction of $X(3872)\toπ^+π^-J/ψ$ are set to be $\mathcal{B}(X(3872)\toπ^0π^0χ_{c1})/\mathcal{B}(X(3872)\toπ^+π^-J/ψ) < 1.1$ and $\mathcal{B}(X(3872)\toπ^0π^0χ_{c2})/\mathcal{B}(X(3872)\toπ^+π^-J/ψ) < 0.5$, taking into account both statistical and systematic uncertainties. △ Less

Submitted 2 August, 2024; originally announced August 2024.

Comments: 12 pages, 4 figures, 6 tables

arXiv:2408.01323 [pdf, other]

FANNO: Augmenting High-Quality Instruction Data with Open-Sourced LLMs Only

Authors: He Zhu, Junyou Su, Tianle Lun, Yicheng Tao, Wenjia Zhang, Zipei Fan, Guanhua Chen

Abstract: Instruction fine-tuning stands as a crucial advancement in leveraging large language models (LLMs) for enhanced task performance. However, the annotation of instruction datasets has traditionally been expensive and laborious, often relying on manual annotations or costly API calls of proprietary LLMs. To address these challenges, we introduce FANNO, a fully autonomous, open-sourced framework that… ▽ More Instruction fine-tuning stands as a crucial advancement in leveraging large language models (LLMs) for enhanced task performance. However, the annotation of instruction datasets has traditionally been expensive and laborious, often relying on manual annotations or costly API calls of proprietary LLMs. To address these challenges, we introduce FANNO, a fully autonomous, open-sourced framework that revolutionizes the annotation process without the need for pre-existing annotated data. Utilizing a Mistral-7b-instruct model, FANNO efficiently produces diverse and high-quality datasets through a structured process involving document pre-screening, instruction generation, and response generation. Experiments on Open LLM Leaderboard and AlpacaEval benchmark show that the FANNO can generate high-quality data with diversity and complexity for free, comparable to human-annotated or cleaned datasets like Alpaca-GPT4-Cleaned. △ Less

Submitted 2 August, 2024; originally announced August 2024.

arXiv:2408.01102 [pdf, other]

doi 10.1145/3654777.3676390

LessonPlanner: Assisting Novice Teachers to Prepare Pedagogy-Driven Lesson Plans with Large Language Models

Authors: Haoxiang Fan, Guanzheng Chen, Xingbo Wang, Zhenhui Peng

Abstract: Preparing a lesson plan, e.g., a detailed road map with strategies and materials for instructing a 90-minute class, is beneficial yet challenging for novice teachers. Large language models (LLMs) can ease this process by generating adaptive content for lesson plans, which would otherwise require teachers to create from scratch or search existing resources. In this work, we first conduct a formativ… ▽ More Preparing a lesson plan, e.g., a detailed road map with strategies and materials for instructing a 90-minute class, is beneficial yet challenging for novice teachers. Large language models (LLMs) can ease this process by generating adaptive content for lesson plans, which would otherwise require teachers to create from scratch or search existing resources. In this work, we first conduct a formative study with six novice teachers to understand their needs for support of preparing lesson plans with LLMs. Then, we develop LessonPlanner that assists users to interactively construct lesson plans with adaptive LLM-generated content based on Gagne's nine events. Our within-subjects study (N=12) shows that compared to the baseline ChatGPT interface, LessonPlanner can significantly improve the quality of outcome lesson plans and ease users' workload in the preparation process. Our expert interviews (N=6) further demonstrate LessonPlanner's usefulness in suggesting effective teaching strategies and meaningful educational resources. We discuss concerns on and design considerations for supporting teaching activities with LLMs. △ Less

Submitted 2 August, 2024; originally announced August 2024.

Comments: 20 pages

Showing 1–50 of 3,584 results for author: Chen, G