-
From COCO to COCO-FP: A Deep Dive into Background False Positives for COCO Detectors
Authors:
Longfei Liu,
Wen Guo,
Shihua Huang,
Cheng Li,
Xi Shen
Abstract:
Reducing false positives is essential for enhancing object detector performance, as reflected in the mean Average Precision (mAP) metric. Although object detectors have achieved notable improvements and high mAP scores on the COCO dataset, analysis reveals limited progress in addressing false positives caused by non-target visual clutter-background objects not included in the annotated categories.…
▽ More
Reducing false positives is essential for enhancing object detector performance, as reflected in the mean Average Precision (mAP) metric. Although object detectors have achieved notable improvements and high mAP scores on the COCO dataset, analysis reveals limited progress in addressing false positives caused by non-target visual clutter-background objects not included in the annotated categories. This issue is particularly critical in real-world applications, such as fire and smoke detection, where minimizing false alarms is crucial. In this study, we introduce COCO-FP, a new evaluation dataset derived from the ImageNet-1K dataset, designed to address this issue. By extending the original COCO validation dataset, COCO-FP specifically assesses object detectors' performance in mitigating background false positives. Our evaluation of both standard and advanced object detectors shows a significant number of false positives in both closed-set and open-set scenarios. For example, the AP50 metric for YOLOv9-E decreases from 72.8 to 65.7 when shifting from COCO to COCO-FP. The dataset is available at https://github.com/COCO-FP/COCO-FP.
△ Less
Submitted 12 September, 2024;
originally announced September 2024.
-
Measurements of the $CP$-even fractions of $D^0\toπ^{+}π^{-}π^{0}$ and $D^0\to K^{+}K^{-}π^{0}$ at BESIII
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (648 additional authors not shown)
Abstract:
The $CP$-even fractions ($F_{+}$) of the decays $D^0\toπ^{+}π^{-}π^{0}$ and $D^0\to K^{+}K^{-}π^{0}$ are measured with a quantum-correlated $ψ(3770)\to D\bar{D}$ data sample collected by the BESIII experiment corresponding to an integrated luminosity of 7.93 $\mathrm{fb}^{-1}$. The results are $F_{+}^{π^{+}π^{-}π^{0}}=0.9406\pm0.0036\pm0.0021$ and $F_{+}^{K^{+}K^{-}π^{0}}=0.631\pm0.014\pm0.011$, w…
▽ More
The $CP$-even fractions ($F_{+}$) of the decays $D^0\toπ^{+}π^{-}π^{0}$ and $D^0\to K^{+}K^{-}π^{0}$ are measured with a quantum-correlated $ψ(3770)\to D\bar{D}$ data sample collected by the BESIII experiment corresponding to an integrated luminosity of 7.93 $\mathrm{fb}^{-1}$. The results are $F_{+}^{π^{+}π^{-}π^{0}}=0.9406\pm0.0036\pm0.0021$ and $F_{+}^{K^{+}K^{-}π^{0}}=0.631\pm0.014\pm0.011$, where the first uncertainties are statistical and the second systematic. These measurements are consistent with the previous determinations, and the uncertainties for $F_{+}^{π^{+}π^{-}π^{0}}$ and $F_{+}^{K^{+}K^{-}π^{0}}$ are reduced by factors of 3.9 and 2.6, respectively. The reported results provide important inputs for the precise measurement of the angle $γ$ of the Cabibbo-Kobayashi-Maskawa matrix and indirect $CP$ violation in charm mixing.
△ Less
Submitted 11 September, 2024;
originally announced September 2024.
-
JCMT 850 $\micron$ continuum observations of density structures in the G35 molecular complex
Authors:
Xianjin Shen,
Hong-Li Liu,
Zhiyuan Ren,
Anandmayee Tej,
Di Li,
Hauyu Baobab Liu,
Gary A. Fuller,
Jinjin Xie,
Sihan Jiao,
Aiyuan Yang,
Patrick M. Koch,
Fengwei Xu,
Patricio Sanhueza,
Pham N. Diep,
Nicolas Peretto,
Ram K. Yadav,
Busaba H. Kramer,
Koichiro Sugiyama,
Mark Rawlings,
Chang Won Lee,
Ken'ichi Tatematsu,
Daniel Harsono,
David Eden,
Woojin Kwon,
Chao-Wei Tsai
, et al. (10 additional authors not shown)
Abstract:
Filaments are believed to play a key role in high-mass star formation. We present a systematic study of the filaments and their hosting clumps in the G35 molecular complex using JCMT SCUBA-2 850 $\micron$ continuum data. We identified five clouds in the complex and 91 filaments within them, some of which form 10 hub-filament systems (HFSs), each with at least 3 hub-composing filaments. We also com…
▽ More
Filaments are believed to play a key role in high-mass star formation. We present a systematic study of the filaments and their hosting clumps in the G35 molecular complex using JCMT SCUBA-2 850 $\micron$ continuum data. We identified five clouds in the complex and 91 filaments within them, some of which form 10 hub-filament systems (HFSs), each with at least 3 hub-composing filaments. We also compiled a catalogue of 350 dense clumps, 183 of which are associated with the filaments. We investigated the physical properties of the filaments and clumps, such as mass, density, and size, and their relation to star formation. We find that the global mass-length trend of the filaments is consistent with a turbulent origin, while the hub-composing filaments of high line masses ($m_{\rm l}\,>$\,230\,$\mathrm{M_{\odot}~pc^{-1}}$) in HFSs deviate from this relation, possibly due to feedback from massive star formation. We also find that the most massive and densest clumps (R\,$>$\,0.2\,pc, M\,$>35\,\mathrm{M_{\odot}}$, $\mathrmΣ>\,0.05\,\mathrm{g~cm^{-2}}$) are located in the filaments and in the hubs of HFS with the latter bearing a higher probability of occurrence of high-mass star-forming signatures, highlighting the preferential sites of HFSs for high-mass star formation. We do not find significant variation in the clump mass surface density across different evolutionary environments of the clouds, which may reflect the balance between mass accretion and stellar feedback.
△ Less
Submitted 9 September, 2024;
originally announced September 2024.
-
Transferable Selective Virtual Sensing Active Noise Control Technique Based on Metric Learning
Authors:
Boxiang Wang,
Dongyuan Shi,
Zhengding Luo,
Xiaoyi Shen,
Junwei Ji,
Woon-Seng Gan
Abstract:
Virtual sensing (VS) technology enables active noise control (ANC) systems to attenuate noise at virtual locations distant from the physical error microphones. Appropriate auxiliary filters (AF) can significantly enhance the effectiveness of VS approaches. The selection of appropriate AF for various types of noise can be automatically achieved using convolutional neural networks (CNNs). However, t…
▽ More
Virtual sensing (VS) technology enables active noise control (ANC) systems to attenuate noise at virtual locations distant from the physical error microphones. Appropriate auxiliary filters (AF) can significantly enhance the effectiveness of VS approaches. The selection of appropriate AF for various types of noise can be automatically achieved using convolutional neural networks (CNNs). However, training the CNN model for different ANC systems is often labour-intensive and time-consuming. To tackle this problem, we propose a novel method, Transferable Selective VS, by integrating metric-learning technology into CNN-based VS approaches. The Transferable Selective VS method allows a pre-trained CNN to be applied directly to new ANC systems without requiring retraining, and it can handle unseen noise types. Numerical simulations demonstrate the effectiveness of the proposed method in attenuating sudden-varying broadband noises and real-world noises.
△ Less
Submitted 9 September, 2024;
originally announced September 2024.
-
Study of the decay $D^0\rightarrow ρ(770)^-e^+ν_e$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (646 additional authors not shown)
Abstract:
We present a study of the semileptonic decay $D^0\rightarrow π^-π^0e^{+}ν_{e}$ using an $e^+e^-$ annihilation data sample of $7.93~\mathrm{fb}^{-1}$ collected at the center-of-mass energy of 3.773 GeV with the BESIII detector. The branching fraction of $D^0\to ρ(770)^-e^+ν_e$ is measured to be $(1.439 \pm 0.033(\rm stat.) \pm 0.027(\rm syst.)) \times10^{-3}$, which is a factor 1.6 more precise tha…
▽ More
We present a study of the semileptonic decay $D^0\rightarrow π^-π^0e^{+}ν_{e}$ using an $e^+e^-$ annihilation data sample of $7.93~\mathrm{fb}^{-1}$ collected at the center-of-mass energy of 3.773 GeV with the BESIII detector. The branching fraction of $D^0\to ρ(770)^-e^+ν_e$ is measured to be $(1.439 \pm 0.033(\rm stat.) \pm 0.027(\rm syst.)) \times10^{-3}$, which is a factor 1.6 more precise than previous measurements. By performing an amplitude analysis, we measure the hadronic form-factor ratios of $D^0\to ρ(770)^-e^+ν_e$ at $q^2=0$ assuming the single-pole-dominance parametrization: $r_{V}=V(0)/A_1(0)=1.548\pm0.079(\rm stat.)\pm0.041(\rm syst.)$ and $r_{2}=A_2(0)/A_1(0)=0.823\pm0.056(\rm stat.)\pm0.026(\rm syst.)$.
△ Less
Submitted 6 September, 2024;
originally announced September 2024.
-
From Calculation to Adjudication: Examining LLM judges on Mathematical Reasoning Tasks
Authors:
Andreas Stephan,
Dawei Zhu,
Matthias Aßenmacher,
Xiaoyu Shen,
Benjamin Roth
Abstract:
To reduce the need for human annotations, large language models (LLMs) have been proposed as judges of the quality of other candidate models. LLM judges are typically evaluated by measuring the correlation with human judgments on generation tasks such as summarization or machine translation. In contrast, we study LLM judges on mathematical reasoning tasks. These tasks require multi-step reasoning,…
▽ More
To reduce the need for human annotations, large language models (LLMs) have been proposed as judges of the quality of other candidate models. LLM judges are typically evaluated by measuring the correlation with human judgments on generation tasks such as summarization or machine translation. In contrast, we study LLM judges on mathematical reasoning tasks. These tasks require multi-step reasoning, and the correctness of their solutions is verifiable, enabling a more objective evaluation. We perform a detailed performance analysis and find that the used judges are mostly unable to improve task performance but are able to pick the better model. Our analysis uncovers a strong correlation between judgment performance and the candidate model task performance. We observe that judges tend to choose the model of higher quality even if its answer is incorrect. Further, we show that it is possible to use statistics, such as the task performances of the individual models, to predict judgment performance. In an ablation, we either swap or mask the candidate answers and observe that judges often keep the original judgment, providing evidence that judges incorporate writing style in their judgments. In summary, we find that regularities in the judgments are quantifiable using statistical measures and provide various angles on exploiting them.
△ Less
Submitted 6 September, 2024;
originally announced September 2024.
-
EigenSR: Eigenimage-Bridged Pre-Trained RGB Learners for Single Hyperspectral Image Super-Resolution
Authors:
Xi Su,
Xiangfei Shen,
Mingyang Wan,
Jing Nie,
Lihui Chen,
Haijun Liu,
Xichuan Zhou
Abstract:
Single hyperspectral image super-resolution (single-HSI-SR) aims to improve the resolution of a single input low-resolution HSI. Due to the bottleneck of data scarcity, the development of single-HSI-SR lags far behind that of RGB natural images. In recent years, research on RGB SR has shown that models pre-trained on large-scale benchmark datasets can greatly improve performance on unseen data, wh…
▽ More
Single hyperspectral image super-resolution (single-HSI-SR) aims to improve the resolution of a single input low-resolution HSI. Due to the bottleneck of data scarcity, the development of single-HSI-SR lags far behind that of RGB natural images. In recent years, research on RGB SR has shown that models pre-trained on large-scale benchmark datasets can greatly improve performance on unseen data, which may stand as a remedy for HSI. But how can we transfer the pre-trained RGB model to HSI, to overcome the data-scarcity bottleneck? Because of the significant difference in the channels between the pre-trained RGB model and the HSI, the model cannot focus on the correlation along the spectral dimension, thus limiting its ability to utilize on HSI. Inspired by the HSI spatial-spectral decoupling, we propose a new framework that first fine-tunes the pre-trained model with the spatial components (known as eigenimages), and then infers on unseen HSI using an iterative spectral regularization (ISR) to maintain the spectral correlation. The advantages of our method lie in: 1) we effectively inject the spatial texture processing capabilities of the pre-trained RGB model into HSI while keeping spectral fidelity, 2) learning in the spectral-decorrelated domain can improve the generalizability to spectral-agnostic data, and 3) our inference in the eigenimage domain naturally exploits the spectral low-rank property of HSI, thereby reducing the complexity. This work bridges the gap between pre-trained RGB models and HSI via eigenimages, addressing the issue of limited HSI training data, hence the name EigenSR. Extensive experiments show that EigenSR outperforms the state-of-the-art (SOTA) methods in both spatial and spectral metrics. Our code will be released.
△ Less
Submitted 6 September, 2024;
originally announced September 2024.
-
How DREAMS are made: Emulating Satellite Galaxy and Subhalo Populations with Diffusion Models and Point Clouds
Authors:
Tri Nguyen,
Francisco Villaescusa-Navarro,
Siddharth Mishra-Sharma,
Carolina Cuesta-Lazaro,
Paul Torrey,
Arya Farahi,
Alex M. Garcia,
Jonah C. Rose,
Stephanie O'Neil,
Mark Vogelsberger,
Xuejian Shen,
Cian Roche,
Daniel Anglés-Alcázar,
Nitya Kallivayalil,
Julian B. Muñoz,
Francis-Yan Cyr-Racine,
Sandip Roy,
Lina Necib,
Kassidy E. Kollmann
Abstract:
The connection between galaxies and their host dark matter (DM) halos is critical to our understanding of cosmology, galaxy formation, and DM physics. To maximize the return of upcoming cosmological surveys, we need an accurate way to model this complex relationship. Many techniques have been developed to model this connection, from Halo Occupation Distribution (HOD) to empirical and semi-analytic…
▽ More
The connection between galaxies and their host dark matter (DM) halos is critical to our understanding of cosmology, galaxy formation, and DM physics. To maximize the return of upcoming cosmological surveys, we need an accurate way to model this complex relationship. Many techniques have been developed to model this connection, from Halo Occupation Distribution (HOD) to empirical and semi-analytic models to hydrodynamic. Hydrodynamic simulations can incorporate more detailed astrophysical processes but are computationally expensive; HODs, on the other hand, are computationally cheap but have limited accuracy. In this work, we present NeHOD, a generative framework based on variational diffusion model and Transformer, for painting galaxies/subhalos on top of DM with an accuracy of hydrodynamic simulations but at a computational cost similar to HOD. By modeling galaxies/subhalos as point clouds, instead of binning or voxelization, we can resolve small spatial scales down to the resolution of the simulations. For each halo, NeHOD predicts the positions, velocities, masses, and concentrations of its central and satellite galaxies. We train NeHOD on the TNG-Warm DM suite of the DREAMS project, which consists of 1024 high-resolution zoom-in hydrodynamic simulations of Milky Way-mass halos with varying warm DM mass and astrophysical parameters. We show that our model captures the complex relationships between subhalo properties as a function of the simulation parameters, including the mass functions, stellar-halo mass relations, concentration-mass relations, and spatial clustering. Our method can be used for a large variety of downstream applications, from galaxy clustering to strong lensing studies.
△ Less
Submitted 4 September, 2024;
originally announced September 2024.
-
Searching for the massless dark photon in $c\to uγ'$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (648 additional authors not shown)
Abstract:
In the effective field theory, the massless dark photon $γ'$ can only couple with the Standard Model particle through operators of dimension higher than four, thereby offering a high sensitivity to the new physics energy scale. Using $7.9~\rm{fb^{-1}}$ of $e^+e^-$ collision data collected at $\sqrt{s}=3.773$ GeV with the BESIII detector at the BEPCII collider, we measure the effective flavor-chang…
▽ More
In the effective field theory, the massless dark photon $γ'$ can only couple with the Standard Model particle through operators of dimension higher than four, thereby offering a high sensitivity to the new physics energy scale. Using $7.9~\rm{fb^{-1}}$ of $e^+e^-$ collision data collected at $\sqrt{s}=3.773$ GeV with the BESIII detector at the BEPCII collider, we measure the effective flavor-changing neutral current coupling of $cuγ'$ in $D^0\toωγ'$ and $D^0\toγγ'$ processes to search for the massless dark photon. No significant signals are observed, and the upper limits at the 90% confidence level on the massless dark photon branching fraction are set to be $1.1\times10^{-5}$ and $2.0\times10^{-6}$ for $D^0\toωγ'$ and $D^0\toγγ'$, respectively. These results provide the most stringent constraint on the new physics energy scale associated with $cuγ'$ coupling in the world, with the new physics energy scale related parameter $|\mathbb{C}|^2+|\mathbb{C}_5|^2<8.2\times10^{-17}~\rm{GeV}^{-2}$ at the 90% confidence level, playing a unique role in the dark sector search with the charm sector.
△ Less
Submitted 4 September, 2024;
originally announced September 2024.
-
Interpreting and Improving Large Language Models in Arithmetic Calculation
Authors:
Wei Zhang,
Chaoqun Wan,
Yonggang Zhang,
Yiu-ming Cheung,
Xinmei Tian,
Xu Shen,
Jieping Ye
Abstract:
Large language models (LLMs) have demonstrated remarkable potential across numerous applications and have shown an emergent ability to tackle complex reasoning tasks, such as mathematical computations. However, even for the simplest arithmetic calculations, the intrinsic mechanisms behind LLMs remain mysterious, making it challenging to ensure reliability. In this work, we delve into uncovering a…
▽ More
Large language models (LLMs) have demonstrated remarkable potential across numerous applications and have shown an emergent ability to tackle complex reasoning tasks, such as mathematical computations. However, even for the simplest arithmetic calculations, the intrinsic mechanisms behind LLMs remain mysterious, making it challenging to ensure reliability. In this work, we delve into uncovering a specific mechanism by which LLMs execute calculations. Through comprehensive experiments, we find that LLMs frequently involve a small fraction (< 5%) of attention heads, which play a pivotal role in focusing on operands and operators during calculation processes. Subsequently, the information from these operands is processed through multi-layer perceptrons (MLPs), progressively leading to the final solution. These pivotal heads/MLPs, though identified on a specific dataset, exhibit transferability across different datasets and even distinct tasks. This insight prompted us to investigate the potential benefits of selectively fine-tuning these essential heads/MLPs to boost the LLMs' computational performance. We empirically find that such precise tuning can yield notable enhancements on mathematical prowess, without compromising the performance on non-mathematical tasks. Our work serves as a preliminary exploration into the arithmetic calculation abilities inherent in LLMs, laying a solid foundation to reveal more intricate mathematical tasks.
△ Less
Submitted 3 September, 2024;
originally announced September 2024.
-
From Yes-Men to Truth-Tellers: Addressing Sycophancy in Large Language Models with Pinpoint Tuning
Authors:
Wei Chen,
Zhen Huang,
Liang Xie,
Binbin Lin,
Houqiang Li,
Le Lu,
Xinmei Tian,
Deng Cai,
Yonggang Zhang,
Wenxiao Wan,
Xu Shen,
Jieping Ye
Abstract:
Large Language Models (LLMs) tend to prioritize adherence to user prompts over providing veracious responses, leading to the sycophancy issue. When challenged by users, LLMs tend to admit mistakes and provide inaccurate responses even if they initially provided the correct answer. Recent works propose to employ supervised fine-tuning (SFT) to mitigate the sycophancy issue, while it typically leads…
▽ More
Large Language Models (LLMs) tend to prioritize adherence to user prompts over providing veracious responses, leading to the sycophancy issue. When challenged by users, LLMs tend to admit mistakes and provide inaccurate responses even if they initially provided the correct answer. Recent works propose to employ supervised fine-tuning (SFT) to mitigate the sycophancy issue, while it typically leads to the degeneration of LLMs' general capability. To address the challenge, we propose a novel supervised pinpoint tuning (SPT), where the region-of-interest modules are tuned for a given objective. Specifically, SPT first reveals and verifies a small percentage (<5%) of the basic modules, which significantly affect a particular behavior of LLMs. i.e., sycophancy. Subsequently, SPT merely fine-tunes these identified modules while freezing the rest. To verify the effectiveness of the proposed SPT, we conduct comprehensive experiments, demonstrating that SPT significantly mitigates the sycophancy issue of LLMs (even better than SFT). Moreover, SPT introduces limited or even no side effects on the general capability of LLMs. Our results shed light on how to precisely, effectively, and efficiently explain and improve the targeted ability of LLMs.
△ Less
Submitted 3 September, 2024;
originally announced September 2024.
-
Study of $D^{+} \to K_{S}^{0}K^{*}(892)^{+}$ in $D^{+} \to K_{S}^{0} K_{S}^{0} π^{+}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (638 additional authors not shown)
Abstract:
Using a data sample of $e^+e^-$ collisions corresponding to an integrated luminosity of 7.93 $\rm fb^{-1}$ collected with the BESIII detector at the center-of-mass energy 3.773~GeV, we perform the first amplitude analysis of the decay $D^{+} \to K_{S}^{0} K_{S}^{0} π^{+}$. The absolute branching fraction of $D^{+} \to K_{S}^{0}K_{S}^{0} π^{+}$ is measured to be…
▽ More
Using a data sample of $e^+e^-$ collisions corresponding to an integrated luminosity of 7.93 $\rm fb^{-1}$ collected with the BESIII detector at the center-of-mass energy 3.773~GeV, we perform the first amplitude analysis of the decay $D^{+} \to K_{S}^{0} K_{S}^{0} π^{+}$. The absolute branching fraction of $D^{+} \to K_{S}^{0}K_{S}^{0} π^{+}$ is measured to be $(2.97 \pm 0.09_{\rm stat.} \pm 0.05_{\rm syst.})\times10^{-3}$. The dominant intermediate process is $D^{+} \to K_{S}^{0}K^{*}(892)^{+}$, whose branching fraction is determined to be $(8.72 \pm 0.28_{\rm stat.} \pm 0.15_{\rm syst.}) \times 10^{-3}$, including all the $K^*(892)^+$ decays.
△ Less
Submitted 2 September, 2024;
originally announced September 2024.
-
MoManifold: Learning to Measure 3D Human Motion via Decoupled Joint Acceleration Manifolds
Authors:
Ziqiang Dang,
Tianxing Fan,
Boming Zhao,
Xujie Shen,
Lei Wang,
Guofeng Zhang,
Zhaopeng Cui
Abstract:
Incorporating temporal information effectively is important for accurate 3D human motion estimation and generation which have wide applications from human-computer interaction to AR/VR. In this paper, we present MoManifold, a novel human motion prior, which models plausible human motion in continuous high-dimensional motion space. Different from existing mathematical or VAE-based methods, our repr…
▽ More
Incorporating temporal information effectively is important for accurate 3D human motion estimation and generation which have wide applications from human-computer interaction to AR/VR. In this paper, we present MoManifold, a novel human motion prior, which models plausible human motion in continuous high-dimensional motion space. Different from existing mathematical or VAE-based methods, our representation is designed based on the neural distance field, which makes human dynamics explicitly quantified to a score and thus can measure human motion plausibility. Specifically, we propose novel decoupled joint acceleration manifolds to model human dynamics from existing limited motion data. Moreover, we introduce a novel optimization method using the manifold distance as guidance, which facilitates a variety of motion-related tasks. Extensive experiments demonstrate that MoManifold outperforms existing SOTAs as a prior in several downstream tasks such as denoising real-world human mocap data, recovering human motion from partial 3D observations, mitigating jitters for SMPL-based pose estimators, and refining the results of motion in-betweening.
△ Less
Submitted 1 September, 2024;
originally announced September 2024.
-
Measurement of Born cross sections of $e^+e^-\toΞ^0\barΞ^0$ and search for charmonium(-like) states at $\sqrt{s}$ = 3.51-4.95 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (648 additional authors not shown)
Abstract:
Using $e^+e^-$ collision data collected by the BESIII detector at BEPCII corresponding to an integrated luminosity of 30 $\rm fb^{-1}$, we measure Born cross sections and effective form factors for the process $e^+e^-\toΞ^0\barΞ^0$ at forty-five center-of-mass energies between 3.51 and 4.95 GeV. The dressed cross section is fitted, assuming a power-law function plus a charmonium(-like) state, i.e.…
▽ More
Using $e^+e^-$ collision data collected by the BESIII detector at BEPCII corresponding to an integrated luminosity of 30 $\rm fb^{-1}$, we measure Born cross sections and effective form factors for the process $e^+e^-\toΞ^0\barΞ^0$ at forty-five center-of-mass energies between 3.51 and 4.95 GeV. The dressed cross section is fitted, assuming a power-law function plus a charmonium(-like) state, i.e., $ψ(3770)$, $ψ(4040)$, $ψ(4160)$, $ψ(4230)$, $ψ(4360)$, $ψ(4415)$ or $ψ(4660)$. No significant charmonium(-like) state decaying into $Ξ^0\barΞ^0$ is observed. Upper limits at the 90% confidence level on the product of the branching fraction and the electronic partial width are provided for each decay. In addition, ratios of the Born cross sections and the effective form factors for $e^+e^-\toΞ^0\barΞ^0$ and $e^+e^-\toΞ^-\barΞ^+$ are also presented to test isospin symmetry and the vector meson dominance model.
△ Less
Submitted 31 August, 2024;
originally announced September 2024.
-
User-centric Service Provision for Edge-assisted Mobile AR: A Digital Twin-based Approach
Authors:
Conghao Zhou,
Jie Gao,
Yixiang Liu,
Shisheng Hu,
Nan Cheng,
Xuemin Shen
Abstract:
Future 6G networks are envisioned to support mobile augmented reality (MAR) applications and provide customized immersive experiences for users via advanced service provision. In this paper, we investigate user-centric service provision for edge-assisted MAR to support the timely camera frame uploading of an MAR device by optimizing the spectrum resource reservation. To address the challenge of no…
▽ More
Future 6G networks are envisioned to support mobile augmented reality (MAR) applications and provide customized immersive experiences for users via advanced service provision. In this paper, we investigate user-centric service provision for edge-assisted MAR to support the timely camera frame uploading of an MAR device by optimizing the spectrum resource reservation. To address the challenge of non-stationary data traffic due to uncertain user movement and the complex camera frame uploading mechanism, we develop a digital twin (DT)-based data-driven approach to user-centric service provision. Specifically, we first establish a hierarchical data model with well-defined data attributes to characterize the impact of the camera frame uploading mechanism on the user-specific data traffic. We then design an easy-to-use algorithm to adapt the data attributes used in traffic modeling to the non-stationary data traffic. We also derive a closed-form service provision solution tailored to data-driven traffic modeling with the consideration of potential modeling inaccuracies. Trace-driven simulation results demonstrate that our DT-based approach for user-centric service provision outperforms conventional approaches in terms of adaptivity and robustness.
△ Less
Submitted 30 August, 2024;
originally announced September 2024.
-
An Empirical Study on Context Length for Open-Domain Dialog Generation
Authors:
Xinyi Shen,
Zuoquan Lin
Abstract:
Transformer-based open-domain dialog models have become increasingly popular in recent years. These models typically represent context as a concatenation of a dialog history. However, there is no criterion to decide how many utterances should be kept adequate in a context. We try to figure out how the choice of context length affects the model. We experiment on three questions from coarse to fine:…
▽ More
Transformer-based open-domain dialog models have become increasingly popular in recent years. These models typically represent context as a concatenation of a dialog history. However, there is no criterion to decide how many utterances should be kept adequate in a context. We try to figure out how the choice of context length affects the model. We experiment on three questions from coarse to fine: (i) Does longer context help model training? (ii) Is it necessary to change the training context length when dealing with dialogs of different context lengths? (iii) Do different dialog samples have the same preference for context length? Our experimental results show that context length, an often overlooked setting, deserves attention when implementing Transformer-based dialog models.
△ Less
Submitted 30 August, 2024;
originally announced September 2024.
-
Search for $h_c \to π^+π^-J/ψ$ via $ψ(3686)\to π^0h_c$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere,
A. Brueggemann
, et al. (653 additional authors not shown)
Abstract:
Using $(2712.4 \pm 14.3) \times 10^6~ψ$(3686) events collected with the BESIII detector operating at the BEPCII collider, we search for the hadronic transition $h_c \to π^+π^-J/ψ$ via $ψ(3686)\to π^0 h_c$. No significant signal is observed. We set the most stringent upper limits to date on the branching fractions $\mathcal{B}(ψ(3686)\to π^0 h_c)\times\mathcal{B}(h_c\toπ^+π^-J/ψ)$ and…
▽ More
Using $(2712.4 \pm 14.3) \times 10^6~ψ$(3686) events collected with the BESIII detector operating at the BEPCII collider, we search for the hadronic transition $h_c \to π^+π^-J/ψ$ via $ψ(3686)\to π^0 h_c$. No significant signal is observed. We set the most stringent upper limits to date on the branching fractions $\mathcal{B}(ψ(3686)\to π^0 h_c)\times\mathcal{B}(h_c\toπ^+π^-J/ψ)$ and $\mathcal{B}(h_c \to π^+π^-J/ψ)$ at the 90$\%$ confidence level, which are determined to be $6.7\times 10^{-7}$ and $9.4 \times10^{-4}$, respectively.
△ Less
Submitted 30 August, 2024;
originally announced August 2024.
-
Measurement of the Decay $Ξ^{0}\toΛγ$ with Entangled $Ξ^{0}\barΞ^{0}$ Pairs
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (638 additional authors not shown)
Abstract:
In this Letter, a systematic study of the weak radiative hyperon decay $Ξ^{0}\toΛγ$ at an electron-positron collider using entangled $Ξ^{0}\barΞ^{0}$ pair events is presented. The absolute branching fraction for this decay has been measured for the first time, and is $\left(1.347 \pm 0.066_{\mathrm stat.}\pm0.054_{\mathrm syst.}\right)\times 10^{-3}$. The decay asymmetry parameter, which character…
▽ More
In this Letter, a systematic study of the weak radiative hyperon decay $Ξ^{0}\toΛγ$ at an electron-positron collider using entangled $Ξ^{0}\barΞ^{0}$ pair events is presented. The absolute branching fraction for this decay has been measured for the first time, and is $\left(1.347 \pm 0.066_{\mathrm stat.}\pm0.054_{\mathrm syst.}\right)\times 10^{-3}$. The decay asymmetry parameter, which characterizes the effect of parity violation in the decay, is determined to be $-0.741 \pm 0.062_{\mathrm stat.}\pm 0.019_{\mathrm syst.}$. The obtained results are consistent with the world average values within the uncertainties, offering valuable insights into the underlying mechanism governing the weak radiative hyperon decays. The charge conjugation parity ($CP$) symmetries of branching fraction and decay asymmetry parameter in the decay are also studied. No statistically significant violation of charge conjugation parity symmetry is observed.
△ Less
Submitted 29 August, 2024; v1 submitted 29 August, 2024;
originally announced August 2024.
-
Model-independent determination of the strong-phase difference between $D^0$ and $\bar{D}^0 \to π^+π^-π^+π^-$ decays
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (647 additional authors not shown)
Abstract:
Measurements of the strong-phase difference between $D^0$ and $\bar{D}^0\toπ^+π^-π^+π^-$ are performed in bins of phase space. The study exploits a sample of quantum-correlated $D\bar{D}$ mesons collected by the BESIII experiment in $e^+e^-$ collisions at a center-of-mass energy of 3.773~GeV, corresponding to an integrated luminosity of 2.93~fb$^{-1}$. Here, $D$ denotes a neutral charm meson in a…
▽ More
Measurements of the strong-phase difference between $D^0$ and $\bar{D}^0\toπ^+π^-π^+π^-$ are performed in bins of phase space. The study exploits a sample of quantum-correlated $D\bar{D}$ mesons collected by the BESIII experiment in $e^+e^-$ collisions at a center-of-mass energy of 3.773~GeV, corresponding to an integrated luminosity of 2.93~fb$^{-1}$. Here, $D$ denotes a neutral charm meson in a superposition of flavor eigenstates. The reported results are valuable for measurements of the $C\!P$-violating phase $γ$ (also denoted $φ_3$) in $B^\pm \to DK^\pm$, $D \to π^+π^-π^+π^-$ decays, and the binning schemes are designed to provide good statistical sensitivity to this parameter. The expected uncertainty on $γ$ arising from the precision of the strong-phase measurements, when applied to very large samples of $B$-meson decays, is around $1.5^\circ$ or $2^\circ$, depending on the binning scheme. The binned strong-phase parameters are combined to give a value of $F_+^{4π} = 0.746 \pm 0.010 \pm 0.004$ for the $C\!P$-even fraction of $D^0 \to π^+π^-π^+π^-$ decays, which is around 30\% more precise than the previous best measurement of this quantity.
△ Less
Submitted 29 August, 2024;
originally announced August 2024.
-
Ultrafast symmetry control in photoexcited quantum dots
Authors:
Burak Guzelturk,
Joshua Portner,
Justin Ondry,
Samira Ghanbarzadeh,
Mia Tarantola,
Ahhyun Jeong,
Thomas Field,
Alicia M. Chandler,
Eliza Wieman,
Thomas R. Hopper,
Nicolas E. Watkins,
Jin Yue,
Xinxin Cheng,
Ming-Fu Lin,
Duan Luo,
Patrick L. Kramer,
Xiaozhe Shen,
Alexander H. Reid,
Olaf Borkiewicz,
Uta Ruett,
Xiaoyi Zhang,
Aaron M. Lindenberg,
Jihong Ma,
Richard Schaller,
Dmitri V. Talapin
, et al. (1 additional authors not shown)
Abstract:
Symmetry control is essential for realizing unconventional properties, such as ferroelectricity, nonlinear optical responses, and complex topological order, thus it holds promise for the design of emerging quantum and photonic systems. Nevertheless, fast and reversible control of symmetry in materials remains a challenge, especially for nanoscale systems. Here, we unveil reversible symmetry change…
▽ More
Symmetry control is essential for realizing unconventional properties, such as ferroelectricity, nonlinear optical responses, and complex topological order, thus it holds promise for the design of emerging quantum and photonic systems. Nevertheless, fast and reversible control of symmetry in materials remains a challenge, especially for nanoscale systems. Here, we unveil reversible symmetry changes in colloidal lead chalcogenide quantum dots on picosecond timescales. Using a combination of ultrafast electron diffraction and total X-ray scattering, in conjunction with atomic-scale structural modeling and first-principles calculations, we reveal that symmetry-broken lead sulfide quantum dots restore to a centrosymmetric phase upon photoexcitation. The symmetry restoration is driven by photoexcited electronic carriers, which suppress lead off-centering for about 100 ps. Furthermore, the change in symmetry is closely correlated with the electronic properties as shown by transient optical measurements. Overall, this study elucidates reversible symmetry changes in colloidal quantum dots, and more broadly defines a new methodology to optically control symmetry in nanoscale systems on ultrafast timescales.
△ Less
Submitted 27 August, 2024;
originally announced August 2024.
-
Aggressively-Dissipative Dark Dwarfs: The Effects of Atomic Dark Matter on the Inner Densities of Isolated Dwarf Galaxies
Authors:
Sandip Roy,
Xuejian Shen,
Jared Barron,
Mariangela Lisanti,
David Curtin,
Norman Murray,
Philip F. Hopkins
Abstract:
We present the first suite of cosmological hydrodynamical zoom-in simulations of isolated dwarf galaxies for a dark sector that consists of Cold Dark Matter and a strongly-dissipative sub-component. The simulations are implemented in GIZMO and include standard baryons following the FIRE-2 galaxy formation physics model. The dissipative dark matter is modeled as Atomic Dark Matter (aDM), which form…
▽ More
We present the first suite of cosmological hydrodynamical zoom-in simulations of isolated dwarf galaxies for a dark sector that consists of Cold Dark Matter and a strongly-dissipative sub-component. The simulations are implemented in GIZMO and include standard baryons following the FIRE-2 galaxy formation physics model. The dissipative dark matter is modeled as Atomic Dark Matter (aDM), which forms a dark hydrogen gas that cools in direct analogy to the Standard Model. Our suite includes seven different simulations of $\sim 10^{10} M_{\odot}$ systems that vary over the aDM microphysics and the dwarf's evolutionary history. We identify a region of aDM parameter space where the cooling rate is aggressive and the resulting halo density profile is universal. In this regime, the aDM gas cools rapidly at high redshifts and only a small fraction survives in the form of a central dark gas disk; the majority collapses centrally into collisionless dark "clumps", which are clusters of sub-resolution dark compact objects. These dark clumps rapidly equilibrate in the inner galaxy, resulting in an approximately isothermal distribution that can be modeled with a simple fitting function. Even when only a small fraction ($\sim 5\%$) of the total dark matter is strongly dissipative, the central densities of classical dwarf galaxies can be enhanced by over an order of magnitude, providing a sharp prediction for observations.
△ Less
Submitted 27 August, 2024;
originally announced August 2024.
-
Evolvable Psychology Informed Neural Network for Memory Behavior Modeling
Authors:
Xiaoxuan Shen,
Zhihai Hu,
Qirong Chen,
Shengyingjie Liu,
Ruxia Liang,
Jianwen Sun
Abstract:
Memory behavior modeling is a core issue in cognitive psychology and education. Classical psychological theories typically use memory equations to describe memory behavior, which exhibits insufficient accuracy and controversy, while data-driven memory modeling methods often require large amounts of training data and lack interpretability. Knowledge-informed neural network models have shown excelle…
▽ More
Memory behavior modeling is a core issue in cognitive psychology and education. Classical psychological theories typically use memory equations to describe memory behavior, which exhibits insufficient accuracy and controversy, while data-driven memory modeling methods often require large amounts of training data and lack interpretability. Knowledge-informed neural network models have shown excellent performance in fields like physics, but there have been few attempts in the domain of behavior modeling. This paper proposed a psychology theory informed neural networks for memory behavior modeling named PsyINN, where it constructs a framework that combines neural network with differentiating sparse regression, achieving joint optimization. Specifically, to address the controversies and ambiguity of descriptors in memory equations, a descriptor evolution method based on differentiating operators is proposed to achieve precise characterization of descriptors and the evolution of memory theoretical equations. Additionally, a buffering mechanism for the sparse regression and a multi-module alternating iterative optimization method are proposed, effectively mitigating gradient instability and local optima issues. On four large-scale real-world memory behavior datasets, the proposed method surpasses the state-of-the-art methods in prediction accuracy. Ablation study demonstrates the effectiveness of the proposed refinements, and application experiments showcase its potential in inspiring psychological research.
△ Less
Submitted 22 August, 2024;
originally announced August 2024.
-
The Ultraviolet Slopes of Early Universe Galaxies: The Impact of Bursty Star Formation, Dust, and Nebular Continuum Emission
Authors:
Desika Narayanan,
Daniel P. Stark,
Steven L. Finkelstein,
Paul Torrey,
Qi Li,
Fergus Cullen,
Micheal W. Topping,
Federico Marinacci,
Laura V. Sales,
Xuejian Shen,
Mark Vogelsberger
Abstract:
JWST has enabled the detection of the UV continuum of galaxies at z>10, evidencing a population of extremely blue, potentially dust-free galaxies. Interpreting the UV spectra of galaxies as they redden is complicated by the well-known degeneracy between stellar ages, dust, and nebular continuum. The main goal of this paper is to develop a theoretical model for the relationship between galaxy UV sl…
▽ More
JWST has enabled the detection of the UV continuum of galaxies at z>10, evidencing a population of extremely blue, potentially dust-free galaxies. Interpreting the UV spectra of galaxies as they redden is complicated by the well-known degeneracy between stellar ages, dust, and nebular continuum. The main goal of this paper is to develop a theoretical model for the relationship between galaxy UV slopes, bursty star formation histories, dust evolution, and the contribution from nebular regions. We accomplish this via cosmological zoom-in simulations, and in specific, build a layered model where we simulate the UV slopes of galaxies with increasingly complex physics. Our main results follow. (i) Unattenuated stellar populations with no nebular emission exhibit a diverse range of intrinsic UV slopes, with values ranging from beta ~ -3 --> -2.2 due to long delays between bursts. This is manifested by an inverse correlation between the intrinsic UV slope and sSFR for early galaxies such that higher sSFR corresponds to bluer UV slopes. (ii) When including dust, our model galaxies demonstrate a rapid rise in dust obscuration between z ~ 8-10. This increase in dust mass is due to high grain-grain shattering rates, and enhanced growth per unit dust mass in very small grains, resulting in UV-detected galaxies at z ~ 12 descending into ALMA-detectable galaxies by z ~ 6. The rapid rise in dust content at z ~ 8-10 leads to a systematic reddening of the UV slopes during this redshift range. (iii) The inclusion of nebular continuum reddens the UV slope by a median factor Delta beta ~ 0.2-0.4. However, when including nebular continuum, our highest redshift galaxies (z~12) are insufficiently blue compared to observations; this may imply an evolving escape fraction from HII regions with redshift.
△ Less
Submitted 23 August, 2024;
originally announced August 2024.
-
Zeoformer: Coarse-Grained Periodic Graph Transformer for OSDA-Zeolite Affinity Prediction
Authors:
Xiangxiang Shen,
Zheng Wan,
Lingfeng Wen,
Licheng Sun,
Ou Yang Ming Jie,
Xuan Tang,
Xian Zeng,
Mingsong Chen,
Xiao He,
Xian Wei
Abstract:
To date, the International Zeolite Association Structure Commission (IZA-SC) has cataloged merely 255 distinct zeolite structures, with millions of theoretically possible structures yet to be discovered. The synthesis of a specific zeolite typically necessitates the use of an organic structure-directing agent (OSDA), since the selectivity for a particular zeolite is largely determined by the affin…
▽ More
To date, the International Zeolite Association Structure Commission (IZA-SC) has cataloged merely 255 distinct zeolite structures, with millions of theoretically possible structures yet to be discovered. The synthesis of a specific zeolite typically necessitates the use of an organic structure-directing agent (OSDA), since the selectivity for a particular zeolite is largely determined by the affinity between the OSDA and the zeolite. Therefore, finding the best affinity OSDA-zeolite pair is the key to the synthesis of targeted zeolite. However, OSDA-zeolite pairs frequently exhibit complex geometric structures, i.e., a complex crystal structure formed by a large number of atoms. Although some existing machine learning methods can represent the periodicity of crystals, they cannot accurately represent crystal structures with local variability. To address this issue, we propose a novel approach called Zeoformer, which can effectively represent coarse-grained crystal periodicity and fine-grained local variability. Zeoformer reconstructs the unit cell centered around each atom and encodes the pairwise distances between this central atom and other atoms within the reconstructed unit cell. The introduction of pairwise distances within the reconstructed unit cell more effectively represents the overall structure of the unit cell and the differences between different unit cells, enabling the model to more accurately and efficiently predict the properties of OSDA-zeolite pairs and general crystal structures. Through comprehensive evaluation, our Zeoformer model demonstrates the best performance on OSDA-zeolite pair datasets and two types of crystal material datasets.
△ Less
Submitted 25 August, 2024; v1 submitted 23 August, 2024;
originally announced August 2024.
-
Generative AI based Secure Wireless Sensing for ISAC Networks
Authors:
Jiacheng Wang,
Hongyang Du,
Yinqiu Liu,
Geng Sun,
Dusit Niyato,
Shiwen Mao,
Dong In Kim,
Xuemin Shen
Abstract:
Integrated sensing and communications (ISAC) is expected to be a key technology for 6G, and channel state information (CSI) based sensing is a key component of ISAC. However, current research on ISAC focuses mainly on improving sensing performance, overlooking security issues, particularly the unauthorized sensing of users. In this paper, we propose a secure sensing system (DFSS) based on two dist…
▽ More
Integrated sensing and communications (ISAC) is expected to be a key technology for 6G, and channel state information (CSI) based sensing is a key component of ISAC. However, current research on ISAC focuses mainly on improving sensing performance, overlooking security issues, particularly the unauthorized sensing of users. In this paper, we propose a secure sensing system (DFSS) based on two distinct diffusion models. Specifically, we first propose a discrete conditional diffusion model to generate graphs with nodes and edges, guiding the ISAC system to appropriately activate wireless links and nodes, which ensures the sensing performance while minimizing the operation cost. Using the activated links and nodes, DFSS then employs the continuous conditional diffusion model to generate safeguarding signals, which are next modulated onto the pilot at the transmitter to mask fluctuations caused by user activities. As such, only ISAC devices authorized with the safeguarding signals can extract the true CSI for sensing, while unauthorized devices are unable to achieve the same sensing. Experiment results demonstrate that DFSS can reduce the activity recognition accuracy of the unauthorized devices by approximately 70%, effectively shield the user from the unauthorized surveillance.
△ Less
Submitted 21 August, 2024;
originally announced August 2024.
-
Sliced Maximal Information Coefficient: A Training-Free Approach for Image Quality Assessment Enhancement
Authors:
Kang Xiao,
Xu Wang,
Yulin He,
Baoliang Chen,
Xuelin Shen
Abstract:
Full-reference image quality assessment (FR-IQA) models generally operate by measuring the visual differences between a degraded image and its reference. However, existing FR-IQA models including both the classical ones (eg, PSNR and SSIM) and deep-learning based measures (eg, LPIPS and DISTS) still exhibit limitations in capturing the full perception characteristics of the human visual system (HV…
▽ More
Full-reference image quality assessment (FR-IQA) models generally operate by measuring the visual differences between a degraded image and its reference. However, existing FR-IQA models including both the classical ones (eg, PSNR and SSIM) and deep-learning based measures (eg, LPIPS and DISTS) still exhibit limitations in capturing the full perception characteristics of the human visual system (HVS). In this paper, instead of designing a new FR-IQA measure, we aim to explore a generalized human visual attention estimation strategy to mimic the process of human quality rating and enhance existing IQA models. In particular, we model human attention generation by measuring the statistical dependency between the degraded image and the reference image. The dependency is captured in a training-free manner by our proposed sliced maximal information coefficient and exhibits surprising generalization in different IQA measures. Experimental results verify the performance of existing IQA models can be consistently improved when our attention module is incorporated. The source code is available at https://github.com/KANGX99/SMIC.
△ Less
Submitted 19 August, 2024;
originally announced August 2024.
-
PA-LLaVA: A Large Language-Vision Assistant for Human Pathology Image Understanding
Authors:
Dawei Dai,
Yuanhui Zhang,
Long Xu,
Qianlan Yang,
Xiaojing Shen,
Shuyin Xia,
Guoyin Wang
Abstract:
The previous advancements in pathology image understanding primarily involved developing models tailored to specific tasks. Recent studies has demonstrated that the large vision-language model can enhance the performance of various downstream tasks in medical image understanding. In this study, we developed a domain-specific large language-vision assistant (PA-LLaVA) for pathology image understand…
▽ More
The previous advancements in pathology image understanding primarily involved developing models tailored to specific tasks. Recent studies has demonstrated that the large vision-language model can enhance the performance of various downstream tasks in medical image understanding. In this study, we developed a domain-specific large language-vision assistant (PA-LLaVA) for pathology image understanding. Specifically, (1) we first construct a human pathology image-text dataset by cleaning the public medical image-text data for domain-specific alignment; (2) Using the proposed image-text data, we first train a pathology language-image pretraining (PLIP) model as the specialized visual encoder for pathology image, and then we developed scale-invariant connector to avoid the information loss caused by image scaling; (3) We adopt two-stage learning to train PA-LLaVA, first stage for domain alignment, and second stage for end to end visual question \& answering (VQA) task. In experiments, we evaluate our PA-LLaVA on both supervised and zero-shot VQA datasets, our model achieved the best overall performance among multimodal models of similar scale. The ablation experiments also confirmed the effectiveness of our design. We posit that our PA-LLaVA model and the datasets presented in this work can promote research in field of computational pathology. All codes are available at: https://github.com/ddw2AIGROUP2CQUPT/PA-LLaVA}{https://github.com/ddw2AIGROUP2CQUPT/PA-LLaVA
△ Less
Submitted 18 August, 2024;
originally announced August 2024.
-
Search for the rare decay $J/ψ\to γD^0+c.c.$ at BESIII
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (642 additional authors not shown)
Abstract:
Using $(10087\pm44)\times10^6J/ψ$ events collected with the BESIII detector, we search for the rare decay $J/ψ\to γD^0+c.c.$ for the first time. No obvious signal is observed and the upper limit on the branching fraction is determined to be ${\cal B}(J/ψ\to γD^{0}+c.c.)< 9.1 \times 10^{-8}$ at 90\% confidence level.
Using $(10087\pm44)\times10^6J/ψ$ events collected with the BESIII detector, we search for the rare decay $J/ψ\to γD^0+c.c.$ for the first time. No obvious signal is observed and the upper limit on the branching fraction is determined to be ${\cal B}(J/ψ\to γD^{0}+c.c.)< 9.1 \times 10^{-8}$ at 90\% confidence level.
△ Less
Submitted 16 August, 2024;
originally announced August 2024.
-
RadioDiff: An Effective Generative Diffusion Model for Sampling-Free Dynamic Radio Map Construction
Authors:
Xiucheng Wang,
Keda Tao,
Nan Cheng,
Zhisheng Yin,
Zan Li,
Yuan Zhang,
Xuemin Shen
Abstract:
Radio map (RM) is a promising technology that can obtain pathloss based on only location, which is significant for 6G network applications to reduce the communication costs for pathloss estimation. However, the construction of RM in traditional is either computationally intensive or depends on costly sampling-based pathloss measurements. Although the neural network (NN)-based method can efficientl…
▽ More
Radio map (RM) is a promising technology that can obtain pathloss based on only location, which is significant for 6G network applications to reduce the communication costs for pathloss estimation. However, the construction of RM in traditional is either computationally intensive or depends on costly sampling-based pathloss measurements. Although the neural network (NN)-based method can efficiently construct the RM without sampling, its performance is still suboptimal. This is primarily due to the misalignment between the generative characteristics of the RM construction problem and the discrimination modeling exploited by existing NN-based methods. Thus, to enhance RM construction performance, in this paper, the sampling-free RM construction is modeled as a conditional generative problem, where a denoised diffusion-based method, named RadioDiff, is proposed to achieve high-quality RM construction. In addition, to enhance the diffusion model's capability of extracting features from dynamic environments, an attention U-Net with an adaptive fast Fourier transform module is employed as the backbone network to improve the dynamic environmental features extracting capability. Meanwhile, the decoupled diffusion model is utilized to further enhance the construction performance of RMs. Moreover, a comprehensive theoretical analysis of why the RM construction is a generative problem is provided for the first time, from both perspectives of data features and NN training methods. Experimental results show that the proposed RadioDiff achieves state-of-the-art performance in all three metrics of accuracy, structural similarity, and peak signal-to-noise ratio. The code is available at https://github.com/UNIC-Lab/RadioDiff.
△ Less
Submitted 16 August, 2024;
originally announced August 2024.
-
Time correlations in the inverse-gamma polymer with flat initial condition
Authors:
Xiao Shen
Abstract:
Temporal correlations in the KPZ universality class have gained significant attention, following the conjectures in [Ferr-Spoh'16]. Building on prior work in the zero temperature setting [Basu-Gang-Zhan'21], we address the time correlation problem with flat initial conditions in the positive temperature regime. Our study focuses on the inverse-gamma polymer, where we establish an upper bound for t…
▽ More
Temporal correlations in the KPZ universality class have gained significant attention, following the conjectures in [Ferr-Spoh'16]. Building on prior work in the zero temperature setting [Basu-Gang-Zhan'21], we address the time correlation problem with flat initial conditions in the positive temperature regime. Our study focuses on the inverse-gamma polymer, where we establish an upper bound for the correlation between two free energies whose endpoints are far apart in time. In contrast to the previous work [Basu-Gang-Zhan'21], our work not only extends the result to positive temperatures but also eliminates the reliance on integrable probability inputs related to the Airy process. This advancement allows us to address local scales, where the short time remains fixed while the large time grows arbitrarily, a scenario beyond the reach of the Airy scaling limit.
△ Less
Submitted 15 August, 2024;
originally announced August 2024.
-
Fast and Accurate Algorithms to Calculate Expected Modularity in Probabilistic Networks
Authors:
Xin Shen,
Matteo Magnani,
Christian Rohner,
Fiona Skerman
Abstract:
Modularity maximization is a widely used community detection technique for deterministic networks. However, little research has been performed to develop efficient modularity calculation algorithms for probabilistic networks. Particularly, it is challenging to efficiently calculate expected modularity when all possible worlds are considered. To address this problem, we propose two algorithms, name…
▽ More
Modularity maximization is a widely used community detection technique for deterministic networks. However, little research has been performed to develop efficient modularity calculation algorithms for probabilistic networks. Particularly, it is challenging to efficiently calculate expected modularity when all possible worlds are considered. To address this problem, we propose two algorithms, namely $\mathrm{PWP}^{\mathrm{EMOD}}$ and $\mathrm{APWP}^{\mathrm{EMOD}}$, partitioning the possible worlds based on their modularities to significantly reduce the number of probability calculations. We evaluate the accuracy and time efficiency of our algorithms through comprehensive experiments.
△ Less
Submitted 13 August, 2024;
originally announced August 2024.
-
Search for $η_c(2S)\toωω$ and $ωφ$ decays and measurements of $χ_{cJ}\toωω$ and $ωφ$ in $ψ(2S)$ radiative processes
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (643 additional authors not shown)
Abstract:
Using $(2712\pm 14)$ $\times$ 10$^{6}$ $ψ(2S)$ events collected with the BESIII detector at the BEPCII collider, we search for the decays $η_{c}(2S)\toωω$ and $η_{c}(2S)\toωφ$ via the process $ψ(2S)\toγη_{c}(2S)$. Evidence of $η_{c}(2S)\toωω$ is found with a statistical significance of $3.2σ$. The branching fraction is measured to be…
▽ More
Using $(2712\pm 14)$ $\times$ 10$^{6}$ $ψ(2S)$ events collected with the BESIII detector at the BEPCII collider, we search for the decays $η_{c}(2S)\toωω$ and $η_{c}(2S)\toωφ$ via the process $ψ(2S)\toγη_{c}(2S)$. Evidence of $η_{c}(2S)\toωω$ is found with a statistical significance of $3.2σ$. The branching fraction is measured to be $\mathcal{B}(η_{c}(2S)\toωω)=(5.65\pm3.77(\rm stat.)\pm5.32(\rm syst.))\times10^{-4}$. No statistically significant signal is observed for the decay $η_{c}(2S)\toωφ$. The upper limit of the branching fraction at the 90\% confidence level is determined to be $\mathcal{B}(ψ(2S)\toγη_{c}(2S),η_{c}(2S)\toωφ)<2.24\times 10^{-7}$. We also update the branching fractions of $χ_{cJ}\to ωω$ and $χ_{cJ}\toωφ$ decays via the $ψ(2S)\toγχ_{cJ}$ transition. The branching fractions are determined to be $\mathcal{B}(χ_{c0}\toωω)=(10.63\pm0.11\pm0.46)\times 10^{-4}$, $\mathcal{B}(χ_{c1}\toωω)=(6.39\pm0.07\pm0.29)\times 10^{-4}$, $\mathcal{B}(χ_{c2}\toωω)=(8.50\pm0.08\pm0.38)\times 10^{-4}$, $\mathcal{B}(χ_{c0}\toωφ)=(1.18\pm0.03\pm0.05)\times 10^{-4}$, $\mathcal{B}(χ_{c1}\toωφ)=(2.03\pm0.15\pm0.12)\times 10^{-5}$, and $\mathcal{B}(χ_{c2}\toωφ)=(9.37\pm1.07\pm0.59)\times 10^{-6}$, where the first uncertainties are statistical and the second are systematic.
△ Less
Submitted 13 August, 2024;
originally announced August 2024.
-
Chain-of-Strategy Planning with LLMs: Aligning the Generation of Psychotherapy Dialogue with Strategy in Motivational Interviewing
Authors:
Xin Sun,
Xiao Tang,
Abdallah El Ali,
Zhuying Li,
Xiaoyu Shen,
Pengjie Ren,
Jan de Wit,
Jiahuan Pei,
Jos A. Bosch
Abstract:
Recent advancements in large language models (LLMs) have shown promise in generating psychotherapeutic dialogues, especially in Motivational Interviewing (MI). However, how to employ strategies, a set of motivational interviewing (MI) skills, to generate therapeutic-adherent conversations with explainability is underexplored. We propose an approach called strategy-aware dialogue generation with Ch…
▽ More
Recent advancements in large language models (LLMs) have shown promise in generating psychotherapeutic dialogues, especially in Motivational Interviewing (MI). However, how to employ strategies, a set of motivational interviewing (MI) skills, to generate therapeutic-adherent conversations with explainability is underexplored. We propose an approach called strategy-aware dialogue generation with Chain-of-Strategy (CoS) planning, which first predicts MI strategies as reasoning and utilizes these strategies to guide the subsequent dialogue generation. It brings the potential for controllable and explainable generation in psychotherapy by aligning the generated MI dialogues with therapeutic strategies. Extensive experiments including automatic and human evaluations are conducted to validate the effectiveness of the MI strategy. Our findings demonstrate the potential of LLMs in producing strategically aligned dialogues and suggest directions for practical applications in psychotherapeutic settings.
△ Less
Submitted 12 August, 2024;
originally announced August 2024.
-
AyE-Edge: Automated Deployment Space Search Empowering Accuracy yet Efficient Real-Time Object Detection on the Edge
Authors:
Chao Wu,
Yifan Gong,
Liangkai Liu,
Mengquan Li,
Yushu Wu,
Xuan Shen,
Zhimin Li,
Geng Yuan,
Weisong Shi,
Yanzhi Wang
Abstract:
Object detection on the edge (Edge-OD) is in growing demand thanks to its ever-broad application prospects. However, the development of this field is rigorously restricted by the deployment dilemma of simultaneously achieving high accuracy, excellent power efficiency, and meeting strict real-time requirements. To tackle this dilemma, we propose AyE-Edge, the first-of-this-kind development tool tha…
▽ More
Object detection on the edge (Edge-OD) is in growing demand thanks to its ever-broad application prospects. However, the development of this field is rigorously restricted by the deployment dilemma of simultaneously achieving high accuracy, excellent power efficiency, and meeting strict real-time requirements. To tackle this dilemma, we propose AyE-Edge, the first-of-this-kind development tool that explores automated algorithm-device deployment space search to realize Accurate yet power-Efficient real-time object detection on the Edge. Through a collaborative exploration of keyframe selection, CPU-GPU configuration, and DNN pruning strategy, AyE-Edge excels in extensive real-world experiments conducted on a mobile device. The results consistently demonstrate AyE-Edge's effectiveness, realizing outstanding real-time performance, detection accuracy, and notably, a remarkable 96.7% reduction in power consumption, compared to state-of-the-art (SOTA) competitors.
△ Less
Submitted 25 July, 2024;
originally announced August 2024.
-
Analysis of the dynamics of the decay $D^{+}\to K_{S}^{0} π^{0} e^{+}ν_{e}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (644 additional authors not shown)
Abstract:
The branching fraction of $D^+\to K_{S}^{0} π^{0}e^+ν_e$ is measured for the first time using $7.93~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$~GeV with the BESIII detector operating at the BEPCII collider, and is determined to be ${\mathcal B}$($D^+\to K_S^0π^0e^+ν_e$) = $(0.881~\pm~0.017_{\rm stat.}~\pm~0.016_{\rm syst.})$\%. Based on a…
▽ More
The branching fraction of $D^+\to K_{S}^{0} π^{0}e^+ν_e$ is measured for the first time using $7.93~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$~GeV with the BESIII detector operating at the BEPCII collider, and is determined to be ${\mathcal B}$($D^+\to K_S^0π^0e^+ν_e$) = $(0.881~\pm~0.017_{\rm stat.}~\pm~0.016_{\rm syst.})$\%. Based on an analysis of the $D^+\to K_S^0π^0e^+ν_e$ decay dynamics, we observe the $S\text{-}{\rm wave}$ and $P$-wave components with fractions of $f_{S\text{-}{\rm wave}}$ = $(6.13~\pm~0.27_{\rm stat.}~\pm ~0.30_{\rm syst.})\%$ and $f_{\bar K^{*}(892)^0}$ = $(93.88~\pm~0.27_{\rm stat.}~\pm~0.29_{\rm syst.})$\%, respectively. From these results, we obtain the branching fractions ${\mathcal B}$($D^+\to (K_S^0π^0)_{S\text{-}{\rm wave}}~e^+ν_e$) = $(5.41~\pm~0.35_{\rm stat.}~\pm~0.37_{\rm syst.})\times10^{-4}$ and ${\mathcal B}$($D^+\to \bar K^{*}(892)^0e^+ν_e$) = $(4.97~\pm~0.11_{\rm stat.}~\pm~0.12_{\rm syst.})$\%. In addition, the hadronic form-factor ratios of $D^{+} \to \bar {K}^{*}(892)^0e^+ν_e$ at $q^2=0$, assuming a single-pole dominance parameterization, are determined to be $r_V=\frac{V(0)}{A_1(0)}= 1.43~\pm~0.07_{\rm stat.}~\pm~0.03_{\rm syst.}$ and $r_2=\frac{A_2(0)}{A_1(0)}=0.72~\pm~0.06_{\rm stat.}~\pm~0.02_{\rm syst.}$.
△ Less
Submitted 8 August, 2024;
originally announced August 2024.
-
Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual Storytelling
Authors:
Zilyu Ye,
Jinxiu Liu,
Ruotian Peng,
Jinjin Cao,
Zhiyang Chen,
Yiyang Zhang,
Ziwei Xuan,
Mingyuan Zhou,
Xiaoqian Shen,
Mohamed Elhoseiny,
Qi Liu,
Guo-Jun Qi
Abstract:
Recent image generation models excel at creating high-quality images from brief captions. However, they fail to maintain consistency of multiple instances across images when encountering lengthy contexts. This inconsistency is largely due to in existing training datasets the absence of granular instance feature labeling in existing training datasets. To tackle these issues, we introduce Openstory+…
▽ More
Recent image generation models excel at creating high-quality images from brief captions. However, they fail to maintain consistency of multiple instances across images when encountering lengthy contexts. This inconsistency is largely due to in existing training datasets the absence of granular instance feature labeling in existing training datasets. To tackle these issues, we introduce Openstory++, a large-scale dataset combining additional instance-level annotations with both images and text. Furthermore, we develop a training methodology that emphasizes entity-centric image-text generation, ensuring that the models learn to effectively interweave visual and textual information. Specifically, Openstory++ streamlines the process of keyframe extraction from open-domain videos, employing vision-language models to generate captions that are then polished by a large language model for narrative continuity. It surpasses previous datasets by offering a more expansive open-domain resource, which incorporates automated captioning, high-resolution imagery tailored for instance count, and extensive frame sequences for temporal consistency. Additionally, we present Cohere-Bench, a pioneering benchmark framework for evaluating the image generation tasks when long multimodal context is provided, including the ability to keep the background, style, instances in the given context coherent. Compared to existing benchmarks, our work fills critical gaps in multi-modal generation, propelling the development of models that can adeptly generate and interpret complex narratives in open-domain environments. Experiments conducted within Cohere-Bench confirm the superiority of Openstory++ in nurturing high-quality visual storytelling models, enhancing their ability to address open-domain generation tasks. More details can be found at https://openstorypp.github.io/
△ Less
Submitted 7 August, 2024;
originally announced August 2024.
-
Measurement of the Branching Fraction of \boldmath{$ψ(2S) \to γπ^0$}
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (644 additional authors not shown)
Abstract:
Based on $(2712.4\pm14.1)\times10^{6}~ψ(2S)$ events, 7.9 fb$^{-1}$ $ψ(3773)$ data, and 0.8 fb$^{-1}$ off-resonance data samples collected with the BESIII detector, we measure the branching fraction of $ψ(2S)\rightarrowγπ^{0}$ and $e^{+}e^{-}\rightarrowγπ^{0}$ form factor at momentum transfers $Q^{2}\sim13$ GeV$^{2}$. The $e^{+}e^{-}\rightarrowγπ^{0}$ cross section is fitted with considering the in…
▽ More
Based on $(2712.4\pm14.1)\times10^{6}~ψ(2S)$ events, 7.9 fb$^{-1}$ $ψ(3773)$ data, and 0.8 fb$^{-1}$ off-resonance data samples collected with the BESIII detector, we measure the branching fraction of $ψ(2S)\rightarrowγπ^{0}$ and $e^{+}e^{-}\rightarrowγπ^{0}$ form factor at momentum transfers $Q^{2}\sim13$ GeV$^{2}$. The $e^{+}e^{-}\rightarrowγπ^{0}$ cross section is fitted with considering the interference between the $ψ(2S)$ and continuum amplitudes and two solutions are found, ${\cal B}=3.74\times10^{-7}$ with $φ=3.93$ rad and ${\cal B}=7.87\times10^{-7}$ with $φ=2.08$ rad. Here, ${\cal B}$ is the branching fraction of $ψ(2S)\rightarrowγπ^{0}$ and $φ$ is the relative phase angle between the $ψ(2S)$ and continuum amplitudes. Due to insufficient off-resonance data, the branching fraction ${\cal B}(ψ(2S)\rightarrowγπ^{0})$ is determined to be in the range $[2.7, 9.7]\times10^{-7}$ within one standard deviation of the contour region.
△ Less
Submitted 7 August, 2024; v1 submitted 7 August, 2024;
originally announced August 2024.
-
Fractional Chern insulators in moiré flat bands with high Chern numbers
Authors:
Chonghao Wang,
Xiaoyang Shen,
Ruiping Guo,
Chong Wang,
Wenhui Duan,
Yong Xu
Abstract:
Recent discoveries of zero-field fractional Chern insulators in moiré materials have attracted intensive research interests. However, most current theoretical and experimental attempts focus on systems with low Chern number bands, in analogy to the Landau levels. Here we propose candidate material systems for realizing fractional Chern insulators with higher Chern numbers. The material setup invol…
▽ More
Recent discoveries of zero-field fractional Chern insulators in moiré materials have attracted intensive research interests. However, most current theoretical and experimental attempts focus on systems with low Chern number bands, in analogy to the Landau levels. Here we propose candidate material systems for realizing fractional Chern insulators with higher Chern numbers. The material setup involves $Γ$-valley twisted homobilayer transition metal dichalcogenides in proximity to a skyrmion lattice. The skyrmion exchange potential induces a flat band with a high Chern number $C = -2$. Using the momentum-space projected exact diagonalization method, we perform a comprehensive study at various filling factors, confirming the generalized Jain series. Our research provides theoretical guidance on realizing unconventional fractional Chern insulators beyond the Landau level picture.
△ Less
Submitted 6 August, 2024;
originally announced August 2024.
-
Measurement of $Σ^+$ transverse polarization in $e^+e^-$ collisions at $\sqrt{s} = 3.68-3.71$ GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (639 additional authors not shown)
Abstract:
Using $e^+e^-$ collision data collected with the BESIII detector at seven energy points ranging from 3.68 to 3.71 GeV and corresponding to an integrated luminosity of $652.1~{\rm pb^{-1}}$, we present an energy-dependent measurement of the transverse polarization, relative phase and modulus ratio of the electromagnetic form factors of the $Σ^+$ hyperon in the $e^+e^- \to Σ^+ \barΣ^-$ reaction. The…
▽ More
Using $e^+e^-$ collision data collected with the BESIII detector at seven energy points ranging from 3.68 to 3.71 GeV and corresponding to an integrated luminosity of $652.1~{\rm pb^{-1}}$, we present an energy-dependent measurement of the transverse polarization, relative phase and modulus ratio of the electromagnetic form factors of the $Σ^+$ hyperon in the $e^+e^- \to Σ^+ \barΣ^-$ reaction. These results are helpful to understand the production mechanism of the $Σ^+$-$\barΣ^-$ pairs.
△ Less
Submitted 7 August, 2024; v1 submitted 6 August, 2024;
originally announced August 2024.
-
Observation of $η_{c}(2S) \to K^{+}K^{-}η$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (639 additional authors not shown)
Abstract:
By analyzing $(27.12 \pm 0.14)\times10^{8}$ $ψ(3686)$ events accumulated with the BESIII detector, the decay $η_{c}(2S) \to K^{+} K^{-} η$ is observed for the first time with a significance of $6.2σ$ after considering systematic uncertainties. The product of the branching fractions of $ψ(3686) \to γη_{c}(2S)$ and $η_{c}(2S) \to K^{+} K^{-} η$ is measured to be…
▽ More
By analyzing $(27.12 \pm 0.14)\times10^{8}$ $ψ(3686)$ events accumulated with the BESIII detector, the decay $η_{c}(2S) \to K^{+} K^{-} η$ is observed for the first time with a significance of $6.2σ$ after considering systematic uncertainties. The product of the branching fractions of $ψ(3686) \to γη_{c}(2S)$ and $η_{c}(2S) \to K^{+} K^{-} η$ is measured to be $\mathcal{B}(ψ(3686) \toγη_{c}(2S))\times \mathcal{B}(η_{c}(2S)\to K^{+} K^{-}η)=(2.39 \pm 0.32 \pm 0.34) \times 10^{-6}$, where the first uncertainty is statistical, and the second one is systematic. The branching fraction of $η_{c}(2S)\to K^{+} K^{-}η$ is determined to be $\mathcal{B}(η_{c}(2S)\to K^{+} K^{-}η) = (3.42 \pm 0.46 \pm 0.48 \pm 2.44) \times 10^{-3}$, where the third uncertainty is due to the branching fraction of $ψ(3686) \to γη_{c}(2S)$. Using a recent BESIII measurement of $\mathcal{B} (η_{c}(2S) \to K^{+} K^{-}π^{0})$, we also determine the ratio between the branching fractions of $η_{c}(2S) \to K^{+} K^{-}η$ and $η_{c}(2S) \to K^{+} K^{-}π^{0}$ to be $1.49 \pm 0.22 \pm 0.25$, which is consistent with the previous result of BaBar at a comparable precision level.
△ Less
Submitted 5 August, 2024;
originally announced August 2024.
-
Search for $X(3872)\toπ^0π^0χ_{c1,2}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (638 additional authors not shown)
Abstract:
Using 10.1 fb$^{-1}$ of $e^+e^-$ collision data collected by the BESIII detector with center-of-mass energies between 4.15 GeV and 4.30 GeV, we search for the decays $X(3872)\toπ^0π^0χ_{c1,2}$, where the $X(3872)$ is produced in $e^+e^-\toγX(3872)$. No evidence above $3σ$ is found for either decay. Upper limits at the $90\%$ C.L. on the branching fractions of $X(3872)\toπ^0π^0χ_{c1,2}$ normalized…
▽ More
Using 10.1 fb$^{-1}$ of $e^+e^-$ collision data collected by the BESIII detector with center-of-mass energies between 4.15 GeV and 4.30 GeV, we search for the decays $X(3872)\toπ^0π^0χ_{c1,2}$, where the $X(3872)$ is produced in $e^+e^-\toγX(3872)$. No evidence above $3σ$ is found for either decay. Upper limits at the $90\%$ C.L. on the branching fractions of $X(3872)\toπ^0π^0χ_{c1,2}$ normalized to the branching fraction of $X(3872)\toπ^+π^-J/ψ$ are set to be $\mathcal{B}(X(3872)\toπ^0π^0χ_{c1})/\mathcal{B}(X(3872)\toπ^+π^-J/ψ) < 1.1$ and $\mathcal{B}(X(3872)\toπ^0π^0χ_{c2})/\mathcal{B}(X(3872)\toπ^+π^-J/ψ) < 0.5$, taking into account both statistical and systematic uncertainties.
△ Less
Submitted 2 August, 2024;
originally announced August 2024.
-
Game Theory Based Community-Aware Opinion Dynamics
Authors:
Shanfan Zhang,
Xiaoting Shen,
Zhan Bu
Abstract:
Examining the mechanisms underlying the formation and evolution of opinions within real-world social systems, which consist of numerous individuals, can provide valuable insights for effective social functioning and informed business decision making. The focus of our study is on the dynamics of opinions inside a networked multi-agent system. We provide a novel approach called the Game Theory Based…
▽ More
Examining the mechanisms underlying the formation and evolution of opinions within real-world social systems, which consist of numerous individuals, can provide valuable insights for effective social functioning and informed business decision making. The focus of our study is on the dynamics of opinions inside a networked multi-agent system. We provide a novel approach called the Game Theory Based Community-Aware Opinion Formation Process (GCAOFP) to accurately represent the co-evolutionary dynamics of communities and opinions in real-world social systems. The GCAOFP algorithm comprises two distinct steps in each iteration. 1) The Community Dynamics Process conceptualizes the process of community formation as a non-cooperative game involving a finite number of agents. Each individual agent aims to maximize their own utility by adopting a response that leads to the most favorable update of the community label. 2) The Opinion Formation Process involves the updating of an individual agent's opinion within a community-aware framework that incorporates bounded confidence. This process takes into account the updated matrix of community members and ensures that an agent's opinion aligns with the opinions of others within their community, within certain defined limits. The present study provides a theoretical proof that under any initial conditions, the aforementioned co-evolutionary dynamics process will ultimately reach an equilibrium state. In this state, both the opinion vector and community member matrix will stabilize after a finite number of iterations. In contrast to conventional opinion dynamics models, the guaranteed convergence of agent opinion within the same community ensures that the convergence of opinions takes place exclusively inside a given community.
△ Less
Submitted 2 August, 2024;
originally announced August 2024.
-
Partial wave analysis of $ψ(3686)\toΛ\barΣ^0π^0+c.c.$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (644 additional authors not shown)
Abstract:
Based on a sample of $(2712.4\pm14.3)\times10^6\;ψ(3686)$ events collected with the BESIII detector, a partial wave analysis of the decay $ψ(3686)\toΛ\barΣ^0π^0+c.c.$ is performed to investigate $Λ^*$ and $Σ^*$ resonances in the $π^0\barΣ^0$ and $π^0Λ$ invariant mass distributions. Significant contributions are found from the $Λ(1405)$, $Λ(1520)$, $Λ(1600)$, $Λ(1670)$, $Λ(1690)$, $Λ(1800)$,…
▽ More
Based on a sample of $(2712.4\pm14.3)\times10^6\;ψ(3686)$ events collected with the BESIII detector, a partial wave analysis of the decay $ψ(3686)\toΛ\barΣ^0π^0+c.c.$ is performed to investigate $Λ^*$ and $Σ^*$ resonances in the $π^0\barΣ^0$ and $π^0Λ$ invariant mass distributions. Significant contributions are found from the $Λ(1405)$, $Λ(1520)$, $Λ(1600)$, $Λ(1670)$, $Λ(1690)$, $Λ(1800)$, $Λ(1890)$, $Λ(2325)$, $Σ(1385)$, $Σ(1660)$, $Σ(1670)$, $Σ(1750)$, and $Σ(1910)$. The masses, widths, and production branching fractions for each component are determined. In addition, the branching fraction of $ψ(3686)\toΛ\barΣ^0π^0+c.c.$ is measured to be $(1.544\pm0.013\pm0.069)\times10^{-4}$ for the first time, where the first uncertainty is statistical and the second systematic.
△ Less
Submitted 1 August, 2024;
originally announced August 2024.
-
Supervised brain node and network construction under voxel-level functional imaging
Authors:
Wanwan Xu,
Selena Wang,
Chichun Tan,
Xilin Shen,
Wenjing Luo,
Todd Constable,
Tianxi Li,
Yize Zhao
Abstract:
Recent advancements in understanding the brain's functional organization related to behavior have been pivotal, particularly in the development of predictive models based on brain connectivity. Traditional methods in this domain often involve a two-step process by first constructing a connectivity matrix from predefined brain regions, and then linking these connections to behaviors or clinical out…
▽ More
Recent advancements in understanding the brain's functional organization related to behavior have been pivotal, particularly in the development of predictive models based on brain connectivity. Traditional methods in this domain often involve a two-step process by first constructing a connectivity matrix from predefined brain regions, and then linking these connections to behaviors or clinical outcomes. However, these approaches with unsupervised node partitions predict outcomes inefficiently with independently established connectivity. In this paper, we introduce the Supervised Brain Parcellation (SBP), a brain node parcellation scheme informed by the downstream predictive task. With voxel-level functional time courses generated under resting-state or cognitive tasks as input, our approach clusters voxels into nodes in a manner that maximizes the correlation between inter-node connections and the behavioral outcome, while also accommodating intra-node homogeneity. We rigorously evaluate the SBP approach using resting-state and task-based fMRI data from both the Adolescent Brain Cognitive Development (ABCD) study and the Human Connectome Project (HCP). Our analyses show that SBP significantly improves out-of-sample connectome-based predictive performance compared to conventional step-wise methods under various brain atlases. This advancement holds promise for enhancing our understanding of brain functional architectures with behavior and establishing more informative network neuromarkers for clinical applications.
△ Less
Submitted 30 July, 2024;
originally announced July 2024.
-
Observation of $D^0\to b_1(1235)^- e^+ν_e$ and evidence for $D^+\to b_1(1235)^0 e^+ν_e$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (647 additional authors not shown)
Abstract:
By analyzing a data sample of $e^+e^-$ collisions with center-of-mass energy $\sqrt{s}=3.773$ GeV, corresponding to an integrated luminosity of $7.9~\rm {fb}^{-1}$ collected with the BESIII detector operating at the BEPCII collider, we study semileptonic decays of the $D^{0(+)}$ mesons into the axial-vector meson $b_1(1235)$ via the decay $b_1(1235)\to ωπ$. The decay…
▽ More
By analyzing a data sample of $e^+e^-$ collisions with center-of-mass energy $\sqrt{s}=3.773$ GeV, corresponding to an integrated luminosity of $7.9~\rm {fb}^{-1}$ collected with the BESIII detector operating at the BEPCII collider, we study semileptonic decays of the $D^{0(+)}$ mesons into the axial-vector meson $b_1(1235)$ via the decay $b_1(1235)\to ωπ$. The decay $D^0\to b_1(1235)^-e^{+}ν_{e}$ is observed with a significance of 5.2$σ$ after considering systematic uncertainty, while evidence for the decay $D^+\to b_1(1235)^0 e^+ν_e$ is obtained with a 3.1$σ$ significance. The product branching fractions are determined to be ${\mathcal B}(D^0\to b_{1}(1235)^-e^{+}ν_{e})\times {\mathcal B} (b_1(1235)^-\to ωπ^-) = (0.72\pm0.18^{+0.06}_{-0.08})\times10^{-4}$ and ${\mathcal B}(D^+\to b_{1}(1235)^0e^{+}ν_{e})\times {\mathcal B} (b_1(1235)^0~\to ωπ^0) = (1.16\pm0.44\pm0.16)\times10^{-4}$, where the first uncertainties are statistical and the second systematic. The ratio of their partial decay widths is determined to be $\frac{Γ(D^0\to b_{1}(1235)^-e^{+}ν_{e})}{2Γ(D^+\to b_{1}(1235)^0e^{+}ν_{e})}=0.78\pm0.19^{+0.04}_{-0.05}$, which is consistent with unity, predicted by isospin invariance, within uncertainties.
△ Less
Submitted 30 July, 2024;
originally announced July 2024.
-
Measurement of the $\boldsymbol{e^{+}e^{-}\to K^+K^-ψ(2S)}$ Cross Section at Center-of-Mass Energies from 4.699 to 4.951 GeV and Search for $\boldsymbol{Z_{cs}^{\pm}}$ in the $\boldsymbol{Z_{cs}^\pm\to K^\pmψ(2S)}$ Decay
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (646 additional authors not shown)
Abstract:
We perform the first investigation of the process $e^{+}e^{-}\to K^+K^-ψ(2S)$ and report its Born cross sections over a range of center-of-mass energies from 4.699 to 4.951~GeV. The measurements are carried out using several partial reconstruction techniques using data samples collected by the BESIII detector with a total integrated luminosity of 2.5~fb$^{-1}$. We search for new tetraquark candida…
▽ More
We perform the first investigation of the process $e^{+}e^{-}\to K^+K^-ψ(2S)$ and report its Born cross sections over a range of center-of-mass energies from 4.699 to 4.951~GeV. The measurements are carried out using several partial reconstruction techniques using data samples collected by the BESIII detector with a total integrated luminosity of 2.5~fb$^{-1}$. We search for new tetraquark candidates $Z_{cs}^\pm$ in the decays $Z_{cs}^\pm\to K^\pmψ(2S)$. No significant $Z_{cs}^\pm$ signals are observed.
△ Less
Submitted 29 July, 2024;
originally announced July 2024.
-
Collaborative Evolving Strategy for Automatic Data-Centric Development
Authors:
Xu Yang,
Haotian Chen,
Wenjun Feng,
Haoxue Wang,
Zeqi Ye,
Xinjie Shen,
Xiao Yang,
Shizhao Sun,
Weiqing Liu,
Jiang Bian
Abstract:
Artificial Intelligence (AI) significantly influences many fields, largely thanks to the vast amounts of high-quality data for machine learning models. The emphasis is now on a data-centric AI strategy, prioritizing data development over model design progress. Automating this process is crucial. In this paper, we serve as the first work to introduce the automatic data-centric development (AD^2) ta…
▽ More
Artificial Intelligence (AI) significantly influences many fields, largely thanks to the vast amounts of high-quality data for machine learning models. The emphasis is now on a data-centric AI strategy, prioritizing data development over model design progress. Automating this process is crucial. In this paper, we serve as the first work to introduce the automatic data-centric development (AD^2) task and outline its core challenges, which require domain-experts-like task scheduling and implementation capability, largely unexplored by previous work.
By leveraging the strong complex problem-solving capabilities of large language models (LLMs), we propose an LLM-based autonomous agent, equipped with a strategy named Collaborative Knowledge-STudying-Enhanced Evolution by Retrieval (Co-STEER), to simultaneously address all the challenges. Specifically, our proposed Co-STEER agent enriches its domain knowledge through our proposed evolving strategy and develops both its scheduling and implementation skills by accumulating and retrieving domain-specific practical experience. With an improved schedule, the capability for implementation accelerates. Simultaneously, as implementation feedback becomes more thorough, the scheduling accuracy increases. These two capabilities evolve together through practical feedback, enabling a collaborative evolution process.
Extensive experimental results demonstrate that our Co-STEER agent breaks new ground in AD^2 research, possesses strong evolvable schedule and implementation ability, and demonstrates the significant effectiveness of its components. Our Co-STEER paves the way for AD^2 advancements.
△ Less
Submitted 26 July, 2024;
originally announced July 2024.
-
Search for $η_{c}(2S)\to K^+ K^- η^{\prime}$ decay
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (639 additional authors not shown)
Abstract:
Using $(2.712\pm0.014)\times10^{9}$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII, we find an evidence of the $η_{c}(2S)\to K^+ K^- η^{\prime}$ decay with a statistical significance of 3.1$σ$. Its decay branching fraction is measured to be $(12.24\pm4.60(\mathrm{stat.})\pm2.37(\mathrm{syst.})\pm4.68(\mathrm{extr.}))\times 10^{-4}$, where the first uncertainty is stati…
▽ More
Using $(2.712\pm0.014)\times10^{9}$ $ψ(3686)$ events collected with the BESIII detector operating at the BEPCII, we find an evidence of the $η_{c}(2S)\to K^+ K^- η^{\prime}$ decay with a statistical significance of 3.1$σ$. Its decay branching fraction is measured to be $(12.24\pm4.60(\mathrm{stat.})\pm2.37(\mathrm{syst.})\pm4.68(\mathrm{extr.}))\times 10^{-4}$, where the first uncertainty is statistical, the second is systematic, and the third uncertainty is from the branching fraction of the $ψ(3686)\toγη_{c}(2S)$ decay. The upper limit on the product branching fraction $B[ψ(3686)\toγη_{c}(2S)] \times$ $B[η_{c}(2S)\to K^+ K^- η^{\prime}]$ is set to be $1.14 \times 10^{-6}$ at $90\%$ confidence level. In addition, the branching fractions of $χ_{c1}\to K^+ K^- η^{\prime}$ and $χ_{c2}\to K^+ K^- η^{\prime}$ are updated to be $(8.47\pm0.09(\mathrm{stat.})\pm0.47(\mathrm{syst.}))\times 10^{-4}$ and $(1.53\pm0.04(\mathrm{stat.})\pm0.08(\mathrm{syst.}))\times 10^{-4}$, respectively. The precision is improved by twofold.
△ Less
Submitted 24 July, 2024;
originally announced July 2024.
-
Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mechanism
Authors:
Anhao Zhao,
Fanghua Ye,
Jinlan Fu,
Xiaoyu Shen
Abstract:
Large language models (LLMs) exhibit remarkable in-context learning (ICL) capabilities. However, the underlying working mechanism of ICL remains poorly understood. Recent research presents two conflicting views on ICL: One attributes it to LLMs' inherent ability of task recognition, deeming label correctness and shot numbers of demonstrations as not crucial; the other emphasizes the impact of simi…
▽ More
Large language models (LLMs) exhibit remarkable in-context learning (ICL) capabilities. However, the underlying working mechanism of ICL remains poorly understood. Recent research presents two conflicting views on ICL: One attributes it to LLMs' inherent ability of task recognition, deeming label correctness and shot numbers of demonstrations as not crucial; the other emphasizes the impact of similar examples in the demonstrations, stressing the need for label correctness and more shots. In this work, we provide a Two-Dimensional Coordinate System that unifies both views into a systematic framework. The framework explains the behavior of ICL through two orthogonal variables: whether LLMs can recognize the task and whether similar examples are presented in the demonstrations. We propose the peak inverse rank metric to detect the task recognition ability of LLMs and study LLMs' reactions to different definitions of similarity. Based on these, we conduct extensive experiments to elucidate how ICL functions across each quadrant on multiple representative classification tasks. Finally, we extend our analyses to generation tasks, showing that our coordinate system can also be used to interpret ICL for generation tasks effectively.
△ Less
Submitted 24 July, 2024;
originally announced July 2024.
-
From Sands to Mansions: Enabling Automatic Full-Life-Cycle Cyberattack Construction with LLM
Authors:
Lingzhi Wang,
Jiahui Wang,
Kyle Jung,
Kedar Thiagarajan,
Emily Wei,
Xiangmin Shen,
Yan Chen,
Zhenyuan Li
Abstract:
The escalating battles between attackers and defenders in cybersecurity make it imperative to test and evaluate defense capabilities from the attackers' perspective. However, constructing full-life-cycle cyberattacks and performing red team emulations requires significant time and domain knowledge from security experts. Existing cyberattack simulation frameworks face challenges such as limited tec…
▽ More
The escalating battles between attackers and defenders in cybersecurity make it imperative to test and evaluate defense capabilities from the attackers' perspective. However, constructing full-life-cycle cyberattacks and performing red team emulations requires significant time and domain knowledge from security experts. Existing cyberattack simulation frameworks face challenges such as limited technical coverage, inability to conduct full-life-cycle attacks, and the need for manual infrastructure building. These limitations hinder the quality and diversity of the constructed attacks. In this paper, we leveraged the capabilities of Large Language Models (LLMs) in summarizing knowledge from existing attack intelligence and generating executable machine code based on human knowledge. we proposed AURORA, an automatic end-to-end cyberattack construction and emulation framework. AURORA can autonomously build multi-stage cyberattack plans based on Cyber Threat Intelligence (CTI) reports, construct the emulation infrastructures, and execute the attack procedures. We also developed an attack procedure knowledge graph to integrate knowledge about attack techniques throughout the full life cycle of advanced cyberattacks from various sources. We constructed and evaluated more than 20 full-life-cycle cyberattacks based on existing CTI reports. Compared to previous attack simulation frameworks, AURORA can construct multi-step attacks and the infrastructures in several minutes without human intervention. Furthermore, AURORA incorporates a wider range (40% more) of attack techniques into the constructed attacks in a more efficient way than the professional red teams. To benefit further research, we open-sourced the dataset containing the execution files and infrastructures of 20 emulated cyberattacks.
△ Less
Submitted 23 July, 2024;
originally announced July 2024.