-
Dynamical Vacuum Compressibility of Space
Authors:
Yu-Cun Xie,
Jen-Tsung Hsiang,
Bei-Lok Hu
Abstract:
This paper continues the investigation initiated in arXiv:2204.08634 into the quantum thermodynamic properties of space by deriving the vacuum compressibility of a variety of dynamical spacetimes containing massive and massless conformally coupled quantum fields. The quantum processes studied here include particle creation, Casimir effect, and the trace anomaly. The spaces include $S^2, S^3$, and…
▽ More
This paper continues the investigation initiated in arXiv:2204.08634 into the quantum thermodynamic properties of space by deriving the vacuum compressibility of a variety of dynamical spacetimes containing massive and massless conformally coupled quantum fields. The quantum processes studied here include particle creation, Casimir effect, and the trace anomaly. The spaces include $S^2, S^3$, and $T^3$ with prescribed time evolution and $S^1$, where the temporal developments are backreaction determined. Vacuum compressibility belongs to the same group of quantum thermodynamic / mechanical response functions as vacuum viscosity, a concept first proposed in 1970 by Zel'dovich for capturing the effects of vacuum particle production on the dynamics of the early universe, made precise by rigorous work of many authors in the following decade using quantum field theory in curved spacetime methodologies and semiclassical gravity theory for treating backreaction effects. Various subtleties in understanding the behavior of the vacuum energies of quantum field origins, negative pressures and novel complicated features of dynamical compressibility are discussed.
△ Less
Submitted 26 December, 2023; v1 submitted 14 December, 2023;
originally announced December 2023.
-
Connectivity Oracles for Predictable Vertex Failures
Authors:
Bingbing Hu,
Evangelos Kosinas,
Adam Polak
Abstract:
The problem of designing connectivity oracles supporting vertex failures is one of the basic data structures problems for undirected graphs. It is already well understood: previous works [Duan--Pettie STOC'10; Long--Saranurak FOCS'22] achieve query time linear in the number of failed vertices, and it is conditionally optimal as long as we require preprocessing time polynomial in the size of the gr…
▽ More
The problem of designing connectivity oracles supporting vertex failures is one of the basic data structures problems for undirected graphs. It is already well understood: previous works [Duan--Pettie STOC'10; Long--Saranurak FOCS'22] achieve query time linear in the number of failed vertices, and it is conditionally optimal as long as we require preprocessing time polynomial in the size of the graph and update time polynomial in the number of failed vertices.
We revisit this problem in the paradigm of algorithms with predictions: we ask if the query time can be improved if the set of failed vertices can be predicted beforehand up to a small number of errors. More specifically, we design a data structure that, given a graph $G=(V,E)$ and a set of vertices predicted to fail $\widehat{D} \subseteq V$ of size $d=|\widehat{D}|$, preprocesses it in time $\tilde{O}(d|E|)$ and then can receive an update given as the symmetric difference between the predicted and the actual set of failed vertices $\widehat{D} \triangle D = (\widehat{D} \setminus D) \cup (D \setminus \widehat{D})$ of size $η= |\widehat{D} \triangle D|$, process it in time $\tilde{O}(η^4)$, and after that answer connectivity queries in $G \setminus D$ in time $O(η)$. Viewed from another perspective, our data structure provides an improvement over the state of the art for the \emph{fully dynamic subgraph connectivity problem} in the \emph{sensitivity setting} [Henzinger--Neumann ESA'16].
We argue that the preprocessing time and query time of our data structure are conditionally optimal under standard fine-grained complexity assumptions.
△ Less
Submitted 1 July, 2024; v1 submitted 13 December, 2023;
originally announced December 2023.
-
Measurements of Born Cross Sections for $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2595)^- + {\rm c.c.}$ and $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2625)^- + {\rm c.c.}$ at $\sqrt{s}=$4918.0 and 4950.9 MeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (620 additional authors not shown)
Abstract:
Using $e^+e^-$ collision data collected with the BESIII detector operating at the BEPCII collider, the Born cross sections of $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2595)^- + \rm{c.c.}$ and $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2625)^- + \rm{c.c.}$ are measured for the first time at center-of-mass energies of $\sqrt{s}=4918.0$ and 4950.9 MeV. Non-zero cross sections are observed very close to the production threshol…
▽ More
Using $e^+e^-$ collision data collected with the BESIII detector operating at the BEPCII collider, the Born cross sections of $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2595)^- + \rm{c.c.}$ and $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2625)^- + \rm{c.c.}$ are measured for the first time at center-of-mass energies of $\sqrt{s}=4918.0$ and 4950.9 MeV. Non-zero cross sections are observed very close to the production threshold. The measured Born cross sections of $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2625)^- + \rm{c.c.}$ are about $2\sim3$ times greater than those of $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2595)^- + \rm{c.c.}$, thereby indicating that the exotic structure potentially exists in the excited charmed baryons. The Born cross sections are $15.6\pm3.1\pm0.9$ pb and $29.4\pm3.7\pm2.7$ pb for $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2595)^- + \rm{c.c.}$, and are $43.4\pm4.0\pm4.1$ pb and $76.8\pm6.5\pm4.2$ pb for $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2625)^- +\rm{c.c.}$ at $\sqrt s=4918.0$ and 4950.9 MeV, respectively. Based on the polar angle distributions of the $\barΛ_{c}(2625)^-$ and $Λ_{c}(2625)^+$, the form-factor ratios $\sqrt{|G_{E}|^2 + 3|G_{M}|^2}/|G_{C}|$ are determined for $e^+e^-\to Λ_{c}^+ \barΛ_{c}(2625)^- + \rm{c.c.}$ for the first time, which are $5.95\pm4.07\pm0.15$ and $0.94\pm0.32\pm0.02$ at $\sqrt s=4918.0$ and 4950.9 MeV, respectively. All of these first uncertainties are statistical and second systematic.
△ Less
Submitted 8 May, 2024; v1 submitted 13 December, 2023;
originally announced December 2023.
-
Search for $D^{0}\to K_{S}^{0} K^{-} e^{+}ν_{e}$, $D^{+}\to K_{S}^{0} K_{S}^{0} e^{+}ν_{e}$, and $D^{+}\to K^{+}K^{-} e^{+}ν_{e}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko
, et al. (604 additional authors not shown)
Abstract:
A search has been performed for the semileptonic decays $D^{0}\to K_{S}^{0} K^{-} e^{+}ν_{e}$, $D^{+}\to K_{S}^{0} K_{S}^{0} e^{+}ν_{e}$ and $D^{+}\to K^{+}K^{-} e^{+}ν_{e}$, using $7.9~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$ GeV by the BESIII detector operating at the BEPCII collider. No significant signals are observed, and upper li…
▽ More
A search has been performed for the semileptonic decays $D^{0}\to K_{S}^{0} K^{-} e^{+}ν_{e}$, $D^{+}\to K_{S}^{0} K_{S}^{0} e^{+}ν_{e}$ and $D^{+}\to K^{+}K^{-} e^{+}ν_{e}$, using $7.9~\mathrm{fb}^{-1}$ of $e^+e^-$ annihilation data collected at the center-of-mass energy $\sqrt{s}=3.773$ GeV by the BESIII detector operating at the BEPCII collider. No significant signals are observed, and upper limits are set at the 90\% confidence level of $2.13\times10^{-5}$, $1.54\times10^{-5}$ and $2.10\times10^{-5}$ for the branching fractions of $D^{0}\to K_{S}^{0} K^{-} e^{+}ν_{e}$, $D^{+}\to K_{S}^{0} K_{S}^{0} e^{+}ν_{e}$ and $D^{+}\to K^{+}K^{-} e^{+}ν_{e}$, respectively.
△ Less
Submitted 10 December, 2023;
originally announced December 2023.
-
Determination of spin-parity quantum numbers of X(2370) as $0^{-+}$ from $J/ψ\rightarrowγK^{0}_{S}K^{0}_{S}η^{\prime}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (605 additional authors not shown)
Abstract:
Based on $(10087\pm44)\times10^{6}$ $J/ψ$ events collected with the BESIII detector, a partial wave analysis of the decay $J/ψ\rightarrowγK^{0}_{S}K^{0}_{S}η^{\prime}$ is performed. The mass and width of the $X(2370)$ are measured to be $2395 \pm 11 ({\rm stat})^{+26}_{-94}({\rm syst})\ \mathrm{MeV}/c^{2}$ and $188^{+18}_{-17}({\rm stat})^{+124}_{-33}({\rm syst})~\mathrm{MeV}$, respectively. The c…
▽ More
Based on $(10087\pm44)\times10^{6}$ $J/ψ$ events collected with the BESIII detector, a partial wave analysis of the decay $J/ψ\rightarrowγK^{0}_{S}K^{0}_{S}η^{\prime}$ is performed. The mass and width of the $X(2370)$ are measured to be $2395 \pm 11 ({\rm stat})^{+26}_{-94}({\rm syst})\ \mathrm{MeV}/c^{2}$ and $188^{+18}_{-17}({\rm stat})^{+124}_{-33}({\rm syst})~\mathrm{MeV}$, respectively. The corresponding product branching fraction is $\mathcal{B}[J/ψ\rightarrowγX(2370)] \times \mathcal{B}[X(2370) \rightarrow f_{0}(980)η^{\prime}] \times \mathcal{B}[f_{0}(980) \rightarrow K^{0}_{S}K^{0}_{S}] = \left( 1.31 \pm 0.22 ({\rm stat})^{+2.85}_{-0.84}({\rm syst}) \right) \times 10^{-5}$. The statistical significance of the $X(2370)$ is greater than $11.7σ$ and the spin-parity is determined to be $0^{-+}$ for the first time. The measured mass and spin-parity of the $X(2370)$ are consistent with the predictions of the lightest pseudoscalar glueball.
△ Less
Submitted 6 May, 2024; v1 submitted 8 December, 2023;
originally announced December 2023.
-
Making Large Language Models Better Knowledge Miners for Online Marketing with Progressive Prompting Augmentation
Authors:
Chunjing Gan,
Dan Yang,
Binbin Hu,
Ziqi Liu,
Yue Shen,
Zhiqiang Zhang,
Jinjie Gu,
Jun Zhou,
Guannan Zhang
Abstract:
Nowadays, the rapid development of mobile economy has promoted the flourishing of online marketing campaigns, whose success greatly hinges on the efficient matching between user preferences and desired marketing campaigns where a well-established Marketing-oriented Knowledge Graph (dubbed as MoKG) could serve as the critical "bridge" for preference propagation. In this paper, we seek to carefully…
▽ More
Nowadays, the rapid development of mobile economy has promoted the flourishing of online marketing campaigns, whose success greatly hinges on the efficient matching between user preferences and desired marketing campaigns where a well-established Marketing-oriented Knowledge Graph (dubbed as MoKG) could serve as the critical "bridge" for preference propagation. In this paper, we seek to carefully prompt a Large Language Model (LLM) with domain-level knowledge as a better marketing-oriented knowledge miner for marketing-oriented knowledge graph construction, which is however non-trivial, suffering from several inevitable issues in real-world marketing scenarios, i.e., uncontrollable relation generation of LLMs,insufficient prompting ability of a single prompt, the unaffordable deployment cost of LLMs. To this end, we propose PAIR, a novel Progressive prompting Augmented mIning fRamework for harvesting marketing-oriented knowledge graph with LLMs. In particular, we reduce the pure relation generation to an LLM based adaptive relation filtering process through the knowledge-empowered prompting technique. Next, we steer LLMs for entity expansion with progressive prompting augmentation,followed by a reliable aggregation with comprehensive consideration of both self-consistency and semantic relatedness. In terms of online serving, we specialize in a small and white-box PAIR (i.e.,LightPAIR),which is fine-tuned with a high-quality corpus provided by a strong teacher-LLM. Extensive experiments and practical applications in audience targeting verify the effectiveness of the proposed (Light)PAIR.
△ Less
Submitted 7 December, 2023;
originally announced December 2023.
-
Not All Negatives Are Worth Attending to: Meta-Bootstrapping Negative Sampling Framework for Link Prediction
Authors:
Yakun Wang,
Binbin Hu,
Shuo Yang,
Meiqi Zhu,
Zhiqiang Zhang,
Qiyang Zhang,
Jun Zhou,
Guo Ye,
Huimei He
Abstract:
The rapid development of graph neural networks (GNNs) encourages the rising of link prediction, achieving promising performance with various applications. Unfortunately, through a comprehensive analysis, we surprisingly find that current link predictors with dynamic negative samplers (DNSs) suffer from the migration phenomenon between "easy" and "hard" samples, which goes against the preference of…
▽ More
The rapid development of graph neural networks (GNNs) encourages the rising of link prediction, achieving promising performance with various applications. Unfortunately, through a comprehensive analysis, we surprisingly find that current link predictors with dynamic negative samplers (DNSs) suffer from the migration phenomenon between "easy" and "hard" samples, which goes against the preference of DNS of choosing "hard" negatives, thus severely hindering capability. Towards this end, we propose the MeBNS framework, serving as a general plugin that can potentially improve current negative sampling based link predictors. In particular, we elaborately devise a Meta-learning Supported Teacher-student GNN (MST-GNN) that is not only built upon teacher-student architecture for alleviating the migration between "easy" and "hard" samples but also equipped with a meta learning based sample re-weighting module for helping the student GNN distinguish "hard" samples in a fine-grained manner. To effectively guide the learning of MST-GNN, we prepare a Structure enhanced Training Data Generator (STD-Generator) and an Uncertainty based Meta Data Collector (UMD-Collector) for supporting the teacher and student GNN, respectively. Extensive experiments show that the MeBNS achieves remarkable performance across six link prediction benchmark datasets.
△ Less
Submitted 11 December, 2023; v1 submitted 7 December, 2023;
originally announced December 2023.
-
Amplitude Analysis of the Decays $D^0\toπ^+π^-π^+π^-$ and $π^+π^-π^0π^0$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (620 additional authors not shown)
Abstract:
Using $e^+e^-$ annihilation data corresponding to an integrated luminosity of 2.93 $\rm fb^{-1}$ taken at the center-of-mass energy $\sqrt{s}=3.773$~GeV with the BESIII detector, a joint amplitude analysis is performed on the decays $D^0\toπ^+π^-π^+π^-$ and $D^0\toπ^+π^-π^0π^0$(non-$η$). The fit fractions of individual components are obtained, and large interferences among the dominant components…
▽ More
Using $e^+e^-$ annihilation data corresponding to an integrated luminosity of 2.93 $\rm fb^{-1}$ taken at the center-of-mass energy $\sqrt{s}=3.773$~GeV with the BESIII detector, a joint amplitude analysis is performed on the decays $D^0\toπ^+π^-π^+π^-$ and $D^0\toπ^+π^-π^0π^0$(non-$η$). The fit fractions of individual components are obtained, and large interferences among the dominant components of $D^{0}\to a_{1}(1260)π$, $D^{0}\toπ(1300)π$, $D^{0}\toρ(770)ρ(770)$ and $D^{0}\to2(ππ)_{S}$ are found in both channels. With the obtained amplitude model, the $CP$-even fractions of $D^0\to π^+π^-π^+π^-$ and $D^0\toπ^+π^-π^0π^0$(non-$η$) are determined to be $(75.2\pm1.1_{\rm stat.}\pm1.5_{\rm syst.})\%$ and $(68.9\pm1.5_{\rm stat.}\pm 2.4_{\rm syst.})\%$, respectively. The branching fractions of $D^0\to π^+π^-π^+π^-$ and $D^0\toπ^+π^-π^0π^0$(non-$η$) are measured to be $(0.688\pm0.010_{\rm stat.}\pm 0.010_{\rm syst.})\%$ and $(0.951\pm0.025_{\rm stat.}\pm 0.021_{\rm syst.})\%$, respectively. The amplitude analysis provides an important model for binning strategy in the measurements of the strong phase parameters of $D^0 \to 4π$ when used to determine the CKM angle $γ(φ_{3})$ via the $B^{-}\to D K^{-}$ decay.
△ Less
Submitted 3 April, 2024; v1 submitted 5 December, 2023;
originally announced December 2023.
-
PEACE: Prototype lEarning Augmented transferable framework for Cross-domain rEcommendation
Authors:
Chunjing Gan,
Bo Huang,
Binbin Hu,
Jian Ma,
Ziqi Liu,
Zhiqiang Zhang,
Jun Zhou,
Guannan Zhang,
Wenliang Zhong
Abstract:
To help merchants/customers to provide/access a variety of services through miniapps, online service platforms have occupied a critical position in the effective content delivery, in which how to recommend items in the new domain launched by the service provider for customers has become more urgent. However, the non-negligible gap between the source and diversified target domains poses a considera…
▽ More
To help merchants/customers to provide/access a variety of services through miniapps, online service platforms have occupied a critical position in the effective content delivery, in which how to recommend items in the new domain launched by the service provider for customers has become more urgent. However, the non-negligible gap between the source and diversified target domains poses a considerable challenge to cross-domain recommendation systems, which often leads to performance bottlenecks in industrial settings. While entity graphs have the potential to serve as a bridge between domains, rudimentary utilization still fail to distill useful knowledge and even induce the negative transfer issue. To this end, we propose PEACE, a Prototype lEarning Augmented transferable framework for Cross-domain rEcommendation. For domain gap bridging, PEACE is built upon a multi-interest and entity-oriented pre-training architecture which could not only benefit the learning of generalized knowledge in a multi-granularity manner, but also help leverage more structural information in the entity graph. Then, we bring the prototype learning into the pre-training over source domains, so that representations of users and items are greatly improved by the contrastive prototype learning module and the prototype enhanced attention mechanism for adaptive knowledge utilization. To ease the pressure of online serving, PEACE is carefully deployed in a lightweight manner, and significant performance improvements are observed in both online and offline environments.
△ Less
Submitted 17 December, 2023; v1 submitted 4 December, 2023;
originally announced December 2023.
-
SANeRF-HQ: Segment Anything for NeRF in High Quality
Authors:
Yichen Liu,
Benran Hu,
Chi-Keung Tang,
Yu-Wing Tai
Abstract:
Recently, the Segment Anything Model (SAM) has showcased remarkable capabilities of zero-shot segmentation, while NeRF (Neural Radiance Fields) has gained popularity as a method for various 3D problems beyond novel view synthesis. Though there exist initial attempts to incorporate these two methods into 3D segmentation, they face the challenge of accurately and consistently segmenting objects in c…
▽ More
Recently, the Segment Anything Model (SAM) has showcased remarkable capabilities of zero-shot segmentation, while NeRF (Neural Radiance Fields) has gained popularity as a method for various 3D problems beyond novel view synthesis. Though there exist initial attempts to incorporate these two methods into 3D segmentation, they face the challenge of accurately and consistently segmenting objects in complex scenarios. In this paper, we introduce the Segment Anything for NeRF in High Quality (SANeRF-HQ) to achieve high-quality 3D segmentation of any target object in a given scene. SANeRF-HQ utilizes SAM for open-world object segmentation guided by user-supplied prompts, while leveraging NeRF to aggregate information from different viewpoints. To overcome the aforementioned challenges, we employ density field and RGB similarity to enhance the accuracy of segmentation boundary during the aggregation. Emphasizing on segmentation accuracy, we evaluate our method on multiple NeRF datasets where high-quality ground-truths are available or manually annotated. SANeRF-HQ shows a significant quality improvement over state-of-the-art methods in NeRF object segmentation, provides higher flexibility for object localization, and enables more consistent object segmentation across multiple views. Results and code are available at the project site: https://lyclyc52.github.io/SANeRF-HQ/.
△ Less
Submitted 6 April, 2024; v1 submitted 3 December, 2023;
originally announced December 2023.
-
Measurement of Branching Fractions for $Λ_{c}^{+} \rightarrow n K_{S}^{0} π^{+}$ and $Λ_{c}^{+} \rightarrow n K_{S}^{0} K^{+}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (603 additional authors not shown)
Abstract:
Based on 4.5 fb$^{-1}$ of $e^{+}e^{-}$ collision data accumulated at center-of-mass energies between $4.600\,\mathrm{GeV}$ and $4.699\,\mathrm{GeV}$ with the BESIII detector, we measure the absolute branching fraction of the Cabibbo-favored decay $Λ_{c}^{+} \rightarrow n K_{S}^{0} π^{+}$ with the precision improved by a factor of 2.8 and report the first evidence for the singly-Cabibbo-suppressed…
▽ More
Based on 4.5 fb$^{-1}$ of $e^{+}e^{-}$ collision data accumulated at center-of-mass energies between $4.600\,\mathrm{GeV}$ and $4.699\,\mathrm{GeV}$ with the BESIII detector, we measure the absolute branching fraction of the Cabibbo-favored decay $Λ_{c}^{+} \rightarrow n K_{S}^{0} π^{+}$ with the precision improved by a factor of 2.8 and report the first evidence for the singly-Cabibbo-suppressed decay $Λ_{c}^{+} \rightarrow n K_{S}^{0} K^{+}$. The branching fractions for $Λ_{c}^{+} \rightarrow n K_{S}^{0} π^{+}$ and $Λ_{c}^{+} \rightarrow n K_{S}^{0} K^{+}$ are determined to be $(1.86\pm0.08\pm0.04)\times10^{-2}$ and $\left(4.3^{+1.9}_{-1.5}\pm0.3\right)\times10^{-4}$, respectively, where the first uncertainties are statistical and the second ones are systematic.
△ Less
Submitted 28 November, 2023;
originally announced November 2023.
-
Droplet control based on pinning and substrate wettability
Authors:
Panagiotis E. Theodorakis,
Alidad Amirfazli,
Bin Hu,
Zhizhao Che
Abstract:
Pinning of liquid droplets on solid substrates is ubiquitous and plays an essential role in many applications, especially in various areas, such as microfluidics and biology. Although pinning can often reduce the efficiency of various applications, a deeper understanding of this phenomenon can actually offer possibilities for technological exploitation. Here, by means of molecular dynamics simulat…
▽ More
Pinning of liquid droplets on solid substrates is ubiquitous and plays an essential role in many applications, especially in various areas, such as microfluidics and biology. Although pinning can often reduce the efficiency of various applications, a deeper understanding of this phenomenon can actually offer possibilities for technological exploitation. Here, by means of molecular dynamics simulation, we identify the conditions that lead to droplet pinning or depinning and discuss the effects of key parameters in detail, such as the height of the physical pinning-barrier and the wettability of the substrates. Moreover, we describe the mechanism of the barrier crossing by the droplet upon depinning, identify the driving force of this process, and, also, elucidate the dynamics of the droplet. Not only does our work provide a detailed description of the pinning and depinning processes, but it also explicitly highlights how both processes can be exploited in nanotechnology applications to control droplet motion. Hence, we anticipate that our study will have significant implications for the nanoscale design of substrates in micro and nano-scale systems and will assist with assessing pinning effects in various applications.
△ Less
Submitted 27 November, 2023;
originally announced November 2023.
-
Towards Vision Enhancing LLMs: Empowering Multimodal Knowledge Storage and Sharing in LLMs
Authors:
Yunxin Li,
Baotian Hu,
Wei Wang,
Xiaochun Cao,
Min Zhang
Abstract:
Recent advancements in multimodal large language models (MLLMs) have achieved significant multimodal generation capabilities, akin to GPT-4. These models predominantly map visual information into language representation space, leveraging the vast knowledge and powerful text generation abilities of LLMs to produce multimodal instruction-following responses. We could term this method as LLMs for Vis…
▽ More
Recent advancements in multimodal large language models (MLLMs) have achieved significant multimodal generation capabilities, akin to GPT-4. These models predominantly map visual information into language representation space, leveraging the vast knowledge and powerful text generation abilities of LLMs to produce multimodal instruction-following responses. We could term this method as LLMs for Vision because of its employing LLMs for visual-language understanding, yet observe that these MLLMs neglect the potential of harnessing visual knowledge to enhance overall capabilities of LLMs, which could be regraded as Vision Enhancing LLMs. In this paper, we propose an approach called MKS2, aimed at enhancing LLMs through empowering Multimodal Knowledge Storage and Sharing in LLMs. Specifically, we introduce the Modular Visual Memory, a component integrated into the internal blocks of LLMs, designed to store open-world visual information efficiently. Additionally, we present a soft Mixtures-of-Multimodal Experts architecture in LLMs to invoke multimodal knowledge collaboration during generation. Our comprehensive experiments demonstrate that MKS2 substantially augments the reasoning capabilities of LLMs in contexts necessitating physical or commonsense knowledge. It also delivers competitive results on multimodal benchmarks.
△ Less
Submitted 27 November, 2023;
originally announced November 2023.
-
Which Matters Most in Making Fund Investment Decisions? A Multi-granularity Graph Disentangled Learning Framework
Authors:
Chunjing Gan,
Binbin Hu,
Bo Huang,
Tianyu Zhao,
Yingru Lin,
Wenliang Zhong,
Zhiqiang Zhang,
Jun Zhou,
Chuan Shi
Abstract:
In this paper, we highlight that both conformity and risk preference matter in making fund investment decisions beyond personal interest and seek to jointly characterize these aspects in a disentangled manner. Consequently, we develop a novel M ulti-granularity Graph Disentangled Learning framework named MGDL to effectively perform intelligent matching of fund investment products. Benefiting from…
▽ More
In this paper, we highlight that both conformity and risk preference matter in making fund investment decisions beyond personal interest and seek to jointly characterize these aspects in a disentangled manner. Consequently, we develop a novel M ulti-granularity Graph Disentangled Learning framework named MGDL to effectively perform intelligent matching of fund investment products. Benefiting from the well-established fund graph and the attention module, multi-granularity user representations are derived from historical behaviors to separately express personal interest, conformity and risk preference in a fine-grained way. To attain stronger disentangled representations with specific semantics, MGDL explicitly involve two self-supervised signals, i.e., fund type based contrasts and fund popularity. Extensive experiments in offline and online environments verify the effectiveness of MGDL.
△ Less
Submitted 23 November, 2023;
originally announced November 2023.
-
Large Language Model as a Policy Teacher for Training Reinforcement Learning Agents
Authors:
Zihao Zhou,
Bin Hu,
Chenyang Zhao,
Pu Zhang,
Bin Liu
Abstract:
Recent studies have uncovered the potential of Large Language Models (LLMs) in addressing complex sequential decision-making tasks through the provision of high-level instructions. However, LLM-based agents lack specialization in tackling specific target problems, particularly in real-time dynamic environments. Additionally, deploying an LLM-based agent in practical scenarios can be both costly an…
▽ More
Recent studies have uncovered the potential of Large Language Models (LLMs) in addressing complex sequential decision-making tasks through the provision of high-level instructions. However, LLM-based agents lack specialization in tackling specific target problems, particularly in real-time dynamic environments. Additionally, deploying an LLM-based agent in practical scenarios can be both costly and time-consuming. On the other hand, reinforcement learning (RL) approaches train agents that specialize in the target task but often suffer from low sampling efficiency and high exploration costs. In this paper, we introduce a novel framework that addresses these challenges by training a smaller, specialized student RL agent using instructions from an LLM-based teacher agent. By incorporating the guidance from the teacher agent, the student agent can distill the prior knowledge of the LLM into its own model. Consequently, the student agent can be trained with significantly less data. Moreover, through further training with environment feedback, the student agent surpasses the capabilities of its teacher for completing the target task. We conducted experiments on challenging MiniGrid and Habitat environments, specifically designed for embodied AI research, to evaluate the effectiveness of our framework. The results clearly demonstrate that our approach achieves superior performance compared to strong baseline methods. Our code is available at https://github.com/ZJLAB-AMMI/LLM4Teach.
△ Less
Submitted 27 May, 2024; v1 submitted 22 November, 2023;
originally announced November 2023.
-
Forecast of joint analysis of cosmic shear and supernovae magnification from CSST and LSST
Authors:
Ye Cao,
Bin Hu,
Ji Yao,
Hu Zhan
Abstract:
Cosmic shear and cosmic magnification reflect the same gravitational lensing field. Each of these two probes are affected by different systematics. We study the auto- and cross-correlations of the cosmic shear from the China Space Survey Telescope (CSST) and cosmic magnification of supernovae from Large Synoptic Survey Telescope (LSST). We want to answer, to what extent, by adding the magnificatio…
▽ More
Cosmic shear and cosmic magnification reflect the same gravitational lensing field. Each of these two probes are affected by different systematics. We study the auto- and cross-correlations of the cosmic shear from the China Space Survey Telescope (CSST) and cosmic magnification of supernovae from Large Synoptic Survey Telescope (LSST). We want to answer, to what extent, by adding the magnification data we can remove the systematic bias in cosmic shear measurement. We generate the mock shear/magnification maps based on the correlation between of different tomographic bins. After obtaining the corrected power spectra, we adopt the Markov Chain Monte Carlo (MCMC) technique to fit the theoretical models, and investigate the constraints on the cosmological and nuisance parameters. We find that the with only cosmic shear data, there are $1σ$ bias in $σ_8$ and intrinsic alignment model parameters. By adding the magnification data, we are able to remove these biases perfectly.
△ Less
Submitted 31 May, 2024; v1 submitted 22 November, 2023;
originally announced November 2023.
-
First observation of $Λ_c^+\rightarrowΛK^+π^0$ and evidence of $Λ_c^+\rightarrowΛK^+π^+π^-$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (608 additional authors not shown)
Abstract:
We present the first observation of the singly Cabibbo-suppressed decay $Λ_c^+ \rightarrow ΛK^+π^0$ with a significance of $5.7σ$ and the first evidence of $Λ_c^+ \rightarrow ΛK^+π^+π^-$ decay with a significance of $3.1σ$, based on $e^+e^-$ annihilation data recorded by the BESIII detector at the BEPCII collider. The data correspond to an integrated luminosity of $6.4~{\rm fb^{-1}}$, in the cente…
▽ More
We present the first observation of the singly Cabibbo-suppressed decay $Λ_c^+ \rightarrow ΛK^+π^0$ with a significance of $5.7σ$ and the first evidence of $Λ_c^+ \rightarrow ΛK^+π^+π^-$ decay with a significance of $3.1σ$, based on $e^+e^-$ annihilation data recorded by the BESIII detector at the BEPCII collider. The data correspond to an integrated luminosity of $6.4~{\rm fb^{-1}}$, in the center-of-mass energy range from $4.600~{\rm GeV}$ to $4.950~{\rm GeV}$. We determine the branching fractions of $Λ_c^+ \rightarrow ΛK^+π^0$ and $Λ_c^+ \rightarrow ΛK^+π^+π^-$ relative to their Cabibbo-favored counterparts to be $\frac{\mathcal{B}(Λ_c^+ \rightarrow ΛK^+π^0)}{\mathcal{B}(Λ_c^+ \rightarrow Λπ^+π^0)} = (2.09\pm0.39_{\mathrm{stat.}}\pm0.07_{\mathrm{syst.}}) \times 10^{-2}$ and $\frac{\mathcal{B}(Λ_c^+ \rightarrow ΛK^+π^+π^-)}{\mathcal{B}(Λ_c^+ \rightarrow Λπ^+π^+π^-)} = (1.13\pm0.41_{\mathrm{stat.}}\pm0.06_{\mathrm{syst.}}) \times 10^{-2}$, respectively. Moreover, by combining our measured result with the world average of $\mathcal{B}(Λ^+_c\to Λπ^+π^0)$, we obtain the branching fraction $\mathcal{B}(Λ_c^+ \to ΛK^+π^0) = (1.49\pm0.27_{\mathrm{stat.}}\pm0.05_{\mathrm{syst.}}\pm0.08_{\mathrm{ref.}}) \times 10^{-3}$. This result significantly departs from theoretical predictions based on quark $SU(3)$ flavor symmetry, which is underpinned by the presumption of meson pair $S$-wave amplitude dominance.
△ Less
Submitted 25 February, 2024; v1 submitted 21 November, 2023;
originally announced November 2023.
-
Improved measurement of the decays $η' \to π^{+}π^{-}π^{+(0)}π^{-(0)}$ and search for the rare decay $η' \to 4π^{0}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (606 additional authors not shown)
Abstract:
Using a sample of 10 billion $J/ψ$ events collected with the BESIII detector, the decays $η' \to π^{+}π^{-}π^{+}π^{-}$, $η' \to π^{+}π^{-}π^{0}π^{0}$ and $η' \to 4 π^{0}$ are studied via the process $J/ψ\toγη'$. The branching fractions of $η' \to π^{+}π^{-}π^{+}π^{-}$ and $η' \to π^{+}π^{-}π^{0}$ $π^{0}$ are measured to be $( 8.56 \pm 0.25({\rm stat.}) \pm 0.23({\rm syst.}) ) \times {10^{ - 5}}$ a…
▽ More
Using a sample of 10 billion $J/ψ$ events collected with the BESIII detector, the decays $η' \to π^{+}π^{-}π^{+}π^{-}$, $η' \to π^{+}π^{-}π^{0}π^{0}$ and $η' \to 4 π^{0}$ are studied via the process $J/ψ\toγη'$. The branching fractions of $η' \to π^{+}π^{-}π^{+}π^{-}$ and $η' \to π^{+}π^{-}π^{0}$ $π^{0}$ are measured to be $( 8.56 \pm 0.25({\rm stat.}) \pm 0.23({\rm syst.}) ) \times {10^{ - 5}}$ and $(2.12 \pm 0.12({\rm stat.}) \pm 0.10({\rm syst.})) \times {10^{ - 4}}$, respectively, which are consistent with previous measurements but with improved precision. No significant $η' \to 4 π^{0}$ signal is observed, and the upper limit on the branching fraction of this decay is determined to be less than $1.24 \times {10^{-5}}$ at the $90\%$ confidence level. In addition, an amplitude analysis of $η' \to π^{+}π^{-}π^{+}π^{-}$ is performed to extract the doubly virtual isovector form factor $α$ for the first time. The measured value of $α=1.22 \pm 0.33({\rm stat.}) \pm 0.04({\rm syst.})$, is in agreement with the prediction of the VMD model.
△ Less
Submitted 21 November, 2023;
originally announced November 2023.
-
An implementation of nDGP gravity in Pinocchio
Authors:
Yanling Song,
Bin Hu,
Chengzong Ruan,
Chiara Moretti,
Pierluigi Monaco
Abstract:
In this paper we investigate dark matter structure formation in the normal branch of the Dvali-Gabadadze-Porrati (nDGP) model using the PINOCCHIO algorithm. We first present 2nd order Lagrangian perturbation theory for the nDGP model, which shows that the 1st- and 2nd-order growth functions in nDGP are larger than those in ΛCDM. We then examine the dynamics of ellipsoidal collapse in nDGP, which i…
▽ More
In this paper we investigate dark matter structure formation in the normal branch of the Dvali-Gabadadze-Porrati (nDGP) model using the PINOCCHIO algorithm. We first present 2nd order Lagrangian perturbation theory for the nDGP model, which shows that the 1st- and 2nd-order growth functions in nDGP are larger than those in ΛCDM. We then examine the dynamics of ellipsoidal collapse in nDGP, which is accelerated compared to ΛCDM due to enhanced gravitational interactions. Running the nDGP-PINOCCHIO code with a box size of 512 Mpc/h and 1024*1024*1024 particles, we analyze the statistical properties of the output halo catalogs, including the halo power spectrum and halo mass function. The calibrated PINOCCHIO halo power spectrum agrees with N-body simulations within 5% in the comoving wavenumber range k < 0.3 (h/Mpc) at redshift z = 0. The agreement is extended to smaller scales for higher redshifts. For the cumulative halo mass function, the agreement between N-body and PINOCCHIO is also within the simulation scatter.
△ Less
Submitted 20 November, 2023;
originally announced November 2023.
-
CurriculumLoc: Enhancing Cross-Domain Geolocalization through Multi-Stage Refinement
Authors:
Boni Hu,
Lin Chen,
Runjian Chen,
Shuhui Bu,
Pengcheng Han,
Haowei Li
Abstract:
Visual geolocalization is a cost-effective and scalable task that involves matching one or more query images, taken at some unknown location, to a set of geo-tagged reference images. Existing methods, devoted to semantic features representation, evolving towards robustness to a wide variety between query and reference, including illumination and viewpoint changes, as well as scale and seasonal var…
▽ More
Visual geolocalization is a cost-effective and scalable task that involves matching one or more query images, taken at some unknown location, to a set of geo-tagged reference images. Existing methods, devoted to semantic features representation, evolving towards robustness to a wide variety between query and reference, including illumination and viewpoint changes, as well as scale and seasonal variations. However, practical visual geolocalization approaches need to be robust in appearance changing and extreme viewpoint variation conditions, while providing accurate global location estimates. Therefore, inspired by curriculum design, human learn general knowledge first and then delve into professional expertise. We first recognize semantic scene and then measure geometric structure. Our approach, termed CurriculumLoc, involves a delicate design of multi-stage refinement pipeline and a novel keypoint detection and description with global semantic awareness and local geometric verification. We rerank candidates and solve a particular cross-domain perspective-n-point (PnP) problem based on these keypoints and corresponding descriptors, position refinement occurs incrementally. The extensive experimental results on our collected dataset, TerraTrack and a benchmark dataset, ALTO, demonstrate that our approach results in the aforementioned desirable characteristics of a practical visual geolocalization solution. Additionally, we achieve new high recall@1 scores of 62.6% and 94.5% on ALTO, with two different distances metrics, respectively. Dataset, code and trained models are publicly available on https://github.com/npupilab/CurriculumLoc.
△ Less
Submitted 20 November, 2023;
originally announced November 2023.
-
Magnetic dipole transition in $^{48}$Ca
Authors:
B. Acharya,
B. S. Hu,
S. Bacca,
G. Hagen,
P. Navrátil,
T. Papenbrock
Abstract:
The magnetic dipole transition strength $B(M1)$ of $^{48}$Ca is dominated by a single resonant state at an excitation energy of 10.23 MeV. Experiments disagree about $B(M1)$ and this impacts our understanding of spin flips in nuclei. We performed ab initio computations based on chiral effective field theory and found that $B(M1:0^+\rightarrow1^+)$ lies in the range from $7.0$ to $10.2~μ_N^2$. This…
▽ More
The magnetic dipole transition strength $B(M1)$ of $^{48}$Ca is dominated by a single resonant state at an excitation energy of 10.23 MeV. Experiments disagree about $B(M1)$ and this impacts our understanding of spin flips in nuclei. We performed ab initio computations based on chiral effective field theory and found that $B(M1:0^+\rightarrow1^+)$ lies in the range from $7.0$ to $10.2~μ_N^2$. This is consistent with a $(γ,n)$ experiment but larger than results from $(e,e^\prime)$ and $(p,p')$ scattering. Two-body currents yield no quenching of the $B(M1)$ strength and continuum effects reduce it by about 10%. For a validation of our approach, we computed magnetic moments in $^{47,49}$Ca and performed benchmark calculations in light nuclei.
△ Less
Submitted 7 June, 2024; v1 submitted 19 November, 2023;
originally announced November 2023.
-
Segment Anything in Defect Detection
Authors:
Bozhen Hu,
Bin Gao,
Cheng Tan,
Tongle Wu,
Stan Z. Li
Abstract:
Defect detection plays a crucial role in infrared non-destructive testing systems, offering non-contact, safe, and efficient inspection capabilities. However, challenges such as low resolution, high noise, and uneven heating in infrared thermal images hinder comprehensive and accurate defect detection. In this study, we propose DefectSAM, a novel approach for segmenting defects on highly noisy the…
▽ More
Defect detection plays a crucial role in infrared non-destructive testing systems, offering non-contact, safe, and efficient inspection capabilities. However, challenges such as low resolution, high noise, and uneven heating in infrared thermal images hinder comprehensive and accurate defect detection. In this study, we propose DefectSAM, a novel approach for segmenting defects on highly noisy thermal images based on the widely adopted model, Segment Anything (SAM)\cite{kirillov2023segany}. Harnessing the power of a meticulously curated dataset generated through labor-intensive lab experiments and valuable prompts from experienced experts, DefectSAM surpasses existing state-of-the-art segmentation algorithms and achieves significant improvements in defect detection rates. Notably, DefectSAM excels in detecting weaker and smaller defects on complex and irregular surfaces, reducing the occurrence of missed detections and providing more accurate defect size estimations. Experimental studies conducted on various materials have validated the effectiveness of our solutions in defect detection, which hold significant potential to expedite the evolution of defect detection tools, enabling enhanced inspection capabilities and accuracy in defect identification.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
Temporal Knowledge Question Answering via Abstract Reasoning Induction
Authors:
Ziyang Chen,
Dongfang Li,
Xiang Zhao,
Baotian Hu,
Min Zhang
Abstract:
In this study, we address the challenge of enhancing temporal knowledge reasoning in Large Language Models (LLMs). LLMs often struggle with this task, leading to the generation of inaccurate or misleading responses. This issue mainly arises from their limited ability to handle evolving factual knowledge and complex temporal logic. To overcome these limitations, we propose Abstract Reasoning Induct…
▽ More
In this study, we address the challenge of enhancing temporal knowledge reasoning in Large Language Models (LLMs). LLMs often struggle with this task, leading to the generation of inaccurate or misleading responses. This issue mainly arises from their limited ability to handle evolving factual knowledge and complex temporal logic. To overcome these limitations, we propose Abstract Reasoning Induction (ARI) framework, which divides temporal reasoning into two distinct phases: Knowledge-agnostic and Knowledge-based. This framework offers factual knowledge support to LLMs while minimizing the incorporation of extraneous noisy data. Concurrently, informed by the principles of constructivism, ARI provides LLMs the capability to engage in proactive, self-directed learning from both correct and incorrect historical reasoning samples. By teaching LLMs to actively construct knowledge and methods, it can significantly boosting their temporal reasoning abilities. Our approach achieves remarkable improvements, with relative gains of 29.7% and 9.27% on two temporal QA datasets, underscoring its efficacy in advancing temporal reasoning in LLMs. The code can be found at https://github.com/czy1999/ARI-QA
△ Less
Submitted 16 May, 2024; v1 submitted 15 November, 2023;
originally announced November 2023.
-
Think-in-Memory: Recalling and Post-thinking Enable LLMs with Long-Term Memory
Authors:
Lei Liu,
Xiaoyan Yang,
Yue Shen,
Binbin Hu,
Zhiqiang Zhang,
Jinjie Gu,
Guannan Zhang
Abstract:
Memory-augmented Large Language Models (LLMs) have demonstrated remarkable performance in long-term human-machine interactions, which basically relies on iterative recalling and reasoning of history to generate high-quality responses. However, such repeated recall-reason steps easily produce biased thoughts, \textit{i.e.}, inconsistent reasoning results when recalling the same history for differen…
▽ More
Memory-augmented Large Language Models (LLMs) have demonstrated remarkable performance in long-term human-machine interactions, which basically relies on iterative recalling and reasoning of history to generate high-quality responses. However, such repeated recall-reason steps easily produce biased thoughts, \textit{i.e.}, inconsistent reasoning results when recalling the same history for different questions. On the contrary, humans can keep thoughts in the memory and recall them without repeated reasoning. Motivated by this human capability, we propose a novel memory mechanism called TiM (Think-in-Memory) that enables LLMs to maintain an evolved memory for storing historical thoughts along the conversation stream. The TiM framework consists of two crucial stages: (1) before generating a response, a LLM agent recalls relevant thoughts from memory, and (2) after generating a response, the LLM agent post-thinks and incorporates both historical and new thoughts to update the memory. Thus, TiM can eliminate the issue of repeated reasoning by saving the post-thinking thoughts as the history. Besides, we formulate the basic principles to organize the thoughts in memory based on the well-established operations, (\textit{i.e.}, insert, forget, and merge operations), allowing for dynamic updates and evolution of the thoughts. Furthermore, we introduce Locality-Sensitive Hashing into TiM to achieve efficient retrieval for the long-term conversations. We conduct qualitative and quantitative experiments on real-world and simulated dialogues covering a wide range of topics, demonstrating that equipping existing LLMs with TiM significantly enhances their performance in generating responses for long-term interactions.
△ Less
Submitted 15 November, 2023;
originally announced November 2023.
-
Towards Reasoning in Large Language Models via Multi-Agent Peer Review Collaboration
Authors:
Zhenran Xu,
Senbao Shi,
Baotian Hu,
Jindi Yu,
Dongfang Li,
Min Zhang,
Yuxiang Wu
Abstract:
Large Language Models (LLMs) have shown remarkable capabilities in general natural language processing tasks but often fall short in complex reasoning tasks. Recent studies have explored human-like problem-solving strategies, such as self-correct, to push further the boundary of single-model reasoning ability. In this work, we let a single model "step outside the box" by engaging multiple models t…
▽ More
Large Language Models (LLMs) have shown remarkable capabilities in general natural language processing tasks but often fall short in complex reasoning tasks. Recent studies have explored human-like problem-solving strategies, such as self-correct, to push further the boundary of single-model reasoning ability. In this work, we let a single model "step outside the box" by engaging multiple models to correct each other. We introduce a multi-agent collaboration strategy that emulates the academic peer review process. Each agent independently constructs its own solution, provides reviews on the solutions of others, and assigns confidence levels to its reviews. Upon receiving peer reviews, agents revise their initial solutions. Extensive experiments on three different types of reasoning tasks show that our collaboration approach delivers superior accuracy across all ten datasets compared to existing methods. Further study underscores the effectiveness of integrating confidence in reviews, demonstrates the superiority of feedback exchange over mere solution sharing, and highlights the role of capability and diversity in fostering successful collaboration.
△ Less
Submitted 17 December, 2023; v1 submitted 14 November, 2023;
originally announced November 2023.
-
A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question Answering
Authors:
Yunxin Li,
Longyue Wang,
Baotian Hu,
Xinyu Chen,
Wanqi Zhong,
Chenyang Lyu,
Wei Wang,
Min Zhang
Abstract:
The emergence of multimodal large models (MLMs) has significantly advanced the field of visual understanding, offering remarkable capabilities in the realm of visual question answering (VQA). Yet, the true challenge lies in the domain of knowledge-intensive VQA tasks, which necessitate not just recognition of visual elements, but also a deep comprehension of the visual information in conjunction w…
▽ More
The emergence of multimodal large models (MLMs) has significantly advanced the field of visual understanding, offering remarkable capabilities in the realm of visual question answering (VQA). Yet, the true challenge lies in the domain of knowledge-intensive VQA tasks, which necessitate not just recognition of visual elements, but also a deep comprehension of the visual information in conjunction with a vast repository of learned knowledge. To uncover such capabilities of MLMs, particularly the newly introduced GPT-4V and Gemini, we provide an in-depth evaluation from three perspectives: 1) Commonsense Knowledge, which assesses how well models can understand visual cues and connect to general knowledge; 2) Fine-grained World Knowledge, which tests the model's skill in reasoning out specific knowledge from images, showcasing their proficiency across various specialized fields; 3) Comprehensive Knowledge with Decision-making Rationales, which examines model's capability to provide logical explanations for its inference, facilitating a deeper analysis from the interpretability perspective. Additionally, we utilize a visual knowledge-enhanced training strategy and multimodal retrieval-augmented generation approach to enhance MLMs, highlighting the future need for advancements in this research direction. Extensive experiments indicate that: a) GPT-4V demonstrates enhanced explanation generation when using composite images as few-shots; b) GPT-4V and other MLMs produce severe hallucinations when dealing with world knowledge; c) Visual knowledge enhanced training and prompting technicals present potential to improve performance. Codes: https://github.com/HITsz-TMG/Cognitive-Visual-Language-Mapper
△ Less
Submitted 24 August, 2024; v1 submitted 13 November, 2023;
originally announced November 2023.
-
Study of the decay $J/ψ\to φπ^{0}η$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (604 additional authors not shown)
Abstract:
Based on $(10.09 \pm 0.04) \times 10^9$ $J/ψ$ events collected with the BESIII detector operating at the BEPCII collider, a partial wave analysis of the decay $J/ψ\to φπ^{0}η$ is performed. We observe for the first time two new structures on the $φη$ invariant mass distribution, with statistical significances of $24.0σ$ and $16.9σ$; the first with $J^{\rm PC}$ = $1^{+-}$, mass M = (1911 $\pm$ 6 (s…
▽ More
Based on $(10.09 \pm 0.04) \times 10^9$ $J/ψ$ events collected with the BESIII detector operating at the BEPCII collider, a partial wave analysis of the decay $J/ψ\to φπ^{0}η$ is performed. We observe for the first time two new structures on the $φη$ invariant mass distribution, with statistical significances of $24.0σ$ and $16.9σ$; the first with $J^{\rm PC}$ = $1^{+-}$, mass M = (1911 $\pm$ 6 (stat.) $\pm$ 14 (sys.))~MeV/$c^{2}$, and width $Γ= $ (149 $\pm$ 12 (stat.) $\pm$ 23 (sys.))~MeV, the second with $J^{\rm PC}$ = $1^{--}$, mass M = (1996 $\pm$ 11 (stat.) $\pm$ 30 (sys.))~MeV/$c^{2}$, and width $Γ$ = (148 $\pm$ 16 (stat.) $\pm$ 66 (sys.))~MeV. These measurements provide important input for the strangeonium spectrum. In addition, the $f_0(980)-a_0(980)^0$ mixing signal in $J/ψ\to φf_0(980) \to φa_0(980)^0$ and the corresponding electromagnetic decay $J/ψ\to φa_0(980)^0$ are measured with improved precision, providing crucial information to understand the nature of $a_0(980)^0$ and $f_0(980)$.
△ Less
Submitted 14 November, 2023; v1 submitted 12 November, 2023;
originally announced November 2023.
-
A Survey of Large Language Models Attribution
Authors:
Dongfang Li,
Zetian Sun,
Xinshuo Hu,
Zhenyu Liu,
Ziyang Chen,
Baotian Hu,
Aiguo Wu,
Min Zhang
Abstract:
Open-domain generative systems have gained significant attention in the field of conversational AI (e.g., generative search engines). This paper presents a comprehensive review of the attribution mechanisms employed by these systems, particularly large language models. Though attribution or citation improve the factuality and verifiability, issues like ambiguous knowledge reservoirs, inherent bias…
▽ More
Open-domain generative systems have gained significant attention in the field of conversational AI (e.g., generative search engines). This paper presents a comprehensive review of the attribution mechanisms employed by these systems, particularly large language models. Though attribution or citation improve the factuality and verifiability, issues like ambiguous knowledge reservoirs, inherent biases, and the drawbacks of excessive attribution can hinder the effectiveness of these systems. The aim of this survey is to provide valuable insights for researchers, aiding in the refinement of attribution methodologies to enhance the reliability and veracity of responses generated by open-domain generative systems. We believe that this field is still in its early stages; hence, we maintain a repository to keep track of ongoing studies at https://github.com/HITsz-TMG/awesome-llm-attributions.
△ Less
Submitted 14 December, 2023; v1 submitted 7 November, 2023;
originally announced November 2023.
-
Measurement of the absolute branching fraction of the three-body decay $Λ_{c}^+ \to Ξ^{0}K^{+}π^{0}$ and search for $Λ_{c}^+ \to nK^+π^0$, $Σ^{0}K^{+}π^{0}$ and $ΛK^{+}π^{0}$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (600 additional authors not shown)
Abstract:
The Cabbibo-favored decay $Λ_{c}^+ \to Ξ^{0}K^{+}π^{0}$ is studied for the first time using 6.1 fb$^{-1}$ of $e^+e^-$ collision data at center-of-mass energies between 4.600 and 4.840 GeV, collected with the BESIII detector at the BEPCII collider. With a double-tag method, the branching fraction of the three-body decay $Λ_{c}^+ \to Ξ^{0}K^{+}π^{0}$ is measured to be…
▽ More
The Cabbibo-favored decay $Λ_{c}^+ \to Ξ^{0}K^{+}π^{0}$ is studied for the first time using 6.1 fb$^{-1}$ of $e^+e^-$ collision data at center-of-mass energies between 4.600 and 4.840 GeV, collected with the BESIII detector at the BEPCII collider. With a double-tag method, the branching fraction of the three-body decay $Λ_{c}^+ \to Ξ^{0}K^{+}π^{0}$ is measured to be $(7.79 \pm 1.46 _{\rm} \pm0.71 _{\rm}) \times 10^{ - 3}$, where the first and second uncertainties are statistical and systematic, respectively. The branching fraction of the two-body decay $Λ_{c}^+ \to Ξ(1530)^{0}K^+$ is $(5.99\pm1.04\pm0.29)\times10^{-3}$, which is consistent with the previous result of $(5.02\pm0.99\pm0.31)\times 10^{-3}$. In addition, the upper limit on the branching fraction of the doubly Cabbibo-suppressed decay $Λ_{c}^+ \to nK^+π^0$ is $7.1 \times 10^{-4}$ at the 90$\%$ confidence level. The upper limits on the branching fractions of $Λ_{c}^+ \to Σ^{0}K^{+}π^{0}$ and $ΛK^{+}π^{0}$ are also determined to be $1.8\times 10^{-3}$ and $ 2.0 \times 10^{-3}$, respectively.
△ Less
Submitted 8 May, 2024; v1 submitted 4 November, 2023;
originally announced November 2023.
-
Search for a muonphilic scalar $X_{0}$ or vector $X_{1}$ via $J/ψ\toμ^+μ^-+\rm{invisible}$ decays at BESII
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (608 additional authors not shown)
Abstract:
A light scalar $X_{0}$ or vector $X_{1}$ particles have been introduced as a possible explanation for the $(g-2)_μ$ anomaly and dark matter phenomena.
Using $(8.998\pm 0.039)\times10^9$ $\jpsi $ events collected by the BESIII detector, we search for a light muon philic scalar $X_{0}$ or vector $X_{1}$ in the processes $J/ψ\toμ^+μ^- X_{0,1}$ with $X_{0,1}$ invisible decays. No obvious signal is f…
▽ More
A light scalar $X_{0}$ or vector $X_{1}$ particles have been introduced as a possible explanation for the $(g-2)_μ$ anomaly and dark matter phenomena.
Using $(8.998\pm 0.039)\times10^9$ $\jpsi $ events collected by the BESIII detector, we search for a light muon philic scalar $X_{0}$ or vector $X_{1}$ in the processes $J/ψ\toμ^+μ^- X_{0,1}$ with $X_{0,1}$ invisible decays. No obvious signal is found, and the upper limits on the coupling $g_{0,1}'$ between the muon and the $X_{0,1}$ particles are set to be between $1.1\times10^{-3}$ and $1.0\times10^{-2}$ for the $X_{0,1}$ mass in the range of $1<M(X_{0,1})<1000$ MeV$/c^2$ at 90$\%$ confidence level.
△ Less
Submitted 18 February, 2024; v1 submitted 2 November, 2023;
originally announced November 2023.
-
Cooperative Label-Free Moving Target Fencing for Second-Order Multi-Agent Systems with Rigid Formation
Authors:
Bin-Bin Hu,
Hai-Tao Zhang,
Yang Shi
Abstract:
This paper proposes a label-free controller for a second-order multi-agent system to cooperatively fence a moving target of variational velocity into a convex hull formed by the agents whereas maintaining a rigid formation. Therein, no label is predetermined for a specified agent. To attain a rigid formation with guaranteed collision avoidance, each controller consists of two terms: a dynamic regu…
▽ More
This paper proposes a label-free controller for a second-order multi-agent system to cooperatively fence a moving target of variational velocity into a convex hull formed by the agents whereas maintaining a rigid formation. Therein, no label is predetermined for a specified agent. To attain a rigid formation with guaranteed collision avoidance, each controller consists of two terms: a dynamic regulator with an internal model to drive agents towards the moving target merely by position information feedback, and a repulsive force between each pair of adjacent agents. Significantly, sufficient conditions are derived to guarantee the asymptotic stability of the closed-loop systems governed by the proposed fencing controller. Rigorous analysis is provided to eliminate the strong nonlinear couplings induced by the label-free property. Finally, the effectiveness of the controller is substantiated by numerical simulations.
△ Less
Submitted 1 November, 2023;
originally announced November 2023.
-
Spontaneous-Ordering Platoon Control for Multirobot Path Navigation Using Guiding Vector Fields
Authors:
Bin-Bin Hu,
Hai-Tao Zhang,
Weijia Yao,
Jianing Ding,
Ming Cao
Abstract:
In this paper, we propose a distributed guiding-vector-field (DGVF) algorithm for a team of robots to form a spontaneous-ordering platoon moving along a predefined desired path in the n-dimensional Euclidean space. Particularly, by adding a path parameter as an additional virtual coordinate to each robot, the DGVF algorithm can eliminate the singular points where the vector fields vanish, and gove…
▽ More
In this paper, we propose a distributed guiding-vector-field (DGVF) algorithm for a team of robots to form a spontaneous-ordering platoon moving along a predefined desired path in the n-dimensional Euclidean space. Particularly, by adding a path parameter as an additional virtual coordinate to each robot, the DGVF algorithm can eliminate the singular points where the vector fields vanish, and govern robots to approach a closed and even self-intersecting desired path. Then, the interactions among neighboring robots and a virtual target robot through their virtual coordinates enable the realization of the desired platoon; in particular, relative parametric displacements can be achieved with arbitrary ordering sequences. Rigorous analysis is provided to guarantee the global convergence to the spontaneous-ordering platoon on the common desired path from any initial positions. 2D experiments using three HUSTER-0.3 unmanned surface vessels (USVs) are conducted to validate the practical effectiveness of the proposed DGVF algorithm, and 3D numerical simulations are presented to demonstrate its effectiveness and robustness when tackling higher-dimensional multi-robot path-navigation missions and some robots breakdown.
△ Less
Submitted 1 November, 2023;
originally announced November 2023.
-
NiST: a non-localized spatial-temporal constitutive relation in rarefied gas dynamics
Authors:
Xiaoda Li,
Bin Hu,
Lei Wu
Abstract:
Although the mesoscopic Boltzmann equation describes the rarefied gas dynamics, finding its solutions in complicated engineering problems is challenging. Therefore, over the past one and a half centuries, many partial differential equations based on a few macroscopic variables are proposed. However, they not only have complicated forms, but also cannot make satisfactory prediction when the Knudsen…
▽ More
Although the mesoscopic Boltzmann equation describes the rarefied gas dynamics, finding its solutions in complicated engineering problems is challenging. Therefore, over the past one and a half centuries, many partial differential equations based on a few macroscopic variables are proposed. However, they not only have complicated forms, but also cannot make satisfactory prediction when the Knudsen number is large. Here, we propose a non-localized spatial-temporal (NiST) constitutive relation for rarefied gas dynamics, where the stress/heat flux at time $t$ and position $\bm x$ is determined by the velocity/temperature gradient in the nearby spatial-temporal coordinates, via convolution operators. By using the solutions of the Boltzmann equation for the Couette/Fourier flow and the spontaneous Rayleigh-Brillouin scattering, we extract the universal parameters of non-locality as functions of the spatial and temporal Knudsen numbers. Further tests in the sound propagation in rarefied gas show that the NiST constitutive relation can predict the rarefied gas flow over a wide range of Knudsen number.
△ Less
Submitted 1 November, 2023;
originally announced November 2023.
-
MOSEL: Inference Serving Using Dynamic Modality Selection
Authors:
Bodun Hu,
Le Xu,
Jeongyoon Moon,
Neeraja J. Yadwadkar,
Aditya Akella
Abstract:
Rapid advancements over the years have helped machine learning models reach previously hard-to-achieve goals, sometimes even exceeding human capabilities. However, to attain the desired accuracy, the model sizes and in turn their computational requirements have increased drastically. Thus, serving predictions from these models to meet any target latency and cost requirements of applications remain…
▽ More
Rapid advancements over the years have helped machine learning models reach previously hard-to-achieve goals, sometimes even exceeding human capabilities. However, to attain the desired accuracy, the model sizes and in turn their computational requirements have increased drastically. Thus, serving predictions from these models to meet any target latency and cost requirements of applications remains a key challenge, despite recent work in building inference-serving systems as well as algorithmic approaches that dynamically adapt models based on inputs. In this paper, we introduce a form of dynamism, modality selection, where we adaptively choose modalities from inference inputs while maintaining the model quality. We introduce MOSEL, an automated inference serving system for multi-modal ML models that carefully picks input modalities per request based on user-defined performance and accuracy requirements. MOSEL exploits modality configurations extensively, improving system throughput by 3.6$\times$ with an accuracy guarantee and shortening job completion times by 11$\times$.
△ Less
Submitted 27 October, 2023;
originally announced October 2023.
-
Does or did the supernova remnant Cassiopeia A operate as a PeVatron?
Authors:
Zhen Cao,
F. Aharonian,
Q. An,
Axikegu,
Y. X. Bai,
Y. W. Bao,
D. Bastieri,
X. J. Bi,
Y. J. Bi,
J. T. Cai,
Q. Cao,
W. Y. Cao,
Zhe Cao,
J. Chang,
J. F. Chang,
A. M. Chen,
E. S. Chen,
Liang Chen,
Lin Chen,
Long Chen,
M. J. Chen,
M. L. Chen,
Q. H. Chen,
S. H. Chen,
S. Z. Chen
, et al. (255 additional authors not shown)
Abstract:
For decades, supernova remnants (SNRs) have been considered the prime sources of Galactic Cosmic rays (CRs). But whether SNRs can accelerate CR protons to PeV energies and thus dominate CR flux up to the knee is currently under intensive theoretical and phenomenological debate. The direct test of the ability of SNRs to operate as CR PeVatrons can be provided by ultrahigh-energy (UHE;…
▽ More
For decades, supernova remnants (SNRs) have been considered the prime sources of Galactic Cosmic rays (CRs). But whether SNRs can accelerate CR protons to PeV energies and thus dominate CR flux up to the knee is currently under intensive theoretical and phenomenological debate. The direct test of the ability of SNRs to operate as CR PeVatrons can be provided by ultrahigh-energy (UHE; $E_γ\geq 100$~TeV) $γ$-rays. In this context, the historical SNR Cassiopeia A (Cas A) is considered one of the most promising target for UHE observations. This paper presents the observation of Cas A and its vicinity by the LHAASO KM2A detector. The exceptional sensitivity of LHAASO KM2A in the UHE band, combined with the young age of Cas A, enabled us to derive stringent model-independent limits on the energy budget of UHE protons and nuclei accelerated by Cas A at any epoch after the explosion. The results challenge the prevailing paradigm that Cas A-type SNRs are major suppliers of PeV CRs in the Milky Way.
△ Less
Submitted 25 October, 2023;
originally announced October 2023.
-
Expression Syntax Information Bottleneck for Math Word Problems
Authors:
Jing Xiong,
Chengming Li,
Min Yang,
Xiping Hu,
Bin Hu
Abstract:
Math Word Problems (MWP) aims to automatically solve mathematical questions given in texts. Previous studies tend to design complex models to capture additional information in the original text so as to enable the model to gain more comprehensive features. In this paper, we turn our attention in the opposite direction, and work on how to discard redundant features containing spurious correlations…
▽ More
Math Word Problems (MWP) aims to automatically solve mathematical questions given in texts. Previous studies tend to design complex models to capture additional information in the original text so as to enable the model to gain more comprehensive features. In this paper, we turn our attention in the opposite direction, and work on how to discard redundant features containing spurious correlations for MWP. To this end, we design an Expression Syntax Information Bottleneck method for MWP (called ESIB) based on variational information bottleneck, which extracts essential features of expression syntax tree while filtering latent-specific redundancy containing syntax-irrelevant features. The key idea of ESIB is to encourage multiple models to predict the same expression syntax tree for different problem representations of the same problem by mutual learning so as to capture consistent information of expression syntax tree and discard latent-specific redundancy. To improve the generalization ability of the model and generate more diverse expressions, we design a self-distillation loss to encourage the model to rely more on the expression syntax information in the latent space. Experimental results on two large-scale benchmarks show that our model not only achieves state-of-the-art results but also generates more diverse solutions. The code is available.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Study of the doubly Cabibbo-suppressed decays $D^+_s\to K^+K^+π^-$ and $D^+_s\to K^+K^+π^-π^0$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko
, et al. (604 additional authors not shown)
Abstract:
Based on 7.33 fb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.128 and 4.226 GeV with the BESIII detector, the experimental studies of the doubly Cabibbo-suppressed decays $D^+_s\to K^+K^+π^-$ and $D^+_s\to K^+K^+π^-π^0$ are reported. We determine the absolute branching fraction of $D^+_s\to K^+K^+π^-$ to be (…
▽ More
Based on 7.33 fb$^{-1}$ of $e^+e^-$ collision data collected at center-of-mass energies between 4.128 and 4.226 GeV with the BESIII detector, the experimental studies of the doubly Cabibbo-suppressed decays $D^+_s\to K^+K^+π^-$ and $D^+_s\to K^+K^+π^-π^0$ are reported. We determine the absolute branching fraction of $D^+_s\to K^+K^+π^-$ to be (${1.23^{+0.28}_{-0.25}}({\rm stat})\pm0.06({\rm syst})$) $\times 10^{-4}$. No significant signal of $D^+_s\to K^+K^+π^-π^0$ is observed and the upper limit on its decay branching fraction at 90\% confidence level is set to be $1.7\times10^{-4}$.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Trade-off relations of geometric coherence
Authors:
Bingyu Hu,
Ming-Jing Zhao
Abstract:
Quantum coherence is an important quantum resource and it is intimately related to various research fields. The geometric coherence is a coherence measure both operationally and geometrically. We study the trade-off relation of geometric coherence in qubit systems. We first derive an upper bound for the geometric coherence by the purity of quantum states. Based on this, a complementarity relation…
▽ More
Quantum coherence is an important quantum resource and it is intimately related to various research fields. The geometric coherence is a coherence measure both operationally and geometrically. We study the trade-off relation of geometric coherence in qubit systems. We first derive an upper bound for the geometric coherence by the purity of quantum states. Based on this, a complementarity relation between the quantum coherence and the mixedness is established. We then derive the quantum uncertainty relations of the geometric coherence on two and three general measurement bases in terms of the incompatibility respectively, which turn out to be state-independent for pure states. These trade-off relations provide the limit to the amount of quantum coherence. As a byproduct,the complementarity relation between the minimum error probability for discriminating a pure-states ensemble and the mixedness of quantum states is established.
△ Less
Submitted 23 October, 2023;
originally announced October 2023.
-
A Review of Prospects and Opportunities in Disassembly with Human-Robot Collaboration
Authors:
Meng-Lun Lee,
Xiao Liang,
Boyi Hu,
Gulcan Onel,
Sara Behdad,
Minghui Zheng
Abstract:
Product disassembly plays a crucial role in the recycling, remanufacturing, and reuse of end-of-use (EoU) products. However, the current manual disassembly process is inefficient due to the complexity and variation of EoU products. While fully automating disassembly is not economically viable given the intricate nature of the task, there is potential in using human-robot collaboration (HRC) to enh…
▽ More
Product disassembly plays a crucial role in the recycling, remanufacturing, and reuse of end-of-use (EoU) products. However, the current manual disassembly process is inefficient due to the complexity and variation of EoU products. While fully automating disassembly is not economically viable given the intricate nature of the task, there is potential in using human-robot collaboration (HRC) to enhance disassembly operations. HRC combines the flexibility and problem-solving abilities of humans with the precise repetition and handling of unsafe tasks by robots. Nevertheless, numerous challenges persist in technology, human workers, and remanufacturing work, that require comprehensive multidisciplinary research to bridge critical gaps. These challenges have motivated the authors to provide a detailed discussion on the opportunities and obstacles associated with introducing HRC to disassembly. In this regard, the authors have conducted a thorough review of the recent progress in HRC disassembly and present the insights gained from this analysis from three distinct perspectives: technology, workers, and work.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
A Read-and-Select Framework for Zero-shot Entity Linking
Authors:
Zhenran Xu,
Yulin Chen,
Baotian Hu,
Min Zhang
Abstract:
Zero-shot entity linking (EL) aims at aligning entity mentions to unseen entities to challenge the generalization ability. Previous methods largely focus on the candidate retrieval stage and ignore the essential candidate ranking stage, which disambiguates among entities and makes the final linking prediction. In this paper, we propose a read-and-select (ReS) framework by modeling the main compone…
▽ More
Zero-shot entity linking (EL) aims at aligning entity mentions to unseen entities to challenge the generalization ability. Previous methods largely focus on the candidate retrieval stage and ignore the essential candidate ranking stage, which disambiguates among entities and makes the final linking prediction. In this paper, we propose a read-and-select (ReS) framework by modeling the main components of entity disambiguation, i.e., mention-entity matching and cross-entity comparison. First, for each candidate, the reading module leverages mention context to output mention-aware entity representations, enabling mention-entity matching. Then, in the selecting module, we frame the choice of candidates as a sequence labeling problem, and all candidate representations are fused together to enable cross-entity comparison. Our method achieves the state-of-the-art performance on the established zero-shot EL dataset ZESHEL with a 2.55% micro-average accuracy gain, with no need for laborious multi-phase pre-training used in most of the previous work, showing the effectiveness of both mention-entity and cross-entity interaction.
△ Less
Submitted 29 October, 2023; v1 submitted 19 October, 2023;
originally announced October 2023.
-
Revisiting Sparse Retrieval for Few-shot Entity Linking
Authors:
Yulin Chen,
Zhenran Xu,
Baotian Hu,
Min Zhang
Abstract:
Entity linking aims to link ambiguous mentions to their corresponding entities in a knowledge base. One of the key challenges comes from insufficient labeled data for specific domains. Although dense retrievers have achieved excellent performance on several benchmarks, their performance decreases significantly when only a limited amount of in-domain labeled data is available. In such few-shot sett…
▽ More
Entity linking aims to link ambiguous mentions to their corresponding entities in a knowledge base. One of the key challenges comes from insufficient labeled data for specific domains. Although dense retrievers have achieved excellent performance on several benchmarks, their performance decreases significantly when only a limited amount of in-domain labeled data is available. In such few-shot setting, we revisit the sparse retrieval method, and propose an ELECTRA-based keyword extractor to denoise the mention context and construct a better query expression. For training the extractor, we propose a distant supervision method to automatically generate training data based on overlapping tokens between mention contexts and entity descriptions. Experimental results on the ZESHEL dataset demonstrate that the proposed method outperforms state-of-the-art models by a significant margin across all test domains, showing the effectiveness of keyword-enhanced sparse retrieval.
△ Less
Submitted 18 October, 2023;
originally announced October 2023.
-
Measurement of the cross sections for $e^+e^-\toηπ^+π^-$ at center-of-mass energies between 2.00 and 3.08 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (605 additional authors not shown)
Abstract:
Using data samples collected at center-of-mass energies between 2.000 and 3.080 GeV with the BESIII detector operating at the BEPCII collider, a partial-wave analysis is performed on the process $e^+e^-\toηπ^+π^-$. In addition to the dominant $e^+e^-\toρη$ component, the $e^+e^-\to a_2(1320)π$ process is also sizeable, contributing up to 24% of the total reaction. The measured cross sections of th…
▽ More
Using data samples collected at center-of-mass energies between 2.000 and 3.080 GeV with the BESIII detector operating at the BEPCII collider, a partial-wave analysis is performed on the process $e^+e^-\toηπ^+π^-$. In addition to the dominant $e^+e^-\toρη$ component, the $e^+e^-\to a_2(1320)π$ process is also sizeable, contributing up to 24% of the total reaction. The measured cross sections of the process $e^+e^-\toηπ^+π^-$ are systematically higher than those of BaBar by more than $3σ$ at center-of-mass energies between 2.000 and 2.300 GeV. In the cross section lineshape for $e^+e^-\to a_2(1320)π$, a resonant structure is observed with a significance of $5.5σ$, with $M=(2044\pm31\pm4)$ MeV/$c^2$, $Γ=(163\pm69\pm24)$ MeV and $\mathcal{B_{R}}\cdotΓ_{e^+e^-}^{R}=(34.6\pm17.1\pm6.0)$ eV or $(137.1\pm73.3\pm2.1)$ eV. In the cross section lineshape for $e^+e^-\toρη$, an evidence of a dip structure around 2180 MeV/$c^2$ is observed with statistical significance of $3.0σ$.
△ Less
Submitted 28 November, 2023; v1 submitted 16 October, 2023;
originally announced October 2023.
-
Very high energy gamma-ray emission beyond 10 TeV from GRB 221009A
Authors:
Zhen Cao,
F. Aharonian,
Q. An,
A. Axikegu,
Y. X. Bai,
Y. W. Bao,
D. Bastieri,
X. J. Bi,
Y. J. Bi,
J. T. Cai,
Q. Cao,
W. Y. Cao,
Zhe Cao,
J. Chang,
J. F. Chang,
A. M. Chen,
E. S. Chen,
Liang Chen,
Lin Chen,
Long Chen,
M. J. Chen,
M. L. Chen,
Q. H. Chen,
S. H. Chen,
S. Z. Chen
, et al. (255 additional authors not shown)
Abstract:
The highest energy gamma-rays from gamma-ray bursts (GRBs) have important implications for their radiation mechanism. Here we report for the first time the detection of gamma-rays up to 13 TeV from the brightest GRB 221009A by the Large High Altitude Air-shower Observatory (LHAASO). The LHAASO-KM2A detector registered more than 140 gamma-rays with energies above 3 TeV during 230$-$900s after the t…
▽ More
The highest energy gamma-rays from gamma-ray bursts (GRBs) have important implications for their radiation mechanism. Here we report for the first time the detection of gamma-rays up to 13 TeV from the brightest GRB 221009A by the Large High Altitude Air-shower Observatory (LHAASO). The LHAASO-KM2A detector registered more than 140 gamma-rays with energies above 3 TeV during 230$-$900s after the trigger. The intrinsic energy spectrum of gamma-rays can be described by a power-law after correcting for extragalactic background light (EBL) absorption. Such a hard spectrum challenges the synchrotron self-Compton (SSC) scenario of relativistic electrons for the afterglow emission above several TeV. Observations of gamma-rays up to 13 TeV from a source with a measured redshift of z=0.151 hints more transparency in intergalactic space than previously expected. Alternatively, one may invoke new physics such as Lorentz Invariance Violation (LIV) or an axion origin of very high energy (VHE) signals.
△ Less
Submitted 22 November, 2023; v1 submitted 13 October, 2023;
originally announced October 2023.
-
Generate Coherent Rays Directly
Authors:
Fengqi Liu,
Zaonan Tan,
Weilai Xiang,
Chenhao Lu,
Dan Li,
Xu Gong,
Yulong Shi,
Songnan Shi,
Qilong Kou,
Bo Hu
Abstract:
The path tracing method generates incoherent rays by randomly sampling directions. This randomness makes it unsuitable for modern processor architectures that rely on coherence to achieve optimal performance. Many efforts have been made to address this issue by reordering rays based on their origin, end, or direction to enhance coherence. However, a drawback of reordering methods is the need to en…
▽ More
The path tracing method generates incoherent rays by randomly sampling directions. This randomness makes it unsuitable for modern processor architectures that rely on coherence to achieve optimal performance. Many efforts have been made to address this issue by reordering rays based on their origin, end, or direction to enhance coherence. However, a drawback of reordering methods is the need to encode and sort rays before tracing, introducing additional overhead. We propose a technique to generate coherent rays directly by reusing the direction. Additionally, we introduce an interleaved reuse domain partition method to mitigate the impact of sampling correlation resulting from direction reuse. We demonstrate the effectiveness of our approach across various scenes, establishing its superiority over reordering methods.
△ Less
Submitted 11 October, 2023;
originally announced October 2023.
-
Measurement of $e^{+}e^{-}\rightarrowηJ/ψ$ Cross Section from $\sqrt{s}=$ 3.808 GeV to 4.951 GeV
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (608 additional authors not shown)
Abstract:
Using data samples with an integrated luminosity of 22.42 fb$^{-1}$ collected by the BESIII detector operating at the BEPCII storage ring, we measure the cross sections of the $e^{+}e^{-}\rightarrow\etaJ/ψ$ process at center-of-mass energies from 3.808 to 4.951 GeV. Three structures are observed in the line shape of the measured cross sections. A maximum-likelihood fit with $ψ(4040)$, two addition…
▽ More
Using data samples with an integrated luminosity of 22.42 fb$^{-1}$ collected by the BESIII detector operating at the BEPCII storage ring, we measure the cross sections of the $e^{+}e^{-}\rightarrow\etaJ/ψ$ process at center-of-mass energies from 3.808 to 4.951 GeV. Three structures are observed in the line shape of the measured cross sections. A maximum-likelihood fit with $ψ(4040)$, two additional resonances, and a non-resonant component is performed. The mass and width of the first additional state are $(4219.7\pm2.5\pm4.5) \rm{MeV}/\rm{c}^2$ and $(80.7\pm4.4\pm1.4) \rm{MeV}$, respectively, consistent with the $ψ(4230)$. For the second state, the mass and width are $(4386\pm13\pm17) \rm{MeV}/\rm{c}^2$ and $(177\pm32\pm13) \rm{MeV}$, respectively, consistent with the $ψ(4360)$. The first uncertainties are statistical and the second ones are systematic. The statistical significance of $ψ(4040)$ is $8.0σ$ and those for $ψ(4230)$ and $ψ(4360)$ are more than $10.0σ$.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
Quark masses and low energy constants in the continuum from the tadpole improved clover ensembles
Authors:
Zhi-Cheng Hu,
Bo-Lun Hu,
Ji-Hao Wang,
Ming Gong,
Liuming Liu,
Peng Sun,
Wei Sun,
Wei Wang,
Yi-Bo Yang,
Dian-Jun Zhao
Abstract:
We present the light-flavor quark masses and low energy constants using the 2+1 flavor full-QCD ensembles with stout smeared clover fermion action and Symanzik gauge actions. Both the fermion and gauge actions are tadpole improved self-consistently. The simulations are performed on 11 ensembles at 3 lattice spacings $a\in[0.05,0.11]$ fm, 4 spatial sizes $L\in[2.5, 5.1]$ fm, 7 pion masses…
▽ More
We present the light-flavor quark masses and low energy constants using the 2+1 flavor full-QCD ensembles with stout smeared clover fermion action and Symanzik gauge actions. Both the fermion and gauge actions are tadpole improved self-consistently. The simulations are performed on 11 ensembles at 3 lattice spacings $a\in[0.05,0.11]$ fm, 4 spatial sizes $L\in[2.5, 5.1]$ fm, 7 pion masses $m_π\in[135,350]$ MeV, and several values of the strange quark mass. The quark mass is defined through the partially conserved axial current (PCAC) relation and renormalized to $\overline{\mathrm{MS}}$ 2 GeV through the intermediate regularization independent momentum subtraction (RI/MOM) scheme. The systematic uncertainty of using the symmetric momentum subtraction (SMOM) scheme is also included. Eventually, we predict $m_u=2.45(22)(20)$ MeV, $m_d=4.74(11)(09)$ MeV, and $m_s=98.8(2.9)(4.7)$ MeV with the systematic uncertainties from lattice spacing determination, continuum extrapolation and renormalization constant included. We also obtain the chiral condensate $Σ^{1/3}=268.6(3.6)(0.7)$ MeV and the pion decay constant $F=86.6(7)(1.4) $ MeV in the $N_f=2$ chiral limit, and the next-to-leading order low energy constants $\ell_3=2.43(54)(05)$ and $\ell_4=4.322(75)(96)$.
△ Less
Submitted 7 January, 2024; v1 submitted 1 October, 2023;
originally announced October 2023.
-
First measurement of $ΛN$ inelastic scattering with $Λ$ from $e^{+} e^{-} \rightarrow J/ψ\to Λ\barΛ$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
H. -R. Bao,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (626 additional authors not shown)
Abstract:
Using an $e^+ e^-$ collision data sample of $(10087 \pm 44)\times10^6 ~J/ψ$ events taken at the center-of-mass energy of $3.097~\rm{GeV}$ by the BESIII detector at the BEPCII collider, the process $Λ+N \rightarrow Σ^+ + X$ is studied for the first time employing a novel method. The $Σ^{+}$ hyperons are produced by the collisions of $Λ$ hyperons from $J/ψ$ decays with nuclei in the material of the…
▽ More
Using an $e^+ e^-$ collision data sample of $(10087 \pm 44)\times10^6 ~J/ψ$ events taken at the center-of-mass energy of $3.097~\rm{GeV}$ by the BESIII detector at the BEPCII collider, the process $Λ+N \rightarrow Σ^+ + X$ is studied for the first time employing a novel method. The $Σ^{+}$ hyperons are produced by the collisions of $Λ$ hyperons from $J/ψ$ decays with nuclei in the material of the BESIII detector. The total cross section of $Λ+ ^{9}{\rm Be} \rightarrow Σ^+ + X$ is measured to be $σ= (37.3 \pm 4.7 \pm 3.5)~{\rm mb}$ at $Λ$ beam momenta within $[1.057, 1.091]~{\rm GeV}/c$, where the uncertainties are statistical and systematic, respectively. This analysis is the first study of $Λ$-nucleon interactions at an $e^+ e^-$ collider, providing information and constraints relevant for the strong-interaction potential, the origin of color confinement, the unified model for baryon-baryon interactions, and the internal structure of neutron stars.
△ Less
Submitted 1 October, 2023;
originally announced October 2023.
-
Updated measurements of the M1 transition $ψ(3686) \to γη_{c}(2S)$ with $η_{c}(2S) \to K \bar{K} π$
Authors:
BESIII Collaboration,
M. Ablikim,
M. N. Achasov,
P. Adlarson,
O. Afedulidis,
X. C. Ai,
R. Aliberti,
A. Amoroso,
M. R. An,
Q. An,
Y. Bai,
O. Bakina,
I. Balossino,
Y. Ban,
V. Batozskaya,
K. Begzsuren,
N. Berger,
M. Berlowski,
M. Bertani,
D. Bettoni,
F. Bianchi,
E. Bianco,
A. Bortone,
I. Boyko,
R. A. Briere
, et al. (609 additional authors not shown)
Abstract:
Based on a data sample of $(27.08 \pm 0.14 ) \times 10^8~ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, the M1 transition $ψ(3686) \to γη_{c}(2S)$ with $η_{c}(2S) \to K\bar{K}π$ is studied, where $K\bar{K}π$ is $K^{+} K^{-} π^{0}$ or $K_{S}^{0}K^{\pm}π^{\mp}$. The mass and width of the $η_{c}(2S)$ are measured to be $(3637.8 \pm 0.8 (\rm {stat}) \pm 0.2 (\rm {syst}))$ M…
▽ More
Based on a data sample of $(27.08 \pm 0.14 ) \times 10^8~ψ(3686)$ events collected with the BESIII detector at the BEPCII collider, the M1 transition $ψ(3686) \to γη_{c}(2S)$ with $η_{c}(2S) \to K\bar{K}π$ is studied, where $K\bar{K}π$ is $K^{+} K^{-} π^{0}$ or $K_{S}^{0}K^{\pm}π^{\mp}$. The mass and width of the $η_{c}(2S)$ are measured to be $(3637.8 \pm 0.8 (\rm {stat}) \pm 0.2 (\rm {syst}))$ MeV/$c^{2}$ and $(10.5 \pm 1.7 (\rm {stat}) \pm 3.5 (\rm {syst}))$ MeV, respectively. The product branching fraction $\mathcal{B}\left(ψ(3686) \rightarrow γη_{c}(2 S)\right) \times \mathcal{B}(η_{c}(2 S) \rightarrow K \bar{K} π)$ is determined to be $(0.97 \pm 0.06 (\rm {stat}) \pm 0.09 (\rm {syst})) \times 10^{-5}$. Using $\mathcal{BR}(η_{c}(2S)\to K\bar{K}π)=(1.86^{+0.68}_{-0.49})\%$, we obtain the branching fraction of the radiative transition to be $\mathcal{BR}(ψ(3686) \to γη_{c}(2S)) = (5.2 \pm 0.3 (\rm {stat}) \pm 0.5 (\rm {syst}) ^{+1.9}_{-1.4} (extr)) \times 10^{-4}$, where the third uncertainty is due to the quoted $\mathcal{BR}(η_{c}(2S) \to K\bar{K}π)$.
△ Less
Submitted 26 September, 2023;
originally announced September 2023.
-
Bad Actor, Good Advisor: Exploring the Role of Large Language Models in Fake News Detection
Authors:
Beizhe Hu,
Qiang Sheng,
Juan Cao,
Yuhui Shi,
Yang Li,
Danding Wang,
Peng Qi
Abstract:
Detecting fake news requires both a delicate sense of diverse clues and a profound understanding of the real-world background, which remains challenging for detectors based on small language models (SLMs) due to their knowledge and capability limitations. Recent advances in large language models (LLMs) have shown remarkable performance in various tasks, but whether and how LLMs could help with fak…
▽ More
Detecting fake news requires both a delicate sense of diverse clues and a profound understanding of the real-world background, which remains challenging for detectors based on small language models (SLMs) due to their knowledge and capability limitations. Recent advances in large language models (LLMs) have shown remarkable performance in various tasks, but whether and how LLMs could help with fake news detection remains underexplored. In this paper, we investigate the potential of LLMs in fake news detection. First, we conduct an empirical study and find that a sophisticated LLM such as GPT 3.5 could generally expose fake news and provide desirable multi-perspective rationales but still underperforms the basic SLM, fine-tuned BERT. Our subsequent analysis attributes such a gap to the LLM's inability to select and integrate rationales properly to conclude. Based on these findings, we propose that current LLMs may not substitute fine-tuned SLMs in fake news detection but can be a good advisor for SLMs by providing multi-perspective instructive rationales. To instantiate this proposal, we design an adaptive rationale guidance network for fake news detection (ARG), in which SLMs selectively acquire insights on news analysis from the LLMs' rationales. We further derive a rationale-free version of ARG by distillation, namely ARG-D, which services cost-sensitive scenarios without querying LLMs. Experiments on two real-world datasets demonstrate that ARG and ARG-D outperform three types of baseline methods, including SLM-based, LLM-based, and combinations of small and large language models.
△ Less
Submitted 22 January, 2024; v1 submitted 21 September, 2023;
originally announced September 2023.
-
Self-morphing of elastic bilayers induced by mismatch strain: deformation simulation and bio-inspired design
Authors:
Junjie Song,
Yixiong Feng,
Zhaoxi Hong,
Bingtao Hu,
Jianrong Tan,
Xiuju Song
Abstract:
The process of self-morphing in curved surfaces found in nature, such as with the growth of flowers and leaves, has generated interest in the study of self-morphing bilayers, which has been used in many soft robots or switchers. However, previous research has primarily focused on materials or bilayer fabrication technologies. The self-morphing mechanism and process have been rarely investigated, d…
▽ More
The process of self-morphing in curved surfaces found in nature, such as with the growth of flowers and leaves, has generated interest in the study of self-morphing bilayers, which has been used in many soft robots or switchers. However, previous research has primarily focused on materials or bilayer fabrication technologies. The self-morphing mechanism and process have been rarely investigated, despite their importance. This study proposed a new deformation simulation method for self-morphing bilayers based on a checkerboard-based discrete differential geometry approach. This new method achieved higher efficiency than traditional finite element methods while still maintaining accuracy. It was also effective in handling complex finite strain situations. Finally, the simulation model was used to design three self-morphing bilayers inspired by folding flowers, spiral grass, and conical seashells. These designs further prove the effectiveness of the proposed method. The results of this study propose a good method for predicting deformation and designing self-morphing bilayers and provide a useful viewpoint for using geometrical methods to solve mechanical problems.
△ Less
Submitted 18 September, 2023;
originally announced September 2023.