Search | arXiv e-print repository

Tetraquark nature of the $a_0(980)$ meson in hadronic $D$ decays

Authors: Hai-Yang Cheng, Cheng-Wei Chiang, Fanrong Xu

Abstract: The internal structure of the light scalar meson $a_0(980)$ is explored in the three-body $D$ decays of $D\to a_0(980)P\to P_1P_2P$ through the intermediate state $a_0(980)$, where $P$ denotes a pseudoscalar meson. The quasi-two-body $D\to a_0(980)^+P$ decays are governed by the external $W$-emission diagram in which $a_0(980)^+$ is emitted. The predicted branching fractions in the $q\bar q$ model… ▽ More The internal structure of the light scalar meson $a_0(980)$ is explored in the three-body $D$ decays of $D\to a_0(980)P\to P_1P_2P$ through the intermediate state $a_0(980)$, where $P$ denotes a pseudoscalar meson. The quasi-two-body $D\to a_0(980)^+P$ decays are governed by the external $W$-emission diagram in which $a_0(980)^+$ is emitted. The predicted branching fractions in the $q\bar q$ model of $a_0(980)$ are too small by one to two orders of magnitude compared to experiment as the amplitude is suppressed by the smallness of the $a_0(980)^+$ decay constant, while those for $D^+\to a_0(980)^0 P$ and $D^0\to a_0(980)^{-}P$ are usually too large. These discrepancies can be resolved provided that $a_0(980)$ is a tetraquark state. In this case, there exist two additional $T$-like topological amplitudes, denoted by $\overline{T}$ and $\tilde T$ which readily account for the discrepancies. An important implication of the tetraquark model is that the $D_s^+\to a_0(980)^+π^0+a_0(980)^0π^+$ decay is not a purely $W$-annihilation process as in the diquark model of $a_0(980)$; it receives dominant contributions from $\overline{T}$ newly noticed in this work. Therefore, measurements of $(D,D_s^+)\to a_0(980)P$ decays lend strong support to the tetraquark picture of $a_0(980)$. △ Less

Submitted 25 August, 2024; originally announced August 2024.

Comments: 14 pages, 1 figure. arXiv admin note: text overlap with arXiv:2201.00460

arXiv:2408.10444 [pdf, other]

In-Flight Performance of Spider's 280 GHz Receivers

Authors: Elle C. Shaw, P. A. R. Ade, S. Akers, M. Amiri, J. Austermann, J. Beall, D. T. Becker, S. J. Benton, A. S. Bergman, J. J. Bock, J. R. Bond, S. A. Bryan, H. C. Chiang, C. R. Contaldi, R. S. Domagalski, O. Doré, S. M. Duff, A. J. Duivenvoorden, H. K. Eriksen, M. Farhang, J. P. Filippini, L. M. Fissel, A. A. Fraisse, K. Freese, M. Galloway , et al. (62 additional authors not shown)

Abstract: SPIDER is a balloon-borne instrument designed to map the cosmic microwave background at degree-angular scales in the presence of Galactic foregrounds. SPIDER has mapped a large sky area in the Southern Hemisphere using more than 2000 transition-edge sensors (TESs) during two NASA Long Duration Balloon flights above the Antarctic continent. During its first flight in January 2015, SPIDER observed i… ▽ More SPIDER is a balloon-borne instrument designed to map the cosmic microwave background at degree-angular scales in the presence of Galactic foregrounds. SPIDER has mapped a large sky area in the Southern Hemisphere using more than 2000 transition-edge sensors (TESs) during two NASA Long Duration Balloon flights above the Antarctic continent. During its first flight in January 2015, SPIDER observed in the 95 GHz and 150 GHz frequency bands, setting constraints on the B-mode signature of primordial gravitational waves. Its second flight in the 2022-23 season added new receivers at 280 GHz, each using an array of TESs coupled to the sky through feedhorns formed from stacks of silicon wafers. These receivers are optimized to produce deep maps of polarized Galactic dust emission over a large sky area, providing a unique data set with lasting value to the field. In this work, we describe the instrument's performance during SPIDER's second flight. △ Less

Submitted 19 August, 2024; originally announced August 2024.

Comments: Submitted to SPIE Astronomical Telescopes + Instrumentation 2024, JATIS

arXiv:2407.20982 [pdf, other]

Analysis of Polarized Dust Emission from the First Flight of the SPIDER Balloon-Borne Telescope

Authors: SPIDER Collaboration, P. A. R. Ade, M. Amiri, S. J. Benton, A. S. Bergman, R. Bihary, J. J. Bock, J. R. Bond, J. A. Bonetti, S. A. Bryan, H. C. Chiang, C. R. Contaldi, O. Doré, A. J. Duivenvoorden, H. K. Eriksen, J. P. Filippini, A. A. Fraisse, K. Freese, M. Galloway, A. E. Gambrel, N. N. Gandilo, K. Ganga, S. Gourapura, R. Gualtieri, J. E. Gudmundsson , et al. (45 additional authors not shown)

Abstract: Using data from the first flight of SPIDER and from Planck HFI, we probe the properties of polarized emission from interstellar dust in the SPIDER observing region. Component separation algorithms operating in both the spatial and harmonic domains are applied to probe their consistency and to quantify modeling errors associated with their assumptions. Analyses spanning the full SPIDER region demon… ▽ More Using data from the first flight of SPIDER and from Planck HFI, we probe the properties of polarized emission from interstellar dust in the SPIDER observing region. Component separation algorithms operating in both the spatial and harmonic domains are applied to probe their consistency and to quantify modeling errors associated with their assumptions. Analyses spanning the full SPIDER region demonstrate that i) the spectral energy distribution of diffuse Galactic dust emission is broadly consistent with a modified-blackbody (MBB) model with a spectral index of $β_\mathrm{d}=1.45\pm0.05$ $(1.47\pm0.06)$ for $E$ ($B$)-mode polarization, slightly lower than that reported by Planck for the full sky; ii) its angular power spectrum is broadly consistent with a power law; and iii) there is no significant detection of line-of-sight decorrelation of the astrophysical polarization. The size of the SPIDER region further allows for a statistically meaningful analysis of the variation in foreground properties within it. Assuming a fixed dust temperature $T_\mathrm{d}=19.6$ K, an analysis of two independent sub-regions of that field results in inferred values of $β_\mathrm{d}=1.52\pm0.06$ and $β_\mathrm{d}=1.09\pm0.09$, which are inconsistent at the $3.9\,σ$ level. Furthermore, a joint analysis of SPIDER and Planck 217 and 353 GHz data within a subset of the SPIDER region is inconsistent with a simple MBB at more than $3\,σ$, assuming a common morphology of polarized dust emission over the full range of frequencies. These modeling uncertainties have a small--but non-negligible--impact on limits on the cosmological tensor-to-scalar ratio derived from the \spider dataset. The fidelity of the component separation approaches of future CMB polarization experiments may thus have a significant impact on their constraining power. △ Less

Submitted 30 July, 2024; originally announced July 2024.

Comments: 21 pages, 15 figures

arXiv:2407.12867 [pdf, other]

Swift-BAT GUANO follow-up of gravitational-wave triggers in the third LIGO-Virgo-KAGRA observing run

Authors: Gayathri Raman, Samuele Ronchini, James Delaunay, Aaron Tohuvavohu, Jamie A. Kennea, Tyler Parsotan, Elena Ambrosi, Maria Grazia Bernardini, Sergio Campana, Giancarlo Cusumano, Antonino D'Ai, Paolo D'Avanzo, Valerio D'Elia, Massimiliano De Pasquale, Simone Dichiara, Phil Evans, Dieter Hartmann, Paul Kuin, Andrea Melandri, Paul O'Brien, Julian P. Osborne, Kim Page, David M. Palmer, Boris Sbarufatti, Gianpiero Tagliaferri , et al. (1797 additional authors not shown)

Abstract: We present results from a search for X-ray/gamma-ray counterparts of gravitational-wave (GW) candidates from the third observing run (O3) of the LIGO-Virgo-KAGRA (LVK) network using the Swift Burst Alert Telescope (Swift-BAT). The search includes 636 GW candidates received in low latency, 86 of which have been confirmed by the offline analysis and included in the third cumulative Gravitational-Wav… ▽ More We present results from a search for X-ray/gamma-ray counterparts of gravitational-wave (GW) candidates from the third observing run (O3) of the LIGO-Virgo-KAGRA (LVK) network using the Swift Burst Alert Telescope (Swift-BAT). The search includes 636 GW candidates received in low latency, 86 of which have been confirmed by the offline analysis and included in the third cumulative Gravitational-Wave Transient Catalogs (GWTC-3). Targeted searches were carried out on the entire GW sample using the maximum--likelihood NITRATES pipeline on the BAT data made available via the GUANO infrastructure. We do not detect any significant electromagnetic emission that is temporally and spatially coincident with any of the GW candidates. We report flux upper limits in the 15-350 keV band as a function of sky position for all the catalog candidates. For GW candidates where the Swift-BAT false alarm rate is less than 10$^{-3}$ Hz, we compute the GW--BAT joint false alarm rate. Finally, the derived Swift-BAT upper limits are used to infer constraints on the putative electromagnetic emission associated with binary black hole mergers. △ Less

Submitted 13 July, 2024; originally announced July 2024.

Comments: 50 pages, 10 figures, 4 tables

arXiv:2407.05216 [pdf, other]

Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course

Authors: Cheng-Han Chiang, Wei-Chih Chen, Chun-Yi Kuan, Chienchou Yang, Hung-yi Lee

Abstract: Using large language models (LLMs) for automatic evaluation has become an important evaluation method in NLP research. However, it is unclear whether these LLM-based evaluators can be applied in real-world classrooms to assess student assignments. This empirical report shares how we use GPT-4 as an automatic assignment evaluator in a university course with 1,028 students. Based on student response… ▽ More Using large language models (LLMs) for automatic evaluation has become an important evaluation method in NLP research. However, it is unclear whether these LLM-based evaluators can be applied in real-world classrooms to assess student assignments. This empirical report shares how we use GPT-4 as an automatic assignment evaluator in a university course with 1,028 students. Based on student responses, we find that LLM-based assignment evaluators are generally acceptable to students when students have free access to these LLM-based evaluators. However, students also noted that the LLM sometimes fails to adhere to the evaluation instructions. Additionally, we observe that students can easily manipulate the LLM-based evaluator to output specific strings, allowing them to achieve high scores without meeting the assignment rubric. Based on student feedback and our experience, we provide several recommendations for integrating LLM-based evaluators into future classrooms. △ Less

Submitted 6 July, 2024; originally announced July 2024.

Comments: An empirical report of our course: Introduction to Generative AI 2024 Spring (https://speech.ee.ntu.edu.tw/~hylee/genai/2024-spring.php)

arXiv:2407.01096 [pdf, other]

Indirect detection constraints on semi-annihilation of inert scalar multiplets

Authors: Hugues Beauchesne, Cheng-Wei Chiang

Abstract: Certain models of inert multiplets allow for semi-annihilation processes, in which two dark matter candidates annihilate to a dark matter particle and a non-dark matter particle. The existence of these processes can alleviate certain constraints and substantially modify the indirect detection signal. In this paper, we study current indirect detection constraints on the semi-annihilation of inert s… ▽ More Certain models of inert multiplets allow for semi-annihilation processes, in which two dark matter candidates annihilate to a dark matter particle and a non-dark matter particle. The existence of these processes can alleviate certain constraints and substantially modify the indirect detection signal. In this paper, we study current indirect detection constraints on the semi-annihilation of inert scalar multiplets. We show that there exist gauge numbers for which dark matter can be thermally produced and be compatible with indirect detection constraints even for very cuspy galactic dark matter density profiles. △ Less

Submitted 1 July, 2024; originally announced July 2024.

Comments: 24 pages, 4 figures

arXiv:2407.00856 [pdf, other]

Drone-Based Antenna Beam Calibration in the High Arctic

Authors: Lawrence Herman, Christopher Barbarie, Mohan Agrawal, Vlad Calinescu, Simon Chen, H. Cynthia Chiang, Cherie K. Day, Eamon Egan, Stephen Fay, Kit Gerodias, Maya Goss, Michael Hétu, Daniel C. Jacobs, Marc-Olivier R. Lalonde, Francis McGee, Loïc Miara, John Orlowski-Scherer, Jonathan Sievers

Abstract: The development of low-frequency radio astronomy experiments for detecting 21-cm line emission from hydrogen presents new opportunities for creative solutions to the challenge of characterizing an antenna beam pattern. The Array of Long Baseline Antennas for Taking Radio Observations from the Seventy-ninth parallel (ALBATROS) is a new radio interferometer sited in the Canadian high Arctic that aim… ▽ More The development of low-frequency radio astronomy experiments for detecting 21-cm line emission from hydrogen presents new opportunities for creative solutions to the challenge of characterizing an antenna beam pattern. The Array of Long Baseline Antennas for Taking Radio Observations from the Seventy-ninth parallel (ALBATROS) is a new radio interferometer sited in the Canadian high Arctic that aims to map Galactic foregrounds at frequencies below $\sim$30 MHz. We present PteroSoar, a custom-built hexacopter outfitted with a transmitter, that will be used to characterize the beam patterns of ALBATROS and other experiments. The PteroSoar drone hardware is motivated by the need for user-servicing at remote sites and environmental factors that are unique to the high Arctic. In particular, magnetic heading is unreliable because the magnetic field lines near the north pole are almost vertical. We therefore implement moving baseline real time kinematic (RTK) positioning with two GPS units to obtain heading solutions with $\sim$1$^\circ$ accuracy. We present a preliminary beam map of an ALBATROS antenna, thus demonstrating successful PteroSoar operation in the high Arctic. △ Less

Submitted 30 June, 2024; originally announced July 2024.

arXiv:2406.19538 [pdf, other]

Context Matters: An Empirical Study of the Impact of Contextual Information in Temporal Question Answering Systems

Authors: Dan Schumacher, Fatemeh Haji, Tara Grey, Niharika Bandlamudi, Nupoor Karnik, Gagana Uday Kumar, Jason Cho-Yu Chiang, Paul Rad, Nishant Vishwamitra, Anthony Rios

Abstract: Large language models (LLMs) often struggle with temporal reasoning, crucial for tasks like historical event analysis and time-sensitive information retrieval. Despite advancements, state-of-the-art models falter in handling temporal information, especially when faced with irrelevant or noisy contexts. This paper addresses this gap by empirically examining the robustness of temporal question-answe… ▽ More Large language models (LLMs) often struggle with temporal reasoning, crucial for tasks like historical event analysis and time-sensitive information retrieval. Despite advancements, state-of-the-art models falter in handling temporal information, especially when faced with irrelevant or noisy contexts. This paper addresses this gap by empirically examining the robustness of temporal question-answering (TQA) systems trained on various context types, including relevant, irrelevant, slightly altered, and no context. Our findings indicate that training with a mix of these contexts enhances model robustness and accuracy. Additionally, we show that the position of context relative to the question significantly impacts performance, with question-first positioning yielding better results. We introduce two new context-rich TQA datasets, ContextAQA and ContextTQE, and provide comprehensive evaluations and guidelines for training robust TQA models. Our work lays the foundation for developing reliable and context-aware temporal QA systems, with broader implications for enhancing LLM robustness against diverse and potentially adversarial information. △ Less

Submitted 27 June, 2024; originally announced June 2024.

arXiv:2404.10528 [pdf, other]

AllTheDocks road safety dataset: A cyclist's perspective and experience

Authors: Chia-Yen Chiang, Ruikang Zhong, Jennifer Ding, Joseph Wood, Stephen Bee, Mona Jaber

Abstract: Active travel is an essential component in intelligent transportation systems. Cycling, as a form of active travel, shares the road space with motorised traffic which often affects the cyclists' safety and comfort and therefore peoples' propensity to uptake cycling instead of driving. This paper presents a unique dataset, collected by cyclists across London, that includes video footage, accelerome… ▽ More Active travel is an essential component in intelligent transportation systems. Cycling, as a form of active travel, shares the road space with motorised traffic which often affects the cyclists' safety and comfort and therefore peoples' propensity to uptake cycling instead of driving. This paper presents a unique dataset, collected by cyclists across London, that includes video footage, accelerometer, GPS, and gyroscope data. The dataset is then labelled by an independent group of London cyclists to rank the safety level of each frame and to identify objects in the cyclist's field of vision that might affect their experience. Furthermore, in this dataset, the quality of the road is measured by the international roughness index of the surface, which indicates the comfort of cycling on the road. The dataset will be made available for open access in the hope of motivating more research in this area to underpin the requirements for cyclists' safety and comfort and encourage more people to replace vehicle travel with cycling. △ Less

Submitted 16 April, 2024; originally announced April 2024.

arXiv:2403.17847 [pdf, other]

Climate Downscaling: A Deep-Learning Based Super-resolution Model of Precipitation Data with Attention Block and Skip Connections

Authors: Chia-Hao Chiang, Zheng-Han Huang, Liwen Liu, Hsin-Chien Liang, Yi-Chi Wang, Wan-Ling Tseng, Chao Wang, Che-Ta Chen, Ko-Chih Wang

Abstract: Human activities accelerate consumption of fossil fuels and produce greenhouse gases, resulting in urgent issues today: global warming and the climate change. These indirectly cause severe natural disasters, plenty of lives suffering and huge losses of agricultural properties. To mitigate impacts on our lands, scientists are developing renewable, reusable, and clean energies and climatologists are… ▽ More Human activities accelerate consumption of fossil fuels and produce greenhouse gases, resulting in urgent issues today: global warming and the climate change. These indirectly cause severe natural disasters, plenty of lives suffering and huge losses of agricultural properties. To mitigate impacts on our lands, scientists are developing renewable, reusable, and clean energies and climatologists are trying to predict the extremes. Meanwhile, governments are publicizing resource-saving policies for a more eco-friendly society and arousing environment awareness. One of the most influencing factors is the precipitation, bringing condensed water vapor onto lands. Water resources are the most significant but basic needs in society, not only supporting our livings, but also economics. In Taiwan, although the average annual precipitation is up to 2,500 millimeter (mm), the water allocation for each person is lower than the global average due to drastically geographical elevation changes and uneven distribution through the year. Thus, it is crucial to track and predict the rainfall to make the most use of it and to prevent the floods. However, climate models have limited resolution and require intensive computational power for local-scale use. Therefore, we proposed a deep convolutional neural network with skip connections, attention blocks, and auxiliary data concatenation, in order to downscale the low-resolution precipitation data into high-resolution one. Eventually, we compare with other climate downscaling methods and show better performance in metrics of Mean Absolute Error (MAE), Root Mean Square Error (RMSE), Pearson Correlation, structural similarity index (SSIM), and forecast indicators. △ Less

Submitted 26 March, 2024; originally announced March 2024.

arXiv:2403.03004 [pdf, other]

Ultralight vector dark matter search using data from the KAGRA O3GK run

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, I. Abouelfettouh, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi , et al. (1778 additional authors not shown)

Abstract: Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we prese… ▽ More Among the various candidates for dark matter (DM), ultralight vector DM can be probed by laser interferometric gravitational wave detectors through the measurement of oscillating length changes in the arm cavities. In this context, KAGRA has a unique feature due to differing compositions of its mirrors, enhancing the signal of vector DM in the length change in the auxiliary channels. Here we present the result of a search for $U(1)_{B-L}$ gauge boson DM using the KAGRA data from auxiliary length channels during the first joint observation run together with GEO600. By applying our search pipeline, which takes into account the stochastic nature of ultralight DM, upper bounds on the coupling strength between the $U(1)_{B-L}$ gauge boson and ordinary matter are obtained for a range of DM masses. While our constraints are less stringent than those derived from previous experiments, this study demonstrates the applicability of our method to the lower-mass vector DM search, which is made difficult in this measurement by the short observation time compared to the auto-correlation time scale of DM. △ Less

Submitted 5 March, 2024; originally announced March 2024.

Comments: 20 pages, 5 figures

Report number: LIGO-P2300250

arXiv:2403.02897 [pdf, other]

Rare $B$ and $K$ decays in a scotogenic model

Authors: Chuan-Hung Chen, Cheng-Wei Chiang

Abstract: A scotogenic model can radiatively generate the observed neutrino mass, provide a dark matter candidate, and lead to rare lepton flavor-violating processes. We aim to extend the model to establish a potential connection to the quark flavor-related processes within the framework of scotogenesis, enhancing the unexpectedly large branching ratio (BR) of $B^+\to K^+ ν\barν$, observed by Belle II Colla… ▽ More A scotogenic model can radiatively generate the observed neutrino mass, provide a dark matter candidate, and lead to rare lepton flavor-violating processes. We aim to extend the model to establish a potential connection to the quark flavor-related processes within the framework of scotogenesis, enhancing the unexpectedly large branching ratio (BR) of $B^+\to K^+ ν\barν$, observed by Belle II Collaboration. Meanwhile, the model can address tensions between some experimental measurements and standard model (SM) predictions in flavor physics, such as the muon $g-2$ excess and the higher BR of $B_s \to μ^- μ^+$. We introduce in the model the following dark particles: a neutral singlet Dirac-type lepton ($N$); two inert Higgs doublets ($η_{1,2}$), with one of which carrying a lepton number; a charged singlet dark scalar $(χ^+)$, and a singlet vector-like up-type dark quark ($T$). The first two entities are responsible for the radiative neutrino mass, and $χ^+$ couples to right-handed quarks and leptons and can resolve the tensions existing in muon $g-2$ and $B_s\to μ^- μ^+$. Furthermore, the BR of $B^+ \to K^+ ν\barν$ can be enhanced up to a factor of 2 compared to the SM prediction through the mediations of the dark $T$ and the charged scalars. In addition, we also study the impacts on the $K\to πν\barν$ decays. △ Less

Submitted 14 March, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

Comments: 34 pages, 6 figures, references added, text revised

arXiv:2403.02337 [pdf, other]

First Constraints on the Epoch of Reionization Using the non-Gaussianity of the Kinematic Sunyaev-Zel{'}dovich Effect from the South Pole Telescope and {\it Herschel}-SPIRE Observations

Authors: S. Raghunathan, P. A. R. Ade, A. J. Anderson, B. Ansarinejad, M. Archipley, J. E. Austermann, L. Balkenhol, J. A. Beall, K. Benabed, A. N. Bender, B. A. Benson, F. Bianchini, L. E. Bleem, J. Bock, F. R. Bouchet, L. Bryant, E. Camphuis, J. E. Carlstrom, T. W. Cecil, C. L. Chang, P. Chaubal, H. C. Chiang, P. M. Chichura, T. -L. Chou, R. Citron , et al. (99 additional authors not shown)

Abstract: We report results from an analysis aimed at detecting the trispectrum of the kinematic Sunyaev-Zel{'}dovich (kSZ) effect by combining data from the South Pole Telescope (SPT) and {\it Herschel}-SPIRE experiments over a 100 ${\rm deg}^{2}$ field. The SPT observations combine data from the previous and current surveys, namely SPTpol and SPT-3G, to achieve depths of 4.5, 3, and 16 $μ{\rm K-arcmin}$ i… ▽ More We report results from an analysis aimed at detecting the trispectrum of the kinematic Sunyaev-Zel{'}dovich (kSZ) effect by combining data from the South Pole Telescope (SPT) and {\it Herschel}-SPIRE experiments over a 100 ${\rm deg}^{2}$ field. The SPT observations combine data from the previous and current surveys, namely SPTpol and SPT-3G, to achieve depths of 4.5, 3, and 16 $μ{\rm K-arcmin}$ in bands centered at 95, 150, and 220 GHz. For SPIRE, we include data from the 600 and 857 GHz bands. We reconstruct the velocity-induced large-scale correlation of the small-scale kSZ signal with a quadratic estimator that uses two cosmic microwave background (CMB) temperature maps, constructed by optimally combining data from all the frequency bands. We reject the null hypothesis of a zero trispectrum at $10.3σ$ level. However, the measured trispectrum contains contributions from both the kSZ and other undesired components, such as CMB lensing and astrophysical foregrounds, with kSZ being sub-dominant. We use the \textsc{Agora} simulations to estimate the expected signal from CMB lensing and astrophysical foregrounds. After accounting for the contributions from CMB lensing and foreground signals, we do not detect an excess kSZ-only trispectrum and use this non-detection to set constraints on reionization. By applying a prior based on observations of the Gunn-Peterson trough, we obtain an upper limit on the duration of reionization of $Δz_{\rm re, 50} < 4.5$ (95\% C.L). We find these constraints are fairly robust to foregrounds assumptions. This trispectrum measurement is independent of, but consistent with, {\it Planck}'s optical depth measurement. This result is the first constraint on the epoch of reionization using the non-Gaussian nature of the kSZ signal. △ Less

Submitted 15 August, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

Comments: 15 pages, 5 figures (3 in main text and 2 in Appendix); Accepted for publication in PRL; Some texts have been moved to Appendix; Minor change in Fig. 2 to include nomalization; Data products and plotting scripts can be downloaded from https://github.com/sriniraghunathan/kSZ_4pt_SPT_SPIRE

arXiv:2403.01729 [pdf, other]

Dark matter semi-annihilation for inert scalar multiplets

Authors: Hugues Beauchesne, Cheng-Wei Chiang

Abstract: Dark matter semi-annihilation is a process through which two dark matter candidates annihilate to a single dark matter particle and a non-dark matter particle. Such processes are common when the symmetry stabilizing the dark matter differs from $\mathbb{Z}_2$ and can lead to qualitatively different phenomenology. In this work, we study the viability of semi-annihilation models including one or two… ▽ More Dark matter semi-annihilation is a process through which two dark matter candidates annihilate to a single dark matter particle and a non-dark matter particle. Such processes are common when the symmetry stabilizing the dark matter differs from $\mathbb{Z}_2$ and can lead to qualitatively different phenomenology. In this work, we study the viability of semi-annihilation models including one or two inert multiplets. For one multiplet, we show that there does not exist any viable model in which semi-annihilation is efficient. For two multiplets, semi-annihilation can be efficient, but the number of viable and technically natural models is limited. We then perform a detailed study of the most promising model, showing that the correct relic abundance can be obtained for a wide range of masses. △ Less

Submitted 27 June, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

Comments: 26 pages, 4 figures, references added, treatment of SE improved, matches published version

arXiv:2402.12786 [pdf, other]

Advancing Large Language Models to Capture Varied Speaking Styles and Respond Properly in Spoken Conversations

Authors: Guan-Ting Lin, Cheng-Han Chiang, Hung-yi Lee

Abstract: In spoken dialogue, even if two current turns are the same sentence, their responses might still differ when they are spoken in different styles. The spoken styles, containing paralinguistic and prosodic information, mark the most significant difference between text and speech modality. When using text-only LLMs to model spoken dialogue, text-only LLMs cannot give different responses based on the… ▽ More In spoken dialogue, even if two current turns are the same sentence, their responses might still differ when they are spoken in different styles. The spoken styles, containing paralinguistic and prosodic information, mark the most significant difference between text and speech modality. When using text-only LLMs to model spoken dialogue, text-only LLMs cannot give different responses based on the speaking style of the current turn. In this paper, we focus on enabling LLMs to listen to the speaking styles and respond properly. Our goal is to teach the LLM that "even if the sentences are identical if they are spoken in different styles, their corresponding responses might be different". Since there is no suitable dataset for achieving this goal, we collect a speech-to-speech dataset, StyleTalk, with the following desired characteristics: when two current speeches have the same content but are spoken in different styles, their responses will be different. To teach LLMs to understand and respond properly to the speaking styles, we propose the Spoken-LLM framework that can model the linguistic content and the speaking styles. We train Spoken-LLM using the StyleTalk dataset and devise a two-stage training pipeline to help the Spoken-LLM better learn the speaking styles. Based on extensive experiments, we show that Spoken-LLM outperforms text-only baselines and prior speech LLMs methods. △ Less

Submitted 30 May, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

Comments: Accepted by ACL 2024

arXiv:2402.05629 [pdf, other]

Merging Facts, Crafting Fallacies: Evaluating the Contradictory Nature of Aggregated Factual Claims in Long-Form Generations

Authors: Cheng-Han Chiang, Hung-yi Lee

Abstract: Long-form generations from large language models (LLMs) contain a mix of factual and non-factual claims, making evaluating factuality difficult. Prior works evaluate the factuality of a long paragraph by decomposing it into multiple facts, verifying those facts independently, and aggregating the results. Such methods assume that combining factual claims forms a factual paragraph. The above assumpt… ▽ More Long-form generations from large language models (LLMs) contain a mix of factual and non-factual claims, making evaluating factuality difficult. Prior works evaluate the factuality of a long paragraph by decomposing it into multiple facts, verifying those facts independently, and aggregating the results. Such methods assume that combining factual claims forms a factual paragraph. The above assumption can be violated: we show that strong open-source models like Llama-chat can generate paragraphs that contain verifiable facts, but the facts are combined into a non-factual paragraph due to entity ambiguity. We further reveal that existing factuality metrics, including FActScore and citation recall, cannot properly evaluate these non-factual paragraphs and overestimate their factuality. To address this, we introduce an enhanced metric, D-FActScore, specifically designed for content with ambiguous entities. We evaluate the D-FActScores of people biographies generated by retrieval-augmented LLMs. We show that D-FActScore can better assess the factuality of paragraphs with entity ambiguity than FActScore. We also find that four widely used open-source LLMs tend to mix information of distinct entities to form non-factual paragraphs, making their D-FActScore much lower than FActScore by over 10%. △ Less

Submitted 6 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

Comments: ACL 2024 Findings

arXiv:2402.03988 [pdf, other]

REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR

Authors: Liang-Hsuan Tseng, En-Pei Hu, Cheng-Han Chiang, Yuan Tseng, Hung-yi Lee, Lin-shan Lee, Shao-Hua Sun

Abstract: Unsupervised automatic speech recognition (ASR) aims to learn the mapping between the speech signal and its corresponding textual transcription without the supervision of paired speech-text data. A word/phoneme in the speech signal is represented by a segment of speech signal with variable length and unknown boundary, and this segmental structure makes learning the mapping between speech and text… ▽ More Unsupervised automatic speech recognition (ASR) aims to learn the mapping between the speech signal and its corresponding textual transcription without the supervision of paired speech-text data. A word/phoneme in the speech signal is represented by a segment of speech signal with variable length and unknown boundary, and this segmental structure makes learning the mapping between speech and text challenging, especially without paired data. In this paper, we propose REBORN,Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR. REBORN alternates between (1) training a segmentation model that predicts the boundaries of the segmental structures in speech signals and (2) training the phoneme prediction model, whose input is the speech feature segmented by the segmentation model, to predict a phoneme transcription. Since supervised data for training the segmentation model is not available, we use reinforcement learning to train the segmentation model to favor segmentations that yield phoneme sequence predictions with a lower perplexity. We conduct extensive experiments and find that under the same setting, REBORN outperforms all prior unsupervised ASR models on LibriSpeech, TIMIT, and five non-English languages in Multilingual LibriSpeech. We comprehensively analyze why the boundaries learned by REBORN improve the unsupervised ASR performance. △ Less

Submitted 28 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

arXiv:2402.01057 [pdf, other]

Expert Proximity as Surrogate Rewards for Single Demonstration Imitation Learning

Authors: Chia-Cheng Chiang, Li-Cheng Lan, Wei-Fang Sun, Chien Feng, Cho-Jui Hsieh, Chun-Yi Lee

Abstract: In this paper, we focus on single-demonstration imitation learning (IL), a practical approach for real-world applications where acquiring multiple expert demonstrations is costly or infeasible and the ground truth reward function is not available. In contrast to typical IL settings with multiple demonstrations, single-demonstration IL involves an agent having access to only one expert trajectory.… ▽ More In this paper, we focus on single-demonstration imitation learning (IL), a practical approach for real-world applications where acquiring multiple expert demonstrations is costly or infeasible and the ground truth reward function is not available. In contrast to typical IL settings with multiple demonstrations, single-demonstration IL involves an agent having access to only one expert trajectory. We highlight the issue of sparse reward signals in this setting and propose to mitigate this issue through our proposed Transition Discriminator-based IL (TDIL) method. TDIL is an IRL method designed to address reward sparsity by introducing a denser surrogate reward function that considers environmental dynamics. This surrogate reward function encourages the agent to navigate towards states that are proximal to expert states. In practice, TDIL trains a transition discriminator to differentiate between valid and non-valid transitions in a given environment to compute the surrogate rewards. The experiments demonstrate that TDIL outperforms existing IL approaches and achieves expert-level performance in the single-demonstration IL setting across five widely adopted MuJoCo benchmarks as well as the "Adroit Door" robotic environment. △ Less

Submitted 7 July, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

Comments: Published at ICML 2024. Code: https://github.com/stanl1y/tdil

arXiv:2401.14198 [pdf, other]

Deep Learning to Improve the Sensitivity of Di-Higgs Searches in the $4b$ Channel

Authors: Cheng-Wei Chiang, Feng-Yang Hsieh, Shih-Chieh Hsu, Ian Low

Abstract: The study of di-Higgs events, both resonant and non-resonant, plays a crucial role in understanding the fundamental interactions of the Higgs boson. In this work we consider di-Higgs events decaying into four $b$-quarks and propose to improve the experimental sensitivity by utilizing a novel machine learning algorithm known as Symmetry Preserving Attention Network (\textsc{Spa-Net}) -- a neural ne… ▽ More The study of di-Higgs events, both resonant and non-resonant, plays a crucial role in understanding the fundamental interactions of the Higgs boson. In this work we consider di-Higgs events decaying into four $b$-quarks and propose to improve the experimental sensitivity by utilizing a novel machine learning algorithm known as Symmetry Preserving Attention Network (\textsc{Spa-Net}) -- a neural network structure whose architecture is designed to incorporate the inherent symmetries in particle reconstruction tasks. We demonstrate that the \textsc{Spa-Net} can enhance the experimental reach over baseline methods such as the cut-based and the Deep Neural Networks (DNN)-based analyses. At the Large Hadron Collider, with a 14-TeV centre-of-mass energy and an integrated luminosity of 300 fb$^{-1}$, the \textsc{Spa-Net} allows us to establish 95\% C.L. upper limits in resonant production cross-sections that are 10\% to 45\% stronger than baseline methods. For non-resonant di-Higgs production, \textsc{Spa-Net} enables us to constrain the self-coupling that is 9\% more stringent than the baseline method. △ Less

Submitted 25 January, 2024; originally announced January 2024.

arXiv:2401.11467 [pdf, other]

Over-Reasoning and Redundant Calculation of Large Language Models

Authors: Cheng-Han Chiang, Hung-yi Lee

Abstract: Large language models (LLMs) can solve problems step-by-step. While this chain-of-thought (CoT) reasoning boosts LLMs' performance, it is unclear if LLMs \textit{know} when to use CoT and whether those CoT are always necessary to answer the question. This paper shows that LLMs tend to generate redundant calculations and reasoning on a manually constructed math QA dataset, GSM8K-Zero. GSM8K-Zero is… ▽ More Large language models (LLMs) can solve problems step-by-step. While this chain-of-thought (CoT) reasoning boosts LLMs' performance, it is unclear if LLMs \textit{know} when to use CoT and whether those CoT are always necessary to answer the question. This paper shows that LLMs tend to generate redundant calculations and reasoning on a manually constructed math QA dataset, GSM8K-Zero. GSM8K-Zero is constructed such that the questions can be answered without any calculations, but LLMs, including Llama-2 models and Claude-2, tend to generate lengthy and unnecessary calculations to answer the questions. We also conduct experiments to explain why LLMs generate redundant calculations and reasonings. GSM8K-Zero is publicly available at https://github.com/d223302/Over-Reasoning-of-LLMs and https://huggingface.co/datasets/dcml0714/GSM8K-Zero. △ Less

Submitted 20 March, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

Comments: EACL 2024 main conference paper. Camera-ready version

arXiv:2401.06316 [pdf, ps, other]

Updated analysis of $D\to PP, V\!P$ and $VV$ decays: Implications for $K_S^0-K_L^0$ asymmetries and $D^0$-$\overline {D}^0$ mixing

Authors: Hai-Yang Cheng, Cheng-Wei Chiang

Abstract: An updated analysis of the two-body $D\to PP, V\!P$ and $VV$ decays within the framework of the topological diagram approach is performed. A global fit to the Cabibbo-favored (CF) modes in the $V\!P$ sector gives many solutions with similarly small local minima in $χ^2$. The solution degeneracy is lifted once we use them to predict for the singly Cabibbo-suppressed (SCS) modes. Topological amplitu… ▽ More An updated analysis of the two-body $D\to PP, V\!P$ and $VV$ decays within the framework of the topological diagram approach is performed. A global fit to the Cabibbo-favored (CF) modes in the $V\!P$ sector gives many solutions with similarly small local minima in $χ^2$. The solution degeneracy is lifted once we use them to predict for the singly Cabibbo-suppressed (SCS) modes. Topological amplitudes are extracted for the $η-η'$ mixing angles $φ=40.4^\circ$ and $43.5^\circ$. The $K_S^0-K_L^0$ asymmetries in $D\to K_{S,L}^0M$ decays denoted by $R(D,M)$ are studied. While the predicted $R(D^0,P)$ for $P=π^0, η$ and $η'$ agree with experiment, the calculated $R(D^+,π^+)$, $R(D_s^+, K^+)$, $R(D^0,ω)$ and $R(D^0,φ)$ deviate from the data. We conjecture that the relative phase between the topological amplitudes $(C+A)$ and $(T+C)$ should be slightly smaller than $90^\circ$ in order to explain the first two discrepancies and that additional singlet contributions due to the SU(3)-singlet nature of $ω$ and $φ$ are needed to account for the last two. For doubly Cabibbo-suppressed (DCS) $D\to V\!P$ decays, their topological amplitudes (double-primed) cannot be all the same as the corresponding ones in the CF modes. The assumption of $E_{V,P}''=E_{V,P}$ for the $W$-exchange amplitude leads to some inconsistencies with the experiment. Through the measured relative phases between CF and DCS channels, the relations of $E_{V,P}''$ with $E_{V,P}$ are determined. Long-distance contributions to the $D^0$-$\overline {D}^0$ mixing parameter $y$ are evaluated in the exclusive approach. In particular, we focus on $D\to PP$ and $V\!P$ decays where $y$ can be reliably estimated. We conclude that $y_{_{P\!P}}\sim (0.110\pm 0.011)\%$ and the lower bound on $y_{_{V\!P}}$ is $(0.220\pm 0.071)\%$. △ Less

Submitted 12 March, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

Comments: 33 pages, accepted by PRD

arXiv:2401.03657 [pdf, other]

Impact of non-thermal phase-space distributions on dark matter abundance in secluded sectors

Authors: Hugues Beauchesne, Cheng-Wei Chiang

Abstract: Many new physics models include secluded sectors that interact little with the Standard Model and whose internal interactions control the dark matter abundance. If these same interactions are responsible for maintaining kinematic equilibrium within the secluded sector, it is possible that the phase-space distributions will differ considerably from their thermal values during freeze-out. This can p… ▽ More Many new physics models include secluded sectors that interact little with the Standard Model and whose internal interactions control the dark matter abundance. If these same interactions are responsible for maintaining kinematic equilibrium within the secluded sector, it is possible that the phase-space distributions will differ considerably from their thermal values during freeze-out. This can potentially result in deviations of the dark matter abundance from that computed under the assumption of thermal distributions. In this paper, we revisit dark matter abundance computations for a benchmark secluded sector by numerically tracking the phase-space distributions. Namely, we show that the dark matter abundance can deviate considerably from standard results during the freeze-out process, but that a longer period of annihilation ultimately leaves only a slight excess. △ Less

Submitted 7 January, 2024; originally announced January 2024.

Comments: 14 pages, 2 figures

arXiv:2401.02075 [pdf, other]

SPT Clusters with DES and HST Weak Lensing. II. Cosmological Constraints from the Abundance of Massive Halos

Authors: S. Bocquet, S. Grandis, L. E. Bleem, M. Klein, J. J. Mohr, T. Schrabback, T. M. C. Abbott, P. A. R. Ade, M. Aguena, A. Alarcon, S. Allam, S. W. Allen, O. Alves, A. Amon, A. J. Anderson, J. Annis, B. Ansarinejad, J. E. Austermann, S. Avila, D. Bacon, M. Bayliss, J. A. Beall, K. Bechtol, M. R. Becker, A. N. Bender , et al. (171 additional authors not shown)

Abstract: We present cosmological constraints from the abundance of galaxy clusters selected via the thermal Sunyaev-Zel'dovich (SZ) effect in South Pole Telescope (SPT) data with a simultaneous mass calibration using weak gravitational lensing data from the Dark Energy Survey (DES) and the Hubble Space Telescope (HST). The cluster sample is constructed from the combined SPT-SZ, SPTpol ECS, and SPTpol 500d… ▽ More We present cosmological constraints from the abundance of galaxy clusters selected via the thermal Sunyaev-Zel'dovich (SZ) effect in South Pole Telescope (SPT) data with a simultaneous mass calibration using weak gravitational lensing data from the Dark Energy Survey (DES) and the Hubble Space Telescope (HST). The cluster sample is constructed from the combined SPT-SZ, SPTpol ECS, and SPTpol 500d surveys, and comprises 1,005 confirmed clusters in the redshift range $0.25-1.78$ over a total sky area of 5,200 deg$^2$. We use DES Year 3 weak-lensing data for 688 clusters with redshifts $z<0.95$ and HST weak-lensing data for 39 clusters with $0.6<z<1.7$. The weak-lensing measurements enable robust mass measurements of sample clusters and allow us to empirically constrain the SZ observable--mass relation. For a flat $Λ$CDM cosmology, and marginalizing over the sum of massive neutrinos, we measure $Ω_\mathrm{m}=0.286\pm0.032$, $σ_8=0.817\pm0.026$, and the parameter combination $σ_8\,(Ω_\mathrm{m}/0.3)^{0.25}=0.805\pm0.016$. Our measurement of $S_8\equivσ_8\,\sqrt{Ω_\mathrm{m}/0.3}=0.795\pm0.029$ and the constraint from Planck CMB anisotropies (2018 TT,TE,EE+lowE) differ by $1.1σ$. In combination with that Planck dataset, we place a 95% upper limit on the sum of neutrino masses $\sum m_ν<0.18$ eV. When additionally allowing the dark energy equation of state parameter $w$ to vary, we obtain $w=-1.45\pm0.31$ from our cluster-based analysis. In combination with Planck data, we measure $w=-1.34^{+0.22}_{-0.15}$, or a $2.2σ$ difference with a cosmological constant. We use the cluster abundance to measure $σ_8$ in five redshift bins between 0.25 and 1.8, and we find the results to be consistent with structure growth as predicted by the $Λ$CDM model fit to Planck primary CMB data. △ Less

Submitted 21 June, 2024; v1 submitted 4 January, 2024; originally announced January 2024.

Comments: Accepted for publication in Phys. Rev. D. arXiv v2 corresponds to published article

arXiv:2312.13239 [pdf, other]

doi 10.1103/PhysRevD.109.075043

A 95 GeV Higgs Boson in the Georgi-Machacek Model

Authors: Ting-Kuo Chen, Cheng-Wei Chiang, Sven Heinemeyer, Georg Weiglein

Abstract: CMS and ATLAS have reported small excesses in the search for low-mass Higgs bosons in the di-photon decay channel at exactly the same mass, $95.4~$GeV. These searches rely on improved analysis techniques, enhancing in particular the discrimination against the $Z \to e^+e^-$ background. In models beyond the Standard Model (SM) that extend the Higgs sector with triplets, doubly-charged Higgs bosons… ▽ More CMS and ATLAS have reported small excesses in the search for low-mass Higgs bosons in the di-photon decay channel at exactly the same mass, $95.4~$GeV. These searches rely on improved analysis techniques, enhancing in particular the discrimination against the $Z \to e^+e^-$ background. In models beyond the Standard Model (SM) that extend the Higgs sector with triplets, doubly-charged Higgs bosons are predicted which can contribute substantially to the di-photon decay rate of a light Higgs boson. The Georgi-Machacek (GM) Model is of particular interest in this context, since despite containing Higgs triplets it preserves the electroweak $ρ$-parameter to be$~$1 at the tree level. We show that within the GM model, a Higgs boson with a mass of $\sim 95~$GeV with a di-photon decay rate as observed by CMS and ATLAS can be well described. We discuss the di-photon excess in conjunction with an excess in the $b \bar b$ final state observed at LEP and an excess observed by CMS in the di-tau final state, which have been found at comparable masses with local significances of about $2σ$ and $3σ$, respectively. The presence of a Higgs boson at about $95~$GeV within the GM model would imply good prospects of the searches for additional light Higgs bosons. In particular, the observed excess in the di-photon channel would be expected to be correlated in the GM model with a light doubly-charged Higgs boson in the mass range between $100~$GeV and $200~$GeV, which motivates dedicated searches in upcoming LHC Runs. △ Less

Submitted 25 April, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

Comments: 26 pages, 7 figures, 1 table, matches the published version

Journal ref: Phys. Rev. D 109 (2024), 075043

arXiv:2312.06152 [pdf, other]

Improving the performance of weak supervision searches using transfer and meta-learning

Authors: Hugues Beauchesne, Zong-En Chen, Cheng-Wei Chiang

Abstract: Weak supervision searches have in principle the advantages of both being able to train on experimental data and being able to learn distinctive signal properties. However, the practical applicability of such searches is limited by the fact that successfully training a neural network via weak supervision can require a large amount of signal. In this work, we seek to create neural networks that can… ▽ More Weak supervision searches have in principle the advantages of both being able to train on experimental data and being able to learn distinctive signal properties. However, the practical applicability of such searches is limited by the fact that successfully training a neural network via weak supervision can require a large amount of signal. In this work, we seek to create neural networks that can learn from less experimental signal by using transfer and meta-learning. The general idea is to first train a neural network on simulations, thereby learning concepts that can be reused or becoming a more efficient learner. The neural network would then be trained on experimental data and should require less signal because of its previous training. We find that transfer and meta-learning can substantially improve the performance of weak supervision searches. △ Less

Submitted 1 March, 2024; v1 submitted 11 December, 2023; originally announced December 2023.

Comments: 20 pages, 7 figures, matches the published version

arXiv:2311.13753 [pdf, other]

Pseudo-Nambu-Goldstone Dark Matter in $SU(7)$ Grand Unification

Authors: Cheng-Wei Chiang, Koji Tsumura, Yoshiki Uchida, Naoki Yamatsu

Abstract: We propose a grand unified theory (GUT) pseudo-Nambu-Goldstone boson (pNGB) dark matter (DM) model based on $SU(7)$ gauge symmetry. In the GUT model, the Standard Model (SM) gauge symmetry $G_{\rm SM} := SU(3)_C\times SU(2)_L\times U(1)_Y$ and the ``dark'' gauge symmetry $SU(2)_D$ are unified, where the $SU(2)_D$ symmetry plays an important role in the stability of DM. The unification of SM fermio… ▽ More We propose a grand unified theory (GUT) pseudo-Nambu-Goldstone boson (pNGB) dark matter (DM) model based on $SU(7)$ gauge symmetry. In the GUT model, the Standard Model (SM) gauge symmetry $G_{\rm SM} := SU(3)_C\times SU(2)_L\times U(1)_Y$ and the ``dark'' gauge symmetry $SU(2)_D$ are unified, where the $SU(2)_D$ symmetry plays an important role in the stability of DM. The unification of SM fermions and dark sector fermions is partially realized. The gauge symmetry $SU(7)$ is spontaneously broken to $SU(5)\times SU(2)\times U(1)$ gauge symmetry at the GUT scale by the nonvanishing vacuum expectation values of an $SU(7)$ adjoint scalar field. The symmetry is further broken to $G_{\rm SM}\times SU(2)_D$ at an intermediate scale. Furthermore, the $SU(2)_D$ symmetry is broken by the $SU(2)_D$ doublet and triplet scalar fields at the TeV scale. In the pNGB DM model based on $G_{\rm SM}\times SU(2)_D$, the residual global $U(1)_V$ dark custodial symmetry guarantees DM stability. On the other hand, in the $SU(7)$ pNGB DM model, this global symmetry is explicitly broken by the Yukawa interaction and the effective Majorana mass terms. To maintain $U(1)_V$ symmetry and thus the DM stability, we need to tune Yukawa coupling constants and cubic scalar couplings at high accuracy. We find that the allowed DM mass region is quite restricted as the gauge coupling constant of $SU(2)_D$ is determined by the condition of the gauge coupling unification. To satisfy gauge coupling unification and the current experimental constraint on proton lifetime, we find that three generations of $SU(3)_C$ adjoint fermions and another three generations of $SU(2)_L$ adjoint fermions with the intermediate mass scale are required. We also find that there is no other solution to satisfy simultaneously the gauge coupling unification and the proton decay constraint if one assumes the other symmetry breaking schemes. △ Less

Submitted 25 March, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

Comments: 25 pages, 6 tables, 4 figures; some paragraphs added; typos corrected; accepted for publication in Physical Review D

Report number: KYUSHU-HET-272

arXiv:2311.10798 [pdf, other]

INSPECT: A Multimodal Dataset for Pulmonary Embolism Diagnosis and Prognosis

Authors: Shih-Cheng Huang, Zepeng Huo, Ethan Steinberg, Chia-Chun Chiang, Matthew P. Lungren, Curtis P. Langlotz, Serena Yeung, Nigam H. Shah, Jason A. Fries

Abstract: Synthesizing information from multiple data sources plays a crucial role in the practice of modern medicine. Current applications of artificial intelligence in medicine often focus on single-modality data due to a lack of publicly available, multimodal medical datasets. To address this limitation, we introduce INSPECT, which contains de-identified longitudinal records from a large cohort of patien… ▽ More Synthesizing information from multiple data sources plays a crucial role in the practice of modern medicine. Current applications of artificial intelligence in medicine often focus on single-modality data due to a lack of publicly available, multimodal medical datasets. To address this limitation, we introduce INSPECT, which contains de-identified longitudinal records from a large cohort of patients at risk for pulmonary embolism (PE), along with ground truth labels for multiple outcomes. INSPECT contains data from 19,402 patients, including CT images, radiology report impression sections, and structured electronic health record (EHR) data (i.e. demographics, diagnoses, procedures, vitals, and medications). Using INSPECT, we develop and release a benchmark for evaluating several baseline modeling approaches on a variety of important PE related tasks. We evaluate image-only, EHR-only, and multimodal fusion models. Trained models and the de-identified dataset are made available for non-commercial use under a data use agreement. To the best of our knowledge, INSPECT is the largest multimodal dataset integrating 3D medical imaging and EHR for reproducible methods evaluation and research. △ Less

Submitted 17 November, 2023; originally announced November 2023.

arXiv:2311.07512 [pdf, other]

doi 10.21105/astro.2311.07512

Galaxy Clusters Discovered via the Thermal Sunyaev-Zel'dovich Effect in the 500-square-degree SPTpol Survey

Authors: L. E. Bleem, M. Klein, T. M. C. Abbott, P. A. R. Ade, M. Aguena, O. Alves, A. J. Anderson, F. Andrade-Oliveira, B. Ansarinejad, M. Archipley, M. L. N. Ashby, J. E. Austermann, D. Bacon, J. A. Beall, A. N. Bender, B. A. Benson, F. Bianchini, S. Bocquet, D. Brooks, D. L. Burke, M. Calzadilla, J. E. Carlstrom, A. Carnero Rosell, J. Carretero, C. L. Chang , et al. (103 additional authors not shown)

Abstract: We present a catalog of 689 galaxy cluster candidates detected at significance $ξ>4$ via their thermal Sunyaev-Zel'dovich (SZ) effect signature in 95 and 150 GHz data from the 500-square-degree SPTpol survey. We use optical and infrared data from the Dark Energy Camera and the Wide-field Infrared Survey Explorer (WISE) and \spitzer \ satellites, to confirm 544 of these candidates as clusters with… ▽ More We present a catalog of 689 galaxy cluster candidates detected at significance $ξ>4$ via their thermal Sunyaev-Zel'dovich (SZ) effect signature in 95 and 150 GHz data from the 500-square-degree SPTpol survey. We use optical and infrared data from the Dark Energy Camera and the Wide-field Infrared Survey Explorer (WISE) and \spitzer \ satellites, to confirm 544 of these candidates as clusters with $\sim94\%$ purity. The sample has an approximately redshift-independent mass threshold at redshift $z>0.25$ and spans $1.5 \times 10^{14} < M_{500c} < 9.1 \times 10^{14}$ $M_\odot/h_{70}$ \ and $0.03<z\lesssim1.6$ in mass and redshift, respectively; 21\% of the confirmed clusters are at $z>1$. We use external radio data from the Sydney University Molonglo Sky Survey (SUMSS) to estimate contamination to the SZ signal from synchrotron sources. The contamination reduces the recovered $ξ$ by a median value of 0.032, or $\sim0.8\%$ of the $ξ=4$ threshold value, and $\sim7\%$ of candidates have a predicted contamination greater than $Δξ= 1$. With the exception of a small number of systems $(<1\%)$, an analysis of clusters detected in single-frequency 95 and 150 GHz data shows no significant contamination of the SZ signal by emission from dusty or synchrotron sources. This cluster sample will be a key component in upcoming astrophysical and cosmological analyses of clusters. The SPTpol millimeter-wave maps and associated data products used to produce this sample are available at https://pole.uchicago.edu/public/data/sptpol_500d_clusters/index.html, and the NASA LAMBDA website. An interactive sky server with the SPTpol maps and Dark Energy Survey data release 2 images is also available at NCSA https://skyviewer.ncsa.illinois.edu. △ Less

Submitted 8 February, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

Comments: Matches version accepted by OJA. 19 pages + references, 14 figures, cluster candidate table provided in Appendix. Data products available at https://pole.uchicago.edu/public/data/sptpol_500d_clusters/index.html and an interactive sky server at https://skyviewer.ncsa.illinois.edu

Journal ref: Open Journal of Astrophysics, Volume 7, 2024

arXiv:2310.16146 [pdf, other]

Clinfo.ai: An Open-Source Retrieval-Augmented Large Language Model System for Answering Medical Questions using Scientific Literature

Authors: Alejandro Lozano, Scott L Fleming, Chia-Chun Chiang, Nigam Shah

Abstract: The quickly-expanding nature of published medical literature makes it challenging for clinicians and researchers to keep up with and summarize recent, relevant findings in a timely manner. While several closed-source summarization tools based on large language models (LLMs) now exist, rigorous and systematic evaluations of their outputs are lacking. Furthermore, there is a paucity of high-quality… ▽ More The quickly-expanding nature of published medical literature makes it challenging for clinicians and researchers to keep up with and summarize recent, relevant findings in a timely manner. While several closed-source summarization tools based on large language models (LLMs) now exist, rigorous and systematic evaluations of their outputs are lacking. Furthermore, there is a paucity of high-quality datasets and appropriate benchmark tasks with which to evaluate these tools. We address these issues with four contributions: we release Clinfo.ai, an open-source WebApp that answers clinical questions based on dynamically retrieved scientific literature; we specify an information retrieval and abstractive summarization task to evaluate the performance of such retrieval-augmented LLM systems; we release a dataset of 200 questions and corresponding answers derived from published systematic reviews, which we name PubMed Retrieval and Synthesis (PubMedRS-200); and report benchmark results for Clinfo.ai and other publicly available OpenQA systems on PubMedRS-200. △ Less

Submitted 24 October, 2023; originally announced October 2023.

arXiv:2310.15211 [pdf, other]

Modeling Path Importance for Effective Alzheimer's Disease Drug Repurposing

Authors: Shunian Xiang, Patrick J. Lawrence, Bo Peng, ChienWei Chiang, Dokyoon Kim, Li Shen, Xia Ning

Abstract: Recently, drug repurposing has emerged as an effective and resource-efficient paradigm for AD drug discovery. Among various methods for drug repurposing, network-based methods have shown promising results as they are capable of leveraging complex networks that integrate multiple interaction types, such as protein-protein interactions, to more effectively identify candidate drugs. However, existing… ▽ More Recently, drug repurposing has emerged as an effective and resource-efficient paradigm for AD drug discovery. Among various methods for drug repurposing, network-based methods have shown promising results as they are capable of leveraging complex networks that integrate multiple interaction types, such as protein-protein interactions, to more effectively identify candidate drugs. However, existing approaches typically assume paths of the same length in the network have equal importance in identifying the therapeutic effect of drugs. Other domains have found that same length paths do not necessarily have the same importance. Thus, relying on this assumption may be deleterious to drug repurposing attempts. In this work, we propose MPI (Modeling Path Importance), a novel network-based method for AD drug repurposing. MPI is unique in that it prioritizes important paths via learned node embeddings, which can effectively capture a network's rich structural information. Thus, leveraging learned embeddings allows MPI to effectively differentiate the importance among paths. We evaluate MPI against a commonly used baseline method that identifies anti-AD drug candidates primarily based on the shortest paths between drugs and AD in the network. We observe that among the top-50 ranked drugs, MPI prioritizes 20.0% more drugs with anti-AD evidence compared to the baseline. Finally, Cox proportional-hazard models produced from insurance claims data aid us in identifying the use of etodolac, nicotine, and BBB-crossing ACE-INHs as having a reduced risk of AD, suggesting such drugs may be viable candidates for repurposing and should be explored further in future studies. △ Less

Submitted 27 October, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

Comments: 16 pages, 3 figures, 2 tables, 1 supplementary figure, 5 supplementary tables, Preprint of an article accepted for publication in Pacific Symposium on Biocomputing ©2023 World Scientific Publishing Co., Singapore, http://psb.stanford.edu/

arXiv:2310.07741 [pdf, other]

doi 10.3847/1538-4357/ad0f1b

Simulating the Detection of the Global 21 cm Signal with MIST for Different Models of the Soil and Beam Directivity

Authors: Raul A. Monsalve, Christian H. Bye, Jonathan L. Sievers, Vadym Bidula, Ricardo Bustos, H. Cynthia Chiang, Xinze Guo, Ian Hendricksen, Francis McGee, F. Patricio Mena, Garima Prabhakar, Oscar Restrepo, Nithyanandan Thyagarajan

Abstract: The Mapper of the IGM Spin Temperature (MIST) is a new ground-based, single-antenna, radio experiment attempting to detect the global 21 cm signal from the Dark Ages and Cosmic Dawn. A significant challenge in this measurement is the frequency-dependence, or chromaticity, of the antenna beam directivity. MIST observes with the antenna above the soil and without a metal ground plane, and the beam d… ▽ More The Mapper of the IGM Spin Temperature (MIST) is a new ground-based, single-antenna, radio experiment attempting to detect the global 21 cm signal from the Dark Ages and Cosmic Dawn. A significant challenge in this measurement is the frequency-dependence, or chromaticity, of the antenna beam directivity. MIST observes with the antenna above the soil and without a metal ground plane, and the beam directivity is sensitive to the electrical characteristics of the soil. In this paper, we use simulated observations with MIST to study how the detection of the global 21 cm signal from Cosmic Dawn is affected by the soil and the MIST beam directivity. We simulate observations using electromagnetic models of the directivity computed for single- and two-layer models of the soil. We test the recovery of the Cosmic Dawn signal with and without beam chromaticity correction applied to the simulated data. We find that our single-layer soil models enable a straightforward recovery of the signal even without chromaticity correction. Two-layer models increase the beam chromaticity and make the recovery more challenging. However, for the model in which the bottom soil layer has a lower electrical conductivity than the top layer, the signal can be recovered even without chromaticity correction. For the other two-layer models, chromaticity correction is necessary for the recovery of the signal and the accuracy requirements for the soil parameters vary between models. These results will be used as a guideline to select observation sites that are favorable for the detection of the Cosmic Dawn signal. △ Less

Submitted 23 May, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

Comments: Matches version published in The Astrophysical Journal. Journal reference and DOI provided

Journal ref: 2024, ApJ, 961, 56

arXiv:2310.05657 [pdf, other]

A Closer Look into Automatic Evaluation Using Large Language Models

Authors: Cheng-Han Chiang, Hung-yi Lee

Abstract: Using large language models (LLMs) to evaluate text quality has recently gained popularity. Some prior works explore the idea of using LLMs for evaluation, while they differ in some details of the evaluation process. In this paper, we analyze LLM evaluation (Chiang and Lee, 2023) and G-Eval (Liu et al., 2023), and we discuss how those details in the evaluation process change how well the ratings g… ▽ More Using large language models (LLMs) to evaluate text quality has recently gained popularity. Some prior works explore the idea of using LLMs for evaluation, while they differ in some details of the evaluation process. In this paper, we analyze LLM evaluation (Chiang and Lee, 2023) and G-Eval (Liu et al., 2023), and we discuss how those details in the evaluation process change how well the ratings given by LLMs correlate with human ratings. We find that the auto Chain-of-Thought (CoT) used in G-Eval does not always make G-Eval more aligned with human ratings. We also show that forcing the LLM to output only a numeric rating, as in G-Eval, is suboptimal. Last, we reveal that asking the LLM to explain its own ratings consistently improves the correlation between the ChatGPT and human ratings and pushes state-of-the-art (SoTA) correlations on two meta-evaluation datasets. △ Less

Submitted 9 October, 2023; originally announced October 2023.

Comments: EMNLP 2023 findings (short paper). Code: https://github.com/d223302/A-Closer-Look-To-LLM-Evaluation/

arXiv:2309.14774 [pdf, other]

BLIP-Adapter: Parameter-Efficient Transfer Learning for Mobile Screenshot Captioning

Authors: Ching-Yu Chiang, I-Hua Chang, Shih-Wei Liao

Abstract: This study aims to explore efficient tuning methods for the screenshot captioning task. Recently, image captioning has seen significant advancements, but research in captioning tasks for mobile screens remains relatively scarce. Current datasets and use cases describing user behaviors within product screenshots are notably limited. Consequently, we sought to fine-tune pre-existing models for the s… ▽ More This study aims to explore efficient tuning methods for the screenshot captioning task. Recently, image captioning has seen significant advancements, but research in captioning tasks for mobile screens remains relatively scarce. Current datasets and use cases describing user behaviors within product screenshots are notably limited. Consequently, we sought to fine-tune pre-existing models for the screenshot captioning task. However, fine-tuning large pre-trained models can be resource-intensive, requiring considerable time, computational power, and storage due to the vast number of parameters in image captioning models. To tackle this challenge, this study proposes a combination of adapter methods, which necessitates tuning only the additional modules on the model. These methods are originally designed for vision or language tasks, and our intention is to apply them to address similar challenges in screenshot captioning. By freezing the parameters of the image caption models and training only the weights associated with the methods, performance comparable to fine-tuning the entire model can be achieved, while significantly reducing the number of parameters. This study represents the first comprehensive investigation into the effectiveness of combining adapters within the context of the screenshot captioning task. Through our experiments and analyses, this study aims to provide valuable insights into the application of adapters in vision-language models and contribute to the development of efficient tuning techniques for the screenshot captioning task. Our study is available at https://github.com/RainYuGG/BLIP-Adapter △ Less

Submitted 26 September, 2023; originally announced September 2023.

arXiv:2309.12904 [pdf, other]

Flavor anomalies in leptoquark model with gauged $U(1)_{L_μ-L_τ}$

Authors: Chuan-Hung Chen, Cheng-Wei Chiang

Abstract: Leptoquarks (LQs) have been extensively studied in the context of $B$ anomalies. When $U(1)_{L_μ-L_τ}$ is introduced to a scalar LQ model with the LQ $S_1$ charged under the new symmetry, $S_1$ primarily couples to the third-generation leptons while its couplings to first and second-generation leptons are naturally suppressed. Furthermore, only $S_1$ in the scalar LQ models has the feature that do… ▽ More Leptoquarks (LQs) have been extensively studied in the context of $B$ anomalies. When $U(1)_{L_μ-L_τ}$ is introduced to a scalar LQ model with the LQ $S_1$ charged under the new symmetry, $S_1$ primarily couples to the third-generation leptons while its couplings to first and second-generation leptons are naturally suppressed. Furthermore, only $S_1$ in the scalar LQ models has the feature that down-type quarks merely couple to neutrinos but not the charged leptons, avoiding strict restrictions from $b\to s μ^+ μ^-$. With this distinctive characteristic of $S_1$, we investigate its impact on rare processes involving the $d_i \to d_j ν\barν$ transitions. Under the dominant constraints from $ΔF=2$ processes, we find that the $S_1$ contributions to the branching ratios (BRs) of $B\to K(K^*) ν\barν$ and $K_L \to π^0 ν\barν$ can be factorized into the same multiplicative factor multiplying the standard model predictions. Enhancement in the BRs can possibly exceed a factor of 2. In particular, ${\cal B}(K^+\to π^+ ν\barν)$ can reach the upper $1σ$ error of the experimental value, i.e., $\simeq 15.4 \times 10^{-11}$. We also show that the model can fit the new world averages of $R(D)$ and $R(D^*)$. △ Less

Submitted 4 October, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

Comments: 14 pages, 3 figures, figures updated, typos corrected, references added

arXiv:2309.08216 [pdf, other]

Unified Risk Analysis for Weakly Supervised Learning

Authors: Chao-Kai Chiang, Masashi Sugiyama

Abstract: Among the flourishing research of weakly supervised learning (WSL), we recognize the lack of a unified interpretation of the mechanism behind the weakly supervised scenarios, let alone a systematic treatment of the risk rewrite problem, a crucial step in the empirical risk minimization approach. In this paper, we introduce a framework providing a comprehensive understanding and a unified methodolo… ▽ More Among the flourishing research of weakly supervised learning (WSL), we recognize the lack of a unified interpretation of the mechanism behind the weakly supervised scenarios, let alone a systematic treatment of the risk rewrite problem, a crucial step in the empirical risk minimization approach. In this paper, we introduce a framework providing a comprehensive understanding and a unified methodology for WSL. The formulation component of the framework, leveraging a contamination perspective, provides a unified interpretation of how weak supervision is formed and subsumes fifteen existing WSL settings. The induced reduction graphs offer comprehensive connections over WSLs. The analysis component of the framework, viewed as a decontamination process, provides a systematic method of conducting risk rewrite. In addition to the conventional inverse matrix approach, we devise a novel strategy called marginal chain aiming to decontaminate distributions. We justify the feasibility of the proposed framework by recovering existing rewrites reported in the literature. △ Less

Submitted 15 September, 2023; originally announced September 2023.

arXiv:2309.02996 [pdf, other]

doi 10.1093/mnras/stae1138

Mapper of the IGM spin temperature: instrument overview

Authors: R. A. Monsalve, C. Altamirano, V. Bidula, R. Bustos, C. H. Bye, H. C. Chiang, M. Diaz, B. Fernandez, X. Guo, I. Hendricksen, E. Hornecker, F. Lucero, H. Mani, F. McGee, F. P. Mena, M. Pessoa, G. Prabhakar, O. Restrepo, J. L. Sievers, N. Thyagarajan

Abstract: The observation of the global 21 cm signal produced by neutral hydrogen gas in the intergalactic medium (IGM) during the Dark Ages, Cosmic Dawn, and Epoch of Reionization requires measurements with extremely well-calibrated wideband radiometers. We describe the design and characterization of the Mapper of the IGM Spin Temperature (MIST), which is a new ground-based, single-antenna, global 21 cm ex… ▽ More The observation of the global 21 cm signal produced by neutral hydrogen gas in the intergalactic medium (IGM) during the Dark Ages, Cosmic Dawn, and Epoch of Reionization requires measurements with extremely well-calibrated wideband radiometers. We describe the design and characterization of the Mapper of the IGM Spin Temperature (MIST), which is a new ground-based, single-antenna, global 21 cm experiment. The design of MIST was guided by the objectives of avoiding systematics from an antenna ground plane and cables around the antenna, as well as maximizing the instrument's on-sky efficiency and portability for operations at remote sites. We have built two MIST instruments, which observe in the range 25-105 MHz. For the 21 cm signal, this frequency range approximately corresponds to redshifts 55.5 > z > 12.5, encompassing the Dark Ages and Cosmic Dawn. The MIST antenna is a horizontal blade dipole of 2.42 m in length, 60 cm in width, and 52 cm in height above the ground. This antenna operates without a metal ground plane. The instruments run on 12 V batteries and have a maximum power consumption of 17 W. The batteries and electronics are contained in a single receiver box located under the antenna. We present the characterization of the instruments using electromagnetic simulations and lab measurements. We also show sample sky measurements from recent observations at remote sites in California, Nevada, and the Canadian High Arctic. These measurements indicate that the instruments perform as expected. Detailed analyses of the sky measurements are left for future work. △ Less

Submitted 23 May, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

Comments: Matches version published in MNRAS

Journal ref: Monthly Notices of the Royal Astronomical Society, Volume 530, Issue 4, June 2024, Pages 4125-4147

arXiv:2308.14763 [pdf, other]

VoiceBank-2023: A Multi-Speaker Mandarin Speech Corpus for Constructing Personalized TTS Systems for the Speech Impaired

Authors: Jia-Jyu Su, Pang-Chen Liao, Yen-Ting Lin, Wu-Hao Li, Guan-Ting Liou, Cheng-Che Kao, Wei-Cheng Chen, Jen-Chieh Chiang, Wen-Yang Chang, Pin-Han Lin, Chen-Yu Chiang

Abstract: Services of personalized TTS systems for the Mandarin-speaking speech impaired are rarely mentioned. Taiwan started the VoiceBanking project in 2020, aiming to build a complete set of services to deliver personalized Mandarin TTS systems to amyotrophic lateral sclerosis patients. This paper reports the corpus design, corpus recording, data purging and correction for the corpus, and evaluations of… ▽ More Services of personalized TTS systems for the Mandarin-speaking speech impaired are rarely mentioned. Taiwan started the VoiceBanking project in 2020, aiming to build a complete set of services to deliver personalized Mandarin TTS systems to amyotrophic lateral sclerosis patients. This paper reports the corpus design, corpus recording, data purging and correction for the corpus, and evaluations of the developed personalized TTS systems, for the VoiceBanking project. The developed corpus is named after the VoiceBank-2023 speech corpus because of its release year. The corpus contains 29.78 hours of utterances with prompts of short paragraphs and common phrases spoken by 111 native Mandarin speakers. The corpus is labeled with information about gender, degree of speech impairment, types of users, transcription, SNRs, and speaking rates. The VoiceBank-2023 is available by request for non-commercial use and welcomes all parties to join the VoiceBanking project to improve the services for the speech impaired. △ Less

Submitted 27 August, 2023; originally announced August 2023.

Comments: submitted to 26th International Conference of the ORIENTAL-COCOSDA

arXiv:2308.14089 [pdf, other]

MedAlign: A Clinician-Generated Dataset for Instruction Following with Electronic Medical Records

Authors: Scott L. Fleming, Alejandro Lozano, William J. Haberkorn, Jenelle A. Jindal, Eduardo P. Reis, Rahul Thapa, Louis Blankemeier, Julian Z. Genkins, Ethan Steinberg, Ashwin Nayak, Birju S. Patel, Chia-Chun Chiang, Alison Callahan, Zepeng Huo, Sergios Gatidis, Scott J. Adams, Oluseyi Fayanju, Shreya J. Shah, Thomas Savage, Ethan Goh, Akshay S. Chaudhari, Nima Aghaeepour, Christopher Sharp, Michael A. Pfeffer, Percy Liang , et al. (5 additional authors not shown)

Abstract: The ability of large language models (LLMs) to follow natural language instructions with human-level fluency suggests many opportunities in healthcare to reduce administrative burden and improve quality of care. However, evaluating LLMs on realistic text generation tasks for healthcare remains challenging. Existing question answering datasets for electronic health record (EHR) data fail to capture… ▽ More The ability of large language models (LLMs) to follow natural language instructions with human-level fluency suggests many opportunities in healthcare to reduce administrative burden and improve quality of care. However, evaluating LLMs on realistic text generation tasks for healthcare remains challenging. Existing question answering datasets for electronic health record (EHR) data fail to capture the complexity of information needs and documentation burdens experienced by clinicians. To address these challenges, we introduce MedAlign, a benchmark dataset of 983 natural language instructions for EHR data. MedAlign is curated by 15 clinicians (7 specialities), includes clinician-written reference responses for 303 instructions, and provides 276 longitudinal EHRs for grounding instruction-response pairs. We used MedAlign to evaluate 6 general domain LLMs, having clinicians rank the accuracy and quality of each LLM response. We found high error rates, ranging from 35% (GPT-4) to 68% (MPT-7B-Instruct), and an 8.3% drop in accuracy moving from 32k to 2k context lengths for GPT-4. Finally, we report correlations between clinician rankings and automated natural language generation metrics as a way to rank LLMs without human review. We make MedAlign available under a research data use agreement to enable LLM evaluations on tasks aligned with clinician needs and preferences. △ Less

Submitted 24 December, 2023; v1 submitted 27 August, 2023; originally announced August 2023.

arXiv:2308.13666 [pdf, other]

A Joint Fermi-GBM and Swift-BAT Analysis of Gravitational-Wave Candidates from the Third Gravitational-wave Observing Run

Authors: C. Fletcher, J. Wood, R. Hamburg, P. Veres, C. M. Hui, E. Bissaldi, M. S. Briggs, E. Burns, W. H. Cleveland, M. M. Giles, A. Goldstein, B. A. Hristov, D. Kocevski, S. Lesage, B. Mailyan, C. Malacaria, S. Poolakkil, A. von Kienlin, C. A. Wilson-Hodge, The Fermi Gamma-ray Burst Monitor Team, M. Crnogorčević, J. DeLaunay, A. Tohuvavohu, R. Caputo, S. B. Cenko , et al. (1674 additional authors not shown)

Abstract: We present Fermi Gamma-ray Burst Monitor (Fermi-GBM) and Swift Burst Alert Telescope (Swift-BAT) searches for gamma-ray/X-ray counterparts to gravitational wave (GW) candidate events identified during the third observing run of the Advanced LIGO and Advanced Virgo detectors. Using Fermi-GBM on-board triggers and sub-threshold gamma-ray burst (GRB) candidates found in the Fermi-GBM ground analyses,… ▽ More We present Fermi Gamma-ray Burst Monitor (Fermi-GBM) and Swift Burst Alert Telescope (Swift-BAT) searches for gamma-ray/X-ray counterparts to gravitational wave (GW) candidate events identified during the third observing run of the Advanced LIGO and Advanced Virgo detectors. Using Fermi-GBM on-board triggers and sub-threshold gamma-ray burst (GRB) candidates found in the Fermi-GBM ground analyses, the Targeted Search and the Untargeted Search, we investigate whether there are any coincident GRBs associated with the GWs. We also search the Swift-BAT rate data around the GW times to determine whether a GRB counterpart is present. No counterparts are found. Using both the Fermi-GBM Targeted Search and the Swift-BAT search, we calculate flux upper limits and present joint upper limits on the gamma-ray luminosity of each GW. Given these limits, we constrain theoretical models for the emission of gamma-rays from binary black hole mergers. △ Less

Submitted 25 August, 2023; originally announced August 2023.

arXiv:2308.13229 [pdf, other]

ReST: A Reconfigurable Spatial-Temporal Graph Model for Multi-Camera Multi-Object Tracking

Authors: Cheng-Che Cheng, Min-Xuan Qiu, Chen-Kuo Chiang, Shang-Hong Lai

Abstract: Multi-Camera Multi-Object Tracking (MC-MOT) utilizes information from multiple views to better handle problems with occlusion and crowded scenes. Recently, the use of graph-based approaches to solve tracking problems has become very popular. However, many current graph-based methods do not effectively utilize information regarding spatial and temporal consistency. Instead, they rely on single-came… ▽ More Multi-Camera Multi-Object Tracking (MC-MOT) utilizes information from multiple views to better handle problems with occlusion and crowded scenes. Recently, the use of graph-based approaches to solve tracking problems has become very popular. However, many current graph-based methods do not effectively utilize information regarding spatial and temporal consistency. Instead, they rely on single-camera trackers as input, which are prone to fragmentation and ID switch errors. In this paper, we propose a novel reconfigurable graph model that first associates all detected objects across cameras spatially before reconfiguring it into a temporal graph for Temporal Association. This two-stage association approach enables us to extract robust spatial and temporal-aware features and address the problem with fragmented tracklets. Furthermore, our model is designed for online tracking, making it suitable for real-world applications. Experimental results show that the proposed graph model is able to extract more discriminating features for object tracking, and our model achieves state-of-the-art performance on several public datasets. △ Less

Submitted 25 August, 2023; originally announced August 2023.

Comments: Accepted by ICCV2023

arXiv:2308.06901 [pdf, other]

Contributions of inert electroweak multiplets to Higgs properties

Authors: Hugues Beauchesne, Cheng-Wei Chiang

Abstract: New physics could manifest itself in the form of electroweak multiplets that interact at tree level with the Higgs boson but do not mix with Standard Model fields or acquire expectation values. In this paper, we study the potential contributions of such inert multiplets to several crucial Higgs properties, namely, the branching ratio of the Higgs to a $Z$ boson and a photon (or massless dark photo… ▽ More New physics could manifest itself in the form of electroweak multiplets that interact at tree level with the Higgs boson but do not mix with Standard Model fields or acquire expectation values. In this paper, we study the potential contributions of such inert multiplets to several crucial Higgs properties, namely, the branching ratio of the Higgs to a $Z$ boson and a photon (or massless dark photon) and the triple Higgs coupling. Constraints from the Higgs signal strengths, oblique parameters and unitarity are taken into account. △ Less

Submitted 18 January, 2024; v1 submitted 13 August, 2023; originally announced August 2023.

Comments: 25 pages, 4 figures, matches published version

arXiv:2308.03822 [pdf, other]

Search for Eccentric Black Hole Coalescences during the Third Observing Run of LIGO and Virgo

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Abac, R. Abbott, H. Abe, F. Acernese, K. Ackley, C. Adamcewicz, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, I. Aguilar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi , et al. (1750 additional authors not shown)

Abstract: Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effect… ▽ More Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effects of eccentricity. Here, we present observational results for a waveform-independent search sensitive to eccentric black hole coalescences, covering the third observing run (O3) of the LIGO and Virgo detectors. We identified no new high-significance candidates beyond those that were already identified with searches focusing on quasi-circular binaries. We determine the sensitivity of our search to high-mass (total mass $M>70$ $M_\odot$) binaries covering eccentricities up to 0.3 at 15 Hz orbital frequency, and use this to compare model predictions to search results. Assuming all detections are indeed quasi-circular, for our fiducial population model, we place an upper limit for the merger rate density of high-mass binaries with eccentricities $0 < e \leq 0.3$ at $0.33$ Gpc$^{-3}$ yr$^{-1}$ at 90\% confidence level. △ Less

Submitted 7 August, 2023; originally announced August 2023.

Comments: 24 pages, 5 figures

Report number: LIGO-P2300080

arXiv:2307.00746 [pdf, other]

Clockwork axion footprint on nano-hertz stochastic gravitational wave background

Authors: Bo-Qiang Lu, Cheng-Wei Chiang, Tianjun Li

Abstract: The recent Pulsar Timing Arrays (PTAs) nano-Hz gravitational wave (GW) background signal can be naturally induced by the annihilation of domain walls (DWs) formed at a symmetry-breaking scale $f\simeq 200$~TeV in the clockwork axion framework. Based on our first successful and precise prediction, we for the first time suggest that the recent PTA observations strongly support the novel mechanism of… ▽ More The recent Pulsar Timing Arrays (PTAs) nano-Hz gravitational wave (GW) background signal can be naturally induced by the annihilation of domain walls (DWs) formed at a symmetry-breaking scale $f\simeq 200$~TeV in the clockwork axion framework. Based on our first successful and precise prediction, we for the first time suggest that the recent PTA observations strongly support the novel mechanism of the QCD instanton-induced DW annihilation in the clockwork axion framework. We also for the first time discover a novel correlation between dark matter (DM) relic abundance and nano-Hz GW background, which in turn indicates a natural connection between the axion decay constant and the symmetry-breaking scale in the clockwork framework. We find that the GW signal has a peak $h^2Ω_{\rm GW}\simeq 10^{-6.6}-10^{-6.1}$ at about 50~nHz, which is definite and testable for future PTA data at frequencies $\gtrsim 25$~nHz and CMB-S4 experiment. We also propose various phenomena that may appear in PTAs and future GW interferometers. △ Less

Submitted 4 April, 2024; v1 submitted 3 July, 2023; originally announced July 2023.

Comments: 5+8 pages, 3+3 figures, 3 tables, accepted for publication in PRD Lett

arXiv:2306.05083 [pdf, other]

Revealing the Blind Spot of Sentence Encoder Evaluation by HEROS

Authors: Cheng-Han Chiang, Yung-Sung Chuang, James Glass, Hung-yi Lee

Abstract: Existing sentence textual similarity benchmark datasets only use a single number to summarize how similar the sentence encoder's decision is to humans'. However, it is unclear what kind of sentence pairs a sentence encoder (SE) would consider similar. Moreover, existing SE benchmarks mainly consider sentence pairs with low lexical overlap, so it is unclear how the SEs behave when two sentences hav… ▽ More Existing sentence textual similarity benchmark datasets only use a single number to summarize how similar the sentence encoder's decision is to humans'. However, it is unclear what kind of sentence pairs a sentence encoder (SE) would consider similar. Moreover, existing SE benchmarks mainly consider sentence pairs with low lexical overlap, so it is unclear how the SEs behave when two sentences have high lexical overlap. We introduce a high-quality SE diagnostic dataset, HEROS. HEROS is constructed by transforming an original sentence into a new sentence based on certain rules to form a \textit{minimal pair}, and the minimal pair has high lexical overlaps. The rules include replacing a word with a synonym, an antonym, a typo, a random word, and converting the original sentence into its negation. Different rules yield different subsets of HEROS. By systematically comparing the performance of over 60 supervised and unsupervised SEs on HEROS, we reveal that most unsupervised sentence encoders are insensitive to negation. We find the datasets used to train the SE are the main determinants of what kind of sentence pairs an SE considers similar. We also show that even if two SEs have similar performance on STS benchmarks, they can have very different behavior on HEROS. Our result reveals the blind spot of traditional STS benchmarks when evaluating SEs. △ Less

Submitted 13 June, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

Comments: ACL 2023 repl4nlp (representation learning for NLP) workshop poster paper. Dataset at https://huggingface.co/datasets/dcml0714/Heros

arXiv:2306.02044 [pdf, other]

Why We Should Report the Details in Subjective Evaluation of TTS More Rigorously

Authors: Cheng-Han Chiang, Wei-Ping Huang, Hung-yi Lee

Abstract: This paper emphasizes the importance of reporting experiment details in subjective evaluations and demonstrates how such details can significantly impact evaluation results in the field of speech synthesis. Through an analysis of 80 papers presented at INTERSPEECH 2022, we find a lack of thorough reporting on critical details such as evaluator recruitment and filtering, instructions and payments,… ▽ More This paper emphasizes the importance of reporting experiment details in subjective evaluations and demonstrates how such details can significantly impact evaluation results in the field of speech synthesis. Through an analysis of 80 papers presented at INTERSPEECH 2022, we find a lack of thorough reporting on critical details such as evaluator recruitment and filtering, instructions and payments, and the geographic and linguistic backgrounds of evaluators. To illustrate the effect of these details on evaluation outcomes, we conducted mean opinion score (MOS) tests on three well-known TTS systems under different evaluation settings and we obtain at least three distinct rankings of TTS models. We urge the community to report experiment details in subjective evaluations to improve the reliability and interpretability of experimental results. △ Less

Submitted 3 June, 2023; originally announced June 2023.

Comments: Interspeech 2023 camera-ready version

arXiv:2305.09256 [pdf, other]

doi 10.1103/PhysRevD.109.055038

Phenomenological study of a gauged ${L_μ-L_τ}$ model with a scalar leptoquark

Authors: Chuan-Hung Chen, Cheng-Wei Chiang, Chun-Wei Su

Abstract: A $Z'$ gauge boson with sub-GeV mass has acquired a significant interest in phenomenology, particularly in view of the muon $g-2$ anomaly and coherent elastic neutrino-nucleon scattering. The latter is challenged by the nuclear recoil energy of a few tens of keV but has been observed by the COHERENT experiment. To further reconcile the observed excesses in $R(D^{(*)})$ from semileptonic charmful… ▽ More A $Z'$ gauge boson with sub-GeV mass has acquired a significant interest in phenomenology, particularly in view of the muon $g-2$ anomaly and coherent elastic neutrino-nucleon scattering. The latter is challenged by the nuclear recoil energy of a few tens of keV but has been observed by the COHERENT experiment. To further reconcile the observed excesses in $R(D^{(*)})$ from semileptonic charmful $B$ decays and in the $W$ boson mass, we investigate a model with a gauged $U(1)_{L_μ-L_τ}$ symmetry and a scalar leptoquark. In contrast to the mechanism that involves kinetic mixing between the gauge bosons of $U(1)_{\rm em}$ and $U(1)_{L_μ-L_τ}$, we adopt a dynamical symmetry breaking of $U(1)_{L_μ-L_τ}$ by incorporating an additional Higgs doublet. Through mixing with the $U(1)_{L_μ-L_τ}$-charged Higgs doublet, new Higgs decay channels $h\to Z_1 Z_1/Z_1 Z_2$ occur at percent-level branching ratios, which could be accessible at the LHC. The $W$-mass anomaly observed by CDF II can be potentially resolved through the enhancement in the oblique parameter $T$. Due to the flavored gauge symmetry, the introduced scalar leptoquark $S^{\frac{1}{3}}=(\bar{3},1,2/3)$ exhibits a unique coupling to the $τ$-lepton, offering an explanation for the excesses observed in $R(D^{(*)})$. Moreover, $τ\to μ(Z_1\to ) e^- e^+$ via the resonant light gauge boson decay can reach the sensitivity of Belle II at an integrated luminosity of 50 ab$^{-1}$. △ Less

Submitted 23 March, 2024; v1 submitted 16 May, 2023; originally announced May 2023.

Comments: 50 pages, 10 figures, typos corrected and references added

Journal ref: Phys.Rev.D 109 (2024) 5, 055038

arXiv:2305.01937 [pdf, other]

Can Large Language Models Be an Alternative to Human Evaluations?

Authors: Cheng-Han Chiang, Hung-yi Lee

Abstract: Human evaluation is indispensable and inevitable for assessing the quality of texts generated by machine learning models or written by humans. However, human evaluation is very difficult to reproduce and its quality is notoriously unstable, hindering fair comparisons among different natural language processing (NLP) models and algorithms. Recently, large language models (LLMs) have demonstrated ex… ▽ More Human evaluation is indispensable and inevitable for assessing the quality of texts generated by machine learning models or written by humans. However, human evaluation is very difficult to reproduce and its quality is notoriously unstable, hindering fair comparisons among different natural language processing (NLP) models and algorithms. Recently, large language models (LLMs) have demonstrated exceptional performance on unseen tasks when only the task instructions are provided. In this paper, we explore if such an ability of the LLMs can be used as an alternative to human evaluation. We present the LLMs with the exact same instructions, samples to be evaluated, and questions used to conduct human evaluation, and then ask the LLMs to generate responses to those questions; we dub this LLM evaluation. We use human evaluation and LLM evaluation to evaluate the texts in two NLP tasks: open-ended story generation and adversarial attacks. We show that the result of LLM evaluation is consistent with the results obtained by expert human evaluation: the texts rated higher by human experts are also rated higher by the LLMs. We also find that the results of LLM evaluation are stable over different formatting of the task instructions and the sampling algorithm used to generate the answer. We are the first to show the potential of using LLMs to assess the quality of texts and discuss the limitations and ethical considerations of LLM evaluation. △ Less

Submitted 3 May, 2023; originally announced May 2023.

Comments: ACL 2023 main conference paper. Main content: 10 pages (including limitations). Appendix: 13 pages

arXiv:2304.08393 [pdf, other]

Search for gravitational-lensing signatures in the full third observing run of the LIGO-Virgo network

Authors: The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, R. Abbott, H. Abe, F. Acernese, K. Ackley, S. Adhicary, N. Adhikari, R. X. Adhikari, V. K. Adkins, V. B. Adya, C. Affeldt, D. Agarwal, M. Agathos, O. D. Aguiar, L. Aiello, A. Ain, P. Ajith, T. Akutsu, S. Albanesi, R. A. Alfaidi, C. Alléné, A. Allocca, P. A. Altin , et al. (1670 additional authors not shown)

Abstract: Gravitational lensing by massive objects along the line of sight to the source causes distortions of gravitational wave-signals; such distortions may reveal information about fundamental physics, cosmology and astrophysics. In this work, we have extended the search for lensing signatures to all binary black hole events from the third observing run of the LIGO--Virgo network. We search for repeated… ▽ More Gravitational lensing by massive objects along the line of sight to the source causes distortions of gravitational wave-signals; such distortions may reveal information about fundamental physics, cosmology and astrophysics. In this work, we have extended the search for lensing signatures to all binary black hole events from the third observing run of the LIGO--Virgo network. We search for repeated signals from strong lensing by 1) performing targeted searches for subthreshold signals, 2) calculating the degree of overlap amongst the intrinsic parameters and sky location of pairs of signals, 3) comparing the similarities of the spectrograms amongst pairs of signals, and 4) performing dual-signal Bayesian analysis that takes into account selection effects and astrophysical knowledge. We also search for distortions to the gravitational waveform caused by 1) frequency-independent phase shifts in strongly lensed images, and 2) frequency-dependent modulation of the amplitude and phase due to point masses. None of these searches yields significant evidence for lensing. Finally, we use the non-detection of gravitational-wave lensing to constrain the lensing rate based on the latest merger-rate estimates and the fraction of dark matter composed of compact objects. △ Less

Submitted 17 April, 2023; originally announced April 2023.

Comments: 28 pages, 11 figures

Report number: LIGO-P2200031

arXiv:2304.04165 [pdf, other]

doi 10.1103/PhysRevD.108.015018

Observability of the Higgs boson decay to a photon and a dark photon

Authors: Hugues Beauchesne, Cheng-Wei Chiang

Abstract: Many collider searches have attempted to detect the Higgs boson decaying to a photon and an invisible massless dark photon. For the branching ratio to this channel to be realistically observable at the LHC, there must exist new mediators that interact with both the standard model and the dark photon. In this paper, we study experimental and theoretical constraints on an extensive set of mediator m… ▽ More Many collider searches have attempted to detect the Higgs boson decaying to a photon and an invisible massless dark photon. For the branching ratio to this channel to be realistically observable at the LHC, there must exist new mediators that interact with both the standard model and the dark photon. In this paper, we study experimental and theoretical constraints on an extensive set of mediator models. We show that these constraints limit the Higgs branching ratio to a photon and a dark photon to be far smaller than the current sensitivity of collider searches. △ Less

Submitted 29 August, 2023; v1 submitted 9 April, 2023; originally announced April 2023.

Comments: 17 pages, 5 figures, matches published version

arXiv:2303.12053 [pdf]

Tomography Scan of Charge Density Wave in NbSe2

Authors: Jyun-Yu Wu, Yung-Ting Lee, Guan-Hao Chen, Zheng-Hong Li, Chang-Tsan Lee, Jie-Yu Hsu, Chia-Nung Kuo, Juhn-Jong Lin, Wen-Hao Chang, Chin-Shan Lue, Po-Tuan Cheng, Cheng-Tien Chiang, Chien-Cheng Kuo, Chien-Te Wu, Chi-Cheng Lee, Ming-Chiang Chung, Hung-Chung Hsueh, Chun-Liang Lin

Abstract: Charge density wave (CDW) resulted from a small distortion in the lattice is able to create new orders beyond the original lattice. In 2H-NbSe2, one of the layered transition metal dichalcogenides (TMD), the 3x3 charge order appears in two-dimensional (2D) layers. Although CDW is usually described by a sine wave, the spatial distribution within a 2D layer has never been systematically visualized.… ▽ More Charge density wave (CDW) resulted from a small distortion in the lattice is able to create new orders beyond the original lattice. In 2H-NbSe2, one of the layered transition metal dichalcogenides (TMD), the 3x3 charge order appears in two-dimensional (2D) layers. Although CDW is usually described by a sine wave, the spatial distribution within a 2D layer has never been systematically visualized. Here by using scanning tunneling microscopy (STM) and density functional theory (DFT), we have monitored the evolution of 3x3 CDW along c-axis and realized a nearly tomography scan of CDW of the topmost layer. The results show that the strength of 3x3 charge order varies while increasing the tunneling current. The 3x3 charge order is relatively strong at the outermost Se level and decreases while probing in between Se and Nb levels. Interestingly, the 3x3 charge order gets strong again as reaching Nb level but along with a phase shift. We further calculated the orbital charge distributions and found that both CDW intensity modulation and phase shift are strongly correlated with the distribution of Se p orbitals and Nb d orbitals. △ Less

Submitted 21 March, 2023; originally announced March 2023.

Comments: 12 pages, 4 figures

Showing 1–50 of 635 results for author: Chiang, C