-
PLANTS: A Novel Problem and Dataset for Summarization of Planning-Like (PL) Tasks
Authors:
Vishal Pallagani,
Biplav Srivastava,
Nitin Gupta
Abstract:
Text summarization is a well-studied problem that deals with deriving insights from unstructured text consumed by humans, and it has found extensive business applications. However, many real-life tasks involve generating a series of actions to achieve specific goals, such as workflows, recipes, dialogs, and travel plans. We refer to them as planning-like (PL) tasks noting that the main commonality…
▽ More
Text summarization is a well-studied problem that deals with deriving insights from unstructured text consumed by humans, and it has found extensive business applications. However, many real-life tasks involve generating a series of actions to achieve specific goals, such as workflows, recipes, dialogs, and travel plans. We refer to them as planning-like (PL) tasks noting that the main commonality they share is control flow information. which may be partially specified. Their structure presents an opportunity to create more practical summaries to help users make quick decisions. We investigate this observation by introducing a novel plan summarization problem, presenting a dataset, and providing a baseline method for generating PL summaries. Using quantitative metrics and qualitative user studies to establish baselines, we evaluate the plan summaries from our method and large language models. We believe the novel problem and dataset can reinvigorate research in summarization, which some consider as a solved problem.
△ Less
Submitted 18 July, 2024;
originally announced July 2024.
-
Neuro-Symbolic Fusion of Wi-Fi Sensing Data for Passive Radar with Inter-Modal Knowledge Transfer
Authors:
Marco Cominelli,
Francesco Gringoli,
Lance M. Kaplan,
Mani B. Srivastava,
Trevor Bihl,
Erik P. Blasch,
Nandini Iyer,
Federico Cerutti
Abstract:
Wi-Fi devices, akin to passive radars, can discern human activities within indoor settings due to the human body's interaction with electromagnetic signals. Current Wi-Fi sensing applications predominantly employ data-driven learning techniques to associate the fluctuations in the physical properties of the communication channel with the human activity causing them. However, these techniques often…
▽ More
Wi-Fi devices, akin to passive radars, can discern human activities within indoor settings due to the human body's interaction with electromagnetic signals. Current Wi-Fi sensing applications predominantly employ data-driven learning techniques to associate the fluctuations in the physical properties of the communication channel with the human activity causing them. However, these techniques often lack the desired flexibility and transparency. This paper introduces DeepProbHAR, a neuro-symbolic architecture for Wi-Fi sensing, providing initial evidence that Wi-Fi signals can differentiate between simple movements, such as leg or arm movements, which are integral to human activities like running or walking. The neuro-symbolic approach affords gathering such evidence without needing additional specialised data collection or labelling. The training of DeepProbHAR is facilitated by declarative domain knowledge obtained from a camera feed and by fusing signals from various antennas of the Wi-Fi receivers. DeepProbHAR achieves results comparable to the state-of-the-art in human activity recognition. Moreover, as a by-product of the learning process, DeepProbHAR generates specialised classifiers for simple movements that match the accuracy of models trained on finely labelled datasets, which would be particularly costly.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
Accurate Passive Radar via an Uncertainty-Aware Fusion of Wi-Fi Sensing Data
Authors:
Marco Cominelli,
Francesco Gringoli,
Lance M. Kaplan,
Mani B. Srivastava,
Federico Cerutti
Abstract:
Wi-Fi devices can effectively be used as passive radar systems that sense what happens in the surroundings and can even discern human activity. We propose, for the first time, a principled architecture which employs Variational Auto-Encoders for estimating a latent distribution responsible for generating the data, and Evidential Deep Learning for its ability to sense out-of-distribution activities…
▽ More
Wi-Fi devices can effectively be used as passive radar systems that sense what happens in the surroundings and can even discern human activity. We propose, for the first time, a principled architecture which employs Variational Auto-Encoders for estimating a latent distribution responsible for generating the data, and Evidential Deep Learning for its ability to sense out-of-distribution activities. We verify that the fused data processed by different antennas of the same Wi-Fi receiver results in increased accuracy of human activity recognition compared with the most recent benchmarks, while still being informative when facing out-of-distribution samples and enabling semantic interpretation of latent variables in terms of physical phenomena. The results of this paper are a first contribution toward the ultimate goal of providing a flexible, semantic characterisation of black-swan events, i.e., events for which we have limited to no training data.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
BEACON: Balancing Convenience and Nutrition in Meals With Long-Term Group Recommendations and Reasoning on Multimodal Recipes
Authors:
Vansh Nagpal,
Siva Likitha Valluru,
Kausik Lakkaraju,
Biplav Srivastava
Abstract:
A common, yet regular, decision made by people, whether healthy or with any health condition, is to decide what to have in meals like breakfast, lunch, and dinner, consisting of a combination of foods for appetizer, main course, side dishes, desserts, and beverages. However, often this decision is seen as a trade-off between nutritious choices (e.g., low salt and sugar) or convenience (e.g., inexp…
▽ More
A common, yet regular, decision made by people, whether healthy or with any health condition, is to decide what to have in meals like breakfast, lunch, and dinner, consisting of a combination of foods for appetizer, main course, side dishes, desserts, and beverages. However, often this decision is seen as a trade-off between nutritious choices (e.g., low salt and sugar) or convenience (e.g., inexpensive, fast to prepare/obtain, taste better). In this preliminary work, we present a data-driven approach for the novel meal recommendation problem that can explore and balance choices for both considerations while also reasoning about a food's constituents and cooking process. Beyond the problem formulation, our contributions also include a goodness measure, a recipe conversion method from text to the recently introduced multimodal rich recipe representation (R3) format, and learning methods using contextual bandits that show promising results.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Rating Multi-Modal Time-Series Forecasting Models (MM-TSFM) for Robustness Through a Causal Lens
Authors:
Kausik Lakkaraju,
Rachneet Kaur,
Zhen Zeng,
Parisa Zehtabi,
Sunandita Patra,
Biplav Srivastava,
Marco Valtorta
Abstract:
AI systems are notorious for their fragility; minor input changes can potentially cause major output swings. When such systems are deployed in critical areas like finance, the consequences of their uncertain behavior could be severe. In this paper, we focus on multi-modal time-series forecasting, where imprecision due to noisy or incorrect data can lead to erroneous predictions, impacting stakehol…
▽ More
AI systems are notorious for their fragility; minor input changes can potentially cause major output swings. When such systems are deployed in critical areas like finance, the consequences of their uncertain behavior could be severe. In this paper, we focus on multi-modal time-series forecasting, where imprecision due to noisy or incorrect data can lead to erroneous predictions, impacting stakeholders such as analysts, investors, and traders. Recently, it has been shown that beyond numeric data, graphical transformations can be used with advanced visual models to achieve better performance. In this context, we introduce a rating methodology to assess the robustness of Multi-Modal Time-Series Forecasting Models (MM-TSFM) through causal analysis, which helps us understand and quantify the isolated impact of various attributes on the forecasting accuracy of MM-TSFM. We apply our novel rating method on a variety of numeric and multi-modal forecasting models in a large experimental setup (six input settings of control and perturbations, ten data distributions, time series from six leading stocks in three industries over a year of data, and five time-series forecasters) to draw insights on robust forecasting models and the context of their strengths. Within the scope of our study, our main result is that multi-modal (numeric + visual) forecasting, which was found to be more accurate than numeric forecasting in previous studies, can also be more robust in diverse settings. Our work will help different stakeholders of time-series forecasting understand the models` behaviors along trust (robustness) and accuracy dimensions to select an appropriate model for forecasting using our rating method, leading to improved decision-making.
△ Less
Submitted 12 June, 2024;
originally announced June 2024.
-
The Case for Developing a Foundation Model for Planning-like Tasks from Scratch
Authors:
Biplav Srivastava,
Vishal Pallagani
Abstract:
Foundation Models (FMs) have revolutionized many areas of computing, including Automated Planning and Scheduling (APS). For example, a recent study found them useful for planning problems: plan generation, language translation, model construction, multi-agent planning, interactive planning, heuristics optimization, tool integration, and brain-inspired planning. Besides APS, there are many seemingl…
▽ More
Foundation Models (FMs) have revolutionized many areas of computing, including Automated Planning and Scheduling (APS). For example, a recent study found them useful for planning problems: plan generation, language translation, model construction, multi-agent planning, interactive planning, heuristics optimization, tool integration, and brain-inspired planning. Besides APS, there are many seemingly related tasks involving the generation of a series of actions with varying guarantees of their executability to achieve intended goals, which we collectively call planning-like (PL) tasks like business processes, programs, workflows, and guidelines, where researchers have considered using FMs. However, previous works have primarily focused on pre-trained, off-the-shelf FMs and optionally fine-tuned them. This paper discusses the need for a comprehensive FM for PL tasks from scratch and explores its design considerations. We argue that such an FM will open new and efficient avenues for PL problem-solving, just like LLMs are creating for APS.
△ Less
Submitted 6 April, 2024;
originally announced April 2024.
-
From Cloud to Edge: Rethinking Generative AI for Low-Resource Design Challenges
Authors:
Sai Krishna Revanth Vuruma,
Ashley Margetts,
Jianhai Su,
Faez Ahmed,
Biplav Srivastava
Abstract:
Generative Artificial Intelligence (AI) has shown tremendous prospects in all aspects of technology, including design. However, due to its heavy demand on resources, it is usually trained on large computing infrastructure and often made available as a cloud-based service. In this position paper, we consider the potential, challenges, and promising approaches for generative AI for design on the edg…
▽ More
Generative Artificial Intelligence (AI) has shown tremendous prospects in all aspects of technology, including design. However, due to its heavy demand on resources, it is usually trained on large computing infrastructure and often made available as a cloud-based service. In this position paper, we consider the potential, challenges, and promising approaches for generative AI for design on the edge, i.e., in resource-constrained settings where memory, compute, energy (battery) and network connectivity may be limited. Adapting generative AI for such settings involves overcoming significant hurdles, primarily in how to streamline complex models to function efficiently in low-resource environments. This necessitates innovative approaches in model compression, efficient algorithmic design, and perhaps even leveraging edge computing. The objective is to harness the power of generative AI in creating bespoke solutions for design problems, such as medical interventions, farm equipment maintenance, and educational material design, tailored to the unique constraints and needs of remote areas. These efforts could democratize access to advanced technology and foster sustainable development, ensuring universal accessibility and environmental consideration of AI-driven design benefits.
△ Less
Submitted 25 February, 2024; v1 submitted 19 February, 2024;
originally announced February 2024.
-
An Empirical Evaluation of Neural and Neuro-symbolic Approaches to Real-time Multimodal Complex Event Detection
Authors:
Liying Han,
Mani B. Srivastava
Abstract:
Robots and autonomous systems require an understanding of complex events (CEs) from sensor data to interact with their environments and humans effectively. Traditional end-to-end neural architectures, despite processing sensor data efficiently, struggle with long-duration events due to limited context sizes and reasoning capabilities. Recent advances in neuro-symbolic methods, which integrate neur…
▽ More
Robots and autonomous systems require an understanding of complex events (CEs) from sensor data to interact with their environments and humans effectively. Traditional end-to-end neural architectures, despite processing sensor data efficiently, struggle with long-duration events due to limited context sizes and reasoning capabilities. Recent advances in neuro-symbolic methods, which integrate neural and symbolic models leveraging human knowledge, promise improved performance with less data. This study addresses the gap in understanding these approaches' effectiveness in complex event detection (CED), especially in temporal reasoning. We investigate neural and neuro-symbolic architectures' performance in a multimodal CED task, analyzing IMU and acoustic data streams to recognize CE patterns. Our methodology includes (i) end-to-end neural architectures for direct CE detection from sensor embeddings, (ii) two-stage concept-based neural models mapping sensor embeddings to atomic events (AEs) before CE detection, and (iii) a neuro-symbolic approach using a symbolic finite-state machine for CE detection from AEs. Empirically, the neuro-symbolic architecture significantly surpasses purely neural models, demonstrating superior performance in CE recognition, even with extensive training data and ample temporal context for neural approaches.
△ Less
Submitted 3 March, 2024; v1 submitted 17 February, 2024;
originally announced February 2024.
-
Trust and ethical considerations in a multi-modal, explainable AI-driven chatbot tutoring system: The case of collaboratively solving Rubik's Cube
Authors:
Kausik Lakkaraju,
Vedant Khandelwal,
Biplav Srivastava,
Forest Agostinelli,
Hengtao Tang,
Prathamjeet Singh,
Dezhi Wu,
Matt Irvin,
Ashish Kundu
Abstract:
Artificial intelligence (AI) has the potential to transform education with its power of uncovering insights from massive data about student learning patterns. However, ethical and trustworthy concerns of AI have been raised but are unsolved. Prominent ethical issues in high school AI education include data privacy, information leakage, abusive language, and fairness. This paper describes technolog…
▽ More
Artificial intelligence (AI) has the potential to transform education with its power of uncovering insights from massive data about student learning patterns. However, ethical and trustworthy concerns of AI have been raised but are unsolved. Prominent ethical issues in high school AI education include data privacy, information leakage, abusive language, and fairness. This paper describes technological components that were built to address ethical and trustworthy concerns in a multi-modal collaborative platform (called ALLURE chatbot) for high school students to collaborate with AI to solve the Rubik's cube. In data privacy, we want to ensure that the informed consent of children, parents, and teachers, is at the center of any data that is managed. Since children are involved, language, whether textual, audio, or visual, is acceptable both from users and AI and the system can steer interaction away from dangerous situations. In information management, we also want to ensure that the system, while learning to improve over time, does not leak information about users from one group to another.
△ Less
Submitted 27 August, 2024; v1 submitted 30 January, 2024;
originally announced February 2024.
-
The Effect of Human v/s Synthetic Test Data and Round-tripping on Assessment of Sentiment Analysis Systems for Bias
Authors:
Kausik Lakkaraju,
Aniket Gupta,
Biplav Srivastava,
Marco Valtorta,
Dezhi Wu
Abstract:
Sentiment Analysis Systems (SASs) are data-driven Artificial Intelligence (AI) systems that output polarity and emotional intensity when given a piece of text as input. Like other AIs, SASs are also known to have unstable behavior when subjected to changes in data which can make it problematic to trust out of concerns like bias when AI works with humans and data has protected attributes like gende…
▽ More
Sentiment Analysis Systems (SASs) are data-driven Artificial Intelligence (AI) systems that output polarity and emotional intensity when given a piece of text as input. Like other AIs, SASs are also known to have unstable behavior when subjected to changes in data which can make it problematic to trust out of concerns like bias when AI works with humans and data has protected attributes like gender, race, and age. Recently, an approach was introduced to assess SASs in a blackbox setting without training data or code, and rating them for bias using synthetic English data. We augment it by introducing two human-generated chatbot datasets and also consider a round-trip setting of translating the data from one language to the same through an intermediate language. We find that these settings show SASs performance in a more realistic light. Specifically, we find that rating SASs on the chatbot data showed more bias compared to the synthetic data, and round-tripping using Spanish and Danish as intermediate languages reduces the bias (up to 68% reduction) in human-generated data while, in synthetic data, it takes a surprising turn by increasing the bias! Our findings will help researchers and practitioners refine their SAS testing strategies and foster trust as SASs are considered part of more mission-critical applications for global use.
△ Less
Submitted 15 January, 2024;
originally announced January 2024.
-
Exact results on finite size corrections for surface codes tailored to biased noise
Authors:
Yinzi Xiao,
Basudha Srivastava,
Mats Granath
Abstract:
The code-capacity threshold of a scalable quantum error correcting stabilizer code can be expressed as a thermodynamic phase transition of a corresponding random-bond Ising model. Here we study the XY and XZZX surface codes under phase-biased noise, $p_x=p_y=p_z/(2η)$, with $η\geq 1/2$, and total error rate $p=p_x+p_y+p_z$. By appropriately formulating the boundary conditions, in the rotated code…
▽ More
The code-capacity threshold of a scalable quantum error correcting stabilizer code can be expressed as a thermodynamic phase transition of a corresponding random-bond Ising model. Here we study the XY and XZZX surface codes under phase-biased noise, $p_x=p_y=p_z/(2η)$, with $η\geq 1/2$, and total error rate $p=p_x+p_y+p_z$. By appropriately formulating the boundary conditions, in the rotated code geometry, we find exact solutions at a special disordered point, $p=\frac{1+η^{-1}}{2+η^{-1}}\gtrsim 0.5$, for arbitrary odd code distance $d$, where the codes reduce to one-dimensional Ising models. The total logical failure rate is given by $P_{f}=\frac{3}{4}-\frac{1}{4}e^{-2d_Z\,\text{artanh}(1/2η)}$, where $d_{Z}=d^2$ and $d$ for the two codes respectively, is the effective code distance for pure phase-flip noise. As a consequence, for code distances $d\ll η$, and error rates near the threshold, the XZZX code is effectively equivalent to the phase-flip correcting repetition code over $d$ qubits. The large finite size corrections for $d_Z<η$ also make threshold extractions, from numerical calculations at moderate code distances, unreliable. We show that calculating thresholds based not only on the total logical failure rate, but also independently on the phase- and bit-flip logical failure rates, gives a more confident estimate. Using this method for the XZZX code with a tensor-network based decoder and code distances up to $d\approx 100$, we find that the thresholds converge to a single value at moderate bias ($η=30, 100$), at an error rate above the hashing bound.
△ Less
Submitted 5 September, 2024; v1 submitted 8 January, 2024;
originally announced January 2024.
-
On the Prospects of Incorporating Large Language Models (LLMs) in Automated Planning and Scheduling (APS)
Authors:
Vishal Pallagani,
Kaushik Roy,
Bharath Muppasani,
Francesco Fabiano,
Andrea Loreggia,
Keerthiram Murugesan,
Biplav Srivastava,
Francesca Rossi,
Lior Horesh,
Amit Sheth
Abstract:
Automated Planning and Scheduling is among the growing areas in Artificial Intelligence (AI) where mention of LLMs has gained popularity. Based on a comprehensive review of 126 papers, this paper investigates eight categories based on the unique applications of LLMs in addressing various aspects of planning problems: language translation, plan generation, model construction, multi-agent planning,…
▽ More
Automated Planning and Scheduling is among the growing areas in Artificial Intelligence (AI) where mention of LLMs has gained popularity. Based on a comprehensive review of 126 papers, this paper investigates eight categories based on the unique applications of LLMs in addressing various aspects of planning problems: language translation, plan generation, model construction, multi-agent planning, interactive planning, heuristics optimization, tool integration, and brain-inspired planning. For each category, we articulate the issues considered and existing gaps. A critical insight resulting from our review is that the true potential of LLMs unfolds when they are integrated with traditional symbolic planners, pointing towards a promising neuro-symbolic approach. This approach effectively combines the generative aspects of LLMs with the precision of classical planning methods. By synthesizing insights from existing literature, we underline the potential of this integration to address complex planning challenges. Our goal is to encourage the ICAPS community to recognize the complementary strengths of LLMs and symbolic planners, advocating for a direction in automated planning that leverages these synergistic capabilities to develop more advanced and intelligent planning systems.
△ Less
Submitted 20 January, 2024; v1 submitted 4 January, 2024;
originally announced January 2024.
-
Measurements of charged-particle multiplicity dependence of higher-order net-proton cumulants in $p$+$p$ collisions at $\sqrt{s} =$ 200 GeV from STAR at RHIC
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (338 additional authors not shown)
Abstract:
We report on the charged-particle multiplicity dependence of net-proton cumulant ratios up to sixth order from $\sqrt{s}=200$ GeV $p$+$p$ collisions at the Relativistic Heavy Ion Collider (RHIC). The measured ratios $C_{4}/C_{2}$, $C_{5}/C_{1}$, and $C_{6}/C_{2}$ decrease with increased charged-particle multiplicity and rapidity acceptance. Neither the Skellam baselines nor PYTHIA8 calculations ac…
▽ More
We report on the charged-particle multiplicity dependence of net-proton cumulant ratios up to sixth order from $\sqrt{s}=200$ GeV $p$+$p$ collisions at the Relativistic Heavy Ion Collider (RHIC). The measured ratios $C_{4}/C_{2}$, $C_{5}/C_{1}$, and $C_{6}/C_{2}$ decrease with increased charged-particle multiplicity and rapidity acceptance. Neither the Skellam baselines nor PYTHIA8 calculations account for the observed multiplicity dependence. In addition, the ratios $C_{5}/C_{1}$ and $C_{6}/C_{2}$ approach negative values in the highest-multiplicity events, which implies that thermalized QCD matter may be formed in $p$+$p$ collisions.
△ Less
Submitted 4 September, 2024; v1 submitted 1 November, 2023;
originally announced November 2023.
-
Estimate of Background Baseline and Upper Limit on the Chiral Magnetic Effect in Isobar Collisions at $\sqrt{s_{\text{NN}}}=200$ GeV at the Relativistic Heavy-Ion Collider
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
J. R. Adams,
G. Agakishiev,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
A. Aitbaev,
I. Alekseev,
E. Alpatov,
A. Aparin,
S. Aslam,
J. Atchison,
G. S. Averichev,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
I. G. Bordyuzhin,
J. D. Brandenburg
, et al. (333 additional authors not shown)
Abstract:
For the search of the chiral magnetic effect (CME), STAR previously presented the results from isobar collisions (${^{96}_{44}\text{Ru}}+{^{96}_{44}\text{Ru}}$, ${^{96}_{40}\text{Zr}}+{^{96}_{40}\text{Zr}}$) obtained through a blind analysis. The ratio of results in Ru+Ru to Zr+Zr collisions for the CME-sensitive charge-dependent azimuthal correlator ($Δγ$), normalized by elliptic anisotropy (…
▽ More
For the search of the chiral magnetic effect (CME), STAR previously presented the results from isobar collisions (${^{96}_{44}\text{Ru}}+{^{96}_{44}\text{Ru}}$, ${^{96}_{40}\text{Zr}}+{^{96}_{40}\text{Zr}}$) obtained through a blind analysis. The ratio of results in Ru+Ru to Zr+Zr collisions for the CME-sensitive charge-dependent azimuthal correlator ($Δγ$), normalized by elliptic anisotropy ($v_{2}$), was observed to be close to but systematically larger than the inverse multiplicity ratio. The background baseline for the isobar ratio, $Y = \frac{(Δγ/v_{2})^{\text{Ru}}}{(Δγ/v_{2})^{\text{Zr}}}$, is naively expected to be $\frac{(1/N)^{\text{Ru}}}{(1/N)^{\text{Zr}}}$; however, genuine two- and three-particle correlations are expected to alter it. We estimate the contributions to $Y$ from those correlations, utilizing both the isobar data and HIJING simulations. After including those contributions, we arrive at a final background baseline for $Y$, which is consistent with the isobar data. We extract an upper limit for the CME fraction in the $Δγ$ measurement of approximately $10\%$ at a $95\%$ confidence level on in isobar collisions at $\sqrt{s_{\text{NN}}} = 200$ GeV, with an expected $15\%$ difference in their squared magnetic fields.
△ Less
Submitted 17 July, 2024; v1 submitted 19 October, 2023;
originally announced October 2023.
-
Observation of the Antimatter Hypernucleus $^4_{\barΛ}\overline{\hbox{H}}$
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (342 additional authors not shown)
Abstract:
At the origin of the Universe, asymmetry between the amount of created matter and antimatter led to the matter-dominated Universe as we know today. The origins of this asymmetry remain not completely understood yet. High-energy nuclear collisions create conditions similar to the Universe microseconds after the Big Bang, with comparable amounts of matter and antimatter. Much of the created antimatt…
▽ More
At the origin of the Universe, asymmetry between the amount of created matter and antimatter led to the matter-dominated Universe as we know today. The origins of this asymmetry remain not completely understood yet. High-energy nuclear collisions create conditions similar to the Universe microseconds after the Big Bang, with comparable amounts of matter and antimatter. Much of the created antimatter escapes the rapidly expanding fireball without annihilating, making such collisions an effective experimental tool to create heavy antimatter nuclear objects and study their properties, hoping to shed some light on existing questions on the asymmetry between matter and antimatter. Here we report the first observation of the antimatter hypernucleus \hbox{$^4_{\barΛ}\overline{\hbox{H}}$}, composed of a $\barΛ$ , an antiproton and two antineutrons. The discovery was made through its two-body decay after production in ultrarelativistic heavy-ion collisions by the STAR experiment at the Relativistic Heavy Ion Collider. In total, 15.6 candidate \hbox{$^4_{\barΛ}\overline{\hbox{H}}$} antimatter hypernuclei are obtained with an estimated background count of 6.4. The lifetimes of the antihypernuclei \hbox{$^3_{\barΛ}\overline{\hbox{H}}$} and \hbox{$^4_{\barΛ}\overline{\hbox{H}}$} are measured and compared with the lifetimes of their corresponding hypernuclei, testing the symmetry between matter and antimatter. Various production yield ratios among (anti)hypernuclei and (anti)nuclei are also measured and compared with theoretical model predictions, shedding light on their production mechanisms.
△ Less
Submitted 8 June, 2024; v1 submitted 19 October, 2023;
originally announced October 2023.
-
On Learning with LAD
Authors:
C. A. Jothishwaran,
Biplav Srivastava,
Jitin Singla,
Sugata Gangopadhyay
Abstract:
The logical analysis of data, LAD, is a technique that yields two-class classifiers based on Boolean functions having disjunctive normal form (DNF) representation. Although LAD algorithms employ optimization techniques, the resulting binary classifiers or binary rules do not lead to overfitting. We propose a theoretical justification for the absence of overfitting by estimating the Vapnik-Chervone…
▽ More
The logical analysis of data, LAD, is a technique that yields two-class classifiers based on Boolean functions having disjunctive normal form (DNF) representation. Although LAD algorithms employ optimization techniques, the resulting binary classifiers or binary rules do not lead to overfitting. We propose a theoretical justification for the absence of overfitting by estimating the Vapnik-Chervonenkis dimension (VC dimension) for LAD models where hypothesis sets consist of DNFs with a small number of cubic monomials. We illustrate and confirm our observations empirically.
△ Less
Submitted 28 September, 2023;
originally announced September 2023.
-
Results on Elastic Cross Sections in Proton-Proton Collisions at $\sqrt{s} = 510$ GeV with the STAR Detector at RHIC
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (343 additional authors not shown)
Abstract:
We report results on an elastic cross section measurement in proton-proton collisions at a center-of-mass energy $\sqrt{s}=510$ GeV, obtained with the Roman Pot setup of the STAR experiment at the Relativistic Heavy Ion Collider (RHIC). The elastic differential cross section is measured in the four-momentum transfer squared range $0.23 \leq -t \leq 0.67$ GeV$^2$. We find that a constant slope $B$…
▽ More
We report results on an elastic cross section measurement in proton-proton collisions at a center-of-mass energy $\sqrt{s}=510$ GeV, obtained with the Roman Pot setup of the STAR experiment at the Relativistic Heavy Ion Collider (RHIC). The elastic differential cross section is measured in the four-momentum transfer squared range $0.23 \leq -t \leq 0.67$ GeV$^2$. We find that a constant slope $B$ does not fit the data in the aforementioned $t$ range, and we obtain a much better fit using a second-order polynomial for $B(t)$. The $t$ dependence of $B$ is determined using six subintervals of $t$ in the STAR measured $t$ range, and is in good agreement with the phenomenological models. The measured elastic differential cross section $\mathrm{d}σ/\mathrm{dt}$ agrees well with the results obtained at $\sqrt{s} = 546$ GeV for proton--antiproton collisions by the UA4 experiment. We also determine that the integrated elastic cross section within the STAR $t$-range is $σ^\mathrm{fid}_\mathrm{el} = 462.1 \pm 0.9 (\mathrm{stat.}) \pm 1.1 (\mathrm {syst.}) \pm 11.6 (\mathrm {scale})$~$μ\mathrm{b}$.
△ Less
Submitted 6 May, 2024; v1 submitted 28 September, 2023;
originally announced September 2023.
-
Longitudinal and transverse spin transfer to $Λ$ and $\overlineΛ$ hyperons in polarized $p$+$p$ collisions at $\sqrt{s} = 200$ GeV
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
D. M. Anderson,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
W. Baker,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
X. Z. Cai
, et al. (357 additional authors not shown)
Abstract:
The longitudinal and transverse spin transfers to $Λ$ ($\overlineΛ$) hyperons in polarized proton-proton collisions are expected to be sensitive to the helicity and transversity distributions, respectively, of (anti-)strange quarks in the proton, and to the corresponding polarized fragmentation functions. We report improved measurements of the longitudinal spin transfer coefficient, $D_{LL}$, and…
▽ More
The longitudinal and transverse spin transfers to $Λ$ ($\overlineΛ$) hyperons in polarized proton-proton collisions are expected to be sensitive to the helicity and transversity distributions, respectively, of (anti-)strange quarks in the proton, and to the corresponding polarized fragmentation functions. We report improved measurements of the longitudinal spin transfer coefficient, $D_{LL}$, and the transverse spin transfer coefficient, $D_{TT}$, to $Λ$ and $\overlineΛ$ in polarized proton-proton collisions at $\sqrt{s}$ = 200 GeV by the STAR experiment at RHIC. The data set includes longitudinally polarized proton-proton collisions with an integrated luminosity of 52 pb$^{-1}$, and transversely polarized proton-proton collisions with a similar integrated luminosity. Both data sets have about twice the statistics of previous results and cover a kinematic range of $|η_{Λ(\overlineΛ)}|$ $<$ 1.2 and transverse momentum $p_{T,{Λ(\overlineΛ)}}$ up to 8 GeV/$c$. We also report the first measurements of the hyperon spin transfer coefficients $D_{LL}$ and $D_{TT}$ as a function of the fractional jet momentum $z$ carried by the hyperon, which can provide more direct constraints on the polarized fragmentation functions.
△ Less
Submitted 7 December, 2023; v1 submitted 25 September, 2023;
originally announced September 2023.
-
Reaction plane correlated triangular flow in Au+Au collisions at $\sqrt{s_{NN}}=3$ GeV
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (341 additional authors not shown)
Abstract:
We measure triangular flow relative to the reaction plane at 3 GeV center-of-mass energy in Au+Au collisions at the BNL Relativistic Heavy Ion Collider. A significant $v_3$ signal for protons is observed, which increases for higher rapidity, higher transverse momentum, and more peripheral collisions. The triangular flow is essentially rapidity-odd with a slope at mid-rapidity, $dv_3/dy|_{(y=0)}$,…
▽ More
We measure triangular flow relative to the reaction plane at 3 GeV center-of-mass energy in Au+Au collisions at the BNL Relativistic Heavy Ion Collider. A significant $v_3$ signal for protons is observed, which increases for higher rapidity, higher transverse momentum, and more peripheral collisions. The triangular flow is essentially rapidity-odd with a slope at mid-rapidity, $dv_3/dy|_{(y=0)}$, opposite in sign compared to the slope for directed flow. No significant $v_3$ signal is observed for charged pions and kaons. Comparisons with models suggest that a mean field potential is required to describe these results, and that the triangular shape of the participant nucleons is the result of stopping and nuclear geometry.
△ Less
Submitted 19 April, 2024; v1 submitted 21 September, 2023;
originally announced September 2023.
-
Promoting Research Collaboration with Open Data Driven Team Recommendation in Response to Call for Proposals
Authors:
Siva Likitha Valluru,
Biplav Srivastava,
Sai Teja Paladi,
Siwen Yan,
Sriraam Natarajan
Abstract:
Building teams and promoting collaboration are two very common business activities. An example of these are seen in the TeamingForFunding problem, where research institutions and researchers are interested to identify collaborative opportunities when applying to funding agencies in response to latter's calls for proposals. We describe a novel system to recommend teams using a variety of AI methods…
▽ More
Building teams and promoting collaboration are two very common business activities. An example of these are seen in the TeamingForFunding problem, where research institutions and researchers are interested to identify collaborative opportunities when applying to funding agencies in response to latter's calls for proposals. We describe a novel system to recommend teams using a variety of AI methods, such that (1) each team achieves the highest possible skill coverage that is demanded by the opportunity, and (2) the workload of distributing the opportunities is balanced amongst the candidate members. We address these questions by extracting skills latent in open data of proposal calls (demand) and researcher profiles (supply), normalizing them using taxonomies, and creating efficient algorithms that match demand to supply. We create teams to maximize goodness along a novel metric balancing short- and long-term objectives. We validate the success of our algorithms (1) quantitatively, by evaluating the recommended teams using a goodness score and find that more informed methods lead to recommendations of smaller number of teams but higher goodness, and (2) qualitatively, by conducting a large-scale user study at a college-wide level, and demonstrate that users overall found the tool very useful and relevant. Lastly, we evaluate our system in two diverse settings in US and India (of researchers and proposal calls) to establish generality of our approach, and deploy it at a major US university for routine use.
△ Less
Submitted 25 January, 2024; v1 submitted 17 September, 2023;
originally announced September 2023.
-
Evaluating Chatbots to Promote Users' Trust -- Practices and Open Problems
Authors:
Biplav Srivastava,
Kausik Lakkaraju,
Tarmo Koppel,
Vignesh Narayanan,
Ashish Kundu,
Sachindra Joshi
Abstract:
Chatbots, the common moniker for collaborative assistants, are Artificial Intelligence (AI) software that enables people to naturally interact with them to get tasks done. Although chatbots have been studied since the dawn of AI, they have particularly caught the imagination of the public and businesses since the launch of easy-to-use and general-purpose Large Language Model-based chatbots like Ch…
▽ More
Chatbots, the common moniker for collaborative assistants, are Artificial Intelligence (AI) software that enables people to naturally interact with them to get tasks done. Although chatbots have been studied since the dawn of AI, they have particularly caught the imagination of the public and businesses since the launch of easy-to-use and general-purpose Large Language Model-based chatbots like ChatGPT. As businesses look towards chatbots as a potential technology to engage users, who may be end customers, suppliers, or even their own employees, proper testing of chatbots is important to address and mitigate issues of trust related to service or product performance, user satisfaction and long-term unintended consequences for society. This paper reviews current practices for chatbot testing, identifies gaps as open problems in pursuit of user trust, and outlines a path forward.
△ Less
Submitted 13 September, 2023; v1 submitted 9 September, 2023;
originally announced September 2023.
-
Upper Limit on the Chiral Magnetic Effect in Isobar Collisions at the Relativistic Heavy-Ion Collider
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
J. R. Adams,
G. Agakishiev,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
A. Aitbaev,
I. Alekseev,
E. Alpatov,
A. Aparin,
S. Aslam,
J. Atchison,
G. S. Averichev,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
I. G. Bordyuzhin,
J. D. Brandenburg
, et al. (333 additional authors not shown)
Abstract:
The chiral magnetic effect (CME) is a phenomenon that arises from the QCD anomaly in the presence of an external magnetic field. The experimental search for its evidence has been one of the key goals of the physics program of the Relativistic Heavy-Ion Collider. The STAR collaboration has previously presented the results of a blind analysis of isobar collisions (…
▽ More
The chiral magnetic effect (CME) is a phenomenon that arises from the QCD anomaly in the presence of an external magnetic field. The experimental search for its evidence has been one of the key goals of the physics program of the Relativistic Heavy-Ion Collider. The STAR collaboration has previously presented the results of a blind analysis of isobar collisions (${^{96}_{44}\text{Ru}}+{^{96}_{44}\text{Ru}}$, ${^{96}_{40}\text{Zr}}+{^{96}_{40}\text{Zr}}$) in the search for the CME. The isobar ratio ($Y$) of CME-sensitive observable, charge separation scaled by elliptic anisotropy, is close to but systematically larger than the inverse multiplicity ratio, the naive background baseline. This indicates the potential existence of a CME signal and the presence of remaining nonflow background due to two- and three-particle correlations, which are different between the isobars. In this post-blind analysis, we estimate the contributions from those nonflow correlations as a background baseline to $Y$, utilizing the isobar data as well as Heavy Ion Jet Interaction Generator simulations. This baseline is found consistent with the isobar ratio measurement, and an upper limit of 10% at 95% confidence level is extracted for the CME fraction in the charge separation measurement in isobar collisions at $\sqrt{s_{\rm NN}}=200$ GeV.
△ Less
Submitted 17 July, 2024; v1 submitted 31 August, 2023;
originally announced August 2023.
-
Jet-hadron correlations with respect to the event plane in $\sqrt{s_{\mathrm{NN}}}$ = 200 GeV Au+Au collisions in STAR
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
X. Z. Cai,
H. Caines
, et al. (340 additional authors not shown)
Abstract:
Angular distributions of charged particles relative to jet axes are studied in $\sqrt{s_{\mathrm{NN}}}$ = 200 GeV Au+Au collisions as a function of the jet orientation with respect to the event plane. This differential study tests the expected path-length dependence of energy loss experienced by a hard-scattered parton as it traverses the hot and dense medium formed in heavy-ion collisions. A seco…
▽ More
Angular distributions of charged particles relative to jet axes are studied in $\sqrt{s_{\mathrm{NN}}}$ = 200 GeV Au+Au collisions as a function of the jet orientation with respect to the event plane. This differential study tests the expected path-length dependence of energy loss experienced by a hard-scattered parton as it traverses the hot and dense medium formed in heavy-ion collisions. A second-order event plane is used in the analysis as an experimental estimate of the reaction plane formed by the collision impact parameter and the beam direction. Charged-particle jets with $15 < p_{\rm T, jet} <$ 20 and $20 < p_{\rm T, jet} <$ 40 GeV/$c$ were reconstructed with the anti-$k_{\rm T}$ algorithm with radius parameter setting of (R=0.4) in the 20-50\% centrality bin to maximize the initial-state eccentricity of the interaction region. The reaction plane fit method is implemented to remove the flow-modulated background with better precision than prior methods. Yields and widths of jet-associated charged-hadron distributions are extracted in three angular bins between the jet axis and the event plane. The event-plane (EP) dependence is further quantified by ratios of the associated yields in different EP bins. No dependence on orientation of the jet axis with respect to the event plane is seen within the uncertainties in the kinematic regime studied. This finding is consistent with a similar experimental observation by ALICE in $\sqrt{s_{\mathrm{NN}}}$ = 2.76 TeV Pb+Pb collision data.
△ Less
Submitted 20 March, 2024; v1 submitted 25 July, 2023;
originally announced July 2023.
-
On Solving the Rubik's Cube with Domain-Independent Planners Using Standard Representations
Authors:
Bharath Muppasani,
Vishal Pallagani,
Biplav Srivastava,
Forest Agostinelli
Abstract:
Rubik's Cube (RC) is a well-known and computationally challenging puzzle that has motivated AI researchers to explore efficient alternative representations and problem-solving methods. The ideal situation for planning here is that a problem be solved optimally and efficiently represented in a standard notation using a general-purpose solver and heuristics. The fastest solver today for RC is DeepCu…
▽ More
Rubik's Cube (RC) is a well-known and computationally challenging puzzle that has motivated AI researchers to explore efficient alternative representations and problem-solving methods. The ideal situation for planning here is that a problem be solved optimally and efficiently represented in a standard notation using a general-purpose solver and heuristics. The fastest solver today for RC is DeepCubeA with a custom representation, and another approach is with Scorpion planner with State-Action-Space+ (SAS+) representation. In this paper, we present the first RC representation in the popular PDDL language so that the domain becomes more accessible to PDDL planners, competitions, and knowledge engineering tools, and is more human-readable. We then bridge across existing approaches and compare performance. We find that in one comparable experiment, DeepCubeA (trained with 12 RC actions) solves all problems with varying complexities, albeit only 78.5% are optimal plans. For the same problem set, Scorpion with SAS+ representation and pattern database heuristics solves 61.50% problems optimally, while FastDownward with PDDL representation and FF heuristic solves 56.50% problems, out of which 79.64% of the plans generated were optimal. Our study provides valuable insights into the trade-offs between representational choice and plan optimality that can help researchers design future strategies for challenging domains combining general-purpose solving methods (planning, reinforcement learning), heuristics, and representations (standard or custom).
△ Less
Submitted 21 August, 2023; v1 submitted 25 July, 2023;
originally announced July 2023.
-
A Planning Ontology to Represent and Exploit Planning Knowledge for Performance Efficiency
Authors:
Bharath Muppasani,
Vishal Pallagani,
Biplav Srivastava,
Raghava Mutharaju,
Michael N. Huhns,
Vignesh Narayanan
Abstract:
Ontologies are known for their ability to organize rich metadata, support the identification of novel insights via semantic queries, and promote reuse. In this paper, we consider the problem of automated planning, where the objective is to find a sequence of actions that will move an agent from an initial state of the world to a desired goal state. We hypothesize that given a large number of avail…
▽ More
Ontologies are known for their ability to organize rich metadata, support the identification of novel insights via semantic queries, and promote reuse. In this paper, we consider the problem of automated planning, where the objective is to find a sequence of actions that will move an agent from an initial state of the world to a desired goal state. We hypothesize that given a large number of available planners and diverse planning domains; they carry essential information that can be leveraged to identify suitable planners and improve their performance for a domain. We use data on planning domains and planners from the International Planning Competition (IPC) to construct a planning ontology and demonstrate via experiments in two use cases that the ontology can lead to the selection of promising planners and improving their performance using macros - a form of action ordering constraints extracted from planning ontology. We also make the planning ontology and associated resources available to the community to promote further research.
△ Less
Submitted 8 July, 2024; v1 submitted 25 July, 2023;
originally announced July 2023.
-
Value-based Fast and Slow AI Nudging
Authors:
Marianna B. Ganapini,
Francesco Fabiano,
Lior Horesh,
Andrea Loreggia,
Nicholas Mattei,
Keerthiram Murugesan,
Vishal Pallagani,
Francesca Rossi,
Biplav Srivastava,
Brent Venable
Abstract:
Nudging is a behavioral strategy aimed at influencing people's thoughts and actions. Nudging techniques can be found in many situations in our daily lives, and these nudging techniques can targeted at human fast and unconscious thinking, e.g., by using images to generate fear or the more careful and effortful slow thinking, e.g., by releasing information that makes us reflect on our choices. In th…
▽ More
Nudging is a behavioral strategy aimed at influencing people's thoughts and actions. Nudging techniques can be found in many situations in our daily lives, and these nudging techniques can targeted at human fast and unconscious thinking, e.g., by using images to generate fear or the more careful and effortful slow thinking, e.g., by releasing information that makes us reflect on our choices. In this paper, we propose and discuss a value-based AI-human collaborative framework where AI systems nudge humans by proposing decision recommendations. Three different nudging modalities, based on when recommendations are presented to the human, are intended to stimulate human fast thinking, slow thinking, or meta-cognition. Values that are relevant to a specific decision scenario are used to decide when and how to use each of these nudging modalities. Examples of values are decision quality, speed, human upskilling and learning, human agency, and privacy. Several values can be present at the same time, and their priorities can vary over time. The framework treats values as parameters to be instantiated in a specific decision environment.
△ Less
Submitted 14 July, 2023;
originally announced July 2023.
-
Can LLMs be Good Financial Advisors?: An Initial Study in Personal Decision Making for Optimized Outcomes
Authors:
Kausik Lakkaraju,
Sai Krishna Revanth Vuruma,
Vishal Pallagani,
Bharath Muppasani,
Biplav Srivastava
Abstract:
Increasingly powerful Large Language Model (LLM) based chatbots, like ChatGPT and Bard, are becoming available to users that have the potential to revolutionize the quality of decision-making achieved by the public. In this context, we set out to investigate how such systems perform in the personal finance domain, where financial inclusion has been an overarching stated aim of banks for decades. W…
▽ More
Increasingly powerful Large Language Model (LLM) based chatbots, like ChatGPT and Bard, are becoming available to users that have the potential to revolutionize the quality of decision-making achieved by the public. In this context, we set out to investigate how such systems perform in the personal finance domain, where financial inclusion has been an overarching stated aim of banks for decades. We asked 13 questions representing banking products in personal finance: bank account, credit card, and certificate of deposits and their inter-product interactions, and decisions related to high-value purchases, payment of bank dues, and investment advice, and in different dialects and languages (English, African American Vernacular English, and Telugu). We find that although the outputs of the chatbots are fluent and plausible, there are still critical gaps in providing accurate and reliable financial information using LLM-based chatbots.
△ Less
Submitted 8 July, 2023;
originally announced July 2023.
-
Data-driven decoding of quantum error correcting codes using graph neural networks
Authors:
Moritz Lange,
Pontus Havström,
Basudha Srivastava,
Valdemar Bergentall,
Karl Hammar,
Olivia Heuts,
Evert van Nieuwenburg,
Mats Granath
Abstract:
To leverage the full potential of quantum error-correcting stabilizer codes it is crucial to have an efficient and accurate decoder. Accurate, maximum likelihood, decoders are computationally very expensive whereas decoders based on more efficient algorithms give sub-optimal performance. In addition, the accuracy will depend on the quality of models and estimates of error rates for idling qubits,…
▽ More
To leverage the full potential of quantum error-correcting stabilizer codes it is crucial to have an efficient and accurate decoder. Accurate, maximum likelihood, decoders are computationally very expensive whereas decoders based on more efficient algorithms give sub-optimal performance. In addition, the accuracy will depend on the quality of models and estimates of error rates for idling qubits, gates, measurements, and resets, and will typically assume symmetric error channels. In this work, instead, we explore a model-free, data-driven, approach to decoding, using a graph neural network (GNN). The decoding problem is formulated as a graph classification task in which a set of stabilizer measurements is mapped to an annotated detector graph for which the neural network predicts the most likely logical error class. We show that the GNN-based decoder can outperform a matching decoder for circuit level noise on the surface code given only simulated experimental data, even if the matching decoder is given full information of the underlying error model. Although training is computationally demanding, inference is fast and scales approximately linearly with the space-time volume of the code. We also find that we can use large, but more limited, datasets of real experimental data [Google Quantum AI, Nature {\bf 614}, 676 (2023)] for the repetition code, giving decoding accuracies that are on par with minimum weight perfect matching. The results show that a purely data-driven approach to decoding may be a viable future option for practical quantum error correction, which is competitive in terms of speed, accuracy, and versatility.
△ Less
Submitted 3 July, 2023;
originally announced July 2023.
-
Understanding the Capabilities of Large Language Models for Automated Planning
Authors:
Vishal Pallagani,
Bharath Muppasani,
Keerthiram Murugesan,
Francesca Rossi,
Biplav Srivastava,
Lior Horesh,
Francesco Fabiano,
Andrea Loreggia
Abstract:
Automated planning is concerned with developing efficient algorithms to generate plans or sequences of actions to achieve a specific goal in a given environment. Emerging Large Language Models (LLMs) can answer questions, write high-quality programming code, and predict protein folding, showcasing their versatility in solving various tasks beyond language-based problems. In this paper, we aim to e…
▽ More
Automated planning is concerned with developing efficient algorithms to generate plans or sequences of actions to achieve a specific goal in a given environment. Emerging Large Language Models (LLMs) can answer questions, write high-quality programming code, and predict protein folding, showcasing their versatility in solving various tasks beyond language-based problems. In this paper, we aim to explore how LLMs can also be used for automated planning. To do so, we seek to answer four key questions. Firstly, we want to understand the extent to which LLMs can be used for plan generation. Secondly, we aim to identify which pre-training data is most effective in facilitating plan generation. Thirdly, we investigate whether fine-tuning or prompting is a more effective approach for plan generation. Finally, we explore whether LLMs are capable of plan generalization. By answering these questions, the study seeks to shed light on the capabilities of LLMs in solving complex planning problems and provide insights into the most effective approaches for using LLMs in this context.
△ Less
Submitted 25 May, 2023;
originally announced May 2023.
-
Towards Explainable and Safe Conversational Agents for Mental Health: A Survey
Authors:
Surjodeep Sarkar,
Manas Gaur,
L. Chen,
Muskan Garg,
Biplav Srivastava,
Bhaktee Dongaonkar
Abstract:
Virtual Mental Health Assistants (VMHAs) are seeing continual advancements to support the overburdened global healthcare system that gets 60 million primary care visits, and 6 million Emergency Room (ER) visits annually. These systems are built by clinical psychologists, psychiatrists, and Artificial Intelligence (AI) researchers for Cognitive Behavioral Therapy (CBT). At present, the role of VMHA…
▽ More
Virtual Mental Health Assistants (VMHAs) are seeing continual advancements to support the overburdened global healthcare system that gets 60 million primary care visits, and 6 million Emergency Room (ER) visits annually. These systems are built by clinical psychologists, psychiatrists, and Artificial Intelligence (AI) researchers for Cognitive Behavioral Therapy (CBT). At present, the role of VMHAs is to provide emotional support through information, focusing less on developing a reflective conversation with the patient. A more comprehensive, safe and explainable approach is required to build responsible VMHAs to ask follow-up questions or provide a well-informed response. This survey offers a systematic critical review of the existing conversational agents in mental health, followed by new insights into the improvements of VMHAs with contextual knowledge, datasets, and their emerging role in clinical decision support. We also provide new directions toward enriching the user experience of VMHAs with explainability, safety, and wholesome trustworthiness. Finally, we provide evaluation metrics and practical considerations for VMHAs beyond the current literature to build trust between VMHAs and patients in active communications.
△ Less
Submitted 25 April, 2023;
originally announced April 2023.
-
Collision-energy Dependence of Deuteron Cumulants and Proton-deuteron Correlations in Au+Au collisions at RHIC
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
C. Broodo,
X. Z. Cai
, et al. (343 additional authors not shown)
Abstract:
We report the first measurements of cumulants, up to $4^{th}$ order, of deuteron number distributions and proton-deuteron correlations in Au+Au collisions recorded by the STAR experiment in phase-I of Beam Energy Scan (BES) program at the Relativistic Heavy Ion Collider. Deuteron cumulants, their ratios, and proton-deuteron mixed cumulants are presented for different collision centralities coverin…
▽ More
We report the first measurements of cumulants, up to $4^{th}$ order, of deuteron number distributions and proton-deuteron correlations in Au+Au collisions recorded by the STAR experiment in phase-I of Beam Energy Scan (BES) program at the Relativistic Heavy Ion Collider. Deuteron cumulants, their ratios, and proton-deuteron mixed cumulants are presented for different collision centralities covering a range of center-of-mass energy per nucleon pair $\sqrt{s_{NN}}$~=~7.7 to 200~GeV. It is found that the cumulant ratios at lower collision energies favor a canonical ensemble over a grand canonical ensemble in thermal models. An anti-correlation between proton and deuteron multiplicity is observed across all collision energies and centralities, consistent with the expectation from global baryon number conservation. The UrQMD model coupled with a phase-space coalescence mechanism qualitatively reproduces the collision-energy dependence of cumulant ratios and proton-deuteron correlations.
△ Less
Submitted 28 June, 2024; v1 submitted 21 April, 2023;
originally announced April 2023.
-
Event-by-event correlations between $Λ$ ($\barΛ$) hyperon global polarization and handedness with charged hadron azimuthal separation in Au+Au collisions at $\sqrt{s_{\text{NN}}} = 27 \text{ GeV}$ from STAR
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
J. R. Adams,
G. Agakishiev,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
A. Aitbaev,
I. Alekseev,
D. M. Anderson,
A. Aparin,
S. Aslam,
J. Atchison,
G. S. Averichev,
V. Bairathi,
W. Baker,
J. G. Ball Cap,
K. Barish,
P. Bhagat,
A. Bhasin,
S. Bhatta,
I. G. Bordyuzhin,
J. D. Brandenburg
, et al. (333 additional authors not shown)
Abstract:
Global polarizations ($P$) of $Λ$ ($\barΛ$) hyperons have been observed in non-central heavy-ion collisions. The strong magnetic field primarily created by the spectator protons in such collisions would split the $Λ$ and $\barΛ$ global polarizations ($ΔP = P_Λ - P_{\barΛ} < 0$). Additionally, quantum chromodynamics (QCD) predicts topological charge fluctuations in vacuum, resulting in a chirality…
▽ More
Global polarizations ($P$) of $Λ$ ($\barΛ$) hyperons have been observed in non-central heavy-ion collisions. The strong magnetic field primarily created by the spectator protons in such collisions would split the $Λ$ and $\barΛ$ global polarizations ($ΔP = P_Λ - P_{\barΛ} < 0$). Additionally, quantum chromodynamics (QCD) predicts topological charge fluctuations in vacuum, resulting in a chirality imbalance or parity violation in a local domain. This would give rise to an imbalance ($Δn = \frac{N_{\text{L}} - N_{\text{R}}}{\langle N_{\text{L}} + N_{\text{R}} \rangle} \neq 0$) between left- and right-handed $Λ$ ($\barΛ$) as well as a charge separation along the magnetic field, referred to as the chiral magnetic effect (CME). This charge separation can be characterized by the parity-even azimuthal correlator ($Δγ$) and parity-odd azimuthal harmonic observable ($Δa_{1}$). Measurements of $ΔP$, $Δγ$, and $Δa_{1}$ have not led to definitive conclusions concerning the CME or the magnetic field, and $Δn$ has not been measured previously. Correlations among these observables may reveal new insights. This paper reports measurements of correlation between $Δn$ and $Δa_{1}$, which is sensitive to chirality fluctuations, and correlation between $ΔP$ and $Δγ$ sensitive to magnetic field in Au+Au collisions at 27 GeV. For both measurements, no correlations have been observed beyond statistical fluctuations.
△ Less
Submitted 22 July, 2023; v1 submitted 19 April, 2023;
originally announced April 2023.
-
Observation of the electromagnetic field effect via charge-dependent directed flow in heavy-ion collisions at the Relativistic Heavy Ion Collider
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
J. R. Adams,
G. Agakishiev,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
A. Aitbaev,
I. Alekseev,
E. Alpatov,
A. Aparin,
S. Aslam,
J. Atchison,
G. S. Averichev,
V. Bairathi,
J. G. Ball Cap,
K. Barish,
P. Bhagat,
A. Bhasin,
S. Bhatta,
S. R. Bhosale,
I. G. Bordyuzhin,
J. D. Brandenburg
, et al. (331 additional authors not shown)
Abstract:
The deconfined quark-gluon plasma (QGP) created in relativistic heavy-ion collisions enables the exploration of the fundamental properties of matter under extreme conditions. Non-central collisions can produce strong magnetic fields on the order of $10^{18}$ Gauss, which offers a probe into the electrical conductivity of the QGP. In particular, quarks and anti-quarks carry opposite charges and rec…
▽ More
The deconfined quark-gluon plasma (QGP) created in relativistic heavy-ion collisions enables the exploration of the fundamental properties of matter under extreme conditions. Non-central collisions can produce strong magnetic fields on the order of $10^{18}$ Gauss, which offers a probe into the electrical conductivity of the QGP. In particular, quarks and anti-quarks carry opposite charges and receive contrary electromagnetic forces that alter their momenta. This phenomenon can be manifested in the collective motion of final-state particles, specifically in the rapidity-odd directed flow, denoted as $v_1(\mathsf{y})$. Here we present the charge-dependent measurements of $dv_1/d\mathsf{y}$ near midrapidities for $π^{\pm}$, $K^{\pm}$, and $p(\bar{p})$ in Au+Au and isobar ($_{44}^{96}$Ru+$_{44}^{96}$Ru and $_{40}^{96}$Zr+$_{40}^{96}$Zr) collisions at $\sqrt{s_{\rm NN}}=$ 200 GeV, and in Au+Au collisions at 27 GeV, recorded by the STAR detector at the Relativistic Heavy Ion Collider. The combined dependence of the $v_1$ signal on collision system, particle species, and collision centrality can be qualitatively and semi-quantitatively understood as several effects on constituent quarks. While the results in central events can be explained by the $u$ and $d$ quarks transported from initial-state nuclei, those in peripheral events reveal the impacts of the electromagnetic field on the QGP. Our data put valuable constraints on the electrical conductivity of the QGP in theoretical calculations.
△ Less
Submitted 22 February, 2024; v1 submitted 6 April, 2023;
originally announced April 2023.
-
Hyperon polarization along the beam direction relative to the second and third harmonic event planes in isobar collisions at $\sqrt{s_{NN}}$ = 200 GeV
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
J. R. Adams,
G. Agakishiev,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
A. Aitbaev,
I. Alekseev,
D. M. Anderson,
A. Aparin,
S. Aslam,
J. Atchison,
G. S. Averichev,
V. Bairathi,
W. Baker,
J. G. Ball Cap,
K. Barish,
P. Bhagat,
A. Bhasin,
S. Bhatta,
I. G. Bordyuzhin,
J. D. Brandenburg
, et al. (338 additional authors not shown)
Abstract:
The polarization of $Λ$ and $\barΛ$ hyperons along the beam direction has been measured relative to the second and third harmonic event planes in isobar Ru+Ru and Zr+Zr collisions at $\sqrt{s_{NN}}$ = 200 GeV. This is the first experimental evidence of the hyperon polarization by the triangular flow originating from the initial density fluctuations. The amplitudes of the sine modulation for the se…
▽ More
The polarization of $Λ$ and $\barΛ$ hyperons along the beam direction has been measured relative to the second and third harmonic event planes in isobar Ru+Ru and Zr+Zr collisions at $\sqrt{s_{NN}}$ = 200 GeV. This is the first experimental evidence of the hyperon polarization by the triangular flow originating from the initial density fluctuations. The amplitudes of the sine modulation for the second and third harmonic results are comparable in magnitude, increase from central to peripheral collisions, and show a mild $p_T$ dependence. The azimuthal angle dependence of the polarization follows the vorticity pattern expected due to elliptic and triangular anisotropic flow, and qualitatively disagree with most hydrodynamic model calculations based on thermal vorticity and shear induced contributions. The model results based on one of existing implementations of the shear contribution lead to a correct azimuthal angle dependence, but predict centrality and $p_T$ dependence that still disagree with experimental measurements. Thus, our results provide stringent constraints on the thermal vorticity and shear-induced contributions to hyperon polarization. Comparison to previous measurements at RHIC and the LHC for the second-order harmonic results shows little dependence on the collision system size and collision energy.
△ Less
Submitted 16 November, 2023; v1 submitted 16 March, 2023;
originally announced March 2023.
-
Measurement of electrons from open heavy-flavor hadron decays in Au+Au collisions at $\sqrt{s_{\rm NN}}=200$ GeV with the STAR detector
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
D. M. Anderson,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
W. Baker,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
X. Z. Cai
, et al. (350 additional authors not shown)
Abstract:
We report a new measurement of the production of electrons from open heavy-flavor hadron decays (HFEs) at mid-rapidity ($|y|<$ 0.7) in Au+Au collisions at $\sqrt{s_{\rm NN}}=200$ GeV. Invariant yields of HFEs are measured for the transverse momentum range of $3.5 < p_{\rm T} < 9$ GeV/$c$ in various configurations of the collision geometry. The HFE yields in head-on Au+Au collisions are suppressed…
▽ More
We report a new measurement of the production of electrons from open heavy-flavor hadron decays (HFEs) at mid-rapidity ($|y|<$ 0.7) in Au+Au collisions at $\sqrt{s_{\rm NN}}=200$ GeV. Invariant yields of HFEs are measured for the transverse momentum range of $3.5 < p_{\rm T} < 9$ GeV/$c$ in various configurations of the collision geometry. The HFE yields in head-on Au+Au collisions are suppressed by approximately a factor of 2 compared to that in $p$+$p$ collisions scaled by the average number of binary collisions, indicating strong interactions between heavy quarks and the hot and dense medium created in heavy-ion collisions. Comparison of these results with models provides additional tests of theoretical calculations of heavy quark energy loss in the quark-gluon plasma.
△ Less
Submitted 28 June, 2023; v1 submitted 12 March, 2023;
originally announced March 2023.
-
Fast and Slow Planning
Authors:
Francesco Fabiano,
Vishal Pallagani,
Marianna Bergamaschi Ganapini,
Lior Horesh,
Andrea Loreggia,
Keerthiram Murugesan,
Francesca Rossi,
Biplav Srivastava
Abstract:
The concept of Artificial Intelligence has gained a lot of attention over the last decade. In particular, AI-based tools have been employed in several scenarios and are, by now, pervading our everyday life. Nonetheless, most of these systems lack many capabilities that we would naturally consider to be included in a notion of "intelligence". In this work, we present an architecture that, inspired…
▽ More
The concept of Artificial Intelligence has gained a lot of attention over the last decade. In particular, AI-based tools have been employed in several scenarios and are, by now, pervading our everyday life. Nonetheless, most of these systems lack many capabilities that we would naturally consider to be included in a notion of "intelligence". In this work, we present an architecture that, inspired by the cognitive theory known as Thinking Fast and Slow by D. Kahneman, is tasked with solving planning problems in different settings, specifically: classical and multi-agent epistemic. The system proposed is an instance of a more general AI paradigm, referred to as SOFAI (for Slow and Fast AI). SOFAI exploits multiple solving approaches, with different capabilities that characterize them as either fast or slow, and a metacognitive module to regulate them. This combination of components, which roughly reflects the human reasoning process according to D. Kahneman, allowed us to enhance the reasoning process that, in this case, is concerned with planning in two different settings. The behavior of this system is then compared to state-of-the-art solvers, showing that the newly introduced system presents better results in terms of generality, solving a wider set of problems with an acceptable trade-off between solving times and solution accuracy.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
Elliptic Flow of Heavy-Flavor Decay Electrons in Au+Au Collisions at $\sqrt{s_{_{\rm NN}}}$ = 27 and 54.4 GeV at RHIC
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
D. M. Anderson,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
W. Baker,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
X. Z. Cai
, et al. (350 additional authors not shown)
Abstract:
We report on new measurements of elliptic flow ($v_2$) of electrons from heavy-flavor hadron decays at mid-rapidity ($|y|<0.8$) in Au+Au collisions at $\sqrt{s_{_{\rm NN}}}$ = 27 and 54.4 GeV from the STAR experiment. Heavy-flavor decay electrons ($e^{\rm HF}$) in Au+Au collisions at $\sqrt{s_{_{\rm NN}}}$ = 54.4 GeV exhibit a non-zero $v_2$ in the transverse momentum ($p_{\rm T}$) region of…
▽ More
We report on new measurements of elliptic flow ($v_2$) of electrons from heavy-flavor hadron decays at mid-rapidity ($|y|<0.8$) in Au+Au collisions at $\sqrt{s_{_{\rm NN}}}$ = 27 and 54.4 GeV from the STAR experiment. Heavy-flavor decay electrons ($e^{\rm HF}$) in Au+Au collisions at $\sqrt{s_{_{\rm NN}}}$ = 54.4 GeV exhibit a non-zero $v_2$ in the transverse momentum ($p_{\rm T}$) region of $p_{\rm T}<$ 2 GeV/$c$ with the magnitude comparable to that at $\sqrt{s_{_{\rm NN}}}=200$ GeV. The measured $e^{\rm HF}$ $v_2$ at 54.4 GeV is also consistent with the expectation of their parent charm hadron $v_2$ following number-of-constituent-quark scaling as other light and strange flavor hadrons at this energy. These suggest that charm quarks gain significant collectivity through the evolution of the QCD medium and may reach local thermal equilibrium in Au+Au collisions at $\sqrt{s_{_{\rm NN}}}=54.4$ GeV. The measured $e^{\rm HF}$ $v_2$ in Au+Au collisions at $\sqrt{s_{_{\rm NN}}}=$ 27 GeV is consistent with zero within large uncertainties. The energy dependence of $v_2$ for different flavor particles ($π,φ,D^{0}/e^{\rm HF}$) shows an indication of quark mass hierarchy in reaching thermalization in high-energy nuclear collisions.
△ Less
Submitted 3 August, 2023; v1 submitted 6 March, 2023;
originally announced March 2023.
-
Advances in Automatically Rating the Trustworthiness of Text Processing Services
Authors:
Biplav Srivastava,
Kausik Lakkaraju,
Mariana Bernagozzi,
Marco Valtorta
Abstract:
AI services are known to have unstable behavior when subjected to changes in data, models or users. Such behaviors, whether triggered by omission or commission, lead to trust issues when AI works with humans. The current approach of assessing AI services in a black box setting, where the consumer does not have access to the AI's source code or training data, is limited. The consumer has to rely on…
▽ More
AI services are known to have unstable behavior when subjected to changes in data, models or users. Such behaviors, whether triggered by omission or commission, lead to trust issues when AI works with humans. The current approach of assessing AI services in a black box setting, where the consumer does not have access to the AI's source code or training data, is limited. The consumer has to rely on the AI developer's documentation and trust that the system has been built as stated. Further, if the AI consumer reuses the service to build other services which they sell to their customers, the consumer is at the risk of the service providers (both data and model providers). Our approach, in this context, is inspired by the success of nutritional labeling in food industry to promote health and seeks to assess and rate AI services for trust from the perspective of an independent stakeholder. The ratings become a means to communicate the behavior of AI systems so that the consumer is informed about the risks and can make an informed decision. In this paper, we will first describe recent progress in developing rating methods for text-based machine translator AI services that have been found promising with user studies. Then, we will outline challenges and vision for a principled, multi-modal, causality-based rating methodologies and its implication for decision-support in real-world scenarios like health and food recommendation.
△ Less
Submitted 4 February, 2023;
originally announced February 2023.
-
Rating Sentiment Analysis Systems for Bias through a Causal Lens
Authors:
Kausik Lakkaraju,
Biplav Srivastava,
Marco Valtorta
Abstract:
Sentiment Analysis Systems (SASs) are data-driven Artificial Intelligence (AI) systems that, given a piece of text, assign one or more numbers conveying the polarity and emotional intensity expressed in the input. Like other automatic machine learning systems, they have also been known to exhibit model uncertainty where a (small) change in the input leads to drastic swings in the output. This can…
▽ More
Sentiment Analysis Systems (SASs) are data-driven Artificial Intelligence (AI) systems that, given a piece of text, assign one or more numbers conveying the polarity and emotional intensity expressed in the input. Like other automatic machine learning systems, they have also been known to exhibit model uncertainty where a (small) change in the input leads to drastic swings in the output. This can be especially problematic when inputs are related to protected features like gender or race since such behavior can be perceived as a lack of fairness, i.e., bias. We introduce a novel method to assess and rate SASs where inputs are perturbed in a controlled causal setting to test if the output sentiment is sensitive to protected variables even when other components of the textual input, e.g., chosen emotion words, are fixed. We then use the result to assign labels (ratings) at fine-grained and overall levels to convey the robustness of the SAS to input changes. The ratings serve as a principled basis to compare SASs and choose among them based on behavior. It benefits all users, especially developers who reuse off-the-shelf SASs to build larger AI systems but do not have access to their code or training data to compare.
△ Less
Submitted 3 February, 2023;
originally announced February 2023.
-
Energy Dependence of Intermittency for Charged Hadrons in Au+Au Collisions at RHIC
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
D. M. Anderson,
E. C. Aschenauer,
S. Aslam,
J. Atchison,
V. Bairathi,
W. Baker,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
X. Z. Cai
, et al. (359 additional authors not shown)
Abstract:
Density fluctuations near the QCD critical point can be probed via an intermittency analysis in relativistic heavy-ion collisions. We report the first measurement of intermittency in Au$+$Au collisions at $\sqrt{s_\mathrm{_{NN}}}$ = 7.7-200 GeV measured by the STAR experiment at the Relativistic Heavy Ion Collider (RHIC). The scaled factorial moments of identified charged hadrons are analyzed at m…
▽ More
Density fluctuations near the QCD critical point can be probed via an intermittency analysis in relativistic heavy-ion collisions. We report the first measurement of intermittency in Au$+$Au collisions at $\sqrt{s_\mathrm{_{NN}}}$ = 7.7-200 GeV measured by the STAR experiment at the Relativistic Heavy Ion Collider (RHIC). The scaled factorial moments of identified charged hadrons are analyzed at mid-rapidity and within the transverse momentum phase space. We observe a power-law behavior of scaled factorial moments in Au$+$Au collisions and a decrease in the extracted scaling exponent ($ν$) from peripheral to central collisions. The $ν$ is consistent with a constant for different collisions energies in the mid-central (10-40\%) collisions. Moreover, the $ν$ in the 0-5\% most central Au$+$Au collisions exhibits a non-monotonic energy dependence that reaches a possible minimum around $\sqrt{s_\mathrm{_{NN}}}$ = 27 GeV. The physics implications on the QCD phase structure are discussed.
△ Less
Submitted 19 September, 2023; v1 submitted 26 January, 2023;
originally announced January 2023.
-
On Safe and Usable Chatbots for Promoting Voter Participation
Authors:
Bharath Muppasani,
Vishal Pallagani,
Kausik Lakkaraju,
Shuge Lei,
Biplav Srivastava,
Brett Robertson,
Andrea Hickerson,
Vignesh Narayanan
Abstract:
Chatbots, or bots for short, are multi-modal collaborative assistants that can help people complete useful tasks. Usually, when chatbots are referenced in connection with elections, they often draw negative reactions due to the fear of mis-information and hacking. Instead, in this paper, we explore how chatbots may be used to promote voter participation in vulnerable segments of society like senio…
▽ More
Chatbots, or bots for short, are multi-modal collaborative assistants that can help people complete useful tasks. Usually, when chatbots are referenced in connection with elections, they often draw negative reactions due to the fear of mis-information and hacking. Instead, in this paper, we explore how chatbots may be used to promote voter participation in vulnerable segments of society like senior citizens and first-time voters. In particular, we build a system that amplifies official information while personalizing it to users' unique needs transparently. We discuss its design, build prototypes with frequently asked questions (FAQ) election information for two US states that are low on an ease-of-voting scale, and report on its initial evaluation in a focus group. Our approach can be a win-win for voters, election agencies trying to fulfill their mandate and democracy at large.
△ Less
Submitted 28 December, 2022; v1 submitted 16 December, 2022;
originally announced December 2022.
-
Plansformer: Generating Symbolic Plans using Transformers
Authors:
Vishal Pallagani,
Bharath Muppasani,
Keerthiram Murugesan,
Francesca Rossi,
Lior Horesh,
Biplav Srivastava,
Francesco Fabiano,
Andrea Loreggia
Abstract:
Large Language Models (LLMs) have been the subject of active research, significantly advancing the field of Natural Language Processing (NLP). From BERT to BLOOM, LLMs have surpassed state-of-the-art results in various natural language tasks such as question answering, summarization, and text generation. Many ongoing efforts focus on understanding LLMs' capabilities, including their knowledge of t…
▽ More
Large Language Models (LLMs) have been the subject of active research, significantly advancing the field of Natural Language Processing (NLP). From BERT to BLOOM, LLMs have surpassed state-of-the-art results in various natural language tasks such as question answering, summarization, and text generation. Many ongoing efforts focus on understanding LLMs' capabilities, including their knowledge of the world, syntax, and semantics. However, extending the textual prowess of LLMs to symbolic reasoning has been slow and predominantly focused on tackling problems related to the mathematical field. In this paper, we explore the use of LLMs for automated planning - a branch of AI concerned with the realization of action sequences (plans) to achieve a goal, typically executed by intelligent agents, autonomous robots, and unmanned vehicles. We introduce Plansformer; an LLM fine-tuned on planning problems and capable of generating plans with favorable behavior in terms of correctness and length with reduced knowledge-engineering efforts. We also demonstrate the adaptability of Plansformer in solving different planning domains with varying complexities, owing to the transfer learning abilities of LLMs. For one configuration of Plansformer, we achieve ~97% valid plans, out of which ~95% are optimal for Towers of Hanoi - a puzzle-solving domain.
△ Less
Submitted 16 December, 2022;
originally announced December 2022.
-
Observation of Directed Flow of Hypernuclei $^3_Λ$H and $^4_Λ$H in $\sqrt{s_{\rm NN}}$ = 3 GeV Au+Au Collisions at RHIC
Authors:
STAR Collaboration,
B. E. Aboona,
J. Adam,
J. R. Adams,
G. Agakishiev,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
A. Aitbaev,
I. Alekseev,
D. M. Anderson,
A. Aparin,
J. Atchison,
G. S. Averichev,
V. Bairathi,
W. Baker,
J. G. Ball Cap,
K. Barish,
P. Bhagat,
A. Bhasin,
S. Bhatta,
I. G. Bordyuzhin,
J. D. Brandenburg,
A. V. Brandin,
X. Z. Cai
, et al. (330 additional authors not shown)
Abstract:
We report here the first observation of directed flow ($v_1$) of the hypernuclei $^3_Λ$H and $^4_Λ$H in mid-central Au+Au collisions at $\sqrt{s_{\rm NN}}$ = 3 GeV at RHIC. These data are taken as part of the beam energy scan program carried out by the STAR experiment. From 165 $\times$ 10$^{6}$ events in 5%-40% centrality, about 8400 $^3_Λ$H and 5200 $^4_Λ$H candidates are reconstructed through t…
▽ More
We report here the first observation of directed flow ($v_1$) of the hypernuclei $^3_Λ$H and $^4_Λ$H in mid-central Au+Au collisions at $\sqrt{s_{\rm NN}}$ = 3 GeV at RHIC. These data are taken as part of the beam energy scan program carried out by the STAR experiment. From 165 $\times$ 10$^{6}$ events in 5%-40% centrality, about 8400 $^3_Λ$H and 5200 $^4_Λ$H candidates are reconstructed through two- and three-body decay channels. We observe that these hypernuclei exhibit significant directed flow. Comparing to that of light nuclei, it is found that the midrapidity $v_1$ slopes of $^3_Λ$H and $^4_Λ$H follow baryon number scaling, implying that the coalescence is the dominant mechanism for these hypernuclei production in such collisions.
△ Less
Submitted 7 June, 2023; v1 submitted 30 November, 2022;
originally announced November 2022.
-
Learnings from Technological Interventions in a Low Resource Language: Enhancing Information Access in Gondi
Authors:
Devansh Mehta,
Harshita Diddee,
Ananya Saxena,
Anurag Shukla,
Sebastin Santy,
Ramaravind Kommiya Mothilal,
Brij Mohan Lal Srivastava,
Alok Sharma,
Vishnu Prasad,
Venkanna U,
Kalika Bali
Abstract:
The primary obstacle to developing technologies for low-resource languages is the lack of representative, usable data. In this paper, we report the deployment of technology-driven data collection methods for creating a corpus of more than 60,000 translations from Hindi to Gondi, a low-resource vulnerable language spoken by around 2.3 million tribal people in south and central India. During this pr…
▽ More
The primary obstacle to developing technologies for low-resource languages is the lack of representative, usable data. In this paper, we report the deployment of technology-driven data collection methods for creating a corpus of more than 60,000 translations from Hindi to Gondi, a low-resource vulnerable language spoken by around 2.3 million tribal people in south and central India. During this process, we help expand information access in Gondi across 2 different dimensions (a) The creation of linguistic resources that can be used by the community, such as a dictionary, children's stories, Gondi translations from multiple sources and an Interactive Voice Response (IVR) based mass awareness platform; (b) Enabling its use in the digital domain by developing a Hindi-Gondi machine translation model, which is compressed by nearly 4 times to enable it's edge deployment on low-resource edge devices and in areas of little to no internet connectivity. We also present preliminary evaluations of utilizing the developed machine translation model to provide assistance to volunteers who are involved in collecting more data for the target language. Through these interventions, we not only created a refined and evaluated corpus of 26,240 Hindi-Gondi translations that was used for building the translation model but also engaged nearly 850 community members who can help take Gondi onto the internet.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
Beam energy dependence of the linear and mode-coupled flow harmonics in Au+Au collisions
Authors:
STAR Collaboration,
B. E. Aboona,
J. Adam,
J. R. Adams,
G. Agakishiev,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
A. Aitbaev,
I. Alekseev,
D. M. Anderson,
A. Aparin,
J. Atchison,
G. S. Averichev,
V. Bairathi,
W. Baker,
J. G. Ball Cap,
K. Barish,
P. Bhagat,
A. Bhasin,
S. Bhatta,
I. G. Bordyuzhin,
J. D. Brandenburg,
A. V. Brandin,
X. Z. Cai
, et al. (333 additional authors not shown)
Abstract:
The linear and mode-coupled contributions to higher-order anisotropic flow are presented for Au+Au collisions at $\sqrt{s_{\mathrm{NN}}}$ = 27, 39, 54.4, and 200 GeV and compared to similar measurements for Pb+Pb collisions at the Large Hadron Collider (LHC). The coefficients and the flow harmonics' correlations, which characterize the linear and mode-coupled response to the lower-order anisotropi…
▽ More
The linear and mode-coupled contributions to higher-order anisotropic flow are presented for Au+Au collisions at $\sqrt{s_{\mathrm{NN}}}$ = 27, 39, 54.4, and 200 GeV and compared to similar measurements for Pb+Pb collisions at the Large Hadron Collider (LHC). The coefficients and the flow harmonics' correlations, which characterize the linear and mode-coupled response to the lower-order anisotropies, indicate a beam energy dependence consistent with an influence from the specific shear viscosity ($η/s$). In contrast, the dimensionless coefficients, mode-coupled response coefficients, and normalized symmetric cumulants are approximately beam-energy independent, consistent with a significant role from initial-state effects. These measurements could provide unique supplemental constraints to (i) distinguish between different initial-state models and (ii) delineate the temperature ($T$) and baryon chemical potential ($μ_{B}$) dependence of the specific shear viscosity $\fracη{s} (T, μ_B)$.
△ Less
Submitted 20 February, 2023; v1 submitted 21 November, 2022;
originally announced November 2022.
-
Measurements of the elliptic and triangular azimuthal anisotropies in central $^{3}$He+Au, $d$+Au and $p$+Au collisions at $\mbox{$\sqrt{s_{\mathrm{NN}}}$}$ = 200 GeV
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
J. R. Adams,
G. Agakishiev,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
A. Aitbaev,
I. Alekseev,
D. M. Anderson,
A. Aparin,
S. Aslam,
J. Atchison,
G. S. Averichev,
V. Bairathi,
W. Baker,
J. G. Ball Cap,
K. Barish,
P. Bhagat,
A. Bhasin,
S. Bhatta,
I. G. Bordyuzhin,
J. D. Brandenburg
, et al. (334 additional authors not shown)
Abstract:
The elliptic ($v_2$) and triangular ($v_3$) azimuthal anisotropy coefficients in central $^{3}$He+Au, $d$+Au, and $p$+Au collisions at $\mbox{$\sqrt{s_{\mathrm{NN}}}$}$ = 200 GeV are measured as a function of transverse momentum ($p_{\mathrm{T}}$) at mid-rapidity ($|η|<$0.9), via the azimuthal angular correlation between two particles both at $|η|<$0.9. While the $v_2(p_{\mathrm{T}})$ values depen…
▽ More
The elliptic ($v_2$) and triangular ($v_3$) azimuthal anisotropy coefficients in central $^{3}$He+Au, $d$+Au, and $p$+Au collisions at $\mbox{$\sqrt{s_{\mathrm{NN}}}$}$ = 200 GeV are measured as a function of transverse momentum ($p_{\mathrm{T}}$) at mid-rapidity ($|η|<$0.9), via the azimuthal angular correlation between two particles both at $|η|<$0.9. While the $v_2(p_{\mathrm{T}})$ values depend on the colliding systems, the $v_3(p_{\mathrm{T}})$ values are system-independent within the uncertainties, suggesting an influence on eccentricity from sub-nucleonic fluctuations in these small-sized systems. These results also provide stringent constraints for the hydrodynamic modeling of these systems.
△ Less
Submitted 6 June, 2023; v1 submitted 20 October, 2022;
originally announced October 2022.
-
$K^{*0}$ production in Au+Au collisions at $\sqrt{s_{\rm NN}}$ = 7.7, 11.5, 14.5, 19.6, 27 and 39 GeV from RHIC beam energy scan
Authors:
STAR Collaboration,
M. S. Abdallah,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
J. K. Adkins,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
D. M. Anderson,
E. C. Aschenauer,
J. Atchison,
V. Bairathi,
W. Baker,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
X. Z. Cai
, et al. (350 additional authors not shown)
Abstract:
We report the measurement of $K^{*0}$ meson at midrapidity ($|y|<$ 1.0) in Au+Au collisions at $\sqrt{s_{\rm NN}}$~=~7.7, 11.5, 14.5, 19.6, 27 and 39 GeV collected by the STAR experiment during the RHIC beam energy scan (BES) program. The transverse momentum spectra, yield, and average transverse momentum of $K^{*0}$ are presented as functions of collision centrality and beam energy. The…
▽ More
We report the measurement of $K^{*0}$ meson at midrapidity ($|y|<$ 1.0) in Au+Au collisions at $\sqrt{s_{\rm NN}}$~=~7.7, 11.5, 14.5, 19.6, 27 and 39 GeV collected by the STAR experiment during the RHIC beam energy scan (BES) program. The transverse momentum spectra, yield, and average transverse momentum of $K^{*0}$ are presented as functions of collision centrality and beam energy. The $K^{*0}/K$ yield ratios are presented for different collision centrality intervals and beam energies. The $K^{*0}/K$ ratio in heavy-ion collisions are observed to be smaller than that in small system collisions (e+e and p+p). The $K^{*0}/K$ ratio follows a similar centrality dependence to that observed in previous RHIC and LHC measurements. The data favor the scenario of the dominance of hadronic re-scattering over regeneration for $K^{*0}$ production in the hadronic phase of the medium.
△ Less
Submitted 5 April, 2023; v1 submitted 6 October, 2022;
originally announced October 2022.
-
Higher-Order Cumulants and Correlation Functions of Proton Multiplicity Distributions in $\sqrt{s_{\mathrm{NN}}}$ = 3 GeV Au+Au Collisions at the RHIC STAR Experiment
Authors:
STAR Collaboration,
M. S. Abdallah,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
J. K. Adkins,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
D. M. Anderson,
E. C. Aschenauer,
J. Atchison,
V. Bairathi,
W. Baker,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
X. Z. Cai
, et al. (349 additional authors not shown)
Abstract:
We report a measurement of cumulants and correlation functions of event-by-event proton multiplicity distributions from fixed-target Au+Au collisions at $\sqrt{s_{\rm NN}}$ = 3 GeV measured by the STAR experiment. Protons are identified within the rapidity ($y$) and transverse momentum ($p_{\rm T}$) region $-0.9 < y<0$ and $0.4 < p_{\rm T} <2.0 $ GeV/$c$ in the center-of-mass frame. A systematic a…
▽ More
We report a measurement of cumulants and correlation functions of event-by-event proton multiplicity distributions from fixed-target Au+Au collisions at $\sqrt{s_{\rm NN}}$ = 3 GeV measured by the STAR experiment. Protons are identified within the rapidity ($y$) and transverse momentum ($p_{\rm T}$) region $-0.9 < y<0$ and $0.4 < p_{\rm T} <2.0 $ GeV/$c$ in the center-of-mass frame. A systematic analysis of the proton cumulants and correlation functions up to sixth-order as well as the corresponding ratios as a function of the collision centrality, $p_{\rm T}$, and $y$ are presented. The effect of pileup and initial volume fluctuations on these observables and the respective corrections are discussed in detail. The results are compared to calculations from the hadronic transport UrQMD model as well as a hydrodynamic model. In the most central 5\% collisions, the value of proton cumulant ratio $C_4/C_2$ is negative, drastically different from the values observed in Au+Au collisions at higher energies. Compared to model calculations including Lattice QCD, a hadronic transport model, and a hydrodynamic model, the strong suppression in the ratio of $C_4/C_2$ at 3 GeV Au+Au collisions indicates an energy regime dominated by hadronic interactions.
△ Less
Submitted 22 February, 2023; v1 submitted 24 September, 2022;
originally announced September 2022.
-
Beam Energy Dependence of Triton Production and Yield Ratio ($\mathrm{N}_t \times \mathrm{N}_p/\mathrm{N}_d^2$) in Au+Au Collisions at RHIC
Authors:
STAR Collaboration,
M. I. Abdulhamid,
B. E. Aboona,
J. Adam,
J. R. Adams,
G. Agakishiev,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
A. Aitbaev,
I. Alekseev,
D. M. Anderson,
A. Aparin,
S. Aslam,
J. Atchison,
G. S. Averichev,
V. Bairathi,
W. Baker,
J. G. Ball Cap,
K. Barish,
P. Bhagat,
A. Bhasin,
S. Bhatta,
I. G. Bordyuzhin,
J. D. Brandenburg
, et al. (333 additional authors not shown)
Abstract:
We report the triton ($t$) production in mid-rapidity ($|y| <$ 0.5) Au+Au collisions at $\sqrt{s_\mathrm{NN}}$= 7.7--200 GeV measured by the STAR experiment from the first phase of the beam energy scan at the Relativistic Heavy Ion Collider (RHIC). The nuclear compound yield ratio ($\mathrm{N}_t \times \mathrm{N}_p/\mathrm{N}_d^2$), which is predicted to be sensitive to the fluctuation of local ne…
▽ More
We report the triton ($t$) production in mid-rapidity ($|y| <$ 0.5) Au+Au collisions at $\sqrt{s_\mathrm{NN}}$= 7.7--200 GeV measured by the STAR experiment from the first phase of the beam energy scan at the Relativistic Heavy Ion Collider (RHIC). The nuclear compound yield ratio ($\mathrm{N}_t \times \mathrm{N}_p/\mathrm{N}_d^2$), which is predicted to be sensitive to the fluctuation of local neutron density, is observed to decrease monotonically with increasing charged-particle multiplicity ($dN_{ch}/dη$) and follows a scaling behavior. The $dN_{ch}/dη$ dependence of the yield ratio is compared to calculations from coalescence and thermal models. Enhancements in the yield ratios relative to the coalescence baseline are observed in the 0\%-10\% most central collisions at 19.6 and 27 GeV, with a significance of 2.3$σ$ and 3.4$σ$, respectively, giving a combined significance of 4.1$σ$. The enhancements are not observed in peripheral collisions or model calculations without critical fluctuation, and decreases with a smaller $p_{T}$ acceptance. The physics implications of these results on the QCD phase structure and the production mechanism of light nuclei in heavy-ion collisions are discussed.
△ Less
Submitted 18 May, 2023; v1 submitted 16 September, 2022;
originally announced September 2022.
-
Search for the Chiral Magnetic Effect in Au+Au collisions at $\sqrt{s_{_{\rm{NN}}}}=27$ GeV with the STAR forward Event Plane Detectors
Authors:
STAR Collaboration,
B. E. Aboona,
J. Adam,
L. Adamczyk,
J. R. Adams,
I. Aggarwal,
M. M. Aggarwal,
Z. Ahammed,
D. M. Anderson,
E. C. Aschenauer,
J. Atchison,
V. Bairathi,
W. Baker,
J. G. Ball Cap,
K. Barish,
R. Bellwied,
P. Bhagat,
A. Bhasin,
S. Bhatta,
J. Bielcik,
J. Bielcikova,
J. D. Brandenburg,
X. Z. Cai,
H. Caines,
M. Calderón de la Barca Sánchez
, et al. (347 additional authors not shown)
Abstract:
A decisive experimental test of the Chiral Magnetic Effect (CME) is considered one of the major scientific goals at the Relativistic Heavy-Ion Collider (RHIC) towards understanding the nontrivial topological fluctuations of the Quantum Chromodynamics vacuum. In heavy-ion collisions, the CME is expected to result in a charge separation phenomenon across the reaction plane, whose strength could be s…
▽ More
A decisive experimental test of the Chiral Magnetic Effect (CME) is considered one of the major scientific goals at the Relativistic Heavy-Ion Collider (RHIC) towards understanding the nontrivial topological fluctuations of the Quantum Chromodynamics vacuum. In heavy-ion collisions, the CME is expected to result in a charge separation phenomenon across the reaction plane, whose strength could be strongly energy dependent. The previous CME searches have been focused on top RHIC energy collisions. In this Letter, we present a low energy search for the CME in Au+Au collisions at $\sqrt{s_{_{\rm{NN}}}}=27$ GeV. We measure elliptic flow scaled charge-dependent correlators relative to the event planes that are defined at both mid-rapidity $|η|<1.0$ and at forward rapidity $2.1 < |η|<5.1$. We compare the results based on the directed flow plane ($Ψ_1$) at forward rapidity and the elliptic flow plane ($Ψ_2$) at both central and forward rapidity. The CME scenario is expected to result in a larger correlation relative to $Ψ_1$ than to $Ψ_2$, while a flow driven background scenario would lead to a consistent result for both event planes. In 10-50\% centrality, results using three different event planes are found to be consistent within experimental uncertainties, suggesting a flow driven background scenario dominating the measurement. We obtain an upper limit on the deviation from a flow driven background scenario at the 95\% confidence level. This work opens up a possible road map towards future CME search with the high statistics data from the RHIC Beam Energy Scan Phase-II.
△ Less
Submitted 19 April, 2023; v1 submitted 7 September, 2022;
originally announced September 2022.