-
Differential rotation in convecting spherical shells with non-uniform viscosity and entropy diffusivity
Authors:
Parag Gupta,
David MacTaggart,
Radostin D. Simitev
Abstract:
Contemporary three-dimensional physics-based simulations of the solar convection zone disagree with observations. They feature differential rotation substantially different from the true rotation inferred by solar helioseismology and exhibit a conveyor belt of convective "Busse" columns not found in observations. To help unravel this so-called "convection conundrum", we use a three-dimensional pse…
▽ More
Contemporary three-dimensional physics-based simulations of the solar convection zone disagree with observations. They feature differential rotation substantially different from the true rotation inferred by solar helioseismology and exhibit a conveyor belt of convective "Busse" columns not found in observations. To help unravel this so-called "convection conundrum", we use a three-dimensional pseudospectral simulation code to investigate how radially non-uniform viscosity and entropy diffusivity affect differential rotation and convective flow patterns in density-stratified rotating spherical fluid shells. We find that radial non-uniformity in fluid properties enhances polar convection, which, in turn, induces non-negligible lateral entropy gradients that lead to large deviations from differential rotation geostrophy due to thermal wind balance. We report simulations wherein this mechanism maintains differential rotation patterns very similar to the true solar profile outside the tangent cylinder, although discrepancies remain at high latitudes. This is significant because differential rotation plays a key role in sustaining solar-like cyclic dipolar dynamos.
△ Less
Submitted 21 November, 2023;
originally announced November 2023.
-
Optimal Fidelity Selection for Improved Performance in Human-in-the-Loop Queues for Underwater Search
Authors:
Piyush Gupta,
Vaibhav Srivastava
Abstract:
In the context of human-supervised autonomy, we study the problem of optimal fidelity selection for a human operator performing an underwater visual search task. Human performance depends on various cognitive factors such as workload and fatigue. We perform human experiments in which participants perform two tasks simultaneously: a primary task, which is subject to evaluation, and a secondary task…
▽ More
In the context of human-supervised autonomy, we study the problem of optimal fidelity selection for a human operator performing an underwater visual search task. Human performance depends on various cognitive factors such as workload and fatigue. We perform human experiments in which participants perform two tasks simultaneously: a primary task, which is subject to evaluation, and a secondary task to estimate their workload. The primary task requires participants to search for underwater mines in videos, while the secondary task involves a simple visual test where they respond when a green light displayed on the side of their screens turns red. Videos arrive as a Poisson process and are stacked in a queue to be serviced by the human operator. The operator can choose to watch the video with either normal or high fidelity, with normal fidelity videos playing at three times the speed of high fidelity ones. Participants receive rewards for their accuracy in mine detection for each primary task and penalties based on the number of videos waiting in the queue. We consider the workload of the operator as a hidden state and model the workload dynamics as an Input-Output Hidden Markov Model (IOHMM). We use a Partially Observable Markov Decision Process (POMDP) to learn an optimal fidelity selection policy, where the objective is to maximize total rewards. Our results demonstrate improved performance when videos are serviced based on the optimal fidelity policy compared to a baseline where humans choose the fidelity level themselves.
△ Less
Submitted 10 November, 2023;
originally announced November 2023.
-
Fostering Human Learning in Sequential Decision-Making: Understanding the Role of Evaluative Feedback
Authors:
Piyush Gupta,
Subir Biswas,
Vaibhav Srivastava
Abstract:
Cognitive rehabilitation, STEM (science, technology, engineering, and math) skill acquisition, and coaching games such as chess often require tutoring decision-making strategies. The advancement of AI-driven tutoring systems for facilitating human learning requires an understanding of the impact of evaluative feedback on human decision-making and skill development. To this end, we conduct human ex…
▽ More
Cognitive rehabilitation, STEM (science, technology, engineering, and math) skill acquisition, and coaching games such as chess often require tutoring decision-making strategies. The advancement of AI-driven tutoring systems for facilitating human learning requires an understanding of the impact of evaluative feedback on human decision-making and skill development. To this end, we conduct human experiments using Amazon Mechanical Turk to study the influence of evaluative feedback on human decision-making in sequential tasks. In these experiments, participants solve the Tower of Hanoi puzzle and receive AI-generated feedback while solving it. We examine how this feedback affects their learning and skill transfer to related tasks. Additionally, treating humans as noisy optimal agents, we employ maximum entropy inverse reinforcement learning to analyze the effect of feedback on the implicit human reward structure that guides their decision making. Lastly, we explore various computational models to understand how people incorporate evaluative feedback into their decision-making processes. Our findings underscore that humans perceive evaluative feedback as indicative of their long-term strategic success, thus aiding in skill acquisition and transfer in sequential decision-making tasks. Moreover, we demonstrate that evaluative feedback fosters a more structured and organized learning experience compared to learning without feedback. Furthermore, our results indicate that providing intermediate goals alone does not significantly enhance human learning outcomes.
△ Less
Submitted 4 May, 2024; v1 submitted 6 November, 2023;
originally announced November 2023.
-
AI-based, automated chamber volumetry from gated, non-contrast CT
Authors:
Athira J Jacob,
Ola Abdelkarim,
Salma Zook,
Kristian Hay Kragholm,
Prantik Gupta,
Myra Cocker,
Juan Ramirez Giraldo,
Jim O Doherty,
Max Schoebinger,
Chris Schwemmer,
Mehmet A Gulsun,
Saikiran Rapaka,
Puneet Sharma,
Su-Min Chang
Abstract:
Background: Accurate chamber volumetry from gated, non-contrast cardiac CT (NCCT) scans can be useful for potential screening of heart failure.
Objectives: To validate a new, fully automated, AI-based method for cardiac volume and myocardial mass quantification from NCCT scans compared to contrasted CT Angiography (CCTA).
Methods: Of a retrospectively collected cohort of 1051 consecutive patie…
▽ More
Background: Accurate chamber volumetry from gated, non-contrast cardiac CT (NCCT) scans can be useful for potential screening of heart failure.
Objectives: To validate a new, fully automated, AI-based method for cardiac volume and myocardial mass quantification from NCCT scans compared to contrasted CT Angiography (CCTA).
Methods: Of a retrospectively collected cohort of 1051 consecutive patients, 420 patients had both NCCT and CCTA scans at mid-diastolic phase, excluding patients with cardiac devices. Ground truth values were obtained from the CCTA scans.
Results: The NCCT volume computation shows good agreement with ground truth values. Volume differences [95% CI ] and correlation coefficients were: -9.6 [-45; 26] mL, r = 0.98 for LV Total, -5.4 [-24; 13] mL, r = 0.95 for LA, -8.7 [-45; 28] mL, r = 0.94 for RV, -5.2 [-27; 17] mL, r = 0.92 for RA, -3.2 [-42; 36] mL, r = 0.91 for LV blood pool, and -6.7 [-39; 26] g, r = 0.94 for LV wall mass, respectively. Mean relative volume errors of less than 7% were obtained for all chambers.
Conclusions: Fully automated assessment of chamber volumes from NCCT scans is feasible and correlates well with volumes obtained from contrast study.
△ Less
Submitted 25 October, 2023;
originally announced November 2023.
-
Waveform Modelling for the Laser Interferometer Space Antenna
Authors:
LISA Consortium Waveform Working Group,
Niayesh Afshordi,
Sarp Akçay,
Pau Amaro Seoane,
Andrea Antonelli,
Josu C. Aurrekoetxea,
Leor Barack,
Enrico Barausse,
Robert Benkel,
Laura Bernard,
Sebastiano Bernuzzi,
Emanuele Berti,
Matteo Bonetti,
Béatrice Bonga,
Gabriele Bozzola,
Richard Brito,
Alessandra Buonanno,
Alejandro Cárdenas-Avendaño,
Marc Casals,
David F. Chernoff,
Alvin J. K. Chua,
Katy Clough,
Marta Colleoni,
Mekhi Dhesi,
Adrien Druart
, et al. (121 additional authors not shown)
Abstract:
LISA, the Laser Interferometer Space Antenna, will usher in a new era in gravitational-wave astronomy. As the first anticipated space-based gravitational-wave detector, it will expand our view to the millihertz gravitational-wave sky, where a spectacular variety of interesting new sources abound: from millions of ultra-compact binaries in our Galaxy, to mergers of massive black holes at cosmologic…
▽ More
LISA, the Laser Interferometer Space Antenna, will usher in a new era in gravitational-wave astronomy. As the first anticipated space-based gravitational-wave detector, it will expand our view to the millihertz gravitational-wave sky, where a spectacular variety of interesting new sources abound: from millions of ultra-compact binaries in our Galaxy, to mergers of massive black holes at cosmological distances; from the beginnings of inspirals that will venture into the ground-based detectors' view to the death spiral of compact objects into massive black holes, and many sources in between. Central to realising LISA's discovery potential are waveform models, the theoretical and phenomenological predictions of the pattern of gravitational waves that these sources emit. This white paper is presented on behalf of the Waveform Working Group for the LISA Consortium. It provides a review of the current state of waveform models for LISA sources, and describes the significant challenges that must yet be overcome.
△ Less
Submitted 20 December, 2023; v1 submitted 2 November, 2023;
originally announced November 2023.
-
Dynamics of an SIR epidemic model with limited medical resources, revisited and corrected
Authors:
Rim Adenane,
Florin Avram,
Mohamed El Fatini,
R. P. Gupta
Abstract:
This paper generalizes and corrects a famous paper (more than 200 citations) concerning Hopf and Bogdanov-Takens bifurcations due to L. Zhou and M. Fan, "Dynamics of an SIR epidemic model with limited medical resources revisited", in which we discovered a significant numerical error. Importantly, unlike the paper of Zhou and Fan and several other papers that followed them, we offer a notebook wher…
▽ More
This paper generalizes and corrects a famous paper (more than 200 citations) concerning Hopf and Bogdanov-Takens bifurcations due to L. Zhou and M. Fan, "Dynamics of an SIR epidemic model with limited medical resources revisited", in which we discovered a significant numerical error. Importantly, unlike the paper of Zhou and Fan and several other papers that followed them, we offer a notebook where the reader may recover all the results and modify them for analyzing similar models. Our calculations lead to the introduction of some interesting symbolic objects, "Groebner eliminated traces and determinants" - see (4.5), (4.6), which seem to have appeared here for the first time and which might be of independent interest. We hope our paper might serve as yet another alarm bell regarding the importance of accompanying papers involving complicated hand computations by electronic notebooks.
△ Less
Submitted 30 October, 2023;
originally announced October 2023.
-
TST$^\mathrm{R}$: Target Similarity Tuning Meets the Real World
Authors:
Anirudh Khatry,
Sumit Gulwani,
Priyanshu Gupta,
Vu Le,
Ananya Singha,
Mukul Singh,
Gust Verbruggen
Abstract:
Target similarity tuning (TST) is a method of selecting relevant examples in natural language (NL) to code generation through large language models (LLMs) to improve performance. Its goal is to adapt a sentence embedding model to have the similarity between two NL inputs match the similarity between their associated code outputs. In this paper, we propose different methods to apply and improve TST…
▽ More
Target similarity tuning (TST) is a method of selecting relevant examples in natural language (NL) to code generation through large language models (LLMs) to improve performance. Its goal is to adapt a sentence embedding model to have the similarity between two NL inputs match the similarity between their associated code outputs. In this paper, we propose different methods to apply and improve TST in the real world. First, we replace the sentence transformer with embeddings from a larger model, which reduces sensitivity to the language distribution and thus provides more flexibility in synthetic generation of examples, and we train a tiny model that transforms these embeddings to a space where embedding similarity matches code similarity, which allows the model to remain a black box and only requires a few matrix multiplications at inference time. Second, we show how to efficiently select a smaller number of training examples to train the TST model. Third, we introduce a ranking-based evaluation for TST that does not require end-to-end code generation experiments, which can be expensive to perform.
△ Less
Submitted 28 October, 2023; v1 submitted 26 October, 2023;
originally announced October 2023.
-
Observed Trends in FRB Population and Bi-modality in the Luminosity Density Distribution
Authors:
Nidhi Saini,
Patrick Das Gupta
Abstract:
Fast radio bursts (FRBs) are radio transients of extragalactic origin lasting for about a few to several milli-seconds. We have analyzed both non-CHIME and CHIME FRB data. To circumvent the absence of measured fluence and flux density of FRBs belonging to the CHIME catalog, we have devised a novel approach that utilizes the ratio of the lower limits of the flux density $S_{ν_O}$ to the fluence…
▽ More
Fast radio bursts (FRBs) are radio transients of extragalactic origin lasting for about a few to several milli-seconds. We have analyzed both non-CHIME and CHIME FRB data. To circumvent the absence of measured fluence and flux density of FRBs belonging to the CHIME catalog, we have devised a novel approach that utilizes the ratio of the lower limits of the flux density $S_{ν_O}$ to the fluence $F_{ν_O}$ of individual FRB events to construct several parameters to investigate the presence of underlying trends in the FRB population drawn from both CHIME and non-CHIME data sets. One of these parameters involves true brightness temperature as well as energy density, despite not knowing the actual size of the FRB emission region. Our first robust conclusion is that the non-CHIME FRBs fall under two broad categories - those with luminosity density less than about $4\times 10^{33} $ erg/s/Hz at the frequency 300 MHz and those having larger luminosity density values than this. Assuming that FRBs are caused by magnetar glitches, we have discussed in this paper a simple physical model, incorporating an abrupt change in the light cylinder radius of an oblique rotator, to address the existence of these two categories.
△ Less
Submitted 21 April, 2024; v1 submitted 18 October, 2023;
originally announced October 2023.
-
Mori-Zwanzig latent space Koopman closure for nonlinear autoencoder
Authors:
Priyam Gupta,
Peter J. Schmid,
Denis Sipp,
Taraneh Sayadi,
Georgios Rigas
Abstract:
The Koopman operator presents an attractive approach to achieve global linearization of nonlinear systems, making it a valuable method for simplifying the understanding of complex dynamics. While data-driven methodologies have exhibited promise in approximating finite Koopman operators, they grapple with various challenges, such as the judicious selection of observables, dimensionality reduction,…
▽ More
The Koopman operator presents an attractive approach to achieve global linearization of nonlinear systems, making it a valuable method for simplifying the understanding of complex dynamics. While data-driven methodologies have exhibited promise in approximating finite Koopman operators, they grapple with various challenges, such as the judicious selection of observables, dimensionality reduction, and the ability to predict complex system behaviors accurately. This study presents a novel approach termed Mori-Zwanzig autoencoder (MZ-AE) to robustly approximate the Koopman operator in low-dimensional spaces. The proposed method leverages a nonlinear autoencoder to extract key observables for approximating a finite invariant Koopman subspace and integrates a non-Markovian correction mechanism using the Mori-Zwanzig formalism. Consequently, this approach yields a closed representation of dynamics within the latent manifold of the nonlinear autoencoder, thereby enhancing the precision and stability of the Koopman operator approximation. Demonstrations showcase the technique's ability to capture regime transitions in the flow around a cylinder. It also provides a low dimensional approximation for Kuramoto-Sivashinsky with promising short-term predictability and robust long-term statistical performance. By bridging the gap between data-driven techniques and the mathematical foundations of Koopman theory, MZ-AE offers a promising avenue for improved understanding and prediction of complex nonlinear dynamics.
△ Less
Submitted 16 April, 2024; v1 submitted 16 October, 2023;
originally announced October 2023.
-
Cost-Driven Hardware-Software Co-Optimization of Machine Learning Pipelines
Authors:
Ravit Sharma,
Wojciech Romaszkan,
Feiqian Zhu,
Puneet Gupta,
Ankur Mehta
Abstract:
Researchers have long touted a vision of the future enabled by a proliferation of internet-of-things devices, including smart sensors, homes, and cities. Increasingly, embedding intelligence in such devices involves the use of deep neural networks. However, their storage and processing requirements make them prohibitive for cheap, off-the-shelf platforms. Overcoming those requirements is necessary…
▽ More
Researchers have long touted a vision of the future enabled by a proliferation of internet-of-things devices, including smart sensors, homes, and cities. Increasingly, embedding intelligence in such devices involves the use of deep neural networks. However, their storage and processing requirements make them prohibitive for cheap, off-the-shelf platforms. Overcoming those requirements is necessary for enabling widely-applicable smart devices. While many ways of making models smaller and more efficient have been developed, there is a lack of understanding of which ones are best suited for particular scenarios. More importantly for edge platforms, those choices cannot be analyzed in isolation from cost and user experience. In this work, we holistically explore how quantization, model scaling, and multi-modality interact with system components such as memory, sensors, and processors. We perform this hardware/software co-design from the cost, latency, and user-experience perspective, and develop a set of guidelines for optimal system design and model deployment for the most cost-constrained platforms. We demonstrate our approach using an end-to-end, on-device, biometric user authentication system using a $20 ESP-EYE board.
△ Less
Submitted 19 October, 2023; v1 submitted 11 October, 2023;
originally announced October 2023.
-
Self-induced inverse spin Hall effect in La$_{0.67}$Sr$_{0.33}$MnO$_{3}$ films
Authors:
Pushpendra Gupta,
In Jun Park,
Anupama Swain,
Abhisek Mishra,
Vivek P. Amin,
Subhankar Bedanta
Abstract:
The efficient generation of spin currents and spin torques via spin-orbit coupling is an important goal of spintronics research. One crucial metric for spin current generation is the spin Hall angle, which is the ratio of the spin Hall current to the transversely flowing charge current. A typical approach to measure the spin Hall angle in nonmagnetic materials is to generate spin currents via spin…
▽ More
The efficient generation of spin currents and spin torques via spin-orbit coupling is an important goal of spintronics research. One crucial metric for spin current generation is the spin Hall angle, which is the ratio of the spin Hall current to the transversely flowing charge current. A typical approach to measure the spin Hall angle in nonmagnetic materials is to generate spin currents via spin pumping in an adjacent ferromagnetic layer and measure the transverse voltage from the inverse spin Hall effect in the nonmagnetic layer. However, given that the spin Hall effect also occurs in ferromagnets, single ferromagnetic layers could generate a self-induced transverse voltage during spin pumping as well. Here we show that manganite based La$_{0.67}$Sr$_{0.33}$MnO$_{3}$ (LSMO) films deposited by pulsed laser deposition exhibit a significant self-induced inverse spin Hall voltage while undergoing spin pumping. We observe efficient spin to charge conversion in the LSMO films via the inverse spin Hall effect. A spin pumping voltage of 1.86 $μ$V is observed in the LSMO (12 nm) film. Using density functional theory and the Kubo formalism, we calculate the intrinsic spin current conductivities of these films and show that they are in reasonable agreement with the experimental measurements.
△ Less
Submitted 10 October, 2023;
originally announced October 2023.
-
Augmented Embeddings for Custom Retrievals
Authors:
Anirudh Khatry,
Yasharth Bajpai,
Priyanshu Gupta,
Sumit Gulwani,
Ashish Tiwari
Abstract:
Information retrieval involves selecting artifacts from a corpus that are most relevant to a given search query. The flavor of retrieval typically used in classical applications can be termed as homogeneous and relaxed, where queries and corpus elements are both natural language (NL) utterances (homogeneous) and the goal is to pick most relevant elements from the corpus in the Top-K, where K is la…
▽ More
Information retrieval involves selecting artifacts from a corpus that are most relevant to a given search query. The flavor of retrieval typically used in classical applications can be termed as homogeneous and relaxed, where queries and corpus elements are both natural language (NL) utterances (homogeneous) and the goal is to pick most relevant elements from the corpus in the Top-K, where K is large, such as 10, 25, 50 or even 100 (relaxed). Recently, retrieval is being used extensively in preparing prompts for large language models (LLMs) to enable LLMs to perform targeted tasks. These new applications of retrieval are often heterogeneous and strict -- the queries and the corpus contain different kinds of entities, such as NL and code, and there is a need for improving retrieval at Top-K for small values of K, such as K=1 or 3 or 5. Current dense retrieval techniques based on pretrained embeddings provide a general-purpose and powerful approach for retrieval, but they are oblivious to task-specific notions of similarity of heterogeneous artifacts. We introduce Adapted Dense Retrieval, a mechanism to transform embeddings to enable improved task-specific, heterogeneous and strict retrieval. Adapted Dense Retrieval works by learning a low-rank residual adaptation of the pretrained black-box embedding. We empirically validate our approach by showing improvements over the state-of-the-art general-purpose embeddings-based baseline.
△ Less
Submitted 8 October, 2023;
originally announced October 2023.
-
Hypersurface Convexity and Extension of Kähler Forms
Authors:
Blake J. Boudreaux,
Purvi Gupta,
Rasul Shafikov
Abstract:
The following generalization of a result of S. Nemirovski is proved: if $X$ is either a projective or a Stein manifold and $K\subset X$ is a compact sublevel set of a strictly plurisubharmonic function $\varphi$ defined in a neighborhood of $K$, then $X\setminus K$ is a union of positive divisors if and only if $dd^c\varphi$ extends to a Hodge form on $X$. For an arbitrary compact subset…
▽ More
The following generalization of a result of S. Nemirovski is proved: if $X$ is either a projective or a Stein manifold and $K\subset X$ is a compact sublevel set of a strictly plurisubharmonic function $\varphi$ defined in a neighborhood of $K$, then $X\setminus K$ is a union of positive divisors if and only if $dd^c\varphi$ extends to a Hodge form on $X$. For an arbitrary compact subset $K\subsetneq X$, this gives that $X\setminus K$ is a union of positive divisors if and only if $K$ admits a neighbourhood basis of sublevel sets of strictly plurisubharmonic functions with the $dd^c$-extension property.
△ Less
Submitted 27 October, 2023; v1 submitted 3 October, 2023;
originally announced October 2023.
-
Portfolio Choice In Dynamic Thin Markets: Merton Meets Cournot
Authors:
Puru Gupta,
Saul D. Jacka
Abstract:
We consider an augmented version of Merton's portfolio choice problem, where trading by large investors influences the price of underlying financial asset leading to strategic interaction among investors, with investors deciding their trading rates independently and simultaneously at each instant, in the spirit of dynamic Cournot competition, modelled here as a non-zero sum singular stochastic dif…
▽ More
We consider an augmented version of Merton's portfolio choice problem, where trading by large investors influences the price of underlying financial asset leading to strategic interaction among investors, with investors deciding their trading rates independently and simultaneously at each instant, in the spirit of dynamic Cournot competition, modelled here as a non-zero sum singular stochastic differential game. We establish an equivalence result for the value functions of an investor's best-response problem, which is a singular stochastic optimal control problem, and an auxiliary classical stochastic optimal control problem by exploiting the invariance of the value functions with respect to a diffeomorphic integral flow associated with the drift coefficient of the best-response problem. Under certain regularity conditions, we show that the optimal trajectories of the two control problems coincide, which permits analytical characterization of Markov-Nash equilibrium portfolios. For the special case when asset price volatility is constant, we show that the unique Nash equilibrium is deterministic, and provide a closed-form solution which illuminates the role of imperfect competition in explaining the excessive trade puzzle.
△ Less
Submitted 27 September, 2023;
originally announced September 2023.
-
Quantum-Enhanced Parameter Estimation Without Entanglement
Authors:
Pragati Gupta
Abstract:
Entanglement is generally considered necessary for achieving the Heisenberg limit in quantum metrology. We construct analogues of Dicke and GHZ states on a single $N+1$ dimensional qudit that achieve precision equivalent to symmetrically entangled states on $N$ qubits, showing that entanglement is not necessary for going beyond the standard quantum limit. We define a measure of non-classicality ba…
▽ More
Entanglement is generally considered necessary for achieving the Heisenberg limit in quantum metrology. We construct analogues of Dicke and GHZ states on a single $N+1$ dimensional qudit that achieve precision equivalent to symmetrically entangled states on $N$ qubits, showing that entanglement is not necessary for going beyond the standard quantum limit. We define a measure of non-classicality based on quantum Fisher information and estimate the achievable precision, suggesting a close relationship between non-classical states and metrological power of qudits. Our work offers an exponential reduction in the physical resources required for quantum-enhanced parameter estimation, making it accessible on any quantum system with a high-dimensional Hilbert space.
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
Chaotic von Zeipel-Lidov-Kozai Oscillations of Binary System around Rotating Supermassive Black Hole
Authors:
Kei-ichi Maeda,
Priti Gupta,
Hirotada Okawa
Abstract:
In this paper, we investigate the dynamics of a binary system that orbits a rotating supermassive black hole. Our approach employs Fermi-Walker transport to construct a local inertial reference frame, and to set up a Newtonian binary system. We consider a scenario in which a circular geodesic observer is positioned around a Kerr black hole, and thereby derive the equations of motion governing the…
▽ More
In this paper, we investigate the dynamics of a binary system that orbits a rotating supermassive black hole. Our approach employs Fermi-Walker transport to construct a local inertial reference frame, and to set up a Newtonian binary system. We consider a scenario in which a circular geodesic observer is positioned around a Kerr black hole, and thereby derive the equations of motion governing the binary system. To eliminate the interaction terms between the center of mass (CM) of the binary and its relative coordinates, we introduce a small acceleration for the observer. This adjustment leads to the CM closely following the observer's orbit, deviating from a circular geodesic. Here, we first focus on elucidating the stability conditions in a hierarchical triple system. Subsequently, we discuss the phenomenon of von Zeipel-Lidov-Kozai (vZLK) oscillations, which manifest when the binary system is compact and the initial inclination exceeds a critical angle. In hard binary systems, these oscillations exhibit regular behavior, while in soft binary systems, they exhibit a chaotic character, characterized by irregular periods and amplitudes, albeit remaining stable. Additionally, we observe an orbital flip under circumstances of large initial inclination. As for the motion of the CM, we observe deviations from a purely circular orbit that transform into stable yet chaotic oscillations characterized by minute amplitude variations.
△ Less
Submitted 3 October, 2023; v1 submitted 21 September, 2023;
originally announced September 2023.
-
Gall Bladder Cancer Detection from US Images with Only Image Level Labels
Authors:
Soumen Basu,
Ashish Papanai,
Mayank Gupta,
Pankaj Gupta,
Chetan Arora
Abstract:
Automated detection of Gallbladder Cancer (GBC) from Ultrasound (US) images is an important problem, which has drawn increased interest from researchers. However, most of these works use difficult-to-acquire information such as bounding box annotations or additional US videos. In this paper, we focus on GBC detection using only image-level labels. Such annotation is usually available based on the…
▽ More
Automated detection of Gallbladder Cancer (GBC) from Ultrasound (US) images is an important problem, which has drawn increased interest from researchers. However, most of these works use difficult-to-acquire information such as bounding box annotations or additional US videos. In this paper, we focus on GBC detection using only image-level labels. Such annotation is usually available based on the diagnostic report of a patient, and do not require additional annotation effort from the physicians. However, our analysis reveals that it is difficult to train a standard image classification model for GBC detection. This is due to the low inter-class variance (a malignant region usually occupies only a small portion of a US image), high intra-class variance (due to the US sensor capturing a 2D slice of a 3D object leading to large viewpoint variations), and low training data availability. We posit that even when we have only the image level label, still formulating the problem as object detection (with bounding box output) helps a deep neural network (DNN) model focus on the relevant region of interest. Since no bounding box annotations is available for training, we pose the problem as weakly supervised object detection (WSOD). Motivated by the recent success of transformer models in object detection, we train one such model, DETR, using multi-instance-learning (MIL) with self-supervised instance selection to suit the WSOD task. Our proposed method demonstrates an improvement of AP and detection sensitivity over the SOTA transformer-based and CNN-based WSOD methods. Project page is at https://gbc-iitd.github.io/wsod-gbc
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
A Joint Fermi-GBM and Swift-BAT Analysis of Gravitational-Wave Candidates from the Third Gravitational-wave Observing Run
Authors:
C. Fletcher,
J. Wood,
R. Hamburg,
P. Veres,
C. M. Hui,
E. Bissaldi,
M. S. Briggs,
E. Burns,
W. H. Cleveland,
M. M. Giles,
A. Goldstein,
B. A. Hristov,
D. Kocevski,
S. Lesage,
B. Mailyan,
C. Malacaria,
S. Poolakkil,
A. von Kienlin,
C. A. Wilson-Hodge,
The Fermi Gamma-ray Burst Monitor Team,
M. Crnogorčević,
J. DeLaunay,
A. Tohuvavohu,
R. Caputo,
S. B. Cenko
, et al. (1674 additional authors not shown)
Abstract:
We present Fermi Gamma-ray Burst Monitor (Fermi-GBM) and Swift Burst Alert Telescope (Swift-BAT) searches for gamma-ray/X-ray counterparts to gravitational wave (GW) candidate events identified during the third observing run of the Advanced LIGO and Advanced Virgo detectors. Using Fermi-GBM on-board triggers and sub-threshold gamma-ray burst (GRB) candidates found in the Fermi-GBM ground analyses,…
▽ More
We present Fermi Gamma-ray Burst Monitor (Fermi-GBM) and Swift Burst Alert Telescope (Swift-BAT) searches for gamma-ray/X-ray counterparts to gravitational wave (GW) candidate events identified during the third observing run of the Advanced LIGO and Advanced Virgo detectors. Using Fermi-GBM on-board triggers and sub-threshold gamma-ray burst (GRB) candidates found in the Fermi-GBM ground analyses, the Targeted Search and the Untargeted Search, we investigate whether there are any coincident GRBs associated with the GWs. We also search the Swift-BAT rate data around the GW times to determine whether a GRB counterpart is present. No counterparts are found. Using both the Fermi-GBM Targeted Search and the Swift-BAT search, we calculate flux upper limits and present joint upper limits on the gamma-ray luminosity of each GW. Given these limits, we constrain theoretical models for the emission of gamma-rays from binary black hole mergers.
△ Less
Submitted 25 August, 2023;
originally announced August 2023.
-
Project Aria: A New Tool for Egocentric Multi-Modal AI Research
Authors:
Jakob Engel,
Kiran Somasundaram,
Michael Goesele,
Albert Sun,
Alexander Gamino,
Andrew Turner,
Arjang Talattof,
Arnie Yuan,
Bilal Souti,
Brighid Meredith,
Cheng Peng,
Chris Sweeney,
Cole Wilson,
Dan Barnes,
Daniel DeTone,
David Caruso,
Derek Valleroy,
Dinesh Ginjupalli,
Duncan Frost,
Edward Miller,
Elias Mueggler,
Evgeniy Oleinik,
Fan Zhang,
Guruprasad Somasundaram,
Gustavo Solaira
, et al. (49 additional authors not shown)
Abstract:
Egocentric, multi-modal data as available on future augmented reality (AR) devices provides unique challenges and opportunities for machine perception. These future devices will need to be all-day wearable in a socially acceptable form-factor to support always available, context-aware and personalized AI applications. Our team at Meta Reality Labs Research built the Aria device, an egocentric, mul…
▽ More
Egocentric, multi-modal data as available on future augmented reality (AR) devices provides unique challenges and opportunities for machine perception. These future devices will need to be all-day wearable in a socially acceptable form-factor to support always available, context-aware and personalized AI applications. Our team at Meta Reality Labs Research built the Aria device, an egocentric, multi-modal data recording and streaming device with the goal to foster and accelerate research in this area. In this paper, we describe the Aria device hardware including its sensor configuration and the corresponding software tools that enable recording and processing of such data.
△ Less
Submitted 1 October, 2023; v1 submitted 24 August, 2023;
originally announced August 2023.
-
EgoBlur: Responsible Innovation in Aria
Authors:
Nikhil Raina,
Guruprasad Somasundaram,
Kang Zheng,
Sagar Miglani,
Steve Saarinen,
Jeff Meissner,
Mark Schwesinger,
Luis Pesqueira,
Ishita Prasad,
Edward Miller,
Prince Gupta,
Mingfei Yan,
Richard Newcombe,
Carl Ren,
Omkar M Parkhi
Abstract:
Project Aria pushes the frontiers of Egocentric AI with large-scale real-world data collection using purposely designed glasses with privacy first approach. To protect the privacy of bystanders being recorded by the glasses, our research protocols are designed to ensure recorded video is processed by an AI anonymization model that removes bystander faces and vehicle license plates. Detected face a…
▽ More
Project Aria pushes the frontiers of Egocentric AI with large-scale real-world data collection using purposely designed glasses with privacy first approach. To protect the privacy of bystanders being recorded by the glasses, our research protocols are designed to ensure recorded video is processed by an AI anonymization model that removes bystander faces and vehicle license plates. Detected face and license plate regions are processed with a Gaussian blur such that these personal identification information (PII) regions are obscured. This process helps to ensure that anonymized versions of the video is retained for research purposes. In Project Aria, we have developed a state-of-the-art anonymization system EgoBlur. In this paper, we present extensive analysis of EgoBlur on challenging datasets comparing its performance with other state-of-the-art systems from industry and academia including extensive Responsible AI analysis on recently released Casual Conversations V2 dataset.
△ Less
Submitted 6 September, 2023; v1 submitted 24 August, 2023;
originally announced August 2023.
-
Learning Representations on Logs for AIOps
Authors:
Pranjal Gupta,
Harshit Kumar,
Debanjana Kar,
Karan Bhukar,
Pooja Aggarwal,
Prateeti Mohapatra
Abstract:
AI for IT Operations (AIOps) is a powerful platform that Site Reliability Engineers (SREs) use to automate and streamline operational workflows with minimal human intervention. Automated log analysis is a critical task in AIOps as it provides key insights for SREs to identify and address ongoing faults. Tasks such as log format detection, log classification, and log parsing are key components of a…
▽ More
AI for IT Operations (AIOps) is a powerful platform that Site Reliability Engineers (SREs) use to automate and streamline operational workflows with minimal human intervention. Automated log analysis is a critical task in AIOps as it provides key insights for SREs to identify and address ongoing faults. Tasks such as log format detection, log classification, and log parsing are key components of automated log analysis. Most of these tasks require supervised learning; however, there are multiple challenges due to limited labelled log data and the diverse nature of log data. Large Language Models (LLMs) such as BERT and GPT3 are trained using self-supervision on a vast amount of unlabeled data. These models provide generalized representations that can be effectively used for various downstream tasks with limited labelled data. Motivated by the success of LLMs in specific domains like science and biology, this paper introduces a LLM for log data which is trained on public and proprietary log data. The results of our experiments demonstrate that the proposed LLM outperforms existing models on multiple downstream tasks. In summary, AIOps powered by LLMs offers an efficient and effective solution for automating log analysis tasks and enabling SREs to focus on higher-level tasks. Our proposed LLM, trained on public and proprietary log data, offers superior performance on multiple downstream tasks, making it a valuable addition to the AIOps platform.
△ Less
Submitted 18 August, 2023;
originally announced August 2023.
-
AffectEcho: Speaker Independent and Language-Agnostic Emotion and Affect Transfer for Speech Synthesis
Authors:
Hrishikesh Viswanath,
Aneesh Bhattacharya,
Pascal Jutras-Dubé,
Prerit Gupta,
Mridu Prashanth,
Yashvardhan Khaitan,
Aniket Bera
Abstract:
Affect is an emotional characteristic encompassing valence, arousal, and intensity, and is a crucial attribute for enabling authentic conversations. While existing text-to-speech (TTS) and speech-to-speech systems rely on strength embedding vectors and global style tokens to capture emotions, these models represent emotions as a component of style or represent them in discrete categories. We propo…
▽ More
Affect is an emotional characteristic encompassing valence, arousal, and intensity, and is a crucial attribute for enabling authentic conversations. While existing text-to-speech (TTS) and speech-to-speech systems rely on strength embedding vectors and global style tokens to capture emotions, these models represent emotions as a component of style or represent them in discrete categories. We propose AffectEcho, an emotion translation model, that uses a Vector Quantized codebook to model emotions within a quantized space featuring five levels of affect intensity to capture complex nuances and subtle differences in the same emotion. The quantized emotional embeddings are implicitly derived from spoken speech samples, eliminating the need for one-hot vectors or explicit strength embeddings. Experimental results demonstrate the effectiveness of our approach in controlling the emotions of generated speech while preserving identity, style, and emotional cadence unique to each speaker. We showcase the language-independent emotion modeling capability of the quantized emotional embeddings learned from a bilingual (English and Chinese) speech corpus with an emotion transfer task from a reference speech to a target speech. We achieve state-of-art results on both qualitative and quantitative metrics.
△ Less
Submitted 16 August, 2023;
originally announced August 2023.
-
Safeguarding Learning-based Control for Smart Energy Systems with Sampling Specifications
Authors:
Chih-Hong Cheng,
Venkatesh Prasad Venkataramanan,
Pragya Kirti Gupta,
Yun-Fei Hsu,
Simon Burton
Abstract:
We study challenges using reinforcement learning in controlling energy systems, where apart from performance requirements, one has additional safety requirements such as avoiding blackouts. We detail how these safety requirements in real-time temporal logic can be strengthened via discretization into linear temporal logic (LTL), such that the satisfaction of the LTL formulae implies the satisfacti…
▽ More
We study challenges using reinforcement learning in controlling energy systems, where apart from performance requirements, one has additional safety requirements such as avoiding blackouts. We detail how these safety requirements in real-time temporal logic can be strengthened via discretization into linear temporal logic (LTL), such that the satisfaction of the LTL formulae implies the satisfaction of the original safety requirements. The discretization enables advanced engineering methods such as synthesizing shields for safe reinforcement learning as well as formal verification, where for statistical model checking, the probabilistic guarantee acquired by LTL model checking forms a lower bound for the satisfaction of the original real-time safety requirements.
△ Less
Submitted 11 August, 2023;
originally announced August 2023.
-
Search for Eccentric Black Hole Coalescences during the Third Observing Run of LIGO and Virgo
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
A. G. Abac,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
C. Adamcewicz,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
I. Aguilar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi
, et al. (1750 additional authors not shown)
Abstract:
Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effect…
▽ More
Despite the growing number of confident binary black hole coalescences observed through gravitational waves so far, the astrophysical origin of these binaries remains uncertain. Orbital eccentricity is one of the clearest tracers of binary formation channels. Identifying binary eccentricity, however, remains challenging due to the limited availability of gravitational waveforms that include effects of eccentricity. Here, we present observational results for a waveform-independent search sensitive to eccentric black hole coalescences, covering the third observing run (O3) of the LIGO and Virgo detectors. We identified no new high-significance candidates beyond those that were already identified with searches focusing on quasi-circular binaries. We determine the sensitivity of our search to high-mass (total mass $M>70$ $M_\odot$) binaries covering eccentricities up to 0.3 at 15 Hz orbital frequency, and use this to compare model predictions to search results. Assuming all detections are indeed quasi-circular, for our fiducial population model, we place an upper limit for the merger rate density of high-mass binaries with eccentricities $0 < e \leq 0.3$ at $0.33$ Gpc$^{-3}$ yr$^{-1}$ at 90\% confidence level.
△ Less
Submitted 7 August, 2023;
originally announced August 2023.
-
Magnetic Proximity induced efficient charge-to-spin conversion in large area PtSe$_{2}$/Ni$_{80}$Fe$_{20}$ heterostructures
Authors:
Richa Mudgal,
Alka Jakhar,
Pankhuri Gupta,
Ram Singh Yadav,
B. Biswal,
P. Sahu,
Himanshu Bangar,
Akash Kumar,
Niru Chowdhury,
Biswarup Satpati,
B. R. K. Nanda,
S. Satpathy,
Samaresh Das,
P. K. Muduli
Abstract:
As a topological Dirac semimetal with controllable spin-orbit coupling and conductivity, PtSe$_2$, a transition-metal dichalcogenide, is a promising material for several applications from optoelectric to sensors. However, its potential for spintronics applications is yet to be explored. In this work, we demonstrate that PtSe$_{2}$/Ni$_{80}$Fe$_{20}$ heterostructure can generate a large damping-lik…
▽ More
As a topological Dirac semimetal with controllable spin-orbit coupling and conductivity, PtSe$_2$, a transition-metal dichalcogenide, is a promising material for several applications from optoelectric to sensors. However, its potential for spintronics applications is yet to be explored. In this work, we demonstrate that PtSe$_{2}$/Ni$_{80}$Fe$_{20}$ heterostructure can generate a large damping-like current-induced spin-orbit torques (SOT), despite the absence of spin-splitting in bulk PtSe$_{2}$. The efficiency of charge-to-spin conversion is found to be $(-0.1 \pm 0.02)$~nm$^{-1}$ in PtSe$_{2}$/Ni$_{80}$Fe$_{20}$, which is three times that of the control sample, Ni$_{80}$Fe$_{20}$/Pt. Our band structure calculations show that the SOT due to the PtSe$_2$ arises from an unexpectedly large spin splitting in the interfacial region of PtSe$_2$ introduced by the proximity magnetic field of the Ni$_{80}$Fe$_{20}$ layer. Our results open up the possibilities of using large-area PtSe$_{2}$ for energy-efficient nanoscale devices by utilizing the proximity-induced SOT.
△ Less
Submitted 21 July, 2023;
originally announced July 2023.
-
Adversarial Likelihood Estimation With One-Way Flows
Authors:
Omri Ben-Dov,
Pravir Singh Gupta,
Victoria Abrevaya,
Michael J. Black,
Partha Ghosh
Abstract:
Generative Adversarial Networks (GANs) can produce high-quality samples, but do not provide an estimate of the probability density around the samples. However, it has been noted that maximizing the log-likelihood within an energy-based setting can lead to an adversarial framework where the discriminator provides unnormalized density (often called energy). We further develop this perspective, incor…
▽ More
Generative Adversarial Networks (GANs) can produce high-quality samples, but do not provide an estimate of the probability density around the samples. However, it has been noted that maximizing the log-likelihood within an energy-based setting can lead to an adversarial framework where the discriminator provides unnormalized density (often called energy). We further develop this perspective, incorporate importance sampling, and show that 1) Wasserstein GAN performs a biased estimate of the partition function, and we propose instead to use an unbiased estimator; and 2) when optimizing for likelihood, one must maximize generator entropy. This is hypothesized to provide a better mode coverage. Different from previous works, we explicitly compute the density of the generated samples. This is the key enabler to designing an unbiased estimator of the partition function and computation of the generator entropy term. The generator density is obtained via a new type of flow network, called one-way flow network, that is less constrained in terms of architecture, as it does not require a tractable inverse function. Our experimental results show that our method converges faster, produces comparable sample quality to GANs with similar architecture, successfully avoids over-fitting to commonly used datasets and produces smooth low-dimensional latent representations of the training data.
△ Less
Submitted 2 October, 2023; v1 submitted 19 July, 2023;
originally announced July 2023.
-
Search Me Knot, Render Me Knot: Embedding Search and Differentiable Rendering of Knots in 3D
Authors:
Aalok Gangopadhyay,
Paras Gupta,
Tarun Sharma,
Prajwal Singh,
Shanmuganathan Raman
Abstract:
We introduce the problem of knot-based inverse perceptual art. Given multiple target images and their corresponding viewing configurations, the objective is to find a 3D knot-based tubular structure whose appearance resembles the target images when viewed from the specified viewing configurations. To solve this problem, we first design a differentiable rendering algorithm for rendering tubular kno…
▽ More
We introduce the problem of knot-based inverse perceptual art. Given multiple target images and their corresponding viewing configurations, the objective is to find a 3D knot-based tubular structure whose appearance resembles the target images when viewed from the specified viewing configurations. To solve this problem, we first design a differentiable rendering algorithm for rendering tubular knots embedded in 3D for arbitrary perspective camera configurations. Utilizing this differentiable rendering algorithm, we search over the space of knot configurations to find the ideal knot embedding. We represent the knot embeddings via homeomorphisms of the desired template knot, where the homeomorphisms are parametrized by the weights of an invertible neural network. Our approach is fully differentiable, making it possible to find the ideal 3D tubular structure for the desired perceptual art using gradient-based optimization. We propose several loss functions that impose additional physical constraints, enforcing that the tube is free of self-intersection, lies within a predefined region in space, satisfies the physical bending limits of the tube material and the material cost is within a specified budget. We demonstrate through results that our knot representation is highly expressive and gives impressive results even for challenging target images in both single view as well as multiple view constraints. Through extensive ablation study we show that each of the proposed loss function is effective in ensuring physical realizability. We construct a real world 3D-printed object to demonstrate the practical utility of our approach. To the best of our knowledge, we are the first to propose a fully differentiable optimization framework for knot-based inverse perceptual art.
△ Less
Submitted 19 August, 2023; v1 submitted 17 July, 2023;
originally announced July 2023.
-
AI For Global Climate Cooperation 2023 Competition Proceedings
Authors:
Yoshua Bengio,
Prateek Gupta,
Lu Li,
Soham Phade,
Sunil Srinivasa,
Andrew Williams,
Tianyu Zhang,
Yang Zhang,
Stephan Zheng
Abstract:
The international community must collaborate to mitigate climate change and sustain economic growth. However, collaboration is hard to achieve, partly because no global authority can ensure compliance with international climate agreements. Combining AI with climate-economic simulations offers a promising solution to design international frameworks, including negotiation protocols and climate agree…
▽ More
The international community must collaborate to mitigate climate change and sustain economic growth. However, collaboration is hard to achieve, partly because no global authority can ensure compliance with international climate agreements. Combining AI with climate-economic simulations offers a promising solution to design international frameworks, including negotiation protocols and climate agreements, that promote and incentivize collaboration. In addition, these frameworks should also have policy goals fulfillment, and sustained commitment, taking into account climate-economic dynamics and strategic behaviors. These challenges require an interdisciplinary approach across machine learning, economics, climate science, law, policy, ethics, and other fields.
Towards this objective, we organized AI for Global Climate Cooperation, a Mila competition in which teams submitted proposals and analyses of international frameworks, based on (modifications of) RICE-N, an AI-driven integrated assessment model (IAM). In particular, RICE-N supports modeling regional decision-making using AI agents. Furthermore, the IAM then models the climate-economic impact of those decisions into the future.
Whereas the first track focused only on performance metrics, the proposals submitted to the second track were evaluated both quantitatively and qualitatively. The quantitative evaluation focused on a combination of (i) the degree of mitigation of global temperature rise and (ii) the increase in economic productivity. On the other hand, an interdisciplinary panel of human experts in law, policy, sociology, economics and environmental science, evaluated the solutions qualitatively. In particular, the panel considered the effectiveness, simplicity, feasibility, ethics, and notions of climate justice of the protocols. In the third track, the participants were asked to critique and improve RICE-N.
△ Less
Submitted 10 July, 2023;
originally announced July 2023.
-
A Dataset of Inertial Measurement Units for Handwritten English Alphabets
Authors:
Hari Prabhat Gupta,
Rahul Mishra
Abstract:
This paper presents an end-to-end methodology for collecting datasets to recognize handwritten English alphabets by utilizing Inertial Measurement Units (IMUs) and leveraging the diversity present in the Indian writing style. The IMUs are utilized to capture the dynamic movement patterns associated with handwriting, enabling more accurate recognition of alphabets. The Indian context introduces var…
▽ More
This paper presents an end-to-end methodology for collecting datasets to recognize handwritten English alphabets by utilizing Inertial Measurement Units (IMUs) and leveraging the diversity present in the Indian writing style. The IMUs are utilized to capture the dynamic movement patterns associated with handwriting, enabling more accurate recognition of alphabets. The Indian context introduces various challenges due to the heterogeneity in writing styles across different regions and languages. By leveraging this diversity, the collected dataset and the collection system aim to achieve higher recognition accuracy. Some preliminary experimental results demonstrate the effectiveness of the dataset in accurately recognizing handwritten English alphabet in the Indian context. This research can be extended and contributes to the field of pattern recognition and offers valuable insights for developing improved systems for handwriting recognition, particularly in diverse linguistic and cultural contexts.
△ Less
Submitted 5 July, 2023;
originally announced July 2023.
-
Asymmetric magnetism at the interfaces of MgO/FeCoB bilayers by exchanging the order of MgO and FeCoB
Authors:
Md. Shahid Jamal,
Sadhana Singh,
Arun Singh Dev,
Neha Gupta,
Pooja Gupta,
Mukul Gupta,
Olaf Leupold,
Ilya Sergueev,
V. R. Reddy,
Dileep Kumar
Abstract:
Interfaces in FeCoB/MgO/FeCoB magnetic tunnel junction play a vital role in controlling their magnetic and transport properties for various applications in spintronics and magnetic recording media. In this work, interface structures of a few nm thick FeCoB layers in FeCoB/MgO and MgO/FeCoB bilayers are comprehensively studied using x-ray standing waves (XSW) generated by depositing bilayers betwee…
▽ More
Interfaces in FeCoB/MgO/FeCoB magnetic tunnel junction play a vital role in controlling their magnetic and transport properties for various applications in spintronics and magnetic recording media. In this work, interface structures of a few nm thick FeCoB layers in FeCoB/MgO and MgO/FeCoB bilayers are comprehensively studied using x-ray standing waves (XSW) generated by depositing bilayers between Pt waveguide structures. High interface selectivity of nuclear resonance scattering (NRS) under the XSW technique allowed measuring structure and magnetism at the two interfaces, namely FeCoB-on-MgO and MgO-on-FeCoB, yielding an interesting result that electron density and hyperfine fields are not symmetric at both interfaces. The formation of a high-density FeCoB layer at the MgO/FeCoB (FeCoB-on-MgO) interface with an increased hyperfine field (~34.65 T) is attributed to the increasing volume of FeCo at the interface due to boron diffusion from 57FeCoB to the MgO layer. Furthermore, it caused unusual angular-dependent magnetic properties in MgO/FeCoB bilayer, whereas FeCoB/MgO is magnetically isotropic. In contrast to the literature, where the unusual angular dependent in FeCoB based system is explained in terms of in-plane magnetic anisotropy, present findings attributed the same to the interlayer exchange coupling between bulk and interface layer within the FeCoB layer.
△ Less
Submitted 1 July, 2023;
originally announced July 2023.
-
SARC: Soft Actor Retrospective Critic
Authors:
Sukriti Verma,
Ayush Chopra,
Jayakumar Subramanian,
Mausoom Sarkar,
Nikaash Puri,
Piyush Gupta,
Balaji Krishnamurthy
Abstract:
The two-time scale nature of SAC, which is an actor-critic algorithm, is characterised by the fact that the critic estimate has not converged for the actor at any given time, but since the critic learns faster than the actor, it ensures eventual consistency between the two. Various strategies have been introduced in literature to learn better gradient estimates to help achieve better convergence.…
▽ More
The two-time scale nature of SAC, which is an actor-critic algorithm, is characterised by the fact that the critic estimate has not converged for the actor at any given time, but since the critic learns faster than the actor, it ensures eventual consistency between the two. Various strategies have been introduced in literature to learn better gradient estimates to help achieve better convergence. Since gradient estimates depend upon the critic, we posit that improving the critic can provide a better gradient estimate for the actor at each time. Utilizing this, we propose Soft Actor Retrospective Critic (SARC), where we augment the SAC critic loss with another loss term - retrospective loss - leading to faster critic convergence and consequently, better policy gradient estimates for the actor. An existing implementation of SAC can be easily adapted to SARC with minimal modifications. Through extensive experimentation and analysis, we show that SARC provides consistent improvement over SAC on benchmark environments. We plan to open-source the code and all experiment data at: https://github.com/sukritiverma1996/SARC.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
Black Hole Menagerie, Charged/Dyonic BHs and Radiation from Interacting Dyonic BH Pairs
Authors:
Patrick Das Gupta,
Mohd. Sirtaz
Abstract:
We describe charged BHs, Penrose process for energy extraction from Kerr BHs and Wald's proposal concerning a Kerr BH slowly becoming a Kerr-Newman BH in the presence of a uniform magnetic field. In the context of BHs bearing magnetic charge, we discuss both magnetic monopoles as well as dyons, and their emergence from various models like string theory, GUTs and electroweak theories, etc. In the l…
▽ More
We describe charged BHs, Penrose process for energy extraction from Kerr BHs and Wald's proposal concerning a Kerr BH slowly becoming a Kerr-Newman BH in the presence of a uniform magnetic field. In the context of BHs bearing magnetic charge, we discuss both magnetic monopoles as well as dyons, and their emergence from various models like string theory, GUTs and electroweak theories, etc. In the later portions, we concentrate on our recent research work pertaining to the non-relativistic dynamics of dyon-dyon interaction that includes mutual gravitational attraction. From the derived classical equations of motion, we obtain not only the well known Schwinger-Zwanziger quantization condition for dyons using Saha's argument based on quantized angular momentum of electromagnetic field but also a scalar virial theorem for an astrophysical system consisting of point particles, some of which carry both electric and magnetic charges. In the final sections, we obtain expressions for the generated electromagnetic wave as well as gravitational wave amplitudes, and the corresponding luminosities due to dyon-dyon interactions. Lastly, we discuss the results after computing these quantities using a range of values for the mass, electric and magnetic charges, etc. of the dyonic BHs.
△ Less
Submitted 26 June, 2023;
originally announced June 2023.
-
Highly depleted alkali metals in Jupiter's deep atmosphere
Authors:
Ananyo Bhattacharya,
Cheng Li,
Sushil K. Atreya,
Paul G. Steffes,
Steven M. Levin,
Scott J. Bolton,
Tristan Guillot,
Pranika Gupta,
Andrew P. Ingersoll,
Jonathan I. Lunine,
Glenn S. Orton,
Fabiano A. Oyafuso,
J. Hunter Waite,
Amadeo Belloti,
Michael H. Wong
Abstract:
Water and ammonia vapors are known to be the major sources of spectral absorption at pressure levels observed by the microwave radiometer (MWR) on Juno. However, the brightness temperatures and limb darkening observed by the MWR at its longest wavelength channel of 50 cm (600 MHz) in the first 9 perijove passes indicate the existence of an additional source of opacity in the deep atmosphere of Jup…
▽ More
Water and ammonia vapors are known to be the major sources of spectral absorption at pressure levels observed by the microwave radiometer (MWR) on Juno. However, the brightness temperatures and limb darkening observed by the MWR at its longest wavelength channel of 50 cm (600 MHz) in the first 9 perijove passes indicate the existence of an additional source of opacity in the deep atmosphere of Jupiter (pressures beyond 100 bar). The absorption properties of ammonia and water vapor, and their relative abundances in Jupiter's atmosphere do not provide sufficient opacity in deep atmosphere to explain the 600 MHz channel observation. Here we show that free electrons due to the ionization of alkali metals, i.e. sodium, and potassium, with sub-solar metallicity [M/H] (log based 10 relative concentration to solar) in the range of [M/H] = -2 to [M/H] = -5 can provide the missing source of opacity in the deep atmosphere. If the alkali metals are not the source of additional opacity in the MWR data, then their metallicity at 1000 bars can only be even lower. The upper bound of -2 on the metallicity of the alkali metals contrasts with the other heavy elements -- C, N, S, Ar, Kr, and Xe -- which are all enriched relative to their solar abundances having a metallicity of approximately +0.5.
△ Less
Submitted 21 June, 2023;
originally announced June 2023.
-
Resource Aware Clustering for Tackling the Heterogeneity of Participants in Federated Learning
Authors:
Rahul Mishra,
Hari Prabhat Gupta,
Garvit Banga
Abstract:
Federated Learning is a training framework that enables multiple participants to collaboratively train a shared model while preserving data privacy and minimizing communication overhead. The heterogeneity of devices and networking resources of the participants delay the training and aggregation in federated learning. This paper proposes a federated learning approach to manoeuvre the heterogeneity…
▽ More
Federated Learning is a training framework that enables multiple participants to collaboratively train a shared model while preserving data privacy and minimizing communication overhead. The heterogeneity of devices and networking resources of the participants delay the training and aggregation in federated learning. This paper proposes a federated learning approach to manoeuvre the heterogeneity among the participants using resource aware clustering. The approach begins with the server gathering information about the devices and networking resources of participants, after which resource aware clustering is performed to determine the optimal number of clusters using Dunn Indices. The mechanism of participant assignment is then introduced, and the expression of communication rounds required for model convergence in each cluster is mathematically derived. Furthermore, a master-slave technique is introduced to improve the performance of the lightweight models in the clusters using knowledge distillation. Finally, experimental evaluations are conducted to verify the feasibility and effectiveness of the approach and to compare it with state-of-the-art techniques.
△ Less
Submitted 7 June, 2023;
originally announced June 2023.
-
Elucidating the role of hydrogen bonding in the optical spectroscopy of the solvated green fluorescent protein chromophore: using machine learning to establish the importance of high-level electronic structure
Authors:
Michael S. Chen,
Yuezhi Mao,
Andrew Snider,
Prachi Gupta,
Andrés Montoya-Castillo,
Tim J. Zuehlsdorff,
Christine M. Isborn,
Thomas E. Markland
Abstract:
Hydrogen bonding interactions with chromophores in chemical and biological environments play a key role in determining their electronic absorption and relaxation processes, which are manifested in their linear and multidimensional optical spectra. For chromophores in the condensed phase, the large number of atoms needed to simulate the environment has traditionally prohibited the use of high-level…
▽ More
Hydrogen bonding interactions with chromophores in chemical and biological environments play a key role in determining their electronic absorption and relaxation processes, which are manifested in their linear and multidimensional optical spectra. For chromophores in the condensed phase, the large number of atoms needed to simulate the environment has traditionally prohibited the use of high-level excited-state electronic structure methods. By leveraging transfer learning, we show how to construct machine-learned models to accurately predict high-level excitation energies of a chromophore in solution from only 400 high-level calculations. We show that when the electronic excitations of the green fluorescent protein chromophore in water are treated using EOM-CCSD embedded in a DFT description of the solvent, the optical spectrum is correctly captured and that this improvement arises from correctly treating the coupling of the electronic transition to electric fields, which leads to a larger response upon hydrogen bonding between the chromophore and water.
△ Less
Submitted 26 May, 2023;
originally announced May 2023.
-
Mitigating Exploitation Bias in Learning to Rank with an Uncertainty-aware Empirical Bayes Approach
Authors:
Tao Yang,
Cuize Han,
Chen Luo,
Parth Gupta,
Jeff M. Phillips,
Qingyao Ai
Abstract:
Ranking is at the core of many artificial intelligence (AI) applications, including search engines, recommender systems, etc. Modern ranking systems are often constructed with learning-to-rank (LTR) models built from user behavior signals. While previous studies have demonstrated the effectiveness of using user behavior signals (e.g., clicks) as both features and labels of LTR algorithms, we argue…
▽ More
Ranking is at the core of many artificial intelligence (AI) applications, including search engines, recommender systems, etc. Modern ranking systems are often constructed with learning-to-rank (LTR) models built from user behavior signals. While previous studies have demonstrated the effectiveness of using user behavior signals (e.g., clicks) as both features and labels of LTR algorithms, we argue that existing LTR algorithms that indiscriminately treat behavior and non-behavior signals in input features could lead to suboptimal performance in practice. Particularly because user behavior signals often have strong correlations with the ranking objective and can only be collected on items that have already been shown to users, directly using behavior signals in LTR could create an exploitation bias that hurts the system performance in the long run.
To address the exploitation bias, we propose EBRank, an empirical Bayes-based uncertainty-aware ranking algorithm. Specifically, to overcome exploitation bias brought by behavior features in ranking models, EBRank uses a sole non-behavior feature based prior model to get a prior estimation of relevance. In the dynamic training and serving of ranking systems, EBRank uses the observed user behaviors to update posterior relevance estimation instead of concatenating behaviors as features in ranking models. Besides, EBRank additionally applies an uncertainty-aware exploration strategy to explore actively, collect user behaviors for empirical Bayesian modeling and improve ranking performance. Experiments on three public datasets show that EBRank is effective, practical and significantly outperforms state-of-the-art ranking algorithms.
△ Less
Submitted 25 May, 2023;
originally announced May 2023.
-
USB: A Unified Summarization Benchmark Across Tasks and Domains
Authors:
Kundan Krishna,
Prakhar Gupta,
Sanjana Ramprasad,
Byron C. Wallace,
Jeffrey P. Bigham,
Zachary C. Lipton
Abstract:
While the NLP community has produced numerous summarization benchmarks, none provide the rich annotations required to simultaneously address many important problems related to control and reliability. We introduce a Wikipedia-derived benchmark, complemented by a rich set of crowd-sourced annotations, that supports $8$ interrelated tasks: (i) extractive summarization; (ii) abstractive summarization…
▽ More
While the NLP community has produced numerous summarization benchmarks, none provide the rich annotations required to simultaneously address many important problems related to control and reliability. We introduce a Wikipedia-derived benchmark, complemented by a rich set of crowd-sourced annotations, that supports $8$ interrelated tasks: (i) extractive summarization; (ii) abstractive summarization; (iii) topic-based summarization; (iv) compressing selected sentences into a one-line summary; (v) surfacing evidence for a summary sentence; (vi) predicting the factual accuracy of a summary sentence; (vii) identifying unsubstantiated spans in a summary sentence; (viii) correcting factual errors in summaries. We compare various methods on this benchmark and discover that on multiple tasks, moderately-sized fine-tuned models consistently outperform much larger few-shot prompted language models. For factuality-related tasks, we also evaluate existing heuristics to create training data and find that training on them results in worse performance than training on $20\times$ less human-labeled data. Our articles draw from $6$ domains, facilitating cross-domain analysis. On some tasks, the amount of training data matters more than the domain where it comes from, while for other tasks training specifically on data from the target domain, even if limited, is more beneficial.
△ Less
Submitted 4 December, 2023; v1 submitted 23 May, 2023;
originally announced May 2023.
-
GrACE: Generation using Associated Code Edits
Authors:
Priyanshu Gupta,
Avishree Khare,
Yasharth Bajpai,
Saikat Chakraborty,
Sumit Gulwani,
Aditya Kanade,
Arjun Radhakrishna,
Gustavo Soares,
Ashish Tiwari
Abstract:
Developers expend a significant amount of time in editing code for a variety of reasons such as bug fixing or adding new features. Designing effective methods to predict code edits has been an active yet challenging area of research due to the diversity of code edits and the difficulty of capturing the developer intent. In this work, we address these challenges by endowing pre-trained large langua…
▽ More
Developers expend a significant amount of time in editing code for a variety of reasons such as bug fixing or adding new features. Designing effective methods to predict code edits has been an active yet challenging area of research due to the diversity of code edits and the difficulty of capturing the developer intent. In this work, we address these challenges by endowing pre-trained large language models (LLMs) of code with the knowledge of prior, relevant edits. The generative capability of the LLMs helps address the diversity in code changes and conditioning code generation on prior edits helps capture the latent developer intent. We evaluate two well-known LLMs, Codex and CodeT5, in zero-shot and fine-tuning settings respectively. In our experiments with two datasets, the knowledge of prior edits boosts the performance of the LLMs significantly and enables them to generate 29% and 54% more correctly edited code in top-1 suggestions relative to the current state-of-the-art symbolic and neural approaches, respectively.
△ Less
Submitted 20 September, 2023; v1 submitted 23 May, 2023;
originally announced May 2023.
-
Quantum transduction of superconducting qubit in electro-optomechanical and electro-optomagnonical system
Authors:
Roson Nongthombam,
Pooja Kumari Gupta,
Amarendra K. Sarma
Abstract:
We study the quantum transduction of a superconducting qubit to an optical photon in electro-optomechanical and electro-optomagnonical systems. The electro-optomechanical system comprises a flux-tunable transmon qubit coupled to a suspended mechanical beam, which then couples to an optical cavity. Similarly, in an electro-optomagnonical system, a flux-tunable transmon qubit is coupled to an optica…
▽ More
We study the quantum transduction of a superconducting qubit to an optical photon in electro-optomechanical and electro-optomagnonical systems. The electro-optomechanical system comprises a flux-tunable transmon qubit coupled to a suspended mechanical beam, which then couples to an optical cavity. Similarly, in an electro-optomagnonical system, a flux-tunable transmon qubit is coupled to an optical whispering gallery mode via a magnon excitation in a YIG ferromagnetic sphere. In both systems, the transduction process is done in sequence. In the first sequence, the qubit states are encoded in coherent excitations of phonon/magnon modes through the phonon/magnon-qubit interaction, which is non-demolition in the qubit part. We then measure the phonon/magnon excitations, which reveal the qubit states, by counting the average number of photons in the optical cavities. The measurement of the phonon/magnon excitations can be performed at a regular intervals of time.
△ Less
Submitted 19 May, 2023;
originally announced May 2023.
-
Quantum interference induced magnon blockade and antibunching in a hybrid quantum system
Authors:
Pooja Kumari Gupta,
Sampreet Kalita,
Amarendra K. Sarma
Abstract:
In this work, we study the phenomena of quantum interference assisted magnon blockade and magnon antibunching in a weakly interacting hybrid ferromagnet-superconductor system. The magnon excitations in two yttrium iron garnet spheres are indirectly coupled to a superconducting qubit through microwave cavity modes of two mutually perpendicular cavities. We find that when one of the magnon mode is d…
▽ More
In this work, we study the phenomena of quantum interference assisted magnon blockade and magnon antibunching in a weakly interacting hybrid ferromagnet-superconductor system. The magnon excitations in two yttrium iron garnet spheres are indirectly coupled to a superconducting qubit through microwave cavity modes of two mutually perpendicular cavities. We find that when one of the magnon mode is driven by a weak optical field, the destructive interference between more than two distinct transition pathways restricts simultaneous excitation of two magnons. We analyze the magnon correlations in the driven magnon mode for the case of zero detunings as well as finite detunings of the magnon modes and the qubit. We show that the magnon antibunching can be tuned by changing the magnon-qubit coupling strength ratio and the driving detuning. Our work proposes a possible scheme which have significant role in the construction of single magnon generating devices.
△ Less
Submitted 15 May, 2023;
originally announced May 2023.
-
Emolysis: A Multimodal Open-Source Group Emotion Analysis and Visualization Toolkit
Authors:
Shreya Ghosh,
Zhixi Cai,
Parul Gupta,
Garima Sharma,
Abhinav Dhall,
Munawar Hayat,
Tom Gedeon
Abstract:
Automatic group emotion recognition plays an important role in understanding complex human-human interaction. This paper introduces, Emolysis, a Python-based, standalone open-source group emotion analysis toolkit for use in different social situations upon getting consent from the users. Given any input video, Emolysis processes synchronized multimodal input and maps it to group level emotion, val…
▽ More
Automatic group emotion recognition plays an important role in understanding complex human-human interaction. This paper introduces, Emolysis, a Python-based, standalone open-source group emotion analysis toolkit for use in different social situations upon getting consent from the users. Given any input video, Emolysis processes synchronized multimodal input and maps it to group level emotion, valence and arousal. Additionally, the toolkit supports major mobile and desktop platforms (Android, iOS, Windows). The Emolysis platform also comes with an intuitive graphical user interface that allows users to select different modalities and target persons for more fine-grained emotion analysis. Emolysis is freely available for academic research and encourages application developers to extend it to application specific environments on top of the existing system. We believe that the extension mechanism is quite straightforward. Our code models and interface are available at https://github.com/ControlNet/emolysis.
△ Less
Submitted 6 August, 2024; v1 submitted 9 May, 2023;
originally announced May 2023.
-
Interfacial origin of unconventional spin-orbit torque in Py/$γ-$IrMn$_{3}$
Authors:
Akash Kumar,
Pankhuri Gupta,
Niru Chowdhury,
Kacho Imtiyaz Ali Khan,
Utkarsh Shashank,
Surbhi Gupta,
Yasuhiro Fukuma,
Sujeet Chaudhary,
Pranaba Kishor Muduli
Abstract:
Angle-resolved spin-torque ferromagnetic resonance measurements are carried out in heterostructures consisting of Py (Ni$_{81}$Fe$_{19}$) and a noncollinear antiferromagnetic quantum material $γ-$IrMn$_{3}$. The structural characterization reveals that $γ-$IrMn$_{3}$ is polycrystalline in nature. A large exchange bias of 158~Oe is found in Py/$γ-$IrMn$_{3}$ at room temperature, while $γ-$IrMn…
▽ More
Angle-resolved spin-torque ferromagnetic resonance measurements are carried out in heterostructures consisting of Py (Ni$_{81}$Fe$_{19}$) and a noncollinear antiferromagnetic quantum material $γ-$IrMn$_{3}$. The structural characterization reveals that $γ-$IrMn$_{3}$ is polycrystalline in nature. A large exchange bias of 158~Oe is found in Py/$γ-$IrMn$_{3}$ at room temperature, while $γ-$IrMn$_{3}$/Py and Py/Cu/$γ-$IrMn$_{3}$ exhibited no exchange bias. Regardless of the exchange bias and stacking sequence, we observe a substantial unconventional out-of-plane anti-damping torque when $γ-$IrMn$_{3}$ is in direct contact with Py. The magnitude of the out-of-plane spin-orbit torque efficiency is found to be twice as large as the in-plane spin-orbit torque efficiency. The unconventional spin-orbit torque vanishes when a Cu spacer is introduced between Py and $γ-$IrMn$_{3}$, indicating that the unconventional spin-orbit torque in this system originates at the interface. These findings are important for realizing efficient antiferromagnet-based spintronic devices via interfacial engineering.
△ Less
Submitted 8 May, 2023;
originally announced May 2023.
-
Placing of the recently observed bottom strange state $B_{sJ}(6063)$ and $B_{sJ}(6114)$ in bottom spectra
Authors:
Ritu Garg,
Pallavi Gupta,
A. Upadhyay
Abstract:
We have employed HQET to give the spin-parity quantum numbers for recently observed bottom strange states $B_{sJ}(6063)$ and $B_{sJ}(6114)$ by LHCb collaborations. By exploring flavour independent parameters $ Δ_{F}^{(c)} =Δ_{F}^{(b)}$ and $ λ_{F}^{(c)} = λ_{F}^{(b)}$, we calculated masses of experimentally missing bottom strange meson states $2S, 1P, 1D$. We have also analyzed these bottom strang…
▽ More
We have employed HQET to give the spin-parity quantum numbers for recently observed bottom strange states $B_{sJ}(6063)$ and $B_{sJ}(6114)$ by LHCb collaborations. By exploring flavour independent parameters $ Δ_{F}^{(c)} =Δ_{F}^{(b)}$ and $ λ_{F}^{(c)} = λ_{F}^{(b)}$, we calculated masses of experimentally missing bottom strange meson states $2S, 1P, 1D$. We have also analyzed these bottom strange masses by taking ${1/m_Q}$ corrections which lead modifications of parameter terms as $ Δ_{F}^{(b)} =Δ_{F}^{(c)} + δΔ_F$ and $ λ_{F}^{(b)} = λ_{F}^{(c)}δλ_F$. Further, we have analyzed their two-body decays, couplings, and branching ratios via the emission of light pseudoscalar mesons. Based on predicted masses and decay widths, we tentatively identified the states $B_{sJ}(6063)$ as $2^3S_1$ and $B_{sJ}(6114)$ as $1^3D_1$. Our predictions provide crucial information for future experimental studies.
△ Less
Submitted 4 May, 2023;
originally announced May 2023.
-
Robust Macroscopic Schrödinger's Cat on a Nucleus
Authors:
Pragati Gupta,
Arjen Vaartjes,
Xi Yu,
Andrea Morello,
Barry C. Sanders
Abstract:
We propose a scheme to generate spin cat states, i.e., superpositions of maximally separated quasiclassical states on a single high-dimensional nuclear spin in a solid-state device. We exploit a strong quadrupolar nonlinearity to drive the nucleus significantly faster than usual gate sequences, achieving collapses and revivals two orders of magnitude faster than the dephasing timescale. Furthermor…
▽ More
We propose a scheme to generate spin cat states, i.e., superpositions of maximally separated quasiclassical states on a single high-dimensional nuclear spin in a solid-state device. We exploit a strong quadrupolar nonlinearity to drive the nucleus significantly faster than usual gate sequences, achieving collapses and revivals two orders of magnitude faster than the dephasing timescale. Furthermore, these states are engineered without entanglement with an ancilla, hence, are robust against error propagation. With our multitone control, we can realize arbitrary high-spin rotations within an experimentally feasible regime, as well as transform a spin coherent state to a spin cat state using only phase modulation, opening the possibility of storing and manipulating high-fidelity cat states.
△ Less
Submitted 29 January, 2024; v1 submitted 26 April, 2023;
originally announced April 2023.
-
Baroclinic interaction of forced shock waves with random thermal gradients
Authors:
Joaquim P. Jossy,
Prateek Gupta
Abstract:
Density gradients aligned at an angle to pressure gradients result in baroclinic torque in fluid flows, generating vorticity. In this work, we study the vorticity generated by the baroclinic torque exerted by the interaction of pressure jumps across random two-dimensional shock waves with density gradients. A field of random two-dimensional shock waves has acoustic spectral energy scaling as ε^{2/…
▽ More
Density gradients aligned at an angle to pressure gradients result in baroclinic torque in fluid flows, generating vorticity. In this work, we study the vorticity generated by the baroclinic torque exerted by the interaction of pressure jumps across random two-dimensional shock waves with density gradients. A field of random two-dimensional shock waves has acoustic spectral energy scaling as ε^{2/3}{\ell}^{-1/3}k^{-2} where k is the wavenumber, ε is the energy dissipation, and \ell is the integral length scale of the field. Since the acoustic energy is broadband, pressure and velocity gradients exist in a wide range of length scales. We study the interaction of these broadband gradients with isobaric thermal gradients localized at a length scale in the spectral space. We show that the method of generating shock waves or injection of wave energy in the system governs the baroclinic interactions. For stochastically forced shock waves, baroclinic termsare negligible. Broadband vorticity with energy at least two orders of magnitude smaller is generated due to continuous variation in curvature of shock waves caused by stochastic forcing. On the other hand, shock waves maintained by energy rescaling result in the generation of coherent vorticity. We also discuss the relative magnitude of the baroclinic torque generated due to total density gradients compared to the one generated due to non-isentropic density gradients within the shock waves interacting with the pressure gradients.
△ Less
Submitted 21 April, 2023;
originally announced April 2023.
-
Search for gravitational-lensing signatures in the full third observing run of the LIGO-Virgo network
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
H. Abe,
F. Acernese,
K. Ackley,
S. Adhicary,
N. Adhikari,
R. X. Adhikari,
V. K. Adkins,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
R. A. Alfaidi,
C. Alléné,
A. Allocca,
P. A. Altin
, et al. (1670 additional authors not shown)
Abstract:
Gravitational lensing by massive objects along the line of sight to the source causes distortions of gravitational wave-signals; such distortions may reveal information about fundamental physics, cosmology and astrophysics. In this work, we have extended the search for lensing signatures to all binary black hole events from the third observing run of the LIGO--Virgo network. We search for repeated…
▽ More
Gravitational lensing by massive objects along the line of sight to the source causes distortions of gravitational wave-signals; such distortions may reveal information about fundamental physics, cosmology and astrophysics. In this work, we have extended the search for lensing signatures to all binary black hole events from the third observing run of the LIGO--Virgo network. We search for repeated signals from strong lensing by 1) performing targeted searches for subthreshold signals, 2) calculating the degree of overlap amongst the intrinsic parameters and sky location of pairs of signals, 3) comparing the similarities of the spectrograms amongst pairs of signals, and 4) performing dual-signal Bayesian analysis that takes into account selection effects and astrophysical knowledge. We also search for distortions to the gravitational waveform caused by 1) frequency-independent phase shifts in strongly lensed images, and 2) frequency-dependent modulation of the amplitude and phase due to point masses. None of these searches yields significant evidence for lensing. Finally, we use the non-detection of gravitational-wave lensing to constrain the lensing rate based on the latest merger-rate estimates and the fraction of dark matter composed of compact objects.
△ Less
Submitted 17 April, 2023;
originally announced April 2023.
-
Training Neural Networks for Execution on Approximate Hardware
Authors:
Tianmu Li,
Shurui Li,
Puneet Gupta
Abstract:
Approximate computing methods have shown great potential for deep learning. Due to the reduced hardware costs, these methods are especially suitable for inference tasks on battery-operated devices that are constrained by their power budget. However, approximate computing hasn't reached its full potential due to the lack of work on training methods. In this work, we discuss training methods for app…
▽ More
Approximate computing methods have shown great potential for deep learning. Due to the reduced hardware costs, these methods are especially suitable for inference tasks on battery-operated devices that are constrained by their power budget. However, approximate computing hasn't reached its full potential due to the lack of work on training methods. In this work, we discuss training methods for approximate hardware. We demonstrate how training needs to be specialized for approximate hardware, and propose methods to speed up the training process by up to 18X.
△ Less
Submitted 8 April, 2023;
originally announced April 2023.
-
Self-Refine: Iterative Refinement with Self-Feedback
Authors:
Aman Madaan,
Niket Tandon,
Prakhar Gupta,
Skyler Hallinan,
Luyu Gao,
Sarah Wiegreffe,
Uri Alon,
Nouha Dziri,
Shrimai Prabhumoye,
Yiming Yang,
Shashank Gupta,
Bodhisattwa Prasad Majumder,
Katherine Hermann,
Sean Welleck,
Amir Yazdanbakhsh,
Peter Clark
Abstract:
Like humans, large language models (LLMs) do not always generate the best output on their first try. Motivated by how humans refine their written text, we introduce Self-Refine, an approach for improving initial outputs from LLMs through iterative feedback and refinement. The main idea is to generate an initial output using an LLMs; then, the same LLMs provides feedback for its output and uses it…
▽ More
Like humans, large language models (LLMs) do not always generate the best output on their first try. Motivated by how humans refine their written text, we introduce Self-Refine, an approach for improving initial outputs from LLMs through iterative feedback and refinement. The main idea is to generate an initial output using an LLMs; then, the same LLMs provides feedback for its output and uses it to refine itself, iteratively. Self-Refine does not require any supervised training data, additional training, or reinforcement learning, and instead uses a single LLM as the generator, refiner, and feedback provider. We evaluate Self-Refine across 7 diverse tasks, ranging from dialog response generation to mathematical reasoning, using state-of-the-art (GPT-3.5, ChatGPT, and GPT-4) LLMs. Across all evaluated tasks, outputs generated with Self-Refine are preferred by humans and automatic metrics over those generated with the same LLM using conventional one-step generation, improving by ~20% absolute on average in task performance. Our work demonstrates that even state-of-the-art LLMs like GPT-4 can be further improved at test time using our simple, standalone approach.
△ Less
Submitted 25 May, 2023; v1 submitted 30 March, 2023;
originally announced March 2023.
-
Dynamics of Binary System around Supermassive Black Hole
Authors:
Kei-ichi Maeda,
Priti Gupta,
Hirotada Okawa
Abstract:
We discuss motion of a binary system around a supermassive black hole. Using Fermi-Walker transport, we construct a local inertial reference frame and set up a Newtonian binary system. Assuming a circular geodesic observer around a Schwarzschild black hole, we write down the equations of motion of a binary. Introducing a small acceleration of the observer, we remove the interaction terms between t…
▽ More
We discuss motion of a binary system around a supermassive black hole. Using Fermi-Walker transport, we construct a local inertial reference frame and set up a Newtonian binary system. Assuming a circular geodesic observer around a Schwarzschild black hole, we write down the equations of motion of a binary. Introducing a small acceleration of the observer, we remove the interaction terms between the center of mass (CM) of a binary and its relative coordinates. The CM follows the observer's orbit, but its motion deviates from an exact circular geodesic. We first solve the relative motion of a binary system, and then find the motion of the CM by the perturbation equations with the small acceleration.
We show that there appears the Kozai-Lidov (KL) oscillations when a binary is compact and the initial inclination is larger than a critical angle. In a hard binary system, KL oscillations are regular, whereas in a soft binary system, oscillations are irregular both in period and in amplitude, although stable. We find an orbital flip when the initial inclination is large. As for the motion of the CM, the radial deviations from a circular orbit become stable oscillations with very small amplitude.
△ Less
Submitted 4 April, 2023; v1 submitted 29 March, 2023;
originally announced March 2023.
-
GNN-Assisted Phase Space Integration with Application to Atomistics
Authors:
Shashank Saxena,
Jan-Hendrik Bastek,
Miguel Spinola,
Prateek Gupta,
Dennis M. Kochmann
Abstract:
Overcoming the time scale limitations of atomistics can be achieved by switching from the state-space representation of Molecular Dynamics (MD) to a statistical-mechanics-based representation in phase space, where approximations such as maximum-entropy or Gaussian phase packets (GPP) evolve the atomistic ensemble in a time-coarsened fashion. In practice, this requires the computation of expensive…
▽ More
Overcoming the time scale limitations of atomistics can be achieved by switching from the state-space representation of Molecular Dynamics (MD) to a statistical-mechanics-based representation in phase space, where approximations such as maximum-entropy or Gaussian phase packets (GPP) evolve the atomistic ensemble in a time-coarsened fashion. In practice, this requires the computation of expensive high-dimensional integrals over all of phase space of an atomistic ensemble. This, in turn, is commonly accomplished efficiently by low-order numerical quadrature. We show that numerical quadrature in this context, unfortunately, comes with a set of inherent problems, which corrupt the accuracy of simulations -- especially when dealing with crystal lattices with imperfections. As a remedy, we demonstrate that Graph Neural Networks, trained on Monte-Carlo data, can serve as a replacement for commonly used numerical quadrature rules, overcoming their deficiencies and significantly improving the accuracy. This is showcased by three benchmarks: the thermal expansion of copper, the martensitic phase transition of iron, and the energy of grain boundaries. We illustrate the benefits of the proposed technique over classically used third- and fifth-order Gaussian quadrature, we highlight the impact on time-coarsened atomistic predictions, and we discuss the computational efficiency. The latter is of general importance when performing frequent evaluation of phase space or other high-dimensional integrals, which is why the proposed framework promises applications beyond the scope of atomistics.
△ Less
Submitted 20 March, 2023;
originally announced March 2023.