-
Obtaining Better Static Word Embeddings Using Contextual Embedding Models
Authors:
Prakhar Gupta,
Martin Jaggi
Abstract:
The advent of contextual word embeddings -- representations of words which incorporate semantic and syntactic information from their context -- has led to tremendous improvements on a wide variety of NLP tasks. However, recent contextual models have prohibitively high computational cost in many use-cases and are often hard to interpret. In this work, we demonstrate that our proposed distillation m…
▽ More
The advent of contextual word embeddings -- representations of words which incorporate semantic and syntactic information from their context -- has led to tremendous improvements on a wide variety of NLP tasks. However, recent contextual models have prohibitively high computational cost in many use-cases and are often hard to interpret. In this work, we demonstrate that our proposed distillation method, which is a simple extension of CBOW-based training, allows to significantly improve computational efficiency of NLP applications, while outperforming the quality of existing static embeddings trained from scratch as well as those distilled from previously proposed methods. As a side-effect, our approach also allows a fair comparison of both contextual and static embeddings via standard lexical evaluation tasks.
△ Less
Submitted 8 June, 2021;
originally announced June 2021.
-
Rawlsian Fair Adaptation of Deep Learning Classifiers
Authors:
Kulin Shah,
Pooja Gupta,
Amit Deshpande,
Chiranjib Bhattacharyya
Abstract:
Group-fairness in classification aims for equality of a predictive utility across different sensitive sub-populations, e.g., race or gender. Equality or near-equality constraints in group-fairness often worsen not only the aggregate utility but also the utility for the least advantaged sub-population. In this paper, we apply the principles of Pareto-efficiency and least-difference to the utility b…
▽ More
Group-fairness in classification aims for equality of a predictive utility across different sensitive sub-populations, e.g., race or gender. Equality or near-equality constraints in group-fairness often worsen not only the aggregate utility but also the utility for the least advantaged sub-population. In this paper, we apply the principles of Pareto-efficiency and least-difference to the utility being accuracy, as an illustrative example, and arrive at the Rawls classifier that minimizes the error rate on the worst-off sensitive sub-population. Our mathematical characterization shows that the Rawls classifier uniformly applies a threshold to an ideal score of features, in the spirit of fair equality of opportunity. In practice, such a score or a feature representation is often computed by a black-box model that has been useful but unfair. Our second contribution is practical Rawlsian fair adaptation of any given black-box deep learning model, without changing the score or feature representation it computes. Given any score function or feature representation and only its second-order statistics on the sensitive sub-populations, we seek a threshold classifier on the given score or a linear threshold classifier on the given feature representation that achieves the Rawls error rate restricted to this hypothesis class. Our technical contribution is to formulate the above problems using ambiguous chance constraints, and to provide efficient algorithms for Rawlsian fair adaptation, along with provable upper bounds on the Rawls error rate. Our empirical results show significant improvement over state-of-the-art group-fair algorithms, even without retraining for fairness.
△ Less
Submitted 31 May, 2021;
originally announced May 2021.
-
Foveal-pit inspired filtering of DVS spike response
Authors:
Shriya T. P. Gupta,
Pablo Linares-Serrano,
Basabdatta Sen Bhattacharya,
Teresa Serrano-Gotarredona
Abstract:
In this paper, we present results of processing Dynamic Vision Sensor (DVS) recordings of visual patterns with a retinal model based on foveal-pit inspired Difference of Gaussian (DoG) filters. A DVS sensor was stimulated with varying number of vertical white and black bars of different spatial frequencies moving horizontally at a constant velocity. The output spikes generated by the DVS sensor we…
▽ More
In this paper, we present results of processing Dynamic Vision Sensor (DVS) recordings of visual patterns with a retinal model based on foveal-pit inspired Difference of Gaussian (DoG) filters. A DVS sensor was stimulated with varying number of vertical white and black bars of different spatial frequencies moving horizontally at a constant velocity. The output spikes generated by the DVS sensor were applied as input to a set of DoG filters inspired by the receptive field structure of the primate visual pathway. In particular, these filters mimic the receptive fields of the midget and parasol ganglion cells (spiking neurons of the retina) that sub-serve the photo-receptors of the foveal-pit. The features extracted with the foveal-pit model are used for further classification using a spiking convolutional neural network trained with a backpropagation variant adapted for spiking neural networks.
△ Less
Submitted 29 May, 2021;
originally announced May 2021.
-
Implementing a foveal-pit inspired filter in a Spiking Convolutional Neural Network: a preliminary study
Authors:
Shriya T. P. Gupta,
Basabdatta Sen Bhattacharya
Abstract:
We have presented a Spiking Convolutional Neural Network (SCNN) that incorporates retinal foveal-pit inspired Difference of Gaussian filters and rank-order encoding. The model is trained using a variant of the backpropagation algorithm adapted to work with spiking neurons, as implemented in the Nengo library. We have evaluated the performance of our model on two publicly available datasets - one f…
▽ More
We have presented a Spiking Convolutional Neural Network (SCNN) that incorporates retinal foveal-pit inspired Difference of Gaussian filters and rank-order encoding. The model is trained using a variant of the backpropagation algorithm adapted to work with spiking neurons, as implemented in the Nengo library. We have evaluated the performance of our model on two publicly available datasets - one for digit recognition task, and the other for vehicle recognition task. The network has achieved up to 90% accuracy, where loss is calculated using the cross-entropy function. This is an improvement over around 57% accuracy obtained with the alternate approach of performing the classification without any kind of neural filtering. Overall, our proof-of-concept study indicates that introducing biologically plausible filtering in existing SCNN architecture will work well with noisy input images such as those in our vehicle recognition task. Based on our results, we plan to enhance our SCNN by integrating lateral inhibition-based redundancy reduction prior to rank-ordering, which will further improve the classification accuracy by the network.
△ Less
Submitted 29 May, 2021;
originally announced May 2021.
-
Lightweight Cross-Lingual Sentence Representation Learning
Authors:
Zhuoyuan Mao,
Prakhar Gupta,
Pei Wang,
Chenhui Chu,
Martin Jaggi,
Sadao Kurohashi
Abstract:
Large-scale models for learning fixed-dimensional cross-lingual sentence representations like LASER (Artetxe and Schwenk, 2019b) lead to significant improvement in performance on downstream tasks. However, further increases and modifications based on such large-scale models are usually impractical due to memory limitations. In this work, we introduce a lightweight dual-transformer architecture wit…
▽ More
Large-scale models for learning fixed-dimensional cross-lingual sentence representations like LASER (Artetxe and Schwenk, 2019b) lead to significant improvement in performance on downstream tasks. However, further increases and modifications based on such large-scale models are usually impractical due to memory limitations. In this work, we introduce a lightweight dual-transformer architecture with just 2 layers for generating memory-efficient cross-lingual sentence representations. We explore different training tasks and observe that current cross-lingual training tasks leave a lot to be desired for this shallow architecture. To ameliorate this, we propose a novel cross-lingual language model, which combines the existing single-word masked language model with the newly proposed cross-lingual token-level reconstruction task. We further augment the training task by the introduction of two computationally-lite sentence-level contrastive learning tasks to enhance the alignment of cross-lingual sentence representation space, which compensates for the learning bottleneck of the lightweight transformer for generative tasks. Our comparisons with competing models on cross-lingual sentence retrieval and multilingual document classification confirm the effectiveness of the newly proposed training tasks for a shallow model.
△ Less
Submitted 27 May, 2022; v1 submitted 28 May, 2021;
originally announced May 2021.
-
Constraints on dark photon dark matter using data from LIGO's and Virgo's third observing run
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
T. D. Abbott,
F. Acernese,
K. Ackley,
C. Adams,
N. Adhikari,
R. X. Adhikari,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
S. Albanesi,
A. Allocca,
P. A. Altin,
A. Amato
, et al. (1605 additional authors not shown)
Abstract:
We present a search for dark photon dark matter that could couple to gravitational-wave interferometers using data from Advanced LIGO and Virgo's third observing run. To perform this analysis, we use two methods, one based on cross-correlation of the strain channels in the two nearly aligned LIGO detectors, and one that looks for excess power in the strain channels of the LIGO and Virgo detectors.…
▽ More
We present a search for dark photon dark matter that could couple to gravitational-wave interferometers using data from Advanced LIGO and Virgo's third observing run. To perform this analysis, we use two methods, one based on cross-correlation of the strain channels in the two nearly aligned LIGO detectors, and one that looks for excess power in the strain channels of the LIGO and Virgo detectors. The excess power method optimizes the Fourier Transform coherence time as a function of frequency, to account for the expected signal width due to Doppler modulations. We do not find any evidence of dark photon dark matter with a mass between $m_{\rm A} \sim 10^{-14}-10^{-11}$ eV/$c^2$, which corresponds to frequencies between 10-2000 Hz, and therefore provide upper limits on the square of the minimum coupling of dark photons to baryons, i.e. $U(1)_{\rm B}$ dark matter. For the cross-correlation method, the best median constraint on the squared coupling is $\sim2.65\times10^{-46}$ at $m_{\rm A}\sim4.31\times10^{-13}$ eV/$c^2$; for the other analysis, the best constraint is $\sim 2.4\times 10^{-47}$ at $m_{\rm A}\sim 5.7\times 10^{-13}$ eV/$c^2$. These limits improve upon those obtained in direct dark matter detection experiments by a factor of $\sim100$ for $m_{\rm A}\sim [2-4]\times 10^{-13}$ eV/$c^2$, and are, in absolute terms, the most stringent constraint so far in a large mass range $m_A\sim$ $2\times 10^{-13}-8\times 10^{-12}$ eV/$c^2$.
△ Less
Submitted 6 May, 2024; v1 submitted 27 May, 2021;
originally announced May 2021.
-
Searches for continuous gravitational waves from young supernova remnants in the early third observing run of Advanced LIGO and Virgo
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
T. D. Abbott,
S. Abraham,
F. Acernese,
K. Ackley,
A. Adams,
C. Adams,
R. X. Adhikari,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
K. M. Aleman,
G. Allen,
A. Allocca
, et al. (1567 additional authors not shown)
Abstract:
We present results of three wide-band directed searches for continuous gravitational waves from 15 young supernova remnants in the first half of the third Advanced LIGO and Virgo observing run. We use three search pipelines with distinct signal models and methods of identifying noise artifacts. Without ephemerides of these sources, the searches are conducted over a frequency band spanning from 10~…
▽ More
We present results of three wide-band directed searches for continuous gravitational waves from 15 young supernova remnants in the first half of the third Advanced LIGO and Virgo observing run. We use three search pipelines with distinct signal models and methods of identifying noise artifacts. Without ephemerides of these sources, the searches are conducted over a frequency band spanning from 10~Hz to 2~kHz. We find no evidence of continuous gravitational radiation from these sources. We set upper limits on the intrinsic signal strain at 95\% confidence level in sample sub-bands, estimate the sensitivity in the full band, and derive the corresponding constraints on the fiducial neutron star ellipticity and $r$-mode amplitude. The best 95\% confidence constraints placed on the signal strain are $7.7\times 10^{-26}$ and $7.8\times 10^{-26}$ near 200~Hz for the supernova remnants G39.2--0.3 and G65.7+1.2, respectively. The most stringent constraints on the ellipticity and $r$-mode amplitude reach $\lesssim 10^{-7}$ and $ \lesssim 10^{-5}$, respectively, at frequencies above $\sim 400$~Hz for the closest supernova remnant G266.2--1.2/Vela Jr.
△ Less
Submitted 14 July, 2021; v1 submitted 24 May, 2021;
originally announced May 2021.
-
On Cosymplectic Conformal Connections
Authors:
Punam Gupta
Abstract:
The aim of this paper is to introduce a cosymplectic analouge of conformal connection in a cosymplectic manifold and proved that if cosymplectic manifold M admits a cosymplectic conformal connection which is of zero curvature, then the Bochner curvature tensor of M vanishes.
The aim of this paper is to introduce a cosymplectic analouge of conformal connection in a cosymplectic manifold and proved that if cosymplectic manifold M admits a cosymplectic conformal connection which is of zero curvature, then the Bochner curvature tensor of M vanishes.
△ Less
Submitted 19 May, 2021;
originally announced May 2021.
-
Comprehensive quasi-Einstein spacetime with application to general relativity
Authors:
Punam Gupta,
Sanjay Kumar Singh
Abstract:
The aim of this paper is to extend the notion of all known quasi-Einstein manifolds like generalized quasi-Einstein, mixed generalized quasi-Einstein manifold, pseudo generalized quasi-Einstein manifold and many more and name it comprehensive quasi Einstein manifold C(QE)$_{n}$. We investigate some geometric and physical properties of the comprehensive quasi Einstein manifolds C(QE)$_{n}$ under ce…
▽ More
The aim of this paper is to extend the notion of all known quasi-Einstein manifolds like generalized quasi-Einstein, mixed generalized quasi-Einstein manifold, pseudo generalized quasi-Einstein manifold and many more and name it comprehensive quasi Einstein manifold C(QE)$_{n}$. We investigate some geometric and physical properties of the comprehensive quasi Einstein manifolds C(QE)$_{n}$ under certain conditions. We study the conformal and conharmonic mappings between C(QE)$_{n}$ manifolds. Then we examine the C(QE)$_{n}$ with harmonic Weyl tensor. We investigate geometric and physical properties of the comprehensive quasi Einstein manifolds C(QE)$_{n}$ under certain conditions. We define the manifold of comprehensive quasi-constant curvature and proved that conformally flat C(QE)$_{n}$ is manifold of comprehensive quasi-constant curvature and vice versa. We study the general two viscous fluid spacetime C(QE)$_{4}$ and find out some important consequences about C(QE)$_{4}$. We study C(QE)$_{n}$ with vanishing space matter tensor. Finally, we prove the existence of such manifolds by constructing non-trivial example.
△ Less
Submitted 3 September, 2021; v1 submitted 8 May, 2021;
originally announced May 2021.
-
Information-theoretic Evolution of Model Agnostic Global Explanations
Authors:
Sukriti Verma,
Nikaash Puri,
Piyush Gupta,
Balaji Krishnamurthy
Abstract:
Explaining the behavior of black box machine learning models through human interpretable rules is an important research area. Recent work has focused on explaining model behavior locally i.e. for specific predictions as well as globally across the fields of vision, natural language, reinforcement learning and data science. We present a novel model-agnostic approach that derives rules to globally e…
▽ More
Explaining the behavior of black box machine learning models through human interpretable rules is an important research area. Recent work has focused on explaining model behavior locally i.e. for specific predictions as well as globally across the fields of vision, natural language, reinforcement learning and data science. We present a novel model-agnostic approach that derives rules to globally explain the behavior of classification models trained on numerical and/or categorical data. Our approach builds on top of existing local model explanation methods to extract conditions important for explaining model behavior for specific instances followed by an evolutionary algorithm that optimizes an information theory based fitness function to construct rules that explain global model behavior. We show how our approach outperforms existing approaches on a variety of datasets. Further, we introduce a parameter to evaluate the quality of interpretation under the scenario of distributional shift. This parameter evaluates how well the interpretation can predict model behavior for previously unseen data distributions. We show how existing approaches for interpreting models globally lack distributional robustness. Finally, we show how the quality of the interpretation can be improved under the scenario of distributional shift by adding out of distribution samples to the dataset used to learn the interpretation and thereby, increase robustness. All of the datasets used in our paper are open and publicly available. Our approach has been deployed in a leading digital marketing suite of products.
△ Less
Submitted 14 May, 2021;
originally announced May 2021.
-
Search for lensing signatures in the gravitational-wave observations from the first half of LIGO-Virgo's third observing run
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
R. Abbott,
T. D. Abbott,
S. Abraham,
F. Acernese,
K. Ackley,
A. Adams,
C. Adams,
R. X. Adhikari,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
K. M. Aleman,
G. Allen,
A. Allocca,
P. A. Altin,
A. Amato
, et al. (1356 additional authors not shown)
Abstract:
We search for signatures of gravitational lensing in the gravitational-wave signals from compact binary coalescences detected by Advanced LIGO and Advanced Virgo during O3a, the first half of their third observing run. We study: 1) the expected rate of lensing at current detector sensitivity and the implications of a non-observation of strong lensing or a stochastic gravitational-wave background o…
▽ More
We search for signatures of gravitational lensing in the gravitational-wave signals from compact binary coalescences detected by Advanced LIGO and Advanced Virgo during O3a, the first half of their third observing run. We study: 1) the expected rate of lensing at current detector sensitivity and the implications of a non-observation of strong lensing or a stochastic gravitational-wave background on the merger-rate density at high redshift; 2) how the interpretation of individual high-mass events would change if they were found to be lensed; 3) the possibility of multiple images due to strong lensing by galaxies or galaxy clusters; and 4) possible wave-optics effects due to point-mass microlenses. Several pairs of signals in the multiple-image analysis show similar parameters and, in this sense, are nominally consistent with the strong lensing hypothesis. However, taking into account population priors, selection effects, and the prior odds against lensing, these events do not provide sufficient evidence for lensing. Overall, we find no compelling evidence for lensing in the observed gravitational-wave signals from any of these analyses.
△ Less
Submitted 30 November, 2021; v1 submitted 13 May, 2021;
originally announced May 2021.
-
Testing GR with the Gravitational Wave Inspiral Signal GW170817
Authors:
Andrey A. Shoom,
Pawan K. Gupta,
Badri Krishnan,
Alex B. Nielsen,
Collin D. Capano
Abstract:
Observations of gravitational waves from compact binary mergers have enabled unique tests of general relativity in the dynamical and non-linear regimes. One of the most important such tests are constraints on the post-Newtonian (PN) corrections to the phase of the gravitational wave signal. The values of these PN coefficients can be calculated within standard general relativity, and these values a…
▽ More
Observations of gravitational waves from compact binary mergers have enabled unique tests of general relativity in the dynamical and non-linear regimes. One of the most important such tests are constraints on the post-Newtonian (PN) corrections to the phase of the gravitational wave signal. The values of these PN coefficients can be calculated within standard general relativity, and these values are different in many alternate theories of gravity. It is clearly of great interest to constrain these deviations based on gravitational wave observations. In the majority of such tests which have been carried out, and which yield by far the most stringent constraints, it is common to vary these PN coefficients individually. While this might in principle be useful for detecting certain deviations from standard general relativity, it is a serious limitation. For example, we would expect alternate theories of gravity to generically have additional parameters. The corrections to the PN coefficients would be expected to depend on these additional non-GR parameters whence, we expect that the various PN coefficients to be highly correlated. We present an alternate analysis here using data from the binary neutron star coalescence GW170817. Our analysis uses an appropriate linear combination of non-GR parameters that represent absolute deviations from the corresponding post-Newtonian inspiral coefficients in the TaylorF2 approximant phase. These combinations represent uncorrelated non-GR parameters which correspond to principal directions of their covariance matrix in the parameter subspace. Our results illustrate good agreement with GR. In particular, the integral non-GR phase is $Ψ_{\mbox{non-GR}} = (0.447\pm253)\times10^{-1}$ and the deviation from GR percentile is $p^{\mbox{Dev-GR}}_{n}=25.85\%$.
△ Less
Submitted 5 May, 2021;
originally announced May 2021.
-
Constraints from LIGO O3 data on gravitational-wave emission due to r-modes in the glitching pulsar PSR J0537-6910
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
T. D. Abbott,
S. Abraham,
F. Acernese,
K. Ackley,
A. Adams,
C. Adams,
R. X. Adhikari,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
K. M. Aleman,
G. Allen,
A. Allocca
, et al. (1574 additional authors not shown)
Abstract:
We present a search for continuous gravitational-wave emission due to r-modes in the pulsar PSR J0537-6910 using data from the LIGO-Virgo Collaboration observing run O3. PSR J0537-6910 is a young energetic X-ray pulsar and is the most frequent glitcher known. The inter-glitch braking index of the pulsar suggests that gravitational-wave emission due to r-mode oscillations may play an important role…
▽ More
We present a search for continuous gravitational-wave emission due to r-modes in the pulsar PSR J0537-6910 using data from the LIGO-Virgo Collaboration observing run O3. PSR J0537-6910 is a young energetic X-ray pulsar and is the most frequent glitcher known. The inter-glitch braking index of the pulsar suggests that gravitational-wave emission due to r-mode oscillations may play an important role in the spin evolution of this pulsar. Theoretical models confirm this possibility and predict emission at a level that can be probed by ground-based detectors. In order to explore this scenario, we search for r-mode emission in the epochs between glitches by using a contemporaneous timing ephemeris obtained from NICER data. We do not detect any signals in the theoretically expected band of 86-97 Hz, and report upper limits on the amplitude of the gravitational waves. Our results improve on previous amplitude upper limits from r-modes in J0537-6910 by a factor of up to 3 and place stringent constraints on theoretical models for r-mode driven spin-down in PSR J0537-6910, especially for higher frequencies at which our results reach below the spin-down limit defined by energy conservation.
△ Less
Submitted 7 January, 2022; v1 submitted 29 April, 2021;
originally announced April 2021.
-
Multi-source Neural Topic Modeling in Multi-view Embedding Spaces
Authors:
Pankaj Gupta,
Yatin Chaudhary,
Hinrich Schütze
Abstract:
Though word embeddings and topics are complementary representations, several past works have only used pretrained word embeddings in (neural) topic modeling to address data sparsity in short-text or small collection of documents. This work presents a novel neural topic modeling framework using multi-view embedding spaces: (1) pretrained topic-embeddings, and (2) pretrained word-embeddings (context…
▽ More
Though word embeddings and topics are complementary representations, several past works have only used pretrained word embeddings in (neural) topic modeling to address data sparsity in short-text or small collection of documents. This work presents a novel neural topic modeling framework using multi-view embedding spaces: (1) pretrained topic-embeddings, and (2) pretrained word-embeddings (context insensitive from Glove and context-sensitive from BERT models) jointly from one or many sources to improve topic quality and better deal with polysemy. In doing so, we first build respective pools of pretrained topic (i.e., TopicPool) and word embeddings (i.e., WordPool). We then identify one or more relevant source domain(s) and transfer knowledge to guide meaningful learning in the sparse target domain. Within neural topic modeling, we quantify the quality of topics and document representations via generalization (perplexity), interpretability (topic coherence) and information retrieval (IR) using short-text, long-text, small and large document collections from news and medical domains. Introducing the multi-source multi-view embedding spaces, we have shown state-of-the-art neural topic modeling using 6 source (high-resource) and 5 target (low-resource) corpora.
△ Less
Submitted 17 April, 2021;
originally announced April 2021.
-
Importance of tidal resonances in extreme-mass-ratio inspirals
Authors:
Priti Gupta,
Béatrice Bonga,
Alvin J. K. Chua,
Takahiro Tanaka
Abstract:
Extreme mass ratio inspirals (EMRIs) will be important sources for future space-based gravitational-wave detectors. In recent work, tidal resonances in binary orbital evolution induced by the tidal field of nearby stars or black holes have been identified as being potentially significant in the context of extreme mass-ratio inspirals. These resonances occur when the three orbital frequencies descr…
▽ More
Extreme mass ratio inspirals (EMRIs) will be important sources for future space-based gravitational-wave detectors. In recent work, tidal resonances in binary orbital evolution induced by the tidal field of nearby stars or black holes have been identified as being potentially significant in the context of extreme mass-ratio inspirals. These resonances occur when the three orbital frequencies describing the orbit are commensurate. During the resonance, the orbital parameters of the small body experience a jump leading to a shift in the phase of the gravitational waveform. In this paper, we treat the tidal perturber as stationary and restricted to the equatorial plane, and present a first study of how common and important such resonances are over the entire orbital parameter space. We find that a large proportion of inspirals encounter a low-order resonance in the observationally important regime. While the instantaneous effect of a tidal resonance is small, its effect on the accumulated phase of the gravitational waveform of an EMRI system can be significant due to its many cycles in band; we estimate that the effect is detectable for a significant fraction of sources. We also provide fitting formulae for the induced change in the constants of motion of the orbit due to the tidal resonance for several low-order resonances.
△ Less
Submitted 27 July, 2021; v1 submitted 7 April, 2021;
originally announced April 2021.
-
Prism: Private Verifiable Set Computation over Multi-Owner Outsourced Databases
Authors:
Yin Li,
Dhrubajyoti Ghosh,
Peeyush Gupta,
Sharad Mehrotra,
Nisha Panwar,
Shantanu Sharma
Abstract:
This paper proposes Prism, a secret sharing based approach to compute private set operations (i.e., intersection and union), as well as aggregates over outsourced databases belonging to multiple owners. Prism enables data owners to pre-load the data onto non-colluding servers and exploits the additive and multiplicative properties of secret-shares to compute the above-listed operations in (at most…
▽ More
This paper proposes Prism, a secret sharing based approach to compute private set operations (i.e., intersection and union), as well as aggregates over outsourced databases belonging to multiple owners. Prism enables data owners to pre-load the data onto non-colluding servers and exploits the additive and multiplicative properties of secret-shares to compute the above-listed operations in (at most) two rounds of communication between the servers (storing the secret-shares) and the querier, resulting in a very efficient implementation. Also, Prism does not require communication among the servers and supports result verification techniques for each operation to detect malicious adversaries. Experimental results show that Prism scales both in terms of the number of data owners and database sizes, to which prior approaches do not scale.
△ Less
Submitted 7 April, 2021;
originally announced April 2021.
-
An active inference model of collective intelligence
Authors:
Rafael Kaufmann,
Pranav Gupta,
Jacob Taylor
Abstract:
To date, formal models of collective intelligence have lacked a plausible mathematical description of the relationship between local-scale interactions between highly autonomous sub-system components (individuals) and global-scale behavior of the composite system (the collective). In this paper we use the Active Inference Formulation (AIF), a framework for explaining the behavior of any non-equili…
▽ More
To date, formal models of collective intelligence have lacked a plausible mathematical description of the relationship between local-scale interactions between highly autonomous sub-system components (individuals) and global-scale behavior of the composite system (the collective). In this paper we use the Active Inference Formulation (AIF), a framework for explaining the behavior of any non-equilibrium steady state system at any scale, to posit a minimal agent-based model that simulates the relationship between local individual-level interaction and collective intelligence (operationalized as system-level performance). We explore the effects of providing baseline AIF agents (Model 1) with specific cognitive capabilities: Theory of Mind (Model 2); Goal Alignment (Model 3), and Theory of Mind with Goal Alignment (Model 4). These stepwise transitions in sophistication of cognitive ability are motivated by the types of advancements plausibly required for an AIF agent to persist and flourish in an environment populated by other AIF agents, and have also recently been shown to map naturally to canonical steps in human cognitive ability. Illustrative results show that stepwise cognitive transitions increase system performance by providing complementary mechanisms for alignment between agents' local and global optima. Alignment emerges endogenously from the dynamics of interacting AIF agents themselves, rather than being imposed exogenously by incentives to agents' behaviors (contra existing computational models of collective intelligence) or top-down priors for collective behavior (contra existing multiscale simulations of AIF). These results shed light on the types of generic information-theoretic patterns conducive to collective intelligence in human and other complex adaptive systems.
△ Less
Submitted 2 April, 2021;
originally announced April 2021.
-
Geometric properties of a domain with cusps
Authors:
Shweta Gandhi,
Prachi Gupta,
Sumit Nagpal,
V. Ravichandran
Abstract:
For $n\geq 4$ (even), the function $\varphi_{n\mathcal{L}}(z)=1+nz/(n+1)+z^n/(n+1)$ maps the unit disk $\mathbb{D}$ onto a domain bounded by an epicycloid with $n-1$ cusps. In this paper, the class $\mathcal{S}^*_{n\mathcal{L}} = \mathcal{S}^*(\varphi_{n\mathcal{L}})$ is studied and various inclusion relations are established with other subclasses of starlike functions. The bounds on initial coeff…
▽ More
For $n\geq 4$ (even), the function $\varphi_{n\mathcal{L}}(z)=1+nz/(n+1)+z^n/(n+1)$ maps the unit disk $\mathbb{D}$ onto a domain bounded by an epicycloid with $n-1$ cusps. In this paper, the class $\mathcal{S}^*_{n\mathcal{L}} = \mathcal{S}^*(\varphi_{n\mathcal{L}})$ is studied and various inclusion relations are established with other subclasses of starlike functions. The bounds on initial coefficients is also computed. Various radii problems are also solved for the class $\mathcal{S}^*_{n\mathcal{L}}$.
△ Less
Submitted 2 April, 2021;
originally announced April 2021.
-
Detecting over/under-translation errors for determining adequacy in human translations
Authors:
Prabhakar Gupta,
Ridha Juneja,
Anil Nelakanti,
Tamojit Chatterjee
Abstract:
We present a novel approach to detecting over and under translations (OT/UT) as part of adequacy error checks in translation evaluation. We do not restrict ourselves to machine translation (MT) outputs and specifically target applications with human generated translation pipeline. The goal of our system is to identify OT/UT errors from human translated video subtitles with high error recall. We ac…
▽ More
We present a novel approach to detecting over and under translations (OT/UT) as part of adequacy error checks in translation evaluation. We do not restrict ourselves to machine translation (MT) outputs and specifically target applications with human generated translation pipeline. The goal of our system is to identify OT/UT errors from human translated video subtitles with high error recall. We achieve this without reference translations by learning a model on synthesized training data. We compare various classification networks that we trained on embeddings from pre-trained language model with our best hybrid network of GRU + CNN achieving 89.3% accuracy on high-quality human-annotated evaluation data in 8 languages.
△ Less
Submitted 1 April, 2021;
originally announced April 2021.
-
Marx-Strohhäcker theorem for Multivalent Functions
Authors:
Prachi Gupta,
Sumit Nagpal,
V. Ravichandran
Abstract:
Some differential implications of classical Marx-Strohhäcker theorem are extended for multivalent functions. These results are also generalized for functions with fixed second coefficient by using the theory of first order differential subordination which in turn, corrects the results of Selvaraj and Stelin [On multivalent functions associated with fixed second coefficient and the principle of sub…
▽ More
Some differential implications of classical Marx-Strohhäcker theorem are extended for multivalent functions. These results are also generalized for functions with fixed second coefficient by using the theory of first order differential subordination which in turn, corrects the results of Selvaraj and Stelin [On multivalent functions associated with fixed second coefficient and the principle of subordination, Int. J. Math. Anal. {\bf 9} (2015), no.~18, 883--895].
△ Less
Submitted 22 March, 2021;
originally announced March 2021.
-
Monocular Multi-Layer Layout Estimation for Warehouse Racks
Authors:
Meher Shashwat Nigam,
Avinash Prabhu,
Anurag Sahu,
Puru Gupta,
Tanvi Karandikar,
N. Sai Shankar,
Ravi Kiran Sarvadevabhatla,
K. Madhava Krishna
Abstract:
Given a monocular colour image of a warehouse rack, we aim to predict the bird's-eye view layout for each shelf in the rack, which we term as multi-layer layout prediction. To this end, we present RackLay, a deep neural network for real-time shelf layout estimation from a single image. Unlike previous layout estimation methods, which provide a single layout for the dominant ground plane alone, Rac…
▽ More
Given a monocular colour image of a warehouse rack, we aim to predict the bird's-eye view layout for each shelf in the rack, which we term as multi-layer layout prediction. To this end, we present RackLay, a deep neural network for real-time shelf layout estimation from a single image. Unlike previous layout estimation methods, which provide a single layout for the dominant ground plane alone, RackLay estimates the top-view and front-view layout for each shelf in the considered rack populated with objects. RackLay's architecture and its variants are versatile and estimate accurate layouts for diverse scenes characterized by varying number of visible shelves in an image, large range in shelf occupancy factor and varied background clutter. Given the extreme paucity of datasets in this space and the difficulty involved in acquiring real data from warehouses, we additionally release a flexible synthetic dataset generation pipeline WareSynth which allows users to control the generation process and tailor the dataset according to contingent application. The ablations across architectural variants and comparison with strong prior baselines vindicate the efficacy of RackLay as an apt architecture for the novel problem of multi-layered layout estimation. We also show that fusing the top-view and front-view enables 3D reasoning applications such as metric free space estimation for the considered rack.
△ Less
Submitted 28 October, 2021; v1 submitted 16 March, 2021;
originally announced March 2021.
-
Search for anisotropic gravitational-wave backgrounds using data from Advanced LIGO and Advanced Virgo's first three observing runs
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
T. D. Abbott,
S. Abraham,
F. Acernese,
K. Ackley,
A. Adams,
C. Adams,
R. X. Adhikari,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
K. M. Aleman,
G. Allen,
A. Allocca
, et al. (1568 additional authors not shown)
Abstract:
We report results from searches for anisotropic stochastic gravitational-wave backgrounds using data from the first three observing runs of the Advanced LIGO and Advanced Virgo detectors. For the first time, we include Virgo data in our analysis and run our search with a new efficient pipeline called {\tt PyStoch} on data folded over one sidereal day. We use gravitational-wave radiometry (broadban…
▽ More
We report results from searches for anisotropic stochastic gravitational-wave backgrounds using data from the first three observing runs of the Advanced LIGO and Advanced Virgo detectors. For the first time, we include Virgo data in our analysis and run our search with a new efficient pipeline called {\tt PyStoch} on data folded over one sidereal day. We use gravitational-wave radiometry (broadband and narrow band) to produce sky maps of stochastic gravitational-wave backgrounds and to search for gravitational waves from point sources. A spherical harmonic decomposition method is employed to look for gravitational-wave emission from spatially-extended sources. Neither technique found evidence of gravitational-wave signals. Hence we derive 95\% confidence-level upper limit sky maps on the gravitational-wave energy flux from broadband point sources, ranging from $F_{α, Θ} < {\rm (0.013 - 7.6)} \times 10^{-8} {\rm erg \, cm^{-2} \, s^{-1} \, Hz^{-1}},$ and on the (normalized) gravitational-wave energy density spectrum from extended sources, ranging from $Ω_{α, Θ} < {\rm (0.57 - 9.3)} \times 10^{-9} \, {\rm sr^{-1}}$, depending on direction ($Θ$) and spectral index ($α$). These limits improve upon previous limits by factors of $2.9 - 3.5$. We also set 95\% confidence level upper limits on the frequency-dependent strain amplitudes of quasimonochromatic gravitational waves coming from three interesting targets, Scorpius X-1, SN 1987A and the Galactic Center, with best upper limits range from $h_0 < {\rm (1.7-2.1)} \times 10^{-25},$ a factor of $\geq 2.0$ improvement compared to previous stochastic radiometer searches.
△ Less
Submitted 2 February, 2022; v1 submitted 15 March, 2021;
originally announced March 2021.
-
Supremacy of optimal beam energy for synthesis of superheavy elements
Authors:
H. C. Manjunatha,
N. Sowmya,
P. S. Damodara Gupta,
L. Seenappa,
T. Nandi
Abstract:
Besides right choice of entrance channel, selection of optimal beam energies for synthesis of superheavy elements plays a crucial role. A thorough investigation with the advanced statistical and dinuclear system models on all the experiments performed for the synthesis of the successful superheavy elements Z=104-118 and failed superheavy elements Z=119-120 leads us to infer that improper choice of…
▽ More
Besides right choice of entrance channel, selection of optimal beam energies for synthesis of superheavy elements plays a crucial role. A thorough investigation with the advanced statistical and dinuclear system models on all the experiments performed for the synthesis of the successful superheavy elements Z=104-118 and failed superheavy elements Z=119-120 leads us to infer that improper choice of the beam energies may be responsible for too low production cross sections to measure and thus the cause for the debacle. We have predicted the optimal beam energies to obtain the maximum production cross sections for all the reactions used for the superheavy elements Z=104-120. Hope exploitation of these predictions may be on the cards soon to extend the periodic table for the eighth period
△ Less
Submitted 12 March, 2021;
originally announced March 2021.
-
On the timescale of quasi fission and Coulomb fission
Authors:
T. Nandi,
H. C. Manjunatha,
P. S. Damodara Gupta,
N. Sowmya,
N. Manjunatha,
K. N. Sridhara,
L. Seenappa
Abstract:
Coulomb fission mechanism may take place if the maximum Coulomb-excitation energy transfer in a reaction exceeds the fission barrier of either the projectile or target. This condition is satisfied by all the reactions used for the earlier blocking measurements except one reaction 208 Pb + Natural Ge crystal, where the measured timescale was below the measuring limit of the blocking measurements <…
▽ More
Coulomb fission mechanism may take place if the maximum Coulomb-excitation energy transfer in a reaction exceeds the fission barrier of either the projectile or target. This condition is satisfied by all the reactions used for the earlier blocking measurements except one reaction 208 Pb + Natural Ge crystal, where the measured timescale was below the measuring limit of the blocking measurements < 1 as. Hence, inclusion of the Coulomb fission in the data analysis of the blocking experiments leads us to interpret that the measured time longer than a few attoseconds (about 2-2.5 as) is nothing but belonging to the Coulomb fission timescale and shorter than 1 as are due to the quasifission. Consequently, this finding resolves the critical discrepancies between the fission timescale measurements using the nuclear and blocking techniques. This, in turn, validates the fact that the quasifission timescale is indeed of the order of zeptoseconds in accordance with the nuclear experiments and theories. It thus provides a radical input in understanding the reaction mechanism for heavy element formation via fusion evaporation processes
△ Less
Submitted 11 March, 2021;
originally announced March 2021.
-
Search for a viable nucleus-nucleus potential for heavy-ion nuclear reactions
Authors:
T. Nandi,
D. K. Swami,
P. S. Damodara Gupta,
Yash Kumar,
S. Chakraborty,
H. C. Manjunatha
Abstract:
We have constructed an empirical formulae for the fusion and interaction barriers using experimental values available till date. The fusion barriers so obtained have been compared with different model predictions based on the proximity, Woods-Saxon and double folding potentials along with several empirical formulas, time dependent Hartree-Fock theories, and the experimental results. The comparison…
▽ More
We have constructed an empirical formulae for the fusion and interaction barriers using experimental values available till date. The fusion barriers so obtained have been compared with different model predictions based on the proximity, Woods-Saxon and double folding potentials along with several empirical formulas, time dependent Hartree-Fock theories, and the experimental results. The comparison allows us to find the best model, which is nothing but the present empirical formula only. Most remarkably, the fusion barrier and radius show excellent consonance with the experimental findings for the reactions meant for synthesis of the superheavy elements also. Furthermore, it is seen that substitution of the predicted fusion barrier and radius in classic Wong formula [C. Wong, Phys. Rev. Lett. {31}, 766 (1973)] for the total fusion cross sections satisfies very well with the experiments. Similarly, current interaction barrier predictions have also been compared well with a few experimental results available and Bass potential model meant for the interaction barrier predictions. Importantly, the present formulae for the fusion as well as interaction barrier will have practical implications in carrying out the physics research near the Coulomb barrier energies. Furthermore, present fusion barrier and radius provide us a good nucleus-nucleus potential useful for numerous theoretical applications.
△ Less
Submitted 11 March, 2021;
originally announced March 2021.
-
Columnar Storage and List-based Processing for Graph Database Management Systems
Authors:
Pranjal Gupta,
Amine Mhedhbi,
Semih Salihoglu
Abstract:
We revisit column-oriented storage and query processing techniques in the context of contemporary graph database management systems (GDBMSs). Similar to column-oriented RDBMSs, GDBMSs support read-heavy analytical workloads that however have fundamentally different data access patterns than traditional analytical workloads. We first derive a set of desiderata for optimizing storage and query proce…
▽ More
We revisit column-oriented storage and query processing techniques in the context of contemporary graph database management systems (GDBMSs). Similar to column-oriented RDBMSs, GDBMSs support read-heavy analytical workloads that however have fundamentally different data access patterns than traditional analytical workloads. We first derive a set of desiderata for optimizing storage and query processors of GDBMS based on their access patterns. We then present the design of columnar storage, compression, and query processing techniques based on these desiderata. In addition to showing direct integration of existing techniques from columnar RDBMSs, we also propose novel ones that are optimized for GDBMSs. These include a novel list-based query processor, which avoids expensive data copies of traditional block-based processors under many-to-many joins, a new data structure we call single-indexed edge property pages and an accompanying edge ID scheme, and a new application of Jacobson's bit vector index for compressing NULL values and empty lists. We integrated our techniques into the GraphflowDB in-memory GDBMS. Through extensive experiments, we demonstrate the scalability and query performance benefits of our techniques.
△ Less
Submitted 27 October, 2021; v1 submitted 3 March, 2021;
originally announced March 2021.
-
SWIS -- Shared Weight bIt Sparsity for Efficient Neural Network Acceleration
Authors:
Shurui Li,
Wojciech Romaszkan,
Alexander Graening,
Puneet Gupta
Abstract:
Quantization is spearheading the increase in performance and efficiency of neural network computing systems making headway into commodity hardware. We present SWIS - Shared Weight bIt Sparsity, a quantization framework for efficient neural network inference acceleration delivering improved performance and storage compression through an offline weight decomposition and scheduling algorithm. SWIS ca…
▽ More
Quantization is spearheading the increase in performance and efficiency of neural network computing systems making headway into commodity hardware. We present SWIS - Shared Weight bIt Sparsity, a quantization framework for efficient neural network inference acceleration delivering improved performance and storage compression through an offline weight decomposition and scheduling algorithm. SWIS can achieve up to 54.3% (19.8%) point accuracy improvement compared to weight truncation when quantizing MobileNet-v2 to 4 (2) bits post-training (with retraining) showing the strength of leveraging shared bit-sparsity in weights. SWIS accelerator gives up to 6x speedup and 1.9x energy improvement overstate of the art bit-serial architectures.
△ Less
Submitted 2 March, 2021; v1 submitted 1 March, 2021;
originally announced March 2021.
-
Concealer: SGX-based Secure, Volume Hiding, and Verifiable Processing of Spatial Time-Series Datasets
Authors:
Peeyush Gupta,
Sharad Mehrotra,
Shantanu Sharma,
Nalini Venkatasubramanian,
Guoxi Wang
Abstract:
This paper proposes a system, entitled Concealer that allows sharing time-varying spatial data (e.g., as produced by sensors) in encrypted form to an untrusted third-party service provider to provide location-based applications (involving aggregation queries over selected regions over time windows) to users. Concealer exploits carefully selected encryption techniques to use indexes supported by da…
▽ More
This paper proposes a system, entitled Concealer that allows sharing time-varying spatial data (e.g., as produced by sensors) in encrypted form to an untrusted third-party service provider to provide location-based applications (involving aggregation queries over selected regions over time windows) to users. Concealer exploits carefully selected encryption techniques to use indexes supported by database systems and combines ways to add fake tuples in order to realize an efficient system that protects against leakage based on output-size. Thus, the design of Concealer overcomes two limitations of existing symmetric searchable encryption (SSE) techniques: (i) it avoids the need of specialized data structures that limit usability/practicality of SSE in large scale deployments, and (ii) it avoids information leakages based on the output-size, which may leak data distributions. Experimental results validate the efficiency of the proposed algorithms over a spatial time-series dataset (collected from a smart space) and TPC-H datasets, each of 136 Million rows, the size of which prior approaches have not scaled to.
△ Less
Submitted 9 February, 2021;
originally announced February 2021.
-
Spin pumping and inverse spin Hall effect in CoFeB/IrMn heterostructures
Authors:
Koustuv Roy,
Abhisek Mishra,
Pushpendra Gupta,
Shaktiranjan Mohanty,
Braj Bhusan Singh,
Subhankar Bedanta
Abstract:
High spin to charge conversion efficiency is the requirement for the spintronics devices which is governed by spin pumping and inverse spin Hall effect (ISHE). In last one decade, ISHE and spin pumping are heavily investigated in ferromagnet/ heavy metal (HM) heterostructures. Recently antiferromagnetic (AFM) materials are found to be good replacement of HMs because AFMs exhibit terahertz spin dyn…
▽ More
High spin to charge conversion efficiency is the requirement for the spintronics devices which is governed by spin pumping and inverse spin Hall effect (ISHE). In last one decade, ISHE and spin pumping are heavily investigated in ferromagnet/ heavy metal (HM) heterostructures. Recently antiferromagnetic (AFM) materials are found to be good replacement of HMs because AFMs exhibit terahertz spin dynamics, high spin-orbit coupling, and absence of stray field. In this context we have performed the ISHE in CoFeB/ IrMn heterostructures. Spin pumping study is carried out for $Co_{40}Fe_{40}B_{20} (12\ nm)/ Cu (3\ nm)/ Ir_{50}Mn_{50} (t\ nm)/ AlO_{x} (3\ nm)$ samples where \textit{t} value varies from 0 to 10 nm. Damping of all the samples are higher than the single layer CoFeB which indicates that spin pumping due to IrMn is the underneath mechanism. Further the spin pumping in the samples are confirmed by angle dependent ISHE measurements. We have also disentangled other spin rectifications effects and found that the spin pumping is dominant in all the samples. From the ISHE analysis the real part of spin mixing conductance (\textit{$g_{r}^{\uparrow \downarrow}$}) is found to be 0.704 $\pm$ 0.003 $\times$ $10^{18}$ $m^{-2}$.
△ Less
Submitted 23 June, 2021; v1 submitted 6 February, 2021;
originally announced February 2021.
-
A few remarks on Pimsner-Popa bases and regular subfactors of depth 2
Authors:
Keshab Chandra Bakshi,
Ved Prakash Gupta
Abstract:
We prove that a finite index regular inclusion of $II_1$-factors with commutative first relative commutant is always a crossed product subfactor with respect to a minimal action of a biconnected weak Kac algebra. Prior to this, we prove that every finite index inclusion of $II_1$-factors which is of depth $2$ and has simple first relative commutant (respectively, is regular and has commutative or…
▽ More
We prove that a finite index regular inclusion of $II_1$-factors with commutative first relative commutant is always a crossed product subfactor with respect to a minimal action of a biconnected weak Kac algebra. Prior to this, we prove that every finite index inclusion of $II_1$-factors which is of depth $2$ and has simple first relative commutant (respectively, is regular and has commutative or simple first relative commutant) admits a two-sided Pimsner-Popa basis (respectively, a unitary orthonormal basis)
△ Less
Submitted 24 December, 2021; v1 submitted 2 February, 2021;
originally announced February 2021.
-
Constraints on cosmic strings using data from the third Advanced LIGO-Virgo observing run
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
T. D. Abbott,
S. Abraham,
F. Acernese,
K. Ackley,
A. Adams,
C. Adams,
R. X. Adhikari,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
K. M. Aleman,
G. Allen,
A. Allocca
, et al. (1565 additional authors not shown)
Abstract:
We search for gravitational-wave signals produced by cosmic strings in the Advanced LIGO and Virgo full O3 data set. Search results are presented for gravitational waves produced by cosmic string loop features such as cusps, kinks and, for the first time, kink-kink collisions.cA template-based search for short-duration transient signals does not yield a detection. We also use the stochastic gravit…
▽ More
We search for gravitational-wave signals produced by cosmic strings in the Advanced LIGO and Virgo full O3 data set. Search results are presented for gravitational waves produced by cosmic string loop features such as cusps, kinks and, for the first time, kink-kink collisions.cA template-based search for short-duration transient signals does not yield a detection. We also use the stochastic gravitational-wave background energy density upper limits derived from the O3 data to constrain the cosmic string tension, $Gμ$, as a function of the number of kinks, or the number of cusps, for two cosmic string loop distribution models.cAdditionally, we develop and test a third model which interpolates between these two models. Our results improve upon the previous LIGO-Virgo constraints on $Gμ$ by one to two orders of magnitude depending on the model which is tested. In particular, for one loop distribution model, we set the most competitive constraints to date, $Gμ\lesssim 4\times 10^{-15}$.
△ Less
Submitted 28 January, 2021;
originally announced January 2021.
-
Upper Limits on the Isotropic Gravitational-Wave Background from Advanced LIGO's and Advanced Virgo's Third Observing Run
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
T. D. Abbott,
S. Abraham,
F. Acernese,
K. Ackley,
A. Adams,
C. Adams,
R. X. Adhikari,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
T. Akutsu,
K. M. Aleman,
G. Allen,
A. Allocca,
P. A. Altin
, et al. (1566 additional authors not shown)
Abstract:
We report results of a search for an isotropic gravitational-wave background (GWB) using data from Advanced LIGO's and Advanced Virgo's third observing run (O3) combined with upper limits from the earlier O1 and O2 runs. Unlike in previous observing runs in the advanced detector era, we include Virgo in the search for the GWB. The results are consistent with uncorrelated noise, and therefore we pl…
▽ More
We report results of a search for an isotropic gravitational-wave background (GWB) using data from Advanced LIGO's and Advanced Virgo's third observing run (O3) combined with upper limits from the earlier O1 and O2 runs. Unlike in previous observing runs in the advanced detector era, we include Virgo in the search for the GWB. The results are consistent with uncorrelated noise, and therefore we place upper limits on the strength of the GWB. We find that the dimensionless energy density $Ω_{\rm GW}\leq 5.8\times 10^{-9}$ at the 95% credible level for a flat (frequency-independent) GWB, using a prior which is uniform in the log of the strength of the GWB, with 99% of the sensitivity coming from the band 20-76.6 Hz; $\leq 3.4 \times 10^{-9}$ at 25 Hz for a power-law GWB with a spectral index of 2/3 (consistent with expectations for compact binary coalescences), in the band 20-90.6 Hz; and $\leq 3.9 \times 10^{-10}$ at 25 Hz for a spectral index of 3, in the band 20-291.6 Hz. These upper limits improve over our previous results by a factor of 6.0 for a flat GWB. We also search for a GWB arising from scalar and vector modes, which are predicted by alternative theories of gravity; we place upper limits on the strength of GWBs with these polarizations. We demonstrate that there is no evidence of correlated noise of magnetic origin by performing a Bayesian analysis that allows for the presence of both a GWB and an effective magnetic background arising from geophysical Schumann resonances. We compare our upper limits to a fiducial model for the GWB from the merger of compact binaries. Finally, we combine our results with observations of individual mergers andshow that, at design sensitivity, this joint approach may yield stronger constraints on the merger rate of binary black holes at $z \lesssim 2$ than can be achieved with individually resolved mergers alone. [abridged]
△ Less
Submitted 28 January, 2021;
originally announced January 2021.
-
Syntactically Guided Generative Embeddings for Zero-Shot Skeleton Action Recognition
Authors:
Pranay Gupta,
Divyanshu Sharma,
Ravi Kiran Sarvadevabhatla
Abstract:
We introduce SynSE, a novel syntactically guided generative approach for Zero-Shot Learning (ZSL). Our end-to-end approach learns progressively refined generative embedding spaces constrained within and across the involved modalities (visual, language). The inter-modal constraints are defined between action sequence embedding and embeddings of Parts of Speech (PoS) tagged words in the correspondin…
▽ More
We introduce SynSE, a novel syntactically guided generative approach for Zero-Shot Learning (ZSL). Our end-to-end approach learns progressively refined generative embedding spaces constrained within and across the involved modalities (visual, language). The inter-modal constraints are defined between action sequence embedding and embeddings of Parts of Speech (PoS) tagged words in the corresponding action description. We deploy SynSE for the task of skeleton-based action sequence recognition. Our design choices enable SynSE to generalize compositionally, i.e., recognize sequences whose action descriptions contain words not encountered during training. We also extend our approach to the more challenging Generalized Zero-Shot Learning (GZSL) problem via a confidence-based gating mechanism. We are the first to present zero-shot skeleton action recognition results on the large-scale NTU-60 and NTU-120 skeleton action datasets with multiple splits. Our results demonstrate SynSE's state of the art performance in both ZSL and GZSL settings compared to strong baselines on the NTU-60 and NTU-120 datasets. The code and pretrained models are available at https://github.com/skelemoa/synse-zsl
△ Less
Submitted 28 June, 2021; v1 submitted 27 January, 2021;
originally announced January 2021.
-
Nonequilibrium thermomechanics of Gaussian phase packet crystals: application to the quasistatic quasicontinuum method
Authors:
Prateek Gupta,
Michael Ortiz,
Dennis M. Kochmann
Abstract:
The quasicontinuum method was originally introduced to bridge across length scales -- from atomistics to significantly larger continuum scales -- thus overcoming a key limitation of classical atomic-scale simulation techniques while solely relying on atomic-scale input (in the form of interatomic potentials). An associated challenge lies in bridging across time scales to overcome the time scale li…
▽ More
The quasicontinuum method was originally introduced to bridge across length scales -- from atomistics to significantly larger continuum scales -- thus overcoming a key limitation of classical atomic-scale simulation techniques while solely relying on atomic-scale input (in the form of interatomic potentials). An associated challenge lies in bridging across time scales to overcome the time scale limitations of atomistics. To address the biggest challenge, bridging across both length and time scales, only a few techniques exist, and most of those are limited to conditions of constant temperature. Here, we present a new strategy for the space-time coarsening of an atomistic ensemble, which introduces thermomechanical coupling. We investigate the quasistatics and dynamics of a crystalline solid described as a lattice of lumped correlated Gaussian phase packets occupying atomic lattice sites. By definition, phase packets account for the dynamics of crystalline lattices at finite temperature through the statistical variances of atomic momenta and positions. We show that momentum-space correlation allows for an exchange between potential and kinetic contributions to the crystal's Hamiltonian. Consequently, local adiabatic heating due to atomic site motion is captured. Moreover, within the quasistatic approximation the governing equations reduce to the minimization of thermodynamic potentials such as Helmholtz free energy (depending on the fixed variables), and they yield the local equation of state. We further discuss opportunities for describing atomic-level thermal transport using the correlated Gaussian phase packet formulation and the importance of interatomic correlations. Such a formulation offers a promising avenue for a finite-temperature non-equilibrium quasicontinuum method that may be combined with thermal transport models.
△ Less
Submitted 14 April, 2021; v1 submitted 25 January, 2021;
originally announced January 2021.
-
Challenges in the application of a mortality prediction model for COVID-19 patients on an Indian cohort
Authors:
Yukti Makhija,
Samarth Bhatia,
Shalendra Singh,
Sneha Kumar Jayaswal,
Prabhat Singh Malik,
Pallavi Gupta,
Shreyas N. Samaga,
Shreya Johri,
Sri Krishna Venigalla,
Rabi Narayan Hota,
Surinder Singh Bhatia,
Ishaan Gupta
Abstract:
Many countries are now experiencing the third wave of the COVID-19 pandemic straining the healthcare resources with an acute shortage of hospital beds and ventilators for the critically ill patients. This situation is especially worse in India with the second largest load of COVID-19 cases and a relatively resource-scarce medical infrastructure. Therefore, it becomes essential to triage the patien…
▽ More
Many countries are now experiencing the third wave of the COVID-19 pandemic straining the healthcare resources with an acute shortage of hospital beds and ventilators for the critically ill patients. This situation is especially worse in India with the second largest load of COVID-19 cases and a relatively resource-scarce medical infrastructure. Therefore, it becomes essential to triage the patients based on the severity of their disease and devote resources towards critically ill patients. Yan et al. 1 have published a very pertinent research that uses Machine learning (ML) methods to predict the outcome of COVID-19 patients based on their clinical parameters at the day of admission. They used the XGBoost algorithm, a type of ensemble model, to build the mortality prediction model. The final classifier is built through the sequential addition of multiple weak classifiers. The clinically operable decision rule was obtained from a 'single-tree XGBoost' and used lactic dehydrogenase (LDH), lymphocyte and high-sensitivity C-reactive protein (hs-CRP) values. This decision tree achieved a 100% survival prediction and 81% mortality prediction. However, these models have several technical challenges and do not provide an out of the box solution that can be deployed for other populations as has been reported in the "Matters Arising" section of Yan et al. Here, we show the limitations of this model by deploying it on one of the largest datasets of COVID-19 patients containing detailed clinical parameters collected from India.
△ Less
Submitted 15 January, 2021;
originally announced January 2021.
-
Analysis of E-commerce Ranking Signals via Signal Temporal Logic
Authors:
Tommaso Dreossi,
Giorgio Ballardin,
Parth Gupta,
Jan Bakus,
Yu-Hsiang Lin,
Vamsi Salaka
Abstract:
The timed position of documents retrieved by learning to rank models can be seen as signals. Signals carry useful information such as drop or rise of documents over time or user behaviors. In this work, we propose to use the logic formalism called Signal Temporal Logic (STL) to characterize document behaviors in ranking accordingly to the specified formulas. Our analysis shows that interesting doc…
▽ More
The timed position of documents retrieved by learning to rank models can be seen as signals. Signals carry useful information such as drop or rise of documents over time or user behaviors. In this work, we propose to use the logic formalism called Signal Temporal Logic (STL) to characterize document behaviors in ranking accordingly to the specified formulas. Our analysis shows that interesting document behaviors can be easily formalized and detected thanks to STL formulas. We validate our idea on a dataset of 100K product signals. Through the presented framework, we uncover interesting patterns, such as cold start, warm start, spikes, and inspect how they affect our learning to ranks models.
△ Less
Submitted 13 January, 2021;
originally announced January 2021.
-
FakeBuster: A DeepFakes Detection Tool for Video Conferencing Scenarios
Authors:
Vineet Mehta,
Parul Gupta,
Ramanathan Subramanian,
Abhinav Dhall
Abstract:
This paper proposes a new DeepFake detector FakeBuster for detecting impostors during video conferencing and manipulated faces on social media. FakeBuster is a standalone deep learning based solution, which enables a user to detect if another person's video is manipulated or spoofed during a video conferencing based meeting. This tool is independent of video conferencing solutions and has been tes…
▽ More
This paper proposes a new DeepFake detector FakeBuster for detecting impostors during video conferencing and manipulated faces on social media. FakeBuster is a standalone deep learning based solution, which enables a user to detect if another person's video is manipulated or spoofed during a video conferencing based meeting. This tool is independent of video conferencing solutions and has been tested with Zoom and Skype applications. It uses a 3D convolutional neural network for predicting video segment-wise fakeness scores. The network is trained on a combination of datasets such as Deeperforensics, DFDC, VoxCeleb, and deepfake videos created using locally captured (for video conferencing scenarios) images. This leads to different environments and perturbations in the dataset, which improves the generalization of the deepfake network.
△ Less
Submitted 9 January, 2021;
originally announced January 2021.
-
Four-dimensional quadratic forms over $\mathbb C(\!(t)\!)(X)$
Authors:
Parul Gupta
Abstract:
For quadratic forms in $4$ variables defined over the rational function field in one variable over $\mathbb C(\!(t)\!)$, the validity of the local-global principle for isotropy with respect to different sets of discrete valuations is examined.
For quadratic forms in $4$ variables defined over the rational function field in one variable over $\mathbb C(\!(t)\!)$, the validity of the local-global principle for isotropy with respect to different sets of discrete valuations is examined.
△ Less
Submitted 6 January, 2021;
originally announced January 2021.
-
Inclusion relations and radius problems for a subclass of starlike functions
Authors:
Prachi Gupta,
Sumit Nagpal,
V. Ravichandran
Abstract:
By considering the polynomial function $φ_{car}(z)=1+z+z^2/2,$ we define the class $\Scar$ consisting of normalized analytic functions $f$ such that $zf'/f$ is subordinate to $φ_{car}$ in the unit disk. The inclusion relations and various radii constants associated with the class $\Scar$ and its connection with several well-known subclasses of starlike functions is established. As an application,…
▽ More
By considering the polynomial function $φ_{car}(z)=1+z+z^2/2,$ we define the class $\Scar$ consisting of normalized analytic functions $f$ such that $zf'/f$ is subordinate to $φ_{car}$ in the unit disk. The inclusion relations and various radii constants associated with the class $\Scar$ and its connection with several well-known subclasses of starlike functions is established. As an application, the obtained results are applied to derive the properties of the partial sums and convolution.
△ Less
Submitted 24 December, 2020;
originally announced December 2020.
-
Diving below the spin-down limit: Constraints on gravitational waves from the energetic young pulsar PSR J0537-6910
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
the KAGRA Collaboration,
R. Abbott,
T. D. Abbott,
S. Abraham,
F. Acernese,
K. Ackley,
A. Adams,
C. Adams,
R. X. Adhikari,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
T. Akutsu,
K. M. Aleman,
G. Allen,
A. Allocca
, et al. (1568 additional authors not shown)
Abstract:
We present a search for continuous gravitational-wave signals from the young, energetic X-ray pulsar PSR J0537-6910 using data from the second and third observing runs of LIGO and Virgo. The search is enabled by a contemporaneous timing ephemeris obtained using NICER data. The NICER ephemeris has also been extended through 2020 October and includes three new glitches. PSR J0537-6910 has the larges…
▽ More
We present a search for continuous gravitational-wave signals from the young, energetic X-ray pulsar PSR J0537-6910 using data from the second and third observing runs of LIGO and Virgo. The search is enabled by a contemporaneous timing ephemeris obtained using NICER data. The NICER ephemeris has also been extended through 2020 October and includes three new glitches. PSR J0537-6910 has the largest spin-down luminosity of any pulsar and is highly active with regards to glitches. Analyses of its long-term and inter-glitch braking indices provided intriguing evidence that its spin-down energy budget may include gravitational-wave emission from a time-varying mass quadrupole moment. Its 62 Hz rotation frequency also puts its possible gravitational-wave emission in the most sensitive band of LIGO/Virgo detectors. Motivated by these considerations, we search for gravitational-wave emission at both once and twice the rotation frequency. We find no signal, however, and report our upper limits. Assuming a rigidly rotating triaxial star, our constraints reach below the gravitational-wave spin-down limit for this star for the first time by more than a factor of two and limit gravitational waves from the $l=m=2$ mode to account for less than 14% of the spin-down energy budget. The fiducial equatorial ellipticity is limited to less than about 3e-5, which is the third best constraint for any young pulsar.
△ Less
Submitted 10 June, 2021; v1 submitted 23 December, 2020;
originally announced December 2020.
-
All-sky search in early O3 LIGO data for continuous gravitational-wave signals from unknown neutron stars in binary systems
Authors:
The LIGO Scientific Collaboration,
the Virgo Collaboration,
R. Abbott,
T. D. Abbott,
S. Abraham,
F. Acernese,
K. Ackley,
A. Adams,
C. Adams,
R. X. Adhikari,
V. B. Adya,
C. Affeldt,
D. Agarwal,
M. Agathos,
K. Agatsuma,
N. Aggarwal,
O. D. Aguiar,
L. Aiello,
A. Ain,
P. Ajith,
K. M. Aleman,
G. Allen,
A. Allocca,
P. A. Altin,
A. Amato
, et al. (1347 additional authors not shown)
Abstract:
Rapidly spinning neutron stars are promising sources of persistent, continuous gravitational waves. Detecting such a signal would allow probing of the physical properties of matter under extreme conditions. A significant fraction of the known pulsar population belongs to binary systems. Searching for unknown neutron stars in binary systems requires specialized algorithms to address unknown orbital…
▽ More
Rapidly spinning neutron stars are promising sources of persistent, continuous gravitational waves. Detecting such a signal would allow probing of the physical properties of matter under extreme conditions. A significant fraction of the known pulsar population belongs to binary systems. Searching for unknown neutron stars in binary systems requires specialized algorithms to address unknown orbital frequency modulations. We present a search for continuous gravitational waves emitted by neutron stars in binary systems in early data from the third observing run of the Advanced LIGO and Advanced Virgo detectors using the semicoherent, GPU-accelerated, BinarySkyHough pipeline. The search analyzes the most sensitive frequency band of the LIGO detectors, 50 - 300 Hz. Binary orbital parameters are split into four regions, comprising orbital periods of 3 - 45 days and projected semimajor axes of 2 - 40 light-seconds. No detections are reported. We estimate the sensitivity of the search using simulated continuous wave signals, achieving the most sensitive results to date across the analyzed parameter space.
△ Less
Submitted 19 March, 2021; v1 submitted 22 December, 2020;
originally announced December 2020.
-
Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation
Authors:
Diksha Garg,
Priyanka Gupta,
Pankaj Malhotra,
Lovekesh Vig,
Gautam Shroff
Abstract:
Most of the existing deep reinforcement learning (RL) approaches for session-based recommendations either rely on costly online interactions with real users, or rely on potentially biased rule-based or data-driven user-behavior models for learning. In this work, we instead focus on learning recommendation policies in the pure batch or offline setting, i.e. learning policies solely from offline his…
▽ More
Most of the existing deep reinforcement learning (RL) approaches for session-based recommendations either rely on costly online interactions with real users, or rely on potentially biased rule-based or data-driven user-behavior models for learning. In this work, we instead focus on learning recommendation policies in the pure batch or offline setting, i.e. learning policies solely from offline historical interaction logs or batch data generated from an unknown and sub-optimal behavior policy, without further access to data from the real-world or user-behavior models. We propose BCD4Rec: Batch-Constrained Distributional RL for Session-based Recommendations. BCD4Rec builds upon the recent advances in batch (offline) RL and distributional RL to learn from offline logs while dealing with the intrinsically stochastic nature of rewards from the users due to varied latent interest preferences (environments). We demonstrate that BCD4Rec significantly improves upon the behavior policy as well as strong RL and non-RL baselines in the batch setting in terms of standard performance metrics like Click Through Rates or Buy Rates. Other useful properties of BCD4Rec include: i. recommending items from the correct latent categories indicating better value estimates despite large action space (of the order of number of items), and ii. overcoming popularity bias in clicked or bought items typically present in the offline logs.
△ Less
Submitted 16 December, 2020;
originally announced December 2020.
-
Optimal quantum simulation of open quantum systems
Authors:
Pragati Gupta,
C. M. Chandrashekar
Abstract:
Digital quantum simulation on quantum systems require algorithms that can be implemented using finite quantum resources. Recent studies have demonstrated digital quantum simulation of open quantum systems on Noisy Intermediate-Scale Quantum (NISQ) devices. In this work, we develop quantum circuits for optimal simulation of Markovian and Non-Markovian open quantum systems. The circuits use ancilla…
▽ More
Digital quantum simulation on quantum systems require algorithms that can be implemented using finite quantum resources. Recent studies have demonstrated digital quantum simulation of open quantum systems on Noisy Intermediate-Scale Quantum (NISQ) devices. In this work, we develop quantum circuits for optimal simulation of Markovian and Non-Markovian open quantum systems. The circuits use ancilla qubits to simulate the environment, and memory effects are induced by storing information about the system on extra qubits. We simulate the amplitude damping channel and dephasing channel as examples of the framework and infer (Non-)Markovianity from the (non-)monotonic behaviour of the dynamics. Further, we develop a method to optimize simulations by decomposing complex open quantum dynamics into smaller parts, that can be simulated using a small number of qubits. We show that this optimization reduces quantum space complexity from $O(l)$ to $O(1)$ for simulating the environment.
△ Less
Submitted 14 December, 2020;
originally announced December 2020.
-
Effects of shell thickness on cross-helicity generation in convection-driven spherical dynamos
Authors:
Luis Silva,
Parag Gupta,
David MacTaggart,
Radostin D. Simitev
Abstract:
The relative importance of the helicity and cross-helicity electromotive dynamo effects for self-sustained magnetic field generation by chaotic thermal convection in rotating spherical shells is investigated as a function of shell thickness. Two distinct branches of dynamo solutions are found to coexist in direct numerical simulations for shell aspect ratios between 0.25 and 0.6 - a mean-field dip…
▽ More
The relative importance of the helicity and cross-helicity electromotive dynamo effects for self-sustained magnetic field generation by chaotic thermal convection in rotating spherical shells is investigated as a function of shell thickness. Two distinct branches of dynamo solutions are found to coexist in direct numerical simulations for shell aspect ratios between 0.25 and 0.6 - a mean-field dipolar regime and a fluctuating dipolar regime. The properties characterising the coexisting dynamo attractors are compared and contrasted, including differences in temporal behavior and spatial structures of both the magnetic field and rotating thermal convection. The helicity $α$-effect and the cross-helicity $γ$-effect are found to be comparable in intensity within the fluctuating dipolar dynamo regime, where their ratio does not vary significantly with the shell thickness. In contrast, within the mean-field dipolar dynamo regime the helicity $α$-effect dominates by approximately two orders of magnitude and becomes stronger with decreasing shell thickness.
△ Less
Submitted 12 December, 2020;
originally announced December 2020.
-
A decentralized approach towards secure firmware updates and testing over commercial IoT Devices
Authors:
Projjal Gupta
Abstract:
Internet technologies have made a paradigm shift in the fields of computing and data science and one such paradigm defining change is the Internet of Things or IoT. Nowadays, thousands of household appliances use integrated smart devices which allow remote monitoring and control and also allow intensive computational work such as high end AI-integrated smart security systems with sustained alerts…
▽ More
Internet technologies have made a paradigm shift in the fields of computing and data science and one such paradigm defining change is the Internet of Things or IoT. Nowadays, thousands of household appliances use integrated smart devices which allow remote monitoring and control and also allow intensive computational work such as high end AI-integrated smart security systems with sustained alerts for the user. The update process of these IoT devices usually lack the ability of checking the security of centralized servers, which may be compromised and host malicious firmware files as it is presumed that the servers are secure during deployment. The solution for this problem can be solved using a decentralized database to hold the hashes and the firmware. This paper discusses the possible implications of insecure servers used to host the firmwares of commercial IoT products, and aims to provide a blockchain based decentralized solution to host firmware files with the property of immutability, and controlled access to the firmware upload functions so as to stop unauthorized use. The paper sheds light over possible hardware implementations and the use of cryptographically secure components in such secure architecture models.
△ Less
Submitted 24 November, 2020;
originally announced November 2020.
-
Modeling Functional Similarity in Source Code with Graph-Based Siamese Networks
Authors:
Nikita Mehrotra,
Navdha Agarwal,
Piyush Gupta,
Saket Anand,
David Lo,
Rahul Purandare
Abstract:
Code clones are duplicate code fragments that share (nearly) similar syntax or semantics. Code clone detection plays an important role in software maintenance, code refactoring, and reuse. A substantial amount of research has been conducted in the past to detect clones. A majority of these approaches use lexical and syntactic information to detect clones. However, only a few of them target semanti…
▽ More
Code clones are duplicate code fragments that share (nearly) similar syntax or semantics. Code clone detection plays an important role in software maintenance, code refactoring, and reuse. A substantial amount of research has been conducted in the past to detect clones. A majority of these approaches use lexical and syntactic information to detect clones. However, only a few of them target semantic clones. Recently, motivated by the success of deep learning models in other fields, including natural language processing and computer vision, researchers have attempted to adopt deep learning techniques to detect code clones. These approaches use lexical information (tokens) and(or) syntactic structures like abstract syntax trees (ASTs) to detect code clones. However, they do not make sufficient use of the available structural and semantic information hence, limiting their capabilities.
This paper addresses the problem of semantic code clone detection using program dependency graphs and geometric neural networks, leveraging the structured syntactic and semantic information. We have developed a prototype tool HOLMES, based on our novel approach, and empirically evaluated it on popular code clone benchmarks. Our results show that HOLMES performs considerably better than the other state-of-the-art tool, TBCCD. We also evaluated HOLMES on unseen projects and performed cross dataset experiments to assess the generalizability of HOLMES. Our results affirm that HOLMES outperforms TBCCD since most of the pairs that HOLMES detected were either undetected or suboptimally reported by TBCCD.
△ Less
Submitted 25 November, 2020; v1 submitted 23 November, 2020;
originally announced November 2020.
-
Using Convolutional Variational Autoencoders to Predict Post-Trauma Health Outcomes from Actigraphy Data
Authors:
Ayse S. Cakmak,
Nina Thigpen,
Garrett Honke,
Erick Perez Alday,
Ali Bahrami Rad,
Rebecca Adaimi,
Chia Jung Chang,
Qiao Li,
Pramod Gupta,
Thomas Neylan,
Samuel A. McLean,
Gari D. Clifford
Abstract:
Depression and post-traumatic stress disorder (PTSD) are psychiatric conditions commonly associated with experiencing a traumatic event. Estimating mental health status through non-invasive techniques such as activity-based algorithms can help to identify successful early interventions. In this work, we used locomotor activity captured from 1113 individuals who wore a research grade smartwatch pos…
▽ More
Depression and post-traumatic stress disorder (PTSD) are psychiatric conditions commonly associated with experiencing a traumatic event. Estimating mental health status through non-invasive techniques such as activity-based algorithms can help to identify successful early interventions. In this work, we used locomotor activity captured from 1113 individuals who wore a research grade smartwatch post-trauma. A convolutional variational autoencoder (VAE) architecture was used for unsupervised feature extraction from four weeks of actigraphy data. By using VAE latent variables and the participant's pre-trauma physical health status as features, a logistic regression classifier achieved an area under the receiver operating characteristic curve (AUC) of 0.64 to estimate mental health outcomes. The results indicate that the VAE model is a promising approach for actigraphy data analysis for mental health outcomes in long-term studies.
△ Less
Submitted 19 November, 2020; v1 submitted 14 November, 2020;
originally announced November 2020.
-
Channel Tiling for Improved Performance and Accuracy of Optical Neural Network Accelerators
Authors:
Shurui Li,
Mario Miscuglio,
Volker J. Sorger,
Puneet Gupta
Abstract:
Low latency, high throughput inference on Convolution Neural Networks (CNNs) remains a challenge, especially for applications requiring large input or large kernel sizes. 4F optics provides a solution to accelerate CNNs by converting convolutions into Fourier-domain point-wise multiplications that are computationally 'free' in optical domain. However, existing 4F CNN systems suffer from the all-po…
▽ More
Low latency, high throughput inference on Convolution Neural Networks (CNNs) remains a challenge, especially for applications requiring large input or large kernel sizes. 4F optics provides a solution to accelerate CNNs by converting convolutions into Fourier-domain point-wise multiplications that are computationally 'free' in optical domain. However, existing 4F CNN systems suffer from the all-positive sensor readout issue which makes the implementation of a multi-channel, multi-layer CNN not scalable or even impractical. In this paper we propose a simple channel tiling scheme for 4F CNN systems that utilizes the high resolution of 4F system to perform channel summation inherently in optical domain before sensor detection, so the outputs of different channels can be correctly accumulated. Compared to state of the art, channel tiling gives similar accuracy, significantly better robustness to sensing quantization (33\% improvement in required sensing precision) error and noise (10dB reduction in tolerable sensing noise), 0.5X total filters required, 10-50X+ throughput improvement and as much as 3X reduction in required output camera resolution/bandwidth. Not requiring any additional optical hardware, the proposed channel tiling approach addresses an important throughput and precision bottleneck of high-speed, massively-parallel optical 4F computing systems.
△ Less
Submitted 14 January, 2021; v1 submitted 14 November, 2020;
originally announced November 2020.
-
Square-reflexive polynomials
Authors:
Karim Johannes Becher,
Parul Gupta
Abstract:
For a field $E$ of characteristic different from $2$ and cohomological $2$-dimension one, quadratic forms over the rational function field $E(X)$ are studied. A characterisation in terms of polynomials in $E[X]$ is obtained for having that quadratic forms over $E(X)$ satisfy a local-global principle with respect to discrete valuations that are trivial on $E$. In this way new elementary proofs for…
▽ More
For a field $E$ of characteristic different from $2$ and cohomological $2$-dimension one, quadratic forms over the rational function field $E(X)$ are studied. A characterisation in terms of polynomials in $E[X]$ is obtained for having that quadratic forms over $E(X)$ satisfy a local-global principle with respect to discrete valuations that are trivial on $E$. In this way new elementary proofs for the local-global principle are achieved in the cases where $E$ is finite or pseudo-algebraically closed. The study is complemented by various examples.
△ Less
Submitted 15 July, 2021; v1 submitted 10 November, 2020;
originally announced November 2020.
-
SuperDeConFuse: A Supervised Deep Convolutional Transform based Fusion Framework for Financial Trading Systems
Authors:
Pooja Gupta,
Angshul Majumdar,
Emilie Chouzenoux,
Giovanni Chierchia
Abstract:
This work proposes a supervised multi-channel time-series learning framework for financial stock trading. Although many deep learning models have recently been proposed in this domain, most of them treat the stock trading time-series data as 2-D image data, whereas its true nature is 1-D time-series data. Since the stock trading systems are multi-channel data, many existing techniques treating the…
▽ More
This work proposes a supervised multi-channel time-series learning framework for financial stock trading. Although many deep learning models have recently been proposed in this domain, most of them treat the stock trading time-series data as 2-D image data, whereas its true nature is 1-D time-series data. Since the stock trading systems are multi-channel data, many existing techniques treating them as 1-D time-series data are not suggestive of any technique to effectively fusion the information carried by the multiple channels. To contribute towards both of these shortcomings, we propose an end-to-end supervised learning framework inspired by the previously established (unsupervised) convolution transform learning framework. Our approach consists of processing the data channels through separate 1-D convolution layers, then fusing the outputs with a series of fully-connected layers, and finally applying a softmax classification layer. The peculiarity of our framework - SuperDeConFuse (SDCF), is that we remove the nonlinear activation located between the multi-channel convolution layers and the fully-connected layers, as well as the one located between the latter and the output layer. We compensate for this removal by introducing a suitable regularization on the aforementioned layer outputs and filters during the training phase. Specifically, we apply a logarithm determinant regularization on the layer filters to break symmetry and force diversity in the learnt transforms, whereas we enforce the non-negativity constraint on the layer outputs to mitigate the issue of dead neurons. This results in the effective learning of a richer set of features and filters with respect to a standard convolutional neural network. Numerical experiments confirm that the proposed model yields considerably better results than state-of-the-art deep learning techniques for real-world problem of stock trading.
△ Less
Submitted 9 November, 2020;
originally announced November 2020.