Search | arXiv e-print repository

On the optimal prediction of extreme events in heavy-tailed time series with applications to solar flare forecasting

Authors: Victor Verma, Stilian Stoev, Yang Chen

Abstract: The prediction of extreme events in time series is a fundamental problem arising in many financial, scientific, engineering, and other applications. We begin by establishing a general Neyman-Pearson-type characterization of optimal extreme event predictors in terms of density ratios. This yields new insights and several closed-form optimal extreme event predictors for additive models. These result… ▽ More The prediction of extreme events in time series is a fundamental problem arising in many financial, scientific, engineering, and other applications. We begin by establishing a general Neyman-Pearson-type characterization of optimal extreme event predictors in terms of density ratios. This yields new insights and several closed-form optimal extreme event predictors for additive models. These results naturally extend to time series, where we study optimal extreme event prediction for heavy-tailed autoregressive and moving average models. Using a uniform law of large numbers for ergodic time series, we establish the asymptotic optimality of an empirical version of the optimal predictor for autoregressive models. Using multivariate regular variation, we also obtain expressions for the optimal extremal precision in heavy-tailed infinite moving averages, which provide theoretical bounds on the ability to predict extremes in this general class of models. The developed theory and methodology is applied to the important problem of solar flare prediction based on the state-of-the-art GOES satellite flux measurements of the Sun. Our results demonstrate the success and limitations of long-memory autoregressive as well as long-range dependent heavy-tailed FARIMA models for the prediction of extreme solar flares. △ Less

Submitted 16 July, 2024; originally announced July 2024.

Comments: 57 pages, 5 figures

MSC Class: 62G32 (Primary) 62G20; 62M10; 62M20 (Secondary)

arXiv:2406.04661 [pdf, other]

doi 10.1038/s41467-022-29376-4

Quantum channel correction outperforming direct transmission

Authors: Sergei Slussarenko, Morgan M. Weston, Lynden K. Shalm, Varun B. Verma, Sae-Woo Nam, Sacha Kocsis, Timothy C. Ralph, Geoff J. Pryde

Abstract: Long-distance optical quantum channels are necessarily lossy, leading to errors in transmitted quantum information, entanglement degradation and, ultimately, poor protocol performance. Quantum states carrying information in the channel can be probabilistically amplified to compensate for loss, but are destroyed when amplification fails. Quantum correction of the channel itself is therefore require… ▽ More Long-distance optical quantum channels are necessarily lossy, leading to errors in transmitted quantum information, entanglement degradation and, ultimately, poor protocol performance. Quantum states carrying information in the channel can be probabilistically amplified to compensate for loss, but are destroyed when amplification fails. Quantum correction of the channel itself is therefore required, but break-even performance -- where arbitrary states can be better transmitted through a corrected channel than an uncorrected one -- has so far remained out of reach. Here we perform distillation by heralded amplification to improve a noisy entanglement channel. We subsequently employ entanglement swapping to demonstrate that arbitrary quantum information transmission is unconditionally improved -- i.e. without relying on postselection or post-processing of data -- compared to the uncorrected channel. In this way, it represents realisation of a genuine quantum relay. Our channel correction for single-mode quantum states will find use in quantum repeater, communication and metrology applications. △ Less

Submitted 7 June, 2024; originally announced June 2024.

Comments: 11 pages, 6 figures, supplementary included

Journal ref: Nature Communications 13, 1832 (2022)

arXiv:2405.07166 [pdf, other]

Resource Efficient Perception for Vision Systems

Authors: A V Subramanyam, Niyati Singal, Vinay K Verma

Abstract: Despite the rapid advancement in the field of image recognition, the processing of high-resolution imagery remains a computational challenge. However, this processing is pivotal for extracting detailed object insights in areas ranging from autonomous vehicle navigation to medical imaging analyses. Our study introduces a framework aimed at mitigating these challenges by leveraging memory efficient… ▽ More Despite the rapid advancement in the field of image recognition, the processing of high-resolution imagery remains a computational challenge. However, this processing is pivotal for extracting detailed object insights in areas ranging from autonomous vehicle navigation to medical imaging analyses. Our study introduces a framework aimed at mitigating these challenges by leveraging memory efficient patch based processing for high resolution images. It incorporates a global context representation alongside local patch information, enabling a comprehensive understanding of the image content. In contrast to traditional training methods which are limited by memory constraints, our method enables training of ultra high resolution images. We demonstrate the effectiveness of our method through superior performance on 7 different benchmarks across classification, object detection, and segmentation. Notably, the proposed method achieves strong performance even on resource-constrained devices like Jetson Nano. Our code is available at https://github.com/Visual-Conception-Group/Localized-Perception-Constrained-Vision-Systems. △ Less

Submitted 12 May, 2024; originally announced May 2024.

arXiv:2404.19341 [pdf, other]

Reliable or Deceptive? Investigating Gated Features for Smooth Visual Explanations in CNNs

Authors: Soham Mitra, Atri Sukul, Swalpa Kumar Roy, Pravendra Singh, Vinay Verma

Abstract: Deep learning models have achieved remarkable success across diverse domains. However, the intricate nature of these models often impedes a clear understanding of their decision-making processes. This is where Explainable AI (XAI) becomes indispensable, offering intuitive explanations for model decisions. In this work, we propose a simple yet highly effective approach, ScoreCAM++, which introduces… ▽ More Deep learning models have achieved remarkable success across diverse domains. However, the intricate nature of these models often impedes a clear understanding of their decision-making processes. This is where Explainable AI (XAI) becomes indispensable, offering intuitive explanations for model decisions. In this work, we propose a simple yet highly effective approach, ScoreCAM++, which introduces modifications to enhance the promising ScoreCAM method for visual explainability. Our proposed approach involves altering the normalization function within the activation layer utilized in ScoreCAM, resulting in significantly improved results compared to previous efforts. Additionally, we apply an activation function to the upsampled activation layers to enhance interpretability. This improvement is achieved by selectively gating lower-priority values within the activation layer. Through extensive experiments and qualitative comparisons, we demonstrate that ScoreCAM++ consistently achieves notably superior performance and fairness in interpreting the decision-making process compared to both ScoreCAM and previous methods. △ Less

Submitted 30 April, 2024; originally announced April 2024.

arXiv:2404.16106 [pdf, other]

A robust approach for time-bin encoded photonic quantum information protocols

Authors: Simon J. U. White, Emanuele Polino, Farzad Ghafari, Dominick J. Joch, Luis Villegas-Aguilar, Lynden K. Shalm, Varun B. Verma, Marcus Huber, Nora Tischler

Abstract: Quantum states encoded in the time-bin degree of freedom of photons represent a fundamental resource for quantum information protocols. Traditional methods for generating and measuring time-bin encoded quantum states face severe challenges due to optical instabilities, complex setups, and timing resolution requirements. Here, we leverage a robust approach based on Hong-Ou-Mandel interference that… ▽ More Quantum states encoded in the time-bin degree of freedom of photons represent a fundamental resource for quantum information protocols. Traditional methods for generating and measuring time-bin encoded quantum states face severe challenges due to optical instabilities, complex setups, and timing resolution requirements. Here, we leverage a robust approach based on Hong-Ou-Mandel interference that allows us to circumvent these issues. First, we perform high-fidelity quantum state tomographies of time-bin qubits with a short temporal separation. Then, we certify intrasystem polarization-time entanglement of single photons through a nonclassicality test. Finally, we propose a robust and scalable protocol to generate and measure high-dimensional time-bin quantum states in a single spatial mode. The protocol promises to enable access to high-dimensional states and tasks that are practically inaccessible with standard schemes, thereby advancing fundamental quantum information science and opening applications in quantum communication. △ Less

Submitted 24 April, 2024; originally announced April 2024.

arXiv:2403.20317 [pdf, other]

Convolutional Prompting meets Language Models for Continual Learning

Authors: Anurag Roy, Riddhiman Moulick, Vinay K. Verma, Saptarshi Ghosh, Abir Das

Abstract: Continual Learning (CL) enables machine learning models to learn from continuously shifting new training data in absence of data from old tasks. Recently, pretrained vision transformers combined with prompt tuning have shown promise for overcoming catastrophic forgetting in CL. These approaches rely on a pool of learnable prompts which can be inefficient in sharing knowledge across tasks leading t… ▽ More Continual Learning (CL) enables machine learning models to learn from continuously shifting new training data in absence of data from old tasks. Recently, pretrained vision transformers combined with prompt tuning have shown promise for overcoming catastrophic forgetting in CL. These approaches rely on a pool of learnable prompts which can be inefficient in sharing knowledge across tasks leading to inferior performance. In addition, the lack of fine-grained layer specific prompts does not allow these to fully express the strength of the prompts for CL. We address these limitations by proposing ConvPrompt, a novel convolutional prompt creation mechanism that maintains layer-wise shared embeddings, enabling both layer-specific learning and better concept transfer across tasks. The intelligent use of convolution enables us to maintain a low parameter overhead without compromising performance. We further leverage Large Language Models to generate fine-grained text descriptions of each category which are used to get task similarity and dynamically decide the number of prompts to be learned. Extensive experiments demonstrate the superiority of ConvPrompt and improves SOTA by ~3% with significantly less parameter overhead. We also perform strong ablation over various modules to disentangle the importance of different components. △ Less

Submitted 29 March, 2024; originally announced March 2024.

Comments: CVPR 2024 Camera Ready

arXiv:2402.18173 [pdf]

Harnessing the Duality of Magnetism and Conductivity: A Review of Oxide based Dilute Magnetic Semiconductors

Authors: Pankaj Bhardwaj, Jarnail Singh, Vikram Verma, Ravi Kumar

Abstract: Over the last two decades, the new branch of spintronics, i.e., semiconductor spintronics, has gained more attention because it integrates the characteristics of conventional semiconductors, such as optical bandgap and charge carriers, helpful for processing and computing pieces of information combined with magnets for data storage applications in a single device. Likewise, substituting transition… ▽ More Over the last two decades, the new branch of spintronics, i.e., semiconductor spintronics, has gained more attention because it integrates the characteristics of conventional semiconductors, such as optical bandgap and charge carriers, helpful for processing and computing pieces of information combined with magnets for data storage applications in a single device. Likewise, substituting transition metal (TM) ions to induce magnetic qualities into semiconductors or oxides creates dilute magnetic semiconductors (DMSs) or oxides (DMOs) with high electronic, photonic, and magnetic functionality. This review article discusses the historical outline of magnetic semiconductors with their origin and mechanism. It also includes a concise overview of various DMO systems based on their conductivity (p-type and n-type) to elucidate the synthesis, origin, and control mechanisms and further evoke the prepared spintronics devices. The occurrence of RTFM with transparency and conductivity can be helpful in spintronics device fabrications, which was assumed to be governed by the formation of intrinsic defects, charge carriers, morphology, and the induced exchange interactions between ions. The DMOs-based spintronics devices, such as magneto-optical devices, transparent ferromagnets, and spin-based solar cells, exploit both semiconducting and magnetic properties, which have also been discussed in this review article with outlook and perspectives. △ Less

Submitted 6 May, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

Comments: 49 pages, 2 tables, 31 figures

arXiv:2401.11932 [pdf, other]

Accelerating Causal Algorithms for Industrial-scale Data: A Distributed Computing Approach with Ray Framework

Authors: Vishal Verma, Vinod Reddy, Jaiprakash Ravi

Abstract: The increasing need for causal analysis in large-scale industrial datasets necessitates the development of efficient and scalable causal algorithms for real-world applications. This paper addresses the challenge of scaling causal algorithms in the context of conducting causal analysis on extensive datasets commonly encountered in industrial settings. Our proposed solution involves enhancing the sc… ▽ More The increasing need for causal analysis in large-scale industrial datasets necessitates the development of efficient and scalable causal algorithms for real-world applications. This paper addresses the challenge of scaling causal algorithms in the context of conducting causal analysis on extensive datasets commonly encountered in industrial settings. Our proposed solution involves enhancing the scalability of causal algorithm libraries, such as EconML, by leveraging the parallelism capabilities offered by the distributed computing framework Ray. We explore the potential of parallelizing key iterative steps within causal algorithms to significantly reduce overall runtime, supported by a case study that examines the impact on estimation times and costs. Through this approach, we aim to provide a more effective solution for implementing causal analysis in large-scale industrial applications. △ Less

Submitted 22 January, 2024; originally announced January 2024.

ACM Class: C.4; E.2; I.2.1

arXiv:2401.07465 [pdf, other]

Power Flow Analysis Using Deep Neural Networks in Three-Phase Unbalanced Smart Distribution Grids

Authors: Deepak Tiwari, Mehdi Jabbari Zideh, Veeru Talreja, Vishal Verma, Sarika K. Solanki, Jignesh Solanki

Abstract: Most power systems' approaches are currently tending towards stochastic and probabilistic methods due to the high variability of renewable sources and the stochastic nature of loads. Conventional power flow (PF) approaches such as forward-backward sweep (FBS) and Newton-Raphson require a high number of iterations to solve non-linear PF equations making them computationally very intensive. PF is th… ▽ More Most power systems' approaches are currently tending towards stochastic and probabilistic methods due to the high variability of renewable sources and the stochastic nature of loads. Conventional power flow (PF) approaches such as forward-backward sweep (FBS) and Newton-Raphson require a high number of iterations to solve non-linear PF equations making them computationally very intensive. PF is the most important study performed by utility, required in all stages of the power system, especially in operations and planning. This paper discusses the applications of deep learning (DL) to predict PF solutions for three-phase unbalanced power distribution grids. Three deep neural networks (DNNs); Radial Basis Function Network (RBFnet), Multi-Layer Perceptron (MLP), and Convolutional Neural Network (CNN), are proposed in this paper to predict PF solutions. The PF problem is formulated as a multi-output regression model where two or more output values are predicted based on the inputs. The training and testing data are generated through the OpenDSS-MATLAB COM interface. These methods are completely data-driven where the training relies on reducing the mismatch at each node without the need for the knowledge of the system. The novelty of the proposed methodology is that the models can accurately predict the PF solutions for the unbalanced distribution grids with mutual coupling and are robust to different R/X ratios, topology changes as well as generation and load variability introduced by the integration of distributed energy resources (DERs) and electric vehicles (EVs). To test the efficacy of the DNN models, they are applied to IEEE 4-node and 123-node test cases, and the American Electric Power (AEP) feeder model. The PF results for RBFnet, MLP, and CNN models are discussed in this paper demonstrating that all three DNN models provide highly accurate results in predicting PF solutions. △ Less

Submitted 14 January, 2024; originally announced January 2024.

arXiv:2312.11805 [pdf, other]

Gemini: A Family of Highly Capable Multimodal Models

Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultra model advances the state of the art in 30 of 32 of these benchmarks - notably being the first model to achieve human-expert performance on the well-studied exam benchmark MMLU, and improving the state of the art in every one of the 20 multimodal benchmarks we examined. We believe that the new capabilities of the Gemini family in cross-modal reasoning and language understanding will enable a wide variety of use cases. We discuss our approach toward post-training and deploying Gemini models responsibly to users through services including Gemini, Gemini Advanced, Google AI Studio, and Cloud Vertex AI. △ Less

Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

arXiv:2312.01188 [pdf, other]

Efficient Expansion and Gradient Based Task Inference for Replay Free Incremental Learning

Authors: Soumya Roy, Vinay K Verma, Deepak Gupta

Abstract: This paper proposes a simple but highly efficient expansion-based model for continual learning. The recent feature transformation, masking and factorization-based methods are efficient, but they grow the model only over the global or shared parameter. Therefore, these approaches do not fully utilize the previously learned information because the same task-specific parameter forgets the earlier kno… ▽ More This paper proposes a simple but highly efficient expansion-based model for continual learning. The recent feature transformation, masking and factorization-based methods are efficient, but they grow the model only over the global or shared parameter. Therefore, these approaches do not fully utilize the previously learned information because the same task-specific parameter forgets the earlier knowledge. Thus, these approaches show limited transfer learning ability. Moreover, most of these models have constant parameter growth for all tasks, irrespective of the task complexity. Our work proposes a simple filter and channel expansion based method that grows the model over the previous task parameters and not just over the global parameter. Therefore, it fully utilizes all the previously learned information without forgetting, which results in better knowledge transfer. The growth rate in our proposed model is a function of task complexity; therefore for a simple task, the model has a smaller parameter growth while for complex tasks, the model requires more parameters to adapt to the current task. Recent expansion based models show promising results for task incremental learning (TIL). However, for class incremental learning (CIL), prediction of task id is a crucial challenge; hence, their results degrade rapidly as the number of tasks increase. In this work, we propose a robust task prediction method that leverages entropy weighted data augmentations and the models gradient using pseudo labels. We evaluate our model on various datasets and architectures in the TIL, CIL and generative continual learning settings. The proposed approach shows state-of-the-art results in all these settings. Our extensive ablation studies show the efficacy of the proposed components. △ Less

Submitted 2 December, 2023; originally announced December 2023.

Comments: To be Appeared in WACV, 2024

arXiv:2312.01167 [pdf, other]

Meta-Learned Attribute Self-Interaction Network for Continual and Generalized Zero-Shot Learning

Authors: Vinay K Verma, Nikhil Mehta, Kevin J Liang, Aakansha Mishra, Lawrence Carin

Abstract: Zero-shot learning (ZSL) is a promising approach to generalizing a model to categories unseen during training by leveraging class attributes, but challenges remain. Recently, methods using generative models to combat bias towards classes seen during training have pushed state of the art, but these generative models can be slow or computationally expensive to train. Also, these generative models as… ▽ More Zero-shot learning (ZSL) is a promising approach to generalizing a model to categories unseen during training by leveraging class attributes, but challenges remain. Recently, methods using generative models to combat bias towards classes seen during training have pushed state of the art, but these generative models can be slow or computationally expensive to train. Also, these generative models assume that the attribute vector of each unseen class is available a priori at training, which is not always practical. Additionally, while many previous ZSL methods assume a one-time adaptation to unseen classes, in reality, the world is always changing, necessitating a constant adjustment of deployed models. Models unprepared to handle a sequential stream of data are likely to experience catastrophic forgetting. We propose a Meta-learned Attribute self-Interaction Network (MAIN) for continual ZSL. By pairing attribute self-interaction trained using meta-learning with inverse regularization of the attribute encoder, we are able to outperform state-of-the-art results without leveraging the unseen class attributes while also being able to train our models substantially faster (>100x) than expensive generative-based approaches. We demonstrate this with experiments on five standard ZSL datasets (CUB, aPY, AWA1, AWA2, and SUN) in the generalized zero-shot learning and continual (fixed/dynamic) zero-shot learning settings. Extensive ablations and analyses demonstrate the efficacy of various components proposed. △ Less

Submitted 2 December, 2023; originally announced December 2023.

Comments: Accepted in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024. arXiv admin note: substantial text overlap with arXiv:2102.11856

arXiv:2311.16496 [pdf, other]

DPOD: Domain-Specific Prompt Tuning for Multimodal Fake News Detection

Authors: Debarshi Brahma, Amartya Bhattacharya, Suraj Nagaje Mahadev, Anmol Asati, Vikas Verma, Soma Biswas

Abstract: The spread of fake news using out-of-context images has become widespread and is a relevant problem in this era of information overload. Such out-of-context fake news may arise across different domains like politics, sports, entertainment, etc. In practical scenarios, an inherent problem of imbalance exists among news articles from such widely varying domains, resulting in a few domains with abund… ▽ More The spread of fake news using out-of-context images has become widespread and is a relevant problem in this era of information overload. Such out-of-context fake news may arise across different domains like politics, sports, entertainment, etc. In practical scenarios, an inherent problem of imbalance exists among news articles from such widely varying domains, resulting in a few domains with abundant data, while the rest containing very limited data. Under such circumstances, it is imperative to develop methods which can work in such varying amounts of data setting. In this work, we explore whether out-of-domain data can help to improve out-of-context misinformation detection (termed here as multi-modal fake news detection) of a desired domain, to address this challenging problem. Towards this goal, we propose a novel framework termed DPOD (Domain-specific Prompt-tuning using Out-of-Domain data). First, to compute generalizable features, we modify the Vision-Language Model, CLIP to extract features that helps to align the representations of the images and corresponding text captions of both the in-domain and out-of-domain data in a label-aware manner. Further, we propose a domain-specific prompt learning technique which leverages the training samples of all the available domains based on the extent they can be useful to the desired domain. Extensive experiments on a large-scale benchmark dataset, namely NewsCLIPpings demonstrate that the proposed framework achieves state of-the-art performance, significantly surpassing the existing approaches for this challenging task. Code will be released on acceptance. △ Less

Submitted 12 March, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

arXiv:2310.05651 [pdf, other]

FENCE: Fairplay Ensuring Network Chain Entity for Real-Time Multiple ID Detection at Scale In Fantasy Sports

Authors: Akriti Upreti, Kartavya Kothari, Utkarsh Thukral, Vishal Verma

Abstract: Dream11 takes pride in being a unique platform that enables over 190 million fantasy sports users to demonstrate their skills and connect deeper with their favorite sports. While managing such a scale, one issue we are faced with is duplicate/multiple account creation in the system. This is done by some users with the intent of abusing the platform, typically for bonus offers. The challenge is to… ▽ More Dream11 takes pride in being a unique platform that enables over 190 million fantasy sports users to demonstrate their skills and connect deeper with their favorite sports. While managing such a scale, one issue we are faced with is duplicate/multiple account creation in the system. This is done by some users with the intent of abusing the platform, typically for bonus offers. The challenge is to detect these multiple accounts before it is too late. We propose a graph-based solution to solve this problem in which we first predict edges/associations between users. Using the edge information we highlight clusters of colluding multiple accounts. In this paper, we talk about our distributed ML system which is deployed to serve and support the inferences from our detection models. The challenge is to do this in real-time in order to take corrective actions. A core part of this setup also involves human-in-the-loop components for validation, feedback, and ground-truth labeling. △ Less

Submitted 9 October, 2023; originally announced October 2023.

Comments: 7 pages, 7 figures, accepted in AIML Systems 2023

ACM Class: I.2.1

arXiv:2309.16890 [pdf, other]

doi 10.1063/5.0178931

A 64-pixel mid-infrared single-photon imager based on superconducting nanowire detectors

Authors: Benedikt Hampel, Richard P. Mirin, Sae Woo Nam, Varun B. Verma

Abstract: A large-format mid-infrared single-photon imager with very low dark count rates would enable a broad range of applications in fields like astronomy and chemistry. Superconducting nanowire single-photon detectors (SNSPDs) are a mature photon-counting technology as demonstrated by their figures of merit. However, scaling SNSPDs to large array sizes for mid-infrared applications requires sophisticate… ▽ More A large-format mid-infrared single-photon imager with very low dark count rates would enable a broad range of applications in fields like astronomy and chemistry. Superconducting nanowire single-photon detectors (SNSPDs) are a mature photon-counting technology as demonstrated by their figures of merit. However, scaling SNSPDs to large array sizes for mid-infrared applications requires sophisticated readout architectures in addition to superconducting materials development. In this work, an SNSPD array design that combines a thermally coupled row-column multiplexing architecture with a thermally coupled time-of-flight transmission line was developed for mid-infrared applications. The design requires only six cables and can be scaled to larger array sizes. The demonstration of a 64-pixel array shows promising results for wavelengths between $\mathrm{3.4\,μm}$ and $\mathrm{10\,μm}$, which will enable the use of this single-photon detector technology for a broad range of new applications. △ Less

Submitted 28 September, 2023; originally announced September 2023.

Comments: 7 pages, 3 figures, 1 page supplementary material. The following article has been submitted to Applied Physics Letters

Journal ref: Appl. Phys. Lett. 124, 042602 (2024)

arXiv:2309.08227 [pdf, other]

VERSE: Virtual-Gradient Aware Streaming Lifelong Learning with Anytime Inference

Authors: Soumya Banerjee, Vinay K. Verma, Avideep Mukherjee, Deepak Gupta, Vinay P. Namboodiri, Piyush Rai

Abstract: Lifelong learning or continual learning is the problem of training an AI agent continuously while also preventing it from forgetting its previously acquired knowledge. Streaming lifelong learning is a challenging setting of lifelong learning with the goal of continuous learning in a dynamic non-stationary environment without forgetting. We introduce a novel approach to lifelong learning, which is… ▽ More Lifelong learning or continual learning is the problem of training an AI agent continuously while also preventing it from forgetting its previously acquired knowledge. Streaming lifelong learning is a challenging setting of lifelong learning with the goal of continuous learning in a dynamic non-stationary environment without forgetting. We introduce a novel approach to lifelong learning, which is streaming (observes each training example only once), requires a single pass over the data, can learn in a class-incremental manner, and can be evaluated on-the-fly (anytime inference). To accomplish these, we propose a novel \emph{virtual gradients} based approach for continual representation learning which adapts to each new example while also generalizing well on past data to prevent catastrophic forgetting. Our approach also leverages an exponential-moving-average-based semantic memory to further enhance performance. Experiments on diverse datasets with temporally correlated observations demonstrate our method's efficacy and superior performance over existing methods. △ Less

Submitted 19 February, 2024; v1 submitted 15 September, 2023; originally announced September 2023.

arXiv:2308.16295 [pdf]

Disposable face masks: a direct source for inhalation of microplastics

Authors: Andres F. Prada, Avram Distler, Shyuan Cheng, John W. Scott, Leonardo P. Chamorro, Ganesh Subramanian, Vishal Verma, Andrew Turner

Abstract: Surgical masks have played a crucial role in healthcare facilities to protect against respiratory and infectious diseases, particularly during the COVID-19 pandemic. However, the synthetic fibers, mainly made of polypropylene, used in their production may adversely affect the environment and human health. Recent studies have confirmed the presence of microplastics and fibers in human lungs and hav… ▽ More Surgical masks have played a crucial role in healthcare facilities to protect against respiratory and infectious diseases, particularly during the COVID-19 pandemic. However, the synthetic fibers, mainly made of polypropylene, used in their production may adversely affect the environment and human health. Recent studies have confirmed the presence of microplastics and fibers in human lungs and have related these synthetic particles with the occurrence of pulmonary ground glass nodules. Using a piston system to simulate human breathing, this study investigates the role of surgical masks as a direct source of inhalation of microplastics. Results reveal the release of particles of sizes ranging from nanometers (300 nm) to millimeters (~2 mm) during normal breathing conditions, raising concerns about the potential health risks. Notably, large visible particles (> 1 mm) were observed to be ejected from masks with limited wear after only a few breathing cycles. Given the widespread use of masks by healthcare workers and the potential future need for mask usage by the general population during seasonal infectious diseases or new pandemics, developing face masks using safe materials for both users and the environment is imperative. △ Less

Submitted 30 August, 2023; originally announced August 2023.

Comments: 11 pages, 3 figures

arXiv:2308.11357 [pdf, other]

Exemplar-Free Continual Transformer with Convolutions

Authors: Anurag Roy, Vinay Kumar Verma, Sravan Voonna, Kripabandhu Ghosh, Saptarshi Ghosh, Abir Das

Abstract: Continual Learning (CL) involves training a machine learning model in a sequential manner to learn new information while retaining previously learned tasks without the presence of previous training data. Although there has been significant interest in CL, most recent CL approaches in computer vision have focused on convolutional architectures only. However, with the recent success of vision transf… ▽ More Continual Learning (CL) involves training a machine learning model in a sequential manner to learn new information while retaining previously learned tasks without the presence of previous training data. Although there has been significant interest in CL, most recent CL approaches in computer vision have focused on convolutional architectures only. However, with the recent success of vision transformers, there is a need to explore their potential for CL. Although there have been some recent CL approaches for vision transformers, they either store training instances of previous tasks or require a task identifier during test time, which can be limiting. This paper proposes a new exemplar-free approach for class/task incremental learning called ConTraCon, which does not require task-id to be explicitly present during inference and avoids the need for storing previous training instances. The proposed approach leverages the transformer architecture and involves re-weighting the key, query, and value weights of the multi-head self-attention layers of a transformer trained on a similar task. The re-weighting is done using convolution, which enables the approach to maintain low parameter requirements per task. Additionally, an image augmentation-based entropic task identification approach is used to predict tasks without requiring task-ids during inference. Experiments on four benchmark datasets demonstrate that the proposed approach outperforms several competitive approaches while requiring fewer parameters. △ Less

Submitted 22 August, 2023; originally announced August 2023.

Comments: Accepted in ICCV 2023

arXiv:2305.15047 [pdf, other]

Ghostbuster: Detecting Text Ghostwritten by Large Language Models

Authors: Vivek Verma, Eve Fleisig, Nicholas Tomlin, Dan Klein

Abstract: We introduce Ghostbuster, a state-of-the-art system for detecting AI-generated text. Our method works by passing documents through a series of weaker language models, running a structured search over possible combinations of their features, and then training a classifier on the selected features to predict whether documents are AI-generated. Crucially, Ghostbuster does not require access to token… ▽ More We introduce Ghostbuster, a state-of-the-art system for detecting AI-generated text. Our method works by passing documents through a series of weaker language models, running a structured search over possible combinations of their features, and then training a classifier on the selected features to predict whether documents are AI-generated. Crucially, Ghostbuster does not require access to token probabilities from the target model, making it useful for detecting text generated by black-box models or unknown model versions. In conjunction with our model, we release three new datasets of human- and AI-generated text as detection benchmarks in the domains of student essays, creative writing, and news articles. We compare Ghostbuster to a variety of existing detectors, including DetectGPT and GPTZero, as well as a new RoBERTa baseline. Ghostbuster achieves 99.0 F1 when evaluated across domains, which is 5.9 F1 higher than the best preexisting model. It also outperforms all previous approaches in generalization across writing domains (+7.5 F1), prompting strategies (+2.1 F1), and language models (+4.4 F1). We also analyze the robustness of our system to a variety of perturbations and paraphrasing attacks and evaluate its performance on documents written by non-native English speakers. △ Less

Submitted 5 April, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

Comments: NAACL 2024

arXiv:2305.12084 [pdf, other]

Revisiting Entropy Rate Constancy in Text

Authors: Vivek Verma, Nicholas Tomlin, Dan Klein

Abstract: The uniform information density (UID) hypothesis states that humans tend to distribute information roughly evenly across an utterance or discourse. Early evidence in support of the UID hypothesis came from Genzel & Charniak (2002), which proposed an entropy rate constancy principle based on the probability of English text under n-gram language models. We re-evaluate the claims of Genzel & Charniak… ▽ More The uniform information density (UID) hypothesis states that humans tend to distribute information roughly evenly across an utterance or discourse. Early evidence in support of the UID hypothesis came from Genzel & Charniak (2002), which proposed an entropy rate constancy principle based on the probability of English text under n-gram language models. We re-evaluate the claims of Genzel & Charniak (2002) with neural language models, failing to find clear evidence in support of entropy rate constancy. We conduct a range of experiments across datasets, model sizes, and languages and discuss implications for the uniform information density hypothesis and linguistic theories of efficient communication more broadly. △ Less

Submitted 17 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

Comments: Findings of EMNLP 2023

arXiv:2303.10739 [pdf, other]

doi 10.1063/5.0150282

Large active-area superconducting microwire detector array with single-photon sensitivity in the near-infrared

Authors: Jamie S. Luskin, Ekkehart Schmidt, Boris Korzh, Andrew D. Beyer, Bruce Bumble, Jason P. Allmaras, Alexander B. Walter, Emma E. Wollman, Lautaro Narváez, Varun B. Verma, Sae Woo Nam, Ilya Charaev, Marco Colangelo, Karl K. Berggren, Cristián Peña, Maria Spiropulu, Maurice Garcia-Sciveres, Stephen Derenzo, Matthew D. Shaw

Abstract: Superconducting nanowire single photon detectors (SNSPDs) are the highest-performing technology for time-resolved single-photon counting from the UV to the near-infrared. The recent discovery of single-photon sensitivity in micrometer-scale superconducting wires is a promising pathway to explore for large active area devices with application to dark matter searches and fundamental physics experime… ▽ More Superconducting nanowire single photon detectors (SNSPDs) are the highest-performing technology for time-resolved single-photon counting from the UV to the near-infrared. The recent discovery of single-photon sensitivity in micrometer-scale superconducting wires is a promising pathway to explore for large active area devices with application to dark matter searches and fundamental physics experiments. We present 8-pixel $1 mm^2$ superconducting microwire single photon detectors (SMSPDs) with $1\,\mathrm{μm}$-wide wires fabricated from WSi and MoSi films of various stoichiometries using electron-beam and optical lithography. Devices made from all materials and fabrication techniques show saturated internal detection efficiency at 1064 nm in at least one pixel, and the best performing device made from silicon-rich WSi shows single-photon sensitivity in all 8 pixels and saturated internal detection efficiency in 6/8 pixels. This detector is the largest reported active-area SMSPD or SNSPD with near-IR sensitivity published to date, and the first report of an SMSPD array. By further optimizing the photolithography techniques presented in this work, a viable pathway exists to realize larger devices with $cm^2$-scale active area and beyond. △ Less

Submitted 19 March, 2023; originally announced March 2023.

arXiv:2302.01462 [pdf, other]

doi 10.1063/5.0145077

Trap-Integrated Superconducting Nanowire Single-Photon Detectors with Improved RF Tolerance for Trapped-Ion Qubit State Readout

Authors: Benedikt Hampel, Daniel H. Slichter, Dietrich Leibfried, Richard P. Mirin, Sae Woo Nam, Varun B. Verma

Abstract: State readout of trapped-ion qubits with trap-integrated detectors can address important challenges for scalable quantum computing, but the strong rf electric fields used for trapping can impact detector performance. Here, we report on NbTiN superconducting nanowire single-photon detectors (SNSPDs) employing grounded aluminum mirrors as electrical shielding that are integrated into linear surface-… ▽ More State readout of trapped-ion qubits with trap-integrated detectors can address important challenges for scalable quantum computing, but the strong rf electric fields used for trapping can impact detector performance. Here, we report on NbTiN superconducting nanowire single-photon detectors (SNSPDs) employing grounded aluminum mirrors as electrical shielding that are integrated into linear surface-electrode rf ion traps. The shielded SNSPDs can be successfully operated at applied rf trapping potentials of up to $\mathrm{54\,V_{peak}}$ at $\mathrm{70\,MHz}$ and temperatures of up to $\mathrm{6\,K}$, with a maximum system detection efficiency of $\mathrm{68\,\%}$. This performance should be sufficient to enable parallel high-fidelity state readout of a wide range of trapped ion species in typical cryogenic apparatus. △ Less

Submitted 2 February, 2023; originally announced February 2023.

Comments: 6 pages, 4 figures. The following article has been submitted to Applied Physics Letters

Journal ref: Appl. Phys. Lett. 122, 174001 (2023)

arXiv:2301.11892 [pdf, other]

Streaming LifeLong Learning With Any-Time Inference

Authors: Soumya Banerjee, Vinay Kumar Verma, Vinay P. Namboodiri

Abstract: Despite rapid advancements in lifelong learning (LLL) research, a large body of research mainly focuses on improving the performance in the existing \textit{static} continual learning (CL) setups. These methods lack the ability to succeed in a rapidly changing \textit{dynamic} environment, where an AI agent needs to quickly learn new instances in a `single pass' from the non-i.i.d (also possibly t… ▽ More Despite rapid advancements in lifelong learning (LLL) research, a large body of research mainly focuses on improving the performance in the existing \textit{static} continual learning (CL) setups. These methods lack the ability to succeed in a rapidly changing \textit{dynamic} environment, where an AI agent needs to quickly learn new instances in a `single pass' from the non-i.i.d (also possibly temporally contiguous/coherent) data streams without suffering from catastrophic forgetting. For practical applicability, we propose a novel lifelong learning approach, which is streaming, i.e., a single input sample arrives in each time step, single pass, class-incremental, and subject to be evaluated at any moment. To address this challenging setup and various evaluation protocols, we propose a Bayesian framework, that enables fast parameter update, given a single training example, and enables any-time inference. We additionally propose an implicit regularizer in the form of snap-shot self-distillation, which effectively minimizes the forgetting further. We further propose an effective method that efficiently selects a subset of samples for online memory rehearsal and employs a new replay buffer management scheme that significantly boosts the overall performance. Our empirical evaluations and ablations demonstrate that the proposed method outperforms the prior works by large margins. △ Less

Submitted 27 January, 2023; originally announced January 2023.

Comments: arXiv admin note: substantial text overlap with arXiv:2110.10741

arXiv:2212.13381 [pdf, other]

MixupE: Understanding and Improving Mixup from Directional Derivative Perspective

Authors: Yingtian Zou, Vikas Verma, Sarthak Mittal, Wai Hoh Tang, Hieu Pham, Juho Kannala, Yoshua Bengio, Arno Solin, Kenji Kawaguchi

Abstract: Mixup is a popular data augmentation technique for training deep neural networks where additional samples are generated by linearly interpolating pairs of inputs and their labels. This technique is known to improve the generalization performance in many learning paradigms and applications. In this work, we first analyze Mixup and show that it implicitly regularizes infinitely many directional deri… ▽ More Mixup is a popular data augmentation technique for training deep neural networks where additional samples are generated by linearly interpolating pairs of inputs and their labels. This technique is known to improve the generalization performance in many learning paradigms and applications. In this work, we first analyze Mixup and show that it implicitly regularizes infinitely many directional derivatives of all orders. Based on this new insight, we propose an improved version of Mixup, theoretically justified to deliver better generalization performance than the vanilla Mixup. To demonstrate the effectiveness of the proposed method, we conduct experiments across various domains such as images, tabular data, speech, and graphs. Our results show that the proposed method improves Mixup across multiple datasets using a variety of architectures, for instance, exhibiting an improvement over Mixup by 0.8% in ImageNet top-1 accuracy. △ Less

Submitted 15 October, 2023; v1 submitted 27 December, 2022; originally announced December 2022.

Comments: 16 pages, Best Student Paper Award at UAI 2023

arXiv:2210.12818 [pdf, other]

Pushing the Efficiency Limit Using Structured Sparse Convolutions

Authors: Vinay Kumar Verma, Nikhil Mehta, Shijing Si, Ricardo Henao, Lawrence Carin

Abstract: Weight pruning is among the most popular approaches for compressing deep convolutional neural networks. Recent work suggests that in a randomly initialized deep neural network, there exist sparse subnetworks that achieve performance comparable to the original network. Unfortunately, finding these subnetworks involves iterative stages of training and pruning, which can be computationally expensive.… ▽ More Weight pruning is among the most popular approaches for compressing deep convolutional neural networks. Recent work suggests that in a randomly initialized deep neural network, there exist sparse subnetworks that achieve performance comparable to the original network. Unfortunately, finding these subnetworks involves iterative stages of training and pruning, which can be computationally expensive. We propose Structured Sparse Convolution (SSC), which leverages the inherent structure in images to reduce the parameters in the convolutional filter. This leads to improved efficiency of convolutional architectures compared to existing methods that perform pruning at initialization. We show that SSC is a generalization of commonly used layers (depthwise, groupwise and pointwise convolution) in ``efficient architectures.'' Extensive experiments on well-known CNN models and datasets show the effectiveness of the proposed method. Architectures based on SSC achieve state-of-the-art performance compared to baselines on CIFAR-10, CIFAR-100, Tiny-ImageNet, and ImageNet classification benchmarks. △ Less

Submitted 23 October, 2022; originally announced October 2022.

Comments: Accepted at the IEEE Winter Conference on Applications of Computer Vision, WACV 2023

arXiv:2210.09505 [pdf, other]

CNT (Conditioning on Noisy Targets): A new Algorithm for Leveraging Top-Down Feedback

Authors: Alexia Jolicoeur-Martineau, Alex Lamb, Vikas Verma, Aniket Didolkar

Abstract: We propose a novel regularizer for supervised learning called Conditioning on Noisy Targets (CNT). This approach consists in conditioning the model on a noisy version of the target(s) (e.g., actions in imitation learning or labels in classification) at a random noise level (from small to large noise). At inference time, since we do not know the target, we run the network with only noise in place o… ▽ More We propose a novel regularizer for supervised learning called Conditioning on Noisy Targets (CNT). This approach consists in conditioning the model on a noisy version of the target(s) (e.g., actions in imitation learning or labels in classification) at a random noise level (from small to large noise). At inference time, since we do not know the target, we run the network with only noise in place of the noisy target. CNT provides hints through the noisy label (with less noise, we can more easily infer the true target). This give two main benefits: 1) the top-down feedback allows the model to focus on simpler and more digestible sub-problems and 2) rather than learning to solve the task from scratch, the model will first learn to master easy examples (with less noise), while slowly progressing toward harder examples (with more noise). △ Less

Submitted 26 October, 2022; v1 submitted 17 October, 2022; originally announced October 2022.

arXiv:2209.10676 [pdf, other]

doi 10.1016/j.ocemod.2022.102136

Lagrangian surface signatures reveal upper-ocean vertical displacement conduits near oceanic density fronts

Authors: H. M. Aravind, Vicky Verma, Sutanu Sarkar, Mara A. Freilich, Amala Mahadevan, Patrick J. Haley, Pierre F. J. Lermusiaux, Michael R. Allshouse

Abstract: Vertical transport in the ocean plays a critical role in the exchange of freshwater, heat, nutrients, and other biogeochemical tracers. While there are situations where vertical fluxes are important, studying the vertical transport and displacement of material requires analysis over a finite interval of time. One such example is the subduction of fluid from the mixed layer into the pycnocline, whi… ▽ More Vertical transport in the ocean plays a critical role in the exchange of freshwater, heat, nutrients, and other biogeochemical tracers. While there are situations where vertical fluxes are important, studying the vertical transport and displacement of material requires analysis over a finite interval of time. One such example is the subduction of fluid from the mixed layer into the pycnocline, which is known to occur near density fronts. Divergence has been used to estimate vertical velocities indicating that surface measurements, where observational data is most widely available, can be used to locate these vertical transport conduits. We evaluate the correlation between surface signatures derived from Eulerian (horizontal divergence, density gradient, and vertical velocity) and Lagrangian (dilation rate and finite time Lyapunov exponent) metrics and vertical displacement conduits. Two submesoscale resolving models of density fronts and a data-assimilative model of the western Mediterranean were analyzed. The Lagrangian surface signatures locate significantly more of the strongest displacement features and the difference in the expected displacements relative to Eulerian ones increases with the length of the time interval considered. Ensemble analysis of forecasts from the Mediterranean model demonstrates that the Lagrangian surface signatures can be used to identify regions of strongest downward vertical displacement even without knowledge of the true ocean state. △ Less

Submitted 21 September, 2022; originally announced September 2022.

Comments: 30 pages, 8 figures. Submitted to Ocean Modelling

arXiv:2204.02042 [pdf]

doi 10.1016/j.physb.2022.414129

Cr doping-induced ferromagnetism in the spin-glass Cd1-xMnxTe studied by x-ray magnetic circular dichroism

Authors: V. K. Verma, S. Sakamoto, K. Ishikawa, V. R. Singh, K. Ishigami, G. Shibata, T. Kadono, T. Koide, S. Kuroda, A. Fujimori

Abstract: The prototypical diluted magnetic semiconductor Cd1-xMnxTe is a spin glass (x<0.6) or an antiferromagnet (x>0.6), but becomes ferromagnetic upon doping with a small amount of Cr atoms substituting for Mn. In order to investigate the origin of the ferromagnetism in Cd1-x-yMnxCryTe, we have studied its element specific magnetic properties by x-ray absorption spectroscopy (XAS) and x-ray magnetic cir… ▽ More The prototypical diluted magnetic semiconductor Cd1-xMnxTe is a spin glass (x<0.6) or an antiferromagnet (x>0.6), but becomes ferromagnetic upon doping with a small amount of Cr atoms substituting for Mn. In order to investigate the origin of the ferromagnetism in Cd1-x-yMnxCryTe, we have studied its element specific magnetic properties by x-ray absorption spectroscopy (XAS) and x-ray magnetic circular dichroism (XMCD) at the Cr and Mn L2,3 edges. Thin films were grown by molecular beam epitaxy with a fixed Mn content of x = 0.2 and varying Cr content in the range of y = 0 - 0.04. Measured XAS and XMCD spectra indicate that both Cr and Mn atoms are divalent and that the ferromagnetic or superparamagnetic components of Cr and Mn are aligned in the same directions. The magnetization of Mn increases with increasing Cr content. These results can be explained if ferromagnetic interaction exists between neighboring Mn and Cr ions although interaction between Mn atoms is largely antiferromagnetic. We conclude that each ferromagnetic or superparamagnetic cluster consists of ferromagnetically coupled several Cr and a much larger number of Mn ions. △ Less

Submitted 5 April, 2022; originally announced April 2022.

Comments: 13 pages, 5 figures

arXiv:2202.05942 [pdf, other]

doi 10.1063/5.0088007

Broadband polarization insensitivity and high detection efficiency in high-fill-factor superconducting microwire single-photon detectors

Authors: Dileep V. Reddy, Negar Otrooshi, Sae Woo Nam, Richard P. Mirin, Varun B. Verma

Abstract: Single-photon detection via absorption in current-biased nanoscale superconducting structures has become a preferred technology in quantum optics and related fields. Single-mode fiber packaged devices have seen new records set in detection efficiency, timing jitter, recovery times, and largest sustainable count rates. The popular approaches to decreasing polarization sensitivity have thus far been… ▽ More Single-photon detection via absorption in current-biased nanoscale superconducting structures has become a preferred technology in quantum optics and related fields. Single-mode fiber packaged devices have seen new records set in detection efficiency, timing jitter, recovery times, and largest sustainable count rates. The popular approaches to decreasing polarization sensitivity have thus far been limited to introduction of geometrically symmetric nanowire meanders, such as spirals and fractals, in the active area. The constraints on bending radii, and by extension, fill factors, in such designs limits their maximum efficiency. The discovery of single-photon sensitivity in micrometer-scale superconducting wires enables novel meander patterns with no effective upper limit on fill factor. This work demonstrates simultaneous low-polarization sensitivity ($1.02\pm 0.008$) and high detection efficiency ($> 91.8\%$ with $67\%$ confidence at $2\times10^5$ counts per second) across a $40$ nm bandwidth centered at 1550 nm in 0.51 $μ\text{m}$ wide microwire devices made of silicon-rich tungsten silicide, with a $0.91$ fill factor in the active area. These devices boasted efficiencies of $96.5-96.9\% \pm 0.5\%$ at $1\times10^5$ counts per second for 1550 nm light. △ Less

Submitted 2 March, 2022; v1 submitted 11 February, 2022; originally announced February 2022.

Comments: 18 pages, 13 figures (including supplementary document) Added citations in replacement version

Journal ref: APL Photonics 7, 051302 (2022)

arXiv:2112.07976 [pdf, other]

doi 10.1088/1361-6668/ac5338

Laser-lithographically written micron-wide superconducting nanowire single-photon detectors

Authors: Maximilian Protte, Varun B. Verma, Jan Philipp Höpker, Richard P. Mirin, Sae Woo Nam, Tim J. Bartley

Abstract: We demonstrate the fabrication of micron-wide tungsten silicide superconducting nanowire single-photon detectors on a silicon substrate using laser lithography. We show saturated internal detection efficiencies with wire widths from 0.59$μ$m to 1.43$μ$m under illumination at 1550nm. We demonstrate both straight wires, as well as meandered structures. Single-photon sensitivity is shown in devices u… ▽ More We demonstrate the fabrication of micron-wide tungsten silicide superconducting nanowire single-photon detectors on a silicon substrate using laser lithography. We show saturated internal detection efficiencies with wire widths from 0.59$μ$m to 1.43$μ$m under illumination at 1550nm. We demonstrate both straight wires, as well as meandered structures. Single-photon sensitivity is shown in devices up to 4mm in length. Laser-lithographically written devices allow for fast and easy structuring of large areas while maintaining a saturated internal efficiency for wire width around 1$μ$m. △ Less

Submitted 15 December, 2021; originally announced December 2021.

Comments: 5 pages, 7 figures

arXiv:2111.15126 [pdf]

Tunable and Sensitive Detection of Cortisol using Anisotropic Phosphorene with a Surface Plasmon Resonance Technique: Numerical Investigation

Authors: Vipin Kumar Verma, Sarika Pal, Conrad Rizal Yogendra Kumar Prajapati

Abstract: Tunable and ultrasensitive surface plasmon resonance (SPR) sensors are highly desirable for monitoring stress hormones such as cortisol, a steroid hormone formed in the adrenal glands in the human body. This paper describes the detection of cortisol using a bimetallic SPR sensor based on highly anisotropic two-dimensional material, i.e., phosphorene. Thicknesses of bi-metal layers, such as copper… ▽ More Tunable and ultrasensitive surface plasmon resonance (SPR) sensors are highly desirable for monitoring stress hormones such as cortisol, a steroid hormone formed in the adrenal glands in the human body. This paper describes the detection of cortisol using a bimetallic SPR sensor based on highly anisotropic two-dimensional material, i.e., phosphorene. Thicknesses of bi-metal layers, such as copper (Cu) and nickel (Ni), is optimized to achieve strong SPR excitation. The proposed sensor is rotated in-plane with a rotation angle around the z-axis to obtain phosphorene anisotropic behavior. The performance parameters of the sensor are demonstrated in terms of higher sensitivity (347.78 degree/RIU), maximum angular figure of merit (1780.3), and finer limit of detection of 0.026 ng/ml. Furthermore, a significant penetration depth (203 nm) is achieved for the proposed sensor. The obtained results of the above parameters indicate that the proposed sensor outperforms the previously reported papers in the literature on cortisol detection using the SPR technique. △ Less

Submitted 29 November, 2021; originally announced November 2021.

Comments: Under Review

arXiv:2111.10970 [pdf, other]

doi 10.1109/AERO53065.2022.9843352

Operations for Autonomous Spacecraft

Authors: Rebecca Castano, Tiago Vaquero, Federico Rossi, Vandi Verma, Ellen Van Wyk, Dan Allard, Bennett Huffmann, Erin M. Murphy, Nihal Dhamani, Robert A. Hewitt, Scott Davidoff, Rashied Amini, Anthony Barrett, Julie Castillo-Rogez, Steve A. Chien, Mathieu Choukroun, Alain Dadaian, Raymond Francis, Benjamin Gorr, Mark Hofstadter, Mitch Ingham, Cristina Sorice, Iain Tierney

Abstract: Onboard autonomy technologies such as planning and scheduling, identification of scientific targets, and content-based data summarization, will lead to exciting new space science missions. However, the challenge of operating missions with such onboard autonomous capabilities has not been studied to a level of detail sufficient for consideration in mission concepts. These autonomy capabilities will… ▽ More Onboard autonomy technologies such as planning and scheduling, identification of scientific targets, and content-based data summarization, will lead to exciting new space science missions. However, the challenge of operating missions with such onboard autonomous capabilities has not been studied to a level of detail sufficient for consideration in mission concepts. These autonomy capabilities will require changes to current operations processes, practices, and tools. We have developed a case study to assess the changes needed to enable operators and scientists to operate an autonomous spacecraft by facilitating a common model between the ground personnel and the onboard algorithms. We assess the new operations tools and workflows necessary to enable operators and scientists to convey their desired intent to the spacecraft, and to be able to reconstruct and explain the decisions made onboard and the state of the spacecraft. Mock-ups of these tools were used in a user study to understand the effectiveness of the processes and tools in enabling a shared framework of understanding, and in the ability of the operators and scientists to effectively achieve mission science objectives. △ Less

Submitted 21 November, 2021; originally announced November 2021.

Comments: 16 pages, 18 Figures, 1 Table, to be published in IEEE Aerospace 2022 (AeroConf 2022)

Journal ref: Proceedings of the 2022 IEEE Aerospace Conference (IEEE AERO 2022), 1-20

arXiv:2110.10741 [pdf, other]

Class Incremental Online Streaming Learning

Authors: Soumya Banerjee, Vinay Kumar Verma, Toufiq Parag, Maneesh Singh, Vinay P. Namboodiri

Abstract: A wide variety of methods have been developed to enable lifelong learning in conventional deep neural networks. However, to succeed, these methods require a `batch' of samples to be available and visited multiple times during training. While this works well in a static setting, these methods continue to suffer in a more realistic situation where data arrives in \emph{online streaming manner}. We e… ▽ More A wide variety of methods have been developed to enable lifelong learning in conventional deep neural networks. However, to succeed, these methods require a `batch' of samples to be available and visited multiple times during training. While this works well in a static setting, these methods continue to suffer in a more realistic situation where data arrives in \emph{online streaming manner}. We empirically demonstrate that the performance of current approaches degrades if the input is obtained as a stream of data with the following restrictions: $(i)$ each instance comes one at a time and can be seen only once, and $(ii)$ the input data violates the i.i.d assumption, i.e., there can be a class-based correlation. We propose a novel approach (CIOSL) for the class-incremental learning in an \emph{online streaming setting} to address these challenges. The proposed approach leverages implicit and explicit dual weight regularization and experience replay. The implicit regularization is leveraged via the knowledge distillation, while the explicit regularization incorporates a novel approach for parameter regularization by learning the joint distribution of the buffer replay and the current sample. Also, we propose an efficient online memory replay and replacement buffer strategy that significantly boosts the model's performance. Extensive experiments and ablation on challenging datasets show the efficacy of the proposed method. △ Less

Submitted 20 October, 2021; originally announced October 2021.

arXiv:2110.01856 [pdf, other]

Hypernetworks for Continual Semi-Supervised Learning

Authors: Dhanajit Brahma, Vinay Kumar Verma, Piyush Rai

Abstract: Learning from data sequentially arriving, possibly in a non i.i.d. way, with changing task distribution over time is called continual learning. Much of the work thus far in continual learning focuses on supervised learning and some recent works on unsupervised learning. In many domains, each task contains a mix of labelled (typically very few) and unlabelled (typically plenty) training examples, w… ▽ More Learning from data sequentially arriving, possibly in a non i.i.d. way, with changing task distribution over time is called continual learning. Much of the work thus far in continual learning focuses on supervised learning and some recent works on unsupervised learning. In many domains, each task contains a mix of labelled (typically very few) and unlabelled (typically plenty) training examples, which necessitates a semi-supervised learning approach. To address this in a continual learning setting, we propose a framework for semi-supervised continual learning called Meta-Consolidation for Continual Semi-Supervised Learning (MCSSL). Our framework has a hypernetwork that learns the meta-distribution that generates the weights of a semi-supervised auxiliary classifier generative adversarial network $(\textit{Semi-ACGAN})$ as the base network. We consolidate the knowledge of sequential tasks in the hypernetwork, and the base network learns the semi-supervised learning task. Further, we present $\textit{Semi-Split CIFAR-10}$, a new benchmark for continual semi-supervised learning, obtained by modifying the $\textit{Split CIFAR-10}$ dataset, in which the tasks with labelled and unlabelled data arrive sequentially. Our proposed model yields significant improvements in the continual semi-supervised learning setting. We compare the performance of several existing continual learning approaches on the proposed continual semi-supervised learning benchmark of the Semi-Split CIFAR-10 dataset. △ Less

Submitted 5 October, 2021; originally announced October 2021.

Comments: Accepted to CSSL workshop at IJCAI 2021 (Best Student Paper Award)

arXiv:2107.12169 [pdf]

doi 10.1142/S0217732322500201

Two-way quantum communication using four-qubit cluster state: mutual exchange of quantum information

Authors: Vikram Verma, Mitali Sisodia

Abstract: In the present study, we have proposed a scheme for two-way quantum communication in which the two legitimate participants mutually exchange their quantum information to each other by using a four-qubit cluster state as the quantum channel. Recently, by utilizing a four-qubit cluster state as the quantum channel, Kazemikhah et al. [Int. J. Theor. Phys., 60 (2021) 378] tried to design a scheme for… ▽ More In the present study, we have proposed a scheme for two-way quantum communication in which the two legitimate participants mutually exchange their quantum information to each other by using a four-qubit cluster state as the quantum channel. Recently, by utilizing a four-qubit cluster state as the quantum channel, Kazemikhah et al. [Int. J. Theor. Phys., 60 (2021) 378] tried to design a scheme for the mutual exchange of quantum information between two legitimate participants. However, in the present study, it has been shown that in their scheme the transmission of quantum information cannot be realized because the two participants are not entangled to each other due to a trivial conceptual mistake made by Kazemikhah et al. in the description of the quantum channel. Here, we have shown that two legitimate participants can teleport quantum information states to each other by using a four-qubit cluster state as the quantum channel, provided they co-operate with each other and perform non-local controlled phase gate operation. If both participants do not co-operate with each other, then no one can reconstruct the information sent to them, and therefore the exchange of information is possible only when both participants are honest to each other. △ Less

Submitted 28 July, 2021; v1 submitted 22 July, 2021; originally announced July 2021.

Comments: 8 pages, 2 tables

arXiv:2106.06795 [pdf, other]

Knowledge Consolidation based Class Incremental Online Learning with Limited Data

Authors: Mohammed Asad Karim, Vinay Kumar Verma, Pravendra Singh, Vinay Namboodiri, Piyush Rai

Abstract: We propose a novel approach for class incremental online learning in a limited data setting. This problem setting is challenging because of the following constraints: (1) Classes are given incrementally, which necessitates a class incremental learning approach; (2) Data for each class is given in an online fashion, i.e., each training example is seen only once during training; (3) Each class has v… ▽ More We propose a novel approach for class incremental online learning in a limited data setting. This problem setting is challenging because of the following constraints: (1) Classes are given incrementally, which necessitates a class incremental learning approach; (2) Data for each class is given in an online fashion, i.e., each training example is seen only once during training; (3) Each class has very few training examples; and (4) We do not use or assume access to any replay/memory to store data from previous classes. Therefore, in this setting, we have to handle twofold problems of catastrophic forgetting and overfitting. In our approach, we learn robust representations that are generalizable across tasks without suffering from the problems of catastrophic forgetting and overfitting to accommodate future classes with limited samples. Our proposed method leverages the meta-learning framework with knowledge consolidation. The meta-learning framework helps the model for rapid learning when samples appear in an online fashion. Simultaneously, knowledge consolidation helps to learn a robust representation against forgetting under online updates to facilitate future learning. Our approach significantly outperforms other methods on several benchmarks. △ Less

Submitted 12 June, 2021; originally announced June 2021.

Comments: International Joint Conference on Artificial Intelligence (IJCAI-2021)

arXiv:2105.06133 [pdf]

doi 10.1038/s42005-021-00602-7

Size dependent nature of the magnetic-field driven superconductor-to-insulator quantum-phase transitions

Authors: Xiaofu Zhang, Adriana E. Lita, Huanlong Liu, Varun B. Verma, Qiang Zhou, Sae Woo Nam, Andreas Schilling

Abstract: The nature of the magnetic-field driven superconductor-to-insulator quantum-phase transition in two-dimensional systems at zero temperature has been under debate since the 1980s, and became even more controversial after the observation of a quantum-Griffiths singularity. Whether it is induced by quantum fluctuations of the superconducting phase and the localization of Cooper pairs, or is directly… ▽ More The nature of the magnetic-field driven superconductor-to-insulator quantum-phase transition in two-dimensional systems at zero temperature has been under debate since the 1980s, and became even more controversial after the observation of a quantum-Griffiths singularity. Whether it is induced by quantum fluctuations of the superconducting phase and the localization of Cooper pairs, or is directly driven by depairing of these pairs, remains an open question. We herein experimentally demonstrate that in weakly-pinning systems and in the limit of infinitely wide films, a sequential superconductor-to-Bose insulator-to-Fermi insulator quantum-phase transition takes place. By limiting their size smaller than the effective penetration depth, however, the vortex interaction alters, and the superconducting state re-enters the Bose-insulating state. As a consequence, one observes a direct superconductor-to-Fermi insulator in the zero-temperature limit. In narrow films, the associated critical-exponent products diverge along the corresponding phase boundaries with increasing magnetic field, which is a hallmark of the quantum-Griffiths singularity. △ Less

Submitted 13 May, 2021; originally announced May 2021.

Comments: 6 main figures, 11 supplementary Figures, 2 supplementary Movies. This is a preprint of an article published in Communications Physics. The final authenticated open-access version with updated figures and text is available for free from May 14, 2021 at https://doi.org/10.1038/s42005-021-00602-7

arXiv:2104.12500 [pdf, other]

doi 10.1088/2515-7647/ac105b

Integrated superconducting nanowire single-photon detectors on titanium in-diffused lithium niobate waveguides

Authors: Jan Philipp Höpker, Varun B. Verma, Maximilian Protte, Raimund Ricken, Viktor Quiring, Christof Eigner, Lena Ebers, Manfred Hammer, Jens Foerstner, Christine Silberhorn, Richard P. Mirin, Sae Woo Nam, Tim J. Bartley

Abstract: We demonstrate the integration of amorphous tungsten silicide superconducting nanowire single-photon detectors on titanium in-diffused lithium niobate waveguides. We show proof-of-principle detection of evanescently-coupled photons of 1550nm wavelength using bidirectional waveguide coupling for two orthogonal polarization directions. We investigate the internal detection efficiency as well as dete… ▽ More We demonstrate the integration of amorphous tungsten silicide superconducting nanowire single-photon detectors on titanium in-diffused lithium niobate waveguides. We show proof-of-principle detection of evanescently-coupled photons of 1550nm wavelength using bidirectional waveguide coupling for two orthogonal polarization directions. We investigate the internal detection efficiency as well as detector absorption using coupling-independent characterization measurements. Furthermore, we describe strategies to improve the yield and efficiency of these devices. △ Less

Submitted 26 April, 2021; originally announced April 2021.

Comments: 7 pages, 6 figures

Journal ref: J. Phys. Photonics 3 034022 (2021)

arXiv:2104.04765 [pdf, other]

Q-matrix Unaware Double JPEG Detection using DCT-Domain Deep BiLSTM Network

Authors: Vinay Verma, Deepak Singh, Nitin Khanna

Abstract: The double JPEG compression detection has received much attention in recent years due to its applicability as a forensic tool for the most widely used JPEG file format. Existing state-of-the-art CNN-based methods either use histograms of all the frequencies or rely on heuristics to select histograms of specific low frequencies to classify single and double compressed images. However, even amidst l… ▽ More The double JPEG compression detection has received much attention in recent years due to its applicability as a forensic tool for the most widely used JPEG file format. Existing state-of-the-art CNN-based methods either use histograms of all the frequencies or rely on heuristics to select histograms of specific low frequencies to classify single and double compressed images. However, even amidst lower frequencies of double compressed images/patches, histograms of all the frequencies do not have distinguishable features to separate them from single compressed images. This paper directly extracts the quantized DCT coefficients from the JPEG images without decompressing them in the pixel domain, obtains all AC frequencies' histograms, uses a module based on $1\times 1$ depth-wise convolutions to learn the inherent relation between each histogram and corresponding q-factor, and utilizes a tailor-made BiLSTM network for selectively encoding these feature vector sequences. The proposed system outperforms several baseline methods on a relatively large and diverse publicly available dataset of single and double compressed patches. Another essential aspect of any single vs. double JPEG compression detection system is handling the scenario where test patches are compressed with entirely different quantization matrices (Q-matrices) than those used while training; different camera manufacturers and image processing software generally utilize their customized quantization matrices. A set of extensive experiments shows that the proposed system trained on a single dataset generalizes well on other datasets compressed with completely unseen quantization matrices and outperforms the state-of-the-art methods in both seen and unseen quantization matrices scenarios. △ Less

Submitted 10 April, 2021; originally announced April 2021.

arXiv:2103.13558 [pdf, other]

Efficient Feature Transformations for Discriminative and Generative Continual Learning

Authors: Vinay Kumar Verma, Kevin J Liang, Nikhil Mehta, Piyush Rai, Lawrence Carin

Abstract: As neural networks are increasingly being applied to real-world applications, mechanisms to address distributional shift and sequential task learning without forgetting are critical. Methods incorporating network expansion have shown promise by naturally adding model capacity for learning new tasks while simultaneously avoiding catastrophic forgetting. However, the growth in the number of addition… ▽ More As neural networks are increasingly being applied to real-world applications, mechanisms to address distributional shift and sequential task learning without forgetting are critical. Methods incorporating network expansion have shown promise by naturally adding model capacity for learning new tasks while simultaneously avoiding catastrophic forgetting. However, the growth in the number of additional parameters of many of these types of methods can be computationally expensive at larger scales, at times prohibitively so. Instead, we propose a simple task-specific feature map transformation strategy for continual learning, which we call Efficient Feature Transformations (EFTs). These EFTs provide powerful flexibility for learning new tasks, achieved with minimal parameters added to the base architecture. We further propose a feature distance maximization strategy, which significantly improves task prediction in class incremental settings, without needing expensive generative models. We demonstrate the efficacy and efficiency of our method with an extensive set of experiments in discriminative (CIFAR-100 and ImageNet-1K) and generative (LSUN, CUB-200, Cats) sequences of tasks. Even with low single-digit parameter growth rates, EFTs can outperform many other continual learning methods in a wide range of settings. △ Less

Submitted 24 March, 2021; originally announced March 2021.

Comments: Accepted in CVPR 2021

arXiv:2103.04032 [pdf, other]

CAM-GAN: Continual Adaptation Modules for Generative Adversarial Networks

Authors: Sakshi Varshney, Vinay Kumar Verma, Srijith P K, Lawrence Carin, Piyush Rai

Abstract: We present a continual learning approach for generative adversarial networks (GANs), by designing and leveraging parameter-efficient feature map transformations. Our approach is based on learning a set of global and task-specific parameters. The global parameters are fixed across tasks whereas the task-specific parameters act as local adapters for each task, and help in efficiently obtaining task-… ▽ More We present a continual learning approach for generative adversarial networks (GANs), by designing and leveraging parameter-efficient feature map transformations. Our approach is based on learning a set of global and task-specific parameters. The global parameters are fixed across tasks whereas the task-specific parameters act as local adapters for each task, and help in efficiently obtaining task-specific feature maps. Moreover, we propose an element-wise addition of residual bias in the transformed feature space, which further helps stabilize GAN training in such settings. Our approach also leverages task similarity information based on the Fisher information matrix. Leveraging this knowledge from previous tasks significantly improves the model performance. In addition, the similarity measure also helps reduce the parameter growth in continual adaptation and helps to learn a compact model. In contrast to the recent approaches for continually-learned GANs, the proposed approach provides a memory-efficient way to perform effective continual data generation. Through extensive experiments on challenging and diverse datasets, we show that the feature-map-transformation approach outperforms state-of-the-art methods for continually-learned GANs, with substantially fewer parameters. The proposed method generates high-quality samples that can also improve the generative-replay-based continual learning for discriminative tasks. △ Less

Submitted 30 July, 2021; v1 submitted 6 March, 2021; originally announced March 2021.

Comments: Under Submission

arXiv:2102.11856 [pdf, other]

Meta-Learned Attribute Self-Gating for Continual Generalized Zero-Shot Learning

Authors: Vinay Kumar Verma, Kevin Liang, Nikhil Mehta, Lawrence Carin

Abstract: Zero-shot learning (ZSL) has been shown to be a promising approach to generalizing a model to categories unseen during training by leveraging class attributes, but challenges still remain. Recently, methods using generative models to combat bias towards classes seen during training have pushed the state of the art of ZSL, but these generative models can be slow or computationally expensive to trai… ▽ More Zero-shot learning (ZSL) has been shown to be a promising approach to generalizing a model to categories unseen during training by leveraging class attributes, but challenges still remain. Recently, methods using generative models to combat bias towards classes seen during training have pushed the state of the art of ZSL, but these generative models can be slow or computationally expensive to train. Additionally, while many previous ZSL methods assume a one-time adaptation to unseen classes, in reality, the world is always changing, necessitating a constant adjustment for deployed models. Models unprepared to handle a sequential stream of data are likely to experience catastrophic forgetting. We propose a meta-continual zero-shot learning (MCZSL) approach to address both these issues. In particular, by pairing self-gating of attributes and scaled class normalization with meta-learning based training, we are able to outperform state-of-the-art results while being able to train our models substantially faster ($>100\times$) than expensive generative-based approaches. We demonstrate this by performing experiments on five standard ZSL datasets (CUB, aPY, AWA1, AWA2 and SUN) in both generalized zero-shot learning and generalized continual zero-shot learning settings. △ Less

Submitted 23 February, 2021; originally announced February 2021.

Comments: Under Review

arXiv:2012.15281 [pdf, other]

Automated Crater Detection from Co-registered Optical Images, Elevation Maps and Slope Maps using Deep Learning

Authors: Atal Tewari, Vinay Verma, Pradeep Srivastava, Vikrant Jain, Nitin Khanna

Abstract: Impact craters are formed as a result of continuous impacts on the surface of planetary bodies. This paper proposes a novel way of simultaneously utilizing optical images, digital elevation maps (DEMs), and slope maps for automatic crater detection on the lunar surface. Mask R-CNN, tuned for the crater detection task, is utilized in this paper. Two catalogs, namely, Head-LROC and Robbins, are used… ▽ More Impact craters are formed as a result of continuous impacts on the surface of planetary bodies. This paper proposes a novel way of simultaneously utilizing optical images, digital elevation maps (DEMs), and slope maps for automatic crater detection on the lunar surface. Mask R-CNN, tuned for the crater detection task, is utilized in this paper. Two catalogs, namely, Head-LROC and Robbins, are used for the performance evaluation. Exhaustive analysis of the detection results on the lunar surface has been performed with respect to both Head-LROC and Robbins catalog. With the Head-LROC catalog, which has relatively strict crater markings and larger possibility of missing craters, recall value of 94.28\% has been obtained as compared to 88.03\% for the baseline method. However, with respect to a manually marked exhaustive crater catalog based on relatively liberal marking, significant precision and recall values are obtained for different crater size ranges. The generalization capability of the proposed method in terms of crater detection on a different terrain with different input data type is also evaluated. We show that the proposed model trained on the lunar surface with optical images, DEMs and corresponding slope maps can be used to detect craters on the Martian surface even with entirely different input data type, such as thermal IR images from the Martian surface. △ Less

Submitted 30 December, 2020; originally announced December 2020.

arXiv:2012.09979 [pdf, other]

doi 10.1063/5.0048049

Single-photon detection in the mid-infrared up to 10 micron wavelength using tungsten silicide superconducting nanowire detectors

Authors: V. B. Verma, B. Korzh, A. B. Walter, A. E. Lita, R. M. Briggs, M. Colangelo, Y. Zhai, E. E. Wollman, A. D. Beyer, J. P. Allmaras, B. Bumble, H. Vora, D. Zhu, E. Schmidt, K. K. Berggren, R. P. Mirin, S. W. Nam, M. D. Shaw

Abstract: We developed superconducting nanowire single-photon detectors (SNSPDs) based on tungsten silicide (WSi) that show saturated internal detection efficiency up to a wavelength of 10 um. These detectors are promising for applications in the mid-infrared requiring ultra-high gain stability, low dark counts, and high efficiency such as chemical sensing, LIDAR, dark matter searches and exoplanet spectros… ▽ More We developed superconducting nanowire single-photon detectors (SNSPDs) based on tungsten silicide (WSi) that show saturated internal detection efficiency up to a wavelength of 10 um. These detectors are promising for applications in the mid-infrared requiring ultra-high gain stability, low dark counts, and high efficiency such as chemical sensing, LIDAR, dark matter searches and exoplanet spectroscopy. △ Less

Submitted 17 December, 2020; originally announced December 2020.

arXiv:2011.07279 [pdf, other]

Towards Zero-Shot Learning with Fewer Seen Class Examples

Authors: Vinay Kumar Verma, Ashish Mishra, Anubha Pandey, Hema A. Murthy, Piyush Rai

Abstract: We present a meta-learning based generative model for zero-shot learning (ZSL) towards a challenging setting when the number of training examples from each \emph{seen} class is very few. This setup contrasts with the conventional ZSL approaches, where training typically assumes the availability of a sufficiently large number of training examples from each of the seen classes. The proposed approach… ▽ More We present a meta-learning based generative model for zero-shot learning (ZSL) towards a challenging setting when the number of training examples from each \emph{seen} class is very few. This setup contrasts with the conventional ZSL approaches, where training typically assumes the availability of a sufficiently large number of training examples from each of the seen classes. The proposed approach leverages meta-learning to train a deep generative model that integrates variational autoencoder and generative adversarial networks. We propose a novel task distribution where meta-train and meta-validation classes are disjoint to simulate the ZSL behaviour in training. Once trained, the model can generate synthetic examples from seen and unseen classes. Synthesize samples can then be used to train the ZSL framework in a supervised manner. The meta-learner enables our model to generates high-fidelity samples using only a small number of training examples from seen classes. We conduct extensive experiments and ablation studies on four benchmark datasets of ZSL and observe that the proposed model outperforms state-of-the-art approaches by a significant margin when the number of examples per seen class is very small. △ Less

Submitted 14 November, 2020; originally announced November 2020.

Comments: Accepted in WACV 2021

arXiv:2011.04419 [pdf, other]

Towards Domain-Agnostic Contrastive Learning

Authors: Vikas Verma, Minh-Thang Luong, Kenji Kawaguchi, Hieu Pham, Quoc V. Le

Abstract: Despite recent success, most contrastive self-supervised learning methods are domain-specific, relying heavily on data augmentation techniques that require knowledge about a particular domain, such as image cropping and rotation. To overcome such limitation, we propose a novel domain-agnostic approach to contrastive learning, named DACL, that is applicable to domains where invariances, and thus, d… ▽ More Despite recent success, most contrastive self-supervised learning methods are domain-specific, relying heavily on data augmentation techniques that require knowledge about a particular domain, such as image cropping and rotation. To overcome such limitation, we propose a novel domain-agnostic approach to contrastive learning, named DACL, that is applicable to domains where invariances, and thus, data augmentation techniques, are not readily available. Key to our approach is the use of Mixup noise to create similar and dissimilar examples by mixing data samples differently either at the input or hidden-state levels. To demonstrate the effectiveness of DACL, we conduct experiments across various domains such as tabular data, images, and graphs. Our results show that DACL not only outperforms other domain-agnostic noising methods, such as Gaussian-noise, but also combines well with domain-specific methods, such as SimCLR, to improve self-supervised visual representation learning. Finally, we theoretically analyze our method and show advantages over the Gaussian-noise based contrastive learning approach. △ Less

Submitted 19 July, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

Comments: Published in ICML 2021

arXiv:2011.02095 [pdf, other]

doi 10.1016/j.ocemod.2021.101844

Lagrangian study of dispersion and transport by submesoscale currents at an upper-ocean front

Authors: Vicky Verma, Sutanu Sarkar

Abstract: The three-dimensional transport pathways, the time scales of vertical transport, and the dispersion characteristics of submesoscale currents at an upper-ocean front are investigated using material points (tracer particles) that advect with the local fluid velocity. Coherent submesoscale vortex filaments and eddies which dominate submesoscale (0.1 - 10 km) dynamics are found to play a crucial role… ▽ More The three-dimensional transport pathways, the time scales of vertical transport, and the dispersion characteristics of submesoscale currents at an upper-ocean front are investigated using material points (tracer particles) that advect with the local fluid velocity. Coherent submesoscale vortex filaments and eddies which dominate submesoscale (0.1 - 10 km) dynamics are found to play a crucial role which is quantified here. These coherent structures are generated and sustained through nonlinear evolution of baroclinic instability. The collective motion of particles helps identify common features of transport at the front. It is found that the particles in the central region organize into inclined lobes, each associated with an eddy, and the filaments associated with the heavy- and light-edges of the front transfer edge particles to the lobes. This flux of new particles into the lobe causes local particles to adjust, which leads to slumping of the front. The particle motion in the vertical shows multiple time scales -- a fast time scale with O(10) m vertical displacement within an hour and a slower near-inertial time scale, comparable to the intrinsic time scale of the growing instability. The fast time scale motions typically occur in the vortex filaments. The overall slumping process is slower than what one might anticipate from the large magnitude of vertical velocity in the filaments and requires a sustained correlation over time between the lateral and the vertical motion. By tracking clouds of particles, we show that their centers of mass downwell/upwell over 1-2 inertial time periods, after which an adjustment follows with a sub-inertial time scale. The dispersion characteristics of the submesoscale turbulent currents using single- and pair-particle statistics have been investigated. The shape change in clusters of four particles reveals deformation into thin, needle-like structures. △ Less

Submitted 3 November, 2020; originally announced November 2020.

arXiv:2009.03918 [pdf, other]

doi 10.1038/s41534-022-00531-5

Quantum steering with vector vortex photon states with the detection loophole closed

Authors: Sergei Slussarenko, Dominick J. Joch, Nora Tischler, Farzad Ghafari, Lynden K. Shalm, Varun B. Verma, Sae Woo Nam, Geoff J. Pryde

Abstract: Violating a nonlocality inequality enables the most powerful remote quantum information tasks and fundamental tests of quantum physics. Loophole-free photonic verification of nonlocality has been achieved with polarization-entangled photon pairs, but not with states entangled in other degrees of freedom. Here we demonstrate completion of the quantum steering nonlocality task, with the detection lo… ▽ More Violating a nonlocality inequality enables the most powerful remote quantum information tasks and fundamental tests of quantum physics. Loophole-free photonic verification of nonlocality has been achieved with polarization-entangled photon pairs, but not with states entangled in other degrees of freedom. Here we demonstrate completion of the quantum steering nonlocality task, with the detection loophole closed, when entanglement is distributed by transmitting a photon in an optical vector vortex state, formed by optical orbital angular momentum (OAM) and polarization. As well as opening up a high-efficiency encoding beyond polarization, the critically-important demonstration of vector vortex steering opens the door to new free-space and satellite-based secure quantum communication devices and device-independent protocols. △ Less

Submitted 11 March, 2022; v1 submitted 8 September, 2020; originally announced September 2020.

Comments: 7 pages, 3 figures

Journal ref: NPJ Quantum Inf. 8, 20 (2022)

arXiv:2008.00065 [pdf, other]

doi 10.1103/PhysRevLett.126.010501

State Readout of a Trapped Ion Qubit Using a Trap-Integrated Superconducting Photon Detector

Authors: S. L. Todaro, V. B. Verma, K. C. McCormick, D. T. C. Allcock, R. P. Mirin, D. J. Wineland, S. W. Nam, A. C. Wilson, D. Leibfried, D. H. Slichter

Abstract: We report high-fidelity state readout of a trapped ion qubit using a trap-integrated photon detector. We determine the hyperfine qubit state of a single $^9$Be$^+$ ion held in a surface-electrode rf ion trap by counting state-dependent ion fluorescence photons with a superconducting nanowire single-photon detector (SNSPD) fabricated into the trap structure. The average readout fidelity is 0.9991(1… ▽ More We report high-fidelity state readout of a trapped ion qubit using a trap-integrated photon detector. We determine the hyperfine qubit state of a single $^9$Be$^+$ ion held in a surface-electrode rf ion trap by counting state-dependent ion fluorescence photons with a superconducting nanowire single-photon detector (SNSPD) fabricated into the trap structure. The average readout fidelity is 0.9991(1), with a mean readout duration of 46 $μ$s, and is limited by the polarization impurity of the readout laser beam and by off-resonant optical pumping. Because there are no intervening optical elements between the ion and the detector, we can use the ion fluorescence as a self-calibrated photon source to determine the detector quantum efficiency and its dependence on photon incidence angle and polarization. △ Less

Submitted 31 July, 2020; originally announced August 2020.

Comments: 15 pages, 11 figures, including supplemental material

Journal ref: Phys. Rev. Lett. 126, 010501 (2021)

arXiv:2007.12212 [pdf, other]

ZSCRGAN: A GAN-based Expectation Maximization Model for Zero-Shot Retrieval of Images from Textual Descriptions

Authors: Anurag Roy, Vinay Kumar Verma, Kripabandhu Ghosh, Saptarshi Ghosh

Abstract: Most existing algorithms for cross-modal Information Retrieval are based on a supervised train-test setup, where a model learns to align the mode of the query (e.g., text) to the mode of the documents (e.g., images) from a given training set. Such a setup assumes that the training set contains an exhaustive representation of all possible classes of queries. In reality, a retrieval model may need t… ▽ More Most existing algorithms for cross-modal Information Retrieval are based on a supervised train-test setup, where a model learns to align the mode of the query (e.g., text) to the mode of the documents (e.g., images) from a given training set. Such a setup assumes that the training set contains an exhaustive representation of all possible classes of queries. In reality, a retrieval model may need to be deployed on previously unseen classes, which implies a zero-shot IR setup. In this paper, we propose a novel GAN-based model for zero-shot text to image retrieval. When given a textual description as the query, our model can retrieve relevant images in a zero-shot setup. The proposed model is trained using an Expectation-Maximization framework. Experiments on multiple benchmark datasets show that our proposed model comfortably outperforms several state-of-the-art zero-shot text to image retrieval models, as well as zero-shot classification and hashing models suitably used for retrieval. △ Less

Submitted 23 September, 2020; v1 submitted 23 July, 2020; originally announced July 2020.

Comments: Accepted in CIKM-2020

Showing 1–50 of 168 results for author: Verma, V