Zum Hauptinhalt springen

Showing 1–50 of 54 results for author: Kumar, M

Searching in archive eess. Search in all archives.
.
  1. arXiv:2401.13784  [pdf, other

    eess.SY

    On the Predictive Capability of Dynamic Mode Decomposition for Nonlinear Periodic Systems with Focus on Orbital Mechanics

    Authors: Sriram Narayanan, Mohamed Naveed Gul Mohamed, Indranil Nayak, Suman Chakravorty, Mrinal Kumar

    Abstract: This paper discusses the predictive capability of Dynamic Mode Decomposition (DMD) in the context of orbital mechanics. The focus is specifically on the Hankel variant of DMD which uses a stacked set of time-delayed observations for system identification and subsequent prediction. A theory on the minimum number of time delays required for accurate reconstruction of periodic trajectories of nonline… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  2. arXiv:2310.03884  [pdf, other

    cs.IT cs.LG eess.SP math.DG stat.ML

    Information Geometry for the Working Information Theorist

    Authors: Kumar Vijay Mishra, M. Ashok Kumar, Ting-Kam Leonard Wong

    Abstract: Information geometry is a study of statistical manifolds, that is, spaces of probability distributions from a geometric perspective. Its classical information-theoretic applications relate to statistical concepts such as Fisher information, sufficient statistics, and efficient estimators. Today, information geometry has emerged as an interdisciplinary field that finds applications in diverse areas… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: 12 pages, 3 figures, 1 table

  3. arXiv:2306.06127  [pdf, ps, other

    eess.SP math-ph math.FA quant-ph

    A framework of windowed octonion linear canonical transform

    Authors: Manish Kumar, Bhawna

    Abstract: The uncertainty principle is a fundamental principle in theoretical physics, such as quantum mechanics and classical mechanics. It plays a prime role in signal processing, including optics, where a signal is to be analyzed simultaneously in both domains; for instance, in harmonic analysis, both time and frequency domains, and in quantum mechanics, both time and momentum. On the other hand, many ma… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: 1 figure

    MSC Class: 46F12; 53D22

  4. arXiv:2303.13656  [pdf, other

    eess.SP

    Non-Linear Signal Processing methods for UAV detections from a Multi-function X-band Radar

    Authors: Mohit Kumar, Keith Kelly

    Abstract: This article develops the applicability of non-linear processing techniques such as Compressed Sensing (CS), Principal Component Analysis (PCA), Iterative Adaptive Approach (IAA) and Multiple-input-multiple-output (MIMO) for the purpose of enhanced UAV detections using portable radar systems. The combined scheme has many advantages and the potential for better detection and classification accuracy… ▽ More

    Submitted 12 March, 2023; originally announced March 2023.

  5. arXiv:2302.13191  [pdf

    cs.RO cs.AI cs.LG cs.NE eess.SY

    DeepCPG Policies for Robot Locomotion

    Authors: Aditya M. Deshpande, Eric Hurd, Ali A. Minai, Manish Kumar

    Abstract: Central Pattern Generators (CPGs) form the neural basis of the observed rhythmic behaviors for locomotion in legged animals. The CPG dynamics organized into networks allow the emergence of complex locomotor behaviors. In this work, we take this inspiration for developing walking behaviors in multi-legged robots. We present novel DeepCPG policies that embed CPGs as a layer in a larger neural networ… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: Preprint of paper accepted for publication in IEEE Transaction On Cognitive and Developmental Systems

  6. arXiv:2302.01606  [pdf, ps, other

    eess.SY

    Design of generalized fuzzy multiple deferred state (GFMDS) sampling plan for attributes

    Authors: Julia Thampy Thomas, Mahesh Kumar

    Abstract: . A sampling plan is a pilot tool for a supply and demand chain quality check strategy. These plans proved to be economically viable for the quality inspection processes but the uncertainty in the plan parameters challenged the reliability of the application of traditional acceptance sampling plans. This study proposes a generalized fuzzy multiple deferred state (GFMDS) sampling plan for attribute… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

  7. arXiv:2212.11410  [pdf, other

    cs.RO eess.SY

    Modelling Controllers for Cyber Physical Systems Using Neural Networks

    Authors: Aravindakumar Vijayasri Mohan Kumar

    Abstract: Model Predictive Controllers (MPC) are widely used for controlling cyber-physical systems. It is an iterative process of optimizing the prediction of the future states of a robot over a fixed time horizon. MPCs are effective in practice, but because they are computationally expensive and slow, they are not well suited for use in real-time applications. Overcoming the flaw can be accomplished by ap… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: 6 pages, 8 figures, 1 table, project report

  8. arXiv:2210.07508  [pdf, other

    cs.SD cs.LG eess.AS

    Hierarchical Diffusion Models for Singing Voice Neural Vocoder

    Authors: Naoya Takahashi, Mayank Kumar, Singh, Yuki Mitsufuji

    Abstract: Recent progress in deep generative models has improved the quality of neural vocoders in speech domain. However, generating a high-quality singing voice remains challenging due to a wider variety of musical expressions in pitch, loudness, and pronunciations. In this work, we propose a hierarchical diffusion model for singing voice neural vocoders. The proposed method consists of multiple diffusion… ▽ More

    Submitted 17 October, 2022; v1 submitted 14 October, 2022; originally announced October 2022.

  9. arXiv:2206.09133  [pdf, other

    cs.CR eess.SY

    Efficacy of Asynchronous GPS Spoofing Against High Volume Consumer GNSS Receivers

    Authors: M. Surendra Kumar, Gaurav S. Kasbekar, Arnab Maity

    Abstract: The vulnerability of the Global Positioning System (GPS) against spoofing is known for quite some time. Also, the positioning and navigation of most semi-autonomous and autonomous drones are dependent on Global Navigation Satellite System (GNSS) signals. In prior work, simplistic or asynchronous GPS spoofing was found to be a simple, efficient, and effective cyber attack against L1 GPS or GNSS dep… ▽ More

    Submitted 18 June, 2022; originally announced June 2022.

    Comments: 10 pages,

  10. arXiv:2111.10047  [pdf, other

    eess.AS cs.CL cs.SD

    Semi-supervised transfer learning for language expansion of end-to-end speech recognition models to low-resource languages

    Authors: Jiyeon Kim, Mehul Kumar, Dhananjaya Gowda, Abhinav Garg, Chanwoo Kim

    Abstract: In this paper, we propose a three-stage training methodology to improve the speech recognition accuracy of low-resource languages. We explore and propose an effective combination of techniques such as transfer learning, encoder freezing, data augmentation using Text-To-Speech (TTS), and Semi-Supervised Learning (SSL). To improve the accuracy of a low-resource Italian ASR, we leverage a well-traine… ▽ More

    Submitted 19 November, 2021; originally announced November 2021.

    Comments: Accepted as a conference paper at ASRU 2021

  11. arXiv:2111.10043  [pdf, other

    eess.AS cs.SD

    A comparison of streaming models and data augmentation methods for robust speech recognition

    Authors: Jiyeon Kim, Mehul Kumar, Dhananjaya Gowda, Abhinav Garg, Chanwoo Kim

    Abstract: In this paper, we present a comparative study on the robustness of two different online streaming speech recognition models: Monotonic Chunkwise Attention (MoChA) and Recurrent Neural Network-Transducer (RNN-T). We explore three recently proposed data augmentation techniques, namely, multi-conditioned training using an acoustic simulator, Vocal Tract Length Perturbation (VTLP) for speaker variabil… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

    Comments: Accepted as a conference paper at ASRU 2021

  12. arXiv:2111.03915  [pdf, other

    cs.RO cs.AI cs.LG eess.SY math.OC

    Robust Deep Reinforcement Learning for Quadcopter Control

    Authors: Aditya M. Deshpande, Ali A. Minai, Manish Kumar

    Abstract: Deep reinforcement learning (RL) has made it possible to solve complex robotics problems using neural networks as function approximators. However, the policies trained on stationary environments suffer in terms of generalization when transferred from one environment to another. In this work, we use Robust Markov Decision Processes (RMDP) to train the drone control policy, which combines ideas from… ▽ More

    Submitted 6 November, 2021; originally announced November 2021.

    Comments: 6 pages; 3 Figures; Accepted in https://mecc2021.a2c2.org/

  13. arXiv:2108.08467  [pdf, other

    eess.IV cs.CV cs.LG

    Medical Image Segmentation with 3D Convolutional Neural Networks: A Survey

    Authors: S Niyas, S J Pawan, M Anand Kumar, Jeny Rajan

    Abstract: Computer-aided medical image analysis plays a significant role in assisting medical practitioners for expert clinical diagnosis and deciding the optimal treatment plan. At present, convolutional neural networks (CNN) are the preferred choice for medical image analysis. In addition, with the rapid advancements in three-dimensional (3D) imaging systems and the availability of excellent hardware and… ▽ More

    Submitted 28 April, 2022; v1 submitted 18 August, 2021; originally announced August 2021.

    Comments: 17 pages, 4 figures

    MSC Class: 68T07 (Primary) 68T45 (Secondary) ACM Class: I.2.10

  14. arXiv:2108.00640  [pdf, ps, other

    cs.LG eess.SP

    Few-shot calibration of low-cost air pollution (PM2.5) sensors using meta-learning

    Authors: Kalpit Yadav, Vipul Arora, Sonu Kumar Jha, Mohit Kumar, Sachchida Nand Tripathi

    Abstract: Low-cost particulate matter sensors are transforming air quality monitoring because they have lower costs and greater mobility as compared to reference monitors. Calibration of these low-cost sensors requires training data from co-deployed reference monitors. Machine Learning based calibration gives better performance than conventional techniques, but requires a large amount of training data from… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: 3+1 pages, submitted to IEEE sensors conference 2021

  15. arXiv:2107.06072  [pdf

    eess.SY stat.AP

    Fragility curves for power transmission towers in Odisha, India, based on observed damage during 2019 Cyclone Fani

    Authors: Surender V Raj, Manish Kumar, Udit Bhatia

    Abstract: Lifeline infrastructure systems such as a power transmission network in coastal regions are vulnerable to strong winds generated during tropical cyclones. Understanding the fragility of individual towers is helpful in improving the resilience of such systems. Fragility curves have been developed in the past for some regions, but without considering relevant epistemic uncertainties. Further, risk a… ▽ More

    Submitted 26 June, 2021; originally announced July 2021.

  16. arXiv:2106.01400  [pdf, other

    eess.AS cs.LG cs.SD

    Dual Script E2E framework for Multilingual and Code-Switching ASR

    Authors: Mari Ganesh Kumar, Jom Kuriakose, Anand Thyagachandran, Arun Kumar A, Ashish Seth, Lodagala Durga Prasad, Saish Jaiswal, Anusha Prakash, Hema Murthy

    Abstract: India is home to multiple languages, and training automatic speech recognition (ASR) systems for languages is challenging. Over time, each language has adopted words from other languages, such as English, leading to code-mixing. Most Indian languages also have their own unique scripts, which poses a major limitation in training multilingual and code-switching ASR systems. Inspired by results in… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: Accepted for publication at Interspeech 2021

  17. arXiv:2105.10727  [pdf, other

    eess.SY

    Reduction in Circulating Current with Improved Secondary Side Modulation in Isolated Current-Fed Half Bridge AC-DC Converter

    Authors: Manish Kumar, Sumit Pramanick, B K Panigrahi

    Abstract: Current-fed half bridge converter with bidirectional switches on ac side and a full bridge converter on dc side of a high frequency transformer is an optimal topology for single stage galvanically isolated ac-dc converter for onboard vehicle charging application. AC side switches are actively commutated to achieve zero current switching (ZCS) using single phase shift modulation (SPSM) and disconti… ▽ More

    Submitted 22 May, 2021; originally announced May 2021.

    Comments: This article has been submitted to IEEE Transactions on Power Electronics for review

  18. arXiv:2104.12859  [pdf, other

    eess.SP

    A MIMO approach for Weather Radars

    Authors: Mohit Kumar, V Chandrasekar, P Keith Kelly

    Abstract: This article develops the multiple-input multiple-output (MIMO) technology for weather radar sensing. There are ample advantages of MIMO that have been highlighted that can improve the spatial resolution of the observations and also the accuracy of the radar variables. These concepts have been introduced here pertaining to weather radar observations with supporting simulations demonstrating improv… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

  19. arXiv:2104.11267  [pdf, other

    eess.SY

    Integrated Framework of Vehicle Dynamics, Instabilities, Energy Models, and Sparse Flow Smoothing Controllers

    Authors: Jonathan W. Lee, George Gunter, Rabie Ramadan, Sulaiman Almatrudi, Paige Arnold, John Aquino, William Barbour, Rahul Bhadani, Joy Carpio, Fang-Chieh Chou, Marsalis Gibson, Xiaoqian Gong, Amaury Hayat, Nour Khoudari, Abdul Rahman Kreidieh, Maya Kumar, Nathan Lichtlé, Sean McQuade, Brian Nguyen, Megan Ross, Sydney Truong, Eugene Vinitsky, Yibo Zhao, Jonathan Sprinkle, Benedetto Piccoli , et al. (3 additional authors not shown)

    Abstract: This work presents an integrated framework of: vehicle dynamics models, with a particular attention to instabilities and traffic waves; vehicle energy models, with particular attention to accurate energy values for strongly unsteady driving profiles; and sparse Lagrangian controls via automated vehicles, with a focus on controls that can be executed via existing technology such as adaptive cruise… ▽ More

    Submitted 22 April, 2021; originally announced April 2021.

  20. arXiv:2104.01061  [pdf, other

    cs.IT cs.LG eess.SP math.DG stat.ML

    Information Geometry and Classical Cramér-Rao Type Inequalities

    Authors: Kumar Vijay Mishra, M. Ashok Kumar

    Abstract: We examine the role of information geometry in the context of classical Cramér-Rao (CR) type inequalities. In particular, we focus on Eguchi's theory of obtaining dualistic geometric structures from a divergence function and then applying Amari-Nagoaka's theory to obtain a CR type inequality. The classical deterministic CR inequality is derived from Kullback-Leibler (KL)-divergence. We show that t… ▽ More

    Submitted 21 August, 2021; v1 submitted 2 April, 2021; originally announced April 2021.

    Comments: 34 pages, 2 figures, 1 table, book chapter. arXiv admin note: text overlap with arXiv:2001.04769

  21. arXiv:2011.10885  [pdf, other

    eess.SY

    A Comprehensive Survey on Real-Time Voltage Stability Assessment for Power Systems

    Authors: Gourav Wadhwa, Amandeep Kharb, Satyam Mishra, Mohit Kumar, Shreyansh Srivastav

    Abstract: Accurate real-time assessment of power systems voltage stability has been an active area of research in the past few decades. In the past decade, after the development of phasor measurement units (PMU), a lot of discussions has been going on phasor measurement techniques for real-time voltage stability. The fundamental idea behind these methods is to find the Thevenin equivalents of the system, an… ▽ More

    Submitted 21 November, 2020; originally announced November 2020.

    Comments: Accepted at IEEE International Conference on Industrial and Information Systems (ICIIS), 2020

  22. arXiv:2011.10527  [pdf, other

    eess.AS

    Multi-Scale Speaker Diarization With Neural Affinity Score Fusion

    Authors: Tae Jin Park, Manoj Kumar, Shrikanth Narayanan

    Abstract: Identifying the identity of the speaker of short segments in human dialogue has been considered one of the most challenging problems in speech signal processing. Speaker representations of short speech segments tend to be unreliable, resulting in poor fidelity of speaker representations in tasks requiring speaker recognition. In this paper, we propose an unconventional method that tackles the trad… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

    Comments: Submitted to ICASSP 2021

  23. arXiv:2009.04983  [pdf, other

    eess.AS cs.SD

    Exploration of End-to-end Synthesisers forZero Resource Speech Challenge 2020

    Authors: Karthik Pandia D S, Anusha Prakash, Mano Ranjith Kumar, Hema A Murthy

    Abstract: A Spoken dialogue system for an unseen language is referred to as Zero resource speech. It is especially beneficial for developing applications for languages that have low digital resources. Zero resource speech synthesis is the task of building text-to-speech (TTS) models in the absence of transcriptions. In this work, speech is modelled as a sequence of transient and steady-state acoustic units,… ▽ More

    Submitted 10 September, 2020; originally announced September 2020.

    Comments: Accepted for publication in Interspeech 2020

  24. arXiv:2008.11121  [pdf, other

    eess.SP

    Use of adaptive filtering techniques and deconvolution to obtain low range sidelobe samples

    Authors: Mohit Kumar, V. Chandrasekar

    Abstract: In this paper the use of adaptive filtering techniques to obtain better peak sidelobe suppression and integrated sidelobe energy will be discussed with regard to weather radars and obtaining better sensitivity with this technique. The performance of these new coefficient sets obtained with adaptive filter (using RLS optimization) will be discussed and presented. They will also be compared with the… ▽ More

    Submitted 9 August, 2020; originally announced August 2020.

    Comments: Presented at 38th Conference on Radar Meteorology, Chicago 2017

  25. arXiv:2007.16196  [pdf, other

    eess.AS cs.SD

    Designing Neural Speaker Embeddings with Meta Learning

    Authors: Manoj Kumar, Tae Jin-Park, Somer Bishop, Shrikanth Narayanan

    Abstract: Neural speaker embeddings trained using classification objectives have demonstrated state-of-the-art performance in multiple applications. Typically, such embeddings are trained on an out-of-domain corpus on a single task e.g., speaker classification, albeit with a large number of classes (speakers). In this work, we reformulate embedding training under the meta-learning paradigm. We redistribute… ▽ More

    Submitted 31 July, 2020; originally announced July 2020.

  26. Evidence of Task-Independent Person-Specific Signatures in EEG using Subspace Techniques

    Authors: Mari Ganesh Kumar, Shrikanth Narayanan, Mriganka Sur, Hema A Murthy

    Abstract: Electroencephalography (EEG) signals are promising as alternatives to other biometrics owing to their protection against spoofing. Previous studies have focused on capturing individual variability by analyzing task/condition-specific EEG. This work attempts to model biometric signatures independent of task/condition by normalizing the associated variance. Toward this goal, the paper extends ideas… ▽ More

    Submitted 25 March, 2021; v1 submitted 27 July, 2020; originally announced July 2020.

    Comments: ©2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

    Journal ref: IEEE Transactions on Information Forensics and Security, 2021

  27. arXiv:2007.09635  [pdf, other

    eess.AS cs.SD

    Meta-learning with Latent Space Clustering in Generative Adversarial Network for Speaker Diarization

    Authors: Monisankha Pal, Manoj Kumar, Raghuveer Peri, Tae Jin Park, So Hyun Kim, Catherine Lord, Somer Bishop, Shrikanth Narayanan

    Abstract: The performance of most speaker diarization systems with x-vector embeddings is both vulnerable to noisy environments and lacks domain robustness. Earlier work on speaker diarization using generative adversarial network (GAN) with an encoder network (ClusterGAN) to project input x-vectors into a latent space has shown promising performance on meeting data. In this paper, we extend the ClusterGAN n… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

    Comments: Submitted to IEEE/ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING

  28. arXiv:2007.07793  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring Rotors

    Authors: Aditya M. Deshpande, Rumit Kumar, Ali A. Minai, Manish Kumar

    Abstract: In this paper, we present a novel developmental reinforcement learning-based controller for a quadcopter with thrust vectoring capabilities. This multirotor UAV design has tilt-enabled rotors. It utilizes the rotor force magnitude and direction to achieve the desired state during flight. The control policy of this robot is learned using the policy transfer from the learned controller of the quadco… ▽ More

    Submitted 15 July, 2020; originally announced July 2020.

    Comments: 10 pages, 8 figures, Accepted in Dynamic Systems and Control Conference (https://event.asme.org/DSCC)

  29. arXiv:2006.15686  [pdf, other

    cs.RO eess.SY math.OC

    Quaternion Feedback Based Autonomous Control of a Quadcopter UAV with Thrust Vectoring Rotors

    Authors: Rumit Kumar, Mahathi Bhargavapuri, Aditya M. Deshpande, Siddharth Sridhar, Kelly Cohen, Manish Kumar

    Abstract: In this paper, we present an autonomous flight controller for a quadcopter with thrust vectoring capabilities. This UAV falls in the category of multirotors with tilt-motion enabled rotors. Since the vehicle considered is over-actuated in nature, the dynamics and control allocation have to be analysed carefully. Moreover, the possibility of hovering at large attitude maneuvers of this novel vehicl… ▽ More

    Submitted 28 June, 2020; originally announced June 2020.

    Comments: Accepted for publication in American Controls Conference 2020, 6-Pages, 10 figures

  30. Computer Vision Toolkit for Non-invasive Monitoring of Factory Floor Artifacts

    Authors: Aditya M. Deshpande, Anil Kumar Telikicherla, Vinay Jakkali, David A. Wickelhaus, Manish Kumar, Sam Anand

    Abstract: Digitization has led to smart, connected technologies be an integral part of businesses, governments and communities. For manufacturing digitization, there has been active research and development with a focus on Cloud Manufacturing (CM) and the Industrial Internet of Things (IIoT). This work presents a computer vision toolkit (CV Toolkit) for non-invasive digitization of the factory floor in line… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

    Comments: Accepted for publication in 48th SME North American Manufacturing Research Conference (NAMRC48)

    Journal ref: Procedia Manufacturing 48 (2020) 1020-1028

  31. arXiv:2005.05815  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    One-Shot Recognition of Manufacturing Defects in Steel Surfaces

    Authors: Aditya M. Deshpande, Ali A. Minai, Manish Kumar

    Abstract: Quality control is an essential process in manufacturing to make the product defect-free as well as to meet customer needs. The automation of this process is important to maintain high quality along with the high manufacturing throughput. With recent developments in deep learning and computer vision technologies, it has become possible to detect various features from the images with near-human acc… ▽ More

    Submitted 12 May, 2020; originally announced May 2020.

    Comments: Accepted for publication in NAMRC 48

    Journal ref: Procedia Manufacturing 48 (2020) 1064-1071

  32. arXiv:2004.12920  [pdf, other

    cs.RO eess.SY

    Flight Control of Sliding Arm Quadcopter with Dynamic Structural Parameters

    Authors: Rumit Kumar, Aditya M. Deshpande, James Z. Wells, Manish Kumar

    Abstract: The conceptual design and flight controller of a novel kind of quadcopter are presented. This design is capable of morphing the shape of the UAV during flight to achieve position and attitude control. We consider a dynamic center of gravity (CoG) which causes continuous variation in a moment of inertia (MoI) parameters of the UAV in this design. These dynamic structural parameters play a vital rol… ▽ More

    Submitted 27 April, 2020; originally announced April 2020.

    Comments: 6 Pages

  33. arXiv:2003.13387  [pdf, other

    eess.SP

    Intermediate frequency Upgrade design features of NASA D3R Weather Radar System

    Authors: Mohit Kumar, Shashank Joshil, Manuel Vega, Robert Beauchamp, V Chandrasekar

    Abstract: The NASA dual-frequency, dual-polarization, Doppler radar (D3R) is an important ground validation tool for the global precipitation measurement (GPM) mission dual-frequency precipitation radar (DPR). The D3R has undergone extensive field trials starting in 2011 and continues to provide observations that enhance our scientific knowledge. To further enhance its capabilities, the Intermediate frequen… ▽ More

    Submitted 25 June, 2020; v1 submitted 29 February, 2020; originally announced March 2020.

  34. Inter Pulse Frequency Diversity System for Second Trip Suppression and Retrieval in a Weather Radar

    Authors: V Chandrasekar, Mohit Kumar

    Abstract: This paper develops the use of Inter-pulse frequency diversity scheme for a weather radar system. It establishes the performance of frequency diversity technique comparing it with other inter-pulse schemes for weather radar systems. Inter-pulse coding is widely used for second trip suppression or cross-polarization isolation. Here, a new inter-pulse scheme is discussed taking advantage of frequenc… ▽ More

    Submitted 22 February, 2020; originally announced March 2020.

  35. arXiv:2003.05352  [pdf, other

    eess.SP

    Coding schemes and Applications for Weather Radars

    Authors: Mohit Kumar, V. Chandrasekar, Shashank Joshil

    Abstract: In this paper, we describe the evolution of a pair of polyphase coded waveform for use in second trip suppression in weather radar. The polyphase codes were designed and tested on NASA weather radar. The NASA dual-frequency, dual-polarization Doppler radar (D3R) was developed primarily as a ground validation tool for the GPM satellite dual-frequency radar. Recently, the D3R radar was upgraded with… ▽ More

    Submitted 20 February, 2020; originally announced March 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1912.00041

  36. Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap

    Authors: Tae Jin Park, Kyu J. Han, Manoj Kumar, Shrikanth Narayanan

    Abstract: In this study, we propose a new spectral clustering framework that can auto-tune the parameters of the clustering algorithm in the context of speaker diarization. The proposed framework uses normalized maximum eigengap (NME) values to estimate the number of clusters and the parameters for the threshold of the elements of each row in an affinity matrix during spectral clustering, without the use of… ▽ More

    Submitted 4 March, 2020; originally announced March 2020.

    Comments: in IEEE Signal Processing Letters, 2020

  37. arXiv:2002.04732  [pdf, other

    cs.IT eess.SP math.ST stat.ML

    Generalized Bayesian Cramér-Rao Inequality via Information Geometry of Relative $α$-Entropy

    Authors: Kumar Vijay Mishra, M. Ashok Kumar

    Abstract: The relative $α$-entropy is the Rényi analog of relative entropy and arises prominently in information-theoretic problems. Recent information geometric investigations on this quantity have enabled the generalization of the Cramér-Rao inequality, which provides a lower bound for the variance of an estimator of an escort of the underlying parametric probability distribution. However, this framework… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

    Comments: 6 pages

  38. arXiv:2001.04769  [pdf, ps, other

    cs.IT eess.SP math.ST stat.ML

    Cramér-Rao Lower Bounds Arising from Generalized Csiszár Divergences

    Authors: M. Ashok Kumar, Kumar Vijay Mishra

    Abstract: We study the geometry of probability distributions with respect to a generalized family of Csiszár $f$-divergences. A member of this family is the relative $α$-entropy which is also a Rényi analog of relative entropy in information theory and known as logarithmic or projective power divergence in statistics. We apply Eguchi's theory to derive the Fisher information metric and the dual affine conne… ▽ More

    Submitted 24 May, 2020; v1 submitted 14 January, 2020; originally announced January 2020.

    Comments: 25 pages

  39. arXiv:1912.12384  [pdf, other

    eess.AS cs.LG cs.SD eess.SP stat.ML

    Improved Multi-Stage Training of Online Attention-based Encoder-Decoder Models

    Authors: Abhinav Garg, Dhananjaya Gowda, Ankur Kumar, Kwangyoun Kim, Mehul Kumar, Chanwoo Kim

    Abstract: In this paper, we propose a refined multi-stage multi-task training strategy to improve the performance of online attention-based encoder-decoder (AED) models. A three-stage training based on three levels of architectural granularity namely, character encoder, byte pair encoding (BPE) based encoder, and attention decoder, is proposed. Also, multi-task learning based on two-levels of linguistic gra… ▽ More

    Submitted 27 December, 2019; originally announced December 2019.

    Comments: Accepted and presented at the ASRU 2019 conference

  40. arXiv:1912.11041  [pdf, ps, other

    eess.AS cs.LG cs.SD stat.ML

    power-law nonlinearity with maximally uniform distribution criterion for improved neural network training in automatic speech recognition

    Authors: Chanwoo Kim, Mehul Kumar, Kwangyoun Kim, Dhananjaya Gowda

    Abstract: In this paper, we describe the Maximum Uniformity of Distribution (MUD) algorithm with the power-law nonlinearity. In this approach, we hypothesize that neural network training will become more stable if feature distribution is not too much skewed. We propose two different types of MUD approaches: power function-based MUD and histogram-based MUD. In these approaches, we first obtain the mel filter… ▽ More

    Submitted 21 December, 2019; originally announced December 2019.

    Comments: Accepted and presented at the ASRU 2019 conference

  41. arXiv:1912.11040  [pdf, ps, other

    eess.AS cs.LG cs.SD eess.SP stat.ML

    end-to-end training of a large vocabulary end-to-end speech recognition system

    Authors: Chanwoo Kim, Sungsoo Kim, Kwangyoun Kim, Mehul Kumar, Jiyeon Kim, Kyungmin Lee, Changwoo Han, Abhinav Garg, Eunhyang Kim, Minkyoo Shin, Shatrughan Singh, Larry Heck, Dhananjaya Gowda

    Abstract: In this paper, we present an end-to-end training framework for building state-of-the-art end-to-end speech recognition systems. Our training system utilizes a cluster of Central Processing Units(CPUs) and Graphics Processing Units (GPUs). The entire data reading, large scale data augmentation, neural network parameter updates are all performed "on-the-fly". We use vocal tract length perturbation [… ▽ More

    Submitted 21 December, 2019; originally announced December 2019.

    Comments: Accepted and presented at the ASRU 2019 conference

  42. arXiv:1912.09463  [pdf, other

    eess.SP

    Finite State Markov Modeling of Fading Channels Towards Decoding of LDPC Codes

    Authors: Mohit Kumar

    Abstract: Here we have proposed two decoding strategies of low-density parity-check (LDPC) codes over Markov noise channels with bit flipping noise. The sum-product algorithm used for decoding LDPC codes over memoryless channels is extended to include channel estimation and how much gain we obtain by doing so is simulated and verified. LDPC codes have been studied for years over memoryless channels and are… ▽ More

    Submitted 19 October, 2019; originally announced December 2019.

  43. arXiv:1912.09215  [pdf

    eess.SP

    Receive signal path design for Active phased array radars

    Authors: Mohit Kumar, Dileep, K. Sreenivasulu, D. Seshagiri, Durga Srinivas, S. Narasimhan

    Abstract: Modern Active Phased array Radar systems with a large number of T/R modules, multi-channel receiver down converters and distributed power distribution networks leads to design and analysis of the receive signal path more complex. In this paper receive signal path design of a typical 1000 T/R modules based fully distributed active phased array radar is discussed in detail including the gain, Spurio… ▽ More

    Submitted 19 October, 2019; originally announced December 2019.

    Comments: Presented at International Radar symposium of India, 2013

  44. Intra-Pulse Polyphase Coding System for Second Trip Suppression in a Weather Radar

    Authors: Mohit Kumar, V Chandrasekar

    Abstract: This paper describes the design and implementation of intra-pulse polyphase codes for a weather radar system. Algorithms to generate codes with good correlation properties are discussed. Thereafter, a new design framework is described, which optimizes the polyphase code and corresponding mismatched filter, using a cost/error function, especially for weather radars. It establishes the performance o… ▽ More

    Submitted 29 November, 2019; originally announced December 2019.

    Comments: to be published in IEEE transactions of geoscience and remote sensing

  45. arXiv:1910.11472  [pdf, other

    eess.AS cs.LG cs.SD

    Learning Domain Invariant Representations for Child-Adult Classification from Speech

    Authors: Rimita Lahiri, Manoj Kumar, Somer Bishop, Shrikanth Narayanan

    Abstract: Diagnostic procedures for ASD (autism spectrum disorder) involve semi-naturalistic interactions between the child and a clinician. Computational methods to analyze these sessions require an end-to-end speech and language processing pipeline that go from raw audio to clinically-meaningful behavioral features. An important component of this pipeline is the ability to automatically detect who is spea… ▽ More

    Submitted 24 October, 2019; originally announced October 2019.

    Comments: Submitted to ICASSP 2020

  46. arXiv:1910.11416  [pdf, ps, other

    eess.AS cs.SD

    A study of semi-supervised speaker diarization system using gan mixture model

    Authors: Monisankha Pal, Manoj Kumar, Raghuveer Peri, Shrikanth Narayanan

    Abstract: We propose a new speaker diarization system based on a recently introduced unsupervised clustering technique namely, generative adversarial network mixture model (GANMM). The proposed system uses x-vectors as front-end representation. Spectral embedding is used for dimensionality reduction followed by k-means initialization during GANMM pre-training. GANMM performs unsupervised speaker clustering… ▽ More

    Submitted 24 October, 2019; originally announced October 2019.

    Comments: Submitted to ICASSP 2020

  47. arXiv:1910.11400  [pdf, other

    eess.AS cs.SD

    Meta-learning for robust child-adult classification from speech

    Authors: Nithin Rao Koluguri, Manoj Kumar, So Hyun Kim, Catherine Lord, Shrikanth Narayanan

    Abstract: Computational modeling of naturalistic conversations in clinical applications has seen growing interest in the past decade. An important use-case involves child-adult interactions within the autism diagnosis and intervention domain. In this paper, we address a specific sub-problem of speaker diarization, namely child-adult speaker classification in such dyadic conversations with specified roles. T… ▽ More

    Submitted 28 October, 2019; v1 submitted 24 October, 2019; originally announced October 2019.

  48. arXiv:1910.11398  [pdf, ps, other

    eess.AS cs.SD

    Speaker diarization using latent space clustering in generative adversarial network

    Authors: Monisankha Pal, Manoj Kumar, Raghuveer Peri, Tae Jin Park, So Hyun Kim, Catherine Lord, Somer Bishop, Shrikanth Narayanan

    Abstract: In this work, we propose deep latent space clustering for speaker diarization using generative adversarial network (GAN) backprojection with the help of an encoder network. The proposed diarization system is trained jointly with GAN loss, latent variable recovery loss, and a clustering-specific loss. It uses x-vector speaker embeddings at the input, while the latent variables are sampled from a co… ▽ More

    Submitted 24 October, 2019; originally announced October 2019.

    Comments: Submitted to ICASSP 2020

  49. arXiv:1910.08861  [pdf

    eess.SP

    A Novel Scheme of Digital Instantaneous Automatic Gain Control (DIAGC) for Pulse Radars

    Authors: Sumanta Pal, Nirmala Shanmugam, Mohit Kumar, P Radhakrishna

    Abstract: Several schemes for gain control are used for preventing saturation of receiver, and overloading of data processor, tracker or display in pulse radars. The use of digital processing techniques open the door to a variety of digital automatic gain control schemes for analyzing digitized return signals and controlling receiver gain only at saturating clutter zones without affecting the detection at o… ▽ More

    Submitted 19 October, 2019; originally announced October 2019.

    Comments: Presented at International Symposium of India 2011

  50. arXiv:1910.08860  [pdf

    eess.SP

    Distributed High Speed Optical Network for Digital Radar Systems

    Authors: Vishal Maheshwari, K. Sreenivasulu, Mohit Kumar, Dr. Vengada Rajan, Sumant Pal, Mohana Kumari

    Abstract: Modern Digital radar systems with multiple digital beamforming capability are built of a large number of receivers and requires high-speed data interface links for transmission of receiver baseband data to processor units. High data throughput (>250Mbyte/sec) from typical eight-channel receivers will be transmitted to Digital beamformer over high-speed serial interface links over an optical channe… ▽ More

    Submitted 19 October, 2019; originally announced October 2019.

    Comments: Presented at International Radar symposium of India 2013