Zum Hauptinhalt springen

Showing 1–50 of 94 results for author: Rao, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.14419  [pdf, other

    cs.AI cs.CL cs.CV

    CHARTOM: A Visual Theory-of-Mind Benchmark for Multimodal Large Language Models

    Authors: Shubham Bharti, Shiyun Cheng, Jihyun Rho, Martina Rao, Xiaojin Zhu

    Abstract: We introduce CHARTOM, a visual theory-of-mind benchmark for multimodal large language models. CHARTOM consists of specially designed data visualizing charts. Given a chart, a language model needs to not only correctly comprehend the chart (the FACT question) but also judge if the chart will be misleading to a human reader (the MIND question). Both questions have significant societal benefits. We d… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  2. arXiv:2407.04208  [pdf, other

    cs.CV

    AMD: Automatic Multi-step Distillation of Large-scale Vision Models

    Authors: Cheng Han, Qifan Wang, Sohail A. Dianat, Majid Rabbani, Raghuveer M. Rao, Yi Fang, Qiang Guan, Lifu Huang, Dongfang Liu

    Abstract: Transformer-based architectures have become the de-facto standard models for diverse vision tasks owing to their superior performance. As the size of the models continues to scale up, model distillation becomes extremely important in various real applications, particularly on devices limited by computational resources. However, prevailing knowledge distillation methods exhibit diminished efficacy… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 19 pages, 5 figures

  3. arXiv:2406.01559  [pdf, other

    cs.CV

    Prototypical Transformer as Unified Motion Learners

    Authors: Cheng Han, Yawen Lu, Guohao Sun, James C. Liang, Zhiwen Cao, Qifan Wang, Qiang Guan, Sohail A. Dianat, Raghuveer M. Rao, Tong Geng, Zhiqiang Tao, Dongfang Liu

    Abstract: In this work, we introduce the Prototypical Transformer (ProtoFormer), a general and unified framework that approaches various motion tasks from a prototype perspective. ProtoFormer seamlessly integrates prototype learning with Transformer by thoughtfully considering motion dynamics, introducing two innovative designs. First, Cross-Attention Prototyping discovers prototypes based on signature moti… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 21 pages, 10 figures

  4. arXiv:2406.00314  [pdf, other

    cs.CL cs.AI cs.LG

    CASE: Efficient Curricular Data Pre-training for Building Assistive Psychology Expert Models

    Authors: Sarthak Harne, Monjoy Narayan Choudhury, Madhav Rao, TK Srikanth, Seema Mehrotra, Apoorva Vashisht, Aarushi Basu, Manjit Sodhi

    Abstract: The limited availability of psychologists necessitates efficient identification of individuals requiring urgent mental healthcare. This study explores the use of Natural Language Processing (NLP) pipelines to analyze text data from online mental health forums used for consultations. By analyzing forum posts, these pipelines can flag users who may require immediate professional attention. A crucial… ▽ More

    Submitted 16 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

  5. arXiv:2403.19786  [pdf, other

    cs.CV

    Zero-shot Prompt-based Video Encoder for Surgical Gesture Recognition

    Authors: Mingxing Rao, Yinhong Qin, Soheil Kolouri, Jie Ying Wu, Daniel Moyer

    Abstract: Purpose: In order to produce a surgical gesture recognition system that can support a wide variety of procedures, either a very large annotated dataset must be acquired, or fitted models must generalize to new labels (so called "zero-shot" capability). In this paper we investigate the feasibility of latter option. Methods: Leveraging the Bridge-Prompt framework, we prompt-tune a pre-trained vision… ▽ More

    Submitted 21 August, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: 17 pages,4 figures, 7 tables, IPCAI 2024 & IJCARS

  6. arXiv:2312.08267  [pdf, other

    eess.IV cs.CV q-bio.QM

    TABSurfer: a Hybrid Deep Learning Architecture for Subcortical Segmentation

    Authors: Aaron Cao, Vishwanatha M. Rao, Kejia Liu, Xinru Liu, Andrew F. Laine, Jia Guo

    Abstract: Subcortical segmentation remains challenging despite its important applications in quantitative structural analysis of brain MRI scans. The most accurate method, manual segmentation, is highly labor intensive, so automated tools like FreeSurfer have been adopted to handle this task. However, these traditional pipelines are slow and inefficient for processing large datasets. In this study, we propo… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: 5 pages, 3 figures, 2 tables

  7. arXiv:2311.17705  [pdf, other

    cs.SE

    Q-PAC: Automated Detection of Quantum Bug-Fix Patterns

    Authors: Pranav K. Nayak, Krishn V. Kher, M. Bharat Chandra, M. V. Panduranga Rao, Lei Zhang

    Abstract: Context: Bug-fix pattern detection has been investigated in the past in the context of classical software. However, while quantum software is developing rapidly, the literature still lacks automated methods and tools to identify, analyze, and detect bug-fix patterns. To the best of our knowledge, our work previously published in SEKE'23 was the first to leverage classical techniques to detect bug-… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: 16 pages, 2 figures

  8. arXiv:2311.15072  [pdf, other

    cs.CV cs.AI

    Introducing SSBD+ Dataset with a Convolutional Pipeline for detecting Self-Stimulatory Behaviours in Children using raw videos

    Authors: Vaibhavi Lokegaonkar, Vijay Jaisankar, Pon Deepika, Madhav Rao, T K Srikanth, Sarbani Mallick, Manjit Sodhi

    Abstract: Conventionally, evaluation for the diagnosis of Autism spectrum disorder is done by a trained specialist through questionnaire-based formal assessments and by observation of behavioral cues under various settings to capture the early warning signs of autism. These evaluation techniques are highly subjective and their accuracy relies on the experience of the specialist. In this regard, machine lear… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

    Comments: Copyright 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  9. arXiv:2311.12310  [pdf, other

    cs.AI cs.LG

    IEKM: A Model Incorporating External Keyword Matrices

    Authors: Cheng Luo, Qin Li, Zhao Yan, Mengliang Rao, Yunbo Cao

    Abstract: A customer service platform system with a core text semantic similarity (STS) task faces two urgent challenges: Firstly, one platform system needs to adapt to different domains of customers, i.e., different domains adaptation (DDA). Secondly, it is difficult for the model of the platform system to distinguish sentence pairs that are literally close but semantically different, i.e., hard negative s… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

  10. arXiv:2311.06261  [pdf, other

    cs.CY cs.AI

    With ChatGPT, do we have to rewrite our learning objectives -- CASE study in Cybersecurity

    Authors: Peter Jamieson, Suman Bhunia, Dhananjai M. Rao

    Abstract: With the emergence of Artificial Intelligent chatbot tools such as ChatGPT and code writing AI tools such as GitHub Copilot, educators need to question what and how we should teach our courses and curricula in the future. In reality, automated tools may result in certain academic fields being deeply reduced in the number of employable people. In this work, we make a case study of cybersecurity und… ▽ More

    Submitted 26 September, 2023; originally announced November 2023.

  11. An Effective Deep Learning Based Multi-Class Classification of DoS and DDoS Attack Detection

    Authors: Arun Kumar Silivery, Kovvur Ram Mohan Rao, L K Suresh Kumar

    Abstract: In the past few years, cybersecurity is becoming very important due to the rise in internet users. The internet attacks such as Denial of service (DoS) and Distributed Denial of Service (DDoS) attacks severely harm a website or server and make them unavailable to other users. Network Monitoring and control systems have found it challenging to identify the many classes of DoS and DDoS attacks since… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  12. arXiv:2308.02013  [pdf, other

    cs.SD cs.CL cs.LG eess.AS

    Federated Representation Learning for Automatic Speech Recognition

    Authors: Guruprasad V Ramesh, Gopinath Chennupati, Milind Rao, Anit Kumar Sahu, Ariya Rastrow, Jasha Droppo

    Abstract: Federated Learning (FL) is a privacy-preserving paradigm, allowing edge devices to learn collaboratively without sharing data. Edge devices like Alexa and Siri are prospective sources of unlabeled audio data that can be tapped to learn robust audio representations. In this work, we bring Self-supervised Learning (SSL) and FL together to learn representations for Automatic Speech Recognition respec… ▽ More

    Submitted 7 August, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: Accepted at ISCA SPSC Symposium 3rd Symposium on Security and Privacy in Speech Communication, 2023

  13. arXiv:2307.03968  [pdf, other

    cs.CE math.NA

    Multi-Level Power Series Solution for Large Surface and Volume Electric Field Integral Equation

    Authors: Y. K. Negi, N. Balakrishnan, S. M. Rao

    Abstract: In this paper, we propose a new multilevel power series solution method for solving a large surface and volume electric field integral equation based H-Matrix. The proposed solution method converges in a fixed number of iterations and is solved at each level of the H-Matrix computation.The solution method avoids the computation of a full matrix, as it can be solved independently at each level, sta… ▽ More

    Submitted 8 July, 2023; originally announced July 2023.

    Comments: 8 pages. The Applied Computational Electromagnetics Society Journal (ACES) 2023

  14. arXiv:2306.12015  [pdf, other

    eess.AS cs.SD

    Federated Self-Learning with Weak Supervision for Speech Recognition

    Authors: Milind Rao, Gopinath Chennupati, Gautam Tiwari, Anit Kumar Sahu, Anirudh Raju, Ariya Rastrow, Jasha Droppo

    Abstract: Automatic speech recognition (ASR) models with low-footprint are increasingly being deployed on edge devices for conversational agents, which enhances privacy. We study the problem of federated continual incremental learning for recurrent neural network-transducer (RNN-T) ASR models in the privacy-enhancing scheme of learning on-device, without access to ground truth human transcripts or machine t… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: Proceedings of ICASSP 2023

  15. Learning When to Trust Which Teacher for Weakly Supervised ASR

    Authors: Aakriti Agrawal, Milind Rao, Anit Kumar Sahu, Gopinath Chennupati, Andreas Stolcke

    Abstract: Automatic speech recognition (ASR) training can utilize multiple experts as teacher models, each trained on a specific domain or accent. Teacher models may be opaque in nature since their architecture may be not be known or their training cadence is different from that of the student ASR model. Still, the student models are updated incrementally using the pseudo-labels generated independently by t… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: Proceedings of INTERSPEECH 2023

    Journal ref: Proc. Interspeech, Aug. 2023, pp. 381-385

  16. arXiv:2305.19444  [pdf

    cs.HC

    Pixelated Interactions: Exploring Pixel Art for Graphical Primitives on a Tactile Display

    Authors: Tigmanshu Bhatnagar, Vikas Upadhyay, Anchal Sharma, P V Madhusudhan Rao, Mark Miodownik, Nicolai Marquardt, Catherine Holloway

    Abstract: Two-dimensional pin array tactile displays enable access to tactile graphics that are important for the education of students with visual impairments. Due to their prohibitive cost, limited access, and limited research within HCI, the rules to design graphical primitives on these low-resolution tactile displays are unclear. In this paper, eight tactile readers with visual impairments qualitatively… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: 25 pages, 10 figures. To appear in DIS'23 Designing Interactive Systems Conference, July 10 to 14, 2023, Pittsburgh, PA, USA

  17. arXiv:2303.02043  [pdf, other

    cs.RO eess.SY

    An Integrated Real-time UAV Trajectory Optimization with Potential Field Approach for Dynamic Collision Avoidance

    Authors: D. M. K. K. Venkateswara Rao, Hamed Habibi, Jose Luis Sanchez-Lopez, Holger Voos

    Abstract: This paper presents an integrated approach that combines trajectory optimization and Artificial Potential Field (APF) method for real-time optimal Unmanned Aerial Vehicle (UAV) trajectory planning and dynamic collision avoidance. A minimum-time trajectory optimization problem is formulated with initial and final positions as boundary conditions and collision avoidance as constraints. It is transcr… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

  18. Iterative RNDOP-Optimal Anchor Placement for Beyond Convex Hull ToA-based Localization: Performance Bounds and Heuristic Algorithms

    Authors: Raghunandan M. Rao, Don-Roberts Emenonye

    Abstract: Localizing targets outside the anchors' convex hull is an understudied but prevalent scenario in vehicle-centric, UAV-based, and self-localization applications. Considering such scenarios, this paper studies the optimal anchor placement problem for Time-of-Arrival (ToA)-based localization schemes such that the worst-case Dilution of Precision (DOP) is minimized. Building on prior results on DOP sc… ▽ More

    Submitted 17 February, 2024; v1 submitted 16 December, 2022; originally announced December 2022.

    Comments: 16 pages. To appear in a future issue of the IEEE Transactions on Vehicular Technology

  19. arXiv:2212.07112  [pdf, other

    cs.CL cs.AI cs.IR

    DialogQAE: N-to-N Question Answer Pair Extraction from Customer Service Chatlog

    Authors: Xin Zheng, Tianyu Liu, Haoran Meng, Xu Wang, Yufan Jiang, Mengliang Rao, Binghuai Lin, Zhifang Sui, Yunbo Cao

    Abstract: Harvesting question-answer (QA) pairs from customer service chatlog in the wild is an efficient way to enrich the knowledge base for customer service chatbots in the cold start or continuous integration scenarios. Prior work attempts to obtain 1-to-1 QA pairs from growing customer service chatlog, which fails to integrate the incomplete utterances from the dialog context for composite QA retrieval… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: Preprint version; The first three authors contribute equally

  20. arXiv:2210.12689  [pdf, other

    cs.CV

    Face Emotion Recognization Using Dataset Augmentation Based on Neural Network

    Authors: Mengyu Rao, Ruyi Bao, Liangshun Dong

    Abstract: Facial expression is one of the most external indications of a person's feelings and emotions. In daily conversation, according to the psychologist, only 7% and 38% of information is communicated through words and sounds respective, while up to 55% is through facial expression. It plays an important role in coordinating interpersonal relationships. Ekman and Friesen recognized six essential emotio… ▽ More

    Submitted 21 November, 2022; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: 5 pages, 8 figures, 3 tables

  21. ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale

    Authors: Gopinath Chennupati, Milind Rao, Gurpreet Chadha, Aaron Eakin, Anirudh Raju, Gautam Tiwari, Anit Kumar Sahu, Ariya Rastrow, Jasha Droppo, Andy Oberlin, Buddha Nandanoor, Prahalad Venkataramanan, Zheng Wu, Pankaj Sitpure

    Abstract: Incremental learning is one paradigm to enable model building and updating at scale with streaming data. For end-to-end automatic speech recognition (ASR) tasks, the absence of human annotated labels along with the need for privacy preserving policies for model building makes it a daunting challenge. Motivated by these challenges, in this paper we use a cloud based framework for production systems… ▽ More

    Submitted 22 July, 2022; v1 submitted 19 July, 2022; originally announced July 2022.

    Comments: 9 pages

  22. arXiv:2207.07033  [pdf, other

    cs.AI cs.CY

    Developing a Series of AI Challenges for the United States Department of the Air Force

    Authors: Vijay Gadepally, Gregory Angelides, Andrei Barbu, Andrew Bowne, Laura J. Brattain, Tamara Broderick, Armando Cabrera, Glenn Carl, Ronisha Carter, Miriam Cha, Emilie Cowen, Jesse Cummings, Bill Freeman, James Glass, Sam Goldberg, Mark Hamilton, Thomas Heldt, Kuan Wei Huang, Phillip Isola, Boris Katz, Jamie Koerner, Yen-Chen Lin, David Mayo, Kyle McAlpin, Taylor Perron , et al. (17 additional authors not shown)

    Abstract: Through a series of federal initiatives and orders, the U.S. Government has been making a concerted effort to ensure American leadership in AI. These broad strategy documents have influenced organizations such as the United States Department of the Air Force (DAF). The DAF-MIT AI Accelerator is an initiative between the DAF and MIT to bridge the gap between AI researchers and DAF mission requireme… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

  23. arXiv:2206.12980  [pdf

    eess.IV cs.CV q-bio.QM

    Detecting Schizophrenia with 3D Structural Brain MRI Using Deep Learning

    Authors: Junhao Zhang, Vishwanatha M. Rao, Ye Tian, Yanting Yang, Nicolas Acosta, Zihan Wan, Pin-Yu Lee, Chloe Zhang, Lawrence S. Kegeles, Scott A. Small, Jia Guo

    Abstract: Schizophrenia is a chronic neuropsychiatric disorder that causes distinct structural alterations within the brain. We hypothesize that deep learning applied to a structural neuroimaging dataset could detect disease-related alteration and improve classification and diagnostic accuracy. We tested this hypothesis using a single, widely available, and conventional T1-weighted MRI scan, from which we e… ▽ More

    Submitted 7 July, 2022; v1 submitted 26 June, 2022; originally announced June 2022.

    Comments: 13 pages, 6 figures

  24. arXiv:2204.08811  [pdf, other

    cs.CL cs.AI

    SmartSales: Sales Script Extraction and Analysis from Sales Chatlog

    Authors: Hua Liang, Tianyu Liu, Peiyi Wang, Mengliang Rao, Yunbo Cao

    Abstract: In modern sales applications, automatic script extraction and management greatly decrease the need for human labor to collect the winning sales scripts, which largely boost the success rate for sales and can be shared across the sales teams. In this work, we present the SmartSales system to serve both the sales representatives and managers to attain the sales insights from the large-scale sales ch… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: Work in progress. The first two authors contribute equally

  25. arXiv:2203.06583  [pdf

    cs.SD cs.AI eess.AS

    Bi-Sampling Approach to Classify Music Mood leveraging Raga-Rasa Association in Indian Classical Music

    Authors: Mohan Rao B C, Vinayak Arkachaari, Harsha M N, Sushmitha M N, Gayathri Ramesh K K, Ullas M S, Pathi Mohan Rao, Sudha G, Narayana Darapaneni

    Abstract: The impact of Music on the mood or emotion of the listener is a well-researched area in human psychology and behavioral science. In Indian classical music, ragas are the melodic structure that defines the various styles and forms of the music. Each raga has been found to evoke a specific emotion in the listener. With the advent of advanced capabilities of audio signal processing and the applicatio… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

  26. Improving Across-Dataset Brain Tissue Segmentation Using Transformer

    Authors: Vishwanatha M. Rao, Zihan Wan, Soroush Arabshahi, David J. Ma, Pin-Yu Lee, Ye Tian, Xuzhe Zhang, Andrew F. Laine, Jia Guo

    Abstract: Brain tissue segmentation has demonstrated great utility in quantifying MRI data through Voxel-Based Morphometry and highlighting subtle structural changes associated with various conditions within the brain. However, manual segmentation is highly labor-intensive, and automated approaches have struggled due to properties inherent to MRI acquisition, leaving a great need for an effective segmentati… ▽ More

    Submitted 31 January, 2023; v1 submitted 21 January, 2022; originally announced January 2022.

    ACM Class: I.4.6

  27. arXiv:2112.03259  [pdf

    q-bio.QM cs.CV eess.IV

    Novel Local Radiomic Bayesian Classifiers for Non-Invasive Prediction of MGMT Methylation Status in Glioblastoma

    Authors: Mihir Rao

    Abstract: Glioblastoma, an aggressive brain cancer, is amongst the most lethal of all cancers. Expression of the O6-methylguanine-DNA-methyltransferase (MGMT) gene in glioblastoma tumor tissue is of clinical importance as it has a significant effect on the efficacy of Temozolomide, the primary chemotherapy treatment administered to glioblastoma patients. Currently, MGMT methylation is determined through an… ▽ More

    Submitted 29 November, 2021; originally announced December 2021.

  28. arXiv:2106.15919  [pdf, other

    cs.CL cs.SD eess.AS

    On joint training with interfaces for spoken language understanding

    Authors: Anirudh Raju, Milind Rao, Gautam Tiwari, Pranav Dheram, Bryan Anderson, Zhe Zhang, Chul Lee, Bach Bui, Ariya Rastrow

    Abstract: Spoken language understanding (SLU) systems extract both text transcripts and semantics associated with intents and slots from input speech utterances. SLU systems usually consist of (1) an automatic speech recognition (ASR) module, (2) an interface module that exposes relevant outputs from ASR, and (3) a natural language understanding (NLU) module. Interfaces in SLU systems carry information on t… ▽ More

    Submitted 25 July, 2022; v1 submitted 30 June, 2021; originally announced June 2021.

    Comments: Proc. Interspeech 2022

  29. arXiv:2106.02357  [pdf, ps, other

    cs.LG quant-ph stat.ML

    Adiabatic Quantum Feature Selection for Sparse Linear Regression

    Authors: Surya Sai Teja Desu, P. K. Srijith, M. V. Panduranga Rao, Naveen Sivadasan

    Abstract: Linear regression is a popular machine learning approach to learn and predict real valued outputs or dependent variables from independent variables or features. In many real world problems, its beneficial to perform sparse linear regression to identify important features helpful in predicting the dependent variable. It not only helps in getting interpretable results but also avoids overfitting whe… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: 8 pages, 2 tables

  30. Listen with Intent: Improving Speech Recognition with Audio-to-Intent Front-End

    Authors: Swayambhu Nath Ray, Minhua Wu, Anirudh Raju, Pegah Ghahremani, Raghavendra Bilgi, Milind Rao, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Jasha Droppo

    Abstract: Comprehending the overall intent of an utterance helps a listener recognize the individual words spoken. Inspired by this fact, we perform a novel study of the impact of explicitly incorporating intent representations as additional information to improve a recurrent neural network-transducer (RNN-T) based automatic speech recognition (ASR) system. An audio-to-intent (A2I) model encodes the intent… ▽ More

    Submitted 16 June, 2021; v1 submitted 14 May, 2021; originally announced May 2021.

    Comments: To appear in Interspeech 2021

    Journal ref: Proc. Interspeech, Sept. 2021, pp. 3455-3459

  31. arXiv:2103.11890  [pdf, other

    eess.SP cs.IT

    Coexistence of Communications and Cognitive MIMO Radar: Waveform Design and Prototype

    Authors: Mohammad Alaee-Kerahroodi, Ehsan Raei, Sumit Kumar, Bhavani Shankar Mysore Rama Rao

    Abstract: New generation of radar systems will need to coexist with other radio frequency (RF) systems, anticipating their behavior and reacting appropriately to avoid interference. In light of this requirement, this paper designs, implements, and evaluates the performance of phase-only sequences (with constant power) for intelligent spectrum utilization using the custom built cognitive Multiple Input Multi… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: 13 pages, 17 figures,

  32. arXiv:2102.06750  [pdf, other

    cs.CL eess.AS

    Do as I mean, not as I say: Sequence Loss Training for Spoken Language Understanding

    Authors: Milind Rao, Pranav Dheram, Gautam Tiwari, Anirudh Raju, Jasha Droppo, Ariya Rastrow, Andreas Stolcke

    Abstract: Spoken language understanding (SLU) systems extract transcriptions, as well as semantics of intent or named entities from speech, and are essential components of voice activated systems. SLU models, which either directly extract semantics from audio or are composed of pipelined automatic speech recognition (ASR) and natural language understanding (NLU) models, are typically trained via differentia… ▽ More

    Submitted 12 February, 2021; originally announced February 2021.

    Comments: Proc. IEEE ICASSP 2021

  33. arXiv:2101.02573  [pdf, other

    cs.CR cs.LG

    RANK: AI-assisted End-to-End Architecture for Detecting Persistent Attacks in Enterprise Networks

    Authors: Hazem M. Soliman, Geoff Salmon, Dušan Sovilj, Mohan Rao

    Abstract: Advanced Persistent Threats (APTs) are sophisticated multi-step attacks, planned and executed by skilled adversaries targeting modern government and enterprise networks. Intrusion Detection Systems (IDSs) and User and Entity Behavior Analytics (UEBA) are commonly employed to aid a security analyst in the detection of APTs. The prolonged nature of APTs, combined with the granular focus of UEBA and… ▽ More

    Submitted 6 January, 2021; originally announced January 2021.

  34. arXiv:2011.06455  [pdf

    cs.GT physics.soc-ph q-bio.PE

    Optimal governance and implementation of vaccination programmes to contain the COVID-19 pandemic

    Authors: Mahendra Piraveenan, Shailendra Sawleshwarkar, Michael Walsh, Iryna Zablotska, Samit Bhattacharyya, Habib Hassan Farooqui, Tarun Bhatnagar, Anup Karan, Manoj Murhekar, Sanjay Zodpey, K. S. Mallikarjuna Rao, Philippa Pattison, Albert Zomaya, Matjaz Perc

    Abstract: Since the recent introduction of several viable vaccines for SARS-CoV-2, vaccination uptake has become the key factor that will determine our success in containing the COVID-19 pandemic. We argue that game theory and social network models should be used to guide decisions pertaining to vaccination programmes for the best possible results. In the months following the introduction of vaccines, their… ▽ More

    Submitted 9 June, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

    Comments: 15 pages, 1 figure; published in Royal Society Open Science

    Journal ref: R. Soc. Open Sci. 8, 210429 (2021)

  35. arXiv:2010.11692  [pdf, other

    cs.CV cs.AI

    Conversion and Implementation of State-of-the-Art Deep Learning Algorithms for the Classification of Diabetic Retinopathy

    Authors: Mihir Rao, Michelle Zhu, Tianyang Wang

    Abstract: Diabetic retinopathy (DR) is a retinal microvascular condition that emerges in diabetic patients. DR will continue to be a leading cause of blindness worldwide, with a predicted 191.0 million globally diagnosed patients in 2030. Microaneurysms, hemorrhages, exudates, and cotton wool spots are common signs of DR. However, they can be small and hard for human eyes to detect. Early detection of DR is… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

    Comments: Pre-print version (in-review)

  36. arXiv:2010.04777  [pdf, other

    cs.LG cs.NI

    A Graph Neural Network Approach for Scalable and Dynamic IP Similarity in Enterprise Networks

    Authors: Hazem M. Soliman, Geoff Salmon, Dusan Sovilij, Mohan Rao

    Abstract: Measuring similarity between IP addresses is an important task in the daily operations of any enterprise network. Applications that depend on an IP similarity measure include measuring correlation between security alerts, building baselines for behavioral modelling, debugging network failures and tracking persistent attacks. However, IPs do not have a natural similarity measure by definition. Deep… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

  37. Speech To Semantics: Improve ASR and NLU Jointly via All-Neural Interfaces

    Authors: Milind Rao, Anirudh Raju, Pranav Dheram, Bach Bui, Ariya Rastrow

    Abstract: We consider the problem of spoken language understanding (SLU) of extracting natural language intents and associated slot arguments or named entities from speech that is primarily directed at voice assistants. Such a system subsumes both automatic speech recognition (ASR) as well as natural language understanding (NLU). An end-to-end joint SLU model can be built to a required specification opening… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: Proceedings of INTERSPEECH

    ACM Class: I.2.7

    Journal ref: Proc. Interspeech 2020, 876-880 (2020)

  38. Underlay Radar-Massive MIMO Spectrum Sharing: Modeling Fundamentals and Performance Analysis

    Authors: Raghunandan M. Rao, Harpreet S. Dhillon, Vuk Marojevic, Jeffrey H. Reed

    Abstract: In this work, we study underlay radar-massive MIMO cellular coexistence in LoS/near-LoS channels, where both systems have 3D beamforming capabilities. Using mathematical tools from stochastic geometry, we derive an upper bound on the average interference power at the radar due to the 3D massive MIMO cellular downlink under the worst-case `cell-edge beamforming' conditions. To overcome the technica… ▽ More

    Submitted 16 May, 2021; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: This arXiv manuscript subsumes the contents of the conference paper presented at the 2019 IEEE Global Communications Conference (Globecom), Waikoloa, HI. The conference version is available at arXiv:1907.09536

  39. Semi-Blind Post-Equalizer SINR Estimation and Dual CSI Feedback for Radar-Cellular Coexistence

    Authors: Raghunandan M. Rao, Vuk Marojevic, Jeffrey H. Reed

    Abstract: Current cellular systems use pilot-aided statistical-channel state information (S-CSI) estimation and limited feedback schemes to aid in link adaptation and scheduling decisions. However, in the presence of pulsed radar signals, pilot-aided S-CSI is inaccurate since interference statistics on pilot and non-pilot resources can be different. Moreover, the channel will be bimodal as a result of the p… ▽ More

    Submitted 1 June, 2020; originally announced June 2020.

    Comments: 33 pages, 26 figures

  40. arXiv:2005.00122  [pdf, other

    cs.NI eess.SP

    Probability of Pilot Interference in Pulsed Radar-Cellular Coexistence: Fundamental Insights on Demodulation and Limited CSI Feedback

    Authors: Raghunandan M. Rao, Vuk Marojevic, Jeffrey H. Reed

    Abstract: This paper considers an underlay pulsed radar-cellular spectrum sharing scenario, where the cellular system uses pilot-aided demodulation, statistical channel state information (S-CSI) estimation and limited feedback schemes. Under a realistic system model, upper and lower bounds are derived on the probability that at least a specified number of pilot signals are interfered by a radar pulse train… ▽ More

    Submitted 30 April, 2020; originally announced May 2020.

    Comments: 13 pages, 5 figures

  41. arXiv:2002.04638  [pdf, other

    cs.DS cs.DC

    A polynomial time parallel algorithm for graph isomorphism using a quasipolynomial number of processors

    Authors: Duc Hung Pham, Krishna V. Palem, M. V. Panduranga Rao

    Abstract: The Graph Isomorphism (GI) problem is a theoretically interesting problem because it has not been proven to be in P nor to be NP-complete. Babai made a breakthrough in 2015 when announcing a quasipolynomial time algorithm for GI problem. Babai's work gives the most theoretically efficient algorithm for GI, as well as a strong evidence favoring the idea that class GI $\ne$ NP and thus P $\ne$ NP. B… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

    Comments: ICALP conference submission preprint

  42. arXiv:1911.10478  [pdf, other

    cs.LO cs.FL

    The Bouquet Algorithm for Model Checking Unbounded Until

    Authors: Shiraj Arora, M. V. Panduranga Rao

    Abstract: The problem of verifying the "Unbounded Until" fragment in temporal logic formulas has been studied extensively in the past, especially in the context of statistical model checking. Statistical model checking, a computationally inexpensive sampling based alternative to the more expensive numerical model checking technique, presents the following decision dilemma -- what length of the sample is eno… ▽ More

    Submitted 24 November, 2019; originally announced November 2019.

  43. Planar graphs without normally adjacent short cycles

    Authors: Fangyao Lu, Mengjiao Rao, Qianqian Wang, Tao Wang

    Abstract: Let $\mathscr{G}$ be the class of plane graphs without triangles normally adjacent to $8^{-}$-cycles, without $4$-cycles normally adjacent to $6^{-}$-cycles, and without normally adjacent $5$-cycles. In this paper, it is shown that every graph in $\mathscr{G}$ is $3$-choosable. Instead of proving this result, we directly prove a stronger result in the form of ``weakly'' DP-$3$-coloring. The main t… ▽ More

    Submitted 10 June, 2022; v1 submitted 13 August, 2019; originally announced August 2019.

    Comments: 17 pages, 3 figures

    MSC Class: 05C15

    Journal ref: Discrete Mathematics, 345 (2022) 112986

  44. arXiv:1907.09536  [pdf, other

    cs.NI eess.SP

    Analysis of Worst-Case Interference in Underlay Radar-Massive MIMO Spectrum Sharing Scenarios

    Authors: Raghunandan M. Rao, Harpeet S. Dhillon, Vuk Marojevic, Jeffrey H. Reed

    Abstract: In this paper, we consider an underlay radar-massive MIMO spectrum sharing scenario in which massive MIMO base stations (BSs) are allowed to operate outside a circular exclusion zone centered at the radar. Modeling the locations of the massive MIMO BSs as a homogeneous Poisson point process (PPP), we derive an analytical expression for a tight upper bound on the average interference at the radar d… ▽ More

    Submitted 22 July, 2019; originally announced July 2019.

    Comments: 6 pages, 3 figures

  45. arXiv:1904.09984  [pdf, other

    cs.DC cs.OS

    IOArbiter: Dynamic Provisioning of Backend Block Storage in the Cloud

    Authors: Moo-Ryong Ra, Hee Won Lee

    Abstract: With the advent of virtualization technology, cloud computing realizes on-demand computing. The capability of dynamic resource provisioning is a fundamental driving factor for users to adopt the cloud technology. The aspect is important for cloud service providers to optimize the expense for running the infrastructure as well. Despite many technological advances in related areas, however, it is st… ▽ More

    Submitted 23 April, 2019; originally announced April 2019.

    Comments: 7 pages, 3 figures

  46. arXiv:1904.03710  [pdf, other

    cs.CV eess.IV

    Planar Geometry and Image Recovery from Motion-Blur

    Authors: Kuldeep Purohit, Subeesh Vasu, M. Purnachandra Rao, A. N. Rajagopalan

    Abstract: Existing works on motion deblurring either ignore the effects of depth-dependent blur or work with the assumption of a multi-layered scene wherein each layer is modeled in the form of fronto-parallel plane. In this work, we consider the case of 3D scenes with piecewise planar structure i.e., a scene that can be modeled as a combination of multiple planes with arbitrary orientations. We first propo… ▽ More

    Submitted 6 February, 2022; v1 submitted 7 April, 2019; originally announced April 2019.

  47. arXiv:1902.04067  [pdf, other

    cs.NI cs.MM cs.PF

    Multi-tier Caching Analysis in CDN-based Over-the-top Video Streaming Systems

    Authors: Abubakr O. Al-Abbasi, Vaneet Aggarwal, Moo-Ryong Ra

    Abstract: Internet video traffic has been been rapidly increasing and is further expected to increase with the emerging 5G applications such as higher definition videos, IoT and augmented/virtual reality applications. As end-users consume video in massive amounts and in an increasing number of ways, the content distribution network (CDN) should be efficiently managed to improve the system efficiency. The st… ▽ More

    Submitted 10 February, 2019; originally announced February 2019.

    Comments: Accepted to IEEE/ACM TON, 2019. arXiv admin note: substantial text overlap with arXiv:1807.01147

  48. arXiv:1901.02574  [pdf, other

    cs.NI

    Analysis of Non-Pilot Interference on Link Adaptation and Latency in Cellular Networks

    Authors: Raghunandan M. Rao, Vuk Marojevic, Jeffrey H. Reed

    Abstract: Modern wireless systems such as the Long-Term Evolution (LTE) and 5G New Radio (5G NR) use pilot-aided SINR estimates to adapt the transmission mode and the modulation and coding scheme (MCS) of data transmissions, maximizing the utility of the wireless channel capacity. However, when interference is localized exclusively on non-pilot resources, pilot-aided SINR estimates become inaccurate. We sho… ▽ More

    Submitted 8 January, 2019; originally announced January 2019.

    Comments: 6 pages, 9 figures, accepted for publication at the 89th IEEE Vehicular Technology Conference (IEEE VTC Spring 2019)

  49. The 2-domination and Roman domination numbers of grid graphs

    Authors: Michaël Rao, Alexandre Talon

    Abstract: We investigate the 2-domination number for grid graphs, that is the size of a smallest set $D$ of vertices of the grid such that each vertex of the grid belongs to $D$ or has at least two neighbours in $D$. We give a closed formula giving the 2-domination number of any $n \!\times\! m$ grid, hereby confirming the results found by Lu and Xu, and Shaheen et al. for $n \leq 4$ and slightly correct th… ▽ More

    Submitted 17 May, 2019; v1 submitted 30 October, 2018; originally announced October 2018.

    Comments: 11 pages, 5 figures, presented at ICGT 2018 The program that led to the results is included in the Source directory (see Other formats) Accepted in DMTCS vol 21. Journal version with their template

    Journal ref: Discrete Mathematics & Theoretical Computer Science, vol. 21 no. 1, ICGT 2018 (May 23, 2019) dmtcs:4952

  50. arXiv:1810.12457  [pdf, ps, other

    cs.DC cs.LG cs.MA eess.SP

    Distributed Convex Optimization With Limited Communications

    Authors: Milind Rao, Stefano Rini, Andrea Goldsmith

    Abstract: In this paper, a distributed convex optimization algorithm, termed \emph{distributed coordinate dual averaging} (DCDA) algorithm, is proposed. The DCDA algorithm addresses the scenario of a large distributed optimization problem with limited communication among nodes in the network. Currently known distributed subgradient methods, such as the distributed dual averaging or the distributed alternati… ▽ More

    Submitted 29 October, 2018; originally announced October 2018.

    Comments: Extended version of submission to IEEE ICASSP 2019