Zum Hauptinhalt springen

Showing 1–50 of 123 results for author: Rao, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.20685  [pdf, other

    cs.ET cs.CL

    CultureVo: The Serious Game of Utilizing Gen AI for Enhancing Cultural Intelligence

    Authors: Ajita Agarwala, Anupam Purwar, Viswanadhasai Rao

    Abstract: CultureVo, Inc. has developed the Integrated Culture Learning Suite (ICLS) to deliver foundational knowledge of world cultures through a combination of interactive lessons and gamified experiences. This paper explores how Generative AI powered by open source Large Langauge Models are utilized within the ICLS to enhance cultural intelligence. The suite employs Generative AI techniques to automate t… ▽ More

    Submitted 1 August, 2024; v1 submitted 30 July, 2024; originally announced July 2024.

    Comments: Fourth International Conference on AI-ML Systems, 8-11 October, 2024, Louisiana, USA

  2. arXiv:2407.18919  [pdf

    cs.LG q-bio.QM

    Accelerating Drug Safety Assessment using Bidirectional-LSTM for SMILES Data

    Authors: K. Venkateswara Rao, Kunjam Nageswara Rao, G. Sita Ratnam

    Abstract: Computational methods are useful in accelerating the pace of drug discovery. Drug discovery carries several steps such as target identification and validation, lead discovery, and lead optimisation etc., In the phase of lead optimisation, the absorption, distribution, metabolism, excretion, and toxicity properties of lead compounds are assessed. To address the issue of predicting toxicity and solu… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 10 pages

  3. arXiv:2407.11149  [pdf

    cs.NE

    BMR and BWR: Two simple metaphor-free optimization algorithms for solving real-life non-convex constrained and unconstrained problems

    Authors: Ravipudi Venkata Rao, Ravikumar shah

    Abstract: This paper presents two simple yet powerful optimization algorithms named Best-Mean-Random (BMR) and Best-Worst-Randam (BWR) algorithms to handle both constrained and unconstrained optimization problems. These algorithms are free of metaphors and algorithm-specific parameters. The BMR algorithm is based on the best, mean, and random solutions of the population generated for solving a given problem… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 28 pages, 5 figures, original paper

    ACM Class: C.1.3; I.2.6; I.5

  4. arXiv:2406.19150  [pdf, other

    cs.CV cs.AI cs.IR

    RAVEN: Multitask Retrieval Augmented Vision-Language Learning

    Authors: Varun Nagaraj Rao, Siddharth Choudhary, Aditya Deshpande, Ravi Kumar Satzoda, Srikar Appalaraju

    Abstract: The scaling of large language models to encode all the world's knowledge in model parameters is unsustainable and has exacerbated resource barriers. Retrieval-Augmented Generation (RAG) presents a potential solution, yet its application to vision-language models (VLMs) is under explored. Existing methods focus on models designed for single tasks. Furthermore, they're limited by the need for resour… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  5. arXiv:2406.10768  [pdf, other

    cs.CY cs.HC

    Rideshare Transparency: Translating Gig Worker Insights on AI Platform Design to Policy

    Authors: Varun Nagaraj Rao, Samantha Dalal, Eesha Agarwal, Dana Calacci, Andrés Monroy-Hernández

    Abstract: Rideshare platforms exert significant control over workers through algorithmic systems that can result in financial, emotional, and physical harm. What steps can platforms, designers, and practitioners take to mitigate these negative impacts and meet worker needs? In this paper, through a novel mixed methods study combining a LLM-based analysis of over 1 million comments posted to online platform… ▽ More

    Submitted 19 June, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

  6. arXiv:2405.18322  [pdf, other

    cs.CV cs.AI

    SCE-MAE: Selective Correspondence Enhancement with Masked Autoencoder for Self-Supervised Landmark Estimation

    Authors: Kejia Yin, Varshanth R. Rao, Ruowei Jiang, Xudong Liu, Parham Aarabi, David B. Lindell

    Abstract: Self-supervised landmark estimation is a challenging task that demands the formation of locally distinct feature representations to identify sparse facial landmarks in the absence of annotated data. To tackle this task, existing state-of-the-art (SOTA) methods (1) extract coarse features from backbones that are trained with instance-level self-supervised learning (SSL) paradigms, which neglect the… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted at CVPR 2024

  7. arXiv:2405.05345  [pdf, other

    cs.CL cs.HC

    QuaLLM: An LLM-based Framework to Extract Quantitative Insights from Online Forums

    Authors: Varun Nagaraj Rao, Eesha Agarwal, Samantha Dalal, Dan Calacci, Andrés Monroy-Hernández

    Abstract: Online discussion forums provide crucial data to understand the concerns of a wide range of real-world communities. However, the typical qualitative and quantitative methods used to analyze those data, such as thematic analysis and topic modeling, are infeasible to scale or require significant human effort to translate outputs to human readable forms. This study introduces QuaLLM, a novel LLM-base… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Accepted to CHI LLM as Research Tools Workshop (2024)

  8. arXiv:2404.10274  [pdf

    cs.AI cs.LG

    Sparse Attention Regression Network Based Soil Fertility Prediction With Ummaso

    Authors: R V Raghavendra Rao, U Srinivasulu Reddy

    Abstract: The challenge of imbalanced soil nutrient datasets significantly hampers accurate predictions of soil fertility. To tackle this, a new method is suggested in this research, combining Uniform Manifold Approximation and Projection (UMAP) with Least Absolute Shrinkage and Selection Operator (LASSO). The main aim is to counter the impact of uneven data distribution and improve soil fertility models' p… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  9. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1110 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 8 August, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  10. arXiv:2401.05683  [pdf, other

    cs.GT cs.AI

    Deep Learning Meets Mechanism Design: Key Results and Some Novel Applications

    Authors: V. Udaya Sankar, Vishisht Srihari Rao, Y. Narahari

    Abstract: Mechanism design is essentially reverse engineering of games and involves inducing a game among strategic agents in a way that the induced game satisfies a set of desired properties in an equilibrium of the game. Desirable properties for a mechanism include incentive compatibility, individual rationality, welfare maximisation, revenue maximisation (or cost minimisation), fairness of allocation, et… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  11. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  12. arXiv:2312.08267  [pdf, other

    eess.IV cs.CV q-bio.QM

    TABSurfer: a Hybrid Deep Learning Architecture for Subcortical Segmentation

    Authors: Aaron Cao, Vishwanatha M. Rao, Kejia Liu, Xinru Liu, Andrew F. Laine, Jia Guo

    Abstract: Subcortical segmentation remains challenging despite its important applications in quantitative structural analysis of brain MRI scans. The most accurate method, manual segmentation, is highly labor intensive, so automated tools like FreeSurfer have been adopted to handle this task. However, these traditional pipelines are slow and inefficient for processing large datasets. In this study, we propo… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

    Comments: 5 pages, 3 figures, 2 tables

  13. arXiv:2311.17705  [pdf, other

    cs.SE

    Q-PAC: Automated Detection of Quantum Bug-Fix Patterns

    Authors: Pranav K. Nayak, Krishn V. Kher, M. Bharat Chandra, M. V. Panduranga Rao, Lei Zhang

    Abstract: Context: Bug-fix pattern detection has been investigated in the past in the context of classical software. However, while quantum software is developing rapidly, the literature still lacks automated methods and tools to identify, analyze, and detect bug-fix patterns. To the best of our knowledge, our work previously published in SEKE'23 was the first to leverage classical techniques to detect bug-… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: 16 pages, 2 figures

  14. Exploding AI Power Use: an Opportunity to Rethink Grid Planning and Management

    Authors: Liuzixuan Lin, Rajini Wijayawardana, Varsha Rao, Hai Nguyen, Wedan Emmanuel Gnibga, Andrew A. Chien

    Abstract: The unprecedented rapid growth of computing demand for AI is projected to increase global annual datacenter (DC) growth from 7.2% to 11.3%. We project the 5-year AI DC demand for several power grids and assess whether they will allow desired AI growth (resource adequacy). If not, several "desperate measures" -- grid policies that enable more load growth and maintain grid reliability by sacrificing… ▽ More

    Submitted 30 April, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted by ACM e-Energy '24: the 15th ACM International Conference on Future and Sustainable Energy Systems

  15. arXiv:2310.05972  [pdf, other

    cs.ET

    Normality of I-V Measurements Using ML

    Authors: Anees Al-Najjar, Nageswara S. V. Rao, Craig A. Bridges, Sheng Dai

    Abstract: Electrochemistry ecosystems are promising for accelerating the design and discovery of electrochemical systems for energy storage and conversion, by automating significant parts of workflows that combine synthesis and characterization experiments with computations. They require the integration of flow controllers, solvent containers, pumps, fraction collectors, and potentiostats, all connected to… ▽ More

    Submitted 28 September, 2023; originally announced October 2023.

    Comments: published at eScience 2023

    Journal ref: in 2023 IEEE 19th International Conference on e-Science (e-Science), Limassol, Cyprus, 2023 pp. 1-2

  16. arXiv:2309.17147  [pdf, other

    cs.CL cs.AI econ.GN

    Using Large Language Models for Qualitative Analysis can Introduce Serious Bias

    Authors: Julian Ashwin, Aditya Chhabra, Vijayendra Rao

    Abstract: Large Language Models (LLMs) are quickly becoming ubiquitous, but the implications for social science research are not yet well understood. This paper asks whether LLMs can help us analyse large-N qualitative data from open-ended interviews, with an application to transcripts of interviews with Rohingya refugees in Cox's Bazaar, Bangladesh. We find that a great deal of caution is needed in using L… ▽ More

    Submitted 5 October, 2023; v1 submitted 29 September, 2023; originally announced September 2023.

  17. arXiv:2307.06883  [pdf, other

    cs.OH physics.ins-det

    Cyber Framework for Steering and Measurements Collection Over Instrument-Computing Ecosystems

    Authors: Anees Al-Najjar, Nageswara S. V. Rao, Ramanan Sankaran, Helia Zandi, Debangshu Mukherjee, Maxim Ziatdinov, Craig Bridges

    Abstract: We propose a framework to develop cyber solutions to support the remote steering of science instruments and measurements collection over instrument-computing ecosystems. It is based on provisioning separate data and control connections at the network level, and developing software modules consisting of Python wrappers for instrument commands and Pyro server-client codes that make them available ac… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: Paper accepted for presentation at IEEE SMARTCOMP 2023

  18. Discrimination through Image Selection by Job Advertisers on Facebook

    Authors: Varun Nagaraj Rao, Aleksandra Korolova

    Abstract: Targeted advertising platforms are widely used by job advertisers to reach potential employees; thus issues of discrimination due to targeting that have surfaced have received widespread attention. Advertisers could misuse targeting tools to exclude people based on gender, race, location and other protected attributes from seeing their job ads. In response to legal actions, Facebook disabled the a… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: Published in FAccT 2023

  19. arXiv:2306.04733  [pdf, other

    physics.soc-ph cs.SI

    Epidemic spreading in group-structured populations

    Authors: Siddharth Patwardhan, Varun K. Rao, Santo Fortunato, Filippo Radicchi

    Abstract: Individuals involved in common group activities/settings -- e.g., college students that are enrolled in the same class and/or live in the same dorm -- are exposed to recurrent contacts of physical proximity. These contacts are known to mediate the spread of an infectious disease, however, it is not obvious how the properties of the spreading process are determined by the structure of and the inter… ▽ More

    Submitted 20 December, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 10 pages, 4 figures + Supplemental Material

    Journal ref: Phys. Rev. X 13, 041054 (2023)

  20. arXiv:2305.19444  [pdf

    cs.HC

    Pixelated Interactions: Exploring Pixel Art for Graphical Primitives on a Tactile Display

    Authors: Tigmanshu Bhatnagar, Vikas Upadhyay, Anchal Sharma, P V Madhusudhan Rao, Mark Miodownik, Nicolai Marquardt, Catherine Holloway

    Abstract: Two-dimensional pin array tactile displays enable access to tactile graphics that are important for the education of students with visual impairments. Due to their prohibitive cost, limited access, and limited research within HCI, the rules to design graphical primitives on these low-resolution tactile displays are unclear. In this paper, eight tactile readers with visual impairments qualitatively… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: 25 pages, 10 figures. To appear in DIS'23 Designing Interactive Systems Conference, July 10 to 14, 2023, Pittsburgh, PA, USA

  21. arXiv:2303.02043  [pdf, other

    cs.RO eess.SY

    An Integrated Real-time UAV Trajectory Optimization with Potential Field Approach for Dynamic Collision Avoidance

    Authors: D. M. K. K. Venkateswara Rao, Hamed Habibi, Jose Luis Sanchez-Lopez, Holger Voos

    Abstract: This paper presents an integrated approach that combines trajectory optimization and Artificial Potential Field (APF) method for real-time optimal Unmanned Aerial Vehicle (UAV) trajectory planning and dynamic collision avoidance. A minimum-time trajectory optimization problem is formulated with initial and final positions as boundary conditions and collision avoidance as constraints. It is transcr… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

  22. arXiv:2211.07092  [pdf, ps, other

    stat.ML cs.LG math.ST

    Offline Estimation of Controlled Markov Chains: Minimaxity and Sample Complexity

    Authors: Imon Banerjee, Harsha Honnappa, Vinayak Rao

    Abstract: In this work, we study a natural nonparametric estimator of the transition probability matrices of a finite controlled Markov chain. We consider an offline setting with a fixed dataset, collected using a so-called logging policy. We develop sample complexity bounds for the estimator and establish conditions for minimaxity. Our statistical bounds depend on the logging policy through its mixing prop… ▽ More

    Submitted 26 January, 2024; v1 submitted 13 November, 2022; originally announced November 2022.

    Comments: 71 pages, 23 main

  23. arXiv:2210.09791  [pdf, other

    cs.DC

    Enabling Autonomous Electron Microscopy for Networked Computation and Steering

    Authors: Anees Al-Najjar, Nageswara S. V. Rao, Ramanan Sankaran, Maxim Ziatdinov, Debangshu Mukherjee, Olga Ovchinnikova, Kevin Roccapriore, Andrew R. Lupini, Sergei V. Kalinin

    Abstract: Advanced electron microscopy workflows require an ecosystem of microscope instruments and computing systems possibly located at different sites to conduct remotely steered and automated experiments. Current workflow executions involve manual operations for steering and measurement tasks, which are typically performed from control workstations co-located with microscopes; consequently, their operat… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: 11 pages, 16 figures, accepted at IEEE eScience 2022 conference

  24. arXiv:2207.13310  [pdf, other

    cs.LG math.DS math.PR physics.data-an stat.AP

    Learning the Evolution of Correlated Stochastic Power System Dynamics

    Authors: Tyler E. Maltba, Vishwas Rao, Daniel Adrian Maldonado

    Abstract: A machine learning technique is proposed for quantifying uncertainty in power system dynamics with spatiotemporally correlated stochastic forcing. We learn one-dimensional linear partial differential equations for the probability density functions of real-valued quantities of interest. The method is suitable for high-dimensional systems and helps to alleviate the curse of dimensionality.

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: 5 pages, 2 figures, Accepted to 2022 IEEE PES GM

    Journal ref: IEEE Power & Energy Society General Meeting (2022) 01-05

  25. arXiv:2206.12980  [pdf

    eess.IV cs.CV q-bio.QM

    Detecting Schizophrenia with 3D Structural Brain MRI Using Deep Learning

    Authors: Junhao Zhang, Vishwanatha M. Rao, Ye Tian, Yanting Yang, Nicolas Acosta, Zihan Wan, Pin-Yu Lee, Chloe Zhang, Lawrence S. Kegeles, Scott A. Small, Jia Guo

    Abstract: Schizophrenia is a chronic neuropsychiatric disorder that causes distinct structural alterations within the brain. We hypothesize that deep learning applied to a structural neuroimaging dataset could detect disease-related alteration and improve classification and diagnostic accuracy. We tested this hypothesis using a single, widely available, and conventional T1-weighted MRI scan, from which we e… ▽ More

    Submitted 7 July, 2022; v1 submitted 26 June, 2022; originally announced June 2022.

    Comments: 13 pages, 6 figures

  26. arXiv:2202.11820  [pdf

    cs.LG cs.CE

    Nowcasting the Financial Time Series with Streaming Data Analytics under Apache Spark

    Authors: Mohammad Arafat Ali Khan, Chandra Bhushan, Vadlamani Ravi, Vangala Sarveswara Rao, Shiva Shankar Orsu

    Abstract: This paper proposes nowcasting of high-frequency financial datasets in real-time with a 5-minute interval using the streaming analytics feature of Apache Spark. The proposed 2 stage method consists of modelling chaos in the first stage and then using a sliding window approach for training with machine learning algorithms namely Lasso Regression, Ridge Regression, Generalised Linear Model, Gradient… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

    Comments: 26 pages; 7 Tables and 11 Figures

    MSC Class: 37M10; 62M10; 91B84 ACM Class: I.2.11; J.4

  27. Improving Across-Dataset Brain Tissue Segmentation Using Transformer

    Authors: Vishwanatha M. Rao, Zihan Wan, Soroush Arabshahi, David J. Ma, Pin-Yu Lee, Ye Tian, Xuzhe Zhang, Andrew F. Laine, Jia Guo

    Abstract: Brain tissue segmentation has demonstrated great utility in quantifying MRI data through Voxel-Based Morphometry and highlighting subtle structural changes associated with various conditions within the brain. However, manual segmentation is highly labor-intensive, and automated approaches have struggled due to properties inherent to MRI acquisition, leaving a great need for an effective segmentati… ▽ More

    Submitted 31 January, 2023; v1 submitted 21 January, 2022; originally announced January 2022.

    ACM Class: I.4.6

  28. arXiv:2112.11547  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Decompose the Sounds and Pixels, Recompose the Events

    Authors: Varshanth R. Rao, Md Ibrahim Khalil, Haoda Li, Peng Dai, Juwei Lu

    Abstract: In this paper, we propose a framework centering around a novel architecture called the Event Decomposition Recomposition Network (EDRNet) to tackle the Audio-Visual Event (AVE) localization problem in the supervised and weakly supervised settings. AVEs in the real world exhibit common unravelling patterns (termed as Event Progress Checkpoints (EPC)), which humans can perceive through the cooperati… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

    Comments: Accepted at AAAI 2022

  29. arXiv:2112.09752  [pdf, other

    cs.LG

    Set Twister for Single-hop Node Classification

    Authors: Yangze Zhou, Vinayak Rao, Bruno Ribeiro

    Abstract: Node classification is a central task in relational learning, with the current state-of-the-art hinging on two key principles: (i) predictions are permutation-invariant to the ordering of a node's neighbors, and (ii) predictions are a function of the node's $r$-hop neighborhood topology and attributes, $r \geq 2$. Both graph neural networks and collective inference methods (e.g., belief propagatio… ▽ More

    Submitted 17 December, 2021; originally announced December 2021.

    Comments: Accepted for presentation at the 2nd GCLR workshop in conjunction with AAAI 2022

  30. arXiv:2111.07552  [pdf, other

    eess.SY cs.RO

    Dynamic Placement of Rapidly Deployable Mobile Sensor Robots Using Machine Learning and Expected Value of Information

    Authors: Alice Agogino, Hae Young Jang, Vivek Rao, Ritik Batra, Felicity Liao, Rohan Sood, Irving Fang, R. Lily Hu, Emerson Shoichet-Bartus, John Matranga

    Abstract: Although the Industrial Internet of Things has increased the number of sensors permanently installed in industrial plants, there will be gaps in coverage due to broken sensors or sparse density in very large plants, such as in the petrochemical industry. Modern emergency response operations are beginning to use Small Unmanned Aerial Systems (sUAS) that have the ability to drop sensor robots to pre… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: 14 pages, 11 figures, IMECE2021

  31. arXiv:2111.03808  [pdf

    cs.LG

    Contextual Unsupervised Outlier Detection in Sequences

    Authors: Mohamed A. Zahran, Leonardo Teixeira, Vinayak Rao, Bruno Ribeiro

    Abstract: This work proposes an unsupervised learning framework for trajectory (sequence) outlier detection that combines ranking tests with user sequence models. The overall framework identifies sequence outliers at a desired false positive rate (FPR), in an otherwise parameter-free manner. We evaluate our methodology on a collection of real and simulated datasets based on user actions at the websites last… ▽ More

    Submitted 6 November, 2021; originally announced November 2021.

    Comments: 11 pages

  32. arXiv:2111.01634  [pdf, other

    cs.NI

    Towards Enabling High-Five Over WiFi

    Authors: Vineet Gokhale, Mohamad Eid, Kees Kroep, R. Venkatesha Prasad, Vijay Rao

    Abstract: The next frontier for immersive applications is enabling sentience over the Internet. Tactile Internet (TI) envisages transporting skills by providing Ultra-Low Latency (ULL) communications for transporting touch senses. In this work, we focus our study on the first/last mile communication, where the future generation WiFi-7 is pitched as the front-runner for ULL applications. We discuss a few can… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

  33. arXiv:2110.11489  [pdf, ps, other

    cs.AR cs.LG

    Supporting Massive DLRM Inference Through Software Defined Memory

    Authors: Ehsan K. Ardestani, Changkyu Kim, Seung Jae Lee, Luoshang Pan, Valmiki Rampersad, Jens Axboe, Banit Agrawal, Fuxun Yu, Ansha Yu, Trung Le, Hector Yuen, Shishir Juluri, Akshat Nanda, Manoj Wodekar, Dheevatsa Mudigere, Krishnakumar Nair, Maxim Naumov, Chris Peterson, Mikhail Smelyanskiy, Vijay Rao

    Abstract: Deep Learning Recommendation Models (DLRM) are widespread, account for a considerable data center footprint, and grow by more than 1.5x per year. With model size soon to be in terabytes range, leveraging Storage ClassMemory (SCM) for inference enables lower power consumption and cost. This paper evaluates the major challenges in extending the memory hierarchy to SCM for DLRM, and presents differen… ▽ More

    Submitted 8 November, 2021; v1 submitted 21 October, 2021; originally announced October 2021.

    Comments: 14 pages, 5 figures

  34. arXiv:2108.10971  [pdf

    cs.CV

    An Effective Pixel-Wise Approach for Skin Colour Segmentation Using Pixel Neighbourhood Technique

    Authors: Tejas Dastane, Varun Rao, Kartik Shenoy, Devendra Vyavaharkar

    Abstract: This paper presents a novel technique for skin colour segmentation that overcomes the limitations faced by existing techniques such as Colour Range Thresholding. Skin colour segmentation is affected by the varied skin colours and surrounding lighting conditions, leading to poorskin segmentation for many techniques. We propose a new two stage Pixel Neighbourhood technique that classifies any pixel… ▽ More

    Submitted 24 August, 2021; originally announced August 2021.

    Comments: 5 pages

    Journal ref: International Journal on Recent and Innovation Trends in Computing and Communication 2018, Volume: 6, Issue: 3, pp. 182-186

  35. arXiv:2108.10970  [pdf

    cs.CV cs.HC

    Real-time Indian Sign Language (ISL) Recognition

    Authors: Kartik Shenoy, Tejas Dastane, Varun Rao, Devendra Vyavaharkar

    Abstract: This paper presents a system which can recognise hand poses & gestures from the Indian Sign Language (ISL) in real-time using grid-based features. This system attempts to bridge the communication gap between the hearing and speech impaired and the rest of the society. The existing solutions either provide relatively low accuracy or do not work in real-time. This system provides good results on bot… ▽ More

    Submitted 24 August, 2021; originally announced August 2021.

    Comments: 9 pages

    Journal ref: 9th International Conference on Communication and Network Technology 2018

  36. arXiv:2108.10168  [pdf

    cs.AI

    CGEMs: A Metric Model for Automatic Code Generation using GPT-3

    Authors: Aishwarya Narasimhan, Krishna Prasad Agara Venkatesha Rao, Veena M B

    Abstract: Today, AI technology is showing its strengths in almost every industry and walks of life. From text generation, text summarization, chatbots, NLP is being used widely. One such paradigm is automatic code generation. An AI could be generating anything; hence the output space is unconstrained. A self-driving car is driven for 100 million miles to validate its safety, but tests cannot be written to m… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: 11 pages, 6 figures, 2 tables

  37. arXiv:2108.00965  [pdf, other

    cs.CR cs.CY stat.CO

    Privacy-Aware Rejection Sampling

    Authors: Jordan Awan, Vinayak Rao

    Abstract: Differential privacy (DP) offers strong theoretical privacy guarantees, but implementations of DP mechanisms may be vulnerable to side-channel attacks, such as timing attacks. When sampling methods such as MCMC or rejection sampling are used to implement a mechanism, the runtime can leak private information. We characterize the additional privacy cost due to the runtime of a rejection sampler in t… ▽ More

    Submitted 29 September, 2022; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: 25 pages + references, 4 figures

  38. arXiv:2107.04140  [pdf, other

    cs.AR

    First-Generation Inference Accelerator Deployment at Facebook

    Authors: Michael Anderson, Benny Chen, Stephen Chen, Summer Deng, Jordan Fix, Michael Gschwind, Aravind Kalaiah, Changkyu Kim, Jaewon Lee, Jason Liang, Haixin Liu, Yinghai Lu, Jack Montgomery, Arun Moorthy, Satish Nadathur, Sam Naghshineh, Avinash Nayak, Jongsoo Park, Chris Petersen, Martin Schatz, Narayanan Sundaram, Bangsheng Tang, Peter Tang, Amy Yang, Jiecao Yu , et al. (90 additional authors not shown)

    Abstract: In this paper, we provide a deep dive into the deployment of inference accelerators at Facebook. Many of our ML workloads have unique characteristics, such as sparse memory accesses, large model sizes, as well as high compute, memory and network bandwidth requirements. We co-designed a high-performance, energy-efficient inference accelerator platform based on these requirements. We describe the in… ▽ More

    Submitted 4 August, 2021; v1 submitted 8 July, 2021; originally announced July 2021.

  39. arXiv:2106.12199  [pdf, other

    math.ST cs.LG math.OC stat.ME

    Bayesian Joint Chance Constrained Optimization: Approximations and Statistical Consistency

    Authors: Prateek Jaiswal, Harsha Honnappa, Vinayak A. Rao

    Abstract: This paper considers data-driven chance-constrained stochastic optimization problems in a Bayesian framework. Bayesian posteriors afford a principled mechanism to incorporate data and prior knowledge into stochastic optimization problems. However, the computation of Bayesian posteriors is typically an intractable problem, and has spawned a large literature on approximate Bayesian computation. Here… ▽ More

    Submitted 30 September, 2022; v1 submitted 23 June, 2021; originally announced June 2021.

  40. arXiv:2106.02357  [pdf, ps, other

    cs.LG quant-ph stat.ML

    Adiabatic Quantum Feature Selection for Sparse Linear Regression

    Authors: Surya Sai Teja Desu, P. K. Srijith, M. V. Panduranga Rao, Naveen Sivadasan

    Abstract: Linear regression is a popular machine learning approach to learn and predict real valued outputs or dependent variables from independent variables or features. In many real world problems, its beneficial to perform sparse linear regression to identify important features helpful in predicting the dependent variable. It not only helps in getting interpretable results but also avoids overfitting whe… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: 8 pages, 2 tables

  41. arXiv:2105.02626  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    A First Look: Towards Explainable TextVQA Models via Visual and Textual Explanations

    Authors: Varun Nagaraj Rao, Xingjian Zhen, Karen Hovsepian, Mingwei Shen

    Abstract: Explainable deep learning models are advantageous in many situations. Prior work mostly provide unimodal explanations through post-hoc approaches not part of the original system design. Explanation mechanisms also ignore useful textual information present in images. In this paper, we propose MTXNet, an end-to-end trainable multimodal architecture to generate multimodal explanations, which focuses… ▽ More

    Submitted 28 April, 2021; originally announced May 2021.

    Comments: This paper is done when Xingjian was an intern in Amazon PARS group, summer 2020. This paper is accepted by NAACL-MAI-Workshop, 2021

  42. arXiv:2104.14538  [pdf, other

    cs.LG stat.ML

    Distributed Multigrid Neural Solvers on Megavoxel Domains

    Authors: Aditya Balu, Sergio Botelho, Biswajit Khara, Vinay Rao, Chinmay Hegde, Soumik Sarkar, Santi Adavani, Adarsh Krishnamurthy, Baskar Ganapathysubramanian

    Abstract: We consider the distributed training of large-scale neural networks that serve as PDE solvers producing full field outputs. We specifically consider neural solvers for the generalized 3D Poisson equation over megavoxel domains. A scalable framework is presented that integrates two distinct advances. First, we accelerate training a large model via a method analogous to the multigrid technique used… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

  43. arXiv:2104.05158  [pdf, other

    cs.DC cs.AI cs.LG cs.PF

    Software-Hardware Co-design for Fast and Scalable Training of Deep Learning Recommendation Models

    Authors: Dheevatsa Mudigere, Yuchen Hao, Jianyu Huang, Zhihao Jia, Andrew Tulloch, Srinivas Sridharan, Xing Liu, Mustafa Ozdal, Jade Nie, Jongsoo Park, Liang Luo, Jie Amy Yang, Leon Gao, Dmytro Ivchenko, Aarti Basant, Yuxi Hu, Jiyan Yang, Ehsan K. Ardestani, Xiaodong Wang, Rakesh Komuravelli, Ching-Hsiang Chu, Serhat Yilmaz, Huayu Li, Jiyuan Qian, Zhuobo Feng , et al. (28 additional authors not shown)

    Abstract: Deep learning recommendation models (DLRMs) are used across many business-critical services at Facebook and are the single largest AI application in terms of infrastructure demand in its data-centers. In this paper we discuss the SW/HW co-designed solution for high-performance distributed training of large-scale DLRMs. We introduce a high-performance scalable software stack based on PyTorch and pa… ▽ More

    Submitted 26 February, 2023; v1 submitted 11 April, 2021; originally announced April 2021.

  44. Machine-learning based methodologies for 3d x-ray measurement, characterization and optimization for buried structures in advanced ic packages

    Authors: Ramanpreet S Pahwa, Soon Wee Ho, Ren Qin, Richard Chang, Oo Zaw Min, Wang Jie, Vempati Srinivasa Rao, Tin Lay Nwe, Yanjing Yang, Jens Timo Neumann, Ramani Pichumani, Thomas Gregorich

    Abstract: For over 40 years lithographic silicon scaling has driven circuit integration and performance improvement in the semiconductor industry. As silicon scaling slows down, the industry is increasingly dependent on IC package technologies to contribute to further circuit integration and performance improvements. This is a paradigm shift and requires the IC package industry to reduce the size and increa… ▽ More

    Submitted 19 May, 2021; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: 7 pages, 9 figures

    Journal ref: International Wafer-Level Packaging Conference (IWLPC) 2020

  45. arXiv:2102.09681  [pdf, other

    cs.CL cs.IR

    WebRED: Effective Pretraining And Finetuning For Relation Extraction On The Web

    Authors: Robert Ormandi, Mohammad Saleh, Erin Winter, Vinay Rao

    Abstract: Relation extraction is used to populate knowledge bases that are important to many applications. Prior datasets used to train relation extraction models either suffer from noisy labels due to distant supervision, are limited to certain domains or are too small to train high-capacity models. This constrains downstream applications of relation extraction. We therefore introduce: WebRED (Web Relation… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

  46. arXiv:2101.02184  [pdf, other

    cs.NI

    VFSIE -- Development and Testing Framework for Federated Science Instruments

    Authors: Anees Al-Najjar, Nageswara S. V. Rao, Neena Imam, Thomas Naughton, Seth Hitefield, Lawrence Sorrillo, James Kohl, Wael Elwasif, Jean-Christophe Bilheux, Hassina Bilheux, Swen Boehm, Jason Kincl

    Abstract: Recent developments in softwarization of networked infrastructures combined with containerization of computing workflows promise unprecedented compute anywhere and everywhere capabilities for federations of edge and remote computing systems and science instruments. The development and testing of software stacks that implement these capabilities over physical production federations, however, is not… ▽ More

    Submitted 2 February, 2021; v1 submitted 6 January, 2021; originally announced January 2021.

    Comments: Earlier Version of VFSIE framework for emulating science workflows at a single site

  47. arXiv:2012.11171  [pdf, other

    eess.SY cs.LG

    A Comprehensive Survey of Machine Learning Based Localization with Wireless Signals

    Authors: Daoud Burghal, Ashwin T. Ravi, Varun Rao, Abdullah A. Alghafis, Andreas F. Molisch

    Abstract: The last few decades have witnessed a growing interest in location-based services. Using localization systems based on Radio Frequency (RF) signals has proven its efficacy for both indoor and outdoor applications. However, challenges remain with respect to both complexity and accuracy of such systems. Machine Learning (ML) is one of the most promising methods for mitigating these problems, as ML (… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

  48. arXiv:2010.10687  [pdf, other

    cs.LG cs.NE

    Is Batch Norm unique? An empirical investigation and prescription to emulate the best properties of common normalizers without batch dependence

    Authors: Vinay Rao, Jascha Sohl-Dickstein

    Abstract: We perform an extensive empirical study of the statistical properties of Batch Norm and other common normalizers. This includes an examination of the correlation between representations of minibatches, gradient norms, and Hessian spectra both at initialization and over the course of training. Through this analysis, we identify several statistical properties which appear linked to Batch Norm's supe… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  49. arXiv:2010.01385  [pdf, other

    cs.CC

    Limitations of Sums of Bounded-Read Formulas

    Authors: Purnata Ghosal, B. V. Raghavendra Rao

    Abstract: Proving super polynomial size lower bounds for various classes of arithmetic circuits computing explicit polynomials is a very important and challenging task in algebraic complexity theory. We study representation of polynomials as sums of weaker models such as read once formulas (ROFs) and read once oblivious algebraic branching programs (ROABPs). We prove: (1) An exponential separation between… ▽ More

    Submitted 3 October, 2020; originally announced October 2020.

    Comments: 20 pages, 3 figures

  50. arXiv:2007.01189  [pdf, other

    cs.CL

    Sequential Domain Adaptation through Elastic Weight Consolidation for Sentiment Analysis

    Authors: Avinash Madasu, Vijjini Anvesh Rao

    Abstract: Elastic Weight Consolidation (EWC) is a technique used in overcoming catastrophic forgetting between successive tasks trained on a neural network. We use this phenomenon of information sharing between tasks for domain adaptation. Training data for tasks such as sentiment analysis (SA) may not be fairly represented across multiple domains. Domain Adaptation (DA) aims to build algorithms that levera… ▽ More

    Submitted 19 July, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

    Comments: Accepted at 25th International Conference on Pattern Recognition, January 2021, Milan, Italy