Zum Hauptinhalt springen

Showing 1–50 of 129 results for author: Rao, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.04948  [pdf, other

    cs.CL cs.LG q-fin.ST stat.AP stat.ML

    HybridRAG: Integrating Knowledge Graphs and Vector Retrieval Augmented Generation for Efficient Information Extraction

    Authors: Bhaskarjit Sarmah, Benika Hall, Rohan Rao, Sunil Patel, Stefano Pasquali, Dhagash Mehta

    Abstract: Extraction and interpretation of intricate information from unstructured text data arising in financial applications, such as earnings call transcripts, present substantial challenges to large language models (LLMs) even using the current best practices to use Retrieval Augmented Generation (RAG) (referred to as VectorRAG techniques which utilize vector databases for information retrieval) due to… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: 9 pages, 2 figures, 5 tables

  2. arXiv:2407.11149  [pdf

    cs.NE

    BMR and BWR: Two simple metaphor-free optimization algorithms for solving real-life non-convex constrained and unconstrained problems

    Authors: Ravipudi Venkata Rao, Ravikumar shah

    Abstract: This paper presents two simple yet powerful optimization algorithms named Best-Mean-Random (BMR) and Best-Worst-Randam (BWR) algorithms to handle both constrained and unconstrained optimization problems. These algorithms are free of metaphors and algorithm-specific parameters. The BMR algorithm is based on the best, mean, and random solutions of the population generated for solving a given problem… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: 28 pages, 5 figures, original paper

    ACM Class: C.1.3; I.2.6; I.5

  3. arXiv:2407.04208  [pdf, other

    cs.CV

    AMD: Automatic Multi-step Distillation of Large-scale Vision Models

    Authors: Cheng Han, Qifan Wang, Sohail A. Dianat, Majid Rabbani, Raghuveer M. Rao, Yi Fang, Qiang Guan, Lifu Huang, Dongfang Liu

    Abstract: Transformer-based architectures have become the de-facto standard models for diverse vision tasks owing to their superior performance. As the size of the models continues to scale up, model distillation becomes extremely important in various real applications, particularly on devices limited by computational resources. However, prevailing knowledge distillation methods exhibit diminished efficacy… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 19 pages, 5 figures

  4. arXiv:2406.17576  [pdf, other

    cs.CR cs.AI cs.LG

    Leveraging Reinforcement Learning in Red Teaming for Advanced Ransomware Attack Simulations

    Authors: Cheng Wang, Christopher Redino, Ryan Clark, Abdul Rahman, Sal Aguinaga, Sathvik Murli, Dhruv Nandakumar, Roland Rao, Lanxiao Huang, Daniel Radke, Edward Bowen

    Abstract: Ransomware presents a significant and increasing threat to individuals and organizations by encrypting their systems and not releasing them until a large fee has been extracted. To bolster preparedness against potential attacks, organizations commonly conduct red teaming exercises, which involve simulated attacks to assess existing security measures. This paper proposes a novel approach utilizing… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  5. arXiv:2406.01559  [pdf, other

    cs.CV

    Prototypical Transformer as Unified Motion Learners

    Authors: Cheng Han, Yawen Lu, Guohao Sun, James C. Liang, Zhiwen Cao, Qifan Wang, Qiang Guan, Sohail A. Dianat, Raghuveer M. Rao, Tong Geng, Zhiqiang Tao, Dongfang Liu

    Abstract: In this work, we introduce the Prototypical Transformer (ProtoFormer), a general and unified framework that approaches various motion tasks from a prototype perspective. ProtoFormer seamlessly integrates prototype learning with Transformer by thoughtfully considering motion dynamics, introducing two innovative designs. First, Cross-Attention Prototyping discovers prototypes based on signature moti… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: 21 pages, 10 figures

  6. arXiv:2405.18322  [pdf, other

    cs.CV cs.AI

    SCE-MAE: Selective Correspondence Enhancement with Masked Autoencoder for Self-Supervised Landmark Estimation

    Authors: Kejia Yin, Varshanth R. Rao, Ruowei Jiang, Xudong Liu, Parham Aarabi, David B. Lindell

    Abstract: Self-supervised landmark estimation is a challenging task that demands the formation of locally distinct feature representations to identify sparse facial landmarks in the absence of annotated data. To tackle this task, existing state-of-the-art (SOTA) methods (1) extract coarse features from backbones that are trained with instance-level self-supervised learning (SSL) paradigms, which neglect the… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted at CVPR 2024

  7. arXiv:2405.08019  [pdf, other

    cs.LG cs.AI

    AdaKD: Dynamic Knowledge Distillation of ASR models using Adaptive Loss Weighting

    Authors: Shreyan Ganguly, Roshan Nayak, Rakshith Rao, Ujan Deb, Prathosh AP

    Abstract: Knowledge distillation, a widely used model compression technique, works on the basis of transferring knowledge from a cumbersome teacher model to a lightweight student model. The technique involves jointly optimizing the task specific and knowledge distillation losses with a weight assigned to them. Despite these weights playing a crucial role in the performance of the distillation process, curre… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  8. arXiv:2404.17212  [pdf

    cs.ET cs.CV

    Scrutinizing Data from Sky: An Examination of Its Veracity in Area Based Traffic Contexts

    Authors: Yawar Ali, Krishnan K N, Debashis Ray Sarkar, K. Ramachandra Rao, Niladri Chatterjee, Ashish Bhaskar

    Abstract: Traffic data collection has been an overwhelming task for researchers as well as authorities over the years. With the advancement in technology and introduction of various tools for processing and extracting traffic data the task has been made significantly convenient. Data from Sky (DFS) is one such tool, based on image processing and artificial intelligence (AI), that provides output for macrosc… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  9. arXiv:2404.10274  [pdf

    cs.AI cs.LG

    Sparse Attention Regression Network Based Soil Fertility Prediction With Ummaso

    Authors: R V Raghavendra Rao, U Srinivasulu Reddy

    Abstract: The challenge of imbalanced soil nutrient datasets significantly hampers accurate predictions of soil fertility. To tackle this, a new method is suggested in this research, combining Uniform Manifold Approximation and Projection (UMAP) with Least Absolute Shrinkage and Selection Operator (LASSO). The main aim is to counter the impact of uneven data distribution and improve soil fertility models' p… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  10. arXiv:2403.17998  [pdf, other

    cs.CV

    Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval

    Authors: Jiamian Wang, Guohao Sun, Pichao Wang, Dongfang Liu, Sohail Dianat, Majid Rabbani, Raghuveer Rao, Zhiqiang Tao

    Abstract: The increasing prevalence of video clips has sparked growing interest in text-video retrieval. Recent advances focus on establishing a joint embedding space for text and video, relying on consistent embedding representations to compute similarity. However, the text content in existing datasets is generally short and concise, making it hard to fully describe the redundant semantics of a video. Corr… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024, code and model are available at https://github.com/Jiamian-Wang/T-MASS-text-video-retrieval

  11. arXiv:2403.00975  [pdf, other

    cs.LG cs.AI math.FA stat.AP

    Equipment Health Assessment: Time Series Analysis for Wind Turbine Performance

    Authors: Jana Backhus, Aniruddha Rajendra Rao, Chandrasekar Venkatraman, Abhishek Padmanabhan, A. Vinoth Kumar, Chetan Gupta

    Abstract: In this study, we leverage SCADA data from diverse wind turbines to predict power output, employing advanced time series methods, specifically Functional Neural Networks (FNN) and Long Short-Term Memory (LSTM) networks. A key innovation lies in the ensemble of FNN and LSTM models, capitalizing on their collective learning. This ensemble approach outperforms individual models, ensuring stable and a… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Comments: 19 Pages, 17 Figures, 3 Tables, Submitted at Applied Sciences (MDPI)

  12. arXiv:2401.14498  [pdf, other

    cs.LG eess.SY stat.AP stat.ML

    Predictive Analysis for Optimizing Port Operations

    Authors: Aniruddha Rajendra Rao, Haiyan Wang, Chetan Gupta

    Abstract: Maritime transport is a pivotal logistics mode for the long-distance and bulk transportation of goods. However, the intricate planning involved in this mode is often hindered by uncertainties, including weather conditions, cargo diversity, and port dynamics, leading to increased costs. Consequently, accurately estimating vessel total (stay) time at port and potential delays becomes imperative for… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 13 pages, 9 figures, 4 Tables. Submitted at IEEE IJCNN 2024

  13. arXiv:2401.12340  [pdf, other

    cs.CV cs.AI cs.LG eess.IV stat.ML

    Contrastive Learning and Cycle Consistency-based Transductive Transfer Learning for Target Annotation

    Authors: Shoaib Meraj Sami, Md Mahedi Hasan, Nasser M. Nasrabadi, Raghuveer Rao

    Abstract: Annotating automatic target recognition (ATR) is a highly challenging task, primarily due to the unavailability of labeled data in the target domain. Hence, it is essential to construct an optimal target domain classifier by utilizing the labeled information of the source domain images. The transductive transfer learning (TTL) method that incorporates a CycleGAN-based unpaired domain translation n… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: This Paper is Accepted in IEEE TRANSACTIONS ON AEROSPACE AND ELECTRONIC SYSTEMS. This Arxiv version is an older version than the reviewed version

  14. arXiv:2401.09742  [pdf, other

    cs.CV

    Image Translation as Diffusion Visual Programmers

    Authors: Cheng Han, James C. Liang, Qifan Wang, Majid Rabbani, Sohail Dianat, Raghuveer Rao, Ying Nian Wu, Dongfang Liu

    Abstract: We introduce the novel Diffusion Visual Programmer (DVP), a neuro-symbolic image translation framework. Our proposed DVP seamlessly embeds a condition-flexible diffusion model within the GPT architecture, orchestrating a coherent sequence of visual programs (i.e., computer vision models) for various pro-symbolic steps, which span RoI identification, style transfer, and position manipulation, facil… ▽ More

    Submitted 30 January, 2024; v1 submitted 18 January, 2024; originally announced January 2024.

    Comments: 25 pages, 20 figures

  15. arXiv:2312.17479  [pdf, other

    cs.AI cs.CY cs.HC cs.LG

    Culturally-Attuned Moral Machines: Implicit Learning of Human Value Systems by AI through Inverse Reinforcement Learning

    Authors: Nigini Oliveira, Jasmine Li, Koosha Khalvati, Rodolfo Cortes Barragan, Katharina Reinecke, Andrew N. Meltzoff, Rajesh P. N. Rao

    Abstract: Constructing a universal moral code for artificial intelligence (AI) is difficult or even impossible, given that different human cultures have different definitions of morality and different societal norms. We therefore argue that the value system of an AI should be culturally attuned: just as a child raised in a particular culture learns the specific values and norms of that culture, we propose t… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

  16. arXiv:2311.02535   

    cs.CV

    TokenMotion: Motion-Guided Vision Transformer for Video Camouflaged Object Detection Via Learnable Token Selection

    Authors: Zifan Yu, Erfan Bank Tavakoli, Meida Chen, Suya You, Raghuveer Rao, Sanjeev Agarwal, Fengbo Ren

    Abstract: The area of Video Camouflaged Object Detection (VCOD) presents unique challenges in the field of computer vision due to texture similarities between target objects and their surroundings, as well as irregular motion patterns caused by both objects and camera movement. In this paper, we introduce TokenMotion (TMNet), which employs a transformer-based model to enhance VCOD by extracting motion-guide… ▽ More

    Submitted 1 February, 2024; v1 submitted 4 November, 2023; originally announced November 2023.

    Comments: Revising Needed

  17. arXiv:2308.11809  [pdf, other

    q-bio.NC cs.AI cs.NE

    Expressive probabilistic sampling in recurrent neural networks

    Authors: Shirui Chen, Linxing Preston Jiang, Rajesh P. N. Rao, Eric Shea-Brown

    Abstract: In sampling-based Bayesian models of brain function, neural activities are assumed to be samples from probability distributions that the brain uses for probabilistic computation. However, a comprehensive understanding of how mechanistic models of neural dynamics can sample from arbitrary distributions is still lacking. We use tools from functional analysis and stochastic differential equations to… ▽ More

    Submitted 14 November, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

  18. An Effective Deep Learning Based Multi-Class Classification of DoS and DDoS Attack Detection

    Authors: Arun Kumar Silivery, Kovvur Ram Mohan Rao, L K Suresh Kumar

    Abstract: In the past few years, cybersecurity is becoming very important due to the rise in internet users. The internet attacks such as Denial of service (DoS) and Distributed Denial of Service (DDoS) attacks severely harm a website or server and make them unavailable to other users. Network Monitoring and control systems have found it challenging to identify the many classes of DoS and DDoS attacks since… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

  19. arXiv:2308.07870  [pdf, other

    cs.AI cs.LG cs.NE

    Brain-Inspired Computational Intelligence via Predictive Coding

    Authors: Tommaso Salvatori, Ankur Mali, Christopher L. Buckley, Thomas Lukasiewicz, Rajesh P. N. Rao, Karl Friston, Alexander Ororbia

    Abstract: Artificial intelligence (AI) is rapidly becoming one of the key technologies of this century. The majority of results in AI thus far have been achieved using deep neural networks trained with the error backpropagation learning algorithm. However, the ubiquitous adoption of this approach has highlighted some important limitations such as substantial computational cost, difficulty in quantifying unc… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: 37 Pages, 9 Figures

  20. arXiv:2306.17316  [pdf, other

    q-fin.MF cs.GT

    Triangle Fees

    Authors: Rithvik Rao, Nihar Shah

    Abstract: Triangle fees are a novel fee structure for AMMs, in which marginal fees are decreasing in a trade's size. That decline is proportional to the movement in the AMM's implied price, i.e. for every basis point the trade moves the ratio of assets, the marginal fee declines by a basis point. These fees create incentives that protect against price staleness, while still allowing the AMM to earn meaningf… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    Comments: 7 pages

  21. arXiv:2306.09239  [pdf, ps, other

    q-bio.NC cs.LG eess.IV

    Exploiting the Brain's Network Structure for Automatic Identification of ADHD Subjects

    Authors: Soumyabrata Dey, Ravishankar Rao, Mubarak Shah

    Abstract: Attention Deficit Hyperactive Disorder (ADHD) is a common behavioral problem affecting children. In this work, we investigate the automatic classification of ADHD subjects using the resting state Functional Magnetic Resonance Imaging (fMRI) sequences of the brain. We show that the brain can be modeled as a functional network, and certain properties of the networks differ in ADHD subjects from cont… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  22. arXiv:2305.13886  [pdf, other

    cs.CV cs.AI

    Deep Transductive Transfer Learning for Automatic Target Recognition

    Authors: Shoaib M. Sami, Nasser M. Nasrabadi, Raghuveer Rao

    Abstract: One of the major obstacles in designing an automatic target recognition (ATR) algorithm, is that there are often labeled images in one domain (i.e., infrared source domain) but no annotated images in the other target domains (i.e., visible, SAR, LIDAR). Therefore, automatically annotating these images is essential to build a robust classifier in the target domain based on the labeled images of the… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

    Comments: 10 pages, 5 figures

    Journal ref: SPIE Defense & Commercial Sensing 2023, Conference 12521, Automatic target recognition XXXIII, Orlando, Florida

  23. arXiv:2305.05532  [pdf, other

    eess.SP cs.AI cs.LG stat.AP stat.ML

    An ensemble of convolution-based methods for fault detection using vibration signals

    Authors: Xian Yeow Lee, Aman Kumar, Lasitha Vidyaratne, Aniruddha Rajendra Rao, Ahmed Farahat, Chetan Gupta

    Abstract: This paper focuses on solving a fault detection problem using multivariate time series of vibration signals collected from planetary gearboxes in a test rig. Various traditional machine learning and deep learning methods have been proposed for multivariate time-series classification, including distance-based, functional data-oriented, feature-driven, and convolution kernel-based methods. Recent st… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: 12 Pages, 9 Figures, 2 Tables. Accepted at ICPHM 2023

    Journal ref: 2023 IEEE International Conference on Prognostics and Health Management (ICPHM)

  24. PIKS: A Technique to Identify Actionable Trends for Policy-Makers Through Open Healthcare Data

    Authors: A. Ravishankar Rao, Subrata Garai, Soumyabrata Dey, Hang Peng

    Abstract: With calls for increasing transparency, governments are releasing greater amounts of data in multiple domains including finance, education and healthcare. The efficient exploratory analysis of healthcare data constitutes a significant challenge. Key concerns in public health include the quick identification and analysis of trends, and the detection of outliers. This allows policies to be rapidly a… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Journal ref: SN COMPUT. SCI. 2, 477 (2021)

  25. Building predictive models of healthcare costs with open healthcare data

    Authors: A. Ravishankar Rao, Subrata Garai, Soumyabrata Dey, Hang Peng

    Abstract: Due to rapidly rising healthcare costs worldwide, there is significant interest in controlling them. An important aspect concerns price transparency, as preliminary efforts have demonstrated that patients will shop for lower costs, driving efficiency. This requires the data to be made available, and models that can predict healthcare costs for a wide range of patient demographics and conditions. W… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: 2020 IEEE International Conference on Healthcare Informatics (ICHI)

  26. A system for exploring big data: an iterative k-means searchlight for outlier detection on open health data

    Authors: A. Ravishankar Rao, Daniel Clarke, Subrata Garai, Soumyabrata Dey

    Abstract: The interactive exploration of large and evolving datasets is challenging as relationships between underlying variables may not be fully understood. There may be hidden trends and patterns in the data that are worthy of further exploration and analysis. We present a system that methodically explores multiple combinations of variables using a searchlight technique and identifies outliers. An iterat… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: 2018 International Joint Conference on Neural Networks (IJCNN)

  27. arXiv:2303.00915  [pdf, other

    cs.CV cs.CL

    BiomedCLIP: a multimodal biomedical foundation model pretrained from fifteen million scientific image-text pairs

    Authors: Sheng Zhang, Yanbo Xu, Naoto Usuyama, Hanwen Xu, Jaspreet Bagga, Robert Tinn, Sam Preston, Rajesh Rao, Mu Wei, Naveen Valluri, Cliff Wong, Andrea Tupini, Yu Wang, Matt Mazzola, Swadheen Shukla, Lars Liden, Jianfeng Gao, Matthew P. Lungren, Tristan Naumann, Sheng Wang, Hoifung Poon

    Abstract: Biomedical data is inherently multimodal, comprising physical measurements and natural language narratives. A generalist biomedical AI model needs to simultaneously process different modalities of data, including text and images. Therefore, training an effective generalist biomedical model requires high-quality multimodal data, such as parallel image-text pairs. Here, we present PMC-15M, a novel d… ▽ More

    Submitted 16 January, 2024; v1 submitted 1 March, 2023; originally announced March 2023.

    Comments: The models are released at https://aka.ms/biomedclip

  28. arXiv:2302.08594  [pdf, other

    cs.CV

    TransUPR: A Transformer-based Uncertain Point Refiner for LiDAR Point Cloud Semantic Segmentation

    Authors: Zifan Yu, Meida Chen, Zhikang Zhang, Suya You, Raghuveer Rao, Sanjeev Agarwal, Fengbo Ren

    Abstract: Common image-based LiDAR point cloud semantic segmentation (LiDAR PCSS) approaches have bottlenecks resulting from the boundary-blurring problem of convolution neural networks (CNNs) and quantitation loss of spherical projection. In this work, we propose a transformer-based plug-and-play uncertain point refiner, i.e., TransUPR, to refine selected uncertain points in a learnable manner, which leads… ▽ More

    Submitted 12 October, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: 6 pages; Accepted by 2023 IROS

  29. A Functional approach for Two Way Dimension Reduction in Time Series

    Authors: Aniruddha Rajendra Rao, Haiyan Wang, Chetan Gupta

    Abstract: The rise in data has led to the need for dimension reduction techniques, especially in the area of non-scalar variables, including time series, natural language processing, and computer vision. In this paper, we specifically investigate dimension reduction for time series through functional data analysis. Current methods for dimension reduction in functional data are functional principal component… ▽ More

    Submitted 1 January, 2023; originally announced January 2023.

    Comments: 10 pages, 4 figures, 4 tables

    Journal ref: IEEE BigData 2022

  30. Iterative RNDOP-Optimal Anchor Placement for Beyond Convex Hull ToA-based Localization: Performance Bounds and Heuristic Algorithms

    Authors: Raghunandan M. Rao, Don-Roberts Emenonye

    Abstract: Localizing targets outside the anchors' convex hull is an understudied but prevalent scenario in vehicle-centric, UAV-based, and self-localization applications. Considering such scenarios, this paper studies the optimal anchor placement problem for Time-of-Arrival (ToA)-based localization schemes such that the worst-case Dilution of Precision (DOP) is minimized. Building on prior results on DOP sc… ▽ More

    Submitted 17 February, 2024; v1 submitted 16 December, 2022; originally announced December 2022.

    Comments: 16 pages. To appear in a future issue of the IEEE Transactions on Vehicular Technology

  31. arXiv:2211.03932  [pdf, other

    cs.CV cs.MM

    Enhanced Low-resolution LiDAR-Camera Calibration Via Depth Interpolation and Supervised Contrastive Learning

    Authors: Zhikang Zhang, Zifan Yu, Suya You, Raghuveer Rao, Sanjeev Agarwal, Fengbo Ren

    Abstract: Motivated by the increasing application of low-resolution LiDAR recently, we target the problem of low-resolution LiDAR-camera calibration in this work. The main challenges are two-fold: sparsity and noise in point clouds. To address the problem, we propose to apply depth interpolation to increase the point density and supervised contrastive learning to learn noise-resistant features. The experime… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

  32. arXiv:2211.03365  [pdf, other

    cs.CC

    Polynomial Kernels for Generalized Domination Problems

    Authors: Pradeesha Ashok, Rajath Rao, Avi Tomar

    Abstract: In this paper, we study the parameterized complexity of a generalized domination problem called the [$σ, ρ$] Dominating Set problem. This problem generalizes a large number of problems including the Minimum Dominating Set problem and its many variants. The parameterized complexity of the [$σ, ρ$] Dominating Set problem parameterized by treewidth is well studied. Here the properties of the sets… ▽ More

    Submitted 9 November, 2022; v1 submitted 7 November, 2022; originally announced November 2022.

    Comments: 19 pages, 6 figures

  33. arXiv:2211.00519  [pdf, other

    cs.GR cs.CV

    Learning Neural Implicit Representations with Surface Signal Parameterizations

    Authors: Yanran Guan, Andrei Chubarau, Ruby Rao, Derek Nowrouzezahrai

    Abstract: Neural implicit surface representations have recently emerged as popular alternative to explicit 3D object encodings, such as polygonal meshes, tabulated points, or voxels. While significant work has improved the geometric fidelity of these representations, much less attention is given to their final appearance. Traditional explicit object representations commonly couple the 3D shape data with aux… ▽ More

    Submitted 25 June, 2023; v1 submitted 1 November, 2022; originally announced November 2022.

  34. arXiv:2210.13461  [pdf, other

    cs.LG cs.AI cs.CV cs.NE q-bio.NC

    Active Predictive Coding: A Unified Neural Framework for Learning Hierarchical World Models for Perception and Planning

    Authors: Rajesh P. N. Rao, Dimitrios C. Gklezakos, Vishwas Sathish

    Abstract: Predictive coding has emerged as a prominent model of how the brain learns through predictions, anticipating the importance accorded to predictive learning in recent AI architectures such as transformers. Here we propose a new framework for predictive coding called active predictive coding which can learn hierarchical world models and solve two radically different open problems in AI: (1) how do w… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: 15 pages, 10 figures, 2 supplementary figures

  35. arXiv:2210.11478  [pdf, other

    q-bio.NC cs.AI

    Neural Co-Processors for Restoring Brain Function: Results from a Cortical Model of Grasping

    Authors: Matthew J. Bryan, Linxing Preston Jiang, Rajesh P N Rao

    Abstract: Objective: A major challenge in designing closed-loop brain-computer interfaces is finding optimal stimulation patterns as a function of ongoing neural activity for different subjects and objectives. Approach: To achieve goal-directed closed-loop neurostimulation, we propose "neural co-processors" which use artificial neural networks and deep learning to learn optimal closed-loop stimulation polic… ▽ More

    Submitted 20 March, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: 45 pages, 19 figures. Submitted the IOP Journal of Neural Engineering

  36. Reflections on Software Failure Analysis

    Authors: Paschal C. Amusuo, Aishwarya Sharma, Siddharth R. Rao, Abbey Vincent, James C. Davis

    Abstract: Failure studies are important in revealing the root causes, behaviors, and life cycle of defects in software systems. These studies either focus on understanding the characteristics of defects in specific classes of systems or the characteristics of a specific type of defect in the systems it manifests in. Failure studies have influenced various software engineering research directions, especially… ▽ More

    Submitted 21 September, 2022; v1 submitted 7 September, 2022; originally announced September 2022.

    Comments: 6 pages, 4 figures To be published in: Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE '22)

  37. arXiv:2207.03593  [pdf, other

    cs.LG

    Hyper-Universal Policy Approximation: Learning to Generate Actions from a Single Image using Hypernets

    Authors: Dimitrios C. Gklezakos, Rishi Jha, Rajesh P. N. Rao

    Abstract: Inspired by Gibson's notion of object affordances in human vision, we ask the question: how can an agent learn to predict an entire action policy for a novel object or environment given only a single glimpse? To tackle this problem, we introduce the concept of Universal Policy Functions (UPFs) which are state-to-action mappings that generalize not only to new goals but most importantly to novel, u… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  38. arXiv:2207.00559  [pdf, other

    cs.LG hep-ex physics.ins-det stat.ML

    Ultra-low latency recurrent neural network inference on FPGAs for physics applications with hls4ml

    Authors: Elham E Khoda, Dylan Rankin, Rafael Teixeira de Lima, Philip Harris, Scott Hauck, Shih-Chieh Hsu, Michael Kagan, Vladimir Loncar, Chaitanya Paikara, Richa Rao, Sioni Summers, Caterina Vernieri, Aaron Wang

    Abstract: Recurrent neural networks have been shown to be effective architectures for many tasks in high energy physics, and thus have been widely adopted. Their use in low-latency environments has, however, been limited as a result of the difficulties of implementing recurrent architectures on field-programmable gate arrays (FPGAs). In this paper we present an implementation of two types of recurrent neura… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Comments: 12 pages, 6 figures, 5 tables

  39. arXiv:2206.08462  [pdf, other

    cs.CV cs.LG

    Recursive Neural Programs: Variational Learning of Image Grammars and Part-Whole Hierarchies

    Authors: Ares Fisher, Rajesh P. N. Rao

    Abstract: Human vision involves parsing and representing objects and scenes using structured representations based on part-whole hierarchies. Computer vision and machine learning researchers have recently sought to emulate this capability using capsule networks, reference frames and active predictive coding, but a generative model formulation has been lacking. We introduce Recursive Neural Programs (RNPs),… ▽ More

    Submitted 25 June, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: 9 pages, 6 figures. fixed LaTeX typo for algorithm reference

  40. arXiv:2204.03055  [pdf

    econ.GN cs.CY

    To Participate Or Not To Participate: An Investigation Of Strategic Participation In Standards

    Authors: Paras Bhatt, Claire Vishik, Govind Hariharan, H. Raghav Rao

    Abstract: Essential functionality in the ICT (Information and Communication Technology) space draws from standards such as HTTP (IETF RFC 2616, Bluetooth (IEEE 802.15) and various telecommunication standards (4G, 5G). They have fuelled rapid growth of ICT sector in the last decades by ensuring interoperability and consistency in computing environment. Research shows that firms that backed ICT standards and… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: https://www.researchgate.net/publication/359415466_To_Participate_Or_Not_To_Participate_An_Investigation_Of_Strategic_Participation_In_Standards

    Journal ref: 11th International Conference on Standardisation and Innovation in Information Technology (SIIT) - The Past, Present and Future of ICT Standardisation, Sep, 2021

  41. Repairing Brain-Computer Interfaces with Fault-Based Data Acquisition

    Authors: Cailin Winston, Caleb Winston, Chloe N Winston, Claris Winston, Cleah Winston, Rajesh PN Rao, René Just

    Abstract: Brain-computer interfaces (BCIs) decode recorded neural signals from the brain and/or stimulate the brain with encoded neural signals. BCIs span both hardware and software and have a wide range of applications in restorative medicine, from restoring movement through prostheses and robotic limbs to restoring sensation and communication through spellers. BCIs also have applications in diagnostic med… ▽ More

    Submitted 20 March, 2022; originally announced March 2022.

    Comments: Accepted at International Conference on Software Engineering (ICSE-2022)

  42. arXiv:2203.10442  [pdf, other

    cs.CL cs.LG

    Towards Structuring Real-World Data at Scale: Deep Learning for Extracting Key Oncology Information from Clinical Text with Patient-Level Supervision

    Authors: Sam Preston, Mu Wei, Rajesh Rao, Robert Tinn, Naoto Usuyama, Michael Lucas, Roshanthi Weerasinghe, Soohee Lee, Brian Piening, Paul Tittel, Naveen Valluri, Tristan Naumann, Carlo Bifulco, Hoifung Poon

    Abstract: Objective: The majority of detailed patient information in real-world data (RWD) is only consistently available in free-text clinical documents. Manual curation is expensive and time-consuming. Developing natural language processing (NLP) methods for structuring RWD is thus essential for scaling real-world evidence generation. Materials and Methods: Traditional rule-based systems are vulnerable… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

  43. arXiv:2203.07664  [pdf, other

    cs.CV cs.AI

    Can you even tell left from right? Presenting a new challenge for VQA

    Authors: Sai Raam Venkatraman, Rishi Rao, S. Balasubramanian, Chandra Sekhar Vorugunti, R. Raghunatha Sarma

    Abstract: Visual Question Answering (VQA) needs a means of evaluating the strengths and weaknesses of models. One aspect of such an evaluation is the evaluation of compositional generalisation, or the ability of a model to answer well on scenes whose scene-setups are different from the training set. Therefore, for this purpose, we need datasets whose train and test sets differ significantly in composition.… ▽ More

    Submitted 15 March, 2022; originally announced March 2022.

  44. arXiv:2201.08813  [pdf, other

    cs.CV cs.AI cs.LG

    Active Predictive Coding Networks: A Neural Solution to the Problem of Learning Reference Frames and Part-Whole Hierarchies

    Authors: Dimitrios C. Gklezakos, Rajesh P. N. Rao

    Abstract: We introduce Active Predictive Coding Networks (APCNs), a new class of neural networks that solve a major problem posed by Hinton and others in the fields of artificial intelligence and brain modeling: how can neural networks learn intrinsic reference frames for objects and parse visual scenes into part-whole hierarchies by dynamically allocating nodes in a parse tree? APCNs address this problem b… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

  45. arXiv:2112.11547  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Decompose the Sounds and Pixels, Recompose the Events

    Authors: Varshanth R. Rao, Md Ibrahim Khalil, Haoda Li, Peng Dai, Juwei Lu

    Abstract: In this paper, we propose a framework centering around a novel architecture called the Event Decomposition Recomposition Network (EDRNet) to tackle the Audio-Visual Event (AVE) localization problem in the supervised and weakly supervised settings. AVEs in the real world exhibit common unravelling patterns (termed as Event Progress Checkpoints (EPC)), which humans can perceive through the cooperati… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

    Comments: Accepted at AAAI 2022

  46. arXiv:2110.08429  [pdf, other

    cs.CV cs.AI

    TorchEsegeta: Framework for Interpretability and Explainability of Image-based Deep Learning Models

    Authors: Soumick Chatterjee, Arnab Das, Chirag Mandal, Budhaditya Mukhopadhyay, Manish Vipinraj, Aniruddh Shukla, Rajatha Nagaraja Rao, Chompunuch Sarasaen, Oliver Speck, Andreas Nürnberger

    Abstract: Clinicians are often very sceptical about applying automatic image processing approaches, especially deep learning based methods, in practice. One main reason for this is the black-box nature of these approaches and the inherent problem of missing insights of the automatically derived decisions. In order to increase trust in these methods, this paper presents approaches that help to interpret and… ▽ More

    Submitted 7 February, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

  47. arXiv:2109.12434  [pdf, other

    q-bio.NC cs.AI cs.LG cs.NE eess.SY

    Emergent behavior and neural dynamics in artificial agents tracking turbulent plumes

    Authors: Satpreet Harcharan Singh, Floris van Breugel, Rajesh P. N. Rao, Bingni Wen Brunton

    Abstract: Tracking a turbulent plume to locate its source is a complex control problem because it requires multi-sensory integration and must be robust to intermittent odors, changing wind direction, and variable plume statistics. This task is routinely performed by flying insects, often over long distances, in pursuit of food or mates. Several aspects of this remarkable behavior have been studied in detail… ▽ More

    Submitted 17 December, 2021; v1 submitted 25 September, 2021; originally announced September 2021.

    ACM Class: I.2.6; I.2.0; I.5.1

  48. arXiv:2108.03936  [pdf, other

    cs.RO cs.CV

    3D Human Reconstruction in the Wild with Collaborative Aerial Cameras

    Authors: Cherie Ho, Andrew Jong, Harry Freeman, Rohan Rao, Rogerio Bonatti, Sebastian Scherer

    Abstract: Aerial vehicles are revolutionizing applications that require capturing the 3D structure of dynamic targets in the wild, such as sports, medicine, and entertainment. The core challenges in developing a motion-capture system that operates in outdoors environments are: (1) 3D inference requires multiple simultaneous viewpoints of the target, (2) occlusion caused by obstacles is frequent when trackin… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Comments: 7 pages, 11 figures, IROS 2021

  49. arXiv:2107.14151  [pdf, other

    stat.ME cs.AI cs.LG stat.ML

    Modern Non-Linear Function-on-Function Regression

    Authors: Aniruddha Rajendra Rao, Matthew Reimherr

    Abstract: We introduce a new class of non-linear function-on-function regression models for functional data using neural networks. We propose a framework using a hidden layer consisting of continuous neurons, called a continuous hidden layer, for functional response modeling and give two model fitting strategies, Functional Direct Neural Network (FDNN) and Functional Basis Neural Network (FBNN). Both are de… ▽ More

    Submitted 7 October, 2023; v1 submitted 29 July, 2021; originally announced July 2021.

    Comments: 6 figures, 6 tables (including supplementary material), 16 pages (including supplementary material). arXiv admin note: text overlap with arXiv:2104.09371

    Journal ref: Statistics and Computing 2023

  50. arXiv:2106.12033  [pdf, other

    cs.CR cs.GT

    Strategic Liquidity Provision in Uniswap v3

    Authors: Zhou Fan, Francisco Marmolejo-Cossío, Daniel J. Moroz, Michael Neuder, Rithvik Rao, David C. Parkes

    Abstract: Uniswap v3 is the largest decentralized exchange for digital currencies. A novelty of its design is that it allows a liquidity provider (LP) to allocate liquidity to one or more closed intervals of the price of an asset instead of the full range of possible prices. An LP earns fee rewards proportional to the amount of its liquidity allocation when prices move in this interval. This induces the pro… ▽ More

    Submitted 16 August, 2024; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: In Proceedings of AFT '23