Zum Hauptinhalt springen

Showing 1–50 of 145 results for author: Trivedi, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.14090  [pdf, other

    cs.DC cs.AI cs.AR cs.NI cs.PF

    Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects

    Authors: Daniele De Sensi, Lorenzo Pichetti, Flavio Vella, Tiziano De Matteis, Zebin Ren, Luigi Fusco, Matteo Turisini, Daniele Cesarini, Kurt Lust, Animesh Trivedi, Duncan Roweth, Filippo Spiga, Salvatore Di Girolamo, Torsten Hoefler

    Abstract: Multi-GPU nodes are increasingly common in the rapidly evolving landscape of exascale supercomputers. On these systems, GPUs on the same node are connected through dedicated networks, with bandwidths up to a few terabits per second. However, gauging performance expectations and maximizing system efficiency is challenging due to different technologies, design options, and software layers. This pape… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    ACM Class: C.2.4; C.5.1; C.2.1; C.4

    Journal ref: Published in Proceedings of The International Conference for High Performance Computing Networking, Storage, and Analysis (SC '24) (2024)

  2. arXiv:2408.02999  [pdf, other

    cs.FL cs.AI

    LLMs as Probabilistic Minimally Adequate Teachers for DFA Learning

    Authors: Lekai Chen, Ashutosh Trivedi, Alvaro Velasquez

    Abstract: The emergence of intelligence in large language models (LLMs) has inspired investigations into their integration into automata learning. This paper introduces the probabilistic Minimally Adequate Teacher (pMAT) formulation, which leverages a probabilistic oracle that could give persistent errors randomly during answering the membership queries for deterministic finite automata (DFA) learning. Give… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

  3. arXiv:2407.14793  [pdf, other

    cs.DC eess.SY

    QoS Aware Mixed-Criticality Task Scheduling in Vehicular Edge Cloud System

    Authors: Suvarthi Sarkar, Aditya Trivedi, Ritish Bansal, Aryabartta Sahu

    Abstract: Modern-day cars are equipped with numerous cameras and sensors, typically integrated with advanced decision-control systems that enable the vehicle to perceive its surroundings and navigate autonomously. Efficient processing of data from sensors, lidars, radars and cameras is quite computationally intensive and can not be done with good accuracy using less capable onboard resources. In order to de… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

  4. arXiv:2406.07833  [pdf, other

    cs.CV cs.AI

    Sense Less, Generate More: Pre-training LiDAR Perception with Masked Autoencoders for Ultra-Efficient 3D Sensing

    Authors: Sina Tayebati, Theja Tulabandhula, Amit R. Trivedi

    Abstract: In this work, we propose a disruptively frugal LiDAR perception dataflow that generates rather than senses parts of the environment that are either predictable based on the extensive training of the environment or have limited consequence to the overall prediction accuracy. Therefore, the proposed methodology trades off sensing energy with training data for low-power robotics and autonomous naviga… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  5. arXiv:2405.13735  [pdf, other

    eess.SY cs.AI cs.LG

    Transfer of Safety Controllers Through Learning Deep Inverse Dynamics Model

    Authors: Alireza Nadali, Ashutosh Trivedi, Majid Zamani

    Abstract: Control barrier certificates have proven effective in formally guaranteeing the safety of the control systems. However, designing a control barrier certificate is a time-consuming and computationally expensive endeavor that requires expert input in the form of domain knowledge and mathematical maturity. Additionally, when a system undergoes slight changes, the new controller and its correctness ce… ▽ More

    Submitted 24 May, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

    Comments: Extended Version, submitted to ADHS 2024

  6. arXiv:2405.10725  [pdf, other

    cs.CL cs.IR

    INDUS: Effective and Efficient Language Models for Scientific Applications

    Authors: Bishwaranjan Bhattacharjee, Aashka Trivedi, Masayasu Muraoka, Muthukumaran Ramasubramanian, Takuma Udagawa, Iksha Gurung, Rong Zhang, Bharath Dandala, Rahul Ramachandran, Manil Maskey, Kaylin Bugbee, Mike Little, Elizabeth Fancher, Lauren Sanders, Sylvain Costes, Sergi Blanco-Cuaresma, Kelly Lockhart, Thomas Allen, Felix Grezes, Megan Ansdell, Alberto Accomazzi, Yousef El-Kurdi, Davis Wertheimer, Birgit Pfitzmann, Cesar Berrospi Ramis , et al. (9 additional authors not shown)

    Abstract: Large language models (LLMs) trained on general domain corpora showed remarkable results on natural language processing (NLP) tasks. However, previous research demonstrated LLMs trained using domain-focused corpora perform better on specialized tasks. Inspired by this pivotal insight, we developed INDUS, a comprehensive suite of LLMs tailored for the Earth science, biology, physics, heliophysics,… ▽ More

    Submitted 20 May, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

  7. arXiv:2405.04979  [pdf, other

    cs.RO

    Predictive Mapping of Spectral Signatures from RGB Imagery for Off-Road Terrain Analysis

    Authors: Sarvesh Prajapati, Ananya Trivedi, Bruce Maxwell, Taskin Padir

    Abstract: Accurate identification of complex terrain characteristics, such as soil composition and coefficient of friction, is essential for model-based planning and control of mobile robots in off-road environments. Spectral signatures leverage distinct patterns of light absorption and reflection to identify various materials, enabling precise characterization of their inherent properties. Recent research… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: 5 Pages, for ICRA Workshop

  8. arXiv:2404.19100  [pdf, other

    cs.SE cs.AI cs.CY cs.LG

    Predicting Fairness of ML Software Configurations

    Authors: Salvador Robles Herrera, Verya Monjezi, Vladik Kreinovich, Ashutosh Trivedi, Saeid Tizpaz-Niari

    Abstract: This paper investigates the relationships between hyperparameters of machine learning and fairness. Data-driven solutions are increasingly used in critical socio-technical applications where ensuring fairness is important. Rather than explicitly encoding decision logic via control and data structures, the ML developers provide input data, perform some pre-processing, choose ML algorithms, and tune… ▽ More

    Submitted 1 July, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: To Appear in the 20th International Conference on Predictive Models and Data Analytics in Software Engineering (PROMISE'24)

  9. arXiv:2404.02872  [pdf, other

    cs.AI

    Integrating Explanations in Learning LTL Specifications from Demonstrations

    Authors: Ashutosh Gupta, John Komp, Abhay Singh Rajput, Krishna Shankaranarayanan, Ashutosh Trivedi, Namrita Varshney

    Abstract: This paper investigates whether recent advances in Large Language Models (LLMs) can assist in translating human explanations into a format that can robustly support learning Linear Temporal Logic (LTL) from demonstrations. Both LLMs and optimization-based methods can extract LTL specifications from demonstrations; however, they have distinct limitations. LLMs can quickly generate solutions and inc… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 21 Pages, 13 Page Appendix

    ACM Class: I.2.8

  10. arXiv:2403.06009  [pdf, other

    cs.LG

    Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

    Authors: Swapnaja Achintalwar, Adriana Alvarado Garcia, Ateret Anaby-Tavor, Ioana Baldini, Sara E. Berger, Bishwaranjan Bhattacharjee, Djallel Bouneffouf, Subhajit Chaudhury, Pin-Yu Chen, Lamogha Chiazor, Elizabeth M. Daly, Kirushikesh DB, Rogério Abreu de Paula, Pierre Dognin, Eitan Farchi, Soumya Ghosh, Michael Hind, Raya Horesh, George Kour, Ja Young Lee, Nishtha Madaan, Sameep Mehta, Erik Miehling, Keerthiram Murugesan, Manish Nagireddy , et al. (13 additional authors not shown)

    Abstract: Large language models (LLMs) are susceptible to a variety of risks, from non-faithful output to biased and toxic generations. Due to several limiting factors surrounding LLMs (training cost, API access, data availability, etc.), it may not always be feasible to impose direct safety constraints on a deployed model. Therefore, an efficient and reliable alternative is required. To this end, we presen… ▽ More

    Submitted 19 August, 2024; v1 submitted 9 March, 2024; originally announced March 2024.

  11. arXiv:2402.18065  [pdf, other

    cs.RO

    A Probabilistic Motion Model for Skid-Steer Wheeled Mobile Robot Navigation on Off-Road Terrains

    Authors: Ananya Trivedi, Mark Zolotas, Adeeb Abbas, Sarvesh Prajapati, Salah Bazzi, Taskın Padır

    Abstract: Skid-Steer Wheeled Mobile Robots (SSWMRs) are increasingly being used for off-road autonomy applications. When turning at high speeds, these robots tend to undergo significant skidding and slipping. In this work, using Gaussian Process Regression (GPR) and Sigma-Point Transforms, we estimate the non-linear effects of tire-terrain interaction on robot velocities in a probabilistic fashion. Using th… ▽ More

    Submitted 29 February, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: Accepted for publication at IEEE ICRA 2024

  12. arXiv:2402.07107  [pdf, other

    cs.LG cs.AI

    Echoes of Socratic Doubt: Embracing Uncertainty in Calibrated Evidential Reinforcement Learning

    Authors: Alex Christopher Stutts, Danilo Erricolo, Theja Tulabandhula, Amit Ranjan Trivedi

    Abstract: We present a novel statistical approach to incorporating uncertainty awareness in model-free distributional reinforcement learning involving quantile regression-based deep Q networks. The proposed algorithm, $\textit{Calibrated Evidential Quantile Regression in Deep Q Networks (CEQR-DQN)}$, aims to address key challenges associated with separately estimating aleatoric and epistemic uncertainty in… ▽ More

    Submitted 3 June, 2024; v1 submitted 11 February, 2024; originally announced February 2024.

  13. arXiv:2402.05624  [pdf, other

    cs.CL cs.AI cs.HC

    Efficient Models for the Detection of Hate, Abuse and Profanity

    Authors: Christoph Tillmann, Aashka Trivedi, Bishwaranjan Bhattacharjee

    Abstract: Large Language Models (LLMs) are the cornerstone for many Natural Language Processing (NLP) tasks like sentiment analysis, document classification, named entity recognition, question answering, summarization, etc. LLMs are often trained on data which originates from the web. This data is prone to having content with Hate, Abuse and Profanity (HAP). For a detailed definition of HAP, please refer to… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 8 pages, 7 figures

  14. arXiv:2401.17481  [pdf, other

    cs.RO

    Navigating the Unknown: Uncertainty-Aware Compute-in-Memory Autonomy of Edge Robotics

    Authors: Nastaran Darabi, Priyesh Shukla, Dinithi Jayasuriya, Divake Kumar, Alex C. Stutts, Amit Ranjan Trivedi

    Abstract: This paper addresses the challenging problem of energy-efficient and uncertainty-aware pose estimation in insect-scale drones, which is crucial for tasks such as surveillance in constricted spaces and for enabling non-intrusive spatial intelligence in smart homes. Since tiny drones operate in highly dynamic environments, where factors like lighting and human movement impact their predictive accura… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  15. arXiv:2401.12379  [pdf, other

    cs.AI cs.DB cs.PL

    Analyzing the Effectiveness of Large Language Models on Text-to-SQL Synthesis

    Authors: Richard Roberson, Gowtham Kaki, Ashutosh Trivedi

    Abstract: This study investigates various approaches to using Large Language Models (LLMs) for Text-to-SQL program synthesis, focusing on the outcomes and insights derived. Employing the popular Text-to-SQL dataset, spider, the goal was to input a natural language question along with the database schema and output the correct SQL SELECT query. The initial approach was to fine-tune a local and open-source mo… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  16. arXiv:2401.06800  [pdf, other

    cs.CL cs.AI

    Reinforcement Learning for Optimizing RAG for Domain Chatbots

    Authors: Mandar Kulkarni, Praveen Tangarajan, Kyung Kim, Anusua Trivedi

    Abstract: With the advent of Large Language Models (LLM), conversational assistants have become prevalent for domain use cases. LLMs acquire the ability to contextual question answering through training, and Retrieval Augmented Generation (RAG) further enables the bot to answer domain-specific questions. This paper describes a RAG-based approach for building a chatbot that answers user's queries using Frequ… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

  17. arXiv:2401.06356  [pdf, other

    cs.LG

    An Empirical Investigation into the Effect of Parameter Choices in Knowledge Distillation

    Authors: Md Arafat Sultan, Aashka Trivedi, Parul Awasthy, Avirup Sil

    Abstract: We present a large-scale empirical study of how choices of configuration parameters affect performance in knowledge distillation (KD). An example of such a KD parameter is the measure of distance between the predictions of the teacher and the student, common choices for which include the mean squared error (MSE) and the KL-divergence. Although scattered efforts have been made to understand the dif… ▽ More

    Submitted 18 February, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

  18. arXiv:2312.11344  [pdf, other

    cs.CL cs.AI cs.HC

    Muted: Multilingual Targeted Offensive Speech Identification and Visualization

    Authors: Christoph Tillmann, Aashka Trivedi, Sara Rosenthal, Santosh Borse, Rong Zhang, Avirup Sil, Bishwaranjan Bhattacharjee

    Abstract: Offensive language such as hate, abuse, and profanity (HAP) occurs in various content on the web. While previous work has mostly dealt with sentence level annotations, there have been a few recent attempts to identify offensive spans as well. We build upon this work and introduce Muted, a system to identify multilingual HAP content by displaying offensive arguments and their targets using heat map… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Journal ref: EMNLP 2023 Demo Track

  19. arXiv:2312.09938  [pdf, other

    cs.LG cs.AI cs.MA

    Assume-Guarantee Reinforcement Learning

    Authors: Milad Kazemi, Mateo Perez, Fabio Somenzi, Sadegh Soudjani, Ashutosh Trivedi, Alvaro Velasquez

    Abstract: We present a modular approach to \emph{reinforcement learning} (RL) in environments consisting of simpler components evolving in parallel. A monolithic view of such modular environments may be prohibitively large to learn, or may require unrealizable communication between the components in the form of a centralized controller. Our proposed approach is based on the assume-guarantee paradigm where t… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: This is the extended version of the paper accepted in the SRRAI Special Track at the Conference on Artificial Intelligence (AAAI-24)

  20. arXiv:2312.08602  [pdf, other

    cs.LO cs.LG

    Omega-Regular Decision Processes

    Authors: Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

    Abstract: Regular decision processes (RDPs) are a subclass of non-Markovian decision processes where the transition and reward functions are guarded by some regular property of the past (a lookback). While RDPs enable intuitive and succinct representation of non-Markovian decision processes, their expressive power coincides with finite-state Markov decision processes (MDPs). We introduce omega-regular decis… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  21. arXiv:2311.11979  [pdf, other

    cs.SE cs.CL

    On the Potential and Limitations of Few-Shot In-Context Learning to Generate Metamorphic Specifications for Tax Preparation Software

    Authors: Dananjay Srinivas, Rohan Das, Saeid Tizpaz-Niari, Ashutosh Trivedi, Maria Leonor Pacheco

    Abstract: Due to the ever-increasing complexity of income tax laws in the United States, the number of US taxpayers filing their taxes using tax preparation software (henceforth, tax software) continues to increase. According to the U.S. Internal Revenue Service (IRS), in FY22, nearly 50% of taxpayers filed their individual income taxes using tax software. Given the legal consequences of incorrectly filing… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted to the Proceedings of the Natural Legal Language Processing Workshop, EMNLP 2023

  22. arXiv:2311.07695  [pdf, other

    cs.FL eess.SY

    Co-Buchi Barrier Certificates for Discrete-time Dynamical Systems

    Authors: Vishnu Murali, Ashutosh Trivedi, Majid Zamani

    Abstract: Barrier certificates provide functional overapproximations for the reachable set of dynamical systems and provide inductive guarantees on the safe evolution of the system. Formally a barrier certificate is a real-valued function over the state set that is required to be non-positive for the initial states, positive over the set of unsafe states and nonincreasing along the state transitions. These… ▽ More

    Submitted 13 November, 2023; originally announced November 2023.

  23. arXiv:2310.19094  [pdf, other

    cs.DC

    Performance Characterization of NVMe Flash Devices with Zoned Namespaces (ZNS)

    Authors: Krijn Doekemeijer, Nick Tehrany, Balakrishnan Chandrasekaran, Matias Bjørling, Animesh Trivedi

    Abstract: The recent emergence of NVMe flash devices with Zoned Namespace support, ZNS SSDs, represents a significant new advancement in flash storage. ZNS SSDs introduce a new storage abstraction of append-only zones with a set of new I/O (i.e., append) and management (zone state machine transition) commands. With the new abstraction and commands, ZNS SSDs offer more control to the host software stack than… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: Paper to appear in the https://clustercomp.org/2023/program/

  24. arXiv:2310.12248  [pdf, other

    cs.LG cs.LO

    A PAC Learning Algorithm for LTL and Omega-regular Objectives in MDPs

    Authors: Mateo Perez, Fabio Somenzi, Ashutosh Trivedi

    Abstract: Linear temporal logic (LTL) and omega-regular objectives -- a superset of LTL -- have seen recent use as a way to express non-Markovian objectives in reinforcement learning. We introduce a model-based probably approximately correct (PAC) learning algorithm for omega-regular objectives in Markov decision processes (MDPs). As part of the development of our algorithm, we introduce the epsilon-recurre… ▽ More

    Submitted 20 February, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

  25. arXiv:2310.08797  [pdf, other

    cs.CL cs.AI

    A Comparative Analysis of Task-Agnostic Distillation Methods for Compressing Transformer Language Models

    Authors: Takuma Udagawa, Aashka Trivedi, Michele Merler, Bishwaranjan Bhattacharjee

    Abstract: Large language models have become a vital component in modern NLP, achieving state of the art performance in a variety of tasks. However, they are often inefficient for real-world deployment due to their expensive inference costs. Knowledge distillation is a promising technique to improve their efficiency while retaining most of their effectiveness. In this paper, we reproduce, compare and analyze… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP 2023 Industry Track

  26. arXiv:2310.02880  [pdf, other

    cs.OS

    Persistent Memory File Systems: A Survey

    Authors: Wiebe van Breukelen, Animesh Trivedi

    Abstract: Persistent Memory (PM) is non-volatile byte-addressable memory that offers read and write latencies in the order of magnitude smaller than flash storage, such as SSDs. This survey discusses how file systems address the most prominent challenges in the implementation of file systems for Persistent Memory. First, we discuss how the properties of Persistent Memory change file system design. Second, w… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  27. arXiv:2309.11048  [pdf, other

    cs.LG

    Containing Analog Data Deluge at Edge through Frequency-Domain Compression in Collaborative Compute-in-Memory Networks

    Authors: Nastaran Darabi, Amit R. Trivedi

    Abstract: Edge computing is a promising solution for handling high-dimensional, multispectral analog data from sensors and IoT devices for applications such as autonomous drones. However, edge devices' limited storage and computing resources make it challenging to perform complex predictive modeling at the edge. Compute-in-memory (CiM) has emerged as a principal paradigm to minimize energy for deep learning… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: arXiv admin note: text overlap with arXiv:2307.03863, arXiv:2309.01771

  28. arXiv:2309.11018  [pdf, other

    cs.LG cs.CV cs.RO

    Conformalized Multimodal Uncertainty Regression and Reasoning

    Authors: Domenico Parente, Nastaran Darabi, Alex C. Stutts, Theja Tulabandhula, Amit Ranjan Trivedi

    Abstract: This paper introduces a lightweight uncertainty estimator capable of predicting multimodal (disjoint) uncertainty bounds by integrating conformal prediction with a deep-learning regressor. We specifically discuss its application for visual odometry (VO), where environmental features such as flying domain symmetries and sensor measurements under ambiguities and occlusion can result in multimodal un… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  29. arXiv:2309.11006  [pdf, other

    cs.RO cs.CV

    STARNet: Sensor Trustworthiness and Anomaly Recognition via Approximated Likelihood Regret for Robust Edge Autonomy

    Authors: Nastaran Darabi, Sina Tayebati, Sureshkumar S., Sathya Ravi, Theja Tulabandhula, Amit R. Trivedi

    Abstract: Complex sensors such as LiDAR, RADAR, and event cameras have proliferated in autonomous robotics to enhance perception and understanding of the environment. Meanwhile, these sensors are also vulnerable to diverse failure mechanisms that can intricately interact with their operation environment. In parallel, the limited availability of training data on complex sensors also affects the reliability o… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  30. arXiv:2309.09593  [pdf, other

    cs.CV cs.IT cs.RO

    Mutual Information-calibrated Conformal Feature Fusion for Uncertainty-Aware Multimodal 3D Object Detection at the Edge

    Authors: Alex C. Stutts, Danilo Erricolo, Sathya Ravi, Theja Tulabandhula, Amit Ranjan Trivedi

    Abstract: In the expanding landscape of AI-enabled robotics, robust quantification of predictive uncertainties is of great importance. Three-dimensional (3D) object detection, a critical robotics operation, has seen significant advancements; however, the majority of current works focus only on accuracy and ignore uncertainty quantification. Addressing this gap, our novel study integrates the principles of c… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  31. arXiv:2309.01771  [pdf, other

    cs.AR cs.LG

    ADC/DAC-Free Analog Acceleration of Deep Neural Networks with Frequency Transformation

    Authors: Nastaran Darabi, Maeesha Binte Hashem, Hongyi Pan, Ahmet Cetin, Wilfred Gomes, Amit Ranjan Trivedi

    Abstract: The edge processing of deep neural networks (DNNs) is becoming increasingly important due to its ability to extract valuable information directly at the data source to minimize latency and energy consumption. Frequency-domain model compression, such as with the Walsh-Hadamard transform (WHT), has been identified as an efficient alternative. However, the benefits of frequency-domain processing are… ▽ More

    Submitted 4 September, 2023; originally announced September 2023.

  32. arXiv:2308.07469  [pdf, other

    cs.LG cs.AI cs.FL

    Omega-Regular Reward Machines

    Authors: Ernst Moritz Hahn, Mateo Perez, Sven Schewe, Fabio Somenzi, Ashutosh Trivedi, Dominik Wojtczak

    Abstract: Reinforcement learning (RL) is a powerful approach for training agents to perform tasks, but designing an appropriate reward mechanism is critical to its success. However, in many cases, the complexity of the learning objectives goes beyond the capabilities of the Markovian assumption, necessitating a more sophisticated reward mechanism. Reward machines and omega-regular languages are two formalis… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: To appear in ECAI-2023

  33. arXiv:2307.11866  [pdf, other

    cs.OS

    A Survey on the Integration of NAND Flash Storage in the Design of File Systems and the Host Storage Software Stack

    Authors: Nick Tehrany, Krijn Doekemeijer, Animesh Trivedi

    Abstract: With the ever-increasing amount of data generate in the world, estimated to reach over 200 Zettabytes by 2025, pressure on efficient data storage systems is intensifying. The shift from HDD to flash-based SSD provides one of the most fundamental shifts in storage technology, increasing performance capabilities significantly. However, flash storage comes with different characteristics than prior HD… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  34. arXiv:2307.11860  [pdf, other

    cs.OS

    Understanding (Un)Written Contracts of NVMe ZNS Devices with zns-tools

    Authors: Nick Tehrany, Krijn Doekemeijer, Animesh Trivedi

    Abstract: Operational and performance characteristics of flash SSDs have long been associated with a set of Unwritten Contracts due to their hidden, complex internals and lack of control from the host software stack. These unwritten contracts govern how data should be stored, accessed, and garbage collected. The emergence of Zoned Namespace (ZNS) flash devices with their open and standardized interface allo… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

  35. arXiv:2307.07631  [pdf, other

    cs.LG

    Towards Model-Size Agnostic, Compute-Free, Memorization-based Inference of Deep Learning

    Authors: Davide Giacomini, Maeesha Binte Hashem, Jeremiah Suarez, Swarup Bhunia, Amit Ranjan Trivedi

    Abstract: The rapid advancement of deep neural networks has significantly improved various tasks, such as image and speech recognition. However, as the complexity of these models increases, so does the computational cost and the number of parameters, making it difficult to deploy them on resource-constrained devices. This paper proposes a novel memorization-based inference (MBI) that is compute free and onl… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

  36. arXiv:2307.03863  [pdf, other

    cs.AR cs.LG

    Memory-Immersed Collaborative Digitization for Area-Efficient Compute-in-Memory Deep Learning

    Authors: Shamma Nasrin, Maeesha Binte Hashem, Nastaran Darabi, Benjamin Parpillon, Farah Fahim, Wilfred Gomes, Amit Ranjan Trivedi

    Abstract: This work discusses memory-immersed collaborative digitization among compute-in-memory (CiM) arrays to minimize the area overheads of a conventional analog-to-digital converter (ADC) for deep learning inference. Thereby, using the proposed scheme, significantly more CiM arrays can be accommodated within limited footprint designs to improve parallelism and minimize external memory accesses. Under t… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  37. arXiv:2305.17519  [pdf, ps, other

    cs.LO eess.SY

    Closure Certificates

    Authors: Vishnu Murali, Ashutosh Trivedi, Majid Zamani

    Abstract: A barrier certificate, defined over the states of a dynamical system, is a real-valued function whose zero level set characterizes an inductively verifiable state invariant separating reachable states from unsafe ones. When combined with powerful decision procedures such as sum-of-squares programming (SOS) or satisfiability-modulo-theory solvers (SMT) barrier certificates enable an automated deduc… ▽ More

    Submitted 5 March, 2024; v1 submitted 27 May, 2023; originally announced May 2023.

    Comments: 14 pages, 5 figures. To appear in 27th ACM International Conference on Hybrid Systems: Computation and Control Hong-Kong, 13-16 May 2024

  38. arXiv:2305.17115  [pdf, other

    cs.LO cs.LG

    Policy Synthesis and Reinforcement Learning for Discounted LTL

    Authors: Rajeev Alur, Osbert Bastani, Kishor Jothimurugan, Mateo Perez, Fabio Somenzi, Ashutosh Trivedi

    Abstract: The difficulty of manually specifying reward functions has led to an interest in using linear temporal logic (LTL) to express objectives for reinforcement learning (RL). However, LTL has the downside that it is sensitive to small perturbations in the transition probabilities, which prevents probably approximately correct (PAC) learning without additional assumptions. Time discounting provides a wa… ▽ More

    Submitted 29 May, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

  39. arXiv:2304.04199  [pdf, other

    cs.SE cs.LG

    Information-Theoretic Testing and Debugging of Fairness Defects in Deep Neural Networks

    Authors: Verya Monjezi, Ashutosh Trivedi, Gang Tan, Saeid Tizpaz-Niari

    Abstract: The deep feedforward neural networks (DNNs) are increasingly deployed in socioeconomic critical decision support software systems. DNNs are exceptionally good at finding minimal, sufficient statistical patterns within their training data. Consequently, DNNs may learn to encode decisions -- amplifying existing biases or introducing new ones -- that may disadvantage protected individuals/groups and… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

    Comments: 2023 IEEE/ACM 45th International Conference on Software Engineering (ICSE 2023)

  40. arXiv:2303.09639  [pdf, other

    cs.CL

    Neural Architecture Search for Effective Teacher-Student Knowledge Transfer in Language Models

    Authors: Aashka Trivedi, Takuma Udagawa, Michele Merler, Rameswar Panda, Yousef El-Kurdi, Bishwaranjan Bhattacharjee

    Abstract: Large pretrained language models have achieved state-of-the-art results on a variety of downstream tasks. Knowledge Distillation (KD) into a smaller student model addresses their inefficiency, allowing for deployment in resource-constrained environments. However, KD can be ineffective when the student is manually selected from a set of existing options, since it can be a sub-optimal choice within… ▽ More

    Submitted 13 October, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: 11 pages, 5 figures

  41. arXiv:2303.09528  [pdf, ps, other

    cs.LG cs.AI math.OC

    Reinforcement Learning for Omega-Regular Specifications on Continuous-Time MDP

    Authors: Amin Falah, Shibashis Guha, Ashutosh Trivedi

    Abstract: Continuous-time Markov decision processes (CTMDPs) are canonical models to express sequential decision-making under dense-time and stochastic environments. When the stochastic evolution of the environment is only available via sampling, model-free reinforcement learning (RL) is the algorithm-of-choice to compute optimal decision sequence. RL, on the other hand, requires the learning objective to b… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: Full version of paper accepted to ICAPS 2023

  42. arXiv:2303.03739  [pdf, other

    cs.RO

    Path Planning Under Uncertainty to Localize mmWave Sources

    Authors: Kai Pfeiffer, Yuze Jia, Mingsheng Yin, Akshaj Kumar Veldanda, Yaqi Hu, Amee Trivedi, Jeff Zhang, Siddharth Garg, Elza Erkip, Sundeep Rangan, Ludovic Righetti

    Abstract: In this paper, we study a navigation problem where a mobile robot needs to locate a mmWave wireless signal. Using the directionality properties of the signal, we propose an estimation and path planning algorithm that can efficiently navigate in cluttered indoor environments. We formulate Extended Kalman filters for emitter location estimation in cases where the signal is received in line-of-sight… ▽ More

    Submitted 8 March, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

  43. arXiv:2303.02207  [pdf, other

    cs.CV cs.AI cs.LG cs.RO eess.IV

    Lightweight, Uncertainty-Aware Conformalized Visual Odometry

    Authors: Alex C. Stutts, Danilo Erricolo, Theja Tulabandhula, Amit Ranjan Trivedi

    Abstract: Data-driven visual odometry (VO) is a critical subroutine for autonomous edge robotics, and recent progress in the field has produced highly accurate point predictions in complex environments. However, emerging autonomous edge robotics devices like insect-scale drones and surgical robots lack a computationally efficient framework to estimate VO's predictive uncertainties. Meanwhile, as edge roboti… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

  44. arXiv:2302.14176  [pdf, other

    cs.AI cs.CE math.OC

    Reinforcement Learning with Depreciating Assets

    Authors: Taylor Dohmen, Ashutosh Trivedi

    Abstract: A basic assumption of traditional reinforcement learning is that the value of a reward does not change once it is received by an agent. The present work forgoes this assumption and considers the situation where the value of a reward decays proportionally to the time elapsed since it was obtained. Emphasizing the inflection point occurring at the time of payment, we use the term asset to refer to a… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: Full version of extended abstract appearing in the proceedings of AAMAS 2023

  45. arXiv:2301.06727  [pdf

    cs.ET physics.app-ph

    Roadmap for Unconventional Computing with Nanotechnology

    Authors: Giovanni Finocchio, Jean Anne C. Incorvia, Joseph S. Friedman, Qu Yang, Anna Giordano, Julie Grollier, Hyunsoo Yang, Florin Ciubotaru, Andrii Chumak, Azad J. Naeemi, Sorin D. Cotofana, Riccardo Tomasello, Christos Panagopoulos, Mario Carpentieri, Peng Lin, Gang Pan, J. Joshua Yang, Aida Todri-Sanial, Gabriele Boschetto, Kremena Makasheva, Vinod K. Sangwan, Amit Ranjan Trivedi, Mark C. Hersam, Kerem Y. Camsari, Peter L. McMahon , et al. (26 additional authors not shown)

    Abstract: In the "Beyond Moore's Law" era, with increasing edge intelligence, domain-specific computing embracing unconventional approaches will become increasingly prevalent. At the same time, adopting a variety of nanotechnologies will offer benefits in energy cost, computational speed, reduced footprint, cyber resilience, and processing power. The time is ripe for a roadmap for unconventional computing w… ▽ More

    Submitted 27 February, 2024; v1 submitted 17 January, 2023; originally announced January 2023.

    Comments: 80 pages accepted in Nano Futures

    Journal ref: Nano Futures (2024)

  46. arXiv:2211.17217  [pdf, ps, other

    eess.SY cs.LG

    A Tutorial on Neural Networks and Gradient-free Training

    Authors: Turibius Rozario, Arjun Trivedi, Ankit Goel

    Abstract: This paper presents a compact, matrix-based representation of neural networks in a self-contained tutorial fashion. Specifically, we develop neural networks as a composition of several vector-valued functions. Although neural networks are well-understood pictorially in terms of interconnected neurons, neural networks are mathematical nonlinear functions constructed by composing several vector-valu… ▽ More

    Submitted 26 November, 2022; originally announced November 2022.

    Comments: Submitted to 2023 American Control Conference. Contains 8 pages, 10 figures, and 3 tables

  47. arXiv:2210.15559  [pdf, other

    cs.CV cs.AI cs.LG cs.RO eess.IV

    Robust Monocular Localization of Drones by Adapting Domain Maps to Depth Prediction Inaccuracies

    Authors: Priyesh Shukla, Sureshkumar S., Alex C. Stutts, Sathya Ravi, Theja Tulabandhula, Amit R. Trivedi

    Abstract: We present a novel monocular localization framework by jointly training deep learning-based depth prediction and Bayesian filtering-based pose reasoning. The proposed cross-modal framework significantly outperforms deep learning-only predictions with respect to model scalability and tolerance to environmental variations. Specifically, we show little-to-no degradation of pose accuracy even with ext… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

  48. arXiv:2209.05448  [pdf, other

    cs.FL

    Composing Copyless Streaming String Transducers

    Authors: Rajeev Alur, Taylor Dohmen, Ashutosh Trivedi

    Abstract: Streaming string transducers (SSTs) implement string-to-string transformations by reading each input word in a single left-to-right pass while maintaining fragments of potential outputs in a finite set of string variables. These variables get updated on transitions of the transducer, where they can be assigned new values described by concatenations of variables and output symbols. An SST is called… ▽ More

    Submitted 7 February, 2024; v1 submitted 12 September, 2022; originally announced September 2022.

  49. arXiv:2207.13416  [pdf, other

    cs.FL

    Optimal Repair For Omega-regular Properties

    Authors: Vrunda Dave, Shankara Narayanan Krishna, Vishnu Murali, Ashutosh Trivedi

    Abstract: This paper presents an optimization based framework to automate system repair against omega-regular properties. In the proposed formalization of optimal repair, the systems are represented as Kripke structures, the properties as $ω$-regular languages, and the repair space as repair machines -- weighted omega-regular transducers equipped with Büchi conditions -- that rewrite strings and associate a… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: 24 pages, 7 page appendix, 4 Tikz figures, 1 PNG figure, to appear in The 20th International Symposium on Automated Technology for Verification and Analysis (ATVA) 2022

  50. arXiv:2207.04159  [pdf, other

    cs.DC

    The SPEC-RG Reference Architecture for the Compute Continuum

    Authors: Matthijs Jansen, Auday Al-Dulaimy, Alessandro V. Papadopoulos, Animesh Trivedi, Alexandru Iosup

    Abstract: As the next generation of diverse workloads like autonomous driving and augmented/virtual reality evolves, computation is shifting from cloud-based services to the edge, leading to the emergence of a cloud-edge compute continuum. This continuum promises a wide spectrum of deployment opportunities for workloads that can leverage the strengths of cloud (scalable infrastructure, high reliability) and… ▽ More

    Submitted 2 March, 2023; v1 submitted 8 July, 2022; originally announced July 2022.

    Comments: 14 pages, SPEC-RG technical report