Zum Hauptinhalt springen

Showing 1–50 of 63 results for author: Thakur, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.05316  [pdf, other

    cond-mat.mtrl-sci cs.LO

    Towards Verifying Exact Conditions of Density Functional Theory Approximations

    Authors: Sameerah Helal, Zhe Tao, Cindy Rubio-González, Francois Gygi, Aditya V. Thakur

    Abstract: Density Functional Theory (DFT) is used extensively in the computation of electronic properties of matter, with various applications. Approximating the exchange-correlation (XC) functional is the key to the Kohn-Sham DFT approach, the basis of most DFT calculations. The choice of this density functional approximation (DFA) depends crucially on the particular system under study, which has resulted… ▽ More

    Submitted 12 August, 2024; v1 submitted 9 August, 2024; originally announced August 2024.

  2. arXiv:2407.11806  [pdf, other

    cs.CR

    MaskedHLS: Domain-Specific High-Level Synthesis of Masked Cryptographic Designs

    Authors: Nilotpola Sarma, Anuj Singh Thakur, Chandan Karfa

    Abstract: The design and synthesis of masked cryptographic hardware implementations that are secure against power side-channel attacks (PSCAs) in the presence of glitches is a challenging task. High-Level Synthesis (HLS) is a promising technique for generating masked hardware directly from masked software, offering opportunities for design space exploration. However, conventional HLS tools make modification… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  3. arXiv:2407.11214  [pdf, ps, other

    cs.AI cs.CL cs.LG cs.LO cs.PL

    PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition

    Authors: George Tsoukalas, Jasper Lee, John Jennings, Jimmy Xin, Michelle Ding, Michael Jennings, Amitayush Thakur, Swarat Chaudhuri

    Abstract: We present PutnamBench, a new multilingual benchmark for evaluating the ability of neural theorem-provers to solve competition mathematics problems. PutnamBench consists of 1697 hand-constructed formalizations of 640 theorems sourced from the William Lowell Putnam Mathematical Competition, the premier undergraduate-level mathematics competition in North America. All the theorems have formalization… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  4. arXiv:2406.12624  [pdf, other

    cs.CL cs.AI

    Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges

    Authors: Aman Singh Thakur, Kartik Choudhary, Venkat Srinik Ramayapally, Sankaran Vaidyanathan, Dieuwke Hupkes

    Abstract: Offering a promising solution to the scalability challenges associated with human evaluation, the LLM-as-a-judge paradigm is rapidly gaining traction as an approach to evaluating large language models (LLMs). However, there are still many open questions about the strengths and weaknesses of this paradigm, and what potential biases it may hold. In this paper, we present a comprehensive study of the… ▽ More

    Submitted 1 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  5. arXiv:2405.15795  [pdf, ps, other

    cs.NE cs.DC

    D-CODE: Data Colony Optimization for Dynamic Network Efficiency

    Authors: Tannu Pandey, Ayush Thakur

    Abstract: The paper introduces D-CODE, a new framework blending Data Colony Optimization (DCO) algorithms inspired by biological colonies' collective behaviours with Dynamic Efficiency (DE) models for real-time adaptation. DCO utilizes metaheuristic strategies from ant colonies, bee swarms, and fungal networks to efficiently explore complex data landscapes, while DE enables continuous resource recalibration… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  6. arXiv:2405.00716  [pdf, other

    cs.CL cs.AI

    Large Language Models in the Clinic: A Comprehensive Benchmark

    Authors: Andrew Liu, Hongjian Zhou, Yining Hua, Omid Rohanian, Anshul Thakur, Lei Clifton, David A. Clifton

    Abstract: The adoption of large language models (LLMs) to assist clinicians has attracted remarkable attention. Existing works mainly adopt the close-ended question-answering (QA) task with answer options for evaluation. However, many clinical decisions involve answering open-ended questions without pre-set options. To better understand LLMs in the clinic, we construct a benchmark ClinicBench. We first coll… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 April, 2024; originally announced May 2024.

  7. arXiv:2405.00004  [pdf, other

    cs.DC

    Self-healing Nodes with Adaptive Data-Sharding

    Authors: Ayush Thakur, Sanskar Chauhan, Ilisha Tomar, Vaibhavi Paul, Deepak Gupta

    Abstract: Data sharding, a technique for partitioning and distributing data among multiple servers or nodes, offers enhancements in the scalability, performance, and fault tolerance of extensive distributed systems. Nonetheless, this strategy introduces novel challenges, including load balancing among shards, management of node failures and data loss, and adaptation to evolving data and workload patterns. T… ▽ More

    Submitted 19 January, 2024; originally announced May 2024.

  8. arXiv:2404.15731  [pdf, other

    cs.LG

    MD-NOMAD: Mixture density nonlinear manifold decoder for emulating stochastic differential equations and uncertainty propagation

    Authors: Akshay Thakur, Souvik Chakraborty

    Abstract: We propose a neural operator framework, termed mixture density nonlinear manifold decoder (MD-NOMAD), for stochastic simulators. Our approach leverages an amalgamation of the pointwise operator learning neural architecture nonlinear manifold decoder (NOMAD) with mixture density-based methods to estimate conditional probability distributions for stochastic output functions. MD-NOMAD harnesses the a… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  9. arXiv:2404.08940  [pdf, other

    cs.IR cs.CL cs.LG

    Introducing Super RAGs in Mistral 8x7B-v1

    Authors: Ayush Thakur, Raghav Gupta

    Abstract: The relentless pursuit of enhancing Large Language Models (LLMs) has led to the advent of Super Retrieval-Augmented Generation (Super RAGs), a novel approach designed to elevate the performance of LLMs by integrating external knowledge sources with minimal structural modifications. This paper presents the integration of Super RAGs into the Mistral 8x7B v1, a state-of-the-art LLM, and examines the… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

  10. arXiv:2403.16024  [pdf, other

    cs.LG cs.CV cs.GR

    A Unified Module for Accelerating STABLE-DIFFUSION: LCM-LORA

    Authors: Ayush Thakur, Rashmi Vashisth

    Abstract: This paper presents a comprehensive study on the unified module for accelerating stable-diffusion processes, specifically focusing on the lcm-lora module. Stable-diffusion processes play a crucial role in various scientific and engineering domains, and their acceleration is of paramount importance for efficient computational performance. The standard iterative procedures for solving fixed-source d… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  11. arXiv:2403.15450  [pdf, other

    cs.CL cs.IR

    Loops On Retrieval Augmented Generation (LoRAG)

    Authors: Ayush Thakur, Rashmi Vashisth

    Abstract: This paper presents Loops On Retrieval Augmented Generation (LoRAG), a new framework designed to enhance the quality of retrieval-augmented text generation through the incorporation of an iterative loop mechanism. The architecture integrates a generative model, a retrieval mechanism, and a dynamic loop module, allowing for iterative refinement of the generated text through interactions with releva… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  12. arXiv:2403.08812  [pdf, other

    cs.HC cs.GR cs.LG

    Gore Diffusion LoRA Model

    Authors: Ayush Thakur, Ashwani Kumar Dubey

    Abstract: The Emergence of Artificial Intelligence (AI) has significantly impacted our engagement with violence, sparking ethical deliberations regarding the algorithmic creation of violent imagery. This paper scrutinizes the "Gore Diffusion LoRA Model," an innovative AI model proficient in generating hyper-realistic visuals portraying intense violence and bloodshed. Our exploration encompasses the model's… ▽ More

    Submitted 9 February, 2024; originally announced March 2024.

  13. arXiv:2403.08261  [pdf, other

    cs.CV cs.AI eess.IV

    CoroNetGAN: Controlled Pruning of GANs via Hypernetworks

    Authors: Aman Kumar, Khushboo Anand, Shubham Mandloi, Ashutosh Mishra, Avinash Thakur, Neeraj Kasera, Prathosh A P

    Abstract: Generative Adversarial Networks (GANs) have proven to exhibit remarkable performance and are widely used across many generative computer vision applications. However, the unprecedented demand for the deployment of GANs on resource-constrained edge devices still poses a challenge due to huge number of parameters involved in the generation process. This has led to focused attention on the area of co… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  14. arXiv:2403.06895  [pdf, other

    cs.CV

    GRITv2: Efficient and Light-weight Social Relation Recognition

    Authors: N K Sagar Reddy, Neeraj Kasera, Avinash Thakur

    Abstract: Our research focuses on the analysis and improvement of the Graph-based Relation Inference Transformer (GRIT), which serves as an important benchmark in the field. We conduct a comprehensive ablation study using the PISC-fine dataset, to find and explore improvement in efficiency and performance of GRITv2. Our research has provided a new state-of-the-art relation recognition model on the PISC rela… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  15. arXiv:2311.02010  [pdf, other

    cs.CY

    A cast of thousands: How the IDEAS Productivity project has advanced software productivity and sustainability

    Authors: Lois Curfman McInnes, Michael Heroux, David E. Bernholdt, Anshu Dubey, Elsa Gonsiorowski, Rinku Gupta, Osni Marques, J. David Moulton, Hai Ah Nam, Boyana Norris, Elaine M. Raybourn, Jim Willenbring, Ann Almgren, Ross Bartlett, Kita Cranfill, Stephen Fickas, Don Frederick, William Godoy, Patricia Grubel, Rebecca Hartman-Baker, Axel Huebl, Rose Lynch, Addi Malviya Thakur, Reed Milewicz, Mark C. Miller , et al. (9 additional authors not shown)

    Abstract: Computational and data-enabled science and engineering are revolutionizing advances throughout science and society, at all scales of computing. For example, teams in the U.S. DOE Exascale Computing Project have been tackling new frontiers in modeling, simulation, and analysis by exploiting unprecedented exascale computing capabilities-building an advanced software ecosystem that supports next-gene… ▽ More

    Submitted 16 February, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

    Comments: 12 pages, 1 figure

  16. arXiv:2310.04353  [pdf, other

    cs.LG cs.AI cs.LO cs.PL

    An In-Context Learning Agent for Formal Theorem-Proving

    Authors: Amitayush Thakur, George Tsoukalas, Yeming Wen, Jimmy Xin, Swarat Chaudhuri

    Abstract: We present an in-context learning agent for formal theorem-proving in environments like Lean and Coq. Current state-of-the-art models for the problem are finetuned on environment-specific proof data. By contrast, our approach, called COPRA, repeatedly asks a high-capacity, general-purpose large language model (GPT-4) to propose tactic applications from within a stateful backtracking search. Propos… ▽ More

    Submitted 8 August, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

  17. arXiv:2309.13716  [pdf, other

    cs.CV eess.IV

    MOSAIC: Multi-Object Segmented Arbitrary Stylization Using CLIP

    Authors: Prajwal Ganugula, Y S S S Santosh Kumar, N K Sagar Reddy, Prabhath Chellingi, Avinash Thakur, Neeraj Kasera, C Shyam Anand

    Abstract: Style transfer driven by text prompts paved a new path for creatively stylizing the images without collecting an actual style image. Despite having promising results, with text-driven stylization, the user has no control over the stylization. If a user wants to create an artistic image, the user requires fine control over the stylization of various entities individually in the content image, which… ▽ More

    Submitted 24 September, 2023; originally announced September 2023.

    Comments: Camera ready, New Ideas in Vision Transformers workshop, ICCV 2023

  18. arXiv:2306.12100  [pdf, other

    cs.CV cs.LG

    Efficient ResNets: Residual Network Design

    Authors: Aditya Thakur, Harish Chauhan, Nikunj Gupta

    Abstract: ResNets (or Residual Networks) are one of the most commonly used models for image classification tasks. In this project, we design and train a modified ResNet model for CIFAR-10 image classification. In particular, we aimed at maximizing the test accuracy on the CIFAR-10 benchmark while keeping the size of our ResNet model under the specified fixed budget of 5 million trainable parameters. Model s… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

  19. arXiv:2305.03711  [pdf, other

    cs.LG cs.CY

    Medical records condensation: a roadmap towards healthcare data democratisation

    Authors: Yujiang Wang, Anshul Thakur, Mingzhi Dong, Pingchuan Ma, Stavros Petridis, Li Shang, Tingting Zhu, David A. Clifton

    Abstract: The prevalence of artificial intelligence (AI) has envisioned an era of healthcare democratisation that promises every stakeholder a new and better way of life. However, the advancement of clinical AI research is significantly hurdled by the dearth of data democratisation in healthcare. To truly democratise data for AI studies, challenges are two-fold: 1. the sensitive information in clinical data… ▽ More

    Submitted 8 January, 2024; v1 submitted 5 May, 2023; originally announced May 2023.

  20. arXiv:2305.03710  [pdf, other

    cs.LG cs.CR

    Data Encoding For Healthcare Data Democratisation and Information Leakage Prevention

    Authors: Anshul Thakur, Tingting Zhu, Vinayak Abrol, Jacob Armstrong, Yujiang Wang, David A. Clifton

    Abstract: The lack of data democratization and information leakage from trained models hinder the development and acceptance of robust deep learning-based healthcare solutions. This paper argues that irreversible data encoding can provide an effective solution to achieve data democratization without violating the privacy constraints imposed on healthcare data and clinical models. An ideal encoding framework… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

  21. arXiv:2305.03219  [pdf

    cs.LG stat.ME

    All models are local: time to replace external validation with recurrent local validation

    Authors: Alex Youssef, Michael Pencina, Anshul Thakur, Tingting Zhu, David Clifton, Nigam H. Shah

    Abstract: External validation is often recommended to ensure the generalizability of ML models. However, it neither guarantees generalizability nor equates to a model's clinical usefulness (the ultimate goal of any clinical decision-support tool). External validation is misaligned with current healthcare ML needs. First, patient data changes across time, geography, and facilities. These changes create signi… ▽ More

    Submitted 13 May, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

  22. Architecture-Preserving Provable Repair of Deep Neural Networks

    Authors: Zhe Tao, Stephanie Nawas, Jacqueline Mitchell, Aditya V. Thakur

    Abstract: Deep neural networks (DNNs) are becoming increasingly important components of software, and are considered the state-of-the-art solution for a number of problems, such as image recognition. However, DNNs are far from infallible, and incorrect behavior of DNNs can have disastrous real-world consequences. This paper addresses the problem of architecture-preserving V-polytope provable repair of DNNs.… ▽ More

    Submitted 16 August, 2023; v1 submitted 7 April, 2023; originally announced April 2023.

    Comments: Accepted paper at PLDI 2023. Tool is available at https://github.com/95616ARG/APRNN/

  23. arXiv:2302.03416  [pdf, other

    cs.SE

    Just-in-Time Code Duplicates Extraction

    Authors: Eman Abdullah AlOmar, Anton Ivanov, Zarina Kurbatova, Yaroslav Golubev, Mohamed Wiem Mkaouer, Ali Ouni, Timofey Bryksin, Le Nguyen, Amit Kini, Aditya Thakur

    Abstract: Refactoring is a critical task in software maintenance, and is usually performed to enforce better design and coding practices, while coping with design defects. The Extract Method refactoring is widely used for merging duplicate code fragments into a single new method. Several studies attempted to recommend Extract Method refactoring opportunities using different techniques, including program sli… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    Comments: 32 pages, 9 figures

  24. arXiv:2212.05612  [pdf, other

    cs.AI cs.CL cs.LG

    Multimodal and Explainable Internet Meme Classification

    Authors: Abhinav Kumar Thakur, Filip Ilievski, Hông-Ân Sandlin, Zhivar Sourati, Luca Luceri, Riccardo Tommasini, Alain Mermoud

    Abstract: In the current context where online platforms have been effectively weaponized in a variety of geo-political events and social issues, Internet memes make fair content moderation at scale even more difficult. Existing work on meme classification and tracking has focused on black-box methods that do not explicitly consider the semantics of the memes or the context of their creation. In this paper,… ▽ More

    Submitted 6 April, 2023; v1 submitted 11 December, 2022; originally announced December 2022.

  25. Giving RSEs a Larger Stage through the Better Scientific Software Fellowship

    Authors: William F. Godoy, Ritu Arora, Keith Beattie, David E. Bernholdt, Sarah E. Bratt, Daniel S. Katz, Ignacio Laguna, Amiya K. Maji, Addi Malviya Thakur, Rafael M. Mudafort, Nitin Sukhija, Damian Rouson, Cindy Rubio-González, Karan Vahi

    Abstract: The Better Scientific Software Fellowship (BSSwF) was launched in 2018 to foster and promote practices, processes, and tools to improve developer productivity and software sustainability of scientific codes. BSSwF's vision is to grow the community with practitioners, leaders, mentors, and consultants to increase the visibility of scientific software production and sustainability. Over the last fiv… ▽ More

    Submitted 14 November, 2022; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: submitted to Computing in Science & Engineering (CiSE), Special Issue on the Future of Research Software Engineers in the US

  26. arXiv:2210.10530  [pdf, other

    cs.LG cs.AI stat.ME

    Adversarial De-confounding in Individualised Treatment Effects Estimation

    Authors: Vinod Kumar Chauhan, Soheila Molaei, Marzia Hoque Tania, Anshul Thakur, Tingting Zhu, David A. Clifton

    Abstract: Observational studies have recently received significant attention from the machine learning community due to the increasingly available non-experimental observational data and the limitations of the experimental studies, such as considerable cost, impracticality, small and less representative sample sizes, etc. In observational studies, de-confounding is a fundamental problem of individualised tr… ▽ More

    Submitted 24 January, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: accepted to AISTATS 2023

  27. arXiv:2210.01970  [pdf, other

    cs.LG

    Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements

    Authors: Leandro von Werra, Lewis Tunstall, Abhishek Thakur, Alexandra Sasha Luccioni, Tristan Thrush, Aleksandra Piktus, Felix Marty, Nazneen Rajani, Victor Mustar, Helen Ngo, Omar Sanseviero, Mario Šaško, Albert Villanova, Quentin Lhoest, Julien Chaumond, Margaret Mitchell, Alexander M. Rush, Thomas Wolf, Douwe Kiela

    Abstract: Evaluation is a key part of machine learning (ML), yet there is a lack of support and tooling to enable its informed and systematic practice. We introduce Evaluate and Evaluation on the Hub --a set of tools to facilitate the evaluation of models and datasets in ML. Evaluate is a library to support best practices for measurements, metrics, and comparisons of data and models. Its goal is to support… ▽ More

    Submitted 6 October, 2022; v1 submitted 30 September, 2022; originally announced October 2022.

  28. arXiv:2208.05606  [pdf, other

    cs.LG physics.comp-ph

    Multi-fidelity wavelet neural operator with application to uncertainty quantification

    Authors: Akshay Thakur, Tapas Tripura, Souvik Chakraborty

    Abstract: Operator learning frameworks, because of their ability to learn nonlinear maps between two infinite dimensional functional spaces and utilization of neural networks in doing so, have recently emerged as one of the more pertinent areas in the field of applied machine learning. Although these frameworks are extremely capable when it comes to modeling complex phenomena, they require an extensive amou… ▽ More

    Submitted 28 July, 2023; v1 submitted 10 August, 2022; originally announced August 2022.

  29. COPER: Continuous Patient State Perceiver

    Authors: Vinod Kumar Chauhan, Anshul Thakur, Odhran O'Donoghue, David A. Clifton

    Abstract: In electronic health records (EHRs), irregular time-series (ITS) occur naturally due to patient health dynamics, reflected by irregular hospital visits, diseases/conditions and the necessity to measure different vitals signs at each visit etc. ITS present challenges in training machine learning algorithms which mostly are built on assumption of coherent fixed dimensional feature space. In this pap… ▽ More

    Submitted 24 November, 2022; v1 submitted 5 August, 2022; originally announced August 2022.

    Comments: 2 figures; presented in IEEE International Conference on Biomedical and Health Informatics (IEEE BHI-2022)

  30. arXiv:2206.12681  [pdf, other

    cs.CV

    UltraMNIST Classification: A Benchmark to Train CNNs for Very Large Images

    Authors: Deepak K. Gupta, Udbhav Bamba, Abhishek Thakur, Akash Gupta, Suraj Sharan, Ertugrul Demir, Dilip K. Prasad

    Abstract: Convolutional neural network (CNN) approaches available in the current literature are designed to work primarily with low-resolution images. When applied on very large images, challenges related to GPU memory, smaller receptive field than needed for semantic correspondence and the need to incorporate multi-scale features arise. The resolution of input images can be reduced, however, with significa… ▽ More

    Submitted 25 June, 2022; originally announced June 2022.

  31. arXiv:2204.02573  [pdf

    cs.CV cs.AI

    Detecting key Soccer match events to create highlights using Computer Vision

    Authors: Narayana Darapaneni, Prashant Kumar, Nikhil Malhotra, Vigneswaran Sundaramurthy, Abhaya Thakur, Shivam Chauhan, Krishna Chaitanya Thangeda, Anwesh Reddy Paduri

    Abstract: The research and data science community has been fascinated with the development of automatic systems for the detection of key events in a video. Special attention in this field is given to sports video analytics which could help in identifying key events during a match and help in preparing a strategy for the games going forward. For this paper, we have chosen Football (soccer) as a sport where w… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

  32. arXiv:2201.07753  [pdf, other

    stat.ML cs.LG

    Deep Capsule Encoder-Decoder Network for Surrogate Modeling and Uncertainty Quantification

    Authors: Akshay Thakur, Souvik Chakraborty

    Abstract: We propose a novel \textit{capsule} based deep encoder-decoder model for surrogate modeling and uncertainty quantification of systems in mechanics from sparse data. The proposed framework is developed by adapting Capsule Network (CapsNet) architecture into image-to-image regression encoder-decoder network. Specifically, the aim is to exploit the benefits of CapsNet over convolution neural network… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

    Comments: 18 pages

  33. arXiv:2201.07729  [pdf

    cs.HC

    Ergonomics Integrated Design Methodology using Parameter Optimization, Computer-Aided Design, and Digital Human Modelling: A Case Study of a Cleaning Equipment

    Authors: Neelesh Kr. Sharma, Mayank Tiwari, Atul Thakur, Anindya K. Ganguli

    Abstract: Challenges of enhancing productivity by amplifying efficiency and man-machine compatibility of equipment can be achieved by adopting advanced technologies. This study aims to present and exemplify methodology for incorporating ergonomics pro-actively into the design using computer-aided design and digital human modeling-based analysis. The cleaning equipment is parametrized to detect the critical… ▽ More

    Submitted 5 April, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

    Comments: page count: 33; word count (Excluding references and abstract): 5413; abstract word count: 161; number of figures: 11; number of tables: 3

  34. arXiv:2112.15230  [pdf, other

    cs.SE

    AntiCopyPaster: Extracting Code Duplicates As Soon As They Are Introduced in the IDE

    Authors: Eman Abdullah AlOmar, Anton Ivanov, Zarina Kurbatova, Yaroslav Golubev, Mohamed Wiem Mkaouer, Ali Ouni, Timofey Bryksin, Le Nguyen, Amit Kini, Aditya Thakur

    Abstract: We developed a plugin for IntelliJ IDEA called AntiCopyPaster, which tracks the pasting of code fragments inside the IDE and suggests the appropriate Extract Method refactoring to combat the propagation of duplicates. Unlike the existing approaches, our tool is integrated with the developer's workflow, and pro-actively recommends refactorings. Since not all code fragments need to be extracted, we… ▽ More

    Submitted 2 September, 2022; v1 submitted 30 December, 2021; originally announced December 2021.

    Comments: 4 pages, 3 figures

  35. arXiv:2110.13809  [pdf, ps, other

    cs.LG stat.ML

    A deep learning based surrogate model for stochastic simulators

    Authors: Akshay Thakur, Souvik Chakraborty

    Abstract: We propose a deep learning-based surrogate model for stochastic simulators. The basic idea is to use generative neural network to approximate the stochastic response. The challenge with such a framework resides in designing the network architecture and selecting loss-function suitable for stochastic response. While we utilize a simple feed-forward neural network, we propose to use conditional maxi… ▽ More

    Submitted 24 October, 2021; originally announced October 2021.

  36. arXiv:2109.14076  [pdf, other

    cs.CL cs.AI cs.LG

    RAFT: A Real-World Few-Shot Text Classification Benchmark

    Authors: Neel Alex, Eli Lifland, Lewis Tunstall, Abhishek Thakur, Pegah Maham, C. Jess Riedel, Emmie Hine, Carolyn Ashurst, Paul Sedille, Alexis Carlier, Michael Noetel, Andreas Stuhlmüller

    Abstract: Large pre-trained language models have shown promise for few-shot learning, completing text-based tasks given only a few task-specific examples. Will models soon solve classification tasks that have so far been reserved for human research assistants? Existing benchmarks are not designed to measure progress in applied settings, and so don't directly answer this question. The RAFT benchmark (Real-wo… ▽ More

    Submitted 18 January, 2022; v1 submitted 28 September, 2021; originally announced September 2021.

    Comments: Dataset, submission instructions, code and leaderboard available at https://raft.elicit.org

  37. arXiv:2109.02846  [pdf, other

    cs.CL

    Datasets: A Community Library for Natural Language Processing

    Authors: Quentin Lhoest, Albert Villanova del Moral, Yacine Jernite, Abhishek Thakur, Patrick von Platen, Suraj Patil, Julien Chaumond, Mariama Drame, Julien Plu, Lewis Tunstall, Joe Davison, Mario Šaško, Gunjan Chhablani, Bhavitvya Malik, Simon Brandeis, Teven Le Scao, Victor Sanh, Canwen Xu, Nicolas Patry, Angelina McMillan-Major, Philipp Schmid, Sylvain Gugger, Clément Delangue, Théo Matussière, Lysandre Debut , et al. (7 additional authors not shown)

    Abstract: The scale, variety, and quantity of publicly-available NLP datasets has grown rapidly as researchers propose new tasks, larger models, and novel benchmarks. Datasets is a community library for contemporary NLP designed to support this ecosystem. Datasets aims to standardize end-user interfaces, versioning, and documentation, while providing a lightweight front-end that behaves similarly for small… ▽ More

    Submitted 6 September, 2021; originally announced September 2021.

    Comments: EMNLP Demo 2021

  38. arXiv:2108.04351   

    cs.CV cs.AI cs.LG

    Adversarial Open Domain Adaption Framework (AODA): Sketch-to-Photo Synthesis

    Authors: Amey Thakur, Mega Satish

    Abstract: This paper aims to demonstrate the efficiency of the Adversarial Open Domain Adaption framework for sketch-to-photo synthesis. The unsupervised open domain adaption for generating realistic photos from a hand-drawn sketch is challenging as there is no such sketch of that class for training data. The absence of learning supervision and the huge domain gap between both the freehand drawing and pictu… ▽ More

    Submitted 19 August, 2021; v1 submitted 28 July, 2021; originally announced August 2021.

    Comments: This was an undergraduate research effort, and in retrospect, it isn't comprehensive enough

  39. White-Box Cartoonization Using An Extended GAN Framework

    Authors: Amey Thakur, Hasan Rizvi, Mega Satish

    Abstract: In the present study, we propose to implement a new framework for estimating generative models via an adversarial process to extend an existing GAN framework and develop a white-box controllable image cartoonization, which can generate high-quality cartooned images/videos from real-world photos and videos. The learning purposes of our system are based on three distinct representations: surface rep… ▽ More

    Submitted 9 July, 2021; originally announced July 2021.

    Comments: 5 pages, 6 figures. International Journal of Engineering Applied Sciences and Technology, 2021

  40. Chat Room Using HTML, PHP, CSS, JS, AJAX

    Authors: Amey Thakur, Karan Dhiman

    Abstract: Earlier there was no mode of online communication between users. In big or small organizations communication between users posed a challenge. There was a requirement to record these communications and store the data for further evaluation. The idea is to automate the existing Simple Chat Room system and make the users utilize the software so that their valuable information is stored digitally and… ▽ More

    Submitted 28 June, 2021; originally announced June 2021.

    Comments: 4 pages, 5 figures

    Journal ref: International Research Journal of Engineering and Technology (IRJET) Volume: 08 Issue: 06 | June 2021

  41. arXiv:2104.04413  [pdf, other

    cs.LG

    Provable Repair of Deep Neural Networks

    Authors: Matthew Sotoudeh, Aditya V. Thakur

    Abstract: Deep Neural Networks (DNNs) have grown in popularity over the past decade and are now being used in safety-critical domains such as aircraft collision avoidance. This has motivated a large number of techniques for finding unsafe behavior in DNNs. In contrast, this paper tackles the problem of correcting a DNN once unsafe behavior is found. We introduce the provable repair problem, which is the pro… ▽ More

    Submitted 24 April, 2021; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: Accepted paper at PLDI 2021. Tool will be available at https://github.com/95616ARG/PRDNN/

  42. arXiv:2103.11470  [pdf, other

    cs.RO cs.AI

    NeBula: Quest for Robotic Autonomy in Challenging Environments; TEAM CoSTAR at the DARPA Subterranean Challenge

    Authors: Ali Agha, Kyohei Otsu, Benjamin Morrell, David D. Fan, Rohan Thakker, Angel Santamaria-Navarro, Sung-Kyun Kim, Amanda Bouman, Xianmei Lei, Jeffrey Edlund, Muhammad Fadhil Ginting, Kamak Ebadi, Matthew Anderson, Torkom Pailevanian, Edward Terry, Michael Wolf, Andrea Tagliabue, Tiago Stegun Vaquero, Matteo Palieri, Scott Tepsuporn, Yun Chang, Arash Kalantari, Fernando Chavez, Brett Lopez, Nobuhiro Funabiki , et al. (47 additional authors not shown)

    Abstract: This paper presents and discusses algorithms, hardware, and software architecture developed by the TEAM CoSTAR (Collaborative SubTerranean Autonomous Robots), competing in the DARPA Subterranean Challenge. Specifically, it presents the techniques utilized within the Tunnel (2019) and Urban (2020) competitions, where CoSTAR achieved 2nd and 1st place, respectively. We also discuss CoSTAR's demonstr… ▽ More

    Submitted 18 October, 2021; v1 submitted 21 March, 2021; originally announced March 2021.

    Comments: For team website, see https://costar.jpl.nasa.gov/. Accepted for publication in the Journal of Field Robotics, 2021

  43. arXiv:2101.03263  [pdf, other

    cs.LG cs.PL

    SyReNN: A Tool for Analyzing Deep Neural Networks

    Authors: Matthew Sotoudeh, Aditya V. Thakur

    Abstract: Deep Neural Networks (DNNs) are rapidly gaining popularity in a variety of important domains. Formally, DNNs are complicated vector-valued functions which come in a variety of sizes and applications. Unfortunately, modern DNNs have been shown to be vulnerable to a variety of attacks and buggy behavior. This has motivated recent work in formally analyzing the properties of such DNNs. This paper int… ▽ More

    Submitted 8 January, 2021; originally announced January 2021.

    Comments: Accepted paper at TACAS 2021. Tool is available at https://github.com/95616ARG/SyReNN

  44. arXiv:2012.15247  [pdf, other

    eess.IV cs.CV

    Automatic Polyp Segmentation using U-Net-ResNet50

    Authors: Saruar Alam, Nikhil Kumar Tomar, Aarati Thakur, Debesh Jha, Ashish Rauniyar

    Abstract: Polyps are the predecessors to colorectal cancer which is considered as one of the leading causes of cancer-related deaths worldwide. Colonoscopy is the standard procedure for the identification, localization, and removal of colorectal polyps. Due to variability in shape, size, and surrounding tissue similarity, colorectal polyps are often missed by the clinicians during colonoscopy. With the use… ▽ More

    Submitted 30 December, 2020; originally announced December 2020.

  45. LOCUS: A Multi-Sensor Lidar-Centric Solution for High-Precision Odometry and 3D Mapping in Real-Time

    Authors: M. Palieri, B. Morrell, A Thakur, K. Ebadi, J. Nash, A. Chatterjee, C. Kanellakis, L. Carlone, C. Guaragnella, A. Agha-mohammadi

    Abstract: A reliable odometry source is a prerequisite to enable complex autonomy behaviour in next-generation robots operating in extreme environments. In this work, we present a high-precision lidar odometry system to achieve robust and real-time operation under challenging perceptual conditions. LOCUS (Lidar Odometry for Consistent operation in Uncertain Settings), provides an accurate multi-stage scan m… ▽ More

    Submitted 28 December, 2020; originally announced December 2020.

    Comments: Accepted for publication at IEEE Robotics and Automation Letters, 2020

  46. arXiv:2012.11206  [pdf

    cs.CR cs.NI

    Edge Computing in Transportation: Security Issues and Challenges

    Authors: Nikheel Soni, Reza Malekian, Arnav Thakur

    Abstract: As the amount of data that needs to be processed in real-time due to recent application developments increase, the need for a new computing paradigm is required. Edge computing resolves this issue by offloading computing resources required by intelligent transportation systems such as the Internet of Vehicles from the cloud closer to the end devices to improve performance however, it is susceptibl… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

  47. arXiv:2009.06592  [pdf, other

    cs.SE cs.AI cs.PL

    Analogy-Making as a Core Primitive in the Software Engineering Toolbox

    Authors: Matthew Sotoudeh, Aditya V. Thakur

    Abstract: An analogy is an identification of structural similarities and correspondences between two objects. Computational models of analogy making have been studied extensively in the field of cognitive science to better understand high-level human cognition. For instance, Melanie Mitchell and Douglas Hofstadter sought to better understand high-level perception by developing the Copycat algorithm for comp… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.

    Comments: Conference paper at SPLASH 'Onward!' 2020. Code is available at https://github.com/95616ARG/sifter

  48. arXiv:2009.05865  [pdf, other

    cs.PL

    Memory-Efficient Fixpoint Computation

    Authors: Sung Kook Kim, Arnaud J. Venet, Aditya V. Thakur

    Abstract: Practical adoption of static analysis often requires trading precision for performance. This paper focuses on improving the memory efficiency of abstract interpretation without sacrificing precision or time efficiency. Computationally, abstract interpretation reduces the problem of inferring program invariants to computing a fixpoint of a set of equations. This paper presents a method to minimize… ▽ More

    Submitted 12 September, 2020; originally announced September 2020.

    Comments: Extended version of conference paper at the 27th Static Analysis Symposium (SAS 2020). Code is available at https://github.com/95616ARG/mikos_sas2020

  49. arXiv:2009.05660  [pdf, ps, other

    cs.LG cs.PL stat.ML

    Abstract Neural Networks

    Authors: Matthew Sotoudeh, Aditya V. Thakur

    Abstract: Deep Neural Networks (DNNs) are rapidly being applied to safety-critical domains such as drone and airplane control, motivating techniques for verifying the safety of their behavior. Unfortunately, DNN verification is NP-hard, with current algorithms slowing exponentially with the number of nodes in the DNN. This paper introduces the notion of Abstract Neural Networks (ANNs), which can be used to… ▽ More

    Submitted 11 September, 2020; originally announced September 2020.

    Comments: Extended version of conference paper at the 27th Static Analysis Symposium (SAS 2020). Code is available at https://github.com/95616ARG/abstract_neural_networks

  50. arXiv:2005.06400  [pdf

    cs.OH

    White Paper on Business of 6G

    Authors: Seppo Yrjola, Petri Ahokangas, Marja Matinmikko-Blue, Risto Jurva, Vivek Kant, Pasi Karppinen, Marianne Kinnula, Harilaos Koumaras, Mika Rantakokko, Volker Ziegler, Abhishek Thakur, Hans-Jurgen Zepernick

    Abstract: Developing products, services and vertical applications for the future digitized society in the 6G era requires a multidisciplinary approach and a re-definition of how we create, deliver and consume network resources, data and services for both communications and sensing purposes. This development will change and disrupt the traditional business models and ecosystem roles of digital service provid… ▽ More

    Submitted 16 July, 2020; v1 submitted 30 April, 2020; originally announced May 2020.

    Comments: This draft white paper has been written by an international expert group, led by the Finnish 6G Flagship program (6gflagship.com) at the University of Oulu, within a series of twelve 6G white papers to be published in their final format in June 2020