Skip to main content

Showing 1–50 of 137 results for author: Sharma, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03172  [pdf, other

    cs.CV cs.AI stat.AP

    IMC 2024 Methods & Solutions Review

    Authors: Shyam Gupta, Dhanisha Sharma, Songling Huang

    Abstract: For the past three years, Kaggle has been hosting the Image Matching Challenge, which focuses on solving a 3D image reconstruction problem using a collection of 2D images. Each year, this competition fosters the development of innovative and effective methodologies by its participants. In this paper, we introduce an advanced ensemble technique that we developed, achieving a score of 0.153449 on th… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 8 Pages, 9 figures

  2. arXiv:2407.02598  [pdf, other

    cs.CV cs.AI

    AutoSplat: Constrained Gaussian Splatting for Autonomous Driving Scene Reconstruction

    Authors: Mustafa Khan, Hamidreza Fazlali, Dhruv Sharma, Tongtong Cao, Dongfeng Bai, Yuan Ren, Bingbing Liu

    Abstract: Realistic scene reconstruction and view synthesis are essential for advancing autonomous driving systems by simulating safety-critical scenarios. 3D Gaussian Splatting excels in real-time rendering and static scene reconstructions but struggles with modeling driving scenarios due to complex backgrounds, dynamic objects, and sparse views. We propose AutoSplat, a framework employing Gaussian splatti… ▽ More

    Submitted 3 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

  3. arXiv:2407.00774  [pdf, other

    quant-ph cs.LG

    Advantages of quantum support vector machine in cross-domain classification of quantum states

    Authors: Diksha Sharma, Vivek Balasaheb Sabale, Parvinder Singh, Atul Kumar

    Abstract: In this study, we use cross-domain classification using quantum machine learning for quantum advantages to address the entanglement versus separability paradigm. We further demonstrate the efficient classification of Bell diagonal states into zero and non-zero discord classes. The inherited structure of quantum states and its relation with a particular class of quantum states are exploited to intu… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  4. arXiv:2406.16625  [pdf, other

    cs.RO

    GATSBI: An Online GTSP-Based Algorithm for Targeted Surface Bridge Inspection and Defect Detection

    Authors: Harnaik Dhami, Charith Reddy, Vishnu Dutt Sharma, Troi Williams, Pratap Tokekar

    Abstract: We study the problem of visual surface inspection of infrastructure for defects using an Unmanned Aerial Vehicle (UAV). We do not assume that the geometric model of the infrastructure is known beforehand. Our planner, termed GATSBI, plans a path in a receding horizon fashion to inspect all points on the surface of the infrastructure. The input to GATSBI consists of a 3D occupancy map created onlin… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: 10 pages, 12 figures, 2 tables. Submitted to IEEE TAES. arXiv admin note: text overlap with arXiv:2012.04803

  5. arXiv:2406.15958  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    Bone Fracture Classification using Transfer Learning

    Authors: Shyam Gupta, Dhanisha Sharma

    Abstract: The manual examination of X-ray images for fractures is a time-consuming process that is prone to human error. In this work, we introduce a robust yet simple training loop for the classification of fractures, which significantly outperforms existing methods. Our method achieves superior performance in less than ten epochs and utilizes the latest dataset to deliver the best-performing model for thi… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: code is publicly available at - https://github.com/shyamgupta196/Bone-Fracture-Classification

  6. arXiv:2406.05199  [pdf, other

    eess.AS cs.SD

    XANE: eXplainable Acoustic Neural Embeddings

    Authors: Sri Harsha Dumpala, Dushyant Sharma, Chandramouli Shama Sastri, Stanislav Kruchinin, James Fosburgh, Patrick A. Naylor

    Abstract: We present a novel method for extracting neural embeddings that model the background acoustics of a speech signal. The extracted embeddings are used to estimate specific parameters related to the background acoustic properties of the signal in a non-intrusive manner, which allows the embeddings to be explainable in terms of those parameters. We illustrate the value of these embeddings by performin… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  7. arXiv:2405.15911  [pdf, other

    cs.LG

    Learning accurate and interpretable decision trees

    Authors: Maria-Florina Balcan, Dravyansh Sharma

    Abstract: Decision trees are a popular tool in machine learning and yield easy-to-understand models. Several techniques have been proposed in the literature for learning a decision tree classifier, with different techniques working well for data from different domains. In this work, we develop approaches to design decision tree learning algorithms given repeated access to data from the same domain. We propo… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 26 pages, UAI 2024

  8. arXiv:2405.05469  [pdf, other

    cs.CR

    PLLM-CS: Pre-trained Large Language Model (LLM) for Cyber Threat Detection in Satellite Networks

    Authors: Mohammed Hassanin, Marwa Keshk, Sara Salim, Majid Alsubaie, Dharmendra Sharma

    Abstract: Satellite networks are vital in facilitating communication services for various critical infrastructures. These networks can seamlessly integrate with a diverse array of systems. However, some of these systems are vulnerable due to the absence of effective intrusion detection systems, which can be attributed to limited research and the high costs associated with deploying, fine-tuning, monitoring,… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  9. arXiv:2405.04829  [pdf, other

    cs.CL

    Fine-tuning Pre-trained Named Entity Recognition Models For Indian Languages

    Authors: Sankalp Bahad, Pruthwik Mishra, Karunesh Arora, Rakesh Chandra Balabantaray, Dipti Misra Sharma, Parameswari Krishnamurthy

    Abstract: Named Entity Recognition (NER) is a useful component in Natural Language Processing (NLP) applications. It is used in various tasks such as Machine Translation, Summarization, Information Retrieval, and Question-Answering systems. The research on NER is centered around English and some other major languages, whereas limited attention has been given to Indian languages. We analyze the challenges an… ▽ More

    Submitted 10 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: 8 pages, accepted in NAACL-SRW, 2024

  10. arXiv:2405.01394  [pdf, other

    cs.AI

    Analysis of a Modular Autonomous Driving Architecture: The Top Submission to CARLA Leaderboard 2.0 Challenge

    Authors: Weize Zhang, Mohammed Elmahgiubi, Kasra Rezaee, Behzad Khamidehi, Hamidreza Mirkhani, Fazel Arasteh, Chunlin Li, Muhammad Ahsan Kaleem, Eduardo R. Corral-Soto, Dhruv Sharma, Tongtong Cao

    Abstract: In this paper we present the architecture of the Kyber-E2E submission to the map track of CARLA Leaderboard 2.0 Autonomous Driving (AD) challenge 2023, which achieved first place. We employed a modular architecture for our solution consists of five main components: sensing, localization, perception, tracking/prediction, and planning/control. Our solution leverages state-of-the-art language-assiste… ▽ More

    Submitted 21 March, 2024; originally announced May 2024.

  11. arXiv:2404.02512  [pdf, other

    cs.CL

    Towards Large Language Model driven Reference-less Translation Evaluation for English and Indian Languages

    Authors: Vandan Mujadia, Pruthwik Mishra, Arafat Ahsan, Dipti Misra Sharma

    Abstract: With the primary focus on evaluating the effectiveness of large language models for automatic reference-less translation assessment, this work presents our experiments on mimicking human direct assessment to evaluate the quality of translations in English and Indian languages. We constructed a translation evaluation task where we performed zero-shot learning, in-context example-driven learning, an… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: text overlap with arXiv:2311.09216

  12. arXiv:2403.13577  [pdf, other

    cs.AR

    HCiM: ADC-Less Hybrid Analog-Digital Compute in Memory Accelerator for Deep Learning Workloads

    Authors: Shubham Negi, Utkarsh Saxena, Deepika Sharma, Kaushik Roy

    Abstract: Analog Compute-in-Memory (CiM) accelerators are increasingly recognized for their efficiency in accelerating Deep Neural Networks (DNN). However, their dependence on Analog-to-Digital Converters (ADCs) for accumulating partial sums from crossbars leads to substantial power and area overhead. Moreover, the high area overhead of ADCs constrains the throughput due to the limited number of ADCs that c… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  13. arXiv:2403.12876  [pdf, other

    cs.RO cs.HC

    LAVA: Long-horizon Visual Action based Food Acquisition

    Authors: Amisha Bhaskar, Rui Liu, Vishnu D. Sharma, Guangyao Shi, Pratap Tokekar

    Abstract: Robotic Assisted Feeding (RAF) addresses the fundamental need for individuals with mobility impairments to regain autonomy in feeding themselves. The goal of RAF is to use a robot arm to acquire and transfer food to individuals from the table. Existing RAF methods primarily focus on solid foods, leaving a gap in manipulation strategies for semi-solid and deformable foods. This study introduces Lon… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: 8 pages, 8 figures

  14. arXiv:2402.09553  [pdf, other

    cs.AI cs.LG stat.ML

    Statistical and Machine Learning Models for Predicting Fire and Other Emergency Events

    Authors: Dilli Prasad Sharma, Nasim Beigi-Mohammadi, Hongxiang Geng, Dawn Dixon, Rob Madro, Phil Emmenegger, Carlos Tobar, Jeff Li, Alberto Leon-Garcia

    Abstract: Emergency events in a city cause considerable economic loss to individuals, their families, and the community. Accurate and timely prediction of events can help the emergency fire and rescue services in preparing for and mitigating the consequences of emergency events. In this paper, we present a systematic development of predictive models for various types of emergency events in the City of Edmon… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Journal ref: IEEE Access 12(2024) 56880-56909

  15. arXiv:2312.14542  [pdf, other

    cs.CL

    Automatic Data Retrieval for Cross Lingual Summarization

    Authors: Nikhilesh Bhatnagar, Ashok Urlana, Vandan Mujadia, Pruthwik Mishra, Dipti Misra Sharma

    Abstract: Cross-lingual summarization involves the summarization of text written in one language to a different one. There is a body of research addressing cross-lingual summarization from English to other European languages. In this work, we aim to perform cross-lingual summarization from English to Hindi. We propose pairing up the coverage of newsworthy events in textual and video format can prove to be h… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

    Comments: 6 pages, 6 tables, 2 figures, conference: ICON 2023

  16. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  17. arXiv:2312.11395  [pdf, other

    cs.CL cs.AI

    Verb Categorisation for Hindi Word Problem Solving

    Authors: Harshita Sharma, Pruthwik Mishra, Dipti Misra Sharma

    Abstract: Word problem Solving is a challenging NLP task that deals with solving mathematical problems described in natural language. Recently, there has been renewed interest in developing word problem solvers for Indian languages. As part of this paper, we have built a Hindi arithmetic word problem solver which makes use of verbs. Additionally, we have created verb categorization data for Hindi. Verbs are… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 16 pages, 17 figures, ICON 2023 Conference

    ACM Class: I.2.7

  18. arXiv:2312.03483  [pdf, other

    cs.CL cs.LG

    Exploring Answer Information Methods for Question Generation with Transformers

    Authors: Talha Chafekar, Aafiya Hussain, Grishma Sharma, Deepak Sharma

    Abstract: There has been a lot of work in question generation where different methods to provide target answers as input, have been employed. This experimentation has been mostly carried out for RNN based models. We use three different methods and their combinations for incorporating answer information and explore their effect on several automatic evaluation metrics. The methods that are used are answer pro… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  19. arXiv:2311.09216  [pdf, other

    cs.CL cs.AI

    Assessing Translation capabilities of Large Language Models involving English and Indian Languages

    Authors: Vandan Mujadia, Ashok Urlana, Yash Bhaskar, Penumalla Aditya Pavani, Kukkapalli Shravya, Parameswari Krishnamurthy, Dipti Misra Sharma

    Abstract: Generative Large Language Models (LLMs) have achieved remarkable advancements in various NLP tasks. In this work, our aim is to explore the multilingual capabilities of large language models by using machine translation as a task involving English and 22 Indian languages. We first investigate the translation capabilities of raw large language models, followed by exploring the in-context learning c… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  20. arXiv:2311.00855  [pdf, other

    cs.AI cs.LG cs.MA

    A Multi-Agent Reinforcement Learning Framework for Evaluating the U.S. Ending the HIV Epidemic Plan

    Authors: Dinesh Sharma, Ankit Shah, Chaitra Gopalappa

    Abstract: Human immunodeficiency virus (HIV) is a major public health concern in the United States, with about 1.2 million people living with HIV and 35,000 newly infected each year. There are considerable geographical disparities in HIV burden and care access across the U.S. The 2019 Ending the HIV Epidemic (EHE) initiative aims to reduce new infections by 90% by 2030, by improving coverage of diagnoses, t… ▽ More

    Submitted 6 November, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: Added acknowledgement

  21. arXiv:2310.18494  [pdf, other

    eess.IV cs.CV

    Knowledge-based in silico models and dataset for the comparative evaluation of mammography AI for a range of breast characteristics, lesion conspicuities and doses

    Authors: Elena Sizikova, Niloufar Saharkhiz, Diksha Sharma, Miguel Lago, Berkman Sahiner, Jana G. Delfino, Aldo Badano

    Abstract: To generate evidence regarding the safety and efficacy of artificial intelligence (AI) enabled medical devices, AI models need to be evaluated on a diverse population of patient cases, some of which may not be readily available. We propose an evaluation approach for testing medical imaging AI models that relies on in silico imaging pipelines in which stochastic digital models of human anatomy (in… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023 Datasets and Benchmarks Track

  22. arXiv:2310.07021  [pdf, other

    cs.RO cs.CV

    Pre-Trained Masked Image Model for Mobile Robot Navigation

    Authors: Vishnu Dutt Sharma, Anukriti Singh, Pratap Tokekar

    Abstract: 2D top-down maps are commonly used for the navigation and exploration of mobile robots through unknown areas. Typically, the robot builds the navigation maps incrementally from local observations using onboard sensors. Recent works have shown that predicting the structural patterns in the environment through learning-based approaches can greatly enhance task efficiency. While many such works build… ▽ More

    Submitted 25 March, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Accepted at ICRA 2024

  23. arXiv:2310.04925  [pdf, other

    cs.LG

    Crystal-GFN: sampling crystals with desirable properties and constraints

    Authors: Mila AI4Science, Alex Hernandez-Garcia, Alexandre Duval, Alexandra Volokhova, Yoshua Bengio, Divya Sharma, Pierre Luc Carrier, Yasmine Benabed, Michał Koziarski, Victor Schmidt

    Abstract: Accelerating material discovery holds the potential to greatly help mitigate the climate crisis. Discovering new solid-state materials such as electrocatalysts, super-ionic conductors or photovoltaic materials can have a crucial impact, for instance, in improving the efficiency of renewable energy production and storage. In this paper, we introduce Crystal-GFN, a generative model of crystal struct… ▽ More

    Submitted 13 December, 2023; v1 submitted 7 October, 2023; originally announced October 2023.

    Comments: Main paper (10 pages) + references + appendix

  24. arXiv:2308.06882  [pdf, other

    q-fin.ST cs.LG q-fin.CP stat.AP

    Quantifying Outlierness of Funds from their Categories using Supervised Similarity

    Authors: Dhruv Desai, Ashmita Dhiman, Tushar Sharma, Deepika Sharma, Dhagash Mehta, Stefano Pasquali

    Abstract: Mutual fund categorization has become a standard tool for the investment management industry and is extensively used by allocators for portfolio construction and manager selection, as well as by fund managers for peer analysis and competitive positioning. As a result, a (unintended) miscategorization or lack of precision can significantly impact allocation decisions and investment fund managers. H… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

    Comments: 8 pages, 5 tables, 8 figures

  25. arXiv:2307.04004  [pdf, other

    cs.RO cs.MA

    MAP-NBV: Multi-agent Prediction-guided Next-Best-View Planning for Active 3D Object Reconstruction

    Authors: Harnaik Dhami, Vishnu D. Sharma, Pratap Tokekar

    Abstract: Next-Best View (NBV) planning is a long-standing problem of determining where to obtain the next best view of an object from, by a robot that is viewing the object. There are a number of methods for choosing NBV based on the observed part of the object. In this paper, we investigate how predicting the unobserved part helps with the efficiency of reconstructing the object. We present, Multi-Agent P… ▽ More

    Submitted 24 June, 2024; v1 submitted 8 July, 2023; originally announced July 2023.

    Comments: 8 pages, 7 figures, 1 table. Submitted to IROS 2024

  26. arXiv:2307.02144  [pdf, other

    cs.IT math.CO

    Kolam Simulation using Angles at Lattice Points

    Authors: Tulasi Bharathi, Shailaja D Sharma, Nithin Nagaraj

    Abstract: Kolam is a ritual art form practised by people in South India and consists of rule-bound geometric patterns of dots and lines. Single loop Kolams are mathematical closed loop patterns drawn over a grid of dots and conforming to certain heuristics. In this work, we propose a novel encoding scheme where we map the angular movements of Kolam at lattice points into sequences containing $4$ distinct sy… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 19 pages, 31 figures

  27. arXiv:2306.11227  [pdf

    cs.AR cs.OS

    An Introduction to the Compute Express Link (CXL) Interconnect

    Authors: Debendra Das Sharma, Robert Blankenship, Daniel S. Berger

    Abstract: The Compute Express Link (CXL) is an open industry-standard interconnect between processors and devices such as accelerators, memory buffers, smart network interfaces, persistent memory, and solid-state drives. CXL offers coherency and memory semantics with bandwidth that scales with PCIe bandwidth while achieving significantly lower latency than PCIe. All major CPU vendors, device vendors, and da… ▽ More

    Submitted 7 May, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

  28. arXiv:2306.07098  [pdf, other

    cs.LG cs.AI

    Efficiently Learning the Graph for Semi-supervised Learning

    Authors: Dravyansh Sharma, Maxwell Jones

    Abstract: Computational efficiency is a major bottleneck in using classic graph-based approaches for semi-supervised learning on datasets with a large number of unlabeled examples. Known techniques to improve efficiency typically involve an approximation of the graph regularization objective, but suffer two major drawbacks - first the graph is assumed to be known or constructed with heuristic hyperparameter… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: 29 pages, 9 figures

  29. arXiv:2306.06375  [pdf, ps, other

    cs.LG eess.SP math.OC

    Optimized Gradient Tracking for Decentralized Online Learning

    Authors: Shivangi Dubey Sharma, Ketan Rajawat

    Abstract: This work considers the problem of decentralized online learning, where the goal is to track the optimum of the sum of time-varying functions, distributed across several nodes in a network. The local availability of the functions and their gradients necessitates coordination and consensus among the nodes. We put forth the Generalized Gradient Tracking (GGT) framework that unifies a number of exist… ▽ More

    Submitted 13 February, 2024; v1 submitted 10 June, 2023; originally announced June 2023.

    Comments: 30 pages, 6 Figures

  30. arXiv:2306.02960  [pdf, other

    cs.CV cs.LG

    Best of Both Worlds: Hybrid SNN-ANN Architecture for Event-based Optical Flow Estimation

    Authors: Shubham Negi, Deepika Sharma, Adarsh Kumar Kosta, Kaushik Roy

    Abstract: In the field of robotics, event-based cameras are emerging as a promising low-power alternative to traditional frame-based cameras for capturing high-speed motion and high dynamic range scenes. This is due to their sparse and asynchronous event outputs. Spiking Neural Networks (SNNs) with their asynchronous event-driven compute, show great potential for extracting the spatio-temporal features from… ▽ More

    Submitted 19 March, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

  31. arXiv:2305.11355  [pdf, other

    cs.CL

    MD3: The Multi-Dialect Dataset of Dialogues

    Authors: Jacob Eisenstein, Vinodkumar Prabhakaran, Clara Rivera, Dorottya Demszky, Devyani Sharma

    Abstract: We introduce a new dataset of conversational speech representing English from India, Nigeria, and the United States. The Multi-Dialect Dataset of Dialogues (MD3) strikes a new balance between open-ended conversational speech and task-oriented dialogue by prompting participants to perform a series of short information-sharing tasks. This facilitates quantitative cross-dialectal comparison, while av… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: InterSpeech 2023

  32. arXiv:2305.05519   

    cs.RO

    ProxMaP: Proximal Occupancy Map Prediction for Efficient Indoor Robot Navigation

    Authors: Vishnu Dutt Sharma, Jingxi Chen, Pratap Tokekar

    Abstract: In a typical path planning pipeline for a ground robot, we build a map (e.g., an occupancy grid) of the environment as the robot moves around. While navigating indoors, a ground robot's knowledge about the environment may be limited due to occlusions. Therefore, the map will have many as-yet-unknown regions that may need to be avoided by a conservative planner. Instead, if a robot is able to corre… ▽ More

    Submitted 9 May, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

    Comments: This is an incremental work over an existing arxiv submission of the author. It will be re-uploaded as a version of that work [arXiv:2203.04177]

  33. arXiv:2304.11465  [pdf, other

    cs.RO

    Pred-NBV: Prediction-guided Next-Best-View for 3D Object Reconstruction

    Authors: Harnaik Dhami, Vishnu D. Sharma, Pratap Tokekar

    Abstract: Prediction-based active perception has shown the potential to improve the navigation efficiency and safety of the robot by anticipating the uncertainty in the unknown environment. The existing works for 3D shape prediction make an implicit assumption about the partial observations and therefore cannot be used for real-world planning and do not consider the control effort for next-best-view plannin… ▽ More

    Submitted 7 August, 2023; v1 submitted 22 April, 2023; originally announced April 2023.

    Comments: 6 pages, 4 figures, 2 tables. Accepted to IROS 2023

  34. arXiv:2304.11238  [pdf, ps, other

    eess.IV cs.CV cs.LG

    Adapting model-based deep learning to multiple acquisition conditions: Ada-MoDL

    Authors: Aniket Pramanik, Sampada Bhave, Saurav Sajib, Samir D. Sharma, Mathews Jacob

    Abstract: Purpose: The aim of this work is to introduce a single model-based deep network that can provide high-quality reconstructions from undersampled parallel MRI data acquired with multiple sequences, acquisition settings and field strengths. Methods: A single unrolled architecture, which offers good reconstructions for multiple acquisition settings, is introduced. The proposed scheme adapts the mode… ▽ More

    Submitted 21 April, 2023; originally announced April 2023.

  35. arXiv:2304.03370  [pdf, other

    cs.LG cs.CR

    Reliable learning in challenging environments

    Authors: Maria-Florina Balcan, Steve Hanneke, Rattana Pukdee, Dravyansh Sharma

    Abstract: The problem of designing learners that provide guarantees that their predictions are provably correct is of increasing importance in machine learning. However, learning theoretic guarantees have only been considered in very specific settings. In this work, we consider the design and analysis of reliable learners in challenging test-time environments as encountered in modern machine learning proble… ▽ More

    Submitted 29 October, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Journal ref: NeurIPS 2023

  36. Detection of Homophobia & Transphobia in Dravidian Languages: Exploring Deep Learning Methods

    Authors: Deepawali Sharma, Vedika Gupta, Vivek Kumar Singh

    Abstract: The increase in abusive content on online social media platforms is impacting the social life of online users. Use of offensive and hate speech has been making so-cial media toxic. Homophobia and transphobia constitute offensive comments against LGBT+ community. It becomes imperative to detect and handle these comments, to timely flag or issue a warning to users indulging in such behaviour. Howeve… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Journal ref: Advanced Network Technologies and Intelligent Computing. ANTIC 2022. Communications in Computer and Information Science, vol 1798. Springer, Cham

  37. arXiv:2212.11562  [pdf

    cs.CR

    Role of Cybersecurity and Blockchain in Battlefield of Things

    Authors: Gaurav Sharma, Deepak Kumar Sharma, Adarsh Kumar

    Abstract: The Internet of Things is an essential component in the growth of an ecosystem that enables quick and precise judgments to be made for communication on the battleground. The usage of the battlefield of things (BoT) is, however, subject to several restrictions for a variety of reasons. There is a potential for instances of replay, data manipulation, breaches of privacy, and other similar occurrence… ▽ More

    Submitted 22 December, 2022; originally announced December 2022.

  38. arXiv:2212.06218  [pdf, other

    cs.CV

    Comparison Of Deep Object Detectors On A New Vulnerable Pedestrian Dataset

    Authors: Devansh Sharma, Tihitina Hade, Qing Tian

    Abstract: Pedestrian safety is one primary concern in autonomous driving. The under-representation of vulnerable groups in today's pedestrian datasets points to an urgent need for a dataset of vulnerable road users. In order to help train comprehensive models and subsequently drive research to improve the accuracy of vulnerable pedestrian identification, we first introduce a new dataset for vulnerable pedes… ▽ More

    Submitted 12 February, 2024; v1 submitted 12 December, 2022; originally announced December 2022.

    Comments: 7 pages, 4 Figures

  39. arXiv:2211.12072  [pdf, other

    cs.AR cs.NI eess.SP

    Design and Performance Analysis of Hardware Realization of 3GPP Physical Layer for 5G Cell Search

    Authors: Khalid Lodhi, Jayant Chhillar, Sumit J. Darak, Divisha Sharma

    Abstract: 5G Cell Search (CS) is the first step for user equipment (UE) to initiate the communication with the 5G node B (gNB) every time it is powered ON. In cellular networks, CS is accomplished via synchronization signals (SS) broadcasted by gNB. 5G 3rd generation partnership project (3GPP) specifications offer a detailed discussion on the SS generation at gNB but a limited understanding of their blind s… ▽ More

    Submitted 22 November, 2022; originally announced November 2022.

  40. arXiv:2211.04987  [pdf, other

    cs.LG cs.AI

    Interpretable Deep Reinforcement Learning for Green Security Games with Real-Time Information

    Authors: Vishnu Dutt Sharma, John P. Dickerson, Pratap Tokekar

    Abstract: Green Security Games with real-time information (GSG-I) add the real-time information about the agents' movement to the typical GSG formulation. Prior works on GSG-I have used deep reinforcement learning (DRL) to learn the best policy for the agent in such an environment without any need to store the huge number of state representations for GSG-I. However, the decision-making process of DRL method… ▽ More

    Submitted 9 November, 2022; originally announced November 2022.

  41. arXiv:2211.01338  [pdf, other

    eess.AS cs.CL cs.MM cs.SD eess.IV

    Technology Pipeline for Large Scale Cross-Lingual Dubbing of Lecture Videos into Multiple Indian Languages

    Authors: Anusha Prakash, Arun Kumar, Ashish Seth, Bhagyashree Mukherjee, Ishika Gupta, Jom Kuriakose, Jordan Fernandes, K V Vikram, Mano Ranjith Kumar M, Metilda Sagaya Mary, Mohammad Wajahat, Mohana N, Mudit Batra, Navina K, Nihal John George, Nithya Ravi, Pruthwik Mishra, Sudhanshu Srivastava, Vasista Sai Lodagala, Vandan Mujadia, Kada Sai Venkata Vineeth, Vrunda Sukhadia, Dipti Sharma, Hema Murthy, Pushpak Bhattacharya , et al. (2 additional authors not shown)

    Abstract: Cross-lingual dubbing of lecture videos requires the transcription of the original audio, correction and removal of disfluencies, domain term discovery, text-to-text translation into the target language, chunking of text using target language rhythm, text-to-speech synthesis followed by isochronous lipsyncing to the original video. This task becomes challenging when the source and target languages… ▽ More

    Submitted 1 November, 2022; originally announced November 2022.

  42. arXiv:2210.12215  [pdf, other

    cs.CL

    Gui at MixMT 2022 : English-Hinglish: An MT approach for translation of code mixed data

    Authors: Akshat Gahoi, Jayant Duneja, Anshul Padhi, Shivam Mangale, Saransh Rajput, Tanvi Kamble, Dipti Misra Sharma, Vasudeva Varma

    Abstract: Code-mixed machine translation has become an important task in multilingual communities and extending the task of machine translation to code mixed data has become a common task for these languages. In the shared tasks of WMT 2022, we try to tackle the same for both English + Hindi to Hinglish and Hinglish to English. The first task dealt with both Roman and Devanagari script as we had monolingual… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

  43. arXiv:2210.00275  [pdf, other

    cs.CV

    Offline Handwritten Amharic Character Recognition Using Few-shot Learning

    Authors: Mesay Samuel, Lars Schmidt-Thieme, DP Sharma, Abiot Sinamo, Abey Bruck

    Abstract: Few-shot learning is an important, but challenging problem of machine learning aimed at learning from only fewer labeled training examples. It has become an active area of research due to deep learning requiring huge amounts of labeled dataset, which is not feasible in the real world. Learning from a few examples is also an important attempt towards learning like humans. Few-shot learning has prov… ▽ More

    Submitted 1 October, 2022; originally announced October 2022.

    Comments: PanAfriCon AI 2022 virtual conference paper

  44. arXiv:2209.09292  [pdf, other

    cs.RO

    D2CoPlan: A Differentiable Decentralized Planner for Multi-Robot Coverage

    Authors: Vishnu Dutt Sharma, Lifeng Zhou, Pratap Tokekar

    Abstract: Centralized approaches for multi-robot coverage planning problems suffer from the lack of scalability. Learning-based distributed algorithms provide a scalable avenue in addition to bringing data-oriented feature generation capabilities to the table, allowing integration with other learning-based approaches. To this end, we present a learning-based, differentiable distributed coverage planner (D2C… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

  45. arXiv:2209.09118  [pdf, other

    cs.CV cs.CL cs.IR cs.LG

    OCR for TIFF Compressed Document Images Directly in Compressed Domain Using Text segmentation and Hidden Markov Model

    Authors: Dikshit Sharma, Mohammed Javed

    Abstract: In today's technological era, document images play an important and integral part in our day to day life, and specifically with the surge of Covid-19, digitally scanned documents have become key source of communication, thus avoiding any sort of infection through physical contact. Storage and transmission of scanned document images is a very memory intensive task, hence compression techniques are… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: The paper has 14 figures and 1 table

  46. The role of entanglement for enhancing the efficiency of quantum kernels towards classification

    Authors: Diksha Sharma, Parvinder Singh, Atul Kumar

    Abstract: Quantum kernels are considered as potential resources to illustrate benefits of quantum computing in machine learning. Considering the impact of hyperparameters on the performance of a classical machine learning model, it is imperative to identify promising hyperparameters using quantum kernel methods in order to achieve quantum advantages. In this work, we analyse and classify sentiments of textu… ▽ More

    Submitted 1 April, 2023; v1 submitted 12 September, 2022; originally announced September 2022.

  47. arXiv:2208.00285  [pdf, other

    cs.DC cs.PL

    Fence Synthesis under the C11 Memory Model

    Authors: Sanjana Singh, Divyanjali Sharma, Ishita Jaju, Subodh Sharma

    Abstract: The C/C++11 (C11) standard offers a spectrum of ordering guarantees on memory access operations. The combinations of such orderings pose a challenge in developing correct and efficient weak memory programs. A common solution to preclude those program outcomes that violate the correctness specification is using C11 synchronization-fences, which establish ordering on program events. The challenge is… ▽ More

    Submitted 2 August, 2022; v1 submitted 30 July, 2022; originally announced August 2022.

  48. arXiv:2207.10199  [pdf, other

    cs.LG stat.ML

    Provably tuning the ElasticNet across instances

    Authors: Maria-Florina Balcan, Mikhail Khodak, Dravyansh Sharma, Ameet Talwalkar

    Abstract: An important unresolved challenge in the theory of regularization is to set the regularization coefficients of popular techniques like the ElasticNet with general provable guarantees. We consider the problem of tuning the regularization parameters of Ridge regression, LASSO, and the ElasticNet across multiple problem instances, a setting that encompasses both cross-validation and multi-task hyperp… ▽ More

    Submitted 15 January, 2024; v1 submitted 20 July, 2022; originally announced July 2022.

  49. arXiv:2205.09185  [pdf, other

    physics.ins-det cs.LG hep-ex nucl-ex physics.comp-ph

    AI-assisted Optimization of the ECCE Tracking System at the Electron Ion Collider

    Authors: C. Fanelli, Z. Papandreou, K. Suresh, J. K. Adkins, Y. Akiba, A. Albataineh, M. Amaryan, I. C. Arsene, C. Ayerbe Gayoso, J. Bae, X. Bai, M. D. Baker, M. Bashkanov, R. Bellwied, F. Benmokhtar, V. Berdnikov, J. C. Bernauer, F. Bock, W. Boeglin, M. Borysova, E. Brash, P. Brindza, W. J. Briscoe, M. Brooks, S. Bueltmann , et al. (258 additional authors not shown)

    Abstract: The Electron-Ion Collider (EIC) is a cutting-edge accelerator facility that will study the nature of the "glue" that binds the building blocks of the visible matter in the universe. The proposed experiment will be realized at Brookhaven National Laboratory in approximately 10 years from now, with detector design and R&D currently ongoing. Notably, EIC is one of the first large-scale facilities to… ▽ More

    Submitted 19 May, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

    Comments: 16 pages, 18 figures, 2 appendices, 3 tables

  50. A Driver-Vehicle Model for ADS Scenario-based Testing

    Authors: Rodrigo Queiroz, Divit Sharma, Ricardo Caldas, Krzysztof Czarnecki, Sergio García, Thorsten Berger, Patrizio Pelliccione

    Abstract: Scenario-based testing for automated driving systems (ADS) must be able to simulate traffic scenarios that rely on interactions with other vehicles. Although many languages for high-level scenario modelling have been proposed, they lack the features to precisely and reliably control the required micro-simulation, while also supporting behavior reuse and test reproducibility for a wide range of int… ▽ More

    Submitted 29 May, 2024; v1 submitted 5 May, 2022; originally announced May 2022.

    Comments: 15 pages, 15 figures