Zum Hauptinhalt springen

Showing 1–20 of 20 results for author: Sharma, U

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.08560  [pdf, other

    cs.CV cs.LG

    A training regime to learn unified representations from complementary breast imaging modalities

    Authors: Umang Sharma, Jungkyu Park, Laura Heacock, Sumit Chopra, Krzysztof Geras

    Abstract: Full Field Digital Mammograms (FFDMs) and Digital Breast Tomosynthesis (DBT) are the two most widely used imaging modalities for breast cancer screening. Although DBT has increased cancer detection compared to FFDM, its widespread adoption in clinical practice has been slowed by increased interpretation times and a perceived decrease in the conspicuity of specific lesion types. Specifically, the n… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  2. arXiv:2407.00121  [pdf, other

    cs.LG cs.AI cs.CL

    Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks

    Authors: Ibrahim Abdelaziz, Kinjal Basu, Mayank Agarwal, Sadhana Kumaravel, Matthew Stallone, Rameswar Panda, Yara Rizk, GP Bhargav, Maxwell Crouse, Chulaka Gunasekara, Shajith Ikbal, Sachin Joshi, Hima Karanam, Vineet Kumar, Asim Munawar, Sumit Neelam, Dinesh Raghu, Udit Sharma, Adriana Meza Soria, Dheeraj Sreedhar, Praveen Venkateswaran, Merve Unuvar, David Cox, Salim Roukos, Luis Lastras , et al. (1 additional authors not shown)

    Abstract: Large language models (LLMs) have recently shown tremendous promise in serving as the backbone to agentic systems, as demonstrated by their performance in multi-faceted, challenging benchmarks like SWE-Bench and Agent-Bench. However, to realize the true potential of LLMs as autonomous agents, they must learn to identify, call, and interact with external tools and application program interfaces (AP… ▽ More

    Submitted 27 June, 2024; originally announced July 2024.

  3. arXiv:2406.08848  [pdf, other

    cs.CL cs.AI

    An Approach to Build Zero-Shot Slot-Filling System for Industry-Grade Conversational Assistants

    Authors: G P Shrivatsa Bhargav, Sumit Neelam, Udit Sharma, Shajith Ikbal, Dheeraj Sreedhar, Hima Karanam, Sachindra Joshi, Pankaj Dhoolia, Dinesh Garg, Kyle Croutwater, Haode Qi, Eric Wayne, J William Murdock

    Abstract: We present an approach to build Large Language Model (LLM) based slot-filling system to perform Dialogue State Tracking in conversational assistants serving across a wide variety of industry-grade applications. Key requirements of this system include: 1) usage of smaller-sized models to meet low latency requirements and to enable convenient and cost-effective cloud and customer premise deployments… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  4. arXiv:2406.04318  [pdf, other

    cs.LG cs.AI cs.CV

    Adaptive Sampling of k-Space in Magnetic Resonance for Rapid Pathology Prediction

    Authors: Chen-Yu Yen, Raghav Singhal, Umang Sharma, Rajesh Ranganath, Sumit Chopra, Lerrel Pinto

    Abstract: Magnetic Resonance (MR) imaging, despite its proven diagnostic utility, remains an inaccessible imaging modality for disease surveillance at the population level. A major factor rendering MR inaccessible is lengthy scan times. An MR scanner collects measurements associated with the underlying anatomy in the Fourier space, also known as the k-space. Creating a high-fidelity image requires collectin… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: ICML 2024. Project website at https://adaptive-sampling-mr.github.io

  5. arXiv:2405.06676  [pdf, other

    cs.CL cs.AI cs.AR

    EDA Corpus: A Large Language Model Dataset for Enhanced Interaction with OpenROAD

    Authors: Bing-Yue Wu, Utsav Sharma, Sai Rahul Dhanvi Kankipati, Ajay Yadav, Bintu Kappil George, Sai Ritish Guntupalli, Austin Rovinski, Vidya A. Chhabria

    Abstract: Large language models (LLMs) serve as powerful tools for design, providing capabilities for both task automation and design assistance. Recent advancements have shown tremendous potential for facilitating LLM integration into the chip design process; however, many of these works rely on data that are not publicly available and/or not permissively licensed for use in LLM training and distribution.… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: Under review at Workshop on LLM-Aided Design (LAD'24)

  6. arXiv:2401.01626   

    cs.LG cs.AI

    On the Expressive Power of Graph Neural Networks

    Authors: Ashwin Nalwade, Kelly Marshall, Axel Eladi, Umang Sharma

    Abstract: The study of Graph Neural Networks has received considerable interest in the past few years. By extending deep learning to graph-structured data, GNNs can solve a diverse set of tasks in fields including social science, chemistry, and medicine. The development of GNN architectures has largely been focused on improving empirical performance on tasks like node or graph classification. However, a lin… ▽ More

    Submitted 8 March, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

    Comments: We felt that significantly more work was needed to improve the quality before it should be put out in its current state. No replacement is available at the moment or in the near future

  7. Grove: a Separation-Logic Library for Verifying Distributed Systems (Extended Version)

    Authors: Upamanyu Sharma, Ralf Jung, Joseph Tassarotti, M. Frans Kaashoek, Nickolai Zeldovich

    Abstract: Grove is a concurrent separation logic library for verifying distributed systems. Grove is the first to handle time-based leases, including their interaction with reconfiguration, crash recovery, thread-level concurrency, and unreliable networks. This paper uses Grove to verify several distributed system components written in Go, including GroveKV, a realistic distributed multi-threaded key-value… ▽ More

    Submitted 14 September, 2023; v1 submitted 6 September, 2023; originally announced September 2023.

    Comments: Extended version of paper appearing at SOSP 2023

  8. arXiv:2209.05828  [pdf, other

    cs.AI cs.DB

    Expressive Reasoning Graph Store: A Unified Framework for Managing RDF and Property Graph Databases

    Authors: Sumit Neelam, Udit Sharma, Sumit Bhatia, Hima Karanam, Ankita Likhyani, Ibrahim Abdelaziz, Achille Fokoue, L. V. Subramaniam

    Abstract: Resource Description Framework (RDF) and Property Graph (PG) are the two most commonly used data models for representing, storing, and querying graph data. We present Expressive Reasoning Graph Store (ERGS) -- a graph store built on top of JanusGraph (a Property Graph store) that also allows storing and querying of RDF datasets. First, we describe how RDF data can be translated into a Property Gra… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 16 pages, 3 figures, 9 tables

  9. Multimodal spatiotemporal graph neural networks for improved prediction of 30-day all-cause hospital readmission

    Authors: Siyi Tang, Amara Tariq, Jared Dunnmon, Umesh Sharma, Praneetha Elugunti, Daniel Rubin, Bhavik N. Patel, Imon Banerjee

    Abstract: Measures to predict 30-day readmission are considered an important quality factor for hospitals as accurate predictions can reduce the overall cost of care by identifying high risk patients before they are discharged. While recent deep learning-based studies have shown promising empirical results on readmission prediction, several limitations exist that may hinder widespread clinical utility, such… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Journal ref: IEEE Journal of Biomedical and Health Informatics, vol. 27, no. 4, pp. 2071-2082, April 2023

  10. arXiv:2203.11054  [pdf, other

    cs.CL cs.AI

    Targeted Extraction of Temporal Facts from Textual Resources for Improved Temporal Question Answering over Knowledge Bases

    Authors: Nithish Kannen, Udit Sharma, Sumit Neelam, Dinesh Khandelwal, Shajith Ikbal, Hima Karanam, L Venkata Subramaniam

    Abstract: Knowledge Base Question Answering (KBQA) systems have the goal of answering complex natural language questions by reasoning over relevant facts retrieved from Knowledge Bases (KB). One of the major challenges faced by these systems is their inability to retrieve all relevant facts due to factors such as incomplete KB and entity/relation linking errors. In this paper, we address this particular cha… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

    ACM Class: I.2.7; I.2.4

  11. arXiv:2201.05793  [pdf, other

    cs.CL cs.AI

    A Benchmark for Generalizable and Interpretable Temporal Question Answering over Knowledge Bases

    Authors: Sumit Neelam, Udit Sharma, Hima Karanam, Shajith Ikbal, Pavan Kapanipathi, Ibrahim Abdelaziz, Nandana Mihindukulasooriya, Young-Suk Lee, Santosh Srivastava, Cezar Pendus, Saswati Dana, Dinesh Garg, Achille Fokoue, G P Shrivatsa Bhargav, Dinesh Khandelwal, Srinivas Ravishankar, Sairam Gurajada, Maria Chang, Rosario Uceda-Sosa, Salim Roukos, Alexander Gray, Guilherme Lima, Ryan Riegel, Francois Luus, L Venkata Subramaniam

    Abstract: Knowledge Base Question Answering (KBQA) tasks that involve complex reasoning are emerging as an important research direction. However, most existing KBQA datasets focus primarily on generic multi-hop reasoning over explicit facts, largely ignoring other reasoning types such as temporal, spatial, and taxonomic reasoning. In this paper, we present a benchmark dataset for temporal reasoning, TempQA-… ▽ More

    Submitted 15 January, 2022; originally announced January 2022.

    Comments: 7 pages, 2 figures, 7 tables. arXiv admin note: substantial text overlap with arXiv:2109.13430

  12. arXiv:2109.13430  [pdf, other

    cs.CL cs.AI

    SYGMA: System for Generalizable Modular Question Answering OverKnowledge Bases

    Authors: Sumit Neelam, Udit Sharma, Hima Karanam, Shajith Ikbal, Pavan Kapanipathi, Ibrahim Abdelaziz, Nandana Mihindukulasooriya, Young-Suk Lee, Santosh Srivastava, Cezar Pendus, Saswati Dana, Dinesh Garg, Achille Fokoue, G P Shrivatsa Bhargav, Dinesh Khandelwal, Srinivas Ravishankar, Sairam Gurajada, Maria Chang, Rosario Uceda-Sosa, Salim Roukos, Alexander Gray, Guilherme LimaRyan Riegel, Francois Luus, L Venkata Subramaniam

    Abstract: Knowledge Base Question Answering (KBQA) tasks that in-volve complex reasoning are emerging as an important re-search direction. However, most KBQA systems struggle withgeneralizability, particularly on two dimensions: (a) acrossmultiple reasoning types where both datasets and systems haveprimarily focused on multi-hop reasoning, and (b) across mul-tiple knowledge bases, where KBQA approaches are… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  13. arXiv:2108.09766  [pdf

    cs.CY

    Role of Information and ICTs as Determinants of Farmer's Adaptive Capacity to Climate Risk: An Empirical Study From Haryana, India

    Authors: Priya Chetri, Upasna Sharma, P. Vigneswara Ilavarasan

    Abstract: Using the primary data collected for 463 farmers in six districts of Haryana, India, the present study attempts to understand the constituents of farmer's adaptive capacity at local level and how it can be enhanced. We use path analysis technique using the lavaan package in RStudio to empirically test the role of information. We find that information is a direct and significant contributor to enha… ▽ More

    Submitted 22 August, 2021; originally announced August 2021.

    Comments: In proceedings of the 1st Virtual Conference on Implications of Information and Digital Technologies for Development, 2021

  14. arXiv:2104.08391  [pdf, other

    cs.CV

    Learning To Count Everything

    Authors: Viresh Ranjan, Udbhav Sharma, Thu Nguyen, Minh Hoai

    Abstract: Existing works on visual counting primarily focus on one specific category at a time, such as people, animals, and cells. In this paper, we are interested in counting everything, that is to count objects from any category given only a few annotated instances from that category. To this end, we pose counting as a few-shot regression task. To tackle this task, we present a novel method that takes a… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

    Comments: CVPR 2021

  15. arXiv:2102.06701  [pdf, other

    cs.LG cond-mat.dis-nn stat.ML

    Explaining Neural Scaling Laws

    Authors: Yasaman Bahri, Ethan Dyer, Jared Kaplan, Jaehoon Lee, Utkarsh Sharma

    Abstract: The population loss of trained deep neural networks often follows precise power-law scaling relations with either the size of the training dataset or the number of parameters in the network. We propose a theory that explains the origins of and connects these scaling laws. We identify variance-limited and resolution-limited scaling behavior for both dataset and model size, for a total of four scali… ▽ More

    Submitted 28 April, 2024; v1 submitted 12 February, 2021; originally announced February 2021.

    Comments: 11 pages, 3 figures + Supplement (expanded). This version to appear in PNAS

    Journal ref: PNAS 121 (27) e2311878121 (2024)

  16. arXiv:2012.01707  [pdf, other

    cs.CL cs.AI

    Leveraging Abstract Meaning Representation for Knowledge Base Question Answering

    Authors: Pavan Kapanipathi, Ibrahim Abdelaziz, Srinivas Ravishankar, Salim Roukos, Alexander Gray, Ramon Astudillo, Maria Chang, Cristina Cornelio, Saswati Dana, Achille Fokoue, Dinesh Garg, Alfio Gliozzo, Sairam Gurajada, Hima Karanam, Naweed Khan, Dinesh Khandelwal, Young-Suk Lee, Yunyao Li, Francois Luus, Ndivhuwo Makondo, Nandana Mihindukulasooriya, Tahira Naseem, Sumit Neelam, Lucian Popa, Revanth Reddy , et al. (5 additional authors not shown)

    Abstract: Knowledge base question answering (KBQA)is an important task in Natural Language Processing. Existing approaches face significant challenges including complex question understanding, necessity for reasoning, and lack of large end-to-end training datasets. In this work, we propose Neuro-Symbolic Question Answering (NSQA), a modular KBQA system, that leverages (1) Abstract Meaning Representation (AM… ▽ More

    Submitted 2 June, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: Accepted to Findings of ACL

  17. arXiv:2006.13155  [pdf, other

    cs.AI cs.LG cs.LO

    Logical Neural Networks

    Authors: Ryan Riegel, Alexander Gray, Francois Luus, Naweed Khan, Ndivhuwo Makondo, Ismail Yunus Akhalwaya, Haifeng Qian, Ronald Fagin, Francisco Barahona, Udit Sharma, Shajith Ikbal, Hima Karanam, Sumit Neelam, Ankita Likhyani, Santosh Srivastava

    Abstract: We propose a novel framework seamlessly providing key properties of both neural nets (learning) and symbolic logic (knowledge and reasoning). Every neuron has a meaning as a component of a formula in a weighted real-valued logic, yielding a highly intepretable disentangled representation. Inference is omnidirectional rather than focused on predefined target variables, and corresponds to logical re… ▽ More

    Submitted 23 June, 2020; originally announced June 2020.

    Comments: 10 pages (incl. references), 38 pages supplementary, 7 figures, 9 tables, 6 algorithms. In submission to NeurIPS 2020

  18. arXiv:2004.10802  [pdf, other

    cs.LG stat.ML

    A Neural Scaling Law from the Dimension of the Data Manifold

    Authors: Utkarsh Sharma, Jared Kaplan

    Abstract: When data is plentiful, the loss achieved by well-trained neural networks scales as a power-law $L \propto N^{-α}$ in the number of network parameters $N$. This empirical scaling law holds for a wide variety of data modalities, and may persist over many orders of magnitude. The scaling law can be explained if neural models are effectively just performing regression on a data manifold of intrinsic… ▽ More

    Submitted 22 April, 2020; originally announced April 2020.

    Comments: 16+12 pages, 11+11 figures

  19. arXiv:1309.7312  [pdf

    cs.CL

    Development and Transcription of Assamese Speech Corpus

    Authors: Himangshu Sarma, Navanath Saharia, Utpal Sharma, Smriti Kumar Sinha, Mancha Jyoti Malakar

    Abstract: A balanced speech corpus is the basic need for any speech processing task. In this report we describe our effort on development of Assamese speech corpus. We mainly focused on some issues and challenges faced during development of the corpus. Being a less computationally aware language, this is the first effort to develop speech corpus for Assamese. As corpus development is an ongoing process, in… ▽ More

    Submitted 27 September, 2013; originally announced September 2013.

    Comments: 4 page,National Conferance

  20. arXiv:0912.1820  [pdf

    cs.CL

    Parsing of part-of-speech tagged Assamese Texts

    Authors: Mirzanur Rahman, Sufal Das, Utpal Sharma

    Abstract: A natural language (or ordinary language) is a language that is spoken, written, or signed by humans for general-purpose communication, as distinguished from formal languages (such as computer-programming languages or the "languages" used in the study of formal logic). The computational activities required for enabling a computer to carry out information processing using natural language is call… ▽ More

    Submitted 9 December, 2009; originally announced December 2009.

    Comments: International Journal of Computer Science Issues, IJCSI Volume 6, Issue 1, pp28-34, November 2009

    Journal ref: M. Rahman, S. Das and U. Sharma, "Parsing of part-of-speech tagged Assamese Texts", International Journal of Computer Science Issues, IJCSI, Volume 6, Issue 1, pp28-34, November 2009