Zum Hauptinhalt springen

Showing 1–44 of 44 results for author: Deshpande, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.13661  [pdf, ps, other

    cs.GT

    Optimal Strategies in Ranked Choice Voting

    Authors: Sanyukta Deshpande, Nikhil Garg, Sheldon Jacobson

    Abstract: Ranked Choice Voting (RCV) and Single Transferable Voting (STV) are widely valued; but are complex to understand due to intricate per-round vote transfers. Questions like determining how far a candidate is from winning or identifying effective election strategies are computationally challenging as minor changes in voter rankings can lead to significant ripple effects - for example, lending support… ▽ More

    Submitted 18 July, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

  2. arXiv:2407.10732  [pdf, other

    cs.CE

    Gaussian process regression + deep neural network autoencoder for probabilistic surrogate modeling in nonlinear mechanics of solids

    Authors: Saurabh Deshpande, Hussein Rappel, Mark Hobbs, Stéphane P. A. Bordas, Jakub Lengiewicz

    Abstract: Many real-world applications demand accurate and fast predictions, as well as reliable uncertainty estimates. However, quantifying uncertainty on high-dimensional predictions is still a severely under-invested problem, especially when input-output relationships are non-linear. To handle this problem, the present work introduces an innovative approach that combines autoencoder deep neural networks… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  3. arXiv:2406.14442  [pdf, other

    cs.LG cs.AI cs.CE q-bio.BM q-bio.MN

    Graph Representation Learning Strategies for Omics Data: A Case Study on Parkinson's Disease

    Authors: Elisa Gómez de Lope, Saurabh Deshpande, Ramón Viñas Torné, Pietro Liò, Enrico Glaab, Stéphane P. A. Bordas

    Abstract: Omics data analysis is crucial for studying complex diseases, but its high dimensionality and heterogeneity challenge classical statistical and machine learning methods. Graph neural networks have emerged as promising alternatives, yet the optimal strategies for their design and optimization in real-world biomedical challenges remain unclear. This study evaluates various graph representation learn… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: Submitted to Machine Learning in Computational Biology 2024 as an extended abstract, 2 pages + 1 appendix

  4. arXiv:2405.02664  [pdf, other

    cs.AI cs.IR

    MedPromptExtract (Medical Data Extraction Tool): Anonymization and Hi-fidelity Automated data extraction using NLP and prompt engineering

    Authors: Roomani Srivastava, Suraj Prasad, Lipika Bhat, Sarvesh Deshpande, Barnali Das, Kshitij Jadhav

    Abstract: A major roadblock in the seamless digitization of medical records remains the lack of interoperability of existing records. Extracting relevant medical information required for further treatment planning or even research is a time consuming labour intensive task involving expenditure of valuable time of doctors. In this demo paper we present, MedPromptExtract an automated tool using a combination… ▽ More

    Submitted 6 June, 2024; v1 submitted 4 May, 2024; originally announced May 2024.

    Comments: 4 pages, 3 figures, pre-print sumitted to CIKM 2024

  5. arXiv:2403.05749  [pdf, other

    eess.SY cs.DM

    Characterizing Flow Complexity in Transportation Networks using Graph Homology

    Authors: Shashank A Deshpande, Hamsa Balakrishnan

    Abstract: Series-parallel network topologies generally exhibit simplified dynamical behavior and avoid high combinatorial complexity. A comprehensive analysis of how flow complexity emerges with a graph's deviation from series-parallel topology is therefore of fundamental interest. We introduce the notion of a robust $k$-path on a directed acycylic graph, with increasing values of the length $k$ reflecting… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

    Comments: 7 pages, 3 figures, letter

  6. arXiv:2401.16914  [pdf, other

    cs.LG cond-mat.mtrl-sci

    Energy-conserving equivariant GNN for elasticity of lattice architected metamaterials

    Authors: Ivan Grega, Ilyes Batatia, Gábor Csányi, Sri Karlapati, Vikram S. Deshpande

    Abstract: Lattices are architected metamaterials whose properties strongly depend on their geometrical design. The analogy between lattices and graphs enables the use of graph neural networks (GNNs) as a faster surrogate model compared to traditional methods such as finite element modelling. In this work, we generate a big dataset of structure-property relationships for strut-based lattices. The dataset is… ▽ More

    Submitted 20 March, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: International Conference on Learning Representations 2024

  7. arXiv:2309.03812  [pdf, other

    cs.CV cs.AI cs.LG

    AnthroNet: Conditional Generation of Humans via Anthropometrics

    Authors: Francesco Picetti, Shrinath Deshpande, Jonathan Leban, Soroosh Shahtalebi, Jay Patel, Peifeng Jing, Chunpu Wang, Charles Metze III, Cameron Sun, Cera Laidlaw, James Warren, Kathy Huynh, River Page, Jonathan Hogins, Adam Crespi, Sujoy Ganguly, Salehe Erfanian Ebadi

    Abstract: We present a novel human body model formulated by an extensive set of anthropocentric measurements, which is capable of generating a wide range of human body shapes and poses. The proposed model enables direct modeling of specific human identities through a deep generative architecture, which can produce humans in any arbitrary pose. It is the first of its kind to have been trained end-to-end usin… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

    Comments: AnthroNet's Unity data generator source code is available at: https://unity-technologies.github.io/AnthroNet/

  8. arXiv:2308.07414  [pdf, other

    cs.GT

    Votemandering: Strategies and Fairness in Political Redistricting

    Authors: Sanyukta Deshpande, Ian G Ludden, Sheldon H Jacobson

    Abstract: Gerrymandering, the deliberate manipulation of electoral district boundaries for political advantage, is a persistent issue in U.S. redistricting cycles. This paper introduces and analyzes a new phenomenon, 'votemandering'- a strategic blend of gerrymandering and targeted political campaigning, devised to gain more seats by circumventing fairness measures. It leverages accurate demographic and soc… ▽ More

    Submitted 15 August, 2023; v1 submitted 14 August, 2023; originally announced August 2023.

  9. arXiv:2308.03897  [pdf, other

    cs.ET quant-ph

    Hardware Architecture for a Quantum Computer Trusted Execution Environment

    Authors: Theodoros Trochatos, Chuanqi Xu, Sanjay Deshpande, Yao Lu, Yongshan Ding, Jakub Szefer

    Abstract: The cloud-based environments in which today's and future quantum computers will operate, raise concerns about the security and privacy of user's intellectual property. Quantum circuits submitted to cloud-based quantum computer providers represent sensitive or proprietary algorithms developed by users that need protection. Further, input data is hard-coded into the circuits, and leakage of the circ… ▽ More

    Submitted 7 August, 2023; originally announced August 2023.

  10. arXiv:2306.00382  [pdf, other

    stat.ME cs.LG

    Calibrated and Conformal Propensity Scores for Causal Effect Estimation

    Authors: Shachi Deshpande, Volodymyr Kuleshov

    Abstract: Propensity scores are commonly used to estimate treatment effects from observational data. We argue that the probabilistic output of a learned propensity score model should be calibrated -- i.e., a predictive treatment probability of 90% should correspond to 90% of individuals being assigned the treatment group -- and we propose simple recalibration techniques to ensure this property. We prove tha… ▽ More

    Submitted 4 June, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: 23 pages, 3 figures

    ACM Class: I.2.m

  11. arXiv:2305.05006  [pdf, other

    eess.IV cs.CV

    Synthesis of Annotated Colorectal Cancer Tissue Images from Gland Layout

    Authors: Srijay Deshpande, Fayyaz Minhas, Nasir Rajpoot

    Abstract: Generating realistic tissue images with annotations is a challenging task that is important in many computational histopathology applications. Synthetically generated images and annotations are valuable for training and evaluating algorithms in this domain. To address this, we propose an interactive framework generating pairs of realistic colorectal cancer histology images with corresponding gland… ▽ More

    Submitted 4 April, 2024; v1 submitted 8 May, 2023; originally announced May 2023.

  12. arXiv:2304.06122  [pdf, other

    cs.CY cs.AI

    Analyzing ChatGPT's Aptitude in an Introductory Computer Engineering Course

    Authors: Sanjay Deshpande, Jakub Szefer

    Abstract: ChatGPT has recently gathered attention from the general public and academia as a tool that is able to generate plausible and human-sounding text answers to various questions. One potential use, or abuse, of ChatGPT is in answering various questions or even generating whole essays and research papers in an academic or classroom setting. While recent works have explored the use of ChatGPT in the co… ▽ More

    Submitted 14 April, 2023; v1 submitted 13 March, 2023; originally announced April 2023.

    Comments: 5 pages

  13. arXiv:2303.06274  [pdf

    cs.CV cs.LG

    CoNIC Challenge: Pushing the Frontiers of Nuclear Detection, Segmentation, Classification and Counting

    Authors: Simon Graham, Quoc Dang Vu, Mostafa Jahanifar, Martin Weigert, Uwe Schmidt, Wenhua Zhang, Jun Zhang, Sen Yang, Jinxi Xiang, Xiyue Wang, Josef Lorenz Rumberger, Elias Baumann, Peter Hirsch, Lihao Liu, Chenyang Hong, Angelica I. Aviles-Rivero, Ayushi Jain, Heeyoung Ahn, Yiyu Hong, Hussam Azzuni, Min Xu, Mohammad Yaqub, Marie-Claire Blache, Benoît Piégu, Bertrand Vernay , et al. (64 additional authors not shown)

    Abstract: Nuclear detection, segmentation and morphometric profiling are essential in helping us further understand the relationship between histology and patient outcome. To drive innovation in this area, we setup a community-wide challenge using the largest available dataset of its kind to assess nuclear segmentation and cellular composition. Our challenge, named CoNIC, stimulated the development of repro… ▽ More

    Submitted 14 March, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

  14. arXiv:2302.12196  [pdf, other

    cs.LG

    Calibrated Regression Against An Adversary Without Regret

    Authors: Shachi Deshpande, Charles Marx, Volodymyr Kuleshov

    Abstract: We are interested in probabilistic prediction in online settings in which data does not follow a probability distribution. Our work seeks to achieve two goals: (1) producing valid probabilities that accurately reflect model confidence; and (2) ensuring that traditional notions of performance (e.g., high accuracy) still hold. We introduce online algorithms guaranteed to achieve these goals on arbit… ▽ More

    Submitted 4 June, 2024; v1 submitted 23 February, 2023; originally announced February 2023.

  15. arXiv:2212.13780  [pdf, other

    eess.IV cs.CV cs.LG

    SynCLay: Interactive Synthesis of Histology Images from Bespoke Cellular Layouts

    Authors: Srijay Deshpande, Muhammad Dawood, Fayyaz Minhas, Nasir Rajpoot

    Abstract: Automated synthesis of histology images has several potential applications in computational pathology. However, no existing method can generate realistic tissue images with a bespoke cellular layout or user-defined histology parameters. In this work, we propose a novel framework called SynCLay (Synthesis from Cellular Layouts) that can construct realistic and high-quality histology images from use… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

  16. Convolution, aggregation and attention based deep neural networks for accelerating simulations in mechanics

    Authors: Saurabh Deshpande, Raúl I. Sosa, Stéphane P. A. Bordas, Jakub Lengiewicz

    Abstract: Deep learning surrogate models are being increasingly used in accelerating scientific simulations as a replacement for costly conventional numerical techniques. However, their use remains a significant challenge when dealing with real-world complex examples. In this work, we demonstrate three types of neural network architectures for efficient learning of highly non-linear deformations of solid bo… ▽ More

    Submitted 24 March, 2023; v1 submitted 1 December, 2022; originally announced December 2022.

    Journal ref: Front. Mater. 10:1128954

  17. arXiv:2212.00219  [pdf, other

    stat.ML cs.LG stat.OT

    Are you using test log-likelihood correctly?

    Authors: Sameer K. Deshpande, Soumya Ghosh, Tin D. Nguyen, Tamara Broderick

    Abstract: Test log-likelihood is commonly used to compare different models of the same data or different approximate inference algorithms for fitting the same probabilistic model. We present simple examples demonstrating how comparisons based on test log-likelihood can contradict comparisons according to other objectives. Specifically, our examples show that (i) approximate Bayesian inference algorithms tha… ▽ More

    Submitted 18 January, 2024; v1 submitted 30 November, 2022; originally announced December 2022.

    Comments: Presented at the ICBINB Workshop at NeurIPS 2022. This version accepted at TMLR, available at https://openreview.net/forum?id=n2YifD4Dxo

  18. MAgNET: A Graph U-Net Architecture for Mesh-Based Simulations

    Authors: Saurabh Deshpande, Stéphane P. A. Bordas, Jakub Lengiewicz

    Abstract: In many cutting-edge applications, high-fidelity computational models prove to be too slow for practical use and are therefore replaced by much faster surrogate models. Recently, deep learning techniques have increasingly been utilized to accelerate such predictions. To enable learning on large-dimensional and complex data, specific neural network architectures have been developed, including convo… ▽ More

    Submitted 2 April, 2024; v1 submitted 1 November, 2022; originally announced November 2022.

    Journal ref: Engineering Applications of Artificial Intelligence, Volume 133, Part B, 2024, 108055

  19. arXiv:2207.05016  [pdf, other

    cs.GT math.OC

    Capacity Management in a Pandemic with Endogenous Patient Choices and Flows

    Authors: Sanyukta Deshpande, Lavanya Marla, Alan Scheller-Wolf, Siddharth Prakash Singh

    Abstract: Motivated by the experiences of a healthcare service provider during the Covid-19 pandemic, we aim to study the decisions of a provider that operates both an Emergency Department (ED) and a medical Clinic. Patients contact the provider through a phone call or may present directly at the ED: patients can be COVID (suspected/confirmed) or non-COVID, and have different severities. Depending on the se… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

  20. Understanding Urban Water Consumption using Remotely Sensed Data

    Authors: Shaswat Mohanty, Anirudh Vijay, Shailesh Deshpande

    Abstract: Urban metabolism is an active field of research that deals with the estimation of emissions and resource consumption from urban regions. The analysis could be carried out through a manual surveyor by the implementation of elegant machine learning algorithms. In this exploratory work, we estimate the water consumption by the buildings in the region captured by satellite imagery. To this end, we bre… ▽ More

    Submitted 5 January, 2023; v1 submitted 3 May, 2022; originally announced May 2022.

    Comments: 4 pages, 2 figures, IEEE Conference Proceedings (IGARSS 2022)

  21. arXiv:2204.08491  [pdf, other

    cs.LG cs.CL cs.CV

    Active Learning Helps Pretrained Models Learn the Intended Task

    Authors: Alex Tamkin, Dat Nguyen, Salil Deshpande, Jesse Mu, Noah Goodman

    Abstract: Models can fail in unpredictable ways during deployment due to task ambiguity, when multiple behaviors are consistent with the provided training data. An example is an object classifier trained on red squares and blue circles: when encountering blue squares, the intended behavior is undefined. We investigate whether pretrained models are better active learners, capable of disambiguating between th… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

  22. arXiv:2203.09672  [pdf, other

    cs.LG

    Deep Multi-Modal Structural Equations For Causal Effect Estimation With Unstructured Proxies

    Authors: Shachi Deshpande, Kaiwen Wang, Dhruv Sreenivas, Zheng Li, Volodymyr Kuleshov

    Abstract: Estimating the effect of intervention from observational data while accounting for confounding variables is a key task in causal inference. Oftentimes, the confounders are unobserved, but we have access to large amounts of additional unstructured data (images, text) that contain valuable proxy signal about the missing confounders. This paper argues that leveraging this unstructured data can greatl… ▽ More

    Submitted 11 December, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

    Comments: NeurIPS 2022 (accepted version)

  23. arXiv:2203.02649  [pdf, other

    cs.CR

    Towards an Antivirus for Quantum Computers

    Authors: Sanjay Deshpande, Chuanqi Xu, Theodoros Trochatos, Yongshan Ding, Jakub Szefer

    Abstract: Researchers are today exploring models for cloud-based usage of quantum computers where multi-tenancy can be used to share quantum computer hardware among multiple users. Multi-tenancy has a promise of allowing better utilization of the quantum computer hardware, but also opens up the quantum computer to new types of security attacks. As this and other recent research shows, it is possible to perf… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: 4 pages, 5 figures, HOST 2022 author version

  24. arXiv:2203.02510  [pdf, ps, other

    q-bio.QM cs.CV cs.LG eess.IV

    Cellular Segmentation and Composition in Routine Histology Images using Deep Learning

    Authors: Muhammad Dawood, Raja Muhammad Saad Bashir, Srijay Deshpande, Manahil Raza, Adam Shephard

    Abstract: Identification and quantification of nuclei in colorectal cancer haematoxylin \& eosin (H\&E) stained histology images is crucial to prognosis and patient management. In computational pathology these tasks are referred to as nuclear segmentation, classification and composition and are used to extract meaningful interpretable cytological and architectural features for downstream analysis. The CoNIC… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

  25. arXiv:2203.01183  [pdf

    eess.IV cs.GR cs.HC cs.MM

    Omnidirectional MediA Format (OMAF): Toolbox for Virtual Reality Services

    Authors: Sachin Deshpande, Miska M. Hannuksela

    Abstract: This paper provides an overview of the Omnidirectional Media Format (OMAF) standard, second edition, which has been recently finalized. OMAF specifies the media format for coding, storage, delivery, and rendering of omnidirectional media, including video, audio, images, and timed text. Additionally, OMAF supports multiple viewpoints corresponding to omnidirectional cameras and overlay images or vi… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

    Comments: 7 pages, 1 figure. This document is the accepted version of the paper that has been published in 2021 IEEE Conference on Standards for Communications and Networking (CSCN)

    Journal ref: 2021 IEEE Conference on Standards for Communications and Networking (CSCN), 2021, pp. 20-25

  26. arXiv:2112.07184  [pdf, other

    cs.LG

    Calibrated and Sharp Uncertainties in Deep Learning via Density Estimation

    Authors: Volodymyr Kuleshov, Shachi Deshpande

    Abstract: Accurate probabilistic predictions can be characterized by two properties -- calibration and sharpness. However, standard maximum likelihood training yields models that are poorly calibrated and thus inaccurate -- a 90% confidence interval typically does not contain the true outcome 90% of the time. This paper argues that calibration is important in practice and is easy to maintain by performing l… ▽ More

    Submitted 19 September, 2022; v1 submitted 14 December, 2021; originally announced December 2021.

    ACM Class: I.2; I.5

  27. arXiv:2112.04620  [pdf, other

    cs.LG stat.ML

    Online Calibrated and Conformal Prediction Improves Bayesian Optimization

    Authors: Shachi Deshpande, Charles Marx, Volodymyr Kuleshov

    Abstract: Accurate uncertainty estimates are important in sequential model-based decision-making tasks such as Bayesian optimization. However, these estimates can be imperfect if the data violates assumptions made by the model (e.g., Gaussianity). This paper studies which uncertainties are needed in model-based decision-making and in Bayesian optimization, and argues that uncertainties can benefit from cali… ▽ More

    Submitted 25 June, 2024; v1 submitted 8 December, 2021; originally announced December 2021.

    ACM Class: I.2; I.5

    Journal ref: Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, May 2024; PMLR 238:1450-1458

  28. Probabilistic Deep Learning for Real-Time Large Deformation Simulations

    Authors: Saurabh Deshpande, Jakub Lengiewicz, Stéphane P. A. Bordas

    Abstract: For many novel applications, such as patient-specific computer-aided surgery, conventional solution techniques of the underlying nonlinear problems are usually computationally too expensive and are lacking information about how certain can we be about their predictions. In the present work, we propose a highly efficient deep-learning surrogate framework that is able to accurately predict the respo… ▽ More

    Submitted 4 July, 2022; v1 submitted 2 November, 2021; originally announced November 2021.

    Journal ref: Computer Methods in Applied Mechanics and Engineering, 2022, Volume 398

  29. arXiv:2109.09248  [pdf, other

    cs.GT

    Wages and Utilities in a Closed Economy

    Authors: Sanyukta Deshpande, Milind A. Sohoni

    Abstract: The broad objective of this paper is to propose a mathematical model for the study of causes of wage inequality and relate it to choices of consumption, the technologies of production, and the composition of labor in an economy. The paper constructs a Simple Closed Model, or an SCM, for short, for closed economies, in which the consumption and the production parts are clearly separated and yet cou… ▽ More

    Submitted 17 August, 2023; v1 submitted 19 September, 2021; originally announced September 2021.

  30. arXiv:2108.07031  [pdf, other

    cs.PL cs.PF physics.comp-ph

    On the performance of GPU accelerated q-LSKUM based meshfree solvers in Fortran, C++, Python, and Julia

    Authors: Nischay Ram Mamidi, Kumar Prasun, Dhruv Saxena, Anil Nemili, Bharatkumar Sharma, S. M. Deshpande

    Abstract: This report presents a comprehensive analysis of the performance of GPU accelerated meshfree CFD solvers for two-dimensional compressible flows in Fortran, C++, Python, and Julia. The programming model CUDA is used to develop the GPU codes. The meshfree solver is based on the least squares kinetic upwind method with entropy variables (q-LSKUM). To assess the computational efficiency of the GPU sol… ▽ More

    Submitted 16 August, 2021; originally announced August 2021.

    Comments: 42 pages, 3 figures

    ACM Class: D.3.0; J.2

  31. arXiv:2106.06510  [pdf, other

    stat.ML cs.LG stat.CO

    Measuring the robustness of Gaussian processes to kernel choice

    Authors: William T. Stephenson, Soumya Ghosh, Tin D. Nguyen, Mikhail Yurochkin, Sameer K. Deshpande, Tamara Broderick

    Abstract: Gaussian processes (GPs) are used to make medical and scientific decisions, including in cardiac care and monitoring of atmospheric carbon dioxide levels. Notably, the choice of GP kernel is often somewhat arbitrary. In particular, uncountably many kernels typically align with qualitative prior knowledge (e.g.\ function smoothness or stationarity). But in practice, data analysts choose among a han… ▽ More

    Submitted 12 March, 2022; v1 submitted 11 June, 2021; originally announced June 2021.

    Comments: AISTATS 2022

  32. arXiv:2008.07331  [pdf, other

    cs.LG cs.AI cs.HC cs.RO stat.ML

    Interactive Visualization for Debugging RL

    Authors: Shuby Deshpande, Benjamin Eysenbach, Jeff Schneider

    Abstract: Visualization tools for supervised learning allow users to interpret, introspect, and gain an intuition for the successes and failures of their models. While reinforcement learning practitioners ask many of the same questions, existing tools are not applicable to the RL setting as these tools address challenges typically found in the supervised learning regime. In this work, we design and implemen… ▽ More

    Submitted 18 August, 2020; v1 submitted 14 August, 2020; originally announced August 2020.

    Comments: Builds on preliminary work presented at ICML 2020 (WHI) arXiv:2007.05577. An interactive demo of the system can be at https://tinyurl.com/y5gv5t4m

  33. arXiv:2008.04526  [pdf, other

    eess.IV cs.CV

    SAFRON: Stitching Across the Frontier for Generating Colorectal Cancer Histology Images

    Authors: Srijay Deshpande, Fayyaz Minhas, Simon Graham, Nasir Rajpoot

    Abstract: Synthetic images can be used for the development and evaluation of deep learning algorithms in the context of limited availability of data. In the field of computational pathology, where histology images are large in size and visual context is crucial, synthesis of large high resolution images via generative modeling is a challenging task. This is due to memory and computational constraints hinder… ▽ More

    Submitted 26 March, 2021; v1 submitted 11 August, 2020; originally announced August 2020.

  34. arXiv:2007.09186  [pdf, other

    cs.IR

    AWS CORD-19 Search: A Neural Search Engine for COVID-19 Literature

    Authors: Parminder Bhatia, Lan Liu, Kristjan Arumae, Nima Pourdamghani, Suyog Deshpande, Ben Snively, Mona Mona, Colby Wise, George Price, Shyam Ramaswamy, Xiaofei Ma, Ramesh Nallapati, Zhiheng Huang, Bing Xiang, Taha Kass-Hout

    Abstract: Coronavirus disease (COVID-19) has been declared as a pandemic by WHO with thousands of cases being reported each day. Numerous scientific articles are being published on the disease raising the need for a service which can organize, and query them in a reliable fashion. To support this cause we present AWS CORD-19 Search (ACS), a public, COVID-19 specific, neural search engine that is powered by… ▽ More

    Submitted 7 October, 2020; v1 submitted 17 July, 2020; originally announced July 2020.

  35. arXiv:2007.05577  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Vizarel: A System to Help Better Understand RL Agents

    Authors: Shuby Deshpande, Jeff Schneider

    Abstract: Visualization tools for supervised learning have allowed users to interpret, introspect, and gain intuition for the successes and failures of their models. While reinforcement learning practitioners ask many of the same questions, existing tools are not applicable to the RL setting. In this work, we describe our initial attempt at constructing a prototype of these ideas, through identifying possib… ▽ More

    Submitted 10 July, 2020; originally announced July 2020.

    Comments: Accepted to ICML 2020 Workshop on Human Interpretability in Machine Learning (Spotlight)

  36. arXiv:2007.02149  [pdf

    cs.CV cs.AI cs.CY

    Human Assisted Artificial Intelligence Based Technique to Create Natural Features for OpenStreetMap

    Authors: Piyush Yadav, Dipto Sarkar, Shailesh Deshpande, Edward Curry

    Abstract: In this work, we propose an AI-based technique using freely available satellite images like Landsat and Sentinel to create natural features over OSM in congruence with human editors acting as initiators and validators. The method is based on Interactive Machine Learning technique where human inputs are coupled with the machine to solve complex problems efficiently as compare to pure autonomous pro… ▽ More

    Submitted 8 July, 2020; v1 submitted 4 July, 2020; originally announced July 2020.

    Comments: 3 pages, 2 Figures, Submitted to FOSS4G Europe 2020 Academic Track (Postponed to 2021)

  37. Computational Model for Urban Growth Using Socioeconomic Latent Parameters

    Authors: Piyush Yadav, Shamsuddin Ladha, Shailesh Deshpande, Edward Curry

    Abstract: Land use land cover changes (LULCC) are generally modeled using multi-scale spatio-temporal variables. Recently, Markov Chain (MC) has been used to model LULCC. However, the model is derived from the proportion of LULCC observed over a given period and it does not account for temporal factors such as macro-economic, socio-economic, etc. In this paper, we present a richer model based on Hidden Mark… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Comments: 12 pages

    Journal ref: ECML PKDD 2018 Lecture Notes in Computer Science vol 11329 Springer Cham

  38. AutoKnow: Self-Driving Knowledge Collection for Products of Thousands of Types

    Authors: Xin Luna Dong, Xiang He, Andrey Kan, Xian Li, Yan Liang, Jun Ma, Yifan Ethan Xu, Chenwei Zhang, Tong Zhao, Gabriel Blanco Saldana, Saurabh Deshpande, Alexandre Michetti Manduca, Jay Ren, Surender Pal Singh, Fan Xiao, Haw-Shiuan Chang, Giannis Karamanolakis, Yuning Mao, Yaqing Wang, Christos Faloutsos, Andrew McCallum, Jiawei Han

    Abstract: Can one build a knowledge graph (KG) for all products in the world? Knowledge graphs have firmly established themselves as valuable sources of information for search and question answering, and it is natural to wonder if a KG can contain information about products offered at online retail sites. There have been several successful examples of generic KGs, but organizing information about products p… ▽ More

    Submitted 24 June, 2020; originally announced June 2020.

    Comments: KDD 2020

  39. arXiv:2006.12669  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Approximate Cross-Validation for Structured Models

    Authors: Soumya Ghosh, William T. Stephenson, Tin D. Nguyen, Sameer K. Deshpande, Tamara Broderick

    Abstract: Many modern data analyses benefit from explicitly modeling dependence structure in data -- such as measurements across time or space, ordered words in a sentence, or genes in a genome. A gold standard evaluation technique is structured cross-validation (CV), which leaves out some data subset (such as data within a time interval or data in a geographic region) in each fold. But CV here can be prohi… ▽ More

    Submitted 1 December, 2020; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: 25 pages, 8 figures. NeurIPS 2020 camera ready. v2 fixes typos and provides additional empirical results. Code: https://github.com/SoumyaTGhosh/structured-infinitesimal-jackknife

  40. arXiv:1906.03479  [pdf, other

    cs.LG astro-ph.IM physics.comp-ph stat.ML

    Learning Radiative Transfer Models for Climate Change Applications in Imaging Spectroscopy

    Authors: Shubhankar Deshpande, Brian D. Bue, David R. Thompson, Vijay Natraj, Mario Parente

    Abstract: According to a recent investigation, an estimated 33-50% of the world's coral reefs have undergone degradation, believed to be as a result of climate change. A strong driver of climate change and the subsequent environmental impact are greenhouse gases such as methane. However, the exact relation climate change has to the environmental condition cannot be easily established. Remote sensing methods… ▽ More

    Submitted 8 June, 2019; originally announced June 2019.

    Comments: Accepted to International Conference on Machine Learning (ICML) 2019 Workshop: Climate Change: How Can AI Help?

  41. arXiv:1902.06371  [pdf, other

    cs.NI

    Achieving Throughput via Fine-Grained Path Planning in Small World DTNs

    Authors: Dhrubojyoti Roy, Mukundan Sridharan, Satyajeet Deshpande, Anish Arora

    Abstract: We explore the benefits of using fine-grained statistics in small world DTNs to achieve high throughput without the aid of external infrastructure. We first design an empirical node-pair inter-contacts model that predicts meetings within a time frame of suitable length, typically of the order of days, with a probability above some threshold, and can be readily computed with low overhead. This temp… ▽ More

    Submitted 17 February, 2019; originally announced February 2019.

    Comments: arXiv admin note: text overlap with arXiv:1310.1162

  42. arXiv:1902.05064  [pdf, other

    q-bio.GN cs.LG stat.ML

    PLIT: An alignment-free computational tool for identification of long non-coding RNAs in plant transcriptomic datasets

    Authors: S. Deshpande, J. Shuttleworth, J. Yang, S. Taramonli, M. England

    Abstract: Long non-coding RNAs (lncRNAs) are a class of non-coding RNAs which play a significant role in several biological processes. RNA-seq based transcriptome sequencing has been extensively used for identification of lncRNAs. However, accurate identification of lncRNAs in RNA-seq datasets is crucial for exploring their characteristic functions in the genome as most coding potential computation (CPC) to… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

    Comments: 36 pages. Author's accepted version (Green OA)

    Journal ref: Computers in Biology and Medicine, 105, pp. 169 - 181, Elevier, 2019

  43. arXiv:1807.08820  [pdf, other

    cs.LG stat.ML

    RAIM: Recurrent Attentive and Intensive Model of Multimodal Patient Monitoring Data

    Authors: Yanbo Xu, Siddharth Biswal, Shriprasad R Deshpande, Kevin O Maher, Jimeng Sun

    Abstract: With the improvement of medical data capturing, vast amount of continuous patient monitoring data, e.g., electrocardiogram (ECG), real-time vital signs and medications, become available for clinical decision support at intensive care units (ICUs). However, it becomes increasingly challenging to model such data, due to high density of the monitoring data, heterogeneous data types and the requiremen… ▽ More

    Submitted 23 July, 2018; originally announced July 2018.

  44. arXiv:1310.1162   

    cs.NI

    A Little Prediction Goes a Long Way: Routing in Semi-Deterministic Delay Tolerant Networks

    Authors: Dhrubojyoti Roy, Mukundan Sridharan, Satyajeet Deshpande, Anish Arora

    Abstract: Realizing delay-capacity in intermittently connected mobile networks remains a largely open question, with state-of-the-art routing schemes typically focusing either on delay or on capacity. We show the feasibility of routing with both high goodput and desired delay constraints, with REAPER (for Reliable, Efficient, and Predictive Routing), a fully distributed convergecast routing framework that j… ▽ More

    Submitted 22 January, 2014; v1 submitted 4 October, 2013; originally announced October 2013.

    Comments: This paper has been withdrawn by the authors. Withdrawn since document intended to be anonymous