Zum Hauptinhalt springen

Showing 1–30 of 30 results for author: Krishnan, R G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.10258  [pdf, other

    cs.CV cs.LG

    NeRF-US: Removing Ultrasound Imaging Artifacts from Neural Radiance Fields in the Wild

    Authors: Rishit Dagli, Atsuhiro Hibi, Rahul G. Krishnan, Pascal N. Tyrrell

    Abstract: Current methods for performing 3D reconstruction and novel view synthesis (NVS) in ultrasound imaging data often face severe artifacts when training NeRF-based approaches. The artifacts produced by current approaches differ from NeRF floaters in general scenes because of the unique nature of ultrasound capture. Furthermore, existing models fail to produce reasonable 3D reconstructions when ultraso… ▽ More

    Submitted 20 August, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

  2. arXiv:2408.05437  [pdf, other

    cs.LG

    Predicting Long-Term Allograft Survival in Liver Transplant Recipients

    Authors: Xiang Gao, Michael Cooper, Maryam Naghibzadeh, Amirhossein Azhie, Mamatha Bhat, Rahul G. Krishnan

    Abstract: Liver allograft failure occurs in approximately 20% of liver transplant recipients within five years post-transplant, leading to mortality or the need for retransplantation. Providing an accurate and interpretable model for individualized risk estimation of graft failure is essential for improving post-transplant care. To this end, we introduce the Model for Allograft Survival (MAS), a simple line… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

    Comments: Accepted at MLHC 2024

  3. arXiv:2407.07018  [pdf, other

    cs.LG cs.CL stat.ME

    End-To-End Causal Effect Estimation from Unstructured Natural Language Data

    Authors: Nikita Dhawan, Leonardo Cotta, Karen Ullrich, Rahul G. Krishnan, Chris J. Maddison

    Abstract: Knowing the effect of an intervention is critical for human decision-making, but current approaches for causal effect estimation rely on manual data collection and structuring, regardless of the causal assumptions. This increases both the cost and time-to-completion for studies. We show how large, diverse observational text data can be mined with large language models (LLMs) to produce inexpensive… ▽ More

    Submitted 23 August, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: 28 pages, 11 figures

  4. arXiv:2406.00426  [pdf, other

    cs.LG

    InterpreTabNet: Distilling Predictive Signals from Tabular Data by Salient Feature Interpretation

    Authors: Jacob Si, Wendy Yusi Cheng, Michael Cooper, Rahul G. Krishnan

    Abstract: Tabular data are omnipresent in various sectors of industries. Neural networks for tabular data such as TabNet have been proposed to make predictions while leveraging the attention mechanism for interpretability. However, the inferred attention masks are often dense, making it challenging to come up with rationales about the predictive signal. To remedy this, we propose InterpreTabNet, a variant o… ▽ More

    Submitted 11 June, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

    Comments: ICML 2024 Spotlight

  5. arXiv:2404.07266  [pdf, other

    cs.LG

    Sequential Decision Making with Expert Demonstrations under Unobserved Heterogeneity

    Authors: Vahid Balazadeh, Keertana Chidambaram, Viet Nguyen, Rahul G. Krishnan, Vasilis Syrgkanis

    Abstract: We study the problem of online sequential decision-making given auxiliary demonstrations from experts who made their decisions based on unobserved contextual information. These demonstrations can be viewed as solving related but slightly different tasks than what the learner faces. This setting arises in many application domains, such as self-driving cars, healthcare, and finance, where expert dem… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  6. arXiv:2403.18910  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    A Geometric Explanation of the Likelihood OOD Detection Paradox

    Authors: Hamidreza Kamkari, Brendan Leigh Ross, Jesse C. Cresswell, Anthony L. Caterini, Rahul G. Krishnan, Gabriel Loaiza-Ganem

    Abstract: Likelihood-based deep generative models (DGMs) commonly exhibit a puzzling behaviour: when trained on a relatively complex dataset, they assign higher likelihood values to out-of-distribution (OOD) data from simpler sources. Adding to the mystery, OOD samples are never generated by these DGMs despite having higher likelihoods. This two-pronged paradox has yet to be conclusively explained, making l… ▽ More

    Submitted 11 June, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: ICML 2024

  7. arXiv:2402.07344  [pdf, other

    cs.LG cs.AI

    Measurement Scheduling for ICU Patients with Offline Reinforcement Learning

    Authors: Zongliang Ji, Anna Goldenberg, Rahul G. Krishnan

    Abstract: Scheduling laboratory tests for ICU patients presents a significant challenge. Studies show that 20-40% of lab tests ordered in the ICU are redundant and could be eliminated without compromising patient safety. Prior work has leveraged offline reinforcement learning (Offline-RL) to find optimal policies for ordering lab tests based on patient information. However, new ICU patient datasets have sin… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2023, December 10th, 2023, New Orleans, United States, 11 pages

  8. arXiv:2311.18780  [pdf, other

    cs.LG

    MultiResFormer: Transformer with Adaptive Multi-Resolution Modeling for General Time Series Forecasting

    Authors: Linfeng Du, Ji Xin, Alex Labach, Saba Zuberi, Maksims Volkovs, Rahul G. Krishnan

    Abstract: Transformer-based models have greatly pushed the boundaries of time series forecasting recently. Existing methods typically encode time series data into $\textit{patches}$ using one or a fixed set of patch lengths. This, however, could result in a lack of ability to capture the variety of intricate temporal dependencies present in real-world multi-periodic time series. In this paper, we propose Mu… ▽ More

    Submitted 8 February, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

  9. arXiv:2311.02221  [pdf, other

    cs.LG stat.ML

    Structured Neural Networks for Density Estimation and Causal Inference

    Authors: Asic Q. Chen, Ruian Shi, Xiang Gao, Ricardo Baptista, Rahul G. Krishnan

    Abstract: Injecting structure into neural networks enables learning functions that satisfy invariances with respect to subsets of inputs. For instance, when learning generative models using neural networks, it is advantageous to encode the conditional independence structure of observed variables, often in the form of Bayesian networks. We propose the Structured Neural Network (StrNN), which injects structur… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: 10 pages with 5 figures, to be published in Neural Information Processing Systems 2023

  10. arXiv:2308.07480  [pdf, other

    cs.LG stat.ME

    Order-based Structure Learning with Normalizing Flows

    Authors: Hamidreza Kamkari, Vahid Balazadeh, Vahid Zehtab, Rahul G. Krishnan

    Abstract: Estimating the causal structure of observational data is a challenging combinatorial search problem that scales super-exponentially with graph size. Existing methods use continuous relaxations to make this problem computationally tractable but often restrict the data-generating process to additive noise models (ANMs) through explicit or implicit assumptions. We present Order-based Structure Learni… ▽ More

    Submitted 17 February, 2024; v1 submitted 14 August, 2023; originally announced August 2023.

  11. arXiv:2306.11912  [pdf, other

    cs.LG

    Copula-Based Deep Survival Models for Dependent Censoring

    Authors: Ali Hossein Gharari Foomani, Michael Cooper, Russell Greiner, Rahul G. Krishnan

    Abstract: A survival dataset describes a set of instances (e.g. patients) and provides, for each, either the time until an event (e.g. death), or the censoring time (e.g. when lost to follow-up - which is a lower bound on the time until the event). We consider the challenge of survival prediction: learning, from such data, a predictive model that can produce an individual survival distribution for a novel i… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: 23 pages, 7 figures

  12. arXiv:2305.12031  [pdf, other

    cs.CL cs.AI

    Clinical Camel: An Open Expert-Level Medical Language Model with Dialogue-Based Knowledge Encoding

    Authors: Augustin Toma, Patrick R. Lawler, Jimmy Ba, Rahul G. Krishnan, Barry B. Rubin, Bo Wang

    Abstract: We present Clinical Camel, an open large language model (LLM) explicitly tailored for clinical research. Fine-tuned from LLaMA-2 using QLoRA, Clinical Camel achieves state-of-the-art performance across medical benchmarks among openly available medical LLMs. Leveraging efficient single-GPU training, Clinical Camel surpasses GPT-3.5 in five-shot evaluations on all assessed benchmarks, including 64.3… ▽ More

    Submitted 17 August, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: for model weights, see https://huggingface.co/wanglab/

  13. arXiv:2304.13017  [pdf, other

    cs.LG

    DuETT: Dual Event Time Transformer for Electronic Health Records

    Authors: Alex Labach, Aslesha Pokhrel, Xiao Shi Huang, Saba Zuberi, Seung Eun Yi, Maksims Volkovs, Tomi Poutanen, Rahul G. Krishnan

    Abstract: Electronic health records (EHRs) recorded in hospital settings typically contain a wide range of numeric time series data that is characterized by high sparsity and irregular observations. Effective modelling for such data must exploit its time series nature, the semantic relationship between different types of observations, and information in the sparsity structure of the data. Self-supervised Tr… ▽ More

    Submitted 15 August, 2023; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: Accepted at MLHC 2023, camera-ready version

  14. arXiv:2303.01841  [pdf, other

    cs.LG cs.AI

    Anamnesic Neural Differential Equations with Orthogonal Polynomial Projections

    Authors: Edward De Brouwer, Rahul G. Krishnan

    Abstract: Neural ordinary differential equations (Neural ODEs) are an effective framework for learning dynamical systems from irregularly sampled time series data. These models provide a continuous-time latent representation of the underlying dynamical system where new observations at arbitrary time points can be used to update the latent representation of the dynamical system. Existing parameterizations fo… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: Accepted at ICLR 2023

    Journal ref: International Conference on Learning Representations (ICLR) 2023

  15. arXiv:2212.02742  [pdf, other

    cs.LG

    A Learning Based Hypothesis Test for Harmful Covariate Shift

    Authors: Tom Ginsberg, Zhongyuan Liang, Rahul G. Krishnan

    Abstract: The ability to quickly and accurately identify covariate shift at test time is a critical and often overlooked component of safe machine learning systems deployed in high-risk domains. While methods exist for detecting when predictions should not be made on out-of-distribution test examples, identifying distributional level differences between training and test time can help determine when a model… ▽ More

    Submitted 1 March, 2023; v1 submitted 5 December, 2022; originally announced December 2022.

  16. arXiv:2211.07076  [pdf, other

    cs.LG

    Learning predictive checklists from continuous medical data

    Authors: Yukti Makhija, Edward De Brouwer, Rahul G. Krishnan

    Abstract: Checklists, while being only recently introduced in the medical domain, have become highly popular in daily clinical practice due to their combined effectiveness and great interpretability. Checklists are usually designed by expert clinicians that manually collect and analyze available evidence. However, the increasing quantity of available medical data is calling for a partially automated checkli… ▽ More

    Submitted 13 November, 2022; originally announced November 2022.

    Comments: Extended Abstract presented at Machine Learning for Health (ML4H) symposium 2022, November 28th, 2022, New Orleans, United States & Virtual, http://www.ml4h.cc, 7 pages

    Journal ref: Machine Learning for Health (ML4H) symposium 2022

  17. arXiv:2210.08139  [pdf, other

    cs.LG

    Partial Identification of Treatment Effects with Implicit Generative Models

    Authors: Vahid Balazadeh, Vasilis Syrgkanis, Rahul G. Krishnan

    Abstract: We consider the problem of partial identification, the estimation of bounds on the treatment effects from observational data. Although studied using discrete treatment variables or in specific causal graphs (e.g., instrumental variables), partial identification has been recently explored using tools from deep generative modeling. We propose a new method for partial identification of average treatm… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

  18. arXiv:2208.02301  [pdf, other

    cs.LG

    HiCu: Leveraging Hierarchy for Curriculum Learning in Automated ICD Coding

    Authors: Weiming Ren, Ruijing Zeng, Tongzi Wu, Tianshu Zhu, Rahul G. Krishnan

    Abstract: There are several opportunities for automation in healthcare that can improve clinician throughput. One such example is assistive tools to document diagnosis codes when clinicians write notes. We study the automation of medical code prediction using curriculum learning, which is a training strategy for machine learning models that gradually increases the hardness of the learning tasks from easy to… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: To appear at Machine Learning for Healthcare Conference (MLHC2022)

  19. arXiv:2206.02647  [pdf, other

    cs.CV

    Scaling Vision Transformers to Gigapixel Images via Hierarchical Self-Supervised Learning

    Authors: Richard J. Chen, Chengkuan Chen, Yicong Li, Tiffany Y. Chen, Andrew D. Trister, Rahul G. Krishnan, Faisal Mahmood

    Abstract: Vision Transformers (ViTs) and their multi-scale and hierarchical variations have been successful at capturing image representations but their use has been generally studied for low-resolution images (e.g. - 256x256, 384384). For gigapixel whole-slide imaging (WSI) in computational pathology, WSIs can be as large as 150000x150000 pixels at 20X magnification and exhibit a hierarchical structure of… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Comments: Accepted to CVPR 2022 (Oral)

  20. arXiv:2204.08324  [pdf, other

    cs.CV cs.AI

    Hierarchical Optimal Transport for Comparing Histopathology Datasets

    Authors: Anna Yeaton, Rahul G. Krishnan, Rebecca Mieloszyk, David Alvarez-Melis, Grace Huynh

    Abstract: Scarcity of labeled histopathology data limits the applicability of deep learning methods to under-profiled cancer types and labels. Transfer learning allows researchers to overcome the limitations of small datasets by pre-training machine learning models on larger datasets similar to the small target dataset. However, similarity between datasets is often determined heuristically. In this paper, w… ▽ More

    Submitted 20 April, 2022; v1 submitted 18 April, 2022; originally announced April 2022.

  21. arXiv:2204.05229  [pdf, other

    cs.LG stat.ML

    Mixture-of-experts VAEs can disregard variation in surjective multimodal data

    Authors: Jannik Wolff, Tassilo Klein, Moin Nabi, Rahul G. Krishnan, Shinichi Nakajima

    Abstract: Machine learning systems are often deployed in domains that entail data from multiple modalities, for example, phenotypic and genotypic characteristics describe patients in healthcare. Previous works have developed multimodal variational autoencoders (VAEs) that generate several modalities. We consider subjective data, where single datapoints from one modality (such as class labels) describe multi… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: Accepted at the NeurIPS 2021 workshop on Bayesian Deep Learning

  22. arXiv:2203.00585  [pdf, other

    cs.CV q-bio.TO

    Self-Supervised Vision Transformers Learn Visual Concepts in Histopathology

    Authors: Richard J. Chen, Rahul G. Krishnan

    Abstract: Tissue phenotyping is a fundamental task in learning objective characterizations of histopathologic biomarkers within the tumor-immune microenvironment in cancer pathology. However, whole-slide imaging (WSI) is a complex computer vision in which: 1) WSIs have enormous image resolutions with precludes large-scale pixel-level efforts in data curation, and 2) diversity of morphological phenotypes res… ▽ More

    Submitted 1 March, 2022; originally announced March 2022.

    Comments: Learning Meaningful Representations of Life (NeurIPS 2021)

  23. arXiv:2110.14993  [pdf, other

    cs.LG stat.ML

    Using Time-Series Privileged Information for Provably Efficient Learning of Prediction Models

    Authors: Rickard K. A. Karlsson, Martin Willbo, Zeshan Hussain, Rahul G. Krishnan, David Sontag, Fredrik D. Johansson

    Abstract: We study prediction of future outcomes with supervised models that use privileged information during learning. The privileged information comprises samples of time series observed between the baseline time of prediction and the future outcome; this information is only available at training time which differs from the traditional supervised learning. Our question is when using this privileged data… ▽ More

    Submitted 5 May, 2022; v1 submitted 28 October, 2021; originally announced October 2021.

    Journal ref: Proceedings of The 25th International Conference on Artificial Intelligence and Statistics, PMLR 151:5459-5484, 2022

  24. arXiv:2102.11218  [pdf, other

    cs.LG

    Neural Pharmacodynamic State Space Modeling

    Authors: Zeshan Hussain, Rahul G. Krishnan, David Sontag

    Abstract: Modeling the time-series of high-dimensional, longitudinal data is important for predicting patient disease progression. However, existing neural network based approaches that learn representations of patient state, while very flexible, are susceptible to overfitting. We propose a deep generative model that makes use of a novel attention-based neural architecture inspired by the physics of how tre… ▽ More

    Submitted 17 June, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

    Comments: To appear at the International Conference on Machine Learning (ICML) 2021

  25. arXiv:2102.07005  [pdf, other

    stat.ML cs.LG

    Clustering Interval-Censored Time-Series for Disease Phenotyping

    Authors: Irene Y. Chen, Rahul G. Krishnan, David Sontag

    Abstract: Unsupervised learning is often used to uncover clusters in data. However, different kinds of noise may impede the discovery of useful patterns from real-world time-series data. In this work, we focus on mitigating the interference of interval censoring in the task of clustering for disease phenotyping. We develop a deep generative, continuous-time model of time-series data that clusters time-serie… ▽ More

    Submitted 5 December, 2021; v1 submitted 13 February, 2021; originally announced February 2021.

    Comments: AAAI 2022

  26. arXiv:1802.05814  [pdf, other

    stat.ML cs.IR cs.LG

    Variational Autoencoders for Collaborative Filtering

    Authors: Dawen Liang, Rahul G. Krishnan, Matthew D. Hoffman, Tony Jebara

    Abstract: We extend variational autoencoders (VAEs) to collaborative filtering for implicit feedback. This non-linear probabilistic model enables us to go beyond the limited modeling capacity of linear factor models which still largely dominate collaborative filtering research.We introduce a generative model with multinomial likelihood and use Bayesian inference for parameter estimation. Despite widespread… ▽ More

    Submitted 15 February, 2018; originally announced February 2018.

    Comments: 10 pages, 3 figures. WWW 2018

  27. arXiv:1710.06085  [pdf, other

    stat.ML cs.LG

    On the challenges of learning with inference networks on sparse, high-dimensional data

    Authors: Rahul G. Krishnan, Dawen Liang, Matthew Hoffman

    Abstract: We study parameter estimation in Nonlinear Factor Analysis (NFA) where the generative model is parameterized by a deep neural network. Recent work has focused on learning such models using inference (or recognition) networks; we identify a crucial problem when modeling large, sparse, high-dimensional datasets -- underfitting. We study the extent of underfitting, highlighting that its severity incr… ▽ More

    Submitted 17 October, 2017; originally announced October 2017.

    Comments: 14 pages, 3 tables, 11 figures

  28. arXiv:1609.09869  [pdf, other

    stat.ML cs.AI cs.LG

    Structured Inference Networks for Nonlinear State Space Models

    Authors: Rahul G. Krishnan, Uri Shalit, David Sontag

    Abstract: Gaussian state space models have been used for decades as generative models of sequential data. They admit an intuitive probabilistic interpretation, have a simple functional form, and enjoy widespread adoption. We introduce a unified algorithm to efficiently learn a broad class of linear and non-linear state space models, including variants where the emission and transition distributions are mode… ▽ More

    Submitted 5 December, 2016; v1 submitted 30 September, 2016; originally announced September 2016.

    Comments: To appear in the Thirty-First AAAI Conference on Artificial Intelligence, February 2017, 13 pages, 11 figures with supplement, changed to AAAI formatting style, added references

  29. arXiv:1511.05121  [pdf, other

    stat.ML cs.LG

    Deep Kalman Filters

    Authors: Rahul G. Krishnan, Uri Shalit, David Sontag

    Abstract: Kalman Filters are one of the most influential models of time-varying phenomena. They admit an intuitive probabilistic interpretation, have a simple functional form, and enjoy widespread adoption in a variety of disciplines. Motivated by recent variational methods for learning deep generative models, we introduce a unified algorithm to efficiently learn a broad spectrum of Kalman filters. Of parti… ▽ More

    Submitted 25 November, 2015; v1 submitted 16 November, 2015; originally announced November 2015.

    Comments: 17 pages, 14 figures: Fixed typo in Fig. 1(b) and added reference

  30. arXiv:1511.02124  [pdf, other

    stat.ML cs.LG math.OC

    Barrier Frank-Wolfe for Marginal Inference

    Authors: Rahul G. Krishnan, Simon Lacoste-Julien, David Sontag

    Abstract: We introduce a globally-convergent algorithm for optimizing the tree-reweighted (TRW) variational objective over the marginal polytope. The algorithm is based on the conditional gradient method (Frank-Wolfe) and moves pseudomarginals within the marginal polytope through repeated maximum a posteriori (MAP) calls. This modular structure enables us to leverage black-box MAP solvers (both exact and ap… ▽ More

    Submitted 25 November, 2015; v1 submitted 6 November, 2015; originally announced November 2015.

    Comments: 25 pages, 12 figures, To appear in Neural Information Processing Systems (NIPS) 2015, Corrected reference and cleaned up bibliography