Zum Hauptinhalt springen

Showing 1–14 of 14 results for author: Sushil, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.13833  [pdf, other

    cs.CL

    Biomedical Large Languages Models Seem not to be Superior to Generalist Models on Unseen Medical Data

    Authors: Felix J. Dorfner, Amin Dada, Felix Busch, Marcus R. Makowski, Tianyu Han, Daniel Truhn, Jens Kleesiek, Madhumita Sushil, Jacqueline Lammert, Lisa C. Adams, Keno K. Bressem

    Abstract: Large language models (LLMs) have shown potential in biomedical applications, leading to efforts to fine-tune them on domain-specific data. However, the effectiveness of this approach remains unclear. This study evaluates the performance of biomedically fine-tuned LLMs against their general-purpose counterparts on a variety of clinical tasks. We evaluated their performance on clinical case challen… ▽ More

    Submitted 25 August, 2024; originally announced August 2024.

    Comments: 10 pages, 3 tables, 1 figure

  2. arXiv:2403.02558  [pdf

    cs.CL cs.CV

    The Minimum Information about CLinical Artificial Intelligence Checklist for Generative Modeling Research (MI-CLAIM-GEN)

    Authors: Brenda Y. Miao, Irene Y. Chen, Christopher YK Williams, Jaysón Davidson, Augusto Garcia-Agundez, Shenghuan Sun, Travis Zack, Suchi Saria, Rima Arnaout, Giorgio Quer, Hossein J. Sadaei, Ali Torkamani, Brett Beaulieu-Jones, Bin Yu, Milena Gianfrancesco, Atul J. Butte, Beau Norgeot, Madhumita Sushil

    Abstract: Recent advances in generative models, including large language models (LLMs), vision language models (VLMs), and diffusion models, have accelerated the field of natural language and image processing in medicine and marked a significant paradigm shift in how biomedical models can be developed and deployed. While these models are highly adaptable to new tasks, scaling and evaluating their usage pres… ▽ More

    Submitted 11 July, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  3. arXiv:2401.13887  [pdf

    cs.CL cs.LG

    A comparative study of zero-shot inference with large language models and supervised modeling in breast cancer pathology classification

    Authors: Madhumita Sushil, Travis Zack, Divneet Mandair, Zhiwei Zheng, Ahmed Wali, Yan-Ning Yu, Yuwei Quan, Atul J. Butte

    Abstract: Although supervised machine learning is popular for information extraction from clinical notes, creating large annotated datasets requires extensive domain expertise and is time-consuming. Meanwhile, large language models (LLMs) have demonstrated promising transfer learning capability. In this study, we explored whether recent LLMs can reduce the need for large-scale data annotations. We curated a… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  4. arXiv:2309.10895  [pdf, ps, other

    cs.HC cs.MA

    Large Language Models as Agents in the Clinic

    Authors: Nikita Mehandru, Brenda Y. Miao, Eduardo Rodriguez Almaraz, Madhumita Sushil, Atul J. Butte, Ahmed Alaa

    Abstract: Recent developments in large language models (LLMs) have unlocked new opportunities for healthcare, from information synthesis to clinical decision support. These new LLMs are not just capable of modeling language, but can also act as intelligent "agents" that interact with stakeholders in open-ended conversations and even influence clinical decision-making. Rather than relying on benchmarks that… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: 4 pages

  5. CORAL: Expert-Curated medical Oncology Reports to Advance Language Model Inference

    Authors: Madhumita Sushil, Vanessa E. Kennedy, Divneet Mandair, Brenda Y. Miao, Travis Zack, Atul J. Butte

    Abstract: Both medical care and observational studies in oncology require a thorough understanding of a patient's disease progression and treatment history, often elaborately documented in clinical notes. Despite their vital role, no current oncology information representation and annotation schema fully encapsulates the diversity of information recorded within these notes. Although large language models (L… ▽ More

    Submitted 11 January, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: Source code available at: https://github.com/MadhumitaSushil/OncLLMExtraction

  6. arXiv:2306.09877  [pdf

    cs.CL

    Revealing the impact of social circumstances on the selection of cancer therapy through natural language processing of social work notes

    Authors: Shenghuan Sun, Travis Zack, Christopher Y. K. Williams, Atul J. Butte, Madhumita Sushil

    Abstract: We aimed to investigate the impact of social circumstances on cancer therapy selection using natural language processing to derive insights from social worker documentation. We developed and employed a Bidirectional Encoder Representations from Transformers (BERT) based approach, using a hierarchical multi-step BERT model (BERT-MS) to predict the prescription of targeted cancer therapy to patients… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    Comments: 18 pages, 4 figures, 2 Tables

  7. arXiv:2301.06570  [pdf

    cs.CL stat.ME

    Cross-institution text mining to uncover clinical associations: a case study relating social factors and code status in intensive care medicine

    Authors: Madhumita Sushil, Atul J. Butte, Ewoud Schuit, Maarten van Smeden, Artuur M. Leeuwenberg

    Abstract: Objective: Text mining of clinical notes embedded in electronic medical records is increasingly used to extract patient characteristics otherwise not or only partly available, to assess their association with relevant health outcomes. As manual data labeling needed to develop text mining models is resource intensive, we investigated whether off-the-shelf text mining models developed at external in… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    MSC Class: 68T50; 68U35; 62-xx; 62P10; 92C60; 92D30 ACM Class: I.2.7; G.3

  8. arXiv:2212.01462  [pdf

    cs.CL

    Topic Modeling on Clinical Social Work Notes for Exploring Social Determinants of Health Factors

    Authors: Shenghuan Sun, Travis Zack, Madhumita Sushil, Atul J. Butte

    Abstract: Most research studying social determinants of health (SDoH) has focused on physician notes or structured elements of the electronic medical record (EMR). We hypothesize that clinical notes from social workers, whose role is to ameliorate social and economic factors, might provide a richer source of data on SDoH. We sought to perform topic modeling to identify robust topics of discussion within a l… ▽ More

    Submitted 2 December, 2022; originally announced December 2022.

    Comments: Under review

  9. arXiv:2210.06566  [pdf

    cs.CL

    Developing a general-purpose clinical language inference model from a large corpus of clinical notes

    Authors: Madhumita Sushil, Dana Ludwig, Atul J. Butte, Vivek A. Rudrapatna

    Abstract: Several biomedical language models have already been developed for clinical language inference. However, these models typically utilize general vocabularies and are trained on relatively small clinical corpora. We sought to evaluate the impact of using a domain-specific vocabulary and a large clinical training corpus on the performance of these language models in clinical language inference. We tr… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Under review

  10. arXiv:2005.07111  [pdf, other

    cs.CL cs.LG stat.ML

    Distilling neural networks into skipgram-level decision lists

    Authors: Madhumita Sushil, Simon Šuster, Walter Daelemans

    Abstract: Several previous studies on explanation for recurrent neural networks focus on approaches that find the most important input segments for a network as its explanations. In that case, the manner in which these input segments combine with each other to form an explanatory pattern remains unknown. To overcome this, some previous work tries to find patterns (called rules) in the data that explain neur… ▽ More

    Submitted 18 May, 2020; v1 submitted 14 May, 2020; originally announced May 2020.

  11. arXiv:1910.07350  [pdf, ps, other

    cs.CL

    Why can't memory networks read effectively?

    Authors: Simon Šuster, Madhumita Sushil, Walter Daelemans

    Abstract: Memory networks have been a popular choice among neural architectures for machine reading comprehension and question answering. While recent work revealed that memory networks can't truly perform multi-hop reasoning, we show in the present paper that vanilla memory networks are ineffective even in single-hop reading comprehension. We analyze the reasons for this on two cloze-style datasets, one fr… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

  12. arXiv:1808.09744  [pdf, other

    cs.CL cs.LG stat.ML

    Rule induction for global explanation of trained models

    Authors: Madhumita Sushil, Simon Šuster, Walter Daelemans

    Abstract: Understanding the behavior of a trained network and finding explanations for its outputs is important for improving the network's performance and generalization ability, and for ensuring trust in automated systems. Several approaches have previously been proposed to identify and visualize the most important features by analyzing a trained network. However, the relations between different features… ▽ More

    Submitted 29 August, 2018; originally announced August 2018.

    Comments: Accepted at the Workshop on 'Analyzing and interpreting neural networks for NLP' (BlackboxNLP), EMNLP 2018

  13. Patient representation learning and interpretable evaluation using clinical notes

    Authors: Madhumita Sushil, Simon Šuster, Kim Luyckx, Walter Daelemans

    Abstract: We have three contributions in this work: 1. We explore the utility of a stacked denoising autoencoder and a paragraph vector model to learn task-independent dense patient representations directly from clinical notes. To analyze if these representations are transferable across tasks, we evaluate them in multiple supervised setups to predict patient mortality, primary diagnostic and procedural cate… ▽ More

    Submitted 3 July, 2018; originally announced July 2018.

    Comments: Accepted manuscript at Journal of Biomedical Informatics

    Journal ref: Journal of Biomedical Informatics Volume 84C (2018) pp. 103-113

  14. arXiv:1711.05198  [pdf, other

    cs.CL

    Unsupervised patient representations from clinical notes with interpretable classification decisions

    Authors: Madhumita Sushil, Simon Šuster, Kim Luyckx, Walter Daelemans

    Abstract: We have two main contributions in this work: 1. We explore the usage of a stacked denoising autoencoder, and a paragraph vector model to learn task-independent dense patient representations directly from clinical notes. We evaluate these representations by using them as features in multiple supervised setups, and compare their performance with those of sparse representations. 2. To understand and… ▽ More

    Submitted 14 November, 2017; originally announced November 2017.

    Comments: Accepted poster at NIPS 2017 Workshop on Machine Learning for Health (https://ml4health.github.io/2017/)