Zum Hauptinhalt springen

Showing 1–18 of 18 results for author: Dunnmon, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2211.11176  [pdf, other

    cs.LG cs.AI eess.SP

    Modeling Multivariate Biosignals With Graph Neural Networks and Structured State Space Models

    Authors: Siyi Tang, Jared A. Dunnmon, Liangqiong Qu, Khaled K. Saab, Tina Baykaner, Christopher Lee-Messer, Daniel L. Rubin

    Abstract: Multivariate biosignals are prevalent in many medical domains, such as electroencephalography, polysomnography, and electrocardiography. Modeling spatiotemporal dependencies in multivariate biosignals is challenging due to (1) long-range temporal dependencies and (2) complex spatial correlations between the electrodes. To address these challenges, we propose representing multivariate biosignals as… ▽ More

    Submitted 29 April, 2023; v1 submitted 20 November, 2022; originally announced November 2022.

    Comments: Published as a conference paper at CHIL 2023

  2. arXiv:2206.00897  [pdf, other

    cs.CV cs.CY

    xView3-SAR: Detecting Dark Fishing Activity Using Synthetic Aperture Radar Imagery

    Authors: Fernando Paolo, Tsu-ting Tim Lin, Ritwik Gupta, Bryce Goodman, Nirav Patel, Daniel Kuster, David Kroodsma, Jared Dunnmon

    Abstract: Unsustainable fishing practices worldwide pose a major threat to marine resources and ecosystems. Identifying vessels that do not show up in conventional monitoring systems -- known as ``dark vessels'' -- is key to managing and securing the health of marine environments. With the rise of satellite-based synthetic aperture radar (SAR) imaging and modern machine learning (ML), it is now possible to… ▽ More

    Submitted 5 November, 2022; v1 submitted 2 June, 2022; originally announced June 2022.

    Comments: Accepted to NeurIPS 2022. 10 pages (25 with references and supplement)

  3. Multimodal spatiotemporal graph neural networks for improved prediction of 30-day all-cause hospital readmission

    Authors: Siyi Tang, Amara Tariq, Jared Dunnmon, Umesh Sharma, Praneetha Elugunti, Daniel Rubin, Bhavik N. Patel, Imon Banerjee

    Abstract: Measures to predict 30-day readmission are considered an important quality factor for hospitals as accurate predictions can reduce the overall cost of care by identifying high risk patients before they are discharged. While recent deep learning-based studies have shown promising empirical results on readmission prediction, several limitations exist that may hinder widespread clinical utility, such… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Journal ref: IEEE Journal of Biomedical and Health Informatics, vol. 27, no. 4, pp. 2071-2082, April 2023

  4. arXiv:2203.14960  [pdf, other

    cs.LG cs.AI

    Domino: Discovering Systematic Errors with Cross-Modal Embeddings

    Authors: Sabri Eyuboglu, Maya Varma, Khaled Saab, Jean-Benoit Delbrouck, Christopher Lee-Messer, Jared Dunnmon, James Zou, Christopher Ré

    Abstract: Machine learning models that achieve high overall accuracy often make systematic errors on important subsets (or slices) of data. Identifying underperforming slices is particularly challenging when working with high-dimensional inputs (e.g. images, audio), where important slices are often unlabeled. In order to address this issue, recent studies have proposed automated slice discovery methods (SDM… ▽ More

    Submitted 21 May, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: ICLR 2022 (Oral)

  5. arXiv:2108.02016  [pdf, other

    eess.IV cs.CV

    OncoNet: Weakly Supervised Siamese Network to automate cancer treatment response assessment between longitudinal FDG PET/CT examinations

    Authors: Anirudh Joshi, Sabri Eyuboglu, Shih-Cheng Huang, Jared Dunnmon, Arjun Soin, Guido Davidzon, Akshay Chaudhari, Matthew P Lungren

    Abstract: FDG PET/CT imaging is a resource intensive examination critical for managing malignant disease and is particularly important for longitudinal assessment during therapy. Approaches to automate longtudinal analysis present many challenges including lack of available longitudinal datasets, managing complex large multimodal imaging examinations, and need for detailed annotations for traditional superv… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

  6. arXiv:2104.08336  [pdf, other

    eess.SP cs.AI cs.LG

    Self-Supervised Graph Neural Networks for Improved Electroencephalographic Seizure Analysis

    Authors: Siyi Tang, Jared A. Dunnmon, Khaled Saab, Xuan Zhang, Qianying Huang, Florian Dubost, Daniel L. Rubin, Christopher Lee-Messer

    Abstract: Automated seizure detection and classification from electroencephalography (EEG) can greatly improve seizure diagnosis and treatment. However, several modeling challenges remain unaddressed in prior automated seizure detection and classification studies: (1) representing non-Euclidean data structure in EEGs, (2) accurately classifying rare seizure types, and (3) lacking a quantitative interpretabi… ▽ More

    Submitted 13 March, 2022; v1 submitted 16 April, 2021; originally announced April 2021.

    Comments: Published as a conference paper at ICLR 2022

    Journal ref: ICLR 2022

  7. arXiv:2011.12945  [pdf, other

    cs.LG cs.CV

    No Subclass Left Behind: Fine-Grained Robustness in Coarse-Grained Classification Problems

    Authors: Nimit S. Sohoni, Jared A. Dunnmon, Geoffrey Angus, Albert Gu, Christopher Ré

    Abstract: In real-world classification tasks, each class often comprises multiple finer-grained "subclasses." As the subclass labels are frequently unavailable, models trained using only the coarser-grained class labels often exhibit highly variable performance across different subclasses. This phenomenon, known as hidden stratification, has important consequences for models deployed in safety-critical appl… ▽ More

    Submitted 10 April, 2022; v1 submitted 25 November, 2020; originally announced November 2020.

    Comments: 40 pages. Published as a conference paper at NeurIPS 2020

  8. arXiv:2010.08006  [pdf

    cs.LG cs.CV eess.IV

    Data Valuation for Medical Imaging Using Shapley Value: Application on A Large-scale Chest X-ray Dataset

    Authors: Siyi Tang, Amirata Ghorbani, Rikiya Yamashita, Sameer Rehman, Jared A. Dunnmon, James Zou, Daniel L. Rubin

    Abstract: The reliability of machine learning models can be compromised when trained on low quality data. Many large-scale medical imaging datasets contain low quality labels extracted from sources such as medical reports. Moreover, images within a dataset may have heterogeneous quality due to artifacts and biases arising from equipment or measurement errors. Therefore, algorithms that can automatically ide… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

  9. arXiv:2004.05316  [pdf, other

    cs.LG stat.ML

    Ivy: Instrumental Variable Synthesis for Causal Inference

    Authors: Zhaobin Kuang, Frederic Sala, Nimit Sohoni, Sen Wu, Aldo Córdova-Palomera, Jared Dunnmon, James Priest, Christopher Ré

    Abstract: A popular way to estimate the causal effect of a variable x on y from observational data is to use an instrumental variable (IV): a third variable z that affects y only through x. The more strongly z is associated with x, the more reliable the estimate is, but such strong IVs are difficult to find. Instead, practitioners combine more commonly available IV candidates---which are not necessarily str… ▽ More

    Submitted 11 April, 2020; originally announced April 2020.

  10. arXiv:2003.07977  [pdf, other

    eess.IV cs.LG stat.ML

    Assessing Robustness to Noise: Low-Cost Head CT Triage

    Authors: Sarah M. Hooper, Jared A. Dunnmon, Matthew P. Lungren, Sanjiv Sam Gambhir, Christopher Ré, Adam S. Wang, Bhavik N. Patel

    Abstract: Automated medical image classification with convolutional neural networks (CNNs) has great potential to impact healthcare, particularly in resource-constrained healthcare systems where fewer trained radiologists are available. However, little is known about how well a trained CNN can perform on images with the increased noise levels, different acquisition protocols, or additional artifacts that ma… ▽ More

    Submitted 28 March, 2020; v1 submitted 17 March, 2020; originally announced March 2020.

    Comments: AI for Affordable Healthcare Workshop at ICLR 2020. First two authors have equal contribution; last two authors have equal contribution. Revision made to manuscript header according to workshop guidelines on 3/28/20

  11. arXiv:1909.12475  [pdf, other

    cs.LG stat.ML

    Hidden Stratification Causes Clinically Meaningful Failures in Machine Learning for Medical Imaging

    Authors: Luke Oakden-Rayner, Jared Dunnmon, Gustavo Carneiro, Christopher Ré

    Abstract: Machine learning models for medical image analysis often suffer from poor performance on important subsets of a population that are not identified during training or testing. For example, overall performance of a cancer detection model may be high, but the model still consistently misses a rare but aggressive cancer subtype. We refer to this problem as hidden stratification, and observe that it re… ▽ More

    Submitted 15 November, 2019; v1 submitted 26 September, 2019; originally announced September 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract

  12. arXiv:1903.11101  [pdf, other

    cs.LG eess.IV stat.ML

    Cross-Modal Data Programming Enables Rapid Medical Machine Learning

    Authors: Jared Dunnmon, Alexander Ratner, Nishith Khandwala, Khaled Saab, Matthew Markert, Hersh Sagreiya, Roger Goldman, Christopher Lee-Messer, Matthew Lungren, Daniel Rubin, Christopher Ré

    Abstract: Labeling training datasets has become a key barrier to building medical machine learning models. One strategy is to generate training labels programmatically, for example by applying natural language processing pipelines to text reports associated with imaging studies. We propose cross-modal data programming, which generalizes this intuitive strategy in a theoretically-grounded way that enables si… ▽ More

    Submitted 26 March, 2019; originally announced March 2019.

  13. arXiv:1902.07087  [pdf, other

    cs.CL cs.LG cs.SI

    Predicting US State-Level Agricultural Sentiment as a Measure of Food Security with Tweets from Farming Communities

    Authors: Jared Dunnmon, Swetava Ganguli, Darren Hau, Brooke Husic

    Abstract: The ability to obtain accurate food security metrics in developing areas where relevant data can be sparse is critically important for policy makers tasked with implementing food aid programs. As a result, a great deal of work has been dedicated to predicting important food security metrics such as annual crop yields using a variety of methods including simulation, remote sensing, weather models,… ▽ More

    Submitted 25 April, 2019; v1 submitted 13 February, 2019; originally announced February 2019.

    Comments: Second revised version corrects typographical errors and adds a few additional references

    Report number: Final report for research project conducted as part of the Sustainability and Artificial Intelligence Laboratory (SAIL) at Stanford University and the Winter 2017 offering of CS 224N Natural Language Processing with Deep Learning

  14. arXiv:1902.05433  [pdf, other

    cs.CV cs.LG

    Predicting Food Security Outcomes Using Convolutional Neural Networks (CNNs) for Satellite Tasking

    Authors: Swetava Ganguli, Jared Dunnmon, Darren Hau

    Abstract: Obtaining reliable data describing local Food Security Metrics (FSM) at a granularity that is informative to policy-makers requires expensive and logistically difficult surveys, particularly in the developing world. We train a CNN on publicly available satellite data describing land cover classification and use both transfer learning and direct training to build a model for FSM prediction purely f… ▽ More

    Submitted 25 April, 2019; v1 submitted 12 February, 2019; originally announced February 2019.

    Comments: Research performed as part of the Sustainability and Artificial Intelligence Laboratory (SAIL) at Stanford University. Second revised version corrects typographical errors and adds a few references

    Report number: Prepared as submission for final project of the Fall 2016 offering of CS 221 Artificial Intelligence at Stanford University

  15. arXiv:1810.02840  [pdf, other

    stat.ML cs.LG

    Training Complex Models with Multi-Task Weak Supervision

    Authors: Alexander Ratner, Braden Hancock, Jared Dunnmon, Frederic Sala, Shreyash Pandey, Christopher Ré

    Abstract: As machine learning models continue to increase in complexity, collecting large hand-labeled training sets has become one of the biggest roadblocks in practice. Instead, weaker forms of supervision that provide noisier but cheaper labels are often used. However, these weak supervision sources have diverse and unknown accuracies, may output correlated labels, and may label different tasks or apply… ▽ More

    Submitted 7 December, 2018; v1 submitted 5 October, 2018; originally announced October 2018.

  16. arXiv:1709.01643  [pdf, other

    stat.ML cs.CV cs.LG

    Learning to Compose Domain-Specific Transformations for Data Augmentation

    Authors: Alexander J. Ratner, Henry R. Ehrenberg, Zeshan Hussain, Jared Dunnmon, Christopher Ré

    Abstract: Data augmentation is a ubiquitous technique for increasing the size of labeled training sets by leveraging task-specific data transformations that preserve class labels. While it is often easy for domain experts to specify individual transformations, constructing and tuning the more sophisticated compositions typically needed to achieve state-of-the-art results is a time-consuming manual task in p… ▽ More

    Submitted 30 September, 2017; v1 submitted 5 September, 2017; originally announced September 2017.

    Comments: To appear at Neural Information Processing Systems (NIPS) 2017

    Journal ref: Advances in Neural Information Processing Systems 30, 2017, 3236--3246

  17. arXiv:1705.06362  [pdf, other

    cs.CV

    Optimizing and Visualizing Deep Learning for Benign/Malignant Classification in Breast Tumors

    Authors: Darvin Yi, Rebecca Lynn Sawyer, David Cohn III, Jared Dunnmon, Carson Lam, Xuerong Xiao, Daniel Rubin

    Abstract: Breast cancer has the highest incidence and second highest mortality rate for women in the US. Our study aims to utilize deep learning for benign/malignant classification of mammogram tumors using a subset of cases from the Digital Database of Screening Mammography (DDSM). Though it was a small dataset from the view of Deep Learning (about 1000 patients), we show that currently state of the art ar… ▽ More

    Submitted 17 May, 2017; originally announced May 2017.

  18. arXiv:1705.01142  [pdf, other

    q-fin.ST cs.CE

    Machine Learning for Better Models for Predicting Bond Prices

    Authors: Swetava Ganguli, Jared Dunnmon

    Abstract: Bond prices are a reflection of extremely complex market interactions and policies, making prediction of future prices difficult. This task becomes even more challenging due to the dearth of relevant information, and accuracy is not the only consideration--in trading situations, time is of the essence. Thus, machine learning in the context of bond price predictions should be both fast and accurate… ▽ More

    Submitted 31 March, 2017; originally announced May 2017.

    Comments: Submitted for publication