Skip to main content

Showing 1–50 of 53 results for author: Ng, A Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.09798  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Many-Shot In-Context Learning in Multimodal Foundation Models

    Authors: Yixing Jiang, Jeremy Irvin, Ji Hun Wang, Muhammad Ahmed Chaudhry, Jonathan H. Chen, Andrew Y. Ng

    Abstract: Large language models are well-known to be effective at few-shot in-context learning (ICL). Recent advancements in multimodal foundation models have enabled unprecedentedly long context windows, presenting an opportunity to explore their capability to perform ICL with many more demonstrating examples. In this work, we evaluate the performance of multimodal foundation models scaling from few-shot t… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  2. arXiv:2401.14486  [pdf, other

    cs.CV cs.LG

    CloudTracks: A Dataset for Localizing Ship Tracks in Satellite Images of Clouds

    Authors: Muhammad Ahmed Chaudhry, Lyna Kim, Jeremy Irvin, Yuzu Ido, Sonia Chu, Jared Thomas Isobe, Andrew Y. Ng, Duncan Watson-Parris

    Abstract: Clouds play a significant role in global temperature regulation through their effect on planetary albedo. Anthropogenic emissions of aerosols can alter the albedo of clouds, but the extent of this effect, and its consequent impact on temperature change, remains uncertain. Human-induced clouds caused by ship aerosol emissions, commonly referred to as ship tracks, provide visible manifestations of t… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 11 pages, 5 figures, submitted to Journal of Machine Learning Research

  3. arXiv:2312.02200  [pdf, other

    cs.CV cs.AI stat.AP

    An Empirical Study of Automated Mislabel Detection in Real World Vision Datasets

    Authors: Maya Srikanth, Jeremy Irvin, Brian Wesley Hill, Felipe Godoy, Ishan Sabane, Andrew Y. Ng

    Abstract: Major advancements in computer vision can primarily be attributed to the use of labeled datasets. However, acquiring labels for datasets often results in errors which can harm model performance. Recent works have proposed methods to automatically identify mislabeled images, but developing strategies to effectively implement them in real world datasets has been sparsely explored. Towards improved d… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  4. arXiv:2312.02199  [pdf, other

    cs.CV cs.AI cs.LG eess.IV stat.AP

    USat: A Unified Self-Supervised Encoder for Multi-Sensor Satellite Imagery

    Authors: Jeremy Irvin, Lucas Tao, Joanne Zhou, Yuntao Ma, Langston Nashold, Benjamin Liu, Andrew Y. Ng

    Abstract: Large, self-supervised vision models have led to substantial advancements for automatically interpreting natural images. Recent works have begun tailoring these methods to remote sensing data which has rich structure with multi-sensor, multi-spectral, and temporal information providing massive amounts of self-labeled data that can be used for self-supervised pre-training. In this work, we develop… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  5. arXiv:2311.17449  [pdf, other

    cs.CV

    Weakly-semi-supervised object detection in remotely sensed imagery

    Authors: Ji Hun Wang, Jeremy Irvin, Beri Kohen Behar, Ha Tran, Raghav Samavedam, Quentin Hsu, Andrew Y. Ng

    Abstract: Deep learning for detecting objects in remotely sensed imagery can enable new technologies for important applications including mitigating climate change. However, these models often require large datasets labeled with bounding box annotations which are expensive to curate, prohibiting the development of models for new tasks and geographies. To address this challenge, we develop weakly-semi-superv… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Tackling Climate Change with Machine Learning at NeurIPS 2023

  6. arXiv:2301.01842  [pdf, other

    cs.CV cs.CY

    Detecting Neighborhood Gentrification at Scale via Street-level Visual Data

    Authors: Tianyuan Huang, Timothy Dai, Zhecheng Wang, Hesu Yoon, Hao Sheng, Andrew Y. Ng, Ram Rajagopal, Jackelyn Hwang

    Abstract: Neighborhood gentrification plays a significant role in shaping the social and economic well-being of both individuals and communities at large. While some efforts have been made to detect gentrification in cities, existing approaches rely mainly on estimated measures from survey data, require substantial work of human labeling, and are limited in characterizing the neighborhood as a whole. We pro… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

  7. arXiv:2208.13027  [pdf, other

    cs.LG cs.AI

    Improving debris flow evacuation alerts in Taiwan using machine learning

    Authors: Yi-Lin Tsai, Jeremy Irvin, Suhas Chundi, Andrew Y. Ng, Christopher B. Field, Peter K. Kitanidis

    Abstract: Taiwan has the highest susceptibility to and fatalities from debris flows worldwide. The existing debris flow warning system in Taiwan, which uses a time-weighted measure of rainfall, leads to alerts when the measure exceeds a predefined threshold. However, this system generates many false alarms and misses a substantial fraction of the actual debris flows. Towards improving this system, we implem… ▽ More

    Submitted 2 September, 2022; v1 submitted 27 August, 2022; originally announced August 2022.

    Comments: Supplementary information: https://drive.google.com/file/d/1Y17YxXo5rhIbUuZzwLo99pmttbh28v9X/view?usp=sharing

  8. arXiv:2207.11166  [pdf, other

    cs.CV

    METER-ML: A Multi-Sensor Earth Observation Benchmark for Automated Methane Source Mapping

    Authors: Bryan Zhu, Nicholas Lui, Jeremy Irvin, Jimmy Le, Sahil Tadwalkar, Chenghao Wang, Zutao Ouyang, Frankie Y. Liu, Andrew Y. Ng, Robert B. Jackson

    Abstract: Reducing methane emissions is essential for mitigating global warming. To attribute methane emissions to their sources, a comprehensive dataset of methane source infrastructure is necessary. Recent advancements with deep learning on remotely sensed imagery have the potential to identify the locations and characteristics of methane sources, but there is a substantial lack of publicly available data… ▽ More

    Submitted 15 August, 2022; v1 submitted 22 July, 2022; originally announced July 2022.

    Comments: Workshop on Complex Data Challenges in Earth Observation at IJCAI-ECAI 2022

  9. arXiv:2201.01449  [pdf, other

    eess.IV cs.CV cs.LG

    Deep Learning-Based Sparse Whole-Slide Image Analysis for the Diagnosis of Gastric Intestinal Metaplasia

    Authors: Jon Braatz, Pranav Rajpurkar, Stephanie Zhang, Andrew Y. Ng, Jeanne Shen

    Abstract: In recent years, deep learning has successfully been applied to automate a wide variety of tasks in diagnostic histopathology. However, fast and reliable localization of small-scale regions-of-interest (ROI) has remained a key challenge, as discriminative morphologic features often occupy only a small fraction of a gigapixel-scale whole-slide image (WSI). In this paper, we propose a sparse WSI ana… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

  10. arXiv:2108.01764  [pdf, other

    cs.CL cs.AI

    Q-Pain: A Question Answering Dataset to Measure Social Bias in Pain Management

    Authors: Cécile Logé, Emily Ross, David Yaw Amoah Dadey, Saahil Jain, Adriel Saporta, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: Recent advances in Natural Language Processing (NLP), and specifically automated Question Answering (QA) systems, have demonstrated both impressive linguistic fluency and a pernicious tendency to reflect social biases. In this study, we introduce Q-Pain, a dataset for assessing bias in medical QA in the context of pain management, one of the most challenging forms of clinical decision-making. Alon… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

    Comments: Accepted to the 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks

  11. arXiv:2106.14463  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    RadGraph: Extracting Clinical Entities and Relations from Radiology Reports

    Authors: Saahil Jain, Ashwin Agrawal, Adriel Saporta, Steven QH Truong, Du Nguyen Duong, Tan Bui, Pierre Chambon, Yuhao Zhang, Matthew P. Lungren, Andrew Y. Ng, Curtis P. Langlotz, Pranav Rajpurkar

    Abstract: Extracting structured clinical information from free-text radiology reports can enable the use of radiology report information for a variety of critical healthcare applications. In our work, we present RadGraph, a dataset of entities and relations in full-text chest X-ray radiology reports based on a novel information extraction schema we designed to structure radiology reports. We release a devel… ▽ More

    Submitted 29 August, 2021; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: Accepted to the 35th Conference on Neural Information Processing Systems (NeurIPS 2021) Track on Datasets and Benchmarks

  12. arXiv:2106.04452  [pdf, other

    physics.med-ph cs.LG eess.SP

    3KG: Contrastive Learning of 12-Lead Electrocardiograms using Physiologically-Inspired Augmentations

    Authors: Bryan Gopal, Ryan W. Han, Gautham Raghupathi, Andrew Y. Ng, Geoffrey H. Tison, Pranav Rajpurkar

    Abstract: We propose 3KG, a physiologically-inspired contrastive learning approach that generates views using 3D augmentations of the 12-lead electrocardiogram. We evaluate representation quality by fine-tuning a linear layer for the downstream task of 23-class diagnosis on the PhysioNet 2020 challenge training data and find that 3KG achieves a $9.1\%$ increase in mean AUC over the best self-supervised base… ▽ More

    Submitted 20 September, 2021; v1 submitted 21 April, 2021; originally announced June 2021.

    Comments: 11 pages, 3 figures, paper revision with new set of experiments and comparison to previous methods

  13. arXiv:2105.02489  [pdf, other

    cs.LG cs.CV

    Learning Neighborhood Representation from Multi-Modal Multi-Graph: Image, Text, Mobility Graph and Beyond

    Authors: Tianyuan Huang, Zhecheng Wang, Hao Sheng, Andrew Y. Ng, Ram Rajagopal

    Abstract: Recent urbanization has coincided with the enrichment of geotagged data, such as street view and point-of-interest (POI). Region embedding enhanced by the richer data modalities has enabled researchers and city administrators to understand the built environment, socioeconomics, and the dynamics of cities better. While some efforts have been made to simultaneously use multi-modal inputs, existing m… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

  14. arXiv:2104.00793  [pdf, ps, other

    eess.IV cs.CV cs.LG

    Effect of Radiology Report Labeler Quality on Deep Learning Models for Chest X-Ray Interpretation

    Authors: Saahil Jain, Akshay Smit, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: Although deep learning models for chest X-ray interpretation are commonly trained on labels generated by automatic radiology report labelers, the impact of improvements in report labeling on the performance of chest X-ray classification models has not been systematically investigated. We first compare the CheXpert, CheXbert, and VisualCheXbert labelers on the task of extracting accurate chest X-ra… ▽ More

    Submitted 27 November, 2021; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: In Neural Information Processing Systems (NeurIPS) Workshop on Data-Centric AI (DCAI)

  15. arXiv:2103.14339  [pdf, other

    cs.CV cs.AI cs.LG

    MedSelect: Selective Labeling for Medical Image Classification Combining Meta-Learning with Deep Reinforcement Learning

    Authors: Akshay Smit, Damir Vrabac, Yujie He, Andrew Y. Ng, Andrew L. Beam, Pranav Rajpurkar

    Abstract: We propose a selective learning method using meta-learning and deep reinforcement learning for medical image interpretation in the setting of limited labeling resources. Our method, MedSelect, consists of a trainable deep learning selector that uses image embeddings obtained from contrastive pretraining for determining which images to label, and a non-parametric selector that uses cosine similarit… ▽ More

    Submitted 26 March, 2021; originally announced March 2021.

  16. arXiv:2103.09957  [pdf, other

    cs.CV cs.AI cs.LG

    CheXbreak: Misclassification Identification for Deep Learning Models Interpreting Chest X-rays

    Authors: Emma Chen, Andy Kim, Rayan Krishnan, Jin Long, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: A major obstacle to the integration of deep learning models for chest x-ray interpretation into clinical settings is the lack of understanding of their failure modes. In this work, we first investigate whether there are patient subgroups that chest x-ray models are likely to misclassify. We find that patient age and the radiographic finding of lung lesion, pneumothorax or support devices are stati… ▽ More

    Submitted 20 July, 2021; v1 submitted 17 March, 2021; originally announced March 2021.

    Comments: In Proceedings of the 2021 Conference on Machine Learning for Health Care, 2021. In ACM Conference on Health, Inference, and Learning (ACM-CHIL) Workshop 2021

  17. arXiv:2103.04590  [pdf, other

    cs.CV cs.AI cs.LG

    CheXseen: Unseen Disease Detection for Deep Learning Interpretation of Chest X-rays

    Authors: Siyu Shi, Ishaan Malhi, Kevin Tran, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: We systematically evaluate the performance of deep learning models in the presence of diseases not labeled for or present during training. First, we evaluate whether deep learning models trained on a subset of diseases (seen diseases) can detect the presence of any one of a larger set of diseases. We find that models tend to falsely classify diseases outside of the subset (unseen diseases) as "no… ▽ More

    Submitted 17 May, 2021; v1 submitted 8 March, 2021; originally announced March 2021.

    Comments: Accepted at MIDL Conference 2021. Previous version accepted at ACM Conference on Health, Inference, and Learning (ACM-CHIL) Workshop 2021

  18. arXiv:2102.11467  [pdf, other

    eess.IV cs.CV cs.LG

    VisualCheXbert: Addressing the Discrepancy Between Radiology Report Labels and Image Labels

    Authors: Saahil Jain, Akshay Smit, Steven QH Truong, Chanh DT Nguyen, Minh-Thanh Huynh, Mudit Jain, Victoria A. Young, Andrew Y. Ng, Matthew P. Lungren, Pranav Rajpurkar

    Abstract: Automatic extraction of medical conditions from free-text radiology reports is critical for supervising computer vision models to interpret medical images. In this work, we show that radiologists labeling reports significantly disagree with radiologists labeling corresponding chest X-ray images, which reduces the quality of report labels as proxies for image labels. We develop and evaluate methods… ▽ More

    Submitted 15 March, 2021; v1 submitted 22 February, 2021; originally announced February 2021.

    Comments: Accepted to ACM Conference on Health, Inference, and Learning (ACM-CHIL) 2021

  19. arXiv:2102.10663  [pdf, other

    eess.IV cs.CV cs.LG

    MedAug: Contrastive learning leveraging patient metadata improves representations for chest X-ray interpretation

    Authors: Yen Nhi Truong Vu, Richard Wang, Niranjan Balachandar, Can Liu, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: Self-supervised contrastive learning between pairs of multiple views of the same image has been shown to successfully leverage unlabeled data to produce meaningful visual representations for both natural and medical images. However, there has been limited work on determining how to select pairs for medical images, where availability of patient metadata can be leveraged to improve representations.… ▽ More

    Submitted 17 October, 2021; v1 submitted 21 February, 2021; originally announced February 2021.

  20. arXiv:2102.10484  [pdf, other

    cs.CV cs.AI cs.LG

    CheXseg: Combining Expert Annotations with DNN-generated Saliency Maps for X-ray Segmentation

    Authors: Soham Gadgil, Mark Endo, Emily Wen, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: Medical image segmentation models are typically supervised by expert annotations at the pixel-level, which can be expensive to acquire. In this work, we propose a method that combines the high quality of pixel-level expert annotations with the scale of coarse DNN-generated saliency maps for training multi-label semantic segmentation models. We demonstrate the application of our semi-supervised met… ▽ More

    Submitted 17 May, 2021; v1 submitted 20 February, 2021; originally announced February 2021.

    Comments: Accepted to Medical Imaging with Deep Learning (MIDL) Conference 2021

  21. arXiv:2102.08660  [pdf, other

    eess.IV cs.CV cs.LG

    CheXternal: Generalization of Deep Learning Models for Chest X-ray Interpretation to Photos of Chest X-rays and External Clinical Settings

    Authors: Pranav Rajpurkar, Anirudh Joshi, Anuj Pareek, Andrew Y. Ng, Matthew P. Lungren

    Abstract: Recent advances in training deep learning models have demonstrated the potential to provide accurate chest X-ray interpretation and increase access to radiology expertise. However, poor generalization due to data distribution shifts in clinical settings is a key barrier to implementation. In this study, we measured the diagnostic performance for 8 different chest X-ray models when applied to (1) s… ▽ More

    Submitted 20 February, 2021; v1 submitted 17 February, 2021; originally announced February 2021.

    Comments: Accepted to ACM Conference on Health, Inference, and Learning (ACM-CHIL) 2021. arXiv admin note: substantial text overlap with arXiv:2011.06129

  22. arXiv:2101.06871  [pdf, other

    cs.CV cs.AI cs.LG

    CheXtransfer: Performance and Parameter Efficiency of ImageNet Models for Chest X-Ray Interpretation

    Authors: Alexander Ke, William Ellsworth, Oishi Banerjee, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: Deep learning methods for chest X-ray interpretation typically rely on pretrained models developed for ImageNet. This paradigm assumes that better ImageNet architectures perform better on chest X-ray tasks and that ImageNet-pretrained weights provide a performance boost over random initialization. In this work, we compare the transfer performance and parameter efficiency of 16 popular convolutiona… ▽ More

    Submitted 20 February, 2021; v1 submitted 17 January, 2021; originally announced January 2021.

  23. arXiv:2011.07227  [pdf, other

    cs.CV cs.AI cs.LG

    OGNet: Towards a Global Oil and Gas Infrastructure Database using Deep Learning on Remotely Sensed Imagery

    Authors: Hao Sheng, Jeremy Irvin, Sasankh Munukutla, Shawn Zhang, Christopher Cross, Kyle Story, Rose Rustowicz, Cooper Elsworth, Zutao Yang, Mark Omara, Ritesh Gautam, Robert B. Jackson, Andrew Y. Ng

    Abstract: At least a quarter of the warming that the Earth is experiencing today is due to anthropogenic methane emissions. There are multiple satellites in orbit and planned for launch in the next few years which can detect and quantify these emissions; however, to attribute methane emissions to their sources on the ground, a comprehensive database of the locations and characteristics of emission sources w… ▽ More

    Submitted 14 November, 2020; originally announced November 2020.

    Comments: Tackling Climate Change with Machine Learning at NeurIPS 2020 (Spotlight talk)

  24. arXiv:2011.06129  [pdf, other

    eess.IV cs.CV cs.LG

    CheXphotogenic: Generalization of Deep Learning Models for Chest X-ray Interpretation to Photos of Chest X-rays

    Authors: Pranav Rajpurkar, Anirudh Joshi, Anuj Pareek, Jeremy Irvin, Andrew Y. Ng, Matthew Lungren

    Abstract: The use of smartphones to take photographs of chest x-rays represents an appealing solution for scaled deployment of deep learning models for chest x-ray interpretation. However, the performance of chest x-ray algorithms on photos of chest x-rays has not been thoroughly investigated. In this study, we measured the diagnostic performance for 8 different chest x-ray models when applied to photos of… ▽ More

    Submitted 11 November, 2020; originally announced November 2020.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2020 - Extended Abstract

  25. arXiv:2011.05479  [pdf, other

    cs.CV cs.LG eess.IV

    ForestNet: Classifying Drivers of Deforestation in Indonesia using Deep Learning on Satellite Imagery

    Authors: Jeremy Irvin, Hao Sheng, Neel Ramachandran, Sonja Johnson-Yu, Sharon Zhou, Kyle Story, Rose Rustowicz, Cooper Elsworth, Kemen Austin, Andrew Y. Ng

    Abstract: Characterizing the processes leading to deforestation is critical to the development and implementation of targeted forest conservation and management policies. In this work, we develop a deep learning model called ForestNet to classify the drivers of primary forest loss in Indonesia, a country with one of the highest deforestation rates in the world. Using satellite imagery, ForestNet identifies… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

    Comments: Tackling Climate Change with Machine Learning at NeurIPS 2020

  26. arXiv:2010.15269  [pdf, other

    eess.IV cs.CV cs.LG

    GloFlow: Global Image Alignment for Creation of Whole Slide Images for Pathology from Video

    Authors: Viswesh Krishna, Anirudh Joshi, Philip L. Bulterys, Eric Yang, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: The application of deep learning to pathology assumes the existence of digital whole slide images of pathology slides. However, slide digitization is bottlenecked by the high cost of precise motor stages in slide scanners that are needed for position information used for slide stitching. We propose GloFlow, a two-stage method for creating a whole slide image using optical flow-based image registra… ▽ More

    Submitted 12 November, 2020; v1 submitted 28 October, 2020; originally announced October 2020.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2020 - Extended Abstract

  27. arXiv:2010.05352  [pdf, other

    cs.CV cs.AI cs.LG

    MoCo-CXR: MoCo Pretraining Improves Representation and Transferability of Chest X-ray Models

    Authors: Hari Sowrirajan, Jingbo Yang, Andrew Y. Ng, Pranav Rajpurkar

    Abstract: Contrastive learning is a form of self-supervision that can leverage unlabeled data to produce pretrained models. While contrastive learning has demonstrated promising results on natural image classification tasks, its application to medical imaging tasks like chest X-ray interpretation has been limited. In this work, we propose MoCo-CXR, which is an adaptation of the contrastive learning method M… ▽ More

    Submitted 17 May, 2021; v1 submitted 11 October, 2020; originally announced October 2020.

    Comments: Accepted at Medical Imaging with Deep Learning (MIDL) Conference 2021

  28. arXiv:2010.04715  [pdf, other

    cs.LG stat.AP stat.ML

    Short-Term Solar Irradiance Forecasting Using Calibrated Probabilistic Models

    Authors: Eric Zelikman, Sharon Zhou, Jeremy Irvin, Cooper Raterink, Hao Sheng, Anand Avati, Jack Kelly, Ram Rajagopal, Andrew Y. Ng, David Gagne

    Abstract: Advancing probabilistic solar forecasting methods is essential to supporting the integration of solar energy into the electricity grid. In this work, we develop a variety of state-of-the-art probabilistic models for forecasting solar irradiance. We investigate the use of post-hoc calibration techniques for ensuring well-calibrated probabilistic predictions. We train and evaluate the models using p… ▽ More

    Submitted 14 October, 2020; v1 submitted 9 October, 2020; originally announced October 2020.

  29. arXiv:2009.08123  [pdf, other

    cs.CV cs.AI cs.LG

    DLBCL-Morph: Morphological features computed using deep learning for an annotated digital DLBCL image set

    Authors: Damir Vrabac, Akshay Smit, Rebecca Rojansky, Yasodha Natkunam, Ranjana H. Advani, Andrew Y. Ng, Sebastian Fernandez-Pol, Pranav Rajpurkar

    Abstract: Diffuse Large B-Cell Lymphoma (DLBCL) is the most common non-Hodgkin lymphoma. Though histologically DLBCL shows varying morphologies, no morphologic features have been consistently demonstrated to correlate with prognosis. We present a morphologic analysis of histology sections from 209 DLBCL cases with associated clinical and cytogenetic data. Duplicate tissue core sections were arranged in tiss… ▽ More

    Submitted 24 September, 2020; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: Corrections to folder structure figure

  30. arXiv:2007.06199  [pdf, other

    eess.IV cs.CV cs.LG

    CheXphoto: 10,000+ Photos and Transformations of Chest X-rays for Benchmarking Deep Learning Robustness

    Authors: Nick A. Phillips, Pranav Rajpurkar, Mark Sabini, Rayan Krishnan, Sharon Zhou, Anuj Pareek, Nguyet Minh Phu, Chris Wang, Mudit Jain, Nguyen Duong Du, Steven QH Truong, Andrew Y. Ng, Matthew P. Lungren

    Abstract: Clinical deployment of deep learning algorithms for chest x-ray interpretation requires a solution that can integrate into the vast spectrum of clinical workflows across the world. An appealing approach to scaled deployment is to leverage the ubiquity of smartphones by capturing photos of x-rays to share with clinicians using messaging services like WhatsApp. However, the application of chest x-ra… ▽ More

    Submitted 11 December, 2020; v1 submitted 13 July, 2020; originally announced July 2020.

  31. arXiv:2006.03680  [pdf, other

    stat.ML cs.CV cs.LG

    Evaluating the Disentanglement of Deep Generative Models through Manifold Topology

    Authors: Sharon Zhou, Eric Zelikman, Fred Lu, Andrew Y. Ng, Gunnar Carlsson, Stefano Ermon

    Abstract: Learning disentangled representations is regarded as a fundamental task for improving the generalization, robustness, and interpretability of generative models. However, measuring disentanglement has been challenging and inconsistent, often dependent on an ad-hoc external model or specific to a certain dataset. To address this, we present a method for quantifying disentanglement that only uses the… ▽ More

    Submitted 17 March, 2021; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: Published at ICLR 2021

  32. arXiv:2004.09167  [pdf, other

    cs.CL cs.IR cs.LG

    CheXbert: Combining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERT

    Authors: Akshay Smit, Saahil Jain, Pranav Rajpurkar, Anuj Pareek, Andrew Y. Ng, Matthew P. Lungren

    Abstract: The extraction of labels from radiology text reports enables large-scale training of medical imaging models. Existing approaches to report labeling typically rely either on sophisticated feature engineering based on medical domain knowledge or manual annotations by experts. In this work, we introduce a BERT-based approach to medical image report labeling that exploits both the scale of available r… ▽ More

    Submitted 18 October, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

    Comments: Accepted to EMNLP 2020

  33. arXiv:2002.11379  [pdf, other

    eess.IV cs.CV cs.LG

    CheXpedition: Investigating Generalization Challenges for Translation of Chest X-Ray Algorithms to the Clinical Setting

    Authors: Pranav Rajpurkar, Anirudh Joshi, Anuj Pareek, Phil Chen, Amirhossein Kiani, Jeremy Irvin, Andrew Y. Ng, Matthew P. Lungren

    Abstract: Although there have been several recent advances in the application of deep learning algorithms to chest x-ray interpretation, we identify three major challenges for the translation of chest x-ray algorithms to the clinical setting. We examine the performance of the top 10 performing models on the CheXpert challenge leaderboard on three tasks: (1) TB detection, (2) pathology detection on photos of… ▽ More

    Submitted 11 March, 2020; v1 submitted 26 February, 2020; originally announced February 2020.

    Comments: Accepted as workshop paper at ACM Conference on Health, Inference, and Learning (CHIL) 2020

  34. arXiv:2002.02917  [pdf, other

    cs.CV cs.LG

    Data augmentation with Mobius transformations

    Authors: Sharon Zhou, Jiequan Zhang, Hang Jiang, Torbjorn Lundh, Andrew Y. Ng

    Abstract: Data augmentation has led to substantial improvements in the performance and generalization of deep models, and remain a highly adaptable method to evolving model architectures and varying amounts of data---in particular, extremely scarce amounts of available training data. In this paper, we present a novel method of applying Mobius transformations to augment input images during training. Mobius t… ▽ More

    Submitted 7 June, 2020; v1 submitted 7 February, 2020; originally announced February 2020.

  35. arXiv:1910.03225  [pdf, other

    cs.LG stat.ML

    NGBoost: Natural Gradient Boosting for Probabilistic Prediction

    Authors: Tony Duan, Anand Avati, Daisy Yi Ding, Khanh K. Thai, Sanjay Basu, Andrew Y. Ng, Alejandro Schuler

    Abstract: We present Natural Gradient Boosting (NGBoost), an algorithm for generic probabilistic prediction via gradient boosting. Typical regression models return a point estimate, conditional on covariates, but probabilistic regression models output a full probability distribution over the outcome space, conditional on the covariates. This allows for predictive uncertainty estimation -- crucial in applica… ▽ More

    Submitted 9 June, 2020; v1 submitted 8 October, 2019; originally announced October 2019.

    Comments: Accepted for ICML 2020

  36. arXiv:1906.05433  [pdf, other

    cs.CY cs.AI cs.LG stat.ML

    Tackling Climate Change with Machine Learning

    Authors: David Rolnick, Priya L. Donti, Lynn H. Kaack, Kelly Kochanski, Alexandre Lacoste, Kris Sankaran, Andrew Slavin Ross, Nikola Milojevic-Dupont, Natasha Jaques, Anna Waldman-Brown, Alexandra Luccioni, Tegan Maharaj, Evan D. Sherwin, S. Karthik Mukkavilli, Konrad P. Kording, Carla Gomes, Andrew Y. Ng, Demis Hassabis, John C. Platt, Felix Creutzig, Jennifer Chayes, Yoshua Bengio

    Abstract: Climate change is one of the greatest challenges facing humanity, and we, as machine learning experts, may wonder how we can help. Here we describe how machine learning can be a powerful tool in reducing greenhouse gas emissions and helping society adapt to a changing climate. From smart grids to disaster management, we identify high impact problems where existing gaps can be filled by machine lea… ▽ More

    Submitted 5 November, 2019; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: For additional resources, please visit the website that accompanies this paper: https://www.climatechange.ai/

  37. arXiv:1901.07031  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    CheXpert: A Large Chest Radiograph Dataset with Uncertainty Labels and Expert Comparison

    Authors: Jeremy Irvin, Pranav Rajpurkar, Michael Ko, Yifan Yu, Silviana Ciurea-Ilcus, Chris Chute, Henrik Marklund, Behzad Haghgoo, Robyn Ball, Katie Shpanskaya, Jayne Seekins, David A. Mong, Safwan S. Halabi, Jesse K. Sandberg, Ricky Jones, David B. Larson, Curtis P. Langlotz, Bhavik N. Patel, Matthew P. Lungren, Andrew Y. Ng

    Abstract: Large, labeled datasets have driven deep learning methods to achieve expert-level performance on a variety of medical imaging tasks. We present CheXpert, a large dataset that contains 224,316 chest radiographs of 65,240 patients. We design a labeler to automatically detect the presence of 14 observations in radiology reports, capturing uncertainties inherent in radiograph interpretation. We invest… ▽ More

    Submitted 21 January, 2019; originally announced January 2019.

    Comments: Published in AAAI 2019

  38. arXiv:1712.06957  [pdf, other

    physics.med-ph cs.AI

    MURA: Large Dataset for Abnormality Detection in Musculoskeletal Radiographs

    Authors: Pranav Rajpurkar, Jeremy Irvin, Aarti Bagul, Daisy Ding, Tony Duan, Hershel Mehta, Brandon Yang, Kaylie Zhu, Dillon Laird, Robyn L. Ball, Curtis Langlotz, Katie Shpanskaya, Matthew P. Lungren, Andrew Y. Ng

    Abstract: We introduce MURA, a large dataset of musculoskeletal radiographs containing 40,561 images from 14,863 studies, where each study is manually labeled by radiologists as either normal or abnormal. To evaluate models robustly and to get an estimate of radiologist performance, we collect additional labels from six board-certified Stanford radiologists on the test set, consisting of 207 musculoskeletal… ▽ More

    Submitted 22 May, 2018; v1 submitted 11 December, 2017; originally announced December 2017.

    Comments: 1st Conference on Medical Imaging with Deep Learning (MIDL 2018)

  39. arXiv:1711.05225  [pdf, other

    cs.CV cs.LG stat.ML

    CheXNet: Radiologist-Level Pneumonia Detection on Chest X-Rays with Deep Learning

    Authors: Pranav Rajpurkar, Jeremy Irvin, Kaylie Zhu, Brandon Yang, Hershel Mehta, Tony Duan, Daisy Ding, Aarti Bagul, Curtis Langlotz, Katie Shpanskaya, Matthew P. Lungren, Andrew Y. Ng

    Abstract: We develop an algorithm that can detect pneumonia from chest X-rays at a level exceeding practicing radiologists. Our algorithm, CheXNet, is a 121-layer convolutional neural network trained on ChestX-ray14, currently the largest publicly available chest X-ray dataset, containing over 100,000 frontal-view X-ray images with 14 diseases. Four practicing academic radiologists annotate a test set, on w… ▽ More

    Submitted 25 December, 2017; v1 submitted 14 November, 2017; originally announced November 2017.

  40. arXiv:1707.01836  [pdf, other

    cs.CV

    Cardiologist-Level Arrhythmia Detection with Convolutional Neural Networks

    Authors: Pranav Rajpurkar, Awni Y. Hannun, Masoumeh Haghpanahi, Codie Bourn, Andrew Y. Ng

    Abstract: We develop an algorithm which exceeds the performance of board certified cardiologists in detecting a wide range of heart arrhythmias from electrocardiograms recorded with a single-lead wearable monitor. We build a dataset with more than 500 times the number of unique patients than previously studied corpora. On this dataset, we train a 34-layer convolutional neural network which maps a sequence o… ▽ More

    Submitted 6 July, 2017; originally announced July 2017.

  41. arXiv:1703.02573  [pdf, other

    cs.LG cs.CL

    Data Noising as Smoothing in Neural Network Language Models

    Authors: Ziang Xie, Sida I. Wang, Jiwei Li, Daniel Lévy, Aiming Nie, Dan Jurafsky, Andrew Y. Ng

    Abstract: Data noising is an effective technique for regularizing neural network models. While noising is widely adopted in application domains such as vision and speech, commonly used noising primitives have not been developed for discrete sequence-level settings such as language modeling. In this paper, we derive a connection between input noising in neural network language models and smoothing in $n$-gra… ▽ More

    Submitted 7 March, 2017; originally announced March 2017.

    Comments: ICLR 2017

  42. arXiv:1603.09727  [pdf, other

    cs.CL cs.AI

    Neural Language Correction with Character-Based Attention

    Authors: Ziang Xie, Anand Avati, Naveen Arivazhagan, Dan Jurafsky, Andrew Y. Ng

    Abstract: Natural language correction has the potential to help language learners improve their writing skills. While approaches with separate classifiers for different error types have high precision, they do not flexibly handle errors such as redundancy or non-idiomatic phrasing. On the other hand, word and phrase-based machine translation methods are not designed to cope with orthographic errors, and hav… ▽ More

    Submitted 31 March, 2016; originally announced March 2016.

    Comments: 10 pages

  43. arXiv:1504.01716  [pdf, other

    cs.RO cs.CV

    An Empirical Evaluation of Deep Learning on Highway Driving

    Authors: Brody Huval, Tao Wang, Sameep Tandon, Jeff Kiske, Will Song, Joel Pazhayampallil, Mykhaylo Andriluka, Pranav Rajpurkar, Toki Migimatsu, Royce Cheng-Yue, Fernando Mujica, Adam Coates, Andrew Y. Ng

    Abstract: Numerous groups have applied a variety of deep learning techniques to computer vision problems in highway perception scenarios. In this paper, we presented a number of empirical evaluations of recent deep learning advances. Computer vision, combined with deep learning, has the potential to bring about a relatively inexpensive, robust solution to autonomous driving. To prepare deep learning for ind… ▽ More

    Submitted 16 April, 2015; v1 submitted 7 April, 2015; originally announced April 2015.

    Comments: Added a video for lane detection

  44. arXiv:1412.5567  [pdf, other

    cs.CL cs.LG cs.NE

    Deep Speech: Scaling up end-to-end speech recognition

    Authors: Awni Hannun, Carl Case, Jared Casper, Bryan Catanzaro, Greg Diamos, Erich Elsen, Ryan Prenger, Sanjeev Satheesh, Shubho Sengupta, Adam Coates, Andrew Y. Ng

    Abstract: We present a state-of-the-art speech recognition system developed using end-to-end deep learning. Our architecture is significantly simpler than traditional speech systems, which rely on laboriously engineered processing pipelines; these traditional systems also tend to perform poorly when used in noisy environments. In contrast, our system does not need hand-designed components to model backgroun… ▽ More

    Submitted 19 December, 2014; v1 submitted 17 December, 2014; originally announced December 2014.

  45. arXiv:1408.2873  [pdf, ps, other

    cs.CL cs.LG cs.NE

    First-Pass Large Vocabulary Continuous Speech Recognition using Bi-Directional Recurrent DNNs

    Authors: Awni Y. Hannun, Andrew L. Maas, Daniel Jurafsky, Andrew Y. Ng

    Abstract: We present a method to perform first-pass large vocabulary continuous speech recognition using only a neural network and language model. Deep neural network acoustic models are now commonplace in HMM-based speech recognition systems, but building such systems is a complex, domain-specific task. Recent work demonstrated the feasibility of discarding the HMM sequence modeling framework by directly p… ▽ More

    Submitted 8 December, 2014; v1 submitted 12 August, 2014; originally announced August 2014.

  46. arXiv:1406.7806  [pdf, other

    cs.CL cs.LG cs.NE stat.ML

    Building DNN Acoustic Models for Large Vocabulary Speech Recognition

    Authors: Andrew L. Maas, Peng Qi, Ziang Xie, Awni Y. Hannun, Christopher T. Lengerich, Daniel Jurafsky, Andrew Y. Ng

    Abstract: Deep neural networks (DNNs) are now a central component of nearly all state-of-the-art speech recognition systems. Building neural network acoustic models requires several design decisions including network architecture, size, and training loss function. This paper offers an empirical investigation on which aspects of DNN acoustic model design are most important for speech recognition system perfo… ▽ More

    Submitted 20 January, 2015; v1 submitted 30 June, 2014; originally announced June 2014.

  47. arXiv:1302.1552  [pdf

    cs.LG stat.ML

    An Information-Theoretic Analysis of Hard and Soft Assignment Methods for Clustering

    Authors: Michael Kearns, Yishay Mansour, Andrew Y. Ng

    Abstract: Assignment methods are at the heart of many algorithms for unsupervised learning and clustering - in particular, the well-known K-means and Expectation-Maximization (EM) algorithms. In this work, we study several different methods of assignment, including the "hard" assignments used by K-means and the ?soft' assignments used by EM. While it is known that K-means minimizes the distortion on the da… ▽ More

    Submitted 6 February, 2013; originally announced February 2013.

    Comments: Appears in Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence (UAI1997)

    Report number: UAI-P-1997-PG-282-293

  48. arXiv:1301.3878  [pdf

    cs.AI cs.LG

    PEGASUS: A Policy Search Method for Large MDPs and POMDPs

    Authors: Andrew Y. Ng, Michael I. Jordan

    Abstract: We propose a new approach to the problem of searching a space of policies for a Markov decision process (MDP) or a partially observable Markov decision process (POMDP), given a model. Our approach is based on the following observation: Any (PO)MDP can be transformed into an "equivalent" POMDP in which all state transitions (given the current state and action) are deterministic. This reduces the… ▽ More

    Submitted 16 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Sixteenth Conference on Uncertainty in Artificial Intelligence (UAI2000)

    Report number: UAI-P-2000-PG-406-415

  49. arXiv:1301.3666  [pdf, other

    cs.CV cs.LG

    Zero-Shot Learning Through Cross-Modal Transfer

    Authors: Richard Socher, Milind Ganjoo, Hamsa Sridhar, Osbert Bastani, Christopher D. Manning, Andrew Y. Ng

    Abstract: This work introduces a model that can recognize objects in images even if no training data is available for the objects. The only necessary knowledge about the unseen categories comes from unsupervised large text corpora. In our zero-shot framework distributional information in language can be seen as spanning a semantic basis for understanding what objects look like. Most previous zero-shot learn… ▽ More

    Submitted 19 March, 2013; v1 submitted 16 January, 2013; originally announced January 2013.

  50. arXiv:1301.3618  [pdf, ps, other

    cs.CL cs.LG

    Learning New Facts From Knowledge Bases With Neural Tensor Networks and Semantic Word Vectors

    Authors: Danqi Chen, Richard Socher, Christopher D. Manning, Andrew Y. Ng

    Abstract: Knowledge bases provide applications with the benefit of easily accessible, systematic relational knowledge but often suffer in practice from their incompleteness and lack of knowledge of new entities and relations. Much work has focused on building or extending them by finding patterns in large unannotated text corpora. In contrast, here we mainly aim to complete a knowledge base by predicting ad… ▽ More

    Submitted 15 March, 2013; v1 submitted 16 January, 2013; originally announced January 2013.