Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Adila, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03642  [pdf, other

    cs.CL cs.LG

    Is Free Self-Alignment Possible?

    Authors: Dyah Adila, Changho Shin, Yijing Zhang, Frederic Sala

    Abstract: Aligning pretrained language models (LMs) is a complex and resource-intensive process, often requiring access to large amounts of ground-truth preference data and substantial compute. Are these costs necessary? That is, it is possible to align using only inherent model knowledge and without additional training? We tackle this challenge with AlignEZ, a novel approach that uses (1) self-generated pr… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  2. arXiv:2406.03631  [pdf, other

    cs.LG

    Discovering Bias in Latent Space: An Unsupervised Debiasing Approach

    Authors: Dyah Adila, Shuai Zhang, Boran Han, Yuyang Wang

    Abstract: The question-answering (QA) capabilities of foundation models are highly sensitive to prompt variations, rendering their performance susceptible to superficial, non-meaning-altering changes. This vulnerability often stems from the model's preference or bias towards specific input characteristics, such as option position or superficial image features in multi-modal settings. We propose to rectify t… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Journal ref: ICML 2024

  3. arXiv:2401.12225  [pdf, other

    cs.CV cs.LG

    Multimodal Data Curation via Object Detection and Filter Ensembles

    Authors: Tzu-Heng Huang, Changho Shin, Sui Jiet Tay, Dyah Adila, Frederic Sala

    Abstract: We propose an approach for curating multimodal data that we used for our entry in the 2023 DataComp competition filtering track. Our technique combines object detection and weak supervision-based ensembling. In the first of two steps in our approach, we employ an out-of-the-box zero-shot object detection model to extract granular information and produce a variety of filter designs. In the second s… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    Comments: Appeared in the Workshop of Towards the Next Generation of Computer Vision Datasets (TNGCV) on ICCV 2023

  4. arXiv:2309.04344  [pdf, other

    cs.LG cs.AI

    Zero-Shot Robustification of Zero-Shot Models

    Authors: Dyah Adila, Changho Shin, Linrong Cai, Frederic Sala

    Abstract: Zero-shot inference is a powerful paradigm that enables the use of large pretrained models for downstream classification tasks without further training. However, these models are vulnerable to inherited biases that can impact their performance. The traditional solution is fine-tuning, but this undermines the key advantage of pretrained models, which is their ability to be used out-of-the-box. We p… ▽ More

    Submitted 12 February, 2024; v1 submitted 8 September, 2023; originally announced September 2023.

    Comments: International Conference on Learning Representations (ICLR), 2024

  5. arXiv:2307.12226  [pdf, other

    cs.LG cs.AI stat.ML

    Geometry-Aware Adaptation for Pretrained Models

    Authors: Nicholas Roberts, Xintong Li, Dyah Adila, Sonia Cromp, Tzu-Heng Huang, Jitian Zhao, Frederic Sala

    Abstract: Machine learning models -- including prominent zero-shot models -- are often trained on datasets whose labels are only a small proportion of a larger label space. Such spaces are commonly equipped with a metric that relates the labels via distances between them. We propose a simple approach to exploit this information to adapt the trained model to reliably predict new classes -- or, in the case of… ▽ More

    Submitted 27 November, 2023; v1 submitted 23 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023

  6. arXiv:2303.17713  [pdf, other

    cs.LG cs.CY stat.ML

    Mitigating Source Bias for Fairer Weak Supervision

    Authors: Changho Shin, Sonia Cromp, Dyah Adila, Frederic Sala

    Abstract: Weak supervision enables efficient development of training sets by reducing the need for ground truth labels. However, the techniques that make weak supervision attractive -- such as integrating any source of signal to estimate unknown labels -- also entail the danger that the produced pseudolabels are highly biased. Surprisingly, given everyday use and the potential for increased bias, weak super… ▽ More

    Submitted 29 November, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: NeurIPS 2023

  7. arXiv:2208.14362  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100 Labels

    Authors: Nicholas Roberts, Xintong Li, Tzu-Heng Huang, Dyah Adila, Spencer Schoenberg, Cheng-Yu Liu, Lauren Pick, Haotian Ma, Aws Albarghouthi, Frederic Sala

    Abstract: Weak supervision (WS) is a powerful method to build labeled datasets for training supervised models in the face of little-to-no labeled data. It replaces hand-labeling data with aggregating multiple noisy-but-cheap label estimates expressed by labeling functions (LFs). While it has been used successfully in many domains, weak supervision's application scope is limited by the difficulty of construc… ▽ More

    Submitted 24 November, 2023; v1 submitted 30 August, 2022; originally announced August 2022.

    Comments: NeurIPS 2022 Datasets and Benchmarks Track

  8. arXiv:2203.13270  [pdf, other

    stat.ML cs.LG

    Shoring Up the Foundations: Fusing Model Embeddings and Weak Supervision

    Authors: Mayee F. Chen, Daniel Y. Fu, Dyah Adila, Michael Zhang, Frederic Sala, Kayvon Fatahalian, Christopher Ré

    Abstract: Foundation models offer an exciting new paradigm for constructing models with out-of-the-box embeddings and a few labeled examples. However, it is not clear how to best apply foundation models without labeled data. A potential approach is to fuse foundation models with weak supervision frameworks, which use weak label sources -- pre-trained models, heuristics, crowd-workers -- to construct pseudol… ▽ More

    Submitted 1 August, 2022; v1 submitted 24 March, 2022; originally announced March 2022.

    Comments: UAI 2022 Camera Ready

  9. arXiv:2111.14730  [pdf, other

    cs.CL cs.AI cs.LG

    Understanding Out-of-distribution: A Perspective of Data Dynamics

    Authors: Dyah Adila, Dongyeop Kang

    Abstract: Despite machine learning models' success in Natural Language Processing (NLP) tasks, predictions from these models frequently fail on out-of-distribution (OOD) samples. Prior works have focused on developing state-of-the-art methods for detecting OOD. The fundamental question of how OOD samples differ from in-distribution samples remains unanswered. This paper explores how data dynamics in trainin… ▽ More

    Submitted 29 November, 2021; originally announced November 2021.

  10. arXiv:2106.02118  [pdf

    eess.IV cs.CV cs.LG

    A Prospective Observational Study to Investigate Performance of a Chest X-ray Artificial Intelligence Diagnostic Support Tool Across 12 U.S. Hospitals

    Authors: Ju Sun, Le Peng, Taihui Li, Dyah Adila, Zach Zaiman, Genevieve B. Melton, Nicholas Ingraham, Eric Murray, Daniel Boley, Sean Switzer, John L. Burns, Kun Huang, Tadashi Allen, Scott D. Steenburg, Judy Wawira Gichoya, Erich Kummerfeld, Christopher Tignanelli

    Abstract: Importance: An artificial intelligence (AI)-based model to predict COVID-19 likelihood from chest x-ray (CXR) findings can serve as an important adjunct to accelerate immediate clinical decision making and improve clinical decision making. Despite significant efforts, many limitations and biases exist in previously developed AI diagnostic models for COVID-19. Utilizing a large set of local and int… ▽ More

    Submitted 6 June, 2021; v1 submitted 3 June, 2021; originally announced June 2021.

    Comments: Check out the medRxiv version at https://doi.org/10.1101/2021.06.04.21258316 for updates