Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Gamble, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.13313  [pdf, other

    cs.AI cs.CL

    Polaris: A Safety-focused LLM Constellation Architecture for Healthcare

    Authors: Subhabrata Mukherjee, Paul Gamble, Markel Sanz Ausin, Neel Kant, Kriti Aggarwal, Neha Manjunath, Debajyoti Datta, Zhengliang Liu, Jiayuan Ding, Sophia Busacca, Cezanne Bianco, Swapnil Sharma, Rae Lasko, Michelle Voisard, Sanchay Harneja, Darya Filippova, Gerry Meixiong, Kevin Cha, Amir Youssefi, Meyhaa Buvanesh, Howard Weingram, Sebastian Bierman-Lytle, Harpreet Singh Mangat, Kim Parikh, Saad Godil , et al. (1 additional authors not shown)

    Abstract: We develop Polaris, the first safety-focused LLM constellation for real-time patient-AI healthcare conversations. Unlike prior LLM works in healthcare focusing on tasks like question answering, our work specifically focuses on long multi-turn voice conversations. Our one-trillion parameter constellation system is composed of several multibillion parameter LLMs as co-operative agents: a stateful pr… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  2. arXiv:2212.13138  [pdf, other

    cs.CL

    Large Language Models Encode Clinical Knowledge

    Authors: Karan Singhal, Shekoofeh Azizi, Tao Tu, S. Sara Mahdavi, Jason Wei, Hyung Won Chung, Nathan Scales, Ajay Tanwani, Heather Cole-Lewis, Stephen Pfohl, Perry Payne, Martin Seneviratne, Paul Gamble, Chris Kelly, Nathaneal Scharli, Aakanksha Chowdhery, Philip Mansfield, Blaise Aguera y Arcas, Dale Webster, Greg S. Corrado, Yossi Matias, Katherine Chou, Juraj Gottweis, Nenad Tomasev, Yun Liu , et al. (5 additional authors not shown)

    Abstract: Large language models (LLMs) have demonstrated impressive capabilities in natural language understanding and generation, but the quality bar for medical and clinical applications is high. Today, attempts to assess models' clinical knowledge typically rely on automated evaluations on limited benchmarks. There is no standard to evaluate model predictions and reasoning across a breadth of tasks. To a… ▽ More

    Submitted 26 December, 2022; originally announced December 2022.

  3. arXiv:1911.01888  [pdf, ps, other

    cs.CR cs.LG eess.AS

    Reducing audio membership inference attack accuracy to chance: 4 defenses

    Authors: Michael Lomnitz, Nina Lopatina, Paul Gamble, Zigfried Hampel-Arias, Lucas Tindall, Felipe A. Mejia, Maria Alejandra Barrios

    Abstract: It is critical to understand the privacy and robustness vulnerabilities of machine learning models, as their implementation expands in scope. In membership inference attacks, adversaries can determine whether a particular set of data was used in training, putting the privacy of the data at risk. Existing work has mostly focused on image related tasks; we generalize this type of attack to speaker i… ▽ More

    Submitted 31 October, 2019; originally announced November 2019.

    Comments: 7 pages, 2 figures, 7 tables

  4. arXiv:1906.06449  [pdf, other

    cs.LG stat.ML

    Robust or Private? Adversarial Training Makes Models More Vulnerable to Privacy Attacks

    Authors: Felipe A. Mejia, Paul Gamble, Zigfried Hampel-Arias, Michael Lomnitz, Nina Lopatina, Lucas Tindall, Maria Alejandra Barrios

    Abstract: Adversarial training was introduced as a way to improve the robustness of deep learning models to adversarial attacks. This training method improves robustness against adversarial attacks, but increases the models vulnerability to privacy attacks. In this work we demonstrate how model inversion attacks, extracting training data directly from the model, previously thought to be intractable become f… ▽ More

    Submitted 14 June, 2019; originally announced June 2019.

    Comments: 11 pages, 11 figures

  5. arXiv:1804.10669  [pdf, other

    cs.SD cs.AI eess.AS

    Deep Speech Denoising with Vector Space Projections

    Authors: Jeff Hetherly, Paul Gamble, Maria Barrios, Cory Stephenson, Karl Ni

    Abstract: We propose an algorithm to denoise speakers from a single microphone in the presence of non-stationary and dynamic noise. Our approach is inspired by the recent success of neural network models separating speakers from other speakers and singers from instrumental accompaniment. Unlike prior art, we leverage embedding spaces produced with source-contrastive estimation, a technique derived from nega… ▽ More

    Submitted 27 April, 2018; originally announced April 2018.

    Comments: arXiv admin note: text overlap with arXiv:1705.04662

  6. arXiv:1804.05053  [pdf, other

    cs.SD eess.AS

    Voices Obscured in Complex Environmental Settings (VOICES) corpus

    Authors: Colleen Richey, Maria A. Barrios, Zeb Armstrong, Chris Bartels, Horacio Franco, Martin Graciarena, Aaron Lawson, Mahesh Kumar Nandwana, Allen Stauffer, Julien van Hout, Paul Gamble, Jeff Hetherly, Cory Stephenson, Karl Ni

    Abstract: This paper introduces the Voices Obscured In Complex Environmental Settings (VOICES) corpus, a freely available dataset under Creative Commons BY 4.0. This dataset will promote speech and signal processing research of speech recorded by far-field microphones in noisy room conditions. Publicly available speech corpora are mostly composed of isolated speech at close-range microphony. A typical appro… ▽ More

    Submitted 15 May, 2018; v1 submitted 13 April, 2018; originally announced April 2018.

    Comments: Submitted to Interspeech 2018

  7. arXiv:1803.09565  [pdf

    q-bio.QM cs.CR cs.DB q-bio.GN

    SIG-DB: leveraging homomorphic encryption to Securely Interrogate privately held Genomic DataBases

    Authors: Alexander J. Titus, Audrey Flower, Patrick Hagerty, Paul Gamble, Charlie Lewis, Todd Stavish, Kevin P. OConnell, Greg Shipley, Stephanie M. Rogers

    Abstract: Genomic data are becoming increasingly valuable as we develop methods to utilize the information at scale and gain a greater understanding of how genetic information relates to biological function. Advances in synthetic biology and the decreased cost of sequencing are increasing the amount of privately held genomic data. As the quantity and value of private genomic data grows, so does the incentiv… ▽ More

    Submitted 26 March, 2018; originally announced March 2018.

    Comments: 38 pages, 3 figures, 4 tables, 1 supplemental table, 7 supplemental figures

    Report number: PMID: 30180163

    Journal ref: PLoS Computational Biology; 2018 Sep 4; 14(9):e1006454