Zum Hauptinhalt springen

Showing 1–9 of 9 results for author: Katariya, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2111.09381  [pdf, other

    cs.CL cs.AI cs.LG

    MEDCOD: A Medically-Accurate, Emotive, Diverse, and Controllable Dialog System

    Authors: Rhys Compton, Ilya Valmianski, Li Deng, Costa Huang, Namit Katariya, Xavier Amatriain, Anitha Kannan

    Abstract: We present MEDCOD, a Medically-Accurate, Emotive, Diverse, and Controllable Dialog system with a unique approach to the natural language generator module. MEDCOD has been developed and evaluated specifically for the history taking task. It integrates the advantage of a traditional modular approach to incorporate (medical) domain knowledge with modern deep learning techniques to generate flexible,… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: 9 pages. Accepted at Machine Learning for Health (ML4H) 2021

  2. arXiv:2111.07564  [pdf, other

    cs.CL cs.AI cs.LG

    Adding more data does not always help: A study in medical conversation summarization with PEGASUS

    Authors: Varun Nair, Namit Katariya, Xavier Amatriain, Ilya Valmianski, Anitha Kannan

    Abstract: Medical conversation summarization is integral in capturing information gathered during interactions between patients and physicians. Summarized conversations are used to facilitate patient hand-offs between physicians, and as part of providing care in the future. Summaries, however, can be time-consuming to produce and require domain expertise. Modern pre-trained NLP models such as PEGASUS have e… ▽ More

    Submitted 28 November, 2021; v1 submitted 15 November, 2021; originally announced November 2021.

    Comments: Accepted to Machine Learning for Healthcare Workshop, NeurIPS 2021

  3. arXiv:2110.07356  [pdf, other

    cs.CL cs.AI cs.LG

    Medically Aware GPT-3 as a Data Generator for Medical Dialogue Summarization

    Authors: Bharath Chintagunta, Namit Katariya, Xavier Amatriain, Anitha Kannan

    Abstract: In medical dialogue summarization, summaries must be coherent and must capture all the medically relevant information in the dialogue. However, learning effective models for summarization require large amounts of labeled data which is especially hard to obtain. We present an algorithm to create synthetic training data with an explicit focus on capturing medically relevant information. We utilize G… ▽ More

    Submitted 9 September, 2021; originally announced October 2021.

    Comments: Accepted to Machine learning for healthcare 2021

  4. arXiv:2009.08666  [pdf, other

    cs.CL cs.AI cs.LG

    Dr. Summarize: Global Summarization of Medical Dialogue by Exploiting Local Structures

    Authors: Anirudh Joshi, Namit Katariya, Xavier Amatriain, Anitha Kannan

    Abstract: Understanding a medical conversation between a patient and a physician poses a unique natural language understanding challenge since it combines elements of standard open ended conversation with very domain specific elements that require expertise and medical knowledge. Summarization of medical conversations is a particularly important aspect of medical conversation understanding since it addresse… ▽ More

    Submitted 18 September, 2020; originally announced September 2020.

    Comments: Accepted for publication in Findings of EMNLP at EMNLP 2020

  5. arXiv:2008.13546  [pdf, other

    cs.IR cs.CL cs.LG

    Effective Transfer Learning for Identifying Similar Questions: Matching User Questions to COVID-19 FAQs

    Authors: Clara H. McCreery, Namit Katariya, Anitha Kannan, Manish Chablani, Xavier Amatriain

    Abstract: People increasingly search online for answers to their medical questions but the rate at which medical questions are asked online significantly exceeds the capacity of qualified people to answer them. This leaves many questions unanswered or inadequately answered. Many of these questions are not unique, and reliable identification of similar questions would enable more efficient and effective ques… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1910.04192

  6. arXiv:1911.08554  [pdf, other

    cs.CL cs.AI cs.LG

    Classification as Decoder: Trading Flexibility for Control in Medical Dialogue

    Authors: Sam Shleifer, Manish Chablani, Anitha Kannan, Namit Katariya, Xavier Amatriain

    Abstract: Generative seq2seq dialogue systems are trained to predict the next word in dialogues that have already occurred. They can learn from large unlabeled conversation datasets, build a deeper understanding of conversational context, and generate a wide variety of responses. This flexibility comes at the cost of control, a concerning tradeoff in doctor/patient interactions. Inaccuracies, typos, or unde… ▽ More

    Submitted 15 November, 2019; originally announced November 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract. arXiv admin note: substantial text overlap with arXiv:1910.03476

  7. arXiv:1910.04192  [pdf, other

    cs.LG cs.CL stat.ML

    Domain-Relevant Embeddings for Medical Question Similarity

    Authors: Clara McCreery, Namit Katariya, Anitha Kannan, Manish Chablani, Xavier Amatriain

    Abstract: The rate at which medical questions are asked online far exceeds the capacity of qualified people to answer them, and many of these questions are not unique. Identifying same-question pairs could enable questions to be answered more effectively. While many research efforts have focused on the problem of general question similarity for non-medical applications, these approaches do not generalize we… ▽ More

    Submitted 14 November, 2019; v1 submitted 9 October, 2019; originally announced October 2019.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2019 - Extended Abstract

  8. arXiv:1910.03476  [pdf, other

    cs.CL cs.LG

    Classification As Decoder: Trading Flexibility For Control In Neural Dialogue

    Authors: Sam Shleifer, Manish Chablani, Namit Katariya, Anitha Kannan, Xavier Amatriain

    Abstract: Generative seq2seq dialogue systems are trained to predict the next word in dialogues that have already occurred. They can learn from large unlabeled conversation datasets, build a deep understanding of conversational context, and generate a wide variety of responses. This flexibility comes at the cost of control. Undesirable responses in the training data will be reproduced by the model at infere… ▽ More

    Submitted 17 October, 2019; v1 submitted 4 October, 2019; originally announced October 2019.

  9. arXiv:1910.02830  [pdf, other

    cs.LG cs.AI stat.ML

    Open Set Medical Diagnosis

    Authors: Viraj Prabhu, Anitha Kannan, Geoffrey J. Tso, Namit Katariya, Manish Chablani, David Sontag, Xavier Amatriain

    Abstract: Machine-learned diagnosis models have shown promise as medical aides but are trained under a closed-set assumption, i.e. that models will only encounter conditions on which they have been trained. However, it is practically infeasible to obtain sufficient training data for every human condition, and once deployed such models will invariably face previously unseen conditions. We frame machine-learn… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.

    Comments: Abbreviated version to appear at Machine Learning for Healthcare (ML4H) Workshop at NeurIPS 2019