Zum Hauptinhalt springen

Showing 1–17 of 17 results for author: Khosla, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2312.17172  [pdf, other

    cs.CV cs.AI cs.CL

    Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

    Authors: Jiasen Lu, Christopher Clark, Sangho Lee, Zichen Zhang, Savya Khosla, Ryan Marten, Derek Hoiem, Aniruddha Kembhavi

    Abstract: We present Unified-IO 2, the first autoregressive multimodal model that is capable of understanding and generating image, text, audio, and action. To unify different modalities, we tokenize inputs and outputs -- images, text, audio, action, bounding boxes, etc., into a shared semantic space and then process them with a single encoder-decoder transformer model. Since training with such diverse moda… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

    Comments: 38 pages, 20 figures

  2. arXiv:2312.12624  [pdf, other

    cs.CL

    Building a Llama2-finetuned LLM for Odia Language Utilizing Domain Knowledge Instruction Set

    Authors: Guneet Singh Kohli, Shantipriya Parida, Sambit Sekhar, Samirit Saha, Nipun B Nair, Parul Agarwal, Sonal Khosla, Kusumlata Patiyal, Debasish Dhal

    Abstract: Building LLMs for languages other than English is in great demand due to the unavailability and performance of multilingual LLMs, such as understanding the local context. The problem is critical for low-resource languages due to the need for instruction sets. In a multilingual country like India, there is a need for LLMs supporting Indic languages to provide generative AI and LLM-based technologie… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  3. arXiv:2312.06141  [pdf, other

    cs.AI

    Survey on Memory-Augmented Neural Networks: Cognitive Insights to AI Applications

    Authors: Savya Khosla, Zhen Zhu, Yifei He

    Abstract: This paper explores Memory-Augmented Neural Networks (MANNs), delving into how they blend human-like memory processes into AI. It covers different memory types, like sensory, short-term, and long-term memory, linking psychological theories with AI applications. The study investigates advanced architectures such as Hopfield Networks, Neural Turing Machines, Correlation Matrix Memories, Memformer, a… ▽ More

    Submitted 12 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

  4. arXiv:2311.07850  [pdf, other

    cs.CL cs.AI cs.DB cs.LG

    Bring Your Own KG: Self-Supervised Program Synthesis for Zero-Shot KGQA

    Authors: Dhruv Agarwal, Rajarshi Das, Sopan Khosla, Rashmi Gangadharaiah

    Abstract: We present BYOKG, a universal question-answering (QA) system that can operate on any knowledge graph (KG), requires no human-annotated training data, and can be ready to use within a day -- attributes that are out-of-scope for current KGQA systems. BYOKG draws inspiration from the remarkable ability of humans to comprehend information present in an unseen KG through exploration -- starting at rand… ▽ More

    Submitted 21 May, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

  5. arXiv:2211.00928  [pdf, ps, other

    cs.LG cs.AI

    Neural Active Learning on Heteroskedastic Distributions

    Authors: Savya Khosla, Chew Kin Whye, Jordan T. Ash, Cyril Zhang, Kenji Kawaguchi, Alex Lamb

    Abstract: Models that can actively seek out the best quality training data hold the promise of more accurate, adaptable, and efficient machine learning. Active learning techniques often tend to prefer examples that are the most difficult to classify. While this works well on homogeneous datasets, we find that it can lead to catastrophic failures when performed on multiple distributions with different degree… ▽ More

    Submitted 23 July, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

  6. arXiv:2207.09099  [pdf, other

    cs.CL cs.LG

    Analyzing Bagging Methods for Language Models

    Authors: Pranab Islam, Shaan Khosla, Arthur Lok, Mudit Saxena

    Abstract: Modern language models leverage increasingly large numbers of parameters to achieve performance on natural language understanding tasks. Ensembling these models in specific configurations for downstream tasks show even further performance improvements. In this paper, we perform an analysis of bagging language models and compare single language models to bagged ensembles that are roughly equivalent… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

  7. arXiv:2111.11159  [pdf

    cs.CL

    Investigating Cross-Linguistic Gender Bias in Hindi-English Across Domains

    Authors: Somya Khosla

    Abstract: Measuring, evaluating and reducing Gender Bias has come to the forefront with newer and improved language embeddings being released every few months. But could this bias vary from domain to domain? We see a lot of work to study these biases in various embedding models but limited work has been done to debias Indic languages. We aim to measure and study this bias in Hindi language, which is a highe… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

    Comments: 19 pages, WIP, Conclusions pending

    ACM Class: I.2.7

  8. arXiv:2104.10215  [pdf, other

    cs.CL cs.AI cs.LG

    Evaluating the Impact of a Hierarchical Discourse Representation on Entity Coreference Resolution Performance

    Authors: Sopan Khosla, James Fiacco, Carolyn Rose

    Abstract: Recent work on entity coreference resolution (CR) follows current trends in Deep Learning applied to embeddings and relatively simple task-related features. SOTA models do not make use of hierarchical representations of discourse structure. In this work, we leverage automatically constructed discourse parse trees within a neural approach and demonstrate a significant improvement on two benchmark e… ▽ More

    Submitted 20 April, 2021; originally announced April 2021.

    Comments: Also contains the Appendix. Accepted to NAACL 2021 as a short paper

  9. arXiv:2103.10730  [pdf, other

    cs.CL

    MuRIL: Multilingual Representations for Indian Languages

    Authors: Simran Khanuja, Diksha Bansal, Sarvesh Mehtani, Savya Khosla, Atreyee Dey, Balaji Gopalan, Dilip Kumar Margam, Pooja Aggarwal, Rajiv Teja Nagipogu, Shachi Dave, Shruti Gupta, Subhash Chandra Bose Gali, Vish Subramanian, Partha Talukdar

    Abstract: India is a multilingual society with 1369 rationalized languages and dialects being spoken across the country (INDIA, 2011). Of these, the 22 scheduled languages have a staggering total of 1.17 billion speakers and 121 languages have more than 10,000 speakers (INDIA, 2011). India also has the second largest (and an ever growing) digital footprint (Statista, 2020). Despite this, today's state-of-th… ▽ More

    Submitted 2 April, 2021; v1 submitted 19 March, 2021; originally announced March 2021.

  10. arXiv:2010.05738  [pdf, other

    cs.CL cs.AI cs.LG

    Using Type Information to Improve Entity Coreference Resolution

    Authors: Sopan Khosla, Carolyn Rose

    Abstract: Coreference resolution (CR) is an essential part of discourse analysis. Most recently, neural approaches have been proposed to improve over SOTA models from earlier paradigms. So far none of the published neural models leverage external semantic knowledge such as type information. This paper offers the first such model and evaluation, demonstrating modest gains in accuracy by introducing either go… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

    Comments: Accepted as Long Paper at CODI workshop EMNLP 2020

  11. arXiv:2010.02246  [pdf, other

    cs.CL cs.LG

    MedFilter: Improving Extraction of Task-relevant Utterances from Doctor-Patient Conversations through Integration of Discourse Structure and Ontological Knowledge

    Authors: Sopan Khosla, Shikhar Vashishth, Jill Fain Lehman, Carolyn Rose

    Abstract: Information extraction from conversational data is particularly challenging because the task-centric nature of conversation allows for effective communication of implicit information by humans, but is challenging for machines. The challenges may differ between utterances depending on the role of the speaker within the conversation, especially when relevant expertise is distributed asymmetrically a… ▽ More

    Submitted 21 June, 2022; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: Accepted as Long Paper to EMNLP 2020

  12. arXiv:2008.04820  [pdf, other

    cs.CL cs.IR cs.LG

    LTIatCMU at SemEval-2020 Task 11: Incorporating Multi-Level Features for Multi-Granular Propaganda Span Identification

    Authors: Sopan Khosla, Rishabh Joshi, Ritam Dutt, Alan W Black, Yulia Tsvetkov

    Abstract: In this paper we describe our submission for the task of Propaganda Span Identification in news articles. We introduce a BERT-BiLSTM based span-level propaganda classification model that identifies which token spans within the sentence are indicative of propaganda. The "multi-granular" model incorporates linguistic knowledge at various levels of text granularity, including word, sentence and docum… ▽ More

    Submitted 20 August, 2020; v1 submitted 11 August, 2020; originally announced August 2020.

  13. Surveys without Questions: A Reinforcement Learning Approach

    Authors: Atanu R Sinha, Deepali Jain, Nikhil Sheoran, Sopan Khosla, Reshmi Sasidharan

    Abstract: The 'old world' instrument, survey, remains a tool of choice for firms to obtain ratings of satisfaction and experience that customers realize while interacting online with firms. While avenues for survey have evolved from emails and links to pop-ups while browsing, the deficiencies persist. These include - reliance on ratings of very few respondents to infer about all customers' online interactio… ▽ More

    Submitted 11 June, 2020; originally announced June 2020.

    Comments: The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19)

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, July 2019, pp. 257-64

  14. arXiv:2006.05513  [pdf

    physics.med-ph cs.CV eess.IV

    A Deep Learning-Based Method for Automatic Segmentation of Proximal Femur from Quantitative Computed Tomography Images

    Authors: Chen Zhao, Joyce H. Keyak, Jinshan Tang, Tadashi S. Kaneko, Sundeep Khosla, Shreyasee Amin, Elizabeth J. Atkinson, Lan-Juan Zhao, Michael J. Serou, Chaoyang Zhang, Hui Shen, Hong-Wen Deng, Weihua Zhou

    Abstract: Purpose: Proximal femur image analyses based on quantitative computed tomography (QCT) provide a method to quantify the bone density and evaluate osteoporosis and risk of fracture. We aim to develop a deep-learning-based method for automatic proximal femur segmentation. Methods and Materials: We developed a 3D image segmentation method based on V-Net, an end-to-end fully convolutional neural netwo… ▽ More

    Submitted 1 July, 2020; v1 submitted 9 June, 2020; originally announced June 2020.

  15. arXiv:2005.01795  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Generating SOAP Notes from Doctor-Patient Conversations Using Modular Summarization Techniques

    Authors: Kundan Krishna, Sopan Khosla, Jeffrey P. Bigham, Zachary C. Lipton

    Abstract: Following each patient visit, physicians draft long semi-structured clinical summaries called SOAP notes. While invaluable to clinicians and researchers, creating digital SOAP notes is burdensome, contributing to physician burnout. In this paper, we introduce the first complete pipelines to leverage deep summarization models to generate these notes based on transcripts of conversations between phy… ▽ More

    Submitted 2 June, 2021; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: Published at ACL 2021 Main Conference

  16. Interpolated Adversarial Training: Achieving Robust Neural Networks without Sacrificing Too Much Accuracy

    Authors: Alex Lamb, Vikas Verma, Kenji Kawaguchi, Alexander Matyasko, Savya Khosla, Juho Kannala, Yoshua Bengio

    Abstract: Adversarial robustness has become a central goal in deep learning, both in the theory and the practice. However, successful methods to improve the adversarial robustness (such as adversarial training) greatly hurt generalization performance on the unperturbed data. This could have a major impact on how the adversarial robustness affects real world systems (i.e. many may opt to forego robustness if… ▽ More

    Submitted 19 October, 2022; v1 submitted 16 June, 2019; originally announced June 2019.

    Comments: This is the latest version, which is published in the Journal, "Neural Networks", in 2022. All the previous results are unchanged. First two authors contributed equally

    Journal ref: Neural Networks, volume 154, pages 218-233 (2022)

  17. arXiv:1805.07966  [pdf, other

    cs.CL cs.AI

    Aff2Vec: Affect--Enriched Distributional Word Representations

    Authors: Sopan Khosla, Niyati Chhaya, Kushal Chawla

    Abstract: Human communication includes information, opinions, and reactions. Reactions are often captured by the affective-messages in written as well as verbal communications. While there has been work in affect modeling and to some extent affective content generation, the area of affective word distributions in not well studied. Synsets and lexica capture semantic relationships across words. These models… ▽ More

    Submitted 21 May, 2018; originally announced May 2018.