Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Gunda, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.17008  [pdf, other

    cs.IR cs.LG

    Evaluation of Table Representations to Answer Questions from Tables in Documents : A Case Study using 3GPP Specifications

    Authors: Sujoy Roychowdhury, Sumit Soman, HG Ranjani, Avantika Sharma, Neeraj Gunda, Sai Krishna Bala

    Abstract: With the ubiquitous use of document corpora for question answering, one important aspect which is especially relevant for technical documents is the ability to extract information from tables which are interspersed with text. The major challenge in this is that unlike free-flow text or isolated set of tables, the representation of a table in terms of what is a relevant chunk is not obvious. We con… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: 10 pages, 4 figures, 2 tables

    MSC Class: 68T50 ACM Class: I.2.7

  2. arXiv:2407.12873  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Evaluation of RAG Metrics for Question Answering in the Telecom Domain

    Authors: Sujoy Roychowdhury, Sumit Soman, H G Ranjani, Neeraj Gunda, Vansh Chhabra, Sai Krishna Bala

    Abstract: Retrieval Augmented Generation (RAG) is widely used to enable Large Language Models (LLMs) perform Question Answering (QA) tasks in various domains. However, RAG based on open-source LLM for specialized domains has challenges of evaluating generated responses. A popular framework in the literature is the RAG Assessment (RAGAS), a publicly available library which uses LLMs for evaluation. One disad… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: Accepted for publication in ICML 2024 Workshop on Foundation Models in the Wild

    MSC Class: 68T50 ACM Class: I.2.7

  3. arXiv:2406.12336  [pdf, other

    cs.CL cs.AI cs.LG

    A Compass for Navigating the World of Sentence Embeddings for the Telecom Domain

    Authors: Sujoy Roychowdhury, Sumit Soman, H. G. Ranjani, Vansh Chhabra, Neeraj Gunda, Subhadip Bandyopadhyay, Sai Krishna Bala

    Abstract: A plethora of sentence embedding models makes it challenging to choose one, especially for domains such as telecom, rich with specialized vocabulary. We evaluate multiple embeddings obtained from publicly available models and their domain-adapted variants, on both point retrieval accuracies as well as their (95\%) confidence intervals. We establish a systematic method to obtain thresholds for simi… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 10 pages, 3 figures, 4 tables

    MSC Class: 68T50 ACM Class: I.2.7