Zum Hauptinhalt springen

Showing 1–5 of 5 results for author: Jayakumar, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05967  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

    Authors: David Romero, Chenyang Lyu, Haryo Akbarianto Wibowo, Teresa Lynn, Injy Hamed, Aditya Nanda Kishore, Aishik Mandal, Alina Dragonetti, Artem Abzaliev, Atnafu Lambebo Tonja, Bontu Fufa Balcha, Chenxi Whitehouse, Christian Salamea, Dan John Velasco, David Ifeoluwa Adelani, David Le Meur, Emilio Villa-Cueva, Fajri Koto, Fauzan Farooqui, Frederico Belcavello, Ganzorig Batnasan, Gisela Vallejo, Grainne Caulfield, Guido Ivetta, Haiyue Song , et al. (50 additional authors not shown)

    Abstract: Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data. However, most of the current VQA models use datasets that are primarily focused on English and a few major world languages, with images that are typically Western-centric. While recen… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  2. arXiv:2403.13903  [pdf, other

    cs.CL

    Leveraging Linguistically Enhanced Embeddings for Open Information Extraction

    Authors: Fauzan Farooqui, Thanmay Jayakumar, Pulkit Mathur, Mansi Radke

    Abstract: Open Information Extraction (OIE) is a structured prediction (SP) task in Natural Language Processing (NLP) that aims to extract structured $n$-ary tuples - usually subject-relation-object triples - from free text. The word embeddings in the input text can be enhanced with linguistic features, usually Part-of-Speech (PoS) and Syntactic Dependency Parse (SynDP) labels. However, past enhancement tec… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted at LREC-COLING 2024 Main Conference, Long Paper

  3. arXiv:2401.15006  [pdf, other

    cs.CL cs.AI

    Airavata: Introducing Hindi Instruction-tuned LLM

    Authors: Jay Gala, Thanmay Jayakumar, Jaavid Aktar Husain, Aswanth Kumar M, Mohammed Safi Ur Rahman Khan, Diptesh Kanojia, Ratish Puduppully, Mitesh M. Khapra, Raj Dabre, Rudra Murthy, Anoop Kunchukuttan

    Abstract: We announce the initial release of "Airavata," an instruction-tuned LLM for Hindi. Airavata was created by fine-tuning OpenHathi with diverse, instruction-tuning Hindi datasets to make it better suited for assistive tasks. Along with the model, we also share the IndicInstruct dataset, which is a collection of diverse instruction-tuning datasets to enable further research for Indic LLMs. Additional… ▽ More

    Submitted 26 February, 2024; v1 submitted 26 January, 2024; originally announced January 2024.

    Comments: Work in progress

  4. arXiv:2401.14280  [pdf, other

    cs.CL cs.AI

    RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models via Romanization

    Authors: Jaavid Aktar Husain, Raj Dabre, Aswanth Kumar, Jay Gala, Thanmay Jayakumar, Ratish Puduppully, Anoop Kunchukuttan

    Abstract: This study addresses the challenge of extending Large Language Models (LLMs) to non-English languages that use non-Roman scripts. We propose an approach that utilizes the romanized form of text as an interface for LLMs, hypothesizing that its frequent informal use and shared tokens with English enhance cross-lingual alignment. Our approach involves the continual pretraining of an English LLM like… ▽ More

    Submitted 23 June, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted to ACL 2024

  5. arXiv:2311.08890  [pdf, other

    cs.CL

    Large Language Models are legal but they are not: Making the case for a powerful LegalLLM

    Authors: Thanmay Jayakumar, Fauzan Farooqui, Luqman Farooqui

    Abstract: Realizing the recent advances in Natural Language Processing (NLP) to the legal sector poses challenging problems such as extremely long sequence lengths, specialized vocabulary that is usually only understood by legal professionals, and high amounts of data imbalance. The recent surge of Large Language Models (LLMs) has begun to provide new opportunities to apply NLP in the legal domain due to th… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 7 pages, Accepted at Natural Legal Language Processing Workshop, EMNLP 2023