Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Mundra, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.05841  [pdf, other

    cs.CL cs.LG

    An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models

    Authors: Nandini Mundra, Aditya Nanda Kishore, Raj Dabre, Ratish Puduppully, Anoop Kunchukuttan, Mitesh M. Khapra

    Abstract: Language Models (LMs) excel in natural language processing tasks for English but show reduced performance in most other languages. This problem is commonly tackled by continually pre-training and fine-tuning these models for said languages. A significant issue in this process is the limited vocabulary coverage in the original model's tokenizer, leading to inadequate representation of new languages… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: Under review

  2. arXiv:2305.07491  [pdf, other

    cs.CL

    A Comprehensive Analysis of Adapter Efficiency

    Authors: Nandini Mundra, Sumanth Doddapaneni, Raj Dabre, Anoop Kunchukuttan, Ratish Puduppully, Mitesh M. Khapra

    Abstract: Adapters have been positioned as a parameter-efficient fine-tuning (PEFT) approach, whereby a minimal number of parameters are added to the model and fine-tuned. However, adapters have not been sufficiently analyzed to understand if PEFT translates to benefits in training/deployment efficiency and maintainability/extensibility. Through extensive experiments on many adapters, tasks, and languages i… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.