Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Soto, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.11446  [pdf, other

    cs.CL cs.LG

    MAML-en-LLM: Model Agnostic Meta-Training of LLMs for Improved In-Context Learning

    Authors: Sanchit Sinha, Yuguang Yue, Victor Soto, Mayank Kulkarni, Jianhua Lu, Aidong Zhang

    Abstract: Adapting large language models (LLMs) to unseen tasks with in-context training samples without fine-tuning remains an important research problem. To learn a robust LLM that adapts well to unseen tasks, multiple meta-training approaches have been proposed such as MetaICL and MetaICT, which involve meta-training pre-trained LLMs on a wide variety of diverse tasks. These meta-training approaches esse… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: KDD 2024, 11 pages(9 main, 2 ref, 1 App) Openreview https://openreview.net/forum?id=JwecLNhWDy&referrer=%5BAuthor%20Console%5D(%2Fgroup%3Fid%3DKDD.org%2F2024%2FResearch_Track%2FAuthors%23your-submissions)

  2. Part of speech tagging for code switched data

    Authors: Fahad AlGhamdi, Giovanni Molina, Mona Diab, Thamar Solorio, Abdelati Hawwari, Victor Soto, Julia Hirschberg

    Abstract: We address the problem of Part of Speech tagging (POS) in the context of linguistic code switching (CS). CS is the phenomenon where a speaker switches between two languages or variants of the same language within or across utterances, known as intra-sentential or inter-sentential CS, respectively. Processing CS data is especially challenging in intra-sentential data given state of the art monoling… ▽ More

    Submitted 3 November, 2019; v1 submitted 27 September, 2019; originally announced September 2019.

    Comments: Association for Computational Linguistics

  3. arXiv:1906.04138  [pdf, other

    cs.CL

    Named Entity Recognition on Code-Switched Data: Overview of the CALCS 2018 Shared Task

    Authors: Gustavo Aguilar, Fahad AlGhamdi, Victor Soto, Mona Diab, Julia Hirschberg, Thamar Solorio

    Abstract: In the third shared task of the Computational Approaches to Linguistic Code-Switching (CALCS) workshop, we focus on Named Entity Recognition (NER) on code-switched social-media data. We divide the shared task into two competitions based on the English-Spanish (ENG-SPA) and Modern Standard Arabic-Egyptian (MSA-EGY) language pairs. We use Twitter data and 9 entity types to establish a new dataset fo… ▽ More

    Submitted 10 June, 2019; originally announced June 2019.

    Comments: ACL 2018 (CALCS)

    Journal ref: Proceedings of the Third Workshop on Computational Approaches to Linguistic Code-Switching, 2018, 138-147

  4. arXiv:1703.08537  [pdf, ps, other

    cs.CL

    Crowdsourcing Universal Part-Of-Speech Tags for Code-Switching

    Authors: Victor Soto, Julia Hirschberg

    Abstract: Code-switching is the phenomenon by which bilingual speakers switch between multiple languages during communication. The importance of developing language technologies for codeswitching data is immense, given the large populations that routinely code-switch. High-quality linguistic annotations are extremely valuable for any NLP task, and performance is often limited by the amount of high-quality l… ▽ More

    Submitted 24 March, 2017; originally announced March 2017.

    Comments: Submitted to Interspeech 2017