Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Sadeq, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17260  [pdf, other

    cs.CL

    Mitigating Hallucination in Fictional Character Role-Play

    Authors: Nafis Sadeq, Zhouhang Xie, Byungkyu Kang, Prarit Lamba, Xiang Gao, Julian McAuley

    Abstract: Role-playing has wide-ranging applications in customer support, embodied agents, computational social science, etc. The influence of parametric world knowledge of large language models (LLMs) often causes role-playing characters to act out of character and hallucinate about things outside the scope of their knowledge. In this work, we focus on the evaluation and mitigation of hallucination in fict… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  2. arXiv:2304.01597  [pdf, other

    cs.CL

    Unsupervised Improvement of Factual Knowledge in Language Models

    Authors: Nafis Sadeq, Byungkyu Kang, Prarit Lamba, Julian McAuley

    Abstract: Masked language modeling (MLM) plays a key role in pretraining large language models. But the MLM objective is often dominated by high-frequency words that are sub-optimal for learning factual knowledge. In this work, we propose an approach for influencing MLM pretraining in a way that can improve language model performance on a variety of knowledge-intensive tasks. We force the language model to… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  3. arXiv:2210.11771  [pdf, other

    cs.CL

    InforMask: Unsupervised Informative Masking for Language Model Pretraining

    Authors: Nafis Sadeq, Canwen Xu, Julian McAuley

    Abstract: Masked language modeling is widely used for pretraining large language models for natural language understanding (NLU). However, random masking is suboptimal, allocating an equal masking rate for all tokens. In this paper, we propose InforMask, a new unsupervised masking strategy for training masked language models. InforMask exploits Pointwise Mutual Information (PMI) to select the most informati… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.