Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Anh, D D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.10998  [pdf, other

    cs.CL cs.LG

    Discrete Diffusion Language Model for Long Text Summarization

    Authors: Do Huu Dat, Do Duc Anh, Anh Tuan Luu, Wray Buntine

    Abstract: While diffusion models excel at conditional generating high-quality images, prior works in discrete diffusion models were not evaluated on conditional long-text generation. In this work, we address the limitations of prior discrete diffusion models for conditional long-text generation, particularly in long sequence-to-sequence tasks such as abstractive summarization. Despite fast decoding speeds c… ▽ More

    Submitted 25 June, 2024; originally announced July 2024.

  2. arXiv:2402.11746  [pdf, other

    cs.CL cs.AI

    Language Models are Homer Simpson! Safety Re-Alignment of Fine-tuned Language Models through Task Arithmetic

    Authors: Rishabh Bhardwaj, Do Duc Anh, Soujanya Poria

    Abstract: Aligned language models face a significant limitation as their fine-tuning often results in compromised safety. To tackle this, we propose a simple method RESTA that performs LLM safety realignment. RESTA stands for REstoring Safety through Task Arithmetic. At its core, it involves a simple arithmetic addition of a safety vector to the weights of the compromised model. We demonstrate the effective… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.