Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Fawi, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.14572  [pdf, ps, other

    cs.LG cs.AI cs.CL

    CURLoRA: Stable LLM Continual Fine-Tuning and Catastrophic Forgetting Mitigation

    Authors: Muhammad Fawi

    Abstract: This paper introduces CURLoRA, a novel approach to fine-tuning large language models (LLMs) that leverages CUR matrix decomposition in the context of Low-Rank Adaptation (LoRA). Our method addresses two critical challenges in LLM fine-tuning: mitigating catastrophic forgetting during continual learning and reducing the number of trainable parameters. We propose a unique modification to the CUR dec… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: Code available at https://github.com/MNoorFawi/curlora