Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Nikdan, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.10994  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Panza: A Personalized Text Writing Assistant via Data Playback and Local Fine-Tuning

    Authors: Armand Nicolicioiu, Eugenia Iofinova, Eldar Kurtic, Mahdi Nikdan, Andrei Panferov, Ilia Markov, Nir Shavit, Dan Alistarh

    Abstract: The availability of powerful open-source large language models (LLMs) opens exciting use-cases, such as automated personal assistants that adapt to the user's unique data and demands. Two key desiderata for such assistants are personalization-in the sense that the assistant should reflect the user's own style-and privacy-in the sense that users may prefer to always store their personal data locall… ▽ More

    Submitted 24 June, 2024; originally announced July 2024.

    Comments: Panza is available at https://github.com/IST-DASLab/PanzaMail

  2. arXiv:2401.04679  [pdf, other

    cs.CL cs.AI cs.LG

    RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation

    Authors: Mahdi Nikdan, Soroush Tabesh, Elvir Crnčević, Dan Alistarh

    Abstract: We investigate parameter-efficient fine-tuning (PEFT) methods that can provide good accuracy under limited computational and memory budgets in the context of large language models (LLMs). We present a new PEFT method called Robust Adaptation (RoSA) inspired by robust principal component analysis that jointly trains $\textit{low-rank}$ and $\textit{highly-sparse}$ components on top of a set of fixe… ▽ More

    Submitted 3 June, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

  3. arXiv:2302.04852  [pdf, other

    cs.LG

    SparseProp: Efficient Sparse Backpropagation for Faster Training of Neural Networks

    Authors: Mahdi Nikdan, Tommaso Pegolotti, Eugenia Iofinova, Eldar Kurtic, Dan Alistarh

    Abstract: We provide a new efficient version of the backpropagation algorithm, specialized to the case where the weights of the neural network being trained are sparse. Our algorithm is general, as it applies to arbitrary (unstructured) sparsity and common layer types (e.g., convolutional or linear). We provide a fast vectorized implementation on commodity CPUs, and show that it can yield speedups in end-to… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.