Zum Hauptinhalt springen

Showing 1–9 of 9 results for author: Iofinova, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.10994  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Panza: A Personalized Text Writing Assistant via Data Playback and Local Fine-Tuning

    Authors: Armand Nicolicioiu, Eugenia Iofinova, Eldar Kurtic, Mahdi Nikdan, Andrei Panferov, Ilia Markov, Nir Shavit, Dan Alistarh

    Abstract: The availability of powerful open-source large language models (LLMs) opens exciting use-cases, such as automated personal assistants that adapt to the user's unique data and demands. Two key desiderata for such assistants are personalization-in the sense that the assistant should reflect the user's own style-and privacy-in the sense that users may prefer to always store their personal data locall… ▽ More

    Submitted 24 June, 2024; originally announced July 2024.

    Comments: Panza is available at https://github.com/IST-DASLab/PanzaMail

  2. arXiv:2310.04816  [pdf, other

    cs.CV cs.AI cs.LG

    Hacking Generative Models with Differentiable Network Bending

    Authors: Giacomo Aldegheri, Alina Rogalska, Ahmed Youssef, Eugenia Iofinova

    Abstract: In this work, we propose a method to 'hack' generative models, pushing their outputs away from the original training distribution towards a new objective. We inject a small-scale trainable module between the intermediate layers of the model and train it for a low number of iterations, keeping the rest of the network frozen. The resulting output images display an uncanny quality, given by the tensi… ▽ More

    Submitted 12 December, 2023; v1 submitted 7 October, 2023; originally announced October 2023.

    Comments: 12 pages, 10 figures, Machine Learning for Creativity and Design Workshop at NeurIPS 2023

    ACM Class: I.2.6; I.2.10

  3. arXiv:2310.04519  [pdf, other

    cs.LG

    SPADE: Sparsity-Guided Debugging for Deep Neural Networks

    Authors: Arshia Soltani Moakhar, Eugenia Iofinova, Elias Frantar, Dan Alistarh

    Abstract: It is known that sparsity can improve interpretability for deep neural networks. However, existing methods in the area either require networks that are pre-trained with sparsity constraints, or impose sparsity after the fact, altering the network's general behavior. In this paper, we demonstrate, for the first time, that sparsity can instead be incorporated into the interpretation process itself,… ▽ More

    Submitted 19 July, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: Published at ICML 2024. 33 pages

  4. arXiv:2308.02060  [pdf, other

    cs.LG cs.AI

    Accurate Neural Network Pruning Requires Rethinking Sparse Optimization

    Authors: Denis Kuznedelev, Eldar Kurtic, Eugenia Iofinova, Elias Frantar, Alexandra Peste, Dan Alistarh

    Abstract: Obtaining versions of deep neural networks that are both highly-accurate and highly-sparse is one of the main challenges in the area of model compression, and several high-performance pruning techniques have been investigated by the community. Yet, much less is known about the interaction between sparsity and the standard stochastic optimization techniques used for training sparse networks, and mo… ▽ More

    Submitted 8 September, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

  5. arXiv:2304.12622  [pdf, other

    cs.CV cs.LG

    Bias in Pruned Vision Models: In-Depth Analysis and Countermeasures

    Authors: Eugenia Iofinova, Alexandra Peste, Dan Alistarh

    Abstract: Pruning - that is, setting a significant subset of the parameters of a neural network to zero - is one of the most popular methods of model compression. Yet, several recent works have raised the issue that pruning may induce or exacerbate bias in the output of the compressed model. Despite existing evidence for this phenomenon, the relationship between neural network pruning and induced bias is no… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: 8 Pages / 49 with references and appendix. Accepted to CVPR 2023

  6. arXiv:2302.04852  [pdf, other

    cs.LG

    SparseProp: Efficient Sparse Backpropagation for Faster Training of Neural Networks

    Authors: Mahdi Nikdan, Tommaso Pegolotti, Eugenia Iofinova, Eldar Kurtic, Dan Alistarh

    Abstract: We provide a new efficient version of the backpropagation algorithm, specialized to the case where the weights of the neural network being trained are sparse. Our algorithm is general, as it applies to arbitrary (unstructured) sparsity and common layer types (e.g., convolutional or linear). We provide a fast vectorized implementation on commodity CPUs, and show that it can yield speedups in end-to… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  7. arXiv:2111.13445  [pdf, other

    cs.CV cs.AI cs.LG

    How Well Do Sparse Imagenet Models Transfer?

    Authors: Eugenia Iofinova, Alexandra Peste, Mark Kurtz, Dan Alistarh

    Abstract: Transfer learning is a classic paradigm by which models pretrained on large "upstream" datasets are adapted to yield good results on "downstream" specialized datasets. Generally, more accurate models on the "upstream" dataset tend to provide better transfer accuracy "downstream". In this work, we perform an in-depth investigation of this phenomenon in the context of convolutional neural networks (… ▽ More

    Submitted 21 April, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

    Comments: Accepted to CVPR'22. This version: 25 pages, 9 figures (including appendix). **Includes extended upstream training results, which are not present in the CVPR version.**

  8. arXiv:2106.12379  [pdf, other

    cs.LG cs.AI

    AC/DC: Alternating Compressed/DeCompressed Training of Deep Neural Networks

    Authors: Alexandra Peste, Eugenia Iofinova, Adrian Vladu, Dan Alistarh

    Abstract: The increasing computational requirements of deep neural networks (DNNs) have led to significant interest in obtaining DNN models that are sparse, yet accurate. Recent work has investigated the even harder case of sparse training, where the DNN weights are, for as much as possible, already sparse to reduce computational costs during training. Existing sparse training methods are often empirical an… ▽ More

    Submitted 15 December, 2021; v1 submitted 23 June, 2021; originally announced June 2021.

    Comments: Accepted at NeurIPS 2021

  9. arXiv:2106.11732  [pdf, other

    cs.LG stat.ML

    FLEA: Provably Robust Fair Multisource Learning from Unreliable Training Data

    Authors: Eugenia Iofinova, Nikola Konstantinov, Christoph H. Lampert

    Abstract: Fairness-aware learning aims at constructing classifiers that not only make accurate predictions, but also do not discriminate against specific groups. It is a fast-growing area of machine learning with far-reaching societal impact. However, existing fair learning methods are vulnerable to accidental or malicious artifacts in the training data, which can cause them to unknowingly produce unfair cl… ▽ More

    Submitted 11 January, 2023; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: 10 pages in main text; 42 pages including bibliography and appendix. Published in Transactions of Machine Learning Research (TMLR), 2022, https://openreview.net/forum?id=XsPopigZX; project website at https://github.com/ISTAustria-CVML/FLEA