Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Dasu, V A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.08152  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    Privacy-Preserving Data Deduplication for Enhancing Federated Learning of Language Models

    Authors: Aydin Abadi, Vishnu Asutosh Dasu, Sumanta Sarkar

    Abstract: Deduplication is a vital preprocessing step that enhances machine learning model performance and saves training time and energy. However, enhancing federated learning through deduplication poses challenges, especially regarding scalability and potential privacy violations if deduplication involves sharing all clients' data. In this paper, we address the problem of deduplication in a federated setu… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  2. arXiv:2407.04268  [pdf, other

    cs.LG cs.AI cs.SE

    NeuFair: Neural Network Fairness Repair with Dropout

    Authors: Vishnu Asutosh Dasu, Ashish Kumar, Saeid Tizpaz-Niari, Gang Tan

    Abstract: This paper investigates neuron dropout as a post-processing bias mitigation for deep neural networks (DNNs). Neural-driven software solutions are increasingly applied in socially critical domains with significant fairness implications. While neural networks are exceptionally good at finding statistical patterns from data, they may encode and amplify existing biases from the historical data. Existi… ▽ More

    Submitted 12 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

    Comments: Paper accepted at ACM ISSTA 2024

  3. arXiv:2310.16152  [pdf, other

    cs.CR cs.LG

    FLTrojan: Privacy Leakage Attacks against Federated Language Models Through Selective Weight Tampering

    Authors: Md Rafi Ur Rashid, Vishnu Asutosh Dasu, Kang Gu, Najrin Sultana, Shagufta Mehnaz

    Abstract: Federated learning (FL) has become a key component in various language modeling applications such as machine translation, next-word prediction, and medical record analysis. These applications are trained on datasets from many FL participants that often include privacy-sensitive data, such as healthcare records, phone/credit card numbers, login credentials, etc. Although FL enables computation with… ▽ More

    Submitted 25 May, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: 20 pages (including bibliography and Appendix), Submitted to ACM CCS '24