Skip to main content

Showing 1–12 of 12 results for author: Wagle, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.09861  [pdf, other

    cs.LG

    Unsupervised Federated Optimization at the Edge: D2D-Enabled Learning without Labels

    Authors: Satyavrat Wagle, Seyyedali Hosseinalipour, Naji Khosravan, Christopher G. Brinton

    Abstract: Federated learning (FL) is a popular solution for distributed machine learning (ML). While FL has traditionally been studied for supervised ML tasks, in many applications, it is impractical to assume availability of labeled data across devices. To this end, we develop Cooperative Federated unsupervised Contrastive Learning ({\tt CF-CL)} to facilitate FL across edge devices with unlabeled datasets.… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 16 pages, 11 figures

  2. arXiv:2403.08092  [pdf, other

    cs.CV

    Mitigating the Impact of Attribute Editing on Face Recognition

    Authors: Sudipta Banerjee, Sai Pranaswi Mullangi, Shruti Wagle, Chinmay Hegde, Nasir Memon

    Abstract: Through a large-scale study over diverse face images, we show that facial attribute editing using modern generative AI models can severely degrade automated face recognition systems. This degradation persists even with identity-preserving generative models. To mitigate this issue, we propose two novel techniques for local and global attribute editing. We empirically ablate twenty-six facial semant… ▽ More

    Submitted 9 April, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Under review

  3. arXiv:2402.09629  [pdf, other

    cs.LG

    Smart Information Exchange for Unsupervised Federated Learning via Reinforcement Learning

    Authors: Seohyun Lee, Anindya Bijoy Das, Satyavrat Wagle, Christopher G. Brinton

    Abstract: One of the main challenges of decentralized machine learning paradigms such as Federated Learning (FL) is the presence of local non-i.i.d. datasets. Device-to-device transfers (D2D) between distributed devices has been shown to be an effective tool for dealing with this problem and robust to stragglers. In an unsupervised case, however, it is not obvious how data exchanges should take place due to… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  4. arXiv:2311.12289  [pdf, other

    cs.CL cs.AI

    ATLANTIC: Structure-Aware Retrieval-Augmented Language Model for Interdisciplinary Science

    Authors: Sai Munikoti, Anurag Acharya, Sridevi Wagle, Sameera Horawalavithana

    Abstract: Large language models record impressive performance on many natural language processing tasks. However, their knowledge capacity is limited to the pretraining corpus. Retrieval augmentation offers an effective solution by retrieving context from external knowledge sources to complement the language model. However, existing retrieval augmentation techniques ignore the structural relationships betwe… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    ACM Class: I.2.7

  5. arXiv:2311.09358  [pdf, other

    cs.CL cs.AI

    Empirical evaluation of Uncertainty Quantification in Retrieval-Augmented Language Models for Science

    Authors: Sridevi Wagle, Sai Munikoti, Anurag Acharya, Sara Smith, Sameera Horawalavithana

    Abstract: Large language models (LLMs) have shown remarkable achievements in natural language processing tasks, producing high-quality outputs. However, LLMs still exhibit limitations, including the generation of factually incorrect information. In safety-critical applications, it is important to assess the confidence of LLM-generated content to make informed decisions. Retrieval Augmented Language Models (… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    ACM Class: I.2.7

  6. arXiv:2311.04348  [pdf, other

    cs.CL cs.AI

    Evaluating the Effectiveness of Retrieval-Augmented Large Language Models in Scientific Document Reasoning

    Authors: Sai Munikoti, Anurag Acharya, Sridevi Wagle, Sameera Horawalavithana

    Abstract: Despite the dramatic progress in Large Language Model (LLM) development, LLMs often provide seemingly plausible but not factual information, often referred to as hallucinations. Retrieval-augmented LLMs provide a non-parametric approach to solve these issues by retrieving relevant information from external data sources and augment the training process. These models help to trace evidence from an e… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: 5 pages

    ACM Class: I.2.7

  7. arXiv:2310.10920  [pdf, other

    cs.CL cs.AI

    NuclearQA: A Human-Made Benchmark for Language Models for the Nuclear Domain

    Authors: Anurag Acharya, Sai Munikoti, Aaron Hellinger, Sara Smith, Sridevi Wagle, Sameera Horawalavithana

    Abstract: As LLMs have become increasingly popular, they have been used in almost every field. But as the application for LLMs expands from generic fields to narrow, focused science domains, there exists an ever-increasing gap in ways to evaluate their efficacy in those fields. For the benchmarks that do exist, a lot of them focus on questions that don't require proper understanding of the subject in questi… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 9 pages

    ACM Class: I.2.7

  8. arXiv:2305.18432  [pdf

    cs.LG cs.HC

    Interactive Decision Tree Creation and Enhancement with Complete Visualization for Explainable Modeling

    Authors: Boris Kovalerchuk Andrew Dunn, Alex Worland, Sridevi Wagle

    Abstract: To increase the interpretability and prediction accuracy of the Machine Learning (ML) models, visualization of ML models is a key part of the ML process. Decision Trees (DTs) are essential in machine learning (ML) because they are used to understand many black box ML models including Deep Learning models. In this research, two new methods for creation and enhancement with complete visualizing Deci… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: 36 pages, 45 figures, 5 tables

  9. arXiv:2208.02856  [pdf, other

    cs.LG

    Embedding Alignment for Unsupervised Federated Learning via Smart Data Exchange

    Authors: Satyavrat Wagle, Seyyedali Hosseinalipour, Naji Khosravan, Mung Chiang, Christopher G. Brinton

    Abstract: Federated learning (FL) has been recognized as one of the most promising solutions for distributed machine learning (ML). In most of the current literature, FL has been studied for supervised ML tasks, in which edge devices collect labeled data. Nevertheless, in many applications, it is impractical to assume existence of labeled data across devices. To this end, we develop a novel methodology, Coo… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: Accepted for publication in IEEE Global Communications Conferences (GLOBECOM), 2022

  10. arXiv:2205.04035  [pdf

    cs.LG

    Visualization of Decision Trees based on General Line Coordinates to Support Explainable Models

    Authors: Alex Worland, Sridevi Wagle, Boris Kovalerchuk

    Abstract: Visualization of Machine Learning (ML) models is an important part of the ML process to enhance the interpretability and prediction accuracy of the ML models. This paper proposes a new method SPC-DT to visualize the Decision Tree (DT) as interpretable models. These methods use a version of General Line Coordinates called Shifted Paired Coordinates (SPC). In SPC, each n-D point is visualized in a s… ▽ More

    Submitted 9 May, 2022; originally announced May 2022.

    Comments: 8 pages, 21 figures, 5 tables

  11. arXiv:2107.04971  [pdf

    cs.LG cs.AI

    Self-service Data Classification Using Interactive Visualization and Interpretable Machine Learning

    Authors: Sridevi Narayana Wagle, Boris Kovalerchuk

    Abstract: Machine learning algorithms often produce models considered as complex black-box models by both end users and developers. They fail to explain the model in terms of the domain they are designed for. The proposed Iterative Visual Logical Classifier (IVLC) is an interpretable machine learning algorithm that allows end users to design a model and classify data with more confidence and without having… ▽ More

    Submitted 11 July, 2021; originally announced July 2021.

    Comments: 37 pages, 33 figures, 7 tables

  12. arXiv:2004.08488  [pdf, other

    cs.DC

    Network-Aware Optimization of Distributed Learning for Fog Computing

    Authors: Su Wang, Yichen Ruan, Yuwei Tu, Satyavrat Wagle, Christopher G. Brinton, Carlee Joe-Wong

    Abstract: Fog computing promises to enable machine learning tasks to scale to large amounts of data by distributing processing across connected devices. Two key challenges to achieving this goal are heterogeneity in devices compute resources and topology constraints on which devices can communicate with each other. We address these challenges by developing the first network-aware distributed learning optimi… ▽ More

    Submitted 21 April, 2021; v1 submitted 17 April, 2020; originally announced April 2020.

    Comments: Accepted for publication in IEEE/ACM Transactions on Networking (16 pages)