Zum Hauptinhalt springen

Showing 1–14 of 14 results for author: Thopalli, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.00331  [pdf, other

    cs.CV

    DECIDER: Leveraging Foundation Model Priors for Improved Model Failure Detection and Explanation

    Authors: Rakshith Subramanyam, Kowshik Thopalli, Vivek Narayanaswamy, Jayaraman J. Thiagarajan

    Abstract: Reliably detecting when a deployed machine learning model is likely to fail on a given input is crucial for ensuring safe operation. In this work, we propose DECIDER (Debiasing Classifiers to Identify Errors Reliably), a novel approach that leverages priors from large language models (LLMs) and vision-language models (VLMs) to detect failures in image classification models. DECIDER utilizes LLMs t… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: Accepted at ECCV (European Conference on Computer Vision) 2024

  2. arXiv:2406.17117  [pdf, other

    cs.CV

    Speeding Up Image Classifiers with Little Companions

    Authors: Yang Liu, Kowshik Thopalli, Jayaraman Thiagarajan

    Abstract: Scaling up neural networks has been a key recipe to the success of large language and vision models. However, in practice, up-scaled models can be disproportionately costly in terms of computations, providing only marginal improvements in performance; for example, EfficientViT-L3-384 achieves <2% improvement on ImageNet-1K accuracy over the base L1-224 model, while requiring $14\times$ more multip… ▽ More

    Submitted 26 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2406.00529  [pdf, other

    cs.LG cs.CV stat.ML

    On the Use of Anchoring for Training Vision Models

    Authors: Vivek Narayanaswamy, Kowshik Thopalli, Rushil Anirudh, Yamen Mubarka, Wesam Sakla, Jayaraman J. Thiagarajan

    Abstract: Anchoring is a recent, architecture-agnostic principle for training deep neural networks that has been shown to significantly improve uncertainty estimation, calibration, and extrapolation capabilities. In this paper, we systematically explore anchoring as a general protocol for training vision models, providing fundamental insights into its training and inference processes and their implications… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  4. arXiv:2305.13284  [pdf, other

    cs.CV cs.AI

    Target-Aware Generative Augmentations for Single-Shot Adaptation

    Authors: Kowshik Thopalli, Rakshith Subramanyam, Pavan Turaga, Jayaraman J. Thiagarajan

    Abstract: In this paper, we address the problem of adapting models from a source domain to a target domain, a task that has become increasingly important due to the brittle generalization of deep neural networks. While several test-time adaptation techniques have emerged, they typically rely on synthetic toolbox data augmentations in cases of limited target data availability. We consider the challenging set… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: Accepted at International Conference Machine Learning (ICML) 2023

  5. arXiv:2210.16692  [pdf, other

    cs.CV cs.LG stat.ML

    Single-Shot Domain Adaptation via Target-Aware Generative Augmentation

    Authors: Rakshith Subramanyam, Kowshik Thopalli, Spring Berman, Pavan Turaga, Jayaraman J. Thiagarajan

    Abstract: The problem of adapting models from a source domain using data from any target domain of interest has gained prominence, thanks to the brittle generalization in deep neural networks. While several test-time adaptation techniques have emerged, they typically rely on synthetic data augmentations in cases of limited target data availability. In this paper, we consider the challenging setting of singl… ▽ More

    Submitted 29 October, 2022; originally announced October 2022.

  6. arXiv:2207.04185  [pdf, other

    cs.CV cs.LG

    Domain Alignment Meets Fully Test-Time Adaptation

    Authors: Kowshik Thopalli, Pavan Turaga, Jayaraman J. Thiagarajan

    Abstract: A foundational requirement of a deployed ML model is to generalize to data drawn from a testing distribution that is different from training. A popular solution to this problem is to adapt a pre-trained model to novel domains using only unlabeled data. In this paper, we focus on a challenging variant of this problem, where access to the original source data is restricted. While fully test-time ada… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

    Comments: 16 Pages including references, 5 figures

  7. arXiv:2201.01806  [pdf, other

    cs.LG cs.CV

    Revisiting Deep Subspace Alignment for Unsupervised Domain Adaptation

    Authors: Kowshik Thopalli, Jayaraman J Thiagarajan, Rushil Anirudh, Pavan K Turaga

    Abstract: Unsupervised domain adaptation (UDA) aims to transfer and adapt knowledge from a labeled source domain to an unlabeled target domain. Traditionally, subspace-based methods form an important class of solutions to this problem. Despite their mathematical elegance and tractability, these methods are often found to be ineffective at producing domain-invariant features with complex, real-world datasets… ▽ More

    Submitted 5 January, 2022; originally announced January 2022.

    Comments: arXiv admin note: text overlap with arXiv:1906.04338

  8. arXiv:2112.09802  [pdf, other

    cs.LG cs.CV

    Automated Domain Discovery from Multiple Sources to Improve Zero-Shot Generalization

    Authors: Kowshik Thopalli, Sameeksha Katoch, Pavan Turaga, Jayaraman J. Thiagarajan

    Abstract: Domain generalization (DG) methods aim to develop models that generalize to settings where the test distribution is different from the training data. In this paper, we focus on the challenging problem of multi-source zero shot DG (MDG), where labeled training data from multiple source domains is available but with no access to data from the target domain. A wide range of solutions have been propos… ▽ More

    Submitted 3 November, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

  9. arXiv:2103.11374  [pdf, other

    cs.CV cs.RO

    MaAST: Map Attention with Semantic Transformersfor Efficient Visual Navigation

    Authors: Zachary Seymour, Kowshik Thopalli, Niluthpol Mithun, Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar

    Abstract: Visual navigation for autonomous agents is a core task in the fields of computer vision and robotics. Learning-based methods, such as deep reinforcement learning, have the potential to outperform the classical solutions developed for this task; however, they come at a significantly increased computational load. Through this work, we design a novel approach that focuses on performing better or comp… ▽ More

    Submitted 21 March, 2021; originally announced March 2021.

    Comments: 6 pages, 5 figures, accepted at ICRA 2021

  10. arXiv:2002.03875  [pdf, other

    stat.ML cs.LG

    Calibrate and Prune: Improving Reliability of Lottery Tickets Through Prediction Calibration

    Authors: Bindya Venkatesh, Jayaraman J. Thiagarajan, Kowshik Thopalli, Prasanna Sattigeri

    Abstract: The hypothesis that sub-network initializations (lottery) exist within the initializations of over-parameterized networks, which when trained in isolation produce highly generalizable models, has led to crucial insights into network initialization and has enabled efficient inferencing. Supervised models with uncalibrated confidences tend to be overconfident even when making wrong prediction. In th… ▽ More

    Submitted 30 September, 2020; v1 submitted 10 February, 2020; originally announced February 2020.

  11. arXiv:1911.10600  [pdf, other

    cs.CV

    Invenio: Discovering Hidden Relationships Between Tasks/Domains Using Structured Meta Learning

    Authors: Sameeksha Katoch, Kowshik Thopalli, Jayaraman J. Thiagarajan, Pavan Turaga, Andreas Spanias

    Abstract: Exploiting known semantic relationships between fine-grained tasks is critical to the success of recent model agnostic approaches. These approaches often rely on meta-optimization to make a model robust to systematic task or domain shifts. However, in practice, the performance of these methods can suffer, when there are no coherent semantic relationships between the tasks (or domains). We present… ▽ More

    Submitted 1 February, 2020; v1 submitted 24 November, 2019; originally announced November 2019.

    Comments: Semantic structure development for tasks/domains essential for efficient knowledge transfer

  12. arXiv:1906.04338  [pdf, other

    stat.ML cs.CV cs.LG

    SALT: Subspace Alignment as an Auxiliary Learning Task for Domain Adaptation

    Authors: Kowshik Thopalli, Jayaraman J. Thiagarajan, Rushil Anirudh, Pavan Turaga

    Abstract: Unsupervised domain adaptation aims to transfer and adapt knowledge learned from a labeled source domain to an unlabeled target domain. Key components of unsupervised domain adaptation include: (a) maximizing performance on the target, and (b) aligning the source and target domains. Traditionally, these tasks have either been considered as separate, or assumed to be implicitly addressed together w… ▽ More

    Submitted 18 December, 2019; v1 submitted 10 June, 2019; originally announced June 2019.

  13. arXiv:1811.04491  [pdf, other

    cs.CV

    Multiple Subspace Alignment Improves Domain Adaptation

    Authors: Kowshik Thopalli, Rushil Anirudh, Jayaraman J. Thiagarajan, Pavan Turaga

    Abstract: We present a novel unsupervised domain adaptation (DA) method for cross-domain visual recognition. Though subspace methods have found success in DA, their performance is often limited due to the assumption of approximating an entire dataset using a single low-dimensional subspace. Instead, we develop a method to effectively represent the source and target datasets via a collection of low-dimension… ▽ More

    Submitted 11 November, 2018; originally announced November 2018.

    Comments: under review in ICASSP 2019

  14. arXiv:1807.10400  [pdf, other

    cs.CV

    Perturbation Robust Representations of Topological Persistence Diagrams

    Authors: Anirudh Som, Kowshik Thopalli, Karthikeyan Natesan Ramamurthy, Vinay Venkataraman, Ankita Shukla, Pavan Turaga

    Abstract: Topological methods for data analysis present opportunities for enforcing certain invariances of broad interest in computer vision, including view-point in activity analysis, articulation in shape analysis, and measurement invariance in non-linear dynamical modeling. The increasing success of these methods is attributed to the complementary information that topology provides, as well as availabili… ▽ More

    Submitted 26 July, 2018; originally announced July 2018.

    Comments: 19 pages, 4 figures, 6 tables