Zum Hauptinhalt springen

Showing 1–50 of 156 results for author: Babu, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.07841  [pdf

    cs.LG cs.AI eess.SY

    SustainDC -- Benchmarking for Sustainable Data Center Control

    Authors: Avisek Naug, Antonio Guillen, Ricardo Luna, Vineet Gundecha, Desik Rengarajan, Sahand Ghorbanpour, Sajad Mousavi, Ashwin Ramesh Babu, Dejan Markovikj, Lekhapriya D Kashyap, Soumyendu Sarkar

    Abstract: Machine learning has driven an exponential increase in computational demand, leading to massive data centers that consume significant amounts of energy and contribute to climate change. This makes sustainable data center control a priority. In this paper, we introduce SustainDC, a set of Python environments for benchmarking multi-agent reinforcement learning (MARL) algorithms for data centers (DC)… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: Under review at Advances in Neural Information Processing Systems 2024 (NeurIPS 2024)

  2. arXiv:2408.05083  [pdf, other

    cs.CV

    PreciseControl: Enhancing Text-To-Image Diffusion Models with Fine-Grained Attribute Control

    Authors: Rishubh Parihar, Sachidanand VS, Sabariswaran Mani, Tejan Karmali, R. Venkatesh Babu

    Abstract: Recently, we have seen a surge of personalization methods for text-to-image (T2I) diffusion models to learn a concept using a few images. Existing approaches, when used for face personalization, suffer to achieve convincing inversion with identity preservation and rely on semantic text-based editing of the generated face. However, a more fine-grained control is desired for facial attribute editing… ▽ More

    Submitted 24 July, 2024; originally announced August 2024.

    Comments: ECCV 2024, Project page: https://rishubhpar.github.io/PreciseControl.home/

  3. arXiv:2407.21674  [pdf, other

    cs.CV cs.AI

    Synthetic Simplicity: Unveiling Bias in Medical Data Augmentation

    Authors: Krishan Agyakari Raja Babu, Rachana Sathish, Mrunal Pattanaik, Rahul Venkataramani

    Abstract: Synthetic data is becoming increasingly integral in data-scarce fields such as medical imaging, serving as a substitute for real data. However, its inherent statistical characteristics can significantly impact downstream tasks, potentially compromising deployment performance. In this study, we empirically investigate this issue and uncover a critical phenomenon: downstream neural networks often ex… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

  4. arXiv:2407.15446  [pdf, other

    cs.CV

    Text2Place: Affordance-aware Text Guided Human Placement

    Authors: Rishubh Parihar, Harsh Gupta, Sachidanand VS, R. Venkatesh Babu

    Abstract: For a given scene, humans can easily reason for the locations and pose to place objects. Designing a computational model to reason about these affordances poses a significant challenge, mirroring the intuitive reasoning abilities of humans. This work tackles the problem of realistic human insertion in a given background scene termed as \textbf{Semantic Human Placement}. This task is extremely chal… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: ECCV 2024, Project Page: https://rishubhpar.github.io/Text2Place/

  5. arXiv:2406.10197  [pdf, other

    cs.CV cs.AI cs.LG

    Crafting Parts for Expressive Object Composition

    Authors: Harsh Rangwani, Aishwarya Agarwal, Kuldeep Kulkarni, R. Venkatesh Babu, Srikrishna Karanam

    Abstract: Text-to-image generation from large generative models like Stable Diffusion, DALLE-2, etc., have become a common base for various tasks due to their superior quality and extensive knowledge bases. As image composition and generation are creative processes the artists need control over various parts of the images being generated. We find that just adding details about parts in the base text prompt… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Project Page Will Be Here: https://rangwani-harsh.github.io/PartCraft

  6. arXiv:2406.05796  [pdf, other

    cs.LG cs.CV

    ProFeAT: Projected Feature Adversarial Training for Self-Supervised Learning of Robust Representations

    Authors: Sravanti Addepalli, Priyam Dey, R. Venkatesh Babu

    Abstract: The need for abundant labelled data in supervised Adversarial Training (AT) has prompted the use of Self-Supervised Learning (SSL) techniques with AT. However, the direct application of existing SSL methods to adversarial training has been sub-optimal due to the increased training complexity of combining SSL with AT. A recent approach, DeACL, mitigates this by utilizing supervision from a standard… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  7. arXiv:2404.12498  [pdf

    cs.LG cs.AI eess.SY

    A Configurable Pythonic Data Center Model for Sustainable Cooling and ML Integration

    Authors: Avisek Naug, Antonio Guillen, Ricardo Luna Gutierrez, Vineet Gundecha, Sahand Ghorbanpour, Sajad Mousavi, Ashwin Ramesh Babu, Soumyendu Sarkar

    Abstract: There have been growing discussions on estimating and subsequently reducing the operational carbon footprint of enterprise data centers. The design and intelligent control for data centers have an important impact on data center carbon footprint. In this paper, we showcase PyDCM, a Python library that enables extremely fast prototyping of data center design and applies reinforcement learning-enabl… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: NeurIPS 2023 Workshop on Tackling Climate Change with Machine Learning https://www.climatechange.ai/papers/neurips2023/15. arXiv admin note: substantial text overlap with arXiv:2310.03906

  8. arXiv:2404.10991  [pdf

    cs.AI cs.LG eess.SY

    Function Approximation for Reinforcement Learning Controller for Energy from Spread Waves

    Authors: Soumyendu Sarkar, Vineet Gundecha, Sahand Ghorbanpour, Alexander Shmakov, Ashwin Ramesh Babu, Avisek Naug, Alexandre Pichard, Mathieu Cocho

    Abstract: The industrial multi-generator Wave Energy Converters (WEC) must handle multiple simultaneous waves coming from different directions called spread waves. These complex devices in challenging circumstances need controllers with multiple objectives of energy capture efficiency, reduction of structural stress to limit maintenance, and proactive protection against high waves. The Multi-Agent Reinforce… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: IJCAI 2023, Proceedings of the Thirty-Second International Joint Conference on Artificial IntelligenceAugust 2023

    Journal ref: IJCAI 2023, Proceedings of the Thirty-Second International Joint Conference on Artificial IntelligenceAugust 2023, Article No 688, Pages 6201 to 6209

  9. arXiv:2404.10786  [pdf

    cs.DC cs.AI cs.LG cs.MA eess.SY

    Sustainability of Data Center Digital Twins with Reinforcement Learning

    Authors: Soumyendu Sarkar, Avisek Naug, Antonio Guillen, Ricardo Luna, Vineet Gundecha, Ashwin Ramesh Babu, Sajad Mousavi

    Abstract: The rapid growth of machine learning (ML) has led to an increased demand for computational power, resulting in larger data centers (DCs) and higher energy consumption. To address this issue and reduce carbon emissions, intelligent design and control of DC components such as IT servers, cabinets, HVAC cooling, flexible load shifting, and battery energy storage are essential. However, the complexity… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 2024 Proceedings of the AAAI Conference on Artificial Intelligence

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 38, no. 20, pp. 22322-22330, Mar. 2024

  10. arXiv:2404.02900  [pdf, other

    cs.CV cs.AI cs.LG

    DeiT-LT Distillation Strikes Back for Vision Transformer Training on Long-Tailed Datasets

    Authors: Harsh Rangwani, Pradipto Mondal, Mayank Mishra, Ashish Ramayee Asokan, R. Venkatesh Babu

    Abstract: Vision Transformer (ViT) has emerged as a prominent architecture for various computer vision tasks. In ViT, we divide the input image into patch tokens and process them through a stack of self attention blocks. However, unlike Convolutional Neural Networks (CNN), ViTs simple architecture has no informative inductive bias (e.g., locality,etc. ). Due to this, ViT requires a large amount of data for… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: CVPR 2024. Project Page: https://rangwani-harsh.github.io/DeiT-LT

  11. arXiv:2403.18985  [pdf

    cs.LG cs.AI cs.CR cs.CV cs.MA

    Robustness and Visual Explanation for Black Box Image, Video, and ECG Signal Classification with Reinforcement Learning

    Authors: Soumyendu Sarkar, Ashwin Ramesh Babu, Sajad Mousavi, Vineet Gundecha, Avisek Naug, Sahand Ghorbanpour

    Abstract: We present a generic Reinforcement Learning (RL) framework optimized for crafting adversarial attacks on different model types spanning from ECG signal analysis (1D), image classification (2D), and video classification (3D). The framework focuses on identifying sensitive regions and inducing misclassifications with minimal distortions and various distortion types. The novel RL method outperforms s… ▽ More

    Submitted 22 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: AAAI Proceedings reference: https://ojs.aaai.org/index.php/AAAI/article/view/30579

    Journal ref: 2024 Proceedings of the AAAI Conference on Artificial Intelligence

  12. arXiv:2403.14092  [pdf

    cs.LG cs.AI cs.MA eess.SY

    Carbon Footprint Reduction for Sustainable Data Centers in Real-Time

    Authors: Soumyendu Sarkar, Avisek Naug, Ricardo Luna, Antonio Guillen, Vineet Gundecha, Sahand Ghorbanpour, Sajad Mousavi, Dejan Markovikj, Ashwin Ramesh Babu

    Abstract: As machine learning workloads significantly increase energy consumption, sustainable data centers with low carbon emissions are becoming a top priority for governments and corporations worldwide. This requires a paradigm shift in optimizing power consumption in cooling and IT loads, shifting flexible loads based on the availability of renewable energy in the power grid, and leveraging battery stor… ▽ More

    Submitted 25 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Journal ref: 2024 Proceedings of the AAAI Conference on Artificial Intelligence

  13. arXiv:2402.18206  [pdf, other

    cs.CV

    Balancing Act: Distribution-Guided Debiasing in Diffusion Models

    Authors: Rishubh Parihar, Abhijnya Bhat, Abhipsa Basu, Saswat Mallick, Jogendra Nath Kundu, R. Venkatesh Babu

    Abstract: Diffusion Models (DMs) have emerged as powerful generative models with unprecedented image generation capability. These models are widely used for data augmentation and creative applications. However, DMs reflect the biases present in the training datasets. This is especially concerning in the context of faces, where the DM prefers one demographic subgroup vs others (eg. female vs male). In this w… ▽ More

    Submitted 29 May, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: CVPR 2024. Project Page : https://ab-34.github.io/balancing_act/

  14. arXiv:2311.16294  [pdf, other

    cs.CV

    Aligning Non-Causal Factors for Transformer-Based Source-Free Domain Adaptation

    Authors: Sunandini Sanyal, Ashish Ramayee Asokan, Suvaansh Bhambri, Pradyumna YM, Akshay Kulkarni, Jogendra Nath Kundu, R Venkatesh Babu

    Abstract: Conventional domain adaptation algorithms aim to achieve better generalization by aligning only the task-discriminative causal factors between a source and target domain. However, we find that retaining the spurious correlation between causal and non-causal factors plays a vital role in bridging the domain gap and improving target adaptation. Therefore, we propose to build a framework that disenta… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: WACV 2024. Project Page: https://val.cds.iisc.ac.in/C-SFTrans/

  15. arXiv:2311.16052  [pdf, other

    cs.CV

    Exploring Attribute Variations in Style-based GANs using Diffusion Models

    Authors: Rishubh Parihar, Prasanna Balaji, Raghav Magazine, Sarthak Vora, Tejan Karmali, Varun Jampani, R. Venkatesh Babu

    Abstract: Existing attribute editing methods treat semantic attributes as binary, resulting in a single edit per attribute. However, attributes such as eyeglasses, smiles, or hairstyles exhibit a vast range of diversity. In this work, we formulate the task of \textit{diverse attribute editing} by modeling the multidimensional nature of attribute edits. This enables users to generate multiple plausible edits… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: Neurips Workshop on Diffusion Models 2023

  16. arXiv:2310.18679  [pdf

    cs.CL cs.AI cs.LG

    N-Critics: Self-Refinement of Large Language Models with Ensemble of Critics

    Authors: Sajad Mousavi, Ricardo Luna Gutiérrez, Desik Rengarajan, Vineet Gundecha, Ashwin Ramesh Babu, Avisek Naug, Antonio Guillen, Soumyendu Sarkar

    Abstract: We propose a self-correction mechanism for Large Language Models (LLMs) to mitigate issues such as toxicity and fact hallucination. This method involves refining model outputs through an ensemble of critics and the model's own feedback. Drawing inspiration from human behavior, we explore whether LLMs can emulate the self-correction process observed in humans who often engage in self-reflection and… ▽ More

    Submitted 8 November, 2023; v1 submitted 28 October, 2023; originally announced October 2023.

    Journal ref: NeurIPS 2023 Workshop on Robustness of Few-shot and Zero-shot Learning in Foundation Models 2023(NeurIPS 2023)

  17. arXiv:2310.18626  [pdf

    cs.CV cs.AI cs.LG

    Benchmark Generation Framework with Customizable Distortions for Image Classifier Robustness

    Authors: Soumyendu Sarkar, Ashwin Ramesh Babu, Sajad Mousavi, Zachariah Carmichael, Vineet Gundecha, Sahand Ghorbanpour, Ricardo Luna, Gutierrez Antonio Guillen, Avisek Naug

    Abstract: We present a novel framework for generating adversarial benchmarks to evaluate the robustness of image classification models. Our framework allows users to customize the types of distortions to be optimally applied to images, which helps address the specific distortions relevant to their deployment. The benchmark can generate datasets at various distortion levels to assess the robustness of differ… ▽ More

    Submitted 8 November, 2023; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

  18. arXiv:2310.08255  [pdf, other

    cs.CV

    Leveraging Vision-Language Models for Improving Domain Generalization in Image Classification

    Authors: Sravanti Addepalli, Ashish Ramayee Asokan, Lakshay Sharma, R. Venkatesh Babu

    Abstract: Vision-Language Models (VLMs) such as CLIP are trained on large amounts of image-text pairs, resulting in remarkable generalization across several data distributions. However, in several cases, their expensive training and data collection/curation costs do not justify the end application. This motivates a vendor-client paradigm, where a vendor trains a large-scale VLM and grants only input-output… ▽ More

    Submitted 9 March, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: Project page: http://val.cds.iisc.ac.in/VL2V-ADiP/

  19. RTDK-BO: High Dimensional Bayesian Optimization with Reinforced Transformer Deep kernels

    Authors: Alexander Shmakov, Avisek Naug, Vineet Gundecha, Sahand Ghorbanpour, Ricardo Luna Gutierrez, Ashwin Ramesh Babu, Antonio Guillen, Soumyendu Sarkar

    Abstract: Bayesian Optimization (BO), guided by Gaussian process (GP) surrogates, has proven to be an invaluable technique for efficient, high-dimensional, black-box optimization, a critical problem inherent to many applications such as industrial design and scientific computing. Recent contributions have introduced reinforcement learning (RL) to improve the optimization performance on both single function… ▽ More

    Submitted 8 November, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: 2023 IEEE 19th International Conference on Automation Science and Engineering (CASE)

  20. PyDCM: Custom Data Center Models with Reinforcement Learning for Sustainability

    Authors: Avisek Naug, Antonio Guillen, Ricardo Luna Gutiérrez, Vineet Gundecha, Dejan Markovikj, Lekhapriya Dheeraj Kashyap, Lorenz Krause, Sahand Ghorbanpour, Sajad Mousavi, Ashwin Ramesh Babu, Soumyendu Sarkar

    Abstract: The increasing global emphasis on sustainability and reducing carbon emissions is pushing governments and corporations to rethink their approach to data center design and operation. Given their high energy consumption and exponentially large computational workloads, data centers are prime candidates for optimizing power consumption, especially in areas such as cooling and IT energy usage. A signif… ▽ More

    Submitted 26 March, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: The 10th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation (BuildSys '23), November 15-16, 2023, Istanbul, Turkey

    Journal ref: 2023 BuildSys '23: Proceedings of the 10th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation

  21. arXiv:2309.07668  [pdf, other

    cs.CV

    CoRF : Colorizing Radiance Fields using Knowledge Distillation

    Authors: Ankit Dhiman, R Srinath, Srinjay Sarkar, Lokesh R Boregowda, R Venkatesh Babu

    Abstract: Neural radiance field (NeRF) based methods enable high-quality novel-view synthesis for multi-view images. This work presents a method for synthesizing colorized novel views from input grey-scale multi-view images. When we apply image or video-based colorization methods on the generated grey-scale novel views, we observe artifacts due to inconsistency across views. Training a radiance field networ… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

    Comments: AI3DCC @ ICCV 2023

  22. arXiv:2308.14023  [pdf, other

    cs.CV

    Domain-Specificity Inducing Transformers for Source-Free Domain Adaptation

    Authors: Sunandini Sanyal, Ashish Ramayee Asokan, Suvaansh Bhambri, Akshay Kulkarni, Jogendra Nath Kundu, R. Venkatesh Babu

    Abstract: Conventional Domain Adaptation (DA) methods aim to learn domain-invariant feature representations to improve the target adaptation performance. However, we motivate that domain-specificity is equally important since in-domain trained models hold crucial domain-specific properties that are beneficial for adaptation. Hence, we propose to build a framework that supports disentanglement and learning o… ▽ More

    Submitted 27 August, 2023; originally announced August 2023.

    Comments: ICCV 2023. Project page: http://val.cds.iisc.ac.in/DSiT-SFDA

  23. arXiv:2308.13021  [pdf

    cs.HC

    Augmenting a Firefighters PPE -- Gas Mask SCBA

    Authors: Kunal Aneja, Tejaswini Ramkumar Babu, Rachel Chan

    Abstract: PPE (Personal Protective Equipment) has allowed firefighters to perform their everyday tasks without getting harmed since the mid 1800s. Now, the advancement of technology has given rise to the improvements of PPE. PPE can now include sensors to detect any number of environmental hazards (chemical, biological, temperature etc.). As the GT class of CS3750, we have decided to create a version of an… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  24. arXiv:2308.10337  [pdf, other

    cs.CV

    Strata-NeRF : Neural Radiance Fields for Stratified Scenes

    Authors: Ankit Dhiman, Srinath R, Harsh Rangwani, Rishubh Parihar, Lokesh R Boregowda, Srinath Sridhar, R Venkatesh Babu

    Abstract: Neural Radiance Field (NeRF) approaches learn the underlying 3D representation of a scene and generate photo-realistic novel views with high fidelity. However, most proposed settings concentrate on modelling a single object or a single level of a scene. However, in the real world, we may capture a scene at multiple levels, resulting in a layered capture. For example, tourists usually capture a mon… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: ICCV 2023, Project Page: https://ankitatiisc.github.io/Strata-NeRF/

  25. arXiv:2306.06462  [pdf, other

    cs.LG

    Boosting Adversarial Robustness using Feature Level Stochastic Smoothing

    Authors: Sravanti Addepalli, Samyak Jain, Gaurang Sriramanan, R. Venkatesh Babu

    Abstract: Advances in adversarial defenses have led to a significant improvement in the robustness of Deep Neural Networks. However, the robust accuracy of present state-ofthe-art defenses is far from the requirements in critical applications such as robotics and autonomous navigation systems. Further, in practical use cases, network prediction alone might not suffice, and assignment of a confidence value f… ▽ More

    Submitted 10 June, 2023; originally announced June 2023.

    Comments: CVPR Workshops 2021. First three authors contributed equally

  26. arXiv:2306.00559  [pdf, other

    cs.CV cs.AI cs.LG

    We never go out of Style: Motion Disentanglement by Subspace Decomposition of Latent Space

    Authors: Rishubh Parihar, Raghav Magazine, Piyush Tiwari, R. Venkatesh Babu

    Abstract: Real-world objects perform complex motions that involve multiple independent motion components. For example, while talking, a person continuously changes their expressions, head, and body pose. In this work, we propose a novel method to decompose motion in videos by using a pretrained image GAN model. We discover disentangled motion subspaces in the latent space of widely used style-based GAN mode… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: AI for content creation, CVPRW-2023

  27. arXiv:2305.11080  [pdf, other

    cs.CV cs.CL

    Inspecting the Geographical Representativeness of Images from Text-to-Image Models

    Authors: Abhipsa Basu, R. Venkatesh Babu, Danish Pruthi

    Abstract: Recent progress in generative models has resulted in models that produce both realistic as well as relevant images for most textual inputs. These models are being used to generate millions of images everyday, and hold the potential to drastically impact areas such as generative art, digital marketing and data augmentation. Given their outsized impact, it is important to ensure that the generated c… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: Preprint, 15 pages, 9 figures

  28. arXiv:2304.10446  [pdf, other

    cs.LG cs.CV

    Certified Adversarial Robustness Within Multiple Perturbation Bounds

    Authors: Soumalya Nandi, Sravanti Addepalli, Harsh Rangwani, R. Venkatesh Babu

    Abstract: Randomized smoothing (RS) is a well known certified defense against adversarial attacks, which creates a smoothed classifier by predicting the most likely class under random noise perturbations of inputs during inference. While initial work focused on robustness to $\ell_2$ norm perturbations using noise sampled from a Gaussian distribution, subsequent works have shown that different noise distrib… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

  29. arXiv:2304.07560  [pdf, other

    cs.CV

    Continual Domain Adaptation through Pruning-aided Domain-specific Weight Modulation

    Authors: Prasanna B, Sunandini Sanyal, R. Venkatesh Babu

    Abstract: In this paper, we propose to develop a method to address unsupervised domain adaptation (UDA) in a practical setting of continual learning (CL). The goal is to update the model on continually changing domains while preserving domain-specific knowledge to prevent catastrophic forgetting of past-seen domains. To this end, we build a framework for preserving domain-specific features utilizing the inh… ▽ More

    Submitted 15 April, 2023; originally announced April 2023.

    Comments: CVPR CLVision Workshop 2023, For code see https://github.com/PrasannaB29/PACDA

  30. arXiv:2304.05866  [pdf, other

    cs.CV cs.LG

    NoisyTwins: Class-Consistent and Diverse Image Generation through StyleGANs

    Authors: Harsh Rangwani, Lavish Bansal, Kartik Sharma, Tejan Karmali, Varun Jampani, R. Venkatesh Babu

    Abstract: StyleGANs are at the forefront of controllable image generation as they produce a latent space that is semantically disentangled, making it suitable for image editing and manipulation. However, the performance of StyleGANs severely degrades when trained via class-conditioning on large-scale long-tailed datasets. We find that one reason for degradation is the collapse of latents for each class in t… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: CVPR 2023. Project Page: https://rangwani-harsh.github.io/NoisyTwins/

  31. arXiv:2304.00306  [pdf, other

    cs.CV

    CapsFlow: Optical Flow Estimation with Capsule Networks

    Authors: Rahul Chand, Rajat Arora, K Ram Prabhakar, R Venkatesh Babu

    Abstract: We present a framework to use recently introduced Capsule Networks for solving the problem of Optical Flow, one of the fundamental computer vision tasks. Most of the existing state of the art deep architectures either uses a correlation oepration to match features from them. While correlation layer is sensitive to the choice of hyperparameters and does not put a prior on the underlying structure o… ▽ More

    Submitted 1 December, 2023; v1 submitted 1 April, 2023; originally announced April 2023.

    Comments: Newer version added to correct issue in the conference name of the previous version uploaded on April 1st

  32. arXiv:2303.15528  [pdf, other

    cs.CV eess.IV

    Few-Shot Domain Adaptation for Low Light RAW Image Enhancement

    Authors: K. Ram Prabhakar, Vishal Vinod, Nihar Ranjan Sahoo, R. Venkatesh Babu

    Abstract: Enhancing practical low light raw images is a difficult task due to severe noise and color distortions from short exposure time and limited illumination. Despite the success of existing Convolutional Neural Network (CNN) based methods, their performance is not adaptable to different camera domains. In addition, such methods also require large datasets with short-exposure and corresponding long-exp… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: BMVC 2021 Best Student Paper Award (Runner-Up). Project Page: https://val.cds.iisc.ac.in/HDR/BMVC21/index.html

    Journal ref: 32nd British Machine Vision Conference 2021, BMVC 2021, 327

  33. arXiv:2302.14685  [pdf, other

    cs.LG cs.AI cs.CV

    DART: Diversify-Aggregate-Repeat Training Improves Generalization of Neural Networks

    Authors: Samyak Jain, Sravanti Addepalli, Pawan Sahu, Priyam Dey, R. Venkatesh Babu

    Abstract: Generalization of neural networks is crucial for deploying them safely in the real world. Common training strategies to improve generalization involve the use of data augmentations, ensembling and model averaging. In this work, we first establish a surprisingly simple but strong benchmark for generalization which utilizes diverse augmentations within a training minibatch, and show that this can le… ▽ More

    Submitted 10 June, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: CVPR 2023. First two authors contributed equally

  34. arXiv:2212.13827  [pdf, other

    cs.LG cs.CV

    Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data

    Authors: Harsh Rangwani, Sumukh K Aithal, Mayank Mishra, R. Venkatesh Babu

    Abstract: Real-world datasets exhibit imbalances of varying types and degrees. Several techniques based on re-weighting and margin adjustment of loss are often used to enhance the performance of neural networks, particularly on minority classes. In this work, we analyze the class-imbalanced learning problem by examining the loss landscape of neural networks trained with re-weighting and margin-based techniq… ▽ More

    Submitted 28 December, 2022; originally announced December 2022.

    Comments: NeurIPS 2022. Code: https://github.com/val-iisc/Saddle-LongTail

  35. arXiv:2210.15909  [pdf, other

    cs.CV cs.LG

    Subsidiary Prototype Alignment for Universal Domain Adaptation

    Authors: Jogendra Nath Kundu, Suvaansh Bhambri, Akshay Kulkarni, Hiran Sarkar, Varun Jampani, R. Venkatesh Babu

    Abstract: Universal Domain Adaptation (UniDA) deals with the problem of knowledge transfer between two datasets with domain-shift as well as category-shift. The goal is to categorize unlabeled target samples, either into one of the "known" categories or into a single "unknown" category. A major problem in UniDA is negative transfer, i.e. misalignment of "known" and "unknown" classes. To this end, we first u… ▽ More

    Submitted 28 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022. Project page: https://sites.google.com/view/spa-unida

  36. arXiv:2210.15318  [pdf, other

    cs.LG cs.CV

    Efficient and Effective Augmentation Strategy for Adversarial Training

    Authors: Sravanti Addepalli, Samyak Jain, R. Venkatesh Babu

    Abstract: Adversarial training of Deep Neural Networks is known to be significantly more data-hungry when compared to standard training. Furthermore, complex data augmentations such as AutoAugment, which have led to substantial gains in standard training of image classifiers, have not been successful with Adversarial Training. We first explain this contrasting behavior by viewing augmentation during trainin… ▽ More

    Submitted 27 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  37. arXiv:2210.09866  [pdf, other

    cs.CV cs.LG

    Towards Efficient and Effective Self-Supervised Learning of Visual Representations

    Authors: Sravanti Addepalli, Kaushal Bhogale, Priyam Dey, R. Venkatesh Babu

    Abstract: Self-supervision has emerged as a propitious method for visual representation learning after the recent paradigm shift from handcrafted pretext tasks to instance-similarity based approaches. Most state-of-the-art methods enforce similarity between various augmentations of a given image, while some methods additionally use contrastive approaches to explicitly ensure diverse representations. While t… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: ECCV 2022

  38. arXiv:2210.09852  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Scaling Adversarial Training to Large Perturbation Bounds

    Authors: Sravanti Addepalli, Samyak Jain, Gaurang Sriramanan, R. Venkatesh Babu

    Abstract: The vulnerability of Deep Neural Networks to Adversarial Attacks has fuelled research towards building robust models. While most Adversarial Training algorithms aim at defending attacks constrained within low magnitude Lp norm bounds, real-world adversaries are not limited by such constraints. In this work, we aim to achieve adversarial robustness within larger bounds, against perturbations that m… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: ECCV 2022

  39. arXiv:2210.01360  [pdf, other

    cs.LG

    Learning an Invertible Output Mapping Can Mitigate Simplicity Bias in Neural Networks

    Authors: Sravanti Addepalli, Anshul Nasery, R. Venkatesh Babu, Praneeth Netrapalli, Prateek Jain

    Abstract: Deep Neural Networks are known to be brittle to even minor distribution shifts compared to the training distribution. While one line of work has demonstrated that Simplicity Bias (SB) of DNNs - bias towards learning only the simplest features - is a key reason for this brittleness, another recent line of work has surprisingly found that diverse/ complex features are indeed learned by the backbone,… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

  40. Skip Training for Multi-Agent Reinforcement Learning Controller for Industrial Wave Energy Converters

    Authors: Soumyendu Sarkar, Vineet Gundecha, Sahand Ghorbanpour, Alexander Shmakov, Ashwin Ramesh Babu, Alexandre Pichard, Mathieu Cocho

    Abstract: Recent Wave Energy Converters (WEC) are equipped with multiple legs and generators to maximize energy generation. Traditional controllers have shown limitations to capture complex wave patterns and the controllers must efficiently maximize the energy capture. This paper introduces a Multi-Agent Reinforcement Learning controller (MARL), which outperforms the traditionally used spring damper control… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: 2022 IEEE 18th International Conference on Automation Science and Engineering (CASE) August 20-24, 2022

    Report number: 02

    Journal ref: 2022 IEEE 18th International Conference on Automation Science and Engineering (CASE)

  41. arXiv:2208.09932  [pdf, other

    cs.CV cs.LG eess.IV

    Improving GANs for Long-Tailed Data through Group Spectral Regularization

    Authors: Harsh Rangwani, Naman Jaswani, Tejan Karmali, Varun Jampani, R. Venkatesh Babu

    Abstract: Deep long-tailed learning aims to train useful deep networks on practical, real-world imbalanced distributions, wherein most labels of the tail classes are associated with a few samples. There has been a large body of work to train discriminative models for visual recognition on long-tailed distribution. In contrast, we aim to train conditional Generative Adversarial Networks, a class of image gen… ▽ More

    Submitted 21 August, 2022; originally announced August 2022.

    Comments: ECCV 2022. Project Page: https://sites.google.com/view/gsr-eccv22

  42. arXiv:2208.03764  [pdf, other

    cs.CV

    Hierarchical Semantic Regularization of Latent Spaces in StyleGANs

    Authors: Tejan Karmali, Rishubh Parihar, Susmit Agrawal, Harsh Rangwani, Varun Jampani, Maneesh Singh, R. Venkatesh Babu

    Abstract: Progress in GANs has enabled the generation of high-resolution photorealistic images of astonishing quality. StyleGANs allow for compelling attribute modification on such images via mathematical operations on the latent style vectors in the W/W+ space that effectively modulate the rich hierarchical representations of the generator. Such operations have recently been generalized beyond mere attribu… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.

    Comments: ECCV 2022. Project page: https://sites.google.com/view/hsr-eccv22/

  43. arXiv:2207.13247  [pdf, other

    cs.CV cs.LG

    Concurrent Subsidiary Supervision for Unsupervised Source-Free Domain Adaptation

    Authors: Jogendra Nath Kundu, Suvaansh Bhambri, Akshay Kulkarni, Hiran Sarkar, Varun Jampani, R. Venkatesh Babu

    Abstract: The prime challenge in unsupervised domain adaptation (DA) is to mitigate the domain shift between the source and target domains. Prior DA works show that pretext tasks could be used to mitigate this domain shift by learning domain invariant representations. However, in practice, we find that most existing pretext tasks are ineffective against other established techniques. Thus, we theoretically a… ▽ More

    Submitted 26 July, 2022; originally announced July 2022.

    Comments: ECCV 2022. Project page: https://sites.google.com/view/sticker-sfda

  44. Everything is There in Latent Space: Attribute Editing and Attribute Style Manipulation by StyleGAN Latent Space Exploration

    Authors: Rishubh Parihar, Ankit Dhiman, Tejan Karmali, R. Venkatesh Babu

    Abstract: Unconstrained Image generation with high realism is now possible using recent Generative Adversarial Networks (GANs). However, it is quite challenging to generate images with a given set of attributes. Recent methods use style-based GAN models to perform image editing by leveraging the semantic hierarchy present in the layers of the generator. We present Few-shot Latent-based Attribute Manipulatio… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: Project page: https://sites.google.com/view/flamelatentediting

  45. arXiv:2207.01229  [pdf, other

    cs.CV

    Segmentation Guided Deep HDR Deghosting

    Authors: K. Ram Prabhakar, Susmit Agrawal, R. Venkatesh Babu

    Abstract: We present a motion segmentation guided convolutional neural network (CNN) approach for high dynamic range (HDR) image deghosting. First, we segment the moving regions in the input sequence using a CNN. Then, we merge static and moving regions separately with different fusion networks and combine fused features to generate the final ghost-free HDR image. Our motion segmentation guided HDR fusion a… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

  46. arXiv:2206.08213  [pdf, other

    cs.LG cs.CV

    A Closer Look at Smoothness in Domain Adversarial Training

    Authors: Harsh Rangwani, Sumukh K Aithal, Mayank Mishra, Arihant Jain, R. Venkatesh Babu

    Abstract: Domain adversarial training has been ubiquitous for achieving invariant representations and is used widely for various domain adaptation tasks. In recent times, methods converging to smooth optima have shown improved generalization for supervised learning tasks like classification. In this work, we analyze the effect of smoothness enhancing formulations on domain adversarial training, the objectiv… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: ICML 2022. Code: https://github.com/val-iisc/SDAT

  47. arXiv:2206.08009  [pdf, other

    cs.CV cs.LG

    Balancing Discriminability and Transferability for Source-Free Domain Adaptation

    Authors: Jogendra Nath Kundu, Akshay Kulkarni, Suvaansh Bhambri, Deepesh Mehta, Shreyas Kulkarni, Varun Jampani, R. Venkatesh Babu

    Abstract: Conventional domain adaptation (DA) techniques aim to improve domain transferability by learning domain-invariant representations; while concurrently preserving the task-discriminability knowledge gathered from the labeled source data. However, the requirement of simultaneous access to labeled source and unlabeled target renders them unsuitable for the challenging source-free DA setting. The trivi… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: ICML 2022. Project page: https://sites.google.com/view/mixup-sfda

  48. arXiv:2204.11022  [pdf, other

    cs.CR cs.CV

    Towards Data-Free Model Stealing in a Hard Label Setting

    Authors: Sunandini Sanyal, Sravanti Addepalli, R. Venkatesh Babu

    Abstract: Machine learning models deployed as a service (MLaaS) are susceptible to model stealing attacks, where an adversary attempts to steal the model within a restricted access framework. While existing attacks demonstrate near-perfect clone-model performance using softmax predictions of the classification network, most of the APIs allow access to only the top-1 labels. In this work, we show that it is… ▽ More

    Submitted 23 April, 2022; originally announced April 2022.

    Comments: CVPR 2022, Project Page: https://sites.google.com/view/dfms-hl

  49. arXiv:2204.02958  [pdf, other

    cs.CV

    LEAD: Self-Supervised Landmark Estimation by Aligning Distributions of Feature Similarity

    Authors: Tejan Karmali, Abhinav Atrishi, Sai Sree Harsha, Susmit Agrawal, Varun Jampani, R. Venkatesh Babu

    Abstract: In this work, we introduce LEAD, an approach to discover landmarks from an unannotated collection of category-specific images. Existing works in self-supervised landmark detection are based on learning dense (pixel-level) feature representations from an image, which are further used to learn landmarks in a semi-supervised manner. While there have been advances in self-supervised learning of image… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: WACV 2022. Project Page at http://sites.google.com/view/lead-wacv22

  50. arXiv:2204.01971  [pdf, other

    cs.CV cs.AI

    Non-Local Latent Relation Distillation for Self-Adaptive 3D Human Pose Estimation

    Authors: Jogendra Nath Kundu, Siddharth Seth, Anirudh Jamkhandi, Pradyumna YM, Varun Jampani, Anirban Chakraborty, R. Venkatesh Babu

    Abstract: Available 3D human pose estimation approaches leverage different forms of strong (2D/3D pose) or weak (multi-view or depth) paired supervision. Barring synthetic or in-studio domains, acquiring such supervision for each new target environment is highly inconvenient. To this end, we cast 3D pose learning as a self-supervised adaptation problem that aims to transfer the task knowledge from a labeled… ▽ More

    Submitted 6 April, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

    Comments: NeurIPS 2021. Project page: https://sites.google.com/view/sa3dhp