Zum Hauptinhalt springen

Showing 1–16 of 16 results for author: Rout, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.17401  [pdf, other

    cs.LG cs.CV stat.ML

    RB-Modulation: Training-Free Personalization of Diffusion Models using Stochastic Optimal Control

    Authors: Litu Rout, Yujia Chen, Nataniel Ruiz, Abhishek Kumar, Constantine Caramanis, Sanjay Shakkottai, Wen-Sheng Chu

    Abstract: We propose Reference-Based Modulation (RB-Modulation), a new plug-and-play solution for training-free personalization of diffusion models. Existing training-free approaches exhibit difficulties in (a) style extraction from reference images in the absence of additional style or content text descriptions, (b) unwanted content leakage from reference style images, and (c) effective composition of styl… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Preprint. Under review

  2. arXiv:2312.00852  [pdf, other

    cs.LG cs.CV stat.ML

    Beyond First-Order Tweedie: Solving Inverse Problems using Latent Diffusion

    Authors: Litu Rout, Yujia Chen, Abhishek Kumar, Constantine Caramanis, Sanjay Shakkottai, Wen-Sheng Chu

    Abstract: Sampling from the posterior distribution poses a major computational challenge in solving inverse problems using latent diffusion models. Common methods rely on Tweedie's first-order moments, which are known to induce a quality-limiting bias. Existing second-order approximations are impractical due to prohibitive computational costs, making standard reverse diffusion processes intractable for post… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: Preprint

  3. arXiv:2307.00619  [pdf, other

    cs.LG cs.AI stat.ML

    Solving Linear Inverse Problems Provably via Posterior Sampling with Latent Diffusion Models

    Authors: Litu Rout, Negin Raoof, Giannis Daras, Constantine Caramanis, Alexandros G. Dimakis, Sanjay Shakkottai

    Abstract: We present the first framework to solve linear inverse problems leveraging pre-trained latent diffusion models. Previously proposed algorithms (such as DPS and DDRM) only apply to pixel-space diffusion models. We theoretically analyze our algorithm showing provable sample recovery in a linear model setting. The algorithmic insight obtained from our analysis extends to more general settings often c… ▽ More

    Submitted 2 July, 2023; originally announced July 2023.

    Comments: Preprint

  4. arXiv:2302.06570  [pdf, ps, other

    stat.ML cs.LG math.OC

    Beyond Uniform Smoothness: A Stopped Analysis of Adaptive SGD

    Authors: Matthew Faw, Litu Rout, Constantine Caramanis, Sanjay Shakkottai

    Abstract: This work considers the problem of finding a first-order stationary point of a non-convex function with potentially unbounded smoothness constant using a stochastic gradient oracle. We focus on the class of $(L_0,L_1)$-smooth functions proposed by Zhang et al. (ICLR'20). Empirical evidence suggests that these functions more closely captures practical machine learning problems as compared to the pe… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

  5. arXiv:2302.01217  [pdf, other

    stat.ML cs.AI cs.LG math.ST

    A Theoretical Justification for Image Inpainting using Denoising Diffusion Probabilistic Models

    Authors: Litu Rout, Advait Parulekar, Constantine Caramanis, Sanjay Shakkottai

    Abstract: We provide a theoretical justification for sample recovery using diffusion based image inpainting in a linear model setting. While most inpainting algorithms require retraining with each new mask, we prove that diffusion based inpainting generalizes well to unseen masks without retraining. We analyze a recently proposed popular diffusion based inpainting algorithm called RePaint (Lugmayr et al., 2… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Comments: 30 pages, 5 figures, 1 Table

  6. arXiv:2209.13570  [pdf, other

    stat.ML cs.LG

    Hierarchical Sliced Wasserstein Distance

    Authors: Khai Nguyen, Tongzheng Ren, Huy Nguyen, Litu Rout, Tan Nguyen, Nhat Ho

    Abstract: Sliced Wasserstein (SW) distance has been widely used in different application scenarios since it can be scaled to a large number of supports without suffering from the curse of dimensionality. The value of sliced Wasserstein distance is the average of transportation cost between one-dimensional representations (projections) of original measures that are obtained by Radon Transform (RT). Despite i… ▽ More

    Submitted 6 February, 2023; v1 submitted 27 September, 2022; originally announced September 2022.

    Comments: Accepted to ICLR 2023, 29 pages, 8 figures, 3 tables,

  7. arXiv:2202.01116  [pdf, other

    eess.IV cs.CV cs.LG

    An Optimal Transport Perspective on Unpaired Image Super-Resolution

    Authors: Milena Gazdieva, Litu Rout, Alexander Korotin, Andrey Kravchenko, Alexander Filippov, Evgeny Burnaev

    Abstract: Real-world image super-resolution (SR) tasks often do not have paired datasets, which limits the application of supervised techniques. As a result, the tasks are usually approached by unpaired techniques based on Generative Adversarial Networks (GANs), which yield complex training losses with several regularization terms, e.g., content or identity losses. We theoretically investigate optimization… ▽ More

    Submitted 11 October, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

  8. arXiv:2110.02999  [pdf, other

    cs.LG

    Generative Modeling with Optimal Transport Maps

    Authors: Litu Rout, Alexander Korotin, Evgeny Burnaev

    Abstract: With the discovery of Wasserstein GANs, Optimal Transport (OT) has become a powerful tool for large-scale generative modeling tasks. In these tasks, OT cost is typically used as the loss for training GANs. In contrast to this approach, we show that the OT map itself can be used as a generative model, providing comparable performance. Previous analogous approaches consider OT maps as generative mod… ▽ More

    Submitted 5 March, 2022; v1 submitted 6 October, 2021; originally announced October 2021.

    Comments: ICLR 2022

  9. arXiv:2010.00522  [pdf, other

    cs.LG cs.CV math.OC stat.ML

    Understanding the Role of Adversarial Regularization in Supervised Learning

    Authors: Litu Rout

    Abstract: Despite numerous attempts sought to provide empirical evidence of adversarial regularization outperforming sole supervision, the theoretical understanding of such phenomena remains elusive. In this study, we aim to resolve whether adversarial regularization indeed performs better than sole supervision at a fundamental level. To bring this insight into fruition, we study vanishing gradient issue, a… ▽ More

    Submitted 1 October, 2020; originally announced October 2020.

    Comments: Under Review

  10. arXiv:2010.00521  [pdf, other

    cs.LG cs.CV stat.ML

    Why Adversarial Interaction Creates Non-Homogeneous Patterns: A Pseudo-Reaction-Diffusion Model for Turing Instability

    Authors: Litu Rout

    Abstract: Long after Turing's seminal Reaction-Diffusion (RD) model, the elegance of his fundamental equations alleviated much of the skepticism surrounding pattern formation. Though Turing model is a simplification and an idealization, it is one of the best-known theoretical models to explain patterns as a reminiscent of those observed in nature. Over the years, concerted efforts have been made to align th… ▽ More

    Submitted 8 December, 2020; v1 submitted 1 October, 2020; originally announced October 2020.

    Comments: 35th AAAI Conference on Artificial Intelligence

  11. arXiv:2004.03879  [pdf, other

    cs.CV

    Monte-Carlo Siamese Policy on Actor for Satellite Image Super Resolution

    Authors: Litu Rout, Saumyaa Shah, S Manthira Moorthi, Debajyoti Dhar

    Abstract: In the past few years supervised and adversarial learning have been widely adopted in various complex computer vision tasks. It seems natural to wonder whether another branch of artificial intelligence, commonly known as Reinforcement Learning (RL) can benefit such complex vision tasks. In this study, we explore the plausible usage of RL in super resolution of remote sensing imagery. Guided by rec… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

    Comments: Computer Vision and Pattern Recognition (CVPR) Workshop on Large Scale Computer Vision for Remote Sensing Imagery

  12. arXiv:2004.03867  [pdf, other

    eess.IV cs.CV

    S2A: Wasserstein GAN with Spatio-Spectral Laplacian Attention for Multi-Spectral Band Synthesis

    Authors: Litu Rout, Indranil Misra, S Manthira Moorthi, Debajyoti Dhar

    Abstract: Intersection of adversarial learning and satellite image processing is an emerging field in remote sensing. In this study, we intend to address synthesis of high resolution multi-spectral satellite imagery using adversarial learning. Guided by the discovery of attention mechanism, we regulate the process of band synthesis through spatio-spectral Laplacian attention. Further, we use Wasserstein GAN… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

    Comments: Computer Vision and Pattern Recognition (CVPR) Workshop on Large Scale Computer Vision for Remote Sensing Imagery

  13. arXiv:1910.13993  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Is Supervised Learning With Adversarial Features Provably Better Than Sole Supervision?

    Authors: Litu Rout

    Abstract: Generative Adversarial Networks (GAN) have shown promising results on a wide variety of complex tasks. Recent experiments show adversarial training provides useful gradients to the generator that helps attain better performance. In this paper, we intend to theoretically analyze whether supervised learning with adversarial features can outperform sole supervision, or not. First, we show that superv… ▽ More

    Submitted 20 April, 2020; v1 submitted 30 October, 2019; originally announced October 2019.

  14. Learning Rotation Adaptive Correlation Filters in Robust Visual Object Tracking

    Authors: Litu Rout, Priya Mariam Raju, Deepak Mishra, Rama Krishna Sai Subrahmanyam Gorthi

    Abstract: Visual object tracking is one of the major challenges in the field of computer vision. Correlation Filter (CF) trackers are one of the most widely used categories in tracking. Though numerous tracking algorithms based on CFs are available today, most of them fail to efficiently detect the object in an unconstrained environment with dynamically changing object appearance. In order to tackle such ch… ▽ More

    Submitted 4 June, 2019; originally announced June 2019.

    Comments: Published in ACCV 2018

    Journal ref: ACCV: Asian Conference on Computer Vision 2018

  15. arXiv:1905.02749  [pdf, other

    cs.CV eess.IV

    DeepSWIR: A Deep Learning Based Approach for the Synthesis of Short-Wave InfraRed Band using Multi-Sensor Concurrent Datasets

    Authors: Litu Rout, Yatharath Bhateja, Ankur Garg, Indranil Mishra, S Manthira Moorthi, Debjyoti Dhar

    Abstract: Convolutional Neural Network (CNN) is achieving remarkable progress in various computer vision tasks. In the past few years, the remote sensing community has observed Deep Neural Network (DNN) finally taking off in several challenging fields. In this study, we propose a DNN to generate a predefined High Resolution (HR) synthetic spectral band using an ensemble of concurrent Low Resolution (LR) ban… ▽ More

    Submitted 7 May, 2019; originally announced May 2019.

  16. arXiv:1709.06057  [pdf, other

    cs.CV

    Rotation Adaptive Visual Object Tracking with Motion Consistency

    Authors: Litu Rout, Sidhartha, Gorthi R. K. S. S. Manyam, Deepak Mishra

    Abstract: Visual Object tracking research has undergone significant improvement in the past few years. The emergence of tracking by detection approach in tracking paradigm has been quite successful in many ways. Recently, deep convolutional neural networks have been extensively used in most successful trackers. Yet, the standard approach has been based on correlation or feature selection with minimal consid… ▽ More

    Submitted 22 November, 2017; v1 submitted 18 September, 2017; originally announced September 2017.

    Comments: Accepted conference paper WACV 2018