Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Dewan, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13735  [pdf, other

    cs.CV cs.LG

    StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images

    Authors: Rushikesh Zawar, Shaurya Dewan, Andrew F. Luo, Margaret M. Henderson, Michael J. Tarr, Leila Wehbe

    Abstract: Understanding the semantics of visual scenes is a fundamental challenge in Computer Vision. A key aspect of this challenge is that objects sharing similar semantic meanings or functions can exhibit striking visual differences, making accurate identification and categorization difficult. Recent advancements in text-to-image frameworks have led to models that implicitly capture natural scene statist… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Dataset website: https://stablesemantics.github.io/StableSemantics

  2. arXiv:2406.05191  [pdf, other

    cs.CV

    DiffusionPID: Interpreting Diffusion via Partial Information Decomposition

    Authors: Shaurya Dewan, Rushikesh Zawar, Prakanshul Saxena, Yingshan Chang, Andrew Luo, Yonatan Bisk

    Abstract: Text-to-image diffusion models have made significant progress in generating naturalistic images from textual inputs, and demonstrate the capacity to learn and represent complex visual-semantic relationships. While these diffusion models have achieved remarkable success, the underlying mechanisms driving their performance are not yet fully accounted for, with many unanswered questions surrounding w… ▽ More

    Submitted 12 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2401.04198  [pdf, other

    cs.LG cs.AI

    Curiosity & Entropy Driven Unsupervised RL in Multiple Environments

    Authors: Shaurya Dewan, Anisha Jain, Zoe LaLena, Lifan Yu

    Abstract: The authors of 'Unsupervised Reinforcement Learning in Multiple environments' propose a method, alpha-MEPOL, to tackle unsupervised RL across multiple environments. They pre-train a task-agnostic exploration policy using interactions from an entire environment class and then fine-tune this policy for various tasks using supervision. We expanded upon this work, with the goal of improving performanc… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  4. arXiv:2212.02493  [pdf, other

    cs.CV

    Canonical Fields: Self-Supervised Learning of Pose-Canonicalized Neural Fields

    Authors: Rohith Agaram, Shaurya Dewan, Rahul Sajnani, Adrien Poulenard, Madhava Krishna, Srinath Sridhar

    Abstract: Coordinate-based implicit neural networks, or neural fields, have emerged as useful representations of shape and appearance in 3D computer vision. Despite advances, however, it remains challenging to build neural fields for categories of objects without datasets like ShapeNet that provide "canonicalized" object instances that are consistently aligned for their 3D position and orientation (pose). W… ▽ More

    Submitted 17 May, 2023; v1 submitted 5 December, 2022; originally announced December 2022.