Zum Hauptinhalt springen

Showing 1–15 of 15 results for author: Annadani, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.03209  [pdf, other

    cs.LG cs.AI

    Challenges and Considerations in the Evaluation of Bayesian Causal Discovery

    Authors: Amir Mohammad Karimi Mamaghan, Panagiotis Tigas, Karl Henrik Johansson, Yarin Gal, Yashas Annadani, Stefan Bauer

    Abstract: Representing uncertainty in causal discovery is a crucial component for experimental design, and more broadly, for safe and reliable causal decision making. Bayesian Causal Discovery (BCD) offers a principled approach to encapsulating this uncertainty. Unlike non-Bayesian causal discovery, which relies on a single estimated causal graph and model parameters for assessment, evaluating BCD presents… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  2. arXiv:2405.16718  [pdf, other

    cs.LG cs.AI

    Amortized Active Causal Induction with Deep Reinforcement Learning

    Authors: Yashas Annadani, Panagiotis Tigas, Stefan Bauer, Adam Foster

    Abstract: We present Causal Amortized Active Structure Learning (CAASL), an active intervention design policy that can select interventions that are adaptive, real-time and that does not require access to the likelihood. This policy, an amortized network based on the transformer, is trained with reinforcement learning on a simulator of the design environment, and a reward function that measures how close th… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  3. arXiv:2307.13917  [pdf, other

    cs.LG stat.ME

    BayesDAG: Gradient-Based Posterior Inference for Causal Discovery

    Authors: Yashas Annadani, Nick Pawlowski, Joel Jennings, Stefan Bauer, Cheng Zhang, Wenbo Gong

    Abstract: Bayesian causal discovery aims to infer the posterior distribution over causal models from observed data, quantifying epistemic uncertainty and benefiting downstream tasks. However, computational challenges arise due to joint inference over combinatorial space of Directed Acyclic Graphs (DAGs) and nonlinear functions. Despite recent progress towards efficient posterior inference over DAGs, existin… ▽ More

    Submitted 8 December, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: NeurIPS 2023

  4. arXiv:2302.10607  [pdf, other

    cs.LG cs.AI stat.ME

    Differentiable Multi-Target Causal Bayesian Experimental Design

    Authors: Yashas Annadani, Panagiotis Tigas, Desi R. Ivanova, Andrew Jesson, Yarin Gal, Adam Foster, Stefan Bauer

    Abstract: We introduce a gradient-based approach for the problem of Bayesian optimal experimental design to learn causal models in a batch setting -- a critical component for causal discovery from finite data where interventions can be costly or risky. Existing methods rely on greedy approximations to construct a batch of experiments while using black-box methods to optimize over a single target-state pair… ▽ More

    Submitted 2 June, 2023; v1 submitted 21 February, 2023; originally announced February 2023.

    Comments: Camera-ready version ICML 2023

  5. arXiv:2211.13715  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Trust Your $\nabla$: Gradient-based Intervention Targeting for Causal Discovery

    Authors: Mateusz Olko, Michał Zając, Aleksandra Nowak, Nino Scherrer, Yashas Annadani, Stefan Bauer, Łukasz Kuciński, Piotr Miłoś

    Abstract: Inferring causal structure from data is a challenging task of fundamental importance in science. Observational data are often insufficient to identify a system's causal structure uniquely. While conducting interventions (i.e., experiments) can improve the identifiability, such samples are usually challenging and expensive to obtain. Hence, experimental design approaches for causal discovery aim to… ▽ More

    Submitted 3 April, 2024; v1 submitted 24 November, 2022; originally announced November 2022.

    Comments: Accepted to 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  6. arXiv:2210.13583  [pdf, other

    cs.LG cs.AI stat.ME

    Learning Latent Structural Causal Models

    Authors: Jithendaraa Subramanian, Yashas Annadani, Ivaxi Sheth, Nan Rosemary Ke, Tristan Deleu, Stefan Bauer, Derek Nowrouzezahrai, Samira Ebrahimi Kahou

    Abstract: Causal learning has long concerned itself with the accurate recovery of underlying causal mechanisms. Such causal modelling enables better explanations of out-of-distribution data. Prior works on causal learning assume that the high-level causal variables are given. However, in machine learning tasks, one often operates on low-level data like image pixels or high-dimensional vectors. In such setti… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: 21 pages, 19 figures

  7. arXiv:2207.05723  [pdf, other

    cs.LG cs.AI stat.ML

    Latent Variable Models for Bayesian Causal Discovery

    Authors: Jithendaraa Subramanian, Yashas Annadani, Ivaxi Sheth, Stefan Bauer, Derek Nowrouzezahrai, Samira Ebrahimi Kahou

    Abstract: Learning predictors that do not rely on spurious correlations involves building causal representations. However, learning such a representation is very challenging. We, therefore, formulate the problem of learning a causal representation from high dimensional data and study causal recovery with synthetic data. This work introduces a latent variable decoder model, Decoder BCD, for Bayesian causal d… ▽ More

    Submitted 10 August, 2022; v1 submitted 12 July, 2022; originally announced July 2022.

    Comments: 7 figures, Published at the ICML 2022 Workshop on Spurious Correlations, Invariance, and Stability

  8. arXiv:2203.02016  [pdf, other

    cs.LG cs.AI stat.ML

    Interventions, Where and How? Experimental Design for Causal Models at Scale

    Authors: Panagiotis Tigas, Yashas Annadani, Andrew Jesson, Bernhard Schölkopf, Yarin Gal, Stefan Bauer

    Abstract: Causal discovery from observational and interventional data is challenging due to limited data and non-identifiability: factors that introduce uncertainty in estimating the underlying structural causal model (SCM). Selecting experiments (interventions) based on the uncertainty arising from both factors can expedite the identification of the SCM. Existing methods in experimental design for causal d… ▽ More

    Submitted 21 October, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: Presented at the thirty-sixth Conference on Neural Information Processing Systems (2022)

  9. arXiv:2109.02429  [pdf, other

    stat.ML cs.LG

    Learning Neural Causal Models with Active Interventions

    Authors: Nino Scherrer, Olexa Bilaniuk, Yashas Annadani, Anirudh Goyal, Patrick Schwab, Bernhard Schölkopf, Michael C. Mozer, Yoshua Bengio, Stefan Bauer, Nan Rosemary Ke

    Abstract: Discovering causal structures from data is a challenging inference problem of fundamental importance in all areas of science. The appealing properties of neural networks have recently led to a surge of interest in differentiable neural network-based methods for learning causal structures from data. So far, differentiable causal discovery has focused on static datasets of observational or fixed int… ▽ More

    Submitted 5 March, 2022; v1 submitted 6 September, 2021; originally announced September 2021.

  10. arXiv:2106.07635  [pdf, other

    cs.LG cs.AI stat.ML

    Variational Causal Networks: Approximate Bayesian Inference over Causal Structures

    Authors: Yashas Annadani, Jonas Rothfuss, Alexandre Lacoste, Nino Scherrer, Anirudh Goyal, Yoshua Bengio, Stefan Bauer

    Abstract: Learning the causal structure that underlies data is a crucial step towards robust real-world decision making. The majority of existing work in causal inference focuses on determining a single directed acyclic graph (DAG) or a Markov equivalence class thereof. However, a crucial aspect to acting intelligently upon the knowledge about causal structure which has been inferred from finite data demand… ▽ More

    Submitted 14 June, 2021; originally announced June 2021.

    Comments: 10 pages, 6 figures

  11. arXiv:2006.07796  [pdf, other

    cs.LG cs.CV stat.ML

    Structure by Architecture: Structured Representations without Regularization

    Authors: Felix Leeb, Guilia Lanzillotta, Yashas Annadani, Michel Besserve, Stefan Bauer, Bernhard Schölkopf

    Abstract: We study the problem of self-supervised structured representation learning using autoencoders for downstream tasks such as generative modeling. Unlike most methods which rely on matching an arbitrary, relatively unstructured, prior distribution for sampling, we propose a sampling technique that relies solely on the independence of latent variables, thereby avoiding the trade-off between reconstruc… ▽ More

    Submitted 15 February, 2024; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: Published at ICLR 2023

  12. arXiv:1907.10430   

    cs.LG stat.ML

    Noise Contrastive Variational Autoencoders

    Authors: Octavian-Eugen Ganea, Yashas Annadani, Gary Bécigneul

    Abstract: We take steps towards understanding the "posterior collapse (PC)" difficulty in variational autoencoders (VAEs),~i.e. a degenerate optimum in which the latent codes become independent of their corresponding inputs. We rely on calculus of variations and theoretically explore a few popular VAE models, showing that PC always occurs for non-parametric encoders and decoders. Inspired by the popular noi… ▽ More

    Submitted 31 July, 2019; v1 submitted 23 July, 2019; originally announced July 2019.

    Comments: There is a mistake common to all the main proofs. In summary, what we find are saddle points or global maxima of the respective loss functions and not the global minima. We apologize for this

  13. arXiv:1803.03049  [pdf, other

    cs.CV

    Preserving Semantic Relations for Zero-Shot Learning

    Authors: Yashas Annadani, Soma Biswas

    Abstract: Zero-shot learning has gained popularity due to its potential to scale recognition models without requiring additional training data. This is usually achieved by associating categories with their semantic information like attributes. However, we believe that the potential offered by this paradigm is not yet fully exploited. In this work, we propose to utilize the structure of the space spanned by… ▽ More

    Submitted 8 March, 2018; originally announced March 2018.

    Comments: CVPR 2018

  14. arXiv:1611.04357  [pdf, other

    cs.CV

    Selfie Detection by Synergy-Constraint Based Convolutional Neural Network

    Authors: Yashas Annadani, Vijayakrishna Naganoor, Akshay Kumar Jagadish, Krishnan Chemmangat

    Abstract: Categorisation of huge amount of data on the multimedia platform is a crucial task. In this work, we propose a novel approach to address the subtle problem of selfie detection for image database segregation on the web, given rapid rise in number of selfies clicked. A Convolutional Neural Network (CNN) is modeled to learn a synergy feature in the common subspace of head and shoulder orientation, de… ▽ More

    Submitted 14 November, 2016; originally announced November 2016.

    Comments: 8 Pages, Accepted for Publication at IEEE SITIS 2016

  15. arXiv:1611.00218  [pdf, other

    cs.CV

    Sliding Dictionary Based Sparse Representation For Action Recognition

    Authors: Yashas Annadani, D L Rakshith, Soma Biswas

    Abstract: The task of action recognition has been in the forefront of research, given its applications in gaming, surveillance and health care. In this work, we propose a simple, yet very effective approach which works seamlessly for both offline and online action recognition using the skeletal joints. We construct a sliding dictionary which has the training data along with their time stamps. This is used t… ▽ More

    Submitted 1 November, 2016; originally announced November 2016.

    Comments: 7 Pages