Zum Hauptinhalt springen

Showing 1–23 of 23 results for author: Karaletsos, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.00809  [pdf, other

    cs.LG stat.ML

    Position: Bayesian Deep Learning is Needed in the Age of Large-Scale AI

    Authors: Theodore Papamarkou, Maria Skoularidou, Konstantina Palla, Laurence Aitchison, Julyan Arbel, David Dunson, Maurizio Filippone, Vincent Fortuin, Philipp Hennig, José Miguel Hernández-Lobato, Aliaksandr Hubin, Alexander Immer, Theofanis Karaletsos, Mohammad Emtiyaz Khan, Agustinus Kristiadi, Yingzhen Li, Stephan Mandt, Christopher Nemeth, Michael A. Osborne, Tim G. J. Rudner, David Rügamer, Yee Whye Teh, Max Welling, Andrew Gordon Wilson, Ruqi Zhang

    Abstract: In the current landscape of deep learning research, there is a predominant emphasis on achieving high predictive accuracy in supervised tasks involving large image and language datasets. However, a broader perspective reveals a multitude of overlooked metrics, tasks, and data types, such as uncertainty, active and continual learning, and scientific data, that demand attention. Bayesian deep learni… ▽ More

    Submitted 6 August, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  2. arXiv:2311.02794  [pdf, other

    stat.ML cs.AI cs.LG q-bio.QM

    Modelling Cellular Perturbations with the Sparse Additive Mechanism Shift Variational Autoencoder

    Authors: Michael Bereket, Theofanis Karaletsos

    Abstract: Generative models of observations under interventions have been a vibrant topic of interest across machine learning and the sciences in recent years. For example, in drug discovery, there is a need to model the effects of diverse interventions on cells in order to characterize unknown biological mechanisms of action. We propose the Sparse Additive Mechanism Shift Variational Autoencoder, SAMS-VAE,… ▽ More

    Submitted 15 January, 2024; v1 submitted 5 November, 2023; originally announced November 2023.

    Comments: Presented at the 37th Conference on Neural Information Processing Systems (NeurIPS 2023) (Post-NeurIPS fixes: cosmetic fixes, updated references, added simulation to appendix)

  3. arXiv:2309.16108  [pdf, other

    cs.CV cs.AI cs.LG

    Channel Vision Transformers: An Image Is Worth 1 x 16 x 16 Words

    Authors: Yujia Bao, Srinivasan Sivanandan, Theofanis Karaletsos

    Abstract: Vision Transformer (ViT) has emerged as a powerful architecture in the realm of modern computer vision. However, its application in certain imaging fields, such as microscopy and satellite imaging, presents unique challenges. In these domains, images often contain multiple channels, each carrying semantically distinct and independent information. Furthermore, the model must demonstrate robustness… ▽ More

    Submitted 18 April, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

  4. arXiv:2305.19402  [pdf, other

    cs.CV cs.AI cs.CL

    Contextual Vision Transformers for Robust Representation Learning

    Authors: Yujia Bao, Theofanis Karaletsos

    Abstract: We introduce Contextual Vision Transformers (ContextViT), a method designed to generate robust image representations for datasets experiencing shifts in latent factors across various groups. Derived from the concept of in-context learning, ContextViT incorporates an additional context token to encapsulate group-specific information. This integration allows the model to adjust the image representat… ▽ More

    Submitted 28 September, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  5. arXiv:2212.00136  [pdf, other

    q-bio.QM cs.LG

    DEL-Dock: Molecular Docking-Enabled Modeling of DNA-Encoded Libraries

    Authors: Kirill Shmilovich, Benson Chen, Theofanis Karaletsos, Mohammad M. Sultan

    Abstract: DNA-Encoded Library (DEL) technology has enabled significant advances in hit identification by enabling efficient testing of combinatorially-generated molecular libraries. DEL screens measure protein binding affinity though sequencing reads of molecules tagged with unique DNA-barcodes that survive a series of selection experiments. Computational models have been deployed to learn the latent bindin… ▽ More

    Submitted 14 December, 2022; v1 submitted 30 November, 2022; originally announced December 2022.

  6. arXiv:2211.02377  [pdf, other

    stat.ML cs.LG

    Black-box Coreset Variational Inference

    Authors: Dionysis Manousakas, Hippolyt Ritter, Theofanis Karaletsos

    Abstract: Recent advances in coreset methods have shown that a selection of representative datapoints can replace massive volumes of data for Bayesian inference, preserving the relevant statistical information and significantly accelerating subsequent downstream tasks. Existing variational coreset constructions rely on either selecting subsets of the observed datapoints, or jointly performing approximate in… ▽ More

    Submitted 15 January, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: NeurIPS 2022

  7. arXiv:2110.00276  [pdf, other

    stat.ML cs.LG

    TyXe: Pyro-based Bayesian neural nets for Pytorch

    Authors: Hippolyt Ritter, Theofanis Karaletsos

    Abstract: We introduce TyXe, a Bayesian neural network library built on top of Pytorch and Pyro. Our leading design principle is to cleanly separate architecture, prior, inference and likelihood specification, allowing for a flexible workflow where users can quickly iterate over combinations of these components. In contrast to existing packages TyXe does not implement any layer classes, and instead relies o… ▽ More

    Submitted 1 October, 2021; originally announced October 2021.

    Comments: Previously presented at PROBPROG 2020

  8. arXiv:2106.09222  [pdf, other

    stat.ML cs.CR cs.CV cs.LG

    Localized Uncertainty Attacks

    Authors: Ousmane Amadou Dia, Theofanis Karaletsos, Caner Hazirbas, Cristian Canton Ferrer, Ilknur Kaynar Kabul, Erik Meijer

    Abstract: The susceptibility of deep learning models to adversarial perturbations has stirred renewed attention in adversarial examples resulting in a number of attacks. However, most of these attacks fail to encompass a large spectrum of adversarial perturbations that are imperceptible to humans. In this paper, we present localized uncertainty attacks, a novel class of threat models against deterministic a… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: CVPR 2021 Workshop on Adversarial Machine Learning in Computer Vision

  9. arXiv:2102.12648  [pdf, other

    stat.ML cs.AI cs.LG

    Stochastic Aggregation in Graph Neural Networks

    Authors: Yuanqing Wang, Theofanis Karaletsos

    Abstract: Graph neural networks (GNNs) manifest pathologies including over-smoothing and limited discriminating power as a result of suboptimally expressive aggregating mechanisms. We herein present a unifying framework for stochastic aggregation (STAG) in GNNs, where noise is (adaptively) injected into the aggregation process from the neighborhood to form node embeddings. We provide theoretical arguments t… ▽ More

    Submitted 25 February, 2021; v1 submitted 24 February, 2021; originally announced February 2021.

  10. arXiv:2006.05468  [pdf, other

    stat.ML cs.LG

    Variational Auto-Regressive Gaussian Processes for Continual Learning

    Authors: Sanyam Kapoor, Theofanis Karaletsos, Thang D. Bui

    Abstract: Through sequential construction of posteriors on observing data online, Bayes' theorem provides a natural framework for continual learning. We develop Variational Auto-Regressive Gaussian Processes (VAR-GPs), a principled posterior updating mechanism to solve sequential tasks in continual learning. By relying on sparse inducing point approximations for scalable posteriors, we propose a novel auto-… ▽ More

    Submitted 12 June, 2021; v1 submitted 9 June, 2020; originally announced June 2020.

    Comments: International Conference on Machine Learning (ICML), 2021

  11. arXiv:2002.04033  [pdf, other

    stat.ML cs.LG

    Hierarchical Gaussian Process Priors for Bayesian Neural Network Weights

    Authors: Theofanis Karaletsos, Thang D. Bui

    Abstract: Probabilistic neural networks are typically modeled with independent weight priors, which do not capture weight correlations in the prior and do not provide a parsimonious interface to express properties in function space. A desirable class of priors would represent weights compactly, capture correlations between weights, facilitate calibrated reasoning about uncertainty, and allow inclusion of pr… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Comments: 12 pages main paper, 13 pages appendix

  12. arXiv:2002.03072  [pdf, other

    cs.LG cs.AI stat.ML

    Generalized Hidden Parameter MDPs Transferable Model-based RL in a Handful of Trials

    Authors: Christian F. Perez, Felipe Petroski Such, Theofanis Karaletsos

    Abstract: There is broad interest in creating RL agents that can solve many (related) tasks and adapt to new tasks and environments after initial training. Model-based RL leverages learned surrogate models that describe dynamics and rewards of individual tasks, such that planning in a good surrogate can lead to good control of the true system. Rather than solving each task individually from scratch, hierarc… ▽ More

    Submitted 7 February, 2020; originally announced February 2020.

    Comments: paper presented at AAAI 2020 as oral presentation, 9 pages

  13. arXiv:1901.05906  [pdf, other

    cs.LG stat.ML

    Applying SVGD to Bayesian Neural Networks for Cyclical Time-Series Prediction and Inference

    Authors: Xinyu Hu, Paul Szerlip, Theofanis Karaletsos, Rohit Singh

    Abstract: A regression-based BNN model is proposed to predict spatiotemporal quantities like hourly rider demand with calibrated uncertainties. The main contributions of this paper are (i) A feed-forward deterministic neural network (DetNN) architecture that predicts cyclical time series data with sensitivity to anomalous forecasting events; (ii) A Bayesian framework applying SVGD to train large neural netw… ▽ More

    Submitted 17 January, 2019; originally announced January 2019.

    Comments: Third workshop on Bayesian Deep Learning (NeurIPS 2018)

  14. arXiv:1812.03399  [pdf, other

    cs.LG stat.ML

    Efficient transfer learning and online adaptation with latent variable models for continuous control

    Authors: Christian F. Perez, Felipe Petroski Such, Theofanis Karaletsos

    Abstract: Traditional model-based RL relies on hand-specified or learned models of transition dynamics of the environment. These methods are sample efficient and facilitate learning in the real world but fail to generalize to subtle variations in the underlying dynamics, e.g., due to differences in mass, friction, or actuators across robotic agents or across time. We propose using variational inference to l… ▽ More

    Submitted 8 December, 2018; originally announced December 2018.

    Comments: Presented at Continual Learning Workshop, NeurIPS 2018, Montreal, Canada. 5 pages, 4 figures

  15. arXiv:1810.09538  [pdf, other

    cs.LG cs.PL stat.ML

    Pyro: Deep Universal Probabilistic Programming

    Authors: Eli Bingham, Jonathan P. Chen, Martin Jankowiak, Fritz Obermeyer, Neeraj Pradhan, Theofanis Karaletsos, Rohit Singh, Paul Szerlip, Paul Horsfall, Noah D. Goodman

    Abstract: Pyro is a probabilistic programming language built on Python as a platform for developing advanced probabilistic models in AI research. To scale to large datasets and high-dimensional models, Pyro uses stochastic variational inference algorithms and probability distributions built on top of PyTorch, a modern GPU-accelerated deep learning framework. To accommodate complex or model-specific algorith… ▽ More

    Submitted 18 October, 2018; originally announced October 2018.

    Comments: Submitted to JMLR MLOSS track

  16. arXiv:1810.00555  [pdf, other

    stat.ML cs.AI cs.LG

    Probabilistic Meta-Representations Of Neural Networks

    Authors: Theofanis Karaletsos, Peter Dayan, Zoubin Ghahramani

    Abstract: Existing Bayesian treatments of neural networks are typically characterized by weak prior and approximate posterior distributions according to which all the weights are drawn independently. Here, we consider a richer prior distribution in which units in the network are represented by latent variables, and the weights between units are drawn conditionally on the values of the collection of those va… ▽ More

    Submitted 1 October, 2018; originally announced October 2018.

    Comments: presented at UAI 2018 Uncertainty In Deep Learning Workshop (UDL AUG. 2018)

  17. arXiv:1806.01856  [pdf, other

    stat.ML cs.LG

    Pathwise Derivatives for Multivariate Distributions

    Authors: Martin Jankowiak, Theofanis Karaletsos

    Abstract: We exploit the link between the transport equation and derivatives of expectations to construct efficient pathwise gradient estimators for multivariate distributions. We focus on two main threads. First, we use null solutions of the transport equation to construct adaptive control variates that can be used to construct gradient estimators with reduced variance. Second, we consider the case of mult… ▽ More

    Submitted 22 March, 2019; v1 submitted 5 June, 2018; originally announced June 2018.

    Comments: To appear at AISTATS 2019; 16 pages

  18. arXiv:1805.09294  [pdf, other

    stat.ML cs.LG

    Likelihood-free inference with emulator networks

    Authors: Jan-Matthis Lueckmann, Giacomo Bassetto, Theofanis Karaletsos, Jakob H. Macke

    Abstract: Approximate Bayesian Computation (ABC) provides methods for Bayesian inference in simulation-based stochastic models which do not permit tractable likelihoods. We present a new ABC method which uses probabilistic neural emulator networks to learn synthetic likelihoods on simulated data -- both local emulators which approximate the likelihood for specific observed data, as well as global ones which… ▽ More

    Submitted 20 May, 2019; v1 submitted 23 May, 2018; originally announced May 2018.

    Comments: In Advances in Approximate Bayesian Inference (AABI 2018)

    Journal ref: PMLR 96:32-53, 2019

  19. arXiv:1612.05048  [pdf, other

    stat.ML cs.AI

    Adversarial Message Passing For Graphical Models

    Authors: Theofanis Karaletsos

    Abstract: Bayesian inference on structured models typically relies on the ability to infer posterior distributions of underlying hidden variables. However, inference in implicit models or complex posterior distributions is hard. A popular tool for learning implicit models are generative adversarial networks (GANs) which learn parameters of generators by fooling discriminators. Typically, GANs are considered… ▽ More

    Submitted 15 December, 2016; originally announced December 2016.

    Comments: (12 pages, 2 figures) Presented at NIPS Advances In Approximate Inference 2016 (AABI 2016)

  20. arXiv:1603.07810  [pdf, other

    cs.CV cs.AI cs.LG

    Conditional Similarity Networks

    Authors: Andreas Veit, Serge Belongie, Theofanis Karaletsos

    Abstract: What makes images similar? To measure the similarity between images, they are typically embedded in a feature-vector space, in which their distance preserve the relative dissimilarity. However, when learning such similarity embeddings the simplifying assumption is commonly made that images are only compared to one unique measure of similarity. A main reason for this is that contradicting notions o… ▽ More

    Submitted 10 April, 2017; v1 submitted 24 March, 2016; originally announced March 2016.

    Comments: CVPR 2017

  21. arXiv:1602.03551  [pdf, other

    cs.CL stat.AP

    Knowledge Transfer with Medical Language Embeddings

    Authors: Stephanie L. Hyland, Theofanis Karaletsos, Gunnar Rätsch

    Abstract: Identifying relationships between concepts is a key aspect of scientific knowledge synthesis. Finding these links often requires a researcher to laboriously search through scien- tific papers and databases, as the size of these resources grows ever larger. In this paper we describe how distributional semantics can be used to unify structured knowledge graphs with unstructured text to predict new r… ▽ More

    Submitted 10 February, 2016; originally announced February 2016.

    Comments: 6 pages, 2 figures, to appear at SDM-DMMH 2016

  22. arXiv:1510.00259  [pdf, other

    cs.CL cs.LG stat.ML

    A Generative Model of Words and Relationships from Multiple Sources

    Authors: Stephanie L. Hyland, Theofanis Karaletsos, Gunnar Rätsch

    Abstract: Neural language models are a powerful tool to embed words into semantic vector spaces. However, learning such models generally relies on the availability of abundant and diverse training examples. In highly specialised domains this requirement may not be met due to difficulties in obtaining a large corpus, or the limited range of expression in average use. Such domains may encode prior knowledge a… ▽ More

    Submitted 3 December, 2015; v1 submitted 1 October, 2015; originally announced October 2015.

    Comments: 8 pages, 5 figures; incorporated feedback from reviewers; to appear in Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence 2016

  23. arXiv:1506.05011  [pdf, other

    stat.ML cs.CV cs.LG

    Bayesian representation learning with oracle constraints

    Authors: Theofanis Karaletsos, Serge Belongie, Gunnar Rätsch

    Abstract: Representation learning systems typically rely on massive amounts of labeled data in order to be trained to high accuracy. Recently, high-dimensional parametric models like neural networks have succeeded in building rich representations using either compressive, reconstructive or supervised criteria. However, the semantic structure inherent in observations is oftentimes lost in the process. Human… ▽ More

    Submitted 1 March, 2016; v1 submitted 16 June, 2015; originally announced June 2015.

    Comments: 16 pages, publishes in ICLR 16