Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Hubin, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.00809  [pdf, other

    cs.LG stat.ML

    Position: Bayesian Deep Learning is Needed in the Age of Large-Scale AI

    Authors: Theodore Papamarkou, Maria Skoularidou, Konstantina Palla, Laurence Aitchison, Julyan Arbel, David Dunson, Maurizio Filippone, Vincent Fortuin, Philipp Hennig, José Miguel Hernández-Lobato, Aliaksandr Hubin, Alexander Immer, Theofanis Karaletsos, Mohammad Emtiyaz Khan, Agustinus Kristiadi, Yingzhen Li, Stephan Mandt, Christopher Nemeth, Michael A. Osborne, Tim G. J. Rudner, David Rügamer, Yee Whye Teh, Max Welling, Andrew Gordon Wilson, Ruqi Zhang

    Abstract: In the current landscape of deep learning research, there is a predominant emphasis on achieving high predictive accuracy in supervised tasks involving large image and language datasets. However, a broader perspective reveals a multitude of overlooked metrics, tasks, and data types, such as uncertainty, active and continual learning, and scientific data, that demand attention. Bayesian deep learni… ▽ More

    Submitted 6 August, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

    Comments: Proceedings of the 41st International Conference on Machine Learning, Vienna, Austria. PMLR 235, 2024

  2. arXiv:2305.03395  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Sparsifying Bayesian neural networks with latent binary variables and normalizing flows

    Authors: Lars Skaaret-Lund, Geir Storvik, Aliaksandr Hubin

    Abstract: Artificial neural networks (ANNs) are powerful machine learning methods used in many modern applications such as facial recognition, machine translation, and cancer diagnostics. A common issue with ANNs is that they usually have millions or billions of trainable parameters, and therefore tend to overfit to the training data. This is especially problematic in applications where it is important to h… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: 24 pages, 10 figures

    MSC Class: 62-02; 62-09; 62F07; 62F15; 62J12; 62J05; 62J99; 62M05; 05A16; 60J22; 92D20; 90C27; 90C59 ACM Class: G.1.2; G.1.6; G.2.1; G.3; I.2.0; I.2.6; I.2.8; I.5.1; I.6; I.6.4

  3. arXiv:2305.00934  [pdf, other

    stat.ML cs.LG

    Variational Inference for Bayesian Neural Networks under Model and Parameter Uncertainty

    Authors: Aliaksandr Hubin, Geir Storvik

    Abstract: Bayesian neural networks (BNNs) have recently regained a significant amount of attention in the deep learning community due to the development of scalable approximate Bayesian inference techniques. There are several advantages of using a Bayesian approach: Parameter and prediction uncertainties become easily available, facilitating rigorous statistical analysis. Furthermore, prior knowledge can be… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: arXiv admin note: text overlap with arXiv:1903.07594

    MSC Class: 62-02; 62-09; 62F07; 62F15; 62J12; 62J05; 62J99; 62M05; 05A16; 60J22; 92D20; 90C27; 90C59 ACM Class: G.1.2; G.1.6; G.2.1; G.3; I.2.0; I.2.6; I.2.8; I.5.1; I.6; I.6.4

  4. skweak: Weak Supervision Made Easy for NLP

    Authors: Pierre Lison, Jeremy Barnes, Aliaksandr Hubin

    Abstract: We present skweak, a versatile, Python-based software toolkit enabling NLP developers to apply weak supervision to a wide range of NLP tasks. Weak supervision is an emerging machine learning paradigm based on a simple idea: instead of labelling data points by hand, we use labelling functions derived from domain knowledge to automatically obtain annotations for a given dataset. The resulting labels… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

  5. arXiv:2004.14723  [pdf, other

    cs.CL cs.LG stat.ML

    Named Entity Recognition without Labelled Data: A Weak Supervision Approach

    Authors: Pierre Lison, Aliaksandr Hubin, Jeremy Barnes, Samia Touileb

    Abstract: Named Entity Recognition (NER) performance often degrades rapidly when applied to target domains that differ from the texts observed during training. When in-domain labelled data is available, transfer learning techniques can be used to adapt existing NER models to the target domain. But what should one do when there is no hand-labelled data for the target domain? This paper presents a simple but… ▽ More

    Submitted 30 April, 2020; originally announced April 2020.

    Comments: Accepted to ACL 2020 (long paper)

  6. arXiv:2003.02929  [pdf, ps, other

    stat.ML cs.LG stat.CO stat.ME

    Flexible Bayesian Nonlinear Model Configuration

    Authors: Aliaksandr Hubin, Geir Storvik, Florian Frommlet

    Abstract: Regression models are used in a wide range of applications providing a powerful scientific tool for researchers from different fields. Linear, or simple parametric, models are often not sufficient to describe complex relationships between input variables and a response. Such relationships can be better described through flexible approaches such as neural networks, but this results in less interpre… ▽ More

    Submitted 23 November, 2021; v1 submitted 5 March, 2020; originally announced March 2020.

    Comments: 42 pages; 18 Tables. arXiv admin note: text overlap with arXiv:1806.02160

    MSC Class: 62-02; 62-09; 62F07; 62F15; 62J12; 62J05; 62J99; 62M05; 05A16; 60J22; 92D20; 90C27; 90C59

    Journal ref: Journal of Artificial Intelligence Research (2021), Volume 72, Pages 901-942

  7. arXiv:1912.09733  [pdf, other

    stat.ML cs.LG math.OC stat.CO

    An adaptive simulated annealing EM algorithm for inference on non-homogeneous hidden Markov models

    Authors: Aliaksandr Hubin

    Abstract: Non-homogeneous hidden Markov models (NHHMM) are a subclass of dependent mixture models used for semi-supervised learning, where both transition probabilities between the latent states and mean parameter of the probability distribution of the responses (for a given state) depend on the set of $p$ covariates. A priori we do not know which (and how) covariates influence the transition probabilities… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

    Comments: 8 pages, 6 figures, 4 tables. Accepted version of the article published in AIIPCC 2019

  8. arXiv:1903.07594  [pdf, other

    stat.ML cs.LG math.OC stat.CO stat.ME

    Combining Model and Parameter Uncertainty in Bayesian Neural Networks

    Authors: Aliaksandr Hubin, Geir Storvik

    Abstract: Bayesian neural networks (BNNs) have recently regained a significant amount of attention in the deep learning community due to the development of scalable approximate Bayesian inference techniques. There are several advantages of using Bayesian approach: Parameter and prediction uncertainty become easily available, facilitating rigid statistical analysis. Furthermore, prior knowledge can be incorp… ▽ More

    Submitted 25 May, 2019; v1 submitted 18 March, 2019; originally announced March 2019.

    Comments: 16 pages, 8 Figures, 2 Tables

    MSC Class: 62-02; 62-09; 62F07; 62F15; 62J12; 62J05; 62J99; 62M05; 05A16; 60J22; 92D20; 90C27; 90C59 ACM Class: G.1.2; G.1.6; G.2.1; G.3; I.2.0; I.2.6; I.2.8; I.5.1; I.6; I.6.4