Zum Hauptinhalt springen

Showing 1–33 of 33 results for author: Laviolette, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2112.04008  [pdf, ps, other

    cs.CL cs.LG

    Multinational Address Parsing: A Zero-Shot Evaluation

    Authors: Marouane Yassine, David Beauchemin, François Laviolette, Luc Lamontagne

    Abstract: Address parsing consists of identifying the segments that make up an address, such as a street name or a postal code. Because of its importance for tasks like record linkage, address parsing has been approached with many techniques, the latest relying on neural networks. While these models yield notable results, previous work on neural networks has only focused on parsing addresses from a single s… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: Accepted in the International Journal of Information Science and Technology (iJIST). arXiv admin note: text overlap with arXiv:2006.16152

  2. arXiv:2110.15137  [pdf, other

    cs.LG

    PAC-Bayesian Learning of Aggregated Binary Activated Neural Networks with Probabilities over Representations

    Authors: Louis Fortier-Dubois, Gaël Letarte, Benjamin Leblanc, François Laviolette, Pascal Germain

    Abstract: Considering a probability distribution over parameters is known as an efficient strategy to learn a neural network with non-differentiable activation functions. We study the expectation of a probabilistic neural network as a predictor by itself, focusing on the aggregation of binary activated neural networks with normal distributions over real-valued weights. Our work leverages a recent analysis d… ▽ More

    Submitted 14 April, 2023; v1 submitted 28 October, 2021; originally announced October 2021.

  3. How to Certify Machine Learning Based Safety-critical Systems? A Systematic Literature Review

    Authors: Florian Tambon, Gabriel Laberge, Le An, Amin Nikanjam, Paulina Stevia Nouwou Mindom, Yann Pequignot, Foutse Khomh, Giulio Antoniol, Ettore Merlo, François Laviolette

    Abstract: Context: Machine Learning (ML) has been at the heart of many innovations over the past years. However, including it in so-called 'safety-critical' systems such as automotive or aeronautic has proven to be very challenging, since the shift in paradigm that ML brings completely changes traditional certification approaches. Objective: This paper aims to elucidate challenges related to the certifica… ▽ More

    Submitted 1 December, 2021; v1 submitted 26 July, 2021; originally announced July 2021.

    Comments: 60 pages (92 pages with references and complements), submitted to a journal (Automated Software Engineering). Changes: Emphasizing difference traditional software engineering / ML approach. Adding Related Works, Threats to Validity and Complementary Materials. Adding a table listing papers reference for each section/subsections

    Journal ref: Autom Softw Eng 29, 38 (2022)

  4. arXiv:2010.12995  [pdf, other

    cs.LG cs.AI stat.ML

    Out-of-distribution detection for regression tasks: parameter versus predictor entropy

    Authors: Yann Pequignot, Mathieu Alain, Patrick Dallaire, Alireza Yeganehparast, Pascal Germain, Josée Desharnais, François Laviolette

    Abstract: It is crucial to detect when an instance lies downright too far from the training samples for the machine learning model to be trusted, a challenge known as out-of-distribution (OOD) detection. For neural networks, one approach to this task consists of learning a diversity of predictors that all can explain the training data. This information can be used to estimate the epistemic uncertainty at a… ▽ More

    Submitted 11 September, 2023; v1 submitted 24 October, 2020; originally announced October 2020.

  5. Leveraging Subword Embeddings for Multinational Address Parsing

    Authors: Marouane Yassine, David Beauchemin, François Laviolette, Luc Lamontagne

    Abstract: Address parsing consists of identifying the segments that make up an address such as a street name or a postal code. Because of its importance for tasks like record linkage, address parsing has been approached with many techniques. Neural network methods defined a new state-of-the-art for address parsing. While this approach yielded notable results, previous work has only focused on applying neura… ▽ More

    Submitted 2 May, 2021; v1 submitted 29 June, 2020; originally announced June 2020.

    Comments: Accepted to IEEE CiSt'20

    Journal ref: 2020 6th IEEE Congress on Information Science and Technology (CiSt)

  6. arXiv:2004.11503  [pdf, other

    cs.DM

    General Cops and Robbers Games with randomness

    Authors: Frédéric Simard, Josée Desharnais, François Laviolette

    Abstract: Cops and Robbers games have been studied for the last few decades in computer science and mathematics. As in general pursuit evasion games, pursuers (cops) seek to capture evaders (robbers); however, players move in turn and are constrained to move on a discrete structure, usually a graph, and know the exact location of their opponent. In 2017, Bonato and MacGillivray presented a general character… ▽ More

    Submitted 23 April, 2020; originally announced April 2020.

    Comments: 36 pages, submitted to the journal Theoretical Computer Science

  7. arXiv:2001.10657  [pdf, other

    cs.LG stat.ML

    The Indian Chefs Process

    Authors: Patrick Dallaire, Luca Ambrogioni, Ludovic Trottier, Umut Güçlü, Max Hinne, Philippe Giguère, Brahim Chaib-Draa, Marcel van Gerven, Francois Laviolette

    Abstract: This paper introduces the Indian Chefs Process (ICP), a Bayesian nonparametric prior on the joint space of infinite directed acyclic graphs (DAGs) and orders that generalizes Indian Buffet Processes. As our construction shows, the proposed distribution relies on a latent Beta Process controlling both the orders and outgoing connection probabilities of the nodes, and yields a probability distributi… ▽ More

    Submitted 28 January, 2020; originally announced January 2020.

  8. arXiv:1912.11037  [pdf, other

    cs.HC cs.LG eess.SP

    Unsupervised Domain Adversarial Self-Calibration for Electromyographic-based Gesture Recognition

    Authors: Ulysse Côté-Allard, Gabriel Gagnon-Turcotte, Angkoon Phinyomark, Kyrre Glette, Erik Scheme, François Laviolette, Benoit Gosselin

    Abstract: Surface electromyography (sEMG) provides an intuitive and non-invasive interface from which to control machines. However, preserving the myoelectric control system's performance over multiple days is challenging, due to the transient nature of the signals obtained with this recording technique. In practice, if the system is to remain usable, a time-consuming and periodic recalibration is necessary… ▽ More

    Submitted 9 October, 2020; v1 submitted 21 December, 2019; originally announced December 2019.

    Comments: 12 pages + 2 pages appendices. The last three authors shared senior authorship

    Journal ref: in IEEE Access, vol. 8, pp. 177941-177955, 2020

  9. arXiv:1912.09380  [pdf, other

    cs.LG cs.CV cs.HC stat.ML

    A Transferable Adaptive Domain Adversarial Neural Network for Virtual Reality Augmented EMG-Based Gesture Recognition

    Authors: Ulysse Côté-Allard, Gabriel Gagnon-Turcotte, Angkoon Phinyomark, Kyrre Glette, Erik Scheme, François Laviolette, Benoit Gosselin

    Abstract: Within the field of electromyography-based (EMG) gesture recognition, disparities exist between the offline accuracy reported in the literature and the real-time usability of a classifier. This gap mainly stems from two factors: 1) The absence of a controller, making the data collected dissimilar to actual control. 2) The difficulty of including the four main dynamic factors (gesture intensity, li… ▽ More

    Submitted 14 February, 2021; v1 submitted 16 December, 2019; originally announced December 2019.

    Comments: 10 Pages. The last three authors shared senior authorship

  10. Interpreting Deep Learning Features for Myoelectric Control: A Comparison with Handcrafted Features

    Authors: Ulysse Côté-Allard, Evan Campbell, Angkoon Phinyomark, François Laviolette, Benoit Gosselin, Erik Scheme

    Abstract: The research in myoelectric control systems primarily focuses on extracting discriminative representations from the electromyographic (EMG) signal by designing handcrafted features. Recently, deep learning techniques have been applied to the challenging task of EMG-based gesture recognition. The adoption of these techniques slowly shifts the focus from feature engineering to feature learning. Howe… ▽ More

    Submitted 20 March, 2020; v1 submitted 30 November, 2019; originally announced December 2019.

    Comments: The first two authors shared first authorship. The last three authors shared senior authorship. 32 pages

    Journal ref: Frontiers in Bioengineering and Biotechnology, 8, 158 (2020)

  11. arXiv:1905.12131  [pdf, other

    cs.LG stat.ML

    Adaptive Deep Kernel Learning

    Authors: Prudencio Tossou, Basile Dura, Francois Laviolette, Mario Marchand, Alexandre Lacoste

    Abstract: Deep kernel learning provides an elegant and principled framework for combining the structural properties of deep learning algorithms with the flexibility of kernel methods. By means of a deep neural network, we learn a parametrized kernel operator that can be combined with a differentiable kernel algorithm during inference. While previous work within this framework has focused on learning a singl… ▽ More

    Submitted 11 December, 2020; v1 submitted 28 May, 2019; originally announced May 2019.

  12. arXiv:1905.10259  [pdf, other

    cs.LG stat.ML

    Dichotomize and Generalize: PAC-Bayesian Binary Activated Deep Neural Networks

    Authors: Gaël Letarte, Pascal Germain, Benjamin Guedj, François Laviolette

    Abstract: We present a comprehensive study of multilayer neural networks with binary activation, relying on the PAC-Bayesian theory. Our contributions are twofold: (i) we develop an end-to-end framework to train a binary activated deep neural network, (ii) we provide nonvacuous PAC-Bayesian generalization bounds for binary activated deep neural networks. Our results are obtained by minimizing the expected l… ▽ More

    Submitted 4 February, 2020; v1 submitted 24 May, 2019; originally announced May 2019.

    Journal ref: NeurIPS 2019

  13. arXiv:1801.07756  [pdf, other

    cs.LG stat.ML

    Deep Learning for Electromyographic Hand Gesture Signal Classification Using Transfer Learning

    Authors: Ulysse Côté-Allard, Cheikh Latyr Fall, Alexandre Drouin, Alexandre Campeau-Lecours, Clément Gosselin, Kyrre Glette, François Laviolette, Benoit Gosselin

    Abstract: In recent years, deep learning algorithms have become increasingly more prominent for their unparalleled ability to automatically learn discriminant features from large amounts of data. However, within the field of electromyography-based gesture recognition, deep learning algorithms are seldom employed as they require an unreasonable amount of effort from a single person, to generate tens of thous… ▽ More

    Submitted 25 January, 2019; v1 submitted 10 January, 2018; originally announced January 2018.

    Comments: Source code and datasets available: https://github.com/Giguelingueling/MyoArmbandDataset

  14. arXiv:1710.04234  [pdf, other

    stat.ML cs.DS cs.LG stat.AP

    Maximum Margin Interval Trees

    Authors: Alexandre Drouin, Toby Dylan Hocking, François Laviolette

    Abstract: Learning a regression function using censored or interval-valued output data is an important problem in fields such as genomics and medicine. The goal is to learn a real-valued prediction function, and the training output labels indicate an interval of possible values. Whereas most existing algorithms for this task are linear models, in this paper we investigate learning nonlinear tree models. We… ▽ More

    Submitted 27 October, 2017; v1 submitted 11 October, 2017; originally announced October 2017.

    Comments: Accepted for presentation at the 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA

  15. arXiv:1612.01030  [pdf, other

    q-bio.GN cs.LG stat.ML

    Large scale modeling of antimicrobial resistance with interpretable classifiers

    Authors: Alexandre Drouin, Frédéric Raymond, Gaël Letarte St-Pierre, Mario Marchand, Jacques Corbeil, François Laviolette

    Abstract: Antimicrobial resistance is an important public health concern that has implications in the practice of medicine worldwide. Accurately predicting resistance phenotypes from genome sequences shows great promise in promoting better use of antimicrobial agents, by determining which antibiotics are likely to be effective in specific clinical cases. In healthcare, this would allow for the design of tre… ▽ More

    Submitted 3 December, 2016; originally announced December 2016.

    Comments: Peer-reviewed and accepted for presentation at the Machine Learning for Health Workshop, NIPS 2016, Barcelona, Spain

  16. arXiv:1506.04573  [pdf, other

    stat.ML cs.LG

    A New PAC-Bayesian Perspective on Domain Adaptation

    Authors: Pascal Germain, Amaury Habrard, François Laviolette, Emilie Morvant

    Abstract: We study the issue of PAC-Bayesian domain adaptation: We want to learn, from a source domain, a majority vote model dedicated to a target one. Our theoretical contribution brings a new perspective by deriving an upper-bound on the target risk where the distributions' divergence---expressed as a ratio---controls the trade-off between a source error measure and the target voters' disagreement. Our b… ▽ More

    Submitted 26 July, 2016; v1 submitted 15 June, 2015; originally announced June 2015.

    Comments: Published at ICML 2016

  17. arXiv:1506.02535  [pdf, ps, other

    cs.LG

    Efficient Learning of Ensembles with QuadBoost

    Authors: Louis Fortier-Dubois, François Laviolette, Mario Marchand, Louis-Emile Robitaille, Jean-Francis Roy

    Abstract: We first present a general risk bound for ensembles that depends on the Lp norm of the weighted combination of voters which can be selected from a continuous set. We then propose a boosting method, called QuadBoost, which is strongly supported by the general risk bound and has very simple rules for assigning the voters' weights. Moreover, QuadBoost exhibits a rate of decrease of its empirical erro… ▽ More

    Submitted 20 November, 2015; v1 submitted 8 June, 2015; originally announced June 2015.

    Comments: 9 pages

  18. arXiv:1505.07818  [pdf, other

    stat.ML cs.LG cs.NE

    Domain-Adversarial Training of Neural Networks

    Authors: Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, François Laviolette, Mario Marchand, Victor Lempitsky

    Abstract: We introduce a new representation learning approach for domain adaptation, in which data at training and test time come from similar but different distributions. Our approach is directly inspired by the theory on domain adaptation suggesting that, for effective domain transfer to be achieved, predictions must be made based on features that cannot discriminate between the training (source) and test… ▽ More

    Submitted 26 May, 2016; v1 submitted 28 May, 2015; originally announced May 2015.

    Comments: Published in JMLR: http://jmlr.org/papers/v17/15-239.html

    Journal ref: Journal of Machine Learning Research 2016, vol. 17, p. 1-35

  19. arXiv:1505.06249  [pdf, other

    q-bio.GN cs.LG stat.ML

    Greedy Biomarker Discovery in the Genome with Applications to Antimicrobial Resistance

    Authors: Alexandre Drouin, Sébastien Giguère, Maxime Déraspe, François Laviolette, Mario Marchand, Jacques Corbeil

    Abstract: The Set Covering Machine (SCM) is a greedy learning algorithm that produces sparse classifiers. We extend the SCM for datasets that contain a huge number of features. The whole genetic material of living organisms is an example of such a case, where the number of feature exceeds 10^7. Three human pathogens were used to evaluate the performance of the SCM at predicting antimicrobial resistance. Our… ▽ More

    Submitted 22 May, 2015; originally announced May 2015.

    Comments: Peer-reviewed and accepted for an oral presentation in the Greed is Great workshop at the International Conference on Machine Learning, Lille, France, 2015

  20. arXiv:1503.08329  [pdf, other

    stat.ML cs.LG

    Risk Bounds for the Majority Vote: From a PAC-Bayesian Analysis to a Learning Algorithm

    Authors: Pascal Germain, Alexandre Lacasse, François Laviolette, Mario Marchand, Jean-Francis Roy

    Abstract: We propose an extensive analysis of the behavior of majority votes in binary classification. In particular, we introduce a risk bound for majority votes, called the C-bound, that takes into account the average quality of the voters and their average disagreement. We also propose an extensive PAC-Bayesian analysis that shows how the C-bound can be estimated from various observations contained in th… ▽ More

    Submitted 28 July, 2015; v1 submitted 28 March, 2015; originally announced March 2015.

    Comments: Published in JMLR http://jmlr.org/papers/v16/germain15a.html

    Journal ref: Journal of Machine Learning Research 2015, vol. 16, p. 787-860

  21. arXiv:1503.06944  [pdf, other

    stat.ML cs.LG

    PAC-Bayesian Theorems for Domain Adaptation with Specialization to Linear Classifiers

    Authors: Pascal Germain, Amaury Habrard, François Laviolette, Emilie Morvant

    Abstract: In this paper, we provide two main contributions in PAC-Bayesian theory for domain adaptation where the objective is to learn, from a source distribution, a well-performing majority vote on a different target distribution. On the one hand, we propose an improvement of the previous approach proposed by Germain et al. (2013), that relies on a novel distribution pseudodistance based on a disagreement… ▽ More

    Submitted 9 August, 2016; v1 submitted 24 March, 2015; originally announced March 2015.

    Comments: This report is a long version of our paper entitled A PAC-Bayesian Approach for Domain Adaptation with Specialization to Linear Classifiers published in the proceedings of the International Conference on Machine Learning (ICML) 2013. We improved our main results, extended our experiments, and proposed an extension to multisource domain adaptation

  22. arXiv:1501.03002  [pdf, ps, other

    stat.ML cs.LG

    An Improvement to the Domain Adaptation Bound in a PAC-Bayesian context

    Authors: Pascal Germain, Amaury Habrard, Francois Laviolette, Emilie Morvant

    Abstract: This paper provides a theoretical analysis of domain adaptation based on the PAC-Bayesian theory. We propose an improvement of the previous domain adaptation bound obtained by Germain et al. in two ways. We first give another generalization bound tighter and easier to interpret. Moreover, we provide a new analysis of the constant term appearing in the bound that can be of high interest for develop… ▽ More

    Submitted 13 January, 2015; originally announced January 2015.

    Comments: NIPS 2014 Workshop on Transfer and Multi-task learning: Theory Meets Practice, Dec 2014, Montr{é}al, Canada

  23. arXiv:1501.03001  [pdf, other

    stat.ML cs.LG

    On Generalizing the C-Bound to the Multiclass and Multi-label Settings

    Authors: Francois Laviolette, Emilie Morvant, Liva Ralaivola, Jean-Francis Roy

    Abstract: The C-bound, introduced in Lacasse et al., gives a tight upper bound on the risk of a binary majority vote classifier. In this work, we present a first step towards extending this work to more complex outputs, by providing generalizations of the C-bound to the multiclass and multi-label settings.

    Submitted 13 January, 2015; originally announced January 2015.

    Comments: NIPS 2014 Workshop on Representation and Learning Methods for Complex Outputs, Dec 2014, Montr{é}al, Canada

  24. arXiv:1412.4446  [pdf, other

    stat.ML cs.LG cs.NE

    Domain-Adversarial Neural Networks

    Authors: Hana Ajakan, Pascal Germain, Hugo Larochelle, François Laviolette, Mario Marchand

    Abstract: We introduce a new representation learning algorithm suited to the context of domain adaptation, in which data at training and test time come from similar but different distributions. Our algorithm is directly inspired by theory on domain adaptation suggesting that, for effective domain transfer to be achieved, predictions must be made based on a data representation that cannot discriminate betwee… ▽ More

    Submitted 9 February, 2015; v1 submitted 14 December, 2014; originally announced December 2014.

    Comments: The first version of this paper was accepted at the "Second Workshop on Transfer and Multi-Task Learning: Theory meets Practice" (NIPS 2014, Montreal, Canada). See: https://sites.google.com/site/multitaskwsnips2014/

  25. arXiv:1412.1463  [pdf, ps, other

    cs.LG cs.CE

    On the String Kernel Pre-Image Problem with Applications in Drug Discovery

    Authors: Sébastien Giguère, Amélie Rolland, François Laviolette, Mario Marchand

    Abstract: The pre-image problem has to be solved during inference by most structured output predictors. For string kernels, this problem corresponds to finding the string associated to a given input. An algorithm capable of solving or finding good approximations to this problem would have many applications in computational biology and other fields. This work uses a recent result on combinatorial optimizatio… ▽ More

    Submitted 3 December, 2014; v1 submitted 3 December, 2014; originally announced December 2014.

    Comments: Peer-reviewed and accepted for presentation at Machine Learning in Computational Biology 2014, Montréal, Québec, Canada

    ACM Class: I.2.6; K.3.2

  26. arXiv:1412.1074  [pdf, other

    q-bio.GN cs.CE cs.LG stat.ML

    Learning interpretable models of phenotypes from whole genome sequences with the Set Covering Machine

    Authors: Alexandre Drouin, Sébastien Giguère, Vladana Sagatovich, Maxime Déraspe, François Laviolette, Mario Marchand, Jacques Corbeil

    Abstract: The increased affordability of whole genome sequencing has motivated its use for phenotypic studies. We address the problem of learning interpretable models for discrete phenotypes from whole genomes. We propose a general approach that relies on the Set Covering Machine and a k-mer representation of the genomes. We show results for the problem of predicting the resistance of Pseudomonas Aeruginosa… ▽ More

    Submitted 2 December, 2014; originally announced December 2014.

    Comments: Presented at Machine Learning in Computational Biology 2014, Montréal, Québec, Canada

  27. arXiv:1402.0796  [pdf, other

    cs.LG stat.ML

    Sequential Model-Based Ensemble Optimization

    Authors: Alexandre Lacoste, Hugo Larochelle, François Laviolette, Mario Marchand

    Abstract: One of the most tedious tasks in the application of machine learning is model selection, i.e. hyperparameter selection. Fortunately, recent progress has been made in the automation of this process, through the use of sequential model-based optimization (SMBO) methods. This can be used to optimize a cross-validation performance of a learning algorithm over the value of its hyperparameters. However,… ▽ More

    Submitted 4 February, 2014; originally announced February 2014.

  28. arXiv:1212.2340  [pdf, other

    stat.ML cs.LG

    PAC-Bayesian Learning and Domain Adaptation

    Authors: Pascal Germain, Amaury Habrard, François Laviolette, Emilie Morvant

    Abstract: In machine learning, Domain Adaptation (DA) arises when the distribution gen- erating the test (target) data differs from the one generating the learning (source) data. It is well known that DA is an hard task even under strong assumptions, among which the covariate-shift where the source and target distributions diverge only in their marginals, i.e. they have the same labeling function. Another p… ▽ More

    Submitted 11 December, 2012; originally announced December 2012.

    Comments: https://sites.google.com/site/multitradeoffs2012/

    Journal ref: Multi-Trade-offs in Machine Learning, NIPS 2012 Workshop, Lake Tahoe : United States (2012)

  29. arXiv:1207.7253  [pdf, other

    q-bio.QM cs.LG q-bio.BM stat.ML

    Learning a peptide-protein binding affinity predictor with kernel ridge regression

    Authors: Sébastien Giguère, Mario Marchand, François Laviolette, Alexandre Drouin, Jacques Corbeil

    Abstract: We propose a specialized string kernel for small bio-molecules, peptides and pseudo-sequences of binding interfaces. The kernel incorporates physico-chemical properties of amino acids and elegantly generalize eight kernels, such as the Oligo, the Weighted Degree, the Blended Spectrum, and the Radial Basis Function. We provide a low complexity dynamic programming algorithm for the exact computation… ▽ More

    Submitted 31 July, 2012; originally announced July 2012.

    Comments: 22 pages, 4 figures, 5 tables

    MSC Class: 92B05 ACM Class: I.2.6; J.3; G.3; G.4; I.5.2

    Journal ref: BMC Bioinformatics 2013, 14:82

  30. arXiv:1110.6886  [pdf, other

    cs.LG cs.IT stat.ML

    PAC-Bayesian Inequalities for Martingales

    Authors: Yevgeny Seldin, François Laviolette, Nicolò Cesa-Bianchi, John Shawe-Taylor, Peter Auer

    Abstract: We present a set of high-probability inequalities that control the concentration of weighted averages of multiple (possibly uncountably many) simultaneously evolving and interdependent martingales. Our results extend the PAC-Bayesian analysis in learning theory from the i.i.d. setting to martingales opening the way for its application to importance weighted sampling, reinforcement learning, and ot… ▽ More

    Submitted 30 July, 2012; v1 submitted 31 October, 2011; originally announced October 2011.

  31. arXiv:1110.6755  [pdf, other

    cs.LG

    PAC-Bayes-Bernstein Inequality for Martingales and its Application to Multiarmed Bandits

    Authors: Yevgeny Seldin, Nicolò Cesa-Bianchi, Peter Auer, François Laviolette, John Shawe-Taylor

    Abstract: We develop a new tool for data-dependent analysis of the exploration-exploitation trade-off in learning under limited feedback. Our tool is based on two main ingredients. The first ingredient is a new concentration inequality that makes it possible to control the concentration of weighted averages of multiple (possibly uncountably many) simultaneously evolving and interdependent martingales. The s… ▽ More

    Submitted 30 January, 2012; v1 submitted 31 October, 2011; originally announced October 2011.

  32. arXiv:1105.4585  [pdf, ps, other

    cs.LG stat.ML

    PAC-Bayesian Analysis of the Exploration-Exploitation Trade-off

    Authors: Yevgeny Seldin, Nicolò Cesa-Bianchi, François Laviolette, Peter Auer, John Shawe-Taylor, Jan Peters

    Abstract: We develop a coherent framework for integrative simultaneous analysis of the exploration-exploitation and model order selection trade-offs. We improve over our preceding results on the same subject (Seldin et al., 2011) by combining PAC-Bayesian analysis with Bernstein-type inequality for martingales. Such a combination is also of independent interest for studies of multiple simultaneously evolvin… ▽ More

    Submitted 23 May, 2011; originally announced May 2011.

    Comments: On-line Trading of Exploration and Exploitation 2 - ICML-2011 workshop. http://explo.cs.ucl.ac.uk/workshop/

  33. arXiv:1105.2416  [pdf, ps, other

    cs.LG stat.ML

    PAC-Bayesian Analysis of Martingales and Multiarmed Bandits

    Authors: Yevgeny Seldin, François Laviolette, John Shawe-Taylor, Jan Peters, Peter Auer

    Abstract: We present two alternative ways to apply PAC-Bayesian analysis to sequences of dependent random variables. The first is based on a new lemma that enables to bound expectations of convex functions of certain dependent random variables by expectations of the same functions of independent Bernoulli random variables. This lemma provides an alternative tool to Hoeffding-Azuma inequality to bound concen… ▽ More

    Submitted 19 May, 2011; v1 submitted 12 May, 2011; originally announced May 2011.