Zum Hauptinhalt springen

Showing 1–9 of 9 results for author: Chopin, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.14335  [pdf, other

    stat.ML cs.LG

    Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning

    Authors: Otmane Sakhi, Imad Aouali, Pierre Alquier, Nicolas Chopin

    Abstract: This work investigates the offline formulation of the contextual bandit problem, where the goal is to leverage past interactions collected under a behavior policy to evaluate, select, and learn new, potentially better-performing, policies. Motivated by critical applications, we move beyond point estimators. Instead, we adopt the principle of pessimism where we construct upper bounds that assess a… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  2. arXiv:2308.01566  [pdf, other

    cs.LG cs.IR stat.ML

    Fast Slate Policy Optimization: Going Beyond Plackett-Luce

    Authors: Otmane Sakhi, David Rohde, Nicolas Chopin

    Abstract: An increasingly important building block of large scale machine learning systems is based on returning slates; an ordered lists of items given a query. Applications of this technology include: search, information retrieval and recommender systems. When the action space is large, decision systems are restricted to a particular structure to complete online queries quickly. This paper addresses the o… ▽ More

    Submitted 29 December, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: Transactions on Machine Learning Research

  3. arXiv:2306.15422  [pdf, other

    stat.CO cs.DC stat.ME

    Debiasing Piecewise Deterministic Markov Process samplers using couplings

    Authors: Adrien Corenflos, Matthew Sutton, Nicolas Chopin

    Abstract: Monte Carlo methods - such as Markov chain Monte Carlo (MCMC) and piecewise deterministic Markov process (PDMP) samplers - provide asymptotically exact estimators of expectations under a target distribution. There is growing interest in alternatives to this asymptotic regime, in particular in constructing estimators that are exact in the limit of an infinite amount of computing processors, rather… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: 30 pages, 3 figures. This is a preliminary version which does not include all the experiments

  4. arXiv:2210.13132  [pdf, other

    stat.ML cs.LG

    PAC-Bayesian Offline Contextual Bandits With Guarantees

    Authors: Otmane Sakhi, Pierre Alquier, Nicolas Chopin

    Abstract: This paper introduces a new principled approach for off-policy learning in contextual bandits. Unlike previous work, our approach does not derive learning principles from intractable or loose bounds. We analyse the problem through the PAC-Bayesian lens, interpreting policies as mixtures of decision rules. This allows us to propose novel generalization bounds and provide tractable algorithms to opt… ▽ More

    Submitted 27 May, 2023; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: Accepted to ICML 2023

  5. arXiv:2206.03369  [pdf, other

    stat.ML cs.LG eess.SP stat.CO

    Computational Doob's h-transforms for Online Filtering of Discretely Observed Diffusions

    Authors: Nicolas Chopin, Andras Fulop, Jeremy Heng, Alexandre H. Thiery

    Abstract: This paper is concerned with online filtering of discretely observed nonlinear diffusion processes. Our approach is based on the fully adapted auxiliary particle filter, which involves Doob's $h$-transforms that are typically intractable. We propose a computational framework to approximate these $h$-transforms by solving the underlying backward Kolmogorov equations using nonlinear Feynman-Kac form… ▽ More

    Submitted 30 May, 2023; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: 20 pages

    Journal ref: ICML 2023

  6. arXiv:2202.02264  [pdf, ps, other

    stat.CO cs.DC stat.ML

    De-Sequentialized Monte Carlo: a parallel-in-time particle smoother

    Authors: Adrien Corenflos, Nicolas Chopin, Simo Särkkä

    Abstract: Particle smoothers are SMC (Sequential Monte Carlo) algorithms designed to approximate the joint distribution of the states given observations from a state-space model. We propose dSMC (de-Sequentialized Monte Carlo), a new particle smoother that is able to process $T$ observations in $\mathcal{O}(\log T)$ time on parallel architecture. This compares favourably with standard particle smoothers, th… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

    Comments: 31 pages, 6 figures

  7. arXiv:1702.00564  [pdf, other

    stat.AP cs.CL stat.ME stat.ML

    Modelling dependency completion in sentence comprehension as a Bayesian hierarchical mixture process: A case study involving Chinese relative clauses

    Authors: Shravan Vasishth, Nicolas Chopin, Robin Ryder, Bruno Nicenboim

    Abstract: We present a case-study demonstrating the usefulness of Bayesian hierarchical mixture modelling for investigating cognitive processes. In sentence comprehension, it is widely assumed that the distance between linguistic co-dependents affects the latency of dependency resolution: the longer the distance, the longer the retrieval time (the distance-based account). An alternative theory, direct-acces… ▽ More

    Submitted 5 May, 2017; v1 submitted 2 February, 2017; originally announced February 2017.

    Comments: 6 pages, 2 figures. To appear in the Proceedings of the Cognitive Science Conference 2017, London, UK

  8. arXiv:1211.5901  [pdf, ps, other

    stat.ML cs.LG stat.CO

    Bayesian learning of noisy Markov decision processes

    Authors: Sumeetpal S. Singh, Nicolas Chopin, Nick Whiteley

    Abstract: We consider the inverse reinforcement learning problem, that is, the problem of learning from, and then predicting or mimicking a controller based on state/action data. We propose a statistical model for such data, derived from the structure of a Markov decision process. Adopting a Bayesian approach to inference, we show how latent variables of the model can be estimated, and how predictions about… ▽ More

    Submitted 26 November, 2012; originally announced November 2012.

  9. arXiv:1205.4304  [pdf, ps, other

    stat.OT cs.DL physics.soc-ph

    In praise of the referee

    Authors: Nicolas Chopin, Andrew Gelman, Kerrie L. Mengersen, Christian P. Robert

    Abstract: There has been a lively debate in many fields, including statistics and related applied fields such as psychology and biomedical research, on possible reforms of the scholarly publishing system. Currently, referees contribute so much to improve scientific papers, both directly through constructive criticism and indirectly through the threat of rejection. We discuss ways in which new approaches to… ▽ More

    Submitted 19 May, 2012; originally announced May 2012.

    Comments: 13 pages