Zum Hauptinhalt springen

Showing 1–12 of 12 results for author: Teboul, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2209.03143  [pdf, other

    cs.SD cs.LG eess.AS

    AudioLM: a Language Modeling Approach to Audio Generation

    Authors: Zalán Borsos, Raphaël Marinier, Damien Vincent, Eugene Kharitonov, Olivier Pietquin, Matt Sharifi, Dominik Roblek, Olivier Teboul, David Grangier, Marco Tagliasacchi, Neil Zeghidour

    Abstract: We introduce AudioLM, a framework for high-quality audio generation with long-term consistency. AudioLM maps the input audio to a sequence of discrete tokens and casts audio generation as a language modeling task in this representation space. We show how existing audio tokenizers provide different trade-offs between reconstruction quality and long-term structure, and we propose a hybrid tokenizati… ▽ More

    Submitted 25 July, 2023; v1 submitted 7 September, 2022; originally announced September 2022.

  2. arXiv:2202.01653  [pdf, other

    cs.LG

    Learning strides in convolutional neural networks

    Authors: Rachid Riad, Olivier Teboul, David Grangier, Neil Zeghidour

    Abstract: Convolutional neural networks typically contain several downsampling operators, such as strided convolutions or pooling layers, that progressively reduce the resolution of intermediate representations. This provides some shift-invariance while reducing the computational complexity of the whole architecture. A critical hyperparameter of such layers is their stride: the integer factor of downsamplin… ▽ More

    Submitted 3 February, 2022; originally announced February 2022.

    Comments: Spotlight at ICLR2022, open-source code available at https://github.com/google-research/diffstride

  3. arXiv:2201.12324  [pdf, other

    cs.LG stat.ML

    Optimal Transport Tools (OTT): A JAX Toolbox for all things Wasserstein

    Authors: Marco Cuturi, Laetitia Meng-Papaxanthos, Yingtao Tian, Charlotte Bunne, Geoff Davis, Olivier Teboul

    Abstract: Optimal transport tools (OTT-JAX) is a Python toolbox that can solve optimal transport problems between point clouds and histograms. The toolbox builds on various JAX features, such as automatic and custom reverse mode differentiation, vectorization, just-in-time compilation and accelerators support. The toolbox covers elementary computations, such as the resolution of the regularized OT problem,… ▽ More

    Submitted 28 January, 2022; originally announced January 2022.

    Comments: 4 pages

  4. arXiv:2105.13802  [pdf, other

    cs.SD cs.LG eess.AS

    DIVE: End-to-end Speech Diarization via Iterative Speaker Embedding

    Authors: Neil Zeghidour, Olivier Teboul, David Grangier

    Abstract: We introduce DIVE, an end-to-end speaker diarization algorithm. Our neural algorithm presents the diarization task as an iterative process: it repeatedly builds a representation for each speaker before predicting the voice activity of each speaker conditioned on the extracted representations. This strategy intrinsically resolves the speaker ordering ambiguity without requiring the classical permut… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

  5. arXiv:2103.09879  [pdf, other

    cs.SD cs.AI eess.AS

    Self-Supervised Learning of Audio Representations from Permutations with Differentiable Ranking

    Authors: Andrew N Carr, Quentin Berthet, Mathieu Blondel, Olivier Teboul, Neil Zeghidour

    Abstract: Self-supervised pre-training using so-called "pretext" tasks has recently shown impressive performance across a wide range of modalities. In this work, we advance self-supervised learning from permutations, by pre-training a model to reorder shuffled parts of the spectrogram of an audio signal, to improve downstream classification performance. We make two main contributions. First, we overcome the… ▽ More

    Submitted 17 March, 2021; originally announced March 2021.

  6. arXiv:2101.08596  [pdf, other

    cs.SD cs.LG eess.AS

    LEAF: A Learnable Frontend for Audio Classification

    Authors: Neil Zeghidour, Olivier Teboul, Félix de Chaumont Quitry, Marco Tagliasacchi

    Abstract: Mel-filterbanks are fixed, engineered audio features which emulate human perception and have been used through the history of audio understanding up to today. However, their undeniable qualities are counterbalanced by the fundamental limitations of handmade representations. In this work we show that we can train a single learnable frontend that outperforms mel-filterbanks on a wide range of audio… ▽ More

    Submitted 21 January, 2021; originally announced January 2021.

    Comments: Accepted at ICLR 2021

  7. arXiv:2004.12508  [pdf, other

    stat.ME cs.LG stat.AP

    Noisy Adaptive Group Testing using Bayesian Sequential Experimental Design

    Authors: Marco Cuturi, Olivier Teboul, Quentin Berthet, Arnaud Doucet, Jean-Philippe Vert

    Abstract: When the infection prevalence of a disease is low, Dorfman showed 80 years ago that testing groups of people can prove more efficient than testing people individually. Our goal in this paper is to propose new group testing algorithms that can operate in a noisy setting (tests can be mistaken) to decide adaptively (looking at past results) which groups to test next, with the goal to converge to a g… ▽ More

    Submitted 22 July, 2020; v1 submitted 26 April, 2020; originally announced April 2020.

    Comments: Latest version, with updated experiments, new conclusions on LBP vs SMC decoding and new approach

  8. arXiv:2002.08871  [pdf, other

    stat.ML cs.LG

    Fast Differentiable Sorting and Ranking

    Authors: Mathieu Blondel, Olivier Teboul, Quentin Berthet, Josip Djolonga

    Abstract: The sorting operation is one of the most commonly used building blocks in computer programming. In machine learning, it is often used for robust statistics. However, seen as a function, it is piecewise linear and as a result includes many kinks where it is non-differentiable. More problematic is the related ranking operator, often used for order statistics and ranking metrics. It is a piecewise co… ▽ More

    Submitted 29 June, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: In proceedings of ICML 2020

  9. arXiv:2002.08676  [pdf, other

    cs.LG math.OC stat.ML

    Learning with Differentiable Perturbed Optimizers

    Authors: Quentin Berthet, Mathieu Blondel, Olivier Teboul, Marco Cuturi, Jean-Philippe Vert, Francis Bach

    Abstract: Machine learning pipelines often rely on optimization procedures to make discrete decisions (e.g., sorting, picking closest neighbors, or shortest paths). Although these discrete decisions are easily computed, they break the back-propagation of computational graphs. In order to expand the scope of learning problems that can be solved in an end-to-end fashion, we propose a systematic method to tran… ▽ More

    Submitted 9 June, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

  10. arXiv:2002.03229  [pdf, other

    cs.LG stat.ML

    Supervised Quantile Normalization for Low-rank Matrix Approximation

    Authors: Marco Cuturi, Olivier Teboul, Jonathan Niles-Weed, Jean-Philippe Vert

    Abstract: Low rank matrix factorization is a fundamental building block in machine learning, used for instance to summarize gene expression profile data or word-document counts. To be robust to outliers and differences in scale across features, a matrix factorization step is usually preceded by ad-hoc feature normalization steps, such as \texttt{tf-idf} scaling or data whitening. We propose in this work to… ▽ More

    Submitted 3 July, 2020; v1 submitted 8 February, 2020; originally announced February 2020.

    Comments: new version with genomics experiments

    Journal ref: ICML 2020

  11. arXiv:1907.00868  [pdf, other

    cs.LG cs.AI stat.ML

    MULEX: Disentangling Exploitation from Exploration in Deep RL

    Authors: Lucas Beyer, Damien Vincent, Olivier Teboul, Sylvain Gelly, Matthieu Geist, Olivier Pietquin

    Abstract: An agent learning through interactions should balance its action selection process between probing the environment to discover new rewards and using the information acquired in the past to adopt useful behaviour. This trade-off is usually obtained by perturbing either the agent's actions (e.g., e-greedy or Gibbs sampling) or the agent's parameters (e.g., NoisyNet), or by modifying the reward it re… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

  12. arXiv:1905.11885  [pdf, other

    cs.LG stat.ML

    Differentiable Ranks and Sorting using Optimal Transport

    Authors: Marco Cuturi, Olivier Teboul, Jean-Philippe Vert

    Abstract: Sorting an array is a fundamental routine in machine learning, one that is used to compute rank-based statistics, cumulative distribution functions (CDFs), quantiles, or to select closest neighbors and labels. The sorting function is however piece-wise constant (the sorting permutation of a vector does not change if the entries of that vector are infinitesimally perturbed) and therefore has no gra… ▽ More

    Submitted 2 November, 2019; v1 submitted 28 May, 2019; originally announced May 2019.