Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Maison, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.12688  [pdf, other

    cs.LG cs.AI stat.ML

    Compression of Recurrent Neural Networks using Matrix Factorization

    Authors: Lucas Maison, Hélion du Mas des Bourboux, Thomas Courtat

    Abstract: Compressing neural networks is a key step when deploying models for real-time or embedded applications. Factorizing the model's matrices using low-rank approximations is a promising method for achieving compression. While it is possible to set the rank before training, this approach is neither flexible nor optimal. In this work, we propose a post-training rank-selection method called Rank-Tuning t… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

  2. arXiv:2306.03773  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Some voices are too common: Building fair speech recognition systems using the Common Voice dataset

    Authors: Lucas Maison, Yannick Estève

    Abstract: Automatic speech recognition (ASR) systems become increasingly efficient thanks to new advances in neural network training like self-supervised learning. However, they are known to be unfair toward certain groups, for instance, people speaking with an accent. In this work, we use the French Common Voice dataset to quantify the biases of a pre-trained wav2vec~2.0 model toward several demographic gr… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: 5 pages, 3 figures. Accepted to Interspeech 2023

  3. arXiv:2303.07924  [pdf, other

    cs.LG cs.CL cs.SD eess.AS

    Improving Accented Speech Recognition with Multi-Domain Training

    Authors: Lucas Maison, Yannick Estève

    Abstract: Thanks to the rise of self-supervised learning, automatic speech recognition (ASR) systems now achieve near-human performance on a wide variety of datasets. However, they still lack generalization capability and are not robust to domain shifts like accent variations. In this work, we use speech audio representing four different French accents to create fine-tuning datasets that improve the robustn… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: 5 pages, 2 figures. Accepted to ICASSP 2023