Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Donato, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.09841  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Anatomy of Industrial Scale Multilingual ASR

    Authors: Francis McCann Ramirez, Luka Chkhetiani, Andrew Ehrenberg, Robert McHardy, Rami Botros, Yash Khare, Andrea Vanzo, Taufiquzzaman Peyash, Gabriel Oexle, Michael Liang, Ilya Sklyar, Enver Fakhan, Ahmed Etefy, Daniel McCrystal, Sam Flamini, Domenic Donato, Takuya Yoshioka

    Abstract: This paper describes AssemblyAI's industrial-scale automatic speech recognition (ASR) system, designed to meet the requirements of large-scale, multilingual ASR serving various application needs. Our system leverages a diverse training dataset comprising unsupervised (12.5M hours), supervised (188k hours), and pseudo-labeled (1.6M hours) data across four languages. We provide a detailed descriptio… ▽ More

    Submitted 16 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  2. arXiv:2404.07341  [pdf, other

    eess.AS cs.CL cs.LG cs.SD

    Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping

    Authors: Kevin Zhang, Luka Chkhetiani, Francis McCann Ramirez, Yash Khare, Andrea Vanzo, Michael Liang, Sergio Ramirez Martin, Gabriel Oexle, Ruben Bousbib, Taufiquzzaman Peyash, Michael Nguyen, Dillon Pulliam, Domenic Donato

    Abstract: This paper presents Conformer-1, an end-to-end Automatic Speech Recognition (ASR) model trained on an extensive dataset of 570k hours of speech audio data, 91% of which was acquired from publicly available sources. To achieve this, we perform Noisy Student Training after generating pseudo-labels for the unlabeled public data using a strong Conformer RNN-T baseline model. The addition of these pseu… ▽ More

    Submitted 12 April, 2024; v1 submitted 10 April, 2024; originally announced April 2024.

  3. arXiv:2207.08583  [pdf, other

    cs.CL

    MAD for Robust Reinforcement Learning in Machine Translation

    Authors: Domenic Donato, Lei Yu, Wang Ling, Chris Dyer

    Abstract: We introduce a new distributed policy gradient algorithm and show that it outperforms existing reward-aware training procedures such as REINFORCE, minimum risk training (MRT) and proximal policy optimization (PPO) in terms of training stability and generalization performance when optimizing machine translation models. Our algorithm, which we call MAD (on account of using the mean absolute deviatio… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

  4. arXiv:2202.11444  [pdf, other

    cs.CL cs.AI cs.LG

    Enabling arbitrary translation objectives with Adaptive Tree Search

    Authors: Wang Ling, Wojciech Stokowiec, Domenic Donato, Laurent Sartran, Lei Yu, Austin Matthews, Chris Dyer

    Abstract: We introduce an adaptive tree search algorithm, that can find high-scoring outputs under translation models that make no assumptions about the form or structure of the search objective. This algorithm -- a deterministic variant of Monte Carlo tree search -- enables the exploration of new kinds of models that are unencumbered by constraints imposed to make decoding tractable, such as autoregressivi… ▽ More

    Submitted 23 February, 2022; originally announced February 2022.

    Comments: 17 pages, 3 figures

  5. arXiv:2112.11446  [pdf, other

    cs.CL cs.AI

    Scaling Language Models: Methods, Analysis & Insights from Training Gopher

    Authors: Jack W. Rae, Sebastian Borgeaud, Trevor Cai, Katie Millican, Jordan Hoffmann, Francis Song, John Aslanides, Sarah Henderson, Roman Ring, Susannah Young, Eliza Rutherford, Tom Hennigan, Jacob Menick, Albin Cassirer, Richard Powell, George van den Driessche, Lisa Anne Hendricks, Maribeth Rauh, Po-Sen Huang, Amelia Glaese, Johannes Welbl, Sumanth Dathathri, Saffron Huang, Jonathan Uesato, John Mellor , et al. (55 additional authors not shown)

    Abstract: Language modelling provides a step towards intelligent communication systems by harnessing large repositories of written human knowledge to better predict and understand the world. In this paper, we present an analysis of Transformer-based language model performance across a wide range of model scales -- from models with tens of millions of parameters up to a 280 billion parameter model called Gop… ▽ More

    Submitted 21 January, 2022; v1 submitted 8 December, 2021; originally announced December 2021.

    Comments: 120 pages

  6. Diverse Pretrained Context Encodings Improve Document Translation

    Authors: Domenic Donato, Lei Yu, Chris Dyer

    Abstract: We propose a new architecture for adapting a sentence-level sequence-to-sequence transformer by incorporating multiple pretrained document context signals and assess the impact on translation performance of (1) different pretraining approaches for generating these signals, (2) the quantity of parallel data for which document context is available, and (3) conditioning on source, target, or source a… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Journal ref: ACL 2021 (1299-1311)

  7. arXiv:0807.4325  [pdf, ps, other

    physics.soc-ph cond-mat.stat-mech cs.IR physics.data-an

    Schroedinger-like PageRank equation and localization in the WWW

    Authors: Nicola Perra, Vinko Zlatic, Alessandro Chessa, Claudio Conti, Debora Donato, Guido Caldarelli

    Abstract: The WorldWide Web is one of the most important communication systems we use in our everyday life. Despite its central role, the growth and the development of the WWW is not controlled by any central authority. This situation has created a huge ensemble of connections whose complexity can be fruitfully described and quantified by network theory. One important application that allows to sort out t… ▽ More

    Submitted 28 July, 2008; originally announced July 2008.

    Comments: 5 pages

  8. Preferential attachment in the growth of social networks: the case of Wikipedia

    Authors: A. Capocci, V. D. P. Servedio, F. Colaiori, L. S. Buriol, D. Donato, S. Leonardi, G. Caldarelli

    Abstract: We present an analysis of the statistical properties and growth of the free on-line encyclopedia Wikipedia. By describing topics by vertices and hyperlinks between them as edges, we can represent this encyclopedia as a directed graph. The topological properties of this graph are in close analogy with that of the World Wide Web, despite the very different growth mechanism. In particular we measur… ▽ More

    Submitted 27 February, 2006; v1 submitted 3 February, 2006; originally announced February 2006.

    Comments: 4 pages, 4 figures, revtex