Zum Hauptinhalt springen

Showing 1–18 of 18 results for author: Ponti, M A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.05240  [pdf, other

    cs.LG

    Decoupling Decision-Making in Fraud Prevention through Classifier Calibration for Business Logic Action

    Authors: Emanuele Luzio, Moacir Antonelli Ponti, Christian Ramirez Arevalo, Luis Argerich

    Abstract: Machine learning models typically focus on specific targets like creating classifiers, often based on known population feature distributions in a business context. However, models calculating individual features adapt over time to improve precision, introducing the concept of decoupling: shifting from point evaluation to data distribution. We use calibration strategies as strategy for decoupling m… ▽ More

    Submitted 21 February, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Journal ref: Long version of the paper of ACM-SAC 2024

  2. arXiv:2312.11240  [pdf, other

    cs.SD eess.AS

    Evaluation of Barlow Twins and VICReg self-supervised learning for sound patterns of bird and anuran species

    Authors: Fábio Felix Dias, Moacir Antonelli Ponti, Mílton Cezar Ribeiro, Rosane Minghim

    Abstract: Taking advantage of the structure of large datasets to pre-train Deep Learning models is a promising strategy to decrease the need for supervised data. Self-supervised learning methods, such as contrastive and its variation are a promising way towards obtaining better representations in many Deep Learning applications. Soundscape ecology is one application in which annotations are expensive and sc… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 10 pages, 2 figures, 3 tables

  3. arXiv:2311.16894  [pdf, other

    cs.LG cs.CV

    Dendrogram distance: an evaluation metric for generative networks using hierarchical clustering

    Authors: Gustavo Sutter Carvalho, Moacir Antonelli Ponti

    Abstract: We present a novel metric for generative modeling evaluation, focusing primarily on generative networks. The method uses dendrograms to represent real and fake data, allowing for the divergence between training and generated samples to be computed. This metric focus on mode collapse, targeting generators that are not able to capture all modes in the training set. To evaluate the proposed method it… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  4. arXiv:2303.16769  [pdf, other

    cs.CV

    Sketch-an-Anchor: Sub-epoch Fast Model Adaptation for Zero-shot Sketch-based Image Retrieval

    Authors: Leo Sampaio Ferraz Ribeiro, Moacir Antonelli Ponti

    Abstract: Sketch-an-Anchor is a novel method to train state-of-the-art Zero-shot Sketch-based Image Retrieval (ZSSBIR) models in under an epoch. Most studies break down the problem of ZSSBIR into two parts: domain alignment between images and sketches, inherited from SBIR, and generalization to unseen data, inherent to the zero-shot protocol. We argue one of these problems can be considerably simplified and… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

  5. arXiv:2210.11327  [pdf, other

    cs.LG stat.ML

    Improving Data Quality with Training Dynamics of Gradient Boosting Decision Trees

    Authors: Moacir Antonelli Ponti, Lucas de Angelis Oliveira, Mathias Esteban, Valentina Garcia, Juan Martín Román, Luis Argerich

    Abstract: Real world datasets contain incorrectly labeled instances that hamper the performance of the model and, in particular, the ability to generalize out of distribution. Also, each example might have different contribution towards learning. This motivates studies to better understanding of the role of data instances with respect to their contribution in good metrics in models. In this paper we propose… ▽ More

    Submitted 22 February, 2024; v1 submitted 20 October, 2022; originally announced October 2022.

  6. arXiv:2204.00618  [pdf, other

    eess.AS cs.CL cs.SD

    ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversion

    Authors: Edresson Casanova, Christopher Shulby, Alexander Korolev, Arnaldo Candido Junior, Anderson da Silva Soares, Sandra Aluísio, Moacir Antonelli Ponti

    Abstract: We explore cross-lingual multi-speaker speech synthesis and cross-lingual voice conversion applied to data augmentation for automatic speech recognition (ASR) systems in low/medium-resource scenarios. Through extensive experiments, we show that our approach permits the application of speech synthesis and voice conversion to improve ASR systems using only one target-language speaker during model tr… ▽ More

    Submitted 20 May, 2023; v1 submitted 29 March, 2022; originally announced April 2022.

    Comments: This paper was accepted at INTERSPEECH 2023

  7. arXiv:2201.02099  [pdf, other

    cs.SD cs.MM eess.AS

    Implementing simple spectral denoising for environmental audio recordings

    Authors: Fábio Felix Dias, Moacir Antonelli Ponti, Rosane Minghim

    Abstract: This technical report details changes applied to a noise filter to facilitate its application and improve its results. The filter is applied to denoise natural sounds recorded in the wild and to generate an acoustic index used in soundscape analysis.

    Submitted 6 January, 2022; originally announced January 2022.

  8. arXiv:2112.02418  [pdf, other

    cs.SD cs.CL eess.AS

    YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone

    Authors: Edresson Casanova, Julian Weber, Christopher Shulby, Arnaldo Candido Junior, Eren Gölge, Moacir Antonelli Ponti

    Abstract: YourTTS brings the power of a multilingual approach to the task of zero-shot multi-speaker TTS. Our method builds upon the VITS model and adds several novel modifications for zero-shot multi-speaker and multilingual training. We achieved state-of-the-art (SOTA) results in zero-shot multi-speaker TTS and results comparable to SOTA in zero-shot voice conversion on the VCTK dataset. Additionally, our… ▽ More

    Submitted 30 April, 2023; v1 submitted 4 December, 2021; originally announced December 2021.

    Comments: An Erratum was added on the last page of this paper

    Journal ref: Proceedings of the 39th International Conference on Machine Learning, PMLR 162:2709-2720, 2022

  9. arXiv:2109.02752  [pdf, other

    cs.LG cs.CV

    Training Deep Networks from Zero to Hero: avoiding pitfalls and going beyond

    Authors: Moacir Antonelli Ponti, Fernando Pereira dos Santos, Leo Sampaio Ferraz Ribeiro, Gabriel Biscaro Cavallari

    Abstract: Training deep neural networks may be challenging in real world data. Using models as black-boxes, even with transfer learning, can result in poor generalization or inconclusive results when it comes to small datasets or specific applications. This tutorial covers the basic steps as well as more recent options to improve models, in particular, but not restricted to, supervised learning. It can be p… ▽ More

    Submitted 13 October, 2021; v1 submitted 6 September, 2021; originally announced September 2021.

    Comments: Extended version of SIBGRAPI 2021 Tutorial Paper

  10. arXiv:2104.05557  [pdf, other

    eess.AS cs.SD

    SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model

    Authors: Edresson Casanova, Christopher Shulby, Eren Gölge, Nicolas Michael Müller, Frederico Santos de Oliveira, Arnaldo Candido Junior, Anderson da Silva Soares, Sandra Maria Aluisio, Moacir Antonelli Ponti

    Abstract: In this paper, we propose SC-GlowTTS: an efficient zero-shot multi-speaker text-to-speech model that improves similarity for speakers unseen during training. We propose a speaker-conditional architecture that explores a flow-based decoder that works in a zero-shot scenario. As text encoders, we explore a dilated residual convolutional-based encoder, gated convolutional-based encoder, and transform… ▽ More

    Submitted 15 June, 2021; v1 submitted 2 April, 2021; originally announced April 2021.

    Comments: Accepted on Interspeech 2021

  11. arXiv:2005.05144  [pdf, other

    eess.AS cs.CL cs.LG

    TTS-Portuguese Corpus: a corpus for speech synthesis in Brazilian Portuguese

    Authors: Edresson Casanova, Arnaldo Candido Junior, Christopher Shulby, Frederico Santos de Oliveira, João Paulo Teixeira, Moacir Antonelli Ponti, Sandra Maria Aluisio

    Abstract: Speech provides a natural way for human-computer interaction. In particular, speech synthesis systems are popular in different applications, such as personal assistants, GPS applications, screen readers and accessibility tools. However, not all languages are on the same level when in terms of resources and systems for speech synthesis. This work consists of creating publicly available resources fo… ▽ More

    Submitted 29 January, 2022; v1 submitted 11 May, 2020; originally announced May 2020.

  12. arXiv:2002.11213  [pdf, other

    cs.CL cs.SD eess.AS

    Speech2Phone: A Novel and Efficient Method for Training Speaker Recognition Models

    Authors: Edresson Casanova, Arnaldo Candido Junior, Christopher Shulby, Frederico Santos de Oliveira, Lucas Rafael Stefanel Gris, Hamilton Pereira da Silva, Sandra Maria Aluisio, Moacir Antonelli Ponti

    Abstract: In this paper we present an efficient method for training models for speaker recognition using small or under-resourced datasets. This method requires less data than other SOTA (State-Of-The-Art) methods, e.g. the Angular Prototypical and GE2E loss functions, while achieving similar results to those methods. This is done using the knowledge of the reconstruction of a phoneme in the speaker's voice… ▽ More

    Submitted 18 June, 2021; v1 submitted 25 February, 2020; originally announced February 2020.

    Comments: Submitted to BRACIS

  13. arXiv:1901.09819  [pdf, other

    cs.CV

    Generalization of feature embeddings transferred from different video anomaly detection domains

    Authors: Fernando Pereira dos Santos, Leonardo Sampaio Ferraz Ribeiro, Moacir Antonelli Ponti

    Abstract: Detecting anomalous activity in video surveillance often involves using only normal activity data in order to learn an accurate detector. Due to lack of annotated data for some specific target domain, one could employ existing data from a source domain to produce better predictions. Hence, transfer learning presents itself as an important tool. But how to analyze the resulting data space? This pap… ▽ More

    Submitted 28 January, 2019; originally announced January 2019.

  14. arXiv:1811.08495  [pdf, other

    cs.CV

    Are pre-trained CNNs good feature extractors for anomaly detection in surveillance videos?

    Authors: Tiago S. Nazare, Rodrigo F. de Mello, Moacir A. Ponti

    Abstract: Recently, several techniques have been explored to detect unusual behaviour in surveillance videos. Nevertheless, few studies leverage features from pre-trained CNNs and none of then present a comparison of features generate by different models. Motivated by this gap, we compare features extracted by four state-of-the-art image classification networks as a way of describing patches from security v… ▽ More

    Submitted 20 November, 2018; originally announced November 2018.

  15. arXiv:1811.00473  [pdf, other

    cs.CV

    Unsupervised representation learning using convolutional and stacked auto-encoders: a domain and cross-domain feature space analysis

    Authors: Gabriel B. Cavallari, Leonardo Sampaio Ferraz Ribeiro, Moacir Antonelli Ponti

    Abstract: A feature learning task involves training models that are capable of inferring good representations (transformations of the original space) from input data alone. When working with limited or unlabelled data, and also when multiple visual domains are considered, methods that rely on large annotated datasets, such as Convolutional Neural Networks (CNNs), cannot be employed. In this paper we investi… ▽ More

    Submitted 1 November, 2018; originally announced November 2018.

    Comments: SIBGRAPI 2018 - Conference on Graphics, Patterns and Images

  16. arXiv:1806.07908  [pdf, other

    cs.LG cs.CV stat.ML

    Como funciona o Deep Learning

    Authors: Moacir Antonelli Ponti, Gabriel B. Paranhos da Costa

    Abstract: Deep Learning methods are currently the state-of-the-art in many problems which can be tackled via machine learning, in particular classification problems. However there is still lack of understanding on how those methods work, why they work and what are the limitations involved in using them. In this chapter we will describe in detail the transition from shallow to deep networks, include examples… ▽ More

    Submitted 20 June, 2018; originally announced June 2018.

    Comments: Book chapter, in Portuguese, 31 pages

    Journal ref: In: Tópicos em Gerenciamento de Dados e Informações, SBC, Cap.3, ISBN 978-85-7669-400-7, pp.63-93, 2017

  17. arXiv:1805.02627  [pdf, other

    cs.LG stat.ML

    Computing the Shattering Coefficient of Supervised Learning Algorithms

    Authors: Rodrigo Fernandes de Mello, Moacir Antonelli Ponti, Carlos Henrique Grossi Ferreira

    Abstract: The Statistical Learning Theory (SLT) provides the theoretical guarantees for supervised machine learning based on the Empirical Risk Minimization Principle (ERMP). Such principle defines an upper bound to ensure the uniform convergence of the empirical risk Remp(f), i.e., the error measured on a given data sample, to the expected value of risk R(f) (a.k.a. actual risk), which depends on the Joint… ▽ More

    Submitted 14 May, 2018; v1 submitted 7 May, 2018; originally announced May 2018.

  18. arXiv:1711.10292  [pdf, other

    cs.LG

    Providing theoretical learning guarantees to Deep Learning Networks

    Authors: Rodrigo Fernandes de Mello, Martha Dais Ferreira, Moacir Antonelli Ponti

    Abstract: Deep Learning (DL) is one of the most common subjects when Machine Learning and Data Science approaches are considered. There are clearly two movements related to DL: the first aggregates researchers in quest to outperform other algorithms from literature, trying to win contests by considering often small decreases in the empirical risk; and the second investigates overfitting evidences, questioni… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

    Comments: Submitted to JMLR