Zum Hauptinhalt springen

Showing 1–9 of 9 results for author: Martínez, A M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2203.09148  [pdf, other

    cs.SD cs.AI cs.CL cs.LG eess.AS

    Prediction of speech intelligibility with DNN-based performance measures

    Authors: Angel Mario Castro Martinez, Constantin Spille, Jana Roßbach, Birger Kollmeier, Bernd T. Meyer

    Abstract: This paper presents a speech intelligibility model based on automatic speech recognition (ASR), combining phoneme probabilities from deep neural networks (DNN) and a performance measure that estimates the word error rate from these probabilities. This model does not require the clean speech reference nor the word labels during testing as the ASR decoding step, which finds the most likely sequence… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Journal ref: Computer Speech & Language, 74, p.101329 (2022)

  2. arXiv:2111.15651  [pdf, other

    cs.CV cs.LG

    Leveraging The Topological Consistencies of Learning in Deep Neural Networks

    Authors: Stuart Synakowski, Fabian Benitez-Quiroz, Aleix M. Martinez

    Abstract: Recently, methods have been developed to accurately predict the testing performance of a Deep Neural Network (DNN) on a particular task, given statistics of its underlying topological structure. However, further leveraging this newly found insight for practical applications is intractable due to the high computational cost in terms of time and memory. In this work, we define a new class of topolog… ▽ More

    Submitted 30 November, 2021; originally announced November 2021.

  3. arXiv:1908.03679  [pdf, other

    eess.IV cs.CV cs.LG

    Distance Map Loss Penalty Term for Semantic Segmentation

    Authors: Francesco Caliva, Claudia Iriondo, Alejandro Morales Martinez, Sharmila Majumdar, Valentina Pedoia

    Abstract: Convolutional neural networks for semantic segmentation suffer from low performance at object boundaries. In medical imaging, accurate representation of tissue surfaces and volumes is important for tracking of disease biomarkers such as tissue morphology and shape features. In this work, we propose a novel distance map derived loss penalty term for semantic segmentation. We propose to use distance… ▽ More

    Submitted 9 August, 2019; originally announced August 2019.

    Comments: Medical Imaging with Deep Learning (MIDL2019) Conference [arXiv:1907.08612], Extended Abstract

    Report number: MIDL/2019/ExtendedAbstract/B1eIcvS45V

  4. arXiv:1808.04399  [pdf, other

    cs.CV

    Cross-Cultural and Cultural-Specific Production and Perception of Facial Expressions of Emotion in the Wild

    Authors: Ramprakash Srinivasan, Aleix M. Martinez

    Abstract: Automatic recognition of emotion from facial expressions is an intense area of research, with a potentially long list of important application. Yet, the study of emotion requires knowing which facial expressions are used within and across cultures in the wild, not in controlled lab conditions; but such studies do not exist. Which and how many cross-cultural and cultural-specific facial expressions… ▽ More

    Submitted 13 August, 2018; originally announced August 2018.

  5. arXiv:1807.09251  [pdf, other

    cs.CV

    GANimation: Anatomically-aware Facial Animation from a Single Image

    Authors: Albert Pumarola, Antonio Agudo, Aleix M. Martinez, Alberto Sanfeliu, Francesc Moreno-Noguer

    Abstract: Recent advances in Generative Adversarial Networks (GANs) have shown impressive results for task of facial expression synthesis. The most successful architecture is StarGAN, that conditions GANs generation process with images of a specific domain, namely a set of images of persons sharing the same expression. While effective, this approach can only generate a discrete number of expressions, determ… ▽ More

    Submitted 28 August, 2018; v1 submitted 24 July, 2018; originally announced July 2018.

    Comments: Accepted as oral at ECCV 2018. Code available at https://github.com/albertpumarola/GANimation. Added minor updates

  6. AMIDST: a Java Toolbox for Scalable Probabilistic Machine Learning

    Authors: Andrés R. Masegosa, Ana M. Martínez, Darío Ramos-López, Rafael Cabañas, Antonio Salmerón, Thomas D. Nielsen, Helge Langseth, Anders L. Madsen

    Abstract: The AMIDST Toolbox is a software for scalable probabilistic machine learning with a spe- cial focus on (massive) streaming data. The toolbox supports a flexible modeling language based on probabilistic graphical models with latent variables and temporal dependencies. The specified models can be learnt from large data sets using parallel or distributed implementa- tions of Bayesian learning algorit… ▽ More

    Submitted 4 April, 2017; originally announced April 2017.

    ACM Class: I.2.6

  7. arXiv:1703.01210  [pdf, other

    cs.CV

    EmotioNet Challenge: Recognition of facial expressions of emotion in the wild

    Authors: C. Fabian Benitez-Quiroz, Ramprakash Srinivasan, Qianli Feng, Yan Wang, Aleix M. Martinez

    Abstract: This paper details the methodology and results of the EmotioNet challenge. This challenge is the first to test the ability of computer vision algorithms in the automatic analysis of a large number of images of facial expressions of emotion in the wild. The challenge was divided into two tracks. The first track tested the ability of current computer vision algorithms in the automatic detection of a… ▽ More

    Submitted 3 March, 2017; originally announced March 2017.

  8. On the Relevance of Auditory-Based Gabor Features for Deep Learning in Automatic Speech Recognition

    Authors: Angel Mario Castro Martinez, Sri Harish Mallidi, Bernd T. Meyer

    Abstract: Previous studies support the idea of merging auditory-based Gabor features with deep learning architectures to achieve robust automatic speech recognition, however, the cause behind the gain of such combination is still unknown. We believe these representations provide the deep learning decoder with more discriminable cues. Our aim with this paper is to validate this hypothesis by performing exper… ▽ More

    Submitted 14 February, 2017; originally announced February 2017.

    Comments: accepted to Computer Speech & Language

  9. arXiv:1604.07990  [pdf, other

    cs.AI cs.DC stat.ML

    Probabilistic Graphical Models on Multi-Core CPUs using Java 8

    Authors: Andres R. Masegosa, Ana M. Martinez, Hanen Borchani

    Abstract: In this paper, we discuss software design issues related to the development of parallel computational intelligence algorithms on multi-core CPUs, using the new Java 8 functional programming features. In particular, we focus on probabilistic graphical models (PGMs) and present the parallelisation of a collection of algorithms that deal with inference and learning of PGMs from data. Namely, maximum… ▽ More

    Submitted 27 April, 2016; originally announced April 2016.

    Comments: Pre-print version of the paper presented in the special issue on Computational Intelligence Software at IEEE Computational Intelligence Magazine journal

    Journal ref: IEEE Computational Intelligence Magazine, 11(2), 41-54. 2016