Zum Hauptinhalt springen

Showing 1–29 of 29 results for author: Schuster, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03132  [pdf, other

    cs.SD cs.AI cs.CL cs.LG eess.AS

    Speaker- and Text-Independent Estimation of Articulatory Movements and Phoneme Alignments from Speech

    Authors: Tobias Weise, Philipp Klumpp, Kubilay Can Demir, Paula Andrea Pérez-Toro, Maria Schuster, Elmar Noeth, Bjoern Heismann, Andreas Maier, Seung Hee Yang

    Abstract: This paper introduces a novel combination of two tasks, previously treated separately: acoustic-to-articulatory speech inversion (AAI) and phoneme-to-articulatory (PTA) motion estimation. We refer to this joint task as acoustic phoneme-to-articulatory speech inversion (APTAI) and explore two different approaches, both working speaker- and text-independently during inference. We use a multi-task le… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: to be published in Interspeech 2024 proceedings

  2. arXiv:2404.08064  [pdf

    eess.AS cs.AI cs.CR cs.LG

    The Impact of Speech Anonymization on Pathology and Its Limits

    Authors: Soroosh Tayebi Arasteh, Tomas Arias-Vergara, Paula Andrea Perez-Toro, Tobias Weise, Kai Packhaeuser, Maria Schuster, Elmar Noeth, Andreas Maier, Seung Hee Yang

    Abstract: Integration of speech into healthcare has intensified privacy concerns due to its potential as a non-invasive biomarker containing individual biometric information. In response, speaker anonymization aims to conceal personally identifiable information while retaining crucial linguistic content. However, the application of anonymization techniques to pathological speech, a critical area where priva… ▽ More

    Submitted 22 June, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

  3. arXiv:2312.14571  [pdf, other

    cs.LG

    Data is Moody: Discovering Data Modification Rules from Process Event Logs

    Authors: Marco Bjarne Schuster, Boris Wiegand, Jilles Vreeken

    Abstract: Although event logs are a powerful source to gain insight about the behavior of the underlying business process, existing work primarily focuses on finding patterns in the activity sequences of an event log, while ignoring event attribute data. Event attribute data has mostly been used to predict event occurrences and process outcome, but the state of the art neglects to mine succinct and interpre… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  4. arXiv:2204.06450  [pdf, other

    cs.SD cs.LG eess.AS

    The effect of speech pathology on automatic speaker verification -- a large-scale study

    Authors: Soroosh Tayebi Arasteh, Tobias Weise, Maria Schuster, Elmar Noeth, Andreas Maier, Seung Hee Yang

    Abstract: Navigating the challenges of data-driven speech processing, one of the primary hurdles is accessing reliable pathological speech data. While public datasets appear to offer solutions, they come with inherent risks of potential unintended exposure of patient health information via re-identification attacks. Using a comprehensive real-world pathological speech corpus, with over n=3,800 test subjects… ▽ More

    Submitted 22 November, 2023; v1 submitted 13 April, 2022; originally announced April 2022.

    Comments: Published in Scientific Reports

    Journal ref: Sci Rep 13, 20476 (2023)

  5. arXiv:2204.04016  [pdf, other

    eess.AS cs.CL cs.LG cs.SD q-bio.QM

    Disentangled Latent Speech Representation for Automatic Pathological Intelligibility Assessment

    Authors: Tobias Weise, Philipp Klumpp, Kubilay Can Demir, Andreas Maier, Elmar Noeth, Bjoern Heismann, Maria Schuster, Seung Hee Yang

    Abstract: Speech intelligibility assessment plays an important role in the therapy of patients suffering from pathological speech disorders. Automatic and objective measures are desirable to assist therapists in their traditionally subjective and labor-intensive assessments. In this work, we investigate a novel approach for obtaining such a measure using the divergence in disentangled latent speech represen… ▽ More

    Submitted 27 June, 2022; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: Submitted and Accepted at INTERSPEECH2022

  6. GPGM-SLAM: a Robust SLAM System for Unstructured Planetary Environments with Gaussian Process Gradient Maps

    Authors: Riccardo Giubilato, Cedric Le Gentil, Mallikarjuna Vayugundla, Martin J. Schuster, Teresa Vidal-Calleja, Rudolph Triebel

    Abstract: Simultaneous Localization and Mapping (SLAM) techniques play a key role towards long-term autonomy of mobile robots due to the ability to correct localization errors and produce consistent maps of an environment over time. Contrarily to urban or man-made environments, where the presence of unique objects and structures offer unique cues for localization, the appearance of unstructured natural envi… ▽ More

    Submitted 14 September, 2021; originally announced September 2021.

    Comments: Submission to Field Robotics (www.journalfieldrobotics.org), under review

    Journal ref: Field Robotics, Vol. 2, 2022

  7. arXiv:2105.02020  [pdf, other

    cs.RO

    Multi-Modal Loop Closing in Unstructured Planetary Environments with Visually Enriched Submaps

    Authors: Riccardo Giubilato, Mallikarjuna Vayugundla, Wolfgang Stürzl, Martin J. Schuster, Armin Wedler, Rudolph Triebel

    Abstract: Future planetary missions will rely on rovers that can autonomously explore and navigate in unstructured environments. An essential element is the ability to recognize places that were already visited or mapped. In this work, we leverage the ability of stereo cameras to provide both visual and depth information, guiding the search and validation of loop closures from a multi-modal perspective. We… ▽ More

    Submitted 14 September, 2021; v1 submitted 5 May, 2021; originally announced May 2021.

    Comments: Accepted at the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2021)

  8. arXiv:2008.06341  [pdf, other

    nlin.CG cs.OH

    Probabilistic Cellular Automata for Granular Media in Video Games

    Authors: Jonathan Devlin, Micah D. Schuster

    Abstract: Granular materials are very common in the everyday world. Media such as sand, soil, gravel, food stuffs, pharmaceuticals, etc. all have similar irregular flow since they are composed of numerous small solid particles. In video games, simulating these materials increases immersion and can be used for various game mechanics. Computationally, full scale simulation is not typically feasible except o… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: Cellular Automata, Sandpile

  9. arXiv:2002.04374  [pdf, other

    cs.LG cs.CL eess.AS stat.ML

    Convolutional Neural Networks and a Transfer Learning Strategy to Classify Parkinson's Disease from Speech in Three Different Languages

    Authors: J. C. Vásquez-Correa, T. Arias-Vergara, C. D. Rios-Urrego, M. Schuster, J. Rusz, J. R. Orozco-Arroyave, E. Nöth

    Abstract: Parkinson's disease patients develop different speech impairments that affect their communication capabilities. The automatic assessment of the speech of the patients allows the development of computer aided tools to support the diagnosis and the evaluation of the disease severity. This paper introduces a methodology to classify Parkinson's disease from speech in three different languages: Spanish… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

    Journal ref: In Iberoamerican Congress on Pattern Recognition (pp. 697-706) 2019

  10. arXiv:1903.03070  [pdf, ps, other

    cs.LO cs.DS math.AC math.LO

    An algorithmic approach to the existence of ideal objects in commutative algebra

    Authors: Thomas Powell, Peter M Schuster, Franziskus Wiesnet

    Abstract: The existence of ideal objects, such as maximal ideals in nonzero rings, plays a crucial role in commutative algebra. These are typically justified using Zorn's lemma, and thus pose a challenge from a computational point of view. Giving a constructive meaning to ideal objects is a problem which dates back to Hilbert's program, and today is still a central theme in the area of dynamical algebra, wh… ▽ More

    Submitted 7 March, 2019; originally announced March 2019.

  11. arXiv:1903.01462  [pdf, other

    physics.ins-det cs.LG nucl-ex

    Deep learning based pulse shape discrimination for germanium detectors

    Authors: P. Holl, L. Hauertmann, B. Majorovits, O. Schulz, M. Schuster, A. J. Zsigmond

    Abstract: Experiments searching for rare processes like neutrinoless double beta decay heavily rely on the identification of background events to reduce their background level and increase their sensitivity. We present a novel machine learning based method to recognize one of the most abundant classes of background events in these experiments. By combining a neural network for feature extraction with a smal… ▽ More

    Submitted 2 June, 2019; v1 submitted 4 March, 2019; originally announced March 2019.

    Comments: Published in Eur. Phys. J. C. 9 pages, 10 figures, 3 tables

    Journal ref: Eur. Phys. J. C (2019) 79: 450

  12. arXiv:1902.08295  [pdf, other

    cs.LG stat.ML

    Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

    Authors: Jonathan Shen, Patrick Nguyen, Yonghui Wu, Zhifeng Chen, Mia X. Chen, Ye Jia, Anjuli Kannan, Tara Sainath, Yuan Cao, Chung-Cheng Chiu, Yanzhang He, Jan Chorowski, Smit Hinsu, Stella Laurenzo, James Qin, Orhan Firat, Wolfgang Macherey, Suyog Gupta, Ankur Bapna, Shuyuan Zhang, Ruoming Pang, Ron J. Weiss, Rohit Prabhavalkar, Qiao Liang, Benoit Jacob , et al. (66 additional authors not shown)

    Abstract: Lingvo is a Tensorflow framework offering a complete solution for collaborative deep learning research, with a particular focus towards sequence-to-sequence models. Lingvo models are composed of modular building blocks that are flexible and easily extensible, and experiment configurations are centralized and highly customizable. Distributed training and quantized inference are supported directly w… ▽ More

    Submitted 21 February, 2019; originally announced February 2019.

  13. arXiv:1804.10292  [pdf, other

    cs.FL

    Streaming Rewriting Games: Winning Strategies and Complexity

    Authors: Christian Coester, Thomas Schwentick, Martin Schuster

    Abstract: Context-free games on strings are two-player rewriting games based on a set of production rules and a regular target language. In each round, the first player selects a position of the current string; then the second player replaces the symbol at that position according to one of the production rules. The first player wins as soon as the current string belongs to the target language. In this paper… ▽ More

    Submitted 26 April, 2018; originally announced April 2018.

  14. arXiv:1804.09849  [pdf, other

    cs.CL cs.AI

    The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation

    Authors: Mia Xu Chen, Orhan Firat, Ankur Bapna, Melvin Johnson, Wolfgang Macherey, George Foster, Llion Jones, Niki Parmar, Mike Schuster, Zhifeng Chen, Yonghui Wu, Macduff Hughes

    Abstract: The past year has witnessed rapid advances in sequence-to-sequence (seq2seq) modeling for Machine Translation (MT). The classic RNN-based approaches to MT were first out-performed by the convolutional seq2seq model, which was then out-performed by the more recent Transformer model. Each of these new approaches consists of a fundamental architecture accompanied by a set of modeling and training tec… ▽ More

    Submitted 26 April, 2018; v1 submitted 25 April, 2018; originally announced April 2018.

  15. arXiv:1802.09984  [pdf, ps, other

    cs.DB cs.PL

    Formal Semantics of the Language Cypher

    Authors: Nadime Francis, Alastair Green, Paolo Guagliardo, Leonid Libkin, Tobias Lindaaker, Victor Marsault, Stefan Plantikow, Mats Rydberg, Martin Schuster, Petra Selmer, Andrés Taylor

    Abstract: Cypher is a query language for property graphs. It was originally designed and implemented as part of the Neo4j graph database, and it is currently used in a growing number of commercial systems, industrial applications and research projects. In this work, we provide denotational semantics of the core fragment of the read-only part of Cypher, which features in particular pattern matching, filterin… ▽ More

    Submitted 20 March, 2018; v1 submitted 27 February, 2018; originally announced February 2018.

    Comments: 22 pages

  16. arXiv:1712.05884  [pdf, other

    cs.CL

    Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions

    Authors: Jonathan Shen, Ruoming Pang, Ron J. Weiss, Mike Schuster, Navdeep Jaitly, Zongheng Yang, Zhifeng Chen, Yu Zhang, Yuxuan Wang, RJ Skerry-Ryan, Rif A. Saurous, Yannis Agiomyrgiannakis, Yonghui Wu

    Abstract: This paper describes Tacotron 2, a neural network architecture for speech synthesis directly from text. The system is composed of a recurrent sequence-to-sequence feature prediction network that maps character embeddings to mel-scale spectrograms, followed by a modified WaveNet model acting as a vocoder to synthesize timedomain waveforms from those spectrograms. Our model achieves a mean opinion s… ▽ More

    Submitted 15 February, 2018; v1 submitted 15 December, 2017; originally announced December 2017.

    Comments: Accepted to ICASSP 2018

  17. arXiv:1701.01337  [pdf, ps, other

    cs.DS

    New Abilities and Limitations of Spectral Graph Bisection

    Authors: Martin R. Schuster, Maciej Liskiewicz

    Abstract: Spectral based heuristics belong to well-known commonly used methods which determines provably minimal graph bisection or outputs "fail" when the optimality cannot be certified. In this paper we focus on Boppana's algorithm which belongs to one of the most prominent methods of this type. It is well known that the algorithm works well in the random \emph{planted bisection model} -- the standard cla… ▽ More

    Submitted 28 April, 2017; v1 submitted 5 January, 2017; originally announced January 2017.

  18. arXiv:1611.04558  [pdf, other

    cs.CL cs.AI

    Google's Multilingual Neural Machine Translation System: Enabling Zero-Shot Translation

    Authors: Melvin Johnson, Mike Schuster, Quoc V. Le, Maxim Krikun, Yonghui Wu, Zhifeng Chen, Nikhil Thorat, Fernanda Viégas, Martin Wattenberg, Greg Corrado, Macduff Hughes, Jeffrey Dean

    Abstract: We propose a simple solution to use a single Neural Machine Translation (NMT) model to translate between multiple languages. Our solution requires no change in the model architecture from our base system but instead introduces an artificial token at the beginning of the input sentence to specify the required target language. The rest of the model, which includes encoder, decoder and attention, rem… ▽ More

    Submitted 21 August, 2017; v1 submitted 14 November, 2016; originally announced November 2016.

  19. arXiv:1609.08144  [pdf, other

    cs.CL cs.AI cs.LG

    Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation

    Authors: Yonghui Wu, Mike Schuster, Zhifeng Chen, Quoc V. Le, Mohammad Norouzi, Wolfgang Macherey, Maxim Krikun, Yuan Cao, Qin Gao, Klaus Macherey, Jeff Klingner, Apurva Shah, Melvin Johnson, Xiaobing Liu, Łukasz Kaiser, Stephan Gouws, Yoshikiyo Kato, Taku Kudo, Hideto Kazawa, Keith Stevens, George Kurian, Nishant Patil, Wei Wang, Cliff Young, Jason Smith , et al. (6 additional authors not shown)

    Abstract: Neural Machine Translation (NMT) is an end-to-end learning approach for automated translation, with the potential to overcome many of the weaknesses of conventional phrase-based translation systems. Unfortunately, NMT systems are known to be computationally expensive both in training and in translation inference. Also, most NMT systems have difficulty with rare words. These issues have hindered NM… ▽ More

    Submitted 8 October, 2016; v1 submitted 26 September, 2016; originally announced September 2016.

  20. arXiv:1609.00150  [pdf, ps, other

    cs.LG

    Reward Augmented Maximum Likelihood for Neural Structured Prediction

    Authors: Mohammad Norouzi, Samy Bengio, Zhifeng Chen, Navdeep Jaitly, Mike Schuster, Yonghui Wu, Dale Schuurmans

    Abstract: A key problem in structured output prediction is direct optimization of the task reward function that matters for test evaluation. This paper presents a simple and computationally efficient approach to incorporate task reward into a maximum likelihood framework. By establishing a link between the log-likelihood and expected reward objectives, we show that an optimal regularized expected reward is… ▽ More

    Submitted 4 January, 2017; v1 submitted 1 September, 2016; originally announced September 2016.

    Comments: NIPS 2016

  21. arXiv:1606.02879  [pdf, ps, other

    cs.FL

    Transducer-based Rewriting Games for Active XML

    Authors: Martin Schuster

    Abstract: Context-free games are two-player rewriting games that are played on nested strings representing XML documents with embedded function symbols. These games were introduced to model rewriting processes for intensional documents in the Active XML framework, where input documents are to be rewritten into a given target schema by calls to external services. This paper studies the setting where depend… ▽ More

    Submitted 9 June, 2016; originally announced June 2016.

    Comments: Extended version of MFCS 2016 conference paper

    ACM Class: F.2.m; F.4.2; H.3.5

  22. arXiv:1603.04467  [pdf, other

    cs.DC cs.LG

    TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

    Authors: Martín Abadi, Ashish Agarwal, Paul Barham, Eugene Brevdo, Zhifeng Chen, Craig Citro, Greg S. Corrado, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Ian Goodfellow, Andrew Harp, Geoffrey Irving, Michael Isard, Yangqing Jia, Rafal Jozefowicz, Lukasz Kaiser, Manjunath Kudlur, Josh Levenberg, Dan Mane, Rajat Monga, Sherry Moore, Derek Murray, Chris Olah , et al. (15 additional authors not shown)

    Abstract: TensorFlow is an interface for expressing machine learning algorithms, and an implementation for executing such algorithms. A computation expressed using TensorFlow can be executed with little or no change on a wide variety of heterogeneous systems, ranging from mobile devices such as phones and tablets up to large-scale distributed systems of hundreds of machines and thousands of computational de… ▽ More

    Submitted 16 March, 2016; v1 submitted 14 March, 2016; originally announced March 2016.

    Comments: Version 2 updates only the metadata, to correct the formatting of Martín Abadi's name

  23. arXiv:1602.02410  [pdf, other

    cs.CL

    Exploring the Limits of Language Modeling

    Authors: Rafal Jozefowicz, Oriol Vinyals, Mike Schuster, Noam Shazeer, Yonghui Wu

    Abstract: In this work we explore recent advances in Recurrent Neural Networks for large scale Language Modeling, a task central to language understanding. We extend current models to deal with two key challenges present in this task: corpora and vocabulary sizes, and complex, long term structure of language. We perform an exhaustive study on techniques such as character Convolutional Neural Networks or Lon… ▽ More

    Submitted 11 February, 2016; v1 submitted 7 February, 2016; originally announced February 2016.

  24. arXiv:1412.5910  [pdf, ps, other

    cs.DB cs.FL

    Games for Active XML Revisited

    Authors: Martin Schuster, Thomas Schwentick

    Abstract: The paper studies the rewriting mechanisms for intensional documents in the Active XML framework, abstracted in the form of active context-free games. The safe rewriting problem studied in this paper is to decide whether the first player, Juliet, has a winning strategy for a given game and (nested) word; this corresponds to a successful rewriting strategy for a given intensional document. The pape… ▽ More

    Submitted 18 December, 2014; originally announced December 2014.

    Comments: To be published in ICDT 2015

    ACM Class: F.2.m; F.4.2; H.3.5

  25. arXiv:1312.3005  [pdf, ps, other

    cs.CL

    One Billion Word Benchmark for Measuring Progress in Statistical Language Modeling

    Authors: Ciprian Chelba, Tomas Mikolov, Mike Schuster, Qi Ge, Thorsten Brants, Phillipp Koehn, Tony Robinson

    Abstract: We propose a new benchmark corpus to be used for measuring progress in statistical language modeling. With almost one billion words of training data, we hope this benchmark will be useful to quickly evaluate novel language modeling techniques, and to compare their contribution when combined with other advanced techniques. We show performance of several well-known types of language models, with the… ▽ More

    Submitted 4 March, 2014; v1 submitted 10 December, 2013; originally announced December 2013.

    Comments: Accompanied by a code.google.com project allowing anyone to generate the benchmark data, and use it to compare their language model against the ones described in the paper

  26. arXiv:1308.2690  [pdf, ps, other

    cs.LO math.AC math.LO

    Induction in Algebra: a First Case Study

    Authors: Peter M Schuster

    Abstract: Many a concrete theorem of abstract algebra admits a short and elegant proof by contradiction but with Zorn's Lemma (ZL). A few of these theorems have recently turned out to follow in a direct and elementary way from the Principle of Open Induction distinguished by Raoult. The ideal objects characteristic of any invocation of ZL are eliminated, and it is made possible to pass from classical to in… ▽ More

    Submitted 20 September, 2013; v1 submitted 12 August, 2013; originally announced August 2013.

    Journal ref: Logical Methods in Computer Science, Volume 9, Issue 3 (September 17, 2013) lmcs:959

  27. arXiv:1212.3501  [pdf, ps, other

    cs.DB

    On optimum left-to-right strategies for active context-free games

    Authors: Henrik Björklund, Martin Schuster, Thomas Schwentick, Joscha Kulbatzki

    Abstract: Active context-free games are two-player games on strings over finite alphabets with one player trying to rewrite the input string to match a target specification. These games have been investigated in the context of exchanging Active XML (AXML) data. While it was known that the rewriting problem is undecidable in general, it is shown here that it is EXPSPACE-complete to decide for a given context… ▽ More

    Submitted 14 December, 2012; originally announced December 2012.

    Comments: To appear in ICDT 2013

  28. arXiv:1207.4694  [pdf, ps, other

    cs.DS cs.CC

    A New Upper Bound for the Traveling Salesman Problem in Cubic Graphs

    Authors: Maciej Liskiewicz, Martin R. Schuster

    Abstract: We provide a new upper bound for traveling salesman problem (TSP) in cubic graphs, i.e. graphs with maximum vertex degree three, and prove that the problem for an $n$-vertex graph can be solved in $O(1.2553^n)$ time and in linear space. We show that the exact TSP algorithm of Eppstein, with some minor modifications, yields the stated result. The previous best known upper bound $O(1.251^n)$ was cla… ▽ More

    Submitted 30 November, 2012; v1 submitted 19 July, 2012; originally announced July 2012.

  29. arXiv:1203.6536  [pdf, other

    math.CO cs.DM

    Computing the Ramsey Number $R(K_5-P_3,K_5)$

    Authors: Jesse A. Calvert, Michael J. Schuster, Stanisław P. Radziszowski

    Abstract: We give a computer-assisted proof of the fact that $R(K_5-P_3, K_5)=25$. This solves one of the three remaining open cases in Hendry's table, which listed the Ramsey numbers for pairs of graphs on 5 vertices. We find that there exist no $(K_5-P_3,K_5)$-good graphs containing a $K_4$ on 23 or 24 vertices, where a graph $F$ is $(G,H)$-good if $F$ does not contain $G$ and the complement of $F$ does n… ▽ More

    Submitted 29 March, 2012; originally announced March 2012.

    MSC Class: 05C55

    Journal ref: Journal of Combinatorial Mathematics and Combinatorial Computing, 82 (2012) 131-140