Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Noufi, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.15001  [pdf, other

    eess.AS cs.SD

    Acoustically-Driven Phoneme Removal That Preserves Vocal Affect Cues

    Authors: Camille Noufi, Jonathan Berger, Karen J. Parker, Daniel L. Bowling

    Abstract: In this paper, we propose a method for removing linguistic information from speech for the purpose of isolating paralinguistic indicators of affect. The immediate utility of this method lies in clinical tests of sensitivity to vocal affect that are not confounded by language, which is impaired in a variety of clinical populations. The method is based on simultaneous recordings of speech audio and… ▽ More

    Submitted 14 March, 2023; v1 submitted 26 October, 2022; originally announced October 2022.

    Comments: To be seen in proceedings of the 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (DOI coming soon)

  2. arXiv:2209.04473  [pdf, other

    eess.AS cs.SD eess.SP

    Reconstructing the Dynamic Directivity of Unconstrained Speech

    Authors: Camille Noufi, Dejan Markovic, Peter Dodds

    Abstract: This article presents a method for estimating and reconstructing the spatial energy distribution pattern of natural speech, which is crucial for achieving realistic vocal presence in virtual communication settings. The method comprises two stages. First, recordings of speech captured by a real, static microphone array are used to create an egocentric virtual array that tracks the movement of the s… ▽ More

    Submitted 5 September, 2023; v1 submitted 9 September, 2022; originally announced September 2022.

    Comments: In proceedings of I3DA 2023 - The 2023 International Conference on Immersive and 3D Audio. DOI coming soon

  3. arXiv:2209.04406  [pdf, other

    q-bio.NC cs.SD eess.AS

    Longitudinal Acoustic Speech Tracking Following Pediatric Traumatic Brain Injury

    Authors: Camille Noufi, Adam C. Lammert, Daryush D. Mehta, James R. Williamson, Gregory Ciccarelli, Douglas Sturim, Jordan R. Green, Thomas F. Quatieri, Thomas F. Campbell

    Abstract: Recommendations for common outcome measures following pediatric traumatic brain injury (TBI) support the integration of instrumental measurements alongside perceptual assessment in recovery and treatment plans. A comprehensive set of sensitive, robust and non-invasive measurements is therefore essential in assessing variations in speech characteristics over time following pediatric TBI. In this ar… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

  4. arXiv:2209.02855  [pdf, other

    cs.SD cs.HC eess.AS eess.SY

    The Role of Vocal Persona in Natural and Synthesized Speech

    Authors: Camille Noufi, Lloyd May, Jonathan Berger

    Abstract: The inclusion of voice persona in synthesized voice can be significant in a broad range of human-computer-interaction (HCI) applications, including augmentative and assistive communication (AAC), artistic performance, and design of virtual agents. We propose a framework to imbue compelling and contextually-dependent expression within a synthesized voice by introducing the role of the vocal persona… ▽ More

    Submitted 31 October, 2022; v1 submitted 6 September, 2022; originally announced September 2022.

    Comments: To be published in the proceedings of the 17th IEEE International Conference on Automatic Face and Gesture Recognition as part of the Workshop on Socially Interactive Human-like Virtual Agents (SIVA '23)

  5. arXiv:2203.03022  [pdf, ps, other

    cs.SD cs.AI cs.LG eess.AS stat.ML

    HEAR: Holistic Evaluation of Audio Representations

    Authors: Joseph Turian, Jordie Shier, Humair Raj Khan, Bhiksha Raj, Björn W. Schuller, Christian J. Steinmetz, Colin Malloy, George Tzanetakis, Gissel Velarde, Kirk McNally, Max Henry, Nicolas Pinto, Camille Noufi, Christian Clough, Dorien Herremans, Eduardo Fonseca, Jesse Engel, Justin Salamon, Philippe Esling, Pranay Manocha, Shinji Watanabe, Zeyu Jin, Yonatan Bisk

    Abstract: What audio embedding approach generalizes best to a wide range of downstream tasks across a variety of everyday domains without fine-tuning? The aim of the HEAR benchmark is to develop a general-purpose audio representation that provides a strong basis for learning in a wide variety of tasks and scenarios. HEAR evaluates audio representations using a benchmark suite across a variety of domains, in… ▽ More

    Submitted 29 May, 2022; v1 submitted 6 March, 2022; originally announced March 2022.

    Comments: to appear in Proceedings of Machine Learning Research (PMLR): NeurIPS 2021 Competition Track

  6. arXiv:2007.09060  [pdf, other

    cs.SD cs.CV cs.IR cs.LG eess.AS

    Self-Supervised Learning of Context-Aware Pitch Prosody Representations

    Authors: Camille Noufi, Prateek Verma

    Abstract: In music and speech, meaning is derived at multiple levels of context. Affect, for example, can be inferred both by a short sound token and by sonic patterns over a longer temporal window such as an entire recording. In this letter, we focus on inferring meaning from this dichotomy of contexts. We show how contextual representations of short sung vocal lines can be implicitly learned from fundamen… ▽ More

    Submitted 1 August, 2021; v1 submitted 17 July, 2020; originally announced July 2020.