Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Marinoni, C

Searching in archive eess. Search in all archives.
.
  1. arXiv:2402.09245  [pdf, other

    eess.AS cs.LG eess.SP

    Overview of the L3DAS23 Challenge on Audio-Visual Extended Reality

    Authors: Christian Marinoni, Riccardo Fosco Gramaccioni, Changan Chen, Aurelio Uncini, Danilo Comminiello

    Abstract: The primary goal of the L3DAS23 Signal Processing Grand Challenge at ICASSP 2023 is to promote and support collaborative research on machine learning for 3D audio signal processing, with a specific emphasis on 3D speech enhancement and 3D Sound Event Localization and Detection in Extended Reality applications. As part of our latest competition, we provide a brand-new dataset, which maintains the s… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: Accepted to 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023)

  2. arXiv:2309.07195  [pdf, other

    cs.SD cs.ET eess.AS

    Diffusion models for audio semantic communication

    Authors: Eleonora Grassucci, Christian Marinoni, Andrea Rodriguez, Danilo Comminiello

    Abstract: Directly sending audio signals from a transmitter to a receiver across a noisy channel may absorb consistent bandwidth and be prone to errors when trying to recover the transmitted bits. On the contrary, the recent semantic communication approach proposes to send the semantics and then regenerate semantically consistent content at the receiver without exactly recovering the bitstream. In this pape… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

    Comments: Submitted to IEEE ICASSP 2024

  3. arXiv:2202.10372  [pdf, other

    eess.AS cs.LG cs.SD eess.SP

    L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office Environment

    Authors: Eric Guizzo, Christian Marinoni, Marco Pennese, Xinlei Ren, Xiguang Zheng, Chen Zhang, Bruno Masiero, Aurelio Uncini, Danilo Comminiello

    Abstract: The L3DAS22 Challenge is aimed at encouraging the development of machine learning strategies for 3D speech enhancement and 3D sound localization and detection in office-like environments. This challenge improves and extends the tasks of the L3DAS21 edition. We generated a new dataset, which maintains the same general characteristics of L3DAS21 datasets, but with an extended number of data points a… ▽ More

    Submitted 21 February, 2022; originally announced February 2022.

    Comments: Accepted to 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022). arXiv admin note: substantial text overlap with arXiv:2104.05499

    Journal ref: 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2022, pp. 9186-9190

  4. arXiv:2104.05499  [pdf, ps, other

    eess.AS cs.LG cs.SD eess.SP

    L3DAS21 Challenge: Machine Learning for 3D Audio Signal Processing

    Authors: Eric Guizzo, Riccardo F. Gramaccioni, Saeid Jamili, Christian Marinoni, Edoardo Massaro, Claudia Medaglia, Giuseppe Nachira, Leonardo Nucciarelli, Ludovica Paglialunga, Marco Pennese, Sveva Pepe, Enrico Rocchi, Aurelio Uncini, Danilo Comminiello

    Abstract: The L3DAS21 Challenge is aimed at encouraging and fostering collaborative research on machine learning for 3D audio signal processing, with particular focus on 3D speech enhancement (SE) and 3D sound localization and detection (SELD). Alongside with the challenge, we release the L3DAS21 dataset, a 65 hours 3D audio corpus, accompanied with a Python API that facilitates the data usage and results s… ▽ More

    Submitted 29 April, 2021; v1 submitted 12 April, 2021; originally announced April 2021.

    Comments: Documentation paper for the L3DAS21 Challenge for IEEE MLSP 2021. Further information on www.l3das.com/mlsp2021

    Journal ref: 2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP), 2021, pp. 1-6