Zum Hauptinhalt springen

Showing 1–12 of 12 results for author: Davies, M E P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2401.08902  [pdf, other

    cs.SD cs.DL cs.IR cs.LG eess.AS

    Similar but Faster: Manipulation of Tempo in Music Audio Embeddings for Tempo Prediction and Search

    Authors: Matthew C. McCallum, Florian Henkel, Jaehun Kim, Samuel E. Sandberg, Matthew E. P. Davies

    Abstract: Audio embeddings enable large scale comparisons of the similarity of audio files for applications such as search and recommendation. Due to the subjectivity of audio similarity, it can be desirable to design systems that answer not only whether audio is similar, but similar in what way (e.g., wrt. tempo, mood or genre). Previous works have proposed disentangled embedding spaces where subspaces rep… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted to the International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024

  2. arXiv:2401.08891  [pdf, other

    cs.SD cs.LG eess.AS

    Tempo estimation as fully self-supervised binary classification

    Authors: Florian Henkel, Jaehun Kim, Matthew C. McCallum, Samuel E. Sandberg, Matthew E. P. Davies

    Abstract: This paper addresses the problem of global tempo estimation in musical audio. Given that annotating tempo is time-consuming and requires certain musical expertise, few publicly available data sources exist to train machine learning models for this task. Towards alleviating this issue, we propose a fully self-supervised approach that does not rely on any human labeled data. Our method builds on the… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted to the International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024

  3. arXiv:2401.08889  [pdf, other

    cs.SD cs.IR cs.LG cs.MM eess.AS

    On the Effect of Data-Augmentation on Local Embedding Properties in the Contrastive Learning of Music Audio Representations

    Authors: Matthew C. McCallum, Matthew E. P. Davies, Florian Henkel, Jaehun Kim, Samuel E. Sandberg

    Abstract: Audio embeddings are crucial tools in understanding large catalogs of music. Typically embeddings are evaluated on the basis of the performance they provide in a wide range of downstream tasks, however few studies have investigated the local properties of the embedding spaces themselves which are important in nearest neighbor algorithms, commonly used in music search and recommendation. In this wo… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: Accepted to the International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024

  4. arXiv:2308.10355  [pdf, other

    eess.AS cs.SD

    Local Periodicity-Based Beat Tracking for Expressive Classical Piano Music

    Authors: Ching-Yu Chiu, Meinard Müller, Matthew E. P. Davies, Alvin Wen-Yu Su, Yi-Hsuan Yang

    Abstract: To model the periodicity of beats, state-of-the-art beat tracking systems use "post-processing trackers" (PPTs) that rely on several empirically determined global assumptions for tempo transition, which work well for music with a steady tempo. For expressive classical music, however, these assumptions can be too rigid. With two large datasets of Western classical piano music, namely the Aligned Sc… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: Accepted to IEEE/ACM Transactions on Audio, Speech, and Language Processing (July 2023)

  5. Tempo vs. Pitch: understanding self-supervised tempo estimation

    Authors: Giovana Morais, Matthew E. P. Davies, Marcelo Queiroz, Magdalena Fuentes

    Abstract: Self-supervision methods learn representations by solving pretext tasks that do not require human-generated labels, alleviating the need for time-consuming annotations. These methods have been applied in computer vision, natural language processing, environmental sound analysis, and recently in music information retrieval, e.g. for pitch estimation. Particularly in the context of music, there are… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: 5 pages, 3 figures, published on 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing

  6. An Analysis Method for Metric-Level Switching in Beat Tracking

    Authors: Ching-Yu Chiu, Meinard Müller, Matthew E. P. Davies, Alvin Wen-Yu Su, Yi-Hsuan Yang

    Abstract: For expressive music, the tempo may change over time, posing challenges to tracking the beats by an automatic model. The model may first tap to the correct tempo, but then may fail to adapt to a tempo change, or switch between several incorrect but perceptually plausible ones (e.g., half- or double-tempo). Existing evaluation metrics for beat tracking do not reflect such behaviors, as they typical… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted to IEEE Signal Processing Letters (Oct. 2022)

  7. arXiv:2203.16165  [pdf, other

    eess.AS cs.AI cs.MM

    Symbolic music generation conditioned on continuous-valued emotions

    Authors: Serkan Sulun, Matthew E. P. Davies, Paula Viana

    Abstract: In this paper we present a new approach for the generation of multi-instrument symbolic music driven by musical emotion. The principal novelty of our approach centres on conditioning a state-of-the-art transformer based on continuous-valued valence and arousal labels. In addition, we provide a new large-scale dataset of symbolic music paired with emotion labels in terms of valence and arousal. We… ▽ More

    Submitted 4 May, 2022; v1 submitted 30 March, 2022; originally announced March 2022.

    Comments: Published in IEEE Access

    Journal ref: volume:10, year:2022, pages:44617-44626

  8. arXiv:2011.07274  [pdf, other

    eess.AS cs.AI cs.LG cs.SD

    On Filter Generalization for Music Bandwidth Extension Using Deep Neural Networks

    Authors: Serkan Sulun, Matthew E. P. Davies

    Abstract: In this paper, we address a sub-topic of the broad domain of audio enhancement, namely musical audio bandwidth extension. We formulate the bandwidth extension problem using deep neural networks, where a band-limited signal is provided as input to the network, with the goal of reconstructing a full-bandwidth output. Our main contribution centers on the impact of the choice of low pass filter when t… ▽ More

    Submitted 6 January, 2021; v1 submitted 14 November, 2020; originally announced November 2020.

    Comments: Qualitative examples on https://serkansulun.com/bwe. Source code on https://github.com/serkansulun/deep-music-enhancer

  9. arXiv:2011.01637  [pdf, other

    cs.SD cs.IR

    Shift If You Can: Counting and Visualising Correction Operations for Beat Tracking Evaluation

    Authors: A. Sá Pinto, I. Domingues, M. E. P. Davies

    Abstract: In this late-breaking abstract we propose a modified approach for beat tracking evaluation which poses the problem in terms of the effort required to transform a sequence of beat detections such that they maximise the well-known F-measure calculation when compared to a sequence of ground truth annotations. Central to our approach is the inclusion of a shifting operation conducted over an additiona… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: ISMIR 2020 Late Breaking/Demo

  10. arXiv:2008.11529  [pdf, other

    eess.AS cs.SD

    TIV.lib: an open-source library for the tonal description of musical audio

    Authors: António Ramires, Gilberto Bernardes, Matthew E. P. Davies, Xavier Serra

    Abstract: In this paper, we present TIV.lib, an open-source library for the content-based tonal description of musical audio signals. Its main novelty relies on the perceptually-inspired Tonal Interval Vector space based on the Discrete Fourier transform, from which multiple instantaneous and global representations, descriptors and metrics are computed - e.g., harmonic change, dissonance, diatonicity, and m… ▽ More

    Submitted 26 August, 2020; originally announced August 2020.

  11. arXiv:1811.02411  [pdf, other

    cs.SD eess.AS

    An audio-only method for advertisement detection in broadcast television content

    Authors: António Ramires, Diogo Cocharro, Matthew E. P. Davies

    Abstract: We address the task of advertisement detection in broadcast television content. While typically approached from a video-only or audio-visual perspective, we present an audio-only method. Our approach centres on the detection of short silences which exist at the boundaries between programming and advertising, as well as between the advertisements themselves. To identify advertising regions we first… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

    Journal ref: Proc. of RecPad-2017, Amadora, Portugal, pp. 21-22, October, 2017

  12. arXiv:1811.02406  [pdf, other

    cs.SD eess.AS

    User Specific Adaptation in Automatic Transcription of Vocalised Percussion

    Authors: António Ramires, Rui Penha, Matthew E. P. Davies

    Abstract: The goal of this work is to develop an application that enables music producers to use their voice to create drum patterns when composing in Digital Audio Workstations (DAWs). An easy-to-use and user-oriented system capable of automatically transcribing vocalisations of percussion sounds, called LVT - Live Vocalised Transcription, is presented. LVT is developed as a Max for Live device which follo… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

    Journal ref: Proc. of RecPad-2017, Amadora, Portugal, pp. 19-20, October, 2017