Search | arXiv e-print repository

On zero-shot learning in neural state estimation of power distribution systems

Authors: Aleksandr Berezin, Stephan Balduin, Thomas Oberließen, Sebastian Peter, Eric MSP Veith

Abstract: This paper addresses the challenge of neural state estimation in power distribution systems. We identified a research gap in the current state of the art, which lies in the inability of models to adapt to changes in the power grid, such as loss of sensors and branch switching. Our experiments demonstrate that graph neural networks are the most promising models for this use case and that their perf… ▽ More This paper addresses the challenge of neural state estimation in power distribution systems. We identified a research gap in the current state of the art, which lies in the inability of models to adapt to changes in the power grid, such as loss of sensors and branch switching. Our experiments demonstrate that graph neural networks are the most promising models for this use case and that their performance can degrade with scale. We propose augmentations to remedy this issue and perform a comprehensive grid search of different model configurations for common zero-shot learning scenarios in neural state estimation. △ Less

Submitted 11 August, 2024; originally announced August 2024.

Comments: 13 pages, 2 figures, associated source code available at https://gitlab.com/transense/nse-tl-paper

arXiv:2408.04309 [pdf, other]

TheGlueNote: Learned Representations for Robust and Flexible Note Alignment

Authors: Silvan David Peter, Gerhard Widmer

Abstract: Note alignment refers to the task of matching individual notes of two versions of the same symbolically encoded piece. Methods addressing this task commonly rely on sequence alignment algorithms such as Hidden Markov Models or Dynamic Time Warping (DTW) applied directly to note or onset sequences. While successful in many cases, such methods struggle with large mismatches between the versions. In… ▽ More Note alignment refers to the task of matching individual notes of two versions of the same symbolically encoded piece. Methods addressing this task commonly rely on sequence alignment algorithms such as Hidden Markov Models or Dynamic Time Warping (DTW) applied directly to note or onset sequences. While successful in many cases, such methods struggle with large mismatches between the versions. In this work, we learn note-wise representations from data augmented with various complex mismatch cases, e.g. repeats, skips, block insertions, and long trills. At the heart of our approach lies a transformer encoder network - TheGlueNote - which predicts pairwise note similarities for two 512 note subsequences. We postprocess the predicted similarities using flavors of weightedDTW and pitch-separated onsetDTW to retrieve note matches for two sequences of arbitrary length. Our approach performs on par with the state of the art in terms of note alignment accuracy, is considerably more robust to version mismatches, and works directly on any pair of MIDI files. △ Less

Submitted 8 August, 2024; originally announced August 2024.

Comments: to be published in Proceedings of the 25th International Society for Music Information Retrieval Conference (ISMIR), 2024

arXiv:2401.00471 [pdf, other]

doi 10.1145/3625135.3625141

Sounding Out Reconstruction Error-Based Evaluation of Generative Models of Expressive Performance

Authors: Silvan David Peter, Carlos Eduardo Cancino-Chacón, Emmanouil Karystinaios, Gerhard Widmer

Abstract: Generative models of expressive piano performance are usually assessed by comparing their predictions to a reference human performance. A generative algorithm is taken to be better than competing ones if it produces performances that are closer to a human reference performance. However, expert human performers can (and do) interpret music in different ways, making for different possible references… ▽ More Generative models of expressive piano performance are usually assessed by comparing their predictions to a reference human performance. A generative algorithm is taken to be better than competing ones if it produces performances that are closer to a human reference performance. However, expert human performers can (and do) interpret music in different ways, making for different possible references, and quantitative closeness is not necessarily aligned with perceptual similarity, raising concerns about the validity of this evaluation approach. In this work, we present a number of experiments that shed light on this problem. Using precisely measured high-quality performances of classical piano music, we carry out a listening test indicating that listeners can sometimes perceive subtle performance difference that go unnoticed under quantitative evaluation. We further present tests that indicate that such evaluation frameworks show a lot of variability in reliability and validity across different reference performances and pieces. We discuss these results and their implications for quantitative evaluation, and hope to foster a critical appreciation of the uncertainties involved in quantitative assessments of such performances within the wider music information retrieval (MIR) community. △ Less

Submitted 31 December, 2023; originally announced January 2024.

Journal ref: 10th International Conference on Digital Libraries for Musicology, November 10, 2023, Milan, Italy

arXiv:2401.00466 [pdf, other]

doi 10.5281/zenodo.10265367

Online Symbolic Music Alignment with Offline Reinforcement Learning

Authors: Silvan David Peter

Abstract: Symbolic Music Alignment is the process of matching performed MIDI notes to corresponding score notes. In this paper, we introduce a reinforcement learning (RL)-based online symbolic music alignment technique. The RL agent - an attention-based neural network - iteratively estimates the current score position from local score and performance contexts. For this symbolic alignment task, environment s… ▽ More Symbolic Music Alignment is the process of matching performed MIDI notes to corresponding score notes. In this paper, we introduce a reinforcement learning (RL)-based online symbolic music alignment technique. The RL agent - an attention-based neural network - iteratively estimates the current score position from local score and performance contexts. For this symbolic alignment task, environment states can be sampled exhaustively and the reward is dense, rendering a formulation as a simplified offline RL problem straightforward. We evaluate the trained agent in three ways. First, in its capacity to identify correct score positions for sampled test contexts; second, as the core technique of a complete algorithm for symbolic online note-wise alignment; and finally, as a real-time symbolic score follower. We further investigate the pitch-based score and performance representations used as the agent's inputs. To this end, we develop a second model, a two-step Dynamic Time Warping (DTW)-based offline alignment algorithm leveraging the same input representation. The proposed model outperforms a state-of-the-art reference model of offline symbolic music alignment. △ Less

Submitted 31 December, 2023; originally announced January 2024.

Journal ref: Proceedings of the 24th International Society for Music Information Retrieval Conference, {ISMIR} 2023, Milan, Italy, November 5-9, 2023

arXiv:2305.09489 [pdf, other]

Discrete Diffusion Probabilistic Models for Symbolic Music Generation

Authors: Matthias Plasser, Silvan Peter, Gerhard Widmer

Abstract: Denoising Diffusion Probabilistic Models (DDPMs) have made great strides in generating high-quality samples in both discrete and continuous domains. However, Discrete DDPMs (D3PMs) have yet to be applied to the domain of Symbolic Music. This work presents the direct generation of Polyphonic Symbolic Music using D3PMs. Our model exhibits state-of-the-art sample quality, according to current quantit… ▽ More Denoising Diffusion Probabilistic Models (DDPMs) have made great strides in generating high-quality samples in both discrete and continuous domains. However, Discrete DDPMs (D3PMs) have yet to be applied to the domain of Symbolic Music. This work presents the direct generation of Polyphonic Symbolic Music using D3PMs. Our model exhibits state-of-the-art sample quality, according to current quantitative evaluation metrics, and allows for flexible infilling at the note level. We further show, that our models are accessible to post-hoc classifier guidance, widening the scope of possible applications. However, we also cast a critical view on quantitative evaluation of music sample quality via statistical metrics, and present a simple algorithm that can confound our metrics with completely spurious, non-musical samples. △ Less

Submitted 16 May, 2023; originally announced May 2023.

Comments: In Proceedings of the 32nd International Joint Conference on Artificial Intelligence (IJCAI-23), Macau, China

arXiv:2304.12939 [pdf, other]

The ACCompanion: Combining Reactivity, Robustness, and Musical Expressivity in an Automatic Piano Accompanist

Authors: Carlos Cancino-Chacón, Silvan Peter, Patricia Hu, Emmanouil Karystinaios, Florian Henkel, Francesco Foscarin, Nimrod Varga, Gerhard Widmer

Abstract: This paper introduces the ACCompanion, an expressive accompaniment system. Similarly to a musician who accompanies a soloist playing a given musical piece, our system can produce a human-like rendition of the accompaniment part that follows the soloist's choices in terms of tempo, dynamics, and articulation. The ACCompanion works in the symbolic domain, i.e., it needs a musical instrument capable… ▽ More This paper introduces the ACCompanion, an expressive accompaniment system. Similarly to a musician who accompanies a soloist playing a given musical piece, our system can produce a human-like rendition of the accompaniment part that follows the soloist's choices in terms of tempo, dynamics, and articulation. The ACCompanion works in the symbolic domain, i.e., it needs a musical instrument capable of producing and playing MIDI data, with explicitly encoded onset, offset, and pitch for each played note. We describe the components that go into such a system, from real-time score following and prediction to expressive performance generation and online adaptation to the expressive choices of the human player. Based on our experience with repeated live demonstrations in front of various audiences, we offer an analysis of the challenges of combining these components into a system that is highly reactive and precise, while still a reliable musical partner, robust to possible performance errors and responsive to expressive variations. △ Less

Submitted 30 May, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

Comments: In Proceedings of the 32nd International Joint Conference on Artificial Intelligence (IJCAI-23), Macao, China. The differences/extensions with the previous version include a technical appendix, added missing links, and minor text updates. 10 pages, 4 figures

arXiv:2206.01104 [pdf, other]

The match file format: Encoding Alignments between Scores and Performances

Authors: Francesco Foscarin, Emmanouil Karystinaios, Silvan David Peter, Carlos Cancino-Chacón, Maarten Grachten, Gerhard Widmer

Abstract: This paper presents the specifications of match: a file format that extends a MIDI human performance with note-, beat-, and downbeat-level alignments to a corresponding musical score. This enables advanced analyses of the performance that are relevant for various tasks, such as expressive performance modeling, score following, music transcription, and performer classification. The match file inclu… ▽ More This paper presents the specifications of match: a file format that extends a MIDI human performance with note-, beat-, and downbeat-level alignments to a corresponding musical score. This enables advanced analyses of the performance that are relevant for various tasks, such as expressive performance modeling, score following, music transcription, and performer classification. The match file includes a set of score-related descriptors that makes it usable also as a bare-bones score representation. For applications that require the use of structural score elements (e.g., voices, parts, beams, slurs), the match file can be easily combined with the symbolic score. To support the practical application of our work, we release a corrected and upgraded version of the Vienna4x22 dataset of scores and performances aligned with match files. △ Less

Submitted 2 June, 2022; originally announced June 2022.

Journal ref: Proceedings of the Music Encoding Conference (MEC), 2022, Halifax, Canada

arXiv:2206.01071 [pdf, other]

Partitura: A Python Package for Symbolic Music Processing

Authors: Carlos Cancino-Chacón, Silvan David Peter, Emmanouil Karystinaios, Francesco Foscarin, Maarten Grachten, Gerhard Widmer

Abstract: Partitura is a lightweight Python package for handling symbolic musical information. It provides easy access to features commonly used in music information retrieval tasks, like note arrays (lists of timed pitched events) and 2D piano roll matrices, as well as other score elements such as time and key signatures, performance directives, and repeat structures. Partitura can load musical scores (in… ▽ More Partitura is a lightweight Python package for handling symbolic musical information. It provides easy access to features commonly used in music information retrieval tasks, like note arrays (lists of timed pitched events) and 2D piano roll matrices, as well as other score elements such as time and key signatures, performance directives, and repeat structures. Partitura can load musical scores (in MEI, MusicXML, Kern, and MIDI formats), MIDI performances, and score-to-performance alignments. The package includes some tools for music analysis, such as automatic pitch spelling, key signature identification, and voice separation. Partitura is an open-source project and is available at https://github.com/CPJKU/partitura/. △ Less

Submitted 2 June, 2022; originally announced June 2022.

Journal ref: Proceedings of the Music Encoding Conference (MEC), 2022, Halifax, Canada

arXiv:2204.09621 [pdf, other]

Extraction of Unaliased High-Frequency Micro-Doppler Signature using FMCW radar

Authors: Soorya Peter, Vinod Veera Reddy

Abstract: Micro-Doppler signature is a potent feature that has been used for target identification and micro-motion parameter estimation. The extraction of high frequency micro-Doppler signature from frequency modulated continuous wave (FMCW) radar along with the target range and velocity is the problem considered in this article. The severe aliasing of the high micro-Doppler frequency spread is circumvente… ▽ More Micro-Doppler signature is a potent feature that has been used for target identification and micro-motion parameter estimation. The extraction of high frequency micro-Doppler signature from frequency modulated continuous wave (FMCW) radar along with the target range and velocity is the problem considered in this article. The severe aliasing of the high micro-Doppler frequency spread is circumvented by the fast time processing in the proposed method. The use of range-Doppler (RD) filtering and empirical mode decomposition (EMD) enables effective out-of-band and in-band noise suppression. Simulation studies and experimental results present the effectiveness of the proposed approach. △ Less

Submitted 20 April, 2022; originally announced April 2022.

arXiv:2011.08520 [pdf, other]

doi 10.1016/j.ymssp.2020.106796

Experimental assessment of polynomial nonlinear state-space and nonlinear-mode models for near-resonant vibrations

Authors: Maren Scheel, Gleb Kleyman, Ali Tatar, Matthew R. W. Brake, Simon Peter, Jean-Philippe Noël, Matthew S. Allen, Malte Krack

Abstract: In the present paper, two existing nonlinear system identification methodologies are used to identify data-driven models. The first methodology focuses on identifying the system using steady-state excitations. To accomplish this, a phase-locked loop controller is implemented to acquire periodic oscillations near resonance and construct a nonlinear-mode model. This model is based on amplitude-depen… ▽ More In the present paper, two existing nonlinear system identification methodologies are used to identify data-driven models. The first methodology focuses on identifying the system using steady-state excitations. To accomplish this, a phase-locked loop controller is implemented to acquire periodic oscillations near resonance and construct a nonlinear-mode model. This model is based on amplitude-dependent modal properties, i.e. does not require nonlinear basis functions. The second methodology exploits uncontrolled experiments with broadband random inputs to build polynomial nonlinear state-space models using advanced system identification tools. The methods are applied to two experimental test rigs, a magnetic cantilever beam and a free-free beam with a lap joint. The respective models of both methods and both specimens are then challenged to predict dynamic, near-resonant behavior observed under different sine and sine-sweep excitations. The vibration prediction of the nonlinear-mode and state-space models clearly highlight the capabilities and limitations of the models. The nonlinear-mode model, by design, yields a perfect match at resonance peaks and high accuracy in close vicinity. However, it is limited to well-spaced modes and sinusoidal excitation. The state-space model covers a wider dynamic range, including transient excitations. However, the real-life nonlinearities considered in this study can only be approximated by polynomial basis functions. Consequently, the identified state-space models are found to be highly input-dependent, in particular for sinusoidal excitations where they are found to lead to a low predictive capability. △ Less

Submitted 17 November, 2020; originally announced November 2020.

Comments: The final version of this article is available online at http://doi.org/10.1016/j.ymssp.2020.106796

Journal ref: Mechanical Systems and Signal Processing 143 (2020) 106796

arXiv:2011.08500 [pdf, other]

doi 10.1016/j.jsv.2018.07.010

A Phase Resonance Approach for Modal Testing of Structures with Nonlinear Dissipation

Authors: Maren Scheel, Simon Peter, Remco I. Leine, Malte Krack

Abstract: The concept of nonlinear modes is useful for the dynamical characterization of nonlinear mechanical systems. While efficient and broadly applicable methods are now available for the computation of nonlinear modes, nonlinear modal testing is still in its infancy. The purpose of this work is to overcome its present limitation to conservative nonlinearities. Our approach relies on the recently extend… ▽ More The concept of nonlinear modes is useful for the dynamical characterization of nonlinear mechanical systems. While efficient and broadly applicable methods are now available for the computation of nonlinear modes, nonlinear modal testing is still in its infancy. The purpose of this work is to overcome its present limitation to conservative nonlinearities. Our approach relies on the recently extended periodic motion concept, according to which nonlinear modes of damped systems are defined as family of periodic motions induced by an appropriate artificial excitation that compensates the natural dissipation. The particularly simple experimental implementation with only a single-point, single-frequency, phase resonant forcing is analyzed in detail. The method permits the experimental extraction of natural frequencies, modal damping ratios and deflection shapes (including harmonics), for each mode of interest, as function of the vibration level. The accuracy, robustness and current limitations of the method are first demonstrated numerically. The method is then verified experimentally for a friction-damped system. Moreover, a self-contained measure for estimating the quality of the extracted modal properties is investigated. The primary advantages over alternative vibration testing methods are noise robustness, broad applicability and short measurement duration. The central limitation of the identified modal quantities is that they only characterize the system in the regime near isolated resonances. △ Less

Submitted 17 November, 2020; originally announced November 2020.

Comments: The final version of this article is available online at http://doi.org/10.1016/j.jsv.2018.07.010

Journal ref: Journal of Sound and Vibration 435 (2018) 56-73

arXiv:2008.02194 [pdf, other]

On the Characterization of Expressive Performance in Classical Music: First Results of the Con Espressione Game

Authors: Carlos Cancino-Chacón, Silvan Peter, Shreyan Chowdhury, Anna Aljanaki, Gerhard Widmer

Abstract: A piece of music can be expressively performed, or interpreted, in a variety of ways. With the help of an online questionnaire, the Con Espressione Game, we collected some 1,500 descriptions of expressive character relating to 45 performances of 9 excerpts from classical piano pieces, played by different famous pianists. More specifically, listeners were asked to describe, using freely chosen word… ▽ More A piece of music can be expressively performed, or interpreted, in a variety of ways. With the help of an online questionnaire, the Con Espressione Game, we collected some 1,500 descriptions of expressive character relating to 45 performances of 9 excerpts from classical piano pieces, played by different famous pianists. More specifically, listeners were asked to describe, using freely chosen words (preferably: adjectives), how they perceive the expressive character of the different performances. In this paper, we offer a first account of this new data resource for expressive performance research, and provide an exploratory analysis, addressing three main questions: (1) how similarly do different listeners describe a performance of a piece? (2) what are the main dimensions (or axes) for expressive character emerging from this?; and (3) how do measurable parameters of a performance (e.g., tempo, dynamics) and mid- and high-level features that can be predicted by machine learning models (e.g., articulation, arousal) relate to these expressive dimensions? The dataset that we publish along with this paper was enriched by adding hand-corrected score-to-performance alignments, as well as descriptive audio features such as tempo and dynamics curves. △ Less

Submitted 5 August, 2020; originally announced August 2020.

Comments: 8 pages, 2 figures, accepted for the 21st International Society for Music Information Retrieval Conference (ISMIR 2020)

Showing 1–12 of 12 results for author: Peter, S