-
On zero-shot learning in neural state estimation of power distribution systems
Authors:
Aleksandr Berezin,
Stephan Balduin,
Thomas Oberließen,
Sebastian Peter,
Eric MSP Veith
Abstract:
This paper addresses the challenge of neural state estimation in power distribution systems. We identified a research gap in the current state of the art, which lies in the inability of models to adapt to changes in the power grid, such as loss of sensors and branch switching. Our experiments demonstrate that graph neural networks are the most promising models for this use case and that their perf…
▽ More
This paper addresses the challenge of neural state estimation in power distribution systems. We identified a research gap in the current state of the art, which lies in the inability of models to adapt to changes in the power grid, such as loss of sensors and branch switching. Our experiments demonstrate that graph neural networks are the most promising models for this use case and that their performance can degrade with scale. We propose augmentations to remedy this issue and perform a comprehensive grid search of different model configurations for common zero-shot learning scenarios in neural state estimation.
△ Less
Submitted 11 August, 2024;
originally announced August 2024.
-
TheGlueNote: Learned Representations for Robust and Flexible Note Alignment
Authors:
Silvan David Peter,
Gerhard Widmer
Abstract:
Note alignment refers to the task of matching individual notes of two versions of the same symbolically encoded piece. Methods addressing this task commonly rely on sequence alignment algorithms such as Hidden Markov Models or Dynamic Time Warping (DTW) applied directly to note or onset sequences. While successful in many cases, such methods struggle with large mismatches between the versions. In…
▽ More
Note alignment refers to the task of matching individual notes of two versions of the same symbolically encoded piece. Methods addressing this task commonly rely on sequence alignment algorithms such as Hidden Markov Models or Dynamic Time Warping (DTW) applied directly to note or onset sequences. While successful in many cases, such methods struggle with large mismatches between the versions. In this work, we learn note-wise representations from data augmented with various complex mismatch cases, e.g. repeats, skips, block insertions, and long trills. At the heart of our approach lies a transformer encoder network - TheGlueNote - which predicts pairwise note similarities for two 512 note subsequences. We postprocess the predicted similarities using flavors of weightedDTW and pitch-separated onsetDTW to retrieve note matches for two sequences of arbitrary length. Our approach performs on par with the state of the art in terms of note alignment accuracy, is considerably more robust to version mismatches, and works directly on any pair of MIDI files.
△ Less
Submitted 8 August, 2024;
originally announced August 2024.
-
Sounding Out Reconstruction Error-Based Evaluation of Generative Models of Expressive Performance
Authors:
Silvan David Peter,
Carlos Eduardo Cancino-Chacón,
Emmanouil Karystinaios,
Gerhard Widmer
Abstract:
Generative models of expressive piano performance are usually assessed by comparing their predictions to a reference human performance. A generative algorithm is taken to be better than competing ones if it produces performances that are closer to a human reference performance. However, expert human performers can (and do) interpret music in different ways, making for different possible references…
▽ More
Generative models of expressive piano performance are usually assessed by comparing their predictions to a reference human performance. A generative algorithm is taken to be better than competing ones if it produces performances that are closer to a human reference performance. However, expert human performers can (and do) interpret music in different ways, making for different possible references, and quantitative closeness is not necessarily aligned with perceptual similarity, raising concerns about the validity of this evaluation approach. In this work, we present a number of experiments that shed light on this problem. Using precisely measured high-quality performances of classical piano music, we carry out a listening test indicating that listeners can sometimes perceive subtle performance difference that go unnoticed under quantitative evaluation. We further present tests that indicate that such evaluation frameworks show a lot of variability in reliability and validity across different reference performances and pieces. We discuss these results and their implications for quantitative evaluation, and hope to foster a critical appreciation of the uncertainties involved in quantitative assessments of such performances within the wider music information retrieval (MIR) community.
△ Less
Submitted 31 December, 2023;
originally announced January 2024.
-
Online Symbolic Music Alignment with Offline Reinforcement Learning
Authors:
Silvan David Peter
Abstract:
Symbolic Music Alignment is the process of matching performed MIDI notes to corresponding score notes. In this paper, we introduce a reinforcement learning (RL)-based online symbolic music alignment technique. The RL agent - an attention-based neural network - iteratively estimates the current score position from local score and performance contexts. For this symbolic alignment task, environment s…
▽ More
Symbolic Music Alignment is the process of matching performed MIDI notes to corresponding score notes. In this paper, we introduce a reinforcement learning (RL)-based online symbolic music alignment technique. The RL agent - an attention-based neural network - iteratively estimates the current score position from local score and performance contexts. For this symbolic alignment task, environment states can be sampled exhaustively and the reward is dense, rendering a formulation as a simplified offline RL problem straightforward. We evaluate the trained agent in three ways. First, in its capacity to identify correct score positions for sampled test contexts; second, as the core technique of a complete algorithm for symbolic online note-wise alignment; and finally, as a real-time symbolic score follower. We further investigate the pitch-based score and performance representations used as the agent's inputs. To this end, we develop a second model, a two-step Dynamic Time Warping (DTW)-based offline alignment algorithm leveraging the same input representation. The proposed model outperforms a state-of-the-art reference model of offline symbolic music alignment.
△ Less
Submitted 31 December, 2023;
originally announced January 2024.
-
Discrete Diffusion Probabilistic Models for Symbolic Music Generation
Authors:
Matthias Plasser,
Silvan Peter,
Gerhard Widmer
Abstract:
Denoising Diffusion Probabilistic Models (DDPMs) have made great strides in generating high-quality samples in both discrete and continuous domains. However, Discrete DDPMs (D3PMs) have yet to be applied to the domain of Symbolic Music. This work presents the direct generation of Polyphonic Symbolic Music using D3PMs. Our model exhibits state-of-the-art sample quality, according to current quantit…
▽ More
Denoising Diffusion Probabilistic Models (DDPMs) have made great strides in generating high-quality samples in both discrete and continuous domains. However, Discrete DDPMs (D3PMs) have yet to be applied to the domain of Symbolic Music. This work presents the direct generation of Polyphonic Symbolic Music using D3PMs. Our model exhibits state-of-the-art sample quality, according to current quantitative evaluation metrics, and allows for flexible infilling at the note level. We further show, that our models are accessible to post-hoc classifier guidance, widening the scope of possible applications. However, we also cast a critical view on quantitative evaluation of music sample quality via statistical metrics, and present a simple algorithm that can confound our metrics with completely spurious, non-musical samples.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
The ACCompanion: Combining Reactivity, Robustness, and Musical Expressivity in an Automatic Piano Accompanist
Authors:
Carlos Cancino-Chacón,
Silvan Peter,
Patricia Hu,
Emmanouil Karystinaios,
Florian Henkel,
Francesco Foscarin,
Nimrod Varga,
Gerhard Widmer
Abstract:
This paper introduces the ACCompanion, an expressive accompaniment system. Similarly to a musician who accompanies a soloist playing a given musical piece, our system can produce a human-like rendition of the accompaniment part that follows the soloist's choices in terms of tempo, dynamics, and articulation. The ACCompanion works in the symbolic domain, i.e., it needs a musical instrument capable…
▽ More
This paper introduces the ACCompanion, an expressive accompaniment system. Similarly to a musician who accompanies a soloist playing a given musical piece, our system can produce a human-like rendition of the accompaniment part that follows the soloist's choices in terms of tempo, dynamics, and articulation. The ACCompanion works in the symbolic domain, i.e., it needs a musical instrument capable of producing and playing MIDI data, with explicitly encoded onset, offset, and pitch for each played note. We describe the components that go into such a system, from real-time score following and prediction to expressive performance generation and online adaptation to the expressive choices of the human player. Based on our experience with repeated live demonstrations in front of various audiences, we offer an analysis of the challenges of combining these components into a system that is highly reactive and precise, while still a reliable musical partner, robust to possible performance errors and responsive to expressive variations.
△ Less
Submitted 30 May, 2023; v1 submitted 24 April, 2023;
originally announced April 2023.
-
The match file format: Encoding Alignments between Scores and Performances
Authors:
Francesco Foscarin,
Emmanouil Karystinaios,
Silvan David Peter,
Carlos Cancino-Chacón,
Maarten Grachten,
Gerhard Widmer
Abstract:
This paper presents the specifications of match: a file format that extends a MIDI human performance with note-, beat-, and downbeat-level alignments to a corresponding musical score. This enables advanced analyses of the performance that are relevant for various tasks, such as expressive performance modeling, score following, music transcription, and performer classification. The match file inclu…
▽ More
This paper presents the specifications of match: a file format that extends a MIDI human performance with note-, beat-, and downbeat-level alignments to a corresponding musical score. This enables advanced analyses of the performance that are relevant for various tasks, such as expressive performance modeling, score following, music transcription, and performer classification. The match file includes a set of score-related descriptors that makes it usable also as a bare-bones score representation. For applications that require the use of structural score elements (e.g., voices, parts, beams, slurs), the match file can be easily combined with the symbolic score. To support the practical application of our work, we release a corrected and upgraded version of the Vienna4x22 dataset of scores and performances aligned with match files.
△ Less
Submitted 2 June, 2022;
originally announced June 2022.
-
Partitura: A Python Package for Symbolic Music Processing
Authors:
Carlos Cancino-Chacón,
Silvan David Peter,
Emmanouil Karystinaios,
Francesco Foscarin,
Maarten Grachten,
Gerhard Widmer
Abstract:
Partitura is a lightweight Python package for handling symbolic musical information. It provides easy access to features commonly used in music information retrieval tasks, like note arrays (lists of timed pitched events) and 2D piano roll matrices, as well as other score elements such as time and key signatures, performance directives, and repeat structures. Partitura can load musical scores (in…
▽ More
Partitura is a lightweight Python package for handling symbolic musical information. It provides easy access to features commonly used in music information retrieval tasks, like note arrays (lists of timed pitched events) and 2D piano roll matrices, as well as other score elements such as time and key signatures, performance directives, and repeat structures. Partitura can load musical scores (in MEI, MusicXML, Kern, and MIDI formats), MIDI performances, and score-to-performance alignments. The package includes some tools for music analysis, such as automatic pitch spelling, key signature identification, and voice separation. Partitura is an open-source project and is available at https://github.com/CPJKU/partitura/.
△ Less
Submitted 2 June, 2022;
originally announced June 2022.
-
Extraction of Unaliased High-Frequency Micro-Doppler Signature using FMCW radar
Authors:
Soorya Peter,
Vinod Veera Reddy
Abstract:
Micro-Doppler signature is a potent feature that has been used for target identification and micro-motion parameter estimation. The extraction of high frequency micro-Doppler signature from frequency modulated continuous wave (FMCW) radar along with the target range and velocity is the problem considered in this article. The severe aliasing of the high micro-Doppler frequency spread is circumvente…
▽ More
Micro-Doppler signature is a potent feature that has been used for target identification and micro-motion parameter estimation. The extraction of high frequency micro-Doppler signature from frequency modulated continuous wave (FMCW) radar along with the target range and velocity is the problem considered in this article. The severe aliasing of the high micro-Doppler frequency spread is circumvented by the fast time processing in the proposed method. The use of range-Doppler (RD) filtering and empirical mode decomposition (EMD) enables effective out-of-band and in-band noise suppression. Simulation studies and experimental results present the effectiveness of the proposed approach.
△ Less
Submitted 20 April, 2022;
originally announced April 2022.
-
Experimental assessment of polynomial nonlinear state-space and nonlinear-mode models for near-resonant vibrations
Authors:
Maren Scheel,
Gleb Kleyman,
Ali Tatar,
Matthew R. W. Brake,
Simon Peter,
Jean-Philippe Noël,
Matthew S. Allen,
Malte Krack
Abstract:
In the present paper, two existing nonlinear system identification methodologies are used to identify data-driven models. The first methodology focuses on identifying the system using steady-state excitations. To accomplish this, a phase-locked loop controller is implemented to acquire periodic oscillations near resonance and construct a nonlinear-mode model. This model is based on amplitude-depen…
▽ More
In the present paper, two existing nonlinear system identification methodologies are used to identify data-driven models. The first methodology focuses on identifying the system using steady-state excitations. To accomplish this, a phase-locked loop controller is implemented to acquire periodic oscillations near resonance and construct a nonlinear-mode model. This model is based on amplitude-dependent modal properties, i.e. does not require nonlinear basis functions. The second methodology exploits uncontrolled experiments with broadband random inputs to build polynomial nonlinear state-space models using advanced system identification tools. The methods are applied to two experimental test rigs, a magnetic cantilever beam and a free-free beam with a lap joint. The respective models of both methods and both specimens are then challenged to predict dynamic, near-resonant behavior observed under different sine and sine-sweep excitations. The vibration prediction of the nonlinear-mode and state-space models clearly highlight the capabilities and limitations of the models. The nonlinear-mode model, by design, yields a perfect match at resonance peaks and high accuracy in close vicinity. However, it is limited to well-spaced modes and sinusoidal excitation. The state-space model covers a wider dynamic range, including transient excitations. However, the real-life nonlinearities considered in this study can only be approximated by polynomial basis functions. Consequently, the identified state-space models are found to be highly input-dependent, in particular for sinusoidal excitations where they are found to lead to a low predictive capability.
△ Less
Submitted 17 November, 2020;
originally announced November 2020.
-
A Phase Resonance Approach for Modal Testing of Structures with Nonlinear Dissipation
Authors:
Maren Scheel,
Simon Peter,
Remco I. Leine,
Malte Krack
Abstract:
The concept of nonlinear modes is useful for the dynamical characterization of nonlinear mechanical systems. While efficient and broadly applicable methods are now available for the computation of nonlinear modes, nonlinear modal testing is still in its infancy. The purpose of this work is to overcome its present limitation to conservative nonlinearities. Our approach relies on the recently extend…
▽ More
The concept of nonlinear modes is useful for the dynamical characterization of nonlinear mechanical systems. While efficient and broadly applicable methods are now available for the computation of nonlinear modes, nonlinear modal testing is still in its infancy. The purpose of this work is to overcome its present limitation to conservative nonlinearities. Our approach relies on the recently extended periodic motion concept, according to which nonlinear modes of damped systems are defined as family of periodic motions induced by an appropriate artificial excitation that compensates the natural dissipation. The particularly simple experimental implementation with only a single-point, single-frequency, phase resonant forcing is analyzed in detail. The method permits the experimental extraction of natural frequencies, modal damping ratios and deflection shapes (including harmonics), for each mode of interest, as function of the vibration level. The accuracy, robustness and current limitations of the method are first demonstrated numerically. The method is then verified experimentally for a friction-damped system. Moreover, a self-contained measure for estimating the quality of the extracted modal properties is investigated. The primary advantages over alternative vibration testing methods are noise robustness, broad applicability and short measurement duration. The central limitation of the identified modal quantities is that they only characterize the system in the regime near isolated resonances.
△ Less
Submitted 17 November, 2020;
originally announced November 2020.
-
On the Characterization of Expressive Performance in Classical Music: First Results of the Con Espressione Game
Authors:
Carlos Cancino-Chacón,
Silvan Peter,
Shreyan Chowdhury,
Anna Aljanaki,
Gerhard Widmer
Abstract:
A piece of music can be expressively performed, or interpreted, in a variety of ways. With the help of an online questionnaire, the Con Espressione Game, we collected some 1,500 descriptions of expressive character relating to 45 performances of 9 excerpts from classical piano pieces, played by different famous pianists. More specifically, listeners were asked to describe, using freely chosen word…
▽ More
A piece of music can be expressively performed, or interpreted, in a variety of ways. With the help of an online questionnaire, the Con Espressione Game, we collected some 1,500 descriptions of expressive character relating to 45 performances of 9 excerpts from classical piano pieces, played by different famous pianists. More specifically, listeners were asked to describe, using freely chosen words (preferably: adjectives), how they perceive the expressive character of the different performances. In this paper, we offer a first account of this new data resource for expressive performance research, and provide an exploratory analysis, addressing three main questions: (1) how similarly do different listeners describe a performance of a piece? (2) what are the main dimensions (or axes) for expressive character emerging from this?; and (3) how do measurable parameters of a performance (e.g., tempo, dynamics) and mid- and high-level features that can be predicted by machine learning models (e.g., articulation, arousal) relate to these expressive dimensions? The dataset that we publish along with this paper was enriched by adding hand-corrected score-to-performance alignments, as well as descriptive audio features such as tempo and dynamics curves.
△ Less
Submitted 5 August, 2020;
originally announced August 2020.