Zum Hauptinhalt springen

Showing 1–20 of 20 results for author: Martin, C P

Searching in archive cs. Search in all archives.
.
  1. Tonal Cognition in Sonification: Exploring the Needs of Practitioners in Sonic Interaction Design

    Authors: Minsik Choi, Josh Andres, Charles Patrick Martin

    Abstract: Research into tonal music examines the structural relationships among sounds and how they align with our auditory perception. The exploration of integrating tonal cognition into sonic interaction design, particularly for practitioners lacking extensive musical knowledge, and developing an accessible software tool, remains limited. We report on a study of designers to understand the sound creation… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: To be published in: Proceedings of the 19th Audio Mostly Conference: A Conference on Explorations in Sonic Cultures, Milan, Italy, 2024

  2. arXiv:2405.15338  [pdf, other

    cs.SD eess.AS

    SoundLoCD: An Efficient Conditional Discrete Contrastive Latent Diffusion Model for Text-to-Sound Generation

    Authors: Xinlei Niu, Jing Zhang, Christian Walder, Charles Patrick Martin

    Abstract: We present SoundLoCD, a novel text-to-sound generation framework, which incorporates a LoRA-based conditional discrete contrastive latent diffusion model. Unlike recent large-scale sound generation models, our model can be efficiently trained under limited computational resources. The integration of a contrastive learning strategy further enhances the connection between text conditions and the gen… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  3. arXiv:2404.15637  [pdf, other

    cs.SD cs.MM eess.AS

    HybridVC: Efficient Voice Style Conversion with Text and Audio Prompts

    Authors: Xinlei Niu, Jing Zhang, Charles Patrick Martin

    Abstract: We introduce HybridVC, a voice conversion (VC) framework built upon a pre-trained conditional variational autoencoder (CVAE) that combines the strengths of a latent model with contrastive learning. HybridVC supports text and audio prompts, enabling more flexible voice style conversion. HybridVC models a latent distribution conditioned on speaker embeddings acquired by a pretrained speaker encoder… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

  4. arXiv:2306.02568  [pdf, other

    stat.ML cs.LG

    Latent Optimal Paths by Gumbel Propagation for Variational Bayesian Dynamic Programming

    Authors: Xinlei Niu, Christian Walder, Jing Zhang, Charles Patrick Martin

    Abstract: We propose the stochastic optimal path which solves the classical optimal path problem by a probability-softening solution. This unified approach transforms a wide range of DP problems into directed acyclic graphs in which all paths follow a Gibbs distribution. We show the equivalence of the Gibbs distribution to a message-passing algorithm by the properties of the Gumbel distribution and give all… ▽ More

    Submitted 25 June, 2024; v1 submitted 4 June, 2023; originally announced June 2023.

    Comments: Accepted by ICML 2024

  5. arXiv:2210.09291  [pdf, other

    cs.HC

    Embodying the Glitch: Perspectives on Generative AI in Dance Practice

    Authors: Benedikte Wallace, Charles P. Martin

    Abstract: What role does the break from realism play in the potential for generative artificial intelligence as a creative tool? Through exploration of glitch, we examine the prospective value of these artefacts in creative practice. This paper describes findings from an exploration of AI-generated "mistakes" when using movement produced by a generative deep learning model as an inspiration source in dance… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

  6. Composing an Ensemble Standstill Work for Myo and Bela

    Authors: Charles Patrick Martin, Alexander Refsum Jensenius, Jim Torresen

    Abstract: This paper describes the process of developing a standstill performance work using the Myo gesture control armband and the Bela embedded computing platform. The combination of Myo and Bela allows a portable and extensible version of the standstill performance concept while introducing muscle tension as an additional control parameter. We describe the technical details of our setup and introduce My… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

    ACM Class: H.5.5

    Journal ref: Proceedings of the International Conference on New Interfaces for Musical Expression, 2018, pp. 196-197

  7. arXiv:2012.02322  [pdf, other

    cs.HC cs.SD eess.AS

    A Laptop Ensemble Performance System using Recurrent Neural Networks

    Authors: Rohan Proctor, Charles Patrick Martin

    Abstract: The popularity of applying machine learning techniques in musical domains has created an inherent availability of freely accessible pre-trained neural network (NN) models ready for use in creative applications. This work outlines the implementation of one such application in the form of an assistance tool designed for live improvisational performances by laptop ensembles. The primary intention was… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    ACM Class: H.5.5; H.5.3

    Journal ref: Proceedings of the International Conference on New Interfaces for Musical Expression, 2020, pp. 43-48

  8. arXiv:2012.02311  [pdf, other

    cs.HC cs.SD eess.AS

    Sonic Sculpture: Activating Engagement with Head-Mounted Augmented Reality

    Authors: Charles Patrick Martin, Zeruo Liu, Yichen Wang, Wennan He, Henry Gardner

    Abstract: This work examines how head-mounted AR can be used to build an interactive sonic landscape to engage with a public sculpture. We describe a sonic artwork, "Listening To Listening", that has been designed to accompany a real-world sculpture with two prototype interaction schemes. Our artwork is created for the HoloLens platform so that users can have an individual experience in a mixed reality cont… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    ACM Class: H.5.5; H.5.1

    Journal ref: Proceedings of the International Conference on New Interfaces for Musical Expression, 2020, pp. 48-52

  9. arXiv:2011.13453  [pdf, other

    cs.SD eess.AS

    Towards Movement Generation with Audio Features

    Authors: Benedikte Wallace, Charles P. Martin, Jim Torresen, Kristian Nymoen

    Abstract: Sound and movement are closely coupled, particularly in dance. Certain audio features have been found to affect the way we move to music. Is this relationship between sound and movement something which can be modelled using machine learning? This work presents initial experiments wherein high-level audio features calculated from a set of music pieces are included in a movement generation model tra… ▽ More

    Submitted 26 November, 2020; originally announced November 2020.

  10. arXiv:2003.13254  [pdf, other

    cs.RO cs.NE

    Environmental Adaptation of Robot Morphology and Control through Real-world Evolution

    Authors: Tønnes F. Nygaard, Charles P. Martin, David Howard, Jim Torresen, Kyrre Glette

    Abstract: Robots operating in the real world will experience a range of different environments and tasks. It is essential for the robot to have the ability to adapt to its surroundings to work efficiently in changing conditions. Evolutionary robotics aims to solve this by optimizing both the control and body (morphology) of a robot, allowing adaptation to internal, as well as external factors. Most work in… ▽ More

    Submitted 20 October, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

  11. arXiv:1905.05626  [pdf, other

    cs.RO

    Lessons Learned from Real-World Experiments with DyRET: the Dynamic Robot for Embodied Testing

    Authors: Tønnes F. Nygaard, Jørgen Nordmoen, Charles P. Martin, Kyrre Glette

    Abstract: Robots are used in more and more complex environments, and are expected to be able to adapt to changes and unknown situations. The easiest and quickest way to adapt is to change the control system of the robot, but for increasingly complex environments one should also change the body of the robot -- its morphology -- to better fit the task at hand. The theory of Embodied Cognition states that cont… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: Accepted to the Learning Legged Locomotion Workshop @ ICRA 2019

  12. arXiv:1904.05009  [pdf, other

    cs.SD cs.HC cs.NE eess.AS

    An Interactive Musical Prediction System with Mixture Density Recurrent Neural Networks

    Authors: Charles P Martin, Jim Torresen

    Abstract: This paper is about creating digital musical instruments where a predictive neural network model is integrated into the interactive system. Rather than predicting symbolic music (e.g., MIDI notes), we suggest that predicting future control data from the user and precise temporal information can lead to new and interesting interactive possibilities. We propose that a mixture density recurrent neura… ▽ More

    Submitted 10 April, 2019; originally announced April 2019.

    Comments: Accepted for presentation at the International Conference on New Interfaces for Musical Expression (NIME), June 2019

  13. arXiv:1902.04403  [pdf, other

    cs.RO

    Evolving Robots on Easy Mode: Towards a Variable Complexity Controller for Quadrupeds

    Authors: Tønnes Frostad Nygaard, Charles Patrick Martin, Jim Torresen, Kyrre Glette

    Abstract: The complexity of a legged robot's environment or task can inform how specialised its gait must be to ensure success. Evolving specialised robotic gaits demands many evaluations - acceptable for computer simulations, but not for physical robots. For some tasks, a more general gait, with lower optimization costs, could be satisfactory. In this paper, we introduce a new type of gait controller where… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

    Comments: Accepted to EvoApplications19

  14. Data Driven Analysis of Tiny Touchscreen Performance with MicroJam

    Authors: Charles P Martin, Jim Torresen

    Abstract: The widespread adoption of mobile devices, such as smartphones and tablets, has made touchscreens a common interface for musical performance. New mobile musical instruments have been designed that embrace collaborative creation and that explore the affordances of mobile devices, as well as their constraints. While these have been investigated from design and user experience perspectives, there is… ▽ More

    Submitted 2 February, 2019; originally announced February 2019.

    Journal ref: Computer Music Journal, 43(4), 41-57 (2020)

  15. arXiv:1901.07859  [pdf, other

    cs.LG cs.AI stat.ML

    How do Mixture Density RNNs Predict the Future?

    Authors: Kai Olav Ellefsen, Charles Patrick Martin, Jim Torresen

    Abstract: Gaining a better understanding of how and what machine learning systems learn is important to increase confidence in their decisions and catalyze further research. In this paper, we analyze the predictions made by a specific type of recurrent neural network, mixture density RNNs (MD-RNNs). These networks learn to model predictions as a combination of multiple Gaussian distributions, making them pa… ▽ More

    Submitted 23 January, 2019; originally announced January 2019.

  16. arXiv:1805.03388  [pdf, other

    cs.RO

    Real-World Evolution Adapts Robot Morphology and Control to Hardware Limitations

    Authors: Tønnes F. Nygaard, Charles P. Martin, Eivind Samuelsen, Jim Torresen, Kyrre Glette

    Abstract: For robots to handle the numerous factors that can affect them in the real world, they must adapt to changes and unexpected events. Evolutionary robotics tries to solve some of these issues by automatically optimizing a robot for a specific environment. Most of the research in this field, however, uses simplified representations of the robotic system in software simulations. The large gap between… ▽ More

    Submitted 9 May, 2018; originally announced May 2018.

    Comments: Accepted to the 2018 Genetic and Evolutionary Computation Conference (GECCO)

  17. arXiv:1805.02965  [pdf, other

    cs.RO

    Exploring Mechanically Self-Reconfiguring Robots for Autonomous Design

    Authors: Tønnes F. Nygaard, Charles P. Martin, Jim Torresen, Kyrre Glette

    Abstract: Evolutionary robotics has aimed to optimize robot control and morphology to produce better and more robust robots. Most previous research only addresses optimization of control, and does this only in simulation. We have developed a four-legged mammal-inspired robot that features a self-reconfiguring morphology. In this paper, we discuss the possibilities opened up by being able to efficiently do e… ▽ More

    Submitted 8 May, 2018; originally announced May 2018.

    Comments: Accepted to the 2018 ICRA Workshop on Autonomous Robot Design

  18. arXiv:1803.05629  [pdf, other

    cs.RO

    Self-Modifying Morphology Experiments with DyRET: Dynamic Robot for Embodied Testing

    Authors: Tønnes F. Nygaard, Charles P. Martin, Jim Torresen, Kyrre Glette

    Abstract: If robots are to become ubiquitous, they will need to be able to adapt to complex and dynamic environments. Robots that can adapt their bodies while deployed might be flexible and robust enough to meet this challenge. Previous work on dynamic robot morphology has focused on simulation, combining simple modules, or switching between locomotion modes. Here, we present an alternative approach: a self… ▽ More

    Submitted 23 July, 2019; v1 submitted 15 March, 2018; originally announced March 2018.

    Comments: Accepted to ICRA19. Corrections to table II, July 2019

  19. arXiv:1801.10492  [pdf, other

    cs.SD cs.AI cs.HC cs.NE eess.AS

    Deep Predictive Models in Interactive Music

    Authors: Charles P. Martin, Kai Olav Ellefsen, Jim Torresen

    Abstract: Musical performance requires prediction to operate instruments, to perform in groups and to improvise. In this paper, we investigate how a number of digital musical instruments (DMIs), including two of our own, have applied predictive machine learning models that assist users by predicting unknown states of musical processes. We characterise these predictions as focussed within a musical instrumen… ▽ More

    Submitted 19 December, 2018; v1 submitted 31 January, 2018; originally announced January 2018.

  20. arXiv:1711.10746  [pdf, other

    cs.HC cs.NE cs.SD eess.AS

    RoboJam: A Musical Mixture Density Network for Collaborative Touchscreen Interaction

    Authors: Charles P. Martin, Jim Torresen

    Abstract: RoboJam is a machine-learning system for generating music that assists users of a touchscreen music app by performing responses to their short improvisations. This system uses a recurrent artificial neural network to generate sequences of touchscreen interactions and absolute timings, rather than high-level musical notes. To accomplish this, RoboJam's network uses a mixture density layer to predic… ▽ More

    Submitted 29 November, 2017; originally announced November 2017.

    Journal ref: Computational Intelligence in Music, Sound, Art and Design. EvoMUSART 2018. Lecture Notes in Computer Science, vol 10783