Zum Hauptinhalt springen

Showing 1–18 of 18 results for author: Jones, O P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04328  [pdf, other

    cs.LG

    The Brain's Bitter Lesson: Scaling Speech Decoding With Self-Supervised Learning

    Authors: Dulhan Jayalath, Gilad Landau, Brendan Shillingford, Mark Woolrich, Oiwi Parker Jones

    Abstract: The past few years have produced a series of spectacular advances in the decoding of speech from brain activity. The engine of these advances has been the acquisition of labelled data, with increasingly large datasets acquired from single subjects. However, participants exhibit anatomical and other individual differences, and datasets use varied scanners and task designs. As a result, prior work h… ▽ More

    Submitted 2 July, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: 10 pages, 4 figures, under review

  2. arXiv:2404.09256  [pdf, other

    cs.LG eess.SP

    Foundational GPT Model for MEG

    Authors: Richard Csaky, Mats W. J. van Es, Oiwi Parker Jones, Mark Woolrich

    Abstract: Deep learning techniques can be used to first training unsupervised models on large amounts of unlabelled data, before fine-tuning the models on specific tasks. This approach has seen massive success for various kinds of data, e.g. images, language, audio, and holds the promise of improving performance in various downstream tasks (e.g. encoding or decoding brain data). However, there has been limi… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: Code available on GitHub (https://github.com/ricsinaruto/MEG-transfer-decoding). Part of PhD thesis (https://ricsinaruto.github.io/docs/thesis_final_appendix.pdf)

  3. arXiv:2404.03073  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Mai Ho'omāuna i ka 'Ai: Language Models Improve Automatic Speech Recognition in Hawaiian

    Authors: Kaavya Chaparala, Guido Zarrella, Bruce Torres Fischer, Larry Kimura, Oiwi Parker Jones

    Abstract: In this paper we address the challenge of improving Automatic Speech Recognition (ASR) for a low-resource language, Hawaiian, by incorporating large amounts of independent text data into an ASR foundation model, Whisper. To do this, we train an external language model (LM) on ~1.5M words of Hawaiian text. We then use the LM to rescore Whisper and compute word error rates (WERs) on a manually curat… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

  4. arXiv:2402.16308  [pdf, other

    cs.RO

    DreamUp3D: Object-Centric Generative Models for Single-View 3D Scene Understanding and Real-to-Sim Transfer

    Authors: Yizhe Wu, Haitz Sáez de Ocáriz Borde, Jack Collins, Oiwi Parker Jones, Ingmar Posner

    Abstract: 3D scene understanding for robotic applications exhibits a unique set of requirements including real-time inference, object-centric latent representation learning, accurate 6D pose estimation and 3D reconstruction of objects. Current methods for scene understanding typically rely on a combination of trained models paired with either an explicit or learnt volumetric representation, all of which hav… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

  5. Reaching Through Latent Space: From Joint Statistics to Path Planning in Manipulation

    Authors: Chia-Man Hung, Shaohong Zhong, Walter Goodwin, Oiwi Parker Jones, Martin Engelcke, Ioannis Havoutis, Ingmar Posner

    Abstract: We present a novel approach to path planning for robotic manipulators, in which paths are produced via iterative optimisation in the latent space of a generative model of robot poses. Constraints are incorporated through the use of constraint satisfaction classifiers operating on the same space. Optimisation leverages gradients through our learned models that provide a simple way to combine goal r… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: 10 pages, 6 figures, 4 tables

    ACM Class: I.2.6; I.2.9; I.2.10

    Journal ref: IEEE Robotics and Automation Letters 7.2 (2022): 5334-5341

  6. arXiv:2208.10248  [pdf, other

    cs.CL cs.LG

    Composing RNNs and FSTs for Small Data: Recovering Missing Characters in Old Hawaiian Text

    Authors: Oiwi Parker Jones, Brendan Shillingford

    Abstract: In contrast to the older writing system of the 19th century, modern Hawaiian orthography employs characters for long vowels and glottal stops. These extra characters account for about one-third of the phonemes in Hawaiian, so including them makes a big difference to reading comprehension and pronunciation. However, transliterating between older and newer texts is a laborious task when performed ma… ▽ More

    Submitted 23 July, 2022; originally announced August 2022.

    Comments: This paper originally appeared in a NeurIPS Workshop in 2018: IRASL - Interpretability and Robustness in Audio, Speech, and Language. It builds on a shorter paper that appeared in the Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP). See acknowledgements for details

  7. arXiv:2206.03591  [pdf, other

    cs.CV cs.AI

    ObPose: Leveraging Pose for Object-Centric Scene Inference and Generation in 3D

    Authors: Yizhe Wu, Oiwi Parker Jones, Ingmar Posner

    Abstract: We present ObPose, an unsupervised object-centric inference and generation model which learns 3D-structured latent representations from RGB-D scenes. Inspired by prior art in 2D representation learning, ObPose considers a factorised latent space, separately encoding object location (where) and appearance (what). ObPose further leverages an object's pose (i.e. location and orientation), defined via… ▽ More

    Submitted 9 June, 2023; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: 14 pages, 4 figures

    MSC Class: 68T07

  8. arXiv:2205.14102  [pdf

    cs.LG eess.SP q-bio.NC

    Group-level Brain Decoding with Deep Learning

    Authors: Richard Csaky, Mats Van Es, Oiwi Parker Jones, Mark Woolrich

    Abstract: Decoding brain imaging data are gaining popularity, with applications in brain-computer interfaces and the study of neural representations. Decoding is typicallysubject-specific and does not generalise well over subjects, due to high amounts ofbetween subject variability. Techniques that overcome this will not only providericher neuroscientific insights but also make it possible for group-level mo… ▽ More

    Submitted 19 January, 2024; v1 submitted 27 May, 2022; originally announced May 2022.

    Comments: Published in Human Brain Mapping

  9. arXiv:2205.01179  [pdf, other

    cs.RO cs.LG

    VAE-Loco: Versatile Quadruped Locomotion by Learning a Disentangled Gait Representation

    Authors: Alexander L. Mitchell, Wolfgang Merkt, Mathieu Geisert, Siddhant Gangapurwala, Martin Engelcke, Oiwi Parker Jones, Ioannis Havoutis, Ingmar Posner

    Abstract: Quadruped locomotion is rapidly maturing to a degree where robots are able to realise highly dynamic manoeuvres. However, current planners are unable to vary key gait parameters of the in-swing feet midair. In this work we address this limitation and show that it is pivotal in increasing controller robustness by learning a latent space capturing the key stance phases constituting a particular gait… ▽ More

    Submitted 12 July, 2023; v1 submitted 2 May, 2022; originally announced May 2022.

    Comments: 16 pages, 13 figures, 1 table, accepted by IEEE Transactions on Robotics (T-RO) as an extended paper. arXiv admin note: substantial text overlap with arXiv:2112.04809

  10. arXiv:2112.04809  [pdf, other

    cs.RO cs.LG

    Next Steps: Learning a Disentangled Gait Representation for Versatile Quadruped Locomotion

    Authors: Alexander L. Mitchell, Wolfgang Merkt, Mathieu Geisert, Siddhant Gangapurwala, Martin Engelcke, Oiwi Parker Jones, Ioannis Havoutis, Ingmar Posner

    Abstract: Quadruped locomotion is rapidly maturing to a degree where robots now routinely traverse a variety of unstructured terrains. However, while gaits can be varied typically by selecting from a range of pre-computed styles, current planners are unable to vary key gait parameters continuously while the robot is in motion. The synthesis, on-the-fly, of gaits with unexpected operational characteristics o… ▽ More

    Submitted 29 March, 2022; v1 submitted 9 December, 2021; originally announced December 2021.

    Comments: 7 pages, 4 figures, accepted at IEEE International Conference on Robotics and Automation (ICRA), 2022

  11. arXiv:2105.14895  [pdf, other

    cs.RO

    APEX: Unsupervised, Object-Centric Scene Segmentation and Tracking for Robot Manipulation

    Authors: Yizhe Wu, Oiwi Parker Jones, Martin Engelcke, Ingmar Posner

    Abstract: Recent advances in unsupervised learning for object detection, segmentation, and tracking hold significant promise for applications in robotics. A common approach is to frame these tasks as inference in probabilistic latent-variable models. In this paper, however, we show that the current state-of-the-art struggles with visually complex scenes such as typically encountered in robot manipulation ta… ▽ More

    Submitted 12 September, 2021; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: 8 pages, 5 figures

    MSC Class: I.2.9

  12. arXiv:2104.09958  [pdf, other

    cs.CV cs.LG stat.ML

    GENESIS-V2: Inferring Unordered Object Representations without Iterative Refinement

    Authors: Martin Engelcke, Oiwi Parker Jones, Ingmar Posner

    Abstract: Advances in unsupervised learning of object-representations have culminated in the development of a broad range of methods for unsupervised object segmentation and interpretable object-centric scene generation. These methods, however, are limited to simulated and real-world datasets with limited visual complexity. Moreover, object representations are often inferred using RNNs which do not scale we… ▽ More

    Submitted 25 January, 2022; v1 submitted 20 April, 2021; originally announced April 2021.

    Comments: NeurIPS 2021 camera-ready version; 26 pages, 19 figures

  13. arXiv:2011.14389  [pdf, other

    cs.RO cs.CV cs.LG eess.SP

    There and Back Again: Learning to Simulate Radar Data for Real-World Applications

    Authors: Rob Weston, Oiwi Parker Jones, Ingmar Posner

    Abstract: Simulating realistic radar data has the potential to significantly accelerate the development of data-driven approaches to radar processing. However, it is fraught with difficulty due to the notoriously complex image formation process. Here we propose to learn a radar sensor model capable of synthesising faithful radar observations based on simulated elevation maps. In particular, we adopt an adve… ▽ More

    Submitted 29 November, 2020; originally announced November 2020.

    Comments: 6 pages + 2 references

  14. arXiv:2007.06245  [pdf, other

    cs.LG stat.ML

    Reconstruction Bottlenecks in Object-Centric Generative Models

    Authors: Martin Engelcke, Oiwi Parker Jones, Ingmar Posner

    Abstract: A range of methods with suitable inductive biases exist to learn interpretable object-centric representations of images without supervision. However, these are largely restricted to visually simple images; robust object discovery in real-world sensory datasets remains elusive. To increase the understanding of such inductive biases, we empirically investigate the role of "reconstruction bottlenecks… ▽ More

    Submitted 24 November, 2020; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: 10 pages, 7 Figures, Workshop on Object-Oriented Learning at ICML 2020

  15. arXiv:2007.01520  [pdf, other

    cs.RO cs.LG

    First Steps: Latent-Space Control with Semantic Constraints for Quadruped Locomotion

    Authors: Alexander L. Mitchell, Martin Engelcke, Oiwi Parker Jones, David Surovik, Siddhant Gangapurwala, Oliwier Melon, Ioannis Havoutis, Ingmar Posner

    Abstract: Traditional approaches to quadruped control frequently employ simplified, hand-derived models. This significantly reduces the capability of the robot since its effective kinematic range is curtailed. In addition, kinodynamic constraints are often non-differentiable and difficult to implement in an optimisation approach. In this work, these challenges are addressed by framing quadruped control as o… ▽ More

    Submitted 20 November, 2020; v1 submitted 3 July, 2020; originally announced July 2020.

    Comments: 8 pages, 7 figures, accepted at IROS 2020

  16. arXiv:1909.13561  [pdf, other

    cs.LG cs.CV cs.RO stat.ML

    Imagine That! Leveraging Emergent Affordances for 3D Tool Synthesis

    Authors: Yizhe Wu, Sudhanshu Kasewa, Oliver Groth, Sasha Salter, Li Sun, Oiwi Parker Jones, Ingmar Posner

    Abstract: In this paper we explore the richness of information captured by the latent space of a vision-based generative model. The model combines unsupervised generative learning with a task-based performance predictor to learn and to exploit task-relevant object affordances given visual observations from a reaching task, involving a scenario and a stick-like tool. While the learned embedding of the genera… ▽ More

    Submitted 7 October, 2020; v1 submitted 30 September, 2019; originally announced September 2019.

    Comments: 12 pages, 6 figures

    ACM Class: I.2.10; I.2.6

  17. arXiv:1907.13052  [pdf, other

    cs.LG cs.CV cs.NE cs.RO stat.ML

    GENESIS: Generative Scene Inference and Sampling with Object-Centric Latent Representations

    Authors: Martin Engelcke, Adam R. Kosiorek, Oiwi Parker Jones, Ingmar Posner

    Abstract: Generative latent-variable models are emerging as promising tools in robotics and reinforcement learning. Yet, even though tasks in these domains typically involve distinct objects, most state-of-the-art generative models do not explicitly capture the compositional nature of visual scenes. Two recent exceptions, MONet and IODINE, decompose scenes into objects in an unsupervised fashion. Their unde… ▽ More

    Submitted 23 November, 2020; v1 submitted 30 July, 2019; originally announced July 2019.

    Comments: Published at the International Conference on Learning Representations (ICLR) 2020

  18. arXiv:1907.12887  [pdf, other

    cs.CV cs.AI cs.LG cs.RO stat.ML

    End-to-end Recurrent Multi-Object Tracking and Trajectory Prediction with Relational Reasoning

    Authors: Fabian B. Fuchs, Adam R. Kosiorek, Li Sun, Oiwi Parker Jones, Ingmar Posner

    Abstract: The majority of contemporary object-tracking approaches do not model interactions between objects. This contrasts with the fact that objects' paths are not independent: a cyclist might abruptly deviate from a previously planned trajectory in order to avoid colliding with a car. Building upon HART, a neural class-agnostic single-object tracker, we introduce a multi-object tracking method MOHART cap… ▽ More

    Submitted 28 September, 2020; v1 submitted 12 July, 2019; originally announced July 2019.