Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Oostermeijer, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.13032  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    Quality-Diversity through AI Feedback

    Authors: Herbie Bradley, Andrew Dai, Hannah Teufel, Jenny Zhang, Koen Oostermeijer, Marco Bellagente, Jeff Clune, Kenneth Stanley, Grégory Schott, Joel Lehman

    Abstract: In many text-generation problems, users may prefer not only a single response, but a diverse range of high-quality outputs from which to choose. Quality-diversity (QD) search algorithms aim at such outcomes, by continually improving and diversifying a population of candidates. However, the applicability of QD to qualitative domains, like creative writing, has been limited by the difficulty of algo… ▽ More

    Submitted 7 December, 2023; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: minor additions to supplementary results

  2. arXiv:2305.15296  [pdf, other

    cs.CV cs.AI cs.LG

    MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation

    Authors: Marco Bellagente, Manuel Brack, Hannah Teufel, Felix Friedrich, Björn Deiseroth, Constantin Eichenberg, Andrew Dai, Robert Baldock, Souradeep Nanda, Koen Oostermeijer, Andres Felipe Cruz-Salinas, Patrick Schramowski, Kristian Kersting, Samuel Weinbach

    Abstract: The recent popularity of text-to-image diffusion models (DM) can largely be attributed to the intuitive interface they provide to users. The intended generation can be expressed in natural language, with the model producing faithful interpretations of text prompts. However, expressing complex or nuanced ideas in text alone can be difficult. To ease image generation, we propose MultiFusion that all… ▽ More

    Submitted 20 December, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: Proceedings of Advances in Neural Information Processing Systems: Annual Conference on Neural Information Processing Systems (NeurIPS)

  3. arXiv:2212.02936  [pdf, other

    cs.CV

    M-VADER: A Model for Diffusion with Multimodal Context

    Authors: Samuel Weinbach, Marco Bellagente, Constantin Eichenberg, Andrew Dai, Robert Baldock, Souradeep Nanda, Björn Deiseroth, Koen Oostermeijer, Hannah Teufel, Andres Felipe Cruz-Salinas

    Abstract: We introduce M-VADER: a diffusion model (DM) for image generation where the output can be specified using arbitrary combinations of images and text. We show how M-VADER enables the generation of images specified using combinations of image and text, and combinations of multiple images. Previously, a number of successful DM image generation algorithms have been introduced that make it possible to s… ▽ More

    Submitted 7 December, 2022; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: 22 pages, 14 figures, 2 tables, fixed figure 3

  4. arXiv:2011.04092  [pdf, other

    cs.SD cs.NE eess.AS

    Frequency Gating: Improved Convolutional Neural Networks for Speech Enhancement in the Time-Frequency Domain

    Authors: Koen Oostermeijer, Qing Wang, Jun Du

    Abstract: One of the strengths of traditional convolutional neural networks (CNNs) is their inherent translational invariance. However, for the task of speech enhancement in the time-frequency domain, this property cannot be fully exploited due to a lack of invariance in the frequency direction. In this paper we propose to remedy this inefficiency by introducing a method, which we call Frequency Gating, to… ▽ More

    Submitted 8 November, 2020; originally announced November 2020.