Zum Hauptinhalt springen

Showing 1–14 of 14 results for author: Golkov, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.20917  [pdf, ps, other

    cs.LG cs.AI cs.CV stat.ML

    How to Choose a Reinforcement-Learning Algorithm

    Authors: Fabian Bongratz, Vladimir Golkov, Lukas Mautner, Luca Della Libera, Frederik Heetmeyer, Felix Czaja, Julian Rodemann, Daniel Cremers

    Abstract: The field of reinforcement learning offers a large variety of concepts and methods to tackle sequential decision-making problems. This variety has become so large that choosing an algorithm for a task at hand can be challenging. In this work, we streamline the process of choosing reinforcement-learning algorithms and action-distribution families. We provide a structured overview of existing method… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: 40 pages

    MSC Class: 62M45 ACM Class: I.2.8; I.2.6; I.5.1

  2. arXiv:2305.07524  [pdf

    physics.med-ph cs.AI

    Joint MR sequence optimization beats pure neural network approaches for spin-echo MRI super-resolution

    Authors: Hoai Nam Dang, Vladimir Golkov, Thomas Wimmer, Daniel Cremers, Andreas Maier, Moritz Zaiss

    Abstract: Current MRI super-resolution (SR) methods only use existing contrasts acquired from typical clinical sequences as input for the neural network (NN). In turbo spin echo sequences (TSE) the sequence parameters can have a strong influence on the actual resolution of the acquired image and have consequently a considera-ble impact on the performance of the NN. We propose a known-operator learning appro… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: 13 pages, 4 figures, 3 tables, submitted to MICCAI 2023 for review

  3. arXiv:2304.05864  [pdf, other

    cs.CV cs.LG

    Scale-Equivariant Deep Learning for 3D Data

    Authors: Thomas Wimmer, Vladimir Golkov, Hoai Nam Dang, Moritz Zaiss, Andreas Maier, Daniel Cremers

    Abstract: The ability of convolutional neural networks (CNNs) to recognize objects regardless of their position in the image is due to the translation-equivariance of the convolutional operation. Group-equivariant CNNs transfer this equivariance to other transformations of the input. Dealing appropriately with objects and object parts of different scale is challenging, and scale can vary for multiple reason… ▽ More

    Submitted 12 April, 2023; originally announced April 2023.

    Comments: 12 pages, 4 figures

  4. arXiv:2109.11398  [pdf, other

    cs.CV

    Scene Graph Generation for Better Image Captioning?

    Authors: Maximilian Mozes, Martin Schmitt, Vladimir Golkov, Hinrich Schütze, Daniel Cremers

    Abstract: We investigate the incorporation of visual relationships into the task of supervised image caption generation by proposing a model that leverages detected objects and auto-generated visual relationships to describe images in natural language. To do so, we first generate a scene graph from raw image pixels by identifying individual objects and visual relationships between them. This scene graph the… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: Technical report. This work was done and the paper was written in 2019

  5. arXiv:2102.06942  [pdf, other

    cs.CV cs.LG cs.NE

    Rotation-Equivariant Deep Learning for Diffusion MRI

    Authors: Philip Müller, Vladimir Golkov, Valentina Tomassini, Daniel Cremers

    Abstract: Convolutional networks are successful, but they have recently been outperformed by new neural networks that are equivariant under rotations and translations. These new networks work better because they do not struggle with learning each possible orientation of each image feature separately. So far, they have been proposed for 2D and 3D data. Here we generalize them to 6D diffusion MRI data, ensuri… ▽ More

    Submitted 13 February, 2021; originally announced February 2021.

    Comments: 24 pages, 8 figures

  6. arXiv:2010.15084  [pdf, other

    eess.AS cs.SD

    Speech Synthesis and Control Using Differentiable DSP

    Authors: Giorgio Fabbro, Vladimir Golkov, Thomas Kemp, Daniel Cremers

    Abstract: Modern text-to-speech systems are able to produce natural and high-quality speech, but speech contains factors of variation (e.g. pitch, rhythm, loudness, timbre)\ that text alone cannot contain. In this work we move towards a speech synthesis system that can produce diverse speech renditions of a text by allowing (but not requiring) explicit control over the various factors of variation. We propo… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

    Comments: 6 pages, 3 figures, for associated audio files, see https://thesmith1.github.io/DDSPeech/

  7. arXiv:2007.07029  [pdf, ps, other

    q-bio.BM cs.LG q-bio.QM stat.ML

    Deep Learning for Virtual Screening: Five Reasons to Use ROC Cost Functions

    Authors: Vladimir Golkov, Alexander Becker, Daniel T. Plop, Daniel Čuturilo, Neda Davoudi, Jeffrey Mendenhall, Rocco Moretti, Jens Meiler, Daniel Cremers

    Abstract: Computer-aided drug discovery is an essential component of modern drug development. Therein, deep learning has become an important tool for rapid screening of billions of molecules in silico for potential hits containing desired chemical features. Despite its importance, substantial challenges persist in training these models, such as severe class imbalance, high decision thresholds, and lack of g… ▽ More

    Submitted 25 June, 2020; originally announced July 2020.

    Comments: 10 pages

    MSC Class: 68T07 (Primary) 62H30; 92E99; 68T10; 62F07 (Secondary) ACM Class: G.3; I.2.1; I.2.6; I.5.1; J.3

  8. arXiv:1910.14594  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Deep Learning for 2D and 3D Rotatable Data: An Overview of Methods

    Authors: Luca Della Libera, Vladimir Golkov, Yue Zhu, Arman Mielke, Daniel Cremers

    Abstract: Convolutional networks are successful due to their equivariance/invariance under translations. However, rotatable data such as images, volumes, shapes, or point clouds require processing with equivariance/invariance under rotations in cases where the rotational orientation of the coordinate system does not affect the meaning of the data (e.g. object classification). On the other hand, estimation/p… ▽ More

    Submitted 22 November, 2021; v1 submitted 31 October, 2019; originally announced October 2019.

    Comments: Improved Definition 1, improved and merged Sections 3.3-3.4, minor additional changes

    MSC Class: 62M45; 68T45; 62H35; 65D18; 68U10 ACM Class: I.2.6; I.5.1; G.3

  9. arXiv:1905.03389  [pdf, other

    cs.NE cs.AI cs.CV cs.LG stat.ML

    Learning to Evolve

    Authors: Jan Schuchardt, Vladimir Golkov, Daniel Cremers

    Abstract: Evolution and learning are two of the fundamental mechanisms by which life adapts in order to survive and to transcend limitations. These biological phenomena inspired successful computational methods such as evolutionary algorithms and deep learning. Evolution relies on random mutations and on random genetic recombination. Here we show that learning to evolve, i.e. learning to mutate and recombin… ▽ More

    Submitted 8 May, 2019; originally announced May 2019.

    MSC Class: 62M45; 68T05; 68W25; 68T20; 90C40; 91A22; 92D15; 92D25 ACM Class: G.1.6; I.2.6; I.2.8; G.3; I.5.1

  10. arXiv:1806.02997  [pdf, other

    stat.ML cs.AI cs.CV cs.LG cs.NE

    q-Space Novelty Detection with Variational Autoencoders

    Authors: Aleksei Vasilev, Vladimir Golkov, Marc Meissner, Ilona Lipp, Eleonora Sgarlata, Valentina Tomassini, Derek K. Jones, Daniel Cremers

    Abstract: In machine learning, novelty detection is the task of identifying novel unseen data. During training, only samples from the normal class are available. Test samples are classified as normal or abnormal by assignment of a novelty score. Here we propose novelty detection methods based on training variational autoencoders (VAEs) on normal data. Since abnormal samples are not used during training, we… ▽ More

    Submitted 25 October, 2018; v1 submitted 8 June, 2018; originally announced June 2018.

    Comments: 11 pages, 2 figures

    MSC Class: 62F15; 62G07; 62M45; 68T30 ACM Class: G.3; H.3.3; I.2.4; I.2.6; I.4.6; I.5; I.5.4; J.3

  11. arXiv:1801.07648  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Clustering with Deep Learning: Taxonomy and New Methods

    Authors: Elie Aljalbout, Vladimir Golkov, Yawar Siddiqui, Maximilian Strobel, Daniel Cremers

    Abstract: Clustering methods based on deep neural networks have proven promising for clustering real-world data because of their high representational power. In this paper, we propose a systematic taxonomy of clustering methods that utilize deep neural networks. We base our taxonomy on a comprehensive review of recent work and validate the taxonomy in a case study. In this case study, we show that the taxon… ▽ More

    Submitted 13 September, 2018; v1 submitted 23 January, 2018; originally announced January 2018.

    MSC Class: 62H30; 62M45; 91C20 ACM Class: H.3.3; I.2.6; I.5; I.5.3; I.5.4

  12. arXiv:1710.10686  [pdf, ps, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Regularization for Deep Learning: A Taxonomy

    Authors: Jan Kukačka, Vladimir Golkov, Daniel Cremers

    Abstract: Regularization is one of the crucial ingredients of deep learning, yet the term regularization has various definitions, and regularization methods are often studied separately from each other. In our work we present a systematic, unifying taxonomy to categorize existing methods. We distinguish methods that affect data, network architectures, error terms, regularization terms, and optimization proc… ▽ More

    Submitted 29 October, 2017; originally announced October 2017.

    MSC Class: 62M45 ACM Class: I.2.6; I.5

  13. arXiv:1704.04039  [pdf, other

    q-bio.BM cs.LG q-bio.QM stat.ML

    3D Deep Learning for Biological Function Prediction from Physical Fields

    Authors: Vladimir Golkov, Marcin J. Skwark, Atanas Mirchev, Georgi Dikov, Alexander R. Geanes, Jeffrey Mendenhall, Jens Meiler, Daniel Cremers

    Abstract: Predicting the biological function of molecules, be it proteins or drug-like compounds, from their atomic structure is an important and long-standing problem. Function is dictated by structure, since it is by spatial interactions that molecules interact with each other, both in terms of steric complementarity, as well as intermolecular forces. Thus, the electron density field and electrostatic pot… ▽ More

    Submitted 13 April, 2017; originally announced April 2017.

    ACM Class: I.2.6; J.3

  14. arXiv:1504.06852  [pdf, other

    cs.CV cs.LG

    FlowNet: Learning Optical Flow with Convolutional Networks

    Authors: Philipp Fischer, Alexey Dosovitskiy, Eddy Ilg, Philip Häusser, Caner Hazırbaş, Vladimir Golkov, Patrick van der Smagt, Daniel Cremers, Thomas Brox

    Abstract: Convolutional neural networks (CNNs) have recently been very successful in a variety of computer vision tasks, especially on those linked to recognition. Optical flow estimation has not been among the tasks where CNNs were successful. In this paper we construct appropriate CNNs which are capable of solving the optical flow estimation problem as a supervised learning task. We propose and compare tw… ▽ More

    Submitted 4 May, 2015; v1 submitted 26 April, 2015; originally announced April 2015.

    Comments: Added supplementary material

    ACM Class: I.2.6; I.4.8