Search | arXiv e-print repository

Towards noise contrastive estimation with soft targets for conditional models

Authors: Johannes Hugger, Virginie Uhlmann

Abstract: Soft targets combined with the cross-entropy loss have shown to improve generalization performance of deep neural networks on supervised classification tasks. The standard cross-entropy loss however assumes data to be categorically distributed, which may often not be the case in practice. In contrast, InfoNCE does not rely on such an explicit assumption but instead implicitly estimates the true co… ▽ More Soft targets combined with the cross-entropy loss have shown to improve generalization performance of deep neural networks on supervised classification tasks. The standard cross-entropy loss however assumes data to be categorically distributed, which may often not be the case in practice. In contrast, InfoNCE does not rely on such an explicit assumption but instead implicitly estimates the true conditional through negative sampling. Unfortunately, it cannot be combined with soft targets in its standard formulation, hindering its use in combination with sophisticated training strategies. In this paper, we address this limitation by proposing a loss function that is compatible with probabilistic targets. Our new soft target InfoNCE loss is conceptually simple, efficient to compute, and can be motivated through the framework of noise contrastive estimation. Using a toy example, we demonstrate shortcomings of the categorical distribution assumption of cross-entropy, and discuss implications of sampling from soft distributions. We observe that soft target InfoNCE performs on par with strong soft target cross-entropy baselines and outperforms hard target NLL and InfoNCE losses on popular benchmarks, including ImageNet. Finally, we provide a simple implementation of our loss, geared towards supervised classification and fully compatible with deep classification models trained with cross-entropy. △ Less

Submitted 15 July, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

arXiv:2102.01389 [pdf, other]

aura-net : robust segmentation of phase-contrast microscopy images with few annotations

Authors: Ethan Cohen, Virginie Uhlmann

Abstract: We present AURA-net, a convolutional neural network (CNN) for the segmentation of phase-contrast microscopy images. AURA-net uses transfer learning to accelerate training and Attention mechanisms to help the network focus on relevant image features. In this way, it can be trained efficiently with a very limited amount of annotations. Our network can thus be used to automate the segmentation of dat… ▽ More We present AURA-net, a convolutional neural network (CNN) for the segmentation of phase-contrast microscopy images. AURA-net uses transfer learning to accelerate training and Attention mechanisms to help the network focus on relevant image features. In this way, it can be trained efficiently with a very limited amount of annotations. Our network can thus be used to automate the segmentation of datasets that are generally considered too small for deep learning techniques. AURA-net also uses a loss inspired by active contours that is well-adapted to the specificity of phase-contrast images, further improving performance. We show that AURA-net outperforms state-of-the-art alternatives in several small (less than 100images) datasets. △ Less

Submitted 2 February, 2021; originally announced February 2021.

Comments: Accepted at ISBI 2021

arXiv:1803.05748 [pdf, other]

doi 10.1007/978-3-319-66709-6_21

Diverse M-Best Solutions by Dynamic Programming

Authors: Carsten Haubold, Virginie Uhlmann, Michael Unser, Fred A. Hamprecht

Abstract: Many computer vision pipelines involve dynamic programming primitives such as finding a shortest path or the minimum energy solution in a tree-shaped probabilistic graphical model. In such cases, extracting not merely the best, but the set of M-best solutions is useful to generate a rich collection of candidate proposals that can be used in downstream processing. In this work, we show how M-best s… ▽ More Many computer vision pipelines involve dynamic programming primitives such as finding a shortest path or the minimum energy solution in a tree-shaped probabilistic graphical model. In such cases, extracting not merely the best, but the set of M-best solutions is useful to generate a rich collection of candidate proposals that can be used in downstream processing. In this work, we show how M-best solutions of tree-shaped graphical models can be obtained by dynamic programming on a special graph with M layers. The proposed multi-layer concept is optimal for searching M-best solutions, and so flexible that it can also approximate M-best diverse solutions. We illustrate the usefulness with applications to object detection, panorama stitching and centerline extraction. Note: We have observed that an assumption in section 4 of our paper is not always fulfilled, see the attached corrigendum for details. △ Less

Submitted 15 March, 2018; originally announced March 2018.

Comments: Includes supplementary and corrigendum

Journal ref: Haubold, C., Uhlmann, V., Unser, M., Hamprecht, F. A.: Diverse M-best solutions by dynamic programming. In: Roth, V., Vetter, T. (eds.) Pattern Recognition. GCPR 2017. vol 10496, pp. 255-267. Springer, Cham (2017)

arXiv:1702.05003 [pdf, other]

Gaussian and Sparse Processes Are Limits of Generalized Poisson Processes

Authors: Julien Fageot, Virginie Uhlmann, Michael Unser

Abstract: The theory of sparse stochastic processes offers a broad class of statistical models to study signals. In this framework, signals are represented as realizations of random processes that are solution of linear stochastic differential equations driven by white Lévy noises. Among these processes, generalized Poisson processes based on compound-Poisson noises admit an interpretation as random L-splin… ▽ More The theory of sparse stochastic processes offers a broad class of statistical models to study signals. In this framework, signals are represented as realizations of random processes that are solution of linear stochastic differential equations driven by white Lévy noises. Among these processes, generalized Poisson processes based on compound-Poisson noises admit an interpretation as random L-splines with random knots and weights. We demonstrate that every generalized Lévy process-from Gaussian to sparse-can be understood as the limit in law of a sequence of generalized Poisson processes. This enables a new conceptual understanding of sparse processes and suggests simple algorithms for the numerical generation of such objects. △ Less

Submitted 16 February, 2017; originally announced February 2017.

Comments: 16 pages, 11 figures

Showing 1–4 of 4 results for author: Uhlmann, V