Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Minnen, D

Searching in archive eess. Search in all archives.
.
  1. Advancing The Rate-Distortion-Computation Frontier For Neural Image Compression

    Authors: David Minnen, Nick Johnston

    Abstract: The rate-distortion performance of neural image compression models has exceeded the state-of-the-art for non-learned codecs, but neural codecs are still far from widespread deployment and adoption. The largest obstacle is having efficient models that are feasible on a wide variety of consumer hardware. Comparative research and evaluation is difficult due to the lack of standard benchmarking platfo… ▽ More

    Submitted 26 September, 2023; originally announced November 2023.

    Comments: Published in 2023 IEEE International Conference on Image Processing (ICIP)

  2. arXiv:2212.13824  [pdf, other

    cs.CV cs.LG eess.IV

    Multi-Realism Image Compression with a Conditional Generator

    Authors: Eirikur Agustsson, David Minnen, George Toderici, Fabian Mentzer

    Abstract: By optimizing the rate-distortion-realism trade-off, generative compression approaches produce detailed, realistic images, even at low bit rates, instead of the blurry reconstructions produced by rate-distortion optimized models. However, previous methods do not explicitly control how much detail is synthesized, which results in a common criticism of these methods: users might be worried that a mi… ▽ More

    Submitted 30 March, 2023; v1 submitted 28 December, 2022; originally announced December 2022.

    Comments: CVPR'23 Camera Ready

  3. arXiv:2206.07307  [pdf, other

    cs.CV cs.LG eess.IV

    VCT: A Video Compression Transformer

    Authors: Fabian Mentzer, George Toderici, David Minnen, Sung-Jin Hwang, Sergi Caelles, Mario Lucic, Eirikur Agustsson

    Abstract: We show how transformers can be used to vastly simplify neural video compression. Previous methods have been relying on an increasing number of architectural biases and priors, including motion prediction and warping operations, resulting in complex models. Instead, we independently map input frames to representations and use a transformer to model their dependencies, letting it predict the distri… ▽ More

    Submitted 12 October, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: NeurIPS'22 Camera Ready Version. Code: https://goo.gle/vct-paper

  4. arXiv:2107.12038  [pdf, other

    eess.IV cs.CV

    Neural Video Compression using GANs for Detail Synthesis and Propagation

    Authors: Fabian Mentzer, Eirikur Agustsson, Johannes Ballé, David Minnen, Nick Johnston, George Toderici

    Abstract: We present the first neural video compression method based on generative adversarial networks (GANs). Our approach significantly outperforms previous neural and non-neural video compression methods in a user study, setting a new state-of-the-art in visual quality for neural methods. We show that the GAN loss is crucial to obtain this high visual quality. Two components make the GAN loss effective:… ▽ More

    Submitted 12 July, 2022; v1 submitted 26 July, 2021; originally announced July 2021.

    Comments: First two authors contributed equally. ECCV Camera ready version

  5. arXiv:2007.08739  [pdf, other

    eess.IV cs.CV cs.IT cs.LG

    Channel-wise Autoregressive Entropy Models for Learned Image Compression

    Authors: David Minnen, Saurabh Singh

    Abstract: In learning-based approaches to image compression, codecs are developed by optimizing a computational model to minimize a rate-distortion objective. Currently, the most effective learned image codecs take the form of an entropy-constrained autoencoder with an entropy model that uses both forward and backward adaptation. Forward adaptation makes use of side information and can be efficiently integr… ▽ More

    Submitted 16 July, 2020; originally announced July 2020.

    Comments: Published at the IEEE International Conference on Image Processing (ICIP) 2020

  6. arXiv:2007.03034  [pdf, other

    cs.IT eess.IV

    Nonlinear Transform Coding

    Authors: Johannes Ballé, Philip A. Chou, David Minnen, Saurabh Singh, Nick Johnston, Eirikur Agustsson, Sung Jin Hwang, George Toderici

    Abstract: We review a class of methods that can be collected under the name nonlinear transform coding (NTC), which over the past few years have become competitive with the best linear transform codecs for images, and have superseded them in terms of rate--distortion performance under established perceptual quality metrics such as MS-SSIM. We assess the empirical rate--distortion performance of NTC with the… ▽ More

    Submitted 23 October, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

    Comments: 17 pages, 14 figures. Accepted for publication in IEEE Journal of Selected Topics in Signal Processing

  7. arXiv:1802.01436  [pdf, other

    eess.IV cs.IT

    Variational image compression with a scale hyperprior

    Authors: Johannes Ballé, David Minnen, Saurabh Singh, Sung Jin Hwang, Nick Johnston

    Abstract: We describe an end-to-end trainable model for image compression based on variational autoencoders. The model incorporates a hyperprior to effectively capture spatial dependencies in the latent representation. This hyperprior relates to side information, a concept universal to virtually all modern image codecs, but largely unexplored in image compression using artificial neural networks (ANNs). Unl… ▽ More

    Submitted 1 May, 2018; v1 submitted 31 January, 2018; originally announced February 2018.

    Comments: accepted as a conference contribution to International Conference on Learning Representations 2018