Zum Hauptinhalt springen

Showing 1–29 of 29 results for author: Theis, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12970  [pdf, other

    cs.IT

    Gaussian Channel Simulation with Rotated Dithered Quantization

    Authors: Szymon Kobus, Lucas Theis, Deniz Gündüz

    Abstract: Channel simulation involves generating a sample $Y$ from the conditional distribution $P_{Y|X}$, where $X$ is a remote realization sampled from $P_X$. This paper introduces a novel approach to approximate Gaussian channel simulation using dithered quantization. Our method concurrently simulates $n$ channels, reducing the upper bound on the excess information by half compared to one-dimensional met… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  2. arXiv:2403.04493  [pdf, ps, other

    cs.LG stat.ML

    What makes an image realistic?

    Authors: Lucas Theis

    Abstract: The last decade has seen tremendous progress in our ability to generate realistic-looking data, be it images, text, audio, or video. Here, we discuss the closely related problem of quantifying realism, that is, designing functions that can reliably tell realistic data from unrealistic data. This problem turns out to be significantly harder to solve and remains poorly understood, despite its preval… ▽ More

    Submitted 21 May, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Journal ref: Proceedings of the 41st International Conference on Machine Learning, 2024

  3. arXiv:2312.02753  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    C3: High-performance and low-complexity neural compression from a single image or video

    Authors: Hyunjik Kim, Matthias Bauer, Lucas Theis, Jonathan Richard Schwarz, Emilien Dupont

    Abstract: Most neural compression models are trained on large datasets of images or videos in order to generalize to unseen data. Such generalization typically requires large and expressive architectures with a high decoding complexity. Here we introduce C3, a neural compression method with strong rate-distortion (RD) performance that instead overfits a small model to each image or video separately. The res… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  4. arXiv:2310.05986  [pdf, other

    cs.CV

    The Unreasonable Effectiveness of Linear Prediction as a Perceptual Metric

    Authors: Daniel Severo, Lucas Theis, Johannes Ballé

    Abstract: We show how perceptual embeddings of the visual system can be constructed at inference-time with no training data or deep neural network features. Our perceptual embeddings are solutions to a weighted least squares (WLS) problem, defined at the pixel-level, and solved at inference-time, that can capture global and local image characteristics. The distance in embedding space is used to define a per… ▽ More

    Submitted 6 October, 2023; originally announced October 2023.

  5. arXiv:2310.03629  [pdf, other

    cs.IT cs.CV eess.IV

    Wasserstein Distortion: Unifying Fidelity and Realism

    Authors: Yang Qiu, Aaron B. Wagner, Johannes Ballé, Lucas Theis

    Abstract: We introduce a distortion measure for images, Wasserstein distortion, that simultaneously generalizes pixel-level fidelity on the one hand and realism or perceptual quality on the other. We show how Wasserstein distortion reduces to a pure fidelity constraint or a pure realism constraint under different parameter choices and discuss its metric properties. Pairs of images that are close under Wasse… ▽ More

    Submitted 28 March, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

  6. arXiv:2305.18231  [pdf, other

    eess.IV cs.CV cs.LG stat.ML

    High-Fidelity Image Compression with Score-based Generative Models

    Authors: Emiel Hoogeboom, Eirikur Agustsson, Fabian Mentzer, Luca Versari, George Toderici, Lucas Theis

    Abstract: Despite the tremendous success of diffusion generative models in text-to-image generation, replicating this success in the domain of image compression has proven difficult. In this paper, we demonstrate that diffusion can significantly improve perceptual quality at a given bit-rate, outperforming state-of-the-art approaches PO-ELIC and HiFiC as measured by FID score. This is achieved using a simpl… ▽ More

    Submitted 7 March, 2024; v1 submitted 26 May, 2023; originally announced May 2023.

  7. Adaptive Greedy Rejection Sampling

    Authors: Gergely Flamich, Lucas Theis

    Abstract: We consider channel simulation protocols between two communicating parties, Alice and Bob. First, Alice receives a target distribution $Q$, unknown to Bob. Then, she employs a shared coding distribution $P$ to send the minimum amount of information to Bob so that he can simulate a single sample $X \sim Q$. For discrete distributions, Harsha et al. (2009) developed a well-known channel simulation p… ▽ More

    Submitted 20 April, 2023; originally announced April 2023.

    Comments: Accepted to 2023 IEEE International Symposium on Information Theory (ISIT). 9 pages, 3 figures

    MSC Class: 94A40 (Primary) 68Q11; 68Q17 (Secondary) ACM Class: E.4; H.1.1

  8. arXiv:2206.08889  [pdf, other

    stat.ML cs.IT cs.LG

    Lossy Compression with Gaussian Diffusion

    Authors: Lucas Theis, Tim Salimans, Matthew D. Hoffman, Fabian Mentzer

    Abstract: We consider a novel lossy compression approach based on unconditional diffusion generative models, which we call DiffC. Unlike modern compression schemes which rely on transform coding and quantization to restrict the transmitted information, DiffC relies on the efficient communication of pixels corrupted by Gaussian noise. We implement a proof of concept and find that it works surprisingly well d… ▽ More

    Submitted 31 December, 2022; v1 submitted 17 June, 2022; originally announced June 2022.

  9. arXiv:2202.06533  [pdf, other

    cs.LG cs.IT eess.IV

    An Introduction to Neural Data Compression

    Authors: Yibo Yang, Stephan Mandt, Lucas Theis

    Abstract: Neural compression is the application of neural networks and other machine learning methods to data compression. Recent advances in statistical machine learning have opened up new possibilities for data compression, allowing compression algorithms to be learned end-to-end from data using powerful generative models such as normalizing flows, variational autoencoders, diffusion probabilistic models,… ▽ More

    Submitted 16 August, 2023; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: Published in Foundations and Trends in Computer Graphics and Vision: Vol. 15, No. 2, pp 113-200. https://www.nowpublishers.com/article/Details/CGV-107

  10. arXiv:2111.00092  [pdf, other

    cs.CR cs.LG

    Optimal Compression of Locally Differentially Private Mechanisms

    Authors: Abhin Shah, Wei-Ning Chen, Johannes Balle, Peter Kairouz, Lucas Theis

    Abstract: Compressing the output of ε-locally differentially private (LDP) randomizers naively leads to suboptimal utility. In this work, we demonstrate the benefits of using schemes that jointly compress and privatize the data using shared randomness. In particular, we investigate a family of schemes based on Minimal Random Coding (Havasi et al., 2019) and prove that they offer optimal privacy-accuracy-com… ▽ More

    Submitted 26 February, 2022; v1 submitted 29 October, 2021; originally announced November 2021.

  11. arXiv:2110.12805  [pdf, other

    cs.IT stat.ML

    Algorithms for the Communication of Samples

    Authors: Lucas Theis, Noureldin Yosri

    Abstract: The efficient communication of noisy data has applications in several areas of machine learning, such as neural compression or differential privacy, and is also known as reverse channel coding or the channel simulation problem. Here we propose two new coding schemes with practical advantages over existing approaches. First, we introduce ordered random coding (ORC) which uses a simple trick to redu… ▽ More

    Submitted 25 May, 2022; v1 submitted 25 October, 2021; originally announced October 2021.

    Comments: Proceedings of the 39th International Conference on Machine Learning, 2022

  12. arXiv:2104.13662  [pdf, ps, other

    cs.IT stat.ML

    A coding theorem for the rate-distortion-perception function

    Authors: Lucas Theis, Aaron B. Wagner

    Abstract: The rate-distortion-perception function (RDPF; Blau and Michaeli, 2019) has emerged as a useful tool for thinking about realism and distortion of reconstructions in lossy compression. Unlike the rate-distortion function, however, it is unknown whether encoders and decoders exist that achieve the rate suggested by the RDPF. Building on results by Li and El Gamal (2018), we show that the RDPF can in… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Journal ref: ICLR 2021 Neural Compression Workshop

  13. arXiv:2102.09270  [pdf, ps, other

    cs.IT stat.ML

    On the advantages of stochastic encoders

    Authors: Lucas Theis, Eirikur Agustsson

    Abstract: Stochastic encoders have been used in rate-distortion theory and neural compression because they can be easier to handle. However, in performance comparisons with deterministic encoders they often do worse, suggesting that noise in the encoding process may generally be a bad idea. It is poorly understood if and when stochastic encoders do better than deterministic encoders. In this paper we provid… ▽ More

    Submitted 29 April, 2021; v1 submitted 18 February, 2021; originally announced February 2021.

    Journal ref: ICLR 2021 Neural Compression Workshop

  14. arXiv:2006.09952  [pdf, other

    stat.ML cs.CV cs.IT cs.LG

    Universally Quantized Neural Compression

    Authors: Eirikur Agustsson, Lucas Theis

    Abstract: A popular approach to learning encoders for lossy compression is to use additive uniform noise during training as a differentiable approximation to test-time quantization. We demonstrate that a uniform noise channel can also be implemented at test time using universal quantization (Ziv, 1985). This allows us to eliminate the mismatch between training and test phases while maintaining a completely… ▽ More

    Submitted 21 October, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: Authors contributed equally

  15. arXiv:1909.01436  [pdf, other

    stat.ML cs.IR cs.LG

    Discriminative Topic Modeling with Logistic LDA

    Authors: Iryna Korshunova, Hanchen Xiong, Mateusz Fedoryszak, Lucas Theis

    Abstract: Despite many years of research into latent Dirichlet allocation (LDA), applying LDA to collections of non-categorical items is still challenging. Yet many problems with much richer data share a similar structure and could benefit from the vast literature on LDA. We propose logistic LDA, a novel discriminative variant of latent Dirichlet allocation which is easy to apply to arbitrary inputs. In par… ▽ More

    Submitted 7 January, 2020; v1 submitted 3 September, 2019; originally announced September 2019.

    Journal ref: Advances in Neural Information Processing Systems 32, 2019

  16. arXiv:1907.06558  [pdf, other

    stat.ML cs.LG

    Addressing Delayed Feedback for Continuous Training with Neural Networks in CTR prediction

    Authors: Sofia Ira Ktena, Alykhan Tejani, Lucas Theis, Pranay Kumar Myana, Deepak Dilipkumar, Ferenc Huszar, Steven Yoo, Wenzhe Shi

    Abstract: One of the challenges in display advertising is that the distribution of features and click through rate (CTR) can exhibit large shifts over time due to seasonality, changes to ad campaigns and other factors. The predominant strategy to keep up with these shifts is to train predictive models continuously, on fresh data, in order to prevent them from becoming stale. However, in many ad systems posi… ▽ More

    Submitted 23 April, 2021; v1 submitted 15 July, 2019; originally announced July 2019.

    Comments: Accepted at RecSys '19

  17. arXiv:1904.01326  [pdf, other

    cs.CV

    HoloGAN: Unsupervised learning of 3D representations from natural images

    Authors: Thu Nguyen-Phuoc, Chuan Li, Lucas Theis, Christian Richardt, Yong-Liang Yang

    Abstract: We propose a novel generative adversarial network (GAN) for the task of unsupervised learning of 3D representations from natural images. Most generative models rely on 2D kernels to generate images and make few assumptions about the 3D world. These models therefore tend to create blurry images or artefacts in tasks that require a strong 3D understanding, such as novel-view synthesis. HoloGAN inste… ▽ More

    Submitted 1 October, 2019; v1 submitted 2 April, 2019; originally announced April 2019.

    Comments: International Conference on Computer Vision ICCV 2019. For project page, see https://www.monkeyoverflow.com/#/hologan-unsupervised-learning-of-3d-representations-from-natural-images/

  18. arXiv:1801.05787  [pdf, other

    cs.CV stat.ML

    Faster gaze prediction with dense networks and Fisher pruning

    Authors: Lucas Theis, Iryna Korshunova, Alykhan Tejani, Ferenc Huszár

    Abstract: Predicting human fixations from images has recently seen large improvements by leveraging deep representations which were pretrained for object recognition. However, as we show in this paper, these networks are highly overparameterized for the task of fixation prediction. We first present a simple yet principled greedy pruning method which we call Fisher pruning. Through a combination of knowledge… ▽ More

    Submitted 9 July, 2018; v1 submitted 17 January, 2018; originally announced January 2018.

  19. arXiv:1707.02937  [pdf

    cs.CV

    Checkerboard artifact free sub-pixel convolution: A note on sub-pixel convolution, resize convolution and convolution resize

    Authors: Andrew Aitken, Christian Ledig, Lucas Theis, Jose Caballero, Zehan Wang, Wenzhe Shi

    Abstract: The most prominent problem associated with the deconvolution layer is the presence of checkerboard artifacts in output images and dense labels. To combat this problem, smoothness constraints, post processing and different architecture designs have been proposed. Odena et al. highlight three sources of checkerboard artifacts: deconvolution overlap, random initialization and loss functions. In this… ▽ More

    Submitted 10 July, 2017; originally announced July 2017.

  20. arXiv:1703.00395  [pdf, other

    stat.ML cs.CV

    Lossy Image Compression with Compressive Autoencoders

    Authors: Lucas Theis, Wenzhe Shi, Andrew Cunningham, Ferenc Huszár

    Abstract: We propose a new approach to the problem of optimizing autoencoders for lossy image compression. New media formats, changing hardware technology, as well as diverse requirements and content types create a need for compression algorithms which are more flexible than existing codecs. Autoencoders have the potential to address this need, but are difficult to optimize directly due to the inherent non-… ▽ More

    Submitted 1 March, 2017; originally announced March 2017.

  21. arXiv:1611.09577  [pdf, other

    cs.CV

    Fast Face-swap Using Convolutional Neural Networks

    Authors: Iryna Korshunova, Wenzhe Shi, Joni Dambre, Lucas Theis

    Abstract: We consider the problem of face swapping in images, where an input identity is transformed into a target identity while preserving pose, facial expression, and lighting. To perform this mapping, we use convolutional neural networks trained to capture the appearance of the target identity from an unstructured collection of his/her photographs.This approach is enabled by framing the face swapping pr… ▽ More

    Submitted 27 July, 2017; v1 submitted 29 November, 2016; originally announced November 2016.

  22. arXiv:1610.04490  [pdf, other

    cs.CV cs.LG stat.ML

    Amortised MAP Inference for Image Super-resolution

    Authors: Casper Kaae Sønderby, Jose Caballero, Lucas Theis, Wenzhe Shi, Ferenc Huszár

    Abstract: Image super-resolution (SR) is an underdetermined inverse problem, where a large number of plausible high-resolution images can explain the same downsampled image. Most current single image SR methods use empirical risk minimisation, often with a pixel-wise mean squared error (MSE) loss. However, the outputs from such methods tend to be blurry, over-smoothed and generally appear implausible. A mor… ▽ More

    Submitted 21 February, 2017; v1 submitted 14 October, 2016; originally announced October 2016.

  23. arXiv:1609.07009  [pdf

    cs.CV

    Is the deconvolution layer the same as a convolutional layer?

    Authors: Wenzhe Shi, Jose Caballero, Lucas Theis, Ferenc Huszar, Andrew Aitken, Christian Ledig, Zehan Wang

    Abstract: In this note, we want to focus on aspects related to two questions most people asked us at CVPR about the network we presented. Firstly, What is the relationship between our proposed layer and the deconvolution layer? And secondly, why are convolutions in low-resolution (LR) space a better choice? These are key questions we tried to answer in the paper, but we were not able to go into as much dept… ▽ More

    Submitted 22 September, 2016; originally announced September 2016.

    Comments: This is a note to share some additional insights for our the CVPR paper

  24. arXiv:1609.04802  [pdf, other

    cs.CV stat.ML

    Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network

    Authors: Christian Ledig, Lucas Theis, Ferenc Huszar, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew Aitken, Alykhan Tejani, Johannes Totz, Zehan Wang, Wenzhe Shi

    Abstract: Despite the breakthroughs in accuracy and speed of single image super-resolution using faster and deeper convolutional neural networks, one central problem remains largely unsolved: how do we recover the finer texture details when we super-resolve at large upscaling factors? The behavior of optimization-based super-resolution methods is principally driven by the choice of the objective function. R… ▽ More

    Submitted 25 May, 2017; v1 submitted 15 September, 2016; originally announced September 2016.

    Comments: 19 pages, 15 figures, 2 tables, accepted for oral presentation at CVPR, main paper + some supplementary material

  25. arXiv:1511.01844  [pdf, other

    stat.ML cs.LG

    A note on the evaluation of generative models

    Authors: Lucas Theis, Aäron van den Oord, Matthias Bethge

    Abstract: Probabilistic generative models can be used for compression, denoising, inpainting, texture synthesis, semi-supervised learning, unsupervised feature learning, and other tasks. Given this wide range of applications, it is not surprising that a lot of heterogeneity exists in the way these models are formulated, trained, and evaluated. As a consequence, direct comparison between models is often diff… ▽ More

    Submitted 24 April, 2016; v1 submitted 5 November, 2015; originally announced November 2015.

  26. arXiv:1506.03478  [pdf, other

    stat.ML cs.CV cs.LG

    Generative Image Modeling Using Spatial LSTMs

    Authors: Lucas Theis, Matthias Bethge

    Abstract: Modeling the distribution of natural images is challenging, partly because of strong statistical dependencies which can extend over hundreds of pixels. Recurrent neural networks have been successful in capturing long-range dependencies in a number of problems but only recently have found their way into generative image models. We here introduce a recurrent image model based on multi-dimensional lo… ▽ More

    Submitted 18 September, 2015; v1 submitted 10 June, 2015; originally announced June 2015.

  27. arXiv:1505.07672  [pdf, other

    cs.CV

    A Generative Model of Natural Texture Surrogates

    Authors: Niklas Ludtke, Debapriya Das, Lucas Theis, Matthias Bethge

    Abstract: Natural images can be viewed as patchworks of different textures, where the local image statistics is roughly stationary within a small neighborhood but otherwise varies from region to region. In order to model this variability, we first applied the parametric texture algorithm of Portilla and Simoncelli to image patches of 64X64 pixels in a large database of natural images such that each image pa… ▽ More

    Submitted 28 May, 2015; originally announced May 2015.

    Comments: 34 pages, 9 figures

  28. arXiv:1411.1045  [pdf, other

    cs.CV q-bio.NC stat.AP

    Deep Gaze I: Boosting Saliency Prediction with Feature Maps Trained on ImageNet

    Authors: Matthias Kümmerer, Lucas Theis, Matthias Bethge

    Abstract: Recent results suggest that state-of-the-art saliency models perform far from optimal in predicting fixations. This lack in performance has been attributed to an inability to model the influence of high-level image features such as objects. Recent seminal advances in applying deep neural networks to tasks like object recognition suggests that they are able to capture this kind of structure. Howeve… ▽ More

    Submitted 9 April, 2015; v1 submitted 4 November, 2014; originally announced November 2014.

  29. arXiv:1011.6086  [pdf, other

    stat.ML cs.LG

    In All Likelihood, Deep Belief Is Not Enough

    Authors: Lucas Theis, Sebastian Gerwinn, Fabian Sinz, Matthias Bethge

    Abstract: Statistical models of natural stimuli provide an important tool for researchers in the fields of machine learning and computational neuroscience. A canonical way to quantitatively assess and compare the performance of statistical models is given by the likelihood. One class of statistical models which has recently gained increasing popularity and has been applied to a variety of complex data are d… ▽ More

    Submitted 28 November, 2010; originally announced November 2010.

    Journal ref: Journal of Machine Learning Research 12, 3071-3096, 2011