Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Sønderby, C K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2006.12459  [pdf, other

    cs.LG stat.ML

    IDF++: Analyzing and Improving Integer Discrete Flows for Lossless Compression

    Authors: Rianne van den Berg, Alexey A. Gritsenko, Mostafa Dehghani, Casper Kaae Sønderby, Tim Salimans

    Abstract: In this paper we analyse and improve integer discrete flows for lossless compression. Integer discrete flows are a recently proposed class of models that learn invertible transformations for integer-valued random variables. Their discrete nature makes them particularly suitable for lossless compression with entropy coding schemes. We start by investigating a recent theoretical claim that states th… ▽ More

    Submitted 23 March, 2021; v1 submitted 22 June, 2020; originally announced June 2020.

    Comments: Accepted as a conference paper at the Ninth International Conference on Learning Representations (ICLR) 2021

  2. arXiv:2003.12140  [pdf, other

    cs.LG physics.ao-ph stat.ML

    MetNet: A Neural Weather Model for Precipitation Forecasting

    Authors: Casper Kaae Sønderby, Lasse Espeholt, Jonathan Heek, Mostafa Dehghani, Avital Oliver, Tim Salimans, Shreya Agrawal, Jason Hickey, Nal Kalchbrenner

    Abstract: Weather forecasting is a long standing scientific challenge with direct social and economic impact. The task is suitable for deep neural networks due to vast amounts of continuously collected data and a rich spatial and temporal structure that presents long range dependencies. We introduce MetNet, a neural network that forecasts precipitation up to 8 hours into the future at the high spatial resol… ▽ More

    Submitted 30 March, 2020; v1 submitted 24 March, 2020; originally announced March 2020.

  3. arXiv:1610.06550  [pdf, other

    cs.CL

    Neural Machine Translation with Characters and Hierarchical Encoding

    Authors: Alexander Rosenberg Johansen, Jonas Meinertz Hansen, Elias Khazen Obeid, Casper Kaae Sønderby, Ole Winther

    Abstract: Most existing Neural Machine Translation models use groups of characters or whole words as their unit of input and output. We propose a model with a hierarchical char2word encoder, that takes individual characters both as input and output. We first argue that this hierarchical representation of the character encoder reduces computational complexity, and show that it improves translation performanc… ▽ More

    Submitted 20 October, 2016; originally announced October 2016.

    Comments: 8 pages, 7 figures

  4. arXiv:1610.04490  [pdf, other

    cs.CV cs.LG stat.ML

    Amortised MAP Inference for Image Super-resolution

    Authors: Casper Kaae Sønderby, Jose Caballero, Lucas Theis, Wenzhe Shi, Ferenc Huszár

    Abstract: Image super-resolution (SR) is an underdetermined inverse problem, where a large number of plausible high-resolution images can explain the same downsampled image. Most current single image SR methods use empirical risk minimisation, often with a pixel-wise mean squared error (MSE) loss. However, the outputs from such methods tend to be blurry, over-smoothed and generally appear implausible. A mor… ▽ More

    Submitted 21 February, 2017; v1 submitted 14 October, 2016; originally announced October 2016.

  5. arXiv:1602.05473  [pdf, other

    stat.ML cs.AI cs.LG

    Auxiliary Deep Generative Models

    Authors: Lars Maaløe, Casper Kaae Sønderby, Søren Kaae Sønderby, Ole Winther

    Abstract: Deep generative models parameterized by neural networks have recently achieved state-of-the-art performance in unsupervised and semi-supervised learning. We extend deep generative models with auxiliary variables which improves the variational approximation. The auxiliary variables leave the generative model unchanged but make the variational distribution more expressive. Inspired by the structure… ▽ More

    Submitted 16 June, 2016; v1 submitted 17 February, 2016; originally announced February 2016.

    Comments: Proceedings of the 33rd International Conference on Machine Learning, New York, NY, USA, 2016, JMLR: Workshop and Conference Proceedings volume 48, Proceedings of the 33rd International Conference on Machine Learning, New York, NY, USA, 2016

  6. arXiv:1602.02282  [pdf, other

    stat.ML cs.LG

    Ladder Variational Autoencoders

    Authors: Casper Kaae Sønderby, Tapani Raiko, Lars Maaløe, Søren Kaae Sønderby, Ole Winther

    Abstract: Variational Autoencoders are powerful models for unsupervised learning. However deep models with several layers of dependent stochastic variables are difficult to train which limits the improvements obtained using these highly expressive models. We propose a new inference model, the Ladder Variational Autoencoder, that recursively corrects the generative distribution by a data dependent approximat… ▽ More

    Submitted 27 May, 2016; v1 submitted 6 February, 2016; originally announced February 2016.

  7. arXiv:1509.05329  [pdf, ps, other

    cs.CV

    Recurrent Spatial Transformer Networks

    Authors: Søren Kaae Sønderby, Casper Kaae Sønderby, Lars Maaløe, Ole Winther

    Abstract: We integrate the recently proposed spatial transformer network (SPN) [Jaderberg et. al 2015] into a recurrent neural network (RNN) to form an RNN-SPN model. We use the RNN-SPN to classify digits in cluttered MNIST sequences. The proposed model achieves a single digit error of 1.5% compared to 2.9% for a convolutional networks and 2.0% for convolutional networks with SPN layers. The SPN outputs a z… ▽ More

    Submitted 17 September, 2015; originally announced September 2015.

  8. Convolutional LSTM Networks for Subcellular Localization of Proteins

    Authors: Søren Kaae Sønderby, Casper Kaae Sønderby, Henrik Nielsen, Ole Winther

    Abstract: Machine learning is widely used to analyze biological sequence data. Non-sequential models such as SVMs or feed-forward neural networks are often used although they have no natural way of handling sequences of varying length. Recurrent neural networks such as the long short term memory (LSTM) model on the other hand are designed to handle sequences. In this study we demonstrate that LSTM networks… ▽ More

    Submitted 6 March, 2015; originally announced March 2015.

    Journal ref: Algorithms for Computational Biology 9199 (2015) 68