Skip to main content

Showing 1–14 of 14 results for author: Ganin, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2211.15089  [pdf, other

    cs.CL cs.LG

    Continuous diffusion for categorical data

    Authors: Sander Dieleman, Laurent Sartran, Arman Roshannai, Nikolay Savinov, Yaroslav Ganin, Pierre H. Richemond, Arnaud Doucet, Robin Strudel, Chris Dyer, Conor Durkan, Curtis Hawthorne, Rémi Leblond, Will Grathwohl, Jonas Adler

    Abstract: Diffusion models have quickly become the go-to paradigm for generative modelling of perceptual signals (such as images and sound) through iterative refinement. Their success hinges on the fact that the underlying physical phenomena are continuous. For inherently discrete and categorical data such as language, various diffusion-inspired alternatives have been proposed. However, the continuous natur… ▽ More

    Submitted 15 December, 2022; v1 submitted 28 November, 2022; originally announced November 2022.

    Comments: 26 pages, 8 figures; corrections and additional information about hyperparameters

  2. arXiv:2211.04236  [pdf, other

    cs.CL cs.LG

    Self-conditioned Embedding Diffusion for Text Generation

    Authors: Robin Strudel, Corentin Tallec, Florent Altché, Yilun Du, Yaroslav Ganin, Arthur Mensch, Will Grathwohl, Nikolay Savinov, Sander Dieleman, Laurent Sifre, Rémi Leblond

    Abstract: Can continuous diffusion models bring the same performance breakthrough on natural language they did for image generation? To circumvent the discrete nature of text data, we can simply project tokens in a continuous space of embeddings, as is standard in language modeling. We propose Self-conditioned Embedding Diffusion, a continuous diffusion mechanism that operates on token embeddings and allows… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: 15 pages

  3. arXiv:2209.11142  [pdf, other

    cs.LG cs.AI stat.ML

    A Generalist Neural Algorithmic Learner

    Authors: Borja Ibarz, Vitaly Kurin, George Papamakarios, Kyriacos Nikiforou, Mehdi Bennani, Róbert Csordás, Andrew Dudzik, Matko Bošnjak, Alex Vitvitskyi, Yulia Rubanova, Andreea Deac, Beatrice Bevilacqua, Yaroslav Ganin, Charles Blundell, Petar Veličković

    Abstract: The cornerstone of neural algorithmic reasoning is the ability to solve algorithmic tasks, especially in a way that generalises out of distribution. While recent years have seen a surge in methodological improvements in this area, they mostly focused on building specialist models. Specialist models are capable of learning to neurally execute either only one algorithm or a collection of algorithms… ▽ More

    Submitted 3 December, 2022; v1 submitted 22 September, 2022; originally announced September 2022.

    Comments: To appear at LoG 2022 (Spotlight talk). 23 pages, 11 figures

  4. arXiv:2105.02769  [pdf, other

    cs.CV cs.LG

    Computer-Aided Design as Language

    Authors: Yaroslav Ganin, Sergey Bartunov, Yujia Li, Ethan Keller, Stefano Saliceti

    Abstract: Computer-Aided Design (CAD) applications are used in manufacturing to model everything from coffee mugs to sports cars. These programs are complex and require years of training and experience to master. A component of all CAD models particularly difficult to make are the highly structured 2D sketches that lie at the heart of every 3D construction. In this work, we propose a machine learning model… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

    Comments: 24 pages, 11 figures, 3 tables

  5. arXiv:2002.10880  [pdf, other

    cs.GR cs.CV cs.LG stat.ML

    PolyGen: An Autoregressive Generative Model of 3D Meshes

    Authors: Charlie Nash, Yaroslav Ganin, S. M. Ali Eslami, Peter W. Battaglia

    Abstract: Polygon meshes are an efficient representation of 3D geometry, and are of central importance in computer graphics, robotics and games development. Existing learning-based approaches have avoided the challenges of working with 3D meshes, instead using alternative object representations that are more compatible with neural architectures and training approaches. We present an approach which models th… ▽ More

    Submitted 23 February, 2020; originally announced February 2020.

  6. arXiv:1910.01007  [pdf, other

    cs.CV cs.LG stat.ML

    Unsupervised Doodling and Painting with Improved SPIRAL

    Authors: John F. J. Mellor, Eunbyung Park, Yaroslav Ganin, Igor Babuschkin, Tejas Kulkarni, Dan Rosenbaum, Andy Ballard, Theophane Weber, Oriol Vinyals, S. M. Ali Eslami

    Abstract: We investigate using reinforcement learning agents as generative models of images (extending arXiv:1804.01118). A generative agent controls a simulated painting environment, and is trained with rewards provided by a discriminator network simultaneously trained to assess the realism of the agent's samples, either unconditional or reconstructions. Compared to prior work, we make a number of improvem… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

    Comments: See https://learning-to-paint.github.io for an interactive version of this paper, with videos

    ACM Class: I.2; I.4

  7. arXiv:1804.01118  [pdf, other

    cs.CV cs.LG stat.ML

    Synthesizing Programs for Images using Reinforced Adversarial Learning

    Authors: Yaroslav Ganin, Tejas Kulkarni, Igor Babuschkin, S. M. Ali Eslami, Oriol Vinyals

    Abstract: Advances in deep generative networks have led to impressive results in recent years. Nevertheless, such models can often waste their capacity on the minutiae of datasets, presumably due to weak inductive biases in their decoders. This is where graphics engines may come in handy since they abstract away low-level details and represent images as high-level programs. Current methods that combine deep… ▽ More

    Submitted 3 April, 2018; originally announced April 2018.

    Comments: 12 pages, 13 figures

  8. arXiv:1712.04120  [pdf, other

    stat.ML cs.LG

    GibbsNet: Iterative Adversarial Inference for Deep Graphical Models

    Authors: Alex Lamb, Devon Hjelm, Yaroslav Ganin, Joseph Paul Cohen, Aaron Courville, Yoshua Bengio

    Abstract: Directed latent variable models that formulate the joint distribution as $p(x,z) = p(z) p(x \mid z)$ have the advantage of fast and exact sampling. However, these models have the weakness of needing to specify $p(z)$, often with a simple fixed prior that limits the expressiveness of the model. Undirected latent variable models discard the requirement that $p(z)$ be specified with a prior, yet samp… ▽ More

    Submitted 11 December, 2017; originally announced December 2017.

    Comments: NIPS 2017

  9. arXiv:1607.07215  [pdf, other

    cs.CV

    DeepWarp: Photorealistic Image Resynthesis for Gaze Manipulation

    Authors: Yaroslav Ganin, Daniil Kononenko, Diana Sungatullina, Victor Lempitsky

    Abstract: In this work, we consider the task of generating highly-realistic images of a given face with a redirected gaze. We treat this problem as a specific instance of conditional image generation and suggest a new deep architecture that can handle this task very well as revealed by numerical comparison with prior art and a user study. Our deep architecture performs coarse-to-fine warping with an additio… ▽ More

    Submitted 26 July, 2016; v1 submitted 25 July, 2016; originally announced July 2016.

    Comments: Fixed typos, 14 + 2 + 2 pages, ECCV 2016

  10. arXiv:1512.05300  [pdf, other

    cs.CV

    Multiregion Bilinear Convolutional Neural Networks for Person Re-Identification

    Authors: Evgeniya Ustinova, Yaroslav Ganin, Victor Lempitsky

    Abstract: In this work we propose a new architecture for person re-identification. As the task of re-identification is inherently associated with embedding learning and non-rigid appearance description, our architecture is based on the deep bilinear convolutional network (Bilinear-CNN) that has been proposed recently for fine-grained classification of highly non-rigid objects. While the last stages of the o… ▽ More

    Submitted 6 September, 2017; v1 submitted 16 December, 2015; originally announced December 2015.

    Comments: in AVSS 2017

  11. arXiv:1505.07818  [pdf, other

    stat.ML cs.LG cs.NE

    Domain-Adversarial Training of Neural Networks

    Authors: Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, François Laviolette, Mario Marchand, Victor Lempitsky

    Abstract: We introduce a new representation learning approach for domain adaptation, in which data at training and test time come from similar but different distributions. Our approach is directly inspired by the theory on domain adaptation suggesting that, for effective domain transfer to be achieved, predictions must be made based on features that cannot discriminate between the training (source) and test… ▽ More

    Submitted 26 May, 2016; v1 submitted 28 May, 2015; originally announced May 2015.

    Comments: Published in JMLR: http://jmlr.org/papers/v17/15-239.html

    Journal ref: Journal of Machine Learning Research 2016, vol. 17, p. 1-35

  12. arXiv:1412.6553  [pdf, other

    cs.CV cs.LG

    Speeding-up Convolutional Neural Networks Using Fine-tuned CP-Decomposition

    Authors: Vadim Lebedev, Yaroslav Ganin, Maksim Rakhuba, Ivan Oseledets, Victor Lempitsky

    Abstract: We propose a simple two-step approach for speeding up convolution layers within large convolutional neural networks based on tensor decomposition and discriminative fine-tuning. Given a layer, we use non-linear least squares to compute a low-rank CP-decomposition of the 4D convolution kernel tensor into a sum of a small number of rank-one tensors. At the second step, this decomposition is used to… ▽ More

    Submitted 24 April, 2015; v1 submitted 19 December, 2014; originally announced December 2014.

  13. arXiv:1409.7495  [pdf, other

    stat.ML cs.LG cs.NE

    Unsupervised Domain Adaptation by Backpropagation

    Authors: Yaroslav Ganin, Victor Lempitsky

    Abstract: Top-performing deep architectures are trained on massive amounts of labeled data. In the absence of labeled data for a certain task, domain adaptation often provides an attractive option given that labeled data of similar nature but from a different domain (e.g. synthetic images) are available. Here, we propose a new approach to domain adaptation in deep architectures that can be trained on large… ▽ More

    Submitted 27 February, 2015; v1 submitted 26 September, 2014; originally announced September 2014.

  14. arXiv:1406.6558  [pdf, other

    cs.CV

    $ N^4 $-Fields: Neural Network Nearest Neighbor Fields for Image Transforms

    Authors: Yaroslav Ganin, Victor Lempitsky

    Abstract: We propose a new architecture for difficult image processing operations, such as natural edge detection or thin object segmentation. The architecture is based on a simple combination of convolutional neural networks with the nearest neighbor search. We focus our attention on the situations when the desired image transformation is too hard for a neural network to learn explicitly. We show that in… ▽ More

    Submitted 3 July, 2014; v1 submitted 25 June, 2014; originally announced June 2014.