Zum Hauptinhalt springen

Showing 1–12 of 12 results for author: Pavllo, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.15805  [pdf, other

    cs.CL cs.LG

    Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers

    Authors: Sotiris Anagnostidis, Dario Pavllo, Luca Biggio, Lorenzo Noci, Aurelien Lucchi, Thomas Hofmann

    Abstract: Autoregressive Transformers adopted in Large Language Models (LLMs) are hard to scale to long sequences. Despite several works trying to reduce their computational cost, most of LLMs still adopt attention layers between all pairs of tokens in the sequence, thus incurring a quadratic cost. In this study, we present a novel approach that dynamically prunes contextual information while preserving the… ▽ More

    Submitted 31 May, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

  2. arXiv:2211.11674  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Shape, Pose, and Appearance from a Single Image via Bootstrapped Radiance Field Inversion

    Authors: Dario Pavllo, David Joseph Tan, Marie-Julie Rakotosaona, Federico Tombari

    Abstract: Neural Radiance Fields (NeRF) coupled with GANs represent a promising direction in the area of 3D reconstruction from a single view, owing to their ability to efficiently model arbitrary topologies. Recent work in this area, however, has mostly focused on synthetic datasets where exact ground-truth poses are known, and has overlooked pose estimation, which is important for certain downstream appli… ▽ More

    Submitted 20 March, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: CVPR 2023. Code and models are available at https://github.com/google-research/nerf-from-image

  3. arXiv:2108.03952  [pdf, other

    cs.LG cs.RO

    Safe Deep Reinforcement Learning for Multi-Agent Systems with Continuous Action Spaces

    Authors: Ziyad Sheebaelhamd, Konstantinos Zisis, Athina Nisioti, Dimitris Gkouletsos, Dario Pavllo, Jonas Kohler

    Abstract: Multi-agent control problems constitute an interesting area of application for deep reinforcement learning models with continuous action spaces. Such real-world applications, however, typically come with critical safety constraints that must not be violated. In order to ensure safety, we enhance the well-known multi-agent deep deterministic policy gradient (MADDPG) framework by adding a safety lay… ▽ More

    Submitted 11 August, 2021; v1 submitted 9 August, 2021; originally announced August 2021.

    Comments: ICML 2021 Workshop on Reinforcement Learning for Real Life

  4. arXiv:2106.03763  [pdf, other

    cs.LG

    Vanishing Curvature and the Power of Adaptive Methods in Randomly Initialized Deep Networks

    Authors: Antonio Orvieto, Jonas Kohler, Dario Pavllo, Thomas Hofmann, Aurelien Lucchi

    Abstract: This paper revisits the so-called vanishing gradient phenomenon, which commonly occurs in deep randomly initialized neural networks. Leveraging an in-depth analysis of neural chains, we first show that vanishing gradients cannot be circumvented when the network width scales with less than O(depth), even when initialized with the popular Xavier and He initializations. Second, we extend the analysis… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  5. arXiv:2103.15627  [pdf, other

    cs.CV cs.GR cs.LG

    Learning Generative Models of Textured 3D Meshes from Real-World Images

    Authors: Dario Pavllo, Jonas Kohler, Thomas Hofmann, Aurelien Lucchi

    Abstract: Recent advances in differentiable rendering have sparked an interest in learning generative models of textured 3D meshes from image collections. These models natively disentangle pose and appearance, enable downstream applications in computer graphics, and improve the ability of generative models to understand the concept of image formation. Although there has been prior work on learning such mode… ▽ More

    Submitted 17 August, 2021; v1 submitted 29 March, 2021; originally announced March 2021.

    Comments: ICCV 2021

  6. arXiv:2006.07660  [pdf, other

    cs.CV cs.GR cs.LG eess.IV

    Convolutional Generation of Textured 3D Meshes

    Authors: Dario Pavllo, Graham Spinks, Thomas Hofmann, Marie-Francine Moens, Aurelien Lucchi

    Abstract: While recent generative models for 2D images achieve impressive visual results, they clearly lack the ability to perform 3D reasoning. This heavily restricts the degree of control over generated objects as well as the possible applications of such models. In this work, we bridge this gap by leveraging recent advances in differentiable rendering. We design a framework that can generate triangle mes… ▽ More

    Submitted 23 October, 2020; v1 submitted 13 June, 2020; originally announced June 2020.

    Comments: NeurIPS 2020, Oral presentation. Code at https://github.com/dariopavllo/convmesh

  7. arXiv:2004.03459  [pdf, other

    cs.CV cs.LG stat.ML

    Hierarchical Image Classification using Entailment Cone Embeddings

    Authors: Ankit Dhall, Anastasia Makarova, Octavian Ganea, Dario Pavllo, Michael Greeff, Andreas Krause

    Abstract: Image classification has been studied extensively, but there has been limited work in using unconventional, external guidance other than traditional image-label pairs for training. We present a set of methods for leveraging information about the semantic hierarchy embedded in class labels. We first inject label-hierarchy knowledge into an arbitrary CNN-based classifier and empirically show that av… ▽ More

    Submitted 25 April, 2020; v1 submitted 2 April, 2020; originally announced April 2020.

    Comments: Accepted in the CVPR 2020 Workshop on Differential Geometry in Computer Vision and Machine Learning

  8. Controlling Style and Semantics in Weakly-Supervised Image Generation

    Authors: Dario Pavllo, Aurelien Lucchi, Thomas Hofmann

    Abstract: We propose a weakly-supervised approach for conditional image generation of complex scenes where a user has fine control over objects appearing in the scene. We exploit sparse semantic maps to control object shapes and classes, as well as textual descriptions or attributes to control both local and global style. In order to condition our model on textual descriptions, we introduce a semantic atten… ▽ More

    Submitted 21 July, 2020; v1 submitted 6 December, 2019; originally announced December 2019.

    Comments: European Conference on Computer Vision (ECCV) 2020, Spotlight. Code at https://github.com/dariopavllo/style-semantics

  9. Modeling Human Motion with Quaternion-based Neural Networks

    Authors: Dario Pavllo, Christoph Feichtenhofer, Michael Auli, David Grangier

    Abstract: Previous work on predicting or generating 3D human pose sequences regresses either joint rotations or joint positions. The former strategy is prone to error accumulation along the kinematic chain, as well as discontinuities when using Euler angles or exponential maps as parameterizations. The latter requires re-projection onto skeleton constraints to avoid bone stretching and invalid configuration… ▽ More

    Submitted 26 October, 2019; v1 submitted 21 January, 2019; originally announced January 2019.

    Comments: Follow-up work of arXiv:1805.06485. This is a pre-print of an article published in IJCV. The final authenticated version is available online at https://doi.org/10.1007/s11263-019-01245-6

    Journal ref: International Journal of Computer Vision (Special Issue on Machine Vision with Deep Learning), 2019. Online ISSN: 1573-1405

  10. arXiv:1811.11742  [pdf, other

    cs.CV

    3D human pose estimation in video with temporal convolutions and semi-supervised training

    Authors: Dario Pavllo, Christoph Feichtenhofer, David Grangier, Michael Auli

    Abstract: In this work, we demonstrate that 3D poses in video can be effectively estimated with a fully convolutional model based on dilated temporal convolutions over 2D keypoints. We also introduce back-projection, a simple and effective semi-supervised training method that leverages unlabeled video data. We start with predicted 2D keypoints for unlabeled video, then estimate 3D poses and finally back-pro… ▽ More

    Submitted 29 March, 2019; v1 submitted 28 November, 2018; originally announced November 2018.

    Comments: CVPR 2019

  11. arXiv:1805.06485  [pdf, other

    cs.CV

    QuaterNet: A Quaternion-based Recurrent Model for Human Motion

    Authors: Dario Pavllo, David Grangier, Michael Auli

    Abstract: Deep learning for predicting or generating 3D human pose sequences is an active research area. Previous work regresses either joint rotations or joint positions. The former strategy is prone to error accumulation along the kinematic chain, as well as discontinuities when using Euler angle or exponential map parameterizations. The latter requires re-projection onto skeleton constraints to avoid bon… ▽ More

    Submitted 31 July, 2018; v1 submitted 16 May, 2018; originally announced May 2018.

    Comments: British Machine Vision Conference (BMVC), 2018

  12. arXiv:1804.02525  [pdf, other

    cs.SI cs.CL cs.IR

    Quootstrap: Scalable Unsupervised Extraction of Quotation-Speaker Pairs from Large News Corpora via Bootstrapping

    Authors: Dario Pavllo, Tiziano Piccardi, Robert West

    Abstract: We propose Quootstrap, a method for extracting quotations, as well as the names of the speakers who uttered them, from large news corpora. Whereas prior work has addressed this problem primarily with supervised machine learning, our approach follows a fully unsupervised bootstrapping paradigm. It leverages the redundancy present in large news corpora, more precisely, the fact that the same quotati… ▽ More

    Submitted 7 April, 2018; originally announced April 2018.

    Comments: Accepted at the 12th International Conference on Web and Social Media (ICWSM), 2018