Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Rakhuba, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10019  [pdf, other

    cs.LG cs.AI cs.CL cs.CV math.NA

    Group and Shuffle: Efficient Structured Orthogonal Parametrization

    Authors: Mikhail Gorbunov, Nikolay Yudin, Vera Soboleva, Aibek Alanov, Alexey Naumov, Maxim Rakhuba

    Abstract: The increasing size of neural networks has led to a growing demand for methods of efficient fine-tuning. Recently, an orthogonal fine-tuning paradigm was introduced that uses orthogonal matrices for adapting the weights of a pretrained model. In this paper, we introduce a new class of structured matrices, which unifies and generalizes structured classes from previous works. We examine properties o… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  2. arXiv:2211.13771  [pdf, other

    cs.LG cs.CV

    Towards Practical Control of Singular Values of Convolutional Layers

    Authors: Alexandra Senderovich, Ekaterina Bulatova, Anton Obukhov, Maxim Rakhuba

    Abstract: In general, convolutional neural networks (CNNs) are easy to train, but their essential properties, such as generalization error and adversarial robustness, are hard to control. Recent research demonstrated that singular values of convolutional layers significantly affect such elusive properties and offered several methods for controlling them. Nevertheless, these methods present an intractable co… ▽ More

    Submitted 24 November, 2022; originally announced November 2022.

    Comments: Published as a conference paper at NeurIPS 2022

  3. arXiv:2105.14250  [pdf, other

    cs.CV cs.LG

    Cherry-Picking Gradients: Learning Low-Rank Embeddings of Visual Data via Differentiable Cross-Approximation

    Authors: Mikhail Usvyatsov, Anastasia Makarova, Rafael Ballester-Ripoll, Maxim Rakhuba, Andreas Krause, Konrad Schindler

    Abstract: We propose an end-to-end trainable framework that processes large-scale visual data tensors by looking at a fraction of their entries only. Our method combines a neural network encoder with a tensor train decomposition to learn a low-rank latent encoding, coupled with cross-approximation (CA) to learn the representation through a subset of the original samples. CA is an adaptive sampling algorithm… ▽ More

    Submitted 12 November, 2021; v1 submitted 29 May, 2021; originally announced May 2021.

    Journal ref: Proc. International Conference on Computer Vision (ICCV) 2021

  4. arXiv:2103.14974  [pdf, other

    math.OC cs.LG cs.MS math.NA

    Automatic differentiation for Riemannian optimization on low-rank matrix and tensor-train manifolds

    Authors: Alexander Novikov, Maxim Rakhuba, Ivan Oseledets

    Abstract: In scientific computing and machine learning applications, matrices and more general multidimensional arrays (tensors) can often be approximated with the help of low-rank decompositions. Since matrices and tensors of fixed rank form smooth Riemannian manifolds, one of the popular tools for finding low-rank approximations is to use Riemannian optimization. Nevertheless, efficient implementation of… ▽ More

    Submitted 23 October, 2021; v1 submitted 27 March, 2021; originally announced March 2021.

  5. arXiv:2103.04217  [pdf, other

    cs.LG cs.CV stat.ML

    Spectral Tensor Train Parameterization of Deep Learning Layers

    Authors: Anton Obukhov, Maxim Rakhuba, Alexander Liniger, Zhiwu Huang, Stamatios Georgoulis, Dengxin Dai, Luc Van Gool

    Abstract: We study low-rank parameterizations of weight matrices with embedded spectral properties in the Deep Learning context. The low-rank property leads to parameter efficiency and permits taking computational shortcuts when computing mappings. Spectral properties are often subject to constraints in optimization problems, leading to better models and stability of optimization. We start by looking at the… ▽ More

    Submitted 13 July, 2021; v1 submitted 6 March, 2021; originally announced March 2021.

    Comments: Accepted at AISTATS 2021

  6. arXiv:2007.06631  [pdf, other

    cs.LG cs.CV stat.ML

    T-Basis: a Compact Representation for Neural Networks

    Authors: Anton Obukhov, Maxim Rakhuba, Stamatios Georgoulis, Menelaos Kanakis, Dengxin Dai, Luc Van Gool

    Abstract: We introduce T-Basis, a novel concept for a compact representation of a set of tensors, each of an arbitrary shape, which is often seen in Neural Networks. Each of the tensors in the set is modeled using Tensor Rings, though the concept applies to other Tensor Networks. Owing its name to the T-shape of nodes in diagram notation of Tensor Rings, T-Basis is simply a list of equally shaped three-dime… ▽ More

    Submitted 13 July, 2021; v1 submitted 13 July, 2020; originally announced July 2020.

    Comments: Accepted at ICML 2020

  7. arXiv:1412.6553  [pdf, other

    cs.CV cs.LG

    Speeding-up Convolutional Neural Networks Using Fine-tuned CP-Decomposition

    Authors: Vadim Lebedev, Yaroslav Ganin, Maksim Rakhuba, Ivan Oseledets, Victor Lempitsky

    Abstract: We propose a simple two-step approach for speeding up convolution layers within large convolutional neural networks based on tensor decomposition and discriminative fine-tuning. Given a layer, we use non-linear least squares to compute a low-rank CP-decomposition of the 4D convolution kernel tensor into a sum of a small number of rank-one tensors. At the second step, this decomposition is used to… ▽ More

    Submitted 24 April, 2015; v1 submitted 19 December, 2014; originally announced December 2014.