Skip to main content

Showing 1–11 of 11 results for author: Markeeva, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.09308  [pdf, other

    cs.CL cs.LG

    Transformers meet Neural Algorithmic Reasoners

    Authors: Wilfried Bounsi, Borja Ibarz, Andrew Dudzik, Jessica B. Hamrick, Larisa Markeeva, Alex Vitvitskyi, Razvan Pascanu, Petar Veličković

    Abstract: Transformers have revolutionized machine learning with their simple yet effective architecture. Pre-training Transformers on massive text datasets from the Internet has led to unmatched generalization for natural language understanding (NLU) tasks. However, such language models remain fragile when tasked with algorithmic forms of reasoning, where computations must be precise and robust. To address… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: To appear at CVPR 2024 Multimodal Algorithmic Reasoning (MAR) Workshop. 10 pages, 5 figures

  2. arXiv:2406.04229  [pdf, other

    cs.LG cs.AI cs.CL cs.DS stat.ML

    The CLRS-Text Algorithmic Reasoning Language Benchmark

    Authors: Larisa Markeeva, Sean McLeish, Borja Ibarz, Wilfried Bounsi, Olga Kozlova, Alex Vitvitskyi, Charles Blundell, Tom Goldstein, Avi Schwarzschild, Petar Veličković

    Abstract: Eliciting reasoning capabilities from language models (LMs) is a critical direction on the path towards building intelligent systems. Most recent studies dedicated to reasoning focus on out-of-distribution performance on procedurally-generated synthetic benchmarks, bespoke-built to evaluate specific skills only. This trend makes results hard to transfer across publications, slowing down progress.… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Preprint, under review. Comments welcome

  3. arXiv:2305.13786  [pdf, other

    cs.CV cs.AI cs.LG

    Perception Test: A Diagnostic Benchmark for Multimodal Video Models

    Authors: Viorica Pătrăucean, Lucas Smaira, Ankush Gupta, Adrià Recasens Continente, Larisa Markeeva, Dylan Banarse, Skanda Koppula, Joseph Heyward, Mateusz Malinowski, Yi Yang, Carl Doersch, Tatiana Matejovicova, Yury Sulsky, Antoine Miech, Alex Frechette, Hanna Klimczak, Raphael Koster, Junlin Zhang, Stephanie Winkler, Yusuf Aytar, Simon Osindero, Dima Damen, Andrew Zisserman, João Carreira

    Abstract: We propose a novel multimodal video benchmark - the Perception Test - to evaluate the perception and reasoning skills of pre-trained multimodal models (e.g. Flamingo, SeViLA, or GPT-4). Compared to existing benchmarks that focus on computational tasks (e.g. classification, detection or tracking), the Perception Test focuses on skills (Memory, Abstraction, Physics, Semantics) and types of reasoning… ▽ More

    Submitted 30 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023) Track on Datasets and Benchmarks

  4. arXiv:2304.05342  [pdf, other

    cs.RO

    TT-SDF2PC: Registration of Point Cloud and Compressed SDF Directly in the Memory-Efficient Tensor Train Domain

    Authors: Alexey I. Boyko, Anastasiia Kornilova, Rahim Tariverdizadeh, Mirfarid Musavian, Larisa Markeeva, Ivan Oseledets, Gonzalo Ferrer

    Abstract: This paper addresses the following research question: ``can one compress a detailed 3D representation and use it directly for point cloud registration?''. Map compression of the scene can be achieved by the tensor train (TT) decomposition of the signed distance function (SDF) representation. It regulates the amount of data reduced by the so-called TT-ranks. Using this representation we have prop… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

  5. arXiv:2301.05747  [pdf, other

    cs.CV cs.AI

    Laser: Latent Set Representations for 3D Generative Modeling

    Authors: Pol Moreno, Adam R. Kosiorek, Heiko Strathmann, Daniel Zoran, Rosalia G. Schneider, Björn Winckler, Larisa Markeeva, Théophane Weber, Danilo J. Rezende

    Abstract: NeRF provides unparalleled fidelity of novel view synthesis: rendering a 3D scene from an arbitrary viewpoint. NeRF requires training on a large number of views that fully cover a scene, which limits its applicability. While these issues can be addressed by learning a prior over scenes in various forms, previous approaches have been either applied to overly simple scenes or struggling to render un… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

    Comments: See https://laser-nv-paper.github.io/ for video results

  6. arXiv:2211.03726  [pdf, other

    cs.CV stat.ML

    TAP-Vid: A Benchmark for Tracking Any Point in a Video

    Authors: Carl Doersch, Ankush Gupta, Larisa Markeeva, Adrià Recasens, Lucas Smaira, Yusuf Aytar, João Carreira, Andrew Zisserman, Yi Yang

    Abstract: Generic motion understanding from video involves not only tracking objects, but also perceiving how their surfaces deform and move. This information is useful to make inferences about 3D shape, physical properties and object interactions. While the problem of tracking arbitrary physical points on surfaces over longer video clips has received some attention, no dataset or benchmark for evaluation e… ▽ More

    Submitted 31 March, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

    Comments: Published in NeurIPS Datasets and Benchmarks track, 2022

  7. arXiv:2107.06825  [pdf, other

    cs.LG cs.CV

    A Generalized Lottery Ticket Hypothesis

    Authors: Ibrahim Alabdulmohsin, Larisa Markeeva, Daniel Keysers, Ilya Tolstikhin

    Abstract: We introduce a generalization to the lottery ticket hypothesis in which the notion of "sparsity" is relaxed by choosing an arbitrary basis in the space of parameters. We present evidence that the original results reported for the canonical basis continue to hold in this broader setting. We describe how structured pruning methods, including pruning units or factorizing fully-connected layers into p… ▽ More

    Submitted 26 July, 2021; v1 submitted 3 July, 2021; originally announced July 2021.

    Comments: Workshop on Sparsity in Neural Networks: Advancing Understanding and Practice (SNN'21). Updates: New curve on Figure 2(left) and discussion on Li et al

    MSC Class: 68T05 ACM Class: I.2.6; I.2.10

  8. arXiv:2106.05237  [pdf, other

    cs.CV cs.AI cs.LG

    Knowledge distillation: A good teacher is patient and consistent

    Authors: Lucas Beyer, Xiaohua Zhai, Amélie Royer, Larisa Markeeva, Rohan Anil, Alexander Kolesnikov

    Abstract: There is a growing discrepancy in computer vision between large-scale models that achieve state-of-the-art performance and models that are affordable in practical applications. In this paper we address this issue and significantly bridge the gap between these two types of models. Throughout our empirical investigation we do not aim to necessarily propose a new method, but strive to identify a robu… ▽ More

    Submitted 21 June, 2022; v1 submitted 9 June, 2021; originally announced June 2021.

    Comments: Lucas, Xiaohua, Amélie, Larisa, and Alex contributed equally; CVPR 2022

  9. arXiv:2004.09222  [pdf, other

    cs.LG stat.ML

    Towards Understanding Normalization in Neural ODEs

    Authors: Julia Gusak, Larisa Markeeva, Talgat Daulbaev, Alexandr Katrutsa, Andrzej Cichocki, Ivan Oseledets

    Abstract: Normalization is an important and vastly investigated technique in deep learning. However, its role for Ordinary Differential Equation based networks (neural ODEs) is still poorly understood. This paper investigates how different normalization techniques affect the performance of neural ODEs. Particularly, we show that it is possible to achieve 93% accuracy in the CIFAR-10 classification task, and… ▽ More

    Submitted 27 April, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

  10. arXiv:2003.05271  [pdf, other

    cs.NE math.NA stat.ML

    Interpolation Technique to Speed Up Gradients Propagation in Neural ODEs

    Authors: Talgat Daulbaev, Alexandr Katrutsa, Larisa Markeeva, Julia Gusak, Andrzej Cichocki, Ivan Oseledets

    Abstract: We propose a simple interpolation-based method for the efficient approximation of gradients in neural ODE models. We compare it with the reverse dynamic method (known in the literature as "adjoint method") to train neural ODEs on classification, density estimation, and inference approximation tasks. We also propose a theoretical justification of our approach using logarithmic norm formalism. As a… ▽ More

    Submitted 30 October, 2020; v1 submitted 11 March, 2020; originally announced March 2020.

  11. arXiv:1903.09973  [pdf, other

    cs.LG cs.CV stat.ML

    MUSCO: Multi-Stage Compression of neural networks

    Authors: Julia Gusak, Maksym Kholiavchenko, Evgeny Ponomarev, Larisa Markeeva, Ivan Oseledets, Andrzej Cichocki

    Abstract: The low-rank tensor approximation is very promising for the compression of deep neural networks. We propose a new simple and efficient iterative approach, which alternates low-rank factorization with a smart rank selection and fine-tuning. We demonstrate the efficiency of our method comparing to non-iterative ones. Our approach improves the compression rate while maintaining the accuracy for a var… ▽ More

    Submitted 15 November, 2019; v1 submitted 24 March, 2019; originally announced March 2019.