Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Orlov, N

Searching in archive cs. Search in all archives.
.
  1. Interactive Neural Painting

    Authors: Elia Peruzzo, Willi Menapace, Vidit Goel, Federica Arrigoni, Hao Tang, Xingqian Xu, Arman Chopikyan, Nikita Orlov, Yuxiao Hu, Humphrey Shi, Nicu Sebe, Elisa Ricci

    Abstract: In the last few years, Neural Painting (NP) techniques became capable of producing extremely realistic artworks. This paper advances the state of the art in this emerging research domain by proposing the first approach for Interactive NP. Considering a setting where a user looks at a scene and tries to reproduce it on a painting, our objective is to develop a computational framework to assist the… ▽ More

    Submitted 31 July, 2023; originally announced July 2023.

    Comments: This is a preprint version of the paper to appear at Computer Vision and Image Understanding (CVIU). The final journal version will be available at https://www.sciencedirect.com/science/article/pii/S1077314223001583

    Journal ref: 10.1016/j.cviu.2023.103778

  2. arXiv:2211.06220  [pdf, other

    cs.CV

    OneFormer: One Transformer to Rule Universal Image Segmentation

    Authors: Jitesh Jain, Jiachen Li, MangTik Chiu, Ali Hassani, Nikita Orlov, Humphrey Shi

    Abstract: Universal Image Segmentation is not a new concept. Past attempts to unify image segmentation in the last decades include scene parsing, panoptic segmentation, and, more recently, new panoptic architectures. However, such panoptic architectures do not truly unify image segmentation because they need to be trained individually on the semantic, instance, or panoptic segmentation to achieve the best p… ▽ More

    Submitted 26 December, 2022; v1 submitted 10 November, 2022; originally announced November 2022.

    Comments: Project Page: https://praeclarumjj3.github.io/oneformer

  3. arXiv:2209.13465  [pdf, other

    cs.CV cs.AI cs.LG

    AdaFocusV3: On Unified Spatial-temporal Dynamic Video Recognition

    Authors: Yulin Wang, Yang Yue, Xinhong Xu, Ali Hassani, Victor Kulikov, Nikita Orlov, Shiji Song, Humphrey Shi, Gao Huang

    Abstract: Recent research has revealed that reducing the temporal and spatial redundancy are both effective approaches towards efficient video recognition, e.g., allocating the majority of computation to a task-relevant subset of frames or the most valuable image regions of each frame. However, in most existing works, either type of redundancy is typically modeled with another absent. This paper explores th… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: ECCV 2022

  4. arXiv:2206.04655  [pdf, other

    cs.CV

    Towards Layer-wise Image Vectorization

    Authors: Xu Ma, Yuqian Zhou, Xingqian Xu, Bin Sun, Valerii Filev, Nikita Orlov, Yun Fu, Humphrey Shi

    Abstract: Image rasterization is a mature technique in computer graphics, while image vectorization, the reverse path of rasterization, remains a major challenge. Recent advanced deep learning-based models achieve vectorization and semantic interpolation of vector graphs and demonstrate a better topology of generating new figures. However, deep models cannot be easily generalized to out-of-domain testing da… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: Accepted as Oral Presentation at CVPR 2022

  5. arXiv:2112.14238  [pdf, other

    cs.CV cs.AI cs.LG

    AdaFocus V2: End-to-End Training of Spatial Dynamic Networks for Video Recognition

    Authors: Yulin Wang, Yang Yue, Yuanze Lin, Haojun Jiang, Zihang Lai, Victor Kulikov, Nikita Orlov, Humphrey Shi, Gao Huang

    Abstract: Recent works have shown that the computational efficiency of video recognition can be significantly improved by reducing the spatial redundancy. As a representative work, the adaptive focus method (AdaFocus) has achieved a favorable trade-off between accuracy and inference speed by dynamically identifying and attending to the informative regions in each video frame. However, AdaFocus requires a co… ▽ More

    Submitted 11 April, 2022; v1 submitted 28 December, 2021; originally announced December 2021.

    Comments: Accepted by CVPR-2022

  6. arXiv:2112.12782  [pdf, other

    cs.CV cs.LG

    SeMask: Semantically Masked Transformers for Semantic Segmentation

    Authors: Jitesh Jain, Anukriti Singh, Nikita Orlov, Zilong Huang, Jiachen Li, Steven Walton, Humphrey Shi

    Abstract: Finetuning a pretrained backbone in the encoder part of an image transformer network has been the traditional approach for the semantic segmentation task. However, such an approach leaves out the semantic context that an image provides during the encoding stage. This paper argues that incorporating semantic information of the image into pretrained hierarchical transformer-based backbones while fin… ▽ More

    Submitted 13 April, 2022; v1 submitted 23 December, 2021; originally announced December 2021.

    Comments: Updated experiments with Mix-Transformer (MiT) on ADE20K and added an analysis section

  7. arXiv:1911.05189  [pdf, other

    cs.CV cs.LG stat.ML

    Fast Glare Detection in Document Images

    Authors: Dmitry Rodin, Nikita Orlov

    Abstract: Glare is a phenomenon that occurs when the scene has a reflection of a light source or has one in it. This luminescence can hide useful information from the image, making text recognition virtually impossible. In this paper, we propose an approach to detect glare in images taken by users via mobile devices. Our method divides the document into blocks and collects luminance features from the origin… ▽ More

    Submitted 24 October, 2019; originally announced November 2019.

    Comments: 4 pages, Workshop on Industrial Applications of Document Analysis and Recognition 2019

    ACM Class: I.2.10

  8. FaSTExt: Fast and Small Text Extractor

    Authors: Alexander Filonenko, Konstantin Gudkov, Aleksei Lebedev, Nikita Orlov, Ivan Zagaynov

    Abstract: Text detection in natural images is a challenging but necessary task for many applications. Existing approaches utilize large deep convolutional neural networks making it difficult to use them in real-world tasks. We propose a small yet relatively precise text extraction method. The basic component of it is a convolutional neural network which works in a fully-convolutional manner and produces res… ▽ More

    Submitted 14 August, 2019; originally announced August 2019.

    Comments: 6 pages, 8th International Workshop on Camera-Based Document Analysis & Recognition

    ACM Class: I.2.10