Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Bulatov, K

Searching in archive cs. Search in all archives.
.
  1. MIDV-2020: A Comprehensive Benchmark Dataset for Identity Document Analysis

    Authors: Konstantin Bulatov, Ekaterina Emelianova, Daniil Tropin, Natalya Skoryukina, Yulia Chernyshova, Alexander Sheshkus, Sergey Usilin, Zuheng Ming, Jean-Christophe Burie, Muhammad Muzzamil Luqman, Vladimir V. Arlazarov

    Abstract: Identity documents recognition is an important sub-field of document analysis, which deals with tasks of robust document detection, type identification, text fields recognition, as well as identity fraud prevention and document authenticity validation given photos, scans, or video frames of an identity document capture. Significant amount of research has been published on this topic in recent year… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    MSC Class: 68T10

    Journal ref: Computer Optics, volume 46, issue 2, p. 252-270, 2022

  2. arXiv:2008.02566  [pdf, other

    cs.CV

    Fast Approximate Modelling of the Next Combination Result for Stopping the Text Recognition in a Video

    Authors: Konstantin Bulatov, Nadezhda Fedotova, Vladimir V. Arlazarov

    Abstract: In this paper, we consider a task of stopping the video stream recognition process of a text field, in which each frame is recognized independently and the individual results are combined together. The video stream recognition stopping problem is an under-researched topic with regards to computer vision, but its relevance for building high-performance video recognition systems is clear. Firstly,… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Comments: 8 pages, 6 figures, 2 tables

    MSC Class: 68T10

  3. Methods of Weighted Combination for Text Field Recognition in a Video Stream

    Authors: Olga Petrova, Konstantin Bulatov, Vladimir L. Arlazarov

    Abstract: Due to a noticeable expansion of document recognition applicability, there is a high demand for recognition on mobile devices. A mobile camera, unlike a scanner, cannot always ensure the absence of various image distortions, therefore the task of improving the recognition precision is relevant. The advantage of mobile devices over scanners is the ability to use video stream input, which allows to… ▽ More

    Submitted 27 November, 2019; originally announced November 2019.

    Comments: 6 pages, 4 figures, 1 table, accepted and presented at International Conference on Machine Vision 2019 (ICMV 2019)

    Journal ref: Proc. SPIE 11433 ICMV-2019 (2020), 114332L

  4. arXiv:1910.04107  [pdf, other

    cs.CV stat.AP

    Next integrated result modelling for stopping the text field recognition process in a video using a result model with per-character alternatives

    Authors: Konstantin Bulatov, Boris Savelyev, Vladimir V. Arlazarov

    Abstract: In the field of document analysis and recognition using mobile devices for capturing, and the field of object recognition in a video stream, an important problem is determining the time when the capturing process should be stopped. Efficient stopping influences not only the total time spent for performing recognition and data entry, but the expected accuracy of the result as well. This paper is di… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

    Comments: 6 pages, 3 figures, 1 table, submitted and accepted for the 12th International Conference on Machine Vision (ICMV 2019)

    MSC Class: 68T37

    Journal ref: Proc. SPIE 11433 ICMV-2019 (2020), 114332M

  5. MIDV-2019: Challenges of the modern mobile-based document OCR

    Authors: Konstantin Bulatov, Daniil Matalov, Vladimir V. Arlazarov

    Abstract: Recognition of identity documents using mobile devices has become a topic of a wide range of computer vision research. The portfolio of methods and algorithms for solving such tasks as face detection, document detection and rectification, text field recognition, and other, is growing, and the scarcity of datasets has become an important issue. One of the openly accessible datasets for evaluating s… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

    Comments: 6 pages, 3 figures, 3 tables, 18 references, submitted and accepted to the 12th International Conference on Machine Vision (ICMV 2019)

    MSC Class: 68T45

    Journal ref: Proc. SPIE 11433 ICMV-2019 (2020), 114332N

  6. MIDV-500: A Dataset for Identity Documents Analysis and Recognition on Mobile Devices in Video Stream

    Authors: Vladimir V. Arlazarov, Konstantin Bulatov, Timofey Chernov, Vladimir L. Arlazarov

    Abstract: A lot of research has been devoted to identity documents analysis and recognition on mobile devices. However, no publicly available datasets designed for this particular problem currently exist. There are a few datasets which are useful for associated subtasks but in order to facilitate a more comprehensive scientific and technical approach to identity document recognition more specialized dataset… ▽ More

    Submitted 11 February, 2020; v1 submitted 16 July, 2018; originally announced July 2018.

    Comments: 7 pages, 6 figures, 5 tables

    MSC Class: 68T10

    Journal ref: Computer optics 43 N5 (2019) 818-824