Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Raptis, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2310.17674  [pdf, other

    cs.CV

    Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis

    Authors: Shangbang Long, Siyang Qin, Yasuhisa Fujii, Alessandro Bissacco, Michalis Raptis

    Abstract: We propose Hierarchical Text Spotter (HTS), a novel method for the joint task of word-level text spotting and geometric layout analysis. HTS can recognize text in an image and identify its 4-level hierarchical structure: characters, words, lines, and paragraphs. The proposed HTS is characterized by two novel components: (1) a Unified-Detector-Polygon (UDP) that produces Bezier Curve polygons of te… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: Accepted to WACV 2024

  2. arXiv:2305.09750  [pdf, other

    cs.CV

    ICDAR 2023 Competition on Hierarchical Text Detection and Recognition

    Authors: Shangbang Long, Siyang Qin, Dmitry Panteleev, Alessandro Bissacco, Yasuhisa Fujii, Michalis Raptis

    Abstract: We organize a competition on hierarchical text detection and recognition. The competition is aimed to promote research into deep learning models and systems that can jointly perform text detection and recognition and geometric layout analysis. We present details of the proposed competition organization, including tasks, datasets, evaluations, and schedule. During the competition period (from Janua… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: ICDAR 2023 competition report by organizers (accepted and to be published officially later)

  3. arXiv:2203.15143  [pdf, other

    cs.CV

    Towards End-to-End Unified Scene Text Detection and Layout Analysis

    Authors: Shangbang Long, Siyang Qin, Dmitry Panteleev, Alessandro Bissacco, Yasuhisa Fujii, Michalis Raptis

    Abstract: Scene text detection and document layout analysis have long been treated as two separate tasks in different image domains. In this paper, we bring them together and introduce the task of unified scene text detection and layout analysis. The first hierarchical scene text dataset is introduced to enable this novel research task. We also propose a novel method that is able to simultaneously detect sc… ▽ More

    Submitted 3 June, 2022; v1 submitted 28 March, 2022; originally announced March 2022.

    Comments: To appear at CVPR 2022. Code Available: https://github.com/tensorflow/models/tree/master/official/projects/unified_detector

  4. arXiv:2203.12054  [pdf, other

    cs.CV cs.AI

    Self-supervision through Random Segments with Autoregressive Coding (RandSAC)

    Authors: Tianyu Hua, Yonglong Tian, Sucheng Ren, Michalis Raptis, Hang Zhao, Leonid Sigal

    Abstract: Inspired by the success of self-supervised autoregressive representation learning in natural language (GPT and its variants), and advances in recent visual architecture design with Vision Transformers (ViTs), in this paper, we explore the effect various design choices have on the success of applying such training strategies for visual feature learning. Specifically, we introduce a novel strategy t… ▽ More

    Submitted 25 October, 2022; v1 submitted 22 March, 2022; originally announced March 2022.

  5. arXiv:2203.09638  [pdf, other

    cs.CV cs.LG

    Unified Line and Paragraph Detection by Graph Convolutional Networks

    Authors: Shuang Liu, Renshen Wang, Michalis Raptis, Yasuhisa Fujii

    Abstract: We formulate the task of detecting lines and paragraphs in a document into a unified two-level clustering problem. Given a set of text detection boxes that roughly correspond to words, a text line is a cluster of boxes and a paragraph is a cluster of lines. These clusters form a two-level tree that represents a major part of the layout of a document. We use a graph convolutional network to predict… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: Accepted to DAS 2022 as an oral paper

  6. arXiv:1908.09231  [pdf, other

    cs.CV

    Towards Unconstrained End-to-End Text Spotting

    Authors: Siyang Qin, Alessandro Bissacco, Michalis Raptis, Yasuhisa Fujii, Ying Xiao

    Abstract: We propose an end-to-end trainable network that can simultaneously detect and recognize text of arbitrary shape, making substantial progress on the open problem of reading scene text of irregular shape. We formulate arbitrary shape text detection as an instance segmentation problem; an attention model is then used to decode the textual content of each irregularly shaped text region without rectifi… ▽ More

    Submitted 24 August, 2019; originally announced August 2019.

    Comments: Accepted to ICCV 2019 as oral presentation

  7. arXiv:1206.4116  [pdf, other

    stat.ML cs.AI

    Dependence Maximizing Temporal Alignment via Squared-Loss Mutual Information

    Authors: Makoto Yamada, Leonid Sigal, Michalis Raptis, Masashi Sugiyama

    Abstract: The goal of temporal alignment is to establish time correspondence between two sequences, which has many applications in a variety of areas such as speech processing, bioinformatics, computer vision, and computer graphics. In this paper, we propose a novel temporal alignment method called least-squares dynamic time warping (LSDTW). LSDTW finds an alignment that maximizes statistical dependency bet… ▽ More

    Submitted 18 June, 2012; originally announced June 2012.

    Comments: 11 pages