Zum Hauptinhalt springen

Showing 1–6 of 6 results for author: Ingle, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.15037  [pdf, other

    cs.CV

    Is it an i or an l: Test-time Adaptation of Text Line Recognition Models

    Authors: Debapriya Tula, Sujoy Paul, Gagan Madan, Peter Garst, Reeve Ingle, Gaurav Aggarwal

    Abstract: Recognizing text lines from images is a challenging problem, especially for handwritten documents due to large variations in writing styles. While text line recognition models are generally trained on large corpora of real and synthetic data, such models can still make frequent mistakes if the handwriting is inscrutable or the image acquisition process adds corruptions, such as noise, blur, compre… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  2. arXiv:2308.09671  [pdf, other

    cs.CL

    OCR Language Models with Custom Vocabularies

    Authors: Peter Garst, Reeve Ingle, Yasuhisa Fujii

    Abstract: Language models are useful adjuncts to optical models for producing accurate optical character recognition (OCR) results. One factor which limits the power of language models in this context is the existence of many specialized domains with language statistics very different from those implied by a general language model - think of checks, medical prescriptions, and many other specialized document… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

  3. XTREME-UP: A User-Centric Scarce-Data Benchmark for Under-Represented Languages

    Authors: Sebastian Ruder, Jonathan H. Clark, Alexander Gutkin, Mihir Kale, Min Ma, Massimo Nicosia, Shruti Rijhwani, Parker Riley, Jean-Michel A. Sarr, Xinyi Wang, John Wieting, Nitish Gupta, Anna Katanova, Christo Kirov, Dana L. Dickinson, Brian Roark, Bidisha Samanta, Connie Tao, David I. Adelani, Vera Axelrod, Isaac Caswell, Colin Cherry, Dan Garrette, Reeve Ingle, Melvin Johnson , et al. (2 additional authors not shown)

    Abstract: Data scarcity is a crucial issue for the development of highly multilingual NLP systems. Yet for many under-represented languages (ULs) -- languages for which NLP re-search is particularly far behind in meeting user needs -- it is feasible to annotate small amounts of data. Motivated by this, we propose XTREME-UP, a benchmark defined by: its focus on the scarce-data scenario rather than zero-shot;… ▽ More

    Submitted 24 May, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

  4. arXiv:2110.05270  [pdf

    cs.CV cs.AI

    Investigating Transfer Learning Capabilities of Vision Transformers and CNNs by Fine-Tuning a Single Trainable Block

    Authors: Durvesh Malpure, Onkar Litake, Rajesh Ingle

    Abstract: In recent developments in the field of Computer Vision, a rise is seen in the use of transformer-based architectures. They are surpassing the state-of-the-art set by CNN architectures in accuracy but on the other hand, they are computationally very expensive to train from scratch. As these models are quite recent in the Computer Vision field, there is a need to study it's transfer learning capabil… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

    Comments: 8 pages, 4 figures

  5. arXiv:2104.07787  [pdf, other

    cs.CV cs.LG

    Rethinking Text Line Recognition Models

    Authors: Daniel Hernandez Diaz, Siyang Qin, Reeve Ingle, Yasuhisa Fujii, Alessandro Bissacco

    Abstract: In this paper, we study the problem of text line recognition. Unlike most approaches targeting specific domains such as scene-text or handwritten documents, we investigate the general problem of developing a universal architecture that can extract text from any image, regardless of source or input modality. We consider two decoder families (Connectionist Temporal Classification and Transformer) an… ▽ More

    Submitted 21 April, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

    Comments: 11 pages, 6 figures

  6. arXiv:1904.09150  [pdf, other

    cs.CV

    A Scalable Handwritten Text Recognition System

    Authors: R. Reeve Ingle, Yasuhisa Fujii, Thomas Deselaers, Jonathan Baccash, Ashok C. Popat

    Abstract: Many studies on (Offline) Handwritten Text Recognition (HTR) systems have focused on building state-of-the-art models for line recognition on small corpora. However, adding HTR capability to a large scale multilingual OCR system poses new challenges. This paper addresses three problems in building such systems: data, efficiency, and integration. Firstly, one of the biggest challenges is obtaining… ▽ More

    Submitted 14 June, 2019; v1 submitted 19 April, 2019; originally announced April 2019.

    Comments: ICDAR 2019