Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Clanuwat, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.02665  [pdf, other

    cs.CV cs.LG physics.ao-ph

    Digital Typhoon: Long-term Satellite Image Dataset for the Spatio-Temporal Modeling of Tropical Cyclones

    Authors: Asanobu Kitamoto, Jared Hwang, Bastien Vuillod, Lucas Gautier, Yingtao Tian, Tarin Clanuwat

    Abstract: This paper presents the official release of the Digital Typhoon dataset, the longest typhoon satellite image dataset for 40+ years aimed at benchmarking machine learning models for long-term spatio-temporal data. To build the dataset, we developed a workflow to create an infrared typhoon-centered image for cropping using Lambert azimuthal equal-area projection referring to the best track data. We… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

    Comments: Accepted by NeurIPS 2023 Datasets and Benchmarks Track (Spotlight)

  2. arXiv:2106.06786  [pdf, other

    cs.CL cs.DL cs.LG

    Predicting the Ordering of Characters in Japanese Historical Documents

    Authors: Alex Lamb, Tarin Clanuwat, Siyu Han, Mikel Bober-Irizar, Asanobu Kitamoto

    Abstract: Japan is a unique country with a distinct cultural heritage, which is reflected in billions of historical documents that have been preserved. However, the change in Japanese writing system in 1900 made these documents inaccessible for the general public. A major research project has been to make these historical documents accessible and understandable. An increasing amount of research has focused… ▽ More

    Submitted 12 June, 2021; originally announced June 2021.

  3. arXiv:2106.02267  [pdf, other

    cs.CV cs.LG

    Ukiyo-e Analysis and Creativity with Attribute and Geometry Annotation

    Authors: Yingtao Tian, Tarin Clanuwat, Chikahiko Suzuki, Asanobu Kitamoto

    Abstract: The study of Ukiyo-e, an important genre of pre-modern Japanese art, focuses on the object and style like other artwork researches. Such study has benefited from the renewed interest by the machine learning community in culturally important topics, leading to interdisciplinary works including collections of images, quantitative approaches, and machine learning-based creativities. They, however, ha… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

  4. arXiv:2002.08595  [pdf, other

    cs.CV cs.LG stat.ML

    KaoKore: A Pre-modern Japanese Art Facial Expression Dataset

    Authors: Yingtao Tian, Chikahiko Suzuki, Tarin Clanuwat, Mikel Bober-Irizar, Alex Lamb, Asanobu Kitamoto

    Abstract: From classifying handwritten digits to generating strings of text, the datasets which have received long-time focus from the machine learning community vary greatly in their subject matter. This has motivated a renewed interest in building datasets which are socially and culturally relevant, so that algorithmic research may have a more direct and immediate impact on society. One such area is in hi… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

  5. arXiv:1910.09433  [pdf, other

    cs.CV cs.LG

    KuroNet: Pre-Modern Japanese Kuzushiji Character Recognition with Deep Learning

    Authors: Tarin Clanuwat, Alex Lamb, Asanobu Kitamoto

    Abstract: Kuzushiji, a cursive writing style, had been used in Japan for over a thousand years starting from the 8th century. Over 3 millions books on a diverse array of topics, such as literature, science, mathematics and even cooking are preserved. However, following a change to the Japanese writing system in 1900, Kuzushiji has not been included in regular school curricula. Therefore, most Japanese nativ… ▽ More

    Submitted 21 October, 2019; originally announced October 2019.

    Comments: International Conference on Document Recognition (ICDAR) 2019 [oral]

  6. arXiv:1905.05377  [pdf

    cs.CV cs.DL

    A human-inspired recognition system for premodern Japanese historical documents

    Authors: Anh Duc Le, Tarin Clanuwat, Asanobu Kitamoto

    Abstract: Recognition of historical documents is a challenging problem due to the noised, damaged characters and background. However, in Japanese historical documents, not only contains the mentioned problems, pre-modern Japanese characters were written in cursive and are connected. Therefore, character segmentation based methods do not work well. This leads to the idea of creating a new recognition system.… ▽ More

    Submitted 13 May, 2019; originally announced May 2019.

  7. arXiv:1812.01718  [pdf, other

    cs.CV cs.LG stat.ML

    Deep Learning for Classical Japanese Literature

    Authors: Tarin Clanuwat, Mikel Bober-Irizar, Asanobu Kitamoto, Alex Lamb, Kazuaki Yamamoto, David Ha

    Abstract: Much of machine learning research focuses on producing models which perform well on benchmark tasks, in turn improving our understanding of the challenges associated with those tasks. From the perspective of ML researchers, the content of the task itself is largely irrelevant, and thus there have increasingly been calls for benchmark tasks to more heavily focus on problems which are of social or c… ▽ More

    Submitted 3 December, 2018; originally announced December 2018.

    Comments: To appear at Neural Information Processing Systems 2018 Workshop on Machine Learning for Creativity and Design