Image captioning in different languages

van Miltenburg, Emiel

Computer Science > Computation and Language

arXiv:2407.09495 (cs)

[Submitted on 31 May 2024]

Title:Image captioning in different languages

Authors:Emiel van Miltenburg

View PDF HTML (experimental)

Abstract:This short position paper provides a manually curated list of non-English image captioning datasets (as of May 2024). Through this list, we can observe the dearth of datasets in different languages: only 23 different languages are represented. With the addition of the Crossmodal-3600 dataset (Thapliyal et al., 2022, 36 languages) this number increases somewhat, but still this number is tiny compared to the thousands of spoken languages that exist. This paper closes with some open questions for the field of Vision & Language.

Subjects:	Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.09495 [cs.CL]
	(or arXiv:2407.09495v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2407.09495

Submission history

From: Emiel van Miltenburg [view email]
[v1] Fri, 31 May 2024 09:37:54 UTC (56 KB)

Full-text links:

Access Paper:

view license

Current browse context:

< prev | next >

new | recent | 2024-07

Change to browse by:

cs.CL
cs.CV

References & Citations

export BibTeX citation

Computer Science > Computation and Language

Title:Image captioning in different languages

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Image captioning in different languages

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators