Annotated Vossian Antonomasia Dataset

doi:10.5281/zenodo.7772308

Published March 26, 2023 | Version v1

Dataset Open

Annotated Vossian Antonomasia Dataset

1. Humboldt-Universität zu Berlin
2. Freie Universität Berlin

This dataset is a collection of Vossian Antonomasia (VA). It comprises 6,096 entries, 3,115 of them contain a VA expression in the associated sentence. When a VA expression exists, the source (`*`), target (`|`), and modifier (`/`) are tagged by surrounding the respective words with the indicated character. Each entry also contains

a link to the New York Times article that contains the sentence,
the Wikidata IDs for both, the source and target (if they exist),
the full target name (if it is mentioned in the corresponding NYT article).

Creation: The dataset has been developed through a series of research papers. Initially, Schwab et al. (2019) created a dataset based on the NYT corpus by Sandhaus (2008) with binary labels, source annotations, and the corresponding Wikidata IDs for sources. The annotation of modifier and target was conducted in Schwab et al. (2022). The extraction of the full target name and the Wikidata ID of the target was performed in Schwab et al. (2023).

Files

README.md

Files (2.2 MB)

Name	Size	Download all
README.md md5:b66d3ccc2e6c8ef57e84fe689cc1d9e2	4.3 kB	Preview Download
va_data.tsv md5:48e0984de90f17e8fc4c77d1e97e73b5	2.2 MB	Download

Additional details

Is compiled by: Conference paper: 10.18653/v1/D19-1647 (DOI); Journal article: 10.3389/frai.2022.868249 (DOI)

295

Views

Downloads

Show more details

	All versions	This version
Views	295	295
Downloads	26	26
Data volume	28.7 MB	28.7 MB

More info on how stats are collected....

DOI

Resource type

Dataset

Publisher

Zenodo

Conference

The 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature 2023 (SIGHUM 2023) , Dubrovnik, Croatia, 2023-05-05

Languages

English

Creative Commons Attribution 4.0 International

The Creative Commons Attribution license allows re-distribution and re-use of a licensed work on the condition that the creator is appropriately credited. Read more

Technical metadata

Created: March 26, 2023
Modified: March 27, 2023

Annotated Vossian Antonomasia Dataset

Creators

Description

Files

README.md

Files (2.2 MB)

Additional details

Related works