Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Bothwell, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.16341  [pdf, other

    cs.CL

    PILA: A Historical-Linguistic Dataset of Proto-Italic and Latin

    Authors: Stephen Bothwell, Brian DuSell, David Chiang, Brian Krostenko

    Abstract: Computational historical linguistics seeks to systematically understand processes of sound change, including during periods at which little to no formal recording of language is attested. At the same time, few computational resources exist which deeply explore phonological and morphological connections between proto-languages and their descendants. This is particularly true for the family of Itali… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

    Comments: 12 pages, 1 figure, 9 tables. Accepted at LREC-COLING 2024

    ACM Class: I.2.7

  2. arXiv:2404.07792  [pdf, other

    cs.CL cs.LG

    Nostra Domina at EvaLatin 2024: Improving Latin Polarity Detection through Data Augmentation

    Authors: Stephen Bothwell, Abigail Swenor, David Chiang

    Abstract: This paper describes submissions from the team Nostra Domina to the EvaLatin 2024 shared task of emotion polarity detection. Given the low-resource environment of Latin and the complexity of sentiment in rhetorical genres like poetry, we augmented the available data through automatic polarity annotation. We present two methods for doing so on the basis of the $k$-means algorithm, and we employ a v… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages

  3. arXiv:2312.00100  [pdf, other

    cs.CL

    Introducing Rhetorical Parallelism Detection: A New Task with Datasets, Metrics, and Baselines

    Authors: Stephen Bothwell, Justin DeBenedetto, Theresa Crnkovich, Hildegund Müller, David Chiang

    Abstract: Rhetoric, both spoken and written, involves not only content but also style. One common stylistic tool is $\textit{parallelism}$: the juxtaposition of phrases which have the same sequence of linguistic ($\textit{e.g.}$, phonological, syntactic, semantic) features. Despite the ubiquity of parallelism, the field of natural language processing has seldom investigated it, missing a chance to better un… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

    Comments: 32 pages, 16 figures, 18 tables. Accepted at EMNLP 2023

    ACM Class: I.2.7