Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Gavrishina, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2209.13750  [pdf, other

    cs.CL

    RuDSI: graph-based word sense induction dataset for Russian

    Authors: Anna Aksenova, Ekaterina Gavrishina, Elisey Rykov, Andrey Kutuzov

    Abstract: We present RuDSI, a new benchmark for word sense induction (WSI) in Russian. The dataset was created using manual annotation and semi-automatic clustering of Word Usage Graphs (WUGs). Unlike prior WSI datasets for Russian, RuDSI is completely data-driven (based on texts from Russian National Corpus), with no external word senses imposed on annotators. Depending on the parameters of graph clusterin… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: TextGraphs-16 workshop at the CoLING-2022 conference