Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Sviridenko, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:1604.05372  [pdf, other

    cs.CL

    Clustering Comparable Corpora of Russian and Ukrainian Academic Texts: Word Embeddings and Semantic Fingerprints

    Authors: Andrey Kutuzov, Mikhail Kopotev, Tatyana Sviridenko, Lyubov Ivanova

    Abstract: We present our experience in applying distributional semantics (neural word embeddings) to the problem of representing and clustering documents in a bilingual comparable corpus. Our data is a collection of Russian and Ukrainian academic texts, for which topics are their academic fields. In order to build language-independent semantic representations of these documents, we train neural distribution… ▽ More

    Submitted 18 April, 2016; originally announced April 2016.

    Comments: To be presented at 9th Workshop on Building and Using Comparable Corpora, co-located with LREC-2016 (https://comparable.limsi.fr/bucc2016/)