Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Rykov, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.05449  [pdf, other

    cs.CL cs.AI

    SmurfCat at PAN 2024 TextDetox: Alignment of Multilingual Transformers for Text Detoxification

    Authors: Elisei Rykov, Konstantin Zaytsev, Ivan Anisimov, Alexandr Voronin

    Abstract: This paper presents a solution for the Multilingual Text Detoxification task in the PAN-2024 competition of the SmurfCat team. Using data augmentation through machine translation and a special filtering procedure, we collected an additional multilingual parallel dataset for text detoxification. Using the obtained data, we fine-tuned several multilingual sequence-to-sequence models, such as mT0 and… ▽ More

    Submitted 10 July, 2024; v1 submitted 7 July, 2024; originally announced July 2024.

  2. arXiv:2406.18305  [pdf, other

    cs.CL cs.AI

    S3: A Simple Strong Sample-effective Multimodal Dialog System

    Authors: Elisei Rykov, Egor Malkershin, Alexander Panchenko

    Abstract: In this work, we present a conceptually simple yet powerful baseline for the multimodal dialog task, an S3 model, that achieves near state-of-the-art results on two compelling leaderboards: MMMU and AI Journey Contest 2023. The system is based on a pre-trained large language model, pre-trained modality encoders for image and audio, and a trainable modality projector. The proposed effective data mi… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  3. arXiv:2404.06137  [pdf, other

    cs.CL cs.AI

    SmurfCat at SemEval-2024 Task 6: Leveraging Synthetic Data for Hallucination Detection

    Authors: Elisei Rykov, Yana Shishkina, Kseniia Petrushina, Kseniia Titova, Sergey Petrakov, Alexander Panchenko

    Abstract: In this paper, we present our novel systems developed for the SemEval-2024 hallucination detection task. Our investigation spans a range of strategies to compare model predictions with reference standards, encompassing diverse baselines, the refinement of pre-trained encoders through supervised learning, and an ensemble approaches utilizing several high-performing models. Through these exploration… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: 12 pages, 10 tables, 3 figures

  4. arXiv:2209.13750  [pdf, other

    cs.CL

    RuDSI: graph-based word sense induction dataset for Russian

    Authors: Anna Aksenova, Ekaterina Gavrishina, Elisey Rykov, Andrey Kutuzov

    Abstract: We present RuDSI, a new benchmark for word sense induction (WSI) in Russian. The dataset was created using manual annotation and semi-automatic clustering of Word Usage Graphs (WUGs). Unlike prior WSI datasets for Russian, RuDSI is completely data-driven (based on texts from Russian National Corpus), with no external word senses imposed on annotators. Depending on the parameters of graph clusterin… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: TextGraphs-16 workshop at the CoLING-2022 conference