Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Korotkov, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.10539  [pdf, other

    cs.CL cs.AI

    OpenMSD: Towards Multilingual Scientific Documents Similarity Measurement

    Authors: Yang Gao, Ji Ma, Ivan Korotkov, Keith Hall, Dana Alon, Don Metzler

    Abstract: We develop and evaluate multilingual scientific documents similarity measurement models in this work. Such models can be used to find related works in different languages, which can help multilingual researchers find and explore papers more efficiently. We propose the first multilingual scientific documents dataset, Open-access Multilingual Scientific Documents (OpenMSD), which has 74M papers in 1… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: Scripts for constructing the OpenMSD dataset is available at: https://github.com/google-research/google-research/tree/master/OpenMSD

  2. arXiv:2004.14503  [pdf, other

    cs.IR cs.CL

    Zero-shot Neural Passage Retrieval via Domain-targeted Synthetic Question Generation

    Authors: Ji Ma, Ivan Korotkov, Yinfei Yang, Keith Hall, Ryan McDonald

    Abstract: A major obstacle to the wide-spread adoption of neural retrieval models is that they require large supervised training sets to surpass traditional term-based techniques, which are constructed from raw corpora. In this paper, we propose an approach to zero-shot learning for passage retrieval that uses synthetic question generation to close this gap. The question generation system is trained on gene… ▽ More

    Submitted 27 January, 2021; v1 submitted 29 April, 2020; originally announced April 2020.

    Comments: 14 pages, 4 figures