Search | arXiv e-print repository

AXOLOTL'24 Shared Task on Multilingual Explainable Semantic Change Modeling

Authors: Mariia Fedorova, Timothee Mickus, Niko Partanen, Janine Siewert, Elena Spaziani, Andrey Kutuzov

Abstract: This paper describes the organization and findings of AXOLOTL'24, the first multilingual explainable semantic change modeling shared task. We present new sense-annotated diachronic semantic change datasets for Finnish and Russian which were employed in the shared task, along with a surprise test-only German dataset borrowed from an existing source. The setup of AXOLOTL'24 is new to the semantic ch… ▽ More This paper describes the organization and findings of AXOLOTL'24, the first multilingual explainable semantic change modeling shared task. We present new sense-annotated diachronic semantic change datasets for Finnish and Russian which were employed in the shared task, along with a surprise test-only German dataset borrowed from an existing source. The setup of AXOLOTL'24 is new to the semantic change modeling field, and involves subtasks of identifying unknown (novel) senses and providing dictionary-like definitions to these senses. The methods of the winning teams are described and compared, thus paving a path towards explainability in computational approaches to historical change of meaning. △ Less

Submitted 4 July, 2024; originally announced July 2024.

Comments: Proceedings of the 5th Workshop on Computational Approaches to Historical Language Change (ACL'24)

arXiv:2406.14167 [pdf, other]

Definition generation for lexical semantic change detection

Authors: Mariia Fedorova, Andrey Kutuzov, Yves Scherrer

Abstract: We use contextualized word definitions generated by large language models as semantic representations in the task of diachronic lexical semantic change detection (LSCD). In short, generated definitions are used as `senses', and the change score of a target word is retrieved by comparing their distributions in two time periods under comparison. On the material of five datasets and three languages,… ▽ More We use contextualized word definitions generated by large language models as semantic representations in the task of diachronic lexical semantic change detection (LSCD). In short, generated definitions are used as `senses', and the change score of a target word is retrieved by comparing their distributions in two time periods under comparison. On the material of five datasets and three languages, we show that generated definitions are indeed specific and general enough to convey a signal sufficient to rank sets of words by the degree of their semantic change over time. Our approach is on par with or outperforms prior non-supervised sense-based LSCD methods. At the same time, it preserves interpretability and allows to inspect the reasons behind a specific shift in terms of discrete definitions-as-senses. This is another step in the direction of explainable semantic change modeling. △ Less

Submitted 31 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

Comments: Findings of ACL 2024

arXiv:2403.18024 [pdf, other]

Enriching Word Usage Graphs with Cluster Definitions

Authors: Mariia Fedorova, Andrey Kutuzov, Nikolay Arefyev, Dominik Schlechtweg

Abstract: We present a dataset of word usage graphs (WUGs), where the existing WUGs for multiple languages are enriched with cluster labels functioning as sense definitions. They are generated from scratch by fine-tuned encoder-decoder language models. The conducted human evaluation has shown that these definitions match the existing clusters in WUGs better than the definitions chosen from WordNet by two ba… ▽ More We present a dataset of word usage graphs (WUGs), where the existing WUGs for multiple languages are enriched with cluster labels functioning as sense definitions. They are generated from scratch by fine-tuned encoder-decoder language models. The conducted human evaluation has shown that these definitions match the existing clusters in WUGs better than the definitions chosen from WordNet by two baseline systems. At the same time, the method is straightforward to use and easy to extend to new languages. The resulting enriched datasets can be extremely helpful for moving on to explainable semantic change modeling. △ Less

Submitted 26 March, 2024; originally announced March 2024.

Comments: LREC-COLING 2024

arXiv:2403.14009 [pdf, other]

A New Massive Multilingual Dataset for High-Performance Language Technologies

Authors: Ona de Gibert, Graeme Nail, Nikolay Arefyev, Marta Bañón, Jelmer van der Linde, Shaoxiong Ji, Jaume Zaragoza-Bernabeu, Mikko Aulamo, Gema Ramírez-Sánchez, Andrey Kutuzov, Sampo Pyysalo, Stephan Oepen, Jörg Tiedemann

Abstract: We present the HPLT (High Performance Language Technologies) language resources, a new massive multilingual dataset including both monolingual and bilingual corpora extracted from CommonCrawl and previously unused web crawls from the Internet Archive. We describe our methods for data acquisition, management and processing of large corpora, which rely on open-source software tools and high-performa… ▽ More We present the HPLT (High Performance Language Technologies) language resources, a new massive multilingual dataset including both monolingual and bilingual corpora extracted from CommonCrawl and previously unused web crawls from the Internet Archive. We describe our methods for data acquisition, management and processing of large corpora, which rely on open-source software tools and high-performance computing. Our monolingual collection focuses on low- to medium-resourced languages and covers 75 languages and a total of ~5.6 trillion word tokens de-duplicated on the document level. Our English-centric parallel corpus is derived from its monolingual counterpart and covers 18 language pairs and more than 96 million aligned sentence pairs with roughly 1.4 billion English tokens. The HPLT language resources are one of the largest open text corpora ever released, providing a great resource for language modeling and machine translation training. We publicly release the corpora, the software, and the tools used in this work. △ Less

Submitted 20 March, 2024; originally announced March 2024.

Comments: LREC-COLING 2024

arXiv:2309.08958 [pdf, other]

Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca

Authors: Pinzhen Chen, Shaoxiong Ji, Nikolay Bogoychev, Andrey Kutuzov, Barry Haddow, Kenneth Heafield

Abstract: Foundational large language models (LLMs) can be instruction-tuned to perform open-domain question answering, facilitating applications like chat assistants. While such efforts are often carried out in a single language, we empirically analyze cost-efficient strategies for multilingual scenarios. Our study employs the Alpaca dataset and machine translations of it to form multilingual data, which i… ▽ More Foundational large language models (LLMs) can be instruction-tuned to perform open-domain question answering, facilitating applications like chat assistants. While such efforts are often carried out in a single language, we empirically analyze cost-efficient strategies for multilingual scenarios. Our study employs the Alpaca dataset and machine translations of it to form multilingual data, which is then used to tune LLMs through either low-rank adaptation or full-parameter training. Under a controlled computation budget, comparisons show that multilingual tuning is on par or better than tuning a model for each language. Furthermore, multilingual tuning with downsampled data can be as powerful and more robust. Our findings serve as a guide for expanding language support through instruction tuning. △ Less

Submitted 30 January, 2024; v1 submitted 16 September, 2023; originally announced September 2023.

Comments: Accepted to Findings of ACL: EACL 2024. Added human evaluation and shortened writing

arXiv:2305.11993 [pdf, other]

Interpretable Word Sense Representations via Definition Generation: The Case of Semantic Change Analysis

Authors: Mario Giulianelli, Iris Luden, Raquel Fernandez, Andrey Kutuzov

Abstract: We propose using automatically generated natural language definitions of contextualised word usages as interpretable word and word sense representations. Given a collection of usage examples for a target word, and the corresponding data-driven usage clusters (i.e., word senses), a definition is generated for each usage with a specialised Flan-T5 language model, and the most prototypical definition… ▽ More We propose using automatically generated natural language definitions of contextualised word usages as interpretable word and word sense representations. Given a collection of usage examples for a target word, and the corresponding data-driven usage clusters (i.e., word senses), a definition is generated for each usage with a specialised Flan-T5 language model, and the most prototypical definition in a usage cluster is chosen as the sense label. We demonstrate how the resulting sense labels can make existing approaches to semantic change analysis more interpretable, and how they can allow users -- historical linguists, lexicographers, or social scientists -- to explore and intuitively explain diachronic trajectories of word meaning. Semantic change analysis is only one of many possible applications of the `definitions as representations' paradigm. Beyond being human-readable, contextualised definitions also outperform token or usage sentence embeddings in word-in-context semantic similarity judgements, making them a new promising type of lexical representation for NLP. △ Less

Submitted 25 July, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

Comments: ACL 2023

arXiv:2305.03880 [pdf, other]

NorBench -- A Benchmark for Norwegian Language Models

Authors: David Samuel, Andrey Kutuzov, Samia Touileb, Erik Velldal, Lilja Øvrelid, Egil Rønningstad, Elina Sigdel, Anna Palatkina

Abstract: We present NorBench: a streamlined suite of NLP tasks and probes for evaluating Norwegian language models (LMs) on standardized data splits and evaluation metrics. We also introduce a range of new Norwegian language models (both encoder and encoder-decoder based). Finally, we compare and analyze their performance, along with other existing LMs, across the different benchmark tests of NorBench. We present NorBench: a streamlined suite of NLP tasks and probes for evaluating Norwegian language models (LMs) on standardized data splits and evaluation metrics. We also introduce a range of new Norwegian language models (both encoder and encoder-decoder based). Finally, we compare and analyze their performance, along with other existing LMs, across the different benchmark tests of NorBench. △ Less

Submitted 5 May, 2023; originally announced May 2023.

Comments: Accepted to NoDaLiDa 2023

arXiv:2303.09859 [pdf, other]

Trained on 100 million words and still in shape: BERT meets British National Corpus

Authors: David Samuel, Andrey Kutuzov, Lilja Øvrelid, Erik Velldal

Abstract: While modern masked language models (LMs) are trained on ever larger corpora, we here explore the effects of down-scaling training to a modestly-sized but representative, well-balanced, and publicly available English text source -- the British National Corpus. We show that pre-training on this carefully curated corpus can reach better performance than the original BERT model. We argue that this ty… ▽ More While modern masked language models (LMs) are trained on ever larger corpora, we here explore the effects of down-scaling training to a modestly-sized but representative, well-balanced, and publicly available English text source -- the British National Corpus. We show that pre-training on this carefully curated corpus can reach better performance than the original BERT model. We argue that this type of corpora has great potential as a language modeling benchmark. To showcase this potential, we present fair, reproducible and data-efficient comparative studies of LMs, in which we evaluate several training objectives and model architectures and replicate previous empirical results in a systematic way. We propose an optimized LM architecture called LTG-BERT. △ Less

Submitted 5 May, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

Comments: Accepted to EACL 2023

arXiv:2209.13750 [pdf, other]

RuDSI: graph-based word sense induction dataset for Russian

Authors: Anna Aksenova, Ekaterina Gavrishina, Elisey Rykov, Andrey Kutuzov

Abstract: We present RuDSI, a new benchmark for word sense induction (WSI) in Russian. The dataset was created using manual annotation and semi-automatic clustering of Word Usage Graphs (WUGs). Unlike prior WSI datasets for Russian, RuDSI is completely data-driven (based on texts from Russian National Corpus), with no external word senses imposed on annotators. Depending on the parameters of graph clusterin… ▽ More We present RuDSI, a new benchmark for word sense induction (WSI) in Russian. The dataset was created using manual annotation and semi-automatic clustering of Word Usage Graphs (WUGs). Unlike prior WSI datasets for Russian, RuDSI is completely data-driven (based on texts from Russian National Corpus), with no external word senses imposed on annotators. Depending on the parameters of graph clustering, different derivative datasets can be produced from raw annotation. We report the performance that several baseline WSI methods obtain on RuDSI and discuss possibilities for improving these scores. △ Less

Submitted 27 September, 2022; originally announced September 2022.

Comments: TextGraphs-16 workshop at the CoLING-2022 conference

arXiv:2209.00154 [pdf, other]

doi 10.3384/nejlt.2000-1533.2022.3478

Contextualized language models for semantic change detection: lessons learned

Authors: Andrey Kutuzov, Erik Velldal, Lilja Øvrelid

Abstract: We present a qualitative analysis of the (potentially erroneous) outputs of contextualized embedding-based methods for detecting diachronic semantic change. First, we introduce an ensemble method outperforming previously described contextualized approaches. This method is used as a basis for an in-depth analysis of the degrees of semantic change predicted for English words across 5 decades. Our fi… ▽ More We present a qualitative analysis of the (potentially erroneous) outputs of contextualized embedding-based methods for detecting diachronic semantic change. First, we introduce an ensemble method outperforming previously described contextualized approaches. This method is used as a basis for an in-depth analysis of the degrees of semantic change predicted for English words across 5 decades. Our findings show that contextualized methods can often predict high change scores for words which are not undergoing any real diachronic semantic shift in the lexicographic sense of the term (or at least the status of these shifts is questionable). Such challenging cases are discussed in detail with examples, and their linguistic categorization is proposed. Our conclusion is that pre-trained contextualized language models are prone to confound changes in lexicographic senses and changes in contextual variance, which naturally stem from their distributional nature, but is different from the types of issues observed in methods based on static embeddings. Additionally, they often merge together syntactic and semantic aspects of lexical entities. We propose a range of possible future solutions to these issues. △ Less

Submitted 31 August, 2022; originally announced September 2022.

Journal ref: Northern European Journal of Language Technology (NEJLT). ISSN 2000-1533. 8(1)

arXiv:2204.05717 [pdf, other]

Do Not Fire the Linguist: Grammatical Profiles Help Language Models Detect Semantic Change

Authors: Mario Giulianelli, Andrey Kutuzov, Lidia Pivovarova

Abstract: Morphological and syntactic changes in word usage (as captured, e.g., by grammatical profiles) have been shown to be good predictors of a word's meaning change. In this work, we explore whether large pre-trained contextualised language models, a common tool for lexical semantic change detection, are sensitive to such morphosyntactic changes. To this end, we first compare the performance of grammat… ▽ More Morphological and syntactic changes in word usage (as captured, e.g., by grammatical profiles) have been shown to be good predictors of a word's meaning change. In this work, we explore whether large pre-trained contextualised language models, a common tool for lexical semantic change detection, are sensitive to such morphosyntactic changes. To this end, we first compare the performance of grammatical profiles against that of a multilingual neural language model (XLM-R) on 10 datasets, covering 7 languages, and then combine the two approaches in ensembles to assess their complementarity. Our results show that ensembling grammatical profiles with XLM-R improves semantic change detection performance for most datasets and languages. This indicates that language models do not fully cover the fine-grained morphological and syntactic signals that are explicitly represented in grammatical profiles. An interesting exception are the test sets where the time spans under analysis are much longer than the time gap between them (for example, century-long spans with a one-year gap between them). Morphosyntactic change is slow so grammatical profiles do not detect in such cases. In contrast, language models, thanks to their access to lexical information, are able to detect fast topical changes. △ Less

Submitted 12 April, 2022; originally announced April 2022.

Comments: 3rd International Workshop on Computational Approaches to Historical Language Change 2022 (LChange'22)

arXiv:2201.05123 [pdf, other]

NorDiaChange: Diachronic Semantic Change Dataset for Norwegian

Authors: Andrey Kutuzov, Samia Touileb, Petter Mæhlum, Tita Ranveig Enstad, Alexandra Wittemann

Abstract: We describe NorDiaChange: the first diachronic semantic change dataset for Norwegian. NorDiaChange comprises two novel subsets, covering about 80 Norwegian nouns manually annotated with graded semantic change over time. Both datasets follow the same annotation procedure and can be used interchangeably as train and test splits for each other. NorDiaChange covers the time periods related to pre- and… ▽ More We describe NorDiaChange: the first diachronic semantic change dataset for Norwegian. NorDiaChange comprises two novel subsets, covering about 80 Norwegian nouns manually annotated with graded semantic change over time. Both datasets follow the same annotation procedure and can be used interchangeably as train and test splits for each other. NorDiaChange covers the time periods related to pre- and post-war events, oil and gas discovery in Norway, and technological developments. The annotation was done using the DURel framework and two large historical Norwegian corpora. NorDiaChange is published in full under a permissive licence, complete with raw annotation data and inferred diachronic word usage graphs (DWUGs). △ Less

Submitted 27 April, 2022; v1 submitted 13 January, 2022; originally announced January 2022.

Comments: LREC'2022 proceedings

arXiv:2109.10397 [pdf, other]

Grammatical Profiling for Semantic Change Detection

Authors: Mario Giulianelli, Andrey Kutuzov, Lidia Pivovarova

Abstract: Semantics, morphology and syntax are strongly interdependent. However, the majority of computational methods for semantic change detection use distributional word representations which encode mostly semantics. We investigate an alternative method, grammatical profiling, based entirely on changes in the morphosyntactic behaviour of words. We demonstrate that it can be used for semantic change detec… ▽ More Semantics, morphology and syntax are strongly interdependent. However, the majority of computational methods for semantic change detection use distributional word representations which encode mostly semantics. We investigate an alternative method, grammatical profiling, based entirely on changes in the morphosyntactic behaviour of words. We demonstrate that it can be used for semantic change detection and even outperforms some distributional semantic methods. We present an in-depth qualitative and quantitative analysis of the predictions made by our grammatical profiling system, showing that they are plausible and interpretable. △ Less

Submitted 21 September, 2021; originally announced September 2021.

Comments: CoNLL 2021

arXiv:2106.08294 [pdf, other]

Three-part diachronic semantic change dataset for Russian

Authors: Andrey Kutuzov, Lidia Pivovarova

Abstract: We present a manually annotated lexical semantic change dataset for Russian: RuShiftEval. Its novelty is ensured by a single set of target words annotated for their diachronic semantic shifts across three time periods, while the previous work either used only two time periods, or different sets of target words. The paper describes the composition and annotation procedure for the dataset. In additi… ▽ More We present a manually annotated lexical semantic change dataset for Russian: RuShiftEval. Its novelty is ensured by a single set of target words annotated for their diachronic semantic shifts across three time periods, while the previous work either used only two time periods, or different sets of target words. The paper describes the composition and annotation procedure for the dataset. In addition, it is shown how the ternary nature of RuShiftEval allows to trace specific diachronic trajectories: `changed at a particular time period and stable afterwards' or `was changing throughout all time periods'. Based on the analysis of the submissions to the recent shared task on semantic change detection for Russian, we argue that correctly identifying such trajectories can be an interesting sub-task itself. △ Less

Submitted 15 June, 2021; originally announced June 2021.

Comments: Accepted to the 2nd International Workshop on Computational Approaches to Historical Language Change 2021 (LChange'21)

arXiv:2105.01192 [pdf, other]

Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks

Authors: Tatyana Iazykova, Denis Kapelyushnik, Olga Bystrova, Andrey Kutuzov

Abstract: Leader-boards like SuperGLUE are seen as important incentives for active development of NLP, since they provide standard benchmarks for fair comparison of modern language models. They have driven the world's best engineering teams as well as their resources to collaborate and solve a set of tasks for general language understanding. Their performance scores are often claimed to be close to or even… ▽ More Leader-boards like SuperGLUE are seen as important incentives for active development of NLP, since they provide standard benchmarks for fair comparison of modern language models. They have driven the world's best engineering teams as well as their resources to collaborate and solve a set of tasks for general language understanding. Their performance scores are often claimed to be close to or even higher than the human performance. These results encouraged more thorough analysis of whether the benchmark datasets featured any statistical cues that machine learning based language models can exploit. For English datasets, it was shown that they often contain annotation artifacts. This allows solving certain tasks with very simple rules and achieving competitive rankings. In this paper, a similar analysis was done for the Russian SuperGLUE (RSG), a recently published benchmark set and leader-board for Russian natural language understanding. We show that its test datasets are vulnerable to shallow heuristics. Often approaches based on simple rules outperform or come close to the results of the notorious pre-trained language models like GPT-3 or BERT. It is likely (as the simplest explanation) that a significant part of the SOTA models performance in the RSG leader-board is due to exploiting these shallow heuristics and that has nothing in common with real language understanding. We provide a set of recommendations on how to improve these datasets, making the RSG leader-board even more representative of the real progress in Russian NLU. △ Less

Submitted 3 May, 2021; originally announced May 2021.

Comments: Accepted to Dialogue'2021

arXiv:2104.06546 [pdf, other]

Large-Scale Contextualised Language Modelling for Norwegian

Authors: Andrey Kutuzov, Jeremy Barnes, Erik Velldal, Lilja Øvrelid, Stephan Oepen

Abstract: We present the ongoing NorLM initiative to support the creation and use of very large contextualised language models for Norwegian (and in principle other Nordic languages), including a ready-to-use software environment, as well as an experience report for data preparation and training. This paper introduces the first large-scale monolingual language models for Norwegian, based on both the ELMo an… ▽ More We present the ongoing NorLM initiative to support the creation and use of very large contextualised language models for Norwegian (and in principle other Nordic languages), including a ready-to-use software environment, as well as an experience report for data preparation and training. This paper introduces the first large-scale monolingual language models for Norwegian, based on both the ELMo and BERT frameworks. In addition to detailing the training process, we present contrastive benchmark results on a suite of NLP tasks for Norwegian. For additional background and access to the data, models, and software, please see http://norlm.nlpl.eu △ Less

Submitted 13 April, 2021; originally announced April 2021.

Comments: Accepted to NoDaLiDa'2021

arXiv:2103.16414 [pdf, other]

Representing ELMo embeddings as two-dimensional text online

Authors: Andrey Kutuzov, Elizaveta Kuzmenko

Abstract: We describe a new addition to the WebVectors toolkit which is used to serve word embedding models over the Web. The new ELMoViz module adds support for contextualized embedding architectures, in particular for ELMo models. The provided visualizations follow the metaphor of `two-dimensional text' by showing lexical substitutes: words which are most semantically similar in context to the words of th… ▽ More We describe a new addition to the WebVectors toolkit which is used to serve word embedding models over the Web. The new ELMoViz module adds support for contextualized embedding architectures, in particular for ELMo models. The provided visualizations follow the metaphor of `two-dimensional text' by showing lexical substitutes: words which are most semantically similar in context to the words of the input sentence. The system allows the user to change the ELMo layers from which token embeddings are inferred. It also conveys corpus information about the query words and their lexical substitutes (namely their frequency tiers and parts of speech). The module is well integrated into the rest of the WebVectors toolkit, providing lexical hyperlinks to word representations in static embedding models. Two web services have already implemented the new functionality with pre-trained ELMo models for Russian, Norwegian and English. △ Less

Submitted 30 March, 2021; originally announced March 2021.

Comments: EACL'2021 demo paper

arXiv:2102.00442 [pdf, other]

Conceptual design of the Spin Physics Detector

Authors: V. M. Abazov, V. Abramov, L. G. Afanasyev, R. R. Akhunzyanov, A. V. Akindinov, N. Akopov, I. G. Alekseev, A. M. Aleshko, V. Yu. Alexakhin, G. D. Alexeev, M. Alexeev, A. Amoroso, I. V. Anikin, V. F. Andreev, V. A. Anosov, A. B. Arbuzov, N. I. Azorskiy, A. A. Baldin, V. V. Balandina, E. G. Baldina, M. Yu. Barabanov, S. G. Barsov, V. A. Baskov, A. N. Beloborodov, I. N. Belov , et al. (270 additional authors not shown)

Abstract: The Spin Physics Detector, a universal facility for studying the nucleon spin structure and other spin-related phenomena with polarized proton and deuteron beams, is proposed to be placed in one of the two interaction points of the NICA collider that is under construction at the Joint Institute for Nuclear Research (Dubna, Russia). At the heart of the project there is huge experience with polarize… ▽ More The Spin Physics Detector, a universal facility for studying the nucleon spin structure and other spin-related phenomena with polarized proton and deuteron beams, is proposed to be placed in one of the two interaction points of the NICA collider that is under construction at the Joint Institute for Nuclear Research (Dubna, Russia). At the heart of the project there is huge experience with polarized beams at JINR. The main objective of the proposed experiment is the comprehensive study of the unpolarized and polarized gluon content of the nucleon. Spin measurements at the Spin Physics Detector at the NICA collider have bright perspectives to make a unique contribution and challenge our understanding of the spin structure of the nucleon. In this document the Conceptual Design of the Spin Physics Detector is presented. △ Less

Submitted 2 February, 2022; v1 submitted 31 January, 2021; originally announced February 2021.

arXiv:2010.06436 [pdf, other]

RuSemShift: a dataset of historical lexical semantic change in Russian

Authors: Julia Rodina, Andrey Kutuzov

Abstract: We present RuSemShift, a large-scale manually annotated test set for the task of semantic change modeling in Russian for two long-term time period pairs: from the pre-Soviet through the Soviet times and from the Soviet through the post-Soviet times. Target words were annotated by multiple crowd-source workers. The annotation process was organized following the DURel framework and was based on sent… ▽ More We present RuSemShift, a large-scale manually annotated test set for the task of semantic change modeling in Russian for two long-term time period pairs: from the pre-Soviet through the Soviet times and from the Soviet through the post-Soviet times. Target words were annotated by multiple crowd-source workers. The annotation process was organized following the DURel framework and was based on sentence contexts extracted from the Russian National Corpus. Additionally, we report the performance of several distributional approaches on RuSemShift, achieving promising results, which at the same time leave room for other researchers to improve. △ Less

Submitted 13 October, 2020; originally announced October 2020.

Comments: Accepted to COLING 2020

arXiv:2010.03481 [pdf, ps, other]

ELMo and BERT in semantic change detection for Russian

Authors: Julia Rodina, Yuliya Trofimova, Andrey Kutuzov, Ekaterina Artemova

Abstract: We study the effectiveness of contextualized embeddings for the task of diachronic semantic change detection for Russian language data. Evaluation test sets consist of Russian nouns and adjectives annotated based on their occurrences in texts created in pre-Soviet, Soviet and post-Soviet time periods. ELMo and BERT architectures are compared on the task of ranking Russian words according to the de… ▽ More We study the effectiveness of contextualized embeddings for the task of diachronic semantic change detection for Russian language data. Evaluation test sets consist of Russian nouns and adjectives annotated based on their occurrences in texts created in pre-Soviet, Soviet and post-Soviet time periods. ELMo and BERT architectures are compared on the task of ranking Russian words according to the degree of their semantic change over time. We use several methods for aggregation of contextualized embeddings from these architectures and evaluate their performance. Finally, we compare unsupervised and supervised techniques in this task. △ Less

Submitted 7 October, 2020; originally announced October 2020.

Comments: The 9th International Conference on Analysis of Images, Social Networks and Texts (AIST 2020)

arXiv:2005.00050 [pdf, other]

UiO-UvA at SemEval-2020 Task 1: Contextualised Embeddings for Lexical Semantic Change Detection

Authors: Andrey Kutuzov, Mario Giulianelli

Abstract: We apply contextualised word embeddings to lexical semantic change detection in the SemEval-2020 Shared Task 1. This paper focuses on Subtask 2, ranking words by the degree of their semantic drift over time. We analyse the performance of two contextualising architectures (BERT and ELMo) and three change detection algorithms. We find that the most effective algorithms rely on the cosine similarity… ▽ More We apply contextualised word embeddings to lexical semantic change detection in the SemEval-2020 Shared Task 1. This paper focuses on Subtask 2, ranking words by the degree of their semantic drift over time. We analyse the performance of two contextualising architectures (BERT and ELMo) and three change detection algorithms. We find that the most effective algorithms rely on the cosine similarity between averaged token embeddings and the pairwise distances between token embeddings. They outperform strong baselines by a large margin (in the post-evaluation phase, we have the best Subtask 2 submission for SemEval-2020 Task 1), but interestingly, the choice of a particular algorithm depends on the distribution of gold scores in the test set. △ Less

Submitted 18 July, 2020; v1 submitted 30 April, 2020; originally announced May 2020.

Comments: To appear in Proceedings of the 14th International Workshop on Semantic Evaluation (SemEval-2020)

arXiv:2003.06651 [pdf, other]

Word Sense Disambiguation for 158 Languages using Word Embeddings Only

Authors: Varvara Logacheva, Denis Teslenko, Artem Shelmanov, Steffen Remus, Dmitry Ustalov, Andrey Kutuzov, Ekaterina Artemova, Chris Biemann, Simone Paolo Ponzetto, Alexander Panchenko

Abstract: Disambiguation of word senses in context is easy for humans, but is a major challenge for automatic approaches. Sophisticated supervised and knowledge-based models were developed to solve this task. However, (i) the inherent Zipfian distribution of supervised training instances for a given word and/or (ii) the quality of linguistic knowledge representations motivate the development of completely u… ▽ More Disambiguation of word senses in context is easy for humans, but is a major challenge for automatic approaches. Sophisticated supervised and knowledge-based models were developed to solve this task. However, (i) the inherent Zipfian distribution of supervised training instances for a given word and/or (ii) the quality of linguistic knowledge representations motivate the development of completely unsupervised and knowledge-free approaches to word sense disambiguation (WSD). They are particularly useful for under-resourced languages which do not have any resources for building either supervised and/or knowledge-based models. In this paper, we present a method that takes as input a standard pre-trained word embedding model and induces a fully-fledged word sense inventory, which can be used for disambiguation in context. We use this method to induce a collection of sense inventories for 158 languages on the basis of the original pre-trained fastText word embeddings by Grave et al. (2018), enabling WSD in these languages. Models and system are available online. △ Less

Submitted 14 March, 2020; originally announced March 2020.

Comments: 10 pages, 5 figures, 4 tables, accepted at LREC 2020

arXiv:1909.03135 [pdf, other]

To lemmatize or not to lemmatize: how word normalisation affects ELMo performance in word sense disambiguation

Authors: Andrey Kutuzov, Elizaveta Kuzmenko

Abstract: We critically evaluate the widespread assumption that deep learning NLP models do not require lemmatized input. To test this, we trained versions of contextualised word embedding ELMo models on raw tokenized corpora and on the corpora with word tokens replaced by their lemmas. Then, these models were evaluated on the word sense disambiguation task. This was done for the English and Russian languag… ▽ More We critically evaluate the widespread assumption that deep learning NLP models do not require lemmatized input. To test this, we trained versions of contextualised word embedding ELMo models on raw tokenized corpora and on the corpora with word tokens replaced by their lemmas. Then, these models were evaluated on the word sense disambiguation task. This was done for the English and Russian languages. The experiments showed that while lemmatization is indeed not necessary for English, the situation is different for Russian. It seems that for rich-morphology languages, using lemmatized training and testing data yields small but consistent improvements: at least for word sense disambiguation. This means that the decisions about text pre-processing before training ELMo should consider the linguistic nature of the language in question. △ Less

Submitted 6 September, 2019; originally announced September 2019.

Comments: Accepted to NODALIDA2019 Deep Learning for Natural Language Processing workshop

arXiv:1907.12674 [pdf, other]

One-to-X analogical reasoning on word embeddings: a case for diachronic armed conflict prediction from news texts

Authors: Andrey Kutuzov, Erik Velldal, Lilja Øvrelid

Abstract: We extend the well-known word analogy task to a one-to-X formulation, including one-to-none cases, when no correct answer exists. The task is cast as a relation discovery problem and applied to historical armed conflicts datasets, attempting to predict new relations of type `location:armed-group' based on data about past events. As the source of semantic information, we use diachronic word embeddi… ▽ More We extend the well-known word analogy task to a one-to-X formulation, including one-to-none cases, when no correct answer exists. The task is cast as a relation discovery problem and applied to historical armed conflicts datasets, attempting to predict new relations of type `location:armed-group' based on data about past events. As the source of semantic information, we use diachronic word embedding models trained on English news texts. A simple technique to improve diachronic performance in such task is demonstrated, using a threshold based on a function of cosine distance to decrease the number of false positives; this approach is shown to be beneficial on two different corpora. Finally, we publish a ready-to-use test set for one-to-X analogy evaluation on historical armed conflicts data. △ Less

Submitted 29 July, 2019; originally announced July 2019.

Comments: 1st International Workshop on Computational Approaches to Historical Language Change (ACL 2019)

arXiv:1906.07040 [pdf, other]

Making Fast Graph-based Algorithms with Graph Metric Embeddings

Authors: Andrey Kutuzov, Mohammad Dorgham, Oleksiy Oliynyk, Chris Biemann, Alexander Panchenko

Abstract: The computation of distance measures between nodes in graphs is inefficient and does not scale to large graphs. We explore dense vector representations as an effective way to approximate the same information: we introduce a simple yet efficient and effective approach for learning graph embeddings. Instead of directly operating on the graph structure, our method takes structural measures of pairwis… ▽ More The computation of distance measures between nodes in graphs is inefficient and does not scale to large graphs. We explore dense vector representations as an effective way to approximate the same information: we introduce a simple yet efficient and effective approach for learning graph embeddings. Instead of directly operating on the graph structure, our method takes structural measures of pairwise node similarities into account and learns dense node representations reflecting user-defined graph distance measures, such as e.g.the shortest path distance or distance measures that take information beyond the graph structure into account. We demonstrate a speed-up of several orders of magnitude when predicting word similarity by vector operations on our embeddings as opposed to directly computing the respective path-based measures, while outperforming various other graph embeddings on semantic similarity and word sense disambiguation tasks and show evaluations on the WordNet graph and two knowledge base graphs. △ Less

Submitted 17 June, 2019; originally announced June 2019.

Comments: In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics (ACL'2019). Florence, Italy

arXiv:1905.06837 [pdf]

Tracing cultural diachronic semantic shifts in Russian using word embeddings: test sets and baselines

Authors: Vadim Fomin, Daria Bakshandaeva, Julia Rodina, Andrey Kutuzov

Abstract: The paper introduces manually annotated test sets for the task of tracing diachronic (temporal) semantic shifts in Russian. The two test sets are complementary in that the first one covers comparatively strong semantic changes occurring to nouns and adjectives from pre-Soviet to Soviet times, while the second one covers comparatively subtle socially and culturally determined shifts occurring in ye… ▽ More The paper introduces manually annotated test sets for the task of tracing diachronic (temporal) semantic shifts in Russian. The two test sets are complementary in that the first one covers comparatively strong semantic changes occurring to nouns and adjectives from pre-Soviet to Soviet times, while the second one covers comparatively subtle socially and culturally determined shifts occurring in years from 2000 to 2014. Additionally, the second test set offers more granular classification of shifts degree, but is limited to only adjectives. The introduction of the test sets allowed us to evaluate several well-established algorithms of semantic shifts detection (posing this as a classification problem), most of which have never been tested on Russian material. All of these algorithms use distributional word embedding models trained on the corresponding in-domain corpora. The resulting scores provide solid comparison baselines for future studies tackling similar tasks. We publish the datasets, code and the trained models in order to facilitate further research in automatically detecting temporal semantic shifts for Russian words, with time periods of different granularities. △ Less

Submitted 29 July, 2019; v1 submitted 16 May, 2019; originally announced May 2019.

Comments: Dialogue 2019

arXiv:1808.05611 [pdf, other]

Learning Graph Embeddings from WordNet-based Similarity Measures

Authors: Andrey Kutuzov, Mohammad Dorgham, Oleksiy Oliynyk, Chris Biemann, Alexander Panchenko

Abstract: We present path2vec, a new approach for learning graph embeddings that relies on structural measures of pairwise node similarities. The model learns representations for nodes in a dense space that approximate a given user-defined graph distance measure, such as e.g. the shortest path distance or distance measures that take information beyond the graph structure into account. Evaluation of the prop… ▽ More We present path2vec, a new approach for learning graph embeddings that relies on structural measures of pairwise node similarities. The model learns representations for nodes in a dense space that approximate a given user-defined graph distance measure, such as e.g. the shortest path distance or distance measures that take information beyond the graph structure into account. Evaluation of the proposed model on semantic similarity and word sense disambiguation tasks, using various WordNet-based similarity measures, show that our approach yields competitive results, outperforming strong graph embedding baselines. The model is computationally efficient, being orders of magnitude faster than the direct computation of graph-based distances. △ Less

Submitted 12 April, 2019; v1 submitted 16 August, 2018; originally announced August 2018.

Comments: Accepted to StarSem 2019

arXiv:1806.03537 [pdf, ps, other]

Diachronic word embeddings and semantic shifts: a survey

Authors: Andrey Kutuzov, Lilja Øvrelid, Terrence Szymanski, Erik Velldal

Abstract: Recent years have witnessed a surge of publications aimed at tracing temporal changes in lexical semantics using distributional methods, particularly prediction-based word embedding models. However, this vein of research lacks the cohesion, common terminology and shared practices of more established areas of natural language processing. In this paper, we survey the current state of academic resear… ▽ More Recent years have witnessed a surge of publications aimed at tracing temporal changes in lexical semantics using distributional methods, particularly prediction-based word embedding models. However, this vein of research lacks the cohesion, common terminology and shared practices of more established areas of natural language processing. In this paper, we survey the current state of academic research related to diachronic word embeddings and semantic shifts detection. We start with discussing the notion of semantic shifts, and then continue with an overview of the existing methods for tracing such time-related shifts with word embedding models. We propose several axes along which these methods can be compared, and outline the main challenges before this emerging subfield of NLP, as well as prospects and possible applications. △ Less

Submitted 13 June, 2018; v1 submitted 9 June, 2018; originally announced June 2018.

Comments: Proceedings of COLING 2018

arXiv:1805.04715 [pdf, other]

doi 10.18653/v1/P18-2010

Unsupervised Semantic Frame Induction using Triclustering

Authors: Dmitry Ustalov, Alexander Panchenko, Andrei Kutuzov, Chris Biemann, Simone Paolo Ponzetto

Abstract: We use dependency triples automatically extracted from a Web-scale corpus to perform unsupervised semantic frame induction. We cast the frame induction problem as a triclustering problem that is a generalization of clustering for triadic data. Our replicable benchmarks demonstrate that the proposed graph-based approach, Triframes, shows state-of-the art results on this task on a FrameNet-derived d… ▽ More We use dependency triples automatically extracted from a Web-scale corpus to perform unsupervised semantic frame induction. We cast the frame induction problem as a triclustering problem that is a generalization of clustering for triadic data. Our replicable benchmarks demonstrate that the proposed graph-based approach, Triframes, shows state-of-the art results on this task on a FrameNet-derived dataset and performing on par with competitive methods on a verb class clustering task. △ Less

Submitted 18 May, 2018; v1 submitted 12 May, 2018; originally announced May 2018.

Comments: 8 pages, 1 figure, 4 tables, accepted at ACL 2018

arXiv:1805.02258 [pdf]

Russian word sense induction by clustering averaged word embeddings

Authors: Andrey Kutuzov

Abstract: The paper reports our participation in the shared task on word sense induction and disambiguation for the Russian language (RUSSE-2018). Our team was ranked 2nd for the wiki-wiki dataset (containing mostly homonyms) and 5th for the bts-rnc and active-dict datasets (containing mostly polysemous words) among all 19 participants. The method we employed was extremely naive. It implied representing c… ▽ More The paper reports our participation in the shared task on word sense induction and disambiguation for the Russian language (RUSSE-2018). Our team was ranked 2nd for the wiki-wiki dataset (containing mostly homonyms) and 5th for the bts-rnc and active-dict datasets (containing mostly polysemous words) among all 19 participants. The method we employed was extremely naive. It implied representing contexts of ambiguous words as averaged word embedding vectors, using off-the-shelf pre-trained distributional models. Then, these vector representations were clustered with mainstream clustering techniques, thus producing the groups corresponding to the ambiguous word senses. As a side result, we show that word embedding models trained on small but balanced corpora can be superior to those trained on large but noisy data - not only in intrinsic evaluation, but also in downstream tasks like word sense induction. △ Less

Submitted 6 May, 2018; originally announced May 2018.

Comments: Proceedings of the 24rd International Conference on Computational Linguistics and Intellectual Technologies (Dialogue-2018)

arXiv:1801.06407 [pdf, other]

doi 10.1007/978-3-319-73013-4_5

Size vs. Structure in Training Corpora for Word Embedding Models: Araneum Russicum Maximum and Russian National Corpus

Authors: Andrey Kutuzov, Maria Kunilovskaya

Abstract: In this paper, we present a distributional word embedding model trained on one of the largest available Russian corpora: Araneum Russicum Maximum (over 10 billion words crawled from the web). We compare this model to the model trained on the Russian National Corpus (RNC). The two corpora are much different in their size and compilation procedures. We test these differences by evaluating the traine… ▽ More In this paper, we present a distributional word embedding model trained on one of the largest available Russian corpora: Araneum Russicum Maximum (over 10 billion words crawled from the web). We compare this model to the model trained on the Russian National Corpus (RNC). The two corpora are much different in their size and compilation procedures. We test these differences by evaluating the trained models against the Russian part of the Multilingual SimLex999 semantic similarity dataset. We detect and describe numerous issues in this dataset and publish a new corrected version. Aside from the already known fact that the RNC is generally a better training corpus than web corpora, we enumerate and explain fine differences in how the models process semantic similarity task, what parts of the evaluation set are difficult for particular models and why. Additionally, the learning curves for both models are described, showing that the RNC is generally more robust as training material for this task. △ Less

Submitted 19 January, 2018; originally announced January 2018.

Journal ref: In: van der Aalst W. et al. (eds) Analysis of Images, Social Networks and Texts. AIST 2017. Lecture Notes in Computer Science, vol 10716. Springer, Cham

arXiv:1707.08660 [pdf, ps, other]

Temporal dynamics of semantic relations in word embeddings: an application to predicting armed conflict participants

Authors: Andrey Kutuzov, Erik Velldal, Lilja Øvrelid

Abstract: This paper deals with using word embedding models to trace the temporal dynamics of semantic relations between pairs of words. The set-up is similar to the well-known analogies task, but expanded with a time dimension. To this end, we apply incremental updating of the models with new training texts, including incremental vocabulary expansion, coupled with learned transformation matrices that let u… ▽ More This paper deals with using word embedding models to trace the temporal dynamics of semantic relations between pairs of words. The set-up is similar to the well-known analogies task, but expanded with a time dimension. To this end, we apply incremental updating of the models with new training texts, including incremental vocabulary expansion, coupled with learned transformation matrices that let us map between members of the relation. The proposed approach is evaluated on the task of predicting insurgent armed groups based on geographical locations. The gold standard data for the time span 1994--2010 is extracted from the UCDP Armed Conflicts dataset. The results show that the method is feasible and outperforms the baselines, but also that important work still remains to be done. △ Less

Submitted 26 July, 2017; originally announced July 2017.

Comments: to appear in EMNLP 2017 proceedings

arXiv:1704.05781 [pdf, other]

Redefining Context Windows for Word Embedding Models: An Experimental Study

Authors: Pierre Lison, Andrey Kutuzov

Abstract: Distributional semantic models learn vector representations of words through the contexts they occur in. Although the choice of context (which often takes the form of a sliding window) has a direct influence on the resulting embeddings, the exact role of this model component is still not fully understood. This paper presents a systematic analysis of context windows based on a set of four distinct… ▽ More Distributional semantic models learn vector representations of words through the contexts they occur in. Although the choice of context (which often takes the form of a sliding window) has a direct influence on the resulting embeddings, the exact role of this model component is still not fully understood. This paper presents a systematic analysis of context windows based on a set of four distinct hyper-parameters. We train continuous Skip-Gram models on two English-language corpora for various combinations of these hyper-parameters, and evaluate them on both lexical similarity and analogy tasks. Notable experimental results are the positive impact of cross-sentential contexts and the surprisingly good performance of right-context windows. △ Less

Submitted 19 April, 2017; originally announced April 2017.

arXiv:1608.03803 [pdf, other]

Redefining part-of-speech classes with distributional semantic models

Authors: Andrey Kutuzov, Erik Velldal, Lilja Øvrelid

Abstract: This paper studies how word embeddings trained on the British National Corpus interact with part of speech boundaries. Our work targets the Universal PoS tag set, which is currently actively being used for annotation of a range of languages. We experiment with training classifiers for predicting PoS tags for words based on their embeddings. The results show that the information about PoS affiliati… ▽ More This paper studies how word embeddings trained on the British National Corpus interact with part of speech boundaries. Our work targets the Universal PoS tag set, which is currently actively being used for annotation of a range of languages. We experiment with training classifiers for predicting PoS tags for words based on their embeddings. The results show that the information about PoS affiliation contained in the distributional vectors allows us to discover groups of words with distributional patterns that differ from other words of the same part of speech. This data often reveals hidden inconsistencies of the annotation process or guidelines. At the same time, it supports the notion of `soft' or `graded' part of speech affiliations. Finally, we show that information about PoS is distributed among dozens of vector components, not limited to only one or two features. △ Less

Submitted 12 August, 2016; originally announced August 2016.

Journal ref: Proceedings of The 20th SIGNLL Conference on Computational Natural Language Learning, 2016, pp. 115-125

arXiv:1604.05372 [pdf, other]

Clustering Comparable Corpora of Russian and Ukrainian Academic Texts: Word Embeddings and Semantic Fingerprints

Authors: Andrey Kutuzov, Mikhail Kopotev, Tatyana Sviridenko, Lyubov Ivanova

Abstract: We present our experience in applying distributional semantics (neural word embeddings) to the problem of representing and clustering documents in a bilingual comparable corpus. Our data is a collection of Russian and Ukrainian academic texts, for which topics are their academic fields. In order to build language-independent semantic representations of these documents, we train neural distribution… ▽ More We present our experience in applying distributional semantics (neural word embeddings) to the problem of representing and clustering documents in a bilingual comparable corpus. Our data is a collection of Russian and Ukrainian academic texts, for which topics are their academic fields. In order to build language-independent semantic representations of these documents, we train neural distributional models on monolingual corpora and learn the optimal linear transformation of vectors from one language to another. The resulting vectors are then used to produce `semantic fingerprints' of documents, serving as input to a clustering algorithm. The presented method is compared to several baselines including `orthographic translation' with Levenshtein edit distance and outperforms them by a large margin. We also show that language-independent `semantic fingerprints' are superior to multi-lingual clustering algorithms proposed in the previous work, at the same time requiring less linguistic resources. △ Less

Submitted 18 April, 2016; originally announced April 2016.

Comments: To be presented at 9th Workshop on Building and Using Comparable Corpora, co-located with LREC-2016 (https://comparable.limsi.fr/bucc2016/)

arXiv:1504.08183 [pdf]

Texts in, meaning out: neural language models in semantic similarity task for Russian

Authors: Andrey Kutuzov, Igor Andreev

Abstract: Distributed vector representations for natural language vocabulary get a lot of attention in contemporary computational linguistics. This paper summarizes the experience of applying neural network language models to the task of calculating semantic similarity for Russian. The experiments were performed in the course of Russian Semantic Similarity Evaluation track, where our models took from the 2n… ▽ More Distributed vector representations for natural language vocabulary get a lot of attention in contemporary computational linguistics. This paper summarizes the experience of applying neural network language models to the task of calculating semantic similarity for Russian. The experiments were performed in the course of Russian Semantic Similarity Evaluation track, where our models took from the 2nd to the 5th position, depending on the task. We introduce the tools and corpora used, comment on the nature of the shared task and describe the achieved results. It was found out that Continuous Skip-gram and Continuous Bag-of-words models, previously successfully applied to English material, can be used for semantic modeling of Russian as well. Moreover, we show that texts in Russian National Corpus (RNC) provide an excellent training material for such models, outperforming other, much larger corpora. It is especially true for semantic relatedness tasks (although stacking models trained on larger corpora on top of RNC models improves performance even more). High-quality semantic vectors learned in such a way can be used in a variety of linguistic tasks and promise an exciting field for further study. △ Less

Submitted 30 April, 2015; originally announced April 2015.

Comments: Proceedings of the Dialog 2015 Conference. Moscow, Russia

arXiv:1409.1612 [pdf, other]

Semantic clustering of Russian web search results: possibilities and problems

Authors: Andrey Kutuzov

Abstract: The paper deals with word sense induction from lexical co-occurrence graphs. We construct such graphs on large Russian corpora and then apply this data to cluster Mail.ru Search results according to meanings of the query. We compare different methods of performing such clustering and different source corpora. Models of applying distributional semantics to big linguistic data are described. The paper deals with word sense induction from lexical co-occurrence graphs. We construct such graphs on large Russian corpora and then apply this data to cluster Mail.ru Search results according to meanings of the query. We compare different methods of performing such clustering and different source corpora. Models of applying distributional semantics to big linguistic data are described. △ Less

Submitted 26 October, 2014; v1 submitted 4 September, 2014; originally announced September 2014.

Comments: Presented at Russian Summer School in Information Retrieval (RuSSIR 2014). To be published in Springer Communications in Computer and Information Science series

arXiv:1112.0793 [pdf]

doi 10.1088/1742-6596/324/1/012039

Crystal electric field parameters for Yb3+ ion in YbRh2Si2

Authors: A. S. Kutuzov, A. M. Skvortsova

Abstract: The tetragonal crystal electric field parameters for Yb3+ ion in YbRh2Si2 are determined from the analysis of the literature data on angle-resolved photoemission, inelastic neutron scattering and electron paramagnetic resonance. The tetragonal crystal electric field parameters for Yb3+ ion in YbRh2Si2 are determined from the analysis of the literature data on angle-resolved photoemission, inelastic neutron scattering and electron paramagnetic resonance. △ Less

Submitted 4 December, 2011; originally announced December 2011.

Comments: 8 pages, 3 figures, 4 tables

Journal ref: J. Phys. Conf. Ser. 324 (2011) 012039

arXiv:1112.0785 [pdf]

doi 10.1088/1742-6596/324/1/012017

Spin relaxation in Kondo lattices

Authors: S. I. Belov, A. S. Kutuzov, B. I. Kochelaev

Abstract: A model of spin relaxation in Kondo lattices is proposed to explain the presence of an electron spin resonance (ESR) signal in the heavy fermion compounds YbRh2Si2 and YbIr2Si2. Coupled equations for dynamical susceptibilities of Kondo ions and conduction electrons are derived by means of the functional derivative method. The perturbational scaling approach reveals the collective spin motion of Yb… ▽ More A model of spin relaxation in Kondo lattices is proposed to explain the presence of an electron spin resonance (ESR) signal in the heavy fermion compounds YbRh2Si2 and YbIr2Si2. Coupled equations for dynamical susceptibilities of Kondo ions and conduction electrons are derived by means of the functional derivative method. The perturbational scaling approach reveals the collective spin motion of Yb ions with conduction electrons in the bottleneck regime. A common energy scale due to the Kondo effect regulates the temperature dependence of the different kinetic coefficients and results in a mutual cancelation of all divergent parts in a collective spin mode. The angular dependence of the ESR linewidth is shown to be in a qualitative agreement with experimental data on YbRh2Si2 and YbIr2Si2. Linewidth contributions other than the Kondo interaction are also discussed. △ Less

Submitted 3 April, 2012; v1 submitted 4 December, 2011; originally announced December 2011.

Comments: In this version there were made corrections in Eqs. (22), (25), (28), (31), (33), (42)

Journal ref: J. Phys. Conf. Ser. 324 (2011) 012017

arXiv:1003.0337 [pdf]

Change of word types to word tokens ratio in the course of translation (based on Russian translations of K. Vonnegut novels)

Authors: Andrey Kutuzov

Abstract: The article provides lexical statistical analysis of K. Vonnegut's two novels and their Russian translations. It is found out that there happen some changes between the speed of word types and word tokens ratio change in the source and target texts. The author hypothesizes that these changes are typical for English-Russian translations, and moreover, they represent an example of Baker's translat… ▽ More The article provides lexical statistical analysis of K. Vonnegut's two novels and their Russian translations. It is found out that there happen some changes between the speed of word types and word tokens ratio change in the source and target texts. The author hypothesizes that these changes are typical for English-Russian translations, and moreover, they represent an example of Baker's translation feature of levelling out. △ Less

Submitted 1 March, 2010; originally announced March 2010.

Comments: 11 pages, 5 figures, to be reported at International Computational Linguistic Conference "Dialog-21"-2010 (http://dialog-21.ru)

arXiv:0908.3557 [pdf, other]

doi 10.1002/pssb.200983058

Low temperature properties of the Electron Spin Resonance in YbRh2Si2

Authors: J. Sichelschmidt, T. Kambe, I. Fazlishanov, D. Zakharov, H. -A. Krug von Nidda, J. Wykhoff, A. Skvortsova, S. Belov, A. Kutuzov, B. I. Kochelaev, V. Pashchenko, M. Lang, C. Krellner, C. Geibel, F. Steglich

Abstract: We present the field and temperature behavior of the narrow Electron Spin Resonance (ESR) response in YbRh2Si2 well below the single ion Kondo temperature. The ESR g factor reflects a Kondo-like field and temperature evolution of the Yb3+ magnetism. Measurements towards low temperatures (>0.5K) have shown distinct crossover anomalies of the ESR parameters upon approaching the regime of a well de… ▽ More We present the field and temperature behavior of the narrow Electron Spin Resonance (ESR) response in YbRh2Si2 well below the single ion Kondo temperature. The ESR g factor reflects a Kondo-like field and temperature evolution of the Yb3+ magnetism. Measurements towards low temperatures (>0.5K) have shown distinct crossover anomalies of the ESR parameters upon approaching the regime of a well defined heavy Fermi liquid. Comparison with the field dependence of specific heat and electrical resistivity reveal that the ESR parameters can be related to quasiparticle mass and cross section and, hence, contain inherent heavy electron properties. △ Less

Submitted 25 August, 2009; originally announced August 2009.

Comments: 4 pages, 6 figures; Manuscript for Proceedings of the International Conference on Quantum Criticality and Novel Phases (QCNP09, Dresden); subm. to pss(b)

Journal ref: Phys. Status solidi B 247, No.3, 747-750 (2010)

arXiv:0907.2074 [pdf, other]

doi 10.1140/epjb/e2009-00386-9

Why could Electron Spin Resonance be observed in a heavy fermion Kondo lattice?

Authors: B. I. Kochelaev, S. I. Belov, A. M. Skvortsova, A. S. Kutuzov, J. Sichelschmidt, J. Wykhoff, C. Geibel, F. Steglich

Abstract: We develop a theoretical basis for understanding the spin relaxation processes in Kondo lattice systems with heavy fermions as experimentally observed by electron spin resonance (ESR). The Kondo effect leads to a common energy scale that regulates a logarithmic divergence of different spin kinetic coefficients and supports a collective spin motion of the Kondo ions with conduction electrons. We… ▽ More We develop a theoretical basis for understanding the spin relaxation processes in Kondo lattice systems with heavy fermions as experimentally observed by electron spin resonance (ESR). The Kondo effect leads to a common energy scale that regulates a logarithmic divergence of different spin kinetic coefficients and supports a collective spin motion of the Kondo ions with conduction electrons. We find that the relaxation rate of a collective spin mode is greatly reduced due to a mutual cancelation of all the divergent contributions even in the case of the strongly anisotropic Kondo interaction. The contribution to the ESR linewidth caused by the local magnetic field distribution is subject to motional narrowing supported by ferromagnetic correlations. The developed theoretical model successfully explains the ESR data of YbRh2Si2 in terms of their dependence on temperature and magnetic field. △ Less

Submitted 22 July, 2009; v1 submitted 12 July, 2009; originally announced July 2009.

Comments: 5pages, 1 Figure

Journal ref: Eur. Phys. J. B 72, 485-489 (2009)

arXiv:0906.4410 [pdf, ps, other]

Some orbits in various models of galactic gravitational field

Authors: N. V. Raspopova, S. A. Kutuzov

Abstract: We consider a gravitational field in steady state galaxy models of two kinds. Some of them are axisymmetrical and others are triaxial. Equipotentials and potential law are given separately in accordance to Kutuzov and Ossipkov (1980). The relatively simple potential law is based on Kuzmin-Malasidze model (1969). Two kinds of models contain four and five structural parameters respectively. One co… ▽ More We consider a gravitational field in steady state galaxy models of two kinds. Some of them are axisymmetrical and others are triaxial. Equipotentials and potential law are given separately in accordance to Kutuzov and Ossipkov (1980). The relatively simple potential law is based on Kuzmin-Malasidze model (1969). Two kinds of models contain four and five structural parameters respectively. One composite model is suggested as well. Some examples of trajectories are calculated in these models. The simplest method to describe orbits is drawing their projections on coordinate planes. However it needs a great amount of calculation and makes troubles in an interpretation of information. In the case of axisymmetrical models a motion in co-moving meridional plane (with cylindrical coordinates R, z) is considered as a common way. In the case of triaxial models one can use three different co-moving planes passing through moving star and corresponding coordinate axis. We describe models in sections 2-4, calculated orbits are discussed in section 5. △ Less

Submitted 24 June, 2009; originally announced June 2009.

Comments: 12 pages, 6 figures, to be published in Dynamics of Galaxies, Proceedings of the International Conference held at Pulkovo Observatory, August 6-10, 2007

arXiv:0810.2942 [pdf, other]

doi 10.1088/0953-8984/20/45/455208

Magnetic susceptibility of YbRh2Si2 and YbIr2Si2 on the basis of a localized 4f electron approach

Authors: A. S. Kutuzov, A. M. Skvortsova, S. I. Belov, J. Sichelschmidt, J. Wykhoff, I. Eremin, C. Krellner, C. Geibel, B. I. Kochelaev

Abstract: We consider the local properties of the Yb3+ ion in the crystal electric field in the Kondo lattice compounds YbRh2Si2 and YbIr2Si2. On this basis we have calculated the magnetic susceptibility taking into account the Kondo interaction in the simplest molecular field approximation. The resulting Curie-Weiss law and Van Vleck susceptibilities could be excellently fitted to experimental results in… ▽ More We consider the local properties of the Yb3+ ion in the crystal electric field in the Kondo lattice compounds YbRh2Si2 and YbIr2Si2. On this basis we have calculated the magnetic susceptibility taking into account the Kondo interaction in the simplest molecular field approximation. The resulting Curie-Weiss law and Van Vleck susceptibilities could be excellently fitted to experimental results in a wide temperature interval where thermodynamic and transport properties show non-Fermi-liquid behaviour for these materials. △ Less

Submitted 16 October, 2008; originally announced October 2008.

Comments: 12 pages, 4 figures, 4 tables

Journal ref: J. Phys.: Condens. Matter 20 (2008) 455208

arXiv:0809.3250 [pdf]

Using descriptive mark-up to formalize translation quality assessment

Authors: Andrey Kutuzov

Abstract: The paper deals with using descriptive mark-up to emphasize translation mistakes. The author postulates the necessity to develop a standard and formal XML-based way of describing translation mistakes. It is considered to be important for achieving impersonal translation quality assessment. Marked-up translations can be used in corpus translation studies; moreover, automatic translation assessmen… ▽ More The paper deals with using descriptive mark-up to emphasize translation mistakes. The author postulates the necessity to develop a standard and formal XML-based way of describing translation mistakes. It is considered to be important for achieving impersonal translation quality assessment. Marked-up translations can be used in corpus translation studies; moreover, automatic translation assessment based on marked-up mistakes is possible. The paper concludes with setting up guidelines for further activity within the described field. △ Less

Submitted 18 September, 2008; originally announced September 2008.

Comments: 9 pages

Journal ref: Published in Russian in 'Translation industry and information supply in international business activities: materials of international conference' - Perm, 2008, pp. 90-101

Showing 1–45 of 45 results for author: Kutuzov, A