Cross-language opinion lexicon extraction using mutual-reinforcement label propagation

PLoS One. 2013 Nov 15;8(11):e79294. doi: 10.1371/journal.pone.0079294. eCollection 2013.

Abstract

There is a growing interest in automatically building opinion lexicon from sources such as product reviews. Most of these methods depend on abundant external resources such as WordNet, which limits the applicability of these methods. Unsupervised or semi-supervised learning provides an optional solution to multilingual opinion lexicon extraction. However, the datasets are imbalanced in different languages. For some languages, the high-quality corpora are scarce or hard to obtain, which limits the research progress. To solve the above problems, we explore a mutual-reinforcement label propagation framework. First, for each language, a label propagation algorithm is applied to a word relation graph, and then a bilingual dictionary is used as a bridge to transfer information between two languages. A key advantage of this model is its ability to make two languages learn from each other and boost each other. The experimental results show that the proposed approach outperforms baseline significantly.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Humans
  • Linguistics*
  • Speech Recognition Software*

Grants and funding

This work was mainly supported by two funds, i.e., the National Natural Science Foundation of China (60933005 & 61100083). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.