Choosing the Right Word: Using Bidirectional LSTM Tagger for Writing Support Systems

Makarenkov, Victor; Rokach, Lior; Shapira, Bracha

doi:10.1016/2019.05.003

Computer Science > Computation and Language

arXiv:1901.02490 (cs)

[Submitted on 8 Jan 2019]

Title:Choosing the Right Word: Using Bidirectional LSTM Tagger for Writing Support Systems

Authors:Victor Makarenkov, Lior Rokach, Bracha Shapira

View PDF

Abstract:Scientific writing is difficult. It is even harder for those for whom English is a second language (ESL learners). Scholars around the world spend a significant amount of time and resources proofreading their work before submitting it for review or publication.
In this paper we present a novel machine learning based application for proper word choice task. Proper word choice is a generalization the lexical substitution (LS) and grammatical error correction (GEC) tasks. We demonstrate and evaluate the usefulness of applying bidirectional Long Short Term Memory (LSTM) tagger, for this task. While state-of-the-art grammatical error correction uses error-specific classifiers and machine translation methods, we demonstrate an unsupervised method that is based solely on a high quality text corpus and does not require manually annotated data. We use a bidirectional Recurrent Neural Network (RNN) with LSTM for learning the proper word choice based on a word's sentential context. We demonstrate and evaluate our application on both a domain-specific (scientific), writing task and a general-purpose writing task. We show that our domain-specific and general-purpose models outperform state-of-the-art general context learning. As an additional contribution of this research, we also share our code, pre-trained models, and a new ESL learner test set with the research community.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1901.02490 [cs.CL]
	(or arXiv:1901.02490v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1901.02490
Journal reference:	Elsevier Engineering Applications of Artificial Intelligence, Volume 84, September 2019, Pages 1-10
Related DOI:	https://doi.org/10.1016/2019.05.003

Submission history

From: Victor Makarenkov [view email]
[v1] Tue, 8 Jan 2019 19:59:33 UTC (929 KB)

Computer Science > Computation and Language

Title:Choosing the Right Word: Using Bidirectional LSTM Tagger for Writing Support Systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Choosing the Right Word: Using Bidirectional LSTM Tagger for Writing Support Systems

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators