Zum Hauptinhalt springen

Showing 1–2 of 2 results for author: Nunes, M d G V

Searching in archive cs. Search in all archives.
.
  1. arXiv:1712.08917  [pdf, ps, other

    cs.CL

    Building a Sentiment Corpus of Tweets in Brazilian Portuguese

    Authors: Henrico Bertini Brum, Maria das Graças Volpe Nunes

    Abstract: The large amount of data available in social media, forums and websites motivates researches in several areas of Natural Language Processing, such as sentiment analysis. The popularity of the area due to its subjective and semantic characteristics motivates research on novel methods and approaches for classification. Hence, there is a high demand for datasets on different domains and different lan… ▽ More

    Submitted 24 December, 2017; originally announced December 2017.

    Comments: Accepted for publication in 11th International Conference on Language Resources and Evaluation (LREC 2018)

  2. arXiv:1704.02963  [pdf, other

    cs.CL cs.AI

    Exploring Word Embeddings for Unsupervised Textual User-Generated Content Normalization

    Authors: Thales Felipe Costa Bertaglia, Maria das Graças Volpe Nunes

    Abstract: Text normalization techniques based on rules, lexicons or supervised training requiring large corpora are not scalable nor domain interchangeable, and this makes them unsuitable for normalizing user-generated content (UGC). Current tools available for Brazilian Portuguese make use of such techniques. In this work we propose a technique based on distributed representation of words (or word embeddin… ▽ More

    Submitted 10 April, 2017; originally announced April 2017.

    Comments: Published in Proceedings of the 2nd Workshop on Noisy User-generated Text, 9 pages