Including Dialects and Language Varieties in Author Profiling

Ciobanu, Alina Maria; Zampieri, Marcos; Malmasi, Shervin; Dinu, Liviu P.

Computer Science > Computation and Language

arXiv:1707.00621 (cs)

[Submitted on 3 Jul 2017]

Title:Including Dialects and Language Varieties in Author Profiling

Authors:Alina Maria Ciobanu, Marcos Zampieri, Shervin Malmasi, Liviu P. Dinu

View PDF

Abstract:This paper presents a computational approach to author profiling taking gender and language variety into account. We apply an ensemble system with the output of multiple linear SVM classifiers trained on character and word $n$-grams. We evaluate the system using the dataset provided by the organizers of the 2017 PAN lab on author profiling. Our approach achieved 75% average accuracy on gender identification on tweets written in four languages and 97% accuracy on language variety identification for Portuguese.

Comments:	Proceedings of PAN at CLEF 2017
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1707.00621 [cs.CL]
	(or arXiv:1707.00621v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1707.00621

Submission history

From: Marcos Zampieri [view email]
[v1] Mon, 3 Jul 2017 16:06:16 UTC (94 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2017-07

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Alina Maria Ciobanu
Marcos Zampieri
Shervin Malmasi
Liviu P. Dinu

export BibTeX citation

Computer Science > Computation and Language

Title:Including Dialects and Language Varieties in Author Profiling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Including Dialects and Language Varieties in Author Profiling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators