Panning for gold: Comparative analysis of cross-platform approaches for automated detection of political content in textual data

PLoS One. 2024 Nov 18;19(11):e0312865. doi: 10.1371/journal.pone.0312865. eCollection 2024.

Abstract

To understand and measure political information consumption in the high-choice media environment, we need new methods to trace individual interactions with online content and novel techniques to analyse and detect politics-related information. In this paper, we report the results of a comparative analysis of the performance of automated content analysis techniques for detecting political content in the German language across different platforms. Using three validation datasets, we compare the performance of three groups of detection techniques relying on dictionaries, classic supervised machine learning, and deep learning. We also examine the impact of different modes of data preprocessing on the low-cost implementations of these techniques using a large set (n = 66) of models. Our results show the limited impact of preprocessing on model performance, with the best results for less noisy data being achieved by deep learning- and classic machine learning-based models, in contrast to the more robust performance of dictionary-based models on noisy data.

Publication types

  • Comparative Study

MeSH terms

  • Deep Learning
  • Humans
  • Machine Learning
  • Politics*

Grants and funding

The article is written within the project “Reciprocal relations between populist radical-right attitudes and political information behaviour: A longitudinal study of attitude development in high-choice information environments” led by S. Adam (University of Bern) and M. Maier (University of Koblenz-Landau) and sponsored by the Der Schweizerische Nationalfonds (https://www.snf.ch/)(grant number 100001CL_182630/1) and Deutsche Forschungsgemeinschaft (https://www.dfg.de/) (grant number MA 2244/9-1). The funders did not play any role in the study design, data collection and analysis, decision to publish, or preparation of the manuscript.