What Distinguishes Conspiracy from Critical Narratives?
A Computational Analysis of Oppositional Discourse

Damir Korenčić^† [email protected] Universitat Politècnica de València, Spain Ruđer Bošković Institute, Croatia

Berta Chulvi^† Symanto Research, Spain Universitat de València, Spain

Xavier Bonet Casals CLiC - Universitat de Barcelona, Spain

Alejandro Toselli Universitat Politècnica de València, Spain

Mariona Taulé CLiC - Universitat de Barcelona, Spain

Paolo Rosso Universitat Politècnica de València, Spain ValgrAI - Valencian Graduate School and Research Network of Artificial Intelligence, Spain

What Distinguishes Conspiracy from Critical Narratives? A Computational Analysis of Oppositional Discourse

Damir Korenčić^† [email protected] Universitat Politècnica de València, Spain Ruđer Bošković Institute, Croatia

Berta Chulvi^† Symanto Research, Spain Universitat de València, Spain

Xavier Bonet Casals CLiC - Universitat de Barcelona, Spain

Alejandro Toselli Universitat Politècnica de València, Spain

Mariona Taulé CLiC - Universitat de Barcelona, Spain

Paolo Rosso Universitat Politècnica de València, Spain ValgrAI - Valencian Graduate School and Research Network of Artificial Intelligence, Spain

Abstract

The current prevalence of conspiracy theories on the internet is a significant issue, tackled by many computational approaches. However, these approaches fail to recognize the relevance of distinguishing between texts which contain a conspiracy theory and texts which are simply critical and oppose mainstream narratives. Furthermore, little attention is usually paid to the role of inter-group conflict in oppositional narratives. We contribute by proposing a novel topic-agnostic annotation scheme that differentiates between conspiracies and critical texts, and that defines span-level categories of inter-group conflict. We also contribute with the multilingual XAI-DisInfodemics corpus (English and Spanish), which contains a high-quality annotation of Telegram messages related to COVID-19 (5,000 messages per language). We also demonstrate the feasibility of an NLP-based automatization by performing a range of experiments that yield strong baseline solutions. Finally, we perform an analysis which demonstrates that the promotion of intergroup conflict and the presence of violence and anger are key aspects to distinguish between the two types of oppositional narratives, i.e. conspiracy vs. critical.

^†^†footnotetext: Equal contribution^†^†footnotetext: Accepted version of the paper: https://doi.org/10.1111/exsy.13671

Keywords conspiracy theories, oppositional thinking, NLP for computational social science, text classification, sequence labeling

1 Introduction

Conspiracy Theories (CTs) are complex narratives that attempt to explain the ultimate causes of significant events as cover plots orchestrated by secret, powerful, and malicious groups (Douglas and Sutton, 2023). The automatic detection of CTs in written texts has recently gained popularity, and the problem is commonly framed as a binary classification task, which can be detailed more thoroughly with fine-grained approaches corresponding to multi-label or multi-class classifications (Giachanou et al., 2021; Moffitt et al., 2021; Pogorelov et al., 2021, 2023).

However, existing approaches do not distinguish between critical and conspiratorial thinking. This distinction has important implications for automatic content moderation: if models do not differentiate between critical and conspiratorial thinking, there is a high risk of pushing people toward conspiracy communities. Namely, labeling a text as conspiratorial when it is actually opposing mainstream views, could potentially lead people who were simply asking questions closer to conspiracy communities. As several authors from the field of social sciences suggest, a fully-fledged conspiratorial worldview is the final step in a progressive “spiritual journey" that sets out by questioning social and political orthodoxies (Sutton and Douglas, 2022; Funkhouser, 2022; Franks et al., 2017). Additionally, recent research (Phadke et al., 2021) has shown that the level of interaction with conspiratorial users is the most important feature for predicting whether or not users join conspiracy communities.

Another gap in the computational analysis of conspiratorial written texts is the failure to address the role that intergroup conflict (IGC) plays in conspiratorial and critical narratives. Applying the concept of intergroup conflict to the study of language consists of identifying a particular approach to the framing of events, one that emphasizes the hostility between groups, typically by using the “us versus them” narrative, and by fueling the perceived injustice and threat to the ingroup (for a recent review of the IGC, see Böhm et al., 2020). The relevance of connecting the study of conspiracy theories with the theoretical framework of IGC stems from the increasing and potentially violent involvement of conspiratorial communities in political processes. This involvement suggests that the purpose of CTs is to reinforce group dynamics and to coordinate action (Wagner-Egger et al., 2022). Therefore, the tools that enable an IGC-based analysis of conspiratorial texts could offer valuable insights for both social sciences and content moderation.

Motivated by the described issues, we propose a novel annotation scheme that distinguishes between conspiracy and critical texts, and defines important categories of oppositional narratives (Figures 2 and 3 in subsection 3.2 contain examples of this annotation scheme). In addition to the core elements of conspiracy narratives that have been widely used in the literature, such as Agents (the conspirators) and Victims, the proposed annotation scheme identifies two additional categories: Facilitators (those collaborating with the agents, such as the media) and Campaigners (those that unmask the conspiratorial agenda). These types of actors are considered as key players in IGC: on the one hand, the facilitators are tangible targets with whom real conflict is possible (in contrast to the agents, which tend to operate in secret), and on the other hand, the campaigners are those who show their opposition to the facilitators and try to persuade the victims to join the cause.

To summarize, distinguishing between critical discourse on the one hand, and inflammatory, violent, and destabilizing narratives on the other, is important for democratic societies which seek to promote freedom of speech and ensure a civilized exchange of opinions. This distinction is also crucial in order to avoid excessive censorhip.

In order to tackle the described issues, we focused on oppositional Telegram messages related to the COVID-19 pandemic, and constructed the multilingual (English and Spanish) XAI-DisInfodemics corpus, annotated with a novel annotation scheme. We then proceeded to tackle several Natural Language Processing (NLP) tasks related to annotation tasks we have carried out during the construction of the corpus. These include a binary classification task of distinguishing between conspiratorial and critical texts, and a multi-label annotation task at the span level to detect the main elements of both kinds of oppositional narratives (Section 3). Both the code used in the experiments and the data have been made available.¹¹1https://github.com/dkorenci/oppositional-discourse

In this paper, the following research questions are addressed:

•

RQ1: Can the critical-vs-conspiracy classification task be solved successfully with state-of-art NLP methods?
•

RQ2: Can narrative elements, including the IGC-related elements, be automatically detected?
•

RQ3: Is the proposed annotation scheme a useful analytical tool for interdisciplinary research on computational linguistics and social science?

This article is organized in the following sections: in Section (2) we first give an overview of the current research context. In Section (3) we proceed to describe the XAI-DisInfodemics corpus, the annotation scheme, and the annotation process. In Section 4 we tackle RQ1 and RQ2 by analyzing the performance of fine-tuned transformers, and we demonstrate that the critical vs. conspiracy distinction can be detected with a high level of accuracy. We also demonstrate that the detection of oppositional narrative categories is a more challenging task, especially at the fine-grained span level, and that the binary detection of these categories is an approach that can achieve high accuracy for the majority of the categories. In Section 5 we tackle RQ3 by conducting experiments using variables which take into account the Campaigners and Facilitators categories from the annotation scheme. We show that an important distinction between critical and conspiratorial narratives is that the latter promote intergroup conflict more intensely and convey more words related to anger and political violence. Finally, in Section 6 we outline the conclusions and future work.

Our work makes four main contributions: (1) an annotation scheme based on a novel approach to oppositional text analysis that focuses on both the conspiracy vs. critical distinction, and on the IGC-related narrative elements; (2) the multilingual XAI-DisInfodemics corpus, consisting of Telegram messages related to COVID-19; (3) several machine learning experiments that examine the tractability of the scheme-derived tasks and yield strong baseline solutions for future applications; (4) the analyses that demonstrate the usefulness of the proposed approach in computational social sciences.

2 Related Work

The detection of conspiracy theories in written text has been approached by several authors using NLP techniques both in Computational Linguistics and in Computational Social Sciences (CSS). Bessi (2016) used a text scaling method to map conspiratorial text to personality traits and to analyze conspiracies. Topic modeling has been used to extract and analyze common themes in conspiracy texts (Klein et al., 2018; Samory and Mitra, 2018). Samory and Mitra (2018) use syntactic parsing in order to extract “motifs” (agent-action-target triplets) and analyze patterns of their occurrence. Holur et al. (2022) performed detection of insider and outsider entities in conspiracy texts by automatic labeling of noun phrases. Giachanou et al. (2021) used psychological and linguistic features to classify and analyze social media users that spread conspiracies. In contrast, Levy et al. (2021) analyzed the capacity of large language models to generate conspiracies. In recent years, there has been a lot of work on NLP approaches to detect and analyze COVID-19 conspiracies in written texts. Moffitt et al. (2021) developed a classifier of conspiracy tweets and used it for propagation analysis. Two recent MediaEval challenges which focused on classification of conspiracy texts (Pogorelov et al., 2021, 2023) led to a number of approaches demonstrating that the state-of-the-art architecture is a multi-task classifier (Peskine et al., 2021, 2023; Korenčić et al., 2023) based on CT-BERT (Müller et al., 2023).

While other authors also focused on methods that analyze conspiratorial text at the span level (Holur et al., 2022; Samory and Mitra, 2018; Introne et al., 2020), the main contribution of our work is that we take into account the importance of intergroup conflict.

Holur et al. (2022) recognize the relevance of IGC: “conspiracy theories and their constituent threat narratives share a signature semantic structure: an implicitly accepted Insider group; a diverse group of threatening Outsiders”. However, they do not define the insider and outsider labels in the sense of conflicting social groups. The label insider is defined in the sense of familiar or pleasant, and the label outsider as the opposite. These binary classes are in contrast with the more fine-grained and IGC-oriented categories that we define in Section 3.3. Additionally, Holur et al. (2022) restrict the labeling to noun phrases, while we allow the spans to range from a single word to whole clauses.

Samory and Mitra (2018) propose an unsupervised method to extract agent-action-target triplets describing conspiracy narratives, but it fails to capture the IGC-related aspect of the texts. In addition, the subject-predicate-object triplets can fail to match with more abstract agent-action-target triplets. In contrast, we use a flexible span definition that allows for different types of actors and other narrative elements.

Introne et al. (2020) propose a span-level scheme of six categories (event, actor, goal, action, consequence, target), and use it to analyze 236 messages from anti-vaccination forums. They distinguish between conspiracy theories and conspiratorial thinking, a category that implies only passive support. This distinction is not based on annotations grounded in theory, but on the requirement of all the categories being present in a given text. However, in practice fewer elements can convey a conspiracy theory in a very strong manner.

Several corpora have been built to study conspiracy theories or phenomena related to them. Memon and Carley (2020) constructed a dataset of 4,700 tweets with 17 misinformation-related categories, including one for conspiracy spreading tweets. Song et al. (2021) collected a corpus of 1,293 debunked COVID-19 claims and annotated them with a number of misinformation categories, including conspiracy theory. Holur et al. (2022) developed the CT5K corpus, consisting of 5,000 COVID-19 conspiracy messages, collected from several social platforms, which they used for the previously described experiments with insider-outsider classification.

Uscinski et al. (2011) collected a dataset consisting of letters sent to a mainstream US publication, and labeled them as either containing a conspiracy or not. The large scale LOCO corpus (Miani et al., 2021) contains 96,743 texts from a diverse collection of mainstream and conspiracy media outlets. The texts are enriched with website metadata and auto-generated topics. COCO is a corpus of 3,495 texts containing COVID-19 conspiracies (Langguth et al., 2023). The texts were manually annotated with a fine-grained classification scheme encompassing conspiracy sub-topics.

The recent literature review by Mahl et al. (2022) shows that the interest about conspiracy theories in online environments has been exponentially rising within the Social Sciences, and that 80% of work on this topic was done on written content, with about a third of the articles using methods for automated content analysis. This demonstrates the relevance of our research, as we provide a new corpus, conceptual tools, and NLP methods that enable more sophisticated analyses than traditional methods of textual content analysis, which rely on techniques such as text scaling and word counting.

3 Dataset: XAI-DisInfodemic Corpus

This section introduces the XAI-DisInfodemic corpus, a multilingual (English and Spanish) corpus consisting of 5,000 annotated Telegram messages each. These messages contain oppositional non-mainstream views on the COVID-19 pandemic, classified into two categories: critical messages and conspiratorial messages. The aim of this corpus is to distinguish these two types of oppositional narratives, which are not equivalent and have very different effects on society: while conspiracy theories about COVID-19 call into question the whole system of social organisation to fight against a disease, critical thinking may express disagreements that are tractable within the system of social organisation of a given community. These messages have been annotated in two consecutive phases, which correspond to two separate annotation tasks. Firstly, we present the construction of the corpus (subsection 3.1), which includes the source and selection criteria of the messages. Secondly, we describe the annotation methodology for the first task (subsection 3.2) and for the second task (subsection 3.3). For each task, we specify the tagset, the annotation criteria, the annotation process, and the IAA agreement tests. Finally, we present the process of anonymization of the corpus (subsection 3.4).

3.1 Corpus Construction

The messages which constitute the corpus were obtained from public Telegram channels in which users tend to post messages which oppose the mainstream discourse related to the COVID-19 pandemic. Even though the social network X (formerly known as Twitter) is commonly used as a source for this kind of research (Langguth et al., 2023), we decided to opt for Telegram messages for several reasons: firstly, because they tend to be longer than X messages, which allows for a deeper analysis of oppositional narratives. Secondly, Telegram has much less restrictive moderation policies, so it is easier for conspiracy theorists to post their opinions more freely (Vergani et al., 2022).

Refer to caption — Figure 1: The process of corpus construction.

The collection of the messages was carried out as depicted in Figure 1. Firstly, we created a list of 2,273 public Telegram channels, either in English or in Spanish, that contain oppositional non-mainstream views on the COVID-19 pandemic. Locating such channels is challenging since they are familiar only to members of closed groups aligned with channels’ attitudes. The first step in the creation of this list was to use a ‘snowballing’ procedure (Noy, 2008) : one of the researchers became a member of several social networks related to conspiracy theories. Specifically, he joined the extreme right-wing network GAP, several Facebook groups, and started to follow on X (former Twitter) a number of users who were openly critical of the government’s health measures, and/or public disseminators of conspiracy theories. After that, the sequence of characters “t.me/” was used to locate additional oppositional Telegram channels in the aforementioned social networks, since Telegram channels are often identified with the short URL “t.me/”, followed by the name of the channel. The final step was to repeat this last procedure within the Telegram channels themselves, until no new channels were found.

Then the messages were downloaded from the channels and filtered by removing duplicates, short texts (i.e. those containing fewer than 12 tokens), and texts with a large proportion of URLs and mentions. Finally, each message was ranked using a quality index. This quality index is composed of criteria capturing the activity of the channel and the message length. Concretely, the quality score of a message was the sum of the scores of the following sixcriteria: (1) the audience of the channel, (2) the number of authors in the channel, (3) the mean of messages by author, (4) the standard deviation of the mean of messages by author, (5) the number of messages in the channel, and (6) the number of words in the message. Once the messages were sorted using the quality score they were manually reviewed, starting from the top, to discard the messages not related to COVID-19. This way 5,000 comments related to COVID-19 and having highest quality scores were selected for each language. The details of the process of filtering and ranking of the messages can be found in the supplementary material. Table 1 contains basic statistics of the final English and Spanish corpora.

Sprache	Avg.	Std. dev	Min.	Q1	median	Q3	Max.
Spanish	128	123	23	49	98	148	766
Englisch	265	528	12	32	65	266	4,108

Table 1: Statistics of the text length, measured in number of words (whitespace separated tokens), for English and Spanish corpora: the average, the standard deviation, the minimum, the first quartile, the median, the third quartile, and the maximum.

3.2 Conspiracy vs. Critical Annotation

The aim of the first annotation task was to identify whether a message hints at the existence of a conspiracy theory, or whether it is criticizing mainstream views on COVID-19 but do not suggest the existence of a conspiracy. To that purpose, we developed an annotation scheme with two binary labels, Conspiracy and Critical. Each of these labels can have either the value 1 if the message contains the corresponding feature, or the value 0 if it does not contain it. These labels are mutually exclusive (i.e. no message can have the value 1 in both labels simultaneously). Messages annotated with the value 0 in both labels were not included in the final corpus (but are available if required). The total amount of these discarded messages are 767 in English and 360 in Spanish. This low amount of “neutral” messages is due to the fact that the corpus was extracted from Telegram channels which tend to contain oppositional messages rather than general domain ones.

For the first annotation task, we developed an annotation scheme to differentiate between texts hinting at the existence of a conspiracy, and those criticizing mainstream views on COVID-19 but without suggesting the existence of a conspiracy. A comment was labeled as Conspiracy if any of these five criteria were met:

1.

The message frames COVID-19 or a related public health strategy as the result of the agency of a small and malevolent agent (either known or unknown) operating in the dark, with the intention to carry out a hidden plan. This plan could be to control society, to physically or mentally manipulate citizens or to impose a false narrative, among others (Douglas and Sutton, 2023). For example: "Whether you believe it to be psychological, physical or both, vaccines have been weaponized as a tool for population control. This much is now obvious". It can also be considered a conspiracy if the vaccine is described as some sort of intentional ‘poison’, and the official vaccination campaign is portrayed as a deliberate murder or as an experiment carried out by an (explicit or implicit) agent, who is purposely trying to harm society. For example: “Pfizer knew the Covid jabs were killing 3% of those who received it. This video is an excellent and succinct summary of the dangers of the “poison death jab” and the musical chairs game big pharma media and govt agents have been playing on us. […]”. The hidden plan may be presented in the form of a ‘join the dots’ narrative, i.e. unrelated facts are presented as being connected with the intention of ‘proving’ the existence of a powerful agent who is deliberately responsible of large-scale events. For example: “Looks like #Covid and all the evil caused by the vaccines were just a test run for the larger plan of locking us all into areas codes… Denial is no longer an option”.
2.

The message implies that the pandemic is not real, and that "a false version of reality has been promoted or protected by the conspirators and their unwitting stooges" (Douglas and Sutton, 2023). For example: “They have already started their bull shit - so the vaccines didn’t work - the last lock downs didn’t work - but these globalist scum want it all to happen again as their immune compromised sheep have no immune system and are filling the hospital beds and dying […]”.
3.

The message suggests that those who publicly mock or deny the existence of a conspiracy theory and its negative consequences are actually involved in the secret plan and are collaborating with the agents. For example: “[…] Las vacunas experimentales COVID-19 están causando mucho daño. En este caso un infarto un día después del pinchazo. Y los sanitarios lo saben. […]”. (English translation: The experimental COVID-19 vaccines are causing a lot of damage. In this case, a heart attach just one day after the jab. And health staff know about it).
4.

A distinction is made between the ingroup (i.e. those who know the truth and reveal the conspiracy, ‘us’) and the outgroup (i.e. those who remain ignorant, society at large, ‘them’). The outgroup is homogenized and its members are perceived as being manipulated and/or ignorant. For example: "#NeverForget all of this fake staging they did - to be used as demagoguery for the pedantic sheep to use against one another, shaming their neighbours into mass compliance for a ‘global pandemic’ hoax". However, the simple presence of a narrative ‘them’ is not necessarily an indication of a conspiracy theory: when two opposing ideas are presented, it is quite usual to associate each idea to its own champion social entity, but there is a difference between establishing an opposition with another social group (in the sense of ‘their behavior is irrational’ or ‘they refuse to listen’, which is a feature of critical thinking) and presenting ‘them’ as an enemy (which is a feature of conspiracy theories).

On the other hand, messages with critical thinking also contain elements of an oppositional narrative, but focus on the way the pandemics has been managed by authorities, as well as the effects and/or efficacy of the vaccine. A message was labeled as Critical if any of these four criteria were met:

1.

The official vaccination strategy and/or the efficiency of vaccines is called into question (e.g. due to the fact that they were developed and commercialized in a very short period of time). This critical stance does not rely on esoteric explanations and the comment does not deny the validity of scientific protocols, but it rather provides what the author considers to be factual data or objective evidence (e.g. the author’s first-hand experience, second-hand accounts from acquaintances or other online users, etc.). For example: "YouTube has blocked renowned physician Dr. Drew Pinsky from going live on its platform after he showed a vaccine-related eye injury he incurred after receiving an experimental Covid jab. […]".
2.

The comment establishes a link between the vaccine and a number of side effects which are not included in the mainstream narrative. This causal relationship is sometimes referred to as a ‘crime’ due to a (perceived) unusually high mortality after the vaccination campaigns, but the comment does not imply any sort of deliberate intentionality. For example: "The Spike Protein’s Impact: “Vaccination may cause our overall immune system to fail to fight against such bad things” - Shigetoshi Sano, M.D., PhD., Professor and Chair, Kochi University School of Medicine".
3.

Asking for information about potential links between the vaccine and its effects on public health is also considered to be a feature of critical thinking, due to the context in which such requests are taking place (i.e. it is assumed that by making use of this kind of channels the author expects a different answer from what could be obtained at an official health center). For example: "Alguien del grupo tiene o puede tener contacto con médicos/sanitarios que vivan en Reino Unido (zona de Escocia) y estén ayudando/aconsejando a vacunados? Necesitaría algún teléfono, grupo de Telegram… lo que sea que sirva de ayuda. Gracias." (English translation: "Does anyone in the group have or may have contact with doctors/healthcare providers who live in the UK (Scotland area) and are helping/advising vaccinated? I would need any phone numbers, Telegram groups… anything that would be helpful. Thanks"). However, if the comment is simply asking for or giving information of any other kind (e.g. a phone number to contact someone, where a given product can be bought, etc.), it is not considered to include either a conspiracy theory or critical thinking, and is therefore annotated with the value 0 on both labels.
4.

The comment supports a deliberate rejection of the vaccine and/or expresses dissatisfaction with the negative consequences that this rejection entails (e.g. social discrimination, mobbing, administrative constraints, etc.). Oftentimes, the group of the vaccinated and the group of the unvaccinated are presented as antagonistic, but there is no allusion to an agent with a hidden agenda. For example: "Si vives en España,la vacunación es voluntaria. Si los superiores les dicen que se inoculen, que te lo dé por escrito, como no lo van hacer, grábalo y te servirá ante un juicio. Hay muchas personas como tú. RESISTE" (English translation: "If you live in Spain, the vaccine is optional. If your superiors tell you to get jabbed, ask them to give it to you in writing, since they are not going to, record it and it will help you in court. There are many people like you. RESIST").

A key aspect that serves to distinguish conspiratorial thinking from critical thinking is that the latter emphasises impersonal, systemic causes while the former focuses on the agency of a secret group. Recurrent themes in critical thinking are the questioning of the efficacy of vaccines, criticism of the pharmaceutical industry, criticism of the mandatory nature of vaccines, and the side effects of COVID-19 vaccines.

The annotation process of this first task was carried out by two teams of annotators (one for the Spanish corpus and one for the English corpus), all of them expert linguists. Each team was constituted of 3 different annotators, either native or highly proficient speakers in the corresponding language (therefore, each message was annotated by 3 different people). The annotators carried out the task independently, and on a weekly basis meetings were held to discuss disagreements among the annotators and also with the help of two senior annotators (the social psychologist who created the taxonomy, and an expert in corpus linguistics). This process lasted about 5 months. In order to assess the quality of the annotation, we performed a number of tests. For the English corpus, the IAA in terms of Krippendorf’s $\alpha$ is of 0.79 for Conspiracy messages and 0.60 for Critical messages, and the average observed percentage of agreement between the three annotators is of 91.4% for Conspiracy and 80.3% for Critical. For the Spanish corpus, Krippendorf’s $\alpha$ is of 0.80 for Conspiracy and 0.70 for Critical, and the average observed percentage of agreement is of 90.9% for Conspiracy and 84.9% for Critical. Finally, the Gold Standard was obtained by means of a majority vote (i.e. agreement between at least two annotators).

3.3 Oppositional Narrative Annotation

The aim of the second annotation task was to identify the main elements of intergroup conflict (IGC) which appear in oppositional narratives. In order to do that, we selected the span of text in which each element appears and assigned it the corresponding label. The tagset includes six labels, which can be applied both to messages containing a conspiracy theory and messages containing critical thinking. In this section, we describe the labels which compose the tagset, and also present the criteria for span delimitation.

We identified the following six categories of narrative elements (see Figure 2 foran example annotation of a Conspiracy message, and Figure 3 for an example annotation of a Critical message.):

1.

Agents (A): Those responsible for the actions and/or negative effects described in the comment. In Conspiracy, it could be the hidden power that pulls the strings (in Figure 2, "Private owned WHO", "investors like Bill Gates", "pharma companies" and "very evil beings"). In Critical, it could be the actors that design the mainstream public health policies (in Figure 3, "White House chief medical advisor Dr. Anthony Fauci" and "the lead of CDC director Rochelle Walensky, who questioned natural immunity").
2.

Facilitators (F): Those who collaborate with the agents and contribute to the execution of their goals. In Conspiracy, they could be governments or institutions which, either intentionally or unwittingly, collaborate with the conspirators and help the conspiracy move forward (in Figure 2, "the world governments ruled by their puppets", "their media", "the media" and "governments"). In Critical, they could be healthcare workers, mass media or authority figures who abide by governmental instructions (in Figure 3, "university hospitals" and "the vaccinated work - from - home hospital administrators who are firing her for not being vaccinated").
3.

Campaigners (C): Those who oppose the mainstream narrative. In Conspiracy, those who know the truth and expose it to society at large (in Figure 2, "those awake already"). In Critical, those who oppose the enforcement of laws and/or refuse to follow health-related instructions from the authorities (in Figure 3, "Dr Martin Kulldorff").
4.

Victims (V): Those who suffer the consequences of the actions and decisions of the agents and/or the facilitators. In Conspiracy, the people who are deceived by those in power, and suffer, become ill, lose their freedom, or die as a result of a hidden plan (in Figure 2, "people", "most people" and "regular people"). In Critical, the people who receive the negative consequences of the actions and the decisions made by those in power, and also suffer, lose their freedom, become ill, or die as a result of wrong decisions (in Figure 3, "all nurses, doctors and other health care providers").
5.

Objectives (O): The intentions and purposes that the agents are trying to achieve. In Conspiracy, the goals of the conspirators (in Figure 2, "agenda" and "destroying us"). In Critical, the goals of public authorities, pharmaceutical companies, organizations, etc (in Figure 3, "pushing vaccine mandates").
6.

Negative Effects (E): The negative consequences suffered by the victims as a result of the actions and decisions of those in power and/or their collaborators (in Figure 2, "the constant fear mongering" and "pay a hefty price, often with their health, lives, the loss of their loved ones"; in Figure 3, "will be fired if they do not get a Covid vaccine").

The decision of where each label should start and end is not as straightforward as it may seem; for this reason, we established the following criteria for span delimitation (in the examples given below, the span is enclosed within square brackets, and the initial of each label appears afterwards within round brackets):

1.

In general, punctuation marks are excluded from the span. For example, “[…] like you and everyone else who isn’t getting paid by these satanic billionaires and isn’t in on the whole thing? [The government](F)?". However, exclamation and question marks thought to be relevant for the interpretation of the label can be included. For example: [She leaves behind her year old son… How many more will we lose, how many children?](E). Also, if an opening punctuation mark has already been included in the span (e.g. quotes, parenthesis, etc.), its corresponding closing punctuation mark must also be included. For example: “Why SARS-Cov-2 ["vaccinated" people](V) can test HIV+".
2.

Paralinguistic elements, such hashtags, which are considered to be relevant for the interpretation of the comment can be labeled on their own or be included in an adjacent fragment if they both have the same communicative function. For example: "Another super fit athlete dies suddenly at the age of 35. I wonder what could be causing all these youg, fit athletes to die suddenly? #COVIDVaccine [#GenocideAgenda](O)".
3.

The label of categories relative to actors (i.e. Agents, Facilitators, Campaigners and Victims) must include the syntactic determiner of the noun phrase (articles, possessive determiners, etc.). For example: “[The billionaire Luciferian Elites](A) think of themselves as the apex predators of the world and therefore deserving of inheriting the Earth.".
4.

Each individual element in an enumeration must be annotated separately, with its own label. For example: “[…] while [Bill Gates](A), [the Rockefeller and Ford Foundations](A), and [the World Bank](A) control 10 per cent of the world’s germplasms […]".
5.

Only linguistic elements which have full semantic content can be labeled; thus, purely coreferential elements (personal pronouns, demonstrative pronouns, clitic pronouns, etc.) must not receive their own label unless they are part of a larger phrase which contains semantically full elements. For example: “[…] Bill Gates (GAVI, WHO, CDC, Microsoft, most vaccines etc etc) set on destroying and enslaving humanity, by poisoning and starving [us, common people](V)".
6.

In all of these categories referring to an actor of the oppositional narrative, the label must include those elements which help to identify and narrow down the referent (adjectives, defining relative clauses, etc.). For example: “[…] the soldiers of [the satanic billionaires who like to call themselves Elites](A).”.

The second annotation task was carried out by two teams (one per language) composed of two linguists each. Both teams were supervised by the same senior annotator, with whom they discussed doubts on a weekly basis. In turn, whenever it was necessary, this senior annotator held further meetings with another senior annotator and the psychologist who created the taxonomy to clarify doubts regarding the implementation of the annotation criteria. These criteria were updated in the annotation guidelines through an iterative process (i.e. we started with a basic set of criteria, which were adapted during the initial periods of the task to make sure they could accurately be implemented onto our dataset). In order to assess the quality of this annotation task, the Gamma ( $\gamma$ ) measure (Mathet et al., 2015) was used to calculate the IAA. Following our preliminary annotation guidelines, we annotated an initial batch of 300 texts, which yielded an average $\gamma$ of 0.43. After revising and discussing the annotation of this first batch, we fine-tuned the guidelines and annotated a second batch of 300 texts with the updated criteria, which yielded an average gamma of 0.57. Finally, a third batch of 300 annotated texts yielded an average gamma of 0.61, which we deemed acceptable since it is close to or above the average agreement of other highly conceptual span-level annotation schemes (Da San Martino et al., 2019; Weimer et al., 2022a). After that, the rest of the corpus was annotated following the final version of the annotation guidelines. Following this final version, the IAA was calculated in batches of 200 texts. Whenever the average gamma result of a batch was below 0.6, the worst quality messages within this batch were discussed and reannotated in order to guarantee the quality of the annotation. After this reannotation, the final average gamma of the task was of 0.72. Given the difficulty of such a subjective and nuanced task, we deem this result as more than acceptable.

The Gold Standard (GS) of this second annotation task was automatically created applying the following criteria to resolve cases of disagreement:

1.

When the annotators agreed on the label but disagreed in the precise limits of a given span (i.e. there was partial overlapping), the GS consisted of the union of both spans. For example, if Annotator 1 labeled as Facilitators the span “administrators and superiors” and Annotator 2 used the same label but selected the span “superiors who reportedly never intended to approve them in the first place”, the GS was created by joining both spans, i.e. “administrators and superiors who reportedly never intended to approve them in the first place”.
2.

When only one of the annotators labeled a given span, or when there was a disagreement in the labels for the same span, the GS did not incorporate either annotation.

		A	F	C	V	O	E
ES	Alle	3,329 (14.0%)	2,688 (11.3%)	4,231 (17.8%)	5,260 (22.2%)	622 (2.6%)	7,150 (30.2%)
	Conspiracy	1,361 (9.8%)	1,184 (8.6%)	2,133 (15.4%)	3,543 (25.6%)	23 (0.2%)	5,326 (38.5%)
	Critical	1,968 (20.0%)	1,504 (15.2%)	2,098 (21.3%)	1,717 (17.4%)	599 (6.1%)	1,824 (18.5%)
EN	Alle	6,411 (22.4%)	3,462 (12.1%)	6,416 (22.4%)	4,433 (15.5%)	2,073 (7.2%)	5,565 (19.4%)
	Conspiracy	3,333 (21.1%)	1,336 (8.5%)	3,839 (24.4%)	2,734 (17.3%)	615 (3.9%)	3,708 (23.5%)
	Critical	3,078 (23.9%)	2,126 (16.5%)	2,577 (20.0%)	1,699 (13.2%)	1,458 (11.3%)	1,857 (14.4%)

Table 2: Statistics for the gold span-level annotations of the narrative elements. Absolute number and percentage of spans is given, for each of the binary text classes and for all texts, and for each of the six narrative categories: Agents (A), Facilitators (F), Campaigners (C), Victims (V), Objectives (O), Negative Effects (E).

Table 2 shows the amount and the percentages of spans in the GS that have been annotated with each label for each category (Conspiracy or Critical). The low number of certain labels in this table might be due to the way the GS was created, especially in spans that can often be interpreted either as Agent or Facilitator in many messages, which results in label disagreement. The application of criterion (2) results in these instances not being present in the GS. Following the learning with disagreement paradigm (Uma et al., 2021) we provide not only the GS but also the individual annotations of each annotator.

3.4 Anonymization

The final step in the creation of the XAI-DisInfodemic corpus was the anonymization of sensitive personal data contained in the messages. Personal data can be proper nouns of anonymous people (i.e. not public figures), phone numbers, email addresses, bank account numbers, and mentions. The procedure we implemented is a combination of automatic and manual approaches:

1.

Phone numbers and bank account numbers were automatically anonymized.
2.

Mentions (@username) and emails ([email protected]) were automatically extracted, and a senior annotator who was familiar with the corpus decided which usernames needed to be anonymized (i.e. those belonging to private individuals, rather than public figures or Telegram channels). The anonymization process consisted in replacing the characters of an username with a randomly generated sequence of alphanumeric characters, with two random numbers added at the end.
3.

Proper nouns were selected automatically, using the spaCy tool²²2https://spacy.io/, and reviewed manually by the same senior annotator, taking into account the context in which they appeared. For each proper noun that needed to be anonymized, an alternative replacement was manually given, so that gender and format (e.g. first name, full name, short versions of a name, etc.) were maintained. This step is crucial to ensure that co-reference relationships and gender agreement are kept within each message, so that the cohesion of the discourse is not lost.

4 Predictive Machine Learning

In this section we describe a series of machine learning experiments aimed at predicting text-related variables provided by the annotation process described in Sections 3.2 and 3.3. These experiments include the binary classification of texts into the Critical and Conspiracy categories, the detection of word spans representing the elements of oppositional narratives, and the text-level binary classification of the occurrences of the narrative categories.

Our aim was to assess the performance of standard NLP models and to offer robust solutions for researchers as a basis for future improvements. We view these experiments as a first step towards full automatization of manual annotation in future applications, and we leave the exhaustive exploration of many possible improvements for future work.

4.1 Classifying Critical vs. Conspiracy

We first tackled the task of distinguishing between critical and conspiratorial messages, set as a binary classification problem (with Conspiracy as the positive class³³3We note that, in our experiment, the performance of the transformer-based classifiers is invariant to the definition of positive class: the results (rounded to two decimal places) remain the same when Critical is the positive class.) on Gold Standard data. We experimented with BERT (Devlin et al., 2019) and RoBERTa (Liu et al., 2019) models, using the original English models and their Spanish counterparts, BETO (Cañete et al., ) and BERTIN (Rosa et al., 2022). RoBERTa builds upon BERT by using an improved training procedure and more data. For English messages, we also experimented with DeBERTaV3 (He et al., 2023), an advanced model that builds upon the BERT/RoBERTa architecture by using improved pre-training and by disentangling the content and the position attention.

We built the classifiers by fine-tuning the transformer models from the huggingface⁴⁴4https://huggingface.co/models repository, and we used case-sensitive “base” versions. The number of tokens was set to 256. We tuned the models for 3 epochs using the AdamW optimizer, learning rate of $2e^{-5}$ , slanted triangular LR scheduler with a 10% warm-up period, a batch size of 16, and a weight decay of 0.01. All the layers of the transformers were fine-tuned. The dropout rate for the classification head was 0.1. In order to obtain robust performance estimates, the results were computed using five-fold cross-validation (4K train and 1K test texts), with separate per-fold random seeds for initialization of model parameters.

	Englisch		Spanish
model	F1-CN	F1-CR	F1-CN	F1-CR
BERT	0.84	0.92	0.80	0.89
RoBERTa	0.86	0.93	0.78	0.88
DeBERTa	0.88	0.94
human-vs-human	0.82	0.90	0.86	0.91
human-vs-gold	0.91	0.95	0.93	0.96

Table 3: Performance of classifiers on the Critical (CR) vs. Conspiracy (CN) task and agreement between human annotators, in terms of the binary F1 measure.

In order to compare the performance of models with expected human agreement, we computed the agreement between annotators in terms of the binary F1 score. First, the annotations (Section 4.1) were used to extract binary Critical vs. Conspiracy labels for each annotator. Then these labels were viewed as class predictions and evaluated against two definitions of ground truth: labels of other annotators (human-vs-human), and the labels obtained by the majority vote (human-vs-gold). In each case, the final score was formed by averaging over all possible (prediction, ground truth) pairs. Note that the human-vs-human scores are pessimistic estimates of human performance, since labels of a single annotator represent a ground truth that is more variable than the average. On the other hand, the human-vs-gold scores are optimistic estimates since the individual predictions are merged into the average, and can therefore influence it.

Table 3 contains both the models’ and the inter-annotator binary F1 scores. These results show that the classifiers performed well both for Spanish and English, with the English models achieving higher F1 scores. Detection of Critical messages achieves very high scores for all the models and languages, while the Conspiracy scores are somewhat lower, especially for Spanish. For English, DeBERTaV3 performed best, while for Spanish BERT outperformed RoBERTa.

When compared to human scores, all the English models outperform the (pessimistic) human-vs-human scores and stay below the (optimistic) human-vs-gold scores. The best-performing DeBERTa model is close to the human-vs-gold scores. For Spanish, the models’ scores are below both the human-vs-human and human-vs-gold scores, with the performance on critical texts being much closer to the human thresholds than the performance of conspiracy texts.

Possible reasons for variation in the models’ performance across languages are the differences in quality between English and Spanish transformer models, and the differences in language structure, which could make the Spanish classification task more challenging for transformers. The slightly higher human F1 scores for Spanish suggest that the Spanish labeling task is not harder for humans, which is also demonstrated by the IAA scores in Section 3.2.

Confusion matrices of the best-performing models for English (DeBERTa) and Spanish (BERT), displayed in Table 4, offer more insight into performance differences between the Conspiracy and Critical classes. For both languages, the precision for the Conspiracy class is lower than the precision for the Critical class. For English, the precision of Conspiracy detection is 88% compared to 95% for Critical detection, while for Spanish the corresponding precision scores are 78% and 90%. In other words, models confuse conspiratorial messages as critical messages more often than they confuse critical messages as conspiratorial, and this asymmetry is more pronounced for Spanish. This is likely due to the fact that conspiratorial messages can be subtle and indirect in their support for conspiracy theories and are therefore more likely to be confused as critical messages. Interestingly, the IAA scores in Section 3.2 indicate that human agreement on messages being conspiratorial is higher than on messages being critical. This indicates that humans are able to reliably detect conspiracies based on subtle and indirect cues. Therefore, a reasonable first step in improving the Conspiracy vs. Critical classifiers would be an analysis of misclassified conspiratorial messages and the identification of the features that are reliable signals of conspiratorial discourse.

	Englisch		Spanish
predicted	conspiracy	critical	conspiracy	critical
conspiracy	301.60	35.20	284.60	60.80
critical	43.20	620.00	81.00	573.60

Table 4: Confusion matrices (averaged over the five cross-validation folds) for classification of texts into Conspiracy and Critical classes, for the best-performing models: DeBERTa for English and BERT for Spanish.

To summarize, the previous results demonstrate that the Critical vs. Conspiracy task is tractable, and that the expected performance of transformer classifiers is good for Spanish and very good for English, which is an expected result of combining high-quality annotations with strong transformer classifiers. High F1 scores for the detection of critical messages suggest that these classifiers could be reliably applied to make the automatic conspiracy-filtering tools more fair. In this use-case, the texts labeled as Critical could either be automatically unflagged as Conspiracy, or routed to humans for closer inspection.

4.2 Detection of Narrative Elements

We proceeded with the identification, on the fine-grained level of token spans, of the six elements of oppositional narratives defined in our annotation scheme (for details, see Section 3.2). Our task includes both long and overlapping spans (of different categories). Similar tasks have been carried out in the context of propaganda detection (Da San Martino et al., 2019, 2020), skill extraction (Zhang et al., 2022), and the analysis of literary texts (Weimer et al., 2022b).

For evaluation, we used the adapted F1 measure of Da San Martino et al. (2019) which accounts for partially correct predictions by looking at span overlap (span-F1). This approach offers a fairer evaluation in tasks with long spans, and with inherent subjectivity of the span boundaries. For tasks like traditional non-nested NER, where named entities are shorter and expectedly have well-defined boundaries, exact matching is a reasonable method of evaluation.

We approached the task by fine-tuning transformer models with token classification heads. To account for the possibility of overlapping spans with different categories, we used 6 separate per-category heads that performed BIO sequence tagging. We employed multi-task learning (Ruder, 2017) by connecting the per-category taggers to the same transformer backbone. Multi-task learning has several advantages such as improved regularization and implicit data augmentation (Ruder, 2017), and the described approach was successfully deployed for a similar task of span-level skill extraction (Zhang et al., 2022).

We experimented with the same transformers used for binary classification (Section 4.1) and we used the same configuration and hyperparameters. The exception was the number of epochs, which we increased to 10 in order to accommodate for the increased task complexity. Gold Standard data was used and it was adapted for multi-task learning by creating a separate training instance for each text and each span category, which resulted in a large proportion of texts without spans. In order to avoid overfitting to the unlabeled tokens, empty texts were downsampled (per category) to 10% of the original size. Five-fold cross-validation was used to obtain performance estimates, in the same way as in Section 4.1.

The results, presented in the top half of Table 5, show the average and per-category span-F1 scores for both languages. For English, DeBERTaV3 outperforms RoBERTa, which outperforms BERT, while for Spanish RoBERTa outperforms BERT. The per-category performance correlates well with the number of spans annotated with a category, i.e. with the size of the training data for the corresponding token classifier. For example, the (less frequent) Facilitators and Objectives categories are harder to detect. Additionally, the Spanish BERT and RoBERTa models are better than their English counterparts for the Victims and Negative Effects categories which are more frequent in Spanish, while for most of the other categories the English models work better. For English, the most sophisticated DeBERTaV3 model achieves noticeable performance gains for most of the categories.

Additionally, we experimented with the SpanBERT model (Joshi et al., 2020) as the transformer backbone. SpanBERT is a model based on a pre-training method “designed to better represent and predict spans of text” (Joshi et al., 2020). The SpanBERT model achieved an inferior performance of 0.31 average span-F1 on English texts (Spanish SpanBERT model does not exist). This is in line with the skill extraction experiment (Zhang et al., 2022), in which the SpanBERT model was not the best solution and it rarely outperformed BERT.

Finally, we experimented using CRF (Lafferty et al., 2001), a probabilistic model that learns patterns of transition between token classes⁵⁵5We used the pytorch-crf package: https://pytorch-crf.readthedocs.io, as the final token classification layer. We experimented with two CRF-based approaches: one in which the CRF layers were optimized together with the transformer (end-to-end), and the other in which the CRF layers were optimized after training and freezing the transformer (fine-tuned). Both of the CRF-based approaches led to an inferior performance. In terms of the span-F1 measure, the average (end-to-end CRF, fine-tuned CRF) results for English were: (0.23, 0.19) for BERT, (0.37, 0.36) for RoBERTa, and (0.47, 0.46) for DeBERTa. For Spanish, the results were (0.32, 0.38) for BERT, and (0.23, 0.25) for RoBERTa. The likely reason for the inferior performance of CRF is that its inductive bias (an assumption that there is valuable information in the surface patterns of transition between token tags) does not fit well with the nature of the data in our case. This is supported by the average number of spans of the same category per text (in which the category occurs), which is 2.29 for English and 2.39 for Spanish.

In terms of the results on a different but similar task, best span labeling models detecting a single “propaganda” category in English news articles achieved a span-F1 of 0.52 (Da San Martino et al., 2020). While direct comparison is not possible, we take these results as evidence that our approach (average span-F1 of 0.57 for English and 0.50 for Spanish) achieves solid performances on a challenging task.

Regarding RQ2, we can say that while the performance of the proposed approach is not below the top performance expected from a similar task, there is room for further improvement, especially for the less frequent categories. Techniques that could improve performance on the less frequent categories include data augmentation (Feng et al., 2021) and active learning (Erdmann et al., 2019).

		Englisch							Spanish
		Avg.	A	F	C	V	O	E	Avg.	A	F	C	V	O	E
	num. spans	4,727	6,411	3,462	6,416	4,433	2,073	5,565	3,880	3,329	2,688	4,231	5,260	622	7,150
SPAN	BERT	0.50	0.59	0.38	0.54	0.57	0.38	0.54	0.48	0.46	0.39	0.50	0.57	0.33	0.64
	RoBERTa	0.52	0.60	0.41	0.62	0.58	0.36	0.55	0.50	0.51	0.37	0.54	0.62	0.33	0.64
	DeBERTa	0.57	0.63	0.45	0.61	0.63	0.49	0.62
BINARY	BERT	0.73	0.82	0.60	0.79	0.77	0.64	0.78	0.72	0.72	0.60	0.75	0.83	0.59	0.85
	RoBERTa	0.76	0.84	0.64	0.82	0.80	0.66	0.80	0.69	0.71	0.58	0.74	0.83	0.57	0.84
	DeBERTa	0.77	0.84	0.65	0.83	0.80	0.69	0.81

Table 5: Performance of BERT, RoBERTa, and DeBERTaV3 transformers on the tasks of span labeling (SPAN), and binary detection (BINARY) of the text-level narrative categories, in terms of span-F1 and binary F1 measures, respectively. Total number of spans in the dataset for each of the categories is given in the top row. The categories are: Agents (A), Facilitators (F), Campaigners (C), Victims (V), Objectives (O), Negative Effects (E).

4.3 Detection of Narrative Elements

While narrative categories in Section 4.2 were detected on the token level, here we will tackle their text-level detection. For each category we assigned binary labels to texts, labeling a text as positive if the category was present in the text, i.e. represented by at least one Gold Standard span. Such per-category binary variables can convey useful information: the presence of spans labeled as Facilitator or Campaigner is a signal of inter-group conflict (for details, see Section 5).

The models and their configuration are the same as for the Conspiracy vs. Critical (CC) classification in Section 4.1, as well as the training and evaluation. The results in Table 5 show that binary detection of the narrative categories is, on average and for most of the categories, a harder task than CC classification (Table 3). Relative model performances are the same: English models obtain better results than Spanish, with DeBERTaV3 being the most successful, and BERT outperforms RoBERTa in Spanish.

The results in Table 5 also show that the binary per-category scores correlate very well with the per-category scores of the sequence labeling models. This is expected since the per-category labeling scores correlate well with the number of spans annotated with a category, which correlates with the number of positive examples for the per-category classification. Similarly as for sequence labeling, the smallest Facilitators and Objectives categories are the most challenging ones and have similar scores.

Regarding RQ3, these results show that text-level detection is a more viable option in applications that require this level or granularity, and that good results can be achieved for most narrative elements. However, the Facilitators and Objectives categories remain a challenge. One could likely improve performance on these less frequent categories by using techniques such as data augmentation Feng et al. (2021) and active learning Settles (2011).

4.4 Error Analysis

The results indicate that the Facilitators (F) and Objectives (O) categories are the most challenging to detect (Table 5). We focus the error analysis on the Facilitators span since it conveys important information about inter-group conflict (Sections 3, 5). We hypothesize that the model confuses the Agent (A) and Facilitator (F) labels, which have a similar role in terms of the “friend vs. foe” discourse. To examine their relationship, we introduce, for each of the categories, a text-level variable that codifies classification outcomes: False Negative, Correct and False Positive. For both categories and for each language, we calculated the outcomes of the best-performing classifiers (see Table 6). Predictions for all five folds are pooled to cover the entire dataset. Applying the chi-square test, which assesses the independence between two categorical variables, we found that the classification outcomes for Facilitators (F) are not independent of the outcomes for Agent (A) ( $p<0.001$ ).

Next we perform the analysis of the residuals, which measure how much a particular combination of outcomes influences the dependence of the variables – large residuals influence the dependence more, and vice versa (Sharpe, 2015). Table 6 shows that there are 81 English texts with missed Facilitators and falsely detected Agents, and that this combination strongly influences the dependence between variables (high residual of 4.3). In our context, the values of the residuals show that there is a strong correlation between Facilitators being missed (False Negative) and Agents being incorrectly identified (False Positive), for both languages. For Spanish, incorrectly identifying Facilitators (False Positive) was significantly linked to missing Agents (False Negative). These results show a tendency to misclassify Facilitators as Agents in both languages, and to misclassify Agents as Facilitators in Spanish. Future models should tackle this confusion, possibly by applying a multi-task approach with a loss function modified to penalize the problematic cases.

Englisch	A-FN	A-Corr	A-FP
F-FN	24 (-2.0)	418 (-2.1)	81 (4.3)
F-Corr	294 (2.6)	3,452 (1.6)	384 (-4.1)
F-FP	17 (-1.4)	289 (0.2)	40 (0.9)
Spanish	A-FN	A-Corr	A-FP
F-FN	46 (0.7)	481 (-3.1)	65 (3.5)
F-Corr	268 (-2.6)	3,514 (5.3)	269 (-4.6)
F-FP	40 (3.2)	278 (-4.2)	39 (2.6)

Table 6: Counts and adjusted residuals (in brackets) for classification outcome variables in the detection of Facilitators (F) and Agents (A). The outcomes are False Negatives (FN), Correct Classification (Corr), and False Positives (FP).

5 Applying the Annotation Scheme

In this section, we illustrate how the proposed annotation scheme can be used for interdisciplinary research in computational linguistics and computational social sciences. First we do a proof-of-concept analysis of the taxonomy to demonstrate that conspiratorial and critical discourses differ in their potential effects on reality by measuring the presence of anger and violence in written texts. Anger is a core emotion that prepares individuals to correct perceived offenses and take action in violent riots (Zeitzoff, 2013). We hypothesize that conspiracy messages differ from critical messages because they tend to create a specific emotional atmosphere with the use of anger [H1] and words connected with violence [H2].

Secondly, we focus on understanding the role of IGC as a powerful force driving violent mobilizations (McCauley and Moskalenko, 2008). This experiment is motivated by the suggestion that CTs are not just pieces of disinformation but narrative artifacts aimed to reinforce group dynamics and coordinate actions (Wagner-Egger et al., 2022). We hypothesize that a greater degree of IGC will be found in conspiratorial messages [H3], and that the use of angry and violent words will be higher in messages where the IGC is present [H4].

To test the previous hypotheses, we define and compute three new variables: anger, violence and intergroup conflict.

Anger: We calculate the amount of angry words in a messasge by applying the LIWC2007 Spanish dictionary (Ramírez-Esparza et al., 2007) and the LIWC2015 English dictionary (Pennebaker et al., 2015). The LIWC dictionary is a standard method in psychological and linguistic research that groups words in emotional and cognitive categories. We define the anger variable as the percentage of words in a text that belong to the anger category in LIWC.

Violence: We aim to measure discourse that promotes political violence (PV) (Bosi and Demetriou, 2015; Krüger et al., 2018). Based on Krüger et al. (2018), we define this kind of discourse as the pragmatic use of a specific vocabulary related to aggression that implicitly normalizes or legitimizes violent acts in order to achieve specific goals. Therefore, we opt for a dictionary-based approach similar to the one used for anger.

The challenge of this approach is to obtain both English and Spanish words that relate specifically to political violence and do so in the context of COVID-19 discourse (e.g. words related to death and injury can merely discuss the pandemic). Therefore, we use a ChatGPT-based method to obtain a list of context-specific PV words for both languages. The method consists of two steps: 1) creation and application of a language-specific ChatGPT prompt; 2) curation of the list by human experts. The prompt asks ChatGPT to extract, from a single message, a list of words that create a violent and aggressive atmosphere. For each language, we applied the corresponding prompt to a sample of 1,000 messages. The output words were lemmatized and checked by two experts, which resulted in a total of 587 Spanish and 710 English words. The words were then used to calculate the percentages of violent words in messages. We have manually examined a sample of messages in order to validate the indices of anger or violence. The prompts, the list of words, and examples of violent messages are provided as supplementary material. The correlation between the presence of words from the two lexicons in the messages of the whole corpus is significant but moderate (R=.49 for English and R=.37 for Spanish). This suggests, as we expected, that the promotion of violence and the expression of anger are related but not entirely equivalent phenomena.

Intergroup conflict: In order to describe the presence of a narrative that focuses on Intergroup Conflict, we rely on the presence of Campaigners and Facilitators in a message (see Section 3.3 for definitions). These two types of actors are related to IGC because they represent the social groups that are in confrontation with each other in real life. Based on the presence of these categories, we created a categorical variable that identifies four manifestations of IGC: no IGC actors are present in the text (C1), only Campaigners are present (C2), only Facilitators are present (C3), both Campaigners and Facilitators are present (C4). While C1 indicates the absence of IGC, C4 indicates a strong presence of IGC, since both groups in conflict are explicitly mentioned.

5.1 Analysis and Results

Firstly, we investigate the difference in the amount of anger and violence words between conspiratorial and critical texts. Since these variables are not normally distributed according to the Kolmogorov-Smirnov test ( $p<0.001$ ), we used the Mann-Whitney U test for comparison of the two groups.

As we observe in Figure 4, conspiratorial messages convey significantly more anger than critical messages, both in Spanish (U=2352490.5, df=3, $p<0.001$ ) and in English (U=2046581.5, df=3, $p<0.001$ ), which confirms [H1]. Similarly, we observe that conspiratorial messages convey significantly more violent words both in Spanish (U=2046572.5, df=3, $p<0.001$ ) and in English (U=2030201.5, df=3, $p<0.001$ ), which confirms [H2].

Next we test whether a larger degree of IGC is present in conspiratorial messages than in critical messages. To this end, we perform a chi-square test and a residual analysis which helps to interpret the chi-square test: the size of the residual indicates a major contribution of a cell to the chi-square significance (for more details, see Sharpe (2015)). The results confirm a significant relation between the IGC variable, and the conspiracy vs. critical variable, both for Spanish ( $\chi^{2}$ = 182.02, df = 3, $p<0.001$ ) and for English ( $\chi^{2}$ = 159.16, df = 3, $p<0.001$ ). For both languages, the C4 category (which is a clear indicator of IGC) occurs significantly more in conspiracy than in critical messages, while the opposite is true for C1 (indicator of no IGC), which supports [H3]. Table 7 contains the statistics for the combined texts of both languages.

	Critical	Conspiracy	Alle
C1	43.91% (9.6)	34.11% (-9.6)	40.44%
C2	34.34% (3.8)	30.63% (-3.8)	33.02%
C3	12.27% (-7.2)	17.47% (7.2)	14.12%
C4	9.46% (-12.1)	17.77% (12.1)	12.41%

Table 7: Percentages (and residuals) of IGC categories: in critical, conspiracy, and all texts.

Finally, we tested whether the use of anger and violence words is higher in messages in which IGC is expressed. We use the Kruskal-Wallis test to compare the presence of anger and violence words in the different categories of the IGC variable. We observed a statistically significant effect of the IGC variable on the measures of anger and violence words, both in Spanish (H=96.289, df=3, $p<0.001$ ), and in English (H=79.462, df=3, $p<0.001$ ). As we can see in Figure 5, there is a significantly larger amount of both anger and violence words in messages which have the highest IGC (C4), and a lesser amount in the messages which have the lowest IGC (C1), supporting [H4].

6 Conclusions and Future Work

This research introduces a new topic-agnostic annotation scheme to approach the analysis of oppositional narratives (conspiracy vs. critical) and the presence of intergroup conflict in these narratives. With this scheme, we created the XAI-DisInfodemic corpus, consisting of novel English and Spanish datasets of Telegram messages related to the COVID-19 pandemic. To the best of our knowledge, this is the first corpus addressing the important issue of differentiating between conspiratorial and critical messages, and may lead to the improvement of content moderating systems and to the mitigation of censorship.

The machine learning experiments we conducted show that critical and conspiratorial messages can be differentiated with a high level of accuracy. Span-level detection of oppositional narrative elements has proven to be a more challenging task. We provide strong baseline solutions that should be improved in future work. Possible methods of improvement include advanced multi-task learning methods (Ruder, 2017), the use of multi-lingual models, and the creation of transformers adapted to Telegram messages. The results of the text analysis experiments, based on the annotation scheme, demonstrate that conspiratorial discourse is more harmful since it contains higher levels of anger and political violence. These results support the conclusion that censoring critical messages as if they were conspiratorial is to ignore that these messages represent a discourse of a different nature, since messages containing conspiracy theories are more likely to promote emotions such as anger and to use words directly related to violence than critical messages. A future line of research in the area of social psychology should investigate whether these different language features translate into different effects on the audience.

A potential application of these findings is the use of measures for identifying anger and violence in messages in order to improve the decisions of the classifiers and their explainability. Our experiments could also motivate further research on the dissemination of conspiracy theories, since it would be both theoretically and practically relevant to determine if the presence of IGC is related to an increase in the possibility that a given text becomes viral.

An important direction for future research is to examine whether NLP models can apply our annotation scheme with an accuracy that is high enough to support the findings derived from human-annotated messages. Our experiments and analyses are a first step in this direction.

Ethics Statement

This work focuses on differentiating conspiratorial messages from critical messages, which may contribute to the mitigation of unfair censorship. Furthermore, we develop the understanding of oppositional narratives by proposing technical and conceptual tools for the analysis of conspiracy theories, critical texts, and intergroup conflict online. Therefore, our work has the potential to lead to a better understanding of important social challenges, to mitigate adverse effects of conspiracies and censorship, and to contribute to building a safer online environment.

The messages used in our work have been anonymized removing personal data such as names, telephone numbers, addresses, and bank accounts.

Transformer models are prone to biases that can lead to an unfair or harmful misclassification of critical messages as conspiratorial, and vice versa. Potential researchers should therefore take steps to mitigate such biases by performing the appropriate evaluations and model corrections. A related issue is the definition of Critical and Conspiracy classes, which should be carefully distinguished and correctly identified in order to avoid potential censorship.

Training transformers from scratch consumes a considerable amount of energy and contributes to global warming. The approaches that we propose rely on fine-tuning, which is less of an issue since it consumes much less energy. However, if the implementation of such approaches to new text domains or languages requires the construction of new transformers, we encourage researchers and professionals from private sectors to take into account the environmental consequences.

Limitations

Even though we have designed our annotation scheme to be topic agnostic, we have limited our study to the COVID-19 pandemic. Therefore, in order to validate it even further, it could be applied to other controversial topics that are prone to CTs, such as climate change.

We have focused on messages posted on the Telegram platform, and it remains to be tested to what extent our approach is generalizable to texts from other sources. We have only tested our approach on messages written in English and Spanish, two languages which are highly-resourced and belong to a similar cultural framework. Further experiments are needed to assess the universality of our annotation scheme.

Another limitation of our work is that we have not performed a hyperparameter tuning, but we have chosen to rely on hyperparameters that are commonly used to fine-tune a pre-trained model.

Finally, while we are aware of the importance of taking into account the different perspectives of the annotators, we have not used the learning with disagreement paradigm (Uma et al., 2021) to leverage them in our experiments. We plan to do this as part of future research.

Data and Code Availability

We make the code of the experiments, as well as the data, available.⁶⁶6https://github.com/dkorenci/oppositional-discourse

Acknowledgements

This work was done in the framework of the research project XAI-DisInfodemics: eXplainable AI for disinformation and conspiracy detection during infodemics (PLEC2021-007681) funded by MICIU/AEI/ 10.13039/501100011033 and by “European Union NextGenerationEU/PRTR”.

We would like to thank Mariona Coll Ardanuy for all the helpful advice pertaining to the improvement of the article text and concept.

Conflict of Interest Statement

The authors declare no potential conflict of interest.

References

Bessi [2016] Alessandro Bessi. Personality traits and echo chambers on facebook. Computers in Human Behavior, 65:319–324, December 2016. ISSN 0747-5632. doi: 10.1016/j.chb.2016.08.016. URL https://www.sciencedirect.com/science/article/pii/S0747563216305817.
Bosi and Demetriou [2015] Lorenzo Bosi and Chares Demetriou. Social movements and political violence. International Encyclopedia of the Social & Behavioral Sciences, 12 2015. doi: 10.1016/B978-0-08-097086-8.96040-9.
Böhm et al. [2020] Robert Böhm, Hannes Rusch, and Jonathan Baron. The psychology of intergroup conflict: A review of theories and measures. Journal of Economic Behavior & Organization, 178:947–962, 2020. ISSN 0167-2681. doi: https://doi.org/10.1016/j.jebo.2018.01.020. URL https://www.sciencedirect.com/science/article/pii/S0167268118300209.
[4] José Cañete, Gabriel Chaperon, Rodrigo Fuentes, Jou-Hui Ho, Hojin Kang, and Jorge Pérez. Spanish Pre-trained BERT Model and Evaluation Data, August . URL http://arxiv.org/abs/2308.02976. ArXiv:2308.02976.
Da San Martino et al. [2019] Giovanni Da San Martino, Seunghak Yu, Alberto Barrón-Cedeño, Rostislav Petrov, and Preslav Nakov. Fine-Grained Analysis of Propaganda in News Articles. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 5636–5646, Hong Kong, China, November 2019. Association for Computational Linguistics. doi: 10.18653/v1/D19-1565. URL https://aclanthology.org/D19-1565.
Da San Martino et al. [2020] Giovanni Da San Martino, Alberto Barrón-Cedeño, Henning Wachsmuth, Rostislav Petrov, and Preslav Nakov. SemEval-2020 Task 11: Detection of Propaganda Techniques in News Articles. In Proceedings of the Fourteenth Workshop on Semantic Evaluation, pages 1377–1414, Barcelona (online), 2020. International Committee for Computational Linguistics. doi: 10.18653/v1/2020.semeval-1.186. URL https://aclanthology.org/2020.semeval-1.186.
Devlin et al. [2019] Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 4171–4186, Minneapolis, Minnesota, June 2019. Association for Computational Linguistics. doi: 10.18653/v1/N19-1423. URL https://aclanthology.org/N19-1423.
Douglas and Sutton [2023] Karen M. Douglas and Robbie M. Sutton. What are conspiracy theories? a definitional approach to their correlates, consequences, and communication. Annual Review of Psychology, 74(1):271–298, 2023. URL https://doi.org/10.1146/annurev-psych-032420-031329.
Erdmann et al. [2019] Alexander Erdmann, David Joseph Wrisley, Benjamin Allen, Christopher Brown, Sophie Cohen-Bodénès, Micha Elsner, Yukun Feng, Brian Joseph, Béatrice Joyeux-Prunel, and Marie-Catherine de Marneffe. Practical, efficient, and customizable active learning for named entity recognition in the digital humanities. In Jill Burstein, Christy Doran, and Thamar Solorio, editors, Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 2223–2234, Minneapolis, Minnesota, June 2019. Association for Computational Linguistics. doi: 10.18653/v1/N19-1231. URL https://aclanthology.org/N19-1231.
Feng et al. [2021] Steven Y. Feng, Varun Gangal, Jason Wei, Sarath Chandar, Soroush Vosoughi, Teruko Mitamura, and Eduard Hovy. A survey of data augmentation approaches for NLP. In Chengqing Zong, Fei Xia, Wenjie Li, and Roberto Navigli, editors, Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pages 968–988, Online, August 2021. Association for Computational Linguistics. doi: 10.18653/v1/2021.findings-acl.84. URL https://aclanthology.org/2021.findings-acl.84.
Franks et al. [2017] Bradley Franks, Adrian Bangerter, Martin W. Bauer, Matthew Hall, and Mark C. Noort. Beyond “monologicality”? exploring conspiracist worldviews. Frontiers in Psychology, 8, 2017. ISSN 1664-1078. doi: 10.3389/fpsyg.2017.00861. URL https://www.frontiersin.org/articles/10.3389/fpsyg.2017.00861.
Funkhouser [2022] Eric Funkhouser. A tribal mind: Beliefs that signal group identity or commitment. Mind & Language, 37(3):444–464, 2022. doi: https://doi.org/10.1111/mila.12326. URL https://onlinelibrary.wiley.com/doi/abs/10.1111/mila.12326.
Giachanou et al. [2021] Anastasia Giachanou, Bilal Ghanem, and Paolo Rosso. Detection of conspiracy propagators using psycho-linguistic characteristics. Journal of Information Science, 49(1):3–17, 2021. doi: 10.1177/0165551520985486.
He et al. [2023] Pengcheng He, Jianfeng Gao, and Weizhu Chen. DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing. In International Conference on Learning Representations, September 2023. URL https://openreview.net/forum?id=sE7-XhLxHA.
Holur et al. [2022] Pavan Holur, Tianyi Wang, Shadi Shahsavari, Timothy Tangherlini, and Vwani Roychowdhury. Which side are you on? Insider-Outsider classification in conspiracy-theoretic social media. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 4975–4987, Dublin, Ireland, May 2022. Association for Computational Linguistics. doi: 10.18653/v1/2022.acl-long.341. URL https://aclanthology.org/2022.acl-long.341.
Introne et al. [2020] Joshua Introne, Ania Korsunska, Leni Krsova, and Zefeng Zhang. Mapping the Narrative Ecosystem of Conspiracy Theories in Online Anti-vaccination Discussions. In International Conference on Social Media and Society, pages 184–192. Association for Computing Machinery, July 2020. ISBN 978-1-4503-7688-4. doi: 10.1145/3400806.3400828. URL https://dl.acm.org/doi/10.1145/3400806.3400828.
Joshi et al. [2020] Mandar Joshi, Danqi Chen, Yinhan Liu, Daniel S. Weld, Luke Zettlemoyer, and Omer Levy. SpanBERT: Improving Pre-training by Representing and Predicting Spans. Transactions of the Association for Computational Linguistics, 8:64–77, 01 2020. ISSN 2307-387X. doi: 10.1162/tacl\_a\_00300. URL https://doi.org/10.1162/tacl_a_00300.
Klein et al. [2018] Colin Klein, Peter Clutton, and Vince Polito. Topic Modeling Reveals Distinct Interests within an Online Conspiracy Forum. Frontiers in Psychology, 9, 2018. ISSN 1664-1078. URL https://www.frontiersin.org/articles/10.3389/fpsyg.2018.00189.
Korenčić et al. [2023] Damir Korenčić, Ivan Grubišić, Alejandro Hector Toselli, Berta Chulvi, and Paolo Rosso. Tackling Covid-19 Conspiracies on Twitter using BERT Ensembles, GPT-3 Augmentation, and Graph NNs. In Working Notes Proceedings of the MediaEval 2022 Workshop Bergen, Norway and Online, 2023. URL https://2022.multimediaeval.com/paper8969.pdf.
Krüger et al. [2018] Steffen Krüger, Karl Figlio, and Barry Richards. Fomenting political violence: An introduction. Fomenting Political Violence: Fantasy, Language, Media, Action, pages 1–16, 2018.
Lafferty et al. [2001] John D. Lafferty, Andrew McCallum, and Fernando C. N. Pereira. Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In Proceedings of the Eighteenth International Conference on Machine Learning, ICML ’01, page 282–289, San Francisco, CA, USA, 2001. Morgan Kaufmann Publishers Inc. ISBN 1558607781.
Langguth et al. [2023] Johannes Langguth, Daniel Thilo Schroeder, Petra Filkuková, Stefan Brenner, Jesper Phillips, and Konstantin Pogorelov. Coco: an annotated twitter dataset of covid-19 conspiracy theories. Journal of Computational Social Science, pages 1–42, 2023.
Levy et al. [2021] Sharon Levy, Michael Saxon, and William Yang Wang. Investigating Memorization of Conspiracy Theories in Text Generation. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pages 4718–4729, Online, August 2021. Association for Computational Linguistics. doi: 10.18653/v1/2021.findings-acl.416. URL https://aclanthology.org/2021.findings-acl.416.
Liu et al. [2019] Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. Roberta: A robustly optimized bert pretraining approach, 2019. ArXiv:1907.11692.
Mahl et al. [2022] Daniela Mahl, Mike S. Schäfer, and Jing Zeng. Conspiracy theories in online environments: An interdisciplinary literature review and agenda for future research. New Media & Society, 0(0):14614448221075759, 2022. doi: 10.1177/14614448221075759. URL https://doi.org/10.1177/14614448221075759.
Mathet et al. [2015] Yann Mathet, Antoine Widlöcher, and Jean-Philippe Métivier. The Unified and Holistic Method Gamma for Inter-Annotator Agreement Measure and Alignment. Computational Linguistics, 41(3):437–479, September 2015. ISSN 0891-2017. doi: 10.1162/COLI\_a\_00227. URL https://doi.org/10.1162/COLI_a_00227.
McCauley and Moskalenko [2008] Clark McCauley and Sophia Moskalenko. Mechanisms of political radicalization: Pathways toward terrorism. Terrorism and Political Violence, 20(3):415–433, 2008. doi: 10.1080/09546550802073367.
Memon and Carley [2020] Shahan Ali Memon and Kathleen M Carley. Characterizing covid-19 misinformation communities using a novel twitter dataset. In Proceedings of the CIKM 2020 Workshops October 19-20, Galway, Ireland, pages 41–44, 2020.
Miani et al. [2021] Alessandro Miani, Thomas Hills, and Adrian Bangerter. Loco: The 88-million-word language of conspiracy corpus. Behavior research methods, pages 1–24, 2021.
Moffitt et al. [2021] J. D. Moffitt, Catherine King, and Kathleen M. Carley. Hunting conspiracy theories during the covid-19 pandemic. Social Media + Society, 7(3), 2021. doi: 10.1177/20563051211043212.
Müller et al. [2023] Martin Müller, Marcel Salathé, and Per E. Kummervold. Covid-twitter-bert: A natural language processing model to analyse covid-19 content on twitter. Frontiers in Artificial Intelligence, 6, 2023. ISSN 2624-8212. doi: 10.3389/frai.2023.1023281. URL https://www.frontiersin.org/articles/10.3389/frai.2023.1023281.
Noy [2008] Chaim Noy. Sampling knowledge: The hermeneutics of snowball sampling in qualitative research. International Journal of Social Research Methodology, 11(4):327–344, 2008. doi: 10.1080/13645570701401305.
Pennebaker et al. [2015] James W Pennebaker, Ryan L Boyd, Kayla Jordan, and Kate Blackburn. The Development and Psychometric Properties of LIWC2015. Texas ScholarWorks, 2015. URL http://hdl.handle.net/2152/31333.
Peskine et al. [2021] Youri Peskine, Giulio Alfarano, Ismail Harrando, Paolo Papotti, and Raphael Troncy. Detecting covid-19-related conspiracy theories in tweets. In MediaEval 2021, MediaEval Benchmarking Initiative for Multimedia Evaluation Workshop, 13-15 December 2021, 2021.
Peskine et al. [2023] Youri Peskine, Paolo Papotti, and Raphaël Troncy. Detection of COVID-19-Related Conpiracy Theories in Tweets using Transformer-Based Models and Node Embedding Techniques. In Working Notes Proceedings of the MediaEval 2022 Workshop Bergen, Norway and Online, 2023.
Phadke et al. [2021] Shruti Phadke, Mattia Samory, and Tanushree Mitra. What makes people join conspiracy communities? role of social factors in conspiracy engagement. Proc. ACM Hum.-Comput. Interact., 4(CSCW3), jan 2021. doi: 10.1145/3432922. URL https://doi.org/10.1145/3432922.
Pogorelov et al. [2021] Konstantin Pogorelov, Daniel Thilo Schroeder, Stefan Brenner, and Johannes Langguth. FakeNews: Corona Virus and Conspiracies Multimedia Analysis Task at MediaEval 2021. In Working Notes Proceedings of the MediaEval 2021 Workshop Bergen, Norway and Online, 2021.
Pogorelov et al. [2023] Konstantin Pogorelov, Daniel Thilo Schroeder, Stefan Brenner, Asep Maulana, and Johannes Langguth. Combining tweets and connections graph for fakenews detection at mediaeval 2022. In Proceedings of the MediaEval 2022 Workshop, Bergen, Norway and Online, 12-13 January 2023., 2023.
Ramírez-Esparza et al. [2007] Nairán Ramírez-Esparza, James W. Pennebaker, Florencia Andrea García, and Raquel Suriá. La psicología del uso de las palabras: Un programa de computadora que analiza textos en Español. [The psychology of word use: A computer program that analyzes texts in Spanish.]. Revista Mexicana de Psicología, 24(1):85–99, 2007. ISSN 0185-6073(Print).
Rosa et al. [2022] Javier De la Rosa, Eduardo G. Ponferrada, Manu Romero, Paulo Villegas, Pablo González de Prado Salas, and María Grandury. BERTIN: Efficient Pre-Training of a Spanish Language Model using Perplexity Sampling. Procesamiento del Lenguaje Natural, 68(0):13–23, March 2022. ISSN 1989-7553. URL http://journal.sepln.org/sepln/ojs/ojs/index.php/pln/article/view/6403. Number: 0.
Ruder [2017] Sebastian Ruder. An Overview of Multi-Task Learning in Deep Neural Networks, June 2017. URL http://arxiv.org/abs/1706.05098. arXiv:1706.05098.
Samory and Mitra [2018] Mattia Samory and Tanushree Mitra. ’The Government Spies Using Our Webcams’: The Language of Conspiracy Theories in Online Discussions. Proceedings of the ACM on Human-Computer Interaction, 2(CSCW):1–24, November 2018. ISSN 2573-0142. doi: 10.1145/3274421. URL https://dl.acm.org/doi/10.1145/3274421.
Settles [2011] Burr Settles. Closing the loop: Fast, interactive semi-supervised annotation with queries on features and instances. In Regina Barzilay and Mark Johnson, editors, Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, pages 1467–1478, Edinburgh, Scotland, UK., July 2011. Association for Computational Linguistics. URL https://aclanthology.org/D11-1136.
Sharpe [2015] D. Sharpe. Chi-square test is statistically significant: Now what? Practical Assessment, Research, and Evaluation, 20, 2015. doi: 10.7275/tbfa-x148.
Song et al. [2021] Xingyi Song, Johann Petrak, Ye Jiang, Iknoor Singh, Diana Maynard, and Kalina Bontcheva. Classification aware neural topic model for covid-19 disinformation categorisation. PloS one, 16(2):e0247086, 2021.
Sutton and Douglas [2022] Robbie M. Sutton and Karen M. Douglas. Rabbit hole syndrome: Inadvertent, accelerating, and entrenched commitment to conspiracy beliefs. Current Opinion in Psychology, 48:101462, 2022. ISSN 2352-250X. doi: https://doi.org/10.1016/j.copsyc.2022.101462. URL https://www.sciencedirect.com/science/article/pii/S2352250X2200183X.
Uma et al. [2021] A. Uma, T. Fornaciari, D. Hovy, S. Paun, B. Plank, and M. Poesio. Learning from Disagreement: A Survey. Journal of Artificial Intelligence Research, 72:1385–1470, 2021.
Uscinski et al. [2011] Joseph E. Uscinski, Joseph Parent, and Bethany Torres. Conspiracy Theories are for Losers, 2011. URL https://papers.ssrn.com/abstract=1901755. APSA 2011 Annual Meeting Paper.
Vergani et al. [2022] Matteo Vergani, Alfonso Martinez Arranz, Ryan Scrivens, and Liliana Orellana. Hate speech in a telegram conspiracy channel during the first year of the covid-19 pandemic. Social Media+ Society, 8(4):20563051221138758, 2022.
Wagner-Egger et al. [2022] Pascal Wagner-Egger, Adrian Bangerter, Sylvain Delouvée, and Sebastian Dieguez. Awake together: Sociopsychological processes of engagement in conspiracist communities. Current Opinion in Psychology, 47:101417, 2022. ISSN 2352-250X. doi: https://doi.org/10.1016/j.copsyc.2022.101417. URL https://www.sciencedirect.com/science/article/pii/S2352250X22001385.
Weimer et al. [2022a] Anna Mareike Weimer, Florian Barth, Tillmann Dönicke, Luisa Gödeke, Hanna Varachkina, Anke Holler, Caroline Sporleder, and Benjamin Gittel. The (In-)Consistency of Literary Concepts. Operationalising, Annotating and Detecting Literary Comment. Journal of Computational Literary Studies, 1(1), December 2022a. ISSN 2940-1348. doi: 10.48694/jcls.90. URL https://jcls.io/article/id/90/. Number: 1 Publisher: Universitäts- und Landesbibliothek Darmstadt.
Weimer et al. [2022b] Anna Mareike Weimer, Florian Barth, Tillmann Dönicke, Luisa Gödeke, Hanna Varachkina, Anke Holler, Caroline Sporleder, and Benjamin Gittel. The (in-)consistency of literary concepts. operationalising, annotating and detecting literary comment. Journal of Computational Literary Studies, 1, 12 2022b. ISSN 2940-1348. doi: 10.48694/jcls.90. URL https://jcls.io/article/id/90/.
Zeitzoff [2013] Thomas Zeitzoff. Anger, exposure to violence and intragroup conflict: A “lab in the field” experiment in southern israel. Political Psychology, 35, 10 2013. doi: 10.1111/pops.12065.
Zhang et al. [2022] Mike Zhang, Kristian Jensen, Sif Sonniks, and Barbara Plank. SkillSpan: Hard and soft skill extraction from English job postings. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 4962–4984, Seattle, United States, July 2022. Association for Computational Linguistics. doi: 10.18653/v1/2022.naacl-main.366. URL https://aclanthology.org/2022.naacl-main.366.