License: CC BY 4.0
arXiv:2402.15525v1 [cs.CL] 19 Feb 2024

Detecting misinformation through Framing Theory: the Frame Element-based Model

Guan Wang
Auckland University of Technology
Auckland, New Zealand
[email protected]
&Rebecca Frederick
Auckland University of Technology
Auckland, New Zealand
[email protected]
&Jinglong Duan
Auckland University of Technology
Auckland, New Zealand
[email protected]
&William Wong
Auckland University of Technology
Auckland, New Zealand
[email protected]
&Verica Rupar
Auckland University of Technology
Auckland, New Zealand
[email protected]
&Weihua Li
Auckland University of Technology
Auckland, New Zealand
[email protected]
&Quan Bai
University of Tasmania
Hobart, Australia
[email protected]
Abstract

In this paper, we delve into the rapidly evolving challenge of misinformation detection, with a specific focus on the nuanced manipulation of narrative frames — an under-explored area within the AI community. The potential for Generative AI models to generate misleading narratives underscores the urgency of this problem. Drawing from communication and framing theories, we posit that the presentation or ‘framing’ of accurate information can dramatically alter its interpretation, potentially leading to misinformation. We highlight this issue through real-world examples, demonstrating how shifts in narrative frames can transmute fact-based information into misinformation. To tackle this challenge, we propose an innovative approach leveraging the power of pre-trained Large Language Models and deep neural networks to detect misinformation originating from accurate facts portrayed under different frames. These advanced AI techniques offer unprecedented capabilities in identifying complex patterns within unstructured data critical for examining the subtleties of narrative frames. The objective of this paper is to bridge a significant research gap in the AI domain, providing valuable insights and methodologies for tackling framing-induced misinformation, thus contributing to the advancement of responsible and trustworthy AI technologies. Several experiments are intensively conducted and experimental results explicitly demonstrate the various impact of elements of framing theory proving the rationale of applying framing theory to increase the performance in misinformation detection.

Keywords Misinformation detection  \cdot framing analysis  \cdot framing extraction  \cdot deep learning

1 Introduction

Misinformation in today’s media landscape is growing substantively, where fake news and false or misleading information is disseminated through various media channels, e.g., news stories and online social media platforms [1]. Artificial Intelligence (AI) has advanced from exclusively understanding language to Generative AI (GAI) models that can automatically generate articles, posts, and narratives with remarkable sophistication [2]. The accessibility of GAI models such as ChatGPT has expedited the process of creating manipulative misinformation, and in most cases, it can be difficult for readers to distinguish whether the narrative was written by a GAI model or a human author [3]. Automating fact-checking or claim validation is a well-researched task that has achieved high accuracy results with traditional misinformation detection focused on keywords [4, 5]. However, when presented with accurate facts where frames have been used to manipulate the information of the narrative to be misleading, it is difficult to identify the misinformation. The framing of accurate information by selecting and highlighting some aspects while simultaneously omitting other aspects to present an alternative perspective can lead to the communication of a different message than the original, accurate narrative intended [6]. The aforementioned manipulation of accurate information by changing the perspective and frame can result in the propagation of misinformation, thus, framing plays an important role in misinformation detection.

Framing theory illuminates the process by which communicators strategically highlight specific facets of a perceived reality within a communication text [7]. This intentional emphasis serves to advance a distinct problem definition, causal interpretation, moral evaluation, and/or treatment recommendation [7]. Framing involves the selection of some factors about an issue or event, and to make salient, or to emphasize, these factors over other factors. It is about selecting and deciding which parts of a situation or event to make salient to an audience. It also suggests how information is presented and communicated in a narrative - the story that communicates the facts in a meaningful way - can influence an individual’s perception and interpretation of that information and is recognized as an important concept in the communication and social science fields [6, 7, 8, 9, 10]. Additionally, framing theory suggests that four frame elements contribute to how information is presented: problem definition, causal interpretation, moral evaluation, and treatment recommendation [7]. Specifically, the problem definition defines the problem by determining the actions of a causal agent along with their associated costs and benefits and is measured by what is culturally acceptable, while the causal interpretation identifies which forces cause the problem. The moral evaluation makes moral judgments on the causal agent and their effects, with the treatment recommendation offering suggestions to solve the problem and the possible effects these might have. When it comes to misinformation, framing theory suggests that the manner in which information is conveyed or framed can be harnessed to persuade readers into embracing inaccurate or misleading information as truthful. By strategically highlighting specific facts or interpretations while purposefully excluding others, individuals or organizations can craft a specific narrative that aligns with their agenda to mislead the audience [6].

There are numerous research studies concerning the detection of framing, the identification of elements within a frame, and the analysis of framing itself [7, 11, 12]. However, few studies explore how frames impact the emergence of misinformation or which frame element has the greatest impact on the overall frame. It is challenging to classify misinformation stemming from factual information. Therefore, misleading information created by manipulating the frame of a truthful narrative would be undetected by traditional misinformation detection models. In this study, we used three topics that stirred public controversy to evaluate the detection model we were developing: the Three Waters Reform media debate in New Zealand, coverage of Covid-19 globally, and reporting on Nuclear Pollution – more on data sets later in the texts. For example, an excerpt of a factual information narrative with a political frame:

“The proposed three waters reform program harks back to the Havelock North water contamination event in 2016… The government estimates that we’ll need a mind-boggling $120 billion to $185 billion over the next 30 years… The government believes that four entities, aggregating all the water services across the country, offer the best and quickest opportunity to achieve the desired improvements… The review was expanded to cover all three waters, this acknowledges the inter-relationships between the three networks."

Information is presented in a straightforward and factual manner, explaining the motivation behind the reform, the expected costs, and the time frame, describing the government’s belief that larger entities can achieve efficiency gains, and understanding why all three water networks were reviewed. However, an excerpt of a misleading narrative with a semantic frame that uses specific terms to associate the statement with other communication contents or features, including irony, lettering, metaphor and so on [13]. The following example shows the satire/irony which suggests the opposite of the original message:

“Because nothing says ‘clean water’ like shifting responsibility from local government to some fancy-sounding entity, right?… They even established a drinking water regulator to ensure everything meets regulatory standards, because we all know how important it is to regulate things, right?… Because who needs small, local councils when you can have these big entities making all the decisions for you? Efficiency gains are just a bonus, my friends!… Because why bother keeping it simple when you can add some unnecessary complexity?"

Satire, oversimplification, and selective framing are used to mislead as it mocks the idea of clean water as a priority, ignoring the serious health concerns that prompted the government to consider these reforms, downplays the significance of regulatory standards by sarcastically framing them as if they are unnecessary, while the actual cost estimates are not addressed seriously, dismisses the efficiency gains oversimplifying the government’s rationale for proposing larger entities to handle water services and sarcastically dismisses the complexity of reviewing all three waters, suggesting that it is unnecessarily complicated.

Pre-trained Large Language Models (LLM) and deep neural networks have been acknowledged as efficient and effective techniques to address the framing classification and misinformation detection problem since they can learn from unstructured data and identify complex patterns that are difficult to detect using traditional methods [1].

Our hypothesis is that news or articles on the same topic can be converted into misinformation when given different frames, and in this paper, we use the frame of a narrative and the frame elements as key considerations in the process of identifying misinformation.

Our contributions of this research work include:

  • We formally define misinformation that is portrayed from the facts and formulate the misinformation detection problem in the context of Generative AI.

  • We propose a novel model called Framed Element-based Model (FEM), which can effectively identify misinformation stemming from portrayed facts under different framing. To the best of our knowledge, this is the first full research work, tackling the framing-based misinformation detection problem.

  • We are the first to investigate how framing elements affect misinformation detection, treating each element as a separate feature for the language model to process. Our research systematically examines these elements, offering important insights into the subtle ways information can be skewed using framing. This also enhances the accuracy and effectiveness of detecting misinformation.

The rest of this work is organized as follows. In Section 2 we discuss related works by examining misinformation detection and framing theory. In Section 3, we give the formal definitions and formulate the problem. Our proposed FEM model is then explained in Section 4. Experimental setups and datasets are introduced in Section 5. In Section 6, four experiments are conducted to evaluate the proposed model, analyze the parameters of the four framing elements, and introduce a case study which provides tangible illustrations of the model’s effectiveness. Lastly, in Section 7, we conclude the paper and give recommendations for future work.

2 Related Work

2.1 Traditional Misinformation Detection

With the rise of social media, the ease with which information can be distributed and consumed has increased, allowing misinformation also to increase [1]. Traditional rule-based misinformation detection for fact-checking and fake news focused on detecting misinformation by focusing on who provided the information or what the content of the information was. Manual fact-checking relied on the author’s reputation and/or the source to determine the veracity of the information [14]. Similarly, to detect fake news on social media, the social contexts, such as explicit and implicit features of user’s profiles, are evaluated to determine the credibility of the information [15]. In addition to social contexts, fake news detection focuses on the content of the text by extracting linguistic features in order to detect sensational headlines that are frequent in fake news [15]. Moreover, identifying negation keywords, such as ‘no,’ ‘not,’ or ‘never,’ played a significant role in enhancing the classification of rumors [16]. Traditional rule-based approaches relied on information specific to the topic to correctly identify misinformation, therefore, these approaches experienced limitations when detecting misinformation about a new topic [17]. These shortcomings were addressed with the introduction of semi-supervised and unsupervised methods [18].

2.2 Deep Learning Based Misinformation Detection

Many researchers have explored the use of deep learning techniques to automate misinformation detection, such as tensor and transformer-based models and convolutional and recurrent neural networks [1, 19, 20, 21, 22]. Latent patterns and spatial context were extracted from tensor-based models to construct k-nearest-neighbour graphs and belief propagation for semi-supervised misinformation detection [19]. A hybrid of convolutional neural networks (CNN) and recurrent neural networks (RNN) leverages the strengths of CNN in extracting local features and of RNN in capturing long-term dependencies to detect fake news [20]. Another RNN model found that combining sentiment, emotional, irony and hate analysis with bagging, boosting, stacking and voting means, produced a higher accuracy than without the various analyses [22]. An evaluation of transformer-based models Large Language Models, namely, BERT variants to be used as baselines for misinformation detection, can achieve comparable or better performance than more complex state-of-the-art methods [21]. More recently, a transformer-based model, MisRoBÆRTa, utilized RoBERTa and BART to outperform single transformer misinformation detection models [23]. Finally, a hybrid deep learning model integrating features-based models and universal sentence encoding revealed promising results on the PHEME dataset [24].

While these techniques are able to accurately detect misinformation without considering the narrative or frame, their challenge lies in dealing with misinformation stemming from factual events that are skewed to convey a different implication. Furthermore, they also face difficulties handling lengthy news articles that potentially contain both truthful and misleading information.

2.3 Framing Theory

The frame of a piece of text can increase the salience of specific parts of information, i.e., to make information more meaningful, noticeable, or memorable [7]. An example by Entman showed that a frame can influence how a large portion of readers notice, understand, remember, evaluate, or act upon information presented to them [7]. According to Entman, the problem definition, causal interpretation, moral evaluation, and treatment recommendation are the four identifiable elements of a frame [7]. Multiple methods have been developed to detect frames using different approaches. Liu et al. detected frames from news based on the article headlines by fine-tuning a Large Language Model, i.e., BERT [25]. Alternatively, Walter and Ophir leveraged computational tools to develop a novel method, the Analysis of Topic Model Networks, for the inductive identification and categorization of frames [11]. Although both misinformation, frame, and framing element detection are possible, the impact of frames on misinformation detection requires further research. Our proposed FEM explores this impact by incorporating the framing theory presented by Entman to solve the earlier challenge of detecting misinformation stemming from accurate facts that are skewed to be misleading potentially [7]. Additionally, FEM discerns the respective contributions of the four framing elements to the overall accuracy of misinformation detection.

3 Preliminary

In this section, we give formal definitions, and the problem of detecting misinformation portrayed from the facts is also formulated.

3.1 Formal Definition

Definition 1: Narrative generally refers to a way of sharing stories or information, whether it is spoken, written, or shared online. In the current context, the narrative indicates the news stories and articles that are being disseminated online. Let 𝒩={n1,n2,,nn}𝒩subscript𝑛1subscript𝑛2subscript𝑛𝑛\mathcal{N}=\{n_{1},n_{2},\ldots,n_{n}\}caligraphic_N = { italic_n start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_n start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_n start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT } denote the set of narratives, a narrative can be information or misinformation in online social networks, where nisubscript𝑛𝑖n_{i}italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT represents a single narrative.

Definition 2: Information refers to the presentation of facts in a way that aims to convey these facts accurately. The narrative of the information is constructed to reflect its true nature and implications without distorting or omitting key elements. Let 𝒜𝒜\mathcal{I}\in\mathcal{A}caligraphic_I ∈ caligraphic_A refer to the information set:

={(fa,n)fa𝒜n𝒩}conditional-set𝑓𝑎𝑛𝑓𝑎𝒜𝑛𝒩\mathcal{I}=\{(fa,n)\mid fa\in\mathcal{FA}\wedge n\in\mathcal{N}\}caligraphic_I = { ( italic_f italic_a , italic_n ) ∣ italic_f italic_a ∈ caligraphic_F caligraphic_A ∧ italic_n ∈ caligraphic_N } (1)

where fa𝒜𝑓𝑎𝒜fa\in\mathcal{FA}italic_f italic_a ∈ caligraphic_F caligraphic_A refers to a specific fact in a fact set, n𝒩𝑛𝒩n\in\mathcal{N}italic_n ∈ caligraphic_N refers to a narrative of a narrative set for information.

Definition 3: Misinformation, converse to information, involves using the same facts but framing them within a narrative that is designed to mislead, deceive, or manipulate the audiences. The key aspect of misinformation in this work is not the distorted facts, but how they are presented in a misleading narrative. Let 𝒜𝒜\mathcal{M}\in\mathcal{A}caligraphic_M ∈ caligraphic_A represent a set of misinformation which is composed of the content and the specific narrative:

={(fa,n)fa𝒜n𝒩}conditional-set𝑓𝑎𝑛𝑓𝑎𝒜𝑛𝒩\mathcal{M}=\{(fa,n)\mid fa\in\mathcal{FA}\wedge n\in\mathcal{N}\}caligraphic_M = { ( italic_f italic_a , italic_n ) ∣ italic_f italic_a ∈ caligraphic_F caligraphic_A ∧ italic_n ∈ caligraphic_N } (2)

where fa𝒜𝑓𝑎𝒜fa\in\mathcal{FA}italic_f italic_a ∈ caligraphic_F caligraphic_A refers to the same fact in a fact set as information, n𝒩𝑛𝒩n\in\mathcal{N}italic_n ∈ caligraphic_N refers to the narrative of a narrative set for misinformation.

Definition 4: Frame suggests how information is structured and presented in a story, including the perspective from which it is told. It suggests how information is presented in a narrative - the story that communicates the facts in a meaningful way - can influence an individual’s perception and interpretation of that information and is recognized as an important concept in the communication and social science fields. Mathematically, f𝑓fitalic_f represents a frame, and the set of frames is ={fr1,fr2,,frm}𝑓subscript𝑟1𝑓subscript𝑟2𝑓subscript𝑟𝑚\mathcal{FR}=\{fr_{1},fr_{2},\ldots,fr_{m}\}caligraphic_F caligraphic_R = { italic_f italic_r start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_f italic_r start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_f italic_r start_POSTSUBSCRIPT italic_m end_POSTSUBSCRIPT }. The relationship between a frame and a narrative of one article can be represented by :𝒩:𝒩\mathcal{R}:\mathcal{N}\rightarrow\mathcal{FR}caligraphic_R : caligraphic_N → caligraphic_F caligraphic_R, where R(ni)=fri𝑅subscript𝑛𝑖𝑓subscript𝑟𝑖R(n_{i})=fr_{i}italic_R ( italic_n start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_f italic_r start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, and \mathcal{R}caligraphic_R represents the element extractor.

Definition 5: Frame Elements are the specific components used to construct a frame in articles. A frame is generally composed of four elements, and they constitute how information should be displayed in front of the readers and how the readers would perceive the content. Each article has four elements: “problem definition”, “causal interpretation”, “moral evaluation”, and “treatment recommendations”. Let e𝑒eitalic_e represent one of the elements of a frame in an article and i={e1,e2,e3,e4}subscript𝑖subscripte1subscripte2subscripte3subscripte4\mathcal{E}_{i}=\{\text{e}_{1},\text{e}_{2},\text{e}_{3},\text{e}_{4}\}caligraphic_E start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , e start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , e start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT , e start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT } represents the element set of the article aisubscript𝑎𝑖a_{i}italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT where e1subscript𝑒1e_{1}italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT represents the “problem definition”, e2subscript𝑒2e_{2}italic_e start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT represents the “causal interpretation”, e3subscript𝑒3e_{3}italic_e start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT represents the “moral evaluation”, and e4subscript𝑒4e_{4}italic_e start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT represents the “treatment recommendations”.

3.2 Problem Formulation

The Misinformation Detection problem is defined as the process of classifying articles to identify misinformation stemming from portrayed facts under different narratives thus misleading the audiences. To achieve that, we adopt the Frame Element-based Model (FEM) incorporating the elements of framing theory extracted from the articles. The FEM is trained to understand the semantics and narratives of articles. Having a set of articles 𝒜={ai,a2,,an}𝒜subscript𝑎𝑖subscript𝑎2subscript𝑎𝑛\mathcal{A}=\{a_{i},a_{2},\ldots,a_{n}\}caligraphic_A = { italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_a start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , … , italic_a start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT }, given an article aiAsubscript𝑎𝑖𝐴a_{i}\in Aitalic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_A, the model first extracts the frame elements Ei={e1,e2,e3,e4}subscript𝐸𝑖subscript𝑒1subscript𝑒2subscript𝑒3subscript𝑒4E_{i}=\{e_{1},e_{2},e_{3},e_{4}\}italic_E start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = { italic_e start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT , italic_e start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_e start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT , italic_e start_POSTSUBSCRIPT 4 end_POSTSUBSCRIPT } of the article, and then encode them to get the hidden state hisubscript𝑖h_{i}italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT of it which is later used to calculate the probability to predict if the article is misinformation or not.

P(hi)=softmax(whi+b),𝑃subscript𝑖𝑠𝑜𝑓𝑡𝑚𝑎𝑥𝑤subscript𝑖𝑏P(h_{i})=softmax(w\cdot h_{i}+b),italic_P ( italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) = italic_s italic_o italic_f italic_t italic_m italic_a italic_x ( italic_w ⋅ italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT + italic_b ) , (3)

where P(hi)𝑃subscript𝑖P(h_{i})italic_P ( italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) is the probability that an article aisubscript𝑎𝑖a_{i}italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT contains misinformation, and hisubscript𝑖h_{i}italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT represents the last hidden state of the given article aisubscript𝑎𝑖a_{i}italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT or corresponding element set Eisubscript𝐸𝑖E_{i}italic_E start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT, .

The object of the last step of predicting is defined as minimizing the loss function \mathcal{L}caligraphic_L:

𝐰*,b*=argmin𝐰,b(𝐰,b)+λ𝐰2,superscript𝐰superscript𝑏subscript𝐰𝑏𝐰𝑏𝜆superscriptnorm𝐰2\mathbf{w}^{*},b^{*}=\arg\min_{\mathbf{w},b}\mathcal{L}(\mathbf{w},b)+\lambda% \|\mathbf{w}\|^{2},bold_w start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT , italic_b start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT = roman_arg roman_min start_POSTSUBSCRIPT bold_w , italic_b end_POSTSUBSCRIPT caligraphic_L ( bold_w , italic_b ) + italic_λ ∥ bold_w ∥ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT , (4)

where w*superscript𝑤w^{*}italic_w start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT and b*superscript𝑏b^{*}italic_b start_POSTSUPERSCRIPT * end_POSTSUPERSCRIPT are the target optimal weights, and the loss function \mathcal{L}caligraphic_L which is the cross entropy loss function is defined as:

=1Ni=1N[yilogP(hi)+(1yi)log(1P(hi))],1𝑁superscriptsubscript𝑖1𝑁delimited-[]subscript𝑦𝑖𝑃subscript𝑖1subscript𝑦𝑖1𝑃subscript𝑖\mathcal{L}=-\frac{1}{N}\sum_{i=1}^{N}\left[y_{i}\log P(h_{i})+(1-y_{i})\log(1% -P(h_{i}))\right],caligraphic_L = - divide start_ARG 1 end_ARG start_ARG italic_N end_ARG ∑ start_POSTSUBSCRIPT italic_i = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT italic_N end_POSTSUPERSCRIPT [ italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT roman_log italic_P ( italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) + ( 1 - italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) roman_log ( 1 - italic_P ( italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) ) ] , (5)

where N𝑁Nitalic_N is the number of samples, and yisubscript𝑦𝑖y_{i}italic_y start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the actual label of the article aisubscript𝑎𝑖a_{i}italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT.

4 Frame Element-based Misinformation Detection Model

Refer to caption

Figure 1: The Architecture of the Frame Element-based Model

In this section, the proposed Frame Element-based Model (FEM) for Misinformation detection is introduced in the context of news articles. Figure 1 demonstrates the overall framework of our proposed model, and in Algorithm 1, we showcase the steps of the whole process.

Initially, the Frame Element Extractor is utilized to process the news article to extract four framing elements: Problem Definition, Causal Interpretation, Moral Evaluation, and Treatment Recommendation. These elements represent the core of how the information is framed. The extracted framing elements, along with the corresponding news article, are then tokenized performing as the fundamental preprocessing step in NLP.

To capture the subtle contextual nuances of each element, following the tokenization, we independently encode each element and the corresponding news article (Lines 6 to 14 in Algorithm 1). This nuanced understanding of different elements is vital as each frame carries different weights and implications for the overall narrative of this news article. The separate encoding also allows us to quantify the impact of each element to reveal the most influential aspects of how the article is framed thus increasing the chance of identifying misinformation.

Independently encoding each element and article is a strategic choice that can enhance the model’s analytical precision, allowing for reliable misinformation detection. Line 6 starts the recurrent process. An empty tensor is created in advance and used to concatenate each embedding from each loop. In the recurrent process, we first encode the article which performs as the main body of the input, and then each element is encoded.

The embeddings of each element embE𝑒𝑚𝑏𝐸embEitalic_e italic_m italic_b italic_E and the news article embA𝑒𝑚𝑏𝐴embAitalic_e italic_m italic_b italic_A are then concatenated to form a dense vector emb𝑒𝑚𝑏embitalic_e italic_m italic_b as the representation of the whole input followed by a dropout layer to prevent overfitting.

embt=concat(embt1,ht),𝑒𝑚subscript𝑏𝑡𝑐𝑜𝑛𝑐𝑎𝑡𝑒𝑚subscript𝑏𝑡1subscript𝑡emb_{t}=concat(emb_{t-1},h_{t}),italic_e italic_m italic_b start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT = italic_c italic_o italic_n italic_c italic_a italic_t ( italic_e italic_m italic_b start_POSTSUBSCRIPT italic_t - 1 end_POSTSUBSCRIPT , italic_h start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT ) , (6)

where embt𝑒𝑚subscript𝑏𝑡emb_{t}italic_e italic_m italic_b start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT represents the concatenated embeddings of the current time step, and htsubscript𝑡h_{t}italic_h start_POSTSUBSCRIPT italic_t end_POSTSUBSCRIPT represents the embeddings of an element embE𝑒𝑚𝑏𝐸embEitalic_e italic_m italic_b italic_E or the article embA𝑒𝑚𝑏𝐴embAitalic_e italic_m italic_b italic_A.

The concatenated embeddings emb𝑒𝑚𝑏embitalic_e italic_m italic_b from the previous layer are then fed to a Bi-LSTM layer. The Bi-LSTM layer is applied for the purpose of capturing the holistic context after all elements and article embeddings are concatenated. This allows the model to understand how different elements of the article relate to each other.

hi=Relu(BiLSTM(emb)),subscript𝑖𝑅𝑒𝑙𝑢𝐵𝑖𝐿𝑆𝑇𝑀𝑒𝑚𝑏h_{i}=Relu(BiLSTM(emb)),italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT = italic_R italic_e italic_l italic_u ( italic_B italic_i italic_L italic_S italic_T italic_M ( italic_e italic_m italic_b ) ) , (7)

where hisubscript𝑖h_{i}italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT is the representation of the input after being processed by a Bi-LSTM layer and a Relu activation function.

A linear layer including a Dropout is applied to map the high-dimensional output representation hisubscript𝑖h_{i}italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT from the Bi-LSTM layer to the target space. The softmax function is used to obtain the probability distribution over the potential classes, which finalizes the prediction process to identify the misinformation.

predicts=softmax(Dropout(hi)WT+b),𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑠𝑠𝑜𝑓𝑡𝑚𝑎𝑥𝐷𝑟𝑜𝑝𝑜𝑢𝑡subscript𝑖superscript𝑊𝑇𝑏predicts=softmax(Dropout(h_{i})W^{T}+b),italic_p italic_r italic_e italic_d italic_i italic_c italic_t italic_s = italic_s italic_o italic_f italic_t italic_m italic_a italic_x ( italic_D italic_r italic_o italic_p italic_o italic_u italic_t ( italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ) italic_W start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT + italic_b ) , (8)

where predicts𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑠predictsitalic_p italic_r italic_e italic_d italic_i italic_c italic_t italic_s is the probability distribution of the class labels, WTsuperscript𝑊𝑇W^{T}italic_W start_POSTSUPERSCRIPT italic_T end_POSTSUPERSCRIPT is the learnable weight matrix, b𝑏bitalic_b refers to the bias.

Algorithm 1 Frame Element-based Misinformation detection Algorithm

Input: D=(a,E)𝐷𝑎𝐸D=(a,\ E)italic_D = ( italic_a , italic_E )
Output: 0 (misinformation) or 1 (information)

1:InformationCollect(sources)𝐼𝑛𝑓𝑜𝑟𝑚𝑎𝑡𝑖𝑜𝑛𝐶𝑜𝑙𝑙𝑒𝑐𝑡𝑠𝑜𝑢𝑟𝑐𝑒𝑠Information\leftarrow Collect(sources)italic_I italic_n italic_f italic_o italic_r italic_m italic_a italic_t italic_i italic_o italic_n ← italic_C italic_o italic_l italic_l italic_e italic_c italic_t ( italic_s italic_o italic_u italic_r italic_c italic_e italic_s )
2:MisinformationChatGPT(information,prompt1)𝑀𝑖𝑠𝑖𝑛𝑓𝑜𝑟𝑚𝑎𝑡𝑖𝑜𝑛𝐶𝑎𝑡𝐺𝑃𝑇𝑖𝑛𝑓𝑜𝑟𝑚𝑎𝑡𝑖𝑜𝑛𝑝𝑟𝑜𝑚𝑝𝑡1Misinformation\leftarrow ChatGPT(information,\ prompt1)italic_M italic_i italic_s italic_i italic_n italic_f italic_o italic_r italic_m italic_a italic_t italic_i italic_o italic_n ← italic_C italic_h italic_a italic_t italic_G italic_P italic_T ( italic_i italic_n italic_f italic_o italic_r italic_m italic_a italic_t italic_i italic_o italic_n , italic_p italic_r italic_o italic_m italic_p italic_t 1 )
3:FrameElementsChatGPT(articles,prompt2)𝐹𝑟𝑎𝑚𝑒𝐸𝑙𝑒𝑚𝑒𝑛𝑡𝑠𝐶𝑎𝑡𝐺𝑃𝑇𝑎𝑟𝑡𝑖𝑐𝑙𝑒𝑠𝑝𝑟𝑜𝑚𝑝𝑡2FrameElements\leftarrow ChatGPT(articles,\ prompt2)italic_F italic_r italic_a italic_m italic_e italic_E italic_l italic_e italic_m italic_e italic_n italic_t italic_s ← italic_C italic_h italic_a italic_t italic_G italic_P italic_T ( italic_a italic_r italic_t italic_i italic_c italic_l italic_e italic_s , italic_p italic_r italic_o italic_m italic_p italic_t 2 )
4:Create BERT, BiLSTM, FC Layer as classifier, Dropout, Relu
5:emb:={}assign𝑒𝑚𝑏emb:=\{\}italic_e italic_m italic_b := { }
6:for ai,EiDsubscript𝑎𝑖subscript𝐸𝑖𝐷a_{i},E_{i}\in Ditalic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_E start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∈ italic_D do
7:     embA:=BERT(ai)assign𝑒𝑚𝑏𝐴𝐵𝐸𝑅𝑇subscript𝑎𝑖embA:=BERT(a_{i})italic_e italic_m italic_b italic_A := italic_B italic_E italic_R italic_T ( italic_a start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT )
8:     emb:=concat(emb,embA)assign𝑒𝑚𝑏𝑐𝑜𝑛𝑐𝑎𝑡𝑒𝑚𝑏𝑒𝑚𝑏𝐴emb:=concat(emb,\ embA)italic_e italic_m italic_b := italic_c italic_o italic_n italic_c italic_a italic_t ( italic_e italic_m italic_b , italic_e italic_m italic_b italic_A )
9:     for ejEisubscript𝑒𝑗subscript𝐸𝑖e_{j}\in E_{i}italic_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∈ italic_E start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT do
10:         embE:=BERT(ej)assign𝑒𝑚𝑏𝐸𝐵𝐸𝑅𝑇subscript𝑒𝑗embE:=BERT(e_{j})italic_e italic_m italic_b italic_E := italic_B italic_E italic_R italic_T ( italic_e start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT )
11:         emb:=concat(emb,embE)assign𝑒𝑚𝑏𝑐𝑜𝑛𝑐𝑎𝑡𝑒𝑚𝑏𝑒𝑚𝑏𝐸emb:=concat(emb,\ embE)italic_e italic_m italic_b := italic_c italic_o italic_n italic_c italic_a italic_t ( italic_e italic_m italic_b , italic_e italic_m italic_b italic_E )
12:     end for
13:     emb:=Dropout(emb)assign𝑒𝑚𝑏𝐷𝑟𝑜𝑝𝑜𝑢𝑡𝑒𝑚𝑏emb:=Dropout(emb)italic_e italic_m italic_b := italic_D italic_r italic_o italic_p italic_o italic_u italic_t ( italic_e italic_m italic_b )
14:end for
15:outputs:=BiLSTM(emb)assign𝑜𝑢𝑡𝑝𝑢𝑡𝑠𝐵𝑖𝐿𝑆𝑇𝑀𝑒𝑚𝑏outputs:=BiLSTM(emb)italic_o italic_u italic_t italic_p italic_u italic_t italic_s := italic_B italic_i italic_L italic_S italic_T italic_M ( italic_e italic_m italic_b )
16:h:=Relu(outputs)assign𝑅𝑒𝑙𝑢𝑜𝑢𝑡𝑝𝑢𝑡𝑠h:=Relu(outputs)italic_h := italic_R italic_e italic_l italic_u ( italic_o italic_u italic_t italic_p italic_u italic_t italic_s )
17:logits:=classifier(Dropout(h))assign𝑙𝑜𝑔𝑖𝑡𝑠𝑐𝑙𝑎𝑠𝑠𝑖𝑓𝑖𝑒𝑟𝐷𝑟𝑜𝑝𝑜𝑢𝑡logits:=classifier(Dropout(h))italic_l italic_o italic_g italic_i italic_t italic_s := italic_c italic_l italic_a italic_s italic_s italic_i italic_f italic_i italic_e italic_r ( italic_D italic_r italic_o italic_p italic_o italic_u italic_t ( italic_h ) )
18:predicts:=softmax(logits)assign𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑠𝑠𝑜𝑓𝑡𝑚𝑎𝑥𝑙𝑜𝑔𝑖𝑡𝑠predicts:=softmax(logits)italic_p italic_r italic_e italic_d italic_i italic_c italic_t italic_s := italic_s italic_o italic_f italic_t italic_m italic_a italic_x ( italic_l italic_o italic_g italic_i italic_t italic_s )

5 Experiment Setups

5.1 Model Setup

To ensure an efficient training process, we conduct our experiments on the Paperspace 111https://www.paperspace.com/ platform utilizing the following tailored computational and training settings to the unique demands of each dataset:

  • GPU Configuration: The model is trained over a span of 100 epochs utilizing NVIDIA’s A6000 48GB GPU and 45 GB 8 CPU.

  • Dropout: To mitigate the risk of overfitting, a dropout rate of 0.30.30.30.3 was applied during training.

  • Learning Rate: The training uses an initial learning rate of 1×1051superscript1051\times 10^{-5}1 × 10 start_POSTSUPERSCRIPT - 5 end_POSTSUPERSCRIPT and it is modulated following a cosine schedule with a warm-up phase. The warm-up steps vary in accordance with the specificities of each dataset.

  • Batch Size: The batch size is determined based on the particular requirements and characteristics of each dataset.

  • Frame Element Extractor: ChatGPT, as a powerful generative AI model, is used as the element extractor. Different extractors can be applied for the same purpose.

5.2 Datasets

In this section, we introduce 4 datasets used to evaluate our model. To assess the generalization capability of the model, we used three single-topic datasets which are the Three Waters Reform dataset, Covid-19 dataset, Nuclear Pollution dataset, and a mixed-topic dataset which is the Kaggle Fake News dataset. The statistics of these datasets are displayed in Table 1.

  • Three Waters Reform dataset is collected from The Knowledge Basket222https://www.knowledge-basket.co.nz/. We only capture the news focus on the “Three Waters Reform" in New Zealand, a topic of substantial political discourse and interest spanning from 2017 to June 2023. This dataset accumulates a total of 1,841 articles. Following the application of our labeling process yields 3,262 articles labeled in concordance with their identified frames and frame elements.

  • Covid-19 is collected using Newsapi 333https://newsapi.org/ which is an API service that allows developers to retrieve news articles from various sources on a worldwide scale. We use “Covid-19" as keywords to retrieve news articles in the period from 01/12/2019 to 20/08/2023. These articles reflect the in-time attitude to the Covid-19 pandemic. This dataset includes 13,386 articles after the pre-processing.

  • Nuclear Pollution dataset is collected using the Newsapi as well and with the keywords “nuclear pollution" over the last 5 years. This dataset provides a comprehensive view of the discourse surrounding nuclear pollution offering a diverse range of perspectives and information. After the data pre-processing, there are 2,431 articles with an average token length of 482.

  • Kaggle Fake News Dataset 444https://www.kaggle.com/datasets/stevenpeutz/misinformation-fake-news-text-dataset-79k contains news articles from multiple sources such as Reuters and so on. For the purpose of our study, we confine our selection to the “TRUE" set and randomly select 3k articles. To augment the dataset to fulfill the research objectives, we produce another set of data by varying the frame of existing news, ultimately resulting in 5,915 labeled samples following the implementation of our augmentation process.

Table 1: The statistics of the datasets after pre-processing.
Dataset articles average length
The Three Waters 3,262 823
Covid-19 13,386 537
Nuclear Pollution 2,431 482
Mixed-topic 5,915 469

5.3 Data Pre-processing

Our proposed methodology commences with the collection of datasets comprised of news articles from reliable sources. These articles constitute our ground truth, representing information opposite to misinformation. Accordingly, we synthesize misinformation based on the framing theory by altering the frames of our collected news articles. This process augments our datasets in a generative method at the document level and unfolds in three structured phases [26].

  • Frame Identification and Element Extraction: Utilizing the capabilities of ChatGPT, we first process the collected news articles to identify their frames and extract four elements of framing theory. These extracted framing elements reflect the news articles’ original and unaltered state. They are annotated with the label “1", signifying their category as information. The frames we harness in this work are selected and proved by domain experts in communication.

  • Frame Alteration: The second stage involves the alteration of the frame, utilizing ChatGPT to manipulate the article narrative while maintaining the original factual information. This step simulates the process of creating misinformation through narrative manipulation, a common way that preserves factual information but skews the frame to mislead readers. 20% of the altered narratives are verified by the domain experts.

  • Element Extraction: In the final step, we process the narrative-manipulated articles through ChatGPT to extract the corresponding four elements of framing theory, labeled “0" along with the manipulated articles, signifying their category as misinformation. This eventually establishes the basis for comparison with the information.

This pre-processing procedure is designed to construct binary-category datasets that are comprised of information and misinformation with elements of framing theory, thereby enabling the nuanced training of our model. Through this process, we not only aim to create datasets that serve as the foundation of misinformation detection but also enhance the understanding of how narrative (framing theory in this work) can be utilized to generate misinformation.

5.4 Evaluation Metrics and Baselines

To evaluate the performance of our proposed model (FEM), we utilize the Confusion Matrix as our primary evaluation measurement. The Confusion Matrix provides a comprehensive visualization of the performance by categorizing predictions into four different classifications [27]:

  • True Positives (TP): when predicted misinformation is actually labeled as misinformation;

  • True Negatives (TN): when predicted information is actually labeled as information;

  • False Positives (FP): when predicted information is actually labeled as misinformation;

  • False Negatives (FN): when predicted misinformation is actually labeled as information.

As our baselines, we perform the fine-tuning of several highly utilized pre-trained transformer-based language models followed by a feed-forward layer as a classifier for misinformation detection. These baselines include:

  • BERT [28] is a groundbreaking transformer-based model in the field of natural language processing (NLP). It is known for its deep bidirectional training, meaning it considers the context from both the left and right sides in all layers. This leads to a more nuanced understanding of language context and semantics. BERT has been highly influential in improving the performance of a wide range of NLP tasks.

  • RoBERTa [29] is built upon BERT by modifying key hyperparameters, training with more data, and longer training times. These changes help RoBERTa outperform BERT on several benchmark NLP tasks. It is known for its improved robustness and efficiency.

  • ALBERT [30] is a version of BERT optimized for lower memory consumption and increased speed. It introduces two major modifications: factorized embedding parameterization and cross-layer parameter sharing. These changes reduce the model’s size without significantly affecting its performance, making it more scalable and efficient.

  • XLNet [31] is an extension of the Transformer model. Instead of the standard transformer, XLNet uses transformer-XL [32]. It combines the best of both autoregressive (AR) and autoencoding (AE) models. Unlike BERT, XLNet learns to predict a word at a position in a sequence considering all permutations of the sequence.

  • LongFormer [33] is designed to handle longer texts. It is an extension of the standard transformer-based model, like BERT, but optimized for lengthy documents. Its key innovation is the introduction of an attention mechanism that scales linearly with sequence length.

6 Experiments and Analysis

6.1 Experiment 1: Model Evaluation - against baselines

In this experiment, we compare the performance of our model with other baseline models on four datasets introduced in Section 5. The results are displayed in Table 2 ,3, 4 and 5 respectively. The results on each dataset consistently show that our model (FEM) incorporating frame elements with the original news article significantly outperforms other models with only articles, presenting the importance of frame elements.

From these results, we can observe that frame elements play a crucial role in understanding and interpreting information. We also demonstrate the results of our model with only frame elements and only texts as input respectively. Compared to the baselines, the results with only frame elements as input are also beyond them.

By analyzing the performance of our model with only frame elements compared to the other baselines, we can observe the significance of these elements. Frame elements contribute to the deeper semantic understanding of the content by narrowing it down to the core theme reducing the distracting noise. This enables the model to grasp not just the explicit meaning but also the implicit intentions and nuances thus increasing the probability of precisely detecting misinformation.

Experiment results in Table 5 on the Mixed-topic dataset demonstrate that frame elements can also provide a more general representation of information, making the model more adaptable and robust to variations of information.

Table 2: Results on the Three Waters Dataset.
Models Accuracy Precision Recall F1_score
BERT 0.8469 0.8188 0.8127 0.8157
RoBERTa 0.8622 0.8784 0.7915 0.8327
ALBERT 0.8086 0.7651 0.8057 0.7849
XLNet 0.8545 0.8113 0.8657 0.8376
LongFormer 0.8591 0.8283 0.8516 0.8398
FEM (text+frames) 0.9862 0.9695 0.9734 0.9715
FEM (only text) 0.8652 0.8316 0.8638 0.8474
FEM (only frames) 0.9278 0.9355 0.9605 0.9478
Table 3: Results on the Covid-19 Dataset.
Models Accuracy Precision Recall F1_score
BERT 0.8372 0.8052 0.8074 0.8063
RoBERTa 0.8547 0.8539 0.7867 0.8190
ALBERT 0.8104 0.7783 0.8163 0.7968
XLNet 0.8429 0.8207 0.8629 0.8412
LongFormer 0.8546 0.8617 0.8694 0.8655
FEM (text+frames) 0.9783 0.9583 0.9708 0.9645
FEM (only text) 0.8865 0.8737 0.8826 0.8781
FEM (only frames) 0.9132 0.9195 0.9361 0.9277
Table 4: Results on the Nuclear Pollution Dataset.
Models Accuracy Precision Recall F1_score
BERT 0.8035 0.7921 0.80167 0.7969
RoBERTa 0.8167 0.8234 0.7826 0.8025
ALBERT 0.8051 0.7568 0.7864 0.7713
XLNet 0.8268 0.8035 0.8284 0.8158
LongFormer 0.8462 0.8254 0.8316 0.8285
FEM (text+frames) 0.9538 0.9429 0.9531 0.9480
FEM (only text) 0.8491 0.8365 0.8537 0.8450
FEM (only frames) 0.9035 0.9216 0.9268 0.9242
Table 5: Results on the Mixed-topic Dataset.
Models Accuracy Precision Recall F1_score
BERT 0.8354 0.8127 0.8165 0.8146
RoBERTa 0.8497 0.8503 0.7902 0.8191
ALBERT 0.8126 0.7816 0.8257 0.8030
XLNet 0.8528 0.8320 0.8783 0.8545
LongFormer 0.8736 0.8542 0.8867 0.8701
FEM (text+frames) 0.9696 0.9582 0.9683 0.9632
FEM (only text) 0.8823 0.8574 0.8929 0.8748
FEM (only frames) 0.9158 0.9207 0.9319 0.9263

6.2 Experiment 2: Parameter Analysis

In this experiment, we conduct a comparative analysis to explore the contribution of each element to misinformation detection. The experimental framework analyzes the composite efficacy of the model equipped with the four elements, i.e., Problem Definition, Causal Interpretation, Moral Evaluation and Treatment Recommendation. Within the area of misinformation detection with frame elements, exploring the individual contribution of distinct frame elements is vital to help us understand how frame elements influence the model’s capability to grasp the veracity of information.

The model with all 4 elements serves as the benchmark for optimal performance, showing a high degree of accuracy, precision, recall and F1-score. This provides a holistic frame element-based analysis of information, thus enhancing the probability of identification of misinformation.

Then, we remove each frame element from all four elements keeping the other three elements remaining. Figure 2 displays all performance metrics while Figure 3 demonstrates the trend of F1-Scores during the training process.

We can observe from all these figures that when the element of Problem Definition is removed from the model, a pronounced decrement in all measurements is demonstrated. This suggests that the recognition of Problem Definitions is instrumental in the precise detection of misinformation, potentially due to its role in pinpointing the core theme within the narrative that may be manipulated. Without the incorporation of the element of Problem Definition, the capability of the model to differentiate between true and misleading content is significantly compromised.

Meanwhile, the absence of the frame of Moral Evaluation also results in a noticeable decline in all performance metrics. It appears to be an important factor in the framing of information indicating that it is often manipulated in misinformation in order to obtain emotional biases or ethical stances.

The model lacking the frame of Problem Definition or Moral Evaluation demonstrates a noticeable drop in Precision and Accuracy indicating a higher rate of false positives. This implies that while the model may still identify genuine instances of misinformation, it is also more likely to incorrectly classify accurate information as misinformation.

On the contrary, a lack of the frame of Causal Interpretation or Treatment Recommendation does not show a substantial decline in performance metrics compared to the benchmark. This observation implies that while they have a role in the misinformation detection process, however, their absence does not critically influence the capability of the model to identify misinformation.

One noticeable difference in the results demonstrated in Figure 1(c) and Figure 2(c) on the Nuclear Pollution dataset is the lack of the frame of Treatment Recommendation. Removing the Treatment Recommendation element also results in a lower performance across all metrics indicating that in the context of nuclear pollution, the treatment recommendation is likely to be a key indicator of the news articles. A lack of this element in this area could also allow misinformation proposing ineffective or misleading responses to harness the readers. The decrement in performance of missing the frame of treatment recommendation also implies that the contribution of each element can vary depending on the subject.

Refer to caption
(a) Three Waters Reform
Refer to caption
(b) Covid-19
Refer to caption
(c) Nuclear Pollution
Refer to caption
(d) Mixed-topic
Figure 2: Measure the performance of removing one of the elements on all four datasets.
Refer to caption
(a) Three Waters Reform
Refer to caption
(b) Covid-19
Refer to caption
(c) Nuclear Pollution
Refer to caption
(d) Mixed-topic
Figure 3: The F1-scores during the training process on all four datasets.

6.3 Experiment 3: Similarity Comparison

Table 6: One single pair similarity and similarities removing one of the elements.
Info vs Mis-info Similarity F1-score
Article Similarity 0.86 0.8474
Elements Similarity(all 4 elements) 0.61 0.9478
Elements Similarity(without problem) 0.79 0.9046
Elements Similarity(without causal) 0.62 0.9454
Elements Similarity(without moral) 0.81 0.9065
Elements Similarity(without treatment) 0.64 0.9354
Table 7: Compare article average similarity with average similarity calculated using 4 elements.
Info vs Mis-info three water covid nuclear mixed
Article Similarity 0.86 0.82 0.83 0.85
Elements Similarity(all 4 elements) 0.58 0.62 0.59 0.61
Elements Similarity(without problem) 0.83 0.81 0.82 0.83
Elements Similarity(without causal) 0.59 0.62 0.60 0.63
Elements Similarity(without moral) 0.81 0.79 0.81 0.80
Elements Similarity(without treatment) 0.60 0.64 0.78 0.63

In this experiment, we analyze the relationship between the similarity between information and misinformation under different conditions relating to the presence or absence of specific frame elements within our model. This experiment represents how closely misinformation mirrors authentic information in terms of framing. The cosine function is used to calculate their similarities:

sim(hi,hj)=hihjhihj,𝑠𝑖𝑚subscript𝑖subscript𝑗subscript𝑖subscript𝑗normsubscript𝑖normsubscript𝑗sim(h_{i},h_{j})=\frac{h_{i}\cdot h_{j}}{\parallel h_{i}\parallel\parallel h_{% j}\parallel},italic_s italic_i italic_m ( italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT , italic_h start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ) = divide start_ARG italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ⋅ italic_h start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT end_ARG start_ARG ∥ italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT ∥ ∥ italic_h start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT ∥ end_ARG , (9)

where hisubscript𝑖h_{i}italic_h start_POSTSUBSCRIPT italic_i end_POSTSUBSCRIPT and hjsubscript𝑗h_{j}italic_h start_POSTSUBSCRIPT italic_j end_POSTSUBSCRIPT represent the final hidden states of two articles or elements from two articles.

Table 6 shows the similarities of one randomly selected article from the Three Waters Reform dataset and the overall performance (F1-score) of the model on this dataset. The similarity between the information article and the misinformation article is used as the benchmark for further analysis. The similarity of 0.86 shows that, without any specialized modification, misinformation is quite successful at resembling a genuine news article.

From Table 6 we can observe that the similarity between the information article and misinformation article is the highest at 0.85, however, the F1 score of the model with only text on this dataset is 0.8474 which is lower than for other conditions especially when utilizing only all 4 elements which holds the lowest similarity 0.61 and highest F1-score 0.9478.

This experiment indicates the alignment of similarity with the model performance as an inverse relationship. The lower the similarity, the higher the performance is in detecting misinformation. This underscores the importance of elements of framing theory in the detection process and points out their potential impact on improving detection accuracy.

We also calculate the average similarity scores between information and misinformation across four distinct datasets under different conditions. Results are displayed in Table 7. The pattern of similarity scores is relatively consistent across different topics, indicating that the manipulation of framing in misinformation follows a similar pattern. However, on the nuclear dataset, when omitting the frame of treatment recommendation, the similarity still remains at a high level, indicating the importance of treatment recommendation in this dataset.

Overall, by comparing the average similarities across all datasets, we can observe the alignment of similarities with omitting distinct elements except the impact when omitting the frame of treatment recommendation on the nuclear dataset. This indicates a unique pattern in the context of a specific topic showing that different elements of framing theory have varying levels of impact depending on the topic. This also underlines the importance of topic-sensitive approaches in misinformation detection.

6.4 Experiment 4: Case study

In this experiment, we conduct a case study to analyze the similarities between two articles written about the same topic where each article has a different frame and frame elements. The articles focus on the government’s proposed water reforms.

The first article has a political frame:

"There’s a lot of change being proposed by the government… Fundamentally, they’re considering shifting responsibility for our three waters: water supply, wastewater, and stormwater, from local government into four large entities… The government now believes that costs of between $120 billion and $185b will be required: between $4 and $6b per year on average… The proposed three waters reform program harks back to the Havelock North water contamination event in 2016… It’s on this basis that the government has concluded that four entities, aggregating all the water services across the country, offer the best and quickest opportunity to achieve the desired improvements to the three-waters networks… It’s too early to ask the community…"

While the second article has a semantic frame to show their satire:

"Oh, boy! The government is proposing some exciting changes, folks. Brace yourselves because they’re considering taking control of our beloved three waters. You know, the precious water supply, wastewater, and stormwater that our local government has been responsible for?… The government estimates that we’ll need a mind-boggling $120 billion to $185 billion over the next 30 years… Well, now they want to hand it over to these big entities called Water Supply Entities. What a brilliant idea, right?… And get this – the government thinks it would be cheaper if larger entities took over the water services. Apparently, they can borrow more, with the government’s backing, of course. I mean, who needs small, local councils when you can have these big entities making all the decisions for you?…"

The similarities calculated in Table 6 of 0.86 indicate that the articles are highly similar as they both share details about the reform. However, once the frame elements are considered, the article similarity decreases to 0.61.

The political frame is informative and objective, presenting a detailed overview using formal language, and statistics to support its claims, while the semantic frame is emotional and opinionated using colloquial language and employing vivid imagery to engage readers emotionally which may risk oversimplification and bias.

To determine the classification without frame elements, our model only encodes the news article. In comparison, for classifying with the frame elements, the elements and news articles are encoded independently with their embeddings concatenated into one vector prior to classification. The inclusion of these extra features enhances the model’s performance.

For example, given the problem definition for the political frame:

“The proposed shift of responsibility for three waters from local government to four large entities known as water supply entities.”

As well as the problem definition for the semantic frame:

“The proposed government takeover of three waters”

The problem definition of each article highlights their differences in framing, the politically framed article’s problem definition is detailed with a neutral tone, while the semantic frame’s problem definition is short with a negative perspective.

Both articles are classified as information without frame elements, however, once the frame elements are considered, the semantic frame is correctly classified as misinformation. This suggests that including the frame elements in our model contributes to the successful classification of misinformation by reinforcing the differences between the two semantically similar articles.

6.5 Discussions

We proposed a Frame Element-based Model (FEM) to distinguish misinformation from information. Several experiments are conducted providing crucial insights into the importance of elements of framing theory in misinformation detection. Results are evaluated by comparing them with baseline models. Furthermore, we analyzed the contribution of each element demonstrating the different roles of the elements. By comparing the article similarities and element similarities, we obtained insights into how the elements improve the performance of detecting misinformation. Based on the experimental results, we have the following insights observed:

  • The results of Experiment 1 on all datasets consistently proved that incorporating the elements of framing theory while detecting misinformation stemming from portrayed facts under different narratives can help improve the performance.

  • The parameter analysis experiment revealed that the absence of certain framing elements, particularly Problem Definition and Moral Evaluation, leads to a significant decrease in the model’s accuracy and precision. This underscores the critical role these elements play in the accurate detection of misinformation.

  • Experiment 2 also demonstrates a finding that the specific element plays a different role on different topics highlighting the potential impact of elements and underscoring the necessity for topic-sensitive approaches in misinformation detection, as different elements of framing theory have varying levels of impact depending on the topic.

  • The similarity comparison experiment further illustrated how misinformation closely mirrors authentic information in terms of framing. The results indicated an inverse relationship between similarity and model performance: lower similarity between the information and misinformation articles led to higher performance in detecting misinformation.

  • The case study shows that not applying the framing theory to the articles can lead to the result of the article with a semantic frame incorrectly classified as information increasing the potential for misleading interpretations. It also shows that while articles may be semantically similar, the choice of framing can greatly impact the narrative of content being misleading or misinterpreted.

7 Conclusion and Future Work

In this work, we introduce the Framed Element-based Model (FEM) to identify misinformation in the context of news articles incorporating the elements of framing theory, i.e., Problem Definition, Causal Interpretation, Moral Evaluation, and Treatment Recommendation. This model leverages ChatGPT and deep neural networks to detect misinformation originating from accurately portrayed facts under different frames. The efficacy of FEM is demonstrated through comprehensive performance comparisons with other methods, highlighting the effectiveness of Framed Element-based approach against traditional misinformation detection models. The contribution of each element is also evaluated and analyzed along with the similarities under different conditions, indicating the importance of the specific element and showcasing how the narrative of an article is framed.

This work has laid a foundational understanding of how elements of framing theory influence the perception and interpretation of information. Building upon the insights obtained, there are several future directions. Future studies can delve into the impact of specific elements across various topics, such as the frame of treatment recommendation on the Nuclear dataset shows more impact than on the other three datasets which raises questions that can be explored in the future. Besides the contribution of each element we explored, relations among these elements also can be explored as a future direction which can help understand and enhance the ability to detect misinformation under more complex scopes.

Acknowledgment

This project has been supported by the AUT DCT Faculty Contestable Research Fund 2023. We acknowledge the support and resources provided by AUT and thank all the participants and collaborators who dedicated their time and effort to this work.

References

  • [1] Md Rafiqul Islam, Shaowu Liu, Xianzhi Wang, and Guandong Xu. Deep learning for misinformation detection on online social networks: a survey and new perspectives. Social Network Analysis and Mining, 10:1–20, 2020.
  • [2] Chiara Longoni, Andrey Fradkin, Luca Cian, and Gordon Pennycook. News from generative artificial intelligence is believed less. In 2022 ACM Conference on Fairness, Accountability, and Transparency, pages 97–106, 2022.
  • [3] Nir Kshetri. Chatgpt in developing economies. IT Professional, 25(2):16–19, 2023.
  • [4] Hannah Rashkin, Eunsol Choi, Jin Yea Jang, Svitlana Volkova, and Yejin Choi. Truth of varying shades: Analyzing language in fake news and political fact-checking. In Proceedings of the 2017 conference on empirical methods in natural language processing, pages 2931–2937, 2017.
  • [5] Andon Tchechmedjiev, Pavlos Fafalios, Katarina Boland, Malo Gasquet, Matthäus Zloch, Benjamin Zapilko, Stefan Dietze, and Konstantin Todorov. Claimskg: A knowledge graph of fact-checked claims. In The Semantic Web–ISWC 2019: 18th International Semantic Web Conference, Auckland, New Zealand, October 26–30, 2019, Proceedings, Part II 18, pages 309–324. Springer, 2019.
  • [6] R.M. Entman. Projections of Power: Framing News, Public Opinion, and U.S. Foreign Policy. Projections of Power: Framing News, Public Opinion, and U.S. Foreign Policy. University of Chicago Press, 2004.
  • [7] Robert M Entman. Framing: Toward clarification of a fractured paradigm. Journal of communication, 43(4):51–58, 1993.
  • [8] Erving Goffman. Frame analysis: An essay on the organization of experience. Harvard University Press, 1974.
  • [9] Dietram A Scheufele. Framing as a theory of media effects. Journal of communication, 49(1):103–122, 1999.
  • [10] Gail Fairhurst and Robert Sarr. The art of framing. San Francisco: Jossey-Bass, 1996.
  • [11] Dror Walter and Yotam Ophir. News frame analysis: An inductive mixed-method computational approach. Communication Methods and Measures, 13(4):248–266, 2019.
  • [12] Maria Touri and Nelya Koteyko. Using corpus linguistic software in the extraction of news frames: towards a dynamic process of frame analysis in journalistic texts. International Journal of Social Research Methodology, 18(6):601–616, 2015.
  • [13] Christian Scheibenzuber, Laurentiu-Marian Neagu, Stefan Ruseti, Benedikt Artmann, Carolin Bartsch, Montgomery Kubik, Mihai Dascalu, Stefan Trausan-Matu, and Nicolae Nistor. Dialog in the echo chamber: Fake news framing predicts emotion, argumentation and dialogic social knowledge building in subsequent online discussions. Computers in Human Behavior, 140:107587, 2023.
  • [14] Zhijiang Guo, Michael Schlichtkrull, and Andreas Vlachos. A Survey on Automated Fact-Checking. Transactions of the Association for Computational Linguistics, 10:178–206, 02 2022.
  • [15] Kai Shu, Suhang Wang, and Huan Liu. Understanding user profiles on social media for fake news detection. In 2018 IEEE conference on multimedia information processing and retrieval (MIPR), pages 430–435. IEEE, 2018.
  • [16] Sejeong Kwon, Meeyoung Cha, Kyomin Jung, Wei Chen, and Yajun Wang. Prominent features of rumor propagation in online social media. In 2013 IEEE 13th International Conference on Data Mining, pages 1103–1108, 2013.
  • [17] Andreas Vlachos and Sebastian Riedel. Fact checking: Task definition and dataset construction. In Cristian Danescu-Niculescu-Mizil, Jacob Eisenstein, Kathleen McKeown, and Noah A. Smith, editors, Proceedings of the ACL 2014 Workshop on Language Technologies and Computational Social Science, pages 18–22, Baltimore, MD, USA, June 2014. Association for Computational Linguistics.
  • [18] Ray Oshikawa, Jing Qian, and William Yang Wang. A survey on natural language processing for fake news detection. CoRR, abs/1811.00770, 2018.
  • [19] Sara Abdali, Gisel G Bastidas, Neil Shah, and Evangelos E Papalexakis. Tensor embeddings for content-based misinformation detection with limited supervision. Disinformation, Misinformation, and Fake News in Social Media: Emerging Research Challenges and Opportunities, pages 117–140, 2020.
  • [20] Jamal Abdul Nasir, Osama Subhani Khan, and Iraklis Varlamis. Fake news detection: A hybrid cnn-rnn based deep learning approach. International Journal of Information Management Data Insights, 1(1):100007, 2021.
  • [21] Kellin Pelrine, Jacob Danovitch, and Reihaneh Rabbany. The surprising performance of simple baselines for misinformation detection. In Proceedings of the Web Conference 2021, pages 3432–3441, 2021.
  • [22] Sanjaikanth E Vadakkethil Somanathan Pillai and Wen-Chen Hu. Misinformation detection using an ensemble method with emphasis on sentiment and emotional analyses. In 2023 IEEE/ACIS 21st International Conference on Software Engineering Research, Management and Applications (SERA), pages 295–300, 2023.
  • [23] Ciprian-Octavian Truică and Elena-Simona Apostol. MisrobÆrta: Transformers versus misinformation. Mathematics, 10(4), 2022.
  • [24] Amani Alzahrani, Tahani Baabdullah, Aeman Almotairi, and Danda B. Rawat. A hybrid deep learning architecture for misinformation detection on social media. In 2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI), pages 199–204, 2023.
  • [25] Siyi Liu, Lei Guo, Kate Mays, Margrit Betke, and Derry Tanti Wijaya. Detecting frames in news headlines and its application to analyzing news framing trends surrounding us gun violence. In Proceedings of the 23rd conference on computational natural language learning (CoNLL), pages 504–514, 2019.
  • [26] Markus Bayer, Marc-André Kaufhold, and Christian Reuter. A survey on data augmentation for text classification. ACM Computing Surveys, 55(7):1–39, 2022.
  • [27] Kai Shu, Amy Sliva, Suhang Wang, Jiliang Tang, and Huan Liu. Fake news detection on social media: A data mining perspective. ACM SIGKDD explorations newsletter, 19(1):22–36, 2017.
  • [28] Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805, 2018.
  • [29] Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Mandar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. Roberta: A robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692, 2019.
  • [30] Zhenzhong Lan, Mingda Chen, Sebastian Goodman, Kevin Gimpel, Piyush Sharma, and Radu Soricut. Albert: A lite bert for self-supervised learning of language representations. arXiv preprint arXiv:1909.11942, 2019.
  • [31] Zhilin Yang, Zihang Dai, Yiming Yang, Jaime Carbonell, Russ R Salakhutdinov, and Quoc V Le. Xlnet: Generalized autoregressive pretraining for language understanding. Advances in neural information processing systems, 32, 2019.
  • [32] Zihang Dai, Zhilin Yang, Yiming Yang, Jaime G Carbonell, Quoc Le, and Ruslan Salakhutdinov. Transformer-xl: Attentive language models beyond a fixed-length context. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 2978–2988, 2019.
  • [33] Iz Beltagy, Matthew E. Peters, and Arman Cohan. Longformer: The long-document transformer. arXiv:2004.05150, 2020.