Analyzing big data in social media: Text and network analyses of an eating disorder forum

Int J Eat Disord. 2018 Jul;51(7):656-667. doi: 10.1002/eat.22878. Epub 2018 May 10.

Abstract

Objective: Social media plays an important role in everyday life of young people. Numerous studies claim negative effects of social media and media in general on eating disorder risk factors. Despite the availability of big data, only few studies have exploited the possibilities so far in the field of eating disorders.

Method: Methods for data extraction, computerized content analysis, and network analysis will be introduced. Strategies and methods will be exemplified for an ad-hoc dataset of 4,247 posts and 34,118 comments by 3,029 users of the proed forum on Reddit.

Results: Text analysis with latent Dirichlet allocation identified nine topics related to social support and eating disorder specific content. Social network analysis describes the overall communication patterns, and could identify community structures and most influential users. A linear network autocorrelation model was applied to estimate associations in language among network neighbors. The supplement contains R code for data extraction and analyses.

Discussion: This paper provides an introduction to investigating social media data, and will hopefully stimulate big data social media research in eating disorders. When applied in real-time, the methods presented in this manuscript could contribute to improving the safety of ED-related online communication.

Keywords: big data; eating disorders; social media; social network analysis; text analysis.

MeSH terms

  • Algorithms
  • Big Data*
  • Central Nervous System Stimulants / pharmacology
  • Data Mining / methods*
  • Feeding and Eating Disorders / diagnosis*
  • Feeding and Eating Disorders / physiopathology*
  • Humans
  • Internet
  • Language
  • Linear Models
  • Natural Language Processing
  • Programming Languages
  • Risk Factors
  • Social Media
  • Social Networking*
  • Social Support*
  • Software

Substances

  • Central Nervous System Stimulants