Zum Hauptinhalt springen

Showing 1–9 of 9 results for author: Bertaglia, T

Searching in archive cs. Search in all archives.
.
  1. The Monetisation of Toxicity: Analysing YouTube Content Creators and Controversy-Driven Engagement

    Authors: Thales Bertaglia, Catalina Goanta, Adriana Iamnitchi

    Abstract: YouTube is a major social media platform that plays a significant role in digital culture, with content creators at its core. These creators often engage in controversial behaviour to drive engagement, which can foster toxicity. This paper presents a quantitative analysis of controversial content on YouTube, focusing on the relationship between controversy, toxicity, and monetisation. We introduce… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: Accept for publication at the 4th International Workshop on Open Challenges in Online Social Networks (OASIS) held in conjunction with 35th ACM Conference on Hypertext and Social Media (HT24)

  2. arXiv:2407.12451  [pdf, other

    cs.CY cs.CL cs.SI

    Across Platforms and Languages: Dutch Influencers and Legal Disclosures on Instagram, YouTube and TikTok

    Authors: Haoyang Gui, Thales Bertaglia, Catalina Goanta, Sybe de Vries, Gerasimos Spanakis

    Abstract: Content monetization on social media fuels a growing influencer economy. Influencer marketing remains largely undisclosed or inappropriately disclosed on social media. Non-disclosure issues have become a priority for national and supranational authorities worldwide, who are starting to impose increasingly harsher sanctions on them. This paper proposes a transparent methodology for measuring whethe… ▽ More

    Submitted 12 August, 2024; v1 submitted 17 July, 2024; originally announced July 2024.

    Comments: Accept for publication at the 16th International Conference on Advances in Social Networks Analysis and Mining - ASONAM-2024

  3. arXiv:2407.09202  [pdf, other

    cs.CY cs.SI

    Influencer Self-Disclosure Practices on Instagram: A Multi-Country Longitudinal Study

    Authors: Thales Bertaglia, Catalina Goanta, Gerasimos Spanakis, Adriana Iamnitchi

    Abstract: This paper presents a longitudinal study of more than ten years of activity on Instagram consisting of over a million posts by 400 content creators from four countries: the US, Brazil, Netherlands and Germany. Our study shows differences in the professionalisation of content monetisation between countries, yet consistent patterns; significant differences in the frequency of posts yet similar user… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: submitted to Online Social Networks and Media

  4. arXiv:2407.08323  [pdf, other

    cs.CY

    Leveraging GPT for the Generation of Multi-Platform Social Media Datasets for Research

    Authors: Henry Tari, Danial Khan, Justus Rutten, Darian Othman, Rishabh Kaushal, Thales Bertaglia, Adriana Iamnitchi

    Abstract: Social media datasets are essential for research on disinformation, influence operations, social sensing, hate speech detection, cyberbullying, and other significant topics. However, access to these datasets is often restricted due to costs and platform regulations. As such, acquiring datasets that span multiple platforms which are crucial for a comprehensive understanding of the digital ecosystem… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  5. arXiv:2403.15214  [pdf, other

    cs.CY cs.CL cs.SI

    InstaSynth: Opportunities and Challenges in Generating Synthetic Instagram Data with ChatGPT for Sponsored Content Detection

    Authors: Thales Bertaglia, Lily Heisig, Rishabh Kaushal, Adriana Iamnitchi

    Abstract: Large Language Models (LLMs) raise concerns about lowering the cost of generating texts that could be used for unethical or illegal purposes, especially on social media. This paper investigates the promise of such models to help enforce legal requirements related to the disclosure of sponsored content online. We investigate the use of LLMs for generating synthetic Instagram captions with two objec… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: To appear at the 18th International AAAI Conference on Web and Social Media (ICWSM 2024) -- please cite accordingly

  6. arXiv:2306.05115  [pdf, ps, other

    cs.CL cs.SI

    Closing the Loop: Testing ChatGPT to Generate Model Explanations to Improve Human Labelling of Sponsored Content on Social Media

    Authors: Thales Bertaglia, Stefan Huber, Catalina Goanta, Gerasimos Spanakis, Adriana Iamnitchi

    Abstract: Regulatory bodies worldwide are intensifying their efforts to ensure transparency in influencer marketing on social media through instruments like the Unfair Commercial Practices Directive (UCPD) in the European Union, or Section 5 of the Federal Trade Commission Act. Yet enforcing these obligations has proven to be highly problematic due to the sheer scale of the influencer market. The task of au… ▽ More

    Submitted 8 June, 2023; originally announced June 2023.

    Comments: Accepted to The World Conference on eXplainable Artificial Intelligence, Lisbon, Portugal, July 2023

  7. arXiv:2205.06666  [pdf, ps, other

    cs.CY cs.AI cs.CL

    The Case for a Legal Compliance API for the Enforcement of the EU's Digital Services Act on Social Media Platforms

    Authors: Catalina Goanta, Thales Bertaglia, Adriana Iamnitchi

    Abstract: In the course of under a year, the European Commission has launched some of the most important regulatory proposals to date on platform governance. The Commission's goals behind cross-sectoral regulation of this sort include the protection of markets and democracies alike. While all these acts propose sophisticated rules for setting up new enforcement institutions and procedures, one aspect remain… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Comments: Accepted for publication at ACM FAccT Conference 2022

  8. arXiv:1707.02657  [pdf, other

    cs.CL cs.AI cs.LG

    PELESent: Cross-domain polarity classification using distant supervision

    Authors: Edilson A. Corrêa Jr, Vanessa Q. Marinho, Leandro B. dos Santos, Thales F. C. Bertaglia, Marcos V. Treviso, Henrico B. Brum

    Abstract: The enormous amount of texts published daily by Internet users has fostered the development of methods to analyze this content in several natural language processing areas, such as sentiment analysis. The main goal of this task is to classify the polarity of a message. Even though many approaches have been proposed for sentiment analysis, some of the most successful ones rely on the availability o… ▽ More

    Submitted 9 July, 2017; originally announced July 2017.

    Comments: Accepted for publication in BRACIS 2017

  9. arXiv:1704.02963  [pdf, other

    cs.CL cs.AI

    Exploring Word Embeddings for Unsupervised Textual User-Generated Content Normalization

    Authors: Thales Felipe Costa Bertaglia, Maria das Graças Volpe Nunes

    Abstract: Text normalization techniques based on rules, lexicons or supervised training requiring large corpora are not scalable nor domain interchangeable, and this makes them unsuitable for normalizing user-generated content (UGC). Current tools available for Brazilian Portuguese make use of such techniques. In this work we propose a technique based on distributed representation of words (or word embeddin… ▽ More

    Submitted 10 April, 2017; originally announced April 2017.

    Comments: Published in Proceedings of the 2nd Workshop on Noisy User-generated Text, 9 pages