Guiding sentiment analysis with hierarchical text clustering: Analyzing the German X/Twitter conversation on face masks in the 2020 COVID-19 pandemic

Abstract: Social media are a critical component of the information ecosystem during public health crises. Understanding the public discourse is essential for effective communication and misinformation mitigation. Computational methods can aid these efforts through online social listening. We combined hierarchical text clustering and sentiment analysis to examine the face mask-wearing discourse in Germany during the COVID-19 pandemic using a dataset of 353,420 German X (formerly Twitter) posts from 2020. For sentiment analysis, we annotated a subsample of the data to train a neural network for classifying the sentiments of posts (neutral, negative, or positive). In combination with clustering, this approach uncovered sentiment patterns of different topics and their subtopics, reflecting the online public response to mask mandates in Germany. We show that our approach can be used to examine long-term narratives and sentiment dynamics and to identify specific topics that explain peaks of interest in the social media discourse.

You can find the paper here: https://aclanthology.org/2024.wassa-1.13/

Overview

This repository contains code we used for our paper, specifically for

While we cannot share the raw X (formerly Twitter) data used in our work, we want to make our research as transparent as possible and enable other researchers to try the presented approach on their data. If you have any questions, do not hesitate to contact us.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
hierarchical_clustering		hierarchical_clustering
treemap_visualization		treemap_visualization
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Guiding sentiment analysis with hierarchical text clustering: Analyzing the German X/Twitter conversation on face masks in the 2020 COVID-19 pandemic

Overview

About

Languages

License

ClimSocAna/sentiments-with-hierarchical-clustering

Folders and files

Latest commit

History

Repository files navigation

Guiding sentiment analysis with hierarchical text clustering: Analyzing the German X/Twitter conversation on face masks in the 2020 COVID-19 pandemic

Overview

About

Topics

Resources

License

Stars

Watchers

Forks

Languages