media-opinion-analyzer

The main purpose is to help scientists from all over the world to estimate and analyze social opinion from social-media comments. The idea is to use text embedding algorithms to vectorize comments which then we can use for clustering, classification, dynamic analyzation and similarity comparison with reference text. The approach is tested on data from the Reddit platform.

Repository structure

Folder	Description
preprocessing	preprocessing data downloaded from Reddit
webapp	streamlit web_app
sBert	testing sBert: vectorization, classification, cos_sim, clustering
doc2vec	testing doc2vec: vectorization, classification, cos_sim
USE	testing USE: vectorization, classification, cos_sim

How to use

Run streamlit app in webapp folder.

Install streamlit library
Set environmet. Libraries listed in requirements.txt
Put three tables ("df_doc2vec", "df_sbert", "df_use") in pickle format to the same folder with my_app.py.
Main columns names: body, vec, who.
In 'body' column comments with type string, in vec - embeddings, in who biden(1) or trump(0) type int.
Run streamlit app: streamlit run my_app.py
more info here https://docs.streamlit.io/en/stable/streamlit_configuration.html

Summary

Results for sBert, doc2vec, USE

similarity to reference text: 'We should build the wall!'
classification to the right party
clusters
dynamic

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
USE		USE
data_2d_projection		data_2d_projection
doc2veс		doc2veс
preprocessing		preprocessing
reply_networks		reply_networks
sBert		sBert
webapp		webapp
.gitignore		.gitignore
BidenEDA (2).ipynb		BidenEDA (2).ipynb
LICENSE		LICENSE
README.md		README.md
Reddit.ipynb		Reddit.ipynb
TrumpEDA.ipynb		TrumpEDA.ipynb
logo.jpg		logo.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

media-opinion-analyzer

Repository structure

How to use

Summary

About

Releases

Packages

Contributors 4

Languages

License

comptech-winter-school/media-opinion-analyzer

Folders and files

Latest commit

History

Repository files navigation

media-opinion-analyzer

Repository structure

How to use

Summary

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages