Skip to content

Virality Predictor using Kaggle's 'Articles sharing and reading from CI&T DeskDrop'

Notifications You must be signed in to change notification settings

mirayyuce/Virality_Predictor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Virality Predictor Project using 'https://www.kaggle.com/gspmoreira/articles-sharing-reading-from-cit-deskdrop?select=shared_articles.csv' dataset

Instructions:
A folder called Virality_Predictor is expected to be under Home directory.
~/Virality_Predictor/
Folder contents:
~/Virality_Predictor/ Collaborative_Filtering_EN.ipynb Collaborative_Filtering_EN_PT.ipynb Collaborative_Filtering_PT.ipynb Collaborative_Filtering_Utils.py
TFIDF-Regression-EN.ipynb TFIDF-Regression-PT.ipynb TFIDF_Regression_Utils.py
TFIDF_Classification_EN.ipynb TFIDF_Classification_PT.ipynb TFIDF_Classification_Utils.py
Utils.py
data_analysis_articles.ipynb data_analysis_users.ipynb
nltk_data/ corpora
datasets/
cleaned_articles_test_EN_text.csv cleaned_articles_test_EN_upsampled_text.csv cleaned_articles_test_PT_text.csv cleaned_articles_test_PT_upsampled_text.csv cleaned_articles_train_EN_text.csv cleaned_articles_train_EN_upsampled_text.csv cleaned_articles_train_PT_text.csv cleaned_articles_train_PT_upsampled_text.csv shared_articles.csv
users_interactions.csv
models/
CF_EN_PT_norm.pkl Classification_EN_pipeline.pkl Classification_PT_pipeline.pkl CF_EN_PT_raw.pkl CF_PT_norm.pkl
CF_PT_raw.pkl Regression_EN_pipeline.pkl Regression_PT_pipeline.pkl
To run the models one should simply start run the ‘jupyter notebook’ command from command line. Notebooks show latest state of the models. Models are under the /models directory
 
Each .ipynb file corresponds to a problem, such as Collaborative Filtering using Articles in English (Collaborative_Filtering_EN.ipynb)
In each jupyter notebook there is a commented cell where you can find the command for loading the corresponding model.
Below packages are installed in the project’s virtualenv.

About

Virality Predictor using Kaggle's 'Articles sharing and reading from CI&T DeskDrop'

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published