Skip to content

Parses through archive and analyses correlations in published articles.

Notifications You must be signed in to change notification settings

antonbauhofer/archive-crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

archive crawler with feature extraction

The following code demonstrates a web crawler for a forum with multiple pages that contain blog posts. Different features are extracted from the parsed texts and the correlations between the features are investigated. With a larger database, the identified features could be used to train a machine learning algorithm to predict the popularity of new blog posts.

If Jupyter Notebook does not render, please refer to pdf.

About

Parses through archive and analyses correlations in published articles.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published