GitHub

Youtube Scrapping Project.

This project implements a method for scrapping youtube search result using selenium. Selenium is a perform document that allows for automation of website by the use of webdriver.

Sofware Engineering Technologies Employed:

selenium: Automating website scrolling to get a large source html to parse. The html is parse using find_element* methods provided in selenium. We used an xpath. Inspected the html and obtained the video id
Youtube api: Use the video id to query youtube database using the api; to get all the information necessary for downstream application.
hydra: abstract the project configuration like json file containing credentials, website headers.

Data Science Technologies used:

lemmatizaton
Text representation: BOW, Term Frequency and Inverse Document Frequency(TFiDF)

Application:

The idea of this project is to understand the current issues being discussed on youtube videos for a particular country. The selenium returns the search result for a particular African country. The data is then analyzed to obtain to gain insights into data. Analysis considered are as follows:

EDA: give a higher overview of the data
Topic analysis: Get the top topics discussed in youtube videos for a country.
Document Clustering: understand the kind of clusters available

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github/workflows		.github/workflows
src		src
.gitignore		.gitignore
Readme.md		Readme.md
youtube_analysis.ipynb		youtube_analysis.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Youtube Scrapping Project.

Sofware Engineering Technologies Employed:

Data Science Technologies used:

Application:

About

Releases

Packages

Languages

rlesiyon/youtube_scrapping

Folders and files

Latest commit

History

Repository files navigation

Youtube Scrapping Project.

Sofware Engineering Technologies Employed:

Data Science Technologies used:

Application:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages