Skip to content

🔎 📚 A fully functional Data Mining project based on movies and shows from Netflix.

License

Notifications You must be signed in to change notification settings

spChalk/Movie-Recommendation-System

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

🔎 📚 Movie-Recommendation-System

Description

Movie Recommendation System is a simple thus fully functional Data Mining project, that showcases the process from which all of the raw data pass through before they are being finally used and manipluated, in order to provide some handy information considering the suggestion of movies, based on some given preferences. The aforementioned preferences can be either some movie / TV show titles themselves or some descriptions that best describe a viewer's taste.

This project was developed in order to help in a study done on the movies of the well-known platform Netflix.

Project's Structure

The project is seperated into four (4) main parts:

  1. Data Preprocessing (missing data handling, lemmatization).
  2. Studying the data and extracting statistical data, in order to completely comprehend the provided datasets.
  3. Implementation of the recommendation system (used Bow and TF-IDF models combined with Jaccard-Tanimoto coefficient and cosine similarity).
  4. Running the program and displaying the final results.

Data

The project's data consist of three (3) .csv files (netflix_titles.csv, IMDb movies.csv, IMDb ratings.csv) that can be found under Movie-Recommendation-System/data/ directory in the project's repository and contain:

  1. Movie and TV Show titles from Netflix.
  2. Movie information from IMDb.
  3. Movie ratings from IMDb.

Collaborators

License

MIT

About

🔎 📚 A fully functional Data Mining project based on movies and shows from Netflix.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published