MANU-Internship-Project

In the following project, we will look at creating a book recommendation system, where we will cover all the key steps for its creation.

The data was taken from Kaggle https://www.kaggle.com/datasets/mohamedbakhet/amazon-books-reviews/data?select=books_data.csv and only a part of this massive data set was used.

We have visualized the data so that the reader has a clearer picture of what type of data it is, as well as for help in cleaning the data and reducing it. In this same section, we analyze the sentiment to have a clearer picture of the data.

Furthermore, we extract all important properties from the data, such as book genre, author, and publisher. Using the name and description of the books, we used the RoBERTa model to add a vector representation of them.

With that, the next step is to create the graph using Pytorch Geometric. With this, we create a computer representation of our knowledge graph on which we will further train and test the models - GraphSAGE, GAT and Transformers. After a long period of training - we test and compare all models using the hit metric. From our results we get a high level of accuracy and a good sign that this problem could be (and already is) useful and used in real life on a huge amount of data from different fields.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
MANU_Internship_Research.ipynb		MANU_Internship_Research.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MANU-Internship-Project

About

Releases

Packages

Languages

Saveska/MANU-Internship-Project

Folders and files

Latest commit

History

Repository files navigation

MANU-Internship-Project

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages