With platforms with thousands of items, how can you connect users to the right content that will be relevant? To answer this question, recommendation systems emerge, a solution to personalize the content presented to users.
And when used well, recommendation systems bring significant results. Netflix, for example, generates $1 billion in profit per year, being the mechanism behind 80% of the content viewed on the platform.
But it doesn't just stop there. E-commerce and other forms of electronic commerce such as marketplaces, social networks and search engines like Google also make use of the resource. All of this with the aim of providing a unique browsing experience for each user and at the same time increasing conversions.
I built a recommendation system for recommending products in an E-commerce ( Amazon ), but the focus is on the book recommendation system.
Data for analysis were made available within the data competitions platform Kaggle.
- With the increase in the book market and the diversification of readers' tastes, it becomes challenging to provide reading recommendations that are appropriate and relevant to each user. A personalized recommendation model can increase user satisfaction, boost sales and build customer loyalty by offering suggestions that are aligned with their individual preferences.
- Using machine learning algorithms and recommendation techniques, it will be possible to analyze the behavior and preferences of each user based on their purchase history and interactions with books. The personalized recommendation model will consider this information to make book suggestions that are highly relevant and interesting for each user.
- The solution will consist of creating a personalized book recommendation model, which will analyze each user's purchase history and preferences to offer personalized suggestions.
- The sample dataset summarizes the behavior of 3,150 Amazon customers.
- One of the datasets ( Ratings ) has approximately 1149780 reviews
- Users.csv:
Variável | Definição |
---|---|
User-ID | ID do cliente |
Location | Localidade |
Age | Idade |
- Ratings.csv:
Variável | Definição |
---|---|
User-ID | ID do cliente |
ISBN | Código |
Book-Rating | Avaliação do livro ( 0 a 10 ) |
- Books.csv:
Variável | Definição |
---|---|
ISBN | Código |
Book-Title | Nome do livro |
Book-Author | Autor do livro |
Year-Of-Publication | O ano que ele foi publicado |
Publisher | Editora |
Image-URL-S | URL |
Image-URL-M | URL |
Image-URL-L | URL |
- We need a minimum number of reviews per user for the machine learning model to work well.
By utilizing machine learning algorithms, we can create systems that offer personalized suggestions to users, helping them discover new products and increasing customer engagement.
Two Machine Learning techniques were applied:
- Nearest Neighbors: It is an unsupervised algorithm, where it will learn using distance, that is, it will calculate the distance to learn this model.
- Cossine Similarity: It is a metric commonly used in the context of recommendation systems to calculate the similarity between two vectors of items or users.
Development of a streamlit page capable of building a list of suggestions based on the initial choice of a book.
Due to the size of the dataset, it was necessary to reduce it to 50 thousand lines.
The Recommendation System can be accessed through this link: Streamlit link
In this project, we seek to solve the challenge of offering accurate and relevant reading recommendations to users, considering the growth of the book market and the diversification of the reading public. By implementing a personalized book recommendation model, it was possible to create a solution that provides personalized suggestions to users, increasing their satisfaction and engagement with the brand.
Using machine learning algorithms and recommendation techniques as NearestNeighbors and Cossine similarity, the model was able to make highly relevant book suggestions for each user, taking into account their individual preferences and tastes.
- How to build recommendation systems
- NearestNeighbors technique
- Cossine Similarity technique
- Show images for each recommendation in Streamlit Cloud
- Testing more techniques
- Use others datasets