Skip to content

Latest commit

 

History

History
32 lines (23 loc) · 2.49 KB

File metadata and controls

32 lines (23 loc) · 2.49 KB

Online Shoppers Purchasing Intention 🛍️

Introduction

A final project of Data Science Bootcamp Batch 20 in Rakamin Academy. In this project, as a team of Data Scientists from an ecommerce company, we were given the task of being able to predict the purchase intention of customers by machine learning. all files attached to this repository are original, as memories of the first data science project as a team

Dataset

This dataset obtained from Kaggle Online Shoppers Purchasing Intention. The dataset consists of feature vectors belonging to 12,330 sessions, was formed so that each sessionwould belong to a different user in a 1-year period to avoid any tendency to a specific campaign, special day, userprofile, or period.

Problem Statement

On the company's website there are 3 categories of visitors, namely returning visitors, new visitors and other. The number of returning visitors is much higher than new visitors, but the conversion rate for returning visitors is lower than new visitors

Goal

Increase revenue (conversion rate) especially in the returning visitor category by following up predicted users will make purchases/transactions with Call To Action and provide promos to targets.

Objective

Build machine learning model to predict which users visit a website with intention to buy or not.

Business Metrics

  • Conversion Rate (conversion rate of returning visitors).
  • Company revenue before and after modeling.

Stage

In this project, we divide into 4 stages:

  1. Stage 1 - Preparation: We study the selected project and dataset. The key takeaways in this stage are who we are in the project, the problem statement, the goals and objectives we want to achieve and lastly are the business metrics.
  2. Stage 2 - EDA: We start to reach the dataset to get the characteristics of the data. We separated the process into 3 steps, starting from data exploration, EDA, and gathering insights on datasets related to the main problem and objectives.
  3. Stage 3 - Preprocessing: We handle the data to be the cleanest data before starting the modeling process. We address outliers, feature encoding, and determine which feature engineering we choose.
  4. Stage 4 - Supervised Learning: We enter the modeling process and explore several algorithms to improve based on the features and targets we have selected.