Skip to content
View DSKunth's full-sized avatar

Block or report DSKunth

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
DSKunth/README.md

My Bio

Grey Minimalist Designer Linkedin Banner

Hi there! 👋 Greetings! I am Dorothy, and here is my brief bio:

  • I am a finance-focused Data Analyst and my favorite tools are Python, SQL, and Tableau.
  • I participated in an immersive, six-month data analyst/data engineer training program at Ethos AI/Women AI Academy.
  • I became interested in data science during my parental break in 2022, so I expanded my skills and explored the sophisticated technologies surrounding data. Since then I have been learning about the convergence between machine learning, digital innovation, cloud computing, business intelligence, and data science via MOOC platforms (Udacity, DataCamp, and Coursera).
  • I am an Accounting and Business Analyst with a strong affinity for IT and a passion for systems and process improvements.

Learning path:

  • Data Science:
    • Python - Pandas, Numpy, Matplotlib, Seaborn
    • R and RStudio
    • Data Pre-processing
    • Exploratory Data Analysis
    • Data Visualization
    • A/B Testing
    • Intro to Machine Learning - Supervised, Unsupervised, NLP
  • Data Engineering - Data Modelling, Database Design, Data Pipelines
  • SQL and Databases - Google BigQuery, MySQL, PostgreSQL, Cassandra
  • Business Intelligence - Power BI, Tableau
  • Tools and Technologies: Jupyter, Google Colab, PyCharm, DBeaver, Docker, Kafka, Spark, Amazon Redshift, Amazon S3, Google BigQuery, Version control (Git) & GitHub

Portfolio

  • View my portfolio of projects here.

Let's Connect!

Pinned Loading

  1. ABC-Product-Segmentation ABC-Product-Segmentation Public

    The original dataset is a year's worth of electronics sales transactions. It has around 185K records and 11 attributes. Business Objective: Identify the products which generated 80% of profit.

    Jupyter Notebook 2

  2. ETL-Pipeline ETL-Pipeline Public

    This repository contains tasks on how to build an ETL pipeline for the online transaction data of an e-commerce company.

    Python 4 1

  3. Customer-Segmentation Customer-Segmentation Public

    This repository covers data extraction from Amazon Redshift, data preprocessing, exploratory data analysis, and customer segmentation based on RFM using percentile ranking and Kmeans clustering.

    Jupyter Notebook

  4. Deforestation-Exploration Deforestation-Exploration Public

    Create views, simple and complex SQL queries to answer questions from the fictitious ForestQuery management about the deforestation’s global situation, regional outlook, and country-level details

  5. Data-Modeling-with-Postgres Data-Modeling-with-Postgres Public

    Project tasks: create a Postgres database with tables designed to optimize queries on song play analysis,create a database schema by defining fact and dimension tables for a star schema and build a…

    Jupyter Notebook

  6. Product-Range-Analysis Product-Range-Analysis Public

    Masterschool's capstone project integrating skills and tools for data analysis.

    Jupyter Notebook