Skip to content

Repo with some aggregated information about my DS skills for a resume

Notifications You must be signed in to change notification settings

to505to505/ds_skills

Repository files navigation

Data Science skills

In this repository I have added my machine learning and deep learning homeworks, that can help you get a comprehension of my skills and my knowledges in these topics. Most of the notebooks were originally made by me in Russian and translated with DeepL after. Don't be surprised by the choice of words.

Machine Learning Folder

You can find classical machine learning notebooks and notebooks with basic tools/libraries in Machine Learning folder. Kind of tasks: gradient descent, random forest, knn, feature engineering, etc.
Stack: Pandas, Numpy, sklearn, matplotlib.

Deep Learning Folder

There are deep learning notebooks and notebooks with basic dl libraries in this folder. The first notebook is about basic FC structure (regression task) and Pytorch, the second one is about image classification with my own models (hand-made architecture with BasicBlocks from resnet but adjusted to 64x64 images) and pretrained large models (vision transformers). The third notebook is about text classification with embeddings.
Stack: Pytorch, pytorchLightning, wandb, torchvision.

Random Forest from scratch

There is also a report of the university project attached. In this project we built efficient Random Forest models from scratch in java to solve a complex regression problem of predicting students grades. The complexity was due to the diffrence between available features for various students. Theoretical aspects of statistics behind Random Forest is discussed in the report, as well as the most efficient implementation for our specific task.

Harmful Brain Activity Classification

I have also competed in EEG classification competition on Kaggle with a friend, where we used EfficientNet and spectograms as inputs to solve the problem. We weren't seeking high place in the leaderboard (u can search team named dmitrysakharov in the Leaderboard serchbar), but still the basic pipeline is implemented. https://www.kaggle.com/competitions/hms-harmful-brain-activity-classification/overview
Stack: Pytorch, sklearn

About

Repo with some aggregated information about my DS skills for a resume

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published