Skip to content
View aaalexlit's full-sized avatar

Block or report aaalexlit

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
aaalexlit/README.md

Hi, I'm Alex ๐Ÿ‘‹

I'm a seasoned Software Developer and I'm interested in everything ML/AI

๐ŸŒฑ Iโ€™m currently learning with DataTalks Club Data Engineering Zoomcamp

My recent projects:

LLM-powered Question Answering Slack bot

to accompany 3 Zoomcamps by DataTalksClub

Project Repo

Zoomcamp QA Bot

Behind the scenes

DE and ML Zoomcamps branch

Tech stack

Connecting the dots
Llamaindex
Vertor DB
Milvus and Zilliz (Cloud-Native Milvus)
Orchestration
Prefect
Embeddings
BAAI/bge-small-en-v1.5
Re-ranker
Cohere re-ranker

The ingestion

MLOps Zoomcamp branch

Course FAQ Google Document and the Course repo get indexed to the Pinecone vector store.
Then semantic search retrieves the most similar (and hopefully most relevant) pieces to the question asked.
Then this information is passed as a context to a conversational LLM to form the final answer.

The workflow

Components

The Star of the show
LangChain
Vertor DB
Pinecone
Orchestration
Prefect
Semantic Search
Sentence Transformers

Alternate Text


Blood Vessel Segmentation

Fine-tune Ultralytics YOLOv8 segmentation model on 3D Hierarchical Phase-Contrast Tomography (HiP-CT) data from human kidneys to segment blood vessels.

Project Repo

Hacking the Human Vasculature

EDA => Training => Hyperparameter tuning => Deployment as a service (FastAPI) => Containerization => Deployment to AWS EKS

FastAPI Service screenshot


Diabetes classification model training and deployment

Project Repo

MLZoomcamp Midterm project


End-to-end MLOps Pipeline

Workflow diagram

Project Repos

  1. Training pipeline
  2. Prediction web service
  3. Prediction web service IAC

Climate change-related news articles Scientific verification

A tool and an API for Climate change-related news articles Scientific verification and Global warming stance detection

Components

NLP Framework
Haystack
Vertor DB
FAISS
API
FastAPI
UI
Streamlit
Semantic Search
Sentence Transformers

Application UI

developed as a part of Omdena's Detecting Bias in Climate Reporting in English and German Language News Media Local Chapter Challenge

Project Repos

  1. WebApp repo
  2. API repo
  3. Auxiliary repo 1 Auxiliary repo 2

Alex's GitHub stats
Top Langs

Pinned Loading

  1. faq-slack-bot faq-slack-bot Public

    LLM-powered RAG Question Answering Slack bot for DataTalksClub Zoomcamps

    Python 51 8

  2. tube-to-text-coach tube-to-text-coach Public

    A fitness companion app that transforms YouTube videos featuring spoken follow-along routines into written, step-by-step exercise instructions

    Python

  3. capitalbikeshare-mlops capitalbikeshare-mlops Public

    MLOps Zoomcamp course final project. End-to-end MLOps pipeline

    Jupyter Notebook 6

  4. hacking-human-vasculature hacking-human-vasculature Public

    Project for SenNet + HOA - Hacking the Human Vasculature in 3D Kaggle competition + ML Zoomcamp final project

    Jupyter Notebook 2 1

  5. DEMO-image-search-dishes DEMO-image-search-dishes Public

    Forked from weaviate-tutorials/DEMO-image-search-dishes

    Python

  6. ml_zoomcamp_midterm_cdc_diabetes ml_zoomcamp_midterm_cdc_diabetes Public

    Diabetes classification model training and deployment. DataTalks.Club Machine Learning Zoomcamp Midterm project.

    Jupyter Notebook