GitHub - nogibjj/Kaisen_Yao_IDS706_Week3_Individual

Project #1: Continuous Integration using Gitlab Actions of Python Data Science Project

Youtube Video Link

Directory Tree Structure

Kaisen_Yao_IDS706_Week3_Individual/
├── .devcontainer/
│   ├── devcontainer.json
│   └── Dockerfile
├── .github/
│   └── workflows/
│       ├── format.yml
│       ├── install.yml
│       ├── lint.yml
│       └── test.yml
├── .gitignore
├── Dockerfile
├── LICENSE
├── main.ipynb
├── main.py
├── Makefile
├── mylib/
│   ├── __init__.py
│   └── lib.py
├── README.md
├── repeat.sh
├── requirements.txt
├── setup.sh
├── test_lib.py
└── test_main.py

Purpose of Project

The purpose of this project is to build upon the last three mini-projects to simulate best practices of continuous integration in Data Science projects. The project uses a dataset that provides an urbanization index for U.S. congressional districts. It contains details like urbanization index, rural and urban population distributions, and partisan lean.

Preparation

Open codespaces
Wait for container to be built and pinned requirements from requirements.txt to be installed
If running locally, git clone the repository and use make install

Check format and test errors

Format code make format
Lint code make lint
Test code make test

Descriptive statistics and vizualizations

Whenever code is pushed to the repository, the following will be automatically generated and committed via GitHub Actions:

Descriptive statistics of the dataset.
Visualizations, including:

Urbanization Index Distribution (Histogram)
Urbanization Grouping Over Time (Line Chart)
Population Distribution by District Type (Bar Chart)

The descriptive statistics and vizualizations are generated whenever an individaul pushes to my repository via actions-user using make generate_and_push. You can find them here descriptive statistics and vizualizations

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project #1: Continuous Integration using Gitlab Actions of Python Data Science Project

Youtube Video Link

Directory Tree Structure

Purpose of Project

Preparation

Check format and test errors

Descriptive statistics and vizualizations

References

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.devcontainer		.devcontainer
.github		.github
mylib		mylib
.DS_Store		.DS_Store
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
district_population_distribution.png		district_population_distribution.png
main.ipynb		main.ipynb
main.py		main.py
repeat.sh		repeat.sh
requirements.txt		requirements.txt
results.png		results.png
setup.sh		setup.sh
summary.md		summary.md
test_lib.py		test_lib.py
test_main.py		test_main.py
urbanization_groups.png		urbanization_groups.png
urbanization_index_distribution.png		urbanization_index_distribution.png

License

nogibjj/Kaisen_Yao_IDS706_Week3_Individual

Folders and files

Latest commit

History

Repository files navigation

Project #1: Continuous Integration using Gitlab Actions of Python Data Science Project

Youtube Video Link

Directory Tree Structure

Purpose of Project

Preparation

Check format and test errors

Descriptive statistics and vizualizations

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages