-
-
Notifications
You must be signed in to change notification settings - Fork 168
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MNT: Cleaning the data folder #281
Comments
This issues requires at least a little discussion before starting applying new changes. @giovaniceotto could we help with some insights here? |
Great suggestions @Gui-FernandesBR. I agree that the data folder may no longer be needed. We can move some of the files used for tests to the The only point I disagree with is creating a separate repository for examples. While this would make this repository lighter, it would be a nightmare to manage. Imagine having to sync two separate repositories so that the examples can always run with the latest RocketPy version. I do not believe this is worth the effort. |
Nooo, did I say that? sorry hahahaha What I think we could do is:
|
To give a context: I discussed a little bit with your colleagues on PythonBR 2024 and they welcome contributions. I also mentioned that I would be mostly interested in CI, lib infrastructure and code maintenance in general... Regarding the data folder, why I think it is a serious problem? I started to clone the project on wifi and it took forever on a slow broadband (depending on the context it will make possible contributors to run away). In my home setup it was necessary to plug in a cable to download the project! Adding to this discussion: I used My guess is that it could potentially reduce the size of the whole repository by 90%. I can devote some time investigating (and eventually implementing or helping the team to implement edit: just corrected the name of the plugin. RocketPy on master is 📦 v1.6.1 via 🐍 v3.12.3 (rocketpy-devenv)
❯ cloc .
496 text files.
470 unique files.
75 files ignored.
github.com/AlDanial/cloc v 1.98 T=0.86 s (548.6 files/s, 546445.3 lines/s)
-------------------------------------------------------------------------------
Language files blank comment code
-------------------------------------------------------------------------------
CSV 43 0 0 218429
Python 167 7218 21945 26633
Text 22 3 0 12145
SVG 9 42 43 10185
reStructuredText 183 3093 4180 3488
Jupyter Notebook 17 0 154818 3350
Markdown 7 292 26 579
JSON 2 0 0 348
YAML 11 41 15 289
MATLAB 1 25 137 115
XML 1 0 0 86
TOML 1 12 2 77
CSS 1 15 3 62
make 2 16 8 40
DOS Batch 1 8 1 26
HTML 1 25 256 14
Dockerfile 1 7 12 11
-------------------------------------------------------------------------------
SUM: 470 10797 181446 275877
-------------------------------------------------------------------------------
RocketPy on master is 📦 v1.6.1 via 🐍 v3.12.3 (rocketpy-devenv)
❯ du -h --max-depth=1 --total .
1,9M ./rocketpy
829M ./.git
1,8M ./tests
60K ./.github
16K ./.vscode
162M ./data
40M ./docs
1,1G .
1,1G total
RocketPy on master is 📦 v1.6.1 via 🐍 v3.12.3 (rocketpy-devenv)
❯ cloc tests/ data/
130 text files.
115 unique files.
30 files ignored.
github.com/AlDanial/cloc v 1.98 T=0.42 s (271.8 files/s, 546528.9 lines/s)
-------------------------------------------------------------------------------
Language files blank comment code
-------------------------------------------------------------------------------
CSV 43 0 0 218429
Python 66 1857 2708 6851
Text 6 3 0 1367
-------------------------------------------------------------------------------
SUM: 115 1860 2708 226647
------------------------------------------------------------------------------- |
Yes, this is a known issue for a while now. We always thought that not a lot of people actually clones the repo and those who does may be okay with long download times. We did a big mistake saving Implementing git LFS would be a good idea, but how would non-experienced developers react to it? What would be the possible development overhead with such addition? Alternatively, I wonder if we could simply delete some files from git history. Regarding contributions, we are really excited with new contributions coming from you. |
I am not used with discord and couldn't find the discord link on the project documentation. Can you please provide it? |
Just go through our readme please, "join our community" section. |
Linking this thread here: https://stackoverflow.com/questions/2116778/reduce-git-repository-size |
I just realized that my RocketPy local repo is more than 3.5 GB large. Gonna try cleaning this up |
I tried running
Then I remembered that .venv folders usually consumes a lot of space!
From the 1.14 GB that I currently have, the .git folder is weighting 768 MB (67%), so it is more than clear to me that the problem is related to our git tree. |
@aureliobarbosa I've raised issue #727 to deal specifically with the repo size reduction task. As of this current issue, the goal was to refactor both @aureliobarbosa , let's collaborate and work together on the new #727 issue! |
What I propose is: (soft suggestions)
The text was updated successfully, but these errors were encountered: