-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BLD: need PyPI package eventually #15
Comments
Some options I saw mentioned upstream recently (because SciPy is considering a datasets subpackage--the discussion is quite interesting: scipy/scipy#8707) are: I wonder how much reinventing the wheel we've done with |
Taking a brief look at each, I think I would lean towards pooch. It looks like it has more example use cases and the documentation seems friendly enough. I would have to do some prototyping to see how these things would work for our different use cases (testing, |
On top of us avoiding slow uploads/downloads of a PyPI package, we also should try to help the community reduce bandwidth where reasonable--PyPI annual bandwidth bill is approaching US $25 million: https://dustingram.com/articles/2021/04/14/powering-the-python-package-index-in-2021/ . We can probably also use CI "caching" features to prevent pulling the logs repo/data in fresh on every single CI flush of the main repo as well, though this currently runs pretty quickly anyway. |
We'll eventually need/want a PyPI package for the project so it can leveraged easily in i.e.,
asv
benchmarks. Jakob mentioned that we might want to consider a "lazy" approach where the PyPI package doesn't contain the data files, but rather can be used to download them. Not sure on the complexity/usage tradeoffs there.The text was updated successfully, but these errors were encountered: