Automatically backup your GitHub repositories to your local machine.
This tool is designed to automatically pull the list of GitHub repositories from one, or more, GitHub organizations and clone (or fetch) them to your local machine. It is designed to be run as part of a scheduled backup process with the ultimate goal of ensuring that you have a local copy of all of your GitHub repositories should the unthinkable happen.
- Backup Multiple Organizations, automatically gathering the full list of repositories for each organization through the GitHub API.
- Backup Starred Repos, automatically gathering the full list of your starred repositories
- Repo Allowlists/Denylists to provide fine-grained control over which repositories are backed up and which are not.
- GitHub Enterprise Support for those of you running your own GitHub instances and not relying on GitHub.com.
# Run the tool directly
./github-backup --config config.yaml
# Or run it in a container
docker run \
-v $(pwd)/config.yaml:/config.yaml \
-v $(pwd)/backups:/backups \
ghcr.io/SierraSoftworks/github-backup:main \
--config /config.yaml
# Run a backup every hour (will use `git fetch` for existing copies)
# You can also omit this if you want to run a one-shot backup
schedule: "0 * * * *"
backups:
- kind: github/repo
from: user # The user associated with the provided credentials
to: /backups/personal
credentials: !UsernamePassword:
username: "<your username>"
password: "<your personal access token>"
properties:
query: "affiliation=owner" # Additional query parameters to pass to GitHub when fetching repositories
- kind: github/repo
from: "users/another-user"
to: /backups/friend
credentials: !Token "your_github_token"
- kind: github/repo
from: "orgs/my-org"
to: /backups/work
filter: '!repo.fork && repo.name contains "awesome"'
- kind: github/release
from: "orgs/my-org"
to: /backups/releases
filter: '!release.prerelease && !asset.source-code'
# You can also backup single repositories directly if you wish
- kind: github/repo
from: "repos/my-org/repo"
to: /backups/work
# This is particularly useful for backing up release artifacts for
# specific projects.
- kind: github/release
from: "repos/my-org/repo"
to: /backups/releases
filter: '!release.prerelease'
In addition to the standard logging output, this tool also supports reporting metrics to an OpenTelemetry-compatible backend. This can be useful for tracking the performance of the tool over time and configuring monitoring in case backups start to fail.
Configuration is conducted through the use of environment variables:
OTEL_EXPORTER_OTLP_ENDPOINT=https://your-otel-collector:4317
OTEL_EXPORTER_OTLP_HEADERS=X-API-KEY=your-api-key
OTEL_TRACES_SAMPLER=traceidratio
OTEL_TRACES_SAMPLER_ARG=1.0
This tool allows you to configure filters to control which GitHub repositories are backed up and
which are not. Filters are used within the backups
section of your configuration file and can
be specified on a per-user or per-organization basis.
When writing a filter, the goal is to write a logical expression which evaluates to true
when
you wish to include a repository and false
when you wish to exclude it. The filter language supports
several operators and properties which can be used to control this process.
For kind: github/repo
and kind: github/star
Field | Type | Description (Example) |
---|---|---|
repo.name |
string |
The name of the repository (Hello-World) |
repo.fullname |
string |
The full-name of the repository (octocat/Hello-World) |
repo.private |
boolean |
Whether the repository is private |
repo.public |
boolean |
Whether the repository is public |
repo.fork |
boolean |
Whether the repository is a fork |
repo.size |
integer |
The size of the repository, in kilobytes (1024). |
repo.archived |
boolean |
Whether the repository is archived |
repo.disabled |
boolean |
Returns whether or not this repository disabled |
repo.default_branch |
string |
The default branch of the repository (main) |
repo.empty |
boolean |
Whether the repository is empty (When a repository is initially created, repo.empty is true ) |
repo.template |
boolean |
Whether this repository acts as a template that can be used to generate new repositories |
repo.forks |
integer |
The number of times this repository is forked |
repo.stargazers |
integer |
The number of people starred this repository |
For kind: github/release
Field | Type | Description (Example) |
---|---|---|
release.tag |
string |
The name of the tag (v1.0.0) |
release.name |
string |
The name of the release (v1.0.0) |
release.draft |
boolean |
Whether the release is a draft (unpublished) release |
release.prerelease |
boolean |
Whether to identify the release as a prerelease or a full release |
release.published |
boolean |
Whether the release is a published (not a draft) release |
asset.name |
string |
The file name of the asset (github-backup-darwin-arm64) |
asset.size |
integer |
The size of the asset, in kilobytes. (1024) |
asset.downloaded |
boolean |
If the asset was downloaded at least once from the GitHub Release |
Here are some examples of filters you might choose to use:
!repo.fork || !repo.archived || !repo.empty
- Do not include repositories which are forks, archived, or empty.repo.private
- Only include private repositories in your list.repo.public && !repo.fork
- Only include public repositories which are not forks.repo.name contains "awesome"
- Only include repositories which have "awesome" in their name.(repo.name contains "awesome" || repo.name contains "cool") && !repo.fork
- Only include repositories which have "awesome" or "cool" in their name and are not forks.!release.prerelease && !asset.source-code
- Only include release artifacts which are not marked as pre-releases and are not source code archives.repo.name in ["git-tool", "grey"]
- Only include repositories with the names "git-tool" or "grey".repo.stargazers >= 5
- Only include repositories with at least 5 stars.