-
Notifications
You must be signed in to change notification settings - Fork 77
Contributing
Typically there are three types of people who use GitHub to access source code: users, developers, and maintainers.
Users simply want access to the latest code. Developers want to make some changes to the code. Maintainers are the ones who make sure the changes to the code make sense, and incorporate them into the code base.
If you want to use software that is hosted on GitHub and get updates when they are released, but you don't think it's likely that you will ever want to fix bugs or add new features yourself, then probably you are a User (you can upgrade yourself to a Developer later if you need to).
Users will want to take the "official" version of the software, make a copy of
it on their own computer, and run the code from there. Using Stormwater-Management-Model
repository as an example, this is done on the command line like this:
$ git clone https://github.com/OpenWaterAnalytics/Stormwater-Management-Model.git
$ cd Stormwater-Management-Model
The git clone
command will create a folder Stormwater-Management-Model
in the current
directory to store the source code. The only thing you really need to decide first is where on
your computer to store the source code. When you want to update Stormwater-Management-Model
to a
newer version of the source code and run it, just go back to that folder in your terminal
and type:
$ git pull
What is actually happening under the hood when you do these things? In addition
to making you a local copy of the source code, the initial git clone
command sets up a relationship between that folder on your computer and the
"origin" of the code. You can see this by typing:
$ git remote -v
origin https://github.com/OpenWaterAnalytics/Stormwater-Management-Model.git (fetch)
origin https://github.com/OpenWaterAnalytics/Stormwater-Management-Model.git (push)
This tells you that git
knows about two "remote" addresses of
Stormwater-Management-Model
: one to fetch
new changes from (if the source code gets updated
by someone else), and one to push
new changes to (if you make changes that
you want to share). In this case the two addresses both point to the "official"
version of the Stormwater-Management-Model
software, and because you're a User, you won't
actually be allowed to push
any changes you make: the remote
copy that
your computer calls origin
is picky about who it allows to make changes
(i.e., only the Maintainers can).
The initial git clone
command also automatically gives your local copy
of the code a "branch" name. You probably won't need to use this functionality,
but the short explanation is that git
allows you to add, delete, and
change files within the Stormwater-Management-Model
directory and then easily go back to where
you started from in case your changes didn't work. One way git
enables
this is by having different "branches" of the same source code, and by default
it creates a branch called "master" when it first clones the remote repository
to your computer:
$ git branch
* master
The asterisk means that master
is the currently active branch. Later, if you
update Stormwater-Management-Model
to a newer version using git pull
, what git
is really doing is git pull origin master
, i.e., pulling changes that
exist in the origin
copy of Stormwater-Management-Model
into your local master
branch.
One of the most challenging concepts with git
source code control
is the fact that its job is to seamlessly manage multiple different versions
of a code base. The idea is that git
allows you to swap back and
forth between these different versions while working within the same directory
structure.
In other words, typically there is only one repository for a given project on a developer's local machine. That repository can have multiple branches within it.
When a developer switches between branches, they are effectively switching
between different copies of that repository. The git
protocol thus
modifies or updates files in-place, and you can keep track of different versions
of the files by keeping track of branch names, instead of storing temporary
copies in other folders, or appending _newVersion
to the filename.
In addition to each developer having a local git repository, they usually become
associated with some number of remote repositories on
GitHub. The developer's local repository can "know"
about any number of remote repositories, and users get to name those remote
repositories whatever they want. Here is a diagram of the connections between
the remote repositories on GitHub and the local computers of each of three
collaborators on the Stormwater-Management-Model
project:
Developers (like blink1073
in the diagram) typically first go to the
official repo of the project
and fork the repository so
that their changes can be sandboxed. The convention is to then set up their
local copy of the codebase with their own fork as the remote origin
, and
connect to the official remote repo with the name upstream
. So after forking
Stormwater-Management-Model to his own GitHub account, user
blink1073
would run::
$ git clone https://github.com/blink1073/Stormwater-Management-Model.git
$ cd Stormwater-Management-Model
$ git remote add upstream https://github.com/OpenWaterAnalytics/Stormwater-Management-Model.git
Now this user has the standard origin
/upstream
configuration, as seen
below. Note the difference in the URIs between origin
and upstream
:
$ git remote -v
origin https://github.com/blink1073/Stormwater-Management-Model.git (fetch)
origin https://github.com/blink1073/Stormwater-Management-Model.git (push)
upstream git://github.com/OpenWaterAnalytics/Stormwater-Management-Model.git (fetch)
upstream git://github.com/OpenWaterAnalytics/Stormwater-Management-Model.git (push)
$ git branch
* master
URIs beginning with git://
are read-only connections, so blink1073
can
pull down new changes from upstream
, but won't be able to directly push his
local changes to upstream. Instead, he would have to push to his fork
(origin
) first, and create a
pull request.
For example, to add some trivial file:
$ git branch fix_branch
$ git branch
fix_branch
* master
$ git checkout fix_branch
$ git branch
* fix_branch
master
$ touch foo.txt
$ git add foo.txt
$ git commit -am 'FIX: Add missing foo file'
$ git push origin fix_branch
This creates a new branch called fix_branch
on the local machine, checks out
that branch, adds a file, commits the change to the branch, and then pushes the
branch to a new remote branch (also called fix_branch
) on the origin
repo (i.e., their fork of the official repo). blink1073
could then navigate
to the website of the upstream repository
and they would find a nice Pull Request button available.
Maintainers would then typically comment on the pull request and ask for
some changes. For example, maybe the user forgot to also add the necessary
bar
file. The user could then do:
$ git branch
* fix_branch
master
$ touch bar.txt
$ git add bar.txt
$ git commit -am 'FIX: Add missing bar file'
$ git push origin fix_branch
After this set of commands, the pull request (PR) is automatically updated to reflect this new addition. The cycle of commenting on and updating the continues until the Maintainers are satisfied with the changes. They will then merge the pull request to incorporate the proposed changes into the upstream GitHub repo.
Once their branch gets merged into the master
branch of
the upstream repo, the developer can do the following to get
up to date on their local machine:
$ git checkout master
$ git fetch upstream
$ git pull upstream/master
$ git branch -d fix_branch
$ git branch
* master
This synchronizes their local master
branch with the master
branch of
the upstream
remote repo, and deletes their local fix_branch
(which is
no longer needed, since its changes have been merged into the upstream master).
Here a couple bash
functions that make Development a bit easier (which can be put in your .bashrc
file):
function git-new() {
# create a new branch based on the latest upstream master
git fetch upstream master
git checkout -b "$1" upstream/master
}
function git-rb() {
# rebase a branch on the latest upstream master
git fetch upstream master
git rebase -i upstream/master
}
Maintainers start out with a similar set up as Developers. However, they might
want to be able to push directly to the upstream
repo as well as pushing to
their fork. Having a repo set up with git://
access instead of
[email protected]
or https://
access will not allow pushing. So
starting from scratch, a maintainer ccordoba12
might fork the upstream repo
and then do::
$ git clone [email protected]:/ccordoba12/Stormwater-Management-Model
$ cd Stormwater-Management-Model
$ git remote add upstream [email protected]:/OpenWaterAnalytics/Stormwater-Management-Model.git
$ git remote add blink git://github.com/blink1073/Stormwater-Management-Model.git
Now the maintainer's local repository has push/pull access to their own personal
development fork and the upstream repo, and has read-only access to
blink1073
's fork:
$ git remote -v
origin [email protected]:/ccordoba12/Stormwater-Management-Model.git (fetch)
origin [email protected]:/ccordoba12/Stormwater-Management-Model.git (push)
blink git://github.com/blink1073/Stormwater-Management-Model.git (fetch)
blink git://github.com/blink1073/Stormwater-Management-Model.git (push)
upstream [email protected]:/OpenWaterAnalytics/Stormwater-Management-Model.git (fetch)
upstream [email protected]:/OpenWaterAnalytics/Stormwater-Management-Model.git (push)
Let's say blink1073
has opened a PR on Github, and the maintainer wants
to test out the code. This can be done this way:
$ git fetch blink
$ git checkout -b blink_branch blink/fix_branch
The first command allows the local repository to know about the changes (if
any) that have occurred on blink1073
's fork of the project
<github.com/blink1073/Stormwater-Management-Model.git>.
In this case, a new branch named fix_branch
has been added.
The second command is more complex. git checkout -b $NAME
is a command
that first creates a branch named $NAME
, then checks it out. The
additional argument blink/fix_branch
tells git
to make the
branch track changes from the remote branch fix_branch
in the remote
repository known as blink
, which you may recall points to
<github.com/blink1073/Stormwater-Management-Model.git>. The full command can thus be interpreted
in human-readable form as "create and check out a branch named
blink_branch
that tracks the changes in the branch
fix_branch
from the remote repo named blink
". The maintainer can
now inspect and test blink1073
's code locally rather than just viewing the
changes on the GitHub pull request page.
Once the code is merged on GitHub, the maintainer can update their local copy in a similar way as the developer did earlier:
$ git checkout master
$ git pull upstream/master
Here is a bash function that allows one to check out a new branch based on a PR #:
function git-pr() {
# create a local branch based on a github PR #
git fetch upstream pull/$1/head:pr/$1
git checkout pr/$1
}
TODO
To be used when there are merge conflicts. Before doing anything we need to install a tool that helps in conflict resolution, we suggest Meld, but you are free to use any other tool you feel comfortable with. Now having install your tool of choice, you need to configure git, so it knows that the tool exists.
Git configuration on Windows (using Meld)
Install Meld, and then open a command prompt and type:
git config --global merge.tool meld
git config --global mergetool.meld.path "X:/PATH TO/MELD/Meld.exe"
Pay attention to the use of slashes (use
/
) and use""
if the path contains spaces.This will let Git know that Meld is installed and available as a mergeing tool.
Now we are ready for rebasing, open a command prompt, and go to your local repository and enter the following:
1.) Checkout to the master branch
git checkout master
2.) Update the master branch on your local repo with the master branch from the upstream repository.
(Read the Developers section on this page to add the upstream
alias if you have not at this point)
git pull upstream master
3.) Update your remote master branch (on your fork) with the latest changes from the master branch in the Stormwater-Management-Model repository (that you just pulled)
git push origin master
4.) Checkout to your local branch
git checkout your_branch_name
5.) Start the rebase
git rebase master
At this point if any problems are found you will get a message saying some conflicts need to be resolved.
6.) Solve conflicts
To solve them using the tool we installed type in the command prompt:
git mergetool
And select the tool of choice, Meld for this example.
meld
Fix the conflicts using the graphical tool and save the changes. For more information on using Meld look at this page
7.) Continue with the rebase
git rebase --continue
If more conflicts appear, repeat step 6 and 7, until all conflicts are solved.
If at any time you feel you did something wrong and want to start from scratch, type in the command prompt:
git rebase --abort
If everything is fixed and went as planned after typing git rebase --continue
, then the rebase will print that it was successfully executed.
8.) Push the rebased local branch to your remote branch
Now all that is left is to push the rebased branch to your remote repo branch. Type in the command prompt:
git push origin your_branch_name --force
Check out all the help articles on Github and the help articles on git.
Adapted from https://github.com/LABSN/expyfun/blob/master/git_flow.rst