Git

Who am I?

Jeroen Budts
PHP & Drupal Developer
At Inuits - An Open Source Consultancy Company
http://budts.be
twitter: @teranex
e-mail: jeroen@inuits.eu

Overview

Basic Git usage
How Git stores it's stuff
More advanced Git usage (branching, merging, rebasing, ..)
Working with remotes
Other tools (Subversion integration etc)
Git flow
Github

What is Git?

"Git is a free & open source, distributed version control system designed to handle everything from small to very large projects with speed and efficiency."

Distributed

The entire repository is on your computer
No connection to the server required to work, view log
Very fast

Open source

Created in 2005
Linux Kernel
Linus Torvalds

Basic Git Usage

First initialize the git repository:

git init

Then add files

git add -A
# or
git add .
# or
git add myfile.txt
git add *.rb
git add directory/

And commit the files

git commit

Git will ask you for a commit message.

Committing modified files. Stage the files (more on that later):

git add file.txt

And commit:

git commit

Or skip the staging step and immediately commit:

git commit -a

Removing a file

git rm file.txt

And commit

git commit

Try it

create a new repository add some text-files commit make some changes commit (with -a for now) remove a file commit (with -a)

The Index

A.K.A. The Staging Area

The index contains the changes that will be added to your next commit. Your commit will not contain the changes in your working directory.

Only the changes that were added to the index will be in the commit

If you add a file, or a part of a file, to the index, a copy is made of that file. When you commit, it is that copy which ends up in the commit.

Important: if you make changes to the file after adding it to the index, those changes will not end up in your commit, unless you add the file again to the index!

If you modify files, you need to 'stage' them again, by running git add.

You can also add only parts of a modified file:

git add -p myfile.txt

Tip: use a tool for this

Checking the state of your repository

Checking the state of the working copy

git status

Seeing what has changed

`git diff`  
# shows the diff of your working copy

`git diff --cached`
# show the diff of your index

Tip: use a tool for this

Checking the log of your current branch

git log

git log has many options. For example:

git log --graph --pretty=format:'%Cred%h%Creset -%C(yellow)%d%Creset %s %Cgreen(%cr) %C(bold blue)<%an>%Creset' --abbrev-commit --date=relative

results in:

Tip: add this in your global gitconfig as an alias:

[alias]
    l = log --graph --pretty=format:'%Cred%h%Creset -%C(yellow)%d%Creset %s %Cgreen(%cr) %C(bold blue)<%an>%Creset' --abbrev-commit --date=relative

See the log of all the branches:

git log --all

Try it

edit two text files check the status add one of them to the index commit add the other to the index edit the file again check the status check the diff and diff --cached

Undoing things

Undoing changes to a file

git checkout -- file.txt
# the -- are optional, to tell git it's about a file
git checkout directory/

But not

git checkout branch-name

Note: This gets back the version from the index. If the file is not in the index, it will get the file from the repository.

Resetting from the index

To reset the file after it was added to the index:

git reset file.txt

The working copy is now the version from the index and the file is not in the index anymore. Then you can use

git checkout file.txt

To completely reset it to the version in the repository.

Removing unversioned files

git clean -f
# to remove untracked files

git clean -f -d
# to remove untracked directories

Completely resetting your working copy

git reset --hard

# optionally followed by
git clean -f

Use with care!

Reverting a commit

After a commit was already pushed out to other people.

git revert

This will create a commit to undo everything introduced by

Try it

edit a file undo the edit edit the file and add it to the index edit the same file, get the version back from the index create a new file use git clean to remove it

Try it

edit a file and commit revert the commit with git revert check the git log edit some more files and revert everything with git reset --hard

How Git stores data

Knowing this gives you a better understanding of Git

A Git repository is a collection of objects

Types of objects

blobs
trees
commits
tags

blobs

Git stores the contents of a file in a 'blob'.

does not contain any meta data
a blob never changes
a hash is calculated as the blob name
the hash will always be the same for the same contents

some examples:

$ git hash-object hello.txt
ce6c1fd146f65c899e6b10e46c89097c644e3229

$ git hash-object say-hi.rb
a8784b043f12b4b0c9114c55ebf33f5c9b44ce8f

Trees

A tree is like a directory

Can contain references to 1 or more blobs
Can contain references to 0, 1 or more other trees
Contains the meta data about the files
Itself also identified by a hash

Trees

Commits

If you think about Git, think about commits!

Contains a reference to one tree object
Contains references to one or more other commits
- one exception: the very first commit in the repo
A commit usually points to it's parent commit
In case of a merge, it can point to two or more commits (one commit for each branch which you merged together)

Commits

Did I mention that you should think in terms of commits, when working with Git?

If you understand commits, you basically understand Git.

references ('refs')

Because a commit hash is very difficult to remember and not really useful to work with, Git uses references to point to specific commits.

HEAD

One such reference is HEAD

It points to the commit which is currently checked out into your working directory

master

Another important reference is master

master is the 'default' branch in Git
when working on the master branch, the master reference and the HEAD reference point to the same commit

special

HEAD^ & HEAD^^: The commit before HEAD, two commits before HEAD
master~7: 7 commits before master reference

Branches

references to other commits

very easy to create
a branch has a name
this name is also the reference to the most recent commit for that branch
a commit can be shared by branches

Creating a branch

First create the branch

git branch mywork

Then start working on that branch:

git checkout mywork

Or combine the two steps

git checkout -b mywork

deleting a branch

Deleting a local branch is also possible

git branch -d mywork

branching

We are working on a branch named 'origin'
At commit C2 we decide to split off a new branch named 'mywork'.
Both branches originate from commit C2; commit C3 and C5 have the same parent

merging

When you have been working on a (feature) branch for a while you will probably want to merge those branches back together.

# to merge the origin branch back into your mywork branch (to bring it up to date)
# checkout the target branch
git checkout mywork
# merge the branch into the current branch
git merge origin

Now your repository looks like this: (Notice that commit C7 has two parents)

Merging

When merging two or more branches there are two possibilities:

Merge commit
Fast Forward

Merge commit

When both branches have new commits a merge commit is created.
Git automatically proposes a commit message:

Merge branch 'mywork' into master
The commit has two or more parents

Fast Forward

When the target branch does not have new commits
No merge commit is created
In fact nothing much happens
Except: The reference for the target branch is simply changed to point the same commit as the source branch
This is an ideal situation and can never go wrong
Obviously you are not always this lucky (→ rebasing)

Fast Forward

Merge the 'mywork' branch into origin

The origin branch is simply fast-forwarded

Explicitly create merge commit

Sometimes you want a merge commit, even when a simple Fast Forward is possible.

git merge --no-ff

Try it

Create a new branch and check it out Edit a file and commit Checkout the master branch Merge the file Edit files on both branches Merge both branches

Try it

Edit files on both branches, introducing conflicts Merge both branches Resolve the conflicts Commit

Inspecting an earlier commit

Show

git show 2a2ea8d
git show HEAD^^

Gives you a diff of what exactly that commit changed + metadata for that commit

Inspecting an earlier commit

Checkout

You can checkout a commit to see the repository as it was back then

git checkout <commit-hash>

Note: Git will tell you that you are in a detached head state, because your HEAD reference doesn't point to any branch.

Resetting to an earlier commit

You can reset your branch to an earlier commit, to throw away the newer commit(s).

git reset HEAD^^

This will change the reference for the branch to that commit

Note: only do this if you have not yet pushed these newer commits

Reflog

The reflog is a log of everything which happened in your repository

git reflog

Useful when you make a mistake during rebasing, merging or resetting.

Your old commits are not 'lost'. They are simply not used anymore and will be removed by git gc after a while.

Try it

edit a file commit reset to the previous commit use the reflog to find the newer commit reset back to the newer commit (no more changes) reset back to older make additional changes to the file commit the file check the reflog

Rebasing

Instead of merging (with merge commits) you can also rebase (so you can then fast forward)

Some people will tell you that this is very harmful, it can break your repository and destroy the universe. This is NOT TRUE. (At least if you know what you are doing)

What is 'rebasing'?

By rebasing your commits you can actually rewrite your history:

Edit a commit message
Add missing files to a commit
Reorder your commits
Modify the parent of a commit
Merge a few commits together into a single commit
Delete commits from the history
...
And break your repository if you want :)

How not to destroy your repository?

Do not rebase commits which have already been pushed to other people

Each commit which is rebased will get a new, different, hash. People (and Git) which pull this new hash will get confused.

If you do rebase a commit which was already pushed, Git will refuse the new commit, unless you use the --force option.

Amending changes

The easiest and 'safest' kind of rebase
Only possible for the most recent commit
Let's you add missing files and modify the commit message

After modifying your index again:

git commit --amend

Rebasing on top of another branch

This is an alternative approach to merging, with a merge commit.

Let's reuse the example:

Now you want to merge 'mywork' into 'origin' without creating a merge commit

What we did before:

git checkout origin
git merge mywork

What we will do now:

# on the mywork branch
git rebase origin
# fix any merge conflicts
git checkout origin
git merge mywork

# on the mywork branch
git rebase origin

Interactive rebasing

With interactive rebasing you can really rewrite history the way you want it to be. ...And break your repository.

Rebase the commits since the specified commit-hash

git rebase --interactive <commit-hash>

Suppose we have the following commits

To rebase the most recent 3 commits:

git rebase --interactive 4efd195

Try it

edit a file and commit edit the file again commit the file with the --amend option

Try it

create and checkout a branch edit a file and commit checkout the master branch edit the same file and commit do a merge check the log undo the merge (reset & checkout) checkout the branch rebase the branch on top of master checkout master and merge check the log Bonus: delete the merged branch

Sharing code

Remotes

To share work with other people, you can add one or more remotes

remotes have an address
remotes have a name
you can add multiple remotes to a repository

Easiest method to set this up is by cloning an existing repository instead of initializing your repo.

git clone http://git.drupal.org/project/drupal.git

This will set up everything for you

Manually adding remotes

Sometimes you will want to add a remote

Because you had already created the repository
Or maybe because you want to add one or more additional remotes

Adding a remote

This is done with the git remote command

git remote add github git@github.com:teranex/git-talk.git

This will add my Github repository for this presentation as a remote with the name github

Tip: pretty url's for remotes

You can add shortcuts in your git config for often used url's

git config --global url.git@github.com.insteadOf gh
git config --global url.ssh://myuser@mydomain.net/path/to/repos/.insteadOf r

Then you can use short url's to clone

git clone gh:teranex/dotvim
git clone r:myrepo.git

Pulling and pushing

When you have cloned the repository:

git pull

This will pull in the changes from the current branch on the origin

git push

Will push your changes to the origin

However: This will only work for 'tracking' branches.

Remote Branches

It is important to think about remote branches as just branches

By default, Git does not know, nor care, about relationships between branches!

local branch: master
remote ('github') branch: github/master
Git just sees two branches

git pull github master git push github master
By default, no local branches are created for remote branches

Remote Branches

You can get a good overview of all your local and remote branches and how they are tracking with: git branch -avv

Tracking branches

To create a local branch based on a remote branch:

git checkout --track -b mywork github/mywork

To link an existing local branch to a remote branch:

git branch --setup-stream github/mywork

You can verify this in the git config file (.git/config) in your repository:

Merging while pulling

To better understand pulling, let's see what actually happens. Instead of using git pull, you also do (while on the master branch):

# pull in the new objects
git fetch github

# merge the remote branch with the local branch
git merge github/master

This works exactly the same as merging two local branches!

Avoiding useless merge commits

Merging a remote branch in your local branch can create a useless merge commit:

You can avoid this by rebasing instead of merging:

git pull --rebase

or, if you want to do it manually:

git fetch github
git rebase github/master

Try it

Clone a repository Checkout a remote branch (you are now in a detached head state) Create a local branch for the remote branch

Stash

Sometimes you want to set aside your changes

because you want to rebase (which is not possible in a dirty repository)
to try alternative approaches
...

Stashing

to stash everything

git stash

Or you can give a description

git stash save "something fancy I was working on"

Stashing

to see the stashes

git stash list

to get changes back from the stash

git stash pop

I often use the stash when I want to git pull --rebase, while I have uncommitted changes (git will refuse to do it in that case):

git stash
git pull --rebase
git stash pop

Try it

make some edits stash them unstash them

Tools

GUI tools

TortoiseGit
Github for Windows
Giggle
Tig
Fugitive
IDE integration

Giggle & GitX: better looking, for Gnome & Mac

Fugitive and gitv plugins for Vim

bash prompt information

Include $(__git_ps1 ' %s ') in your $PS1

Git and Subversion

Git has plugins available to migrate from and/or integrate into other versioning systems as well. One such plugin is git-svn.

git-svn

With git-svn you can:

migrate an existing subversion repository to a git repository, including all the history.
use Git locally to do your work, but push to a central Subversion server.

Local Git, Subversion server

To locally use Git and push to a central Subversion server:

First 'clone' the Subversion repository into a local Git repo

git svn clone -s http://svn.example.com/myproject
# the -s means the subversion repo has a standard layout (trunk/ etc)

Now you can work as usual with your Git repository. Except... instead of running git pull to get the changes from other people, you now do:

git svn rebase

And you don't do git push, but:

git svn dcommit

Git Flow

To use the 'Git Flow' branching strategy a Git plugin is available: git-flow. This plugin makes it really easy to follow Git Flow:

First initialize it to configure the names. (I recommend to use the defaults)

git flow init

To start a feature branch:

git flow feature start my-exiting-feature

To publish the feature (push it to the remote)

git flow feature publish my-exiting-feature

To finalize the branch:

git flow feature finish my-exiting-feature

Submodules

In a Git repository you can 'link' other repositories to subdirectories. This can be useful, for example when using external libraries or when building your VIM configuration, to pull in all the plugins.

Similar to svn:externals, but not quite the same:

submodules must be registered in the repository
submodules must be pulled-in separately for each repo.
submodules are not updated automatically
submodules are not pushed automatically
a specific commit is referenced in the parent repo

Add a submodule

# from the root of the repository
git submodule add git://path/to/repo path/in/repo

This will modify the .gitmodules file, which is part of the parent repository, and record the exact commit which is checked out in the submodule. Now you can commit, push, etc.

Pulling in submodules

When another repository pulls in the change the following steps are required:

git submodule init

This will update the .git/config file for that repository and register the submodule in the repository

git submodule update

This will check-out all the submodules to the correct commit

# or combine both steps
git submodule update --init

Updating submodules

To update you can checkout another commit in the submodule (for example by pulling) and commit the reference to this new commit in the parent repository.

To update all the modules, you can use something like:

git submodule foreach git pull origin master

Try it

Create a new repo with a commit Add it as a submodule of your main repo Make some changes in the Submodule Check the status of the main repo Commit in the Submodule Check the status of the main repo Commit the updated main repo to point to the ne commit in the submodule

Bisect

Helping you to find which commit introduced the bug

Sometimes somebody will introduce bugs into your software or brake previously working features. Who knows, maybe even you! While trying to find the source of the problem, it can be useful to know which commit exactly introduced the troubles in paradise. That is exactly what Git bisect is for.

Steps for bisecting

Inform Git about one 'good' commit And about one 'bad' commit Git will checkout a commit for you Verify that commit Tell Git whether that commit has the problem ('bad'), or not ('good') Finally the 'bad' commit will be found

an example

Let's say you know the feature was already broken in the previous commit and 10 commits ago the feature still worked.

Start Git Bisect and inform Git about this:

git bisect start
git bisect bad HEAD^
git bisect good HEAD~10

Git will checkout the commit in the middle, so you can test:

Bisecting: 5 revisions left to test after this (roughly 3 steps)

After testing you inform Git about the result

git bisect good # when the feature worked
git bisect bad  # when the feature is b0rken

After you have found the bad commit, reset your repository:

git bisect reset

If you write a script which can verify each commit, you can let Git run it for every commit!

Try it

Find the commit which changed A tree is a like a **directory** into A tree is like a **directory**

Github

Creating a repo
Using the issue tracker
Forking vs shared repository
Pull requests (via website / local)
Syncing forks

Git – Github

Inuits

Git – Github

3 4 (function() { var po = document.createElement('script'); po.type = 'text/javascript'; po.async = true; po.src = 'https://apis.google.com/js/platform.js'; var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(po, s); })();

git-training

Git

Who am I?

Overview

What is Git?

Distributed

Open source

Basic Git Usage

Try it

The Index

Checking the state of your repository

Try it

Undoing things

Undoing changes to a file

Resetting from the index

Removing unversioned files

Completely resetting your working copy

Reverting a commit

Try it

Try it

How Git stores data

Types of objects

blobs

Trees

Trees

Commits

Commits

references ('refs')

HEAD

master

special

Branches

Branches

Creating a branch

deleting a branch

branching

merging

Merging

Merge commit

Fast Forward

Fast Forward

Explicitly create merge commit

Try it

Try it

Inspecting an earlier commit

Show

Inspecting an earlier commit

Checkout

Resetting to an earlier commit

Reflog

Try it

Rebasing

What is 'rebasing'?

How not to destroy your repository?

Amending changes

Rebasing on top of another branch

Interactive rebasing

Try it

Try it

Sharing code

Remotes

Manually adding remotes

Adding a remote

Tip: pretty url's for remotes

Pulling and pushing

Remote Branches

Remote Branches

Tracking branches

Merging while pulling

Avoiding useless merge commits

Try it

Stash

Stashing

Stashing

Try it

Tools

3 4