Running Code Reviews with Confidence

by Emma Jane Hogbin WestbySeptember 02, 2014

Published in Project Management, Workflow & Tools

Growing up, I learned there were two kinds of reviews I could seek out from my parents. One parent gave reviews in the form of a shower of praise. The other parent, the one with a degree from the Royal College of Art, would put me through a design crit. Today the reviews I seek are for my code, not my horse drawings, but it continues to be a process I both dread and crave.

Article Continues Below

In this article, I’ll describe my battle-tested process for conducting code reviews, highlighting the questions you should ask during the review process as well as the necessary version control commands to download and review someone’s work. I’ll assume your team uses Git to store its code, but the process works much the same if you’re using any other source control system.

Completing a peer review is time-consuming. In the last project where I introduced mandatory peer reviews, the senior developer and I estimated that it doubled the time to complete each ticket. The reviews introduced more context-switching for the developers, and were a source of increased frustration when it came to keeping the branches up to date while waiting for a code review.

The benefits, however, were huge. Coders gained a greater understanding of the whole project through their reviews, reducing silos and making onboarding easier for new people. Senior developers had better opportunities to ask why decisions were being made in the codebase that could potentially affect future work. And by adopting an ongoing peer review process, we reduced the amount of time needed for human quality assurance testing at the end of each sprint.

Let’s walk through the process. Our first step is to figure out exactly what we’re looking for.

Determine the purpose of the proposed change#section2

Our code review should always begin in a ticketing system, such as Jira or GitHub. It doesn’t matter if the proposed change is a new feature, a bug fix, a security fix, or a typo: every change should start with a description of why the change is necessary, and what the desired outcome will be once the change has been applied. This allows us to accurately assess when the proposed change is complete.

The ticketing system is where you’ll track the discussion about the changes that need to be made after reviewing the proposed work. From the ticketing system, you’ll determine which branch contains the proposed code. Let’s pretend the ticket we’re reviewing today is 61524—it was created to fix a broken link in our website. It could just as equally be a refactoring, or a new feature, but I’ve chosen a bug fix for the example. No matter what the nature of the proposed change is, having each ticket correspond to only one branch in the repository will make it easier to review, and close, tickets.

Set up your local environment and ensure that you can reproduce what is currently the live site—complete with the broken link that needs fixing. When you apply the new code locally, you want to catch any regressions or problems it might introduce. You can only do this if you know, for sure, the difference between what is old and what is new.

Review the proposed changes#section3

At this point you’re ready to dive into the code. I’m going to assume you’re working with Git repositories, on a branch-per-issue setup, and that the proposed change is part of a remote team repository. Working directly from the command line is a good universal approach, and allows me to create copy-paste instructions for teams regardless of platform.

To begin, update your local list of branches.

git fetch

Then list all available branches.

git branch -a

A list of branches will be displayed to your terminal window. It may appear something like this:

* master
remotes/origin/master
remotes/origin/HEAD -> origin/master
remotes/origin/61524-broken-link

The * denotes the name of the branch you are currently viewing (or have “checked out”). Lines beginning with remotes/origin are references to branches we’ve downloaded. We are going to work with a new, local copy of branch 61524-broken-link.

When you clone your project, you’ll have a connection to the remote repository as a whole, but you won’t have a read-write relationship with each of the individual branches in the remote repository. You’ll make an explicit connection as you switch to the branch. This means if you need to run the command git push to upload your changes, Git will know which remote repository you want to publish your changes to.

git checkout --track origin/61524-broken-link

Ta-da! You now have your own copy of the branch for ticket 61524, which is connected (“tracked”) to the origin copy in the remote repository. You can now begin your review!

First, let’s take a look at the commit history for this branch with the command log.

git log master..

Sample output:

Author: emmajane 
Date: Mon Jun 30 17:23:09 2014 -0400

Link to resources page was incorrectly spelled. Fixed.

Resolves #61524.

This gives you the full log message of all the commits that are in the branch 61524-broken-link, but are not also in the master branch. Skim through the messages to get a sense of what’s happening.

Next, take a brief gander through the commit itself using the diff command. This command shows the difference between two snapshots in your repository. You want to compare the code on your checked-out branch to the branch you’ll be merging “to”—which conventionally is the master branch.

git diff master

How to read patch files#section4

When you run the command to output the difference, the information will be presented as a patch file. Patch files are ugly to read. You’re looking for lines beginning with + or -. These are lines that have been added or removed, respectively. Scroll through the changes using the up and down arrows, and press q to quit when you’ve finished reviewing. If you need an even more concise comparison of what’s happened in the patch, consider modifying the diff command to list the changed files, and then look at the changed files one at a time:

git diff master --name-only
git diff master <filename>

Let’s take a look at the format of a patch file.

diff --git a/about.html b/about.html
index a3aa100..a660181 100644
	--- a/about.html
	+++ b/about.html
@@ -48,5 +48,5 @@
	(2004-05)

- A full list of <a href="emmajane.net/events">public 
+ A full list of <a href="http://emmajane.net/events">public 
presentations and workshops</a> Emma has given is available

I tend to skim past the metadata when reading patches and just focus on the lines that start with - or +. This means I start reading at the line immediate following @@. There are a few lines of context provided leading up to the changes. These lines are indented by one space each. The changed lines of code are then displayed with a preceding - (line removed), or + (line added).

Going beyond the command line#section5

Using a Git repository browser, such as gitk, allows you to get a slightly better visual summary of the information we’ve looked at to date. The version of Git that Apple ships with does not include gitk—I used Homebrew to re-install Git and get this utility. Any repository browser will suffice, though, and there are many GUI clients available on the Git website.

gitk

When you run the command gitk, a graphical tool will launch from the command line. An example of the output is given in the following screenshot. Click on each of the commits to get more information about it. Many ticket systems will also allow you to look at the changes in a merge proposal side-by-side, so if you’re finding this cumbersome, click around in your ticketing system to find the comparison tools they might have—I know for sure GitHub offers this feature.

Screenshot of the gitk repository browser.

Now that you’ve had a good look at the code, jot down your answers to the following questions:

Does the code comply with your project’s identified coding standards?
Does the code limit itself to the scope identified in the ticket?
Does the code follow industry best practices in the most efficient way possible?
Has the code been implemented in the best possible way according to all of your internal specifications? It’s important to separate your preferences and stylistic differences from actual problems with the code.

Apply the proposed changes#section6

Now is the time to start up your testing environment and view the proposed change in context. How does it look? Does your solution match what the coder thinks they’ve built? If it doesn’t look right, do you need to clear the cache, or perhaps rebuild the Sass output to update the CSS for the project?

Now is the time to also test the code against whatever test suite you use.

Does the code introduce any regressions?
Does the new code perform as well as the old code? Does it still fall within your project’s performance budget for download and page rendering times?
Are the words all spelled correctly, and do they follow any brand-specific guidelines you have?

Depending on the context for this particular code change, there may be other obvious questions you need to address as part of your code review.

Do your best to create the most comprehensive list of everything you can find wrong (and right) with the code. It’s annoying to get dribbles of feedback from someone as part of the review process, so we’ll try to avoid “just one more thing” wherever we can.

Prepare your feedback#section7

Let’s assume you’ve now got a big juicy list of feedback. Maybe you have no feedback, but I doubt it. If you’ve made it this far in the article, it’s because you love to comb through code as much as I do. Let your freak flag fly and let’s get your review structured in a usable manner for your teammates.

For all the notes you’ve assembled to date, sort them into the following categories:

The code is broken. It doesn’t compile, introduces a regression, it doesn’t pass the testing suite, or in some way actually fails demonstrably. These are problems which absolutely must be fixed.
The code does not follow best practices. You have some conventions, the web industry has some guidelines. These fixes are pretty important to make, but they may have some nuances which the developer might not be aware of.
The code isn’t how you would have written it. You’re a developer with battle-tested opinions, and you know you’re right, you just haven’t had the chance to update the Wikipedia page yet to prove it.

Submit your evaluation#section8

Based on this new categorization, you are ready to engage in passive-aggressive coding. If the problem is clearly a typo and falls into one of the first two categories, go ahead and fix it. Obvious typos don’t really need to go back to the original author, do they? Sure, your teammate will be a little embarrassed, but they’ll appreciate you having saved them a bit of time, and you’ll increase the efficiency of the team by reducing the number of round trips the code needs to take between the developer and the reviewer.

If the change you are itching to make falls into the third category: stop. Do not touch the code. Instead, go back to your colleague and get them to describe their approach. Asking “why” might lead to a really interesting conversation about the merits of the approach taken. It may also reveal limitations of the approach to the original developer. By starting the conversation, you open yourself to the possibility that just maybe your way of doing things isn’t the only viable solution.

If you needed to make any changes to the code, they should be absolutely tiny and minor. You should not be making substantive edits in a peer review process. Make the tiny edits, and then add the changes to your local repository as follows:

git add .
git commit -m "[#61524] Correcting <list problem> identified in peer review."

You can keep the message brief, as your changes should be minor. At this point you should push the reviewed code back up to the server for the original developer to double-check and review. Assuming you’ve set up the branch as a tracking branch, it should just be a matter of running the command as follows:

git push

Update the issue in your ticketing system as is appropriate for your review. Perhaps the code needs more work, or perhaps it was good as written and it is now time to close the issue queue.

Repeat the steps in this section until the proposed change is complete, and ready to be merged into the main branch.

Merge the approved change into the trunk#section9

Up to this point you’ve been comparing a ticket branch to the master branch in the repository. This main branch is referred to as the “trunk” of your project. (It’s a tree thing, not an elephant thing.) The final step in the review process will be to merge the ticket branch into the trunk, and clean up the corresponding ticket branches.

Begin by updating your master branch to ensure you can publish your changes after the merge.

git checkout master
git pull origin master

Take a deep breath, and merge your ticket branch back into the main repository. As written, the following command will not create a new commit in your repository history. The commits will simply shuffle into line on the master branch, making git log −−graph appear as though a separate branch has never existed. If you would like to maintain the illusion of a past branch, simply add the parameter −−no-ff to the merge command, which will make it clear, via the graph history and a new commit message, that you have merged a branch at this point. Check with your team to see what’s preferred.

git merge 61524-broken-link

The merge will either fail, or it will succeed. If there are no merge errors, you are ready to share the revised master branch by uploading it to the central repository.

git push

If there are merge errors, the original coders are often better equipped to figure out how to fix them, so you may need to ask them to resolve the conflicts for you.

Once the new commits have been successfully integrated into the master branch, you can delete the old copies of the ticket branches both from your local repository and on the central repository. It’s just basic housekeeping at this point.

git branch -d 61524-broken-link
git push origin --delete 61524-broken-link

Conclusion#section10

This is the process that has worked for the teams I’ve been a part of. Without a peer review process, it can be difficult to address problems in a codebase without blame. With it, the code becomes much more collaborative; when a mistake gets in, it’s because we both missed it. And when a mistake is found before it’s committed, we both breathe a sigh of relief that it was found when it was.

Regardless of whether you’re using Git or another source control system, the peer review process can help your team. Peer-reviewed code might take more time to develop, but it contains fewer mistakes, and has a strong, more diverse team supporting it. And, yes, I’ve been known to learn the habits of my reviewers and choose the most appropriate review style for my work, just like I did as a kid.

17 Reader Comments

Anne Franco says:

September 2, 2014 at 4:38 pm

I recently discovered tracking gets set up automatically if you checkout the branch ‘directly’:

git checkout 61524-broken-link

should be all you need.
Emma Jane Hogbin Westby says:

September 2, 2014 at 7:36 pm

@anne Thanks for your comment! To be on the safe side when writing the article, I tried to omit all possible shortcuts that might not be available for older systems. This exact shortcut was actually one that I went back and forth with as it *ought* to work in *nearly* all cases exactly as you’ve described; however, there are two times when it will fail.

*An older version of Git* I did a bit of digging to double check, and I’m pretty sure this functionality was added in 1.6.6 (2009ish). “But that’s AGES ago” I hear you saying…but not all systems ship with an up-to-date version of Git (we’re on the 2.x branch now), so if you’ve got an older system running an even older version of xcode (for example), this might not work as intended.

*More than one remote with the same branch name* You’ve omitted `origin` to get this to work. If there were a second remote with the same branch name it wouldn’t know which one to pick, and wouldn’t set up tracking. (I find it counter-intuitive that Git will assume tracking when LESS information is provided…but there are more than a few things I find counter-intuitive at the command line.)

Thanks again for the comment. I agree this is a great shortcut to know about!
Martin Lundberg says:

September 3, 2014 at 4:07 pm

Just learned about git push origin --delete I’ve always done git push origin : which is a lot harder to understand (and remember). Also did some digging and found it was added in 1.7.0 (https://raw.githubusercontent.com/git/git/master/Documentation/RelNotes/1.7.0.txt)
Mohammad Umair Khan says:

September 4, 2014 at 5:05 am

I have been using atlassians crucible/fisheye with in our team for peer reviews, and i must say these tools are nice. Readers should try out these tools, they will eliminate time lost in git commands and branching.
Emma Jane Hogbin Westby says:

September 4, 2014 at 11:02 am

Thanks for digging up when that was introduced, Martin! This is another one that I waffled on. I’ll admit that I typically use :branch-name (not the –delete parameter), as I’ve been using Git for longer than the new parameter’s existed…but as you mentioned, –delete is easier to remember. 🙂
Emma Jane Hogbin Westby says:

September 4, 2014 at 11:11 am

@Mohammed That’s great you’ve found some helpful tools for your team! Gerrit is another popular code review system people might want to look into. Unfortunately these tools don’t help you with the “social” part of the code review (what to review; how to provide feedback in a humane / useable manner). Sometimes the extra infrastructure isn’t worth the overhead for smaller teams. I’m delighted to hear you’ve got a system that works for you!
dleppik says:

September 4, 2014 at 11:21 am

Good advice.

I’m a solo developer, and yet as I mature I’m getting good at self-reviews. I love the fact that JetBrains’s IDEs put a visual diff right in the check-in window. JetBrains is working on a tool specifically for code reviews; their intro video makes it look a lot like the things that are already in the IDEs.

Code that doesn’t compile or which fails simple regression tests shouldn’t be reviewed. Accidents do happen, but don’t waste people’s time by (git) pushing obviously broken code.

One thing I look for in particular is debug code and code that made sense only while debugging (e.g. simple variables that are only used once.) In part, that’s because it’s fairly easy to find [at least in my code 🙁 ] so it gives me something to focus on. Again, I’m doing self-reviews, so looking at other people’s code would be easier to look at with fresh eyes.
Emma Jane Hogbin Westby says:

September 4, 2014 at 11:57 am

@dleppik Great suggestions! Thanks. 🙂
Patel Narendrakumar says:

September 5, 2014 at 4:10 am

Nice article. I am now suffering from code reviews stuff. I used a atlassians crucible/fisheye for code review, it’s save more time. Thanks. 🙂
Adam Donahue says:

September 5, 2014 at 2:16 pm

I fully agree that code reviews are an essential and useful part of any development workflow. But I disagree about what you include in the code review process.

Many of the steps you outline would be better performed prior to a review request, and in many cases via an automated process:

(I) Stylistic adherence is better managed via a pre-commit lint, and any issues should prevent a commit without a specific override on the part of the developer.

(II) Unit testing should also be performed prior to review, and rather than having the reviewer run a test himself it should be a policy of the review process that all review requests include the updated unit test source. (So if I’m modifying utils.cc I’ve also updated and committed tests/utils.cc.) The code reviewer should then ensure the unit tests capture and correctly test the updated semantics (including edge cases).

(III) Related to this is that execution of unit tests should be part of the commit process; if unit tests don’t succeeded, the commit fails (unless the committing developer specifically overrides this check — — and specifically documents the reason).

It’s my general view that a code review should require nothing more on the part of the reviewer than an open diff. There should be no need to set up and run the changes, to create or delete branches locally, etc. It should be possible to do a full review via GitHub, for example, if one is using that as the source repository. Anything beyond that stretches what I’d say is under the prevue of a ‘review.’ (Note that this will also increase productivity by allowing a user to avoid having to stash local but uncommitted changes, fetch updates, and so forth, all of which would require unwinding when the review is complete.)

What we’ve found effective is a simple practice of requesting a code review via GitHub, using inline comments to point out issues or suggestions in the code, and commenting on a commit when signing off. (We’ve also incorporated git commit -s in the past but this again is something that might be automated via a hook.)
Emma Jane Hogbin Westby says:

September 5, 2014 at 6:06 pm

The [review article] isn’t how you would have written it. You’re a developer with battle-tested opinions, and you know you’re right, you just haven’t had the chance to update the Wikipedia page yet to prove it.

😉

@adam I can’t tell from your Facebook profile, but it seems like you might be a software developer, or perhaps a backend developer as opposed to a web dev (specifically front end dev)? What you’re describing seems to match a continuous deployment approach to software dev with a full test suite…?

(un)fortunately this isn’t how all teams approach software development. Some only focus on writing tests for *high value business logic*. Some would argue there are even types of fixes are caused by “untestable” errors (what if I had simply changed the URL, and it wasn’t a bug caused by simply missing the protocol from the URL). And even beyond what you’ve described, some will also write acceptance tests based on the output of an image diff (think wraith). There’s so many ways to add automated tests that it can get quite overwhelming for a team to know where to start. And, sadly, they were well out of scope for my article.

I appreciate you taking the time to put together such a thoughtful comment though about how you’ve integrated automated testing into your workflow. Hopefully you’ll be able to put together a little article with the full details of how you’ve configured the system. It sounds like a fair amount of work for teams and I’m sure there would be a lot of people who would really appreciate your guidance!
Netguru says:

September 9, 2014 at 7:00 am

Seems like everyone knows that code review is a valuable tool but a lot of teams struggle with implementing it because of the overhead they feel they just can’t afford right now. But apart from making your code better, it can also make you, as a dev, better thanks to the entire culture of feedback that the process is connected to. We’ve enumerated a number of reason why should one care to do code reviews (and how we handle it internally): a quick guide to peer code review
M Parker says:

September 10, 2014 at 8:41 am

Excellent article!

When I tried code snippet #5 (git log master), however, Git didn’t give me the full log messages of all the commits that were in my feature branch and not in the master branch (instead, I saw all the commits to master, and none of the commits on the feature branch I was on).

After a bit of digging in the git-log documentation, I tried git log master.. instead (i.e.: I added two dots to the end of the branch name), and I was able to get the results described.

I’m using Git 2.1.0 on OS/X 10.9.
Emma Jane Hogbin Westby says:

September 10, 2014 at 5:06 pm

Good eye, @M_Parker! You’re absolutely right: those two periods need to be there! I’ve asked the ALA team to make the correction.

You can also explicitly add the name of the branch you’re in so that you don’t (like me!) forget to add the periods. So the correction could equally have been git log master..61524-broken-link OR as @M_Parker correctly pointed out, just git log master.. and while we’re at it, I’d probably throw in the parameter ––oneline to give you the short version of the commit messages to get a quick overview before digging into the specifics.

For those following along at home, this is a fun one to play with. You can move the two periods to the other side of “master” and it will give you any changes which have been merged into master but which haven’t been included in the bug-fix branch (i.e. changes that happened AFTER the branch forked off to do its own thing). Hopefully you get nothing back, but if you get something back there’s a potential for conflicts when you try to merge the two branches at the end of the review process.
Jamie Knight says:

September 11, 2014 at 3:07 am

Hiya,

I really liked this article, in my last role (Developer in BBC Platform Engineering) the cost of bugs was extremely high so a good review process was essential.

We did do things slightly differently, each review was normally done in person and we had assigned roles:

* Presenter – person who wrote the code
* Time keeper – reviews over 45 mins rarely work out well. Two reviews work better than one long review! Time keeper provided time remaining count downs. Also helps to prevent holy wars and loooong debates.
* Note taker

The note taker would take notes on points discussed, but it was up to the developer how they acted on the discussion. We had a rule that only the original developer could make changes to the branch (unless they invited someone to help) as we found this helped people feel better ownership of their work.

In many ways, the code review represented the moment where the work stopped being owned by the individual, and started being owned (and supported etc) by the team.

We didn’t have such a refined process for async reviews, but it will be something to look at in the future.

Thanks again for the article,

Cheers,

Jamie + Lion
John Pereless says:

September 15, 2014 at 4:13 am

Very good article on code reviews and advice! I will be referring it for my programming and fundamentals. Thanks
Emma Jane Hogbin Westby says:

September 17, 2014 at 6:42 am

@Jamie_Knight Thank you for adding your comments about synchronous / real-time code reviews! (This is my personal favourite way to deliver bigger code reviews.) I’m glad to know there are teams out there who find the time to do it this way as it can often be very difficult to fit them into the schedule.

Got something to say?

We have turned off comments, but you can see what folks had to say before we did so.

More from ALA

Designed for a Dead Language

by Shrey Shah

Every language app in your pocket inherited a teaching method built for Latin. Understanding why that happened is a more useful design lesson than anything the apps themselves can teach you.

Good designers, bad websites: a proposal

by Alan Dalton

Designers are good people. Some designs exclude people anyway. Alan Dalton offers a practical fix: accessibility personas that help you recognize problems while you're designing, not after. Homework included.

“Successful” or “Unsuccessful”: the Post-“Good Design” Vocabulary

by Justin Dauer

Design for Amiability: Lessons from Vienna

by Mark Bernstein

Computing was born in a Viennese café. Between 1928 and 1934, while Hitler plotted and Europe crumbled, a motley crew of mathematicians, philosophers, architects, and economists gathered weekly to puzzle out the limits of reason—and invented Computer Science in the process. What made their collaboration possible wasn't just brilliance (though they had plenty). It was amiability: the careful design of a social space where difficult people could disagree without destroying each other. Longtime A List Apart contributing author Mark Bernstein mines this forgotten history for lessons that might just save today's embattled web from its worst impulses. Spoiler: it involves better coffee service and the looming threat of public humiliation.

Design Dialects: Breaking the Rules, Not the System

by Michel Ferreira

Design systems aren't component libraries—they’re living languages. Rigid adherence to visual rules creates brittle systems that break under contextual pressure. Fluent systems bend without breaking.