authors	state
Dave Pacheco <dap@joyent.com>	abandoned

RFD 45 Tooling for code reviews and code standards

Status: This has been abandoned as it mostly describes usage of Gerrit. Joyent engineering has since moved away from Gerrit to using GitHub PRs for code review.

As we look to scale contributions for Triton, Manta, and other Joyent software from both new hires and the broader community, we're proposing to formalize the existing process around code review and code standards. Particularly, we'd like to:

provide an easy golden path for sending out code reviews, providing feedback, and iterating on that feedback
make sure that these processes are clear to newcomers. Ideally, these processes should be well-documented and discoverable.
enforce basic standards on all code changes, including that code passes make check (including both style and lint), smoke tests, and code review where appropriate, and that commit messages match the expected form. It would be great if tooling made it easy to do the right thing here, rather than just disallowing the wrong thing.

The purpose of the proposal is to help automate and enforce the process we want, not to impose new, different process just because that's what a tool happens to do. That said, adopting tooling would likely change the workflow a bit, and it'll be important to distinguish between changes that are merely different (e.g., you used to run command X and now you run command Y) from those that are actually worse (e.g., you used to run command X to do something and now there's no place in the workflow to do that).

Overview

This RFD covers:

Our process today
Proposed goals
Proposal: Gerrit for both automation and code review
Alternatives (GitHub, webrev, others)
Gerrit deeper dive
Frequently asked questions and frequently raised fears
Questions to be answered
Road map

Our process today

The process we follow today is basically:

Most components have one or more owners whose feedback is recommended for changes.
Any member of the engineering team can push changes to any engineering repo. Community members can submit changes, but they generally have to be approved by someone on the team.
Code review is recommended, but not strictly required. (There is no mechanism to enforce code review.) The specific mechanism for review varies by engineer and project: webrev, GitHub pull requests, links directly to commits on GitHub, links to patches in gists, and so on. These sometimes get emailed or just dropped into Jabber.
While reviewers can suggest that a change should not proceed, this is not enforced. In practice, people generally do not integrate changes over the objections of reviewers.
The expectation is that no change is integrated that does not pass make check and the built-in test suite. In general, any breakage is considered bad, and needs to be either fixed or backed out quickly.
We have defined requirements around commit messages, but these are not enforced by any tooling, and many commits to many repositories don't match these requirements.

Proposed goals

We want to formalize this so that:

all changes have the opportunity to be reviewed before integration, using an easy, uniform process for all core repositories
we enforce make check and optionally basic smoke tests for each push
we enforce one-commit-per-ticket (except for rare overrides). That is, the same ticket should not be used in multiple commits in the same repository except for the rare case of fixups.
we enforce commit message format

Unchanged from today: we'll continue to let anyone on the team to give final approval for integrating a change (with the assumption that people are acting in good faith).

The automation should support people's existing workflows, including allowing engineers to preserve intermediate commits locally (and even pushing these to a remote branch) while only submitting some of these checkpoints for review.

It would be nice if the system also supported:

incremental diffs (e.g., multiple revisions of the same change)
email notifications of new reviews and review feedback
both general feedback and feedback associated with specific lines of code
ability to interact via email or local editors or something other than the captive browser interface. This might mean being able to reply to emails and have that feedback show up in the tool, or using a CLI (via an API) to submit review, or something else.
code reviews from open-source community members

This system should also be easy for new users to get set up with.

Proposal: Gerrit for both automation and code review

Gerrit is a web-based code review and code management system. Using Gerrit would be a pretty big change from our existing workflow. In a typical configuration:

Gerrit stores the authoritative source repositories. We'd use the Gerrit server instead of GitHub for cloning, pulling, and pushing repositories. People (and CI, through Jenkins) can still clone from GitHub.
When you push changes to Gerrit, we can (but don't have to) enforce that they go through a review and sanity-check process.
Once changes are integrated, Gerrit replicates changes to GitHub, so the GitHub repo is still useful, but changes would always go through Gerrit (even if we wanted to bypass the review process).

Pros:

There's a very clear Golden Path for code review. It would be pretty easy to enforce that all repos work the same way if we want. The UI guides people through the process so that the "easiest path" is also the one that applies whatever process we want.
We can enforce make check, successful test runs, and other criteria on all changes. We can also configure these to be overridable, if desired.
Gerrit has flexibility to implement our existing process, as well as stricter processes if we want that. See the "Gerrit Deeper Dive" below.
There's a command-line interface and REST API that includes support for reviewing changes.
We could set up GitHub-based authentication/authorization so that external community members can use the same system.

Cons:

The philosophy is pretty different from what we're used to. Today, we're used to being able to push directly, and we assume people have done their homework. If they want to get review, they do it however they want. With Gerrit, we would almost certainly want to ensure that the final integration step happens through Gerrit (either via the CLI, API, or web UI), and that feels like a pretty big change.
Current engineers, new team members, and community members have to learn how to use Gerrit. (They're less likely to already know how to use it than, say, GitHub.)

Gerrit provides a lot of structure around the code integration process. There's a deeper dive below.

Alternatives

The automation and code review pieces are theoretically separate.

GitHub-driven automation

We could configure all of our repos to run make check via jenkins when Pull Requests are submitted. Then people could use Pull Requests to submit changes for review (even if they have access to the repository). Note that we don't have to use the PR "Merge" button for this to work, though that would also work as long as we require merges to always fast forward.

Pros: We get the ability to verify that make check and the test suite pass for each change that we review.

Cons: Pull Requests can be annoying to manage, and people sometimes don't bother creating them. For many changes, it's easier to just send people links to commits and then push it to the repo directly. Unless we enforce the PR workflow somehow (including on project owners), then it's not very helpful.

GitHub-driven code review

We could use GitHub Pull Requests for code reviews. (This is similar to the previous section, but we could do either one without the other.)

Pros: we'd have a path for code review. It matches what many community members expect, and it remains part of the history (although some of that might disappear if people destroy their clones).

Cons:

GitHub is a proprietary service over which we have no control; when it is missing features that we deem necessary, changes their business model around features that we depend on or deprecates features we deem essential, we have no recourse. This is a very, very significant con and alone broadly disqualifies GitHub from further consideration.
GitHub's design center (namely, the social aspects of open source) does not fit our use case (namely, the rigor of professional software engineering): it is not designed to rigidly mandate that integrations meet certain criteria, formally track iterative review, separate review from approval, etc. Yes, there are analogues in GitHub for these constructs -- but GitHub's design decisions are unquestionably around making development easily accessible not about making engineering well-formalized and automatically enforced.
Incremental diffs are possible by submitting multiple commits, but it's not always clear whether multiple commits are drafts of the same work (e.g., different checkpoints for reviewers to look at), or just intermediate work that the author didn't want to squash yet. This problem is compounded for large bodies of work that has several rounds of review: GitHub makes it difficult to find the delta between a new round and a previously-reviewed version.
At a smaller scale, there are many downsides to GitHub diffs: e.g., they don't render manual pages. Some people prefer to be able to draft comments and revise them before submitting them, but GitHub submits each individual piece of feedback right away. That this can't be customized or otherwise altered is a concrete example of the peril of depending on a proprietary service.

Webrev-based code review

We could use webrev for code reviews.

Pros:

Some people prefer webrev's presentation of diffs.
Webrevs can be generated offline, copied around, and viewed offline.
Webrevs can be easily archived (e.g., into Manta).

Cons:

Although webrevs can be archived easily, we'd need to build new tools or infrastructure to organize and store them.
Webrevs aren't a communication channel in themselves. They don't provide any notifications, and feedback would need to be sent and stored separately.
There's no one golden path for webrevs today; most people drop them into various Manta directories. Incremental diffs are definitely possible, but they require manually managing each separate webrev. For a change with 3 revisions, you might reasonably want to keep each of the three revisions, plus three incremental webrevs. We would likely want to build some tooling and documentation to make this easier.

As mentioned in the Gerrit section, we can have Gerrit generate webrevs to get many of these benefits.

Phabricator and other options

Gerrit is one of several systems that can manage process like this. We have not explored Phabricator or other similar tools, mainly because we'd heard positive reports about Gerrit and a negative report about Phabricator.

Gerrit Deeper Dive

This section has been replaced with the cr.joyent.us user guide.

Besides code review, there are a few other nice things about Gerrit for our use-case:

It's got a REST API and CLI for working with changes.
It can integrate with GitHub auth.
It's got a decent plugin system. We can use this for Jenkins and JIRA integration.

Frequently asked questions and frequently raised fears

Are we changing our existing policies around code review or pre-integration checks?

No, but we've got several policies around commit message format and "make check" that are not currently enforced, and we're proposing enforcing these. To do that, the long term plan would be that we only do most pushes through the Gerrit Change Review process.

Gerrit's policies appear flexible enough to implement our existing policy (which is essentially that anything goes -- see above), and it may be that Gerrit is valuable even if all of the code review and verification features are optional. But the assumption here is that it's worthwhile to enforce the practices we already consider strongly recommended.

What about merges from upstream (as for illumos)?

While most deployments disallow pushing directly to master without going through the Change process, Gerrit does support direct pushes. Upstream merges may want to bypass the review process and push directly to master, as we've already decided that our policy is to incorporate these changes without review (unless/until we discover breakage). This seems like a special case.

As an author, sometimes I want to be able to control exactly when integration happens, even if all the reviewers are satisfied. Can I do this?

A Change becomes eligible for integration when requirements are met, like when a +2 is provided and no -2's are provided. But the change isn't automatically integrated. There's a separate "Submit" button for this. One approach that some teams use is to provide an implicit -2 from the author on all new changes to give the author a final opportunity to check on things before integrating a change.

I heard that Gerrit only supports single-commit changes. How do we iterate on reviews?

Yes, each revision of a change (called a patchset) has to be encapsulated in a single commit. However, you can submit a new commit to an existing change. This will show up as a new revision. The web UI makes it easy for reviewers to see the latest patchset (which encapsulates the current version of the entire change) or diff it against any previous patchset. That allows reviewers to diff between whatever they saw last time (or any previous time).

Okay, but what if I like to keep a bunch of commits that I don't want to send out for review?

You can do this, too.

The deal with Gerrit is that each patchset (each revision of a change) has to be a single commit. That makes sense, because the patchset represents exactly what's going to land onto master, which is a good thing. Test what you ship, and ship what you test.

But that doesn't mean you can't keep multiple commits locally. All it means is that when you want to send something for review, you have to create a squashed commit -- but that doesn't have to change your working directory or even change "master" on your working copy. We could (and probably should) build a tool that creates a new, temporary branch based on origin/master, cherry-picks all of your local commits onto it, squashes them, submits that to Gerrit as either a new change or a new patchset, and then switches back to the branch you were on. Several of us have been using this workflow internally (without a tool), and it works fine.

You can even push all your intermediate commits to a remote branch. We don't have to apply the Change Review process to every branch in a repo.

What about cross-repository changes?

As far as we can tell at this point, changes to multiple repositories have to be submitted as separate Changes in Gerrit, without any link between them. This is analogous to separate webrevs or separate PRs.

When there's a cross-repo change to repositories A and B, and A explicitly depends on B, then the automated verification might do the wrong thing, depending on how that dependency is expressed. If A depends on B's "master" branch, then the automated verification will pull in the wrong B. If A depends on a specific commit of B (or semver), then the automated verification may pull in the right version. We may need to explore this more.

What about cross-repository flag days?

We'll have to treat these as separate changes, and whoever submits them will need to know the appropriate order to submit them, and that all changes need to be submitted. (This isn't that different from the way pushes are coordinated today.)

I heard Gerrit pollutes commit messages with a piece of change-id metadata.

To make iterating on changes easier, the common Gerrit workflow is that you include this change-id line in your commit message. That's how Gerrit knows this is a new patchset for an existing Change, rather than a new Change. However, this is not required: you can push to the special remote refs/changes/CHANGE_ID, in which case you can leave the metadata out of the commit message. We recommend this approach.

But I like webrev's presentation of diffs!

We could build a Gerrit plugin that generates a webrev, stores it into Manta, and links to it from the Gerrit interface. We would still get the benefits of automated checks and a channel for code review feedback, and we'd also get the webrev benefit of rendering manual pages and generating PDFs.

But I like GitHub's presentation of diffs!

I haven't actually heard this request yet. But there are Gerrit plugins that attempt to synchronize pull requests with Gerrit changes. Much more exploration would be needed to figure out if this is possible, how to do it, and how to avoid creating confusion.

Will we be able to review community contributions in the same way?

While we haven't tested this, the prototype Gerrit uses GitHub auth, so the expectation is that community members could use the same process (provided they're willing to use Gerrit). This may mean that we need to manage access control for repositories in Gerrit instead of GitHub, though.

Questions to be answered

Are we as a team and community happy with Gerrit specifically? Particularly: with the inversion of control (i.e., Gerrit itself does all integrations, and we just push the button)?
Are there other workflows in use today that we want to keep supporting that Gerrit doesn't support?

Road map and next steps

Here's the proposed plan, assuming we don't decide to abort partway through if we discover Gerrit won't work for us:

done: get a Gerrit instance set up so that people can start playing with it
start fleshing out features of the Gerrit instance that we'll want (e.g., commit message validation, running make check)
meanwhile, have interested people start using Gerrit in earnest for their repositories (even if those features aren't all present yet)

For any given repository, the steps would be:

add the repo to Gerrit so that people can submit changes and leave feedback with it
add support for replication from Gerrit to GitHub so that people can also submit changes through Gerrit
disallow pushing to GitHub directly, or pushing to #master in Gerrit without going through the submission process

To this end, we've set up a prototype Gerrit deployment in west1. You can reach it at:

https://cr.joyent.us

This isn't done yet! It's definitely usable, but it's missing things like commit message validation, make check and more. It may be redeployed and updated frequently. The plan is to avoid blowing away any state there.

There are instructions for getting set up with cr.joyent.us and importing projects.

That repository also has a summary of what features have been implemented (e.g., email notification, GitHub replication) and which are on the TODO list.

If you want to take a look at two real code reviews that went through Gerrit:

https://cr.joyent.us/#/c/1/ (node-cueball)
https://cr.joyent.us/#/c/14/ (node-jsprim)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

RFD 45 Tooling for code reviews and code standards

Overview

Our process today

Proposed goals

Proposal: Gerrit for both automation and code review

Alternatives

GitHub-driven automation

GitHub-driven code review

Webrev-based code review

Phabricator and other options

Gerrit Deeper Dive

Frequently asked questions and frequently raised fears

Questions to be answered

Road map and next steps

Files

README.md

Latest commit

History

README.md

File metadata and controls

RFD 45 Tooling for code reviews and code standards

Overview

Our process today

Proposed goals

Proposal: Gerrit for both automation and code review

Alternatives

GitHub-driven automation

GitHub-driven code review

Webrev-based code review

Phabricator and other options

Gerrit Deeper Dive

Frequently asked questions and frequently raised fears

Questions to be answered

Road map and next steps