Principles of Version Control

The purpose of version control is twofold. The first is to avoid the version conflicts in development that can arise when multiple programmers work on the same set of files on a project. The second is to automatically journal changes to key files on a project.

By requiring programmers to check out a file from a central repository when they wish to work on it, a version control platform can keep a record of who is working on which files and at what times. Depending on the approach taken, this method can either wholly exclude other developers from working on that file at the same time, or allow other developers to also work on the file and then merge their changes together when they both check the file back in again.

Journaling of files in a project involves the retention of previous versions of each file. Whenever a file is checked in to a repository, the version control platform marks it as current so that anybody retrieving that file from the repository does retrieve that latest version by default. However, a copy is taken of the file in its previous state. Not only does this allow the project administrator or lead architect to roll back a file should a new version prove problematic, it also allows the changes between versions to be easily listed, which, from a project management perspective, is a major boon. All version control systems use some variety of repository for storing a copy (usually verbatim on disk, but sometimes as part of a more complex and often proprietary database format) of the file and directory structure of the project. How this repository is accessed varies from system to system. We examine a typical topology later in this appendix.

Concurrent vs. Exclusive Versioning

Version control platforms differ immensely in their implementation of the principles discussed earlier, not least in their means of checking out a file.

In any version control platform, when a file is being checked out by a programmer the latest version is always retrieved from the repository, and the user's local version is replaced with that latest version.

However, in a platform employing Exclusive Versioning, a lock is then placed on that file with immediate effect. While the file is checked out, other developers may still retrieve the latest version of that file, but they themselves will not be able to check it out to work on it. This Exclusive Versioning is usually enforced by marking the developer's local copy of the file as read-only. Of course, this is only notional enforcement and does require the developer's cooperation to work well in practice. The lock is removed after the developer working on it checks in the file in question. The repository is then updated to reflect the latest version.

Concurrent Versioning adopts quite a different approach. The acts of retrieving the latest version of a file and checking it out to work on are combined so that they are essentially one and the same. In other words, to work on a file, all a developer must do is ensure that he or she has the latest version of that file and then start working on it. When each developer has finished making changes to the file, he or she will check it in. This is where the magic happens. If the developer in question checked out the file after another developer had checked it out, and is attempting to check it in after that other developer has checked in a changed version, the two newly submitted versions will be merged.

A Concurrent Versioning Example

To try to make things a little clearer, consider the following example. This is an imaginary file called helloworld.php that prints "hello world" in the Web browser. You will see that line numbers have been included. Of course, these would not be included in the code as it is saved.

   1: <?php    2:   $strToPrint = "Hello World";    3: ?>    4: <html>    5:   <body>    6:     <?=$strToPrint?>    7:     <br /><br />    8:   </body>    9: </html>

You can call this snippet of code version 1.0 of the file.

Imagine that Jane Doe and John Doe are both working on the project. Assume that John works out of New York and Jane out of Los Angeles.

A meeting has been called with the client. They wish to change the code of this particular file so that instead of printing "Hello World," it would print "Goodbye World." John Doe's manager has given John the task of modifying the code to reflect this requirement.

In that meeting, the client also requested that a horizontal rule be drawn underneath the text where printed. Jane Doe's manager has given her the task of modifying the code to include that extra line.

In an Exclusive Versioning setup, it would be impossible for John and Jane to make their changes at the same time. John would have to check out the file, make the change, and check it in again. Jane could then check out the file, make her change, and check it in again.

In a Concurrent Versioning setup, no such requirement exists. Say that at 12:00 p.m. (Eastern Time) John does a check out to get the latest version of the file, currently version 1.0. He then starts work on making his change. At 12:01 p.m. (Eastern Time), Jane does a check out as well, also to get the latest version of the file. This is still version 1.0; John hasn't checked anything in yet. Jane starts work on her change.

At 12:05, John is done. The code works fine, so he decides to check in his work. He does so, and the repository saves his newly submitted version as the latest version version 1.1. Version 1.1. now looks like this:

   1: <?    2:   $strToPrint = "Goodbye World";    3: ?>    4: <html>    5:   <body>    6:     <?=$strToPrint?>    7:     <br /><br />    8:   </body>    9: </html>

At 12:09, Jane is done, too. The code works fine for her, so she now wants to check in her work. Her code now looks like this:

1: <? 2: $strTime = time("H:i:s"); 3: $strToPrint = "Goodbye World, it's $strTime"; 4: ?> 5: <html> 6: <body> 7: <?=$strToPrint?> 8: 9: <hr /> 10: </body> 11: </html>

1: <? 2: $strTime = time("Y-m-d"); 3: $strToPrint = "Goodbye World, it's $strDate"; 4: ?> 5: <html> 6: <body> 7: <?=$strToPrint?> 8: 9: <hr /> 10: </body> 11: </html>

1: <? 2: $strTime = time("Y-m-d"); <<<<<<< helloworld.php 3: $strToPrint = "Goodbye World, it's $strTime"; ======= 3: $strToPrint = "Goodbye World, it's $strDate"; >>>>>>> 1.3 4: ?> 5: <html> 6: <body> 7: <?=$strToPrint?> 8: 9: <hr /> 10: </body> 11: </html>

Concurrent vs. Exclusive Versioning

A Concurrent Versioning Example

Concurrent Versioning Conflicts

Making the Choice

Version Control Topology