Experiences converting Concurrent Versions System (CVS) repositories to Subversion on Mac OS X. A conversion attempt in early 2004 failed due to Berkeley DB wedging. Subversion now supports a new repository format (fsfs) that should avoid such problems. Sites with larger repositories or multiple developers will likely have additional challenges not covered here, such as scheduling the transition, converting existing infrastructure built around CVS, and retraining staff.
Consider also svk, a “decentralized version control system” that “can mirror remote Subversion, Perforce, and CVS repositories to perform offline operations”.
Installation
Install both Subversion and the cvs2svn conversion script. Use a ports or package system if possible, to avoid a tedious manual install.
Install Subversion
Obtain Subversion via a ports or package system, such as Fink for Mac OS X.
$ fink install svn-ssl
Manual Install
If compiling Subversion manually, consult the INSTALL under the source directory for complete install notes. For local use only, build Subversion with the --without-apache option. Omit Berkeley DB support and use the default fsfs repository type:
$ ./configure --without-apache
If building Subvesion via Fink, avoid the system-java14-dev error by installing the Java Developer Tools package from Apple.
Install cvs2svn
The cvs2svn script is also available in Fink. If cvs2svn does not install via Fink, python must still be installed, as the version of python that ships with Mac OS X 10.4.2 does not contain a DBM module:
ERROR: your installation of Python does not contain a suitable
DBM module. This script cannot continue.
to solve: see http://python.org/doc/current/lib/module-anydbm.html
for details.
To use cvs2svn from the source tarball, install python via Fink, then run cvs2svn under the source directory:
$ fink install python24-nox python24-nox-shlibs
$ tar xzf cvs2svn-1.2.1.tar.gz
$ cd cvs2svn-1.2.1
$ /sw/bin/python ./cvs2svn
USAGE: cvs2svn [-v] [-s svn-repos-path] [-p pass] cvs-repos-path
…
cvs2svn can optionally be installed under a bin directory, though the supporting cvs2svn_rcsparse directory must be placed under a PYTHONPATH path for python.
$ mkdir -p ~/lib/python ~/bin
$ chmod +x cvs2svn
$ cp cvs2svn ~/bin
$ cp -r cvs2svn_rcsparse ~/lib/python
$ export PYTHONPATH=$HOME/lib/python
To ensure the correct version of python is used, edit the first line of cvs2svn to point at Fink’s python.
$ cd ~/bin
$ head -1 cvs2svn
#!/sw/bin/python2.4
Conversion
The following are a rough list of steps that need to be done. More will be required if network access and multiple users are involved. Be sure to test the conversion several times before the final switch!
Repositories
If multiple CVS repositories need conversion, they either can be done incrementally, or first combined into a new CVS repository prior to conversion with cvs2svn.
As Subversion allows directories to be copied and moved, the Subversion repository can be rearranged at will. However, spend some time up front considering any layout changes, and how tags and branches will be managed under Subversion (usually via copies to the tags and branchs directories).
CVSROOT
The CVSROOT directory may contain scripts and other data that need to be ported to Subversion. Important files under the CVSROOT directory include the commitinfo and loginfo commit handlers, in addition to the checkoutlist file that lists additional files available under CVSROOT.
The Subversion book documents how commit hooks work under Subversion.
Tags & Branches
Before moving anything, investigate the existing CVS repositories for tags and branches. These the cvs2svn script will automatically place under the tags and branches subdirectories of the Subversion repository, though cvs2svn has --trunk-only and other options to adjust tag and branch handling. The cvs-list-symbols script will help list symbols under a CVS repository:
$ cvs-list-symbols /path/to/cvs/repository
…
If no tags need conversion, remove them from the CVS repository prior to conversion via the cvs-strip-symbols script. This script empties the symbols section of the Revision Control System (RCS) *,v files CVS uses.
Keywords
Subversion supports keyword properties similar to the keywords supported by CVS. To locate files under CVS repositories using keywords, use something like the following command.
$ cd /path/to/cvs/repository
$ egrep -rl \
'\$(Author|Date|Header|Name|Locker|Log|RCSfile|Revision|Source|State|Id)' *
Code that relies on the $Revision$ and other keywords may need to be changed to work properly under Subversion!
cvs2svn now converts some keywords by default, unless the --keywords-off is set. To manually set keywords under a Subversion sandbox after the conversion, use something like the following, which may run afoul binary files:
$ egrep -rl '\$Id: ' * | grep -v /.svn/ | xargs svn propset svn:keywords Id
$ svn commit
Ignores
Read the Subversion book for the exact handling of file ignores. In my case, I have tried the following, but it does not appear to be getting set in subdirectories of the Subversion sandbox. Will have to research this more:
$ cat ~/.cvsignore
*.db
.DS_Store
.*.swp
$ svn propset svn:ignore -F ~/.cvsignore .
$ svn update
$ svn commit .
Miscellaneous
- Backups
- rsync
Backups of a fsfs format repository should be straightforward. Consider also a commit hook to duplicate commited data.
If using rsync with the CVS specific -C option, try --exclude=.svn instead.
Training
To bring new developers up to speed with Subversion, consider: Subversion: The Definitive Guide and the Subversion FAQ. Subversion also contains built in documentation, available via the help option:
$ svn help
…
$ svnadmin help
…