Diff and merge of archives - proposal

John Arbash Meinel john at arbash-meinel.com
Wed Oct 13 19:46:37 BST 2010


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1


...
> If an archive has changed, the contents will be decompressed and
> diffed (note that the contained files and folders will not have their
> own file-ids, so renames will not be detected!).

Note that if someone is versioning a 'tar.gz' file, often they need the
exact binary content. As an example, debian packages build from a tar.gz
file. However they use a sha/md5sum to make sure that the tarball is
valid. But they use the shasum of the compressed content. Because of
this, there have been a lot of hacks (pristine-tar), because gzip is not
deterministic.

(At the least, it includes timestamp info in the metadata, but you also
have lots of flags and tweaks, such that two people probably won't get
the same content.)

And if they *don't* need the binary blob to be exact, then why do they
need to version it as a binary blob?

Just a thought,
John
=:->
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.9 (Cygwin)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iEYEARECAAYFAky1/o0ACgkQJdeBCYSNAAMtEgCfQ0SY/6oBe4VuobSRPn54sb4a
BHcAoM88nL9kg7QYtHVvn9WNJBTJ6NzK
=/6YK
-----END PGP SIGNATURE-----



More information about the bazaar mailing list