RFC: handlings large files via fragmenting

Aaron Bentley aaron at aaronbentley.com
Mon Aug 25 18:02:21 BST 2008


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

John Arbash Meinel wrote:
>> We could also approximate this by storing a cheap checksum of each line,
>> and doing an initial match based on the checksums.
> 
> I'm pretty sure that is what Robert was meaning.

Sorry, I should have emphasized the *storing* part of my sentence.  I
was talking about storing this data in the repository.

>> Or another alternative would be to use the compression deltas to seed
>> the diff.  This requires a line-based delta approach, but has the
>> advantage that it cannot produce false matches, only false mismatches.

> It also requires a format that has these readily available.

Indeed, which is why I said "This requires a line-based delta approach".
 I'm well aware that groupcompress is not one.

Aaron
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFIsuWd0F+nu1YWqI0RAkMIAJ98hw9TQnGgz5pSbweSmNxykGiQCgCfQ01k
BcMuqQrGWkIZZW9vuP2q8QA=
=QWq0
-----END PGP SIGNATURE-----



More information about the bazaar mailing list