split inventory progress

Robert Collins robertc at robertcollins.net
Wed Nov 12 09:18:39 GMT 2008


On Wed, 2008-11-12 at 17:30 +1000, Martin Pool wrote:
> On Wed, Nov 12, 2008 at 4:57 PM, Robert Collins
> <robertc at robertcollins.net> wrote:
> > btree(1.9):
> > Commits: 107
> >                      Raw    %    Compressed    %  Objects
> > Revisions:       1074 KiB   0%       266 KiB   3%      107
> > Inventories:    78202 KiB  49%       580 KiB   7%      107
> > Texts:          79439 KiB  50%      6552 KiB  88%     3025
> > Signatures:         0 KiB   0%         0 KiB   0%        0
> > Total:         158716 KiB 100%      7399 KiB 100%     3239
> >
> > split-inventory:
> > Commits: 107
> >                      Raw    %    Compressed    %  Objects
> > Revisions:       1078 KiB   1%       269 KiB   3%      107
> > Inventories:      835 KiB   1%       446 KiB   6%     1243
> > Texts:          79439 KiB  97%      6552 KiB  90%     3025
> > Signatures:         0 KiB   0%         0 KiB   0%        0
> > Total:          81353 KiB 100%      7268 KiB 100%     4375
> 
> For those not used to reading this output, the key point is that the
> inventories have shrunk from being 78MB in uncompressed form, to only
> 835kB.  After gzip compression, the difference is not so extreme.
> Essentially we're recognizing common data between trees at a higher
> level, rather than making gzip do the work, so this will be faster to
> commit and query, and more compact.

Having fixed a bug that prevented pulling further revisions; I have have
some sad news - we were not writing the entire tree, so it isn't as
dramatic as it seemed :(.

It's still pretty good though:

btree(1.9)
Commits: 107
                      Raw    %    Compressed    %  Objects
Revisions:       1074 KiB   0%       266 KiB   3%      107
Inventories:    78202 KiB  49%       580 KiB   7%      107
Texts:          79439 KiB  50%      6552 KiB  88%     3025
Signatures:         0 KiB   0%         0 KiB   0%        0
Total:         158716 KiB 100%      7399 KiB 100%     3239

split-inventory
Commits: 107
                      Raw    %    Compressed    %  Objects
Revisions:       1078 KiB   1%       269 KiB   3%      107
Inventories:     2497 KiB   3%      1296 KiB  15%     2942
Texts:          79439 KiB  95%      6552 KiB  80%     3025
Signatures:         0 KiB   0%         0 KiB   0%        0
Total:          83015 KiB 100%      8118 KiB 100%     6074

so 78MB down to 2.5MB. I'm pulling across a bigger sample set now.

-Rob

-- 
GPG key available at: <http://www.robertcollins.net/keys.txt>.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
Url : https://lists.ubuntu.com/archives/bazaar/attachments/20081112/c380e3ad/attachment.pgp 


More information about the bazaar mailing list