brisbane-core+ric-commit results

John Arbash Meinel john at arbash-meinel.com
Fri Mar 27 04:10:25 GMT 2009


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1


...

> 
> So for branching Launchpad:
> 
> 44m	Old brisbane-core
> 26m	new brisbane-core
> 17m	set btree leaf cache to 4k
> 
> However, part of this is also because as we are generating the new
> content, we are querying it to ensure that the keys are not duplicated.
> Also, we are spilling the new CHK nodes to disk, and then combining them.
> 
> I tried setting "random_id = True" but I added a check, and for some
> very strange reason, we are sending the same key multiple times. It
> seems to be one of the last steps that we do, so I'm not sure if there
> is a bug in the chk streams (near the end), or if it is a bug with
> signatures, or something else.


So it was a bug in "iter_interesting_nodes()" that was accidentally
sending the same chk page twice. (The bug was if there was a single leaf
node with the same content as a leaf node that was referenced by a
split. In this case the node was the empty tree with only the
TREE_ROOT.) There is a simple fix for it.

So my next test was with btree leaf cache at 1k, but setting
'random_id=True' disabling 'combine_backing_indexes'.

At that point, I was at 14m4s (with lsprof).

So I had some ideas about the btree caching issues, but they don't
really seem to fit. So I'm going to think about them some more. Setting
random_id=True and combine_backing_indices=False had a bigger effect
than btree cache size...

John
=:->
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.9 (Cygwin)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iEYEARECAAYFAknMUbEACgkQJdeBCYSNAAOGgACeM3LziQP0h8ABp5bAYzTWMKhf
wnsAn3fHKvsIev3+2DdbLj3egS46Dv9E
=IkVs
-----END PGP SIGNATURE-----



More information about the bazaar mailing list