Better compression
Robert Collins
robertc at robertcollins.net
Sat Jul 26 05:11:51 BST 2008
On Fri, 2008-07-25 at 22:05 -0500, John Arbash Meinel wrote:
> In testing, I get really good results. It also revealed the need for my
> other patch to stream Revisions and Signatures (because otherwise my
> screen fills up with requesting revision 1-by-1.)
We need to check that GenericRepoFetcher is never used with altered
serialisers I think, but otherwsise the other patch should be fine. My
work for reverse-topo will be ready my Monday.
> These are the times for branching bzrtools into a gc repository
>
> Time Stream Revisions LRU Cache
> 37s No No
> 24s Yes No
> 20s No Yes
> 8s Yes Yes...
> Anyway, a 10MB cache gives us a 93% hit rate for bzr.dev texts, which
> seems to make it more than worthwhile. I would consider tweaking it a
> bit, to try to make the estimated size more closely line up with
> reality. But I do take into account the python overhead of string
> objects, and the overhead of storing lots of lines in a list (at least
> 4bytes for a pointer per line, and probably 20+ bytes per string for
> PyObject overhead.)
>
> The LRU Cache is in my groupcompress branch, and the streaming revisions
> is up for review.
I would rather not have the LRU cache to be honest; its working around
asking for the wrong stream type; and will just bloat memory once that
is fixed.
-Rob
--
GPG key available at: <http://www.robertcollins.net/keys.txt>.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: This is a digitally signed message part
Url : https://lists.ubuntu.com/archives/bazaar/attachments/20080726/743c5f76/attachment.pgp
More information about the bazaar
mailing list