[RFC] store inventory in tab-separated file

Alexander Belchenko bialix at ukr.net
Mon Jan 29 08:22:44 GMT 2007


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Dmitry Vasiliev пишет:
> Alexander Belchenko пишет:
>> Jari Aalto пишет:
>>> Alexander Belchenko <bialix at ukr.net> writes:
>>>
>>>> I wrote draft implementation of new serializer format that use
>>>> tab-separated
>>>> text instead of XML. John Meinel often says that our weakness point is
>>>> inventory. So I make some experiment to rewrote our serializer.
> 
> I like the idea and even thought about it some time ago. I guess
> record-based format is also may give some performance benefits.
> 
>>> Just to comment the separator character:
>>>
>>> * TAB is not the best choice, because it's invisible and easily get's
>>>   corrupted with space
>>>
>>> Perhaps just plain space, colon, semicolon or any other
>>> standard-visible character?
>>
>> This file is not intended to be editable by user.
>> So I say 'no'. TAB is deliberated choice.
> 
> But for CSV files parsing you can use csv builtin module which can be
> easy to use and faster.

I cannot agree with you about csv module.
I don't think it will be faster: it's the pure python module,
it uses regular expressions to parse input,
it more complicated because it's general purpose.
So I'm pretty sure that it cannot be faster of my
strictly specialized implementation.

Alexander
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.5 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iD8DBQFFva7UzYr338mxwCURAvatAJ9kPSPACpJJDg3fnjTjf8XNm3GxjwCeI3yw
a60lr6d9jd3ldsLx56GnJ90=
=qVf2
-----END PGP SIGNATURE-----




More information about the bazaar mailing list