[Bulk] Re: Python 3

John Arbash Meinel john at arbash-meinel.com
Thu Jun 24 00:51:45 BST 2010


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1


...
> http://blog.labix.org/2009/07/02/screwing-up-python-compatibility-unicode-str-bytes
> 
> Our general approach is also to normally convert to/from byte encoding
> on the boundary but the 2.x environment means that either sometimes we
> couldn't follow that approach consistently, or there may be latent
> inadvertent cases where we don't follow it.  This is one reason why
> getting the tests to pass under 2to3, even if we don't want to
> officially support that, may find some interesting bugs.
> 

I'll also note that we used to have file-ids and revision-ids as Unicode
strings in memory, but intentionally switched back to 8-bit strings.
Mostly for performance/memory consumption. (Default Unicode on most
Unixes is UCS-4, which means we bloated every revision-id and file-id by
approximately 4x. And when you have 100,000+ of them, that gets pretty big)


John
=:->

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.9 (Cygwin)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iEYEARECAAYFAkwinhAACgkQJdeBCYSNAANPfwCgyUYY8aAhFDykGeGwA0oNZzpa
7hcAnRCMmY5w1627OVh6EhlBO10/vMnZ
=ojl5
-----END PGP SIGNATURE-----



More information about the bazaar mailing list