[Bulk] Re: Python 3
John Arbash Meinel
john at arbash-meinel.com
Thu Jun 24 00:51:45 BST 2010
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
...
> http://blog.labix.org/2009/07/02/screwing-up-python-compatibility-unicode-str-bytes
>
> Our general approach is also to normally convert to/from byte encoding
> on the boundary but the 2.x environment means that either sometimes we
> couldn't follow that approach consistently, or there may be latent
> inadvertent cases where we don't follow it. This is one reason why
> getting the tests to pass under 2to3, even if we don't want to
> officially support that, may find some interesting bugs.
>
I'll also note that we used to have file-ids and revision-ids as Unicode
strings in memory, but intentionally switched back to 8-bit strings.
Mostly for performance/memory consumption. (Default Unicode on most
Unixes is UCS-4, which means we bloated every revision-id and file-id by
approximately 4x. And when you have 100,000+ of them, that gets pretty big)
John
=:->
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.9 (Cygwin)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/
iEYEARECAAYFAkwinhAACgkQJdeBCYSNAANPfwCgyUYY8aAhFDykGeGwA0oNZzpa
7hcAnRCMmY5w1627OVh6EhlBO10/vMnZ
=ojl5
-----END PGP SIGNATURE-----
More information about the bazaar
mailing list