[merge] cache encoding

Robert Collins robertc at robertcollins.net
Fri Aug 18 08:06:23 BST 2006


On Fri, 2006-08-18 at 16:54 +1000, Martin Pool wrote:
> 
> I'd really rather not allow arbitrary non-utf-8 binary, just because
> it
> will cause trouble if we ever do need to decode them.  And the general
> policy is that strings are Unicode, so defining some strings to be
> 8-bit binary is just asking for trouble.
> 
> Of course having them stay in utf-8 and be treated by the program as
> byte string as an optimization is totally fine.  But the interface
> requirement is that they're utf-8. 

We can change the interface: but we need to define that the existing
utf8 strings will be considered bytestrings - at a minimum its going to
mean a code audit.

-Rob
-- 
GPG key available at: <http://www.robertcollins.net/keys.txt>.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 191 bytes
Desc: This is a digitally signed message part
Url : https://lists.ubuntu.com/archives/bazaar/attachments/20060818/de929d12/attachment.pgp 


More information about the bazaar mailing list