[Drizzle-discuss] UTF-8/File Systems

Russel Winder russel.winder at concertant.com
Fri Feb 19 08:53:11 GMT 2010


On Fri, 2010-02-19 at 11:35 +1100, Martin Pool wrote:
> On 19 February 2010 11:22, Tim Penhey <tim at penhey.net> wrote:
> > Does Bazaar have a known issue with UTF-8 filenames?
> 
> If you search for bugs tagged 'unicode' you may find some.
> 
> The main problem is on mac os x where the os 'helpfully' rewrites
> filenames into a normal form.  So mkdir(a) does not guarantee that a
> exists, only that something with a name representing the same
> characters as a exists.
> 
> The other thing people tend to hit is that bzr insists filenames be
> valid in the OS filesystem encoding: if it's declared to be utf-8 then
> the names must actually be utf-8.
> 
> If they want to file or vote for a bug please do.

My experience is that there is only a problem for Mac OS X when the
filesystem is created as case insensitive -- for case sensitive
filestores, there is no problem.  Sadly the Mac OS X default is case
insensitive so the problem exists for more or less all Mac OS X
isntallations.

The folk over at the Unison project (the filestore synchronization tool)
have tackled this successfully and are able to process UTF-8 encoded
files correctly between Mac OS X and Ubuntu.  Of course their code is
written in OCaml so it can't just be used as is, but the algorithms are
known, and known to work.


-- 
Russel.
=============================================================================
Dr Russel Winder      Partner
                                            xmpp: russel at russel.org.uk
Concertant LLP        t: +44 20 7585 2200, +44 20 7193 9203
41 Buckmaster Road,   f: +44 8700 516 084   voip: sip:russel.winder at ekiga.net
London SW11 1EN, UK   m: +44 7770 465 077   skype: russel_winder
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 197 bytes
Desc: This is a digitally signed message part
Url : https://lists.ubuntu.com/archives/bazaar/attachments/20100219/24284476/attachment.pgp 


More information about the bazaar mailing list