Martin Pool martinpool at gmail.com
Fri Sep 2 02:41:57 BST 2005

On 9/2/05, Harald Meland <harald.meland at usit.uio.no> wrote:
> >> The root of the problem is that the XML 1.0 specification doesn't seem
> >> to allow encoding of such "control characters" as e.g. "\x01", if I
> >> understand the the well-formedness constraint here correctly:
> >>   http://www.w3.org/TR/REC-xml/#NT-Char

Yes, so it would seem.

I'm not sure that supporting control characters is really a good idea;
it seems pretty problematic when the rest of the application wants to
treat it as "normal" unicode text.  I can see the attraction for being
able to do lossless imports of existing data.

I'm inclined to say that tailor (or maybe bzr?) should just strip out
those characters.  Would that be a problem for you?  How did it get in


