Bug: Commit message containing control characters
Jan Hudec
bulb at ucw.cz
Fri Sep 2 22:12:07 BST 2005
On Fri, Sep 02, 2005 at 11:41:57 +1000, Martin Pool wrote:
> On 9/2/05, Harald Meland <harald.meland at usit.uio.no> wrote:
> > [Robert Collins]
> >
> > >> The root of the problem is that the XML 1.0 specification doesn't seem
> > >> to allow encoding of such "control characters" as e.g. "\x01", if I
> > >> understand the the well-formedness constraint here correctly:
> > >>
> > >> http://www.w3.org/TR/REC-xml/#NT-Char
>
> Yes, so it would seem.
>
> I'm not sure that supporting control characters is really a good idea;
> it seems pretty problematic when the rest of the application wants to
> treat it as "normal" unicode text. I can see the attraction for being
> able to do lossless imports of existing data.
>
> I'm inclined to say that tailor (or maybe bzr?) should just strip out
> those characters.
I am more inclined to think, that ElementTree should enititify them and
de-entitify them when giving them back, as that is correct approach to
storing general text in XML.
--
Jan 'Bulb' Hudec <bulb at ucw.cz>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 189 bytes
Desc: Digital signature
Url : https://lists.ubuntu.com/archives/bazaar/attachments/20050902/c80e5a97/attachment.pgp
More information about the bazaar
mailing list