[PLUGIN] bzr changeset, time to pull again

Robey Pointer robey at lag.net
Tue Jul 5 00:26:15 BST 2005


On 2 Jul 2005, at 1:19, John A Meinel wrote:

> The current format uses tabs quite liberally for tokenizing. This is
> because 'path\twith\ttab'.encode('unicode_escape') will escape tabs,
> which means you can str.split('\t') on them, and not be ambiguous.
> But some people don't like them, because they don't look any different
> from spaces.
> If someone wants to offer something else, they are welcome to. But to
> the best of my knowledge, the only character that is not allowed in a
> unix *path* is a null (\0). And the only extra one not allowed in a
> filename is a '/'.
> So you're going to have to escape something. And it gets really  
> ugly if
> you have to escape a space character. (Though again, python handles
> u'\u0020' just fine :) But I don't think we'd like to see:
>
> path\u0020to\u0020my\u0020file.txt
> file-id:path\u0020to\u0020my\u0020file.txt-1092309182390-109820381234

If you normalize the path, then "//" would be an unambiguous  
terminator (and you could follow with spaces for readability):

moon landing.txt//  file-id:moon landing.txt-12389127-123871223784

I think this looks ugly, but decided to bring it up for  
completeness.  I agree that the tabs are kinda gross but have no  
better solution.

robey





More information about the bazaar mailing list