<meta http-equiv="content-type" content="text/html; charset=utf-8"><div>It is a practical issue. Program source code is not intended to be handling text from multiple locales. (In fact I think the C standard says source files must be entirely ASCII). However, translation files do need to support it. In this case when inserting part numbers in a translation LRO and RLO are necessary to make the .po file intelligible and usable to the one editing it.</div>
<div><br></div><div>While there are multiple ways to achieve the very same appearance on the screen, most programs not written with this in mind will consider text with different embedded overrides in different places as completely different text... thus resulting in malfunction on things like a database lookup or even a simple string comparison.</div>
<div><br></div><div>While it would be relatively easy to modify msgfmt to treat directionality codes the same way as it treats spaces, perhaps it isn't a bug and I just need someone to explain how to deal with this situation effectively. </div>
<br><div class="gmail_quote">On 24 December 2010 09:10, <span dir="ltr"><<a href="mailto:ubuntu-rtl-request@lists.ubuntu.com">ubuntu-rtl-request@lists.ubuntu.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">
Send Ubuntu-RTL mailing list submissions to<br>
<a href="mailto:ubuntu-rtl@lists.ubuntu.com">ubuntu-rtl@lists.ubuntu.com</a><br>
<br>
To subscribe or unsubscribe via the World Wide Web, visit<br>
<a href="https://lists.ubuntu.com/mailman/listinfo/ubuntu-rtl" target="_blank">https://lists.ubuntu.com/mailman/listinfo/ubuntu-rtl</a><br>
or, via email, send a message with subject or body 'help' to<br>
<a href="mailto:ubuntu-rtl-request@lists.ubuntu.com">ubuntu-rtl-request@lists.ubuntu.com</a><br>
<br>
You can reach the person managing the list at<br>
<a href="mailto:ubuntu-rtl-owner@lists.ubuntu.com">ubuntu-rtl-owner@lists.ubuntu.com</a><br>
<br>
When replying, please edit your Subject line so it is more specific<br>
than "Re: Contents of Ubuntu-RTL digest..."<br>
<br>
<br>
Today's Topics:<br>
<br>
1. gettext handling of LRO and RLO (Chris Scaife)<br>
2. Re: gettext handling of LRO and RLO (Yaron Shahrabani)<br>
<br>
<br>
----------------------------------------------------------------------<br>
<br>
Message: 1<br>
Date: Fri, 24 Dec 2010 08:07:20 +1300<br>
From: Chris Scaife <<a href="mailto:scaife.chris@gmail.com">scaife.chris@gmail.com</a>><br>
Subject: gettext handling of LRO and RLO<br>
To: <a href="mailto:ubuntu-rtl@lists.ubuntu.com">ubuntu-rtl@lists.ubuntu.com</a><br>
Message-ID:<br>
<AANLkTi=uY50Pf9ULG9zosaHtX1dn=<a href="mailto:eZ6n6Fm2WP8ri3h@mail.gmail.com">eZ6n6Fm2WP8ri3h@mail.gmail.com</a>><br>
Content-Type: text/plain; charset="iso-8859-1"<br>
<br>
I do understand that this list is intended for bug reporting.<br>
So I've come across a few issues with the gettext tools and I assume many<br>
here do use these for internationalization and localization.<br>
<br>
Attached is image of part of my ar_SA.po file in which I had to place<br>
standard Unicode LRO (or optionally RLO) overrides... just before the " of<br>
the translation. That way I can insert what is technically considered an<br>
Arabic "part number" containing a mixture of Arabic digits and letters. I<br>
don't want the LRO code to be part of my part number* inside* the quotes, or<br>
it won't be recognized and processed correctly by the software.<br>
<br>
All is well up to the point where I try to run "msgfmt" it gives me:<br>
<br>
> base.ar_SA.utf-8.po:49:8: syntax error<br>
<br>
base.ar_SA.utf-8.po:52:8: syntax error<br>
<br>
msgfmt: found 2 fatal errors<br>
<br>
<br>
Is there any specific bug list this would be more appropriate for?<br>
-------------- next part --------------<br>
An HTML attachment was scrubbed...<br>
URL: <a href="https://lists.ubuntu.com/archives/ubuntu-rtl/attachments/20101224/91307c4a/attachment-0001.htm" target="_blank">https://lists.ubuntu.com/archives/ubuntu-rtl/attachments/20101224/91307c4a/attachment-0001.htm</a><br>
-------------- next part --------------<br>
A non-text attachment was scrubbed...<br>
Name: lro_in_dot_po.png<br>
Type: image/png<br>
Size: 16827 bytes<br>
Desc: not available<br>
Url : <a href="https://lists.ubuntu.com/archives/ubuntu-rtl/attachments/20101224/91307c4a/attachment-0001.png" target="_blank">https://lists.ubuntu.com/archives/ubuntu-rtl/attachments/20101224/91307c4a/attachment-0001.png</a><br>
<br>
------------------------------<br>
<br>
Message: 2<br>
Date: Thu, 23 Dec 2010 22:10:36 +0200<br>
From: Yaron Shahrabani <<a href="mailto:sh.yaron@gmail.com">sh.yaron@gmail.com</a>><br>
Subject: Re: gettext handling of LRO and RLO<br>
To: The Ubuntu RTL Team <<a href="mailto:ubuntu-rtl@lists.ubuntu.com">ubuntu-rtl@lists.ubuntu.com</a>><br>
Message-ID:<br>
<<a href="mailto:AANLkTikdFJd7Rh%2BAmdXiZ6kPMiBHdw5a%2BqpxmhQc3U-b@mail.gmail.com">AANLkTikdFJd7Rh+AmdXiZ6kPMiBHdw5a+qpxmhQc3U-b@mail.gmail.com</a>><br>
Content-Type: text/plain; charset="utf-8"<br>
<br>
Let me get this straight: You want to add LRO outside the quotes... why<br>
would you do that for?<br>
Think about it, lets say you wanted to do the same thing with an app written<br>
in C, you would type something like: printf(<LRO>"");, this command won't<br>
allow compilation since you typed LRO as part of the syntax, same for<br>
gettext, you can't add any symbols outside the quotes because of the exact<br>
same reason...<br>
<br>
Please, I may have misunderstood so please feel free to correct me if it is<br>
so...<br>
<br>
Kind regards,<br>
Yaron Shahrabani<br>
<br>
<Hebrew translator><br>
<br>
<br>
<br>
<br>
On Thu, Dec 23, 2010 at 9:07 PM, Chris Scaife <<a href="mailto:scaife.chris@gmail.com">scaife.chris@gmail.com</a>>wrote:<br>
<br>
I do understand that this list is intended for bug reporting.<br>
So I've come across a few issues with the gettext tools and I assume many<br>
here do use these for internationalization and localization.<br>
<br>
Attached is image of part of my ar_SA.po file in which I had to place<br>
standard Unicode LRO (or optionally RLO) overrides... just before the " of<br>
the translation. That way I can insert what is technically considered an<br>
Arabic "part number" containing a mixture of Arabic digits and letters. I<br>
don't want the LRO code to be part of my part number* inside* the quotes, or<br>
it won't be recognized and processed correctly by the software.<br>
<br>
All is well up to the point where I try to run "msgfmt" it gives me:<br>
<br>
base.ar_SA.utf-8.po:49:8: syntax error<br>
<br>
base.ar_SA.utf-8.po:52:8: syntax error<br>
<br>
msgfmt: found 2 fatal errors<br>
<br>
<br>
Is there any specific bug list this would be more appropriate for?<br>
<br>
<br>
<br>
--<br>
Ubuntu-RTL mailing list<br>
<a href="mailto:Ubuntu-RTL@lists.ubuntu.com">Ubuntu-RTL@lists.ubuntu.com</a><br>
Modify settings or unsubscribe at:<br>
<a href="https://lists.ubuntu.com/mailman/listinfo/ubuntu-rtl" target="_blank">https://lists.ubuntu.com/mailman/listinfo/ubuntu-rtl</a><br>
-------------- next part --------------<br>
An HTML attachment was scrubbed...<br>
URL: <a href="https://lists.ubuntu.com/archives/ubuntu-rtl/attachments/20101223/98f76f4a/attachment.htm" target="_blank">https://lists.ubuntu.com/archives/ubuntu-rtl/attachments/20101223/98f76f4a/attachment.htm</a><br>
<br>
------------------------------<br>
<font color="#888888"><br>
--<br>
Ubuntu-RTL mailing list<br>
<a href="mailto:Ubuntu-RTL@lists.ubuntu.com">Ubuntu-RTL@lists.ubuntu.com</a><br>
Modify settings or unsubscribe at: <a href="https://lists.ubuntu.com/mailman/listinfo/ubuntu-rtl" target="_blank">https://lists.ubuntu.com/mailman/listinfo/ubuntu-rtl</a><br>
<br>
<br>
End of Ubuntu-RTL Digest, Vol 6, Issue 10<br>
*****************************************<br>
</font></blockquote></div><br>