Semi-mechanizing the DTTP translations

Hendrik Knackstedt hendrik.knackstedt at t-online.de
Thu Dec 20 16:39:25 UTC 2012


Am 20.12.2012 13:43, schrieb Pierre Slamich:
> I don't have a clean way to split them right now. I split them by size
> to keep below 900ko (I took 800 for safety), but I then had to adjust
> manually because the strings were split right in the middle.

Ok, I'll take a look at it and see if I can come up with something useful.
>
> If you don't mind, it would be great to take advantage of the German
> process to automate the process as much as possible.
> Would you be willing to expand the pad
> (http://lite.framapad.org/p/ddtpUbuntu) with us (yet another proof of
> French-German partnership ;-P)?

Sure. What do you mean by "the German process"? I'm a bit short on time
right now but just let me know what has to be done and I'll try to get
it done asap.

Regards,
Hendrik
>
> Pierre
>
> On Thu, Dec 20, 2012 at 1:35 PM, Hendrik Knackstedt
> <hendrik.knackstedt at t-online.de
> <mailto:hendrik.knackstedt at t-online.de>> wrote:
>
>     Hey Pierre!
>
>
>     I'd like to test your approach for the German language also. How
>     exactly did you split the files? Did you use an existing
>     program/script or can you provide a script for doing this? Thanks!
>
>     Hendrik
>
>     Am 19.12.2012 15:58, schrieb Pierre Slamich:
>>     Yes, although we might be finished by then ;-)
>>     Thanks to the method we're reviewing and correcting around 1000
>>     strings per day at the moment.
>>
>>     sincerely,
>>     Pierre
>>
>>     On Tue, Dec 18, 2012 at 4:06 PM, Hannie Dumoleyn
>>     <lafeber-dumoleyn2 at zonnet.nl
>>     <mailto:lafeber-dumoleyn2 at zonnet.nl>> wrote:
>>
>>         Hi Pierre, Redmar, and all who are interested,
>>         Would it be an idea to brainstorm on this in
>>         #ubuntu-translators? Perhaps in January 2013?
>>         I agree with Redmar that the msgmerge is a good method,
>>         especially for huge documents. The only snag is that you
>>         still have to approve the fuzzies offline before uploading
>>         the file back to Launchpad. We use this method for the Ubuntu
>>         Manual "Getting started with Ubuntu" (Lucid > Maverick >
>>         ....> Raring) and with success.
>>         Redmar, sorry for not yet having tested your popsort :(
>>         Regards,
>>         Hannie
>>
>>         Op 18-12-12 00:51, Pierre Slamich schreef:
>>>         Hi Hannie, Hi Redmar,
>>>         Thanks a lot for the tips: we're interested in using your
>>>         approach, and more generally it might be interesting
>>>         expending the msmerge approach to all teams that are already
>>>         underway for the DDTP, and the Google one to the teams that
>>>         need to get started.
>>>
>>>         - For the Google Translator Kit approach, I guess we
>>>         could extend the mock project we did for fr_FR to other
>>>         languages (and streamlining our process by using Bazaar) by
>>>         creating a global team responsible for the DDTP Mock project
>>>         and including in this team one member from each language
>>>         team responsible for uploading the machine translated po for
>>>         his or her language.
>>>
>>>         - For the msmerge approach, do you already have a project to
>>>         handle this ? Is there any advantage in msmerging raring
>>>         against releases older than quantal to get more modified
>>>         strings ? How many strings have you been able to recover
>>>         using that approach ?  It might be neat to generate the
>>>         msmerged po for all languages ? Importing them as actual
>>>         translations (not fuzzy) into a mock project like the Google
>>>         Translate one would show them as suggestions for the actual
>>>         DDTP as well.
>>>         The translator would thus be able to pick the human
>>>         translated one when available or to build on the machine
>>>         translated one otherwise.
>>>
>>>         Can we try to schedule some time to coordinate on this so
>>>         that we can use both approaches and try to onboard all the
>>>         other languages teams once we have a rock-solid process ?
>>>
>>>         Pierre
>>>
>>>         Pierre Slamich
>>>         pierre.slamich at gmail.com <mailto:pierre.slamich at gmail.com>
>>>
>>>
>>>         On Mon, Dec 17, 2012 at 10:30 PM, Redmar
>>>         <redmar at ubuntu-nl.org <mailto:redmar at ubuntu-nl.org>> wrote:
>>>
>>>             Hi Pierre,
>>>
>>>             I've actually tried a similar approach for Dutch using
>>>             msgmerge, which
>>>             might also be worth checking out. When you merge the
>>>             translations of an
>>>             older version of ubuntu into the current version (msgmerge
>>>             quantal_ddtp.po raring_ddtp.po -o merged_ddtp.po, for
>>>             example), there
>>>             will be a lot of 'fuzzy' translations for strings that
>>>             are similar (for
>>>             example, meta packages for different programs, debugging
>>>             symbols etc).
>>>             These fuzzy often only need a few small changes (eg
>>>             program name) to be
>>>             accepted, which can really speed up translations. And
>>>             you don't have to
>>>             worry about google putting in a weird translation, since
>>>             it is all based
>>>             on earlier translations done by a human.
>>>
>>>             On a related note, if any of you work on
>>>             ddtp-translations offline, I
>>>             have written a python program that can sort entries in
>>>             ddtp po-files
>>>             based on the popularity of the package. This way, the
>>>             most popular
>>>             packages will be at the top of the po file, and you are
>>>             always sure you
>>>             are working on the most important packages first.
>>>
>>>             You can get the code here:
>>>             bzr branch lp:~redmar/+junk/ddtp_popsort
>>>
>>>             It has a small readme file, please let me know if
>>>             something is unclear
>>>             or not working for you.
>>>
>>>             Regards,
>>>             Redmar
>>>             --
>>>             Ubuntu Dutch Translators
>>>
>>>
>>>             Hannie Dumoleyn schreef op ma 17-12-2012 om 17:58 [+0100]:
>>>             > Hello Pierre,
>>>             > This is a very good idea! I have just uploaded the
>>>             first part of the
>>>             > incomplete Dutch translation (900kb) to GTT.
>>>             > Thanks,
>>>             > Hannie
>>>             >
>>>             > Op 17-12-12 12:55, Pierre Slamich schreef:
>>>             >
>>>             > > The DDTP represent around 50 000 strings to
>>>             translate * 140
>>>             > > languages. On very good weeks, a typical translation
>>>             team translates
>>>             > > 500 strings (see UWN for examples weekly figures).
>>>             > >
>>>             > >
>>>             > > Would take a lot of weeks (years?) with highly
>>>             motivated volunteers
>>>             > > of a large translation team, working non-stop, at
>>>             their best to get
>>>             > > done with it.
>>>             > > Thus we had the idea to delegate initial translation
>>>             suggestions to
>>>             > > Google Translator Kit and review translations with
>>>             humans to speed
>>>             > > the process.
>>>             > >
>>>             > > We successfully did an import for circa 40 000
>>>             French strings  (yup
>>>             > > you read that right) this week-end in a mock project
>>>             called DDTP
>>>             > > Automation
>>>             (https://translations.launchpad.net/ddtpautomation).
>>>             > > To keep it short, the translations from this project
>>>             appear as
>>>             > > suggestions in the French DDTP, and can be reviewed
>>>             by actual
>>>             > > translators.
>>>             > > We've started using them, and it turns out that a
>>>             lot of them are
>>>             > > actually useful and are speeding up the translation
>>>             process a lot.
>>>             > >
>>>             > > We detailed the (somewhat) tedious process in English at
>>>             > > http://lite.framapad.org/p/ddtpUbuntu
>>>             > > Questions and inquiries welcome.
>>>             > >
>>>             > > Pierre
>>>             > >
>>>             > >
>>>             > > ---
>>>             > > pierre.slamich at gmail.com
>>>             <mailto:pierre.slamich at gmail.com>
>>>             > >
>>>             > >
>>>             >
>>>
>>>
>>>             --
>>>             ubuntu-translators mailing list
>>>             ubuntu-translators at lists.ubuntu.com
>>>             <mailto:ubuntu-translators at lists.ubuntu.com>
>>>             https://lists.ubuntu.com/mailman/listinfo/ubuntu-translators
>>>
>>>
>>
>>
>>
>>
>
>
>     --
>     ubuntu-translators mailing list
>     ubuntu-translators at lists.ubuntu.com
>     <mailto:ubuntu-translators at lists.ubuntu.com>
>     https://lists.ubuntu.com/mailman/listinfo/ubuntu-translators
>
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.ubuntu.com/archives/ubuntu-translators/attachments/20121220/692afed1/attachment.html>


More information about the ubuntu-translators mailing list