Gibberish in Ubuntu when dealing with documents / websites

Ansgar Burchardt ansgar at 2008.43-1.org
Wed Nov 26 11:12:26 UTC 2008


Hi,

"Dotan Cohen" <dotancohen at gmail.com> writes:
> As my native language is not ASCII-compatible, myself and other Ubuntu
> users in my area often have problems with Gibberish (wrongly-encoded
> text) when exchanging documents with users of non-UTF-8 operating
> systems *ahem* MS Windows *ahem*. I would like to know if other people
> / locales suffer from this problem. If so, please send to me the
> letters of your alphabet so that I can try to compie a list of
> problematic letters (see my .sig for examples of some alphabets). I am
> trying to write a script that will help decode this Gibberish back
> into the correct encoding, but I need to know which letters are
> problematic for other Ubuntu users. Thanks.

You can use `iconv' to convert text between different encodings.  It can
only handle plain text though as it doesn't know anything about binary
file formats.

If you know the input and output encoding, just run

  iconv -f [input-encoding] -t [output-encoding] -o [output-file] [input-file]

To get a list of encodings supported by iconv run

  iconv -l

Regards,
Ansgar

-- 
PGP: 1024D/595FAD19  739E 2D09 0969 BEA9 9797  B055 DDB0 2FF7 595F AD19





More information about the ubuntu-users mailing list