[Bug 1803327] Re: iconv no longer transforms UTF8 to CP1252 as it used to under Ubuntu 16.04 LTS.
Bernard Moreton
bernard.moreton at gmail.com
Thu Nov 15 10:14:35 UTC 2018
Yes, I've done dumps to od -xc, and I agree - iconv is behaving
correctly.
Under Ubuntu 16.04 LTS, passing multibyte characters from a database into RTF, then on into LibreOffice caused the unknown character symbol (�) in LibreOffice.
Using "unix2dos < in.txt | iconv -f UTF-8 -t CP1252 > out.txt" fixed that, then - seemingly by getting pure latin-1 into the RTF file, as the RTF spec. requires.
Now, under Ubuntu 18.04 LTS, it appears to cause the problem If I use
this this transform, I get the unknown character symbol in the LO
document, and if I leave it out, and put multibyte characters into the
RTF, LibreOffice shows them correctly.
So other things have changed, as well as ICONV, and I see other recent
references to multibyte characters in RTF. And in any case, it's not
really desirable to have non-ASCII latin-1 characters in a UTF8
environment.
Thnk you for your care. Please close this bug report.
--
You received this bug notification because you are a member of Ubuntu
Foundations Bugs, which is subscribed to glibc in Ubuntu.
https://bugs.launchpad.net/bugs/1803327
Title:
iconv no longer transforms UTF8 to CP1252 as it used to under Ubuntu
16.04 LTS.
Status in glibc package in Ubuntu:
New
Bug description:
Under Ubuntu 16.04 LTS I used code in the form
unix2dos < in.txt | iconv -f UTF-8 -t CP1252 > out.txt
(as in "man unix2dos", then and also still now under 18.04 LTS)
to transform UTF8 database extract into an RTF file, for onward processing.
This now fails.
inconv now gives output as follows:
echo abc àêü | iconv -f utf-8 -t cp1252
abc ���
echo abc àêü | iconv -f utf-8 -t cp1252//translit
abc ���
echo abc àêü | iconv -f utf-8 -t latin1//translit
abc ���
But inconv WILL transform UTF8 to basic ASCII, if called with TRANLIT:
echo abc àêü | iconv -f utf-8 -t ascii
abc iconv: illegal input sequence at position 4
echo abc àêü | iconv -f utf-8 -t ascii//translit
abc aeu
This is better than not working at all, but the transform to CP1252 used to change accented UTF8 characters to single-byte accented characters
This is the appropriate behaviour, and should be restored..
(FWIW, recode now cannot transform to CP1252 either ...)
ProblemType: Bug
DistroRelease: Ubuntu 18.04
Package: libc-bin 2.27-3ubuntu1
ProcVersionSignature: Ubuntu 4.15.0-38.41-generic 4.15.18
Uname: Linux 4.15.0-38-generic x86_64
ApportVersion: 2.20.9-0ubuntu7.4
Architecture: amd64
CurrentDesktop: X-Cinnamon
Date: Wed Nov 14 10:53:57 2018
Dependencies:
gcc-8-base 8.2.0-1ubuntu2~18.04
libc6 2.27-3ubuntu1
libgcc1 1:8.2.0-1ubuntu2~18.04
InstallationDate: Installed on 2017-03-09 (614 days ago)
InstallationMedia: Ubuntu 16.04 LTS "Xenial Xerus" - Release amd64 (20160420.1)
SourcePackage: glibc
UpgradeStatus: No upgrade log present (probably fresh install)
modified.conffile..etc.ld.so.conf:
include /etc/ld.so.conf.d/*.conf
/lib /lib/esql /usr/local/lib
mtime.conffile..etc.ld.so.conf: 2017-03-21T19:11:53.466413
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/glibc/+bug/1803327/+subscriptions
More information about the foundations-bugs
mailing list