[Bug 1462737] Re: mawk counts characters incorrectly
Jarno Suni
1462737 at bugs.launchpad.net
Sun Jul 19 07:38:22 UTC 2015
I guess it is design. I think some operations are faster, if you count
bytes instead of characters. There could be an option to allow mawk
count characters, though.
** Description changed:
$ echo ä | mawk '{print length($0)}'
- outputs 2. I expect 1.
+ outputs 2. I expect 1.
$ echo äo | mawk '{print match($0,"o")}'
outputs 3. I expect 2.
Probably this is due to the internal representation of UTF-8 characters;
- mawk counts bytes instead of characters.
+ mawk counts bytes instead of characters. gawk works similarly, if -b
+ option is used.
ProblemType: Bug
DistroRelease: Ubuntu 14.04
Package: mawk 1.3.3-17ubuntu2
ProcVersionSignature: Ubuntu 3.13.0-53.89-lowlatency 3.13.11-ckt19
Uname: Linux 3.13.0-53-lowlatency x86_64
ApportVersion: 2.14.1-0ubuntu3.11
Architecture: amd64
CurrentDesktop: XFCE
Date: Sun Jun 7 15:52:26 2015
Dependencies:
- gcc-4.9-base 4.9.1-0ubuntu1
- libc6 2.19-0ubuntu6.6
- libgcc1 1:4.9.1-0ubuntu1
- multiarch-support 2.19-0ubuntu6.6
+ gcc-4.9-base 4.9.1-0ubuntu1
+ libc6 2.19-0ubuntu6.6
+ libgcc1 1:4.9.1-0ubuntu1
+ multiarch-support 2.19-0ubuntu6.6
EcryptfsInUse: Yes
InstallationDate: Installed on 2014-09-21 (259 days ago)
InstallationMedia: Ubuntu-Studio 14.04.1 LTS "Trusty Tahr" - Release amd64 (20140722.1)
SourcePackage: mawk
UpgradeStatus: No upgrade log present (probably fresh install)
--
You received this bug notification because you are a member of Ubuntu
Server Team, which is subscribed to mawk in Ubuntu.
https://bugs.launchpad.net/bugs/1462737
Title:
mawk counts characters incorrectly
To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/mawk/+bug/1462737/+subscriptions
More information about the Ubuntu-server-bugs
mailing list