[உபுண்டு தமிழகம்] TUNE RFC
ramanraj.k at gmail.com
Tue Nov 28 04:35:53 GMT 2006
The Tamil Nadu Virtual University has come up with a proposal for New
Tamil Unicode at http://www.tunerfc.tn.nic.in/tumain.htm and has
requested for comments.
Please comment on the proposal there, and I kindly request this
conversation to continue in English, with minimal use of Tamil chars,
for the benefit of developers who may refer to the archives to provide
support for Tamil in their Application.
The new chart supplies many of the Tamil chars missing in the current
implementation, including fraction symbols and archiac chars. This would
set the correct binary sort order, but I am not sure if the Tamil
numbers are correctly placed to allow that.
Anyway, the sort order is set in the DUCET at
http://www.unicode.org/Public/UCA/4.1.0/allkeys.txt If that is amended
suitably, sorting could be done without even any complicated locale
settings. (Try searching for the Tamil chars in the usual default order
set by DUCET - what is wrong is obviously visible). It may be possible
to offer a patch for indic languages so that correct sorting of indic
lang strings is immediately possible with utf-8. It is possible to
create new lists also so that multi-linugal sort is also done correctly
-say a SQL query may collate names begining with "O" and "ஒ" together.
We may also look into what changes to the existing allocation at unicode
would help, so that the need to use any private user area is avoided.
More information about the Ubuntu-tam