Re: Unicode-based FreeBSD



On Sun, Aug 24, 2008 at 9:15 PM, Frank
<mitchell@xxxxxxxxxxxxxxxxxxxxxxxxx> wrote:
I've heard that Japanese and Chinese users prefer their own coding systems,
because the Unicode Character Set in these languages is limited. Korean also
has Combining Characters, and UTF-8 comes in 3 different Levels depending on
its ability to cope with this. Maybe you need some contacts in other
countries.

This is not true -- at least for Chinese.
I'm a Chinese living in Taiwan and I am probably sure that Unicode is larger
than any other Chinese character sets (including traditional and simplified
Chinese). The UTF-8 support in FreeBSD/Xorg is good enough for me.
I can read/type all Unicode 4.0 characters (including CJKV extension A/B)
in Firefox or any gtk/qt programs if I have the needed font; I can produce
documents with any Unicode characters by LaTeX+CJK package.
It's much better than MS IE and Word because IE and Word only support
Unicode 2.0 (or maybe 3.0, I'm not so sure).

There are two reasons to use any character sets other than UTF-8:
1. compatibility for old programs/services or other OS.
2. the old man wrote the document when Unicode was not so popular and
newbies read the old document.

UTF-8 is more and more popular in Chinese, at least in Taiwan.
Almost everything works well in my daily jobs (of course under the X).
The major missing part is the kiconv UTF-8 support -- currently the kiconv
doesn't support more than two bytes character conversion so there
is no UTF-8 support for Chinese (most Chinese characters are 3-byte or
more). I should mount msdosfs/cd9660 in zh_TW.Big5 and convert the
filename to UTF-8 by lint or screen.

IMHO, If I need Chinese support, I'll go into X. I have no reason to use
Chinese under console even if I can read/type in Chinese. I prefer Firefox
rather than w3m or links. :-)

Regards,
Tz-Huan
_______________________________________________
freebsd-current@xxxxxxxxxxx mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to "freebsd-current-unsubscribe@xxxxxxxxxxx"



Relevant Pages

  • Re: Unicode Support
    ... > Not knowing much about UTF-8 (my Unicode knowledge extends as far as ... > literal strings of this form as long as the character code for quote ... > can never appear in a MBCS (multibyte character sequence). ... then XP Notepad directly understands UNICODE and you can ...
    (alt.lang.asm)
  • Re: VB - Ascii to Unicode and then Unicode to UTF-8 conversion (Very desperate!!)
    ... Latin together) then you have to use a Unicode column type. ... AscW returns the real Unicode character ... for Chinese characters, ... then the next thing to worry about is your CSV file. ...
    (microsoft.public.vb.general.discussion)
  • Re: Unicode-based FreeBSD
    ... than any other Chinese character sets (including traditional and simplified ... The UTF-8 support in FreeBSD/Xorg is good enough for me. ... There are two reasons to use any character sets other than UTF-8: ...
    (freebsd-current)
  • Re: Attention: European C/C++/C#/Java Programmers-Call for Input
    ... Simply make a straight decision now - you will use UTF-8. ... character format) much like UTF-8 which itself ... I would have little more than UNICODE left. ... generator is assembly language. ...
    (comp.arch.embedded)
  • Re: Posting with XHR and ISO-8859-15
    ... UTF-8 code units can be byte values ... Latin-9, and Unicode are the same, so there wouldn't be any troubles ... URIs, I can't use encodeURIComponent. ... ISO-8859-xx in the sense that not every character that can be encoded ...
    (comp.lang.javascript)