Re: DEC-C toupper() of accented characters ?

From: JF Mezei (jfmezei.spamnot_at_istop.com)
Date: 05/15/03


Date: Thu, 15 May 2003 16:18:41 -0400


"Craig A. Berry" wrote:
> As far as breaking legacy code, only code that changes its locale from
> the default could possibly be broken, and only if the new locale is one
> that changes the code points occupied by 7-bit ASCII.

Since the US ASCII is a subset of ISO-LATIN-1, supporting ISO-LATIN 1 would
not break any existing programs.

In a global economy, even british, australians and americans (the bad guys in
the Irak invasion) should learn to support customers with strange names.
Consider that Digital has supported what is essentially the ISO lATIN 1 since
the days of the VT220s, and one wonders why there would still be remnants of
the 7 bit ascii of the 1970s left in Digital code.

Oh, another one which supports proper "uppercase" function is FMS. Yes, that
old, retired product, much older than DEC-C has the proper behaviour to
uppercase any character. (TPU also supports it)
 
Ok, so asking TPU to find an "é" when asking for an "e" is asking too much,
but it can find an "É" when i ask it to look for an "é"



Relevant Pages

  • Re: [9fans] simplicity
    ... locale was UTF-8, rules would match random lines of input, the fix? ... think of dealing with files which actually contain non-ASCII UTF-8). ... If the problem is that Gnu grep converts any non-8-bit character set ... ASCII file if I set my locale to the UTF-8 locale. ...
    (comp.os.plan9)
  • Re: French characters not recognised in C?
    ... if compiled in ISO C mode. ... > on what the default locale is. ... default character set _must_ be either an extension to ... ASCII, or something else entirely (most usually EBCDIC, which itself is ...
    (comp.lang.c)
  • Re: Using non-ascii symbols
    ... while I don't give a damn if those symbols are going to be supported by Python, I don't think the plain ASCII version should be deprecated. ... Think about moving from ASCII to LATIN-1 or UTF-8 as similar to moving ... I think it is a legitimate question, after UTF-8 becomes more and more supported. ... Editors could provide means to easily enter these symbols once programming languages start supporting them: Automatic expansion of ascii combinations, Alt-Combinations or popup menus with all supported symbols. ...
    (comp.lang.python)
  • Re: Anyone using non-UTF-8 locale(s)?
    ... > there's no such thing as a global locale on a UNIX system, ... the "invalid" character because it is not standard ascii. ... How is the access control part supposed to know which locale ...
    (Fedora)
  • Re: French characters not recognised in C?
    ... >>locale which only supports ASCII. ... By default most platforms will use one 8-bit extension to ASCII ... > You're really naive if you believe that this will change the character ... Actually I meant LANG=fr_CA but this is clearly a shell feature so let's ...
    (comp.lang.c)