- From: Jeremy Carroll <jjc@hplb.hpl.hp.com>
- Date: Thu, 12 Dec 2002 12:38:24 +0000
- To: www-archive <www-archive@w3.org>
Received on Thursday, 12 December 2002 07:38:36 UTC
Note following text from RFC 2279
- Character values from 0000 0000 to 0000 007F (US-ASCII repertoire)
correspond to octets 00 to 7F (7 bit US-ASCII values). A direct
consequence is that a plain ASCII string is also a valid UTF-8
string.
- US-ASCII values do not appear otherwise in a UTF-8 encoded
character stream. This provides compatibility with file systems
or other software (e.g. the printf() function in C libraries) that
parse based on US-ASCII values but are transparent to other
values.
Received on Thursday, 12 December 2002 07:38:36 UTC