- From: Tex Texin <tex@i18nguy.com>
- Date: Thu, 06 Nov 2003 00:51:57 -0500
- To: ishida@w3.org
- Cc: public-i18n-geo@w3.org
Thanks for the pointers. My reading of these 2 sections is that the term signature refers to its purpose, not as a name for the character. The one exception is the table and related text in section 15.9. There they are using the term signature (in my estimation) to refer to the pattern of bytes, since in this usage they do not represent a character. ie A detector would be looking for the byte patterns and not thinking in terms of characters or unicode. But the section reminds of another concern, which is the BOM might actually be a ZWNBSP. Although this won't be the case for markup, to the extent more complex systems might be concatenating data from different sources we perhaps should make sure someone doesn't misinterpret the instructions as being a signal to remove BOM's from the front of any type of file. (Unlikely I know, but necessary.) tex Richard Ishida wrote: > > Tex, > > You were right that the term signature is not used in the CharMod. I > was wrong about that. But then 'byte order mark' and 'BOM' don't appear > there either, as far as I can tell. > > But look at section 2.11 of the Unicode Standard v4 (pp47-48) and you > will see a distinction drawn between the terms Byte Order Mark and > Unicode Signature. In section 15.9, it refers to 'Unicode Encoding Form > Signatures'. > > RI > > ============ > Richard Ishida > W3C > > contact info: http://www.w3.org/People/Ishida/ > > http://www.w3.org/International/ > http://www.w3.org/International/geo/ > > W3C Internationalization FAQs > http://www.w3.org/International/questions.html > RSS feed: http://www.w3.org/International/questions.rss -- ------------------------------------------------------------- Tex Texin cell: +1 781 789 1898 mailto:Tex@XenCraft.com Xen Master http://www.i18nGuy.com XenCraft http://www.XenCraft.com Making e-Business Work Around the World -------------------------------------------------------------
Received on Thursday, 6 November 2003 00:52:55 UTC