W3C home > Mailing lists > Public > w3c-wai-ig@w3.org > July to September 2002

RE: non-sgml characters

From: Al Gilman <asgilman@iamdigex.net>
Date: Mon, 15 Jul 2002 10:07:31 -0400
Message-Id: <5.1.0.14.2.20020715095620.0212cb40@pop.iamdigex.net>
To: "Steven McCaffrey" <smccaffr@MAIL.NYSED.GOV>, <jon@spin.ie>, <w3c-wai-ig@w3.org>

At 08:26 AM 2002-07-15, Steven McCaffrey wrote:
>Apparently there is also unicode and SGML.  Is there a overview of the different kinds of icon representation methods that does not get too detailed about the inner workings of each character set. 

Steve,

I don't know if this is going to be readable enough or not.

What Google came up with for me as the lead hit was

 http://www.w3.org/International/O-charset.html

Read down through the Dan Connolly note about "charsets considered harmful" 
and Glenn's response to that.

Another starting point is the Character Model document

 http://www.w3.org/TR/charmod/

Please tell us if either of these works for you.

Al

PS:  For what it's worth the search query I used was

 http://www.google.com/search?q=character+set+vs.+character+encoding


>Hello all:
>
>     I too have been looking for a standard set of icons.  Additionally, though, since I am blind I am really looking for a table that would be like:
>description:code - that is, two text columns.
>(e.g. trademark symbol:&#153)
>I am unfamiliar with the methods to do this so probably confuse things.
>There are "entities" in HTML such as &sup for superscript that JFW 4.01 reads very well (JFW says "superscript").
>Apparently there is also unicode and SGML.  Is there a overview of the different kinds of icon representation methods that does not get too detailed about the inner workings of each character set. 
>  
>Thanks,
>
>
>Steve
>
>
>
>
>
>>>> "Jon Hanna" <jon@spin.ie> 07/15/02 07:54AM >>> 
>
>> I used "&#153;" on a site to represent the TM Symbol, an unregistered 
>> Trademark. Apparently this is a reference to a non-SGML character. 
>
>Yep, 153 is the position in the windows character set. Its Unicode position 
>is 0x2122 (or 8482 in decimal) so you can use &#x2122; or &#8482; 
>
>You could also try &trade; though that depends more on browser support. 
>
>AFAIR There is no character 153 (though I'm not too sure), hence the browser 
>was able to realise you'd made a mistake and guess that you wanted the 
>windows character of that position. 
>
>(It's not an SGML thing as such, SGML uses character sets defined elsewhere, 
>similarly with XML - and hence HTML since HTML has been XML for the last 3 
>years). 
Received on Monday, 15 July 2002 10:07:42 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 19 July 2011 18:14:05 GMT