Re: HTML entities.

(To David, who sent me a private e-mail: I sent this message to the Amaya list because I thought I was relevant for the discussion.)

Characters of code position 161 through 255 in Charset ISO-8859-1 do not transfer well over the Internet. Therefore the need of converting - in the HTML source code - these characters to entity names or numbers.

The above would also concern characters of code positions above 255.

I cut off at 8364 because that's where the list at HTML Spec. ended. But, that was wrong. I forgot the list for Symbols, which ends with 9830.

As for using Charset ISO-8859-1 on documents that includes characters of code positions above 255 - I cannot say, I am probably wrong using it as ISO-8859-1 seems to end at 255. So, Charset US-ASCII should then be used.

Regards,
Thomas Jedenfelt

P.S.
UCS = Universal Character Set (defined in ISO10646).


----- Original Message -----
From: "Dave Woolley" 
Date: Fri, 25 Nov 2005
> >
> > (As previously mentioned, when using Charset ISO-8859-1 it is my 
> > Wish that Amaya will generate Decimal character references for 
> > characters of code position 161 through 8364, for the purpose of 
> > accurately transferring HTML-files over the Internet.)
> 
> If you specify ISO-8859-1 you are making a claim that you can
> accurately transfer all the valid HTML characters up to and
> including 255.  If you cannot rely on that you should be
> specifying US-ASCII instead.
> 
> I don't understand your 8394 cut off.  The cut-off for HTML is
> the UCS cut-off, which is well over 4,000,000.
> 
> --
> David Woolley


-- 
_______________________________________________
Surf the Web in a faster, safer and easier way:
Download Opera 8 at http://www.opera.com

Powered by Outblaze

Received on Friday, 25 November 2005 13:29:50 UTC