- From: Alan Gresley <alan@css-class.com>
- Date: Mon, 17 Jan 2011 00:25:07 +1100
- To: Ms2ger <ms2ger@gmail.com>
- CC: fantasai <fantasai.lists@inkedblade.net>, Peter Linss <peter.linss@hp.com>, Public CSS test suite mailing list <public-css-testsuite@w3.org>, Ian Hickson <ian@hixie.ch>
On 16/01/2011 10:38 PM, Ms2ger wrote:
> On 01/16/2011 06:11 AM, fantasai wrote:
>> Hixie, got a question, see below?
>>
>> On 01/14/2011 03:22 PM, Peter Linss wrote:
>>>
>>> On Jan 14, 2011, at 12:58 PM, Alan Gresley wrote:
>>>
>>>> On 15/01/2011 7:24 AM, Peter Linss wrote:
>>>>> I believe there are a few problems with this test.
>>>>>
>>>>> First, the only style possibilities for the test paragraphs are white
>>>>> text on a green background versus white text on a green background. I
>>>>> presume it's trying to test for the application of the rule in the
>>>>> linked stylesheet but there would be no visible effect either way.
>>>>>
>>>>> Second, I'm trying to figure out if this test requires http or not
>>>>> (and
>>>>> exactly what for that matter this test is trying to test), I'm
>>>>> guessing
>>>>> the rule in the linked style sheet is NOT supposed to match
>>>>> anything? It
>>>>> it relying on the linked stylesheet being served as utf-8? (the
>>>>> server's
>>>>> default, as there is no explicit encoding set on that file) Why does
>>>>> the
>>>>> title state "malformed UTF-8"? Either something's missing here or I'm
>>>>> not getting it...
>>>>
>>>>
>>>> The external stylesheet CSS [1] has this
>>>>
>>>> .t�st { color: white; background: green; }
>>>>
>>>> I presume that � is malformed CSS. Each class of each <p> has a string
>>>> of class"t(Unicode)st". These are the Unicode characters.
>>>>
>>>> é ้ щ ى ι י И
>>>>
>>>>
>>>> 1.
>>>> <http://test.csswg.org/suites/css2.1/20101210/html4/support/character-encoding-038.css>
>>>>
>>>>
>>>>
>>>
>>> The external stylesheet is:
>>> .tést { color: white; background: green; }
>>>
>>> if interpreted as ISO-8859-1 encoding. Which I take to mean that the
>>> stylesheet needs to be served via http with the explicit encoding of
>>> utf-8, so that it does NOT match any of the content. (Meaning the
>>> stylesheet is malformed utf-8, which explains the title.)
>>>
>>> So I presume the stylesheet should be updated to be:
>>> .t�st { color: yellow; background: red; }
>>>
>>> and the test does in fact need the 'http' flag (which I already added).
>>
>> Hixie's server does not serve the original CSS up with a charset
>> parameter,
>> although it does set UTF-8 for the HTML file, so I don't think we should
>> be serving a UTF-8 header for this. (According to CSS2.1, the style sheet
>> must be treated as UTF-8 even without the header: see [1].)
>>
>> I do agree that the style sheet needs to be setting a red background,
>> though, because right now there appears no way for it to fail.
>>
>> What I don't understand is what to do with the rest of the <p
>> class="t*st">s,
>> since there doesn't seem to be anything that could potentially trigger a
>> failure on those, either.
>
> 1. é is 'LATIN SMALL LETTER E WITH ACUTE' (U+00E9),
> which is encoded in windows-1252, -54, -56, -57, -58 as 0xE9.
> 2. ้ is 'THAI CHARACTER MAI THO' (U+0E49),
> which is encoded in windows-874 as 0xE9.
> 3. щ is 'CYRILLIC SMALL LETTER SHCHA' (U+0449),
> which is encoded in iso-8859-5 as 0xE9.
> 4. ى is 'ARABIC LETTER ALEF MAKSURA' (U+0649),
> which is encoded in iso-8859-6 as 0xE9.
> 5. ι is 'GREEK SMALL LETTER IOTA' (U+03B9),
> which is encoded in windows-1253 as 0xE9.
> 6. י is 'HEBREW LETTER YOD' (U+05D9),
> which is encoded in windows-1255 as 0xE9.
> 7. И is 'CYRILLIC CAPITAL LETTER I' (U+0418),
> which is encoded in koi8-r as 0xE9.
>
> Most of these encodings are mentioned in the table of defaults at the
> end of the "Determining the character encoding" section in HTML [1].
>
> It seems unlikely that these would fail in en-US browsers, but much less
> so in foreign locales (even though we don't usually test those).
>
> HTH
> Ms2ger
>
> [1] <http://www.whatwg.org/html/#determining-the-character-encoding>
Well this is all very interesting. Out of coincidence, I logged into an
email account that I had forgotten about and one of the emails had
"you�re" (? within black diamond). I presume the word is "you're."
To clarify something which I noticed from Peter's reply to me if it was
not noticed.
I wrote this in reply to Peter's initial message.
>>>> The external stylesheet CSS [1] has this
>>>>
>>>> .t�st { color: white; background: green; }
_has ? within black diamond_
Peter replied and said this.
>>> The external stylesheet is:
>>> .tést { color: white; background: green; }
_has Latin small letter e with acute_
and then this.
>>> So I presume the stylesheet should be updated to be:
>>> .t�st { color: yellow; background: red; }
_has ? within black diamond_
So are Peter I seeing different characters (Peter sees 'e' with acute
and I'm seeing '?' within black diamond)?
In the external stylesheet on the original test on Hixie's server I see
this.
.tést { color: white; background: green; }
_has Latin small letter e with acute_
So if the é (letter e with acute) is (U+00E9), what is the Unicode for �
(? within black diamond)?
--
Alan http://css-class.com/
Armies Cannot Stop An Idea Whose Time Has Come. - Victor Hugo
Received on Sunday, 16 January 2011 13:25:47 UTC