[Bug 6915] Encoded unicode chars errors as "broken URI fragments"

http://www.w3.org/Bugs/Public/show_bug.cgi?id=6915





--- Comment #2 from David <boujin@yahoo.com>  2009-05-19 08:44:37 ---
I am using xhtml 1.0 sent as text/html and 1.1 sent as application/xhtml+xml. I
am using content negotiation depending on which the user agent accepts. I use
"id" inside an "a".

Example 1 with "id":
<a href="/espa%C3%B1a/espa%C3%B1a" id="espa&#241;a">Espa&#241;a</a>

Example 2 with "#":
<a href="/espa%C3%B1a/espa%C3%B1a#espa%C3%B1a">Espa&#241;a</a>

(Note different encoding for char in text and uri.)

Both validate using the W3C Markup Validator. Unfortunately, Example 2 fails
the W3C Link Checker stating error mentioned in my previous post.



I have read your links where it is stated, "Note that the collection of legal
values in XML 1.0 Section 2.3, production 5 is much larger than that permitted
to be used in the ID and NAME types defined in HTML 4."

html's "id" (and "name") allow: letters (a-z), digits, hyphens, underscores,
colons, and periods.

Unfortunately, I have failed to find a list of valid encodings for the "id" in
xhtml. Nevertheless, since xhtml's list is "much larger", then I guess the
previously mentioned anchor is correct?


-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.

Received on Tuesday, 19 May 2009 08:44:48 UTC