W3C home > Mailing lists > Public > www-validator-cvs@w3.org > May 2009

[Bug 6915] Encoded unicode chars errors as "broken URI fragments"

From: <bugzilla@wiggum.w3.org>
Date: Tue, 19 May 2009 08:44:38 +0000
To: www-validator-cvs@w3.org
Message-Id: <E1M6KwA-0005aR-RV@wiggum.w3.org>

--- Comment #2 from David <boujin@yahoo.com>  2009-05-19 08:44:37 ---
I am using xhtml 1.0 sent as text/html and 1.1 sent as application/xhtml+xml. I
am using content negotiation depending on which the user agent accepts. I use
"id" inside an "a".

Example 1 with "id":
<a href="/espa%C3%B1a/espa%C3%B1a" id="espa&#241;a">Espa&#241;a</a>

Example 2 with "#":
<a href="/espa%C3%B1a/espa%C3%B1a#espa%C3%B1a">Espa&#241;a</a>

(Note different encoding for char in text and uri.)

Both validate using the W3C Markup Validator. Unfortunately, Example 2 fails
the W3C Link Checker stating error mentioned in my previous post.

I have read your links where it is stated, "Note that the collection of legal
values in XML 1.0 Section 2.3, production 5 is much larger than that permitted
to be used in the ID and NAME types defined in HTML 4."

html's "id" (and "name") allow: letters (a-z), digits, hyphens, underscores,
colons, and periods.

Unfortunately, I have failed to find a list of valid encodings for the "id" in
xhtml. Nevertheless, since xhtml's list is "much larger", then I guess the
previously mentioned anchor is correct?

Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the QA contact for the bug.
Received on Tuesday, 19 May 2009 08:44:48 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 23:02:35 UTC