- From: <bugzilla@jessica.w3.org>
- Date: Wed, 01 Jun 2011 01:18:20 +0000
- To: public-html@w3.org
http://www.w3.org/Bugs/Public/show_bug.cgi?id=12839 Summary: @id: Define how Unicode normalization affects the 'unique identifier' status Product: HTML WG Version: unspecified Platform: PC URL: http://dev.w3.org/html5/spec/elements#concept-id OS/Version: All Status: NEW Severity: normal Priority: P3 Component: HTML5 spec (editor: Ian Hickson) AssignedTo: ian@hixie.ch ReportedBy: xn--mlform-iua@xn--mlform-iua.no QAContact: public-html-bugzilla@w3.org CC: mike@w3.org, public-html-wg-issue-tracking@w3.org, public-html@w3.org PROPOSAL: * DEFINE 'unique identifier'. * SUGGESTED DEFINITION: State that W3C normalization [*] must be performed before it can be established whether the @id is valid. That is: before it can be established whether it constitutes a 'unique identifier'. [*] http://unicode.org/faq/normalization.html#7 This means (and this should perhaps be emphasized) that if two id attributes differ only with regard to their normalization form, then it is a violation of the "unique identifier" requirement. NOTE: private tests of today's user agents (IE8, Firefox4, Opera11, Safari, Chrome) shows that that <a href="#å">link</a> targets <p id="å"> whereas <a href="#å">link</a> targets <p id="å"> Thus, today's user agents do actually treat them as unique identifiers, despite that they both refer to the same "å" (å). However, in order toi avoid author confusion as well as user confusion, this should not be considered valid. It probably also breaks a number of other specs, including Unicode, to treat them as unique. CURRENT STATUS: Spec says: ]] The id attribute specifies its element's unique identifier (ID). The value must be unique amongst all the IDs in the element's home subtree and must contain at least one character.[[ PROBLEM: there is no definition of "unique". Specifically, it does not state whether two @id attributes that differs only with regard to the normalization, are to be considered unique, or not. EXAMPLE: In this example document, the letter 'å' (å) is first represented in decomposed form, and thereafter in composed form: <!DOCTYPE html><title></title><p id="å"><p id="å"> -- Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are on the CC list for the bug.
Received on Wednesday, 1 June 2011 01:18:22 UTC