- From: Bjoern Hoehrmann <derhoermi@gmx.net>
- Date: Sun, 17 Oct 2004 11:41:28 +0200
- To: xml-editor@w3.org
Dear XML Core Working Group, I am unable to find your public response that formally addresses an issue raised by David Carlisle on the XML 1.1 Proposed Recommendation which is publicly archived at http://lists.w3.org/Archives/Public/xml-editor/2003OctDec/0048.html which you have apparently rejected (the Recommendation contains the same apparently contradictory text). The relevant text in the Recommendation is: [...] To simplify the tasks of applications, the XML processor MUST behave as if it normalized all line breaks in external parsed entities (including the document entity) on input, before parsing, by translating all of the following to a single #xA character: [...] The characters #x85 and #x2028 cannot be reliably recognized and translated until an entity's encoding declaration (if present) has been read. Therefore, it is a fatal error to use them within the XML declaration or text declaration. [...] I do not understand this either. Please point me to your response to David which will hopefully answer the following questions: * Why is it (in theory) not possible to recognize these characters reliably or (in theory) with less reliability than recognizing any other character such as U+0020? * How can a processor detect this error if it is not possible to recognize the offending characters reliably? * How can a processor detect this error if it is not possible that these characters are present when parsing the XML declaration due to line break normalization? regards.
Received on Sunday, 17 October 2004 09:42:17 UTC