- From: Rick Jelliffe <ricko@topologi.com>
- Date: Tue, 14 Jan 2003 01:24:03 +1100
- To: <www-tag@w3.org>
(Repost from Wednesday. This never made the archive, not sure why.) I think Chris misses out another option: ?) Refactor XML so that there are four kinds of XML processors: headlessWF, WF, typed, and valid. Deprecate WF in favour of WF and typedWF in all W3C specifications. - Headless WF must have no DOCTYPE. - Typed would use the DTD for entity expansion, decoration and type annotating only without validating. Typed may also include built-in defaults for standard entities sets. A typed XML processor will result in all attributes of type ID being so noted in the Infoset. Advantages: - headless gives lightweight XML for those who need it, especially those who want to enforce no DTDs: so reflects common practise in some industries - always makes sure that documents with DTD have exactly the same infoset (except validation info) whether they are valid or just typed - existing mechanism (DTDs) - allows small documents, with just the declarations for IDs in the prolog, so document sizes can be comparable to proposals for inline declarations - typed gives a form of XML that HTML can use without buying into DTD validation: the DTD for XHTML would only have ID attributes (and - addresses HTML's problem, rather than farting around one particular issue at a time: the problem is not "we need IDs" or "HTML needs entities" but that "WF versus Valid is proving to be the wrong split." . Disadvantages: - existing mechanism is deemed poor, but this is partly due to stupidity of making DTD interpretation at whim of parser, - not namespace aware (though this could be fixed at the same time: indeed, one of the ISO DSDL schema languages is to use DTDs externally with namespace awareness and no syntax change: it is W3C who is conservative on adding namespace-awareness to DTDs.) - performs/conflates type annotation and decoration, though why this is so bad when there is a strict processing order eludes me In other words, this should be XML 1.2. No existing documents would become invalid. The HTML entity problem would be resolved. The ID problem would be resolved. The roulette Infoset problem would be resolved. The lack of namespace awareness in DTDs would be resolved. The need for an official lightweight form of XML would be resolved. Cheers Rick Jelliffe
Received on Monday, 13 January 2003 22:00:46 UTC