- From: Noah Mendelsohn <nrm@arcanedomain.com>
- Date: Fri, 30 Sep 2011 09:15:56 -0400
- To: Henri Sivonen <hsivonen@iki.fi>
- CC: public-html-xml@w3.org, Robert Leif <rleif@rleif.com>
On 9/29/2011 4:28 AM, Henri Sivonen wrote:
> On Wed, Sep 28, 2011 at 9:45 PM, Robert Leif<rleif@rleif.com> wrote:
>> My proposals were for XHTML5, which I believe, as of yet, has neither
>> been implemented nor described in any detail.
>
> Your belief is incorrect. The HTML5 spec describes XHTML5 and Firefox,
> Opera, Chrome, Safari and IE9 implement XHTML5 to the extent they
> implement the corresponding features on the non-X HTML side.
>
At the risk of causing more confusion, let me offer a theory as to why
everyone is talking past each other here. Earlier, Robert Leif wrote:
> I believe that possibility of developing a working interface between XML
> and HTML5 can be maximized by concentrating on XHTML5. The simplest
> solution would be to encourage Microsoft to make its HTML5 schema fully
> implement HTML5 and make this schema(s) available to the rest of the
> software community at no charge. Microsoft’s and other schema and XML
> validation tools including browsers would then have to accept interleave
> elements, such as the one in XSD1.1.
This and other writings by Robert suggest that he takes a very
schema-centric view of what XML is, and of how schemas can be fundamental
to improving the HTML/XML story. I don't think that premise is shared by
most of the others in this discussion, and so confusion is resulting.
The first two sentences quoted above, taken together, imply that blessing
as standard an HTML5 schema from Microsoft would somehow cause HTML and XML
to interoperate in a way that the would not without the schema. I think
that's true only in a very particular sense. Obviously, if you are using an
XML stack that happens to to use XSD or some other XML schema language
(e.g. RelaxNG), then having a useful schema that captures some of the
constraints required for valid HTML5 will help you get useful validations
out, or might drive some tooling in an automated way, but that's about all
it gets you. I've tried to word that quite carefully, in particular:
* Even many users who do want to use XML to process their HTML will not
want to run schema validation, especially in production (as opposed to
debugging). For those users, any schema will at most serve as
documentation, and that documentation will almost surely be redundant with
the normative documentation for HTML5 and XHTML5, both of which are
contained in the draft specification for HTML5.
* Even for those who do wish to run XML Schema validation, no schema from
Microsoft or anyone else can capture more than some of the constraints that
you need for HTML validation. Yes, having that level of checking automated
using XSD may be useful, but it can't fundamentally answer questions like:
how compatible are the DOMs you get when you parse as XML vs. HTML, and
that question is fundamental to having scripts run compatibly. Indeed, much
of the polyglot discussion in this thread is aimed at dealing with or
avoiding such incompatibilities, and XSD has little to say about them.
By the way, these limitations on XML schemas are not confined to HTML
validation. Almost any XML vocabulary will have levels of validity
requirements that XSD can't capture. For example, a mathematician using XSD
might wish to contstrain the value of some attribute to be a prime integer.
XSD can only check "integerness", and maybe something like
{1,2,odd-number}. At a yet higher level, a schema for a purchase order can
check that an element value resembles a credit-card number in format, but
almost surely can't check that the card number is valid (e.g. has been
issued and is not stolen).
I'm a fan of XSD and RelaxNG for what they're good at. Once the community
figures out some good way of writing HTML as XML, and once we figure out
which users want to do that, then by all means we should write schemas that
capture as many useful constraints as we can. That will allow tools such as
XSD to automate some checking, more strongly type some fields, perhaps
facilitate autmatic binding into document management systems etc.
What I don't think is that writing such schemas is a first step, or is in
any way fundamental to answering the questions confronting this task force:
who wants to write HTML that is processable as XML?; what are their
requirements? what XML/HTML format can we specify that would be convenient
to use and would meet those needs well?; are there alternate modes (e.g.
XML5) of processing XML-like documents that will result in DOM's that are
more usefully compatible with what HTML users want and expect?; etc. Or
having for the moment mostly failed to find ways to do all of that, at
least document what the requirements are and what good practice is given
the specifications and tools as they exist. I think thatt last bit is
where I think the report is today. A compromise, but a useful step.
Robert: I don't know if the above makes sense to you, but the net of it is
that I think you haven't yet convinced others that a schema-centric
approach is the right way to tackle these issues. My apologies if I've
misunderstood your ideas.
Noah
Received on Friday, 30 September 2011 13:16:30 UTC