Re: Exploring new vocabularies for HTML from David Carlisle on 2008-03-31 (www-math@w3.org from March 2008)

From: David Carlisle <davidc@nag.co.uk>
Date: Mon, 31 Mar 2008 10:01:26 +0100
To: hsivonen@iki.fi
Cc: public-html@w3.org, www-math@w3.org
Message-Id: <200803310901.m2V91QBc026793@edinburgh.nag.co.uk>

> The right way to do either is to run an HTML5 parser.

I don't see how that is likely to happen while the "html parser" is
simply that, with so many hard coded rules for html elements.
If the parsing was abstracted away from html and then some  schema
language was used to specify html5 in terms od that abstraction,
perhaps other languages could least consider whether they wanted to
offfer lax "html-style" parsing in addition to xml. This is essentially
how John Cowan's tag soup works. Now it may be that you've looked at
existing behaviour and decided the only way to model that is build in
special rules everywhere, if that's the case, so be it, but that
severely limits the usefulness of such a parser in a non-html context.



> We can ask browsers to use the XML serialization for clipboad export  
> on platforms that have pre-existing deployed XML-based clipboard  
> flavor for MathML

yes and you would also need to ask all editing systems not to generate
<math>1+2=3</math> so that what they produce could be used as mathml
without having to pass it to a browser and cut it out. The simplest way
to ensure that editors don't produce such corruption is not to imply
that it is legal in the first place. It offers very little benefit to
anyone, and massive oportunities for incompatiblity with the past and
corruption of data (where the system does not imply the element
structure the author expected) in the future.

David

________________________________________________________________________
The Numerical Algorithms Group Ltd is a company registered in England
and Wales with company number 1249803. The registered office is:
Wilkinson House, Jordan Hill Road, Oxford OX2 8DR, United Kingdom.

This e-mail has been scanned for all viruses by Star. The service is
powered by MessageLabs. 
________________________________________________________________________

Received on Monday, 31 March 2008 09:15:56 UTC