- From: David Carlisle <davidc@nag.co.uk>
- Date: Mon, 31 Mar 2008 10:01:26 +0100
- To: hsivonen@iki.fi
- Cc: public-html@w3.org, www-math@w3.org
> The right way to do either is to run an HTML5 parser. I don't see how that is likely to happen while the "html parser" is simply that, with so many hard coded rules for html elements. If the parsing was abstracted away from html and then some schema language was used to specify html5 in terms od that abstraction, perhaps other languages could least consider whether they wanted to offfer lax "html-style" parsing in addition to xml. This is essentially how John Cowan's tag soup works. Now it may be that you've looked at existing behaviour and decided the only way to model that is build in special rules everywhere, if that's the case, so be it, but that severely limits the usefulness of such a parser in a non-html context. > We can ask browsers to use the XML serialization for clipboad export > on platforms that have pre-existing deployed XML-based clipboard > flavor for MathML yes and you would also need to ask all editing systems not to generate <math>1+2=3</math> so that what they produce could be used as mathml without having to pass it to a browser and cut it out. The simplest way to ensure that editors don't produce such corruption is not to imply that it is legal in the first place. It offers very little benefit to anyone, and massive oportunities for incompatiblity with the past and corruption of data (where the system does not imply the element structure the author expected) in the future. David ________________________________________________________________________ The Numerical Algorithms Group Ltd is a company registered in England and Wales with company number 1249803. The registered office is: Wilkinson House, Jordan Hill Road, Oxford OX2 8DR, United Kingdom. This e-mail has been scanned for all viruses by Star. The service is powered by MessageLabs. ________________________________________________________________________
Received on Monday, 31 March 2008 09:15:56 UTC