W3C home > Mailing lists > Public > public-html-xml@w3.org > July 2011

Re: Revised HTML/XML Task Force Report

From: Karl Dubost <karld@opera.com>
Date: Tue, 12 Jul 2011 22:34:18 -0400
Message-Id: <7237DEE5-30C6-4D63-90D2-CCE10A5BF1C0@opera.com>
Cc: Robin Berjon <robin@berjon.com>, public-html-xml@w3.org
To: Larry Masinter <masinter@adobe.com>
removing www-tag,

Larry,

I still do not understand what you are trying to achieve. 

1. Are you really not aware of the issues after these years of discussions?
or  2. Do you want to just improve the report?

Le 12 juil. 2011 à 11:24, Robin Berjon a écrit :
> The stated problem is consuming HTML content with an XML tool chain.


# HTML Entities
One issue html entities for example, put that in your browser of choice

data:application/xhtml+xml,<!doctype><html><title>boo</title><p>&eacute;l&eacute;gant</p></html>

common? yes 
see http://dev.opera.com/articles/view/mama-character-entities/

# WELL FORMED
well-formedness. Most HTML documents of the Web are not wellformed nor valid. 
common? yes.
http://dev.opera.com/articles/view/mama-w3c-validator-research-2/#validated

Many of them in the link, I already gave
http://mzl.la/qqTz5Q


-- 
Karl Dubost - http://dev.opera.com/
Developer Relations & Tools, Opera Software
Received on Wednesday, 13 July 2011 02:35:06 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 13 July 2011 02:35:06 GMT