- From: <bugzilla@jessica.w3.org>
- Date: Mon, 24 Jun 2013 17:02:20 +0000
- To: public-html-admin@w3.org
https://www.w3.org/Bugs/Public/show_bug.cgi?id=22436
Bug ID: 22436
Summary: Give rules for content that is treated as text under a
common heading
Classification: Unclassified
Product: HTML WG
Version: unspecified
Hardware: PC
URL: http://dev.w3.org/html5/html-xhtml-author-guide/
OS: All
Status: NEW
Severity: normal
Priority: P2
Component: HTML/XHTML Compatibility Authoring Guide (ed: Eliot
Graff)
Assignee: eliotgra@microsoft.com
Reporter: xn--mlform-iua@xn--mlform-iua.no
QA Contact: public-html-bugzilla@w3.org
CC: eliotgra@microsoft.com, mike@w3.org,
public-html-admin@w3.org,
public-html-wg-issue-tracking@w3.org
See the thread ”During HTML parsing, are *all* named character references
replaced by their corresponding glyph?”, and in particular this answer from
Michael:
http://www.w3.org/mid/20130624113437.GB37583@sideshowbarker
What Michael said, is easy to forget. Thus, I think this subject needs a little
more description in Polyglot Markup. Right now, only <script> and <style> are
covered - and also <noscript>.
I would propose to
a) ad a section that describes the general issue of content
that, unlike in XML, is treated as text by the HTML parser
Motivation: This a an important and general gotcha and
difference, both within pure HTML, but especialy when
creating polyglots.
b) In practise, this means listing all the elements
that themselves - or their children, are treated
as text by the HTML parsers. (This includes
all elements that begins with the string “<no”, such
as <noscript> and <noframe>, as well as <script>,
<style>, <xmp>, <iframe> and perhaps some more (?)
NB: It may also make sense to mention, in a note
that the “sane” elements, such as <object>,
<video> etc, are not treated that way.
c) The section should give the various usage rules
- some elements are forbidden etc, while others
have special rules for polyglots under this
heading. (Thus, the script/style should go there
- or at least be represented with a link to the
section where their rules are described.)
Btw, note that HTML5 already says that the content of iframe must be empty in
XML, so describing iframe should be a nobrainer. See
http://www.w3.org/TR/html5/embedded-content-0.html#iframe-content-model
And HTML5 has similar things to say about most - if not of these elements, so
it is mostly a collection job.
--
You are receiving this mail because:
You are on the CC list for the bug.
Received on Monday, 24 June 2013 17:02:22 UTC