[Bug 11234] New: Invalidate documents whose text content contains improperly balanced bidi formatting characters

http://www.w3.org/Bugs/Public/show_bug.cgi?id=11234

           Summary: Invalidate documents whose text content contains
                    improperly balanced bidi formatting characters
           Product: HTML WG
           Version: unspecified
          Platform: PC
        OS/Version: Windows XP
            Status: NEW
          Severity: normal
          Priority: P2
         Component: pre-LC1 HTML5 spec (editor: Ian Hickson)
        AssignedTo: ian@hixie.ch
        ReportedBy: aharon.lists.lanin@gmail.com
         QAContact: public-html-bugzilla@w3.org
                CC: mike@w3.org, public-html-wg-issue-tracking@w3.org,
                    public-html@w3.org


As has surfaced in the discussion of bug 10809, it would be helpful to declare
invalid documents where any element's text node children (*not* descendants
generally) contain improperly balanced LRE, RLE, LRO, RLO, or PDF characters.
In other words, for the purposes of validation, treat every LRE, RLE, LRO, or
RLO character as the opening tag of an imaginary element, something like
<bidi-formatting>, and PDF as that imaginary element's closing tag. This
applies to these character's entities, as well, of course.

Examples of invalid usage:

1. <div>&#x202A;</div>
2. <div>&#x202C;</div>
3. <div>&#x202C;&#x202A;</div>
4. <div>&#x202A;&#x202A;&#x202C;</div>
5. <div>&#x202A;<br>&#x202A;&#x202C;</div>
6. <div>&#x202A;<span>&#x202C;</span></div>
7. <div><span>&#x202A;</span>&#x202C;</div>

An example of valid (but not recommended!) usage:

<div>&#x202A;<span>...</span>&#x202C;</div>

-- 
Configure bugmail: http://www.w3.org/Bugs/Public/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.

Received on Friday, 5 November 2010 11:30:36 UTC