Re: tag name state

+1

On Wed, Feb 29, 2012 at 2:34 PM, David Carlisle <davidc@nag.co.uk> wrote:

>
> Jeni said
>
>  And to be specific, my suggestion is that when in the Tag name state
>> [2], if the next character is<  then this is a Parse Error, and the
>> parser emits the current token and reprocesses the current input
>> character (<) in the data state.
>>
>
> _If_ we are going to differ from HTML5 at this point I think I would go
> further. We have a hard requirement I think that any tree have a
> serialisation as namespace well formed XML. If we tokenise a start tag
> at this point that isn't a legal XML name then inevitably there will
> have to be some arbitrary character mangling leading to names such as
>
> oneU00003CtwoU00003CthreeU0000**3C
>
> How would it work if we split up tag name state into a series of states so
> the only characters accepted are
>
> name start
> optional name - :
> optional
>  :
>  name start
>  optional name - :
>
> ie only namespace well formed names are accepted.
>
> using the XML1.1/XML1.0-5thed definitions of Name Start and Name
> characters.
>
> In each of these states, if a non-name character is seen it is put back
> and reprocessed in data state. If that happens on the first character, the
> < is put back as data and no tag is tokenised at all.
>
> And same for attribute names of course.
>
> David
>
>
> ______________________________**______________________________**
> ____________
> The Numerical Algorithms Group Ltd is a company registered in England
> and Wales with company number 1249803. The registered office is:
> Wilkinson House, Jordan Hill Road, Oxford OX2 8DR, United Kingdom.
>
> This e-mail has been scanned for all viruses by Star. The service is
> powered by MessageLabs. ______________________________**
> ______________________________**____________
>
>


-- 
Innovimax SARL
Consulting, Training & XML Development
9, impasse des Orteaux
75020 Paris
Tel : +33 9 52 475787
Fax : +33 1 4356 1746
http://www.innovimax.fr
RCS Paris 488.018.631
SARL au capital de 10.000 €

Received on Wednesday, 29 February 2012 16:42:51 UTC