- From: Pablo Nieto Caride <pablo.nieto@linguaserve.com>
- Date: Tue, 16 Apr 2013 18:29:13 +0200
- To: "'Jirka Kosek'" <jirka@kosek.cz>
- Cc: <public-multilingualweb-lt@w3.org>
Hi Jirka, Felix, all,
This the ABNF I created based on http://www.w3.org/TR/xmlschema-2/#nt-charClass that covers pretty much the subset Yves and Shaun proposed:
Subset:
1. character classes [abc] [a-zA-Z_\-]
2. ranges [a-c]
3. negations [^abc]
4. "^" and "]" must never appear unless backslash-escaped
5. "-" may be backslash-escaped
6. escape sequences "\n", "\r", "\t", "\d", and "\D"
7. literal "\" is escaped as "\\"
The rules of the ABNF are:
[1] charClass ::= charClassEsc | charClassExpr | WildcardEsc
[2] charClassEsc ::= SingleCharEsc | MultiCharEsc
[3] SingleCharEsc ::= '\' [nrt\|.?*+(){}#x2D#x5B#x5D#x5E]
[4] MultiCharEsc ::= '\' [dD]
[5] charClassExpr ::= '[' charGroup ']'
[6] charGroup ::= posCharGroup | negCharGroup | charClassSub
[7] posCharGroup ::= ( charRange | charClassEsc )+
[8] charRange ::= seRange | XmlCharIncDash
[9] seRange ::= charOrEsc '-' charOrEsc
[10] charOrEsc ::= XmlChar | SingleCharEsc
[11] XmlChar ::= [^\#x2D#x5B#x5D]
[12] XmlCharIncDash ::= [^\#x5B#x5D]
[13] negCharGroup ::= '^' posCharGroup
[14] charClassSub ::= ( posCharGroup | negCharGroup ) '-' charClassExpr
[15] WildcardEsc ::= '.'
It's pretty much the same as the nt-charClass, I just removed some rules and tokens that we didn't really needed as Jirka suggested.
Now if memory serves we need a RELAX NG schema to validate the grammar, don't we? Or are we going to use the regex finally?
Cheers,
Pablo.
------------------------------------------------------------------------------------
-----Mensaje original-----
De: Jirka Kosek [mailto:jirka@kosek.cz]
Enviado el: martes, 16 de abril de 2013 11:38
Para: Pablo Nieto Caride
CC: public-multilingualweb-lt@w3.org
Asunto: Re: [Action-484] Create an ABNF based on http://lists.w3.org/Archives/Public/public-multilingualweb-lt/2013Apr/0047.html
On 16.4.2013 11:30, Pablo Nieto Caride wrote:
> I assume both are valid, in that case do we have any preference?
These are just variations. Personally I prefer W3C style, it's more readable to normal reader IMHO.
Jirka
--
------------------------------------------------------------------
Jirka Kosek e-mail: jirka@kosek.cz http://xmlguru.cz
------------------------------------------------------------------
Professional XML consulting and training services
DocBook customization, custom XSLT/XSL-FO document processing
------------------------------------------------------------------
OASIS DocBook TC member, W3C Invited Expert, ISO JTC1/SC34 rep.
------------------------------------------------------------------
Bringing you XML Prague conference http://xmlprague.cz
------------------------------------------------------------------
Received on Tuesday, 16 April 2013 16:29:42 UTC