- From: Jirka Kosek <jirka@kosek.cz>
- Date: Wed, 10 Oct 2012 11:14:03 +0200
- To: Yves Savourel <ysavourel@enlaso.com>
- CC: public-multilingualweb-lt@w3.org
- Message-ID: <50753C5B.1070109@kosek.cz>
On 9.10.2012 15:17, Yves Savourel wrote: > During the call we mentioned: allowedCharacters="." (which is currently in the specification as an example) > And allowedCharacters=".*" > > I think both are wrong. > > I don't think ".*" is a character class as defined here: > http://www.w3.org/TR/xmlschema-2/#charcter-classes Hi, you are right. I forgot that we are using classes only, so * is not allowed. > And "." is a valid pattern for a character class, but corresponds to [^\n\r] no to 'any characters'. (see http://www.w3.org/TR/xmlschema-2/#nt-WildcardEsc) > > Could we then use ".[\n\r]" for "any characters? We can't use .[\n\r] as it is not valid class. What we can do is to redefine "." as matching any character including \n and \r. This should not pose any harm to implementors as RE libraries usually had flag which controls whether \n and \r are covered by . (whole string matching, and line-by-line matching). I think this could be the easiest and cleanest solution both from the spec and implementation PoV. Of course I'm looking for Yves counterarguments as he is already typing relevant Java code to prove that my thinking was wrong :-D Jirka -- ------------------------------------------------------------------ Jirka Kosek e-mail: jirka@kosek.cz http://xmlguru.cz ------------------------------------------------------------------ Professional XML consulting and training services DocBook customization, custom XSLT/XSL-FO document processing ------------------------------------------------------------------ OASIS DocBook TC member, W3C Invited Expert, ISO JTC1/SC34 member ------------------------------------------------------------------
Received on Wednesday, 10 October 2012 09:14:32 UTC