[Bug 24780] Request to clarify proper use of explicit UCS code point numbers in regular expression.

https://www.w3.org/Bugs/Public/show_bug.cgi?id=24780

--- Comment #4 from Tom <spectrum777@outlook.com> ---
(In reply to Michael Kay from comment #1)
> If you want to refer to characters by codepoint in an XSD regex, use XML
> numeric character references, e.g. value="&#x20AC;".
> 
> \x and \u are not permitted in an XSD regex. There are some XSD processors,
> however, that are non-conformant in this regard, and that accept these even
> though they are not allowed by the spec.
> 
> It is indeed confusing that the EBNF notation used to specify what is
> allowed in regular expressions is similar to, but not the same as, the
> regular expression syntax that it is specifying. Hence the note. As far as I
> can see, you have read the note and it has helped to resolve the confusion.

Thanks... I would recommend the regex section have a "Note:" that clarifies
using something similar to your first sentence in the prior comment (see
above). I would also add to the same note that \x and \u are not permitted.
Because EBNF uses constructs similar to actual regular expressions, and because
escaping can be a tricky topic, a small note seems worthy for the regex
section. Only a suggestion. Thanks again for the clarifications.

-- 
You are receiving this mail because:
You are the QA Contact for the bug.

Received on Monday, 24 February 2014 10:07:32 UTC