- From: Bruce D. Sinclair <b.sinclair@liant.com>
- Date: Thu, 22 Mar 2001 18:07:15 -0600
- To: <www-xml-schema-comments@w3.org>
In Appendix F, Regular Expressions, of XML Schema Part 2: Datatypes, there is a problem with the definition of the \w character sequence. ([39] MultiCharEsc). The subtracted character class given is [\p{P}\p{S}\p{C}], but the text describes this as "(all characters except the set of "punctuation", "separator" and "control" characters)." In the subtracted character class, the sequence \p{S} would eliminate the "symbol" characters, not the "separator" characters. Should this have been \p{Z}? Or is the explanation simply a misinterpretation of \p{S}? Also, the use of "control" for \p{C} is misleading since the control characters are a subset of "Other" characters, that is, \p{Cc} selects the "control" characters. --Bruce D. Sinclair
Received on Thursday, 22 March 2001 19:07:08 UTC