- From: Richard Ishida <ishida@w3.org>
- Date: Fri, 13 Jan 2006 12:27:43 -0000
- To: "'Felix Sasaki'" <fsasaki@w3.org>, <public-i18n-core@w3.org>
Felix,
I don't think it should take long to search for duplicates. We must also add these comments to the last call table, so that we have a complete record of comments and responses. This is important for the transition of the document to CR.
RI
============
Richard Ishida
Internationalization Lead
W3C (World Wide Web Consortium)
http://www.w3.org/People/Ishida/
http://www.w3.org/International/
http://people.w3.org/rishida/blog/
http://www.flickr.com/photos/ishida/
> -----Original Message-----
> From: public-i18n-core-request@w3.org
> [mailto:public-i18n-core-request@w3.org] On Behalf Of Felix Sasaki
> Sent: 12 January 2006 03:12
> To: public-i18n-core@w3.org
> Subject: Fwd: Comments on Character Model for the World Wide
> Web 1.0: Normalization
>
> Hi all,
>
> This is the approved version of the XQuery / XSL Working
> Group comments.
> If you have a look at them, you see that they are very closly
> related to the progress of the QT specifications. Most of
> them are now in canidate recommendation stage. If we still
> want to have an influence on the specs before they become
> recs, we should reply fast. Hence, I propose to postpone
> Francois action item to look for dublicates in the previous
> last call, and talk about the comments during the teleconfs
> in the next week.
>
> Felix
>
> ------- Forwarded message -------
> From: "Jim Melton" <jim.melton@acm.org>
> To: www-i18n-comments@w3.org
> Cc: w3c-xsl-query@w3.org, "C. Michael Sperberg-McQueen"
> <cmsmcq@acm.org>
> Subject: Comments on Character Model for the World Wide Web 1.0:
> Normalization
> Date: Thu, 12 Jan 2006 05:32:12 +0900
>
> Gentlepeople,
>
> A joint teleconference of the XML Query Working Group and the
> XSL Working Group has approved the following as their formal
> comments on the document entitled Character Model for the
> World Wide Web 1.0:
> Normalization. (Please note that these comments are
> substantially the same as the personal comments that I sent
> to you in late December, 2005, the principal change being the
> addition of an example in point (3) below.)
>
> (1) In section 2, Conformance, the list of specification
> conformance criteria include: "make it a conformance
> requirement for implementations to conform to this document",
> and "make it a conformance requirement for content to conform
> to this document". Would you clarify (perhaps only as a
> response to this message) whether or not the XQuery 1.0,
> XPath 2.0, and XSLT 2.0 suite of specifications would be
> cited as non-conforming to this specification if (as I
> believe to be the case) they do not contain an explicit
> statement of those two criteria?
>
> (2) In section 3.2.3, Include-normalized text, bullet 2 uses
> the phrase "clause 1 above". I believe that most readers
> will better understand your meaning if you replace that with
> "bullet 1 above" or "list item 1 above". To many readers,
> the word "clause" refers either to a major subdivision of a
> document (e.g., a chapter) or to a relatively short phrase
> such as a portion of a sentence (e.g., the noun clause).
>
> (3) In section 3.2.4, Fully-normalized text, first numbered
> list, bullet 1 says that a composing character is "the second
> character in the canonical decomposition mapping of some
> character". There are characters in Unicode that are made of
> a "base character" plus two or more composing characters;
> therefore, "a composing character" would be "each character
> after the first in the canonical decomposition mapping of
> some character". One example of such a character would seem
> to be U+1FA4 GREEK SMALL LETTER OMEGA WITH PSILI AND OXIA AND
> YPOGEGRAMMENI, the canonical decomposition of which is
> U+03C9 GREEK SMALL LETTER OMEGA + U+0313 COMBINING COMMA
> ABOVE + U+0301
> COMBINING ACUTE ACCENT + U+0345 COMBINING GREEK YPOGEGRAMMENI.
>
> (4) In section 3.2.4, Fully-normalized text, first numbered
> list, bullet 1 refers to "some character that is not listed
> in the Composition Exclusion Table defined in [UTR #15]".
> However, following the link to the most recent version of UTR
> #15, the section of that document whose title is "Composition
> Exclusion Table" contains neither a table nor a list of
> characters. While this is an apparent failure of UTF #15,
> the dependence on that section of UTR #15 cascades that
> failure into Normalization. However, there is (in section 6
> of UTF #15) a (not terribly
> obvious) reference to "the Composition Exclusion Table
> [Exclusions]". The References entry with that name
> (Exclusions) contains pointers to several versions of such a
> table, the latest of which is available at
> <http://www.unicode.org/Public/UNIDATA/CompositionExclusions.t
> xt>http://www.unicode.org/Public/UNIDATA/CompositionExclusions.txt
> . It would have seemed a Very Good Idea for Normalization to
> point directly to this file, perhaps in addition to the
> reference directly to UTF
> #16 section 6.
>
> (5) In section 3.2.4, Fully-normalized text, second numbered
> list, bullet 2 uses the phrase "clause 1 above". I believe
> that most readers will better understand your meaning if you
> replace that with "bullet 1 above" or "list item 1 above".
> To many readers, the word "clause" refers either to a major
> subdivision of a document (e.g., a chapter) or to a
> relatively short phrase such as a portion of a sentence
> (e.g., the noun clause).
>
> (6) In section 3.2.4, Fully-normalized text, the paragraph
> beginning "Identification of the constructs..." includes the
> statement that "it is the responsibility of the specification
> for a language to specify exactly what constitutes a relevant
> construct". Could you please clarify whether or not the
> XQuery 1.0, XPath 2.0, and XSLT 2.0 suite of specifications
> would be cited as non-conforming to this specification if (as
> I believe to be the case) they do not contain any such
> explicit specification?
>
> (7) In section 3.2.7, Certified and suspect text, the NOTE
> begins with the statement "To normalize text, it is in
> general sufficient to store the last seen character...".
> Perhaps I've missed something important earlier in this
> specification, but I have no idea what that statement means.
> One way of explaining it is to use the example of text "C
> combining-cedilla". When processing that text, I store the
> last seen character (combining-cedilla). And, violá, the
> text is normalized. But that obviously is not the case. So
> what does that statement mean? Could it be expressed in a
> less ambiguous manner?
>
> (8) In section 3.4, Responsibility for normalization, item
> C303 includes an Example that uses the notations "xf:concat"
> and "xf:substring". In both cases (because this document
> does not define any namespace prefixes associated with the
> namespace name associated with XPath/XQuery functions), the
> "xf" should be replaced with "fn", which is the conventional
> prefix used for that namespace.
>
> (9) In section 4, String identity matching, item C312, list
> item 1 includes the statement "In accordance with section
> <http://www.w3.org/TR/2005/WD-charmod-norm-20051027/#sec-Norma
lization>3
> Normalization, this step MUST be performed by the producers
> of the strings to be compared." But section 3 does not make
> such a requirement (it did so in earlier drafts, but has been
> changed in this draft). At the very least, that use of
> "MUST" must (pun intended) be replaced by "SHOULD".
> Furthermore, the requirement to use "Early uniform
> normalization" might be correct because of the use of "as if"
> in the preceding paragraph, but (as section 3 makes clear)
> late normalization will produce identical results.
>
> (10) In appendix A, the reference to XQuery Operators
> includes an outdated list of editors. Jonathan Robie is no
> longer cited as an editor of that specification.
> Furthermore, the most recent edition is now dated 4 November,
> 2005, and is a Candidate Recommendation. (Of course, because
> Normalization was published earlier than that date, you could
> not have known this fact; the next publication of
> Normalization should make this
> change.)
>
> (11) In Appendix B, the final NOTE: says that certain
> characters may be displayed as a blank or as a blank
> rectangle. In some situations (e.g., Firefox 1.0.4 on my
> system without any font that covers Sinhala, a question mark
> ("?") is displayed. It might be appropriate to include that
> possibility in this NOTE.
>
>
> Hope this helps,
> Jim
>
> ==============================================================
> ==========
> Jim Melton --- Editor of ISO/IEC 9075-* (SQL) Phone:
> +1.801.942.0144
> Co-Chair, W3C XML Query WG; F&O (etc.) editor Fax :
> +1.801.942.3345
> Oracle Corporation Oracle Email: jim dot melton at
> oracle dot com
> 1930 Viscounti Drive Standards email: jim dot melton at
> acm dot org
> Sandy, UT 84093-1063 USA Personal email: jim at
> melton dot name
> ==============================================================
> ==========
> = Facts are facts. But any opinions expressed are the
> opinions =
> = only of myself and may or may not reflect the opinions of
> anybody =
> = else with whom I may or may not have discussed the issues
> at hand. =
> ==============================================================
> ==========
> ==============================================================
> ==========
> Jim Melton --- Editor of ISO/IEC 9075-* (SQL) Phone:
> +1.801.942.0144
> Co-Chair, W3C XML Query WG; F&O (etc.) editor Fax :
> +1.801.942.3345
> Oracle Corporation Oracle Email: jim dot melton at
> oracle dot com
> 1930 Viscounti Drive Standards email: jim dot melton at
> acm dot org
> Sandy, UT 84093-1063 USA Personal email: jim at
> melton dot name
> ==============================================================
> ==========
> = Facts are facts. But any opinions expressed are the
> opinions =
> = only of myself and may or may not reflect the opinions of
> anybody =
> = else with whom I may or may not have discussed the issues
> at hand. =
> ==============================================================
> ==========
>
>
Received on Friday, 13 January 2006 12:27:59 UTC