Re: XML literals, canonical form, and normal form C problem

Peter:
 > I believe that this response does not adequately address the issue as it
 > does not address the case of typed literals in Production 7.2.16. 
(Untyped
 > literals are handled correctly in this production.)

We agreed with this which is why we proposed a rewording of that 
production: i.e.

replace
 > > [[
 > >
 > > If the rdf:datatype attribute d is given then o :=
 > typed-literal(literal-value
 > > := t.string-value, literal-datatype := d.string-value) otherwise
 > > t.string-value MUST be a Unicode[UNICODE] string in Normal Form 
C[NFC], o
 > :=
 > > literal(literal-value := t.string-value, literal-language := 
e.language)
 > and
 > > the
 > > ]]
 > >
 > > with
 > > [[
 > > The Unicode [UNICODE] string t.string-value MUST be in Normal Form 
C[NFC].
 > > If the rdf:datatype attribute d is given then o :=
 > typed-literal(literal-value
 > > := t.string-value, literal-datatype := d.string-value), otherwise o :=
 > > literal(literal-value := t.string-value, literal-language := 
e.language).
 > The
 > > ..
 > > ]]




Peter:
 >I think that there needs to be some text somewhere in the RDF documents
 >indicating which portions of an RDF/XML document must be in Normal 
Form >C.

*needs* seems quite strong for this issue.
The formal grammar serves this function.
The best advice to document authors is to read charmod, and avoid 
non-NFC text. The best advice to implementors is the formal grammar.



 >I believe that it is possible to have a valid RDF/XML document which when
 >interpreted as a Unicode string is not in Normal Form C.

Correct. For example, within XML Comments, and XML processing 
instructions. However, such documents will not be valid XML 1.1.

I do not believe further changes are needed on this issue.

Are you dissatisified? If so, I will propose that the WG records your 
objection and moves on.

Jeremy

Received on Thursday, 18 September 2003 10:06:08 UTC