W3C home > Mailing lists > Public > xmlschema-dev@w3.org > October 2001

Re: More surprises (ID, ENTITY, NOTATION, ...)

From: Henry S. Thompson <ht@cogsci.ed.ac.uk>
Date: 23 Oct 2001 09:15:01 +0100
To: Eric van der Vlist <vdv@dyomedea.com>
Cc: xmlschema-dev@w3.org
Message-ID: <f5blmi2g4ca.fsf@cogsci.ed.ac.uk>
Eric van der Vlist <vdv@dyomedea.com> writes:

> 1) The serialization space.
> 
> This is the "value" as
> actually written in the
> document after expension
> of internal and external
> parsed entities. Per XML
> 1.0, this value isn't
> significative and is
> submitted to the
> applications after a
> first whitespace
> processign depending on
> its location (attribute
> or element) and (when in
> an element ) to the
> xml:space attribute.

And EOL normalisation.  I think xml:space never changes anything,
actually, just determines what is reported _in addition_ to the
characters involved.

> 
> 2) The parser space.
> 
> This is the value sent
> by a conform parser to
> the application after
> the XML 1.0 whitespace
> processing.

i.e. the processing specified above under (1), which really belongs here.

> 3) The lexical space.
> 
> This is the parser space
> after W3C XML Schema
> whitespace processing
> depending on the
> xs:whitespace facet of
> the datatype.
> 
> 
> 4) The value space.
> 
> This is the abstract concept behind the datatype.
> 
> 5) The canonical space.
> 
> This is the ensemble of
> the canonical
> representations of the
> elements of the value
> space.
> 
> 
> (The canonical value may
> be difficult to
> determine for "special"
> types such as QNames).
> 
> 
> And the facets can be classified into 3 different categories:
> 
> 1) xs:whitespace
> 
> This facet is the
> definition of the
> whitespace processing
> taking place between the
> parser space and the
> lexical space.
> 
> 
> 2) xs:pattern
> 
> This is the only facet constraining the lexical space.
> 
> 3) Other facets
> 
> They are all constraining the value space.
> 
> Would you agree with this summary?

Yes, subject to minor quibbles above.

ht
-- 
  Henry S. Thompson, HCRC Language Technology Group, University of Edinburgh
          W3C Fellow 1999--2001, part-time member of W3C Team
     2 Buccleuch Place, Edinburgh EH8 9LW, SCOTLAND -- (44) 131 650-4440
	    Fax: (44) 131 650-4587, e-mail: ht@cogsci.ed.ac.uk
		     URL: http://www.ltg.ed.ac.uk/~ht/
Received on Tuesday, 23 October 2001 04:14:14 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 11 January 2011 00:14:24 GMT