Re: non-XML characters (e.g. #1)

 > What do people think?
 >
 > What do we do about this?
 >
 > Is [#1 - #7e] a legal range?

Unequivocally yes.  It describes the input. If the input contains those 
characters, you have to be able to describe it.

Output, on the other hand, is a different issue, because of the XML 
misfeature of excluding most C0 characters from content. (If I were 
redesigning XML, I would allow those characters, but only expressed in 
encoded form.)

So, it is just fine to accept XML illegal characters in the input, as long 
as they don't end up in the output:

 illegal: -[#1-#7e].

And assuring those characters don't get through to the output is the 
grammar author's responsibility.

Steven

Received on Monday, 3 January 2022 10:04:17 UTC