Re: non-XML characters (e.g. #1)

On Mon, 2022-01-03 at 10:03 +0000, Steven Pemberton wrote:
> . (If I were 
> redesigning XML, I would allow those characters, but only expressed
> in encoded form.)

That was in a draft for what became XML 1.1, along with C1 controls
(except for �). Unfortunately the primary use for them seemed to be
wrongly-encoded documents, and allowing C0 just moved the problems for
those documents.

Allowing all codepoints in input seems appropriate - e.g. convert a CSV
to XML where the separator is \1 (control-A, SOH).



If someone wants malformed XML as output, maybe that should be allowed
too?

-- 
Liam Quin, https://www.delightfulcomputing.com/
Available for XML/Document/Information Architecture/XSLT/
XSL/XQuery/Web/Text Processing/A11Y training, work & consulting.
Barefoot Web-slave, antique illustrations:  http://www.fromoldbooks.org

Received on Monday, 3 January 2022 15:12:17 UTC