- From: Steven Pemberton <steven.pemberton@cwi.nl>
- Date: Wed, 30 Nov 2016 09:36:26 +0100
- To: "public-xformsusers@w3.org" <public-xformsusers@w3.org>, "Erik Bruchez" <erik@bruchez.org>
- Message-ID: <op.yrqfi0c4smjzpq@steven-aspire-s7>
You're right, it's not clear. When I wrote this, I meant 9, A and D to be included "characters and escapes that have no equivalent XML character" was intended to mean that you don't have to specially encode, 9 A and D. I propose changing it to read "and most characters of the form \uxxxx less than \u0020". Does that cover it sufficiently? Or should we list 9, A and D explicitly? Steven On Tue, 29 Nov 2016 20:22:49 +0100, Erik Bruchez <erik@bruchez.org> wrote: > All, > > Currently, the spec says: > > "characters and escapes that have no equivalent XML character (\b, > \f, and >characters of the form \uxxxx less than \u0020) are transformed > by adding >\uE000 to them." > > The sentence contradict itself because in XML, the following characters > below >\u0020 are supported: > > - \u0009 > - \u000A > - \u000D > > So we should clarify this, and I suggest that we allow keeping the 3 > characters >above. Consider this piece of JSON: > > { > "firstName": "John", > "lastName": "Smith", > "address": "1000 Main Street\nNew York, NY" > } > > The `\n` in "address" translates to a newline `\u000A`. If we translate > it to >`\uE00A`, it becomes unnecessary inconvenient to handle the > newline on the XML >side. > > Conversely, when converting back from XML to JSON, a `\u000A` in the XML > must >translate into `\n` in the resulting JSON. > > For reference this was raised by a user. Details here: > > https://github.com/orbeon/orbeon-forms/issues/3012 > > Feedback welcome. > > -Erik
Received on Wednesday, 30 November 2016 08:37:10 UTC