Re: [web-annotation] Reference to text encoding in spec perhaps not appropriate from Rob Sanderson via GitHub on 2016-05-21 (public-annotation@w3.org from May 2016)

From: Rob Sanderson via GitHub <sysbot+gh@w3.org>
Date: Sat, 21 May 2016 09:09:01 +0000
To: public-annotation@w3.org
Message-ID: <issue_comment.created-220767424-1463821740-sysbot+gh@w3.org>

I accept the 4.2.5 distinction. The recommendation to use UTF-8 is 
because it MUST be that for recording in the JSON, but that is not 
part of the normalization before counting characters... which I think 
we have agreed to use code points for.

I disagree that normalization should be NFC, as HTML/XML whitespace 
normalization would not be included under that description, and 
there's no way for a browser client to undo the normalization that the
 HTML parser has already done when creating the DOM. (As far as I 
understand, those with more experience please correct me)

I'm happy to take out "and so forth", but unless there's an existing, 
accepted set of normalization operations we can refer to, I hope we 
can put that discussion off until we have closed more pressing issues.

-- 
GitHub Notification of comment by azaroth42
Please view or discuss this issue at 
https://github.com/w3c/web-annotation/issues/227#issuecomment-220767424
 using your GitHub account

Received on Saturday, 21 May 2016 09:09:02 UTC