Re: [whatwg/webidl] “Unicode character” should likely say “Unicode scalar value” in intro to lexical grammar (Issue #1080)

FYI - here's some related information from other places:

In [Infra](https://infra.spec.whatwg.org/#code-points):

> Code points are sometimes referred to as characters and in certain contexts are prefixed with "0x" rather than "U+".

In [CSS](https://drafts.csswg.org/css-syntax-3/#tokenization):

> To **tokenize** a stream of code points into a stream of CSS tokens *input*, repeatedly consume a token from *input* until an <EOF-token> is reached, pushing each of the returned tokens into a stream.

It seems that they have not ruled out the surrogate code points, but I am not sure how the situation here is. See also [Internationalization Best Practices for Spec Developers](https://w3c.github.io/bp-i18n-specdev/#characters).


-- 
Reply to this email directly or view it on GitHub:
https://github.com/whatwg/webidl/issues/1080#issuecomment-1001851049
You are receiving this because you are subscribed to this thread.

Message ID: <whatwg/webidl/issues/1080/1001851049@github.com>

Received on Tuesday, 28 December 2021 03:34:33 UTC