Re: [whatwg/webidl] “Unicode character” should likely say “Unicode scalar value” in intro to lexical grammar (Issue #1080) from Fuqiao Xue on 2021-12-28 (public-webapps-github@w3.org from December 2021)

From: Fuqiao Xue <notifications@github.com>
Date: Mon, 27 Dec 2021 19:34:21 -0800
To: whatwg/webidl <webidl@noreply.github.com>
Cc: Subscribed <subscribed@noreply.github.com>
Message-ID: <whatwg/webidl/issues/1080/1001851049@github.com>

FYI - here's some related information from other places:

In [Infra](https://infra.spec.whatwg.org/#code-points):

> Code points are sometimes referred to as characters and in certain contexts are prefixed with "0x" rather than "U+".

In [CSS](https://drafts.csswg.org/css-syntax-3/#tokenization):

> To **tokenize** a stream of code points into a stream of CSS tokens *input*, repeatedly consume a token from *input* until an &lt;EOF-token&gt; is reached, pushing each of the returned tokens into a stream.

It seems that they have not ruled out the surrogate code points, but I am not sure how the situation here is. See also [Internationalization Best Practices for Spec Developers](https://w3c.github.io/bp-i18n-specdev/#characters).


-- 
Reply to this email directly or view it on GitHub:
https://github.com/whatwg/webidl/issues/1080#issuecomment-1001851049
You are receiving this because you are subscribed to this thread.

Message ID: <whatwg/webidl/issues/1080/1001851049@github.com>

Received on Tuesday, 28 December 2021 03:34:33 UTC