- From: Anne van Kesteren <notifications@github.com>
- Date: Mon, 24 Mar 2025 01:59:07 -0700
- To: whatwg/url <url@noreply.github.com>
- Cc: Subscribed <subscribed@noreply.github.com>
Received on Monday, 24 March 2025 08:59:11 UTC
@annevk commented on this pull request.
> @@ -2038,8 +2038,9 @@ and <a>code points</a> in the range U+00A0 to U+10FFFD, inclusive, excluding <a>
<!-- IRI also excludes the ranges U+E000 to U+F8FF, U+FFF0 to U+FFFD, and U+E0000 to U+E09FF, all
inclusive. We don't to align with HTML. -->
-<p class=note>Code points greater than U+007F DELETE will be converted to
-<a lt="percent-encoded byte">percent-encoded bytes</a> by the <a>URL parser</a>.
+<p class=note>For historical reasons, rather than storing codepoints and [=byte/percent-encoding=]
+to ASCII for serialization, URLs instead store their value as ASCII internally, eagerly converting
+code points greater than U+007F DELETE to [=percent-encoded bytes=] during [=URL parser|parsing=].
Apologies for not responding to this more quickly, but I think I never ended up merging it because I'm not sure this is correct. I suspect one could convert at serialization time instead. It's just not how the specification is written.
--
Reply to this email directly or view it on GitHub:
https://github.com/whatwg/url/pull/804#pullrequestreview-2709616364
You are receiving this because you are subscribed to this thread.
Message ID: <whatwg/url/pull/804/review/2709616364@github.com>
Received on Monday, 24 March 2025 08:59:11 UTC