- From: Anne van Kesteren <notifications@github.com>
- Date: Mon, 24 Mar 2025 01:59:07 -0700
- To: whatwg/url <url@noreply.github.com>
- Cc: Subscribed <subscribed@noreply.github.com>
Received on Monday, 24 March 2025 08:59:11 UTC
@annevk commented on this pull request. > @@ -2038,8 +2038,9 @@ and <a>code points</a> in the range U+00A0 to U+10FFFD, inclusive, excluding <a> <!-- IRI also excludes the ranges U+E000 to U+F8FF, U+FFF0 to U+FFFD, and U+E0000 to U+E09FF, all inclusive. We don't to align with HTML. --> -<p class=note>Code points greater than U+007F DELETE will be converted to -<a lt="percent-encoded byte">percent-encoded bytes</a> by the <a>URL parser</a>. +<p class=note>For historical reasons, rather than storing codepoints and [=byte/percent-encoding=] +to ASCII for serialization, URLs instead store their value as ASCII internally, eagerly converting +code points greater than U+007F DELETE to [=percent-encoded bytes=] during [=URL parser|parsing=]. Apologies for not responding to this more quickly, but I think I never ended up merging it because I'm not sure this is correct. I suspect one could convert at serialization time instead. It's just not how the specification is written. -- Reply to this email directly or view it on GitHub: https://github.com/whatwg/url/pull/804#pullrequestreview-2709616364 You are receiving this because you are subscribed to this thread. Message ID: <whatwg/url/pull/804/review/2709616364@github.com>
Received on Monday, 24 March 2025 08:59:11 UTC