- From: Henri Sivonen <notifications@github.com>
- Date: Fri, 01 Mar 2024 01:27:14 -0800
- To: whatwg/url <url@noreply.github.com>
- Cc: Subscribed <subscribed@noreply.github.com>
- Message-ID: <whatwg/url/issues/818/1972826176@github.com>
After looking into this more, I think the right abstraction would be for UTS 46 to take an ASCII deny list instead of taking a boolean flag for STD3 rules. What the deny list can modify should probably be constrained so that denying ASCII letters, digits, hyphen or full-stop would not be allowed. I think it would simplify data quite a bit if the caller of UTS 46 was not permitted to allow the ASCII space. (I am not aware of use cases for permitting ASCII space in domain name-like things, and the characteristics of the output get weird if space is allowed.) But whether the rest of ASCII is allowed or denied could be customizable by the caller of UTS 46, and I think acting on that deny list should belong in the UTS 46 algorithms and not in the algorithms in URL. So far, I'm not aware of more than two relevant configurations: the STD3 list (deny everything that I didn't list as must-allow above) and the WHATWG list ("forbidden domain code point"). So far, in the code I'm writing, I'm supporting only these two options. I'm thinking of sending UTS 46 feedback to this effect. @annevk, what do you think? -- Reply to this email directly or view it on GitHub: https://github.com/whatwg/url/issues/818#issuecomment-1972826176 You are receiving this because you are subscribed to this thread. Message ID: <whatwg/url/issues/818/1972826176@github.com>
Received on Friday, 1 March 2024 09:27:18 UTC