Re: [whatwg/url] Refusing a mix of numeric-only and BIDI domains (#543)

The point on empty labels  is actually not right.
The string "a.b.c." (trailing dot) does not represent a DNS name with an empty label; it is a syntactic convention saying "we know c is a top level domain, don't try to append your search path elements to it in order to find it".

It's largely fallen out of use.

My suggestion for a solution would be to add text in the URL standard as follows:

> The IDNA2008 standard, in RFC 5893, gives a rule for evaluating whether or not a single label is suitable for use in a BIDI domain name, and some advice for applications.
>
> RFC 5893 defines the terms "RTL label", "LTR label", "Bidi domain name", and "Bidi rule".
> 
> Based on this advice, the following domain names will be accepted by the URL standard:
> * Domains containing only labels that obey the Bidi rule
> * Domains containing RTL labels followed by an LTR label consisting only of ASCII characters, where the first character is not a digit.
> 

That should be the necessary and sufficient rules for ensuring that display of domain names using the Unicode bidi algorithm don't contain characters that "jump the dot".




-- 
Reply to this email directly or view it on GitHub:
https://github.com/whatwg/url/issues/543#issuecomment-1384923846
You are receiving this because you are subscribed to this thread.

Message ID: <whatwg/url/issues/543/1384923846@github.com>

Received on Tuesday, 17 January 2023 07:08:02 UTC