Re: [whatwg/url] Refusing a mix of numeric-only and BIDI domains (#543)

Okay that makes sense. I think that precondition means that LTR in your second rule can be LDH as well (which guarantees ASCII).

And to be clear, there is the (unstated) precondition that these domains are Bidi domain names, right? As presumably we will not impose these requirements on non-Bidi domain names.

I think with that we'd recommend these changes to UTS 46:

1. Remove step 8 of https://unicode.org/reports/tr46/#Validity_Criteria as Validity Criteria only operates on a single label. (Although it somehow claims to have knowledge about the domain_name string as well...)
2. Add a new step 5 to https://unicode.org/reports/tr46/#Processing. (Note that due to step 4 we will have U-labels.)
   
   > 1. If _CheckBidi_, and the domain_name string is a Bidi domain name, record there was an error if neither of the following conditions is true:
   >    * All labels in the domain_name string satisfy the 6 subrules of The Bidi Rule of RFC 5893, Section 2.
   >    * RTL labels in the domain_name string are immediately followed by an LDH label whose first code point is not of class EN and all labels in the domain_name string are either LDH labels or satisfy the 6 subrules of The Bidi Rule of RFC 5893, Section 2.

I'd appreciate your review and of anyone else still paying attention. 😅

-- 
Reply to this email directly or view it on GitHub:
https://github.com/whatwg/url/issues/543#issuecomment-1420575665

You are receiving this because you are subscribed to this thread.

Message ID: <whatwg/url/issues/543/1420575665@github.com>

Received on Tuesday, 7 February 2023 10:53:44 UTC