Re: [jlreq] [META] Reorganize character classes and its adoption of Unicode based definition (#240)

I admit I have not read this w/ enough detail, but skimming this discussion it occurred to me that the problem is similar on a meta level to the Unicode IndicSyllabicCategories and IndicPositionalCategories, and of course the Unicode Vertical text properties.

How about proposing a set of categories to Unicode, with some defined as "derived" (some algorithmic combination of existing Unicode properties) and some explicitly assigned as a - more fine grained - override).

Done that way, the end result would be a reliance on formal Unicode properties, but also, inside Unicode, the established derivations would surface any changes that might be (inadvertently) introduced by changes in the underlying Unicode properties (like general category or line break). If such properties must change in Unicode for some reason, it would be possible to adjust the derivation or attach an override to keep the layout properties unchanged. On the contrary, if/when the layout properties need to be changed/corrected, that can be done by changing a derivation, changing and override or changing an underlying Unicode property (if appropriate).

Getting this done may require that a Unicode technical report draft is created that defines the relation between standard Unicode properties and the (partially derived) layout properties.


-- 
GitHub Notification of comment by asmusf
Please view or discuss this issue at https://github.com/w3c/jlreq/issues/240#issuecomment-715658610 using your GitHub account


-- 
Sent via github-notify-ml as configured in https://github.com/w3c/github-notify-ml-config

Received on Saturday, 24 October 2020 02:24:03 UTC