Re: [shex] Round tripping language tags case from Eric Prud'hommeaux via GitHub on 2017-10-23 (public-shex-dev@w3.org from October 2017)

From: Eric Prud'hommeaux via GitHub <sysbot+gh@w3.org>
Date: Mon, 23 Oct 2017 11:48:08 +0000
To: public-shex-dev@w3.org
Message-ID: <issue_comment.created-338633541-1508759287-sysbot+gh@w3.org>

I wasn't worried about the validation, just what exactly how to specify the canonical form. I guess you have something in mind like:

> When emiting a ShEx schema, language tags in that schema SHOULD be in the the <a>canonical language tag form</a> in order to comply with [[!BCP47]] section @@!.
> A language tag is in <dfn>canonical language tag form</dfn> if a language tag is split on '-' into a set of sequences and the following rules applied before it is joined again on '-':
> 
> - Each two-letter sequence following a sequence of two or more letters is in uppercase, e.g. `ab-CD-EF-ghi`
> - Each four-letter sequence following a sequence of two or more letters is in title case. e.g. `ab-Cdef-Ghij`

Where in BCP47 do the capitalization rules come from? Can we justify the rules above?

-- 
GitHub Notification of comment by ericprud
Please view or discuss this issue at https://github.com/shexSpec/shex/issues/73#issuecomment-338633541 using your GitHub account

Received on Monday, 23 October 2017 11:48:11 UTC