- From: Steven Pemberton <steven.pemberton@cwi.nl>
- Date: Tue, 15 Mar 2022 14:39:12 +0000
- To: "C. M. Sperberg-McQueen" <cmsmcq@blackmesatech.com>, public-ixml@w3.org
- Message-Id: <1647354467151.1380525782.911820952@cwi.nl>
So an inclusion currently looks like this:
["0"-"9"; "+-()."; #d; L]
giving
<inclusion>
<range from='0' to='9'/>
<literal string="+-()."/>
<literal hex='d'/>
<class code='L'/>
</inclusion>
Since all meaningful (semantic) characters go in attributes, I propose
<inclusion>
<member from='0' to='9'/>
<member string='+-().'/>
<member hex='d'/>
<member class='L'/>
</inclusion>
@from and @to would continue to allow as now e.g. from='#d'
Look OK?
Steven
On Monday 14 March 2022 19:39:02 (+01:00), C. M. Sperberg-McQueen wrote:
> Unless I am reading the specification grammar wrong, inclusions and
> exclusions contain sets, which contain members, which can be literals.
>
> Inclusions, exclusions, and literals can all carry tmarks.
>
> What does it mean to have a tmark both on an inclusion or exclusion and
> on a member literal?
>
> {1} ['a'; 'b']
> {2} ['a'; -'b']
> {3} ['a'; ^'b']
> {4} -['a'; 'b']
> {5} -['a'; -'b']
> {6} -['a'; ^'b']
> {7} ^['a'; 'b']
> {8} ^['a'; -'b']
> {9} ^['a'; ^'b']
>
> If I am reading the grammar correctly, all of the above are
> gramamatical, but what do items 2-8 mean? (And especially 2, 6, and 8,
> where the two tmarks conflict?)
>
> I think it might be wise to revise the grammar so that tmarks are not
> allowed on character set members. Am I wrong?
>
> Michael
>
Received on Tuesday, 15 March 2022 14:39:34 UTC