- From: Domenico Cuciti <domeniko@gmail.com>
- Date: Fri, 11 Mar 2016 19:55:17 +0000
- To: www-voice@w3.org
- Message-ID: <CAEt4PGcxwhzid_TVO=q-_qdQp69s40Vi2WU4mQ5CMSc3tNRwHw@mail.gmail.com>
Hi,
I was trying to validate some PLS documents using the latest schema
available [1], that was referenced in the Appendix A of the PLS
recommendation [2], when I took a closer look at the schema itself and
found a discrepancy with the recommendation itself.
In particular:
the specification of the <lexeme> element requires at least one <grapheme>
and one among <alias>|<phoneme> [3].
Unfortunately the xsd schema defines the <lexeme> element as follows:
<xs:element name="lexeme">
<xs:complexType>
<xs:choice minOccurs="0" maxOccurs="unbounded">
<xs:element ref="p:grapheme"/>
<xs:choice>
<xs:element ref="p:phoneme"/>
<xs:element ref="p:alias"/>
</xs:choice>
<xs:element ref="p:example"/>
</xs:choice>
[...]
Which would validate as correctly formed lexemes the following:
<lexeme/>
<lexeme>
<example>lexeme with only an example</example>
</lexeme>
<lexeme>
<phoneme>lexeme with a phoneme but no grapheme</phoneme>
</lexeme>
<lexeme>
<alias>lexeme with an alias but no grapheme</alias>
</lexeme>
<lexeme>
<grapheme>grapheme without any alias nor phoneme</grapheme>
</lexeme>
These are all invalid PLS lexemes according to the recommendation.
I thought I'd share my findings with the workgroup.
Regards,
Domenico Cuciti
[1]: https://www.w3.org/TR/pronunciation-lexicon/pls.xsd
[2]: https://www.w3.org/TR/pronunciation-lexicon/#AppA
[3]: https://www.w3.org/TR/pronunciation-lexicon/#S4.4
Received on Monday, 14 March 2016 20:11:25 UTC