- From: r12a <ishida@w3.org>
- Date: Wed, 8 Apr 2026 05:56:29 +0100
- To: Internationalization Working Group <public-i18n-core@w3.org>
- Message-ID: <dfaccfb9-1132-9288-6825-48b43aa83551@w3.org>
Here's some info i sent on the original email thread. ri > *From:* r12a <ishida@w3.org> > *Date:* 7 April 2026 at 19:50 > *To:* Steven Hoober <steven@4ourth.com> > *CC:* Fuqiao Xue <xfq@w3.org>, atsushi@w3.org, bert@w3.org > *Subject:* Our AG subgroup could use some Internationalization > Steven Hoober wrote on 07/04/2026 18:46: > > Steven, before this goes much further, please switch the discussion to > a GH issue. We find it difficult to follow and share information from > emails, and we cannot fit them into our review management system. You > could raise an issue in the WCAG repo of your choice and add the > i18n-track label — that will alert us to the issue automatically. > > I suggest you add your questions to the initial comment, and then add > what i'm going to say in another comment. > > > > > Rather than just generally talk about diacritics, i think you may need > to be more specific about what diacritics are relevant here. Leaving > aside the Hawaiian example for now (which i'd like to understand > better), the type of diacritic you seem to be mostly concerned with > are those that indicate vowel sounds in scripts of type 'abjad'. > These scripts include Arabic, Hebrew, and Classical Syriac. Note > carefully that these are scripts, not languages. The Arabic script is > used to write the largest number of languages in the world, after > Latin script. You can find lists of languages used Arabic and Hebrew > scripts at > https://www.w3.org/International/questions/qa-scripts.en.html#how-many-people > > However, it is important to note that not all language orthographies > using the Arabic script hide their diacritics. Languages that hide > the diacritics include Arabic (and its dialects), Pashto, Persian, > Saraiki, Sindhi, and Urdu, etc. Other languages using the Arabic > script always show all diacritics include Fulfulde, Hausa, Kashmiri, > and Wolof, etc. Other languages using the Arabic script don't actually > use diacritics at all to represent vowel sounds; these include Sorani, > Uighur, etc. > > It is important to also recognise that not all marks on the page that > look like diacritics are ignorable, even in abjads. See this > description of the difference between ijam and tashkil > https://r12a.github.io/scripts/arab/homographs#ijam_tashkil. (It's the > tashkil, only, that get hidden in Arabic script.) > > It's also worth noting that Hebrew has 3 spelling variants, one of > which is possibly worth considering as an alternative to flipping > diacritics on and off. See > https://r12a.github.io/scripts/hebr/he.html#spelling. There are also > some diacritics that are more useful than others for disambiguating > sounds. If you have time, you can read more about that in the article > just pointed to. > > Classical Syriac is an abjad, but is not much used these days. Other > uses of the Syriac script include for Assyrian Neo-Aramaic and Turoyo > communities. Their orthographies generally preserve the diacritics, > and so are not abjads. > > > > > > Fine points on your Google doc: > > in the text > > * > > ملاك, pronounced malaak, means “angel” > > * > > مَلك, pronounced malak, is an archaic version that means “angel” > mostly in religious texts > > * > > ملك, pronounced malik, means “king” > > the difference between the first bullet and the second involves letter > changes, not diacritic changes — so it's not a good example. I may be > able to come up with something better, if you need. > > The 3rd bullet doesn't even show the kasra diacritic that indicates > the i. To contrast those properly you'd need to show: > > • مَلَك > • مَلِك > > hope that helps > ri >
Received on Wednesday, 8 April 2026 04:56:32 UTC