Re: [string-search] Updates including Kashmiri examples from Richard (#14)

> In an additional twist to this story, two diacritics with different code points could be used here. In our previous example we used ं [U+0902 DEVANAGARI SIGN ANUSVARA ] to represent the nasal sound because the accompanying vowel-sign rises above the hanging baseline. If the vowel-sign was one that didn't rise above the hanging baseline, we would normally use ँ [U+0901 DEVANAGARI SIGN CANDRABINDU ] instead. The function of both of these diacritics is the same, but their code points are different.

I think this gives the wrong impression. The anusvara in 'hindi' represents an 'n' sound, rather than nasalisation. Candrabindu is a _nasalisation_ mark, and not an alternative for a syllable-final 'n' (there is no nasalisation in 'hindi').  In principle, the anusvara is an alternative for candrabindu when space is constrained, but not the other way around. That said, it does sometimes appear to be the case that some words _are_ spelled as if they were nasalised, or alternatively as if they had a syllable-final nasal, eg. snake at https://r12a.github.io/scripts/devanagari/hi.html#nasalisation  I don't believe that applies to the word 'hindi' though.

-- 
GitHub Notification of comment by r12a
Please view or discuss this issue at https://github.com/w3c/string-search/pull/14#issuecomment-1330828559 using your GitHub account


-- 
Sent via github-notify-ml as configured in https://github.com/w3c/github-notify-ml-config

Received on Tuesday, 29 November 2022 15:29:36 UTC