- From: asmusf via GitHub <sysbot+gh@w3.org>
- Date: Sun, 07 Feb 2016 03:20:00 +0000
- To: public-i18n-archive@w3.org
On 2/6/2016 5:45 PM, aphillips wrote:
>
> So... I'm aware of this. I suppose we probably should mention, for
> example, something like the "Paypal" bug, e.g, U+03A1, U+0420, and
> U+0050 (P) all look absolutely identical but are unrelated.
>
> I have changed the Section 2.2 introduction to be slightly more
> technically accurate. I also added a Note Well box.
>
> @asmusf <https://github.com/asmusf> I partially copied your reply
> above to form part of the note. (I also added you to the
> acknowledgements list).
>
> Please consider the changes and see if these address the problem.
>
> —
> Reply to this email directly or view it on GitHub
>
<https://github.com/w3c/charmod-norm/issues/69#issuecomment-180912036>.
>
You write:
Obviously, "confusable"characters like this can present spoofing
and
other security risks.For more information, see [[UTR39]].
First, the example in this case is not merely about characters that
are
"confusable" - a term that encompasses a wide spectrum of similarity
under a wide variation of circumstances and involving assumptions
about
human perception - but it is more precisely about characters being
"homoglyphs" (in fact, strict homoglyphs, with an appearance that is
identical in all practical scenarios).
The distinction matters, because those that feel that normalization
"should" have addressed certain issues are fine with "mere"
similarities
handled differently.
Second, the example may be "famous" but involves only a subset of
homoglyphs. There are some other examples of homoglyphs that are in
the
same script. I don't think you need to give examples of both; but it
would not be amiss to add that this effect does not require separate
scripts.
My suggested replacement:
==> Similar examples of identical appearance also exist within a
single
script. Because these characters have with an appearance that is
identical for all practical purposes they are an extreme manifestation
of "confusable" characters, which can represent...
(Or you can find a way to break that sentence in two).
A./
--
GitHub Notification of comment by asmusf
Please view or discuss this issue at
https://github.com/w3c/charmod-norm/issues/69#issuecomment-180928288
using your GitHub account
Received on Sunday, 7 February 2016 03:20:02 UTC