Re: [charmod-norm] Arabic & Hebrew unique issues associated with sections 2.4 and 2.5

> If this is what you mean, note that the layout issues don't affect 
text comparison 
> in this document, since once both the source and target text have 
been 
> converted to Unicode all of the presentational variation has 
(hopefully) 
> been resolved (or the text will be muddled).

Indeed this is one of the use cases I had in mind. But please observe 
that just code page conversion to Unicode might not resolve all the 
issues. Specifically with Bidi text, it must be transformed to logical
 bidi layout. This is all what this comment is about: **when one 
converts text to Unicode, he/she must also assure that text is 
transformed to logical bidi layout.** This unfortunately does not 
happen automatically (one of the reasons is in fact that bidi layout 
can't be guessed by just looking at the text Think of palindromes: 
"swap paws" - are we talking about text running from left to right or 
from right to left). . 

-- 
GitHub Notification of comment by tomerm
Please view or discuss this issue at 
https://github.com/w3c/charmod-norm/issues/80#issuecomment-208034767 
using your GitHub account

Received on Sunday, 10 April 2016 18:16:19 UTC