Re: [charmod-norm] Arabic & Hebrew unique issues associated with sections 2.4 and 2.5

@tomerm This still doesn't make sense to me. What do you mean by 
"layout"? Application of the Unicode Bidirectional Algorithm? 

The encodings I mentioned are character encodings used to *store* the 
data. Their relevance to this discussion is that they store data 
differently from character encodings that store data in logical order:
 these encodings store the bytes representing each character in 
"teletype order" (the order that a left-to-right printing line printer
 would output them). This certainly can make searching more difficult 
because the source and target text might need to be re-ordered in 
order to compare them and things such as line-breaking may change the 
order of the characters in storage. 

If this is what you mean, note that the layout issues don't affect 
text comparison in this document, since once both the source and 
target text have been converted to Unicode all of the presentational 
variation has (hopefully) been resolved (or the text will be muddled).


-- 
GitHub Notification of comment by aphillips
Please view or discuss this issue at 
https://github.com/w3c/charmod-norm/issues/80#issuecomment-208018520 
using your GitHub account

Received on Sunday, 10 April 2016 17:11:42 UTC