Re: [Ltru] RE: For review: Tagging text with no language

On Thu, 12 Apr 2007, Mark Davis wrote:

[ about text that is in unknown language but known script ]

> For that, I'd suggest und-Latn (or whatever the script is). Since only 
> languages would have scripts, that is sufficiently determinate.

I'm not so sure about it; it depends on what "script" really means. If the 
data is "JuUiYTlajUJO", which is not in any language as far as I know, 
can't we still say that it is in the Latin script?

-- 
Jukka "Yucca" Korpela, http://www.cs.tut.fi/~jkorpela/

Received on Thursday, 12 April 2007 17:29:27 UTC