Re: For review: Tagging text with no language

Mark Davis wrote:

>                                   mul = "Multiple
>                mul, if the        languages"
>                protocol only      maybe also
>  chat          permits a          others, since
>                single tag         "chat" has
>                <en, fr>           entered the
>                otherwise          vocabulary of
>                                   many languages

Depends on the context.  If the context is something in the
direction of your comment it's fine.  But if the context is 
"I don't know if that's about a chat or a cat" I'd use "und".

>                                   some language
>                                   the process
>  Igonda flatunicai vbinkli?  mis  recognizes, but
>                                   which is not in
>                                   BCP 47

That would be wrong for almost all languages not yet in the
registry belonging to another collection like "ger".

>                                    something the
>                                    process
>                                    recognizes as
>                                    having
>                                    linguistic
>  podstatné jméno            mis  content, and
>                                    might be in BCP
>                                    47, but it
>                                    doesn't know
>                                    which language
>                                    it is.

If it doesn't know that it can use "und", abusing "mis" is
no option.  I think "art" is about artificial languages for
humans or in fiction, not for programming languages.

Frank

Received on Friday, 13 April 2007 17:20:29 UTC