W3C home > Mailing lists > Public > www-international@w3.org > January to March 2005

Re: XMLLiterals and language

From: Frank Manola <fmanola@acm.org>
Date: Wed, 19 Jan 2005 12:19:00 -0500
Message-ID: <41EE9684.30609@acm.org>
To: Jeremy Carroll <jjc@hplb.hpl.hp.com>
CC: Reto Bachmann-Gmuer <reto@gmuer.ch>, Martin Duerst <duerst@w3.org>, www-rdf-interest@w3.org, www-international@w3.org

Jeremy--

As an American mother-tongue speaker, I'm not sure how I'm to parse your
first sentence :-)

Nevertheless, isn't how the text is to be pronounced a separate issue?
It seems to me some separate metadata would be needed to adequately deal
with (a) what language it is ("chat" in English vs. French is an
oft-used example) vs. (b) how it's to be pronounced;  e.g., the same
English speech could be read in fluent English for a serious effect vs.
"broken" English in a comedy routine (and this without even considering
stuff like how native Texans and New Englanders pronounce ordinary
American English!)  Does overloading all this onto xml:lang really make
sense?

--Frank

Jeremy Carroll wrote:
> 
> 
> I am not at all convinced that this issue is irrelevant outside the
> semantic web domain. e.g. a text-to-speech system should, pronounce
> english words quite differently when in an italian mode, since italian
> speakers typically use italian pronounciation rules for english words
> being used in italian sentences. As an English mother-tongue speaker,
> with reasonable italian the most difficult sentences I find to
> understand are such mixed sentences.
> 
> <span xml:lang="it">
> Abbiamo fatto questo lavoro per il progetto
> <span xml:lang="en">"Question How"</span>
> </span>
> 
> the words "question how" are pronounced quite differently from in
> English (even when the mother tongue italian speaker is a fluent english
> speaker). (bitter experience here!)
> 
> Jeremy
Received on Wednesday, 19 January 2005 17:12:13 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:17:04 GMT