- From: Kevin Smathers <ks@micky.hpl.hp.com>
- Date: Wed, 22 Jan 2003 11:24:04 -0800
- To: "Stephen K. Rhoads" <rhoads@thrupoint.net>
- Cc: cco@dydax.com, www-rdf-interest@w3.org
Hi Stephen,
Use the Unicode::Lite module from CPAN.
require Unicode::Lite;
*lat2uni = convertor( 'latin1', 'utf8' );
print lat2uni($text);
Cheers,
-kls
On Wed, Jan 22, 2003 at 01:25:06PM -0500, Stephen K. Rhoads wrote:
>
> So, I added the "iso-8859-1" encoding declaration, and it worked, but ONLY
> when I retrieved the RDF document from a web server using the "Parse URI"
> feature in the RDF Validator. When I cut and paste via a browser window, I
> get the same error. Any thoughts as to why?
>
> Also, I anticipate adding additional languages in the future which go beyond
> the characters in 8859. Thus I would prefer to generate files encoded in
> UTF-8. Any tips on how to do this? I'm using PERL and various text editors
> to generate my XML.
>
> --- Stephen
>
>
> ----- Original Message -----
> From: "Chris Olds" <colds@dydax.com>
> To: "Stephen K. Rhoads" <rhoads@thrupoint.net>
> Cc: <www-rdf-interest@w3.org>
> Sent: Tuesday, January 21, 2003 8:11 PM
> Subject: Re: Diacritic Signs
>
>
>
> On Tue, 21 Jan 2003, Stephen K. Rhoads wrote:
> >
> > Does RDF (or XML, I suppose) have a problem with diacritic signs? The W3
> > RDF Validator chokes on the "é" in "République" in the RDF fragment below.
>
> The default encoding for XML is UTF-8. The "é" in "République" has the
> high bit set, but is not a legal UTF-8 sequence (because it is, in fact,
> a simple 8859-1 (or perhaps -15) accented character).
>
> Solutions: Add an encoding declaration to your XML file, or encode it as
> UTF-8.
>
> /cco
--
// .--=,
.....::://::::::::::::::::::::::::::::.. (o O & kevin_smathers@hp.com
:::::::://:::://://://:/:://::||_// / V K
:::::://:::://:/:|//'/' // _,|' r , 'qk
:'''/____ // / // |_// // || .'~. .~`,
kls \_/-=\_/
Received on Wednesday, 22 January 2003 14:19:55 UTC