W3C home > Mailing lists > Public > www-dom@w3.org > January to March 2001

RE: How can I solve LANGUAGE PROBLEM in XML?

From: Liz <liz@aebeh.com>
Date: Thu, 18 Jan 2001 20:36:13 -0500
To: "Jeffrey Hantin" <jhantin@verisign.com>, "Beynur COSAR" <bcosar@temtas.com.tr>
Cc: "WWW DOM" <www-dom@w3.org>
Message-ID: <MABBKPBNAOONLCPNPFPBOENPCAAA.liz@aebeh.com>
What you do to get around the non-latin Character problem depends on how
often those Characters are used in your XML Document.

You may want to look into the ISO Character Sets found here
http://www.altheim.com/specs/charents.html and xml:lang attribute and also,
the use of XML Entities.

To use the ISO Character Sets you insert the needed ISO number inside the
xml prolog (the first line)
<?xml version="1.0" encoding="ISO-8859-9" ?>
This is if your document is written in a language other than the regular
ASCII code or UNICODE Characters.

You would use the xml:lang for Element(tags) and their childElements that
are written in the other Character code. It is usually used in cases where
documents contain multiple languages (it overrides the language declaration
in the XML prolog)

Or you can list your Characters in a DTD and create Entities out of them
(the coversion code), then in the XML document you insert the &entity;
instead of the actual Characters.

Hope this helps
:-)
Liz

-----Original Message-----
From: www-dom-request@w3.org [mailto:www-dom-request@w3.org]On Behalf Of
Jeffrey Hantin
Sent: Thursday, January 18, 2001 7:12 PM
To: Beynur COSAR
Cc: WWW DOM
Subject: Re: How can I solve LANGUAGE PROBLEM in XML?


Beynur COSAR wrote:
>
> I'm new member from TURKEY.
> I try to develop XML database and its views.
>
> The XML didn't accept this writing type with those characters "ř–«‹›"
> ;
> ____________________________________________________
> <?xml version="1.0" ?>
>  <?xml-stylesheet type="text/css" href="style.css"?>
>   <note>
> <icinde_not>
>   <to>Mustafa «avuĢoglu</to>
>   <from>Beynur COřAR</from>
>   <heading>Reminder</heading>
>   <body>Don't forget me this weekend!</body>
> </note>
> _____________________________________________________
>
> What can I do in this situation? Thank you for your answers!
>

It's probably actually rejecting the unclosed <icinde_not> element.  You
can mark it as empty by inserting a '/' before the closing '>'.  Or,
if it really is the characters the parser is not accepting, you can
either replace them with character references, or specify the
character-set
in use in the xml declaration at the top.

--
Jeffrey Hantin
Received on Thursday, 18 January 2001 20:33:22 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Friday, 22 June 2012 06:13:48 GMT