W3C home > Mailing lists > Public > www-international@w3.org > October to December 2003

AW: About UTF-8, XHTML and Character Encoding

From: Paul Deuter <PaulD@plumtree.com>
Date: Tue, 28 Oct 2003 21:44:55 -0500
Message-Id: <4.2.0.58.J.20031028214441.04c61048@localhost>
To: www-international@w3.org


You may use literal UTF-8 characters.
-Paul


----------
Von: AmirBehzad Eslami [mailto:behzad@delphiarea.com]
Gesendet: Tuesday, October 28, 2003 10:08 AM
An: www-international@w3.org
Betreff: About UTF-8, XHTML and Character Encoding

E-Greetings Every One,

I'm developing a web site using XHTML in Farsi (persian - 'fa'). The page 
encoded in UTF-8 using the following syntax in XHTML:

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1//EN" 
"<http://www.w3.org/TR/xhtml11/DTD/xhtml11.dtd>http://www.w3.org/TR/xhtml11/ 
DTD/xhtml11.<http://www.w3.org/TR/xhtml11/DTD/xhtml11.dtd>dtd">
<html 
xmlns="<http://www.w3.org/1999/xhtml>http://www.w3.org/1999/<http://www.w3.o 
rg/1999/xhtml>xhtml" xml:lang="fa-IR">

The web page contains non US-ASCII characters such as Farsi and Arabic 
characters.
My question is:

Should I use "Character References" while writting the content in an XHTML 
(UTF-8) web page?
Or It is valid to use "Literal UTF-8" characters? (I mean it is not 
necessary to define a character using Numeric Character Reference)


Thanks in advance,
Behzad
Received on Thursday, 30 October 2003 06:02:56 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:17:03 GMT