W3C home > Mailing lists > Public > www-international@w3.org > October to December 2003

RE: About UTF-8, XHTML and Character Encoding

From: Richard Ishida <ishida@w3.org>
Date: Tue, 28 Oct 2003 19:31:35 -0000
To: "'AmirBehzad Eslami'" <behzad@delphiarea.com>, <www-international@w3.org>
Message-ID: <000b01c39d8a$1cd7ac90$cb01000a@w3c40upc3ma3j2>

Behzad,

Please use normal characters.  It makes it much easier to maintain the
source code in editors that display the characters, and will reduce file
size considerably. 

NCRs are mainly for use when the encoding doesn't support the character
you need (not the case here) or the author is unable to type in the
actual character (again, I'm assuming that's not the case).

Hope that helps,
RI


============
Richard Ishida
W3C

contact info: http://www.w3.org/People/Ishida/

http://www.w3.org/International/
http://www.w3.org/International/geo/

See the W3C Internationalization FAQ page
http://www.w3.org/International/questions.html


-----Original Message-----
From: www-international-request@w3.org
[mailto:www-international-request@w3.org] On Behalf Of AmirBehzad Eslami
Sent: 28 October 2003 18:08
To: www-international@w3.org
Subject: About UTF-8, XHTML and Character Encoding


E-Greetings Every One,

I'm developing a web site using XHTML in Farsi (persian - 'fa'). The
page encoded in UTF-8 using the following syntax in XHTML:

<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1//EN"
"http://www.w3.org/TR/xhtml11/DTD/xhtml11.dtd">
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="fa-IR">

The web page contains non US-ASCII characters such as Farsi and Arabic
characters.
My question is:

Should I use "Character References" while writting the content in an
XHTML (UTF-8) web page?
Or It is valid to use "Literal UTF-8" characters? (I mean it is not
necessary to define a character using Numeric Character Reference)


Thanks in advance,
Behzad
Received on Tuesday, 28 October 2003 14:36:15 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:17:03 GMT