W3C home > Mailing lists > Public > html-tidy@w3.org > April to June 2001

' not recognized

From: Bjoern Hoehrmann <derhoermi@gmx.net>
Date: Thu, 12 Apr 2001 22:51:13 +0200
To: html-tidy@w3.org
Message-ID: <h35cdtcqn99eifsj8s7ig0g4sg29ts5ikn@4ax.com>
Hi,

   HTML Tidy doesn't regognize &apos; as a valid XML entitiy (when in
'-xml' mode). XML defines these entities:

  <!ENTITY lt     "&#38;#60;">
  <!ENTITY gt     "&#62;">
  <!ENTITY amp    "&#38;#38;">
  <!ENTITY apos   "&#39;">
  <!ENTITY quot   "&#34;">

Just including &apos; in entities.c would enforce a major
interoperability problem, since HTML 4.01 doesn't define this entity, so
entity encoding routines would generate non-compliant code (if ' is ever
encoded as &apos;). What to do about that?
-- 
Björn Höhrmann | mailto:bjoern@hoehrmann.de | http://www.bjoernsworld.de
am Badedeich 7 | Telefon: +49(0)4667/981028 | http://bjoern.hoehrmann.de
25899 Dagebüll | PGP Pub. KeyID: 0xA4357E78 | http://www.learn.to/quote/
Received on Thursday, 12 April 2001 16:50:36 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:45 GMT