W3C home > Mailing lists > Public > www-validator-cvs@w3.org > July 2007

validator/htdocs/config charset.cfg,1.13,1.14

From: Olivier Thereaux via cvs-syncmail <cvsmail@w3.org>
Date: Thu, 19 Jul 2007 03:59:26 +0000
To: www-validator-cvs@w3.org
Message-Id: <E1IBNAk-0001LD-2W@lionel-hutz.w3.org>

Update of /sources/public/validator/htdocs/config
In directory hutz:/tmp/cvs-serv3877/htdocs/config

Modified Files:
	charset.cfg 
Log Message:
removing the list of supported character encodings, 
as we have a better and more reliable technical way to do this.

Replacing with a mechanism to:
* suggest a better alias if a "bad" encoding alias is used
* refuse to transcode if encoding used is forbidden by policy 
(I don't know any, but the mechanism is here...)

More details:
http://lists.w3.org/Archives/Public/public-qa-dev/2007Jul/0009.html
and surrounding thread.



Index: charset.cfg
===================================================================
RCS file: /sources/public/validator/htdocs/config/charset.cfg,v
retrieving revision 1.13
retrieving revision 1.14
diff -u -d -r1.13 -r1.14
--- charset.cfg	12 Oct 2006 01:15:50 -0000	1.13
+++ charset.cfg	19 Jul 2007 03:59:23 -0000	1.14
@@ -1,60 +1,23 @@
 #
-# List of accepted encodings.
+# List of encodings aliases and forbidden encodings
 #
 # $Id$
-#
-# The Validator will refuse to decode documents in an encoding
-# other than those listed here. The list is independent of what
+
+# This list indicates character encoding aliases that are 
+# not recommended, along with a recommended equivalent, e.g:
+# encoding-obscure = encoding-well-known
+
+# It also lists encoding names that the validator will refuse to treat:
+# bogus_encoding = Encoding Forbidden (Reason why)
+
+# The list is independent of what
 # is supported on a specific system but subject to the Validator
 # policy for acceptable encodings.
-#
 
-utf-8                          = 1
-utf-16                         = 1
-utf-16be                       = 1
-utf-16le                       = 1
-iso-8859-1                     = 1
-iso-8859-2                     = 1
-iso-8859-3                     = 1
-iso-8859-4                     = 1
-iso-8859-5                     = 1
-iso-8859-6                     = 1
-iso-8859-6-i                   = 1
-iso-8859-7                     = 1
-iso-8859-8                     = 1
-iso-8859-8-i                   = 1
-iso-8859-9                     = 1
-iso-8859-10                    = 1
-iso-8859-11                    = 1
-iso-8859-13                    = 1
-iso-8859-14                    = 1
-iso-8859-15                    = 1
-iso-8859-16                    = 1
-us-ascii                       = 1
-iso-2022-jp                    = 1
-shift_jis                      = 1
-euc-jp                         = 1
-gb2312                         = 1
-big5                           = 1
-iso-2022-kr                    = 1
-euc-kr                         = 1
-gb18030                        = 1
-tis-620                        = 1
-koi8-r                         = 1
-koi8-u                         = 1
-windows-1250                   = 1
-windows-1251                   = 1
-windows-1252                   = 1
-windows-1253                   = 1
-windows-1254                   = 1
-windows-1255                   = 1
-windows-1256                   = 1
-windows-1257                   = 1
-macintosh                      = 1
-x-mac-roman                    = 1
-x-sjis                         = 1
-iso8859-1                      = 1
-ascii                          = 1
-iso-8859-1-Windows-3.1-Latin-1 = 1
-ks_c_5601-1987                 = 1
-ksc_5601                       = 1
+
+x-mac-roman		= macintosh
+x-sjis			= shift_jis
+iso8859-1		= iso-8859-1
+ascii			= us-ascii
+# this one is in IANA, but better use only windows-1252
+iso-8859-1-Windows-3.1-Latin-1	= windows-1252
Received on Thursday, 19 July 2007 03:59:30 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Thursday, 26 April 2012 12:54:58 GMT