W3C home > Mailing lists > Public > www-international@w3.org > January to March 2016

Re: Proposal to deprecate 'Character encodings' article

From: John C Klensin <john+w3c@jck.com>
Date: Fri, 22 Jan 2016 13:59:42 -0500
To: ishida@w3.org, www International <www-international@w3.org>
Message-ID: <6F9B4F570995CF8C5216C95A@JcK-HP5.jck.com>


--On Friday, January 22, 2016 4:37 PM +0000 ishida@w3.org wrote:

> Someone recently pointed out that the article entitled
> "Character encodings" at
> https://www.w3.org/International/O-charset
> is significantly out of date.
> 
> It doesn't put enough emphasis on the use of UTF-8, it doesn't
> show the HTML5 meta charset markup, references to XHTML can be
> removed, it points to the IANA registry rather than the
> Encoding spec, it says that ISO 8859-1 is a preferred
> encoding, etc.  Furthermore, the content is mostly covered
> elsewhere, in more recent articles.
>...

Richard,

I think explicitly deprecating that document as outdated is a
fine idea (and, in retrospect, probably overdue).

However (and while it is probably part of a separate
discussion), I'm still anxious about having two separate
registries -- at IANA and in the Encoding spec.  We went to
great lengths to make what are now called media types the same
for the web, email, and everything else.  Separate lists for
character encoding identifiers (seen from the IETF and email
perspective as part of the media type picture) really benefits
no one.   Perhaps the solution is to point out the confusion we
have gotten into and see it as another reason for moving to
Unicode encoded in UTF-8, but I'm not sure that is a good reason
for encouraging worse confusion in the interim.

   john
Received on Friday, 22 January 2016 19:00:12 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 21 September 2016 22:37:40 UTC