- From: Julian Reschke <julian.reschke@gmx.de>
- Date: Thu, 03 Apr 2008 20:58:40 +0200
- To: Mark Nottingham <mnot@mnot.net>
- CC: "Roy T. Fielding" <fielding@gbiv.com>, Jamie Lokier <jamie@shareable.org>, HTTP Working Group <ietf-http-wg@w3.org>
Mark Nottingham wrote: > I know people are getting fatigued by this one, and I'd like to at least > get a shared understanding of the scope of this issue, and maybe carve > off some parts to make it more manageable. Here are a few statements > that I believe capture where we're at; if you disagree, please say so > (hopefully without reopening the entire discussion); > > 1. We are considering allowing UTF-8 in content, specifically (a) in > newly defined headers, and/or (b) in places where TEXT is now. Yes. > 2. We intend to remove the "blanket" RFC2047 encoding associated with > TEXT and (if kept) move it to the definitions of the individual rules, > so that it's clear where such encoding may occur. Candidates for this > include Reason-Phrase, filename-parm, warn-text, as well as the comments > in field-content. Yes. > 3. If RFC2047 encoding is used / referenced, we need to more carefully > specify its use; e.g., regarding what encoding forms are allowable, line > length limits, charsets used, folding. Yes. > 4. From also deserves a look. Ok. > 5. Either the definition of TEXT or CTL may need the C1 control > characters added. TEXT already allows C1 controls (and always did) (<http://greenbytes.de/tech/webdav/draft-ietf-httpbis-p1-messaging-latest.html#rule.TEXT>): TEXT = %x20-7E | %x80-FF | LWS ; any OCTET except CTLs, but including LWS That being said, I'd like it to exclude C1 controls. > As Roy states, #1 should be approached conservatively. I think we can > quickly get a decision on 2, 3, and 5. If we later decide to allow > UTF-8, we can readjust the text (and TEXT) to suit, but if we're going > to punt on this, I'd like to at least nail down the parts of it that we > know we have to deal with. Agreed. BR, Julian
Received on Thursday, 3 April 2008 18:59:28 UTC