- From: Addison Phillips [wM] <aphillips@webmethods.com>
- Date: Thu, 10 Jun 2004 21:13:37 -0700
- To: <andrea.vine@sun.com>, "I18n WSTF" <public-i18n-ws@w3.org>
Incorporated (at last!) Also deleted the comment about RFC 2277. Note: I modified the material about MIME to be more CharMod compliant (using character encoding, not charset, while indicating that the parameter is called charset in MIME). I also wrote some experimental text about Content-Language and how it (doesn't actually) relate to locales. I need to add a couple links and we need to address some of AV's comments. Best Regards, Addison Addison P. Phillips Director, Globalization Architecture webMethods | Delivering Global Business Visibility http://www.webMethods.com Chair, W3C Internationalization (I18N) Working Group Chair, W3C-I18N-WG, Web Services Task Force http://www.w3.org/International Internationalization is an architecture. It is not a feature. > -----Original Message----- > From: public-i18n-ws-request@w3.org > [mailto:public-i18n-ws-request@w3.org]On Behalf Of A. Vine > Sent: 2004年6月10日 16:01 > To: I18n WSTF > Subject: [Fwd: 4.16 Transports] > > > > > > -------- Original Message -------- > Subject: 4.16 Transports > Resent-Date: Wed, 14 Apr 2004 14:33:05 -0400 (EDT) > Resent-From: public-i18n-ws@w3.org > Date: Wed, 14 Apr 2004 11:12:10 -0700 > From: A. Vine <andrea.vine@Sun.COM> > Reply-To: andrea.vine@Sun.COM > Organization: Sun Microsystems > To: I18n WSTF <public-i18n-ws@w3.org> > > > Lots of modification should happen to this section, but I am not the right > person to make them all. > Someone at the F2F (Tex maybe?) had a lot more relevant > information on FTP than > I do. I'm happy to write it up if you give me the info or a pointer. > > I am not very happy with my intro - comments, changes, rewrites welcome. > > I think 4.16.4 "IRIs, URIs, and fun stuff" would be better > written by Martin. I > could take a shot at the "fun stuff" part ;-} > > -------------------------------------------------------- > 4.16 Transports > > Web services may use a variety of transport technologies and > protocols. Many of > these have parameters defined for data identification. These > parameters are > necessary for proper processing of international data. The > specifics of several > transport protocols are discussed in this section. > > 4.16.1 HTTP Accept-Language > {Andrea's note: what about Content-Language? And a blurb on > Content-Type is > added below. Maybe we should call this section "HTTP".} > > The HyperText Transport Protocol (HTTP) is often used for Web > service message > transport. HTTP contains some header fields which are useful for > identifying > sender preferences and capabilities. One of those fields is > Accept-Language. > > Accept-Language takes one or more language identifiers in RFC3066 (or its > replacement) format as its parameters. Each language identifier > can have a > quality value which gives a relative priority. Here is an example: > > {Andrea's note: the below should be set off in the example > format, or indented > and in a different font, or something} > Accept-Language: zh-cn, fr-ch;q=0.8, fr;q=0.7 > > The above could be read as "Simplified Chinese is preferred, but > Swiss French is > acceptable, as are other types of French." There is more > information about the > handling of Accept-Language in the HTTP 1.1 specification. > > A Web service requester using HTTP can include an Accept-Language field to > indicate the languages preferred. The provider can then take > that information > and use it to return human-readable data in the appropriate language. > > {Andrea's note: I put the below paragraph in because I think it > bears mentioning} > The charset of the data can also be specified as a parameter of > the Content-Type > header. However, it is better to specify the SOAP document > charset inside the > document itself, rather than to rely on the transport mechanism > to be the sole > mechanism for identifying the charset. If the charset specified > in HTTP doesn't > agree with the charset inside the document, then the receiver must make a > decision on how to resolve the problem. > > 4.16.2 FTP > > File Transfer Protocol (FTP) is a simple transport mechanism that > can be used > for Web service documents. The main international consideration > in using FTP is > to specify the representation type as I (Image), allowing 8-bit > values to pass > unchanged through the transfer. > > File names, path names, and character encoding issues may intrude here. > > 4.16.3 SMTP > > Simple Mail Transfer Protocol (SMTP) has no particular provisions for > international data. SMTP itself is limited to 7-bit data, but > can transport > 8-bit data. Its main restriction is an 8-bit gateway; that is, > encodings such > as UTF-16 and UTF-32 may not be successfully transmitted and > should be avoided. > > 4.16.3.1 MIME Tags > > Multipurpose Internet Mail Extensions (MIME) tags are necessary > for a multipart > SOAP request, for example, a SOAP message with an attachment. > MIME contains a > number of headers which may be used for international data. > > {somewhere we need to reference RFCs 2045-9, maybe as a MIME > reference in the > reference section} > > MIME can be useful for identifying the charset of attachments which do not > identify their own charset inside the attachment. Examples of > such attachments > are plain text documents which cannot contain a charset tag and > legacy markup > documents which do not contain a charset tag by omission. If the > attachment > contains an internal charset tag, the MIME charset parameter > should be omitted > to avoid an inadvertent mismatch. > > MIME can also contain a Content-Language tag. While it is better > to indicate > the document language inside the document itself, sometimes it > isn't possible. > For example, if there is an image attachment which contains > embedded text, the > Content-Language header can provide the language id. > > See the example in 4.5.2 Character Coding of Attachments. > {Andrea's note: add the following line to the attachments > example in 4.5.2 in > the attachment MIME headers after Content-Type - > > Content-Language: fr > > This avoids having to make a separate example which would be very > similar.} > > 4.16.4 IRIs, URIs, and fun stuff > {Martin's text here :-) } > > > > > > -- > I have always wished that my computer would be as easy to use as > my telephone. > My wish has come true. I no longer know how to use my telephone. > -Bjarne Stroustrup, designer of C++ programming language (1950- )
Received on Friday, 11 June 2004 00:15:02 UTC