- From: Arthur Barstow <Art.Barstow@nokia.com>
- Date: Mon, 14 Jul 2008 08:01:06 -0400
- To: public-webapps <public-webapps@w3.org>
- Message-Id: <A8C6D39A-69EA-4201-919D-AA607D422430@nokia.com>
Comments on the Widgets specs from Addison Phillips of the I18N Core
WG ...
Begin forwarded message:
> From: "ext Phillips, Addison" <addison@amazon.com>
> Date: July 10, 2008 5:01:50 PM EDT
> To: "public-i18n-core-comments@w3.org" <public-i18n-core@w3.org>
> Cc: Marcos Caceres <marcosscaceres@gmail.com>, Arthur Barstow
> <Art.Barstow@nokia.com>, Felix Sasaki <fsasaki@w3.org>,
> "mike@w3.org" <mike@w3.org>
> Subject: RE: Widgets i18n feedback
>
> Hi,
>
> My personal comments on the Widgets specs located at [1] follow. I
> have copied a few members of the WebApps WG on this message so they
> can see progress; these comments will be a Topic in our next
> teleconference, for consideration as the Internationalization WG's
> official comments.
>
> Comments follow.
>
> First, the requirements document:
>
> 1. In the introduction we see the (much better) comment:
>
> --
> As argued by the Widget Landscape document, there is currently no
> formally standardized way to author, package, digitally sign and
> internationalize a widget resource for distribution and deployment
> on the Web.
> --
>
> The widget-land document focuses on localization of widgets, which
> is important. This document should provide a solution to the above
> and this should be referred to as "localization".
> Internationalization remains a problem because JavaScript has no
> locale facet. Internationalized formatting and processing is only
> possible as long as one is happy with the default system locale and
> formats on the host platform. Note: this is usually less of a
> problem for widgets than for AJAX style interactions in a Web page,
> since most widgets are perceived as applications running locally,
> whereas most Web properties manage the user language/locale
> themselves and need a locale in JS to "do the right thing".
>
> 2. [R5] "For example, addressing a resource via an IRI (e.g. new
> Image('../images/pane.png'))."
>
> The use of IRI here is good to see.
>
> 3. [R6] Multilingual Resource Names. The current text is not really
> as strong as I'd like it to be (as you probably would suspect). I'm
> also not sure that it's quite correct. The current text reads:
>
> --
> A conforming specification SHOULD recommend a packaging format that
> is suitable for multilingual contexts, giving authors the ability
> to name files and directories using characters from the Unicode
> character repertoire; in such a case, a conforming specification
> SHOULD recommend the UTF-8 encoding.
> --
>
> Keeping the same "strength" of requirement, I would probably phrase
> this as:
>
> --
> A conforming specification SHOULD recommend a packaging format that
> allows for non-ASCII characters in file and directory names,
> allowing authors to create widgets suitable for various cultures
> and languages, as well as multilingual contexts. The packaging
> format MUST either provide for a declaration of the character
> encoding used or specify what it is. The UTF-8 character encoding
> SHOULD be either the default (if multiple encodings are allowed) or
> sole encoding used.
> --
>
> It would be far better to strongly require encoding support:
>
> --
> A conforming specification MUST declare the character encoding of
> file and directory names used in the packaging format and SHOULD
> use the UTF-8 character encoding. If the UTF-8 encoding is not
> used, the specific encoding MUST either be specified by the
> packaging format or be specifiable in the package itself. Since
> packaged widgets are widely distributed, variation in character
> encoding between different platforms or configurations may render a
> widget with non-ASCII resources inoperable or otherwise degrade the
> user experience unless a comment character encoding is used.
> --
>
> 4. [R7] Internationalization guidelines. Really this should be
> "localization guidelines", since internationalization support is
> not really dealt with anywhere.
>
> 5. [R14] Authorship and Widget Metadata. Note that the author's
> name, email, and organization can all be non-ASCII values.
>
> Next document: [1a]
>
> 6. "File and Folder names". This section contains the following text:
>
> --
> Author requirements: The zip relative path must be encoded as
> either [CP437] or [UTF-8]. Encoding the file name field using
> [UTF-8] is recommended. If the zip relative path is encoded using
> [UTF-8], then the general purpose bit 11 of the local file header
> must be set to 1, otherwise it must be set to 0.
> --
>
> This is good (and addresses my comment 3).
>
> 7. (same section). The text says:
>
> --
> Author requirements: It is recommended that authors keep their path
> lengths below 255 characters. Having excessively long path names
> (eg. over 120 characters) can also result in interoperability
> issues on some operating systems.
> --
>
> I'm pretty sure you do not mean "characters" here. You probably
> mean "bytes", since that is the limitation on some operating
> environments. In a multibyte encoding, such as UTF-8, this means
> that there may be fewer than 255 characters in a 255 byte sequence
> (as few as 63 in the worst case). Please use the word "bytes" here
> if that is what is meant and include a note along the lines of:
> "Note that a character can require more than one byte to encode,
> making the length of the path in characters less than 255."
>
> 8. [non-i18n] I think you mean to s/260/255 in this note:
>
> Note for implementers: as this specification does not put a
> restriction on path length, implementers need to be prepared to
> deal with path lengths longer than 260 characters.
>
> 9. [non-i18n] The "valid zip relative path" ABNF is flawed. It has
> a production called 'ascii-chars', but both utf8-chars and cp437-
> chars use 'ascii-range' instead.
>
> 10. Section 6. The example has no language attributes, non-ASCII,
> or IRI-style values. Probably this example is fine, but it is
> always nice to see examples that use international capabilities.
>
> 11. "Attribute Values and Types". Put quotes around the example
> numbers to make clear where they are. Some cultures use comma as a
> decimal separator and it will be clearer what your examples are if
> you do this.
>
> 12. "URI attribute" You specify BOTH URI and IRI here. And you
> specify path as being strictly URI (3986). But elsewhere you
> consistently use the term IRI. Even more confusingly, the word
> "token" doesn't even appear in 3986 and appears only once (in
> another context) in 3987. I think you should specify the specific
> format, preferably IRI.
>
> 13. The "name" and "description" elements (6.5, 6.6). Here are
> human readable strings with no xml:lang. You should allow an
> xml:lang on these elements. Ideally, you should permit multiple
> descriptions (at least) in multiple languages. These elements are
> often displayed in widget management environments (before the
> widget is invoked or running).
>
> 14. The "author" element (and its attributes) (6.7) The uri and
> email attributes should both be of the IRI-flavor, although non-
> ASCII email names are currently controversial.
>
> 15. The "content" element (6.11) The default 'type' is "text/html"
> with no charset. It would be better if the default included a
> charset. The 'src' attribute uses the word "URI", where you should
> say "IRI".
>
> 16. Section 7. "widget locale". This is specified as:
>
> --
> widget Locale
> The system locale as an RFC3066 language code (eg. en-us)
> --
>
> This should say "The system locale as a BCP 47 language tag (e.g.
> en-US or zh-Hant-TW)"
>
> 17. In this same section, "rules for getting text content": I
> notice that any bidi markup will be removed.
>
> 18. General observation: there is no way to set base directionality
> (for bidi text) on any of the description/name/etc. elements or for
> the widget overall. This may make some bidi languages (Arabic,
> Hebrew, Urdu, etc.) work poorly in a widget.
>
> ===
>
> That's it for now.
>
>
> Best Regards,
>
> Addison
>
>
> [1] http://www.w3.org/TR/2008/WD-widgets-reqs-20080625/
> [a] http://www.w3.org/TR/widgets/
>
>
> Addison Phillips
> Globalization Architect -- Lab126
>
> Internationalization is not a feature.
> It is an architecture.
>
>> -----Original Message-----
>> From: Michael(tm) Smith [mailto:mike@w3.org]
>> Sent: Wednesday, July 09, 2008 8:10 PM
>> To: Phillips, Addison; Addison Phillips
>> Cc: Marcos Caceres; Arthur Barstow; Felix Sasaki
>> Subject: Re: Widgets i18n feedback
>>
>> Addison,
>>
>> We really need to get your input on this by the week of July 28 at
>> the latest.
>>
>> --Mike
>>
>> Marcos Caceres <marcosscaceres@gmail.com>, 2008-07-01 11:41 +1000:
>>
>>> Hi Addison,
>>> I'm just wondering if you could give us an ETA on the i18n input
>> for the
>>> Widget spec?
>>> Kind regards,
>>> Marcos
>>
>> --
>> Michael(tm) Smith
>> http://people.w3.org/mike/
>> http://sideshowbarker.net/
Received on Monday, 14 July 2008 12:02:18 UTC