- From: Richard Ishida <ishida@w3.org>
- Date: Tue, 28 Oct 2003 12:27:19 -0000
- To: "'Karl Dubost'" <karl@w3.org>, <www-i18n-comments@w3.org>
Karl, Thanks for taking the time to make these comments. Due to vacation and travel it may take a while to respond to you, but we will read the comments carefully and get back to you. Best regards, Richard. ============ Richard Ishida W3C contact info: http://www.w3.org/People/Ishida/ http://www.w3.org/International/ http://www.w3.org/International/geo/ See the W3C Internationalization FAQ page http://www.w3.org/International/questions.html > -----Original Message----- > From: www-i18n-comments-request@w3.org > [mailto:www-i18n-comments-request@w3.org] On Behalf Of Karl Dubost > Sent: 25 October 2003 00:57 > To: www-i18n-comments@w3.org > Subject: ATeXHI 1.0 or Babel Scribe 1.0 > > > > Hi, > > this is a few comments with regard to your 1st WD. > ATeXHI 1.0 or Babel Scribe 1.0 > Authoring Techniques for XHTML & HTML Internationalization 1.0 > > First of all, thank you very much for this work it was much needed. I > hope you will have success and good reviews for each of your version. > > * QA Spec Guidelines - http://www.w3.org/TR/qaframe-spec > > The QA Spec Guidelines are entering in CR phase, which is an > implementation phase for the QA WG. It seems that it will be a > wonderful opportunity for both WG, GEO and QA, to implement these > guidelines and for the QA WG to help and create tools when > it's needed. > > This following review is not a review against QA Spec Guidelines > > I have discussed with Richard Ishida on IRC and he told me > that some of > the verbiage was repeating the same principles along the > document. The > document to be read by the outline. I Would encourage the editors to > write atomic statement for each feature and to not repeat the same > verbiage BUT to point to these atomic statement from > different outlines. > > It will be like having modules addressing some > problems, and profiles > collecting a set of modules or features applied to specific > problems or > readers. It will have the advantage for the editor to be easier to > maintain as well and less confusing for the reader in certain > circumstances. > > > > * Abstract > You limit your scope to XHTML 1.0/HTML 4.01. XHTML 1.1 > is already a > specification and includes Ruby, which is an interesting > technology for > the Web and I18N. XHTML 2.0 is in development it may be the > opportunity > to input more I18N stuff in XHTML and when XHTML 2.0 does not address > certain I18N issues to put them in this document. > > * Status > """These are techniques that need to be > addressed from the start of content development > if unnecessary costs and resource issues > are to be avoided later on.""" > > It's never too late to improve a Web site or a > document. It might be > benefitial to point out that if the site does not respect simple > principles of I18N, it can still improve Step by Step the overall > quality. > See http://www.w3.org/QA/2003/03/web-kit where we > mentionned I18N > > * 1.3 Standards addressed > """ote that XHTML source can be served as XML > (using MIME types application/xhtml+xml, > application/xml or text/xml) or HTML > (using the MIME type text/html).""" > > It might happen in the future that text/xml be > deprecated. There's a > lot of discussion around that. It's at risk. > > * 1.4 User agents addressed. > Netscape 7 is a frozen/dead product and will not be > developed anymore, > I would encourage to focus on Mozilla more than Netscape. > If you want that your document fresh and evolving with > tools, you may > want to choose to compatibility charts outside of your main document. > > * 2.1 Internationalizing the page header > > The recommendation is good but your example is not very good. If you > serve your document as text/html, you do not need the XML declaration > <?xml version="1.0" encoding="UTF-8"?> > And if you serve it as application/xhtml+xml, there's no need to put > the xml declaration if your document is utf-8 and utf-16, > it's even not > recommended, because IE 6 Windows have problem with the xml > declaration > and pass in quirks mode when it's here. > It's good to encourage utf-8, and there's an incentive > to do it by > saying that if you use utf-8, you don't need to put the xml > declaration > and therefore IE 6 will be friendly with you. > > """In case of conflict, the Content-Type > charset declaration and the XML declaration > have precedence over the meta charset statement, > according to the HTML 4.01 and XHTML 1.0 > specifications. [Ed. note: Is this true in > practise? esp wrt IE?]""" > > See CUAP - http://www.w3.org/TR/cuap. There is the precedence order. > > """Use meta charset declarations as early as possible > in the head > element.""" > When the browser does not in the http headers the > encoding, it will be > necessary to parse the begining of the document to get the encoding > information. As such, it's indeed preferable to have it at > the start so > the user agent will be able to display with the correct encoding. > Though it might be useful to test or ask to vendors when do they stop > parsing the header to find this information. > > """For HTML use the lang attribute, and for > XHTML use the lang and xml:lang attributes > in the html tag. """ > There's an incentive to use XHTML over HTML by the fact > of being able > to smoothen your transition to XHTML 1.1 or XHTML 2.0. In XHTML 1.0, > you can use only xml:lang if you wish and you will have no > problems to > switch to XHTML 1.1 or XHTML 2.0 where xml:lang is the only possible > attribute. > > One of the reason of using xml:lang or lang attributes > in a document, > is the behaviour of CSS rules. For example, in IE5 Macintosh > if you put > a "q" element for citation, the quotes will be different depending on > the wrapping language. < blabla > in french, " blabla " in english, > etc. You have also rules of selection in CSS 2 depending on the > language too. > Another good point to make, if the document is read by > a translating > agent (automatic translation), it will not have to guess the main > language of the language by an heuristic, therefore performance > improvement for processing it. > The meta statement must be compatible with the html > element, though > it's not mandatory. I guess the html element should have > precedence on > the meta element. > > * 2.2 International Layout considerations > right, left and before, after > An interesting issue which appeared when we designed a > QA stylesheet > for right/left direction languages. We have small red arrows in the > menu and for languages left to right the arrow points to the right. > Luckily enough the arrow was specified with a before CSS > structure and > was in the CSS and not in the HTML with an img element so we > have been > able to create another stylesheet for right to left languages. It has > been less painful than having thousands of pages to modify. > Though it's interesting to understand that a simple > arrow may have > internationalization problems. > > * 3.1 Choose a page encoding > Choose UTF-8 or another Unicode encoding for all content. > > - Give the list or a reference to a list of Unicode encodings > > """* Unicode (UTF-8) forms will be easier to migrate to > XForms.""" > You can add for the reasons I gave before: > * Unicode (UTF-8) forms will be easier to migrate to > XHTML 1.1/XHTML > 2.0 > > """If you don't use a Unicode encoding, select an > encoding that best > supports the languages / characters to be included in the > page text.""" > > This is not testable per se. You might recommend: > Use an encoding > that supports the languages/characters included in the page text. > > """Check that user agents (all agents that must render > the page) > adequately support the page encoding that you have selected. If not, > you might need to use a more widely > supported encoding to achieve an adequate degree of user agent > support.""" > > It contradicts in a sense a principle of accessibility > and of the Web > which says whatever your user agent you should be able to access the > content. Though this said, it doesn't solve the problem. I would not > encourage people to do browser sniffing too, because it challenges > > It's the same for the next technique. """Use character sets and > encodings that will be accessible and common to your > users.""" when you > recommend such techniques, you have to moderate it by explaining the > constraints/difficulties it might create to other users. > > * 3.2 Specifying a page encoding > > """Where practical, declare the page's character > encoding by setting the charset parameter in the > HTTP Content-Type header.""" > > Not where practical, do that all the time. Each time > you have the > opportunity to serve your document with the right encoding in > the HTTP > header, just do it. It has the benefit for the user agent to not have > to guess or parse the begining of the HTML document to know how to > display it. > It's not incompatible with specifying inside the > document for the > reason you gave, saving locally, etc. > > You may give an example for httpd.conf and/or .htaccess > for Apache and > an example for Jigsaw > > Apache httpd.conf and .htaccess > > AddCharset utf-8 .html > > you can also do things like > > <FilesMatch "/somewhere/europe/*.html"> > AddCharset iso-8859-1 .html > </FilesMatch> > > Ask to Yves Lafon on the method for Jigsaw. > > """For XHTML served as text/html, where practical use an XML > declaration with an encoding attribute.""" > > No. When XHTML is served as text/html the XML > declaration becomes > completely irrelevant and as I said gives problem to IE6. And you > explain it just after. The visual checking is not a good > recommendation. :) even if it's done often. > > * 4.1 Choosing & specifying fonts > """Do not use <font> tags - use CSS styles instead.""" > I see in the Ed Note """Ed. note: Describe the evils of > using <font> > to cheat on the charset and represent other > > scripts.]""". It would be good to give techniques and examples > how the Webmaster can switch from the use of font to the use of other > techniques. > > """Always use the serif and sans-serif fallbacks""" > to add "In the font property in CSS". > > 5.3 Specifying the language of a link destination > > """Use the hreflang attribute on the a element.""" > It is supported by CSS :)))) You should read my entry about it. > > http://www.la-grange.net/2002/09/03#hreflang (french) > > CSS rule for it > /* display of the language you linked to */ > a[hreflang]:after { content: " [" attr(hreflang) "] "; } > > What are the benefits of that? > > 1) strong usability benefits, the user will know > browsing your Web > site what is the language of the ressource your are linking > to. Imagine > you are in a document writtent in french and you link to a > reference in > english, but some of your readers do not know english at all. > They will > not have to follow the link to discover afterward they can't read it. > They save time, and bandwidth. > > 2) It will be good if the I18N activity review the CUAP > note and add > comments to it or new checkpoints. Why? Because you might > encourage or > recommend behaviours of user agents. For example, you might recommend > to a user agent which is an automatic translator to respect the > attributes "lang" and "xml:lang" in a document, so it doesn't > translate > things which should not (like trying to translate french to french > sometimes... silly. and to use with intelligence the hreflang > attribute. It means in a context where you have this attribute the > automatic translator will know beforehand the main language > used if the > user follow the link and will give the possibility to translate > adequatly. > Also for indexing search engines like Google it has the > benefits of > knowing the language before to index it and so to be more > effective in > indexing the page. > > * Do not add dir="rtl" to the body tag. > """According to the Microsoft article Authoring HTML > for Middle Eastern Content, the following behaviors > can only be expected in Internet Explorer 5 if the > dir attribute is on the html element, rather than the > body element.""" > > Specify which version of IE mac or windows? > > > This is it for a first review ;) > > > > -- > Karl Dubost - http://www.w3.org/People/karl/ > W3C Conformance Manager > *** Be Strict To Be Cool *** >
Received on Tuesday, 28 October 2003 07:27:43 UTC