[ESW Wiki] Update of "geoGettingStartedwithI18n" by RichardIshida

Dear Wiki user,

You have subscribed to a wiki page or wiki category on "ESW Wiki" for change notification.

The following page has been changed by RichardIshida:
http://esw.w3.org/topic/geoGettingStartedwithI18n


------------------------------------------------------------------------------
+ = ITS WG Collaborative editing page =
+ 
+ Follow the [http://esw.w3.org/topic/itsHowTo conventions] for editing this page.
+ 
+ '''[http://www.w3.org/International/geo/2004/article-devt-process.html Status: Initial Draft]''' 
+ ie. please focus on technical content, rather than wordsmithing at this stage.
+ 
+ Author: Susan K. Miller
+ 
+ 
+ ----
+ 
  = Getting Started with Internationalization =
  This document is the first draft of a article on the GEO site targeted to managers,
  Web designers and developers who are new to I18n. This section includes key concepts, 
  checklists and FAQs regarding '''basic''' I18n and L10n issues. 
  
- = how to edit this page = 
- The whole text of the article  is here, paragraph by paragraph, with relevant comments sandwiched between. Each point made begins with the initials of the author and date (see below), and continues until the next author's initials or the next quote from the article . Multiple points by the same author on the same piece of text are preceded by multiple initials and separated by "===". Related comments, and replies are grouped together were possible. 
- The text of the actual article is in bold. 
- 
- The original bulk of text contains no dates. If adding comments from now on, please use a new paragraph and start the first paragraph in your comment with your initials, followed by the date in square brackets, following the form [RI 2 feb]. 
- 
- 
- --------------------------------------------------------------------------------
- 
- Status: Initial Draft ie. no wordsmithing comments at this stage, please, other than for the question. Comments about structure, etc. very welcome though. 
- 
- '''Are you new to the world of Web site globalization, internationalization and localization? If so, this overview is for you, whether your role is as a project manager, designer or developer. [[BR]][[BR]]
+ Are you new to the world of Web site globalization, internationalization and localization? If so, this overview is for you, whether your role is as a project manager, designer or developer. 
  
- Project managers will find:[[BR]]
+ Project managers will find:
- • A brief discussion of why a Web site should be globalized.[[BR]]
+  - A brief discussion of why a Web site should be globalized.
- • Key concepts, including internationalization, localization and Unicode.[[BR]]
+  - Key concepts, including internationalization, localization and Unicode.
- •	Frequently asked questions about internationalization.[[BR]][[BR]]
+  - Frequently asked questions about internationalization.
  
+ [[RI I assume that the bulleted text will link to the appropriate place each time.]]
+ 
- Web site designers will find:[[BR]]
+ Web site designers will find:
- • An introduction to content, style and page layout issues for internationalizing and localizing Web sites.
+  - An introduction to content, style and page layout issues for internationalizing and localizing Web sites.
- [[BR]]
- • A checklist  of key issues to use as you design your Web site.[[BR]][[BR]]
+  - A checklist  of key issues to use as you design your Web site.
  
- Web site developers will find:[[BR]]
+ Web site developers will find:
- • An introduction to the key concepts including language declarations and 
+  - An introduction to the key concepts including language declarations and character encoding.
- character encoding.[[BR]]
- • Considerations for structuring an internationalized Web site.[[BR]]'''[[BR]]
+  - Considerations for structuring an internationalized Web site.
  
  = Managerial View =
  
- == Why Globalization? == 
+ == Why Globalization? ==
  
+ 
- '''The key reason why companies consider offering multilingual versions of their Web sites is in response to local market cultural and/or legal requirements. In most countries, computer users prefer to work with Web sites in their native language. In order to establish or  increase their presence in target countries, companies may decide to offer their Web sites in a local language.
+ The key reason why companies consider offering multilingual versions of their Web sites is in response to local market cultural and/or legal requirements. In most countries, computer users prefer to work with Web sites in their native language. In order to establish or  increase their presence in target countries, companies may decide to offer their Web sites in a local language.
  
  The globalization of Web sites has become an important issue for companies that want to increase their presence and market and sell their products in international markets. In many case, localization has proven to be the key factor for international product acceptance & success.
  
+ [[RI doesn't this para just repeat the previous one?]]
+ 
  While the practice of web production has matured rapidly over the years, Web site globalization is still very much in its infancy. 
  
  Although nearly every American corporation now has a web presence, less than 15% offer more than one language. With so few examples to build upon, and few established standards, the web manager planning a multilingual site is often left with little direction and support. 
  
- Web site globalization is a complex process, which requires a clear project plan and often the allocation of significant time and resources. '''
+ Web site globalization is [[RI can be?]] a complex process, which requires a clear project plan and often the allocation of significant time and resources.
  
+ 
- == Key concepts == 
+ == Key concepts ==
  
- '''There are many ways to define the key concepts of Web site globalization and we will not attempt to supply the definitive version. For the purposes of this overview, we can say that internationalization is the process of building a Web site so that it can support multiple locales (a country or region and language), while localization is the process of modifying that site for a specific locale. 
+ There are many ways to define the key concepts of Web site globalization and we will not attempt to supply the definitive version. For the purposes of this overview, we can say that internationalization is the process of building a Web site so that it can support multiple locales (a country or region and language), while localization is the process of modifying that site for a specific locale. 
  
- Internationalization generally focuses on the Web site elements that work “behind the scenes” and are transparent to the end user (e.g. encoding, architecture). Localization, on the other hand, focuses on elements that are visible to the user, including the language of the text, the cultural context for the images, and the layout of the Web page. So, we can say that Web site globalization is the sum of internationalization and localization activities.
+ Internationalization generally focuses on the Web site elements that work "behind the scenes" and are transparent to the end user (e.g. encoding, architecture). Localization, on the other hand, [[RI typically/generally]] focuses on elements that are visible to the user, including the language of the text, the cultural context for the images, and the layout of the Web page. So, we can say that Web site globalization is the sum of internationalization and localization activities.
  
- Let’s consider these different facets of globalization in a little more detail.'''
+ Let'€™s consider these different facets of globalization in a little more detail.
  
  == Internationalization  ==
+ 
- '''The Localisation Industry Standards Association (LISA) defines internationalization as... ”the process of generalizing a product so that it can handle multiple languages and cultural conventions without the need for re-design.”
+ The Localisation Industry Standards Association (LISA) defines internationalization as... "the process of generalizing a product so that it can handle multiple languages and cultural conventions without the need for re-design."
  
- In general, the internationalization any product – including a Web site -- is most efficiently done during the product development cycle, as a precursor to the localization. The biggest and most costly problem for many companies is that their English-only Web site has to be completely re-created from the ground up due to the limitations imposed by the English text being embedded in the code and applications that can only handle English text. 
+ In general, the internationalization of any product -- including a Web site -- is most efficiently done during the product development cycle, as a precursor to the localization. The biggest and most costly problem for many companies is that [[RI where?]] their English-only Web site has to be completely re-created from the ground up due to the limitations imposed by the English text being embedded in the code and applications that can only handle English text. 
  
  In fact, an important aspect of internationalization is the separation of text from the source code. Translatable text, i.e. text which is visible to the user, should be moved to separate strings-only resource files. These are the files that can then be handed off to translators.
  
+ [[RI It might help to add some wording to more carefully relate this concept to Web technologies.]]
+ 
- Central to internationalization is the ability to display the character sets and support local standards of a particular country or region. For example, before Web pages can be translated into most Asian languages, they must support double-byte characters. If the page has been coded to support only Western European languages, it must be double-byte enabled (e.g. by using Unicode). For more on this, please refer to the Developer View below.
+ Central to internationalization is the ability to display the character sets and support local standards of a particular country or region. For example, before Web pages can be translated into most Asian languages, they must support double-byte characters [[RI suggest multi-byte]]. If the page has been coded to support only Western European languages, it must be double-byte enabled (e.g. by using Unicode). For more on this, please refer to the Developer View below.
  
+ [[RI 'display the character sets' -> 'support the character sets' OR -> 'display the characters']]
+ 
- Internationalization is often abbreviated as “i18n”, where “18” indicates the number of letters between the “i” and the “n". '''
+ Internationalization is often abbreviated as “i18n”, where “18” indicates the number of letters between the "i" and the "n".
  
  == Localization  ==
  
- '''As with internationalization, many different definitions exist for the term localization. LISA notes that localization: “Involves taking a product and making it linguistically and culturally appropriate to the target locale where it will be used and sold.”
+ As with internationalization, many different definitions exist for the term localization. LISA notes that localization: "Involves taking a product and making it linguistically and culturally appropriate to the target locale where it will be used and sold."
  
- The term localization is derived from the word locale, which generally means a small area or vicinity. In the context of Web site globalization, locale represents a specific combination of language, region and character encoding. For example, Portuguese as spoken in Portugal is one locale; Portuguese spoken in Brazil is a different locale.
+ The term localization is derived from the word locale, which generally means a small area or vicinity. In the context of Web site globalization, locale represents a specific combination of language, region and character encoding [[RI I don't think character encoding is relevant]]. For example, Portuguese as spoken in Portugal is one locale; Portuguese spoken in Brazil is a different locale.
  
- L10n is often used as an abbreviation for localization, where “10” indicates the number of letters between the “l” and the “n".'''
+ L10n is often used as an abbreviation for localization, where “10” indicates the number of letters between the "l" and the "n".
  
  == Globalization ==
  
- '''Because web site internationalization is very much integrated with localization, the process of publishing multi-lingual and multicultural Web sites can be referred to as Web site globalization.  
+ Because web site internationalization is very much integrated with localization, the process of publishing multi-lingual and multicultural Web sites can be referred to as Web site globalization.  
  
  Globalization is a term used in many different ways:
  • At the highest level, we talk about the globalization of business in general as an economic process. 
  • At the enterprise level, a company physically establishes a global presence by setting up local branch or distribution offices. 
  • For  establishing a virtual presence, an enterprise may globalize its web site, by enabling it  for non-English speaking visitors, i.e. internationalizing the site’s back-end software and architecture, and localizing the site’s content.
  
- LISA defines globalization by saying it “addresses the business issues associated with taking a product global. In the globalization of hi-tech prods this involves integrating localization throughout a company, after proper internationalization and product design, as well as marketing, sales and support in the world market.” '''
+ LISA defines globalization by saying it “addresses the business issues associated with taking a product global. In the globalization of hi-tech prods this involves integrating localization throughout a company, after proper internationalization and product design, as well as marketing, sales and support in the world market.” 
  
  == Unicode ==
+ 
- '''One important technical development that has affected localization is the Unicode standard. Unicode provides a unique number for every character, no matter what the platform, program or language. The Unicode standard uses two bytes (16 bits) for all characters instead of just one byte for standard European characters and has been implemented for most computing platforms.  For more on Unicode, see the Developer view.'''
+ One important technical development that has affected localization is the Unicode standard. Unicode provides a unique number for every character, no matter what the platform, program or language. The Unicode standard uses two bytes (16 bits) for all characters instead of just one byte for standard European characters and has been implemented for most computing platforms.  For more on Unicode, see the Developer view.
+ 
+ [[RI Unicode isn't a two-byte encoding any more. Maybe some of this [[http://www.w3.org/International/tutorials/tutorial-char-enc/en/all.html#Slide0040 tutorial text]] can help here.]] 
  
  = YOUR GLOBALIZATION PROJECT =
   
- '''Web sites are becoming increasingly dynamic. Consequently, Web site globalization is a complex process, requiring a clear project plan and often the allocation of significant resources. Below are a few of the issues to keep in mind as you consider the ramifications of I18n. '''
+ Web sites are becoming increasingly dynamic. Consequently, Web site globalization is a complex process, requiring a clear project plan and often the allocation of significant resources. Below are a few of the issues to keep in mind as you consider the ramifications of I18n.
  
  == Increasing use of dynamic web pages ==
  
- '''With XML, the use of databases to create web site content has become more widespread. For Web site l10n, this means that where traditionally large sets of HTML pages and images needed to be localized, now database tables with structured content will be translated. '''
+ With XML, the use of databases to create web site content has become more widespread. For Web site l10n, this means that where traditionally large sets of HTML pages and images needed to be localized, now database tables with structured content will be translated.'
  
  == The challenge of constant change ==
+ 
- '''Many Web sites are updated at least once, if not many times, a day. For multilingual web sites this frequency of updates introduces the challenge of keeping language versions synchronized; ideally, all updates on multilingual web sites are published simultaneously in all target languages. Obviously, this requires an extremely quick turn-around time for translations. Therefore, updating multilingual Web content often requires some form of automation to manage complex workflows.  Even if you already have a content management in place, you’ll need to investigate whether it can handle translated content. '''
+ Many Web sites are updated at least once, if not many times, a day. For multilingual web sites this frequency of updates introduces the challenge of keeping language versions synchronized; ideally, all updates on multilingual web sites are published simultaneously in all target languages. Obviously, this requires an extremely quick turn-around time for translations. Therefore, updating multilingual Web content often requires some form of automation to manage complex workflows.  Even if you already have a content management in place, you’ll need to investigate whether it can handle translated content.
  
  == Translation and content management ==
  
- '''Various tools and web content management systems have been developed to store and manage information in multiple languages on web sites. Setting up an organization and workflow that creates and manages content in multiple languages efficiently is a complicated task that is often underestimated. 
+ Various tools and web content management systems have been developed to store and manage information in multiple languages on web sites. Setting up an organization and workflow that creates and manages content in multiple languages efficiently is a complicated task that is often underestimated. 
  
  Even when the technology is in place to host and manage multilingual web sites, many other issues need to be considered, such as provision of local (not localized) content, allocation of localization budgets, validation process for content translated by a third party service provider, etc. '''
  
  == Legalities ==
  
- '''Keep in mind that laws and regulations in other countries can vary significantly from the practice in your country with regard to: copyright, privacy , advertising, and consumer protection. The country of ownership of your Web site can impact which local laws your company is subject to and how the laws are applied.
+ Keep in mind that laws and regulations in other countries can vary significantly from the practice in your country with regard to: copyright, privacy , advertising, and consumer protection. The country of ownership of your Web site can impact which local laws your company is subject to and how the laws are applied.
  
  For example, if you go to the McDonald’s Corporation (U.S.) Web site and then select  a non-U.S. locale, a separate page warns you: “You are leaving the McDonald's Corporation web site. The policies, including the privacy policy, on the site you are going to may vary from McDonald's. Be sure to review the policies of every site you visit.”
  
- You may need to consider getting local legal advice to address these issues and verify that you are in compliance.  In some countries, your company may be subject to fines within thirty days of the launch of a local Web site.'''
+ You may need to consider getting local legal advice to address these issues and verify that you are in compliance.  In some countries, your company may be subject to fines within thirty days of the launch of a local Web site.
  
  == FAQs == 
  
- '''Q: Isn’t localization just another word for translation?[[BR]]
+ Q: Isn’t localization just another word for translation?[[BR]]
  
  A: No, localization goes beyond just translation of the words. It includes allowances for locale-specific cultural references and regional standards. [[BR]]
  
@@ -124, +138 @@

  
  A: An “international” web site is one that is intended for an international audience, and a “multilingual” web site refers to a web site that uses more than one language. An international web site may or may not be multilingual, just as a multilingual web site may or may not be international.
  
- For more information, please see the article defining international and multilingual sites and the FAQ  that examines the trade-offs between international sites that are monolingual vs. multilingual.'''
+ For more information, please see the article defining international and multilingual sites and the FAQ  that examines the trade-offs between international sites that are monolingual vs. multilingual.
  
- QUESTION: WHAT OTHER Questions COULD BE ASKED AND ANSWERED HERE?
+ [[SM QUESTION: WHAT OTHER Questions COULD BE ASKED AND ANSWERED HERE?]]
  
  = Design View =
  
  == Text content issues ==
  
- ''' The best thing you can do with your content is to ensure clarity and consistency. The text should always be clear and unambiguous:[[BR]]
+ The best thing you can do with your content is to ensure clarity and consistency. The text should always be clear and unambiguous:[[BR]]
  
- • Use consistent phrases and terms. The importance of simple, concise language is magnified when writing for translation. For example, in page navigation decide upfront whether you will use “back” or “previous“; click on”, “click”, “choose”, or “select:, when describing navigation.[[BR]]
+ • Use consistent phrases and terms. The importance of simple, concise language is magnified when writing for translation. For example, in page navigation decide upfront whether you will use “back” or “previous“; click on”, “click”, “choose”, or “select:, when describing navigation.
  
- • Use simple, active verbs.  Compare  “Click the GO button” to “The GO button should now be clicked.”[[BR]]
+ • Use simple, active verbs.  Compare  "Click the GO button" to "The GO button should now be clicked."
  
- • Avoid the use of “telegraphic English”, ie., write full sentences.[[BR]]'''
+ • Avoid the use of "telegraphic English", ie., write full sentences.
  
  
  == Format and style issues ==
  
+ [[RI Allow space for text to expand in translation.
+ Maintain a clear separation between structure and presentation (ie. semantically marked up XTHML with CSS for presentation).
+ Design of forms.]]
+ 
  == Dates ==
- '''Visitors to a web site from varying locales may be confused by date formats. The format MM/DD/YY is unique to the United States. Most of Europe uses DD/MM/YY. Japan uses YY/MM/DD. The separators may be slashes, dashes or periods. Some locales print leading zeroes, others suppress them. If a native Japanese speaker is reading a US English web page from a web site in Germany that contains the date 03/04/02 how do they interpret it?  
+ Visitors to a web site from varying locales may be confused by date formats. The format MM/DD/YY is unique to the United States. Most of Europe uses DD/MM/YY. Japan uses YY/MM/DD. The separators may be slashes, dashes or periods. Some locales print leading zeroes, others suppress them. If a native Japanese speaker is reading a US English web page from a web site in Germany that contains the date 03/04/02 how do they interpret it?  
  
+ [[RI actually the most difficult thing is recognising dates typed into forms.  Form design must therefore be thought through carefully, and perhaps alternatives prepared, according to user.]]
+ 
- For more information, view the solution to this problem published as a Q and A by GEO.(http://www.w3.org/International/questions/qa-date-format)'''
+ For more information, view the solution to this problem published as a Q and A [[RI 'Q & A' -> 'FAQ']] by GEO.(http://www.w3.org/International/questions/qa-date-format)
  
  ==  Calendars ==
- '''Be aware that many Asian countries use different calendars than the Julian one typically used in the Americas and Europe.  
+ Be aware that many Asian countries use different calendars than the Julian one typically used in the Americas and Europe.  
  
+ [[RI 'julian' -> 'gregorian']]
+ 
- Also note that a U.S. calendar week typically displays with Saturday as the last day of the week, while in Europe it is often more common to display Sunday as the last day.'''
+ Also note that a U.S. calendar week typically displays with Saturday as the last day of the week, while in Europe it is often more common to display Sunday as the last day.
  
  == Telephones and Addresses ==
- '''One of the first places you may notice that your Web site doesn’t work beyond your region is when a user from another locale tries to input his or her address and telephone number into your hardcoded form. Telephone numbers and addresses around the world vary substantially from the U.S. standard; these varying formats are well-documented on the Web.
+ One of the first places you may notice that your Web site doesn't work beyond your region is when a user from another locale tries to input his or her address and telephone number into your hardcoded form. Telephone numbers and addresses around the world vary substantially from the U.S. standard; these varying formats are well-documented on the Web.
  
- When displaying addresses and telephone numbers, always include the country code and country name. Specify the time zone.'''
+ When displaying addresses and telephone numbers, always include the country code and country name. Specify the time zone.
  
  == Cultural standards and other issues ==
  
- '''The general rule is to avoid culturally-specific content. This includes: humorous references; references to politics, religion and sacred objects; sports and entertainment events and figures; seasons and holidays. Equivalents in other languages can probably be found for your culturally-specific content, but this requires much more effort than straightforward translation and increases both the time and costs of translation.
+ The general rule is to avoid culturally-specific content. This includes: humorous references; references to politics, religion and sacred objects; sports and entertainment events and figures; seasons and holidays. Equivalents in other languages can probably be found for your culturally-specific content, but this requires much more effort than straightforward translation and increases both the time and costs of translation.
+ 
+ [[RI Hmm.  I think avoid OR take on board the need to change.  See [http://www.w3.org/International/questions/qa-mono-multilingual my FAQ]] for similar arguments.]]
  
  Furthermore, you need take care when referencing or consciously or accidentally attaching meaning to colors, animals, and symbols or hand gestures.
  
@@ -171, +195 @@

  Consider measurement conversion. For example, consider adding “(6.2 km)” when referring to “1 mile”. This will also avoid inconsistencies between conversion done in different target languages; some translators may leave the “1 mile” unchanged, others may completely replace it with the conversion. Again, the key is to anticipate any confusion.
  
  When including examples such as a person’s name, use generic examples that are known world-wide.
- '''
  
  == Images == 
  
@@ -251, +274 @@

  
  Microsoft's Global Development portal has a wealth of information at http://www.microsoft.com/globaldev/default.mspx'''
  
- 
- 

Received on Wednesday, 27 April 2005 13:09:18 UTC