W3C home > Mailing lists > Public > public-i18n-its-ig@w3.org > July 2008

Re: ITS rules for OpenDocument

From: Rodolfo M. Raya <rmraya@maxprograms.com>
Date: Tue, 29 Jul 2008 08:16:46 -0300
To: Felix Sasaki <fsasaki@w3.org>
Cc: public-i18n-its-ig@w3.org
Message-Id: <20080729081646.4ed810d6.rmraya@maxprograms.com>

On Tue, 29 Jul 2008 19:13:00 +0900
Felix Sasaki <fsasaki@w3.org> wrote:

Hi Felix,

>     * an implementation of "within text" which was necessary to get the
>       OpenDocument segmentation right

Segmentation is set at paragraph level, not sentence level. Is that
correct?

>     * a now worky docx file, please check
>       http://www.w3.org/International/its/its-translate-decorator/example/alice-in-wonderland.docx

It is fine. Word 2007 can open it.

>     * the updated content of that file, see
>       http://www.w3.org/International/its/its-translate-decorator/example/alice-in-wonderland.xml
>       . Rodolfo, could you use that file to re-generate your XLIFF file
>       to make comparison easier?

I used the .docx file to generate a new XLIFF. You can download it from

   ftp://charmed.maxprograms.com/pub/alice-in-wonderland.docx.xlf

>     * the re-generated XLIFF files
>       http://www.w3.org/International/its/its-translate-decorator/example/xliff-file-alice.xml
>       http://www.w3.org/International/its/its-translate-decorator/example/xliff-file.xml

Some comments:

1) the official extension for XLIFF files is ".xlf". Don't use ".xml"

2) use a real language code in the generated XLIFF files. Set it to
"en" (English) or anything else, but not to "tbd". XLIFF editors are
able to validate language codes and complain if you use an invalid one.

3) In "xliff-file.xml" you declare source-language="en" in the <file>
element and then use "tbd" in all <source> elements. This is
inconsistent.

4) There are too many inline tags in "xliff-file-alice.xml".  Tags that
contain the whole segment can, in most cases, be excluded. For example,
if the whole segment is enclosed in <bold> tags, you can put the tags
in a skeleton and store clean text in the <source> element. Tags that
appear before or after the segment and don't affect the text can be
exluded from the segment too. 

5) <ph> element has a required attribute: "id". It is missing in your
files.

Best regards,
Rodolfo
-- 
Rodolfo M. Raya <rmraya@maxprograms.com>
http://www.maxprograms.com
Received on Tuesday, 29 July 2008 11:17:42 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 18:33:00 GMT