- From: Felix Sasaki <fsasaki@w3.org>
- Date: Fri, 7 Nov 2014 06:19:37 +0100
- To: Yves Savourel <ysavourel@enlaso.com>
- Cc: "Estreen, Fredrik" <Fredrik.Estreen@lionbridge.com>, public-i18n-its-ig <public-i18n-its-ig@w3.org>
Am 06.11.2014 um 23:13 schrieb Yves Savourel <ysavourel@enlaso.com>: > What do you mean by 'strip the whitespace'? > 1- delete all whitespace? no. > 2- normalize the whitespace? yes. > a- trim the leading and trailing? > b- normalize the leading/trailing? I mean b. - Felix > > [ Text Text ] -> [TextText] > > [ Text Text ] -> [ Text Text ] > > [ Text Text ] -> [Text Text] > > Thanks, > -yves > > > -----Original Message----- > From: Felix Sasaki [mailto:fsasaki@w3.org] > Sent: Thursday, November 6, 2014 2:40 PM > To: Estreen, Fredrik > Cc: Yves Savourel; public-i18n-its-ig > Subject: Re: ACTION-54: Try to come up with example of xliff+its test format / output > > HI Fredrik and Yves, all, > > I would calculate the offset based on element textual content, zero is start of the element, tags themselves are not counted, and > the whitespace is always stripped. Since roundtripping is not needed the whitespace stripping does not hurt. > See the NIF conversion at > http://www.w3.org/TR/its20/#conversion-to-nif > including the note about whitespace stripping. > > Best, > > Felix > > Am 06.11.2014 um 22:24 schrieb Estreen, Fredrik <Fredrik.Estreen@lionbridge.com>: > >> Hi Yves, Felix, >> >> How would this work in cases where xml:space != "preserve"? A generic XML processor might normalize the space and thus invalidate > the offsets if insignificant whitespace is not preserved. >> >> Regards, >> Fredrik Estreen >> >>> -----Original Message----- >>> From: Yves Savourel [mailto:ysavourel@enlaso.com] >>> Sent: den 6 november 2014 15:34 >>> To: 'Felix Sasaki' >>> Cc: 'public-i18n-its-ig' >>> Subject: RE: ACTION-54: Try to come up with example of xliff+its test >>> format / output >>> >>> Hi Felix, >>> >>> Can you specify a bit more how the offset would be computed? >>> It seems the zero is the start of the element (e.g. <source>) content. >>> But how would we count the inline element? >>> >>> <source>Text<sm id='1' translate='no'/>data</source> >>> >>> "Text" = 0,4 >>> "data = 31,35 >>> >>> The problem is that we don't always know how long the inline tag is >>> in the document (you can have extra spaces between attributes, some >>> attributes with default values may be omitted, etc.) >>> >>> Or should we count each inline tag as 1 character? >>> >>> Which would give: >>> >>> "Text" = 0,4 >>> "data = 5,9 >>> >>> >>> Thanks, >>> -yves >>> >> >
Received on Friday, 7 November 2014 05:20:12 UTC