- From: J. Waldram <jwaldram@uwyo.edu>
- Date: Wed, 19 Aug 2009 17:00:57 -0600
- To: Irene Vatton <Irene.Vatton@inria.fr>
- CC: "www-amaya@w3c.org" <www-amaya@w3c.org>
- Message-ID: <alpine.LRH.2.00.0908191154190.31076@leopard.uwyo.edu>
On Tue, 18 Aug 2009, Irene Vatton wrote: > Le lundi 03 août 2009 à 16:36 -0600, J. Waldram a écrit : >> I have searched the archives and have seen only one related post from 2006 >> on an older version of Amaya. Amaya 11.2, with the "Keep multiple >> Spaces" option, the   is represented as a hex byte 0xa0 instead of >>   or  . Is this a mis-translation for the >>   that is rendered by browsers? > > The nbsp character is represented as: > 1) a hex byte 0xa0 when the document is utf-8 encoded > 2) when the document is not utf-8 encoded and there is a doctype > that declares the entity > 3)   or   when the document is not utf-8 encoded and there is > not a doctype that declares the entity > > If Amaya generates a hex byte, the document is considered as utf-8 > encoded. If it's not, the bug is there. I believe that a bug is present in the item 3) case. Does not the   notation allow the renderer/browser to interpret the function without regard of character encoding? Within the case where multiple spaces are preserved is that of multiple spaces (for formatting) that can be removed or allow a line break (by the browser). An example is the two spaces before the beginning of this sentence. I see the need for using nbsp and would suggest that a specific action for an insertion of a nbsp, as in the case of math symbols and <br/>, be used. Alternate logic such as: If (preserve multiple spaces) then if (preserve multiple spaces as nbsp) then put   in document else do not remove the multiple "regular" spaces and allow the browser to format the document as needed fi else remove multiple spaces from document fi could be used. I am not quite sure why removing multiple spaces is a good default action. Is this some style suggestion/action? > >> Since XML does not allow   >> and the   is allowed could this be recoded as a text string not a hex >> byte? Amaya source code views show it as a "~" (tilde) which is less >> obvious in coding and some browsers render it as a literal character. > > Amaya source code views show it as a "~" (tilde) to let know to the > author that there is a nbsp which is not managed as a single space. > A single space can be removed or allows a line break; this is not true > for a nbsp. > >> This is in a >> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Strict//EN" >> "http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd"> >> >> <meta http-equiv="content-type" content="text/html; charset=iso-8859-1" /> >> <meta http-equiv="Content-Style-Type" content="text/css" /> >> >> I did not want to muck with the source code. I do not >> know enough to see the ramifications of changes. >> Are the relevant parts in translate.c the place to make changes? >> >> Thank you, >> >> -Jim Waldram > -- > Irene Vatton <Irene.Vatton@inria.fr> > INRIA -Jim Waldram
Received on Wednesday, 19 August 2009 23:01:52 UTC