CRLF within URL?
While using libwww-5.0a I run into strange behaviour on
I observe a bit strange HTML...
... <IMG SRC="/
library/images/gifs/toolbar/support.gif" WIDTH=74 HEIGHT=21
Note the CRLF within quoted attribute value. How is this supposed to
be interpreted? Are CR(+LF?) codes supposed to belong into the
attribute value (URL) or not? Currently, at least the version I am
using gives attribute value with CRLF included, and this gets fed back
to the library as is later for the retrieval of the inline image
(after which things stop).
- should library (wwwlib) quote/escape these?
- should application strip these of?
- is my modified "SGML parser" wrong and should ignore CRLF within
Markku Savela (firstname.lastname@example.org), Technical Research Centre of Finland
Multimedia Systems, P.O.Box 1203,FIN-02044 VTT,http://www.vtt.fi/tte/staff/msa/