- From: Norman Walsh <ndw@nwalsh.com>
- Date: Mon, 10 Oct 2011 09:00:35 -0400
- To: XProc Dev <xproc-dev@w3.org>
Received on Monday, 10 October 2011 13:01:13 UTC
Florent Georges <fgeorges@fgeorges.org> writes:
>> I propose that we add a charset option to the pxp:unzip step. If the
>> specified content type is not an XML content type but is a text
>> content type (begins "text/") and a charset parameter is specified,
>> then the result is a c:data element containing the characters of the
>> extracted file.
>
> Why only for text/*? What if application/xml and the entry does not
> have an XML declaration (or if the XML decl does not have the encoding
> pseudo-attribute)? Shouldn't it be possible to help the processor by
> telling it: "hey, I know this entry is ISO-8859-15, make sure the XML
> parser will use this"...
Uuuuhhhmmmm. Yes. :-)
Be seeing you,
norm
--
Norman Walsh
Lead Engineer
MarkLogic Corporation
Phone: +1 413 624 6676
www.marklogic.com
Received on Monday, 10 October 2011 13:01:13 UTC