W3C home > Mailing lists > Public > xproc-dev@w3.org > October 2011

Re: Proposed update to pxp:unzip: charset attribute

From: Norman Walsh <ndw@nwalsh.com>
Date: Mon, 10 Oct 2011 09:00:35 -0400
To: XProc Dev <xproc-dev@w3.org>
Message-ID: <m28votovbg.fsf@nwalsh.com>
Florent Georges <fgeorges@fgeorges.org> writes:
>> I propose that we add a charset option to the pxp:unzip step. If the
>> specified content type is not an XML content type but is a text
>> content type (begins "text/") and a charset parameter is specified,
>> then the result is a c:data element containing the characters of the
>> extracted file.
>
>   Why only for text/*?  What if application/xml and the entry does not
> have an XML declaration (or if the XML decl does not have the encoding
> pseudo-attribute)?  Shouldn't it be possible to help the processor by
> telling it: "hey, I know this entry is ISO-8859-15, make sure the XML
> parser will use this"...

Uuuuhhhmmmm. Yes. :-)

                                        Be seeing you,
                                          norm

-- 
Norman Walsh
Lead Engineer
MarkLogic Corporation
Phone: +1 413 624 6676
www.marklogic.com

Received on Monday, 10 October 2011 13:01:13 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 10 October 2011 13:01:14 GMT