- From: Imsieke, Gerrit, le-tex <gerrit.imsieke@le-tex.de>
- Date: Fri, 10 Nov 2017 00:37:03 +0100
- To: xproc-dev@w3.org
Did you know that Calabash comes with validator.nu that can also parse HTML? Its advantages are: – it’s more modern and probably also more robust than tidy – you don’t need p:exec – you get results that don’t refer to DTDs in the DOCTYPE (see below) It is typically invoked by: – p:http-request instead of p:load (also for local files) – p:unescape-markup – p:filter Usage examples: https://github.com/le-tex/ora2dbk/blob/master/xpl/htmlbook2docbook.xpl#L207 https://github.com/transpect/xproc-util/blob/master/html5/xpl/load-html5.xpl One issue I had with tidy that it inserted an XHTML 1.0 DOCTYPE declaration and your pipeline got stuck in the XSLT step because it tried unsuccessfully to retrieve http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd. I needed to supply a local DTD and an XML catalog to the pipeline/processor. Gerrit On 10/11/2017 00:05, Imsieke, Gerrit, le-tex wrote: > Hi Andreas, > > On 09/11/2017 23:14, Andreas Mixich wrote: >> <p:insert name="insertGog" match="/en-note/div" position="last-child"> > > The en-note element is in a namespace. Try match="/*:en-note/*:div". > > Gerrit -- Gerrit Imsieke Geschäftsführer / Managing Director le-tex publishing services GmbH Weissenfelser Str. 84, 04229 Leipzig, Germany Phone +49 341 355356 110, Fax +49 341 355356 510 gerrit.imsieke@le-tex.de, http://www.le-tex.de Registergericht / Commercial Register: Amtsgericht Leipzig Registernummer / Registration Number: HRB 24930 Geschäftsführer: Gerrit Imsieke, Svea Jelonek, Thomas Schmidt, Dr. Reinhard Vöckler
Received on Thursday, 9 November 2017 23:37:32 UTC