W3C home > Mailing lists > Public > html-tidy@w3.org > October to December 2013

Ignore processing instructions

From: Christophe Chenon <christophe.chenon@fr.ibm.com>
Date: Wed, 6 Nov 2013 11:50:51 +0100
To: html-tidy@w3.org
Message-ID: <OF72C4082D.0D084D08-ONC1257C1B.00396E96-C1257C1B.003B9A72@fr.ibm.com>
Hi there, 

I'm a faithfull user of Tidy to XMLize near XML files. 

Recently I came accross the following problem: my near-XML file contains 
unexpected processing instructions. They should not be there and are not 
intended to modify Tidy's behaviour. Unfortunately, they do. 

At some point, my near-XML file contains the following:  

<econtext><?idd:break> blah blah... </econtext>

Tidy replaces this with 


and the whole file is gracefully ended with all necessary closing tags, 
ignoring the full bulk of interesting data below this point. 

Ideally, this processing instruction ( <?idd:break> ) would be ignored or 
even suppressed. I don't need it at all. 

Can a new option be created ? I can envision something like heed-procins : 

Another option would be to escape whatever coding is contained in some 
pre-declared elements. Here  any coding in the <econtext> element can be 

The corresponding option to pre-declare such elements could be : 
escape-content-elements : econtext  elem2 elem3 

Thank you for considering, 

Cordialement / Best regards

Christophe Chenon, PhD

Innovation ? Terminology ? Quality

French Translation Services Center

1,  Place  Jean-Baptiste  Clément

 Noisy-le-grand ? 93881 ? France

French   Translation   Services   Center

IBM Corporate Multilingual Terminology

Sauf indication contraire ci-dessus:/ Unless stated otherwise above:
Compagnie IBM France
Siège Social : 17 avenue de l'Europe, 92275 Bois-Colombes Cedex
RCS Nanterre 552 118 465
Forme Sociale : S.A.S.
Capital Social : 653.242.306,20 ?
SIREN/SIRET : 552 118 465 03644 - Code NAF 6202A 

(image/gif attachment: 01-part)

(image/gif attachment: 02-part)

(image/gif attachment: 03-part)

(image/gif attachment: 04-part)

(image/gif attachment: 05-part)

Received on Wednesday, 6 November 2013 10:52:52 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 6 January 2015 21:38:59 UTC