W3C home > Mailing lists > Public > html-tidy@w3.org > April to June 2004

RE: [tidy] looking to use html-tidy for parsing

From: Cory Nelson <PhrostByte@socal.rr.com>
Date: Sun, 4 Apr 2004 17:51:28 -0700
Message-Id: <200404050051.i350pOu3023791@ms-smtp-01-eri0.socal.rr.com>
To: "'Henrique'" <cube_@hotmail.com>, <html-tidy@w3.org>
There is no way you can do this with tidy.exe (that I know of), but if you
are talking code-wise, here's how:
 
static void RecurseImages(TidyNode node) {
   TidyAttr attr;
 
   node=tidyGetChild(node);
 
   while(node) {
      if(tidyNodeIsIMG(node)) {
         attr=tidyAttrGetSRC(node);
         printf("image found on line %d: %s\n", tidyNodeLine(node),
tidyAttrValue(attr));
      }
      else RecurseImages(node);
      
      node=tidyGetNext(node);
   }
}
 
int main(void) {
   TidyDoc tdoc=tidyCreate();
   tidyParseFile(tdoc, "page.html");
 
   RecurseImages(tidyGetBody(tidyDoc));
 
   tidyRelease(tdoc);
 
   return 0;
}






  _____  

http://www.int64.org <http://www.int64.org/>  - When 4GiB of RAM just isn't
enough.
 

  _____  

From: html-tidy-request@w3.org [mailto:html-tidy-request@w3.org] On Behalf
Of Henrique
Sent: Sunday, April 04, 2004 1:22 PM
To: html-tidy@w3.org
Subject: [tidy] looking to use html-tidy for parsing


I'm trying to use html-tidy to parse an html file to print all 'img' tags
and the contents of the 'src' attribute.
Can someone point me in the right direction?
 
 
Received on Sunday, 4 April 2004 20:51:30 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:55 GMT