- From: Sebastian Lange <lange@cyperfection.de>
- Date: Thu, 13 Jul 2000 08:50:02 +0200
- To: html-tidy@w3.org
At 20:14 12.07.2000 -0700, RickR@biztro.com wrote: > ... well, he wrote nothing... ;-) but his question was: Is there any way to get TIDY to recursively tidy up a tree of files? I am currently working on a perl script to convert certain HTML files into XML files, it basically is called like "./html2xml.pl source_path/to/html_files target_path/to/xml_files". The path to the HTML sources is processed recursively, the XML files are written to the target directory (without recursivle re-creating the directory structure, but that should be a minor modification only). If I find some time next week, I'll have a go at it... for instance, you can take this piece of perl code and modify it to your needs.... push (@DirStack, $ARGV[0]); $XMLDir = $ARGV[1]; while ($DirCursor = pop @DirStack) { opendir TheDir, $DirCursor; @DirContent = readdir TheDir; foreach $Filename (@DirContent) { if (($Filename eq '.') || ($Filename eq '..')) { next; } $FilePath = $DirCursor.'/'.$Filename; if (-d $FilePath) { push @DirStack, $FilePath; next; } if ($FilePath =~ /.html?$/i) { open (INFILE, "< $FilePath") or die("ERROR: \'$FilePath\' not openable for reading!\n"); $FILE = join("", <INFILE>); close (INFILE); $FILE = trim(retrieveVariables($FILE)); #printVariablesToScreen(); printXMLFile(); print "Parsed file: " . $FilePath . " -> " . $XMLFilename . "\n"; } } closedir TheDir; >Rick Roth >Biztro, Inc. ><mailto:rickr@biztro.com>rickr@biztro.com > -- Sebastian Lange http://www.sl-chat.de/ Maybe the first chat site that validates as HTML 4.0 even though user input may contain HTML codes. Courtesy to Dave Raggett's HTML Tidy: http://www.w3.org/People/Raggett/tidy/
Received on Thursday, 13 July 2000 02:53:28 UTC