W3C home > Mailing lists > Public > html-tidy@w3.org > October to December 1999

Re: Enhancement request

From: asantos <asantos@automail.pt>
Date: Fri, 3 Dec 1999 00:12:16 -0100
Message-ID: <002501bf3d2b$70d10da0$0501a8c0@ajs.llsa>
To: "Bjoern Hoehrmann" <derhoermi@gmx.net>, <html-tidy@w3.org>
Hi!

I'm affraid that I don't agree with your concept of restricting tidy to be a
"pretty-printer for HTML Code".... after all, there are 3133 lines in
parser.c vs 1594 in pprint.c. Therefore, I think of tidy in a wider sense,
and the -slides option proves my point :).

Unfortunately, I haven't got the time to hack the (estimated) 20 lines of
code that would have to be added to tidy to do the stuff I proposed. It's
not the 20 lines, its understanding the whole of the code, of course.

Re LinkExtor, I'll look into it (thanks for the tip), but the fact is that
there are dozens of extractors, in Perl, Python, C, even Lisp. But on the
other hand, tidy is (on my system, at least) all of 103 kb... which is
better that loading perl's 491 k, plus (assuming) libwww and the rest.

The point is that tidy tracks bleeding edge HTML, is based on a grammar, and
its fast.

Again, thank you for the tip

Armando



-----Original Message-----
From: Bjoern Hoehrmann <derhoermi@gmx.net>
To: asantos <asantos@automail.pt>
Cc: html-tidy@w3.org <html-tidy@w3.org>
Date: Quinta-feira, 2 de Dezembro de 1999 21:29
Subject: Re: Enhancement request


>| I'd like to have 2 new command line options:  --links and --images. I
think
>| it's obvious, --links would print a (unformated ?) list of all the href
URLs
>| for the <a...> tag, --images all the src URLs of <img...> tags. These two
>| options would make tidy very usable in scripts.
>
>Use a perl script instead of html tidy. Tidy is ment to be a pretty-printer
>for HTML Code, not a all-in-one HTML authoring tool.
>
>Look at HTML::LinkExtor from CPAN; it can do exactly what you want.
>
>regards,
>--
>Björn Höhrmann - mailto:bjoern@hoehrmann.de - http://www.bjoernsworld.de
>
>
Received on Thursday, 2 December 1999 19:13:16 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 3 April 2012 06:13:42 GMT