Date: Wed, 4 Nov 1992 16:48:34 -0500 From: Jim Davis <davis@dri.cornell.edu> Message-Id: <199211042148.AA01760@willow.tc.cornell.edu> To: wei@xcf.berkeley.edu, www-talk@nxoc01.cern.ch Subject: improved printing of WWW files If you can't quite manage to live without hardcopy, you may wish sometimes to print WWW files. I have written a couple of scripts to do this. They are particularly useful with Pei Wei's excellent Viola WWW browser. A tar archive is available for anonymous FTP: dri.cornell.edu/pub/davis/print-www.tar It contains: README print-www print-www.l html-to-latex html2latex.sed (modified version of original CERN version) The hardest part was writing the perl script to obtain documents via http protocol - turns out you cant just run pipes through telnet. The conversion from HTML to LaTex is not really robust yet - this is doubly hard since there is no guarentee that the HTML is legal. But at least it works for my test cases. No doubt it will be improved in time. best wishes