- From: Kent Vander Velden <graphix@iastate.edu>
- Date: Sat, 21 Nov 1998 12:43:49 CST
- To: Henrik Frystyk Nielsen <frystyk@w3.org>
- cc: www-lib@w3.org
>> In short, >> this works: >> ./w3c -to "text/latex" http://www.w3.org/ -o w3home.txt >> this does not: >> ./w3c -to "text/plain" http://www.w3.org/ -o w3home.txt > >I don't think that any of these work - the command line tool [1] doesn't >have an HTML parser integrated - I only added the HTML parser to the webbot >[2] (which needs it for finding links) and the line mode browser (because >it's a browser!) [3]. I added the following code to main() in HTLine.c: HTList * converters = HTList_new(); HTConverterInit(converters); HTMLInit(converters); HTFormat_setConversion(converters); that seems to add the converters and I can see with debug enabled that the converter is being found and used. The output when converting to text/plain seems to disappear however. Did I miss something? >The following should work as intended: > > ./www -to "text/latex" http://www.w3.org/ -o w3home.tex > > ./www -to "text/plain" http://www.w3.org/ -o w3home.txt > >(it may not generate fully compliant tex, though). You can remove the [n] >link references by using the "-na" command line option. These work great! I had seen these mentioned in the mailing list archive but somehow but the idea that www became w3c. Thanks! --- Kent Vander Velden kent@iastate.edu
Received on Saturday, 21 November 1998 13:43:50 UTC