Re: HTMLToPlain and libwww 5.2

>>  In short, 
>>    this works:
>>      ./w3c -to "text/latex" http://www.w3.org/ -o w3home.txt
>>    this does not:
>>      ./w3c -to "text/plain" http://www.w3.org/ -o w3home.txt
>
>I don't think that any of these work - the command line tool [1] doesn't
>have an HTML parser integrated - I only added the HTML parser to the webbot
>[2] (which needs it for finding links) and the line mode browser (because
>it's a browser!) [3].

  I added the following code to main() in HTLine.c:

    HTList * converters = HTList_new();
    HTConverterInit(converters);
    HTMLInit(converters);
    HTFormat_setConversion(converters);

  that seems to add the converters and I can see with debug enabled that
the converter is being found and used.  The output when converting to
text/plain seems to disappear however.  Did I miss something?

>The following should work as intended:
>
>	./www -to "text/latex" http://www.w3.org/ -o w3home.tex
>
>	./www -to "text/plain" http://www.w3.org/ -o w3home.txt
>
>(it may not generate fully compliant tex, though). You can remove the [n]
>link references by using the "-na" command line option.

  These work great!  I had seen these mentioned in the mailing
list archive but somehow but the idea that www became w3c.

  Thanks!

---
Kent Vander Velden
kent@iastate.edu

Received on Saturday, 21 November 1998 13:43:50 UTC