Hi! I'm a newbie to libwww. My requirements are basic: I need to download HTML files and parse out the plain text. My attempts at doing this met with only partial success in the past. Libwww looked god-sent to me. However: my experiments with the "showtext" sample has not yielded much success. It always messes up with <script..> </script> tag, spitting out the script as part of the text. (Sites I tried: www.msn.com, www.cnn.com, www.rens.com). Any help would be sincerely appreciated...in anticipation of blissful parsing.... Sajit PrabhakaranReceived on Monday, 20 November 2000 02:48:05 GMT
This archive was generated by hypermail 2.2.0+W3C-0.50 : Monday, 23 April 2007 18:18:38 GMT