W3C home > Mailing lists > Public > html-tidy@w3.org > April to June 2005

RE: Web service using XML RPC and html Tidy by extracting data fr om a web page

From: <Kipp.Howard@lexisnexis.com>
Date: Mon, 4 Apr 2005 10:44:38 -0700
Message-ID: <5150922C4A9FFE4DA2F658016BF9FA3C09092DDC@lnxseamail01.internal.courtlink.com>
To: bn.rout@gmail.com, html-tidy@w3.org

biranchi rout wrote:
>   I am a newbie in the web service area. I have written one xml rpc
> server and a client to test it. It works fine.
>      I need to create a web service which will display the formatted
> information by taking input from another web page, I am not sure , how
> they will interact.
>      I have to use some sort of HTML scrapping, anybody has idea, can
> guide me how to go ahead with this.

In the past, I have used Tidy to convert HTML pages into XHTML and then use
XSL to pull data out of the XHTML.  It worked well, but the XSL is not the
prettiest.  Good luck.

-- 
Kipp E. Howard
Sr. Software Engineer, LexisNexis File & Serve
Phone: 425.372.1837 or 800.774.7317 ext 1837
Email: kipp.howard@nospam.lexisnexis.com
Received on Tuesday, 5 April 2005 11:31:24 UTC

This archive was generated by hypermail 2.3.1 : Wednesday, 5 February 2014 07:15:53 UTC