W3C home > Mailing lists > Public > www-style@w3.org > March 2003

Re: machine readable properties

From: Robin Berjon <robin.berjon@expway.fr>
Date: Tue, 18 Mar 2003 19:21:56 +0100
Message-ID: <3E7763C4.70002@expway.fr>
To: Yung-Fong Tang <ftang@netscape.com>
Cc: www-style@w3.org
Yung-Fong Tang wrote:
> Basically you want the Appendix F of CSS2 in a seperate plain text file, 
> right?
> It looks http://www.w3.org/TR/REC-CSS2/css2.txt won't do the job because 
> it contains other formatting information.

Thanks for the pointer. It could do the job as it would be fairly easily 
parsable by matching on the fixed width columns of the table and munging the 
values back into lines.

In the meantime I wrote a brute force Perl screen scraper (ah, that good old 
'90s feel...) that finds regularities in the HTML of the spec to produce an XML 
dump of the properties. I haven't yet checked that it makes much sense beyond 
being WF. I've attached it in case anyone is interested (I can provide the 
script too). Further parsing of the value spaces as well as recognition of 
common options (yes, no, aural, N/A...) could be fairly easily added.

Robin Berjon <robin.berjon@expway.fr>
Research Engineer, Expway        http://expway.fr/
7FC0 6F5F D864 EFB8 08CE  8E74 58E6 D5DB 4889 2488

Received on Tuesday, 18 March 2003 13:20:01 UTC

This archive was generated by hypermail 2.3.1 : Monday, 2 May 2016 14:27:06 UTC