W3C home > Mailing lists > Public > www-xsl-fo@w3.org > March 2002

Enhanced SGML to XML conversion based on SP tools

From: G. Ken Holman <gkholman@CraneSoftwrights.com>
Date: Fri, 15 Mar 2002 22:51:10 -0500
Message-Id: <5.1.0.14.0.20020315224922.0210ff38@junk>
To: XSLFO-WWW <www-xsl-fo@w3.org>
James Clark's SX tool fits many people's requirements for SGML to XML 
conversion.

When I needed more features than offered by SX, I wrote a Python 
application to read the output from NSGMLS and emit XML syntax supporting:

  - specifiable document type declaration information
  - mapping of ISO SDATA entities to Unicode characters
  - gleaning of document type definition internal declaration subset

This is now available as a freely-downloadable resource from our Resource 
Library linked from the top of our home page noted in my trailer below.

T:\sgml>python n2x.py -?

Arguments: {options} {filename}

Options:   -  = use stdin (also true if the filename is absent)
            -? = print help
            -noi or -nointernal = suppress the internal declaration subset
            -non or -nonotation = suppress notation declarations
            -noe or -noerrors = suppress errors for NDATA entity references
            -nos or -noSDATA = suppress sdata entity replacement
            -p:public-id or -public:public-id or "-p:public-id with spaces"
            -s:system-id or -system:system-id or "-s:system-id with spaces"

Assumes:   - the SDATA entity replacements are of the form "[entity-name]"

Input:     - the output from the nsgmls tool found at http://www.jclark.com/sp

IMPORTANT: - the "-bUTF-8" runtime option must be specified for nsgmls

Example:   nsgmls -bUTF-8 <test.sgm >test.nsgmls
            python n2x.py <test.nsmgls >test.xml

Note:      - the options "-noi -nos -noe" reproduces the output from SX

T:\sgml>

We hope that people find this program useful and can take the time to 
report to us any problems they find so we can improve the program for 
everyone.  It meets the needs that we have, but we would like to ensure it 
meets others' needs as well.

Thanks!

.......................... Ken

cc:
XSL List
XML-DEV
XML-DOC
XML-L
XSLFO
XSLFO-WWW
xalan-dev
fop-user
comp.text.xml
microsoft.public.xml

--
Upcoming: 3-days XSLT/XPath and/or 2-days XSLFO: June 17-21, 2002
-       : 3-days XML Information Modeling: July 31-August 2, 2002

G. Ken Holman                mailto:gkholman@CraneSoftwrights.com
Crane Softwrights Ltd.         http://www.CraneSoftwrights.com/f/
Box 266, Kars, Ontario CANADA K0A-2E0 +1(613)489-0999 (Fax:-0995)
ISBN 0-13-065196-6                        Definitive XSLT & XPath
ISBN 1-894049-08-X  Practical Transformation Using XSLT and XPath
ISBN 1-894049-07-1               Practical Formatting Using XSLFO
XSL/XML/DSSSL/SGML/OmniMark services, books(electronic, printed),
articles, training(instructor-live,Internet-live,web/CD,licensed)
Next public training:   2002-04-08,09,10,11,05-06,07,09,10,13,20,
-                                06-04,07,10,11,13,14,17,20,07-31
Received on Friday, 15 March 2002 23:02:19 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Wednesday, 3 October 2007 16:06:09 GMT