W3C home > Mailing lists > Public > www-international@w3.org > January to March 2002

ANN: Gorille 0.2

From: Simon St.Laurent <simonstl@simonstl.com>
Date: 07 Jan 2002 16:38:01 -0500
To: www-international@w3.org
Message-Id: <1010439482.2128.6.camel@localhost.localdomain>
As the shift in Unicode handling between XML 1.0 and 1.1 seems relevant to
W3C internationalization work, I thought I'd post this here.

I'm happy to announce a second release of the configurable Gorille 
XML/Unicode character tester.  Like the earlier release, it uses XML-based 
configuration files to specify which characters should be permitted in 
particular XML contexts.  This version adds support for both namespaces and 
public identifiers, as well as an experimental SAX filter.

Gorille is a small Java package designed to let developers of various kinds 
of XML processors test the content and names of XML structures in their XML 
documents. While Gorille ships with test files for both XML 1.0 and the 
draft XML 1.1, you can create your own configuration files as well.

Gorille uses an XML format to specify lists of characters according to 
either XML 1.0 conventions (with its BaseChar, Ideographic, CombiningChar, 
Digit, and Extender productions) or XML 1.1 conventions (NameStartChar, 
NameChar). Both forms permit specification of the Char and S production for 
content characters and whitespace. I've included sample lists for both XML 
1.0 and XML 1.1, as well as an ASCII-only version of XML 1.0.

Gorille is now hosted at SourceForge, complete with mailing lists and CVS:
http://gorille.sourceforge.net

I would especially like to hear from developers who can give Gorille more 
thorough testing on a wider range of Unicode than I have been able to do so 
far.  The SAX Filter in particular needs some tire-kicking, as most SAX 
parsers already perform the XML 1.0 version of Gorille's tests, making it 
difficult to get large numbers of faulty events into Gorille.

Despite my interests in Unicode and character encoding issues in XML, I 
still live in a largely ASCII universe, and no doubt some subtleties have 
escaped me.

Contributions, bug reports, and general comments on the usefulness or lack 
thereof of this tool are all quite welcome.

Simon St.Laurent
Associate Editor, O'Reilly & Associates
http://simonstl.com
Received on Monday, 7 January 2002 15:34:07 GMT

This archive was generated by hypermail 2.2.0+W3C-0.50 : Tuesday, 2 June 2009 19:16:58 GMT