- From: Robin Berjon <robin@berjon.com>
- Date: Fri, 29 May 2009 15:46:42 +0200
- To: Henry S. Thompson <ht@inf.ed.ac.uk>
- Cc: www-tag@w3.org
On May 29, 2009, at 14:41 , Henry S. Thompson wrote:
> All of which raises the question: What _would_ constitute reliable
> evidence of frequency of usage of the four major schema languages
> (DTD, XSD, Relax NG, Schematron)?
>
> Note once again in closing that this is _not_ a "my language is better
> than your language because more people are using it" discussion, but
> rather an attempt to support the proposition that maintaining and
> improving W3C XML Schema is important for W3C because it has a
> substantial user community on (and off) the Web.
If a survey is made, a good model could perhaps be the State of the
Web 2008 survey:
http://www.webdirections.org/the-state-of-the-web-2008/
That being said, sorry if I'm a little thick but I decreasingly
understand what we would be trying to achieve with such a survey? As
passionately as I may dislike XML Schema, there is little doubt that
it is used a lot out there. Or more precisely, that a lot of languages
are defined using XML Schema — the distinction being that in a fair
number of cases that I've seen the schema may be in the specification
but then no one ever uses it as part of the production chain.
Which brings me to my primary point: more than reflecting on what
schema language should get what resources in times of constraint,
shouldn't we take a step back, look at how XML is used today, and
figure out how we would like to help and shape it over the next
decade? I'm willing to bet that any serious take on this will shake
out precisely what needs be done around the schema issue — and will be
more generally useful.
In my experience, an awful lot of users out there pick XML Schema
either because they don't know about the alternatives, or because
"it's the W3C thing". Since as often as not they don't actually put
the schema to use — in fact it's not rare that their schemata would be
seriously broken, even though betters tools are reducing that issue —
they don't really have an opinion on whether XML Schema or anything
else is what they need. It's just there.
This is strongly tied to the misperception that when defining a
language one absolutely needs to have a schema, and the joint
misperception that having defined a schema, processing rules for user-
agent and versioning strategies are naturally solved issues.
What I'm getting at here is that I believe that what we most need is
some form of guidance — possibly in the form of best practices — for
the usage of XML in various situations. Whether W3C is the best place
to define these (given that they wouldn't produce a standard) is an
exercise I'm happy to leave to another day — though an IG could
perhaps work.
There's a lot of experience in defining XML languages from various
groups, notably inside W3C. It would be great at the very least to
document that. It seems to me that whenever I stumble into a new
standardisation effort both inside W3C and outside (e.g. OMTP, OMA,
3GPP, TV Anytime, MPEG...) I have to restate the very same things
about picking the schema language that best fits based on genuine use
cases and requirements rather than "just because", on the fact that a
schema is rarely enough for validation, on versioning strategies, on
user-agent error-handling, on processing models... and my throat feels
rather raw from all that typing, especially since it usually takes a
few rounds of discussion before people start to understand the
problem. I'd love to have something to point at instead.
I took a first semi-serious stab at it for the latest XML Prague (PDF: http://www.xmlprague.cz/2009/presentations/XMLPrague_2009_proceedings.pdf
, slides + video: http://river-valley.tv/designing-xmlweb-languages-a-review-of-common-mistakes/)
and it was well received — in fact I'm still getting questions about
it even from people who weren't there. I have neither the time, the
resources, nor the experience to elaborate on that in a solid, serious
manner that will cover enough of the XML-using community — but I think
that identifying problematic usage patterns will get us closer to
something widely useful than figuring out which schema language is
used most. The broader view on XML usage ought to shake out what
really needs resources.
--
Robin Berjon - http://berjon.com/
Feel like hiring me? Go to http://robineko.com/
Received on Friday, 29 May 2009 13:47:15 UTC