Online validator and *.wikipedia.org

Hi,

trying to use the online validator on Wikipedia XML docu-
ments and schemas (cf.
<URI:http://www.w3.org/2001/03/webdata/xsv?docAddrs=http://de.wikipedia.org/w/index.php?title=Vorlage:Infobox_Ort_in_Griechenland/XML%26action%3Draw+http://de.wikipedia.org/w/index.php?title=User:Revvar/VM/XML-Schema%26action%3Draw&warnings=on&style=xsl>)
yields:

| validator crash during target reading
| Error: can't retrieve "http://de.wikipedia.org/w/index.php?title=Vorlage:Infobox_Ort_in_Griechenland/XML&action=raw": 403 Forbidden

This ("403 Forbidden") is most probably due to the online
validator not supplying a "User-Agent" header (or using a
default one too common) and thus being rejected by Wikipe-
dia; this policy is explained a bit at
<URI:http://www.mediawiki.org/wiki/API%3aFAQ#do_I_get_HTTP_403_errors.3F>.

  So - if this is the case - it would be nice if the online
validator could provide a suitable "User-Agent" header.
Some information on how to achieve that with Python is
available inter alia at
<URI:http://stackoverflow.com/questions/120061/fetch-a-wikipedia-article-with-python>.

Thanks in advance,
Tim

P. S.: There's also a typo in "report problems (and
       sucesses!) to xmlschema-dev (archive)" :-).

Received on Tuesday, 27 December 2011 05:55:33 UTC