- From: Tim Landscheidt <tim@tim-landscheidt.de>
- Date: Tue, 27 Dec 2011 00:12:36 +0000
- To: xmlschema-dev@w3.org
Hi, trying to use the online validator on Wikipedia XML docu- ments and schemas (cf. <URI:http://www.w3.org/2001/03/webdata/xsv?docAddrs=http://de.wikipedia.org/w/index.php?title=Vorlage:Infobox_Ort_in_Griechenland/XML%26action%3Draw+http://de.wikipedia.org/w/index.php?title=User:Revvar/VM/XML-Schema%26action%3Draw&warnings=on&style=xsl>) yields: | validator crash during target reading | Error: can't retrieve "http://de.wikipedia.org/w/index.php?title=Vorlage:Infobox_Ort_in_Griechenland/XML&action=raw": 403 Forbidden This ("403 Forbidden") is most probably due to the online validator not supplying a "User-Agent" header (or using a default one too common) and thus being rejected by Wikipe- dia; this policy is explained a bit at <URI:http://www.mediawiki.org/wiki/API%3aFAQ#do_I_get_HTTP_403_errors.3F>. So - if this is the case - it would be nice if the online validator could provide a suitable "User-Agent" header. Some information on how to achieve that with Python is available inter alia at <URI:http://stackoverflow.com/questions/120061/fetch-a-wikipedia-article-with-python>. Thanks in advance, Tim P. S.: There's also a typo in "report problems (and sucesses!) to xmlschema-dev (archive)" :-).
Received on Tuesday, 27 December 2011 05:55:33 UTC