W3C home > Mailing lists > Public > public-rdf-in-xhtml-tf@w3.org > September 2009

Re: [Fwd: DBpedia Pages now include RDFa]

From: Kingsley Idehen <kidehen@openlinksw.com>
Date: Thu, 17 Sep 2009 11:54:52 -0400
Message-ID: <4AB25BCC.8000801@openlinksw.com>
To: Shane McCarron <shane@aptest.com>
CC: Ivan Herman <ivan@w3.org>, W3C RDFa task force <public-rdf-in-xhtml-tf@w3.org>, public-rdfa@w3.org
Shane McCarron wrote:
> Happy to!
> Basically, what you need to remember is that XHTML+RDFa is an XML 
> dialect, so the document first and foremost needs to be xml well formed.

This I know, the page is generated :-)
>   In particular this means that elements with EMPTY content models 
> (like img) need to be terminated, attribute values need to have values 
> and those values need to be quoted, ampersands that are literal 
> ampersands need to be expressed as an entity.  So... for example, 
> embedded URLs need to have ampersands changed to &amp;.  img elements 
> need to have '/>' at their end.
> You can see the complete set of errors for that page at 
> http://validator.w3.org/check?uri=http%3A%2F%2Fdbpedia.org%2Fpage%2FAlbert_Einstein&charset=(detect+automatically)&doctype=Inline&group=0 
> Some of the errors are not really a big deal - parsers don't care 
> about validation per se.  But they do care about well-formedness so it 
> is important the document be well formed.
> Does this help?  Or do you need some more specific suggestions?
Just need technical description of problems, I am very familiar with all 
the technology and technical details.

Basically, just need rapid-fire identification of technical issue.

Since we have our own DBMS hosted XML parser, I just need to check how 
come we didn't add validation to the generation process :-)

> Kingsley Idehen wrote:
>> Shane McCarron wrote:
>>> I agree that this is great news.  However, when I went and LOOKED at 
>>> the page we use in our examples, 
>>> http://dbpedia.org/page/Albert_Einstein , I see that it is invalid 
>>> in a number of respects.  It is unfortunate that dbpedia is at the 
>>> mercy of the resources they troll.  The markup ends up being hard to 
>>> manage, I suspect.
>>> I was forced to put my parser into "html" mode to get it to swallow 
>>> that page:
>>> http://htmlwg.mn.aptest.com/rdfa/extract_rdfa.pl?format=n3&type=html&uri=http://dbpedia.org/page/Albert_Einstein 
>> Shane,
>> Just provide feedback and we'll have it fixed. I am aware of some the 
>> issue above.
>> We are in a continuum, nothing is static etc..
>> Just provide feedback :-)
>> Kingsley
>>> Ivan Herman wrote:
>>>> FYI
>>>> -------- Original Message --------
>>>> Subject: DBpedia Pages now include RDFa
>>>> Resent-Date: Wed, 16 Sep 2009 15:32:01 +0000
>>>> Resent-From: public-lod@w3.org
>>>> Date: Wed, 16 Sep 2009 11:31:19 -0400
>>>> From: Kingsley Idehen <kidehen@openlinksw.com>
>>>> To: public-lod@w3.org
>>>> CC: public-lod <public-lod@w3.org>,
>>>> "dbpedia-discussion@lists.sourceforge.net"
>>>> <dbpedia-discussion@lists.sourceforge.net>
>>>> References:
>>>> <f914914c0909160749s4adaa3b5q517b418a1b0a9619@mail.gmail.com>
>>>> <4AB1024E.4000309@openlinksw.com>
>>>> All,
>>>> Quick FYI.
>>>> All DBpedia pages now include RDFa. Thus, re. the DBpedia Linked Data
>>>> Space, you now have HTML+RDFa as a structured metadata representation
>>>> alternative to N3, Turtle, RDF/XML, and RDF/JSON (*new*).



Kingsley Idehen	      Weblog: http://www.openlinksw.com/blog/~kidehen
President & CEO 
OpenLink Software     Web: http://www.openlinksw.com
Received on Thursday, 17 September 2009 15:55:29 UTC

This archive was generated by hypermail 2.4.0 : Friday, 17 January 2020 17:02:04 UTC