- From: Sebastian Hellmann <hellmann@informatik.uni-leipzig.de>
- Date: Fri, 31 Jan 2014 15:52:44 +0100
- To: public-openannotation <public-openannotation@w3.org>, Antoine Isaac <aisaac@few.vu.nl>, Dimitris Kontokostas <kontokostas@informatik.uni-leipzig.de>
- Message-ID: <52EBB8BC.8050905@informatik.uni-leipzig.de>
Hi all, sorry to come late to the discussion and also for not reading all the emails. - One of the biggest problems of NIF was actually not the adoption, but the fact that the existing 40 implementation all interpreted the spec a little bit different. In order to improve this fact, we recently developed a generic validator for data, which allows test-driven data (and web service development). Please find the recently WWW paper here: http://svn.aksw.org/papers/2014/WWW_Databugger/public.pdf (Please keep Dimitris in CC). We are using it for the area of Linguistics: http://svn.aksw.org/papers/2014/ESWC_NLP_Cleansing/public.pdf - As a result of the standardization of the Internationalization Tagset 2.0, it was concluded that: RDFa was completely unsuitable for annotation, mostly because you loose most of the information "what" is the exact target of an annotation (i.e. parsing results in de-contextualised triples). We wrote a bit about this here on page 9 [1] Note the elegant format to embed annotations in RDF ***Disclaimer: this does not cover full OA, but the 20 data categories cover a lot of use cases and tool support is widely given in the language technology world. **** from http://www.w3.org/TR/its20/#conversion-to-nif *<!DOCTYPE html>**<html* xmlns="http://www.w3.org/1999/xhtml"*>* *<head>**<meta* http-equiv="Content-Type" content="text/html;charset=utf-8"* >* *<title>*NIF conversion example*</title>**</head>* *<body>**<h2* translate="yes"*>*Welcome to*<span* its-ta-ident-ref="http://dbpedia.org/resource/Dublin" its-within-text="yes" translate="no"*>*Dublin*</span>* in*<b* translate="no" its-within-text="yes"*>*Ireland*</b>*!*</h2>**</body>**</html>* - Personally, I like the new IRI-enabled N-Triples format: http://www.w3.org/TR/n-triples/, which is compatible with Turtle in HTML: http://www.w3.org/TR/turtle/#in-html (Line based formats are very robust) All the best, Sebastian [1] http://svn.aksw.org/papers/2013/ISWC_NIF/public.pdf -- Sebastian Hellmann Department of Computer Science, University of Leipzig Events: * *30th January, 2014*: 1st DBpedia Community meeting (http://wiki.dbpedia.org/meetings/Amsterdam2014) * *Sept. 2014* MLODE Venha para a Alemanha como PhD: http://bis.informatik.uni-leipzig.de/csf Projects: http://dbpedia.org, http://nlp2rdf.org, http://linguistics.okfn.org, http://dbpedia.org/Wiktionary Homepage: http://bis.informatik.uni-leipzig.de/SebastianHellmann Research Group: http://aksw.org Stop asking, it's here: http://tinyurl.com/sh-thesis-summary http://tinyurl.com/sh-thesis
Received on Friday, 31 January 2014 14:54:40 UTC