- From: Sebastian Hellmann <hellmann@informatik.uni-leipzig.de>
- Date: Fri, 31 Jan 2014 15:52:44 +0100
- To: public-openannotation <public-openannotation@w3.org>, Antoine Isaac <aisaac@few.vu.nl>, Dimitris Kontokostas <kontokostas@informatik.uni-leipzig.de>
- Message-ID: <52EBB8BC.8050905@informatik.uni-leipzig.de>
Hi all,
sorry to come late to the discussion and also for not reading all the
emails.
- One of the biggest problems of NIF was actually not the adoption, but
the fact that the existing 40 implementation all interpreted the spec a
little bit different.
In order to improve this fact, we recently developed a generic validator
for data, which allows test-driven data (and web service development).
Please find the recently WWW paper here:
http://svn.aksw.org/papers/2014/WWW_Databugger/public.pdf (Please keep
Dimitris in CC).
We are using it for the area of Linguistics:
http://svn.aksw.org/papers/2014/ESWC_NLP_Cleansing/public.pdf
- As a result of the standardization of the Internationalization Tagset
2.0, it was concluded that: RDFa was completely unsuitable for
annotation, mostly because you loose most of the information "what" is
the exact target of an annotation (i.e. parsing results in
de-contextualised triples).
We wrote a bit about this here on page 9 [1]
Note the elegant format to embed annotations in RDF
***Disclaimer:
this does not cover full OA, but the 20 data categories cover a lot of
use cases and tool support is widely given in the language technology
world.
****
from http://www.w3.org/TR/its20/#conversion-to-nif
*<!DOCTYPE html>**<html* xmlns="http://www.w3.org/1999/xhtml"*>*
*<head>**<meta* http-equiv="Content-Type" content="text/html;charset=utf-8"* >*
*<title>*NIF conversion example*</title>**</head>*
*<body>**<h2* translate="yes"*>*Welcome to*<span*
its-ta-ident-ref="http://dbpedia.org/resource/Dublin" its-within-text="yes"
translate="no"*>*Dublin*</span>* in*<b* translate="no" its-within-text="yes"*>*Ireland*</b>*!*</h2>**</body>**</html>*
- Personally, I like the new IRI-enabled N-Triples format:
http://www.w3.org/TR/n-triples/, which is compatible with Turtle in
HTML: http://www.w3.org/TR/turtle/#in-html
(Line based formats are very robust)
All the best,
Sebastian
[1] http://svn.aksw.org/papers/2013/ISWC_NIF/public.pdf
--
Sebastian Hellmann
Department of Computer Science, University of Leipzig
Events:
* *30th January, 2014*: 1st DBpedia Community meeting
(http://wiki.dbpedia.org/meetings/Amsterdam2014)
* *Sept. 2014* MLODE
Venha para a Alemanha como PhD: http://bis.informatik.uni-leipzig.de/csf
Projects: http://dbpedia.org, http://nlp2rdf.org,
http://linguistics.okfn.org, http://dbpedia.org/Wiktionary
Homepage: http://bis.informatik.uni-leipzig.de/SebastianHellmann
Research Group: http://aksw.org
Stop asking, it's here:
http://tinyurl.com/sh-thesis-summary
http://tinyurl.com/sh-thesis
Received on Friday, 31 January 2014 14:54:40 UTC