- From: Martynas Jusevičius <martynas@graphity.org>
- Date: Fri, 25 Nov 2016 15:43:09 +0100
- To: Felix Sasaki <fsasaki@w3.org>
- Cc: public-rax@w3.org
Hey all, (resending from the right address) sorry for not being able to participate today. I updated my section on XML formats for quads however: https://www.w3.org/community/rax/wiki/Draft_Material#XML_formats_for_RDF_datasets_.28quads.29 Nice weekend everyone, Martynas atomgraph.com On Fri, Nov 25, 2016 at 2:46 PM, Felix Sasaki <fsasaki@w3.org> wrote: > See > > https://www.w3.org/2016/11/25-rax-minutes.html > > and below as text. We discussed the two use cases from Christopher > > https://www.w3.org/community/rax/wiki/Draft_Material#Data_acquisition_from_job_postings_via_GATE > https://www.w3.org/community/rax/wiki/Draft_Material#AutomationML_industry_automation_models_integration > > and issues with converting (potentially with back conversion = round tripping) from XML/HTML to RDF. From that we may derive some general patterns that may be worth documenting. We will provide examples of input output in the github - feel free to do the same. Next call would be 9 December. > > Best, > > Felix > > [1]W3C > > [1] http://www.w3.org/ > > - DRAFT - > > rax cg > > 25 Nov 2016 > > [2]Agenda > > [2] https://lists.w3.org/Archives/Public/public-rax/2016Nov/0008.html > > See also: [3]IRC log > > [3] http://www.w3.org/2016/11/25-rax-irc > > Attendees > > Present > philr, felix, timea, christoph > > Regrets > christian, gerard, jose > > Chair > phil > > Scribe > fsasaki > > Contents > > * [4]Topics > 1. [5]meeting start > 2. [6]bdva summit > 3. [7]AOB > * [8]Summary of Action Items > * [9]Summary of Resolutions > __________________________________________________________ > > meeting start > > phil: did a review of use cases this morning. not too much > change, missed one that christoph added. > > [10]https://www.w3.org/community/rax/wiki/Draft_Material#Data_a > cquisition_from_job_postings_via_GATE > > [10] https://www.w3.org/community/rax/wiki/Draft_Material#Data_acquisition_from_job_postings_via_GATE > > phil: thanks a lot for adding this, christoph - can you give a > brief description? > > christoph: sure. have not yet managed to share the > descriptions, I have more material, and will get it done to > share this > ... will also add more concrete examples. Application setting > is: we collect job postings in the form of plain text from the > web > ... we do named entity recognition with gate, and we get XML > output > ... begining and end of each token is annotated > > <clange> text text text <start/>recognised entity<end/> text > text > > christoph: see above XML example. this has to be translated to > RDF > > <clange> <start id="foo"/> > > <clange> <start href="#foo"/> > > christoph: start and end tags look like the above > > <clange> ids or refs (forgot which direction) are in these > start/end tags > > christoph: we are using XSLT based tool I developed (trextor) > to create RDF. it is quite hard > > <clange> krextor > > christoph: with XPath it is hard to select elements between > start and end tags > ... that is a bit tricky, you need a good knowledge of XPath, > the sibling axis' etc. > ... in context of European project, in which another partner is > doing the extraction > > phil: is this similar to Martynas case? > > christopher: in terms of Xpath complexity, yes > ... general XML to RDF transformation issue? > > [11]https://github.com/fsasaki/its20-extractor/tree/master/wiki > pedia-extractor > > [11] https://github.com/fsasaki/its20-extractor/tree/master/wikipedia-extractor > > <philr> felix: I've written various converters > > <philr> ...it is always special case issues > > <philr> ...XML has various ways to include content > > <philr> ...special purpose handling is somwhat unavoidable > > <philr> ...example documents with guideance would be useful > > scribe: may be useful to give guidance on how to handle various > cases > > christopher: there are patterns, e.g. parent child relations in > XML and RDF properties > ... for this you can provide a high level translation patterns > > <philr> clange: High level translation is possible with simple > parent-child relationships > > <philr> felix: mixture of text and element nodes is challenging > > [12]https://github.com/fsasaki/its20-extractor/blob/master/wiki > pedia-extractor/its-ta-2-nif-wikipedia.xsl#L43 > > [12] https://github.com/fsasaki/its20-extractor/blob/master/wikipedia-extractor/its-ta-2-nif-wikipedia.xsl#L43 > > <clange> fsasaki: handling of specific links (specific to wiki > markup) > > phil: in FREME project we are also doing named entity > recognition on plain text. our services are capable of > returning turtle files, but we can cover many formats > > [13]https://api-dev.freme-project.eu/ckeditor-dev/ckeditor/samp > les/freme.html > > [13] https://api-dev.freme-project.eu/ckeditor-dev/ckeditor/samples/freme.html > > various types of output, inline or external using json-ld > > <scribe> ACTION: felix to provide examples of round tripping as > done in the freme project [recorded in > [14]http://www.w3.org/2016/11/25-rax-minutes.html#action01] > > [14] http://www.w3.org/2016/11/25-rax-minutes.html#action01] > > bdva summit > > <philr> felix: to collect information on what better tooling is > needed > > <philr> ...best practices abd standardization > > <philr> ...1.5 hour session on requirements > > <philr> clange: is there more I can do if I do not attend the > summit? > > <philr> felix: it would be good if someone from your > organization could attend > > <philr> ...questionnaire to bdva members but want input from > companies > > <philr> Is there a fee to join bdva? > > felix: yes, will send info on that > > <clange> fsasaki 14:29: EU is not necessarily interested in new > standards being developed, but in existing standards to be > _applied_ in a better way > > thanks, clange > > discussion on automationML use case > > felix will send further infos on BDVA around > > AOB > > next meeting 9th of December > > phil cannot make it, christian to chair > > Summary of Action Items > > [NEW] ACTION: felix to provide examples of round tripping as > done in the freme project [recorded in > [15]http://www.w3.org/2016/11/25-rax-minutes.html#action01] > > [15] http://www.w3.org/2016/11/25-rax-minutes.html#action01 > > Summary of Resolutions > > [End of minutes] > __________________________________________________________ > > > Minutes formatted by David Booth's [16]scribe.perl version > 1.148 ([17]CVS log) > $Date: 2016/11/25 13:41:09 $ > __________________________________________________________ > > [16] http://dev.w3.org/cvsweb/~checkout~/2002/scribe/scribedoc.htm > [17] http://dev.w3.org/cvsweb/2002/scribe/ > > Scribe.perl diagnostic output > > [Delete this section before finalizing the minutes.] > This is scribe.perl Revision: 1.148 of Date: 2016/10/11 12:55:14 > Check for newer version at [18]http://dev.w3.org/cvsweb/~checkout~/2002/ > scribe/ > > [18] http://dev.w3.org/cvsweb/~checkout~/2002/scribe/ > > Guessing input format: RRSAgent_Text_Format (score 1.00) > > Succeeded: s/this/this, christoph/ > No ScribeNick specified. Guessing ScribeNick: fsasaki > Inferring Scribes: fsasaki > Present: philr felix timea christoph > Regrets: christian gerard jose > Agenda: [19]https://lists.w3.org/Archives/Public/public-rax/2016Nov/0008 > .html > Got date from IRC log name: 25 Nov 2016 > Guessing minutes URL: [20]http://www.w3.org/2016/11/25-rax-minutes.html > People with action items: felix > > [19] https://lists.w3.org/Archives/Public/public-rax/2016Nov/0008.html > [20] http://www.w3.org/2016/11/25-rax-minutes.html > > > [End of [21]scribe.perl diagnostic output] > > [21] http://dev.w3.org/cvsweb/~checkout~/2002/scribe/scribedoc.htm > > > >
Received on Friday, 25 November 2016 14:43:44 UTC