ideas for standard JSON-based semantic representation

There are currently quite a few cloud-based natural language application
development toolkits, all with their own proprietary result formats, even
though their functionality doesn't differ too much. Proprietary formats
shouldn't be necessary. It would be extremely useful to have a standard
representation for natural language results for many reasons; for example,
to make it easier to switch vendors and to encourage the development of
third-party natural language development tools. The EMMA standard
(https://w3c.github.io/emma/emma2_0/emma_2_0_editor_draft.html) was
developed for representing semantic results and has the ability to represent
a rich set of metadata about semantic processing. EMMA would be a good
option for use as a standard with current toolkits. However, EMMA is an XML
format and all of the current toolkit result formats are based on JSON,
which is very popular with developers. I think it should be possible to
develop a JSON format that captures the kind of information that's contained
in EMMA. To that end,  I put together a writeup with some suggestions for
representing natural language results using JSON syntax and added it to the
Voice Interaction GitHub repository

HTML rendered version:
https://w3c.github.io/voiceinteraction/voice%20interaction%20drafts/emmaJSON
.htm 

Repository:
https://github.com/w3c/voiceinteraction/tree/master/voice%20interaction%20dr
afts/emmaJSON.htm
<https://github.com/w3c/voiceinteraction/tree/master/voice%20interaction%20d
rafts/emmaJSON.htm%20%0d> 

 

Please take a look and send comments to this list, or post them in the group
wiki, https://github.com/w3c/voiceinteraction/wiki/Home/_edit 

We have the option to eventually publish some version of this as a Community
Group report.

 

Received on Tuesday, 27 November 2018 19:47:33 UTC