- From: Michael Miller <mmiller@systemsbiology.org>
- Date: Fri, 17 Dec 2010 09:23:39 -0800
- To: Peter.Hendler@kp.org
- Cc: conor-dowling@caregraf.com, public-semweb-lifesci@w3.org, public-semweb-lifesci-request@w3.org
- Message-ID: <69ecf2a4ee8856e5a1474ebacec51642@mail.gmail.com>
hi peter, "You don't gain anything by decomposing it and recomposing it into RDF." the scenarios where i have seen this work is where the data itself isn't touched, the application makes the transformation to RDF in memory then applies semantic tools. so if i want to see if there are patients suitable for a clinical trial, a sparql query might describe the criteria quite nicely. then i could run over EHR records, translating to RDF triples in memory then applying the query. depending on the backing store, i've also seen this done successfully by translating the query to sql then turning the query result into rdf triples to answer the SPARQL query. cheers, michael *From:* Peter.Hendler@kp.org [mailto:Peter.Hendler@kp.org] *Sent:* Thursday, December 16, 2010 4:37 PM *To:* mscottmarshall@gmail.com *Cc:* conor-dowling@caregraf.com; mmiller@systemsbiology.org; public-semweb-lifesci@w3.org; public-semweb-lifesci-request@w3.org; twclark@nmr.mgh.harvard.edu *Subject:* Re: Wait a sec...What about the HL7 RIM An Universal Exchange Language Just want to be clear about when we use the word HL7. In the USA when you just say HL7 it is assumed you mean Version 2 which is in very wide use and is not OO at all. You almost have to say RIM based or V3 before people assume you mean the OO model (RIM). There is a specific XML serialization of the V3 RIM called the XML ITS. It generates the XML elements, attributes and the order that they must be serialized in. When you are dealing with a CDA document, then in addition to these rules there are "templates" that are mostly just verbal human readable rules that you must follow for a given template. Templates refer to each other, so the C32 document mentioned in the Meaningful Use rules of the HITECH part of the ARRA act is made from HL7 and IHE and HITSP templates all referring to each other. These templates specify many things including which vocabularies must be used (SNOMED for example for coding the diagnosis). For a C32 templated CDA you have no wiggle room. The XML is defined. You don't gain anything by decomposing it and recomposing it into RDF. There is nothing to say that you couldn't come up with different ITS's For example you could come up with a JSON ITS and I suppose you could also come up with another RDF ITS. I'm just not convinced it would add anything to the understandability and I think it would add a heck of a lot of size to the representation. Because the diagnosis and much of the clinical data contained in a CDA is in SNOMED, I do see the benefits of using subsumption on the SNOMED codes. For example you could ask for all the patients that had any form a diabetes or heart disease and then use subsumption to get the lists of possible SNOMED codes that would be in the records of the target patients. What I don't see is the advantage of using RDF instead of the standard RIM XML syntax. You could come up with an entirely new model for healthcare not based on the RIM and based on RDF, but much of the world (outside the USA) is already using the RIM. *NOTICE TO RECIPIENT:* If you are not the intended recipient of this e-mail, you are prohibited from sharing, copying, or otherwise using or disclosing its contents. If you have received this e-mail in error, please notify the sender immediately by reply e-mail and permanently delete this e-mail and any attachments without reading, forwarding or saving them. Thank you. *"M. Scott Marshall" <mscottmarshall@gmail.com>* 12/16/2010 03:47 PM To conor dowling <conor-dowling@caregraf.com> cc Michael Miller <mmiller@systemsbiology.org>, Peter Hendler/CA/KAIPERM@KAIPERM, twclark@nmr.mgh.harvard.edu, public-semweb-lifesci@w3.org, public-semweb-lifesci-request@w3.org Subject Re: Wait a sec...What about the HL7 RIM An Universal Exchange Language I like Eric Neumann's description of RDF as "recombinant data". Agreed. Choosing something other than HL7 as the lingua franca for assertions doesn't devalue HL7! We can be happy that we got the information from one machine to another! It's a long haul from the days of big-endian, little-endian. The value is not in the messages (or message syntax) but what is in them (the cargo, the payload). But how will we interoperate between HL7 and CDISC? I suppose that an ontology will help.. BRIDG anyone ;) Cecil? The way that XML quietly infiltrated all our computer systems was by making it easy to describe and parse data of all shapes and sizes. Will OWL/RDF do the same by making it reasonably easy to describe the meaning of messages and documents? HL7 isn't going away. It is the standard. So, how can its users take advantage of other (non-HL7) sources of information that are related to the contents of its messages? And how can other systems, for example, clinical research systems relate their information and constraints to HL7 data? See http://hcls.deri.org/coi/demo/ (makes use of pseudo-CDISC and HL7, and Drug Ontology), presented at AMIA. There should be machine-readable and reason-able links from one set of assertions to the other, that can make use of context (read: provenance). Could the HL7 provenance help us make use of the 'cargo' in another context? i.e. assertion came from message issued by.. -Scott -- M. Scott Marshall, W3C HCLS IG co-chair, http://www.w3.org/blog/hcls Leiden University Medical Center / University of Amsterdam* *http://staff.science.uva.nl/~marshall On Thu, Dec 16, 2010 at 11:06 PM, conor dowling <conor-dowling@caregraf.com> wrote: On Wed, Dec 15, 2010 at 8:47 AM, Michael Miller <mmiller@systemsbiology.org> wrote: hi all, "unambiguous identifier for "things"" i agree, this has been a known issue for many years (as you well know, tim) but its importance is just now growing as multi-omics studies and sharing of EHR records is becoming more common. "It is HL7 V3" i also agree, in a sense, with this. HL7 messages capture information as a whole, as an entity, so in that representation it is also true that semantic web technologies would have a hard time, as is, making sense of them because semantic web technologies wants a fact by fact representation, e.g. triple store. But turn this on its head. HL7 messages come from "islands of data" which have undetermined linkage. Think of a lab result that has a local code, rather than LOINC. LOINC is equivalent to a link to the outside. Effectively the local code is meaningless outside. By its nature, linked data should resolve. If there is a url, you should be able to chase it down. The equivalent of a local code is a resolvable URL which presumably leads to some sort of description of what that local concept means, perhaps enough to translate it to a more commonly understood equivalent. You ask for any number of triples from a semantic endpoint, enough to capture what you need - all lab result assertions over a period for such-and-such a person. That's no different than a query in HL7 (or any other RPC like mechanism). The key difference with linked data (specifically) and "islands with protocol access" is linkage: the idea that links always resolve to something meaningful as opposed to identifiers that while unambiguous, may lead you no where. The problem with the old school which Parsa's "30 years of XML and HL7 experience" captures nicely is wrapped up in this. I've coded this stuff a good bit and everyone gets fixated on the syntax of messages/xml blocks. People are happy if a coded element is "correct", that it "conforms" as opposed to being useful or meaningful. And the problem lies not with them, but the mechanism. It's put the focus on "truck", not "cargo". Conor cheers, michael *From:* public-semweb-lifesci-request@w3.org [mailto: public-semweb-lifesci-request@w3.org] *On Behalf Of *Peter.Hendler@kp.org* Sent:* Wednesday, December 15, 2010 8:18 AM* To:* markw@illuminae.com* Cc:* public-semweb-lifesci@w3.org; public-semweb-lifesci-request@w3.org; twclark@nmr.mgh.harvard.edu * Subject:* Wait a sec...What about the HL7 RIM An Universal Exchange Language The PCAST did not take into consideration (maybe they don't even know) there is an universal exchange language for healthcare. It is HL7 V3. The CDA is merely one of virtually infinite structures that can be constructed from the RIM. The meta information as well as the clinical data is unambiguously represented by RIM. There is no reason to ignore the thousands of man years that went into designing the RIM. The RIM Based Application Architecture (RIMBAA) work group at HL7 has had many demonstrations of RIM based applications. We don't need to re invent the wheel. CDA is only one particular RIM structure designed for one particular use case. Those of us who have been working at HL7 for years are blown away by the suggestion that there needs to be a different wheel invented. * NOTICE TO RECIPIENT:* If you are not the intended recipient of this e-mail, you are prohibited from sharing, copying, or otherwise using or disclosing its contents. If you have received this e-mail in error, please notify the sender immediately by reply e-mail and permanently delete this e-mail and any attachments without reading, forwarding or saving them. Thank you. *Mark <**markw@illuminae.com* <markw@illuminae.com>*>* Sent by: public-semweb-lifesci-request@w3.org 12/14/2010 06:44 PM To "Tim Clark" <twclark@nmr.mgh.harvard.edu> cc public-semweb-lifesci@w3.org Subject Re: An Universal Exchange Language But seriously, Tim, if we were to pursue this problem, we would need some form of unambiguous identifier for "things"... and given the distributed nature of the biomedical domain, we'd want to make sure that there was some way of resolving that identifier to obtain metadata about it from a variety of disparate sources who might have very different information - clinical, molecular, demographic, etc... hmmmm....
Received on Friday, 17 December 2010 17:24:14 UTC