- From: Eric Prud'hommeaux <eric@w3.org>
- Date: Fri, 30 Mar 2012 09:05:13 -0400
- To: "public-semweb-lifesci@w3.org" <public-semweb-lifesci@w3.org>
- Cc: "Mead, Charlie (NIH/NCI) [C]" <meadch@mail.nih.gov>, "cecil.o.lynch@accenture.com" <cecil.o.lynch@accenture.com>
Note below the follow-up on Fri 30 March 11AM EST on #hcls.
We'll be digging more deeply into the SemWeb mechansims used to
classify and dispatch incident (in this case, tests or diagnoses of
TB) reports.
Conference Details:
Date of Call: Friday, March 30, 2012
Time of Call: 11:00 am Eastern Time, 3 pm UK, 4 pm CET
Dial-In #: +1.617.761.6200 (Cambridge, MA)
VoIP address: sip:zakim@voip.w3.org
Participant Access Code: 4257 ("@@HCLS")
IRC Channel: irc.w3.org<http://irc.w3.org> port 6665 channel HCLS (see W3C IRC page for
details, or see Web IRC), Quick Start: Use
http://www.mibbit.com/chat/?server=irc.w3.org:6665&channel=%23hcls
Duration: ~1 hour
Convener: Charlie Mead
Scribe: TBD
* Eric Prud'hommeaux <eric@w3.org> [2012-03-27 17:59-0400]
> http://www.w3.org/2012/03/27-HCLS-minutes
>
> HCLS Health Care Domain
>
> 27 Mar 2012
>
> See also: [2]IRC log
>
> Attendees
>
> Chair
> Charlie Mead
>
> Scribe
> ericP
>
> Special Guest Star
> Cecil Lynch
>
> Follow-up Meeting
> Fri 30 March 11AM EST on #hcls
>
> Contents
>
> * [3]OWL in CDC's Tuberculosis Surveillance/Response
> __________________________________________________________________
>
> [slide 3]
>
> <Joanne_Luciano> slides aren't numbered :-(
>
> <Joanne_Luciano> ah, but the browser numbers them!
>
> <egombocz> If you look at them not in show mode, you can see the
> numbers on the side thumbnails
>
> Cecil: antibiotic-resistent airline passenger promted review on
> Tuberculosis Information Management System (TIMS)
> ... reporting a TB case required passing a brittle set of messaging and
> business rules
>
> [slide 4: Message Processing Integration]
>
> Joanne_Luciano: each state wanted their own standard?
>
> Cecil: CDC wanted a standard
> ... states would take anything which makes reporting easier
> ... [re: slide 4]
> ... choices about how to import messages to CDC
> ... .. after message had some processing
> ... .. as a Web Service RPC
>
> [slide 5: Deployment Architecture]
>
> Cecil: going with existing CDC infrastructure
> ... staring from left:
> ... .. some source, usually state or large counties (53 jurisdictions)
> reports
>
> <Joanne_Luciano> is going with the CDC one of those three options on
> slide 4 or is it another one (not listed on slide 4)?
>
> Cecil: .. goes into data messaging broker, which validates syntax
>
> <Joanne_Luciano> looks like it's option 1 on slide 4
>
> Cecil: .. if a valid TB message, off to content validation queue
> ... .. also split into components for e.g. line listing of incoming
> cases
> ... .. after validation, email with contents of alert sent to CDC's TB
> group
>
> Joanne_Luciano: this is slide 3 option 1?
>
> Cecil: this is slide option 3 (RPC)
> ... we had tried driving real-time alerting from biosense
> ... we took messages off the first transport, never queued in DMB
> [slide 5 left]
> ... the HL7 2.x standard is fairly loose
> ... flexible, can take any payload
> ... can be structured in any way
> ... segments are well-defined, but segment structure requires point to
> point negotiation
> ... p2p neg is a guideline
>
> charlie: HL7 2.x is a syntactic standard and a semantics guideline
>
> [slide 6: Message Content Validation Architecture]
>
> <Joanne_Luciano> JMS?
>
> Cecil: after leaving broker, falls into JMS interface
> ... because this has the 2.5 validation, we don't need the 2.x
> syntactic validation
> ... so we don't do the validation
> ... before we went live, we validated and found 2 errors in HL7
> messaging
> ... (was a benefit of 2-tier validation)
> ... once live, we don't do syntacit validation
> ... but we do parse out components
> ... questions like birthday and date of problem were found via OBX
> extractions
> ... an OWL ontology tells us how to process a message
> ... the ontology links all the knowledge
> ... it guides parsing the message by aligning the OBX-extracted facts
> with an RDF graph
> ... we can then use the JESS reasoner for evaluating these facts
> ... JESS (Java Expert System Shell) is a rules FW/BW chaining rules
> engine
> ... has a protege plugin, interprets SWRL
> ... good commercial tool for high-volume processing
> ... paid for by tax dollars, only free for government use
> ... $75K otherwise
>
> <Stuart> Drools
>
> <iker> DROOLS
>
> <mr_sticky> Drools is from JBoss
>
> <mr_sticky> [4]http://www.jboss.org/drools
>
> Cecil: we tried Drools, which has FW/BW chaining and similar fact
> structure
> ... use JESS if you're processing millions of facts
>
> Joanne_Luciano: and Jena?
>
> Cecil: no experience with it
> ... at OTR, we pass what we expect to see and what we got as two graphs
> ... the choreography of the OTR framework works out that something is a
> question about an e.g. resistance pattern of anitbiotic
> ... we have a set of "listeners" (patterns)
> ... we built this on V3 semantics, but mapped back to V2 syntax
> ... once we've matched the graph against the patterns, we pass it to
> jess
> ... we give jess the profile for an e.g. normal patient, MDR (multi
> drug resistant) patient, XDR (extensive drug resistant) (potential
> super-spreader)
> ... the reasoning framework decides if an event needs action
> ... another listener strains through alerts from JESS for outbound
> messaging
> ... we also use the output for visualization
> ... folks don't need to need to use SAS to extract this data from
> mid-tier, instead just using graph representations
> ... with agreement from CDC, we could have sent output messages back to
> reporters
> ... output:
> ... .. drug resistant
> ... .. appropriateness of drugging (per WHO codes)
> ... .. predictive analysis of whether someone is likely to fall off
> treatment based on patient history
>
> [slide 7: Types of problems that could be solved by extending the TB
> framework]
>
> Cecil: had to bend to time and budget limitations
> ... we could have added a d2rq interface to retrofit the pre-existing
> data
> ... a lot we could have done
>
> [slide 8: The use of an OWL ontology]
>
> Cecil
>
> [slide 9: HL7 Message Artifact Taxonomy]
>
> Cecil: this is how we mapped the OBX structure to the ontology
>
> [slide 11: Rule Processing]
>
> [slide 12: Message Content Validation Rule Implementation]
>
> Cecil: this demonstrates the advantage of using OWL
> ... the blue is what we deleted
> ... (from TIMS)
> ... went from 358 to 175
> ... reduces frustration of reporters facing conflicting rules
> ... beyond OWL being able to do syntax, vocabulary, rule processing, we
> see the advantage of declarative rules
>
> [slde 13: Message Content Validation Rules]
>
> Cecil: with tons of volume and response time requirements, you need a
> more efficient bw-chaining system (JESS)
>
> [slide 14: Message Content Validation Results View]
>
> Cecil: sample output
>
> [slide 15: Processing Results]
>
> Cecil: average processing time 3.5s round trip
> ... far faster than a human, and more accurate
> ... scales up to ~350k messages/day
> ... ~300K TB messages/year
> ... could scale to influenza
> ... at worst case (4 month window), 50-75M, so ~ 200K message/day
> ... in a surveillance, you're also looking at folks who don't have it
> ... feeds from 800 VA hospitals, + laps a quest and labcore, ...
> ... congress says we need response in 2 mins
> ... had to put everything in memory
> ... biosense lost funding
>
> mscottm: summary of SemWeb advantages is very different from our usual
> tech demos in HCLS
> ... what are your SemWeb wins?
> ... what could be improved?
>
> charlie: would like formal continuation
> ... to help us find focal points in HCLS
>
> Cecil: SemWeb is a flexible way to extract knowledge
> ... we were given a TB messaging system and a deadline
> ... 7 days before deadline, CDC said we'd like to upgrade a 1.2 of our
> implementation guideline
> ... had around 35 new rules and 100 terminology changes
> ... because everything CDC gave us was in the OWL. expected to do it in
> 4 days
> ... made it on 4 days with no additional charge to CDC
> ... big commercial motivation is the flexibility at responding to
> rapidly changing knowledge
> ... at NCI, i wanted to build an EMR system
> ... NCO SHARP projects kind of get to this
> ... win 1: rapid software engineering
> ... win 2: rule validation
> ... win 3: can infer things that a human has problems inspecting
>
> <mscottm> Nice to hear that experience in the field confirms my main
> sales pitch about advantage of SemWeb tech for software: easier
> maintenance and change, agile development, effectively lower cost.
>
> Cecil: .. (large systems (e.g. BRIDG's UML) hard to swap into a brain)
> __________________________________________________________________
>
>
> Minutes formatted by David Booth's [5]scribe.perl version 1.136
> ([6]CVS log)
> $Date: 2012/03/27 21:57:08 $
>
> References
>
> 1. http://www.w3.org/
> 2. http://www.w3.org/2012/03/27-HCLS-irc
> 3. http://www.w3.org/2012/03/CSTE_TB.ppt
> 4. http://www.jboss.org/drools
> 5. http://dev.w3.org/cvsweb/~checkout~/2002/scribe/scribedoc.htm
> 6. http://dev.w3.org/cvsweb/2002/scribe/
>
> --
> -ericP
--
-ericP
Received on Friday, 30 March 2012 13:05:47 UTC