Preview of OWL 2 ontology for clinical pharmacogenomics and decision support (now called 'Genomic CDS')

Dear all,

A preview release (version 0.8) of the Genomic CDS ontology is now available at 
http://www.genomic-cds.org/ont/genomic-cds.owl

There is also a demo version that contains some example patient data:
http://www.genomic-cds.org/ont/genomic-cds-demo.owl

Among other things, it contains logical representations of pharmacogenetic alleles and their defining SNPs, as well as logical representations of selected pharmacogenetic guidelines from the Clinical Pharmacogenetics Implementation Consortium and the Dutch Pharmacogenetics Working Group. The Genomic CDS ontology aims to unify several functionalities in a single resource, being:
* A knowledge base for clinical pharmacogenomics/pharmacogenetics that can be used for question-answering (e.g., which SNPs are associated with this drug?)
* A rule base for clinical decision support (e.g., inferring that a patient with a specific set of SNPs requires a lowered dose of warfarin and generating a CDS message that can be viewed by clinicians)
* A tool for checking data consistency (e.g., highlighting which allele definitions in PharmGKB are overlapping, or which clinical decision support rules are matching the same group of patients)

The ontology is OWL 2 DL serialized in Manchester Syntax. Protege 4 with the TrOWL reasoner plugin is recommended for working with the ontology. Please note that it the ontology is quite complex, so reasoning only works on decent hardware.

The use of OWL 2 (and Manchester Syntax) makes it possible to capture complex pharmacogenetic definitions and data in a relatively intuitive, yet ontology-based and machine-readable format. For example, the OWL description of a few SNPs of a patient looks like this:
-----------Individual: example_patient
Types:
	human,
	(has some rs10929302_A)  and (has some rs10929302_G),
	(has some rs1208_A)  and (has some rs1208_G),
	(has some rs12571421_A)  and (has some rs12571421_G),
	(has some rs12769205_A)  and (has some rs12769205_G),
	has exactly 2 rs10264272_C,
	has exactly 2 rs10276036_T,
	has exactly 2 rs1041983_C,
	has exactly 2 rs1042713_G,
-----------Equally, a logical description of SNPs associated with the CYP2C9*3 allele look like this...-----------Class: 'human with CYP2C9 *3'        EquivalentTo:         has some rs1057910_C        SubClassOf:         has some 'CYP2C9 *3',        (has some rs1057910_C)         and (has some rs1057911_A)         and (has some rs1799853_C)         and (has some rs2256871_A)         and (has some rs28371685_C)         and (has some rs28371686_C)         and (has some rs56165452_T)         and (has some rs57505750_C)         and (has some rs67807361_C)         and (has some rs72558184_G)         and (has some rs72558187_T)         and (has some rs72558188_AGAAATGGAA)         and (has some rs72558189_G)         and (has some rs72558190_C)         and (has some rs72558192_A)         and (has some rs72558193_A)         and (has some rs7900194_G)         and (has some rs9332130_A)         and (has some rs9332131_A)         and (has some rs9332239_C),-----------... and a logical description of the group of humans that might require a specific initial warfarin dose based on their genetic traits looks like this:-----------Class: 'human triggering CDS rule 7'    Annotations:         label "human triggering CDS rule 7",        'CDS message' "3-4 mg warfarin per day should be considered as a starting dose range for a patient with this genotype according to the Warfarin drug label (Bristol-Myers Squibb).",        'relevant for' <http://www.genomic-cds.org/ont/genomic-cds.owl#Warfarin>        EquivalentTo:         (has some 'CYP2C9 *1')         and (has some 'CYP2C9 *3')         and (has exactly 2 rs9923231_C)        SubClassOf:         'Human triggering CDS rule'-----------To make long-term development and maintenance easier, the ontology is generated from intuitive spreadsheets (so that independent domain experts can easily edit and validate the knowledge) and automated imports from primary datasets such as dbSNP.An issue tracker for ontology development is available athttp://code.google.com/p/genomic-cds/issues/listAlso, a website for the project is under construction at
http://www.genomic-cds.org/

The ontology also has a home on BioPortal at
http://bioportal.bioontology.org/ontologies/3179

Needless to say, all of this is just a partial research prototype. Not all required information is in the ontology, some information might be several months out of date, and accuracy and quality have not been validated. 
We can discuss the ontology in today's teleconference call (the first in the new Wednesday timeslot, an announcement will be sent out soon).

Cheers,
Matthias Samwald
http://samwald.info/

Received on Wednesday, 21 November 2012 15:20:12 UTC