Copyright © 2006-2007 W3C® (MIT, ERCIM, Keio), All Rights Reserved. W3C liability, trademark and document use rules apply.
CDISC's Study Data Tabulation Model (SDTM) is used to define the study components in terms of domains and observations for a given clinical trial study. However, the ability to use it for sets of biomarkers that serve to define surrogate endpoints and/or evidence of mechanism is not currently not possible / or not well described. We intend to propose an augmentation for the SDTM model using RDF-OWL that will support the inclusion of biomarker data and genotyping from subjects, associated with known mechanisms and endpoint descriptors.
This HCLSIG task force focuses on the topic of “applying semantics to R&D Informatics efforts in support Drug Safety and Efficacy” within clinial trials, as well as post-market surveillance. We also intend to demonstrate how Semantic Web standards can be applied to issues related to these in the near-term. Specifically, the task force focuses on the following areas for scenarios and activities: Identify/address challenges and needs regarding Biomarkers and Pharmacogenomics in coordination with FDA guidelines Semantic applications around Drug Safety: Signals and Notification Possible applications of Semantic Web in Clinical Trial planning, management, analysis, and reporting (e.g., EDC and EHR Single-Source, data security, integrity) Facilitating electronic submissions as per the Common Technical Document (eCTD) specifications, http://www.fda.gov/cder/guidance/7087rev.htm ) X Use Cases document to illustrate, in detail, the techniques XX provides for associating documents with appropriate instructions for extracting any embedded data.
This section describes the status of this document at the time of its publication. Other documents may supersede this document. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at http://www.w3.org/TR/.
This document is a Interets Group Note, developed by the Healthcare and Life Sciences Interest Group.
As of the publication of this Interest Group Note the HCLS Interest Group has completed work on this document. Changes from the previous Working Draft are indicated in a log of changes. Comments on this document may be sent to public-semweb-lifesci@w3.org. Further discussion on this material may be sent to the Semantic Web Interest Group mailing list, semantic-web@w3.org (also with public archive).
Publication as a Working Group Note does not imply endorsement by the W3C Membership. This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress.
This document was produced by a group operating under the 5 February 2004 W3C Patent Policy. W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy.
This HCLSIG task force (DSE) focuses on the topic of “applying semantics to R&D Informatics efforts in support Drug Safety and Efficacy” within clinical trials, as well as post-market surveillance. We also intend to demonstrate how Semantic Web standards can be applied to issues related to these in the near-term. Specifically, the task force focuses on the following areas for scenarios and activities [HCLS].
Digital data from both Non-clinical (animal) and Clinical Studies needs to be organized according to the following areas:
The objectives focused on a few key items related to the SDTM model and possible extensions to it: Develop and document Scenarios for some of the above identified areas Identify and validate some initial Best Practices for handling safety and efficacy information through semantics, which incorporate current vocabulary conventions Create one or more public Semantic Web-based Demonstrations (see Clinical Trial Demo) Coordination and collaboration with relevant organizations, possibly CDISC, ICH, HL7-RCRIM, EMEA, FDA, NCI-caBIG
The requirement to convert ODM/XML to RDF may not be approach the problem by addressing SDTM elements; data + metadata , codelists and definitions embedded in one study, instead use references to metadata and defs.
The use of information to improve the development of Efficacious and Safe Drugs rests on the proper and timely utilization of diverse information sets, and the adoption and compliance of well-defined policies. As information becomes more diverse and policies more central to the pharmaceutical industry, the development of information systems that are better suited to handle multiple information types (data and ontologies) while complying with defined policies (rules and actions) will become essential. Semantic Web technology standards offer potential solutions for: Aggregating Study Datasets, around Biomarkers (and following eCTD guidelines) Enhancing management of non-clinical and clinical controlled vocabularies that will be certainly expanding and evolving (adaptability) Providing fast access to current safety information though semantic-enabled channels (Pharmacovigilance) Applying Rules, Integrity, and Security in support of policy compliance and management (HIPAA, CFR21Part11 and Sarbanes-Oxley)
The Study Data Tabulation Model (SDTM) is used to define the study components in terms of domains and observations for a given clinical trial study. However, the ability to use it for sets of biomarkers that serve to define surrogate endpoints and/or evidence ofd mechanism is not currently possible. We intend to propose an augmented SDTM model using RDF-OWL that will support the inclusion of biomarker data from subjects, associated with known mechanisms and endpoint descriptors.
The following examples are work in progress (collaborative whiteboard) of how to define and organize clinical data ala the SDTM model using an RDF approach. N3 is being used here to make editing and comprehension easier. Some basic syntactical rules are reviewed here:
@prefix cdisc: <http://www.cdisc.org/sdtm/vocab> .
@prefix dse: <http://www.w3.org/2001/sw/hcls/dse> .
@prefix nci: <http://nci.nih.gov/cadsr/vocabulary> .
@prefix nist: <http://nist.gov/units> .
@prefix time <http://www.w3.org/2006/time> .
// Sex Text Code: 'MALE', 'FEMALE', 'UNKNOWN', 'Intersex'
<http://clinic.com/study/T2271>
a cdisc:Study ;
cdisc:subject <http://clinic.com/study/T2271/subject/S83221> ;
cdisc:subject <http://clinic.com/study/T2271/subject/S74343> ;
... .
<http://clinic.com/study/T2271/subject/S83221>
a cdisc:Subject ;
nci:sex_code nci:Female ;
// here I assume cdisc:Diastolic_BP is a subproperty of cdisc:VSTest --
cdisc:observation <http://clinic.com/study/T2271/subject/S83221/observation/O6622> ;
cdisc:observation <http://clinic.com/study/T2271/subject/S83221/observation/O6561> ;
... .
<http://clinic.com/study/T2271/subject/S83221/observation/O6622>
a cdisc:Diastolic_BP ;
cdisc:obs_context cdisc:patient_lying ;
cdisc:obs_value "98" ;
cdisc:obs_units nist:mmHg .
<http://clinic.com/study/T2271/subject/S83221/observation/O6561>
a cdisc:Pulse ;
cdisc:obs_context cdisc:patient_lying ;
cdisc:obs_value "64";
cdisc:obs_units nist:bpm .
Bundling Diastolic_BP, Systolic_BP, and Pulse together under one observation could be done in the following way...
<http://clinic.com/study/T2271/subject/S83221/observation/O6622>
a cdisc:Vital_sign ; // cdisc:Vital_sign is a subclass of cdisc:Observation
cdisc:obs_context [ cdisc:position cdisc:patient_lying ; cdisc:note cdisc:patient_fainted ; cdisc:patient_status cdisc:non_critical . ] ;
cdisc:diastolic [ a cdisc:Diastolic_BP ;
cdisc:obs_value "98" ;
cdisc:obs_units nist:mmHg .
] ;
cdisc:systolic [ a cdisc:Systolic_BP ;
cdisc:obs_value "152" ;
cdisc:obs_units nist:mmHg .
] ;
cdisc:pulse [ a cdisc:Pulse ;
cdisc:obs_value "64";
cdisc:obs_units nist:bpm .
] .
Example Based on simulated Clinical Data from Stephen Dobson
<http://clinic.com/study/T2271/subject/4183542663506>
a cdisc:Subject ;
nci:sex_code nci:Female ;
cdisc:treatment <http://clinic.com/study/T2271/subject/4183542663506/observation/O2241> ;
cdisc:vitalSigns <http://clinic.com/study/T2271/subject/4183542663506/observation/O6561> ;
cdisc:adverseEvent <http://clinic.com/study/T2271/subject/4183542663506/observation/O6622> ;
// ROUTE DRGGROUP DOSE pid treatment tpfday tptday
// IV B 7 MG 4183542663506 7mg then 14mg SEMWEB 6/11/84 7/11/84
<http://clinic.com/study/T2271/subject/S83221/observation/O2241 >
a cdisc:Treatment ; // cdisc:Treatment is a subclass of cdisc:Observation
cdisc:design_arm <http://clinic.com/study/T2271/treated_B/double_dose> ;
dse:route cdisc:IV_route ;
dse:drug_group "B" ;
cdisc:dose "7" ;
cdisc:dose_units nist:mg ;
cdisc:treatment "7mg then 14mg SEMWEB" ;
cdisc:first_date "6/11/84" ;
cdisc:term_date "7/11/84" ;
... // How best to define Treatments and Experimental Design ? using cdisc:design_arm to link back to design graph?
// VTLTEXT VTLRES VISIT_ID pid collday related
// Standing Diastolic BP (mmHg) 75 BASELINE 4183542663506 6/11/84 1
<http://clinic.com/study/T2271/subject/S83221/observation/O6561 >
a cdisc:Vital_sign ; // cdisc:Vital_sign is a subclass of cdisc:Observation
cdisc:visit_id cdisc:BASELINE ;
cdisc:visit_date "6/11/84" ;
dse:obs_context [ cdisc:position cdisc:patient_standing . ] ;
cdisc:diastolic [ a cdisc:StandingDiastolic_BP ;
dse:vtltext "Standing Diastolic BP (mmHg)" ;
dse:related_measure "1" ;
dsecdisc:obs_value "75" ;
dse:obs_units nist:mmHg .
] .
// pid AEFDAY AETDAY AESEV AESEVT AESER AESERT PREFTEXT BODYTEXT
// 4183542663506 6 9 2 MODERATE 2 NO ABDOMEN ENLARGED BODY AS A WHOLE
<http://clinic.com/study/T2271/subject/S83221/observation/O6622 >
a cdisc:Adverse_Event ; // cdisc:Adverse_Event is a subclass of cdisc:Observation
cdisc:visit_id cdisc:BASELINE ;
time:first_date "6" ;
time:term_date "9" ;
time:duration_days "2" ;
dse:severity AE:MODERATE ;
dse:rating "2" ;
dse:RT "NO" ;
dse:prefText "NO ABDOMEN ENLARGED" ;
dse:bodyText "BODY AS A WHOLE" ;
dse:obs_context [ cdisc:position cdisc:patient_standing . ] .
Below, in the related resources section, two examples are attached on how the current XML output looks like from CDISC usage of NCI caDSR for the so called SDTM Controlled Terminologies. These inlude the permissable values as strings to be incorporated in SDTM datasets, e.g:
This would enable the publication of observation types ontologies. As a formal descriptions of the required patient and measurement context such as these for the measurement of blood pressure:
Such observation types ontologies could also be the place to describe the different classifications such as this type of observation 'is-a-Metabolic_Marker', according to nciOncology and 'is-a-Finding', according to SDTM general classes.
This demo illustrates how clinical trials data formatted as RDF can be visualized. It takes advantage of MIT's SIMILE's Exhibit technology, and shows how easy it is to merge and visualize aggregated data (graphs) through the Web. The data was obtained from four separate SDTM generated Excel documents, that were converted and merged using SIMILE's BABEL utility.
CDISC's SDTM model can be mapped into an RDF based model, provided that some key data entities (e.g., Subjects, Observations, Interventions) are specified using URIs. There are clear advantages of using such a model, since additional links and metadata can easily be extended to any study set.
The more complex issues we have not yet addressed include an appropriate mapping of terminology codes and strings into OWL or SKOS defined URI entities.
HCLSIG/Drug Safety and Efficacy/SDTM Notes Draft 1.0 (last edited 2007-11-05 08:40:14 by 212)This concludes the SDTM Note. Further DSE discussions can be found at http://esw.w3.org/topic/HCLSIG/Drug_Safety_and_Efficacy.
The editor would like to thank the following Working Group members for authoring this document:
This document is a product of the Drug Safety and Efficacy Task Force of the HCLS Interest Group.
$Log: Note_DSE_20071108.html,v $ Revision 1.1 2007/11/05 04:33:51 eneumann2 Note_DSE_20071108.html Revision 1.0 2007/11/05 18:39:11 eneumann