RE: Unstructured vs. Structured (was: HL7 and patient records in RDF/OWL?)

Matthew Cockerill wrote:
> I couldn't agree more.
> 
> Spreadsheets (and equivalently, CSV files) are a large fraction of
> the 'additional datafiles' that BioMed Central receives from authors.
> 
> What would be great would be to be able to define some simple
> standards and/or templates which authors could follow in their
> spreadsheets, to allow the automatic recognition of key life science
> identifiers, and quantitative attributes,  and so the generation of
> RDF. 

This would be great; to an extent though it's already happening. There
are many attempts to data standards for different areas of biology. As 
Sean points out LSID's are good for identifiers. 

Have you ever looked at the additional data files for BMC, and asked 
what kind of data is generally in them. Is it all microarray? Is it
all different kinds? 

Phil

Received on Wednesday, 15 February 2006 12:30:15 UTC