W3C home > Mailing lists > Public > public-semweb-lifesci@w3.org > May 2006

[BioRDF] Easily digestible Entrez Gene records

From: Alan Ruttenberg <alanruttenberg@gmail.com>
Date: Mon, 1 May 2006 11:26:49 -0400
Message-Id: <A0523036-BA21-4C9E-99B9-61E81DD411C2@gmail.com>
Cc: Eric Miller <em@w3.org>
To: public-semweb-lifesci@w3.org

There was some interest in working with a simpler form of Entrez data  
for some of our efforts. I'm on the way to releasing the code, but in  
the interim, if people have a need for some easily digestible entrez  
gene data, you can download from the Nuts and Bolts wiki page:

http://tinyurl.com/l8shw

This is a tab delimited file of Entrez Gene records for human, mouse  
and rat. It is generated by some code that parses the ASN.1 format  
release files and pulls out just the fields I was interested in for  
the project I work on. Fields are listed below. Where a field might  
have multiple values, such as in SYNONYMS or REFSEQ-PROTEIN, they are  
separated by "|".

ID	
STATUS	
NAME	
TYPE	
SPECIES	
LOCUSLINK	
CURRENT-ID	
CURRENT-LOCUSLINK	
REFSEQ-MRNA	
REFSEQ-PROTEIN	
OMIM	
UNIGENE	
GO	
CHROMOSOME	
STRAND	
START	
END	
SYNONYMS	
SUMMARY

-Alan
Received on Monday, 1 May 2006 15:26:56 GMT

This archive was generated by hypermail 2.3.1 : Tuesday, 26 March 2013 18:00:43 GMT