- From: Harry Halpin <hhalpin@ibiblio.org>
- Date: Wed, 07 Mar 2007 00:17:08 -0500
- To: Harry Halpin <hhalpin@ibiblio.org>
- Cc: public-cwm-talk@w3.org
Nevermind - it's apparently bit rot with --pipe... Harry Halpin wrote: > Not sure which error this is on. I have a bunch of autogenerated RDF > files I got from scraping various patent databases that I'm trying to > merge into a large RDF file. > > This is the error: > > @@@@@@ toXML.py 382: (0, u'http://www.w3.org/2001/XMLSchema#string' > > This produces weird output: > .. > <pat:dateFiled rdf:datatype="(0, > u'http://www.w3.org/2001/XMLSchema#dateTime')">1990-06-29</pat:dateFiled> > ... > Ouch, I don't want that (0,blahblah) in my output. I'm assuming somehow > the problem is in my scraper, but those seem like valid URIs to me. > > Input: > http://www.ibiblio.org/hhalpin/homepage/notes/5161193.rdf > http://www.ibiblio.org/hhalpin/homepage/notes/4130865.rdf > http://www.ibiblio.org/hhalpin/homepage/notes/4203154.rdf > > Output: > http://www.ibiblio.org/hhalpin/homepage/notes/example.rdf > > Example Commandline: > cwm --rdf 5161193.rdf 4130865.rdf 4203154.rdf --rdf --pipe > example.rdf > > Rapper does not seem to give me an errors on these files, but not sure > how to merge lots of files using rapper. > > Also, what's a good command line utility to merge RDF files from bunches > of small RDF/XML files to verally large ones. We have about 6,000 files, > each containing an average of about 70 triples, that we want to make a > big graph out of. I was planning on just putting cwm in my pipeline... > > > > -- -harry Harry Halpin, University of Edinburgh http://www.ibiblio.org/hhalpin 6B522426
Received on Wednesday, 7 March 2007 05:17:23 UTC