W3C home > Mailing lists > Public > public-lod@w3.org > February 2017

The USPTO Linked Patent Dataset release

From: Mofeed <mounir@informatik.uni-leipzig.de>
Date: Thu, 16 Feb 2017 10:30:06 +0100
To: aksw@lists.informatik.uni-leipzig.de, sda@lists.iai.uni-bonn.de, public-lod@w3.org, semantic-web@w3.org
Message-ID: <00b34301-5ac4-8640-84c5-f0a3dcb18d43@informatik.uni-leipzig.de>
Dear everyone,

we are happy to announce USPTO Linked Patent Dataset release.

Patents are widely used to protect intellectual property and a measure 
of innovation output. Each year, the USPTO
grants over 150, 000 patents to individuals and companies all over the 
world. In fact, there were more than 200, 000 patent grants
issued in the US in 2013. However, accessing, searching and analyzing 
those patents is often still cumbersome and inefficient.

Our dataset is the output of converting USPTO XML patents data into RDF 
from the years 2002 - 2016. This supports the integration with other 
data sources in order to further simplify use cases such as trend 
analysis, structured patent search & exploration and societal progress 
measurements.


The USPTO Linked Patent Dataset contains 13,014,651 entities where 
2,355,579 are patents. Other entities represent Applicant, Inventor, 
Agent, Examiner (primary and secondary) ,  and assignee. All these 
entities amount to c.a. 168 million triples are describing the patents 
information.

The complete description for the dataset and SPARQL endpoint are 
available on the DataHub: 
https://datahub.io/dataset/linked-uspto-patent-data.

We really appreciate feedback and are open to collaborations.

*If you happen to have use cases utilizing this dataset, please contact us.*

Kind regards,

Mofeed Hassan, Amrapali Zaveri, Jens Lehmann
Received on Tuesday, 28 February 2017 22:25:07 UTC

This archive was generated by hypermail 2.3.1 : Tuesday, 28 February 2017 22:25:08 UTC