Patent Assignment Dataset

The USPTO allows parties to record assignments of patents and patent applications to, as much as possible, maintain a complete history of claimed interests in a patent. The USPTO also permits recording of other documents that affect title (such as certificates of name change and mergers of businesses) or are relevant to patent ownership (such as licensing agreements, security interests, mortgages, and liens). The Patent Assignment Dataset contains detailed information on 6.8 million patent assignments and other transactions recorded at the USPTO since 1970 and involving roughly 11.1 million patents and patent applications. It is derived from the recording of patent transfers by parties with the USPTO.

A document describing these data is available here: “The USPTO Patent Assignment Dataset: Descriptions and Analysis.” Users are requested to cite this documentation when using these data. It is also available as: Marco, Alan C., Graham, Stuart J.H., Myers, Amanda F., D'Agostino, Paul A and Apple, Kirsten, The USPTO Patent Assignment Dataset: Descriptions and Analysis (July 27, 2015). Available at SSRN: http://ssrn.com/abstract=2636461.

The OCE developed these data files for public use and encourages users to identify fixes and improvements. Please provide all feedback to: EconomicsData@uspto.gov

Data Files

USPTO Patent Assignment Dataset Schema

Download full set of 2015 data files [.dta format (936 MB)] [.csv format (1.09 GB)]

Download individual data files:

File Name 2014* 2015**
assignment DTA
138 MB
CSV
231 MB
DTA
146 MB
CSV
245 MB
assignor DTA
128 MB
CSV
162 MB
DTA
137 MB
CSV
175 MB
assignee DTA
131 MB
CSV
166 MB
DTA
139 MB
CSV
177 MB
documentid DTA
289 MB
CSV
367 MB
DTA
314 MB
CSV
404 MB
assignment_conveyance DTA
13.4 MB
CSV
17 MB
DTA
13.4 MB
CSV
18.2 MB
documentid_admin DTA
104 MB
CSV
89.8 MB
DTA
186 MB
CSV
97.4 MB

Direct Download here: 2014, 2015.

Download full set of 2014 data files [.dta format (773 MB)] [.csv format (1.04 GB)]

These data comprise an aggregation of raw XML data found here.

* Note: the 2014 DTA files have been created using the Stata-13 format. Long string values are stored using the new strL variable type.

* Note: the 2015 DTA files have been created using the Stata-14 format. Long string values are stored using the new strL variable type.