https://www.usa.gov/government-workshttps://www.usa.gov/government-works
Contains four research datasets containing time series and micro-level data by National Bureau of Economic Research (NBER) technology sub-category on applications, grants, and in-force patents spanning two centuries of innovation.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
The file geoc_inv.txt contains identifiers for patent first filings (corresponding to appln_id in PATSTAT), latitude, longitude, city, region, and country of the inventor. Missing coordinates have been imputed from equivalents and other second filings or from information on the location of applicants. The file also contains a variable indicating the source of information ('source'): 1: information comes from the first filing itself 2: information comes from direct equivalent 3: information comes from other subsequent filings 4: information comes from the applicant’s location in first filings 5: information comes from the applicant’s location in the equivalent 6: information comes from the applicant’s location in other subsequent filings; the column 'coord_source' indicates the source of coordinates (whether they come from geolocalisation services, from geonames, or from PatentsView). It is possible to select certain types of first filings based on column 'type'. For example, Paris Convention priority filings can be retrieved by specifying type=priority. The file geoc_app.txt contains location information of applicants. Sources of information (first filings, equivalents, etc.) are thus browsed in reverse order. A detailed data description can be found in de Rassenfosse, Kozak, Seliger 2019: Geocoding of worldwide patent data, published in 'Scientific Data' and available at https://doi.org/10.1038/s41597-019-0264-6. Please note the following: The files geoc_inv_person.txt and geoc_app_person.txt contain person IDs for inventors and applicants, respectively, whenever the location information comes from PATSTAT. If not, the person_id is = 0. These files are not described in the paper. They have been made accessible to improve interoperability with PATSTAT data. Some files had to be zipped in order to upload them to Harvard Dataverse.
Contains detailed information on roughly 6 million patent assignments and other transactions recorded at the USPTO since 1970 and involving over 10 million patents and patent applications.
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
The USPTO grants US patents to inventors and assignees all over the world. For researchers in particular, PatentsView is intended to encourage the study and understanding of the intellectual property (IP) and innovation system; to serve as a fundamental function of the government in creating “public good” platforms in these data; and to eliminate redundant cleaning, converting and matching of these data by individual researchers, thus freeing up researcher time to do what they do best—study IP, innovation, and technological change.
PatentsView Data is a database that longitudinally links inventors, their organizations, locations, and overall patenting activity. The dataset uses data derived from USPTO bulk data files.
Fork this notebook to get started on accessing data in the BigQuery dataset using the BQhelper package to write SQL queries.
“PatentsView” by the USPTO, US Department of Agriculture (USDA), the Center for the Science of Science and Innovation Policy, New York University, the University of California at Berkeley, Twin Arch Technologies, and Periscopic, used under CC BY 4.0.
Data Origin: https://bigquery.cloud.google.com/dataset/patents-public-data:patentsview
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Google Patents Research Data contains the output of much of the data analysis work used in Google Patents (patents.google.com), including machine translations of titles and abstracts from Google Translate, embedding vectors, extracted top terms, similar documents, and forward references.
Contains Artificial Intelligence Patent Landscape data classifying 13,244,037 granted patents and PGPubs published from 1976 through 2021 in eight AI component technologies using state-of-the art machine learning based models.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This page provides the data resulting from linking assignees and assignors in the USPTO Patent Assignment Dataset to Compustat gvkeys. We work with a version of the USPTO PAD that was gracefully shared with us by Stuart Graham. Such version precedes by one year the first release available at the USPTO website (https://www.uspto.gov/ip-policy/economic-research/research-datasets/patent-assignment-dataset). The version that we use covers 5,534,135 transactions recorded at the USPTO between January 1970 and January 2013 (inclusive). While the first transaction date is January 1970, the number of transactions recorded in the initial years is negligible. Data coverage seems sufficient for the years 1981-2012.
If you use the code or data, please cite the following two papers:
Arque-Castells, P., and Spulber, D. (2022). Measuring the Private and Social Returns to R&D: Unintended Spillovers versus Technology Markets. Journal of Political Economy. https://doi.org/10.1086/719908
Arqué Castells, Pere and Spulber, Daniel F., Firm Matching in the Market for Technology: Business Stealing and Business Creation (September 17, 2021). Northwestern Law & Econ Research Paper No. 18-14, Available at SSRN: https://ssrn.com/abstract=3041558 or http://dx.doi.org/10.2139/ssrn.3041558
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
The Office of the Chief Economist (OCE) is responsible for advising the Under Secretary of Commerce for Intellectual Property and Director of the USPTO on the economic implications of policies and programs affecting the U.S. intellectual property (IP) system. The office disseminates detailed patent and trademark data, undertakes research, and conducts economic analysis on a variety of IP issues. OCE works with policy makers, collaborates with academics, and engages the public more generally through conferences it organizes, the publicly accessible research datasets it provides, and its publications.
The USPTO OCE Patent Assignment Dataset contains detailed data patent assignments and other transactions recorded at the USPTO since 1970.
"USPTO OCE Patent Assignment Data" by the USPTO, for public use. Marco, Alan C., Graham, Stuart J.H., Myers, Amanda F., D'Agostino, Paul A and Apple, Kirsten, "The USPTO Patent Assignment Dataset: Descriptions and Analysis" (July 27, 2015).
Data Origin: https://bigquery.cloud.google.com/dataset/patents-public-data:uspto_oce_assignment
Banner photo by Jeff Sheldon on Unsplash
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Google Patents Public Data, provided by IFI CLAIMS Patent Services, is a worldwide bibliographic and US full-text dataset of patent publications. Patent information accessibility is critical for examining new patents, informing public policy decisions, managing corporate investment in intellectual property, and promoting future scientific innovation. The growing number of available patent data sources means researchers often spend more time downloading, parsing, loading, syncing and managing local databases than conducting analysis. With these new datasets, researchers and companies can access the data they need from multiple sources in one place, thus spending more time on analysis than data preparation.
The Google Patents Public Data dataset contains a collection of publicly accessible, connected database tables for empirical analysis of the international patent system.
Data Origin: https://bigquery.cloud.google.com/dataset/patents-public-data:patents
For more info, see the documentation at https://developers.google.com/web/tools/chrome-user-experience-report/
“Google Patents Public Data” by IFI CLAIMS Patent Services and Google is licensed under a Creative Commons Attribution 4.0 International License.
Banner photo by Helloquence on Unsplash
USPTO Patent Examiner Data System (PEDS) API Data contains data from the examination process of USPTO patent applications. PEDS contains the bibliographic, published document and patent term extension data tabs in Public PAIR from 1981 to present. There is also some data dating back to 1935.
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
The Patent Claims Research Dataset contain detailed information on claims from U.S. patents granted between 1976 and 2014 and U.S. patent applications published between 2001 and 2014. The dataset is derived from the Patent Application Publication Full-Text and Patent Grant Full Text files, available at https://bulkdata.uspto.gov/, to which the Office of Chief Economist (OCE) applied a Python algorithm to identify individual claims as well as the dependency relationship between claims. From the parsed claims text, OCE created six data files containing individually-parsed claims, claim-level statistics, and document-level statistics, including newly-developed measures of patent scope.
USPTO OCE Patent Claims Research data contains detailed information on claims from U.S. patents granted between 1976 and 2014 and U.S. patent applications published between 2001 and 2014.
"USPTO OCE Patent Claims Research Data" by the USPTO, for public use. Marco, Alan C. and Sarnoff, Joshua D. and deGrazia, Charles, "Patent Claims and Patent Scope" (October 2016). USPTO Economic Working Paper 2016-04.
Data Origin: https://bigquery.cloud.google.com/dataset/patents-public-data:uspto_oce_claims
Banner photo by William Iven on Unsplash
Attribution-NonCommercial 1.0 (CC BY-NC 1.0)https://creativecommons.org/licenses/by-nc/1.0/
License information was derived automatically
This Zenodo page describes data collection, processing, and different open access data files related to the text of USPTO patent documents. The document "Data Description Zenodo.pdf" provides more details. If you use the code or data, please cite the following paper:
Arts S, Hou J, Gomez JC (2021). Natural language processing to identify the creation and impact of new technologies in patent text: Code, data, and new measures. Research Policy, 50(2), 104144. (https://doi.org/10.1016/j.respol.2020.104144)
Patent data is aggregated across multiple Intellectual Property (IP) registries, including USPTO, CIPO, EUIPO and WIPO (USA, Canada, Europe). Our complete dataset of active patent records is updated weekly. Customized reports available based on company lists, or full dataset via raw feed or one-off reports. Full bibliographic data provided for each IP record; including filing date, grant date, expiry date, inventor(s), IPC, full text abstract, title, etc. Ownership/entity relationship mapping, ticker mapping, ISIN mapping, Crunchbase uuid mapping, Crunchbase domain mapping. We also provide our proprietary IP Activity Score for each owner, which can assist to compare recent innovation activity amongst owners, as reflected in their Intellectual Property filings.
Ipqwery's Patent data is also available as a combined dataset with our Trademark dataset, enabling full IP profiles for corporate entities.
Patent Examination Data System lets customers retrieve and download multiple records of USPTO patent application or patent filing status at no cost. PEDS contains the bibliographic, published document and patent term extension data tabs in Public PAIR from 1981 to present. There is also some data dating back to 1935. Customers can download the entire dataset containing the data for all indexed documents. There are over 13 million records in PEDS. The data can be accessed by anyone using the web interface or the provided Application Programming Interface (API). PEDS is updated daily and mirrors the data available in the Patent Application Location and Monitoring (PALM) system. PEDS provides access to public applications including: published patent applications and patents, and PCT applications that have not been published by WIPO. Any applications that have not been released by the USPTO will not be available in PEDS.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
A patent is a set of exclusive rights granted to an inventor by a sovereign state for a solution, be it a product or a process, for a solution to a particular technological problem. The United States Patent and Trademark Office (USPTO) is part of the US department of Commerce that provides patents to businesses and inventors for their inventions in addition to registration of products and intellectual property identification. Each year, the USPTO grants over 150,000 patents to individuals and companies all over the world. As of December 2011, 8,743,423 patents have been issued and 16,020,302 applications have been received. The USPTO patents are accepted in electronic form and are filed as PDF documents. However, the indexing is not perfect and it is cumbersome to search through the PDF documents. Additionally, Google has also made all the patents available for download in XML format, albeit only from the years 2002 to 2015. Thus, we converted this bulk of data (spanning 13 years) from XML to RDF to conform to the Linked Data principles.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
assistive
https://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain
Graph and download economic data for U.S. Granted Patents: Design Patents Originating in the District of Columbia (PATENTUSDCDESIGN) from 1992 to 2020 about patent granted, DC, intellectual property, origination, patents, and USA.
USPTO OCE Patent Claims Research data contains detailed information on claims from U.S. patents granted between 1976 and 2014 and U.S. patent applications published between 2001 and 2014.
Contains the full text of each patent application (non-provisional utility and plant) published weekly (Thursdays) from March 15, 2001 to present (excludes images/drawings). Subset of the Patent Application Full Text Data with Embedded TIFF Images.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This database links innovation data to Compustat firms. When using the data, please cite "Knowledge Spillovers and Corporate Investment in Scientific Research" (Arora, Belenzon and Sheer), NBER WP 23187. A special thanks and appreciation go to Bernardo Dionisi , Honggi Lee, Dror Shvadron and JK Suh for their diligent work and dedication to this effort over the past several years.
This project introduces major data extension and improvement to the historical NBER patent dataset, which should be valuable for all researchers working with patent data linked to firms. In updating the data to match between Compustat and patents to 2015, we address two major challenges: name changes and ownership changes. These challenges are central to how patents are assigned to firms over time. To be consistent over the sample period, we reconstruct the complete historical data covered in the NBER data files.
About 30% of the Compustat firms in our sample change their name at least once. Accounting for name changes improves the accuracy and scope of matches to patents (and other assets), ownership structure, and dynamic reassignments of GVKEY codes to companies. Dynamic reassignment means that, for instance, if a sample firm merges with another firm, the patents of the merged firm are included in the stock of patents linked to the Compustat record from that point onward, but not before.
For ownership and subsidiary data we rely on a wide range of M&A data, including SDC, historical snapshots of ORBIS files for 2002-2015, 10-K SEC filings, and NBER2006 as well as perform extensive manual checks that help us uncover firms’ structure and ownership changes before proceeding to the patent match. Thus, we have extended and improved the NBER patent data. In the enclosed "Data Appendix", we document our data construction work, present several examples (“case studies”), and outline the improvements we made to existing NBER historical patent data.
https://www.usa.gov/government-workshttps://www.usa.gov/government-works
Contains four research datasets containing time series and micro-level data by National Bureau of Economic Research (NBER) technology sub-category on applications, grants, and in-force patents spanning two centuries of innovation.