Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Google Patents Public Data, provided by IFI CLAIMS Patent Services, is a worldwide bibliographic and US full-text dataset of patent publications. Patent information accessibility is critical for examining new patents, informing public policy decisions, managing corporate investment in intellectual property, and promoting future scientific innovation. The growing number of available patent data sources means researchers often spend more time downloading, parsing, loading, syncing and managing local databases than conducting analysis. With these new datasets, researchers and companies can access the data they need from multiple sources in one place, thus spending more time on analysis than data preparation.
The Google Patents Public Data dataset contains a collection of publicly accessible, connected database tables for empirical analysis of the international patent system.
Data Origin: https://bigquery.cloud.google.com/dataset/patents-public-data:patents
For more info, see the documentation at https://developers.google.com/web/tools/chrome-user-experience-report/
“Google Patents Public Data” by IFI CLAIMS Patent Services and Google is licensed under a Creative Commons Attribution 4.0 International License.
Banner photo by Helloquence on Unsplash
The Bulk Search and Download API allows searching published patent applications (pre-grant publications, pgpubs) and issued patents (patent grants) across various fields from 2001 to present, and to request a custom zip package for the given patent application or patent ids. Download up to 100 document numbers at a time. Wait time can vary based on the number of requests being processed.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
A patent is a set of exclusive rights granted to an inventor by a sovereign state for a solution, be it a product or a process, for a solution to a particular technological problem. The United States Patent and Trademark Office (USPTO) is part of the US department of Commerce that provides patents to businesses and inventors for their inventions in addition to registration of products and intellectual property identification. Each year, the USPTO grants over 150,000 patents to individuals and companies all over the world. As of December 2011, 8,743,423 patents have been issued and 16,020,302 applications have been received. The USPTO patents are accepted in electronic form and are filed as PDF documents. However, the indexing is not perfect and it is cumbersome to search through the PDF documents. Additionally, Google has also made all the patents available for download in XML format, albeit only from the years 2002 to 2015. Thus, we converted this bulk of data (spanning 13 years) from XML to RDF to conform to the Linked Data principles.
https://dataverse.harvard.edu/api/datasets/:persistentId/versions/2.1/customlicense?persistentId=doi:10.7910/DVN/PG6THVhttps://dataverse.harvard.edu/api/datasets/:persistentId/versions/2.1/customlicense?persistentId=doi:10.7910/DVN/PG6THV
PatentCity is a dataset that provides information on each individual patents filed in the US patent office since 1836, on the UK patent office since 1894, on the French patent office since 1903 and on the German patent office (including East Germany) since 1877. Each entry is a patent publication along with standard information taken from patent offices (publication number, date of publication, technological classes…) which are enriched with additional details processed from the text of the patents. This includes: the name of each patentee (assignees or inventors), its geocoded address and when applicable its occupation and citizenship. PatentCity can be used in a variety of disciplines, geography, economics, history of science… and has been designed to be easily merged with existing geographical or technological sources. Github of the project: github.com/cverluise/patentcity Documentation: cverluise.github.io/patentcity Descriptive paper: www.longtermproductivity.com/perso/Patentcity_desc.pdf
Using a Bayesian supervised learning approach, we identify individual inventors from the U.S. utility patent database, from 1975 to the present. An interface to calculate and illustrate patent co-authorship networks and social network measures is also provided. The network representation does not require bounding the social network beforehand. We provide descriptive statistics of individual and collaborative vari ables and illustrate examples of networks for an individual, an organization, a technology, and a region. The paper provides an overview of the technical algorithms and pointers to the data, code, and documentation, with the hope of further open development by the research community. Go here for theNBER pdpass file -- https://sites.google.com/site/patentdataproject/Home/downloads. It's old and hasn't been updated
Published by the European Patent Office, PATSTAT Global provides information on patent applications and granted patents collected from national and regional patent offices worldwide. The dataset has been formatted to facilitate statistical analysis. It can be used to research when a patent application was filed, how the patent progressed through the process, if and when it was granted, who the inventors were, and the textual abstract of the patent itself. PATSTAT Global is extracted from PATSTAT Online database. It is a snapshot of the source database at the time of extraction which is end of January for the spring edition and end of July for the autumn edition. Files are in .csv format. More information is available on the PATSTAT website. DATA AVAILABLE FOR PERIOD: 1900-July 2018
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Patent pledges are voluntary public commitments that patent holders make to limit the enforcement or exploitation of their patent rights. Such pledges have been made for decades and appear in industries ranging from software to automotive to green tech to biotech. Originally compiled by Prof. Jorge L. Contreras (University of Utah) and now curated by the Center for Advanced Studies in Bioscience Innovation Law (CeBIL) at the University of Copenhagen, this dataset offers the most comprehensive public record of patent pledges to date. The database covers more than 300 pledges spanning software, telecommunications, green technology, automotive, biotechnology, medical devices, and AI. Each record includes: Pledgor name Date of pledge (exact or best‐estimate) Excerpt of the pledge text Patent families / technologies covered Pledge type (e.g., non-assert, FRAND-style) Source URLTracking number (an ID that matches the filename of the archived snapshot capturing the pledge as it appeared online) Available for download Master spreadsheet of all pledge metadata PDF, PNG snapshot or MP4 of each pledge at time of collection The CeBIL research team, led by Dr. Gabriela Lenarczyk in close collaboration with Professor Timo Minssen (CeBIL Director), will update the dataset quarterly and welcome community submissions of new pledges or errata. (Initial public release: V1, June 2025; subsequent versions will follow Dataverse semantic-versioning conventions.) Patent Pledge literature Jorge L. Contreras, Patent Pledges as Portfolio Management Tools: Benefits, Obligations and Enforcement in A Modern Guide to Patenting. Challenges of Patenting in the 21st Century, Nicholas Thumm & Knut Blind (eds.), Edward Elgar (Jun. 2025), link Gabriela Lenarczyk, Mateo Aboy, OpenAI's Patent Pledge: A Post-Moderna Analysis, Journal of Intellectual Property Law & Practice 006 (2025), link Jorge L. Contreras, Voluntary Intellectual Property Pledges and COVID-19 in Intellectual Property, COVID-19 and the Next Pandemic, Haochen Sun & Madhavi Sunder (eds.), Cambridge University Press (Dec. 2024), link Gabriela Lenarczyk, Timo Minssen, Mateo Aboy, The nature, scope and validity of patent pledges, Journal of Intellectual Property Law & Practice 805, 19(11) (2024), link Gabriela Lenarczyk,Patent pledges na tle polskich instytucji prawnych, ze szczególnym uwzględnieniem licencji otwartej [‘Patent Pledges in the Context of Polish Legal Institutions, with Special Emphasis on Licences of Right’ book written in Polish] Publishing House of ILS PAS (2024), link Gaétan de Rassenfosse, Alfons Palangkaraya, Do Patent Pledges Accelerate Innovation?, Research Policy 52(5), (2023), link Jorge L. Contreras, No Take-Backs: Moderna’s Attempt to Renege on Its Vaccine Patent Pledge, Bill of Health blog, Aug. 29, 2022, link Richard Li-dar Wang, Chung-Lun Shen, Tung-Che Wu & Wesley Wei-Wen Hsiao, A concise framework to facilitate open COVID pledge of non-disclosed technologies: In terms of non-disclosed patent applications and trade secrets, Journal of the Formosan Medical Association, 121(8), (Aug. 2022), link Jorge L. Contreras, The Open COVID Pledge: Design, Implementation and Preliminary Assessment of an Intellectual Property Commons, 2021 Utah L. Rev. 833 (2021), link Ginevra Assia Antonelli, Maria Isabella Leone, Riccardo Ricci, Exploring the Open COVID Pledge in the fight against COVID-19: a semantic analysis of the Manifesto, the pledgors and the featured patents, R&D Management Special Issue: Providing solutions in emergencies: R&D and innovation management during Covid-19, 52(2) (2022), link Jorge L. Contreras, Michael Eisen, Ariel Ganz, Mark Lemley, Jenny Molloy, Diane M. Peters, Frank Tietze, Pledging Intellectual Property for Covid-19, 38 Nature Biotechnology 1146 (2020),link Jorge L. Contreras, Deconstructing Moderna’s COVID-19 Patent Pledge, Bill of Health blog, Oct. 21, 2020, link Jorge L. Contreras, Pledging Intellectual Property for Distributed Design in Viral Design – The COVID-19 Crisis as a Global Test Bed for Distributed Design (Distributed Design Platform, 2020), link Jonas F. Ehrnsperger & Frank Tietze, Motives for Patent Pledges: A Qualitative Study, CTM Working Paper Series, University of Cambridge (2019), link Jonas F. Ehrnsperger & Frank Tietze, IP Pledges, Open IP or Patent Pools? Developing Texonomies in the Thicket of Terminologies, CTM Working Paper Series, University of Cambridge (2019), link Jorge L. Contreras, Bronwyn H. Hall & Christian Helmers, Pledging Patents for the Public Good: Rise and Fall of the Eco-Patent Commons, 57 Houston L. Rev. 61-109 (2019), link Jorge L. Contreras, The Evolving Patent Pledge Landscape, CIGI Papers No. 166, Apr. 3, 2018, link Natacha Estèves, Open models for patents: Giving patents a new lease on life?, The Journal of World Intellectual Property, 21(1-2) (Mar. 2018), link Jorge L. Contreras & Meredith Jacob (eds.), Patent Pledges: Global Perspectives on Patent Law’s Private Ordering Frontier, Edward...
Comprehensive dataset of 8 Patent offices in Washington, United States as of June, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
Public Domain Mark 1.0https://creativecommons.org/publicdomain/mark/1.0/
License information was derived automatically
Offers display/download of Patent Trial and Appeal Board (PTAB) (formerly the Board of Patent Appeals and Interferences (BPAI)) Precedential Opinions (PDF); offers display/download of PTAB Informative Opinions (PDF sorted most recent and alphabetically); and also offers search, display, and download of PTAB Final Decisions. Search requires one or a combination of the following: application number;patent number; appeal number; interference number; publication number; inventor name; decision date; issue date; publication date; start date; end date; search document text (free form); records per page (60 default; 15; 30; 45; or all). Additionally, there is a button to retrieve all BPAI decisions. Format is PDF. Unavailable during daily database backups from 01:00 - 05:00 AM U.S. Eastern Time. http://www.uspto.gov/ip/boards/bpai/decisions/prec/index.jsp http://www.uspto.gov/ip/boards/bpai/decisions/inform/index.jsp http://e-foia.uspto.gov/Foia/PTABReadingRoom.jsp
Comprehensive dataset of 7 Patent offices in New York, United States as of June, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
The Patent Trial and Appeal Board (PTAB) API is a RESTful API with an easy to use search interface. You can easily browse USPTO PTAB public documents, search for specific content, and request a bulk download of PTAB content. The PTAB API synchronizes close to real time with the PTAB E2E (End-to-End) system making the latest public America Invents Action (AIA) Trial information and documents available. PTAB API v2 has text search capabilities for decision documents.
Comprehensive dataset of 8 Patent offices in Nevada, United States as of June, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
USPTO-2M is a dataset which download from United State Patent Trademark Office. It contains 2 million records which have cleaned and organized into JSON format.It could work as a benchmark dataset for patent classification task.Provided by Jie Hu from Guzhou University of Finance and Economics and Dr. Jianjun Hu at University of South Carolina.Citation: Li, Shaobo, Jie Hu, Yuxin Cui, and Jianjun Hu. "DeepPatent: patent classification with convolutional neural networks and word embedding." Scientometrics 117 (2018): 721-744.a sample of our data. { "Subclass_labels": [ "A43B", "A41D", "A43C" ], "Abstract": "a decorative and or promotional accessory to be secured to a lace such as a shoe lace includes a molded plastic body having a passage longitudinally extending therethrough from a first opening to a second opening the passage is sized and shaped to receive the lace therethrough and to frictionally secure the body in a desired position along the lace the accessory also includes indicia provided on an exterior surface of the accessory which can be in the form of any desired message name number logo graphic or the like an alternative embodiment of the accessory is disclosed which is to be secured to a cap bill this embodiment includes a slot radially extending to the passage which is sized and shaped to receive the cap brim therein and to resiliently grip the bill and removably secure the accessory in a desired position along the bill", "Title": "accessory for shoe laces hat brims and the like", "No": "US08925116" }
Comprehensive dataset of 9 Patent offices in Michigan, United States as of July, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
Subset and preprocessed version of Chemical reactions from US patents (1976-Sep2016) by Daniel Lowe. It includes 50K randomly selected reactions that was later classified into 10 reaction classes by Nadine Schneider et al.
http://data.europa.eu/eli/dec/2011/833/ojhttp://data.europa.eu/eli/dec/2011/833/oj
This dataset contains information about projects and their results funded by the European Union under the Horizon 2020 framework programme for research and innovation from 2014 to 2020.
The dataset is composed of six (6) different sub-set (in different formats):
Reference data (programmes, topics, topic keywords funding schemes (types of action), organisation types and countries) can be found in this dataset: https://data.europa.eu/euodp/en/data/dataset/cordisref-data
EuroSciVoc is available here: https://data.europa.eu/data/datasets/euroscivoc-the-european-science-vocabulary
CORDIS datasets are produced monthly. Therefore, inconsistencies may occur between what is presented on the CORDIS live website and the datasets.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Context This dataset is provided for the U.S. Patent Phrase to Phrase Matching competition. It adds additional information by providing the meaning of each code in the context column.
For more info, check out the discussion thread, and take a look at this starter notebook to see how to incorporate the data into the competition.
Content Preprocessing script here: https://www.kaggle.com/code/xhlulu/download-and-process-cpc
Licensing This data can be found on the USPTO website, where you can find the copyright information:
Pursuant to federal law, most government-produced materials appearing on this website are not subject to copyright restrictions within the United States and are therefore in the public domain. Public domain information may be freely distributed and copied, but it is requested that in any subsequent use the United States Patent and Trademark Office (USPTO) be given appropriate acknowledgement (e.g., “Source: United States Patent and Trademark Office, www.uspto.gov”). The USPTO reserves the right to assert copyright protection internationally.
Acknowledgements Photo by 2H Media on Unsplash
Original Data Source: Cooperative Patent Classification Codes Meaning
Comprehensive dataset of 7 Patent offices in Georgia, United States as of June, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
Comprehensive dataset of 8 Patent offices in Pennsylvania, United States as of June, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
Comprehensive dataset of 15 Patent offices in Israel as of June, 2025. Includes verified contact information (email, phone), geocoded addresses, customer ratings, reviews, business categories, and operational details. Perfect for market research, lead generation, competitive analysis, and business intelligence. Download a complimentary sample to evaluate data quality and completeness.
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Google Patents Public Data, provided by IFI CLAIMS Patent Services, is a worldwide bibliographic and US full-text dataset of patent publications. Patent information accessibility is critical for examining new patents, informing public policy decisions, managing corporate investment in intellectual property, and promoting future scientific innovation. The growing number of available patent data sources means researchers often spend more time downloading, parsing, loading, syncing and managing local databases than conducting analysis. With these new datasets, researchers and companies can access the data they need from multiple sources in one place, thus spending more time on analysis than data preparation.
The Google Patents Public Data dataset contains a collection of publicly accessible, connected database tables for empirical analysis of the international patent system.
Data Origin: https://bigquery.cloud.google.com/dataset/patents-public-data:patents
For more info, see the documentation at https://developers.google.com/web/tools/chrome-user-experience-report/
“Google Patents Public Data” by IFI CLAIMS Patent Services and Google is licensed under a Creative Commons Attribution 4.0 International License.
Banner photo by Helloquence on Unsplash