100+ datasets found
  1. Google Patents Public Data

    • kaggle.com
    zip
    Updated Sep 19, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Google BigQuery (2018). Google Patents Public Data [Dataset]. https://www.kaggle.com/datasets/bigquery/patents
    Explore at:
    zip(0 bytes)Available download formats
    Dataset updated
    Sep 19, 2018
    Dataset provided by
    Googlehttp://google.com/
    BigQueryhttps://cloud.google.com/bigquery
    Authors
    Google BigQuery
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Fork this notebook to get started on accessing data in the BigQuery dataset by writing SQL queries using the BQhelper module.

    Context

    Google Patents Public Data, provided by IFI CLAIMS Patent Services, is a worldwide bibliographic and US full-text dataset of patent publications. Patent information accessibility is critical for examining new patents, informing public policy decisions, managing corporate investment in intellectual property, and promoting future scientific innovation. The growing number of available patent data sources means researchers often spend more time downloading, parsing, loading, syncing and managing local databases than conducting analysis. With these new datasets, researchers and companies can access the data they need from multiple sources in one place, thus spending more time on analysis than data preparation.

    Content

    The Google Patents Public Data dataset contains a collection of publicly accessible, connected database tables for empirical analysis of the international patent system.

    Acknowledgements

    Data Origin: https://bigquery.cloud.google.com/dataset/patents-public-data:patents

    For more info, see the documentation at https://developers.google.com/web/tools/chrome-user-experience-report/

    “Google Patents Public Data” by IFI CLAIMS Patent Services and Google is licensed under a Creative Commons Attribution 4.0 International License.

    Banner photo by Helloquence on Unsplash

  2. Patent Data

    • kaggle.com
    Updated Apr 11, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dushyant Rathore (2022). Patent Data [Dataset]. https://www.kaggle.com/datasets/dushyantrathore/patent-data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 11, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Dushyant Rathore
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    The dataset contains the details of Patent Litigation Cases in the United States from 2000 to 2021. The team collected the litigation data in two phases. The first phase looked at data from 2010, specifically within Texas's Western and Eastern Districts. Unified Patent's Portal includes litigation data that each plaintiff has been marked as NPE (Patent Assertion Entity), NPE (Small Company), or NPE (Individual).

    Using the definitions, Unified first focused on identifying what NPEs were aggregators and then if they involved third-party financing. NPE aggregators were defined as NPEs with more than one affiliated subsidiary bringing patent litigation. An example of this would be IP Edge and the various limited liability companies underneath IP Edge's control that have brought numerous litigations against operating companies. Third-party financing was defined as evidence of any third party with a financial interest other than the assertors.

    With a narrow focus on the Western and Eastern District of Texas, Unified then used several public databases, such as Edgar, USPTO Assignment Records, the NPE Stanford Database, press releases, and its database of NPEs to identify any aggregator and any third-party financial interest, as well as various secretary of state corporate filings or court-ordered disclosures. After these two districts were identified, Unified expanded the data to cover the top five most litigious venues for patents, including the Western and Eastern Districts of Texas, Delaware, and the North and Central Districts of California. (On average, over the past five years, these districts have seen about 70% of all patent litigation.) Once that was completed, that dataset was then expanded to include all jurisdictions from 2010 and on.

    The final step was to complete the data set from 2000 to 2009. The team followed a similar data collection process using Lex Machina, the NPE Stanford Database, and Unified's Portal. Unified identified all of the litigation known to be NPE-related. Using the top five jurisdictions' aggregation and financing data, aggregator entities—such as Intellectual Ventures—were identified using the same methodology. The current dataset covers 2000-2021, determines who is an NPE, notes which NPEs are aggregators, and identifies which aggregators are known to have third-party financing.

    Note: there are currently no reporting requirements Federally, at the state level, or in the courts to publicly disclose the financing details of nonpublic entities. Thus, any data analysis of which litigations are funded or financed is incomplete, as many of these arrangements are closely held, private, and unknown even to the courts and the parties to the actions. This data set describes the minimum known amount of third-party-funded patent litigation. It is necessarily underinclusive of all nonpublic deals for which there is no available evidence or insight. For further generalized industry information on the size and scope of litigation funding for patent litigations, private sources often report on the size and scope of the burgeoning industry in the aggregate. For example, see Westfleet Advisor's 2021 Litigation Finance Report, available at https://www.westfleetadvisors.com/publications/2021-litigation-finance-report/.

  3. IPO: patent data

    • gov.uk
    Updated Aug 17, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Intellectual Property Office (2018). IPO: patent data [Dataset]. https://www.gov.uk/government/publications/ipo-patent-data
    Explore at:
    Dataset updated
    Aug 17, 2018
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    Intellectual Property Office
    Description

    This data is already available to the public in weekly XML snapshots via our online register but has been compiled into a snapshot to make it easier for statistical research to be undertaken.

    The attached documents describe the data.

    Please note: The GB patent data file may not open with older spreadsheet software.

  4. d

    Historical Patent Data Files for Academia and Researchers (1840 - 2014)

    • catalog.data.gov
    • uspto.data.commerce.gov
    Updated Sep 30, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office of the Chief Economist (OCE) (2025). Historical Patent Data Files for Academia and Researchers (1840 - 2014) [Dataset]. https://catalog.data.gov/dataset/historical-patent-data-files-for-academia-and-researchers-1840-2014
    Explore at:
    Dataset updated
    Sep 30, 2025
    Dataset provided by
    Office of the Chief Economist (OCE)
    Description

    Contains four research datasets containing time series and micro-level data by National Bureau of Economic Research (NBER) technology sub-category on applications, grants, and in-force patents spanning two centuries of innovation.

  5. Link Compustat – USPTO Patent Assignment Dataset

    • zenodo.org
    • data.niaid.nih.gov
    zip
    Updated Jul 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pere Arque-Castells; Daniel F. Spulber; Pere Arque-Castells; Daniel F. Spulber (2024). Link Compustat – USPTO Patent Assignment Dataset [Dataset]. http://doi.org/10.5281/zenodo.6352358
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jul 17, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Pere Arque-Castells; Daniel F. Spulber; Pere Arque-Castells; Daniel F. Spulber
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This page provides the data resulting from linking assignees and assignors in the USPTO Patent Assignment Dataset to Compustat gvkeys. We work with a version of the USPTO PAD that was gracefully shared with us by Stuart Graham. Such version precedes by one year the first release available at the USPTO website (https://www.uspto.gov/ip-policy/economic-research/research-datasets/patent-assignment-dataset). The version that we use covers 5,534,135 transactions recorded at the USPTO between January 1970 and January 2013 (inclusive). While the first transaction date is January 1970, the number of transactions recorded in the initial years is negligible. Data coverage seems sufficient for the years 1981-2012.

    If you use the code or data, please cite the following two papers:

    Arque-Castells, P., and Spulber, D. (2022). Measuring the Private and Social Returns to R&D: Unintended Spillovers versus Technology Markets. Journal of Political Economy. https://doi.org/10.1086/719908

    Arqué Castells, Pere and Spulber, Daniel F., Firm Matching in the Market for Technology: Business Stealing and Business Creation (September 17, 2021). Northwestern Law & Econ Research Paper No. 18-14, Available at SSRN: https://ssrn.com/abstract=3041558 or http://dx.doi.org/10.2139/ssrn.3041558

  6. d

    Patent Assignment Data for Academia and Researchers (1970 - 2023)

    • catalog.data.gov
    Updated Sep 30, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office of the Chief Economist (OCE) (2025). Patent Assignment Data for Academia and Researchers (1970 - 2023) [Dataset]. https://catalog.data.gov/dataset/patent-assignment-data-for-academia-and-researchers-1970-2021
    Explore at:
    Dataset updated
    Sep 30, 2025
    Dataset provided by
    Office of the Chief Economist (OCE)
    Description

    Contains detailed information on 10.0 million patent assignments and other transactions recorded at the USPTO since 1970 and involving roughly 17.8 million patents and patent applications. It is derived from the recording of patent transfers by parties with the USPTO.

  7. PatentsView Data

    • kaggle.com
    zip
    Updated Feb 12, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Google BigQuery (2019). PatentsView Data [Dataset]. https://www.kaggle.com/datasets/bigquery/patentsview
    Explore at:
    zip(0 bytes)Available download formats
    Dataset updated
    Feb 12, 2019
    Dataset provided by
    BigQueryhttps://cloud.google.com/bigquery
    Authors
    Google BigQuery
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Context

    The USPTO grants US patents to inventors and assignees all over the world. For researchers in particular, PatentsView is intended to encourage the study and understanding of the intellectual property (IP) and innovation system; to serve as a fundamental function of the government in creating “public good” platforms in these data; and to eliminate redundant cleaning, converting and matching of these data by individual researchers, thus freeing up researcher time to do what they do best—study IP, innovation, and technological change.

    Content

    PatentsView Data is a database that longitudinally links inventors, their organizations, locations, and overall patenting activity. The dataset uses data derived from USPTO bulk data files.

    Fork this notebook to get started on accessing data in the BigQuery dataset using the BQhelper package to write SQL queries.

    Acknowledgements

    “PatentsView” by the USPTO, US Department of Agriculture (USDA), the Center for the Science of Science and Innovation Policy, New York University, the University of California at Berkeley, Twin Arch Technologies, and Periscopic, used under CC BY 4.0.

    Data Origin: https://bigquery.cloud.google.com/dataset/patents-public-data:patentsview

    Banner photo by rawpixel on Unsplash

  8. DISCERN 2: Duke Innovation & SCientific Enterprises Research Network

    • zenodo.org
    Updated Aug 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ashish Arora; Ashish Arora; Sharon Belenzon; Sharon Belenzon; Larisa Cioaca; Larisa Cioaca; Lia Sheer; Lia Sheer; Hyun Moh (John) Shin; Hyun Moh (John) Shin; Dror Shvadron; Dror Shvadron (2024). DISCERN 2: Duke Innovation & SCientific Enterprises Research Network [Dataset]. http://doi.org/10.5281/zenodo.13619821
    Explore at:
    Dataset updated
    Aug 30, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Ashish Arora; Ashish Arora; Sharon Belenzon; Sharon Belenzon; Larisa Cioaca; Larisa Cioaca; Lia Sheer; Lia Sheer; Hyun Moh (John) Shin; Hyun Moh (John) Shin; Dror Shvadron; Dror Shvadron
    License

    https://cdla.dev/open-use-of-data-agreement-v1-0https://cdla.dev/open-use-of-data-agreement-v1-0

    Description

    The DISCERN dataset was developed to support academic research on corporate innovation by linking data on U.S. publicly listed firms from Standard & Poor’s Compustat database to their patents and scientific publications. A key feature of DISCERN is its comprehensive coverage of firms’ subsidiaries and their ownership changes over time, which is crucial for accurately mapping corporate innovation. Patents and publications may be assigned to various legal entities within a firm’s organizational structure. Subsidiaries may change ownership in M&A events. By accounting for these ownership linkages over time, DISCERN enables researchers to construct more precise measures of firms’ knowledge production and examine the factors influencing their R&D investment decisions.

    Version 2.0 incorporates several key improvements over the previous version of DISCERN. First, we shift to using the PatentsView database as the main source of patent data and OpenAlex as the main source of scientific publication data. PatentsView is publicly available and continuously maintained directly by the United States Patents & Trademarks Office (USPTO). OpenAlex is currently the only open data source of scientific publication metadata. Using freely available data sources allows us to share both the patent and the publication datasets openly. This enhances data access, which was previously limited due to the use of propriety data. Second, the updated dataset now covers the period from 1980 to 2021, providing an additional six years of data. Third, we transition to using Securities and Exchange Commission (SEC) filings as the primary source of subsidiary data, allowing us to trace ownership linkages further back to the mid-1990s and ensuring a higher degree of reliability compared to the Orbis data used in the original version, which was less reliable and had comprehensive coverage only from 2008. Finally, by transitioning to PatentsView and additional data sourced from the USPTO, we expand the scope of the dataset to include pre-grant patent applications and patent re-assignment information. This addition allows users to study patent applications regardless of grant status and to observe ownership transitions beyond those related to mergers and acquisitions.

    A special thanks and appreciation go to Sanskriti Purohit and Ron Rabi for their diligent work and dedication to this effort.

    The dataset is freely available under the O-UDA-1.0 License, permitting unrestricted use for research and commercial purposes. We request that users provide proper citations when utilizing the dataset. The license also allows for the creation of derivative datasets based on DISCERN, with the condition that creators ask their downstream users to cite the original authors appropriately.

    If you use the data, please add these citations:

    1. Arora, A., Belenzon, S., Cioaca, L., Sheer, L, Shin, H.M. & Shvadron, D. (2024). DISCERN 2.0: Duke Innovation & SCientific Enterprises Research Network [Dataset]. In Zenodo (CERN European Organization for Nuclear Research). https://doi.org/10.5281/zenodo.3594642

    2. Arora, A., Belenzon, S., Cioaca, L., Sheer, L, & Shvadron, D. (2024). Back to the Future: Are Big Firms Regaining their Scientific and Technological Dominance? Evidence from DISCERN 2.0 (available soon)

  9. H

    PatentCity: a dataset to study the location of patents since the 19th...

    • dataverse.harvard.edu
    • search.dataone.org
    Updated Apr 17, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Antonin Bergeaud; Verluise Cyril (2022). PatentCity: a dataset to study the location of patents since the 19th century [Dataset]. http://doi.org/10.7910/DVN/PG6THV
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 17, 2022
    Dataset provided by
    Harvard Dataverse
    Authors
    Antonin Bergeaud; Verluise Cyril
    License

    https://dataverse.harvard.edu/api/datasets/:persistentId/versions/2.1/customlicense?persistentId=doi:10.7910/DVN/PG6THVhttps://dataverse.harvard.edu/api/datasets/:persistentId/versions/2.1/customlicense?persistentId=doi:10.7910/DVN/PG6THV

    Description

    PatentCity is a dataset that provides information on each individual patents filed in the US patent office since 1836, on the UK patent office since 1894, on the French patent office since 1903 and on the German patent office (including East Germany) since 1877. Each entry is a patent publication along with standard information taken from patent offices (publication number, date of publication, technological classes…) which are enriched with additional details processed from the text of the patents. This includes: the name of each patentee (assignees or inventors), its geocoded address and when applicable its occupation and citizenship. PatentCity can be used in a variety of disciplines, geography, economics, history of science… and has been designed to be easily merged with existing geographical or technological sources. Github of the project: github.com/cverluise/patentcity Documentation: cverluise.github.io/patentcity Descriptive paper: www.longtermproductivity.com/perso/Patentcity_desc.pdf

  10. d

    The Artificial Intelligence Patent Dataset (AIPD) for Academia and...

    • catalog.data.gov
    Updated Sep 30, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office of the Chief Economist (OCE) (2025). The Artificial Intelligence Patent Dataset (AIPD) for Academia and Researchers [Dataset]. https://catalog.data.gov/dataset/the-artificial-intelligence-patent-dataset-aipd-for-academia-and-researchers
    Explore at:
    Dataset updated
    Sep 30, 2025
    Dataset provided by
    Office of the Chief Economist (OCE)
    Description

    Contains Artificial Intelligence Patent Landscape data classifying 13,244,037 granted patents and PGPubs published from 1976 through 2023 in eight AI component technologies using state-of-the art machine learning based models.

  11. F

    U.S. Granted Patents: Design Patents Originating in the District of Columbia...

    • fred.stlouisfed.org
    json
    Updated May 21, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2021). U.S. Granted Patents: Design Patents Originating in the District of Columbia [Dataset]. https://fred.stlouisfed.org/series/PATENTUSDCDESIGN
    Explore at:
    jsonAvailable download formats
    Dataset updated
    May 21, 2021
    License

    https://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain

    Area covered
    Washington
    Description

    Graph and download economic data for U.S. Granted Patents: Design Patents Originating in the District of Columbia (PATENTUSDCDESIGN) from 1992 to 2020 about patent granted, patents, intellectual property, origination, DC, and USA.

  12. EP full-text data for text analytics

    • data.europa.eu
    csv
    Updated Jul 1, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    European Patent Office (2022). EP full-text data for text analytics [Dataset]. https://data.europa.eu/88u/dataset/https-www-epo-org-searching-for-patents-data-bulk-data-sets-text-analytics-dataset
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jul 1, 2022
    Dataset authored and provided by
    European Patent Officehttp://www.epo.org/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    A bulk data set consisting of XML-tagged titles, abstracts, descriptions, claims and search reports of European Patent (EP) publications, designed to facilitate natural language processing work.

  13. C

    Commercial Patent Database Report

    • datainsightsmarket.com
    doc, pdf, ppt
    Updated Jul 12, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Data Insights Market (2025). Commercial Patent Database Report [Dataset]. https://www.datainsightsmarket.com/reports/commercial-patent-database-1413170
    Explore at:
    doc, ppt, pdfAvailable download formats
    Dataset updated
    Jul 12, 2025
    Dataset authored and provided by
    Data Insights Market
    License

    https://www.datainsightsmarket.com/privacy-policyhttps://www.datainsightsmarket.com/privacy-policy

    Time period covered
    2025 - 2033
    Area covered
    Global
    Variables measured
    Market Size
    Description

    The global commercial patent database market is experiencing robust growth, driven by the increasing need for intellectual property (IP) management and competitive intelligence among businesses. The market's expansion is fueled by several key factors, including rising R&D investments across various industries (pharmaceuticals, technology, etc.), a surge in patent filings worldwide, and the growing adoption of sophisticated analytical tools for patent data mining. This necessitates comprehensive and user-friendly databases that offer advanced search functionalities, allowing businesses to identify opportunities, track competitors, and protect their own innovations effectively. The market's segmentation reflects the diverse needs of users, encompassing solutions tailored to specific industries and IP management tasks. Leading players are continuously innovating, integrating AI and machine learning capabilities to enhance search precision and data analysis, creating more efficient and insightful platforms. The competitive landscape is characterized by a mix of established players and emerging technology companies, each striving for differentiation through superior user experience, data quality, and analytical features. We estimate the market size to be approximately $2.5 billion in 2025, growing at a compound annual growth rate (CAGR) of 12% between 2025 and 2033. This strong growth is projected to continue throughout the forecast period, primarily due to the ongoing digital transformation across sectors and the increasing reliance on data-driven decision-making. However, challenges remain, including the high cost of access to premium database features and the complex nature of patent data, requiring specialized expertise to interpret effectively. The market will see continued consolidation, with larger players acquiring smaller companies to expand their market reach and product offerings. Furthermore, the focus on user experience and the development of more intuitive interfaces will be critical to broaden the appeal of these databases to a wider range of users, from IP professionals to business strategists. Geographic expansion, particularly in emerging economies with growing R&D activities, will also be a key driver of market growth in the coming years.

  14. d

    The Government Patent Register: Data and Documentation

    • search.dataone.org
    Updated Sep 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gross, Daniel; Sampat, Bhaven (2024). The Government Patent Register: Data and Documentation [Dataset]. http://doi.org/10.7910/DVN/BDCCN9
    Explore at:
    Dataset updated
    Sep 25, 2024
    Dataset provided by
    Harvard Dataverse
    Authors
    Gross, Daniel; Sampat, Bhaven
    Description

    This repository provides the Government Patent Register, a dataset of U.S. government interest patents sourced from administrative records, accompanying the following article: Gross, Daniel P. and Bhaven N. Sampat. 2024. “The Government Patent Register: A new resource for measuring U.S. government-funded patenting." NBER working paper no. 32136. Please sign up here if you'd like to be notified of future updates: https://forms.gle/3fuqUW7jPyLxySLF8.

  15. d

    Patent Application Full Text (2001 - Present)

    • catalog.data.gov
    • s.cnmilf.com
    Updated Sep 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Patents (2025). Patent Application Full Text (2001 - Present) [Dataset]. https://catalog.data.gov/dataset/patent-application-publication-full-text-2001-present-6be21
    Explore at:
    Dataset updated
    Sep 30, 2025
    Dataset provided by
    Patents
    Description

    Contains the full text of each patent application (non-provisional utility and plant) published weekly (Thursdays) from March 15, 2001 to present (excludes images/drawings). Subset of the Patent Application Full Text Data with Embedded TIFF Images.

  16. Patent data

    • figshare.com
    application/x-rar
    Updated Oct 19, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Haodong Yang (2025). Patent data [Dataset]. http://doi.org/10.6084/m9.figshare.30393856.v1
    Explore at:
    application/x-rarAvailable download formats
    Dataset updated
    Oct 19, 2025
    Dataset provided by
    Figsharehttp://figshare.com/
    figshare
    Authors
    Haodong Yang
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Patent information of listed companies

  17. n

    Historical Patent Data Files for Academia and Researchers - Dataset - CKAN

    • nationaldataplatform.org
    Updated Jul 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Historical Patent Data Files for Academia and Researchers - Dataset - CKAN [Dataset]. https://nationaldataplatform.org/catalog/dataset/historical-patent-data-files-for-academia-and-researchers
    Explore at:
    Dataset updated
    Jul 13, 2025
    Description

    Contains four research datasets containing time series and micro-level data by National Bureau of Economic Research (NBER) technology sub-category on applications, grants, and in-force patents spanning two centuries of innovation. For more information: https://www.uspto.gov/learning-and-resources/ip-policy/economic-research/research-datasets

  18. d

    Global Active Patent Data | B2B Intellectual Property Dataset | +10M Records...

    • datarade.ai
    .json, .csv
    Updated Jul 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    IPqwery (2024). Global Active Patent Data | B2B Intellectual Property Dataset | +10M Records | 20 Year Historical Data [Dataset]. https://datarade.ai/data-products/ipqwery-global-active-patent-dataset-b2b-intellectual-proper-ipqwery
    Explore at:
    .json, .csvAvailable download formats
    Dataset updated
    Jul 10, 2024
    Dataset authored and provided by
    IPqwery
    Area covered
    Belgium, Portugal, Belarus, Slovenia, Ireland, Hungary, Sweden, Norway, Spain, Italy
    Description

    Patent data is aggregated across multiple Intellectual Property (IP) registries, including USPTO, CIPO, EUIPO and WIPO (USA, Canada, Europe). Our complete dataset of active patent records is updated weekly. Customized reports available based on company lists, or full dataset via raw feed or one-off reports. Full bibliographic data provided for each IP record; including filing date, grant date, expiry date, inventor(s), IPC, full text abstract, title, etc. Ownership/entity relationship mapping, ticker mapping, ISIN mapping, Crunchbase uuid mapping, Crunchbase domain mapping. We also provide our proprietary IP Activity Score for each owner, which can assist to compare recent innovation activity amongst owners, as reflected in their Intellectual Property filings.

    Ipqwery's Patent data is also available as a combined dataset with our Trademark dataset, enabling full IP profiles for corporate entities.

  19. USPTO OCE Patent Claims Research Data

    • kaggle.com
    zip
    Updated Feb 12, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Google BigQuery (2019). USPTO OCE Patent Claims Research Data [Dataset]. https://www.kaggle.com/datasets/bigquery/uspto-oce-claims
    Explore at:
    zip(0 bytes)Available download formats
    Dataset updated
    Feb 12, 2019
    Dataset provided by
    BigQueryhttps://cloud.google.com/bigquery
    Authors
    Google BigQuery
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Fork this notebook to get started on accessing data in the BigQuery dataset by writing SQL queries using the BQhelper module.

    Context

    The Patent Claims Research Dataset contain detailed information on claims from U.S. patents granted between 1976 and 2014 and U.S. patent applications published between 2001 and 2014. The dataset is derived from the Patent Application Publication Full-Text and Patent Grant Full Text files, available at https://bulkdata.uspto.gov/, to which the Office of Chief Economist (OCE) applied a Python algorithm to identify individual claims as well as the dependency relationship between claims. From the parsed claims text, OCE created six data files containing individually-parsed claims, claim-level statistics, and document-level statistics, including newly-developed measures of patent scope.

    Content

    USPTO OCE Patent Claims Research data contains detailed information on claims from U.S. patents granted between 1976 and 2014 and U.S. patent applications published between 2001 and 2014.

    Acknowledgements

    "USPTO OCE Patent Claims Research Data" by the USPTO, for public use. Marco, Alan C. and Sarnoff, Joshua D. and deGrazia, Charles, "Patent Claims and Patent Scope" (October 2016). USPTO Economic Working Paper 2016-04.

    Data Origin: https://bigquery.cloud.google.com/dataset/patents-public-data:uspto_oce_claims

    Banner photo by William Iven on Unsplash

  20. d

    Moat Innovation Data | Market, Product, and Patent Data | 21K+ Public...

    • datarade.ai
    Updated Jul 12, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Moat (2025). Moat Innovation Data | Market, Product, and Patent Data | 21K+ Public Companies | Company-Aligned, Time-Aware [Dataset]. https://datarade.ai/data-products/moat-innovation-data-patent-product-litigation-and-marke-moat
    Explore at:
    .json, .csv, .jsonl, .parquetAvailable download formats
    Dataset updated
    Jul 12, 2025
    Dataset authored and provided by
    Moat
    Area covered
    State of, Montserrat, Palau, Uruguay, New Caledonia, Greenland, Suriname, Madagascar, Bonaire, Dominican Republic
    Description

    Innovation is the engine of long-term growth.

    Moat provides structured patent data rolled up to an ultimate parent and mapped to ticker symbols. Patent ownership is time aware of asset transfers and corporate hierarchy changes. Data is mapped to actual markets and products (not a CPC schema). Patent data can also be combined with market, risk, and/or product data to quantify company and sector specific innovation behavior and trends.

    Dataset creates queryable relationships among products, technologies, patents, entities, investment, risk, talent, and value.

    Datasets can be used for such things as: - Enterprise Valuation - Validate or ascertain enterprise value through intangible asset aligned enterprise values. - Patent Valuation - Estimate of the dollar value of the cost to rebuild a patent portfolio - IP Risk and Litigation - Quantifies risks to each patent and patent portfolio through strength, validity, and litigation metrics. - Innovation Tracking and Analysis - Maps financial, product, and risk data to patents to facilitate comparative analysis and to reveal demonstrated innovation behavior. - Patent Lifecycle and Expiration - Data that estimates the lifecycle and expirations of technology areas and products protected by complex patent strategies.

    Patent data is time-aware and 20 years of historical data is available. Data is updated daily. In depth usage examples can be provided on request.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Google BigQuery (2018). Google Patents Public Data [Dataset]. https://www.kaggle.com/datasets/bigquery/patents
Organization logoOrganization logo

Google Patents Public Data

Worldwide bibliographic and US patent publications (BigQuery)

Explore at:
182 scholarly articles cite this dataset (View in Google Scholar)
zip(0 bytes)Available download formats
Dataset updated
Sep 19, 2018
Dataset provided by
Googlehttp://google.com/
BigQueryhttps://cloud.google.com/bigquery
Authors
Google BigQuery
License

Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically

Description

Fork this notebook to get started on accessing data in the BigQuery dataset by writing SQL queries using the BQhelper module.

Context

Google Patents Public Data, provided by IFI CLAIMS Patent Services, is a worldwide bibliographic and US full-text dataset of patent publications. Patent information accessibility is critical for examining new patents, informing public policy decisions, managing corporate investment in intellectual property, and promoting future scientific innovation. The growing number of available patent data sources means researchers often spend more time downloading, parsing, loading, syncing and managing local databases than conducting analysis. With these new datasets, researchers and companies can access the data they need from multiple sources in one place, thus spending more time on analysis than data preparation.

Content

The Google Patents Public Data dataset contains a collection of publicly accessible, connected database tables for empirical analysis of the international patent system.

Acknowledgements

Data Origin: https://bigquery.cloud.google.com/dataset/patents-public-data:patents

For more info, see the documentation at https://developers.google.com/web/tools/chrome-user-experience-report/

“Google Patents Public Data” by IFI CLAIMS Patent Services and Google is licensed under a Creative Commons Attribution 4.0 International License.

Banner photo by Helloquence on Unsplash

Search
Clear search
Close search
Google apps
Main menu