14 datasets found
  1. PatentsView Data

    • kaggle.com
    zip
    Updated Feb 12, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Google BigQuery (2019). PatentsView Data [Dataset]. https://www.kaggle.com/datasets/bigquery/patentsview
    Explore at:
    zip(0 bytes)Available download formats
    Dataset updated
    Feb 12, 2019
    Dataset provided by
    BigQueryhttps://cloud.google.com/bigquery
    Authors
    Google BigQuery
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Context

    The USPTO grants US patents to inventors and assignees all over the world. For researchers in particular, PatentsView is intended to encourage the study and understanding of the intellectual property (IP) and innovation system; to serve as a fundamental function of the government in creating “public good” platforms in these data; and to eliminate redundant cleaning, converting and matching of these data by individual researchers, thus freeing up researcher time to do what they do best—study IP, innovation, and technological change.

    Content

    PatentsView Data is a database that longitudinally links inventors, their organizations, locations, and overall patenting activity. The dataset uses data derived from USPTO bulk data files.

    Fork this notebook to get started on accessing data in the BigQuery dataset using the BQhelper package to write SQL queries.

    Acknowledgements

    “PatentsView” by the USPTO, US Department of Agriculture (USDA), the Center for the Science of Science and Innovation Policy, New York University, the University of California at Berkeley, Twin Arch Technologies, and Periscopic, used under CC BY 4.0.

    Data Origin: https://bigquery.cloud.google.com/dataset/patents-public-data:patentsview

    Banner photo by rawpixel on Unsplash

  2. d

    PatentsView PatentSearch API (Version 2.3.0)

    • catalog.data.gov
    Updated Sep 30, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office of the Chief Economist (OCE) (2025). PatentsView PatentSearch API (Version 2.3.0) [Dataset]. https://catalog.data.gov/dataset/patentsview-api-version-1-0-0
    Explore at:
    Dataset updated
    Sep 30, 2025
    Dataset provided by
    Office of the Chief Economist (OCE)
    Description

    The PatentsView PatentSearch API is intended to inspire the exploration and enhanced understanding of US intellectual property (IP) and innovation systems. The database driving the API is regularly updated and integrates the best available tools for inventor disambiguation and data quality control. We hope researchers and developers alike will explore the API to discover people and companies and to visualize trends and patterns across the US innovation landscape.

  3. PatentsView full description text for the 12/31/2024 release, both granted...

    • zenodo.org
    zip
    Updated Mar 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zenodo (2025). PatentsView full description text for the 12/31/2024 release, both granted and pre-grant. [Dataset]. http://doi.org/10.5281/zenodo.15062212
    Explore at:
    zipAvailable download formats
    Dataset updated
    Mar 25, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    These are the detailed description text from granted patents (1976-2014, prefix "g_") and patent applications (2021-2014, prefix "pg_") from the final release of PatentsView on 12/31/2024.

  4. PatentsView Data

    • console.cloud.google.com
    Updated Jul 17, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    https://console.cloud.google.com/marketplace/browse?filter=partner:Google%20Patents%20Public%20Datasets&hl=en_GB (2023). PatentsView Data [Dataset]. https://console.cloud.google.com/marketplace/product/google_patents_public_datasets/patentsview?hl=en_GB
    Explore at:
    Dataset updated
    Jul 17, 2023
    Dataset provided by
    Googlehttp://google.com/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    PatentsView Data is a dataset that longitudinally links inventors, their organizations, locations, and overall patenting activity. The dataset uses data derived from USPTO bulk data files.

  5. patents_granted_Q12019_USPTO

    • kaggle.com
    zip
    Updated Sep 24, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jayeeta Putatunda (2019). patents_granted_Q12019_USPTO [Dataset]. https://www.kaggle.com/datasets/jputatunda/patents-granted-q12019-uspto
    Explore at:
    zip(32786160 bytes)Available download formats
    Dataset updated
    Sep 24, 2019
    Authors
    Jayeeta Putatunda
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Context

    The Patents View API can easily query all the granted patents in US. But the result table is not very comprehensive and have repeat records for different values for a particular column. I wrote a script to comprehend it and pulled only data rich columns on which various analysis can be done.

    Content

    This dataset consists of all granted patents in US in the first quarter of 2019 (Jan - Mar). I wanted to analyze this to see which industries are leading in innovations, sectors and technologies used in these patents and see if we could draw some patterns.

    This dataset is a publicly available dataset and you can check all available columns here - http://www.patentsview.org/api/patent.html.

    Inspiration

    I am still building my analyzing and visualization dashboard. Open to any questions that you may want to see answered from this dataset.

  6. h

    USPTO-3M

    • huggingface.co
    Updated Dec 31, 2015
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Malav Patel (2015). USPTO-3M [Dataset]. https://huggingface.co/datasets/MalavP/USPTO-3M
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 31, 2015
    Authors
    Malav Patel
    Description

    This dataset was generated using Google's BigQuery API. The query is adapted from Appendix A from the work by Lee and Hsiang. The query is changed to include patents from 2000 - 2015. The specific query is shown below. SELECT STRING_AGG(distinct t2.group_id ORDER BY t2.group_id) AS cpc_ids, t1.id, t1.date, t3.text FROM patents-public-data.patentsview.patent t1, patents-public-data.patentsview.cpc_current t2, patents-public-data.patentsview.claim t3 WHERE t1.id =… See the full description on the dataset page: https://huggingface.co/datasets/MalavP/USPTO-3M.

  7. n

    Data from: Measures Associated to USPTO Patent Technological Significance

    • data.ncl.ac.uk
    csv
    Updated Jul 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rafael A. Corredoira; Brent Goldfarb (2025). Measures Associated to USPTO Patent Technological Significance [Dataset]. http://doi.org/10.25405/data.ncl.29506094.v1
    Explore at:
    csvAvailable download formats
    Dataset updated
    Jul 16, 2025
    Dataset provided by
    Newcastle University
    Authors
    Rafael A. Corredoira; Brent Goldfarb
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Version 1. 15/07/2025This dataset includes measures associated to the technological significance of USPTO patents and supporting variables. It covers patents granted between 1 January 1980 and 31 December 2009.These variables were utilized in “The Changing Nature of Firm Innovation: Short-term Orientation and Influential Innovation in US Public Firms”, Management Science, forthcomingIf you use this dataset in your research or publication, we kindly ask that you acknowledge it by including the following citation: Corredoira, R. A., & Goldfarb, B. D. (2025). Measures associated to USPTO patent technological significance [Data set]. Newcastle University. https://doi.org/10.25405/data.ncl.29506094 Proper citation helps ensure the dataset's impact is recognized and supports continued data sharing.

  8. D

    Supplementary data for: Social Push and the Direction of Innovation -...

    • datalumos.org
    delimited
    Updated Oct 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Elias Einio; Josh Feng; Xavier Jaravel (2025). Supplementary data for: Social Push and the Direction of Innovation - PatentsView data [Dataset]. http://doi.org/10.3886/E238523V1
    Explore at:
    delimitedAvailable download formats
    Dataset updated
    Oct 1, 2025
    Dataset provided by
    University of Utah, David Eccles School of Business
    VATT Institute for Economic Research
    London School of Economics
    Authors
    Elias Einio; Josh Feng; Xavier Jaravel
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    United States
    Description

    This repository includes the version of the PatentsView data (March 2022) that is used to produce some of the exhibits in "Social Push and the Direction of Innovation."Because we needed citations data and the March 2022 version is no longer available, we use the latest available version of g_us_patent_citation.tsv (from April 2025), which is a bit newer than the version available at: https://doi.org/10.3886/E223582V1.

  9. h

    us-patent-descriptions

    • huggingface.co
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Michael Hurhangee, us-patent-descriptions [Dataset]. https://huggingface.co/datasets/mhurhangee/us-patent-descriptions
    Explore at:
    Authors
    Michael Hurhangee
    Description

    US Patent Descriptions

    This dataset contains the descriptions of granted US utility patents, filtered and deduplicated.The original data comes from all granted patents in 2025 up to May 20, available from PatentsView.

      Splits
    

    train: 10,000 rows for model training
    validation: 2,500 rows for validation
    test: 2,500 rows for evaluation

      Columns
    

    patent_id: Identifier for the patent; useful for reconciling with other PatentsView datasets
    description_text: Full… See the full description on the dataset page: https://huggingface.co/datasets/mhurhangee/us-patent-descriptions.

  10. Data from: A Roadmap for Systematically Identifying Opportunities in...

    • figshare.com
    xlsx
    Updated Apr 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Behrooz Khademi; Hannele Lampela; Gerrit De Waal; Smyrnios, Kosmas X. (2023). A Roadmap for Systematically Identifying Opportunities in Geographically Bounded Ecosystems Using Patent Analytics [Dataset]. http://doi.org/10.6084/m9.figshare.14782221.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Apr 28, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Behrooz Khademi; Hannele Lampela; Gerrit De Waal; Smyrnios, Kosmas X.
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset contains supplementary material regarding country-specific analysis for the Nordic region.

  11. DISCERN 2: Duke Innovation & SCientific Enterprises Research Network

    • zenodo.org
    Updated Aug 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ashish Arora; Ashish Arora; Sharon Belenzon; Sharon Belenzon; Larisa Cioaca; Larisa Cioaca; Lia Sheer; Lia Sheer; Hyun Moh (John) Shin; Hyun Moh (John) Shin; Dror Shvadron; Dror Shvadron (2024). DISCERN 2: Duke Innovation & SCientific Enterprises Research Network [Dataset]. http://doi.org/10.5281/zenodo.13619821
    Explore at:
    Dataset updated
    Aug 30, 2024
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Ashish Arora; Ashish Arora; Sharon Belenzon; Sharon Belenzon; Larisa Cioaca; Larisa Cioaca; Lia Sheer; Lia Sheer; Hyun Moh (John) Shin; Hyun Moh (John) Shin; Dror Shvadron; Dror Shvadron
    License

    https://cdla.dev/open-use-of-data-agreement-v1-0https://cdla.dev/open-use-of-data-agreement-v1-0

    Description

    The DISCERN dataset was developed to support academic research on corporate innovation by linking data on U.S. publicly listed firms from Standard & Poor’s Compustat database to their patents and scientific publications. A key feature of DISCERN is its comprehensive coverage of firms’ subsidiaries and their ownership changes over time, which is crucial for accurately mapping corporate innovation. Patents and publications may be assigned to various legal entities within a firm’s organizational structure. Subsidiaries may change ownership in M&A events. By accounting for these ownership linkages over time, DISCERN enables researchers to construct more precise measures of firms’ knowledge production and examine the factors influencing their R&D investment decisions.

    Version 2.0 incorporates several key improvements over the previous version of DISCERN. First, we shift to using the PatentsView database as the main source of patent data and OpenAlex as the main source of scientific publication data. PatentsView is publicly available and continuously maintained directly by the United States Patents & Trademarks Office (USPTO). OpenAlex is currently the only open data source of scientific publication metadata. Using freely available data sources allows us to share both the patent and the publication datasets openly. This enhances data access, which was previously limited due to the use of propriety data. Second, the updated dataset now covers the period from 1980 to 2021, providing an additional six years of data. Third, we transition to using Securities and Exchange Commission (SEC) filings as the primary source of subsidiary data, allowing us to trace ownership linkages further back to the mid-1990s and ensuring a higher degree of reliability compared to the Orbis data used in the original version, which was less reliable and had comprehensive coverage only from 2008. Finally, by transitioning to PatentsView and additional data sourced from the USPTO, we expand the scope of the dataset to include pre-grant patent applications and patent re-assignment information. This addition allows users to study patent applications regardless of grant status and to observe ownership transitions beyond those related to mergers and acquisitions.

    A special thanks and appreciation go to Sanskriti Purohit and Ron Rabi for their diligent work and dedication to this effort.

    The dataset is freely available under the O-UDA-1.0 License, permitting unrestricted use for research and commercial purposes. We request that users provide proper citations when utilizing the dataset. The license also allows for the creation of derivative datasets based on DISCERN, with the condition that creators ask their downstream users to cite the original authors appropriately.

    If you use the data, please add these citations:

    1. Arora, A., Belenzon, S., Cioaca, L., Sheer, L, Shin, H.M. & Shvadron, D. (2024). DISCERN 2.0: Duke Innovation & SCientific Enterprises Research Network [Dataset]. In Zenodo (CERN European Organization for Nuclear Research). https://doi.org/10.5281/zenodo.3594642

    2. Arora, A., Belenzon, S., Cioaca, L., Sheer, L, & Shvadron, D. (2024). Back to the Future: Are Big Firms Regaining their Scientific and Technological Dominance? Evidence from DISCERN 2.0 (available soon)

  12. The anatomy of Green AI technologies: structure, evolution, and impact -...

    • zenodo.org
    bin, csv
    Updated May 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lorenzo Emer; Lorenzo Emer; Andrea Mina; Andrea Vandin; Andrea Vandin; Andrea Mina (2025). The anatomy of Green AI technologies: structure, evolution, and impact - Dataset and Replicability Material [Dataset]. http://doi.org/10.5281/zenodo.15545361
    Explore at:
    csv, binAvailable download formats
    Dataset updated
    May 30, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Lorenzo Emer; Lorenzo Emer; Andrea Mina; Andrea Vandin; Andrea Vandin; Andrea Mina
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Accompanying material for the paper "The anatomy of Green AI technologies: structure, evolution, and impact" (2025).

    Dataset Construction

    The Green AI Patent Dataset comprises 63 326 unique U.S. patents that intersect environmental (“green”) technologies with artificial‐intelligence components, spanning from 1976 to 2023. It was assembled by combining:

    1. PatentsView (USPTO) – U.S. patents (snapshot of January 2025) labelled under Cooperative Patent Classification classes Y02 and Y04S for climate‐change mitigation/adaptation and smart‐grid technologies.

    2. Artificial Intelligence Patent Dataset (AIPD 2023 - most recent update) – USPTO’s machine‐learning–validated classification of AI‐related patents (predict50_any_ai = 1). Available here: Pairolero, N. et al. The artificial intelligence patent dataset (aipd) 2023 update. USPTO Economic Working Paper 2024-4,
      USPTO (2024). Available at https://www.uspto.gov/sites/default/files/documents/oce-aipd-2023.pdf.

    Variables

    VariableDescriptionCompleteness (non-null count)
    patent_idUnique USPTO patent identifier.63 326
    cpc_subclassSubclasses of "green" CPC taxonomy Y02 / Y04S. Refer to the USTPO's website for more details: https://www.uspto.gov/web/patents/classification/cpc/html/cpc-Y.html63 326
    patent_dateGrant date of the patent (YYYY-MM-DD).63 326
    patent_titleTitle of the patent.63 326
    assigneeDisambiguated assignee organization name.59 479
    countryDisambiguated assignee country.59 155
    forward_citationsNumber of times this patent is cited by later patents (forward citations).63 326
    tech_domainBERTOPIC‐derived technology domain (integer 0–15; –1 marks outliers).62 337
    real_valueMarket‐value proxy associated with the patent, derived from the updated dataset of Kogan, L., Papanikolaou, D., Seru, A. & Stoffman, N. Technological innovation, resource allocation, and growth. The Q. J.
    Econ. 132, 665–712, DOI: 10.1093/qje/qjw040 (2017).
    26 306

    BERTOPIC Topic Mapping

    Each patent was assigned to one of 16 topics (tech_domain), numbered 0–15 (with –1 for outliers). Below is the label, example keywords (with their topic cohesion scores), and the number of patents in each topic:

    IDLabelTop Keywords (score)Count
    0Data Processing & Memory Managementprocessing (0.516), computing (0.461), process (0.449), systems (0.443), memory (0.421)27 435
    1Microgrid & Distributed Energy Systemsmicrogrid (0.487), electricity (0.421), utility (0.401), power (0.380), energy (0.370)5 378
    2Vehicle Control & Autonomous Powertrainsvehicle (0.477), vehicles (0.468), control (0.416), driving (0.387), engine (0.386)3 747
    3Irrigation & Agricultural Water Mgmtirrigation (0.511), systems (0.431), flow (0.353), process (0.348), water (0.333)2 754
    4Photovoltaic & Electrochemical Devicessemiconductor (0.518), photoelectric (0.509), electrodes (0.487), electrode (0.473), photovoltaic (0.470)2 599
    5Clinical Microbiome & Therapeuticsmicrobiome (0.481), clinical (0.371), physiological (0.321), therapeutic (0.320), disease (0.314)2 286
    6Combustion Engine Controlcombustion (0.423), engine (0.373), control (0.342), fuel (0.338), ignition (0.318)2 179
    7Battery Charging & Managementcharging (0.485), charger (0.449), charge (0.425), battery (0.386), batteries (0.377)1 541
    8HVAC & Thermal Regulationhvac (0.515), heater (0.474), cooling (0.471), heating (0.464), evaporator (0.455)1 523
    9Lighting & Illumination Systemslighting (0.621), illumination (0.601), lights (0.545), brightness (0.526), light (0.488)1 219
    10Exhaust & Emission Treatmentexhaust (0.464), catalytic (0.446), purification (0.444), catalyst (0.366), emissions (0.365)1 064
    11Wind Turbine & Rotor Controlturbines (0.498), turbine (0.488), windmill (0.464), wind (0.418), rotor (0.300)988
    12Aircraft Wing Aerodynamics & Controlwing (0.450), aircraft (0.448), wingtip (0.424), apparatus (0.423), aerodynamic (0.418)697
    13Meteorological Radar & Weather Forecastingradar (0.541), meteorological (0.511), weather (0.412), precipitation (0.391), systems (0.372)542
    14Fuel Cell Systems & Electrodesfuel (0.375), cell (0.313), systems (0.295), cells (0.291), controls (0.262)377
    15Turbine Airfoils & Coolingairfoils (0.584), airfoil (0.572), turbine (0.433), engine (0.333), axial (0.321)352
    –1Outliers7 656

    Code availability

    This Zenodo entry contains topic_modeling.ipynb, a fully documented jupyter notebook containing Python code for uncovering latent themes in patent abstracts using BERTopic. It walks through text preprocessing (lowercasing, standard English stopwords plus “herein” and “invention,” tokenization, and boilerplate removal), embedding with the all-MiniLM-L6-v2 SentenceTransformer, dimensionality reduction via UMAP, clustering with HDBSCAN, and topic extraction through class-based TF-IDF. The script also executes a grid search over UMAP and HDBSCAN hyperparameters, computes UMass coherence and topic diversity for each configuration, and saves a CSV of evaluation metrics, enabling straightforward reproduction of our topic-modeling workflow.

    **Note on Patent Abstracts**
    The BERTopic analysis in this notebook was performed on the full text of U.S. patent abstracts. To save space and comply with memory constraints, the abstracts themselves are not included in this repository. However, they can be downloaded directly from the PatentsView portal (see “g_patent_abstract” in the data tables at https://patentsview.org/download/data-download-tables). Each record is linked to our processed dataset via the `patent_id` field, so you can seamlessly merge the raw abstracts with your local copy of the Green AI dataset before running or inspecting the topic model.

    Additional analyses, such as data cleaning, merging, aggregation, and the generation of summary tables and plots, were also performed but are not included here by default, as they consist of straightforward operations using standard open-source libraries (e.g., pandas, NumPy, matplotlib, and seaborn). The full code for these steps can be made available upon request.

  13. d

    Science and Engineering Indicators 2024 support material

    • elsevier.digitalcommonsdata.com
    Updated Feb 15, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Guillaume Roberge (2024). Science and Engineering Indicators 2024 support material [Dataset]. http://doi.org/10.17632/vrg53tc5r2.1
    Explore at:
    Dataset updated
    Feb 15, 2024
    Authors
    Guillaume Roberge
    License

    Attribution-NonCommercial 3.0 (CC BY-NC 3.0)https://creativecommons.org/licenses/by-nc/3.0/
    License information was derived automatically

    Description

    This repository includes the base notebooks used to prepare the patent and trademark data for SEI 2024. This covers the uploading of the PatentsView database, its curation, and the preparation of patent and trademark indicators across all the mapping classifications.

  14. IRIS DOD-SBIR database

    • zenodo.org
    zip
    Updated Sep 1, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Carlo Bottai; Carlo Bottai; Emilio Raiteri; Gaétan de Rassenfosse; Gaétan de Rassenfosse; Emilio Raiteri (2021). IRIS DOD-SBIR database [Dataset]. http://doi.org/10.5281/zenodo.5341454
    Explore at:
    zipAvailable download formats
    Dataset updated
    Sep 1, 2021
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Carlo Bottai; Carlo Bottai; Emilio Raiteri; Gaétan de Rassenfosse; Gaétan de Rassenfosse; Emilio Raiteri
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This database collects and links U.S. federal funded awards to U.S. utility patents, and such patents to virtual patent marking (VPM) pages, in line with two related project: 3PFL and IPRoduct. Specifically, this database looks at awards provided by the U.S. Department of Defense (DOD) within the Small Business Innovation Research (SBIR) and Small Business Technology Transfer Program (STTR) programs from 1984 to 2018.

    The database is part of a project, IRIS - Insights on the "Real" Impact of Science. The project aims at assessing how public investment in research and development (R&D) translates into commercial products for the final consumer.

    The database is composed of three main elements: awards; patents; and web pages. The database provides several information pieces. This has been possible by making use of several sources, that has been properly combined and further elaborated in a convenient way. Information about the awards comes from the Defense Contract Action Data System (DCADS), for the years 1984--2001, and from USAspending.gov, for the years 2001--2018. Most information about the patents is provided by PatentsView, while specific information comes from the Patent Examination Research Dataset (PatEx) or from PATSTAT.

  15. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Google BigQuery (2019). PatentsView Data [Dataset]. https://www.kaggle.com/datasets/bigquery/patentsview
Organization logo

PatentsView Data

Analyze and explore US patent data by the USPTO (BigQuery)

Explore at:
432 scholarly articles cite this dataset (View in Google Scholar)
zip(0 bytes)Available download formats
Dataset updated
Feb 12, 2019
Dataset provided by
BigQueryhttps://cloud.google.com/bigquery
Authors
Google BigQuery
License

Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically

Description

Context

The USPTO grants US patents to inventors and assignees all over the world. For researchers in particular, PatentsView is intended to encourage the study and understanding of the intellectual property (IP) and innovation system; to serve as a fundamental function of the government in creating “public good” platforms in these data; and to eliminate redundant cleaning, converting and matching of these data by individual researchers, thus freeing up researcher time to do what they do best—study IP, innovation, and technological change.

Content

PatentsView Data is a database that longitudinally links inventors, their organizations, locations, and overall patenting activity. The dataset uses data derived from USPTO bulk data files.

Fork this notebook to get started on accessing data in the BigQuery dataset using the BQhelper package to write SQL queries.

Acknowledgements

“PatentsView” by the USPTO, US Department of Agriculture (USDA), the Center for the Science of Science and Innovation Policy, New York University, the University of California at Berkeley, Twin Arch Technologies, and Periscopic, used under CC BY 4.0.

Data Origin: https://bigquery.cloud.google.com/dataset/patents-public-data:patentsview

Banner photo by rawpixel on Unsplash

Search
Clear search
Close search
Google apps
Main menu