72 datasets found
  1. Top NPM GitHub Repositories

    • kaggle.com
    zip
    Updated Dec 6, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mike Lanciano (2021). Top NPM GitHub Repositories [Dataset]. https://www.kaggle.com/datasets/mikelanciano/top-npm-github-repositories
    Explore at:
    zip(298650 bytes)Available download formats
    Dataset updated
    Dec 6, 2021
    Authors
    Mike Lanciano
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    NPM Repository Data Set Genertion

    This was a dataset generated to pull public repository attributes about a the top 10,000 most popular NPM packages for analysis. The initial goal was numeric prediction and nominal classification around security advisories to identify key attributes for consideration in just-in-time supply chain analysis of third party dependencies. Ultimately there were know key public attributes that were statistically significant enough to build confidence to a packages overall security posture.

    How The Data Was Generated

    The data was generated using various APIs and then aggregated, including Deps.dev - OSI, GitHub API, and npms.io. Each API was chosen for the data consistency and reliability and that the data was fed downstream from overlapping sources. Here is how the scripts put this together.

    Data Challanges

    The dataset represents a point in time of the security advisories and is not a complete picture of the security health of overall project. Further only 13% of the entire dataset has an advisories. As such it's really difficult to draw any accurate conclusions and the data poorly correlates.

    Future Considerations

    I think this set serves as a good starting point for others who are interested in deriving more information about repositories such as if there are other attributes in the dataset that correlate or if additional APIs could bring more attributes into the fold. For exmaple, topic tagging wold be ideal. Potential problems in the future might be around assessing project health and DevOps related to repository popularity.

  2. Z

    NCQ Dataset (NPM Package Data)

    • data.niaid.nih.gov
    Updated Jul 26, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Reid, Brittany (2021). NCQ Dataset (NPM Package Data) [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4835693
    Explore at:
    Dataset updated
    Jul 26, 2021
    Dataset provided by
    The University of Adelaide
    Authors
    Reid, Brittany
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset for use in Node Code Query, contains package information in a tab separated csv file. The unzipped size is ~700MB.

    You do not need to manually download this file for use in NCQ, the setup scripts will handle this for you automatically.

    The dataset contains the following fields:

    Mined from the NPM registry:

    Package name

    Description

    Keywords

    License

    repositoryUrl

    timeModified

    Derived from data on the NPM registry:

    Array of Node.js code snippets extracted from the package README using https://github.com/Brittany-Reid/npm-code-snippets

    Number of markdown code blocks in the README (number may be larger than node.js snippets, these are non-filtered)

    Number of lines in the README

    If an install example exists in the README (if a code block exists with npm install or a install header exists)

    If a run example exists in the README (if a code block exists with npm run or a usage header exists)

    Mined from GitHub for packages with a GitHub repository (values will be 0 or false for packages missing this data)

    Number of stars

    Is a fork?

    Number of forks

    Number of watchers

    If a test directory exists (if the top level directory contains a folder called test or tests)

  3. i

    NPM Data for IEEE

    • ieee-dataport.org
    Updated Jun 25, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dongdong You (2020). NPM Data for IEEE [Dataset]. https://ieee-dataport.org/documents/npm-data-ieee
    Explore at:
    Dataset updated
    Jun 25, 2020
    Authors
    Dongdong You
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    signal data

  4. Data from: Investigating the Reproducibility of NPM Packages

    • zenodo.org
    bin, zip
    Updated Jun 25, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anonymous Anonymous; Anonymous Anonymous (2020). Investigating the Reproducibility of NPM Packages [Dataset]. http://doi.org/10.5281/zenodo.3698357
    Explore at:
    zip, binAvailable download formats
    Dataset updated
    Jun 25, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Anonymous Anonymous; Anonymous Anonymous
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Open-source Dataset


    This dataset contains the NPM packages that we built using our tool-chain. It consists of the diffoscope outputs, the versions built by our tool-chain, and the pre-built packages present on the npmjs registry.

  5. NPM Package Information from Libraries.io

    • zenodo.org
    zip
    Updated Jun 17, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Brittany Reid; Brittany Reid (2020). NPM Package Information from Libraries.io [Dataset]. http://doi.org/10.5281/zenodo.3898749
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 17, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Brittany Reid; Brittany Reid
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This zip file contains a JSON file of package and repository information for 12 million NPM packages, sourced from libaries.io (10.5281/zenodo.3626071, https://libraries.io/data, January 12, 2020). The file is 2.39GB uncompressed.

    The file was generated from the 'Projects with related Repository fields' csv file, filtering for NPM packages only. The original files contains inconsistent columns between the first row, containing labels, and the subsequent rows, containing data, however the additional, unlabelled column is empty for all NPM packages so it has been ignored, and the subsequent rows shifted up into their correct positions.

    This dataset has not modified otherwise.

    Includes data from Libraries.io.

  6. Data from: How Do Developers Follow Security-Relevant Best Practices When...

    • figshare.com
    pdf
    Updated Sep 2, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anonymous Author (2022). How Do Developers Follow Security-Relevant Best Practices When Using NPM Packages? [Dataset]. http://doi.org/10.6084/m9.figshare.20800696.v1
    Explore at:
    pdfAvailable download formats
    Dataset updated
    Sep 2, 2022
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Anonymous Author
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    The data contains the results of best practice violations in NodeJS projects

  7. p

    Lod-opendata - A NPM Package

    • data.public.lu
    code
    Updated Oct 4, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Roberto Entringer (2022). Lod-opendata - A NPM Package [Dataset]. https://data.public.lu/en/datasets/lod-opendata-a-npm-package/
    Explore at:
    codeAvailable download formats
    Dataset updated
    Oct 4, 2022
    Authors
    Roberto Entringer
    Description

    A NPM package for get data of Lëtzebuerger Online Dictionnaire (LOD) from data.public.lu. Repo on Github : https://github.com/robertoentringer/lod-opendata Npm package : https://www.npmjs.com/package/lod-opendata

  8. Data from: Helping or not Helping? Why and How Trivial Packages Impact the...

    • zenodo.org
    zip
    Updated Jan 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Xiaowei Chen; Rabe Abdalkareem; Suhaib Mujahid; Emad Shihab; Xin Xia; Xiaowei Chen; Rabe Abdalkareem; Suhaib Mujahid; Emad Shihab; Xin Xia (2020). Helping or not Helping? Why and How Trivial Packages Impact the npm Ecosystem [Dataset]. http://doi.org/10.5281/zenodo.3417393
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jan 24, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Xiaowei Chen; Rabe Abdalkareem; Suhaib Mujahid; Emad Shihab; Xin Xia; Xiaowei Chen; Rabe Abdalkareem; Suhaib Mujahid; Emad Shihab; Xin Xia
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Developers often share their code snippets by packaging them and making them available to others through software packages. How much a package does and how big it is can be seen as positive or negative. Recent studies showed that many packages that exist in the npm ecosystem are trivial and may introduce high dependency overhead.

    Hence, one question that arises is why developers choose to publish these trivial packages. Therefore, in this paper, we perform a developer-centered study to empirically examine why developers choose to publish such trivial packages. Specifically, we ask 1) why developers publish trivial packages, 2) what they believe to be the possible negative impacts of these packages, and 3) how such negative issues can be mitigated. The survey response of 59 JavaScript developers who publish trivial npm packages showed that the main reasons for publishing these trivial packages are to provide reusable components, testing & documentation, and separation of concerns. Even the developers who publish these trivial packages admitted to having issues when they publish such packages, which include the maintenance of multiple packages, dependency hell, finding the right package, and the increase of duplicated packages in the ecosystems. Furthermore, we found that the majority of the developers suggested grouping these trivial packages to cope with the problems associated with publishing them. Then, to quantitatively investigate the impact of these trivial packages on the npm ecosystem and its users, we examine grouping these trivial packages. We found that if trivial packages that are always used together are grouped, the ecosystem can reduce the number of dependencies by approximately 13%. Our findings shed light on the impact of publishing trivial packages and show that ecosystems and developer communities need to rethink their publishing policies since it can negatively impact the developers and the entire ecosystem.

    The published data set contains the following:

    1. List of identified trivial npm packages.
    2. The survey questions.
    3. The developers' responses to the survey.
    4. The results of the co-usage analysis of trivial npm packages.

  9. Data from: On the Untriviality of Trivial Packages: An Empirical Study of...

    • zenodo.org
    csv
    Updated Sep 9, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Md Atique Reza Chowdhury; Rabe Abdalkareem; Emad Shihab; Bram Adams; Md Atique Reza Chowdhury; Rabe Abdalkareem; Emad Shihab; Bram Adams (2020). On the Untriviality of Trivial Packages: An Empirical Study of npm JavaScript Packages [Dataset]. http://doi.org/10.5281/zenodo.4019091
    Explore at:
    csvAvailable download formats
    Dataset updated
    Sep 9, 2020
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Md Atique Reza Chowdhury; Rabe Abdalkareem; Emad Shihab; Bram Adams; Md Atique Reza Chowdhury; Rabe Abdalkareem; Emad Shihab; Bram Adams
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Nowadays, developing software would be unthinkable without the use of third-party packages. Although such code reuse helps to achieve rapid continuous delivery of software to end-users, blindly reusing code has its pitfalls. For example, prior work has investigated the rationale for using packages that implement simple functionalities, known as trivial packages. This prior work showed that although these trivial packages were simple, they were popular and prevalent in the npm ecosystem. This popularity and prevalence of trivial packages peaked our interest in questioning the ‘triviality of trivial packages’.

    To better understand and examine the triviality of trivial packages, we mine a large set of JavaScript projects that use trivial npm packages and evaluate their relative centrality. Specifically, we evaluate the triviality from two complementary points of view: based on project usage and ecosystem usage of these trivial packages. Our result shows that trivial packages are being used in central JavaScript files of a software project. Additionally, by analyzing all external package API calls in these JavaScript files, we found that a high percentage of these API calls are attributed to trivial packages. Therefore, these packages play a significant role in JavaScript files. Furthermore, in the package dependency network, we observed that 16.8% packages are trivial and in some cases removing a trivial package can impact approximately 29% of the ecosystem. Overall, our finding indicates that although smaller in size and complexity, trivial packages are highly depended on packages by JavaScript projects. Additionally, our study shows that although they might be called trivial, nothing about trivial packages is trivial.

  10. Data from: On the Discoverability of npm Vulnerabilities in Node.js Projects...

    • zenodo.org
    bin
    Updated Jul 7, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alfadel et al.; Alfadel et al. (2022). On the Discoverability of npm Vulnerabilities in Node.js Projects [Dataset]. http://doi.org/10.5281/zenodo.6803536
    Explore at:
    binAvailable download formats
    Dataset updated
    Jul 7, 2022
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Alfadel et al.; Alfadel et al.
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This package contains the material used in the manuscript.

    For more information on how to understand the package structure, please read the README.md.

  11. u

    Needs and Population Monitoring (NPM) Round 10 Site Assessment - 2018 -...

    • microdata.unhcr.org
    Updated Jun 25, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Needs and Population Monitoring (2024). Needs and Population Monitoring (NPM) Round 10 Site Assessment - 2018 - Bangladesh [Dataset]. https://microdata.unhcr.org/index.php/catalog/study/HDX_needs-and-population-monitoring-npm-bangladesh-round-10-site-assessment_vEXT
    Explore at:
    Dataset updated
    Jun 25, 2024
    Dataset authored and provided by
    Needs and Population Monitoring
    Time period covered
    2018
    Area covered
    Bangladesh
    Description

    Abstract

    Following an outbreak of violence on 25 August 2017 in Rakhine State, Myanmar, a new massive influx of Rohingya refugees to Cox’s Bazar, Bangladesh started in late August 2017. Most of the Rohingya refugees settled in Ukhia and Teknaf Upazilas of Cox’s Bazar, a district bordering Myanmar identified as the main entry area for border crossings.

    These datasets present the result of the NPM Round 10 Baseline and Site Assessment exercises, which collected information related to the Rohingya population distribution and needs during the months of April and May 2018.

    The data collection for NPM baseline survey was conducted between 1 and 17 April 2018: this provides an update about the population distribution and movements; The data collection for NPM Site Assessment survey was conducted between 1 and 20 May 2018: in addition to an update about the population figures, this includeds a multi-sectoral needs assessment.

    The full maps and GIS packages by camp produced based on NPM Baseline and Site Assessment 10 are available at the links below:

    • Please click here to access the data by camp as of May 2018.
    • Please click here to access the data by camp as of April 2018.

    Rohingya refugee population distribution by para in Teknaf upazila. Data collected during NPM Site Assessment 10 between 1 and 20 May 2018.

    • Please click here.

    Geographic coverage

    Bangladesh

    Kind of data

    Observation data/ratings [obs]

  12. s

    Panasonic Npm Import Data India – Buyers & Importers List

    • seair.co.in
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seair Exim Solutions, Panasonic Npm Import Data India – Buyers & Importers List [Dataset]. https://www.seair.co.in/panasonic-npm-import-data.aspx
    Explore at:
    .text/.csv/.xml/.xls/.binAvailable download formats
    Dataset authored and provided by
    Seair Exim Solutions
    Area covered
    India
    Description

    Access updated Panasonic Npm import data India with HS Code, price, importers list, Indian ports, exporting countries, and verified Panasonic Npm buyers in India.

  13. e

    Npm Impex Solution Export Import Data | Eximpedia

    • eximpedia.app
    Updated Feb 18, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Npm Impex Solution Export Import Data | Eximpedia [Dataset]. https://www.eximpedia.app/companies/npm-impex-solution/98995058
    Explore at:
    Dataset updated
    Feb 18, 2025
    Description

    Npm Impex Solution Export Import Data. Follow the Eximpedia platform for HS code, importer-exporter records, and customs shipment details.

  14. s

    Npm W2 Import Data India – Buyers & Importers List

    • seair.co.in
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Seair Exim, Npm W2 Import Data India – Buyers & Importers List [Dataset]. https://www.seair.co.in
    Explore at:
    .bin, .xml, .csv, .xlsAvailable download formats
    Dataset provided by
    Seair Info Solutions PVT LTD
    Authors
    Seair Exim
    Area covered
    India
    Description

    Subscribers can find out export and import data of 23 countries by HS code or product’s name. This demo is helpful for market analysis.

  15. e

    Npm Process Equipments Export Import Data | Eximpedia

    • eximpedia.app
    Updated Feb 15, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Npm Process Equipments Export Import Data | Eximpedia [Dataset]. https://www.eximpedia.app/companies/npm-process-equipments/18716161
    Explore at:
    Dataset updated
    Feb 15, 2025
    Description

    Npm Process Equipments Export Import Data. Follow the Eximpedia platform for HS code, importer-exporter records, and customs shipment details.

  16. c

    $NPM Hack Threatens JavaScript Ecosystem Price Prediction Data

    • coinbase.com
    Updated Nov 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). $NPM Hack Threatens JavaScript Ecosystem Price Prediction Data [Dataset]. https://www.coinbase.com/en-in/price-prediction/base-npm-hack-threatens-javascript-ecosystem-116c
    Explore at:
    Dataset updated
    Nov 12, 2025
    Variables measured
    Growth Rate, Predicted Price
    Measurement technique
    User-defined projections based on compound growth. This is not a formal financial forecast.
    Description

    This dataset contains the predicted prices of the asset $NPM Hack Threatens JavaScript Ecosystem over the next 16 years. This data is calculated initially using a default 5 percent annual growth rate, and after page load, it features a sliding scale component where the user can then further adjust the growth rate to their own positive or negative projections. The maximum positive adjustable growth rate is 100 percent, and the minimum adjustable growth rate is -100 percent.

  17. IOM Bangladesh - Needs and Population Monitoring (NPM) Round 11 Site...

    • data.humdata.org
    • cloud.csiss.gmu.edu
    • +1more
    pdf, xlsx
    Updated Mar 2, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    International Organization for Migration (IOM) (2023). IOM Bangladesh - Needs and Population Monitoring (NPM) Round 11 Site Assessment [Dataset]. https://data.humdata.org/dataset/3cb08900-d6f1-4b0e-b1d0-697c3db30236?force_layout=desktop
    Explore at:
    xlsx(3335958), pdf(266976), pdf(270336), xlsx(670441)Available download formats
    Dataset updated
    Mar 2, 2023
    Dataset provided by
    International Organization for Migrationhttp://www.iom.int/
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Bangladesh
    Description

    Following an outbreak of violence on 25 August 2017 in Rakhine State, Myanmar, a new massive influx of Rohingya refugees to Cox’s Bazar, Bangladesh started in late August 2017. Most of the Rohingya refugees settled in Ukhia and Teknaf Upazilas of Cox’s Bazar, a district bordering Myanmar identified as the main entry area for border crossings.

    This dataset presents the result of the NPM Round 11 exercise, which collected information related to the Rohingya refugee population distribution and needs during the months of June and July 2018.

    • The data collection for NPM baseline survey was conducted between 2 and 14 June 2018: it provides an update about the population distribution and movements.
    • The data collection for NPM Site Assessment survey was conducted between 1 and 22 July 2018: in addition to an update about the population figures, this includeds a multi-sectoral needs assessment.

    The full maps and GIS packages by camp produced based on NPM Baseline and Site Assessment 11 are available at the links below:

    • Please click here to access the data by camp as of May 2018.
    • Please click here to access the data by camp as of July 2018.

    Rohingya refugee population distribution by para in Teknaf upazila. - Please click here.

  18. e

    Npm C And O Europe Ou Export Import Data | Eximpedia

    • eximpedia.app
    Updated Jan 9, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Npm C And O Europe Ou Export Import Data | Eximpedia [Dataset]. https://www.eximpedia.app/companies/npm-c-and-o-europe-ou/78379928
    Explore at:
    Dataset updated
    Jan 9, 2025
    Area covered
    Europe
    Description

    Npm C And O Europe Ou Export Import Data. Follow the Eximpedia platform for HS code, importer-exporter records, and customs shipment details.

  19. d

    Aerodrome npm

    • dune.com
    Updated Dec 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    hypurrquant (2025). Aerodrome npm [Dataset]. https://dune.com/discover/content/trending?q=aerodrome&resource-type=queries
    Explore at:
    Dataset updated
    Dec 1, 2025
    Dataset authored and provided by
    hypurrquant
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Blockchain data query: Aerodrome npm

  20. e

    Npm Silmet O Export Import Data | Eximpedia

    • eximpedia.app
    Updated Oct 31, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Npm Silmet O Export Import Data | Eximpedia [Dataset]. https://www.eximpedia.app/companies/npm-silmet-o/57383510
    Explore at:
    Dataset updated
    Oct 31, 2025
    Description

    Npm Silmet O Export Import Data. Follow the Eximpedia platform for HS code, importer-exporter records, and customs shipment details.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Mike Lanciano (2021). Top NPM GitHub Repositories [Dataset]. https://www.kaggle.com/datasets/mikelanciano/top-npm-github-repositories
Organization logo

Top NPM GitHub Repositories

Attributes of Popular NPM Dependencies

Explore at:
zip(298650 bytes)Available download formats
Dataset updated
Dec 6, 2021
Authors
Mike Lanciano
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

NPM Repository Data Set Genertion

This was a dataset generated to pull public repository attributes about a the top 10,000 most popular NPM packages for analysis. The initial goal was numeric prediction and nominal classification around security advisories to identify key attributes for consideration in just-in-time supply chain analysis of third party dependencies. Ultimately there were know key public attributes that were statistically significant enough to build confidence to a packages overall security posture.

How The Data Was Generated

The data was generated using various APIs and then aggregated, including Deps.dev - OSI, GitHub API, and npms.io. Each API was chosen for the data consistency and reliability and that the data was fed downstream from overlapping sources. Here is how the scripts put this together.

Data Challanges

The dataset represents a point in time of the security advisories and is not a complete picture of the security health of overall project. Further only 13% of the entire dataset has an advisories. As such it's really difficult to draw any accurate conclusions and the data poorly correlates.

Future Considerations

I think this set serves as a good starting point for others who are interested in deriving more information about repositories such as if there are other attributes in the dataset that correlate or if additional APIs could bring more attributes into the fold. For exmaple, topic tagging wold be ideal. Potential problems in the future might be around assessing project health and DevOps related to repository popularity.

Search
Clear search
Close search
Google apps
Main menu