1 dataset found
  1. O

    Enterprise-Driven Open Source Software

    • opendatalab.com
    • data.niaid.nih.gov
    • +1more
    zip
    Updated Apr 21, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Athens University of Economics and Business (2020). Enterprise-Driven Open Source Software [Dataset]. https://opendatalab.com/OpenDataLab/Enterprise-Driven_Open_Source_etc
    Explore at:
    zip(7896769 bytes)Available download formats
    Dataset updated
    Apr 21, 2020
    Dataset provided by
    Athens University of Economics and Business
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    We present a dataset of open source software developed mainly by enterprises rather than volunteers. This can be used to address known generalizability concerns, and, also, to perform research on open source business software development. Based on the premise that an enterprise's employees are likely to contribute to a project developed by their organization using the email account provided by it, we mine domain names associated with enterprises from open data sources as well as through white- and blacklisting, and use them through three heuristics to identify 17,264 enterprise GitHub projects. We provide these as a dataset detailing their provenance and properties. A manual evaluation of a dataset sample shows an identification accuracy of 89%. Through an exploratory data analysis we found that projects are staffed by a plurality of enterprise insiders, who appear to be pulling more than their weight, and that in a small percentage of relatively large projects development happens exclusively through enterprise insiders.

  2. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Athens University of Economics and Business (2020). Enterprise-Driven Open Source Software [Dataset]. https://opendatalab.com/OpenDataLab/Enterprise-Driven_Open_Source_etc

Enterprise-Driven Open Source Software

OpenDataLab/Enterprise-Driven_Open_Source_etc

Explore at:
49 scholarly articles cite this dataset (View in Google Scholar)
zip(7896769 bytes)Available download formats
Dataset updated
Apr 21, 2020
Dataset provided by
Athens University of Economics and Business
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

We present a dataset of open source software developed mainly by enterprises rather than volunteers. This can be used to address known generalizability concerns, and, also, to perform research on open source business software development. Based on the premise that an enterprise's employees are likely to contribute to a project developed by their organization using the email account provided by it, we mine domain names associated with enterprises from open data sources as well as through white- and blacklisting, and use them through three heuristics to identify 17,264 enterprise GitHub projects. We provide these as a dataset detailing their provenance and properties. A manual evaluation of a dataset sample shows an identification accuracy of 89%. Through an exploratory data analysis we found that projects are staffed by a plurality of enterprise insiders, who appear to be pulling more than their weight, and that in a small percentage of relatively large projects development happens exclusively through enterprise insiders.

Search
Clear search
Close search
Google apps
Main menu