99 datasets found

D
Dataverse Community Survey 2022 – Data
dataverse.no
dataverse.azure.uit.no
+1more
docx, pdf, png +3
Updated Sep 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Philipp Conzett; Philipp Conzett (2023). Dataverse Community Survey 2022 – Data [Dataset]. http://doi.org/10.18710/UOC8CP
Explore at:
xlsx(237162), png(27105), png(60756), text/tsv(324), png(82266), text/tsv(781), png(42044), text/tsv(441), xlsx(237019), text/tsv(252), png(76945), png(127124), xlsx(237234), xlsx(237457), text/tsv(311), xlsx(236817), xlsx(239544), png(9443), xlsx(237574), png(84243), png(10617), png(25842), xlsx(237079), png(68203), docx(121213), png(57822), text/tsv(490), png(268559), xlsx(236982), png(41638), xlsx(241097), text/tsv(189), text/tsv(3868), xlsx(237497), text/tsv(198), png(72478), xlsx(237276), xlsx(238387), xlsx(237157), xlsx(237036), xlsx(236994), png(29299), text/tsv(84), text/tsv(330), text/tsv(97), text/tsv(166), png(38778), png(54493), text/tsv(297), png(15331), text/tsv(249), xlsx(285808), png(47413), png(48538), png(50982), xlsx(239731), text/tsv(317), png(35252), png(11423), xlsx(237380), xlsx(236750), png(58837), png(30077), png(22419), xlsx(240275), text/tsv(491), xlsx(236969), xlsx(237046), xlsx(237744), png(31527), png(46063), text/tsv(609), xlsx(237118), png(105999), png(58386), png(41932), png(144980), xlsx(237127), text/tsv(56), text/tsv(1157), text/tsv(351), text/tsv(3985), png(87888), xlsx(236977), png(32522), png(23763), png(54715), xlsx(238959), xlsx(237073), xlsx(237498), xlsx(236928), text/tsv(88), xlsx(238353), png(98233), text/tsv(179), text/tsv(1408), text/tsv(1526), text/tsv(162713), png(32773), png(24805), png(37087), xlsx(236945), png(51108), png(18162), png(17826), xlsx(236766), pdf(240883), png(181083), xlsx(236999), text/tsv(119), xlsx(237045), png(13788), xlsx(237345), text/tsv(250), png(14307), text/tsv(271), png(60983), xlsx(236784), png(45840), text/tsv(402), png(27661), text/tsv(277), xlsx(237476), png(184889), png(58069), text/tsv(174), png(99230), png(45285), png(81808), png(76035), text/tsv(242), png(48469), text/tsv(493), text/tsv(216), png(39604), text/tsv(256), text/tsv(308), xlsx(237190), xlsx(236974), text/tsv(66), png(47343), png(72614), png(21131), png(32169), text/tsv(1315), txt(70354), pdf(896782), png(56694), png(31829), xlsx(237010), png(141447), text/tsv(230), text/tsv(359), png(76504), xlsx(238146), text/tsv(310), xlsx(237302)Available download formats
Unique identifier
https://doi.org/10.18710/UOC8CP
Dataset updated
Sep 28, 2023
Dataset provided by
DataverseNO
Authors
Philipp Conzett; Philipp Conzett
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Time period covered
2022
Area covered
Netherlands, Norway, Colombia, Mexico, Germany, United States, Italy, Spain, Austria, Indonesia
Description
This dataset contains raw data and processed data from the Dataverse Community Survey 2022. The main goal of the survey was to help the Global Dataverse Community Consortium (GDCC; https://dataversecommunity.global/) and the Dataverse Project (https://dataverse.org/) decide on what actions to take to improve the Dataverse software and the larger ecosystem of integrated tools and services as well as better support community members. The results from the survey may also be of interest to other communities working on software and services for managing research data. The survey was designed to map out the current status as well as the roadmaps and priorities of Dataverse installations around the world. The main target group for participating in the survey were the people/teams responsible for operating Dataverse installations around the world. A secondary target group were people/teams at organizations that are planning to deploy or considering deploying a Dataverse installation. There were 34 existing and planned Dataverse installations participating in the survey.
d
Open Source at Harvard
search.dataone.org
dataverse.harvard.edu
Updated Nov 22, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Durbin, Philip (2023). Open Source at Harvard [Dataset]. http://doi.org/10.7910/DVN/TJCLKP
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/TJCLKP
Dataset updated
Nov 22, 2023
Dataset provided by
Harvard Dataverse
Authors
Durbin, Philip
Description
The tabular file contains information on known Harvard repositories on GitHub, such as the number of stars, programming language, day last updated, number of open issues, size, number of forks, repository URL, create date, and description. Each repository has a corresponding JSON file (see primary-data.zip) that was retrieved using the GitHub API with code and a list of repositories available from https://github.com/IQSS/open-source-at-harvard.
H
Data Repository for "Open Source Software as Digital Platforms to Innovate"
dataverse.harvard.edu
search.dataone.org
Updated Jun 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sergio Petralia (2025). Data Repository for "Open Source Software as Digital Platforms to Innovate" [Dataset]. http://doi.org/10.7910/DVN/UQNVHF
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/UQNVHF
Dataset updated
Jun 2, 2025
Dataset provided by
Harvard Dataverse
Authors
Sergio Petralia
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This dataverse hosts the data repository of the article entitled "Open Source Software as Digital Platforms to Innovate" . It contains databases and R software codes that replicate the main results of the article. The article contains a detailed description of how these databases were constructed and how they are organized.
H
Replication Data for: Open Journal Systems and Dataverse Integration–...
datasetcatalog.nlm.nih.gov
dataverse.harvard.edu
Updated Oct 15, 2015
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Crosas, Merce; Whitney, Jen; Altman, Micah; Durbin, Philip; Castro, Eleni (2015). Replication Data for: Open Journal Systems and Dataverse Integration– Helping Journals to Upgrade Data Publication for Reusable Research [Dataset]. http://doi.org/10.7910/DVN/Y3WOOE
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/Y3WOOE
Dataset updated
Oct 15, 2015
Authors
Crosas, Merce; Whitney, Jen; Altman, Micah; Durbin, Philip; Castro, Eleni
Description
This article describes the novel open source tools for open data publication in open access journal workflows. This comprises a plugin for Open Journal Systems that supports a data submission, citation, review, and publication workflow; and an extension to the Dataverse system that provides a standard deposit API. We describe the function and design of these tools, provide examples of their use, and summarize their initial reception. We conclude by discussing future plans and potential impact.
d
Open Source Indicators Project
search.dataone.org
dataverse.harvard.edu
Updated Nov 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Reed, Terry (2023). Open Source Indicators Project [Dataset]. http://doi.org/10.7910/DVN/EN8FUW
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/EN8FUW
Dataset updated
Nov 21, 2023
Dataset provided by
Harvard Dataverse
Authors
Reed, Terry
Description
The goal of the Open Source Indicators (OSI) Program was to make automated predictions of significant societal events through the continuous and automated analysis of publicly available data such as news media, social media, informational websites, and satellite imagery. Societal events of interest included civil unrest, disease outbreaks, and election results. Geographic areas of interest include countries in Latin America (LA) and the Middle East and North Africa (MENA). The handbook is intended to serve as a reference document for the OSI Program and a companion to the ground truth event data used for test and evaluation. The handbook provides guidance regarding the types of events considered; the submission of automated predictions or “warnings;” the development of ground truth; the test and evaluation of submitted warnings; performance measures; and other programmatic information. IARPA initiated a solicitation for OSI Research Teams in late summer 2011 for one base year and two option years of research. MITRE was selected as the Test and Evaluation (T&E) Team in November 2011. Following a review of proposals, three teams (BBN, HRL, and Virginia Tech (VT)) were selected. The OSI Program officially began in April 2012; manual event encoding and formal T&E ended in March 2015.
D
Dataset for Design Ideation Study
dataverse.azure.uit.no
dataverse.no
application/x-h5, pdf +3
Updated Feb 28, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Filip Gornitzka Abelson; Filip Gornitzka Abelson; Henrikke Dybvik; Henrikke Dybvik; Martin Steinert; Martin Steinert (2024). Dataset for Design Ideation Study [Dataset]. http://doi.org/10.18710/PZQC4A
Explore at:
tsv(7501), txt(13093), application/x-h5(25860340), application/x-h5(286920385), zip(581532), tsv(295160), application/x-h5(540715825), tsv(767327), application/x-h5(49209334), application/x-h5(510702725), tsv(1336354), tsv(2010), tsv(1935109), pdf(33267), application/x-h5(272694817)Available download formats
Unique identifier
https://doi.org/10.18710/PZQC4A
Dataset updated
Feb 28, 2024
Dataset provided by
DataverseNO
Authors
Filip Gornitzka Abelson; Filip Gornitzka Abelson; Henrikke Dybvik; Henrikke Dybvik; Martin Steinert; Martin Steinert
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Study information Design ideation study (N = 24) using eye tracking technology. Participants solved a total of twelve design problems while receiving inspirational stimuli on a monitor. Their task was to generate as many solutions to each problem and explain their solution briefly by thinking aloud. The study allows for getting further insight into how inspirational stimuli improve idea fluency during design ideation. This dataset features processed data from the experiment. Eye tracking data includes gaze data, fixation data, blink data, and pupillometry data for all participants. The study is based on the following research paper and follows the same experimental setup: Goucher-Lambert, K., Moss, J., & Cagan, J. (2019). A neuroimaging investigation of design ideation with and without inspirational stimuli—understanding the meaning of near and far stimuli. Design Studies, 60, 1-38. DOI Dataset Most files in the dataset are saved as CSV files or other human readable file formats. Large files are saved in Hierarchical Data Format (HDF5/H5) to allow for smaller file sizes and higher compression. All data is described thoroughly in 00_ReadMe.txt. The following processed data is included in the dataset: Concatenated annotations file of experimental flow for all participants (CSV). All eye tracking raw data in concatenated files. Annotated with only participant ID. (CSV/HDF5) Annotated eye tracking data for ideation routines only. A subset of the files above. (CSV/HDF5) Audio transcriptions from Google Cloud Speech-to-Text API of each recording with annotations. (CSV) Raw API response for each transcription. These files include time offset for each word in a recording. (JSON) Data for questionnaire feedback and ideas generated during the experiment. (CSV) Data for the post-experiment survey, including demographic information (TSV). Python code used for the open-source experimental setup and dataset construction is hosted at GitHub. Repository also includes code of how the dataset has been further processed.
d
Supplemental Materials for \"Topic Modeling the Hàn diăn Ancient Classics...
search.dataone.org
dataverse.harvard.edu
+1more
Updated Nov 22, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Allen, Colin; Luo, Hongliang; Murdock, Jaimie; Pu, Jianghuai; Wang, Xiaohong; Zhai, Yanjie; Zhao, Kun (2023). Supplemental Materials for \"Topic Modeling the Hàn diăn Ancient Classics (汉典古籍)\" [Dataset]. http://doi.org/10.7910/DVN/3QXX29
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/3QXX29
Dataset updated
Nov 22, 2023
Dataset provided by
Harvard Dataverse
Authors
Allen, Colin; Luo, Hongliang; Murdock, Jaimie; Pu, Jianghuai; Wang, Xiaohong; Zhai, Yanjie; Zhao, Kun
Description
The "Handian" corpus ( 汉典 or Hàn diăn, i.e, the "Han canon" or "Han classics") contains over 18,000 classics of ancient Chinese philosophy, as well as documents of historical and biographical significance, and literary works. The versions of the documents presented here are derived from www.zdic.net under their permissive Creative Commons 1.0 Public Domain Dedication. These significant cultural texts are modeled here and published in the Journal of Cultural Analytics. The Dataverse repository contains the models for the InPhO Topic Explorer (handian-ca.tez), installation instructions (README.md), and supplemental materials (handian-ca-supplemental.pdf). These works are released under the Creative Commons Attribution-Share Alike 4.0 International License (CC BY-SA 4.0).
D
Data for: modeling of local scour around circular structures using the open...
dataverse.no
dataverse.azure.uit.no
text/x-fixed-field +1
Updated Nov 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tina Ebrahimi; Tina Ebrahimi; Widar Weizhi Wang; Widar Weizhi Wang; Hans Bihs; Hans Bihs (2025). Data for: modeling of local scour around circular structures using the open source CFD toolbox REEF3D [Dataset]. http://doi.org/10.18710/YUZVLT
Explore at:
text/x-fixed-field(870977), text/x-fixed-field(5281792), text/x-fixed-field(53937457), text/x-fixed-field(3171271), txt(3084)Available download formats
Unique identifier
https://doi.org/10.18710/YUZVLT
Dataset updated
Nov 6, 2025
Dataset provided by
DataverseNO
Authors
Tina Ebrahimi; Tina Ebrahimi; Widar Weizhi Wang; Widar Weizhi Wang; Hans Bihs; Hans Bihs
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This data set contains the key output files for the results shown in the related publication. Specifically, the dataset should allow the reproduction of the REEF3D::CFD simulations. Certain result deviations may occur due to different software versions.
H
Flowers Dataset
datasetcatalog.nlm.nih.gov
dataverse.harvard.edu
+1more
Updated Aug 24, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tung, K (2020). Flowers Dataset [Dataset]. http://doi.org/10.7910/DVN/1ECTVN
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/1ECTVN
Dataset updated
Aug 24, 2020
Authors
Tung, K
Description
Open source flower images available in Python distribution. Raw images converted to TFRecord format in offline process.
D
Replication Data for: Write access provisioning and organizational ownership...
dataverse.nl
Updated Sep 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Poonacha Medappa; Poonacha Medappa; Shirish C. Srivastava; Shirish C. Srivastava; Saverio D. Favaron; Saverio D. Favaron (2025). Replication Data for: Write access provisioning and organizational ownership in open source software projects: Exploring the impact on project novelty and survival [Dataset]. http://doi.org/10.34894/VQC4OD
Explore at:
txt(1022), csv(32201), xlsx(3627827), application/x-stata-syntax(1594), text/markdown(195), text/plain; charset=us-ascii(35149), xlsx(145477), csv(108378), xlsx(235212), csv(2078033), txt(1423), csv(136560), xlsx(47324)Available download formats
Unique identifier
https://doi.org/10.34894/VQC4OD
Dataset updated
Sep 30, 2025
Dataset provided by
DataverseNL
Authors
Poonacha Medappa; Poonacha Medappa; Shirish C. Srivastava; Shirish C. Srivastava; Saverio D. Favaron; Saverio D. Favaron
License
https://dataverse.nl/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.34894/VQC4ODhttps://dataverse.nl/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=doi:10.34894/VQC4OD
Description
It contains the code for replicating the main results and training data for the article "Write access provisioning and organizational ownership in open source software projects: Exploring the impact on project novelty and survival"
U
US Prisons, Jails, and Detention Centers with EPA Facility Registry Service...
datasetcatalog.nlm.nih.gov
dataverse.ucla.edu
Updated Apr 13, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TAN, SARAH; TRAN, NATHAN; CHIMWAZA, MELISSA; SPORTSMAN, DEREK; LU, ALICE; NATARAJAN, RAMYA; KO, RAYMOND; RAMIREZ, SAVANNAH; MILLAM, BEN; YEE, NICOLE; SHAPIRO, NICHOLAS; POIRIER, LINDSAY; MOLINA, PRECIOUS IVY; RANADE, PRASANN (2024). US Prisons, Jails, and Detention Centers with EPA Facility Registry Service (FRS) IDs [Dataset]. http://doi.org/10.25346/S6/CQP9FE
Explore at:
Unique identifier
https://doi.org/10.25346/S6/CQP9FE
Dataset updated
Apr 13, 2024
Authors
TAN, SARAH; TRAN, NATHAN; CHIMWAZA, MELISSA; SPORTSMAN, DEREK; LU, ALICE; NATARAJAN, RAMYA; KO, RAYMOND; RAMIREZ, SAVANNAH; MILLAM, BEN; YEE, NICOLE; SHAPIRO, NICHOLAS; POIRIER, LINDSAY; MOLINA, PRECIOUS IVY; RANADE, PRASANN
Area covered
United States
Description
This dataset integrates data from multiple publicly available sources to enhance the social and environmental analytical potential of the 2017 and 2020 HIFLD prison boundaries datasets. The HIFLD prison boundary feature class contains secure detention facilities. These facilities range in jurisdiction from federal (excluding military) to local governments. Polygon geometry is used to describe the extent of where the incarcerated population is located (fence lines or building footprints). This feature class’s attribution describes many physical and social characteristics of detention facilities in the United States and some of its territories. The attribution for this feature class was populated by open source search methodologies of authoritative sources. We have manually coded the corresponding EPA Facility Registry Service (FRS) ID number to every facility for which we could find a reasonable match (source: https://www.epa.gov/frs/frs-facilities-state-single-file-csv-download). This FRS ID number enables finding corresponding environmental permits, inspections, violations, and enforcement actions. We have additionally created additional socially significant categories: ICE facilities, private prisons. Purpose: This feature class contains secure detention facilities with EPA FRS ID and additional socially relevant variables for research on the environmental injustices of mass incarceration by Carceral Ecologies.
H
The SEOSS Dataset - Requirements, Bug Reports, Code History, and Trace Links...
dataverse.harvard.edu
Updated May 24, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Michael Rath; Patrick Mäder (2019). The SEOSS Dataset - Requirements, Bug Reports, Code History, and Trace Links for Entire Projects [Dataset]. http://doi.org/10.7910/DVN/PDDZ4Q
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/PDDZ4Q
Dataset updated
May 24, 2019
Dataset provided by
Harvard Dataverse
Authors
Michael Rath; Patrick Mäder
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
A systematically retrieved dataset consisting of 33 open-source software projects containing a large number of typed artifacts and trace links between them. The artifacts stem from the projects' issue tracking system and source version control system to enable their joint analysis. Enriched with additional metadata, such as time stamps, release versions, component information, and developer comments, the dataset is highly suitable for empirical research, e.g., in requirements and software traceability analysis, software evolution, bug and feature localization, and stakeholder collaboration. It can stimulate new research directions, facilitate the replication of existing studies, and act as benchmark for the comparison of competing approaches.
C
Open source software for the analysis of kinematics and dynamics of machines...
dataverse.csuc.cat
text/x-python, txt
Updated Jul 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ramón Jerez-Mesa; Ramón Jerez-Mesa; Mateo Sánchez Rodríguez; Mateo Sánchez Rodríguez (2025). Open source software for the analysis of kinematics and dynamics of machines using their linkage equations [software] [Dataset]. http://doi.org/10.34810/data2294
Explore at:
text/x-python(8267), text/x-python(600), text/x-python(11384), text/x-python(389), text/x-python(30483), text/x-python(44536), text/x-python(3558), text/x-python(30662), text/x-python(0), text/x-python(6488), text/x-python(190), text/x-python(3475), text/x-python(1399), txt(33510), text/x-python(3332), text/x-python(40034), text/x-python(1415), text/x-python(178), text/x-python(5194), txt(5869), text/x-python(2117), txt(33800)Available download formats
Unique identifier
https://doi.org/10.34810/data2294
Dataset updated
Jul 11, 2025
Dataset provided by
CORA.Repositori de Dades de Recerca
Authors
Ramón Jerez-Mesa; Ramón Jerez-Mesa; Mateo Sánchez Rodríguez; Mateo Sánchez Rodríguez
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
This open-source computational tool is designed for the simulation and analysis of planar linkage mechanisms. Aimed at students, educators and engineers, the software offers a flexible and intuitive environment for modeling mechanical systems. It features a custom domain-specific language for defining mechanisms through variables, equations, and structural data, and combines symbolic preprocessing with numerical solvers for kinematic and dynamic analysis. The tool includes an interactive graphical user interface (GUI) for real-time configuration and visualization. Validated through representative test cases, it delivers accurate results for position, velocity, acceleration, and force analysis. Entirely free of proprietary dependencies, the application serves as an accessible alternative to commercial simulation tools, promoting educational equity and supporting learning through visualization and experimentation.
D
Open-source Dynamic Evolving Levee Environment (openDELvE)
dataverse.nl
Updated Jan 2, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
J. O'Dell; J. O'Dell (2023). Open-source Dynamic Evolving Levee Environment (openDELvE) [Dataset]. http://doi.org/10.34894/2WZ0S9
Explore at:
pdf(4770199), xml(50963), application/zipped-shapefile(240538), csv(850272), txt(10873), kmz(13831374), kmz(316552), xml(52084), txt(6572), application/zipped-shapefile(11141265), xml(13464), kmz(13789371), txt(1535), xml(52543), xml(15935), bin(890368), application/zipped-shapefile(14074835)Available download formats
Unique identifier
https://doi.org/10.34894/2WZ0S9
Dataset updated
Jan 2, 2023
Dataset provided by
DataverseNL
Authors
J. O'Dell; J. O'Dell
License
https://dataverse.nl/api/datasets/:persistentId/versions/3.2/customlicense?persistentId=doi:10.34894/2WZ0S9https://dataverse.nl/api/datasets/:persistentId/versions/3.2/customlicense?persistentId=doi:10.34894/2WZ0S9
Description
Collated data from disparate sources including vector, raster, and published reports and maps to produce a global delta protection measures database. The dataset includes three layers and a table. Polygon layer containing leveed areas (Leveed Areas) imported from vector data or drawn from suitably georeferencing raster/published data Polygon layer containg boundary area for research focus (Delta Polygons), as created by Edmonds et. al. (2020, doi:10.1038/s41467-020-18531-4) Line layer containing levee, defence, or similar features imported from vector data or drawn from suitably georeferencing raster/published data (Levee Lines). Table recording methodology and decision making process (Delta Index) for each delta polygon, as well as reasons for excluding data, country code, and processing/review fields. Metadata for the dataset in its entirety and the individual layers is additionally published and confirms to the INSPIRE standard. The dataset is structure so that each metadata file is within the respective file when downloaded as a zipped archive. In line with an agreed change, the dataset is now attributed to the student only, and the paper, which contains further work using the data can be found at doi:10.5194/nhess-2021-291
B
Community smells detection on scholarly communication open source software...
borealisdata.ca
search.dataone.org
Updated Mar 19, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pierre Lasou; Tomasz Neugebauer (2024). Community smells detection on scholarly communication open source software using csDetector [Dataset]. http://doi.org/10.5683/SP3/34MYPI
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.5683/SP3/34MYPI
Dataset updated
Mar 19, 2024
Dataset provided by
Borealis
Authors
Pierre Lasou; Tomasz Neugebauer
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
May 1, 2024 - Jul 31, 2024
Description
This dataset includes both (1) the community smells detected in seven Open Source software (DSpace, Dataverse, Eprint3.4, Archivematica, Islandora PKP OJS, Samvera) used by academic libraries for scholarly communication services and (2) the metrics generated by csDetector (code source available at: https://github.com/Nuri22/csDetector). The data is based on Github repositories activities on a 3 months period : may to july 2023.
d
Replication Data for: \"Externalities in Knowledge Production: Evidence from...
search.dataone.org
dataverse.harvard.edu
Updated Nov 13, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hinnosaar, Marit; Toomas Hinnosaar; Michael Kummer; Olga Slivko (2023). Replication Data for: \"Externalities in Knowledge Production: Evidence from a Randomized Field Experiment\" [Dataset]. http://doi.org/10.7910/DVN/T4VFCX
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/T4VFCX
Dataset updated
Nov 13, 2023
Dataset provided by
Harvard Dataverse
Authors
Hinnosaar, Marit; Toomas Hinnosaar; Michael Kummer; Olga Slivko
Description
The data and programs replicate tables and figures from "Externalities in Knowledge Production: Evidence from a Randomized Field Experiment", by Hinnosaar, Hinnosaar, Kummer, and Slivko. Data were constructed from various sources. Please see the Readme file for additional details.
H
Replication Data for: Underproduction Analysis of Open Source Software
dataverse.harvard.edu
Updated Oct 16, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kaylea Champion; Benjamin Mako Hill (2024). Replication Data for: Underproduction Analysis of Open Source Software [Dataset]. http://doi.org/10.7910/DVN/OGLTJ9
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/OGLTJ9
Dataset updated
Oct 16, 2024
Dataset provided by
Harvard Dataverse
Authors
Kaylea Champion; Benjamin Mako Hill
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This archive contains replication materials for "Underproduction Analysis of Open Source Software". This archive contains the extension materials to be used in conduction with two other dataverses: Champion, Kaylea; Hill, Benjamin Mako, 2021, "Replication data and online supplement for: Underproduction: An Approach for Measuring Risk in Open Source Software", https://doi.org/10.7910/DVN/PUCD2P, Harvard Dataverse, V2, UNF:6:A8MV1fxlZnJtlKI3DnGaRg== [fileUNF] and Kaylea Champion, 2024, "Replication Data for: Sources of Underproduction in Open Source Software", https://doi.org/10.7910/DVN/N2HIRS, Harvard Dataverse, V1 You will need the contents of all three archives to fully replicate the materials in "Underproduction Analysis of Open Source Software". See README.txt for full details and instructions.
D
Replication Data for: On the Impact of a Streaming Oxygen Population on...
dataverse.azure.uit.no
dataverse.no
text/x-fixed-field +2
Updated Jun 11, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Håkon Midthun Kolstø; Håkon Midthun Kolstø (2020). Replication Data for: On the Impact of a Streaming Oxygen Population on Collisionless Magnetic Reconnection [Dataset]. http://doi.org/10.18710/AG7QVS
Explore at:
txt(5467), text/x-fixed-field(1884192096), text/x-python-script(9465)Available download formats
Unique identifier
https://doi.org/10.18710/AG7QVS
Dataset updated
Jun 11, 2020
Dataset provided by
DataverseNO
Authors
Håkon Midthun Kolstø; Håkon Midthun Kolstø
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This data set contains files of binary data describing the output of the particle-in-cell simulation of magnetotail reconnection in case of streaming oxygen ions of ionospheric origin. It thus contains electric and magnetic fields and average particle data in the simulation domain. The documentation of the variables and arrays are given below. The dat files are made using the open source language Fortran 90. They are named fields-*.dat, for which the * has the usual meaning in a linux environment. The name "fields" refer to the electromagnetic fields, but the files also contain particle information. The number behind the fields, e.g. 00200, refer to the time in units of the inverse of the electron plasma frequency. The variables are structured identically in each file, only the time of evaluation is of difference.
d
Exploring MDSplus data-acquisition software and custom devices
search.dataone.org
dataverse.harvard.edu
+1more
Updated Nov 14, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Santoro, Fernando; Stillerman, Joshua; Lane Walsh, Stephen; Fredian, Thomas (2023). Exploring MDSplus data-acquisition software and custom devices [Dataset]. http://doi.org/10.7910/DVN/GFVTH8
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/GFVTH8
Dataset updated
Nov 14, 2023
Dataset provided by
Harvard Dataverse
Authors
Santoro, Fernando; Stillerman, Joshua; Lane Walsh, Stephen; Fredian, Thomas
Description
MDSplus is a software tool designed for data acquisition, storage, and analysis of complex scientific experiments. Over the years, MDSplus has primarily been used for data management for fusion experiments. This paper demonstrates that MDSplus can be used for a much wider variety of systems and experiments. We present a step-by-step tutorial describing how to create a simple experiment, manage the data, and analyze it using MDSplus and Python. To this end, a custom example device was developed to be used as the data source. This device was built on an opensource electronic hardware platform, and it consists of a microcontroller and two sensors. We read data from these sensors, store it in MDSplus, and use JupyterLab to visualize and process it. This project and code demo are available on the GitHub site at this URL: https://github.com/santorofer/MDSplusAndCustomeDevices
d
Replication Data for: Automated Coding of Political Campaign Advertisement...
search.dataone.org
Updated Nov 8, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tarr, Alex; Imai, Kosuke; Hwang, June (2023). Replication Data for: Automated Coding of Political Campaign Advertisement Videos: An Empirical Validation Study [Dataset]. http://doi.org/10.7910/DVN/6SWKPR
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/6SWKPR
Dataset updated
Nov 8, 2023
Dataset provided by
Harvard Dataverse
Authors
Tarr, Alex; Imai, Kosuke; Hwang, June
Description
Video advertisements, either through television or the Internet, play an essential role in modern political campaigns. For over two decades, researchers have studied television video ads by analyzing the hand-coded data from the Wisconsin Advertising Project and its successor, the Wesleyan Media Project (WMP). Unfortunately, manually coding more than a hundred of variables, such as issue mentions, opponent appearance, and negativity, for many videos is a laborious and expensive process. We propose to automatically code campaign advertisement videos. Applying state-of-the-art machine learning methods, we extract various audio and image features from each video file. We show that our machine coding is comparable to human coding for many variables of the WMP data sets. Since many candidates make their advertisement videos available on the Internet, automated coding can dramatically improve the efficiency and scope of campaign advertisement research. Open-source software package is available for implementing the proposed methodology.

Facebook

Twitter

Click to copy link

Link copied

Cite

Philipp Conzett; Philipp Conzett (2023). Dataverse Community Survey 2022 – Data [Dataset]. http://doi.org/10.18710/UOC8CP

Dataverse Community Survey 2022 – Data

Explore at:

2 scholarly articles cite this dataset (View in Google Scholar)

xlsx(237162), png(27105), png(60756), text/tsv(324), png(82266), text/tsv(781), png(42044), text/tsv(441), xlsx(237019), text/tsv(252), png(76945), png(127124), xlsx(237234), xlsx(237457), text/tsv(311), xlsx(236817), xlsx(239544), png(9443), xlsx(237574), png(84243), png(10617), png(25842), xlsx(237079), png(68203), docx(121213), png(57822), text/tsv(490), png(268559), xlsx(236982), png(41638), xlsx(241097), text/tsv(189), text/tsv(3868), xlsx(237497), text/tsv(198), png(72478), xlsx(237276), xlsx(238387), xlsx(237157), xlsx(237036), xlsx(236994), png(29299), text/tsv(84), text/tsv(330), text/tsv(97), text/tsv(166), png(38778), png(54493), text/tsv(297), png(15331), text/tsv(249), xlsx(285808), png(47413), png(48538), png(50982), xlsx(239731), text/tsv(317), png(35252), png(11423), xlsx(237380), xlsx(236750), png(58837), png(30077), png(22419), xlsx(240275), text/tsv(491), xlsx(236969), xlsx(237046), xlsx(237744), png(31527), png(46063), text/tsv(609), xlsx(237118), png(105999), png(58386), png(41932), png(144980), xlsx(237127), text/tsv(56), text/tsv(1157), text/tsv(351), text/tsv(3985), png(87888), xlsx(236977), png(32522), png(23763), png(54715), xlsx(238959), xlsx(237073), xlsx(237498), xlsx(236928), text/tsv(88), xlsx(238353), png(98233), text/tsv(179), text/tsv(1408), text/tsv(1526), text/tsv(162713), png(32773), png(24805), png(37087), xlsx(236945), png(51108), png(18162), png(17826), xlsx(236766), pdf(240883), png(181083), xlsx(236999), text/tsv(119), xlsx(237045), png(13788), xlsx(237345), text/tsv(250), png(14307), text/tsv(271), png(60983), xlsx(236784), png(45840), text/tsv(402), png(27661), text/tsv(277), xlsx(237476), png(184889), png(58069), text/tsv(174), png(99230), png(45285), png(81808), png(76035), text/tsv(242), png(48469), text/tsv(493), text/tsv(216), png(39604), text/tsv(256), text/tsv(308), xlsx(237190), xlsx(236974), text/tsv(66), png(47343), png(72614), png(21131), png(32169), text/tsv(1315), txt(70354), pdf(896782), png(56694), png(31829), xlsx(237010), png(141447), text/tsv(230), text/tsv(359), png(76504), xlsx(238146), text/tsv(310), xlsx(237302)Available download formats

Unique identifier

https://doi.org/10.18710/UOC8CP

Dataset updated

Sep 28, 2023

Dataset provided by

DataverseNO

Authors

Philipp Conzett; Philipp Conzett

License

CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically

Time period covered

2022

Area covered

Netherlands, Norway, Colombia, Mexico, Germany, United States, Italy, Spain, Austria, Indonesia

Description

This dataset contains raw data and processed data from the Dataverse Community Survey 2022. The main goal of the survey was to help the Global Dataverse Community Consortium (GDCC; https://dataversecommunity.global/) and the Dataverse Project (https://dataverse.org/) decide on what actions to take to improve the Dataverse software and the larger ecosystem of integrated tools and services as well as better support community members. The results from the survey may also be of interest to other communities working on software and services for managing research data. The survey was designed to map out the current status as well as the roadmaps and priorities of Dataverse installations around the world. The main target group for participating in the survey were the people/teams responsible for operating Dataverse installations around the world. A secondary target group were people/teams at organizations that are planning to deploy or considering deploying a Dataverse installation. There were 34 existing and planned Dataverse installations participating in the survey.

Clear search

Close search

Google apps

Main menu

Dataverse Community Survey 2022 – Data

Open Source at Harvard

Data Repository for "Open Source Software as Digital Platforms to Innovate"

Replication Data for: Open Journal Systems and Dataverse Integration–...

Open Source Indicators Project

Dataset for Design Ideation Study

Supplemental Materials for \"Topic Modeling the Hàn diăn Ancient Classics...

Data for: modeling of local scour around circular structures using the open...

Flowers Dataset

Replication Data for: Write access provisioning and organizational ownership...

US Prisons, Jails, and Detention Centers with EPA Facility Registry Service...

The SEOSS Dataset - Requirements, Bug Reports, Code History, and Trace Links...

Open source software for the analysis of kinematics and dynamics of machines...

Open-source Dynamic Evolving Levee Environment (openDELvE)

Community smells detection on scholarly communication open source software...

Replication Data for: \"Externalities in Knowledge Production: Evidence from...

Replication Data for: Underproduction Analysis of Open Source Software

Replication Data for: On the Impact of a Streaming Oxygen Population on...

Exploring MDSplus data-acquisition software and custom devices

Replication Data for: Automated Coding of Political Campaign Advertisement...

Dataverse Community Survey 2022 – Data