100+ datasets found

d
Dataset metadata of known Dataverse installations
search.dataone.org
dataverse.harvard.edu
+1more
Updated Nov 22, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gautier, Julian (2023). Dataset metadata of known Dataverse installations [Dataset]. http://doi.org/10.7910/DVN/DCDKZQ
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/DCDKZQ
Dataset updated
Nov 22, 2023
Dataset provided by
Harvard Dataverse
Authors
Gautier, Julian
Description
This dataset contains the metadata of the datasets published in 77 Dataverse installations, information about each installation's metadata blocks, and the list of standard licenses that dataset depositors can apply to the datasets they publish in the 36 installations running more recent versions of the Dataverse software. The data is useful for reporting on the quality of dataset and file-level metadata within and across Dataverse installations. Curators and other researchers can use this dataset to explore how well Dataverse software and the repositories using the software help depositors describe data. How the metadata was downloaded The dataset metadata and metadata block JSON files were downloaded from each installation on October 2 and October 3, 2022 using a Python script kept in a GitHub repo at https://github.com/jggautier/dataverse-scripts/blob/main/other_scripts/get_dataset_metadata_of_all_installations.py. In order to get the metadata from installations that require an installation account API token to use certain Dataverse software APIs, I created a CSV file with two columns: one column named "hostname" listing each installation URL in which I was able to create an account and another named "apikey" listing my accounts' API tokens. The Python script expects and uses the API tokens in this CSV file to get metadata and other information from installations that require API tokens. How the files are organized ├── csv_files_with_metadata_from_most_known_dataverse_installations │ ├── author(citation).csv │ ├── basic.csv │ ├── contributor(citation).csv │ ├── ... │ └── topic_classification(citation).csv ├── dataverse_json_metadata_from_each_known_dataverse_installation │ ├── Abacus_2022.10.02_17.11.19.zip │ ├── dataset_pids_Abacus_2022.10.02_17.11.19.csv │ ├── Dataverse_JSON_metadata_2022.10.02_17.11.19 │ ├── hdl_11272.1_AB2_0AQZNT_v1.0.json │ ├── ... │ ├── metadatablocks_v5.6 │ ├── astrophysics_v5.6.json │ ├── biomedical_v5.6.json │ ├── citation_v5.6.json │ ├── ... │ ├── socialscience_v5.6.json │ ├── ACSS_Dataverse_2022.10.02_17.26.19.zip │ ├── ADA_Dataverse_2022.10.02_17.26.57.zip │ ├── Arca_Dados_2022.10.02_17.44.35.zip │ ├── ... │ └── World_Agroforestry_-_Research_Data_Repository_2022.10.02_22.59.36.zip └── dataset_pids_from_most_known_dataverse_installations.csv └── licenses_used_by_dataverse_installations.csv └── metadatablocks_from_most_known_dataverse_installations.csv This dataset contains two directories and three CSV files not in a directory. One directory, "csv_files_with_metadata_from_most_known_dataverse_installations", contains 18 CSV files that contain the values from common metadata fields of all 77 Dataverse installations. For example, author(citation)_2022.10.02-2022.10.03.csv contains the "Author" metadata for all published, non-deaccessioned, versions of all datasets in the 77 installations, where there's a row for each author name, affiliation, identifier type and identifier. The other directory, "dataverse_json_metadata_from_each_known_dataverse_installation", contains 77 zipped files, one for each of the 77 Dataverse installations whose dataset metadata I was able to download using Dataverse APIs. Each zip file contains a CSV file and two sub-directories: The CSV file contains the persistent IDs and URLs of each published dataset in the Dataverse installation as well as a column to indicate whether or not the Python script was able to download the Dataverse JSON metadata for each dataset. For Dataverse installations using Dataverse software versions whose Search APIs include each dataset's owning Dataverse collection name and alias, the CSV files also include which Dataverse collection (within the installation) that dataset was published in. One sub-directory contains a JSON file for each of the installation's published, non-deaccessioned dataset versions. The JSON files contain the metadata in the "Dataverse JSON" metadata schema. The other sub-directory contains information about the metadata models (the "metadata blocks" in JSON files) that the installation was using when the dataset metadata was downloaded. I saved them so that they can be used when extracting metadata from the Dataverse JSON files. The dataset_pids_from_most_known_dataverse_installations.csv file contains the dataset PIDs of all published datasets in the 77 Dataverse installations, with a column to indicate if the Python script was able to download the dataset's metadata. It's a union of all of the "dataset_pids_..." files in each of the 77 zip files. The licenses_used_by_dataverse_installations.csv file contains information about the licenses that a number of the installations let depositors choose when creating datasets. When I collected ... Visit https://dataone.org/datasets/sha256%3Ad27d528dae8cf01e3ea915f450426c38fd6320e8c11d3e901c43580f997a3146 for complete metadata about this dataset.
D
Dataverse Community Survey 2022 – Data
dataverse.azure.uit.no
dataverse.no
+1more
docx, pdf, png +3
Updated Sep 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Philipp Conzett; Philipp Conzett (2023). Dataverse Community Survey 2022 – Data [Dataset]. http://doi.org/10.18710/UOC8CP
Explore at:
xlsx(236994), png(181083), png(15331), png(48469), png(41638), png(50982), png(60983), text/tsv(249), xlsx(241097), png(42044), text/tsv(162713), text/tsv(311), text/tsv(277), png(21131), text/tsv(297), png(57822), xlsx(237073), png(60756), png(47413), png(105999), png(32773), png(31527), png(14307), png(54715), text/tsv(179), xlsx(237019), png(31829), png(81808), xlsx(237127), png(82266), xlsx(236784), xlsx(237010), png(27105), text/tsv(1408), xlsx(237079), xlsx(285808), png(48538), png(24805), png(58069), png(76504), xlsx(237498), png(51108), xlsx(237345), xlsx(236977), xlsx(236766), text/tsv(250), text/tsv(1157), xlsx(238353), text/tsv(174), png(47343), text/tsv(491), png(27661), png(29299), pdf(240883), text/tsv(493), text/tsv(324), text/tsv(308), png(23763), png(268559), xlsx(237162), text/tsv(490), xlsx(239544), text/tsv(198), png(11423), xlsx(236750), text/tsv(88), text/tsv(271), png(87888), pdf(896782), png(58386), text/tsv(609), png(127124), png(72614), xlsx(237157), text/tsv(402), png(76945), text/tsv(242), text/tsv(3985), xlsx(236817), png(25842), text/tsv(441), png(98233), text/tsv(351), png(72478), xlsx(237457), png(38778), png(76035), xlsx(236969), xlsx(236974), png(141447), png(22419), text/tsv(317), png(9443), xlsx(237380), xlsx(238959), xlsx(237046), text/tsv(330), xlsx(237190), xlsx(237574), xlsx(238387), png(184889), png(37087), png(56694), png(17826), xlsx(237476), text/tsv(359), png(84243), text/tsv(66), text/tsv(781), png(18162), png(46063), png(32522), text/tsv(230), text/tsv(252), png(45840), png(99230), xlsx(239731), xlsx(237302), xlsx(237744), xlsx(237118), png(30077), xlsx(237234), text/tsv(97), txt(70354), text/tsv(119), xlsx(236928), png(58837), text/tsv(3868), docx(121213), text/tsv(256), xlsx(238146), xlsx(240275), text/tsv(216), xlsx(236999), text/tsv(310), png(13788), text/tsv(189), xlsx(237276), text/tsv(1526), png(10617), png(45285), xlsx(236945), png(41932), png(54493), xlsx(236982), png(39604), png(32169), xlsx(237497), text/tsv(84), text/tsv(56), png(68203), png(144980), xlsx(237045), png(35252), text/tsv(1315), text/tsv(166), xlsx(237036)Available download formats
Unique identifier
https://doi.org/10.18710/UOC8CP
Dataset updated
Sep 28, 2023
Dataset provided by
DataverseNO
Authors
Philipp Conzett; Philipp Conzett
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Time period covered
2022
Area covered
Colombia, Spain, Germany, Austria, France, Mexico, Italy, Hungary, Belgium, Norway
Description
This dataset contains raw data and processed data from the Dataverse Community Survey 2022. The main goal of the survey was to help the Global Dataverse Community Consortium (GDCC; https://dataversecommunity.global/) and the Dataverse Project (https://dataverse.org/) decide on what actions to take to improve the Dataverse software and the larger ecosystem of integrated tools and services as well as better support community members. The results from the survey may also be of interest to other communities working on software and services for managing research data. The survey was designed to map out the current status as well as the roadmaps and priorities of Dataverse installations around the world. The main target group for participating in the survey were the people/teams responsible for operating Dataverse installations around the world. A secondary target group were people/teams at organizations that are planning to deploy or considering deploying a Dataverse installation. There were 34 existing and planned Dataverse installations participating in the survey.
H
Comparative review of data repositories
dataverse.harvard.edu
xlsx
Updated Oct 27, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Harvard Dataverse (2020). Comparative review of data repositories [Dataset]. http://doi.org/10.7910/DVN/WS9OUR
Explore at:
xlsx(24716)Available download formats
Unique identifier
https://doi.org/10.7910/DVN/WS9OUR
Dataset updated
Oct 27, 2020
Dataset provided by
Harvard Dataverse
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Comparative review of open access data repositories collected to inform product development for the Dataverse Project at the Harvard Institute for Quantitative Social Science More information about the scope, purpose and development of this review is at https://dataverse.org/blog/comparative-review-various-data-repositories.
R
Dataset of Image files
beta.dataverse.org
Updated Mar 7, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ellen Kraffmiller (2024). Dataset of Image files [Dataset]. https://beta.dataverse.org/dataset.xhtml;jsessionid=3c0d06bc5a58c1c59ad94a197a17?persistentId=doi%3A10.5072%2FFK2%2FDS4ADB&version=&q=&fileTypeGroupFacet=&fileAccess=&fileSortField=date&tagPresort=true
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 7, 2024
Dataset provided by
Root
Authors
Ellen Kraffmiller
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
A Dataset of small image files, screenshots of SPA code, tools and development environment, to show files table
R
test dataset
beta.dataverse.org
data.dev-wins.com
+6more
Updated Mar 11, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Dataverse Admin (2024). test dataset [Dataset]. https://beta.dataverse.org/dataset.xhtml?persistentId=doi:10.5072/FK2/8WYJTS
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Mar 11, 2024
Dataset provided by
Root
Authors
Dataverse Admin
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
test dataset
R
Max Schema.org
beta.dataverse.org
Updated May 21, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Philip Durbin; IQSS (2025). Max Schema.org [Dataset]. https://beta.dataverse.org/dataset.xhtml?persistentId=doi:10.5072/FK2/VQTYHD
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
May 21, 2025
Dataset provided by
Root
Authors
Philip Durbin; IQSS
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Time period covered
Jan 1, 2023 - Dec 31, 2023
Area covered
United States, Cambridge, MA, Harvard Square
Dataset funded by
NIH
NSF
Description
Exercising fields used by schema.org exporter.
d
Dataverse Network Project
dknet.org
scicrunch.org
Updated Jun 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2025). Dataverse Network Project [Dataset]. http://identifiers.org/RRID:SCR_001997
Explore at:
Unique identifier
https://identifiers.org/RRID:SCR_001997
Dataset updated
Jun 3, 2025
Description
Project portal for publishing, citing, sharing and discovering research data. Software, protocols, and community connections for creating research data repositories that automate professional archival practices, guarantee long term preservation, and enable researchers to share, retain control of, and receive web visibility and formal academic citations for their data contributions. Researchers, data authors, publishers, data distributors, and affiliated institutions all receive appropriate credit. Hosts multiple dataverses. Each dataverse contains studies or collections of studies, and each study contains cataloging information that describes the data plus the actual data files and complementary files. Data related to social sciences, health, medicine, humanities or other sciences with an emphasis in human behavior are uploaded to the IQSS Dataverse Network (Harvard). You can create your own dataverse for free and start adding studies for your data files and complementary material (documents, software, etc). You may install your own Dataverse Network for your University or organization.
T
TXST Dataverse Quick Start Guide
dataverse.tdl.org
pdf, tsv
Updated Sep 11, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Example Name; Example Name (2024). TXST Dataverse Quick Start Guide [Dataset]. http://doi.org/10.18738/T8/TCTUJN
Explore at:
pdf(686966), tsv(664)Available download formats
Unique identifier
https://doi.org/10.18738/T8/TCTUJN
Dataset updated
Sep 11, 2024
Dataset provided by
Texas Data Repository
Authors
Example Name; Example Name
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
TXST Dataverse Quick Start Guide-Example RDM Dataverse
d
lina dataverse
search.dataone.org
dataverse.harvard.edu
Updated Dec 16, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
hu, lina (2023). lina dataverse [Dataset]. http://doi.org/10.7910/DVN/ID4DGD
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/ID4DGD
Dataset updated
Dec 16, 2023
Dataset provided by
Harvard Dataverse
Authors
hu, lina
Description
experimental data. Visit https://dataone.org/datasets/sha256%3A18735774f162e6915a7d05c2276ae4ddf535e237e1559bebab64d219355e9ca8 for complete metadata about this dataset.
T
Data from: Storytelling with Data - The Beyonce Edition
dataverse.tdl.org
csv, pdf, txt
Updated Nov 5, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kenton Rambsy; Kenton Rambsy (2020). Storytelling with Data - The Beyonce Edition [Dataset]. http://doi.org/10.18738/T8/XL8NIX
Explore at:
txt(3312), csv(62377), csv(19456), csv(53673), csv(8730), pdf(78421)Available download formats
Unique identifier
https://doi.org/10.18738/T8/XL8NIX
Dataset updated
Nov 5, 2020
Dataset provided by
Texas Data Repository
Authors
Kenton Rambsy; Kenton Rambsy
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This four datasets are used in conjunction with the senior seminar course, “Storytelling with Data: The Beyonce Edition.” This compilation of information related to Beyonce’s live performances related to the singer’s music videos, live performance, award nominations and wins, and chart performances of her songs.
H
Replication Data for: Discovering non-additive heritability using additive...
dataverse.harvard.edu
search.dataone.org
+1more
Updated Apr 23, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Samuel Smith; Gregory Darnell; Dana Udwin; Julian Stamp; Arbel Harpak; Sohini Ramachandran; Lorin Crawford (2024). Replication Data for: Discovering non-additive heritability using additive GWAS summary statistics [Dataset]. http://doi.org/10.7910/DVN/W6MA8J
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Unique identifier
https://doi.org/10.7910/DVN/W6MA8J
Dataset updated
Apr 23, 2024
Dataset provided by
Harvard Dataverse
Authors
Samuel Smith; Gregory Darnell; Dana Udwin; Julian Stamp; Arbel Harpak; Sohini Ramachandran; Lorin Crawford
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
This repo contains data produced from the manuscript entitled: "Discovering non-additive heritability using additive GWAS summary statistics". Here, we provide the additive and cis-interaction LD scores used for the real data analyses of 25 well-studied quantitative phenotypes from 349,468 individuals of self-identified European ancestry in the UK Biobank and up to 159,095 individuals in BioBank Japan. Note that for the UK Biobank analysis, LD scores were computed using a reference panel of 489 individuals from the European superpopulation (EUR) of the 1000 Genomes Project. For the analysis of BioBank Japan, In order to analyze data from BioBank Japan, we downloaded publicly available GWAS summary statistics for the 25 traits from http://jenger.riken.jp/en/result. Summary statistics used age, sex, and the first ten principal components as confounders in the initial GWAS study. We then used individuals from the East Asian (EAS) superpopulation from the 1000 Genomes Project Phase 3 to calculate paired LD scores from a reference panel.
R
Total File Search Selection
beta.dataverse.org
Updated Sep 26, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
st fer (2024). Total File Search Selection [Dataset]. https://beta.dataverse.org/dataset.xhtml;jsessionid=0348aaa85d63c6448a07010d45c2?persistentId=doi%3A10.5072%2FFK2%2F0UKSME&version=&q=&fileTypeGroupFacet=&fileAccess=&fileSortField=type&tagPresort=true
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 26, 2024
Dataset provided by
Root
Authors
st fer
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Total File Search Selection
s
Fictionality
marketplace.sshopencloud.eu
dataverse.harvard.edu
+1more
Updated Dec 19, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2016). Fictionality [Dataset]. http://doi.org/10.7910/DVN/5WKTZV
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/5WKTZV
Dataset updated
Dec 19, 2016
Description
Piper, Andrew, 2016, "Fictionality", doi:10.7910/DVN/5WKTZV, Harvard Dataverse, V1. Contains LIWC feature tables for all ~27,000 documents used in this study, R and Python code used to generate statistical results, and all supporting tables. Original CA article published at http://culturalanalytics.org/2016/12/fictionality/ See also Community Norms [http://best-practices.dataverse.org/harvard-policies/community-norms.html] as well as good scientific practices expect that proper credit is given via citation.
d
Harvard Dataverse Optional Feature Use Data
search.dataone.org
dataverse.harvard.edu
Updated Nov 23, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Boyd, Ceilyn (2023). Harvard Dataverse Optional Feature Use Data [Dataset]. http://doi.org/10.7910/DVN/9STGWE
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/9STGWE
Dataset updated
Nov 23, 2023
Dataset provided by
Harvard Dataverse
Authors
Boyd, Ceilyn
Time period covered
Oct 28, 2019 - Oct 29, 2019
Description
This dataset contains data, documentation, and code files associated with studies performed on snapshots of the contents of Harvard Dataverse taken on 28 and 29 October 2019.
d
Introduction and Background Information
search.dataone.org
dataverse.harvard.edu
Updated Nov 22, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Scholz, Dieter (2023). Introduction and Background Information [Dataset]. http://doi.org/10.7910/DVN/R33RS9
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/R33RS9
Dataset updated
Nov 22, 2023
Dataset provided by
Harvard Dataverse
Authors
Scholz, Dieter
Description
Harvard Dataverse => Digital Library - Projects & Theses - Prof. Dr. Scholz ----- Introduction and background information to "Digital Library - Projects & Theses - Prof. Dr. Scholz". The URL of the dataverse: http://dataverse.harvard.edu/dataverse/LibraryProfScholz The URL of this (introduction) dataset: http://doi.org/10.7910/DVN/R33RS9 YOU MAY HAVE BEEN DIRECTED HERE, BECAUSE THE CALLING PAGE HAS NO OTHER ENTRY POINT (with DOI) INTO THIS DATAVERSE. Click on the title of this page to reach the start page of the dataverse! Introduction to the Data in this Dataverse This dataverse is about: Aircraft Design Flight Mechanics Aircraft Systems This dataverse contains research data and software produced by students for their projects and theses on above topics. Get linked to all other resources from their reports using the URN from the German National Library (DNB) as given in each dataset under "Metadata": https://nbn-resolving.org/html/urn:nbn:de:gbv:18302-aeroJJJJ-MM-DD.01x Alternative sites that store the data given in this dataverse are: http://library.ProfScholz.de and https://archive.org/details/@profscholz Open an "item". Under "DOWNLOAD OPTIONS" select the file (as far as available) called "ZIP" to download DataXxxx.zip. Alternatively, go to "SHOW ALL"; In the new window select next to DataXxxx.zip click "View Contents" or select URL next to "Data-list". Download single file from DataXxxx.zip. Data Publishing Data publishing means publishing of research data for (re)use by others. It consists of preparing single files or a dataset containing several files for access in the WWW. This practice is part of the open science movement. There is consensus about the benefits resulting from Open Data - especially in connection with Open Access publishing. It is important to link the publication (e.g. thesis) with the underlying data and vice versa. General (not disciplinary) and free data repositories are: Harvard Dataverse (this one!) figshare (emphasis: multi media) Zenodo (emphasis: results from EU research, mainly text) Mendeley Data (emphasis: data associated with journal articles) To find data repositories use http://re3data.org Read more on https://en.wikipedia.org/wiki/Data_publishing
R
SicpaOpenData for .Net
entrepot.recherche.data.gouv.fr
zip
Updated May 25, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Thierry Heirman; Thierry Heirman (2023). SicpaOpenData for .Net [Dataset]. http://doi.org/10.57745/B2BOYY
Explore at:
zip(584975)Available download formats
Unique identifier
https://doi.org/10.57745/B2BOYY
Dataset updated
May 25, 2023
Dataset provided by
Recherche Data Gouv
Authors
Thierry Heirman; Thierry Heirman
License
https://spdx.org/licenses/etalab-2.0.htmlhttps://spdx.org/licenses/etalab-2.0.html
Description
The Sicpa_OpenData libraries allow to facilitate the publication of data to the INRAE dataverse in a transparent way 1/ by simplifying the creation of the metadata document from the data already present in the information systems, 2/ by simplifying the use of dataverse.org APIs. Available as a DLL, the SicpaOpenData for .Net library can be used from all developments using the Microsoft .NET platform
U
Cora Dataset
dataverse-staging.rdmc.unc.edu
openicpsr.org
+3more
csv
Updated Jul 21, 2017
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Andrew McCallum; Andrew McCallum (2017). Cora Dataset [Dataset]. http://doi.org/10.15139/S3/GZOSGN
Explore at:
csv(275824)Available download formats
Unique identifier
https://doi.org/10.15139/S3/GZOSGN
Dataset updated
Jul 21, 2017
Dataset provided by
UNC Dataverse
Authors
Andrew McCallum; Andrew McCallum
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
The Cora data contains bibliographic records of machine learning papers that have been manually clustered into groups that refer to the same publication. Originally, Cora was prepared by Andrew McCallum, and his versions of this data set are available on his Data web page. The data is also hosted here. Note that various versions of the Cora data set have been used by many publications in record linkage and entity resolution over the years.
d
Course enrollment stats
search.dataone.org
dataverse.harvard.edu
Updated Nov 21, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mehta, Neel (2023). Course enrollment stats [Dataset]. http://doi.org/10.7910/DVN/9MWTYO
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/9MWTYO
Dataset updated
Nov 21, 2023
Dataset provided by
Harvard Dataverse
Authors
Mehta, Neel
Description
Harvard College course enrollment statistics for the most recent semester including course, department, class number, and number of students (categorized by affiliation.)
R
Replication data for: [Public Health Policy At Scale: Impact of a...
dataverse.iza.org
dataverse.harvard.edu
+1more
Updated Jul 12, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Erdal Tekin; Jane Greve; Altindag, Onar; Erdal Tekin; Jane Greve; Altindag, Onar (2024). Replication data for: [Public Health Policy At Scale: Impact of a Government-sponsored Information Campaign on Infant Mortality in Denmark] [Dataset]. http://doi.org/10.7910/DVN/ZOTA28
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/ZOTA28
Dataset updated
Jul 12, 2024
Dataset provided by
Research Data Center of IZA (IDSC)
Authors
Erdal Tekin; Jane Greve; Altindag, Onar; Erdal Tekin; Jane Greve; Altindag, Onar
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Area covered
Denmark
Description
Documentation file for do-files and datasets corresponding to paper titled: “Public Health Policy at Scale: Impact of a Government-sponsored Information Campaign on Infant Mortality in Denmark” Onur Altindag, Jane Greve, and Erdal Tekin This document describes the datasets, STATA and R programs that replicate the results for the paper “Public Health Policy at Scale: Impact of Government-sponsored Information Campaign on Infant Mortality in Denmark” by Onur Altindag, Jane Greve, and Erdal Tekin, Review of Economics and Statistics, the version that is accepted on February 2021.
R
Replication Data for: "Social Ties in Academia: A Friend is a Treasure"
dataverse.iza.org
dataverse.harvard.edu
+1more
Updated Jun 12, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tommaso Colussi; Tommaso Colussi (2024). Replication Data for: "Social Ties in Academia: A Friend is a Treasure" [Dataset]. http://doi.org/10.7910/DVN/BSJHXZ
Explore at:
Unique identifier
https://doi.org/10.7910/DVN/BSJHXZ
Dataset updated
Jun 12, 2024
Dataset provided by
Research Data Center of IZA (IDSC)
Authors
Tommaso Colussi; Tommaso Colussi
License
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
Description
Replication Data for: "Social Ties in Academia: A Friend is a Treasure"

Facebook

Twitter

Click to copy link

Link copied

Cite

Gautier, Julian (2023). Dataset metadata of known Dataverse installations [Dataset]. http://doi.org/10.7910/DVN/DCDKZQ

Dataset metadata of known Dataverse installations

Explore at:

Unique identifier

https://doi.org/10.7910/DVN/DCDKZQ

Dataset updated

Nov 22, 2023

Dataset provided by

Harvard Dataverse

Authors

Gautier, Julian

Description

This dataset contains the metadata of the datasets published in 77 Dataverse installations, information about each installation's metadata blocks, and the list of standard licenses that dataset depositors can apply to the datasets they publish in the 36 installations running more recent versions of the Dataverse software. The data is useful for reporting on the quality of dataset and file-level metadata within and across Dataverse installations. Curators and other researchers can use this dataset to explore how well Dataverse software and the repositories using the software help depositors describe data. How the metadata was downloaded The dataset metadata and metadata block JSON files were downloaded from each installation on October 2 and October 3, 2022 using a Python script kept in a GitHub repo at https://github.com/jggautier/dataverse-scripts/blob/main/other_scripts/get_dataset_metadata_of_all_installations.py. In order to get the metadata from installations that require an installation account API token to use certain Dataverse software APIs, I created a CSV file with two columns: one column named "hostname" listing each installation URL in which I was able to create an account and another named "apikey" listing my accounts' API tokens. The Python script expects and uses the API tokens in this CSV file to get metadata and other information from installations that require API tokens. How the files are organized ├── csv_files_with_metadata_from_most_known_dataverse_installations │ ├── author(citation).csv │ ├── basic.csv │ ├── contributor(citation).csv │ ├── ... │ └── topic_classification(citation).csv ├── dataverse_json_metadata_from_each_known_dataverse_installation │ ├── Abacus_2022.10.02_17.11.19.zip │ ├── dataset_pids_Abacus_2022.10.02_17.11.19.csv │ ├── Dataverse_JSON_metadata_2022.10.02_17.11.19 │ ├── hdl_11272.1_AB2_0AQZNT_v1.0.json │ ├── ... │ ├── metadatablocks_v5.6 │ ├── astrophysics_v5.6.json │ ├── biomedical_v5.6.json │ ├── citation_v5.6.json │ ├── ... │ ├── socialscience_v5.6.json │ ├── ACSS_Dataverse_2022.10.02_17.26.19.zip │ ├── ADA_Dataverse_2022.10.02_17.26.57.zip │ ├── Arca_Dados_2022.10.02_17.44.35.zip │ ├── ... │ └── World_Agroforestry_-_Research_Data_Repository_2022.10.02_22.59.36.zip └── dataset_pids_from_most_known_dataverse_installations.csv └── licenses_used_by_dataverse_installations.csv └── metadatablocks_from_most_known_dataverse_installations.csv This dataset contains two directories and three CSV files not in a directory. One directory, "csv_files_with_metadata_from_most_known_dataverse_installations", contains 18 CSV files that contain the values from common metadata fields of all 77 Dataverse installations. For example, author(citation)_2022.10.02-2022.10.03.csv contains the "Author" metadata for all published, non-deaccessioned, versions of all datasets in the 77 installations, where there's a row for each author name, affiliation, identifier type and identifier. The other directory, "dataverse_json_metadata_from_each_known_dataverse_installation", contains 77 zipped files, one for each of the 77 Dataverse installations whose dataset metadata I was able to download using Dataverse APIs. Each zip file contains a CSV file and two sub-directories: The CSV file contains the persistent IDs and URLs of each published dataset in the Dataverse installation as well as a column to indicate whether or not the Python script was able to download the Dataverse JSON metadata for each dataset. For Dataverse installations using Dataverse software versions whose Search APIs include each dataset's owning Dataverse collection name and alias, the CSV files also include which Dataverse collection (within the installation) that dataset was published in. One sub-directory contains a JSON file for each of the installation's published, non-deaccessioned dataset versions. The JSON files contain the metadata in the "Dataverse JSON" metadata schema. The other sub-directory contains information about the metadata models (the "metadata blocks" in JSON files) that the installation was using when the dataset metadata was downloaded. I saved them so that they can be used when extracting metadata from the Dataverse JSON files. The dataset_pids_from_most_known_dataverse_installations.csv file contains the dataset PIDs of all published datasets in the 77 Dataverse installations, with a column to indicate if the Python script was able to download the dataset's metadata. It's a union of all of the "dataset_pids_..." files in each of the 77 zip files. The licenses_used_by_dataverse_installations.csv file contains information about the licenses that a number of the installations let depositors choose when creating datasets. When I collected ... Visit https://dataone.org/datasets/sha256%3Ad27d528dae8cf01e3ea915f450426c38fd6320e8c11d3e901c43580f997a3146 for complete metadata about this dataset.

Clear search

Close search

Google apps

Main menu

Dataset metadata of known Dataverse installations

Dataverse Community Survey 2022 – Data

Comparative review of data repositories

Dataset of Image files

test dataset

Max Schema.org

Dataverse Network Project

TXST Dataverse Quick Start Guide

lina dataverse

Data from: Storytelling with Data - The Beyonce Edition

Replication Data for: Discovering non-additive heritability using additive...

Total File Search Selection

Fictionality

Harvard Dataverse Optional Feature Use Data

Introduction and Background Information

SicpaOpenData for .Net

Cora Dataset

Course enrollment stats

Replication data for: [Public Health Policy At Scale: Impact of a...

Replication Data for: "Social Ties in Academia: A Friend is a Treasure"

Dataset metadata of known Dataverse installationsSee More Versions

Dataset metadata of known Dataverse installations