100+ datasets found
  1. d

    Dataset metadata of known Dataverse installations

    • search.dataone.org
    • dataverse.harvard.edu
    • +1more
    Updated Nov 22, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gautier, Julian (2023). Dataset metadata of known Dataverse installations [Dataset]. http://doi.org/10.7910/DVN/DCDKZQ
    Explore at:
    Dataset updated
    Nov 22, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Gautier, Julian
    Description

    This dataset contains the metadata of the datasets published in 77 Dataverse installations, information about each installation's metadata blocks, and the list of standard licenses that dataset depositors can apply to the datasets they publish in the 36 installations running more recent versions of the Dataverse software. The data is useful for reporting on the quality of dataset and file-level metadata within and across Dataverse installations. Curators and other researchers can use this dataset to explore how well Dataverse software and the repositories using the software help depositors describe data. How the metadata was downloaded The dataset metadata and metadata block JSON files were downloaded from each installation on October 2 and October 3, 2022 using a Python script kept in a GitHub repo at https://github.com/jggautier/dataverse-scripts/blob/main/other_scripts/get_dataset_metadata_of_all_installations.py. In order to get the metadata from installations that require an installation account API token to use certain Dataverse software APIs, I created a CSV file with two columns: one column named "hostname" listing each installation URL in which I was able to create an account and another named "apikey" listing my accounts' API tokens. The Python script expects and uses the API tokens in this CSV file to get metadata and other information from installations that require API tokens. How the files are organized ├── csv_files_with_metadata_from_most_known_dataverse_installations │ ├── author(citation).csv │ ├── basic.csv │ ├── contributor(citation).csv │ ├── ... │ └── topic_classification(citation).csv ├── dataverse_json_metadata_from_each_known_dataverse_installation │ ├── Abacus_2022.10.02_17.11.19.zip │ ├── dataset_pids_Abacus_2022.10.02_17.11.19.csv │ ├── Dataverse_JSON_metadata_2022.10.02_17.11.19 │ ├── hdl_11272.1_AB2_0AQZNT_v1.0.json │ ├── ... │ ├── metadatablocks_v5.6 │ ├── astrophysics_v5.6.json │ ├── biomedical_v5.6.json │ ├── citation_v5.6.json │ ├── ... │ ├── socialscience_v5.6.json │ ├── ACSS_Dataverse_2022.10.02_17.26.19.zip │ ├── ADA_Dataverse_2022.10.02_17.26.57.zip │ ├── Arca_Dados_2022.10.02_17.44.35.zip │ ├── ... │ └── World_Agroforestry_-_Research_Data_Repository_2022.10.02_22.59.36.zip └── dataset_pids_from_most_known_dataverse_installations.csv └── licenses_used_by_dataverse_installations.csv └── metadatablocks_from_most_known_dataverse_installations.csv This dataset contains two directories and three CSV files not in a directory. One directory, "csv_files_with_metadata_from_most_known_dataverse_installations", contains 18 CSV files that contain the values from common metadata fields of all 77 Dataverse installations. For example, author(citation)_2022.10.02-2022.10.03.csv contains the "Author" metadata for all published, non-deaccessioned, versions of all datasets in the 77 installations, where there's a row for each author name, affiliation, identifier type and identifier. The other directory, "dataverse_json_metadata_from_each_known_dataverse_installation", contains 77 zipped files, one for each of the 77 Dataverse installations whose dataset metadata I was able to download using Dataverse APIs. Each zip file contains a CSV file and two sub-directories: The CSV file contains the persistent IDs and URLs of each published dataset in the Dataverse installation as well as a column to indicate whether or not the Python script was able to download the Dataverse JSON metadata for each dataset. For Dataverse installations using Dataverse software versions whose Search APIs include each dataset's owning Dataverse collection name and alias, the CSV files also include which Dataverse collection (within the installation) that dataset was published in. One sub-directory contains a JSON file for each of the installation's published, non-deaccessioned dataset versions. The JSON files contain the metadata in the "Dataverse JSON" metadata schema. The other sub-directory contains information about the metadata models (the "metadata blocks" in JSON files) that the installation was using when the dataset metadata was downloaded. I saved them so that they can be used when extracting metadata from the Dataverse JSON files. The dataset_pids_from_most_known_dataverse_installations.csv file contains the dataset PIDs of all published datasets in the 77 Dataverse installations, with a column to indicate if the Python script was able to download the dataset's metadata. It's a union of all of the "dataset_pids_..." files in each of the 77 zip files. The licenses_used_by_dataverse_installations.csv file contains information about the licenses that a number of the installations let depositors choose when creating datasets. When I collected ... Visit https://dataone.org/datasets/sha256%3Ad27d528dae8cf01e3ea915f450426c38fd6320e8c11d3e901c43580f997a3146 for complete metadata about this dataset.

  2. D

    Dataverse Community Survey 2022 – Data

    • dataverse.azure.uit.no
    • dataverse.no
    • +1more
    docx, pdf, png +3
    Updated Sep 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Philipp Conzett; Philipp Conzett (2023). Dataverse Community Survey 2022 – Data [Dataset]. http://doi.org/10.18710/UOC8CP
    Explore at:
    xlsx(236994), png(181083), png(15331), png(48469), png(41638), png(50982), png(60983), text/tsv(249), xlsx(241097), png(42044), text/tsv(162713), text/tsv(311), text/tsv(277), png(21131), text/tsv(297), png(57822), xlsx(237073), png(60756), png(47413), png(105999), png(32773), png(31527), png(14307), png(54715), text/tsv(179), xlsx(237019), png(31829), png(81808), xlsx(237127), png(82266), xlsx(236784), xlsx(237010), png(27105), text/tsv(1408), xlsx(237079), xlsx(285808), png(48538), png(24805), png(58069), png(76504), xlsx(237498), png(51108), xlsx(237345), xlsx(236977), xlsx(236766), text/tsv(250), text/tsv(1157), xlsx(238353), text/tsv(174), png(47343), text/tsv(491), png(27661), png(29299), pdf(240883), text/tsv(493), text/tsv(324), text/tsv(308), png(23763), png(268559), xlsx(237162), text/tsv(490), xlsx(239544), text/tsv(198), png(11423), xlsx(236750), text/tsv(88), text/tsv(271), png(87888), pdf(896782), png(58386), text/tsv(609), png(127124), png(72614), xlsx(237157), text/tsv(402), png(76945), text/tsv(242), text/tsv(3985), xlsx(236817), png(25842), text/tsv(441), png(98233), text/tsv(351), png(72478), xlsx(237457), png(38778), png(76035), xlsx(236969), xlsx(236974), png(141447), png(22419), text/tsv(317), png(9443), xlsx(237380), xlsx(238959), xlsx(237046), text/tsv(330), xlsx(237190), xlsx(237574), xlsx(238387), png(184889), png(37087), png(56694), png(17826), xlsx(237476), text/tsv(359), png(84243), text/tsv(66), text/tsv(781), png(18162), png(46063), png(32522), text/tsv(230), text/tsv(252), png(45840), png(99230), xlsx(239731), xlsx(237302), xlsx(237744), xlsx(237118), png(30077), xlsx(237234), text/tsv(97), txt(70354), text/tsv(119), xlsx(236928), png(58837), text/tsv(3868), docx(121213), text/tsv(256), xlsx(238146), xlsx(240275), text/tsv(216), xlsx(236999), text/tsv(310), png(13788), text/tsv(189), xlsx(237276), text/tsv(1526), png(10617), png(45285), xlsx(236945), png(41932), png(54493), xlsx(236982), png(39604), png(32169), xlsx(237497), text/tsv(84), text/tsv(56), png(68203), png(144980), xlsx(237045), png(35252), text/tsv(1315), text/tsv(166), xlsx(237036)Available download formats
    Dataset updated
    Sep 28, 2023
    Dataset provided by
    DataverseNO
    Authors
    Philipp Conzett; Philipp Conzett
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    2022
    Area covered
    Colombia, Spain, Germany, Austria, France, Mexico, Italy, Hungary, Belgium, Norway
    Description

    This dataset contains raw data and processed data from the Dataverse Community Survey 2022. The main goal of the survey was to help the Global Dataverse Community Consortium (GDCC; https://dataversecommunity.global/) and the Dataverse Project (https://dataverse.org/) decide on what actions to take to improve the Dataverse software and the larger ecosystem of integrated tools and services as well as better support community members. The results from the survey may also be of interest to other communities working on software and services for managing research data. The survey was designed to map out the current status as well as the roadmaps and priorities of Dataverse installations around the world. The main target group for participating in the survey were the people/teams responsible for operating Dataverse installations around the world. A secondary target group were people/teams at organizations that are planning to deploy or considering deploying a Dataverse installation. There were 34 existing and planned Dataverse installations participating in the survey.

  3. H

    Comparative review of data repositories

    • dataverse.harvard.edu
    xlsx
    Updated Oct 27, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Harvard Dataverse (2020). Comparative review of data repositories [Dataset]. http://doi.org/10.7910/DVN/WS9OUR
    Explore at:
    xlsx(24716)Available download formats
    Dataset updated
    Oct 27, 2020
    Dataset provided by
    Harvard Dataverse
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Comparative review of open access data repositories collected to inform product development for the Dataverse Project at the Harvard Institute for Quantitative Social Science More information about the scope, purpose and development of this review is at https://dataverse.org/blog/comparative-review-various-data-repositories.

  4. R

    Dataset of Image files

    • beta.dataverse.org
    Updated Mar 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ellen Kraffmiller (2024). Dataset of Image files [Dataset]. https://beta.dataverse.org/dataset.xhtml;jsessionid=3c0d06bc5a58c1c59ad94a197a17?persistentId=doi%3A10.5072%2FFK2%2FDS4ADB&version=&q=&fileTypeGroupFacet=&fileAccess=&fileSortField=date&tagPresort=true
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 7, 2024
    Dataset provided by
    Root
    Authors
    Ellen Kraffmiller
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    A Dataset of small image files, screenshots of SPA code, tools and development environment, to show files table

  5. R

    test dataset

    • beta.dataverse.org
    • data.dev-wins.com
    • +6more
    Updated Mar 11, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataverse Admin (2024). test dataset [Dataset]. https://beta.dataverse.org/dataset.xhtml?persistentId=doi:10.5072/FK2/8WYJTS
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Mar 11, 2024
    Dataset provided by
    Root
    Authors
    Dataverse Admin
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    test dataset

  6. R

    Max Schema.org

    • beta.dataverse.org
    Updated May 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Philip Durbin; IQSS (2025). Max Schema.org [Dataset]. https://beta.dataverse.org/dataset.xhtml?persistentId=doi:10.5072/FK2/VQTYHD
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 21, 2025
    Dataset provided by
    Root
    Authors
    Philip Durbin; IQSS
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Time period covered
    Jan 1, 2023 - Dec 31, 2023
    Area covered
    United States, Cambridge, MA, Harvard Square
    Dataset funded by
    NIH
    NSF
    Description

    Exercising fields used by schema.org exporter.

  7. d

    Dataverse Network Project

    • dknet.org
    • scicrunch.org
    Updated Jun 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Dataverse Network Project [Dataset]. http://identifiers.org/RRID:SCR_001997
    Explore at:
    Dataset updated
    Jun 3, 2025
    Description

    Project portal for publishing, citing, sharing and discovering research data. Software, protocols, and community connections for creating research data repositories that automate professional archival practices, guarantee long term preservation, and enable researchers to share, retain control of, and receive web visibility and formal academic citations for their data contributions. Researchers, data authors, publishers, data distributors, and affiliated institutions all receive appropriate credit. Hosts multiple dataverses. Each dataverse contains studies or collections of studies, and each study contains cataloging information that describes the data plus the actual data files and complementary files. Data related to social sciences, health, medicine, humanities or other sciences with an emphasis in human behavior are uploaded to the IQSS Dataverse Network (Harvard). You can create your own dataverse for free and start adding studies for your data files and complementary material (documents, software, etc). You may install your own Dataverse Network for your University or organization.

  8. T

    TXST Dataverse Quick Start Guide

    • dataverse.tdl.org
    pdf, tsv
    Updated Sep 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Example Name; Example Name (2024). TXST Dataverse Quick Start Guide [Dataset]. http://doi.org/10.18738/T8/TCTUJN
    Explore at:
    pdf(686966), tsv(664)Available download formats
    Dataset updated
    Sep 11, 2024
    Dataset provided by
    Texas Data Repository
    Authors
    Example Name; Example Name
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    TXST Dataverse Quick Start Guide-Example RDM Dataverse

  9. d

    lina dataverse

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Dec 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    hu, lina (2023). lina dataverse [Dataset]. http://doi.org/10.7910/DVN/ID4DGD
    Explore at:
    Dataset updated
    Dec 16, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    hu, lina
    Description

    experimental data. Visit https://dataone.org/datasets/sha256%3A18735774f162e6915a7d05c2276ae4ddf535e237e1559bebab64d219355e9ca8 for complete metadata about this dataset.

  10. T

    Data from: Storytelling with Data - The Beyonce Edition

    • dataverse.tdl.org
    csv, pdf, txt
    Updated Nov 5, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kenton Rambsy; Kenton Rambsy (2020). Storytelling with Data - The Beyonce Edition [Dataset]. http://doi.org/10.18738/T8/XL8NIX
    Explore at:
    txt(3312), csv(62377), csv(19456), csv(53673), csv(8730), pdf(78421)Available download formats
    Dataset updated
    Nov 5, 2020
    Dataset provided by
    Texas Data Repository
    Authors
    Kenton Rambsy; Kenton Rambsy
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This four datasets are used in conjunction with the senior seminar course, “Storytelling with Data: The Beyonce Edition.” This compilation of information related to Beyonce’s live performances related to the singer’s music videos, live performance, award nominations and wins, and chart performances of her songs.

  11. H

    Replication Data for: Discovering non-additive heritability using additive...

    • dataverse.harvard.edu
    • search.dataone.org
    • +1more
    Updated Apr 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Samuel Smith; Gregory Darnell; Dana Udwin; Julian Stamp; Arbel Harpak; Sohini Ramachandran; Lorin Crawford (2024). Replication Data for: Discovering non-additive heritability using additive GWAS summary statistics [Dataset]. http://doi.org/10.7910/DVN/W6MA8J
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 23, 2024
    Dataset provided by
    Harvard Dataverse
    Authors
    Samuel Smith; Gregory Darnell; Dana Udwin; Julian Stamp; Arbel Harpak; Sohini Ramachandran; Lorin Crawford
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    This repo contains data produced from the manuscript entitled: "Discovering non-additive heritability using additive GWAS summary statistics". Here, we provide the additive and cis-interaction LD scores used for the real data analyses of 25 well-studied quantitative phenotypes from 349,468 individuals of self-identified European ancestry in the UK Biobank and up to 159,095 individuals in BioBank Japan. Note that for the UK Biobank analysis, LD scores were computed using a reference panel of 489 individuals from the European superpopulation (EUR) of the 1000 Genomes Project. For the analysis of BioBank Japan, In order to analyze data from BioBank Japan, we downloaded publicly available GWAS summary statistics for the 25 traits from http://jenger.riken.jp/en/result. Summary statistics used age, sex, and the first ten principal components as confounders in the initial GWAS study. We then used individuals from the East Asian (EAS) superpopulation from the 1000 Genomes Project Phase 3 to calculate paired LD scores from a reference panel.

  12. R

    Total File Search Selection

    • beta.dataverse.org
    Updated Sep 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    st fer (2024). Total File Search Selection [Dataset]. https://beta.dataverse.org/dataset.xhtml;jsessionid=0348aaa85d63c6448a07010d45c2?persistentId=doi%3A10.5072%2FFK2%2F0UKSME&version=&q=&fileTypeGroupFacet=&fileAccess=&fileSortField=type&tagPresort=true
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 26, 2024
    Dataset provided by
    Root
    Authors
    st fer
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Total File Search Selection

  13. s

    Fictionality

    • marketplace.sshopencloud.eu
    • dataverse.harvard.edu
    • +1more
    Updated Dec 19, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2016). Fictionality [Dataset]. http://doi.org/10.7910/DVN/5WKTZV
    Explore at:
    Dataset updated
    Dec 19, 2016
    Description

    Piper, Andrew, 2016, "Fictionality", doi:10.7910/DVN/5WKTZV, Harvard Dataverse, V1. Contains LIWC feature tables for all ~27,000 documents used in this study, R and Python code used to generate statistical results, and all supporting tables. Original CA article published at http://culturalanalytics.org/2016/12/fictionality/ See also Community Norms [http://best-practices.dataverse.org/harvard-policies/community-norms.html] as well as good scientific practices expect that proper credit is given via citation.

  14. d

    Harvard Dataverse Optional Feature Use Data

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Boyd, Ceilyn (2023). Harvard Dataverse Optional Feature Use Data [Dataset]. http://doi.org/10.7910/DVN/9STGWE
    Explore at:
    Dataset updated
    Nov 23, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Boyd, Ceilyn
    Time period covered
    Oct 28, 2019 - Oct 29, 2019
    Description

    This dataset contains data, documentation, and code files associated with studies performed on snapshots of the contents of Harvard Dataverse taken on 28 and 29 October 2019.

  15. d

    Introduction and Background Information

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 22, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Scholz, Dieter (2023). Introduction and Background Information [Dataset]. http://doi.org/10.7910/DVN/R33RS9
    Explore at:
    Dataset updated
    Nov 22, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Scholz, Dieter
    Description

    Harvard Dataverse => Digital Library - Projects & Theses - Prof. Dr. Scholz ----- Introduction and background information to "Digital Library - Projects & Theses - Prof. Dr. Scholz". The URL of the dataverse: http://dataverse.harvard.edu/dataverse/LibraryProfScholz The URL of this (introduction) dataset: http://doi.org/10.7910/DVN/R33RS9 YOU MAY HAVE BEEN DIRECTED HERE, BECAUSE THE CALLING PAGE HAS NO OTHER ENTRY POINT (with DOI) INTO THIS DATAVERSE. Click on the title of this page to reach the start page of the dataverse! Introduction to the Data in this Dataverse This dataverse is about: Aircraft Design Flight Mechanics Aircraft Systems This dataverse contains research data and software produced by students for their projects and theses on above topics. Get linked to all other resources from their reports using the URN from the German National Library (DNB) as given in each dataset under "Metadata": https://nbn-resolving.org/html/urn:nbn:de:gbv:18302-aeroJJJJ-MM-DD.01x Alternative sites that store the data given in this dataverse are: http://library.ProfScholz.de and https://archive.org/details/@profscholz Open an "item". Under "DOWNLOAD OPTIONS" select the file (as far as available) called "ZIP" to download DataXxxx.zip. Alternatively, go to "SHOW ALL"; In the new window select next to DataXxxx.zip click "View Contents" or select URL next to "Data-list". Download single file from DataXxxx.zip. Data Publishing Data publishing means publishing of research data for (re)use by others. It consists of preparing single files or a dataset containing several files for access in the WWW. This practice is part of the open science movement. There is consensus about the benefits resulting from Open Data - especially in connection with Open Access publishing. It is important to link the publication (e.g. thesis) with the underlying data and vice versa. General (not disciplinary) and free data repositories are: Harvard Dataverse (this one!) figshare (emphasis: multi media) Zenodo (emphasis: results from EU research, mainly text) Mendeley Data (emphasis: data associated with journal articles) To find data repositories use http://re3data.org Read more on https://en.wikipedia.org/wiki/Data_publishing

  16. R

    SicpaOpenData for .Net

    • entrepot.recherche.data.gouv.fr
    zip
    Updated May 25, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Thierry Heirman; Thierry Heirman (2023). SicpaOpenData for .Net [Dataset]. http://doi.org/10.57745/B2BOYY
    Explore at:
    zip(584975)Available download formats
    Dataset updated
    May 25, 2023
    Dataset provided by
    Recherche Data Gouv
    Authors
    Thierry Heirman; Thierry Heirman
    License

    https://spdx.org/licenses/etalab-2.0.htmlhttps://spdx.org/licenses/etalab-2.0.html

    Description

    The Sicpa_OpenData libraries allow to facilitate the publication of data to the INRAE dataverse in a transparent way 1/ by simplifying the creation of the metadata document from the data already present in the information systems, 2/ by simplifying the use of dataverse.org APIs. Available as a DLL, the SicpaOpenData for .Net library can be used from all developments using the Microsoft .NET platform

  17. U

    Cora Dataset

    • dataverse-staging.rdmc.unc.edu
    • openicpsr.org
    • +3more
    csv
    Updated Jul 21, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Andrew McCallum; Andrew McCallum (2017). Cora Dataset [Dataset]. http://doi.org/10.15139/S3/GZOSGN
    Explore at:
    csv(275824)Available download formats
    Dataset updated
    Jul 21, 2017
    Dataset provided by
    UNC Dataverse
    Authors
    Andrew McCallum; Andrew McCallum
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The Cora data contains bibliographic records of machine learning papers that have been manually clustered into groups that refer to the same publication. Originally, Cora was prepared by Andrew McCallum, and his versions of this data set are available on his Data web page. The data is also hosted here. Note that various versions of the Cora data set have been used by many publications in record linkage and entity resolution over the years.

  18. d

    Course enrollment stats

    • search.dataone.org
    • dataverse.harvard.edu
    Updated Nov 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mehta, Neel (2023). Course enrollment stats [Dataset]. http://doi.org/10.7910/DVN/9MWTYO
    Explore at:
    Dataset updated
    Nov 21, 2023
    Dataset provided by
    Harvard Dataverse
    Authors
    Mehta, Neel
    Description

    Harvard College course enrollment statistics for the most recent semester including course, department, class number, and number of students (categorized by affiliation.)

  19. R

    Replication data for: [Public Health Policy At Scale: Impact of a...

    • dataverse.iza.org
    • dataverse.harvard.edu
    • +1more
    Updated Jul 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Erdal Tekin; Jane Greve; Altindag, Onar; Erdal Tekin; Jane Greve; Altindag, Onar (2024). Replication data for: [Public Health Policy At Scale: Impact of a Government-sponsored Information Campaign on Infant Mortality in Denmark] [Dataset]. http://doi.org/10.7910/DVN/ZOTA28
    Explore at:
    Dataset updated
    Jul 12, 2024
    Dataset provided by
    Research Data Center of IZA (IDSC)
    Authors
    Erdal Tekin; Jane Greve; Altindag, Onar; Erdal Tekin; Jane Greve; Altindag, Onar
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Denmark
    Description

    Documentation file for do-files and datasets corresponding to paper titled: “Public Health Policy at Scale: Impact of a Government-sponsored Information Campaign on Infant Mortality in Denmark” Onur Altindag, Jane Greve, and Erdal Tekin This document describes the datasets, STATA and R programs that replicate the results for the paper “Public Health Policy at Scale: Impact of Government-sponsored Information Campaign on Infant Mortality in Denmark” by Onur Altindag, Jane Greve, and Erdal Tekin, Review of Economics and Statistics, the version that is accepted on February 2021.

  20. R

    Replication Data for: "Social Ties in Academia: A Friend is a Treasure"

    • dataverse.iza.org
    • dataverse.harvard.edu
    • +1more
    Updated Jun 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tommaso Colussi; Tommaso Colussi (2024). Replication Data for: "Social Ties in Academia: A Friend is a Treasure" [Dataset]. http://doi.org/10.7910/DVN/BSJHXZ
    Explore at:
    Dataset updated
    Jun 12, 2024
    Dataset provided by
    Research Data Center of IZA (IDSC)
    Authors
    Tommaso Colussi; Tommaso Colussi
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    Replication Data for: "Social Ties in Academia: A Friend is a Treasure"

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Gautier, Julian (2023). Dataset metadata of known Dataverse installations [Dataset]. http://doi.org/10.7910/DVN/DCDKZQ

Dataset metadata of known Dataverse installations

Explore at:
Dataset updated
Nov 22, 2023
Dataset provided by
Harvard Dataverse
Authors
Gautier, Julian
Description

This dataset contains the metadata of the datasets published in 77 Dataverse installations, information about each installation's metadata blocks, and the list of standard licenses that dataset depositors can apply to the datasets they publish in the 36 installations running more recent versions of the Dataverse software. The data is useful for reporting on the quality of dataset and file-level metadata within and across Dataverse installations. Curators and other researchers can use this dataset to explore how well Dataverse software and the repositories using the software help depositors describe data. How the metadata was downloaded The dataset metadata and metadata block JSON files were downloaded from each installation on October 2 and October 3, 2022 using a Python script kept in a GitHub repo at https://github.com/jggautier/dataverse-scripts/blob/main/other_scripts/get_dataset_metadata_of_all_installations.py. In order to get the metadata from installations that require an installation account API token to use certain Dataverse software APIs, I created a CSV file with two columns: one column named "hostname" listing each installation URL in which I was able to create an account and another named "apikey" listing my accounts' API tokens. The Python script expects and uses the API tokens in this CSV file to get metadata and other information from installations that require API tokens. How the files are organized ├── csv_files_with_metadata_from_most_known_dataverse_installations │ ├── author(citation).csv │ ├── basic.csv │ ├── contributor(citation).csv │ ├── ... │ └── topic_classification(citation).csv ├── dataverse_json_metadata_from_each_known_dataverse_installation │ ├── Abacus_2022.10.02_17.11.19.zip │ ├── dataset_pids_Abacus_2022.10.02_17.11.19.csv │ ├── Dataverse_JSON_metadata_2022.10.02_17.11.19 │ ├── hdl_11272.1_AB2_0AQZNT_v1.0.json │ ├── ... │ ├── metadatablocks_v5.6 │ ├── astrophysics_v5.6.json │ ├── biomedical_v5.6.json │ ├── citation_v5.6.json │ ├── ... │ ├── socialscience_v5.6.json │ ├── ACSS_Dataverse_2022.10.02_17.26.19.zip │ ├── ADA_Dataverse_2022.10.02_17.26.57.zip │ ├── Arca_Dados_2022.10.02_17.44.35.zip │ ├── ... │ └── World_Agroforestry_-_Research_Data_Repository_2022.10.02_22.59.36.zip └── dataset_pids_from_most_known_dataverse_installations.csv └── licenses_used_by_dataverse_installations.csv └── metadatablocks_from_most_known_dataverse_installations.csv This dataset contains two directories and three CSV files not in a directory. One directory, "csv_files_with_metadata_from_most_known_dataverse_installations", contains 18 CSV files that contain the values from common metadata fields of all 77 Dataverse installations. For example, author(citation)_2022.10.02-2022.10.03.csv contains the "Author" metadata for all published, non-deaccessioned, versions of all datasets in the 77 installations, where there's a row for each author name, affiliation, identifier type and identifier. The other directory, "dataverse_json_metadata_from_each_known_dataverse_installation", contains 77 zipped files, one for each of the 77 Dataverse installations whose dataset metadata I was able to download using Dataverse APIs. Each zip file contains a CSV file and two sub-directories: The CSV file contains the persistent IDs and URLs of each published dataset in the Dataverse installation as well as a column to indicate whether or not the Python script was able to download the Dataverse JSON metadata for each dataset. For Dataverse installations using Dataverse software versions whose Search APIs include each dataset's owning Dataverse collection name and alias, the CSV files also include which Dataverse collection (within the installation) that dataset was published in. One sub-directory contains a JSON file for each of the installation's published, non-deaccessioned dataset versions. The JSON files contain the metadata in the "Dataverse JSON" metadata schema. The other sub-directory contains information about the metadata models (the "metadata blocks" in JSON files) that the installation was using when the dataset metadata was downloaded. I saved them so that they can be used when extracting metadata from the Dataverse JSON files. The dataset_pids_from_most_known_dataverse_installations.csv file contains the dataset PIDs of all published datasets in the 77 Dataverse installations, with a column to indicate if the Python script was able to download the dataset's metadata. It's a union of all of the "dataset_pids_..." files in each of the 77 zip files. The licenses_used_by_dataverse_installations.csv file contains information about the licenses that a number of the installations let depositors choose when creating datasets. When I collected ... Visit https://dataone.org/datasets/sha256%3Ad27d528dae8cf01e3ea915f450426c38fd6320e8c11d3e901c43580f997a3146 for complete metadata about this dataset.

Search
Clear search
Close search
Google apps
Main menu