48 datasets found

MERGE Dataset
zenodo.org
zip
Updated Feb 7, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pedro Lima Louro; Pedro Lima Louro; Hugo Redinho; Hugo Redinho; Ricardo Santos; Ricardo Santos; Ricardo Malheiro; Ricardo Malheiro; Renato Panda; Renato Panda; Rui Pedro Paiva; Rui Pedro Paiva (2025). MERGE Dataset [Dataset]. http://doi.org/10.5281/zenodo.13939205
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.13939205
Dataset updated
Feb 7, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Pedro Lima Louro; Pedro Lima Louro; Hugo Redinho; Hugo Redinho; Ricardo Santos; Ricardo Santos; Ricardo Malheiro; Ricardo Malheiro; Renato Panda; Renato Panda; Rui Pedro Paiva; Rui Pedro Paiva
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
The MERGE dataset is a collection of audio, lyrics, and bimodal datasets for conducting research on Music Emotion Recognition. A complete version is provided for each modality. The audio datasets provide 30-second excerpts for each sample, while full lyrics are provided in the relevant datasets. The amount of available samples in each dataset is the following:

MERGE Audio Complete: 3554

MERGE Audio Balanced: 3232

MERGE Lyrics Complete: 2568

MERGE Lyrics Balanced: 2400

MERGE Bimodal Complete: 2216

MERGE Bimodal Balanced: 2000

Additional Contents

Each dataset contains the following additional files:

av_values: File containing the arousal and valence values for each sample sorted by their identifier;

tvt_dataframes: Train, validate, and test splits for each dataset. Both a 70-15-15 and a 40-30-30 split are provided.

Metadata

A metadata spreadsheet is provided for each dataset with the following information for each sample, if available:

Song (Audio and Lyrics datasets) - Song identifiers. Identifiers starting with MT were extracted from the AllMusic platform, while those starting with A or L were collected from private collections;

Quadrant - Label corresponding to one of the four quadrants from Russell's Circumplex Model;

AllMusic Id - For samples starting with A or L, the matching AllMusic identifier is also provided. This was used to complement the available information for the samples originally obtained from the platform;

Artist - First performing artist or band;

Title - Song title;

Relevance - AllMusic metric representing the relevance of the song in relation to the query used;

Duration - Song length in seconds;

Moods - User-generated mood tags extracted from the AllMusic platform and available in Warriner's affective dictionary;

MoodsAll - User-generated mood tags extracted from the AllMusic platform;

Genres - User-generated genre tags extracted from the AllMusic platform;

Themes - User-generated theme tags extracted from the AllMusic platform;

Styles - User-generated style tags extracted from the AllMusic platform;

AppearancesTrackIDs - All AllMusic identifiers related with a sample;

Sample - Availability of the sample in the AllMusic platform;

SampleURL - URL to the 30-second excerpt in AllMusic;

ActualYear - Year of song release.

Citation

If you use some part of the MERGE dataset in your research, please cite the following article:

Louro, P. L. and Redinho, H. and Santos, R. and Malheiro, R. and Panda, R. and Paiva, R. P. (2024). MERGE - A Bimodal Dataset For Static Music Emotion Recognition. arxiv. URL: https://arxiv.org/abs/2407.06060.

BibTeX:

@misc{louro2024mergebimodaldataset,
title={MERGE -- A Bimodal Dataset for Static Music Emotion Recognition},
author={Pedro Lima Louro and Hugo Redinho and Ricardo Santos and Ricardo Malheiro and Renato Panda and Rui Pedro Paiva},
year={2024},
eprint={2407.06060},
archivePrefix={arXiv},
primaryClass={cs.SD},
url={https://arxiv.org/abs/2407.06060},
}

Acknowledgements

This work is funded by FCT - Foundation for Science and Technology, I.P., within the scope of the projects: MERGE - DOI: 10.54499/PTDC/CCI-COM/3171/2021 financed with national funds (PIDDAC) via the Portuguese State Budget; and project CISUC - UID/CEC/00326/2020 with funds from the European Social Fund, through the Regional Operational Program Centro 2020.

Renato Panda was supported by Ci2 - FCT UIDP/05567/2020.
NSF/NCAR GV HIAPER 1 Minute Data Merge
data.ucar.edu
ascii
Updated Dec 26, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gao Chen; Jennifer R. Olson; Michael Shook (2024). NSF/NCAR GV HIAPER 1 Minute Data Merge [Dataset]. http://doi.org/10.26023/R1RA-JHKZ-W913
Explore at:
asciiAvailable download formats
Unique identifier
https://doi.org/10.26023/R1RA-JHKZ-W913
Dataset updated
Dec 26, 2024
Dataset provided by
University Corporation for Atmospheric Research
Authors
Gao Chen; Jennifer R. Olson; Michael Shook
Time period covered
May 18, 2012 - Jun 30, 2012
Area covered

Description
This data set contains NSF/NCAR GV HIAPER 1 Minute Data Merge data collected during the Deep Convective Clouds and Chemistry Experiment (DC3) from 18 May 2012 through 30 June 2012. These are updated merges from the NASA DC3 archive that were made available 13 June 2014. In most cases, variable names have been kept identical to those submitted in the raw data files. However, in some cases, names have been changed (e.g., to eliminate duplication). Units have been standardized throughout the merge. In addition, a "grand merge" has been provided. This includes data from all the individual merged flights throughout the mission. This grand merge will follow the following naming convention: "dc3-mrg60-gV_merge_YYYYMMdd_R5_thruYYYYMMdd.ict" (with the comment "_thruYYYYMMdd" indicating the last flight date included). This data set is in ICARTT format. Please see the header portion of the data files for details on instruments, parameters, quality assurance, quality control, contact information, and data set comments.
P
titanic5 Dataset Dataset
paperswithcode.com
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
titanic5 Dataset Dataset [Dataset]. https://paperswithcode.com/dataset/titanic5-dataset
Explore at:
Description
titanic5 Dataset Created by David Beltran del Rio March 2016.

Notes This is the final (for now) version of my update to the Titanic data. I think it’s finally ready for publishing if you’d like. What I did was to strip all the passenger and crew data from the Encyclopedia Titanica (ET) web pages (excluding channel crossing passengers), create a unique ID for each passenger and crew member (Name_ID), then (painstakingly and hopefully 100% correctly) match to your earlier titanic3 dataset, in order to compare the two and to get your sibsp and parch variables. Since the ET is updated occasionally the work put into the ID and matching can be reused and refined later. I did eventually hear back from the ET people, they are willing to make the underlying database available in the future, I have not yet taken them up on it.

The two datasets line up nicely, most of the differences in the newer titanic5 dataset are in the age variable, as I had mentioned before - the new set has less missing ages - 51 missing (vs 263) out of 1309.

I am in the process of refining my analysis of the data as well, based on your comments below and your Regression Modeling Strategies example.

titanic3_wID data can be matched to titanic5 using the Name_ID variable. Tab titanic5 Metadata has the variable descriptions and allowable values for Class and Class/Dept.

A note about the ages - instead of using the add 0.5 trick to indicate estimated birth day / date I have a flag that indicates how the “final” age (Age_F) was arrived at. It’s the Age_F_Code variable - the allowable values are in the Titanic5_metadata tab in the attached excel. The reason for this is that I already had some fractional ages for infants where I had age in months instead of years and I wanted to avoid confusion for 6 month old infants, although I don’t think there are any in the data! Also, I was thinking to make fractional ages or age in days for all passengers for whom I have DoB, but I have not yet done so.

Here’s what the tabs are:

Titanic5_all - all (mostly cleaned) Titanic passenger and crew records Titanic5_work - working dataset, crew removed, unnecessary variables removed - this is the one I import into SAS / R to work on Titanic5_metadata - Variable descriptions and allowable values titanic3_wID - Original Titanic3 dataset with Name_ID added for merging to Titanic5 I have a csv, R dataset, and SAS dataset, but the variable names are an older version, so I won’t send those along for now to avoid confusion.

If it helps send my contact info along to your student in case any questions arise. Gmail address probably best, on weekends for sure: davebdr@gmail.com

The tabs in titanic5.xls are

Titanic5_all Titanic5_passenger (the one to be used for analysis) Titanic5_metadata (used during analysis file creation) Titanic3_wID
e
Merger of BNV-D data (2008 to 2019) and enrichment
data.europa.eu
zip
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Patrick VINCOURT, Merger of BNV-D data (2008 to 2019) and enrichment [Dataset]. https://data.europa.eu/data/datasets/5f1c3eca9d149439e50c740f
Explore at:
zip(18530465)Available download formats
Dataset authored and provided by
Patrick VINCOURT
Description
Merging (in Table R) data published on https://www.data.gouv.fr/fr/datasets/ventes-de-pesticides-par-departement/, and joining two other sources of information associated with MAs: — uses: https://www.data.gouv.fr/fr/datasets/usages-des-produits-phytosanitaires/ — information on the “Biocontrol” status of the product, from document DGAL/SDQSPV/2020-784 published on 18/12/2020 at https://agriculture.gouv.fr/quest-ce-que-le-biocontrole

All the initial files (.csv transformed into.txt), the R code used to merge data and different output files are collected in a zip. enter image description here NB: 1) “YASCUB” for {year,AMM,Substance_active,Classification,Usage,Statut_“BioConttrol”}, substances not on the DGAL/SDQSPV list being coded NA. 2) The file of biocontrol products shall be cleaned from the duplicates generated by the marketing authorisations leading to several trade names.
3) The BNVD_BioC_DY3 table and the output file BNVD_BioC_DY3.txt contain the fields {Code_Region,Region,Dept,Code_Dept,Anne,Usage,Classification,Type_BioC,Quantite_substance)}
Data from: KORUS-AQ Aircraft Merge Data Files
catalog.data.gov
s.cnmilf.com
+1more
Updated Jul 3, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
NASA/LARC/SD/ASDC (2025). KORUS-AQ Aircraft Merge Data Files [Dataset]. https://catalog.data.gov/dataset/korus-aq-aircraft-merge-data-files
Explore at:
Dataset updated
Jul 3, 2025
Dataset provided by
NASAhttp://nasa.gov/
Description
KORUSAQ_Merge_Data are pre-generated merge data files combining various products collected during the KORUS-AQ field campaign. This collection features pre-generated merge files for the DC-8 aircraft. Data collection for this product is complete.The KORUS-AQ field study was conducted in South Korea during May-June, 2016. The study was jointly sponsored by NASA and Korea’s National Institute of Environmental Research (NIER). The primary objectives were to investigate the factors controlling air quality in Korea (e.g., local emissions, chemical processes, and transboundary transport) and to assess future air quality observing strategies incorporating geostationary satellite observations. To achieve these science objectives, KORUS-AQ adopted a highly coordinated sampling strategy involved surface and airborne measurements including both in-situ and remote sensing instruments.Surface observations provided details on ground-level air quality conditions while airborne sampling provided an assessment of conditions aloft relevant to satellite observations and necessary to understand the role of emissions, chemistry, and dynamics in determining air quality outcomes. The sampling region covers the South Korean peninsula and surrounding waters with a primary focus on the Seoul Metropolitan Area. Airborne sampling was primarily conducted from near surface to about 8 km with extensive profiling to characterize the vertical distribution of pollutants and their precursors. The airborne observational data were collected from three aircraft platforms: the NASA DC-8, NASA B-200, and Hanseo King Air. Surface measurements were conducted from 16 ground sites and 2 ships: R/V Onnuri and R/V Jang Mok.The major data products collected from both the ground and air include in-situ measurements of trace gases (e.g., ozone, reactive nitrogen species, carbon monoxide and dioxide, methane, non-methane and oxygenated hydrocarbon species), aerosols (e.g., microphysical and optical properties and chemical composition), active remote sensing of ozone and aerosols, and passive remote sensing of NO2, CH2O, and O3 column densities. These data products support research focused on examining the impact of photochemistry and transport on ozone and aerosols, evaluating emissions inventories, and assessing the potential use of satellite observations in air quality studies.
DLR Falcon 1 Second Data Merge
data.ucar.edu
ascii
Updated Dec 26, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gao Chen; Jennifer R. Olson; Michael Shook (2024). DLR Falcon 1 Second Data Merge [Dataset]. http://doi.org/10.26023/PYCX-YRVR-AB0W
Explore at:
asciiAvailable download formats
Unique identifier
https://doi.org/10.26023/PYCX-YRVR-AB0W
Dataset updated
Dec 26, 2024
Dataset provided by
University Corporation for Atmospheric Research
Authors
Gao Chen; Jennifer R. Olson; Michael Shook
Time period covered
May 29, 2012 - Jun 14, 2012
Area covered

Description
This data set contains DLR Falcon 1 Second Data Merge data collected during the Deep Convective Clouds and Chemistry Experiment (DC3) from 29 May 2012 through 14 June 2012. These merges were created using data in the NASA DC3 archive as of September 25, 2013. In most cases, variable names have been kept identical to those submitted in the raw data files. However, in some cases, names have been changed (e.g., to eliminate duplication). Units have been standardized throughout the merge. In addition, a "grand merge" has been provided. This includes data from all the individual merged flights throughout the mission. This grand merge will follow the following naming convention: "dc3-mrg01-falcon_merge_YYYYMMdd_R1_thruYYYYMMdd.ict" (with the comment "_thruYYYYMMdd" indicating the last flight date included). This data set is in ICARTT format. Please see the header portion of the data files for details on instruments, parameters, quality assurance, quality control, contact information, and data set comments.
f
Merging Resource Availability with Isotope Mixing Models: The Role of...
plos.figshare.com
ai
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Justin D. Yeakel; Mark Novak; Paulo R. Guimarães Jr.; Nathaniel J. Dominy; Paul L. Koch; Eric J. Ward; Jonathan W. Moore; Brice X. Semmens (2023). Merging Resource Availability with Isotope Mixing Models: The Role of Neutral Interaction Assumptions [Dataset]. http://doi.org/10.1371/journal.pone.0022015
Explore at:
aiAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0022015
Dataset updated
Jun 1, 2023
Dataset provided by
PLOS ONE
Authors
Justin D. Yeakel; Mark Novak; Paulo R. Guimarães Jr.; Nathaniel J. Dominy; Paul L. Koch; Eric J. Ward; Jonathan W. Moore; Brice X. Semmens
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
BackgroundBayesian mixing models have allowed for the inclusion of uncertainty and prior information in the analysis of trophic interactions using stable isotopes. Formulating prior distributions is relatively straightforward when incorporating dietary data. However, the use of data that are related, but not directly proportional, to diet (such as prey availability data) is often problematic because such information is not necessarily predictive of diet, and the information required to build a reliable prior distribution for all prey species is often unavailable. Omitting prey availability data impacts the estimation of a predator's diet and introduces the strong assumption of consumer ultrageneralism (where all prey are consumed in equal proportions), particularly when multiple prey have similar isotope values. MethodologyWe develop a procedure to incorporate prey availability data into Bayesian mixing models conditional on the similarity of isotope values between two prey. If a pair of prey have similar isotope values (resulting in highly uncertain mixing model results), our model increases the weight of availability data in estimating the contribution of prey to a predator's diet. We test the utility of this method in an intertidal community against independently measured feeding rates. ConclusionsOur results indicate that our weighting procedure increases the accuracy by which consumer diets can be inferred in situations where multiple prey have similar isotope values. This suggests that the exchange of formalism for predictive power is merited, particularly when the relationship between prey availability and a predator's diet cannot be assumed for all species in a system.
f
Data from: HOW TO PERFORM A META-ANALYSIS: A PRACTICAL STEP-BY-STEP GUIDE...
scielo.figshare.com
tiff
Updated Jun 4, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Diego Ariel de Lima; Camilo Partezani Helito; Lana Lacerda de Lima; Renata Clazzer; Romeu Krause Gonçalves; Olavo Pires de Camargo (2023). HOW TO PERFORM A META-ANALYSIS: A PRACTICAL STEP-BY-STEP GUIDE USING R SOFTWARE AND RSTUDIO [Dataset]. http://doi.org/10.6084/m9.figshare.19899537.v1
Explore at:
tiffAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.19899537.v1
Dataset updated
Jun 4, 2023
Dataset provided by
SciELO journals
Authors
Diego Ariel de Lima; Camilo Partezani Helito; Lana Lacerda de Lima; Renata Clazzer; Romeu Krause Gonçalves; Olavo Pires de Camargo
License
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Description
ABSTRACT Meta-analysis is an adequate statistical technique to combine results from different studies, and its use has been growing in the medical field. Thus, not only knowing how to interpret meta-analysis, but also knowing how to perform one, is fundamental today. Therefore, the objective of this article is to present the basic concepts and serve as a guide for conducting a meta-analysis using R and RStudio software. For this, the reader has access to the basic commands in the R and RStudio software, necessary for conducting a meta-analysis. The advantage of R is that it is a free software. For a better understanding of the commands, two examples were presented in a practical way, in addition to revising some basic concepts of this statistical technique. It is assumed that the data necessary for the meta-analysis has already been collected, that is, the description of methodologies for systematic review is not a discussed subject. Finally, it is worth remembering that there are many other techniques used in meta-analyses that were not addressed in this work. However, with the two examples used, the article already enables the reader to proceed with good and robust meta-analyses. Level of Evidence V, Expert Opinion.
NASA DC-8 SAGAAERO Data Merge
data.ucar.edu
ascii
Updated Dec 26, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gao Chen; Jennifer R. Olson; Michael Shook (2024). NASA DC-8 SAGAAERO Data Merge [Dataset]. http://doi.org/10.26023/ANQE-HZRR-P30K
Explore at:
asciiAvailable download formats
Unique identifier
https://doi.org/10.26023/ANQE-HZRR-P30K
Dataset updated
Dec 26, 2024
Dataset provided by
University Corporation for Atmospheric Research
Authors
Gao Chen; Jennifer R. Olson; Michael Shook
Time period covered
May 18, 2012 - Jun 22, 2012
Area covered

Description
This data set contains NASA DC-8 SAGAAERO Data Merge data collected during the Deep Convective Clouds and Chemistry Experiment (DC3) from 18 May 2012 through 22 June 2012. These merge files were updated by NASA. The data have been merged to SAGAAero file timeline. In most cases, variable names have been kept identical to those submitted in the raw data files. However, in some cases, names have been changed (e.g., to eliminate duplication). Units have been standardized throughout the merge. In addition, a "grand merge" has been provided. This includes data from all the individual merged flights throughout the mission. This grand merge will follow the following naming convention: "dc3-mrgSAGAAero-dc8_merge_YYYYMMdd_R*_thruYYYYMMdd.ict" (with the comment "_thruYYYYMMdd" indicating the last flight date included). This data set is in ICARTT format. Please see the header portion of the data files for details on instruments, parameters, quality assurance, quality control, contact information, and data set comments.
g
KORUS-AQ Aircraft Merge Data Files | gimi9.com
gimi9.com
Updated Sep 30, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2018). KORUS-AQ Aircraft Merge Data Files | gimi9.com [Dataset]. https://gimi9.com/dataset/data-gov_korus-aq-aircraft-merge-data-files/
Explore at:
Dataset updated
Sep 30, 2018
Description
KORUSAQ_Merge_Data are pre-generated merge data files combining various products collected during the KORUS-AQ field campaign. This collection features pre-generated merge files for the DC-8 aircraft. Data collection for this product is complete.The KORUS-AQ field study was conducted in South Korea during May-June, 2016. The study was jointly sponsored by NASA and Korea’s National Institute of Environmental Research (NIER). The primary objectives were to investigate the factors controlling air quality in Korea (e.g., local emissions, chemical processes, and transboundary transport) and to assess future air quality observing strategies incorporating geostationary satellite observations. To achieve these science objectives, KORUS-AQ adopted a highly coordinated sampling strategy involved surface and airborne measurements including both in-situ and remote sensing instruments.Surface observations provided details on ground-level air quality conditions while airborne sampling provided an assessment of conditions aloft relevant to satellite observations and necessary to understand the role of emissions, chemistry, and dynamics in determining air quality outcomes. The sampling region covers the South Korean peninsula and surrounding waters with a primary focus on the Seoul Metropolitan Area. Airborne sampling was primarily conducted from near surface to about 8 km with extensive profiling to characterize the vertical distribution of pollutants and their precursors. The airborne observational data were collected from three aircraft platforms: the NASA DC-8, NASA B-200, and Hanseo King Air. Surface measurements were conducted from 16 ground sites and 2 ships: R/V Onnuri and R/V Jang Mok.The major data products collected from both the ground and air include in-situ measurements of trace gases (e.g., ozone, reactive nitrogen species, carbon monoxide and dioxide, methane, non-methane and oxygenated hydrocarbon species), aerosols (e.g., microphysical and optical properties and chemical composition), active remote sensing of ozone and aerosols, and passive remote sensing of NO2, CH2O, and O3 column densities. These data products support research focused on examining the impact of photochemistry and transport on ozone and aerosols, evaluating emissions inventories, and assessing the potential use of satellite observations in air quality studies.
NASA DC-8 1 Minute Data Merge
data.ucar.edu
ascii
Updated Dec 26, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gao Chen; Jennifer R. Olson; Michael Shook (2024). NASA DC-8 1 Minute Data Merge [Dataset]. http://doi.org/10.26023/VM9C-1C16-H003
Explore at:
asciiAvailable download formats
Unique identifier
https://doi.org/10.26023/VM9C-1C16-H003
Dataset updated
Dec 26, 2024
Dataset provided by
University Corporation for Atmospheric Research
Authors
Gao Chen; Jennifer R. Olson; Michael Shook
Time period covered
May 1, 2012 - Jun 30, 2012
Area covered

Description
This dataset contains NASA DC-8 1 Minute Data Merge data collected during the Deep Convective Clouds and Chemistry Experiment (DC3) from 18 May 2012 through 22 June 2012. This dataset contains updated data provided by NASA. In most cases, variable names have been kept identical to those submitted in the raw data files. However, in some cases, names have been changed (e.g., to eliminate duplication). Units have been standardized throughout the merge. In addition, a "grand merge" has been provided. This includes data from all the individual merged flights throughout the mission. This grand merge will follow the following naming convention: "dc3-mrg60-dc8_merge_YYYYMMdd_R5_thruYYYYMMdd.ict" (with the comment "_thruYYYYMMdd" indicating the last flight date included). This dataset is in ICARTT format. Please see the header portion of the data files for details on instruments, parameters, quality assurance, quality control, contact information, and dataset comments. For more information on updates to this dataset, please see the readme file.
Z
Data from: r-process abundances in neutron-rich merger ejecta given...
data.niaid.nih.gov
osti.gov
Updated Jan 22, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nicole Vassh (2021). r-process abundances in neutron-rich merger ejecta given different theoretical nuclear physics inputs [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4456125
Explore at:
Dataset updated
Jan 22, 2021
Dataset provided by
Rebecca Surman
Matthew R. Mumpower
Nicole Vassh
Trevor M. Sprouse
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This data release contains nucleosynthesis predictions for the r-process abundances presented in Côté, Eichler, Yagüe, Vassh et al. (2021) for compact object merger ejecta based on the publicly available simulation trajectories of Rosswog et al. (2013). All ejecta for the merger scenarios considered here are very neutron-rich (Ye ~ 0.016-0.11). Calculations were performed with the PRISM code (Mumpower et al. 2018) which accounts for nuclear reheating (here with a reheating efficiency of 50%). Results are reported for several different theoretical nuclear physics inputs but all calculations make use of the GEF fission yield prescription (see Vassh et al. 2019). All abundances are given at 1 Myr (10^6 years) post-merger. Please see the README file for more details and references.

When using these nucleosynthesis yields, please cite this Zenodo data release (Vassh et al. 2021), and refer to Vassh et al. (2019) and Côté, Eichler, Yagüe, Vassh et al. (2021) for further details on the nuclear data applied as well as Rosswog et al. (2013), Piran et al. (2013), and Korobkin et al. (2012) for further details on the merger ejecta trajectories.
NASA DC-8 10 Second Data Merge
data.ucar.edu
archive
Updated Dec 26, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gao Chen; Jennifer R. Olson; Michael Shook (2024). NASA DC-8 10 Second Data Merge [Dataset]. http://doi.org/10.26023/CHJ0-RYQ4-GR10
Explore at:
archiveAvailable download formats
Unique identifier
https://doi.org/10.26023/CHJ0-RYQ4-GR10
Dataset updated
Dec 26, 2024
Dataset provided by
University Corporation for Atmospheric Research
Authors
Gao Chen; Jennifer R. Olson; Michael Shook
Time period covered
May 18, 2012 - Jun 22, 2012
Area covered

Description
This data set contains NASA DC-8 10 Second Data Merge data collected during the Deep Convective Clouds and Chemistry Experiment (DC3) from 18 May 2012 through 22 June 2012. These merges are an updated version provided by NASA. In most cases, variable names have been kept identical to those submitted in the raw data files. However, in some cases, names have been changed (e.g., to eliminate duplication). Units have been standardized throughout the merge. In addition, a "grand merge" has been provided. This includes data from all the individual merged flights throughout the mission. This grand merge will follow the following naming convention: "dc3-mrg10-dc8_merge_YYYYMMdd_R*_thruYYYYMMdd.ict" (with the comment "_thruYYYYMMdd" indicating the last flight date included). This data set is in ICARTT format. Please see the header portion of the data files for details on instruments, parameters, quality assurance, quality control, contact information, and data set comments. For the latest information on the updates to this dataset, please see the readme file.
h
merged-llm-instructions-collection-v1
huggingface.co
Updated May 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Marcus Cedric R. Idia (2025). merged-llm-instructions-collection-v1 [Dataset]. https://huggingface.co/datasets/marcuscedricridia/merged-llm-instructions-collection-v1
Explore at:
Dataset updated
May 11, 2025
Authors
Marcus Cedric R. Idia
Description
marcuscedricridia/merged-llm-instructions-collection-v1 dataset hosted on Hugging Face and contributed by the HF Datasets community
n
Multilevel modeling of time-series cross-sectional data reveals the dynamic...
data.niaid.nih.gov
dataone.org
+1more
zip
Updated Mar 6, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Kodai Kusano (2020). Multilevel modeling of time-series cross-sectional data reveals the dynamic interaction between ecological threats and democratic development [Dataset]. http://doi.org/10.5061/dryad.547d7wm3x
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5061/dryad.547d7wm3x
Dataset updated
Mar 6, 2020
Dataset provided by
University of Nevada, Reno
Authors
Kodai Kusano
License
https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html
Description
What is the relationship between environment and democracy? The framework of cultural evolution suggests that societal development is an adaptation to ecological threats. Pertinent theories assume that democracy emerges as societies adapt to ecological factors such as higher economic wealth, lower pathogen threats, less demanding climates, and fewer natural disasters. However, previous research confused within-country processes with between-country processes and erroneously interpreted between-country findings as if they generalize to within-country mechanisms. In this article, we analyze a time-series cross-sectional dataset to study the dynamic relationship between environment and democracy (1949-2016), accounting for previous misconceptions in levels of analysis. By separating within-country processes from between-country processes, we find that the relationship between environment and democracy not only differs by countries but also depends on the level of analysis. Economic wealth predicts increasing levels of democracy in between-country comparisons, but within-country comparisons show that democracy declines as countries become wealthier over time. This relationship is only prevalent among historically wealthy countries but not among historically poor countries, whose wealth also increased over time. By contrast, pathogen prevalence predicts lower levels of democracy in both between-country and within-country comparisons. Our longitudinal analyses identifying temporal precedence reveal that not only reductions in pathogen prevalence drive future democracy, but also democracy reduces future pathogen prevalence and increases future wealth. These nuanced results contrast with previous analyses using narrow, cross-sectional data. As a whole, our findings illuminate the dynamic process by which environment and democracy shape each other.

Methods Our Time-Series Cross-Sectional data combine various online databases. Country names were first identified and matched using R-package “countrycode” (Arel-Bundock, Enevoldsen, & Yetman, 2018) before all datasets were merged. Occasionally, we modified unidentified country names to be consistent across datasets. We then transformed “wide” data into “long” data and merged them using R’s Tidyverse framework (Wickham, 2014). Our analysis begins with the year 1949, which was occasioned by the fact that one of the key time-variant level-1 variables, pathogen prevalence was only available from 1949 on. See our Supplemental Material for all data, Stata syntax, R-markdown for visualization, supplemental analyses and detailed results (available at https://osf.io/drt8j/).
a
Merged Datasets for the Multidisciplinary drifting Observatory for the Study...
arcticdata.io
search.dataone.org
+2more
Updated Jun 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
David Clemens-Sewall; Christopher Cox; Kirstin Schulz; Ian Raphael (2025). Merged Datasets for the Multidisciplinary drifting Observatory for the Study of Arctic Climate (MOSAiC) Central Observatory in the Arctic Ocean (2019-2020) [Dataset]. http://doi.org/10.18739/A2GX44W6J
Explore at:
Unique identifier
https://doi.org/10.18739/A2GX44W6J
Dataset updated
Jun 2, 2025
Dataset provided by
Arctic Data Center
Authors
David Clemens-Sewall; Christopher Cox; Kirstin Schulz; Ian Raphael
Time period covered
Oct 5, 2019 - Oct 1, 2020
Area covered

Variables measured
so, hus, tas, tos, uas, vas, prsn, rlds, rsds, tosf, and 5 more
Description
The Multidisciplinary drifting Observatory for the Study of Arctic Climate (MOSAiC) produced a wealth of observational data along the drift of the R/V Polarstern in the Arctic Ocean from October 2019 to September 2020. These data can further process-level understanding and improvements in models. However, the observational records contain temporal gaps and are provided in different formats. One goal of the MOSAiC Single Column Model Working Group (MSCMWG: https://mosaic-expedition.org/science/cross-cutting_groups/) is to provide consistently-formatted, gap-filled, merged datasets representing the conditions at the MOSAiC Central Observatory (the intensively studied region within a few km of R/V Polarstern) that are suitable for driving models on this spatial domain (e.g., single column models, large eddy simulations, etc). The MSCMWG is an open group, please contact the dataset creators if you would like to contribute to future versions of these merged datasets (including new variables). This dataset contains version 1 of these merged datasets, and comprises the variables necessary to force a single column ice model (e.g., Icepack: https://zenodo.org/doi/10.5281/zenodo.1213462). The atmospheric variables are primarily derived from Met City (~66 percent (%) of record, https://doi.org/10.18739/A2PV6B83F), with temporal gaps filled by bias and advection corrected data from Atmospheric Surface Flux Stations ( https://doi.org/10.18739/A2XD0R00S, https://doi.org/10.18739/A25X25F0P, https://doi.org/10.18739/A2FF3M18K). Some residual gaps in shortwave radiation were filled with ARM ship-board radiometer data. Three different options for snowfall precipitation rate (prsn) are provided, based on in-situ observations that precipitation greatly exceeded accumulation on level ice, and accumulation rates varied on different ice types. MOSAiC_kazr_snow_MDF_20191005_20201001.nc uses 'snowfall_rate1' derived from the vertically-pointing, ka-band radar on the vessel (https://doi.org/10.5439/1853942). MOSAiC_Raphael_snow_fyi_MDF_20191005_20201001.nc and MOSAiC_Raphael_snow_syi_MDF_20191005_20201001.nc use snow accumulation measurements from manual mass balance sites (https://doi.org/10.18739/A2NK36626) to derived a pseudo-precipitation. MOSAiC_Raphael_snow_fyi_MDF_20191005_20201001.nc is based on the First Year Ice (fyi) sites. MOSAiC_Raphael_snow_syi_MDF_20191005_20201001.nc is based on the Second Year Ice (syi) sites. The other atmospheric variables for these files are identical. Oceanic variables are in MOSAiC_ocn_MDF_20191006_20200919.nc and are derived from https://doi.org/10.18739/A21J9790B. The data are netCDF files formatted according to the Merged Data File format (https://doi.org/10.5194/egusphere-2023-2413, https://gitlab.com/mdf-makers/mdf-toolkit). The code 'recipes' that were used to produce these data are available at: https://doi.org/10.5281/zenodo.10819497. If you use these datasets, please also cite the appropriate publications: Meteorological variables (excluding precipitation): Cox et al., 2023 (https://doi.org/10.1038/s41597-023-02415-5) Oceanographic variables: Schulz et al., 2023 (https://doi.org/10.31223/X5TT2W) KAZR-derived precipitation: Matrosov et al., 2022 (https://doi.org/10.1525/elementa.2021.00101) Accumulation-derived pseudo-precipitation: Raphael et al., in review. The following are known issues that will be addressed in future dataset releases: 1. Residual gaps occupy approximately 20% of the data record (see addendum) 2. Some transitions to shiprad downwelling shortwave are unreasonable abrupt 3. MDF format does not currently include a field for point-by-point data source Addendum: For atmospheric variables, below indicates the percentage sourced from each dataset (and the amount missing a.k.a NaN) Air Temperature metcity 0.661943 NaN 0.193333 asfs30 0.134910 asfs40 0.008607 asfs50 0.001207 Specific Humidity metcity 0.658890 NaN 0.196298 asfs40 0.008695 Wind Velocity metcity 0.666334 NaN 0.255003 asfs30 0.068828 asfs40 0.008630 asfs50 0.001205 Downwelling Longwave metcity 0.549417 asfs30 0.241502 NaN 0.209081 Downwelling Shortwave metcity 0.674166 NaN 0.158814 asfs30 0.140794 shipradS1 0.026226 Note that the 21 day gap from the end of Central Observatory 2 to the start of Central Observatory 3 occupies 5.8% of the record.
h
itp-merged
huggingface.co
Updated May 31, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Marcus Cedric R. Idia (2025). itp-merged [Dataset]. https://huggingface.co/datasets/marcuscedricridia/itp-merged
Explore at:
Dataset updated
May 31, 2025
Authors
Marcus Cedric R. Idia
Description
marcuscedricridia/itp-merged dataset hosted on Hugging Face and contributed by the HF Datasets community
R Script - Sequence Merging Stats, Reorganizing and Averaging Replicate...
figshare.com
txt
Updated May 18, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Lindsay Putman (2021). R Script - Sequence Merging Stats, Reorganizing and Averaging Replicate Samples [Dataset]. http://doi.org/10.6084/m9.figshare.14605722.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.14605722.v1
Dataset updated
May 18, 2021
Dataset provided by
Figsharehttp://figshare.com/
Authors
Lindsay Putman
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
R Script used to assess if the merging of sequence data from different sequencing centers was successful, and to reorganize count table data columns and average technical replicates.
d
Replication Data for: Wake merging and turbulence transition downstream of...
search.dataone.org
dataverse.no
Updated Jun 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Hearst, R. Jason; Berstad, Fanny Olivia Johannessen; Neunaber, Ingrid (2025). Replication Data for: Wake merging and turbulence transition downstream of side-by-side porous discs [Dataset]. http://doi.org/10.18710/XAEWC5
Explore at:
Unique identifier
https://doi.org/10.18710/XAEWC5
Dataset updated
Jun 3, 2025
Dataset provided by
DataverseNO
Authors
Hearst, R. Jason; Berstad, Fanny Olivia Johannessen; Neunaber, Ingrid
Description
These are the streamwise velocity time series measured in the wakes of two sets of porous discs in side-by-side setting as used in the manuscript ``Wake merging and turbulence transition downstream of side-by-side porous discs´´ which is accepted by Journal of Fluid Mechanics. Data was obtained by means of hot-wire anemometry in the Large Scale Wind Tunnel at the Norwegian University of Science and Technology in near-laminar inflow (background turbulence intensity of approximately 0.3%) at an inflow velocity of 10m/s (diameter-based Reynolds number 125000). Two types of porous discs with diameters D = 0.2m, one with uniform blockage and one with radially changing blockage, were used. Three spacings, namely 1.5D, 2D and 3D, were investigated. Span-wise profiles were measured at 8D and 30D downstream for each case, and a streamwise profile along the centerline between the discs was additionally obtained. In addition, measurements downstream of both disc types (singe disc setting) are provided as comparison. The scope of these experiments was to study the merging mechanisms of the turbulence when the two wakes are meeting.
4
Data underlying the publication: Modelling perceived risk and trust in...
data.4tu.nl
zip
Updated Oct 20, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Xiaolin He; J.C.J. (Jork) Stapel; Meng Wang; R. (Riender) Happee (2023). Data underlying the publication: Modelling perceived risk and trust in driving automation reacting to merging and braking vehicles [Dataset]. http://doi.org/10.4121/95a4bb4e-3ca4-4fcc-ba34-4be76a9ab578.v1
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.4121/95a4bb4e-3ca4-4fcc-ba34-4be76a9ab578.v1
Dataset updated
Oct 20, 2023
Dataset provided by
4TU.ResearchData
Authors
Xiaolin He; J.C.J. (Jork) Stapel; Meng Wang; R. (Riender) Happee
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Dataset funded by
European Union’s Horizon 2020
Description
This dataset is derived from a driving simulator study that explored the dynamics of perceived risk and trust in the context of driving automation. The study involved 25 participants who were tasked with monitoring SAE Level 2 driving automation features (Adaptive Cruise Control and Lane Centering) while encountering various driving scenarios on a motorway. These scenarios included merging and hard-braking events with different levels of criticality.
This dataset contains kinetic data from the driving simulator, capturing variables such as vehicle position, velocity, and acceleration among others. Subjective ratings of perceived risk and trust, collected post-event for regression analysis are also included.

Facebook

Twitter

Click to copy link

Link copied

Cite

Pedro Lima Louro; Pedro Lima Louro; Hugo Redinho; Hugo Redinho; Ricardo Santos; Ricardo Santos; Ricardo Malheiro; Ricardo Malheiro; Renato Panda; Renato Panda; Rui Pedro Paiva; Rui Pedro Paiva (2025). MERGE Dataset [Dataset]. http://doi.org/10.5281/zenodo.13939205

MERGE Dataset

Explore at:

zipAvailable download formats

Unique identifier

https://doi.org/10.5281/zenodo.13939205

Dataset updated

Feb 7, 2025

Dataset provided by

Zenodohttp://zenodo.org/

Authors

Pedro Lima Louro; Pedro Lima Louro; Hugo Redinho; Hugo Redinho; Ricardo Santos; Ricardo Santos; Ricardo Malheiro; Ricardo Malheiro; Renato Panda; Renato Panda; Rui Pedro Paiva; Rui Pedro Paiva

License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

The MERGE dataset is a collection of audio, lyrics, and bimodal datasets for conducting research on Music Emotion Recognition. A complete version is provided for each modality. The audio datasets provide 30-second excerpts for each sample, while full lyrics are provided in the relevant datasets. The amount of available samples in each dataset is the following:

MERGE Audio Complete: 3554
MERGE Audio Balanced: 3232
MERGE Lyrics Complete: 2568
MERGE Lyrics Balanced: 2400
MERGE Bimodal Complete: 2216
MERGE Bimodal Balanced: 2000

Additional Contents

Each dataset contains the following additional files:

av_values: File containing the arousal and valence values for each sample sorted by their identifier;
tvt_dataframes: Train, validate, and test splits for each dataset. Both a 70-15-15 and a 40-30-30 split are provided.

Metadata

A metadata spreadsheet is provided for each dataset with the following information for each sample, if available:

Song (Audio and Lyrics datasets) - Song identifiers. Identifiers starting with MT were extracted from the AllMusic platform, while those starting with A or L were collected from private collections;
Quadrant - Label corresponding to one of the four quadrants from Russell's Circumplex Model;
AllMusic Id - For samples starting with A or L, the matching AllMusic identifier is also provided. This was used to complement the available information for the samples originally obtained from the platform;
Artist - First performing artist or band;
Title - Song title;
Relevance - AllMusic metric representing the relevance of the song in relation to the query used;
Duration - Song length in seconds;
Moods - User-generated mood tags extracted from the AllMusic platform and available in Warriner's affective dictionary;
MoodsAll - User-generated mood tags extracted from the AllMusic platform;
Genres - User-generated genre tags extracted from the AllMusic platform;
Themes - User-generated theme tags extracted from the AllMusic platform;
Styles - User-generated style tags extracted from the AllMusic platform;
AppearancesTrackIDs - All AllMusic identifiers related with a sample;
Sample - Availability of the sample in the AllMusic platform;
SampleURL - URL to the 30-second excerpt in AllMusic;
ActualYear - Year of song release.

Citation

If you use some part of the MERGE dataset in your research, please cite the following article:

Louro, P. L. and Redinho, H. and Santos, R. and Malheiro, R. and Panda, R. and Paiva, R. P. (2024). MERGE - A Bimodal Dataset For Static Music Emotion Recognition. arxiv. URL: https://arxiv.org/abs/2407.06060.

BibTeX:

@misc{louro2024mergebimodaldataset,
title={MERGE -- A Bimodal Dataset for Static Music Emotion Recognition},
author={Pedro Lima Louro and Hugo Redinho and Ricardo Santos and Ricardo Malheiro and Renato Panda and Rui Pedro Paiva},
year={2024},
eprint={2407.06060},
archivePrefix={arXiv},
primaryClass={cs.SD},
url={https://arxiv.org/abs/2407.06060},
}

Acknowledgements

This work is funded by FCT - Foundation for Science and Technology, I.P., within the scope of the projects: MERGE - DOI: 10.54499/PTDC/CCI-COM/3171/2021 financed with national funds (PIDDAC) via the Portuguese State Budget; and project CISUC - UID/CEC/00326/2020 with funds from the European Social Fund, through the Regional Operational Program Centro 2020.

Renato Panda was supported by Ci2 - FCT UIDP/05567/2020.

Clear search

Close search

Google apps

Main menu

MERGE Dataset

Additional Contents

Metadata

Citation

Acknowledgements

NSF/NCAR GV HIAPER 1 Minute Data Merge

titanic5 Dataset Dataset

Merger of BNV-D data (2008 to 2019) and enrichment

Data from: KORUS-AQ Aircraft Merge Data Files

DLR Falcon 1 Second Data Merge

Merging Resource Availability with Isotope Mixing Models: The Role of...

Data from: HOW TO PERFORM A META-ANALYSIS: A PRACTICAL STEP-BY-STEP GUIDE...

NASA DC-8 SAGAAERO Data Merge

KORUS-AQ Aircraft Merge Data Files | gimi9.com

NASA DC-8 1 Minute Data Merge

Data from: r-process abundances in neutron-rich merger ejecta given...

NASA DC-8 10 Second Data Merge

merged-llm-instructions-collection-v1

Multilevel modeling of time-series cross-sectional data reveals the dynamic...

Merged Datasets for the Multidisciplinary drifting Observatory for the Study...

itp-merged

R Script - Sequence Merging Stats, Reorganizing and Averaging Replicate...

Replication Data for: Wake merging and turbulence transition downstream of...

Data underlying the publication: Modelling perceived risk and trust in...

MERGE DatasetSee More Versions

Additional Contents

Metadata

Citation

Acknowledgements

MERGE Dataset