100+ datasets found
  1. MERGE Dataset (INCOMPLETE. SEE V1.1)

    • zenodo.org
    Updated Feb 7, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pedro Lima Louro; Pedro Lima Louro; Hugo Redinho; Hugo Redinho; Ricardo Santos; Ricardo Santos; Ricardo Malheiro; Ricardo Malheiro; Renato Panda; Renato Panda; Rui Pedro Paiva; Rui Pedro Paiva (2025). MERGE Dataset (INCOMPLETE. SEE V1.1) [Dataset]. http://doi.org/10.5281/zenodo.13904708
    Explore at:
    Dataset updated
    Feb 7, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Pedro Lima Louro; Pedro Lima Louro; Hugo Redinho; Hugo Redinho; Ricardo Santos; Ricardo Santos; Ricardo Malheiro; Ricardo Malheiro; Renato Panda; Renato Panda; Rui Pedro Paiva; Rui Pedro Paiva
    License

    Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
    License information was derived automatically

    Description

    The MERGE dataset is a collection of audio, lyrics, and bimodal datasets for conducting research on Music Emotion Recognition. A complete version is provided for each modality. The audio datasets provide 30-second excerpts for each sample, while full lyrics are provided in the relevant datasets. The amount of available samples in each dataset is the following:

    • MERGE Audio Complete: 3554
    • MERGE Audio Balanced: 3232
    • MERGE Lyrics Complete: 2568
    • MERGE Lyrics Balanced: 2400
    • MERGE Bimodal Complete: 2216
    • MERGE Bimodal Balanced: 2000

    Additional Contents

    Each dataset contains the following additional files:

    • av_values: File containing the arousal and valence values for each sample sorted by their identifier;
    • tvt_dataframes: Train, validate, and test splits for each dataset. Both a 70-15-15 and a 40-30-30 split are provided.

    Metadata

    A metadata spreadsheet is provided for each dataset with the following information for each sample, if available:

    • Song (Audio and Lyrics datasets) - Song identifiers. Identifiers starting with MT were extracted from the AllMusic platform, while those starting with A or L were collected from private collections;
    • Quadrant - Label corresponding to one of the four quadrants from Russell's Circumplex Model;
    • AllMusic Id - For samples starting with A or L, the matching AllMusic identifier is also provided. This was used to complement the available information for the samples originally obtained from the platform;
    • Artist - First performing artist or band;
    • Title - Song title;
    • Relevance - AllMusic metric representing the relevance of the song in relation to the query used;
    • Duration - Song length in seconds;
    • Moods - User-generated mood tags extracted from the AllMusic platform and available in Warriner's affective dictionary;
    • MoodsAll - User-generated mood tags extracted from the AllMusic platform;
    • Genres - User-generated genre tags extracted from the AllMusic platform;
    • Themes - User-generated theme tags extracted from the AllMusic platform;
    • Styles - User-generated style tags extracted from the AllMusic platform;
    • AppearancesTrackIDs - All AllMusic identifiers related with a sample;
    • Sample - Availability of the sample in the AllMusic platform;
    • SampleURL - URL to the 30-second excerpt in AllMusic;
    • ActualYear - Year of song release

    Acknowledgements

    This work is funded by FCT - Foundation for Science and Technology, I.P., within the scope of the projects: MERGE - DOI: 10.54499/PTDC/CCI-COM/3171/2021 financed with national funds (PIDDAC) via the Portuguese State Budget; and project CISUC - UID/CEC/00326/2020 with funds from the European Social Fund, through the Regional Operational Program Centro 2020.

    Renato Panda was supported by Ci2 - FCT UIDP/05567/2020.

  2. R

    Merge Dataset

    • universe.roboflow.com
    zip
    Updated Jun 18, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AirPlane (2023). Merge Dataset [Dataset]. https://universe.roboflow.com/airplane-mhpdr/merge-dataset-9tjjp/dataset/3
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 18, 2023
    Dataset authored and provided by
    AirPlane
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Variables measured
    Airplane Bounding Boxes
    Description

    Merge Dataset

    ## Overview
    
    Merge Dataset is a dataset for object detection tasks - it contains Airplane annotations for 1,090 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [Public Domain license](https://creativecommons.org/licenses/Public Domain).
    
  3. R

    Project Merge Dataset

    • universe.roboflow.com
    zip
    Updated Aug 10, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Swetha (2022). Project Merge Dataset [Dataset]. https://universe.roboflow.com/swetha-ksc31/project-merge
    Explore at:
    zipAvailable download formats
    Dataset updated
    Aug 10, 2022
    Dataset authored and provided by
    Swetha
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    Actions Bounding Boxes
    Description

    PROJECT MERGE

    ## Overview
    
    PROJECT MERGE is a dataset for object detection tasks - it contains Actions annotations for 3,532 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  4. TRACE-A Merge Data

    • s.cnmilf.com
    • gimi9.com
    • +3more
    Updated Aug 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NASA/LARC/SD/ASDC (2025). TRACE-A Merge Data [Dataset]. https://s.cnmilf.com/user74170196/https/catalog.data.gov/dataset/trace-a-merge-data-204e5
    Explore at:
    Dataset updated
    Aug 21, 2025
    Dataset provided by
    NASAhttp://nasa.gov/
    Description

    TRACE-A_Merge_Data is merge data files created from data collected onboard the DC-8 aircraft during the Transport and Atmospheric Chemistry near the Equator - Atlantic (TRACE-A) suborbital campaign. Data collection for this product is complete.The TRACE-A mission was a part of NASA’s Global Tropospheric Experiment (GTE) – an assemblage of missions conducted from 1983-2001 with various research goals and objectives. TRACE-A was conducted in the Atlantic from September 21 to October 24, 1992. TRACE-A had the objective of determining the cause and source of the high concentrations of ozone that accumulated over the Atlantic Ocean between southern Africa and South America from August to October. NASA partnered with the Brazilian Space Agency (INPE) to accomplish this goal.  The NASA DC-8 aircraft and ozonesondes were utilized during TRACE-A to collect the necessary data. The DC-8 was equipped with 19 instruments. A few instruments on the DC-8 include the Differential Absorption Lidar (DIAL), the Laser-Induced Fluorescence, the O3-NO Ethylene/Forward Scattering Spectrometer, the Modified Licor, and the DACOM IR Laser Spectrometer. The DIAL was responsible for a variety of measurements, which include Nadir IR aerosols, Nadir UV aerosols, Zenith IR aerosols, Zenith VS aerosols, ozone, and ozone column. The Laser-Induced Fluorescence instrument collected measurements on NxOy in the atmosphere. Measurements of ozone were recorded by the O3-NO Ethylene/Forward Scattering Spectrometer while the Modified Licor recorded CO2. Finally, the DACOM IR Laser Spectrometer gathered an assortment of data points, including CO, O3, N2O, CH4, and CO2. Ozonesondes played a role in data collection for TRACE-A along with the DC-8 aircraft. The sondes were dropped from the DC-8 aircraft in order to gather data on ozone, temperature, and atmospheric pressure.

  5. R

    Tdl Merge Dataset

    • universe.roboflow.com
    zip
    Updated Apr 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chirag (2024). Tdl Merge Dataset [Dataset]. https://universe.roboflow.com/chirag-3o2dd/tdl-merge
    Explore at:
    zipAvailable download formats
    Dataset updated
    Apr 10, 2024
    Dataset authored and provided by
    Chirag
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Variables measured
    ... Bounding Boxes
    Description

    TDL Merge

    ## Overview
    
    TDL Merge is a dataset for object detection tasks - it contains ... annotations for 2,220 images.
    
    ## Getting Started
    
    You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
    
      ## License
    
      This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
    
  6. w

    Websites using merge-options

    • webtechsurvey.com
    csv
    Updated Dec 23, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    WebTechSurvey (2023). Websites using merge-options [Dataset]. https://webtechsurvey.com/technology/merge-options
    Explore at:
    csvAvailable download formats
    Dataset updated
    Dec 23, 2023
    Dataset authored and provided by
    WebTechSurvey
    License

    https://webtechsurvey.com/termshttps://webtechsurvey.com/terms

    Time period covered
    2025
    Area covered
    Global
    Description

    A complete list of live websites using the merge-options technology, compiled through global website indexing conducted by WebTechSurvey.

  7. h

    indo-merge-dataset-3

    • huggingface.co
    Updated Apr 26, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bagas S (2025). indo-merge-dataset-3 [Dataset]. https://huggingface.co/datasets/bagasshw/indo-merge-dataset-3
    Explore at:
    Dataset updated
    Apr 26, 2025
    Authors
    Bagas S
    Description

    bagasshw/indo-merge-dataset-3 dataset hosted on Hugging Face and contributed by the HF Datasets community

  8. d

    MeSH 2025 Update - Merge Report

    • catalog.data.gov
    • datadiscovery.nlm.nih.gov
    • +2more
    Updated May 30, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Library of Medicine (2025). MeSH 2025 Update - Merge Report [Dataset]. https://catalog.data.gov/dataset/mesh-2023-update-merge-report
    Explore at:
    Dataset updated
    May 30, 2025
    Dataset provided by
    National Library of Medicine
    Description

    (Includes MeSH 2023 and 2024 changes) NOTE - There are no Merges for 2025 MeSH. The MeSH 2025 Update - Merge Report describes cases where two or more terms have merged into a single term. The term(s) being merged into another term become(s) an Entry Term. Merges can be between Descriptors and Supplementary Concept Records (SCRs), between Descriptors, or between SCRs. This report includes MeSH changes from previous years, starting from 2023.

  9. Data from: ORACLES Merge Data Files

    • catalog.data.gov
    • s.cnmilf.com
    • +2more
    Updated Aug 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NASA/LARC/SD/ASDC (2025). ORACLES Merge Data Files [Dataset]. https://catalog.data.gov/dataset/oracles-merge-data-files
    Explore at:
    Dataset updated
    Aug 21, 2025
    Dataset provided by
    NASAhttp://nasa.gov/
    Description

    ORACLES_Merge_Data are pre-generated aircraft merge data files created utilizing data collected during the ObseRvations of Aerosols above CLouds and their intEractionS (ORACLES) campaign. These measurements were collected from August 19, 2016 – October 27, 2016, August 1, 2017 – September 4, 2017 and September 21, 2018 – October 27, 2018. ORACLES provides multi-year airborne observations over the complete vertical column of key parameters that drive aerosol-cloud interactions in the southeast Atlantic, an area with some of the largest inter-model differences in aerosol forcing assessments on the planet. The P-3 Orion aircraft was utilized as a low-flying platform for simultaneous in situ and remote sensing measurements of aerosols and clouds and was supplemented by ER-2 remote sensing during the 2016 campaign. Data collection for this product is complete.Southern Africa produces almost one-third of the Earth’s biomass burning aerosol particles. The ORACLES (ObseRvations of Aerosols above CLouds and their intEractionS) experiment was a five year investigation with three intensive observation periods (August 19, 2016 – October 27, 2016; August 1, 2017 – September 4, 2017; September 21, 2018 – October 27, 2018) and was designed to study key processes that determine the climate impacts of African biomass burning aerosols. ORACLES provided multi-year airborne observations over the complete vertical column of the key parameters that drive aerosol-cloud interactions in the southeast Atlantic, an area with some of the largest inter-model differences in aerosol forcing assessments. These inter-model differences in aerosol and cloud distributions, as well as their combined climatic effects in the SE Atlantic are partly due to the persistence of aerosols above clouds. The varying separation of cloud and aerosol layers sampled during ORACLES allow for a process-oriented understanding of how variations in radiative heating profiles impact cloud properties, which is expected to improve model simulations for other remote regions experience long-range aerosol transport above clouds. ORACLES utilized two NASA aircraft, the P-3 and ER-2. The P-3 was used as a low-flying platform for simultaneous in situ and remote sensing measurements of aerosols and clouds in all three campaigns, supplemented by ER-2 remote sensing in 2016. ER-2 observations will be used to enhance satellite-based remote sensing by resolving variability within a particular scene, and by guiding the development of new and improved remote sensing techniques.

  10. Data from: DC3 Merge Data Files

    • catalog.data.gov
    • res1catalogd-o-tdatad-o-tgov.vcapture.xyz
    • +1more
    Updated Jul 11, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NASA/LARC/SD/ASDC (2025). DC3 Merge Data Files [Dataset]. https://catalog.data.gov/dataset/dc3-merge-data-files-e9aa5
    Explore at:
    Dataset updated
    Jul 11, 2025
    Dataset provided by
    NASAhttp://nasa.gov/
    Description

    DC3_Merge_Data are pre-generated merge data files collected during the Deep Convective Clouds and Chemistry (DC3) field campaign. This product contains merged data products collected from instruments onboard the DC-8, NSF/NCAR GV-HIAPER, and DLR-Falcon aircrafts. Data collection for this product is complete.The Deep Convective Clouds and Chemistry (DC3) field campaign sought to understand the dynamical, physical, and lightning processes of deep, mid-latitude continental convective clouds and to define the impact of these clouds on upper tropospheric composition and chemistry. DC3 was conducted from May to June 2012 with a base location of Salina, Kansas. Observations were conducted in northeastern Colorado, west Texas to central Oklahoma, and northern Alabama in order to provide a wide geographic sample of storm types and boundary layer compositions, as well as to sample convection.DC3 had two primary science objectives. The first was to investigate storm dynamics and physics, lightning and its production of nitrogen oxides, cloud hydrometeor effects on wet deposition of species, surface emission variability, and chemistry in anvil clouds. Observations related to this objective focused on the early stages of active convection. The second objective was to investigate changes in upper tropospheric chemistry and composition after active convection. Observations related to this objective focused on the 12-48 hours following convection. This objective also served to explore seasonal change of upper tropospheric chemistry.In addition to using the NSF/NCAR Gulfstream-V (GV) aircraft, the NASA DC-8 was used during DC3 to provide in-situ measurements of the convective storm inflow and remotely-sensed measurements used for flight planning and column characterization. DC3 utilized ground-based radar networks spread across its observation area to measure the physical and kinematic characteristics of storms. Additional sampling strategies relied on lightning mapping arrays, radiosondes, and precipitation collection. Lastly, DC3 used data collected from various satellite instruments to achieve its goals, focusing on measurements from CALIOP onboard CALIPSO and CPL onboard CloudSat. In addition to providing an extensive set of data related to deep, mid-latitude continental convective clouds and analyzing their impacts on upper tropospheric composition and chemistry, DC3 improved models used to predict convective transport. DC3 improved knowledge of convection and chemistry, and provided information necessary to understanding the processes relating to ozone in the upper troposphere.

  11. h

    merge-with-keys-2

    • huggingface.co
    Updated Jun 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pierre-Louis B (2025). merge-with-keys-2 [Dataset]. https://huggingface.co/datasets/PLB/merge-with-keys-2
    Explore at:
    Dataset updated
    Jun 1, 2025
    Authors
    Pierre-Louis B
    Description

    merge-with-keys-2

    This dataset was generated using a phospho starter pack. This dataset contains a series of episodes recorded with a robot and multiple cameras. It can be directly used to train a policy using imitation learning. It's compatible with LeRobot and RLDS.

  12. h

    java-merge-dataset

    • huggingface.co
    Updated Sep 2, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Madhan MMR (2023). java-merge-dataset [Dataset]. https://huggingface.co/datasets/madhan2301/java-merge-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 2, 2023
    Authors
    Madhan MMR
    Description

    madhan2301/java-merge-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community

  13. h

    merge-bench-core

    • huggingface.co
    Updated Jul 30, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    zap (2025). merge-bench-core [Dataset]. https://huggingface.co/datasets/kaizen9/merge-bench-core
    Explore at:
    Dataset updated
    Jul 30, 2025
    Authors
    zap
    Description

    kaizen9/merge-bench-core dataset hosted on Hugging Face and contributed by the HF Datasets community

  14. Z

    Dataset of merge conflicts collected from GitHub repositories

    • data.niaid.nih.gov
    • zenodo.org
    Updated Apr 16, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Victor Cacciari Miraldo (2020). Dataset of merge conflicts collected from GitHub repositories [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_3751037
    Explore at:
    Dataset updated
    Apr 16, 2020
    Dataset authored and provided by
    Victor Cacciari Miraldo
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Within each nested folder of the archive you will find files A,O,B and M. They each represent a conflict where file O was altered in two different ways, resulting in A and B. Finally, a developer solved the merge conflict committing M as the solution.

    We have selected these by manually searching for a programming language on GitHub and selecting those repositories that had a large number of forks, commits and contributors.

  15. h

    merge

    • huggingface.co
    Updated Feb 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vova (2024). merge [Dataset]. https://huggingface.co/datasets/ddosxd/merge
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 13, 2024
    Authors
    Vova
    Description

    Что это/откуда тут чаты?

    Это мерж датасетов

    ultrachat no_robots SiberiaSoft/SiberianPersonaChat russian_dialogues

      Зачем?
    

    я это сделал для своей попытки файнтюна мистрала 7б

      Формат
    

    датасет собран в формате сообщений оаи [ { 'role':'user', 'content':'...' }, { 'role':'assistant, 'content':'...' } ]

    и храниться файликом .jsonl. вот так его переконвертировать в формат антропиков (ваще не антропиков ибо у них не… See the full description on the dataset page: https://huggingface.co/datasets/ddosxd/merge.

  16. w

    Websites using merge-images

    • webtechsurvey.com
    csv
    Updated Dec 27, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    WebTechSurvey (2023). Websites using merge-images [Dataset]. https://webtechsurvey.com/technology/merge-images
    Explore at:
    csvAvailable download formats
    Dataset updated
    Dec 27, 2023
    Dataset authored and provided by
    WebTechSurvey
    License

    https://webtechsurvey.com/termshttps://webtechsurvey.com/terms

    Time period covered
    2025
    Area covered
    Global
    Description

    A complete list of live websites using the merge-images technology, compiled through global website indexing conducted by WebTechSurvey.

  17. ARISE C-130 Aircraft Merge Data Files - Dataset - NASA Open Data Portal

    • data.nasa.gov
    • data.staging.idas-ds1.appdat.jsc.nasa.gov
    Updated Apr 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    nasa.gov (2025). ARISE C-130 Aircraft Merge Data Files - Dataset - NASA Open Data Portal [Dataset]. https://data.nasa.gov/dataset/arise-c-130-aircraft-merge-data-files-e402f
    Explore at:
    Dataset updated
    Apr 1, 2025
    Dataset provided by
    NASAhttp://nasa.gov/
    Description

    ARISE_Merge_Data_1 is the Arctic Radiation - IceBridge Sea & Ice Experiment (ARISE) 2014 pre-generated aircraft (C-130) merge data files. This product is a result of a joint effort of the Radiation Sciences, Cryospheric Sciences and Airborne Sciences programs of the Earth Science Division in NASA's Science Mission Directorate in Washington. Data collection is complete.ARISE was NASA's first Arctic airborne campaign designed to take simultaneous measurements of ice, clouds and the levels of incoming and outgoing radiation, the balance of which determined the degree of climate warming. Over the past few decades, an increase in global temperatures led to decreased Arctic summer sea ice. Typically, Arctic sea ice reflects sunlight from the Earth. However, a loss of sea ice means there is more open water to absorb heat from the sun, enhancing warming in the region. More open water can also cause the release of more moisture into the atmosphere. This additional moisture could affect cloud formation and the exchange of heat from Earth’s surface to space. Conducted during the peak of summer ice melt (August 28, 2014-October 1, 2014), ARISE was designed to study and collect data on thinning sea ice, measure cloud and atmospheric properties in the Arctic, and to address questions about the relationship between retreating sea ice and the Arctic climate. During the campaign, instruments on NASA’s C-130 aircraft conducted measurements of spectral and broadband radiative flux profiles, quantified surface characteristics, cloud properties, and atmospheric state parameters under a variety of Arctic atmospheric and surface conditions (e.g. open water, sea ice, and land ice). When possible, C-130 flights were coordinated to fly under satellite overpasses. The primary aerial focus of ARISE was over Arctic sea ice and open water, with minor coverage over Greenland land ice. Through these efforts, the ARISE field campaign helped improve cloud and sea ice computer modeling in the Arctic.

  18. h

    variations-merge-test-4

    • huggingface.co
    Updated Jul 27, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jack Vial (2025). variations-merge-test-4 [Dataset]. https://huggingface.co/datasets/jackvial/variations-merge-test-4
    Explore at:
    Dataset updated
    Jul 27, 2025
    Authors
    Jack Vial
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Merged LeRobot Dataset

    This dataset was created by merging multiple LeRobot datasets using the LeRobot merge tool.

      Source Datasets
    

    This merged dataset combines the following 2 datasets:

    jackvial/koch_screwdriver_attach_orange_panel_rse14clean jackvial/koch_screwdriver_attach_orange_panel_e125

      Dataset Statistics
    

    Total Episodes: 139 Total Frames: 26038 Robot Type: koch_screwdriver_follower FPS: 30

      Dataset Structure
    

    meta/info.json: {… See the full description on the dataset page: https://huggingface.co/datasets/jackvial/variations-merge-test-4.

  19. f

    Merge script

    • figshare.com
    txt
    Updated May 30, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jose Aguasvivas; Jon Andoni Duñabeitia (2018). Merge script [Dataset]. http://doi.org/10.6084/m9.figshare.5924797.v2
    Explore at:
    txtAvailable download formats
    Dataset updated
    May 30, 2018
    Dataset provided by
    figshare
    Authors
    Jose Aguasvivas; Jon Andoni Duñabeitia
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Script in R language to load the databases and merge them into an unique file. This script may be modified and extended as needed.

  20. NAAMES C-130 Aircraft Merge Data Files

    • catalog.data.gov
    • cmr.earthdata.nasa.gov
    • +1more
    Updated Aug 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NASA/LARC/SD/ASDC (2025). NAAMES C-130 Aircraft Merge Data Files [Dataset]. https://catalog.data.gov/dataset/naames-c-130-aircraft-merge-data-files-3f82f
    Explore at:
    Dataset updated
    Aug 21, 2025
    Dataset provided by
    NASAhttp://nasa.gov/
    Description

    NAAMES_Merge_Data is the North Atlantic Aerosols and Marine Ecosystems Study (NAAMES) pre-generated aircraft merge data files created using data collected during the NAAMES campaign. NAAMES was a NASA funded Earth-Venture Suborbital (EVS) mission with 4 deployments occurring from 2015-2018. Data collection is complete.The NASA North Atlantic Aerosols and Marine Ecosystems Study (NAAMES) project was the first NASA Earth Venture – Suborbital mission focused on studying the coupled ocean ecosystem and atmosphere. NAAMES utilizes a combination of ship-based, airborne, autonomous sensor, and remote sensing measurements that directly link ocean ecosystem processes, emissions of ocean-generated aerosols and precursor gases, and subsequent atmospheric evolution and processing. Four deployments coincide with the seasonal cycle of phytoplankton in the North Atlantic Ocean: the Winter Transition (November 5 – December 2, 2015), the Bloom Climax (May 11 – June 5, 2016), the Deceleration Phase (August 30 – September 24, 2017), and the Acceleration Phase (March 20 – April 13, 2018). Ship-based measurements were conducted from the Woods Hole Oceanographic Institution Research Vessel Atlantis in the middle of the North Atlantic Ocean, while airborne measurements were conducted on a NASA Wallops Flight Facility C-130 Hercules that was based at St. John's International Airport, Newfoundland, Canada. Data products in the ASDC archive focus on the NAAMES atmospheric aerosol, cloud, and trace gas data from the ship and aircraft, as well as related satellite and model data subsets. While a few ocean-remote sensing data products (e.g., from the high-spectral resolution lidar) are also included in the ASDC archive, most ocean data products reside in a companion archive at SeaBass.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Pedro Lima Louro; Pedro Lima Louro; Hugo Redinho; Hugo Redinho; Ricardo Santos; Ricardo Santos; Ricardo Malheiro; Ricardo Malheiro; Renato Panda; Renato Panda; Rui Pedro Paiva; Rui Pedro Paiva (2025). MERGE Dataset (INCOMPLETE. SEE V1.1) [Dataset]. http://doi.org/10.5281/zenodo.13904708
Organization logo

MERGE Dataset (INCOMPLETE. SEE V1.1)

Explore at:
Dataset updated
Feb 7, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Pedro Lima Louro; Pedro Lima Louro; Hugo Redinho; Hugo Redinho; Ricardo Santos; Ricardo Santos; Ricardo Malheiro; Ricardo Malheiro; Renato Panda; Renato Panda; Rui Pedro Paiva; Rui Pedro Paiva
License

Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically

Description

The MERGE dataset is a collection of audio, lyrics, and bimodal datasets for conducting research on Music Emotion Recognition. A complete version is provided for each modality. The audio datasets provide 30-second excerpts for each sample, while full lyrics are provided in the relevant datasets. The amount of available samples in each dataset is the following:

  • MERGE Audio Complete: 3554
  • MERGE Audio Balanced: 3232
  • MERGE Lyrics Complete: 2568
  • MERGE Lyrics Balanced: 2400
  • MERGE Bimodal Complete: 2216
  • MERGE Bimodal Balanced: 2000

Additional Contents

Each dataset contains the following additional files:

  • av_values: File containing the arousal and valence values for each sample sorted by their identifier;
  • tvt_dataframes: Train, validate, and test splits for each dataset. Both a 70-15-15 and a 40-30-30 split are provided.

Metadata

A metadata spreadsheet is provided for each dataset with the following information for each sample, if available:

  • Song (Audio and Lyrics datasets) - Song identifiers. Identifiers starting with MT were extracted from the AllMusic platform, while those starting with A or L were collected from private collections;
  • Quadrant - Label corresponding to one of the four quadrants from Russell's Circumplex Model;
  • AllMusic Id - For samples starting with A or L, the matching AllMusic identifier is also provided. This was used to complement the available information for the samples originally obtained from the platform;
  • Artist - First performing artist or band;
  • Title - Song title;
  • Relevance - AllMusic metric representing the relevance of the song in relation to the query used;
  • Duration - Song length in seconds;
  • Moods - User-generated mood tags extracted from the AllMusic platform and available in Warriner's affective dictionary;
  • MoodsAll - User-generated mood tags extracted from the AllMusic platform;
  • Genres - User-generated genre tags extracted from the AllMusic platform;
  • Themes - User-generated theme tags extracted from the AllMusic platform;
  • Styles - User-generated style tags extracted from the AllMusic platform;
  • AppearancesTrackIDs - All AllMusic identifiers related with a sample;
  • Sample - Availability of the sample in the AllMusic platform;
  • SampleURL - URL to the 30-second excerpt in AllMusic;
  • ActualYear - Year of song release

Acknowledgements

This work is funded by FCT - Foundation for Science and Technology, I.P., within the scope of the projects: MERGE - DOI: 10.54499/PTDC/CCI-COM/3171/2021 financed with national funds (PIDDAC) via the Portuguese State Budget; and project CISUC - UID/CEC/00326/2020 with funds from the European Social Fund, through the Regional Operational Program Centro 2020.

Renato Panda was supported by Ci2 - FCT UIDP/05567/2020.

Search
Clear search
Close search
Google apps
Main menu