Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
The MERGE dataset is a collection of audio, lyrics, and bimodal datasets for conducting research on Music Emotion Recognition. A complete version is provided for each modality. The audio datasets provide 30-second excerpts for each sample, while full lyrics are provided in the relevant datasets. The amount of available samples in each dataset is the following:
Each dataset contains the following additional files:
A metadata spreadsheet is provided for each dataset with the following information for each sample, if available:
This work is funded by FCT - Foundation for Science and Technology, I.P., within the scope of the projects: MERGE - DOI: 10.54499/PTDC/CCI-COM/3171/2021 financed with national funds (PIDDAC) via the Portuguese State Budget; and project CISUC - UID/CEC/00326/2020 with funds from the European Social Fund, through the Regional Operational Program Centro 2020.
Renato Panda was supported by Ci2 - FCT UIDP/05567/2020.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
## Overview
Merge Dataset is a dataset for object detection tasks - it contains Airplane annotations for 1,090 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [Public Domain license](https://creativecommons.org/licenses/Public Domain).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
## Overview
PROJECT MERGE is a dataset for object detection tasks - it contains Actions annotations for 3,532 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
TRACE-A_Merge_Data is merge data files created from data collected onboard the DC-8 aircraft during the Transport and Atmospheric Chemistry near the Equator - Atlantic (TRACE-A) suborbital campaign. Data collection for this product is complete.The TRACE-A mission was a part of NASA’s Global Tropospheric Experiment (GTE) – an assemblage of missions conducted from 1983-2001 with various research goals and objectives. TRACE-A was conducted in the Atlantic from September 21 to October 24, 1992. TRACE-A had the objective of determining the cause and source of the high concentrations of ozone that accumulated over the Atlantic Ocean between southern Africa and South America from August to October. NASA partnered with the Brazilian Space Agency (INPE) to accomplish this goal. The NASA DC-8 aircraft and ozonesondes were utilized during TRACE-A to collect the necessary data. The DC-8 was equipped with 19 instruments. A few instruments on the DC-8 include the Differential Absorption Lidar (DIAL), the Laser-Induced Fluorescence, the O3-NO Ethylene/Forward Scattering Spectrometer, the Modified Licor, and the DACOM IR Laser Spectrometer. The DIAL was responsible for a variety of measurements, which include Nadir IR aerosols, Nadir UV aerosols, Zenith IR aerosols, Zenith VS aerosols, ozone, and ozone column. The Laser-Induced Fluorescence instrument collected measurements on NxOy in the atmosphere. Measurements of ozone were recorded by the O3-NO Ethylene/Forward Scattering Spectrometer while the Modified Licor recorded CO2. Finally, the DACOM IR Laser Spectrometer gathered an assortment of data points, including CO, O3, N2O, CH4, and CO2. Ozonesondes played a role in data collection for TRACE-A along with the DC-8 aircraft. The sondes were dropped from the DC-8 aircraft in order to gather data on ozone, temperature, and atmospheric pressure.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
## Overview
TDL Merge is a dataset for object detection tasks - it contains ... annotations for 2,220 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
https://webtechsurvey.com/termshttps://webtechsurvey.com/terms
A complete list of live websites using the merge-options technology, compiled through global website indexing conducted by WebTechSurvey.
bagasshw/indo-merge-dataset-3 dataset hosted on Hugging Face and contributed by the HF Datasets community
(Includes MeSH 2023 and 2024 changes) NOTE - There are no Merges for 2025 MeSH. The MeSH 2025 Update - Merge Report describes cases where two or more terms have merged into a single term. The term(s) being merged into another term become(s) an Entry Term. Merges can be between Descriptors and Supplementary Concept Records (SCRs), between Descriptors, or between SCRs. This report includes MeSH changes from previous years, starting from 2023.
ORACLES_Merge_Data are pre-generated aircraft merge data files created utilizing data collected during the ObseRvations of Aerosols above CLouds and their intEractionS (ORACLES) campaign. These measurements were collected from August 19, 2016 – October 27, 2016, August 1, 2017 – September 4, 2017 and September 21, 2018 – October 27, 2018. ORACLES provides multi-year airborne observations over the complete vertical column of key parameters that drive aerosol-cloud interactions in the southeast Atlantic, an area with some of the largest inter-model differences in aerosol forcing assessments on the planet. The P-3 Orion aircraft was utilized as a low-flying platform for simultaneous in situ and remote sensing measurements of aerosols and clouds and was supplemented by ER-2 remote sensing during the 2016 campaign. Data collection for this product is complete.Southern Africa produces almost one-third of the Earth’s biomass burning aerosol particles. The ORACLES (ObseRvations of Aerosols above CLouds and their intEractionS) experiment was a five year investigation with three intensive observation periods (August 19, 2016 – October 27, 2016; August 1, 2017 – September 4, 2017; September 21, 2018 – October 27, 2018) and was designed to study key processes that determine the climate impacts of African biomass burning aerosols. ORACLES provided multi-year airborne observations over the complete vertical column of the key parameters that drive aerosol-cloud interactions in the southeast Atlantic, an area with some of the largest inter-model differences in aerosol forcing assessments. These inter-model differences in aerosol and cloud distributions, as well as their combined climatic effects in the SE Atlantic are partly due to the persistence of aerosols above clouds. The varying separation of cloud and aerosol layers sampled during ORACLES allow for a process-oriented understanding of how variations in radiative heating profiles impact cloud properties, which is expected to improve model simulations for other remote regions experience long-range aerosol transport above clouds. ORACLES utilized two NASA aircraft, the P-3 and ER-2. The P-3 was used as a low-flying platform for simultaneous in situ and remote sensing measurements of aerosols and clouds in all three campaigns, supplemented by ER-2 remote sensing in 2016. ER-2 observations will be used to enhance satellite-based remote sensing by resolving variability within a particular scene, and by guiding the development of new and improved remote sensing techniques.
DC3_Merge_Data are pre-generated merge data files collected during the Deep Convective Clouds and Chemistry (DC3) field campaign. This product contains merged data products collected from instruments onboard the DC-8, NSF/NCAR GV-HIAPER, and DLR-Falcon aircrafts. Data collection for this product is complete.The Deep Convective Clouds and Chemistry (DC3) field campaign sought to understand the dynamical, physical, and lightning processes of deep, mid-latitude continental convective clouds and to define the impact of these clouds on upper tropospheric composition and chemistry. DC3 was conducted from May to June 2012 with a base location of Salina, Kansas. Observations were conducted in northeastern Colorado, west Texas to central Oklahoma, and northern Alabama in order to provide a wide geographic sample of storm types and boundary layer compositions, as well as to sample convection.DC3 had two primary science objectives. The first was to investigate storm dynamics and physics, lightning and its production of nitrogen oxides, cloud hydrometeor effects on wet deposition of species, surface emission variability, and chemistry in anvil clouds. Observations related to this objective focused on the early stages of active convection. The second objective was to investigate changes in upper tropospheric chemistry and composition after active convection. Observations related to this objective focused on the 12-48 hours following convection. This objective also served to explore seasonal change of upper tropospheric chemistry.In addition to using the NSF/NCAR Gulfstream-V (GV) aircraft, the NASA DC-8 was used during DC3 to provide in-situ measurements of the convective storm inflow and remotely-sensed measurements used for flight planning and column characterization. DC3 utilized ground-based radar networks spread across its observation area to measure the physical and kinematic characteristics of storms. Additional sampling strategies relied on lightning mapping arrays, radiosondes, and precipitation collection. Lastly, DC3 used data collected from various satellite instruments to achieve its goals, focusing on measurements from CALIOP onboard CALIPSO and CPL onboard CloudSat. In addition to providing an extensive set of data related to deep, mid-latitude continental convective clouds and analyzing their impacts on upper tropospheric composition and chemistry, DC3 improved models used to predict convective transport. DC3 improved knowledge of convection and chemistry, and provided information necessary to understanding the processes relating to ozone in the upper troposphere.
merge-with-keys-2
This dataset was generated using a phospho starter pack. This dataset contains a series of episodes recorded with a robot and multiple cameras. It can be directly used to train a policy using imitation learning. It's compatible with LeRobot and RLDS.
madhan2301/java-merge-dataset dataset hosted on Hugging Face and contributed by the HF Datasets community
kaizen9/merge-bench-core dataset hosted on Hugging Face and contributed by the HF Datasets community
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Within each nested folder of the archive you will find files A,O,B and M. They each represent a conflict where file O was altered in two different ways, resulting in A and B. Finally, a developer solved the merge conflict committing M as the solution.
We have selected these by manually searching for a programming language on GitHub and selecting those repositories that had a large number of forks, commits and contributors.
Что это/откуда тут чаты?
Это мерж датасетов
ultrachat no_robots SiberiaSoft/SiberianPersonaChat russian_dialogues
Зачем?
я это сделал для своей попытки файнтюна мистрала 7б
Формат
датасет собран в формате сообщений оаи [ { 'role':'user', 'content':'...' }, { 'role':'assistant, 'content':'...' } ]
и храниться файликом .jsonl. вот так его переконвертировать в формат антропиков (ваще не антропиков ибо у них не… See the full description on the dataset page: https://huggingface.co/datasets/ddosxd/merge.
https://webtechsurvey.com/termshttps://webtechsurvey.com/terms
A complete list of live websites using the merge-images technology, compiled through global website indexing conducted by WebTechSurvey.
ARISE_Merge_Data_1 is the Arctic Radiation - IceBridge Sea & Ice Experiment (ARISE) 2014 pre-generated aircraft (C-130) merge data files. This product is a result of a joint effort of the Radiation Sciences, Cryospheric Sciences and Airborne Sciences programs of the Earth Science Division in NASA's Science Mission Directorate in Washington. Data collection is complete.ARISE was NASA's first Arctic airborne campaign designed to take simultaneous measurements of ice, clouds and the levels of incoming and outgoing radiation, the balance of which determined the degree of climate warming. Over the past few decades, an increase in global temperatures led to decreased Arctic summer sea ice. Typically, Arctic sea ice reflects sunlight from the Earth. However, a loss of sea ice means there is more open water to absorb heat from the sun, enhancing warming in the region. More open water can also cause the release of more moisture into the atmosphere. This additional moisture could affect cloud formation and the exchange of heat from Earth’s surface to space. Conducted during the peak of summer ice melt (August 28, 2014-October 1, 2014), ARISE was designed to study and collect data on thinning sea ice, measure cloud and atmospheric properties in the Arctic, and to address questions about the relationship between retreating sea ice and the Arctic climate. During the campaign, instruments on NASA’s C-130 aircraft conducted measurements of spectral and broadband radiative flux profiles, quantified surface characteristics, cloud properties, and atmospheric state parameters under a variety of Arctic atmospheric and surface conditions (e.g. open water, sea ice, and land ice). When possible, C-130 flights were coordinated to fly under satellite overpasses. The primary aerial focus of ARISE was over Arctic sea ice and open water, with minor coverage over Greenland land ice. Through these efforts, the ARISE field campaign helped improve cloud and sea ice computer modeling in the Arctic.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Merged LeRobot Dataset
This dataset was created by merging multiple LeRobot datasets using the LeRobot merge tool.
Source Datasets
This merged dataset combines the following 2 datasets:
jackvial/koch_screwdriver_attach_orange_panel_rse14clean jackvial/koch_screwdriver_attach_orange_panel_e125
Dataset Statistics
Total Episodes: 139 Total Frames: 26038 Robot Type: koch_screwdriver_follower FPS: 30
Dataset Structure
meta/info.json: {… See the full description on the dataset page: https://huggingface.co/datasets/jackvial/variations-merge-test-4.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Script in R language to load the databases and merge them into an unique file. This script may be modified and extended as needed.
NAAMES_Merge_Data is the North Atlantic Aerosols and Marine Ecosystems Study (NAAMES) pre-generated aircraft merge data files created using data collected during the NAAMES campaign. NAAMES was a NASA funded Earth-Venture Suborbital (EVS) mission with 4 deployments occurring from 2015-2018. Data collection is complete.The NASA North Atlantic Aerosols and Marine Ecosystems Study (NAAMES) project was the first NASA Earth Venture – Suborbital mission focused on studying the coupled ocean ecosystem and atmosphere. NAAMES utilizes a combination of ship-based, airborne, autonomous sensor, and remote sensing measurements that directly link ocean ecosystem processes, emissions of ocean-generated aerosols and precursor gases, and subsequent atmospheric evolution and processing. Four deployments coincide with the seasonal cycle of phytoplankton in the North Atlantic Ocean: the Winter Transition (November 5 – December 2, 2015), the Bloom Climax (May 11 – June 5, 2016), the Deceleration Phase (August 30 – September 24, 2017), and the Acceleration Phase (March 20 – April 13, 2018). Ship-based measurements were conducted from the Woods Hole Oceanographic Institution Research Vessel Atlantis in the middle of the North Atlantic Ocean, while airborne measurements were conducted on a NASA Wallops Flight Facility C-130 Hercules that was based at St. John's International Airport, Newfoundland, Canada. Data products in the ASDC archive focus on the NAAMES atmospheric aerosol, cloud, and trace gas data from the ship and aircraft, as well as related satellite and model data subsets. While a few ocean-remote sensing data products (e.g., from the high-spectral resolution lidar) are also included in the ASDC archive, most ocean data products reside in a companion archive at SeaBass.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
The MERGE dataset is a collection of audio, lyrics, and bimodal datasets for conducting research on Music Emotion Recognition. A complete version is provided for each modality. The audio datasets provide 30-second excerpts for each sample, while full lyrics are provided in the relevant datasets. The amount of available samples in each dataset is the following:
Each dataset contains the following additional files:
A metadata spreadsheet is provided for each dataset with the following information for each sample, if available:
This work is funded by FCT - Foundation for Science and Technology, I.P., within the scope of the projects: MERGE - DOI: 10.54499/PTDC/CCI-COM/3171/2021 financed with national funds (PIDDAC) via the Portuguese State Budget; and project CISUC - UID/CEC/00326/2020 with funds from the European Social Fund, through the Regional Operational Program Centro 2020.
Renato Panda was supported by Ci2 - FCT UIDP/05567/2020.