Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Within each nested folder of the archive you will find files A,O,B and M. They each represent a conflict where file O was altered in two different ways, resulting in A and B. Finally, a developer solved the merge conflict committing M as the solution.
We have selected these by manually searching for a programming language on GitHub and selecting those repositories that had a large number of forks, commits and contributors.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
## Overview
Merging Raw, Empty, 0.5 Synthetic 2 is a dataset for instance segmentation tasks - it contains Stickers annotations for 1,964 images.
## Getting Started
You can download this dataset for use within your own projects, or fork it into a workspace on Roboflow to create your own model.
## License
This dataset is available under the [CC BY 4.0 license](https://creativecommons.org/licenses/CC BY 4.0).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This resource introduces a merged dataset, integrating Convective Triggering Potential (CTP) and Humidity Index (HI) from three established reanalysis products: the Modern-Era Retrospective Analysis for Research and Applications, version 2 (MERRA2), Climate Forecast System Reanalysis (CFSR), and the European Centre for Medium-Range Weather Forecasts Reanalysis v5 (ERA5). This innovative dataset, crafted using the Triple Collocation (TC) method, addresses the challenges posed using single-source reanalysis data and offers a more reliable representation of atmospheric conditions. It mitigates biases associated with individual datasets and compensates for satellite-derived estimates' shortcomings, such as missing observations and lower vertical resolution. This merged CTP-HI product offers a robust alternative to single-source datasets, enhancing accuracy in characterizing atmospheric conditions and addressing the limitations of satellite-derived data. Verification against the Integrated Global Radiosonde Archive version 2 (IGRA2) in-situ measurements and Atmospheric Infrared Sounder version 7 (AIRSv7) satellite observations ensure reliability for meteorological research. The dataset provides a valuable tool for analyzing atmospheric stability and humidity, with potential implications for weather prediction and climate research.
michaeldinzinger/merged-trecdl dataset hosted on Hugging Face and contributed by the HF Datasets community
This airborne or shipborne geophysical survey recorded the following parameters: Total Field Magnetic. The flight line spacing is 805 m. The survey was flown between 1956-09-01 and 1956-10-31. The data were Digitized from contour maps. Platform: Fixed-wing.
U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
This part of the data release includes 25-m resolution merged multibeam-bathymetry data of the northern portion of the Southern California Continental Borderland. The data are presented as a TIFF file. In February 2016 the University of Washington in cooperation with the U.S. Geological Survey, Pacific Coastal and Marine Science Center (USGS, PCMSC) collected multibeam bathymetry and acoustic backscatter data in Catalina Basin aboard the University of Washington's Research Vessel Thomas G. Thompson. Data were collected using a Kongsberg EM300 multibeam echosounder hull-mounted to the 274-foot R/V Thomas G. Thompson. The USGS, PCMSC processed these data and produced a series of bathymetric surfaces and acoustic backscatter images for scientific research purposes. A 25-m bathymetric surface produced from this work was merged with publically available multibeam bathymetry data, as well as 2015, 2016, and 2017 multibeam bathymetry data collected in the continental borderland region by ...
This work integrated multiple topographic and bathymetric data sources to generate a merged topobathymetric map of western Prince William Sound. We converted all data sources to NAD 83 UTM Zone 6 N and mean higher high water (MHHW) before compiling. In Barry Arm, north of Port Wells, we used a digital terrain model (DTM) derived from subaerial light detection and ranging (lidar) data collected on June 26, 2020, (Daanen and others, 2021) and submarine multibeam sonar bathymetric data collected between August 12 and 23, 2020 (NOAA, 2020). In College Fiord, adjacent to Barry Arm to the east, we used multibeam sonar bathymetric data collected between March 25 and August 26, 2021 (NOAA, 2021). These data were combined at 5 m horizontal resolution. For the subaerial portions of the computational _domain outside of Barry Arm, we used a 5 m interferometric synthetic aperture radar (IFSAR)-derived DTM for Alaska (U.S. Geological Survey, 2018, accessed through Alaska Division of Geological and Geophysical Surveys, 2013). Below the MHHW waterline and outside of Barry Arm and College Fiord, we used one of two existing topobathymetric sources. In Passage Canal, we used an 8/15 arc-second dataset (~12 m grid cells) for Whittier and Passage Canal (NOAA, 2009b). Elsewhere, we used an 8/3 arc-second dataset (~59 m grid cells) for Prince William Sound (NOAA, 2009a). These two topobathymetric datasets were themselves derived from multiple data sources, including, but not limited to: National Ocean Service hydrographic surveys, National Elevation Dataset topography, and digital coastlines datasets. The source data for both topobathymetric datasets were sparse in both deep water and near shore (up to 1.5 km spacing between observations), which necessitated interpolating over those areas. This process, which is detailed by Caldwell and others (2011), gave substantial weight to the shoreline topography in the assignment of interpolated depths in the nearshore zone. Because our results use the more recent and higher resolution IFSAR-derived topography, which has a different shoreline, we re-interpolated a narrow band of nearshore grid cells using a similar methodology. We defined the nearshore re-interpolation zone based on a constant horizontal distance from the edge of valid IFSAR observation. We used a distance of 83 m because it results in the re-interpolation of at least one but no more than two of the 8/3 arc-second topobathymetric grid cells. We first removed any grid cell of either the Prince William Sound topobathymetric dataset (NOAA, 2009a) or the Whittier and Passage Canal topobathymetric dataset (NOAA, 2009b) at its original resolution that overlapped the near-shore re-interpolation zone. After removing the grid cells in the nearshore re-interpolation zone from these two topobathymetric datasets, we bilinearly interpolated and resampled both datasets from their original resolution to match the 5 m resolution of the IFSAR DTM. We then merged the two topobathymetric datasets with the IFSAR DTM. This yielded a 5 m dataset with missing values only in the nearshore re-interpolation zone. We then added the higher resolution and more recent Barry Arm and College Fiord multibeam sonar bathymetric data, as well as the Barry Arm lidar topographic data, directly to the regional dataset in their original footprints. These data were collected closer to shore, well within the re-interpolation zone that we defined for the lower resolution topobathymetric data and had no previously interpolated zones, thus requiring no additional clipping. Accordingly, we allowed the gap between the edge of the multibeam bathymetric data footprints and the lidar- or IFSAR-defined shoreline to represent the interpolation zone for these data. Finally, we interpolated across all the missing nearshore values using bilinear interpolation, thereby generating a single, continuous 5 m topobathymetric raster. References Cited Alaska Division of Geological & Geophysical Surveys [DGGS], 2013, Elevation Datasets of Alaska: Alaska Division of Geological & Geophysical Surveys Digital Data Series 4, https://elevation.alaska.gov/, accessed May 6, 2022, at https://doi.org/10.14509/25239. Caldwell, R. J., Eakins, B. W., and Lim, E., 2011, Digital Elevation Models of Prince William Sound, Alaska: Procedures, Data Sources, and Analysis: NOAA Technical Memorandum NESDIS NGDC-40, accessed June 16, 2022, at https://www.ngdc.noaa.gov/mgg/dat/dems/regional_tr/prince_william_sound_83_mhhw_2009.pdf Daanen, R.P., Wolken, G.J., Wikstrom Jones, K., and Herbst, A.M., 2021, High resolution lidar-derived elevation data for Barry Arm landslide, southcentral Alaska, June 26, 2020: Alaska Division of Geological & Geophysical Surveys Raw Data File 2021–3, 9 p., accessed June 17, 2021, at https://doi.org/10.14509/30593. National Oceanic and Atmospheric Administration [NOAA], 1995, Report for H10655: National Oceanic and Atmospheric Administration [NOAA] web page, accessed July 22, 2021, at https://www.ngdc.noaa.gov/nos/H10001-H12000/H10655.html. National Oceanic and Atmospheric Administration [NOAA], 2020, Report for H13396: National Oceanic and Atmospheric Administration [NOAA] web page, accessed April 5, 2021, at https://www.ngdc.noaa.gov/nos/H12001-H14000/H13396.html National Oceanic and Atmospheric Administration [NOAA], 2021, Report for H13420: National Oceanic and Atmospheric Administration [NOAA] web page, accessed March 15, 2023, at https://www.ngdc.noaa.gov/nos/H12001-H14000/H13420.html. National Oceanic and Atmospheric Administration [NOAA] National Geophysical Data Center, 2009a, Prince William Sound, Alaska 8/3 arc-second MHHW coastal digital elevation model: National Oceanic and Atmospheric Administration [NOAA], National Centers for Environmental Information web page, accessed June 16, 2022, at https://www.ncei.noaa.gov/access/metadata/landing-page/bin/iso?id=gov.noaa.ngdc.mgg.dem:735 National Oceanic and Atmospheric Administration [NOAA] National Geophysical Data Center, 2009b, Whittier, Alaska 8/15 arc-second MHHW coastal digital elevation model: National Oceanic and Atmospheric Administration [NOAA], National Centers for Environmental Information web page, accessed April 5, 2021, at https://www.ncei.noaa.gov/access/metadata/landing-page/bin/iso?id=gov.noaa.ngdc.mgg.dem:530. U.S. Geological Survey, 2018, USGS EROS Archive – Digital Elevation – Interferometric Synthetic Aperture Radar (IFSAR) – Alaska, Accessed May 6, 2022, at https://doi.org/10.5066/P9C064CO.
biglab/jitteredwebsites-merged-224-paraphrased dataset hosted on Hugging Face and contributed by the HF Datasets community
This airborne or shipborne geophysical survey recorded the following parameters: Total Field Magnetic. The flight line spacing is 2000 m for a total of 54785 kilometres. The survey was flown between 1982-04-12 and 1982-05-29. The data were Digitally acquired. Platform: Fixed-wing.
This new merged gridded dataset (M-1B01-2A25-GD) contains precipitation, clouds and atmospheric parameters with 0.25° spatial resolution. It is produced by merging TRMM PR and VIRS measurements with the ERA5 reanalysis dataset at the same spatiotemporal resolution between 40° S and 40° N. The near-surface rain rate, profiles of rain rate and precipitation reflectivity factor, visible and infrared signals and atmospheric parameters (temperature, pressure, geopotential height, specific humidity, divergence, vertical velocity and so on) can be obtained in the dataset. The statistical results indicate that the merging and gridding will not dramatically distort the original data and the new dataset can be used to study the characteristics and distribution of the precipitation and clouds systems. This research was supported by the National Natural Science Foundation of China (Grants 91837310, 41675041)
The Inland Waters dataset (ILW) provides data for lakes and other water bodies across the contiguous United States (CONUS) and Alaska. ILW significantly reduces the processing effort required by end users and is a standardized community resource for lake and reservoir algorithm development and performance assessment.The data is provided for 15,450 CONUS waterbodies with a size of at least one 300 m pixel and over 2,300 resolvable lakes with sizes greater than three 300 m pixels. Alaska has 5,874 lakes resolvable lakes. ILW is a times series containing 10 years of MERIS (2002-2012) and OLCI from both Sentinel-3a (2016-present) and Sentinel-3b (2018-present).
ACSPO Merge VIIRS SST, Great Lakes, 1.3km/pixel (2021-present) cdm_data_type=Other infoUrl=https://coastwatch.glerl.noaa.gov/ institution=CoastWatch Great Lakes Node sourceUrl=(local files) subsetVariables=fileType
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
This airborne or shipborne geophysical survey recorded the following parameters: Total Field Magnetic. The flight line spacing is 805 m. The survey was flown between 1958-05-01 and 1958-05-31. The data were Digitized from contour maps. Platform: Fixed-wing.
This Dataset is used for training the upcoming LeoPARD reasoning model Today, I’m excited to release VortexHunter23/MERGED-2, a SFT dataset designed for training the LeoPARD shed base model. This dataset can be used by anyone
Disclaimer
As the name suggests. This is a merged dataset and i do not hold any rights (I dont know which all one's i merged so i cant list them here, sry)
If anyone can help for the datasets please reach me out in my hugging face profile or smthg
Closed-TTS/Ja-All-Merged-Qwen3-4B-Tokenized dataset hosted on Hugging Face and contributed by the HF Datasets community
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
This airborne or shipborne geophysical survey recorded the following parameters: Total Field Magnetic. The flight line spacing is 1609 m. The survey was flown between 1958-05-01 and 1958-05-31. The data were Digitized from contour maps. Platform: Fixed-wing.
https://artefacts.ceda.ac.uk/licences/specific_licences/esacci_ozone_terms_and_conditions.pdfhttps://artefacts.ceda.ac.uk/licences/specific_licences/esacci_ozone_terms_and_conditions.pdf
This dataset comprises the MErged GRIdded Dataset of Ozone Profiles (MEGRIDOP) in the stratosphere with a resolved longitudinal structure, which is derived from data by six limb and occultation satellite instruments: GOMOS, SCIAMACHY and MIPAS on Envisat, OSIRIS on Odin, OMPS on Suomi-NPP, and MLS on Aura. The merged dataset was generated as a contribution to the European Space Agency Climate Change Initiative Ozone project (Ozone_cci). The period of this merged time series of ozone profiles is from late 2001 until the end of 2022.
The monthly mean gridded ozone profiles and deseasonalised anomalies are provided in the altitude range from 10 to 50 km in bins of 10 degree latitude x 20 degree longitude.
For more details please see the associated readme file and Sofieva, V. F., Szeląg, M., Tamminen, J., Kyrölä, E., Degenstein, D., Roth, C., Zawada, D., Rozanov, A., Arosio, C., Burrows, J. P., Weber, M., Laeng, A., Stiller, G. P., von Clarmann, T., Froidevaux, L., Livesey, N., van Roozendael, M. and Retscher, C.: Measurement report: regional trends of stratospheric ozone evaluated using the MErged GRIdded Dataset of Ozone Profiles (MEGRIDOP), Atmos. Chem. Phys., 21(9), 6707–6720, doi:10.5194/acp-21-6707-2021, 2021
The Yukon-Tanana Upland airborne magnetic geophysical data merge provides a merged raster grid of magnetic data for the Yukon-Tanana Upland (YTU), Alaska. This dataset provides a regionally consistent magnetic grid while retaining as much detail as possible of the originating data by leveling all the region's magnetic grids to one another. This 100-m cell-size magnetic grid of the YTU will assist in evaluating the region's geologic structure, geologic processes, tectonic evolution, and mineral resource potential. The grid was compiled from previous studies published by the Alaska Division of Geological & Geophysical Surveys (DGGS).
Open Government Licence - Canada 2.0https://open.canada.ca/en/open-government-licence-canada
License information was derived automatically
This airborne or shipborne geophysical survey recorded the following parameters: Total Field Magnetic. The flight line spacing is 1000 m. The survey was flown between 1985-01-01 and 1985-12-31. The data were Digitally acquired. Platform: Fixed-wing.
This dataset provides information on greenhouse gases and human-produced air pollution, including atmospheric concentrations of carbon dioxide (CO2), methane (CH4), tropospheric ozone (O3), and black carbon (BC) aerosols, collected during airborne campaigns conducted by NASA's Atmospheric Tomography (ATom) mission. This dataset includes merged data from all instruments plus additional data such as numbered profiles and distance flown. Merged data have been created for seven different sampling intervals. In the case of data obtained over longer time intervals (e.g. flask data), the merge files provide (weighted) averages to match the sampling intervals. ATom deploys an extensive gas and aerosol payload on the NASA DC-8 aircraft for a systematic, global-scale sampling of the atmosphere, profiling continuously from 0.2 to 12 km altitude. Flights occurred in each of 4 seasons from 2016 to 2018. Flights originate from the Armstrong Flight Research Center in Palmdale, California, fly north to the western Arctic, south to the South Pacific, east to the Atlantic, north to Greenland, and return to California across central North America. ATom establishes a single, contiguous, global-scale dataset. This comprehensive dataset will be used to improve the representation of chemically reactive gases and short-lived climate forcers in global models of atmospheric chemistry and climate. Profiles of the reactive gases will also provide critical information for the validation of satellite data, particularly in remote areas where in situ data is lacking. Complete aircraft flight information including, but not limited to, latitude, longitude, and altitude are also provided. This data release provides results from all instruments on all four ATom flight campaigns.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Within each nested folder of the archive you will find files A,O,B and M. They each represent a conflict where file O was altered in two different ways, resulting in A and B. Finally, a developer solved the merge conflict committing M as the solution.
We have selected these by manually searching for a programming language on GitHub and selecting those repositories that had a large number of forks, commits and contributors.