Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
196 views (3 recent) Catches in FAO area 27 by country, species, area and year. Source: Eurostat/ICES database on catch statistics - ICES 2011, Copenhagen. Format: Archived dataset in .xls and .csv format. Version 26-06-2019 https://www.ices.dk/data/dataset-collections/Pages/Fish-catch-and-stock-assessment.aspx
Being relatively new to the field, electromechanical actuators in aerospace applications lack the knowledge base compared to ones accumulated for the other actuator types, especially when it comes to fault detection and characterization. Lack of health monitoring data from fielded systems and prohibitive costs of carrying out real flight tests push for the need of building system models and designing affordable but realistic experimental setups. This paper presents our approach to accomplish a comprehensive test environment equipped with fault injection and data collection capabilities. Efforts also include development of multiple models for EMA operations, both in nominal and fault conditions that can be used along with measurement data to generate effective diagnostic and prognostic estimates. A detailed description has been provided about how various failure modes are inserted in the test environment and corresponding data is collected to verify the physics based models under these failure modes that have been developed in parallel. A design of experiment study has been included to outline the details of experimental data collection. Furthermore, some ideas about how experimental results can be extended to real flight environments through actual flight tests and using real flight data have been presented. Finally, the roadmap leading from this effort towards developing successful prognostic algorithms for electromechanical actuators is discussed.*
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Nominal Household consumption distribution by categories, Harvested from Annual Household Survey 2015/16, Government of Nepal, National Planning Commission Secretariat, Central Bureau of Statistics.
These are nominal yield curves, obtained by fitting a parametric form to the prices of off-the-run nominal Treasury coupon securities. The data are available at daily frequency, from 1961 to present.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Total nominal household consumption (Rs. billion), Harvested from Annual Household Survey 2015/16, Government of Nepal, National Planning Commission Secretariat, Central Bureau of Statistics.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
This deposit contains various datasets describing tuna fisheries activities (currently catches and efforts) and different levels of processing on 1° or 5° spatial grids with a monthly temporal resolution. Lower levels of processing have been officially endorsed by FIRMS and are also published on Zenodo : see FIRMS Global Tuna Atlas datasets. Currently, FIRMS datasets only deal with catches and Level 0 data (a global dataset which remains as close as possible from datasets published on tuna RFMOs Website), including a lower spatio-temporal resolution dataset which gives the best estimates of total catches (nominal catches, per year and per ocean).
Data structure
All Global Tuna Atlas datasets comply with a common data format in line with CWP Reference Harmonization standard (https://www.fao.org/3/cc6734en/cc6734en.pdf) which is described in a json file (https://github.com/fdiwg/fdi-formats/blob/main/cwp_rh_generic_gta_taskI.json).
Global Catch dataset (IRD level 2)
IRD Level 2 denotes the series of processing steps applied by the French National Research Institute for Sustainable Development (IRD) to generate this dataset from the primary RFMO catch-and-effort data. Although some steps mirror those used in the FIRMS Level 0 product (DOI: https://doi.org/10.5281/zenodo.5745958), the entire workflow was rerun to integrate early adjustments to IATTC shark and billfish data prior to final aggregation.
This dataset compiles monthly global catch data for tuna, tuna-like species and sharks from 1950 through 2023. Catches are stratified according to the latest CWP standards update :
- month
- species
- gear_type (reporting fishing_gear)
- fishing_fleet (reporting country)
- fishing_mode (type of school used)
- geographic_identifier (1° or 5° grid cell)
- measurement_unit i.e. unit of catch (weight or number)
- measurement (catch)
- measurement_type (landings or retained catches)
- measurement_processing_level (original samples or processed data)
- a `label` column has been added for each field (e.g. `fishing_mode`, `species`, `gear_type`, etc.) to provide clear descriptive metadata
Warning: This dataset is designed to enhance the understanding of fish counts at level 0, and the amount of georeferenced data. It is not suitable for accurately georeferencing data by country or fishing fleet and should not be used for studies on fishing zone legality or quota management. While it offers a georeferenced footprint of captures to reflect reported biomass more closely, significant uncertainty remains regarding the precise locations of the catches.
Global level 2 processing includes the conversion and raising of georeferenced catch data to match nominal dataset values.
To reproduce the data and the workflow we provide a .zip with all the initial data used as well as labeling and the mapping to nominal geometries (see all_rawdata.zip)
Global Effort dataset (IRD Level 0)
We compiled a comprehensive dataset of geo-referenced fishing effort observations from global tuna fisheries, covering the period from 1950 to 2023. These data are collected from the public domain datasets released by the five tuna Regional Fisheries Management Organizations (t-RFMOs): CCSBT, IATTC, ICCAT, IOTC, and WCPFC. As with the catch dataset, the effort data were processed by using the same data generation workflow as the one used for FIRMS-GTA with a different parametrization complying with the standardized data structure promoted by the Coordinating Working Party (CWP) standards for (tuna) fisheries statistics.
Contrariwise to catches, effort values are reported using a significant number of measurement units (23). Only a few mapping between similar tRFMOs units has been managed based on fdiwg codelists (see GitHub repository: https://github.com/fdiwg/fdi-mappings). Each remaining unit reflects different operational aspects depending on the fishing gear, fleet behavior, and the reporting RFMO. The Level 0 global dataset includes all reported units without conversion or aggregation, to preserve the original semantic richness and reflect the heterogeneity in reporting practices.
This IRD Level 0 global effort dataset thus, preserves all original effort records from t-RFMOs and complies with a unified data structure while maintaining the granularity and diversity of reporting. This level of processing is not a standardized or simplified effort dataset. No upper level of processing is currently made available by IRD. Any further aggregation or transformation of effort data should be conducted by the end-user, based on specific scientific goals and with careful consideration of the semantics behind each unit.
Both datasets are enriched with "gear_type_label", "fishing_fleet label", for catch, "species_group" using the FDIWG standards and for efforts "measurement_unit_labels".
Appendix work:
If you are interested in creating a customized version of this Global Tuna Atlas with specific filters or adjustments based on particular issues, please feel free to reach out to us.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset provides values for NOMINAL GDP GROWTH reported in several countries. The data includes current values, previous releases, historical highs and record lows, release frequency, reported unit and currency.
The Mars Express PFS data set contains raw (CODMAC Level 2) measurements from the Planetary Fourier Spectrometer collected during the first extension Mars orbit phases.
https://earth.esa.int/eogateway/documents/20142/1564626/Terms-and-Conditions-for-the-use-of-ESA-Data.pdfhttps://earth.esa.int/eogateway/documents/20142/1564626/Terms-and-Conditions-for-the-use-of-ESA-Data.pdf
The Fundamental Data Record (FDR) for Atmospheric Composition UVN v.1.0 dataset is a cross-instrument Level-1 product [ATMOS_L1B] generated in 2023 and resulting from the ESA FDR4ATMOS project. The FDR contains selected Earth Observation Level 1b parameters (irradiance/reflectance) from the nadir-looking measurements of the ERS-2 GOME and Envisat SCIAMACHY missions for the period ranging from 1995 to 2012. The data record offers harmonised cross-calibrated spectra with focus on spectral windows in the Ultraviolet-Visible-Near Infrared regions for the retrieval of critical atmospheric constituents like ozone (O3), sulphur dioxide (SO2), nitrogen dioxide (NO2) column densities, alongside cloud parameters. The FDR4ATMOS products should be regarded as experimental due to the innovative approach and the current use of a limited-sized test dataset to investigate the impact of harmonization on the Level 2 target species, specifically SO2, O3 and NO2. Presently, this analysis is being carried out within follow-on activities. The FDR4ATMOS V1 is currently being extended to include the MetOp GOME-2 series. Product format For many aspects, the FDR product has improved compared to the existing individual mission datasets: GOME solar irradiances are harmonised using a validated SCIAMACHY solar reference spectrum, solving the problem of the fast-changing etalon present in the original GOME Level 1b data; Reflectances for both GOME and SCIAMACHY are provided in the FDR product. GOME reflectances are harmonised to degradation-corrected SCIAMACHY values, using collocated data from the CEOS PIC sites; SCIAMACHY data are scaled to the lowest integration time within the spectral band using high-frequency PMD measurements from the same wavelength range. This simplifies the use of the SCIAMACHY spectra which were split in a complex cluster structure (with own integration time) in the original Level 1b data; The harmonization process applied mitigates the viewing angle dependency observed in the UV spectral region for GOME data; Uncertainties are provided. Each FDR product provides, within the same file, irradiance/reflectance data for UV-VIS-NIR special regions across all orbits on a single day, including therein information from the individual ERS-2 GOME and Envisat SCIAMACHY measurements. FDR has been generated in two formats: Level 1A and Level 1B targeting expert users and nominal applications respectively. The Level 1A [ATMOS_L1A] data include additional parameters such as harmonisation factors, PMD, and polarisation data extracted from the original mission Level 1 products. The ATMOS_L1A dataset is not part of the nominal dissemination to users. In case of specific requirements, please contact EOHelp. Please refer to the README file for essential guidance before using the data. All the new products are conveniently formatted in NetCDF. Free standard tools, such as Panoply, can be used to read NetCDF data. Panoply is sourced and updated by external entities. For further details, please consult our Terms and Conditions page. Uncertainty characterisation One of the main aspects of the project was the characterization of Level 1 uncertainties for both instruments, based on metrological best practices. The following documents are provided: General guidance on a metrological approach to Fundamental Data Records (FDR) Uncertainty Characterisation document Effect tables NetCDF files containing example uncertainty propagation analysis and spectral error correlation matrices for SCIAMACHY (Atlantic and Mauretania scene for 2003 and 2010) and GOME (Atlantic scene for 2003) reflectance_uncertainty_example_FDR4ATMOS_GOME.nc reflectance_uncertainty_example_FDR4ATMOS_SCIA.nc Known Issues Non-monotonous wavelength axis for SCIAMACHY in FDR data version 1.0 In the SCIAMACHY OBSERVATION group of the atmospheric FDR v1.0 dataset (DOI: 10.5270/ESA-852456e), the wavelength axis (lambda variable) is not monotonically increasing. This issue affects all spectral channels (UV, VIS, NIR) in the SCIAMACHY group, while GOME OBSERVATION data remain unaffected. The root cause of the issue lies in the incorrect indexing of the lambda variable during the NetCDF writing process. Notably, the wavelength values themselves are calculated correctly within the processing chain. Temporary Workaround The wavelength axis is correct in the first record of each product. As a workaround, users can extract the wavelength axis from the first record and apply it to all subsequent measurements within the same product. The first record can be retrieved by setting the first two indices (time and scanline) to 0 (assuming counting of array indices starts at 0). Note that this process must be repeated separately for each spectral range (UV, VIS, NIR) and every daily product. Since the wavelength axis of SCIAMACHY is highly stable over time, using the first record introduces no expected impact on retrieval results. Python pseudo-code example: lambda_...
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Residential property price statistics from different countries. Contains property price indicators (real series are the nominal price series deflated by the consumer price index), both in levels and in growth rates. Can be used for property market analysis. The dataset contains four different files with different metrics, including nominal index, nominal year-on-year changes, real index, and real year-on-year changes. Each file includes data in the format of date, country, and price.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Composition of nominal household income and nominal equivalised income
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Major differences from previous work: For level 2 catch: Catches in tons, raised to match nominal values, now consider the geographic area of the nominal data for improved accuracy. Captures in "Number of fish" are converted to weight based on nominal data. The conversion factors used in the previous version are no longer used, as they did not adequately represent the diversity of captures. Number of fish without corresponding data in nominal are not removed as they were before, creating a huge difference for this measurement_unit between the two datasets. Nominal data from WCPFC includes fishing fleet information, and georeferenced data has been raised based on this instead of solely on the triplet year/gear/species, to avoid random reallocations. Strata for which catches in tons are raised to match nominal data have had their numbers removed. Raising only applies to complete years to avoid overrepresenting specific months, particularly in the early years of georeferenced reporting. Strata where georeferenced data exceed nominal data have not been adjusted downward, as it is unclear if these discrepancies arise from missing nominal data or different aggregation methods in both datasets. The data is not aggregated to 5-degree squares and thus remains unharmonized spatially. Aggregation can be performed using CWP codes for geographic identifiers. For example, an R function is available: source("https://raw.githubusercontent.com/firms-gta/geoflow-tunaatlas/master/sardara_functions/transform_cwp_code_from_1deg_to_5deg.R") Level 0 dataset has been modified creating differences in this new version notably : The species retained are different; only 32 major species are kept. Mappings have been somewhat modified based on new standards implemented by FIRMS. New rules have been applied for overlapping areas. Data is only displayed in 1 degrees square area and 5 degrees square areas. The data is enriched with "Species group", "Gear labels" using the fdiwg standards. These main differences are recapped in the Differences_v2018_v2024.zip Recommendations: To avoid converting data from number using nominal stratas, we recommend the use of conversion factors which could be provided by tRFMOs. In some strata, nominal data appears higher than georeferenced data, as observed during level 2 processing. These discrepancies may result from errors or differences in aggregation methods. Further analysis will examine these differences in detail to refine treatments accordingly. A summary of differences by tRFMOs, based on the number of strata, is included in the appendix. Some nominal data have no equivalent in georeferenced data and therefore cannot be disaggregated. What could be done is to check for each nominal data without equivalence if a georeferenced data exists in different buffers, and to average the distribution of this footprint. Then, disaggregate the nominal data based on the georeferenced data. This would lead to the creation of data (approximately 3%), and would necessitate reducing/removing all georeferenced data without a nominal equivalent or with a lesser equivalent. Tests are currently being conducted with and without this. It would help improve the biomass captured footprint but could lead to unexpected discrepancies with current datasets. For level 0 effort : In some datasets—namely those from ICCAT and the purse seine (PS) data from WCPFC— same effort data has been reported multiple times by using different units which have been kept as is, since no official mapping allows conversion between these units. As a result, users have be remind that some ICCAT and WCPFC effort data are deliberately duplicated : in the case of ICCAT data, lines with identical strata but different effort units are duplicates reporting the same fishing activity with different measurement units. It is indeed not possible to infer strict equivalence between units, as some contain information about others (e.g., Hours.FAD and Hours.FSC may inform Hours.STD). in the case of WCPFC data, effort records were also kept in all originally reported units. Here, duplicates do not necessarily share the same “fishing_mode”, as SETS for purse seiners are reported with an explicit association to fishing_mode, while DAYS are not. This distinction allows SETS records to be separated by fishing mode, whereas DAYS records remain aggregated. Some limited harmonization—particularly between units such as NET-days and Nets—has not been implemented in the current version of the dataset, but may be considered in future releases if a consistent relationship can be established.
https://data.gov.sg/open-data-licencehttps://data.gov.sg/open-data-licence
Dataset from Singapore Department of Statistics. For more information, visit https://data.gov.sg/datasets/d_60ef4eaaf5ab139618aa6ad282899d3e/view
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Nominal per capital consumption by consumption group, Harvested from Annual Household Survey 2015/16, Government of Nepal, National Planning Commission Secretariat, Central Bureau of Statistics.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset is a collection of CBT results from prospective students participating in the primary school teacher professional education program.
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
Abstract: The data set comprises temperature data of the Earth's middle atmosphere region for the years 2002-2012 as derived from the nominal mode mid infrared spectra measurements collected by the MIPAS instrument on board of the Envisat satellite. The data version is V8. TechnicalRemarks: The data is delivered as a set of archive files with each archive containing monthly netCDF files of the respective year. Users of the IDL data language can use the programs which are delivered as an archive, too. For previous version see relations.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The USDA Forest Service (USFS) builds multiple versions of percent tree canopy cover data, in order to serve needs of multiple user communities. These datasets encompass CONUS, Coastal Alaska, Hawaii, U.S. Virgin Islands and Puerto Rico. There are three versions of data within the 2016 TCC Product Suite, which include: The initial model outputs referred to as the Analytical data; A masked version of the initial output referred to as Cartographic data; And a modified version built for the National Land Cover Database and referred to as NLCD data, which includes a canopy cover change dataset derived from subtraction of datasets for the nominal years of 2011 and 2016. The Analytical data are the initial model outputs generated in the production workflow. These data are best suited for users who will carry out their own detailed statistical and uncertainty analyses on the dataset and place lower priority on the visual appearance of the dataset for cartographic purposes. Datasets for the nominal years of 2011 and 2016 are available.
The Cartographic products mask the initial model outputs to improve the visual appearance of the datasets. These data are best suited for users who prioritize visual appearance of the data for cartographic and illustrative purposes. Datasets for the nominal years of 2011 and 2016 are available.
The NLCD data are the result of further processing of the masked data. The goal was to generate three coordinated components. The components are (1) a dataset for the nominal year of 2011, (2) a dataset for the nominal year of 2016, and (3) a dataset that captures the change in canopy cover between the two nominal years of 2011 and 2016. For the NLCD data, the three components meet the criterion of 2011 TCC + change in TCC = 2016 TCC. These NLCD data are best suited for users who require a coordinated three-component data stack where each pixels values meet the criterion of 2011 TCC + change in TCC = 2016 TCC. Datasets for the nominal years of 2011 and 2016 are available, as well as a dataset that captures the change (loss or gain) in canopy cover between those two nominal years of 2011 and 2016, in areas where change was identified.
These tree canopy cover data are accessible for multiple user communities, through multiple channels and platforms, as listed below: Analytical USFS Tree Canopy Cover Datasets (Download) USFS Enterprise Data Warehouse (Image Service) Cartographic USFS Tree Canopy Cover Datasets (Download) USFS Enterprise Data Warehouse (Map Service) NLCD Multi-Resolution Land Characteristics (MRLC) Consortium (Download) USFS Enterprise Data Warehouse (Image Service) The Puerto Rico and the US Virgin Islands TCC NLCD change dataset is comprised of a single layer. The pixel values range from -97 to 98 percent where negative values represent canopy loss and positive values represent canopy gain. The background is represented by the value 127 and data gaps are represented by the value 110 since this is a signed 8-bit image.This record was taken from the USDA Enterprise Data Inventory that feeds into the https://data.gov catalog. Data for this record includes the following resources: ISO-19139 metadata ArcGIS Hub Dataset ArcGIS GeoService For complete information, please visit https://data.gov.
Boosted top tagging is an essential binary classification task for experiments at the Large Hadron Collider (LHC) to measure the properties of the top quark. The ATLAS Top Tagging Open Data Set is a publicly available dataset for the development of Machine Learning (ML) based boosted top tagging algorithms. The dataset consists of a nominal piece used for the training and evaluation of algorithms, and a systematic piece used for estimating the size of systematic uncertainties produced by an algorithm. The nominal data are is split into two orthogonal sets, named train and test. The systematic varied data is split into many more pieces that should only be used for evaluation in most cases. Both nominal sets are composed of equal parts signal (jets initiated by a boosted top quark) and background (jets initiated by light quarks or gluons).
A brief overview of these datasets is as follows. For more detailed information see arxiv:2047.20127.
For each jet, the datasets contain:
There are two rules for using this data set: the contribution to a loss function from any jet should always be weighted by the training weight, and any performance claim is incomplete without an estimate of the systematic uncertainties via the method illustrated in this repository. The ideal model shows high performance but also small systematic uncertainties.
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
This dataset includes nominal and real gross domestic product (GDP) data (including their details) in Thailand from 1993Q1 to 2023Q1. This data is scraped from websites of The Office of the National Economic and Social Development Council, also known as NESDC.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
Major differences from previous work: For level 2: Catches in tons, raised to match nominal values, now consider the geographic area of the nominal data for improved accuracy. Captures in "Number of fish" are converted to weight based on nominal data. The conversion factors used in the previous version are no longer used, as they did not adequately represent the diversity of captures. Number of fish without corresponding data in nominal are not removed as they were before, creating a huge difference for this measurement_unit between the two datasets. Nominal data from WCPFC includes fishing fleet information, and georeferenced data has been raised based on this instead of solely on the triplet year/gear/species, to avoid random reallocations. Strata for which catches in tons are raised to match nominal data have had their numbers removed. Raising only applies to complete years to avoid overrepresenting specific months, particularly in the early years of georeferenced reporting. Strata where georeferenced data exceed nominal data have not been adjusted downward, as it is unclear if these discrepancies arise from missing nominal data or different aggregation methods in both datasets. The data is not aggregated to 5-degree squares and thus remains unharmonized spatially. Aggregation can be performed using CWP codes for geographic identifiers. For example, an R function is available: source("https://raw.githubusercontent.com/firms-gta/geoflow-tunaatlas/master/sardara_functions/transform_cwp_code_from_1deg_to_5deg.R") Level 0 dataset has been modified creating differences in this new version notably : The species retained are different; only 32 major species are kept. Mappings have been somewhat modified based on new standards implemented by FIRMS. New rules have been applied for overlapping areas. Data is only displayed in 1 degrees square area and 5 degrees square areas. The data is enriched with "Species group", "Gear labels" using the fdiwg standards. These main differences are recapped in the Differences_v2018_v2024.zip Recommendations: To avoid converting data from number using nominal stratas, we recommend the use of conversion factors which could be provided by tRFMOs. In some strata, nominal data appears higher than georeferenced data, as observed during level 2 processing. These discrepancies may result from errors or differences in aggregation methods. Further analysis will examine these differences in detail to refine treatments accordingly. A summary of differences by tRFMOs, based on the number of strata, is included in the appendix. Some nominal data have no equivalent in georeferenced data and therefore cannot be disaggregated. What could be done is to check for each nominal data without equivalence if a georeferenced data exists in different buffers, and to average the distribution of this footprint. Then, disaggregate the nominal data based on the georeferenced data. This would lead to the creation of data (approximately 3%), and would necessitate reducing/removing all georeferenced data without a nominal equivalent or with a lesser equivalent. Tests are currently being conducted with and without this. It would help improve the biomass captured footprint but could lead to unexpected discrepancies with current datasets.
Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
196 views (3 recent) Catches in FAO area 27 by country, species, area and year. Source: Eurostat/ICES database on catch statistics - ICES 2011, Copenhagen. Format: Archived dataset in .xls and .csv format. Version 26-06-2019 https://www.ices.dk/data/dataset-collections/Pages/Fish-catch-and-stock-assessment.aspx