100+ datasets found
  1. Dynamic World V1

    • developers.google.com
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Google, Dynamic World V1 [Dataset]. http://doi.org/10.1038/s41597-022-01307-4
    Explore at:
    Dataset provided by
    Googlehttp://google.com/
    World Resources Institute
    Time period covered
    Jun 27, 2015 - Sep 22, 2025
    Area covered
    Earth
    Description

    Dynamic World is a 10m near-real-time (NRT) Land Use/Land Cover (LULC) dataset that includes class probabilities and label information for nine classes. Dynamic World predictions are available for the Sentinel-2 L1C collection from 2015-06-27 to present. The revisit frequency of Sentinel-2 is between 2-5 days depending on latitude. Dynamic World predictions are generated for Sentinel-2 L1C images with CLOUDY_PIXEL_PERCENTAGE <= 35%. Predictions are masked to remove clouds and cloud shadows using a combination of S2 Cloud Probability, Cloud Displacement Index, and Directional Distance Transform. Images in the Dynamic World collection have names matching the individual Sentinel-2 L1C asset names from which they were derived, e.g: ee.Image('COPERNICUS/S2/20160711T084022_20160711T084751_T35PKT') has a matching Dynamic World image named: ee.Image('GOOGLE/DYNAMICWORLD/V1/20160711T084022_20160711T084751_T35PKT'). All probability bands except the "label" band collectively sum to 1. To learn more about the Dynamic World dataset and see examples for generating composites, calculating regional statistics, and working with the time series, see the Introduction to Dynamic World tutorial series. Given Dynamic World class estimations are derived from single images using a spatial context from a small moving window, top-1 "probabilities" for predicted land covers that are in-part defined by cover over time, like crops, can be comparatively low in the absence of obvious distinguishing features. High-return surfaces in arid climates, sand, sunglint, etc may also exhibit this phenomenon. To select only pixels that confidently belong to a Dynamic World class, it is recommended to mask Dynamic World outputs by thresholding the estimated "probability" of the top-1 prediction.

  2. World Development Indicators

    • kaggle.com
    zip
    Updated Apr 10, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    World Bank (2019). World Development Indicators [Dataset]. https://www.kaggle.com/theworldbank/world-development-indicators
    Explore at:
    zip(134125679 bytes)Available download formats
    Dataset updated
    Apr 10, 2019
    Dataset authored and provided by
    World Bankhttp://topics.nytimes.com/top/reference/timestopics/organizations/w/world_bank/index.html
    License

    https://www.worldbank.org/en/about/legal/terms-of-use-for-datasetshttps://www.worldbank.org/en/about/legal/terms-of-use-for-datasets

    Description

    Content

    The primary World Bank collection of development indicators, compiled from officially-recognized international sources. It presents the most current and accurate global development data available, and includes national, regional and global estimates.

    Context

    This is a dataset hosted by the World Bank. The organization has an open data platform found here and they update their information according the amount of data that is brought in. Explore the World Bank using Kaggle and all of the data sources available through the World Bank organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using the World Bank's APIs and Kaggle's API.

    Cover photo by Alex Block on Unsplash
    Unsplash Images are distributed under a unique Unsplash License.

  3. G

    VIIRS Stray Light Corrected Nighttime Day/Night Band Composites Version 1

    • developers.google.com
    Updated May 31, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Earth Observation Group, Payne Institute for Public Policy, Colorado School of Mines (2017). VIIRS Stray Light Corrected Nighttime Day/Night Band Composites Version 1 [Dataset]. https://developers.google.com/earth-engine/datasets/catalog/NOAA_VIIRS_DNB_MONTHLY_V1_VCMSLCFG
    Explore at:
    Dataset updated
    May 31, 2017
    Dataset provided by
    Earth Observation Group, Payne Institute for Public Policy, Colorado School of Mines
    Time period covered
    Jan 1, 2014 - Mar 1, 2025
    Area covered
    Description

    Monthly average radiance composite images using nighttime data from the Visible Infrared Imaging Radiometer Suite (VIIRS) Day/Night Band (DNB). As these data are composited monthly, there are many areas of the globe where it is impossible to get good quality data coverage for that month. This can be due to …

  4. Most popular database management systems worldwide 2024

    • statista.com
    Updated Jun 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Most popular database management systems worldwide 2024 [Dataset]. https://www.statista.com/statistics/809750/worldwide-popularity-ranking-database-management-systems/
    Explore at:
    Dataset updated
    Jun 30, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Jun 2024
    Area covered
    Worldwide
    Description

    As of June 2024, the most popular database management system (DBMS) worldwide was Oracle, with a ranking score of *******; MySQL and Microsoft SQL server rounded out the top three. Although the database management industry contains some of the largest companies in the tech industry, such as Microsoft, Oracle and IBM, a number of free and open-source DBMSs such as PostgreSQL and MariaDB remain competitive. Database Management Systems As the name implies, DBMSs provide a platform through which developers can organize, update, and control large databases. Given the business world’s growing focus on big data and data analytics, knowledge of SQL programming languages has become an important asset for software developers around the world, and database management skills are seen as highly desirable. In addition to providing developers with the tools needed to operate databases, DBMS are also integral to the way that consumers access information through applications, which further illustrates the importance of the software.

  5. Stack Overflow 2022 Developers Survey

    • kaggle.com
    Updated Jul 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stack Overflow (2023). Stack Overflow 2022 Developers Survey [Dataset]. https://www.kaggle.com/datasets/stackoverflow/stack-overflow-2022-developers-survey
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 11, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Stack Overflow
    Description

    Description:

    The enclosed data set is the complete, cleaned results of the 2022 Stack Overflow Developer Survey. Free response submissions and personally-identifying information have been removed from the results to protect the privacy of respondents. There are three files besides this README:

    1. survey_results_public.csv - CSV file with main survey results, one respondent per row and one column per answer
    2. survey_results_schema.csv - CSV file with survey schema, i.e., the questions that correspond to each column name
    3. so_survey_2022.pdf - PDF file of the survey instrument

    The survey was fielded from May 11, 2022 to June 1, 2022. The median time spent on the survey for qualified responses was 15.08 minutes.

    Respondents were recruited primarily through channels owned by Stack Overflow. The top 5 sources of respondents were onsite messaging, blog posts, email lists, Meta posts, banner ads, and social media posts. Since respondents were recruited in this way, highly engaged users on Stack Overflow were more likely to notice the links for the survey and click to begin it.

    As an incentive, respondents who finished the survey could opt into a "Census" badge if they completed the survey.

    You can find the official published results online.

    Find previous survey results online, as well.

    Legal:

    This database - The Public 2022 Stack Overflow Developer Survey Results - is made available under the Open Database License (ODbL): http://opendatacommons.org/licenses/odbl/1.0/. Any rights in individual contents of the database are licensed under the Database Contents License: http://opendatacommons.org/licenses/dbcl/1.0/

    TLDR: You are free to share, adapt, and create derivative works from The Public 2022 Stack Overflow Developer Survey Results as long as you attribute Stack Overflow, keep the database open (if you redistribute it), and continue to share-alike any adapted database under the ODbl.

    Acknowledgment:

    Massive, heartfelt thanks to all Stack Overflow contributors and lurking developers of the world who took part in the survey this year. We value your generous participation more than you know.

  6. Commercial Real Estate Data | Commercial Real Estate Professionals in Europe...

    • datarade.ai
    Updated Jan 1, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Success.ai (2018). Commercial Real Estate Data | Commercial Real Estate Professionals in Europe | Verified Global Profiles from 700M+ Dataset | Best Price Guarantee [Dataset]. https://datarade.ai/data-products/commercial-real-estate-data-commercial-real-estate-professi-success-ai
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Jan 1, 2018
    Dataset provided by
    Area covered
    Netherlands, Denmark, Gibraltar, Finland, Serbia, Greece, Monaco, Bulgaria, Isle of Man, Croatia
    Description

    Success.ai’s Commercial Real Estate Data for Commercial Real Estate Professionals in Europe provides a highly detailed dataset tailored for businesses looking to engage with key decision-makers in the European commercial real estate market. Covering developers, property managers, brokers, and investors, this dataset includes verified contact data, decision-maker insights, and firmographic details to empower your outreach and strategic initiatives.

    With access to over 700 million verified global profiles and data from 70 million businesses, Success.ai ensures your marketing, sales, and partnership efforts are powered by accurate, continuously updated, and AI-validated data. Supported by our Best Price Guarantee, this solution is indispensable for navigating Europe’s thriving commercial real estate sector.

    Why Choose Success.ai’s Commercial Real Estate Data?

    1. Verified Contact Data for Targeted Outreach

      • Access verified work emails, phone numbers, and LinkedIn profiles of property developers, brokers, asset managers, and investment leads.
      • AI-driven validation ensures 99% accuracy, reducing communication errors and improving outreach effectiveness.
    2. Comprehensive Coverage Across Europe’s Real Estate Sector

      • Includes profiles from major European real estate markets such as the UK, Germany, France, Italy, and the Netherlands.
      • Gain insights into regional market dynamics, investment opportunities, and commercial real estate trends.
    3. Continuously Updated Datasets

      • Real-time updates capture leadership changes, market expansions, and emerging property developments.
      • Stay aligned with the fast-evolving commercial real estate market and seize opportunities effectively.
    4. Ethical and Compliant

      • Adheres to GDPR, CCPA, and other global data privacy regulations, ensuring responsible use of data and compliance with legal standards.

    Data Highlights:

    • 700M+ Verified Global Profiles: Connect with decision-makers, property managers, and brokers in Europe’s commercial real estate sector.
    • 70M Business Profiles: Access detailed firmographic data, including company sizes, revenue ranges, and geographic footprints.
    • Leadership Insights: Engage with CEOs, asset managers, and real estate directors driving strategic decisions.
    • Market Intelligence: Gain visibility into property development projects, investment trends, and regional opportunities.

    Key Features of the Dataset:

    1. Decision-Maker Profiles in Real Estate

      • Identify and connect with executives, brokers, and property managers overseeing transactions, asset management, and investment strategies.
      • Target professionals responsible for property acquisitions, leasing, and development.
    2. Firmographic and Geographic Insights

      • Access detailed business information, including company structures, geographic locations, and market specializations.
      • Pinpoint key players in specific regions and align outreach with localized market needs.
    3. Advanced Filters for Precision Campaigns

      • Filter companies by industry focus (commercial properties, retail, industrial), revenue size, or project scope.
      • Tailor your campaigns to address specific challenges, such as tenant retention, sustainability initiatives, or market expansion.
    4. AI-Driven Enrichment

      • Profiles enriched with actionable data allow for personalized messaging, highlight unique value propositions, and improve engagement outcomes with real estate professionals.

    Strategic Use Cases:

    1. Sales and Lead Generation

      • Present property management tools, software solutions, or investment opportunities to real estate firms and property managers.
      • Build relationships with brokers and developers seeking innovative solutions to streamline operations or enhance profitability.
    2. Market Research and Competitive Analysis

      • Analyze trends in Europe’s commercial real estate market to guide product development and marketing strategies.
      • Benchmark against competitors to identify growth opportunities, underserved segments, and high-value properties.
    3. Partnership Development and Investment Insights

      • Engage with property developers, asset managers, and brokers exploring strategic partnerships or new investment opportunities.
      • Foster alliances that expand market reach, improve property performance, or drive higher returns.
    4. Recruitment and Workforce Solutions

      • Target HR professionals and hiring managers recruiting for roles in property management, real estate finance, or asset development.
      • Provide workforce optimization tools, training platforms, or recruitment services tailored to the commercial real estate sector.

    Why Choose Success.ai?

    1. Best Price Guarantee
      • Access premium-quality commercial real estate data at competitive prices, ensuring strong ROI for your marketing, sales, and business development efforts. ...
  7. Open Buildings V3 Polygons

    • developers.google.com
    Updated Oct 8, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Google Research - Open Buildings (2022). Open Buildings V3 Polygons [Dataset]. https://developers.google.com/earth-engine/datasets/catalog/GOOGLE_Research_open-buildings_v3_polygons
    Explore at:
    Dataset updated
    Oct 8, 2022
    Dataset provided by
    Googlehttp://google.com/
    Time period covered
    May 30, 2023
    Area covered
    Earth
    Description

    This large-scale open dataset consists of outlines of buildings derived from high-resolution 50 cm satellite imagery. It contains 1.8B building detections in Africa, Latin America, Caribbean, South Asia and Southeast Asia. The inference spanned an area of 58M km². For each building in this dataset we include the polygon describing its footprint on the ground, a confidence score indicating how sure we are that this is a building, and a Plus Code corresponding to the center of the building. There is no information about the type of building, its street address, or any details other than its geometry. Building footprints are useful for a range of important applications: from population estimation, urban planning and humanitarian response to environmental and climate science. The project is based in Ghana, with an initial focus on the continent of Africa and new updates on South Asia, South-East Asia, Latin America and the Caribbean. Inference was carried out during May 2023. For more details see the official website of the Open Buildings dataset.

  8. BETA-FOR_SP3_EnvironmentalAttributes_DLM/ESAWC/SRTM_2023

    • zenodo.org
    Updated Feb 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Patrick Kacic; Patrick Kacic (2025). BETA-FOR_SP3_EnvironmentalAttributes_DLM/ESAWC/SRTM_2023 [Dataset]. http://doi.org/10.5281/zenodo.14850688
    Explore at:
    Dataset updated
    Feb 18, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Patrick Kacic; Patrick Kacic
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Time period covered
    Oct 24, 2023
    Description
    This dataset provides additional information on environmental attributes (minimum distance to land cover classes, topographic information) based on the dataset "BETA-FOR_SPZ_Patches_2022/2023" (https://zenodo.org/records/14748236) (centroid coordinates: decimalLongitude, decimalLatitude).
    From the following three geospatial datasets the information on environmental attributes were derived:
    - ESA Worldcover = Global product on land cover (Raster data, https://developers.google.com/earth-engine/datasets/catalog/ESA_WorldCover_v100?hl=en)
    - SRTM = Global Digital Elevation Model (Raster data, https://developers.google.com/earth-engine/datasets/catalog/USGS_SRTMGL1_003?hl=en)
    The following attributes were added to the "BETA-FOR_SPZ_Patches_2022/2023" table and exported as .csv file (tabular data):
    DLM250:
    - min_dist_sie01_p = minimum distance to urban areas [m]
    - min_dist_ver01_l = minimum distance to technical infrastructure (roads) [m]
    - min_dist_veg01_f = minimum distance to agricultural areas [m]
    - min_dist_gew01_l = minimum distance to waterbodies [m]
    Please consider that the DLM250 is spatially discontinuous vector data where e.g. agricultural areas are incompletely assessed.
    ESA WorldCover (ESAWC):
    - min_dist_esawc_30 = minimum distance to grasslands (land cover class value = 30) [m]
    - min_dist_esawc_40 = minimum distance to cropland (land cover class value = 40) [m]
    SRTM:
    - SRTM_elevation = elevation [m]
    - SRTM_slope = slope [°]
    - SRTM_aspect = aspect; 90° = E, 180° = S; 270 ° = W; 360°/0° = N [°]
    The original vector and raster data can be made available upon request, e.g. to inspect benefits and limitations of DLM250 and ESA WorldCover.
  9. Commercial Real Estate Data | Global Real Estate Professionals | Work...

    • datarade.ai
    Updated Oct 27, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Success.ai (2021). Commercial Real Estate Data | Global Real Estate Professionals | Work Emails, Phone Numbers & Verified Profiles | Best Price Guaranteed [Dataset]. https://datarade.ai/data-products/commercial-real-estate-data-global-real-estate-professional-success-ai
    Explore at:
    .bin, .json, .xml, .csv, .xls, .sql, .txtAvailable download formats
    Dataset updated
    Oct 27, 2021
    Dataset provided by
    Area covered
    Hong Kong, Comoros, Burkina Faso, Bolivia (Plurinational State of), Guatemala, Netherlands, Marshall Islands, El Salvador, Korea (Republic of), Sierra Leone
    Description

    Success.ai’s Commercial Real Estate Data and B2B Contact Data for Global Real Estate Professionals is a comprehensive dataset designed to connect businesses with industry leaders in real estate worldwide. With over 170M verified profiles, including work emails and direct phone numbers, this solution ensures precise outreach to agents, brokers, property developers, and key decision-makers in the real estate sector.

    Utilizing advanced AI-driven validation, our data is continuously updated to maintain 99% accuracy, offering actionable insights that empower targeted marketing, streamlined sales strategies, and efficient recruitment efforts. Whether you’re engaging with top real estate executives or sourcing local property experts, Success.ai provides reliable and compliant data tailored to your needs.

    Key Features of Success.ai’s Real Estate Professional Contact Data

    • Comprehensive Industry Coverage Gain direct access to verified profiles of real estate professionals across the globe, including:
    1. Real Estate Agents: Professionals facilitating property sales and purchases.
    2. Brokers: Key intermediaries managing transactions between buyers and sellers.
    3. Property Developers: Decision-makers shaping residential, commercial, and industrial projects.
    4. Real Estate Executives: Leaders overseeing multi-regional operations and business strategies.
    5. Architects & Consultants: Experts driving design and project feasibility.
    • Verified and Continuously Updated Data

    AI-Powered Validation: All profiles are verified using cutting-edge AI to ensure up-to-date accuracy. Real-Time Updates: Our database is refreshed continuously to reflect the most current information. Global Compliance: Fully aligned with GDPR, CCPA, and other regional regulations for ethical data use.

    • Customizable Data Delivery Tailor your data access to align with your operational goals:

    API Integration: Directly integrate data into your CRM or project management systems for seamless workflows. Custom Flat Files: Receive detailed datasets customized to your specifications, ready for immediate application.

    Why Choose Success.ai for Real Estate Contact Data?

    • Best Price Guarantee Enjoy competitive pricing that delivers exceptional value for verified, comprehensive contact data.

    • Precision Targeting for Real Estate Professionals Our dataset equips you to connect directly with real estate decision-makers, minimizing misdirected efforts and improving ROI.

    • Strategic Use Cases

      Lead Generation: Target qualified real estate agents and brokers to expand your network. Sales Outreach: Engage with property developers and executives to close high-value deals. Marketing Campaigns: Drive targeted campaigns tailored to real estate markets and demographics. Recruitment: Identify and attract top talent in real estate for your growing team. Market Research: Access firmographic and demographic data for in-depth industry analysis.

    • Data Highlights 170M+ Verified Professional Profiles 50M Work Emails 30M Company Profiles 700M Global Professional Profiles

    • Powerful APIs for Enhanced Functionality

      Enrichment API Ensure your contact database remains relevant and up-to-date with real-time enrichment. Ideal for businesses seeking to maintain competitive agility in dynamic markets.

    Lead Generation API Boost your lead generation with verified contact details for real estate professionals, supporting up to 860,000 API calls per day for robust scalability.

    • Use Cases for Real Estate Contact Data
    1. Targeted Outreach for New Projects Connect with property developers and brokers to pitch your services or collaborate on upcoming projects.

    2. Real Estate Marketing Campaigns Execute personalized marketing campaigns targeting agents and clients in residential, commercial, or industrial sectors.

    3. Enhanced Sales Strategies Shorten sales cycles by directly engaging with decision-makers and key stakeholders.

    4. Recruitment and Talent Acquisition Access profiles of highly skilled professionals to strengthen your real estate team.

    5. Market Analysis and Intelligence Leverage firmographic and demographic insights to identify trends and optimize business strategies.

    • What Makes Us Stand Out? >> Unmatched Data Accuracy: Our AI-driven validation ensures 99% accuracy for all contact details. >> Comprehensive Global Reach: Covering professionals across diverse real estate markets worldwide. >> Flexible Delivery Options: Access data in formats that seamlessly fit your existing systems. >> Ethical and Compliant Data Practices: Adherence to global standards for secure and responsible data use.

    Success.ai’s B2B Contact Data for Global Real Estate Professionals delivers the tools you need to connect with the right people at the right time, driving efficiency and success in your business operations. From agents and brokers to property developers and executiv...

  10. a

    Data from: Google Earth Engine (GEE)

    • sdgs.amerigeoss.org
    • data.amerigeoss.org
    • +6more
    Updated Nov 28, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    AmeriGEOSS (2018). Google Earth Engine (GEE) [Dataset]. https://sdgs.amerigeoss.org/datasets/bb1b131beda24006881d1ab019205277
    Explore at:
    Dataset updated
    Nov 28, 2018
    Dataset authored and provided by
    AmeriGEOSS
    Description

    Meet Earth EngineGoogle Earth Engine combines a multi-petabyte catalog of satellite imagery and geospatial datasets with planetary-scale analysis capabilities and makes it available for scientists, researchers, and developers to detect changes, map trends, and quantify differences on the Earth's surface.SATELLITE IMAGERY+YOUR ALGORITHMS+REAL WORLD APPLICATIONSLEARN MOREGLOBAL-SCALE INSIGHTExplore our interactive timelapse viewer to travel back in time and see how the world has changed over the past twenty-nine years. Timelapse is one example of how Earth Engine can help gain insight into petabyte-scale datasets.EXPLORE TIMELAPSEREADY-TO-USE DATASETSThe public data archive includes more than thirty years of historical imagery and scientific datasets, updated and expanded daily. It contains over twenty petabytes of geospatial data instantly available for analysis.EXPLORE DATASETSSIMPLE, YET POWERFUL APIThe Earth Engine API is available in Python and JavaScript, making it easy to harness the power of Google’s cloud for your own geospatial analysis.EXPLORE THE APIGoogle Earth Engine has made it possible for the first time in history to rapidly and accurately process vast amounts of satellite imagery, identifying where and when tree cover change has occurred at high resolution. Global Forest Watch would not exist without it. For those who care about the future of the planet Google Earth Engine is a great blessing!-Dr. Andrew Steer, President and CEO of the World Resources Institute.CONVENIENT TOOLSUse our web-based code editor for fast, interactive algorithm development with instant access to petabytes of data.LEARN ABOUT THE CODE EDITORSCIENTIFIC AND HUMANITARIAN IMPACTScientists and non-profits use Earth Engine for remote sensing research, predicting disease outbreaks, natural resource management, and more.SEE CASE STUDIESREADY TO BE PART OF THE SOLUTION?SIGN UP NOWTERMS OF SERVICE PRIVACY ABOUT GOOGLE

  11. Z

    Dataset for paper "Mitigating the effect of errors in source parameters on...

    • data.niaid.nih.gov
    Updated Sep 28, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Phil-Simon Hardalupas (2022). Dataset for paper "Mitigating the effect of errors in source parameters on seismic (waveform) inversion" [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_6969601
    Explore at:
    Dataset updated
    Sep 28, 2022
    Dataset provided by
    Phil-Simon Hardalupas
    Nicholas Rawlinson
    Nienke Blom
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Dataset corresponding to the journal article "Mitigating the effect of errors in source parameters on seismic (waveform) inversion" by Blom, Hardalupas and Rawlinson, accepted for publication in Geophysical Journal International. In this paper, we demonstrate the effect or errors in source parameters on seismic tomography, with a particular focus on (full) waveform tomography. We study effect both on forward modelling (i.e. comparing waveforms and measurements resulting from a perturbed vs. unperturbed source) and on seismic inversion (i.e. using a source which contains an (erroneous) perturbation to invert for Earth structure. These data were obtained using Salvus, a state-of-the-art (though proprietary) 3-D solver that can be used for wave propagation simulations (Afanasiev et al., GJI 2018).

    This dataset contains:

    The entire Salvus project. This project was prepared using Salvus version 0.11.x and 0.12.2 and should be fully compatible with the latter.

    A number of Jupyter notebooks used to create all the figures, set up the project and do the data processing.

    A number of Python scripts that are used in above notebooks.

    two conda environment .yml files: one with the complete environment as used to produce this dataset, and one with the environment as supplied by Mondaic (the Salvus developers), on top of which I installed basemap and cartopy.

    An overview of the inversion configurations used for each inversion experiment and the name of hte corresponding figures: inversion_runs_overview.ods / .csv .

    Datasets corresponding to the different figures.

    One dataset for Figure 1, showing the effect of a source perturbation in a real-world setting, as previously used by Blom et al., Solid Earth 2020

    One dataset for Figure 2, showing how different methodologies and assumptions can lead to significantly different source parameters, notably including systematic shifts. This dataset was kindly supplied by Tim Craig (Craig, 2019).

    A number of datasets (stored as pickled Pandas dataframes) derived from the Salvus project. We have computed:

    travel-time arrival predictions from every source to all stations (df_stations...pkl)

    misfits for different metrics for both P-wave centered and S-wave centered windows for all components on all stations, comparing every time waveforms from a reference source against waveforms from a perturbed source (df_misfits_cc.28s.pkl)

    addition of synthetic waveforms for different (perturbed) moment tenors. All waveforms are stored in HDF5 (.h5) files of the ASDF (adaptable seismic data format) type

    How to use this dataset:

    To set up the conda environment:

    make sure you have anaconda/miniconda

    make sure you have access to Salvus functionality. This is not absolutely necessary, but most of the functionality within this dataset relies on salvus. You can do the analyses and create the figures without, but you'll have to hack around in the scripts to build workarounds.

    Set up Salvus / create a conda environment. This is best done following the instructions on the Mondaic website. Check the changelog for breaking changes, in that case download an older salvus version.

    Additionally in your conda env, install basemap and cartopy:

    conda-env create -n salvus_0_12 -f environment.yml conda install -c conda-forge basemap conda install -c conda-forge cartopy

    Install LASIF (https://github.com/dirkphilip/LASIF_2.0) and test. The project uses some lasif functionality.

    To recreate the figures: This is extremely straightforward. Every figure has a corresponding Jupyter Notebook. Suffices to run the notebook in its entirety.

    Figure 1: separate notebook, Fig1_event_98.py

    Figure 2: separate notebook, Fig2_TimCraig_Andes_analysis.py

    Figures 3-7: Figures_perturbation_study.py

    Figures 8-10: Figures_toy_inversions.py

    To recreate the dataframes in DATA: This can be done using the example notebook Create_perturbed_thrust_data_by_MT_addition.py and Misfits_moment_tensor_components.M66_M12.py . The same can easily be extended to the position shift and other perturbations you might want to investigate.

    To recreate the complete Salvus project: This can be done using:

    the notebook Prepare_project_Phil_28s_absb_M66.py (setting up project and running simulations)

    the notebooks Moment_tensor_perturbations.py and Moment_tensor_perturbation_for_NS_thrust.py

    For the inversions: using the notebook Inversion_SS_dip.M66.28s.py as an example. See the overview table inversion_runs_overview.ods (or .csv) as to naming conventions.

    References:

    Michael Afanasiev, Christian Boehm, Martin van Driel, Lion Krischer, Max Rietmann, Dave A May, Matthew G Knepley, Andreas Fichtner, Modular and flexible spectral-element waveform modelling in two and three dimensions, Geophysical Journal International, Volume 216, Issue 3, March 2019, Pages 1675–1692, https://doi.org/10.1093/gji/ggy469

    Nienke Blom, Alexey Gokhberg, and Andreas Fichtner, Seismic waveform tomography of the central and eastern Mediterranean upper mantle, Solid Earth, Volume 11, Issue 2, 2020, Pages 669–690, 2020, https://doi.org/10.5194/se-11-669-2020

    Tim J. Craig, Accurate depth determination for moderate-magnitude earthquakes using global teleseismic data. Journal of Geophysical Research: Solid Earth, 124, 2019, Pages 1759– 1780. https://doi.org/10.1029/2018JB016902

  12. d

    Monthly OpenET Image Collections (v2.0) Summarized by 12-Digit Hydrologic...

    • catalog.data.gov
    • data.usgs.gov
    Updated Sep 17, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Monthly OpenET Image Collections (v2.0) Summarized by 12-Digit Hydrologic Unit Codes, 2008-2023 [Dataset]. https://catalog.data.gov/dataset/monthly-openet-image-collections-v2-0-summarized-by-12-digit-hydrologic-unit-codes-2008-20
    Explore at:
    Dataset updated
    Sep 17, 2025
    Dataset provided by
    U.S. Geological Survey
    Description

    This dataset provides monthly summaries of evapotranspiration (ET) data from OpenET v2.0 image collections for the period 2008-2023 for all National Watershed Boundary Dataset subwatersheds (12-digit hydrologic unit codes [HUC12s]) in the US that overlap the spatial extent of OpenET datasets. For each HUC12, this dataset contains spatial aggregation statistics (minimum, mean, median, and maximum) for each of the ET variables from each of the publicly available image collections from OpenET for the six available models (DisALEXI, eeMETRIC, geeSEBAL, PT-JPL, SIMS, SSEBop) and the Ensemble image collection, which is a pixel-wise ensemble of all 6 individual models after filtering and removal of outliers according to the median absolute deviation approach (Melton and others, 2022). Data are available in this data release in two different formats: comma-separated values (CSV) and parquet, a high-performance format that is optimized for storage and processing of columnar data. CSV files containing data for each 4-digit HUC are grouped by 2-digit HUCs for easier access of regional data, and the single parquet file provides convenient access to the entire dataset. For each of the ET models (DisALEXI, eeMETRIC, geeSEBAL, PT-JPL, SIMS, SSEBop), variables in the model-specific CSV data files include: -huc12: The 12-digit hydrologic unit code -ET: Actual evapotranspiration (in millimeters) over the HUC12 area in the month calculated as the sum of daily ET interpolated between Landsat overpasses -statistic: Max, mean, median, or min. Statistic used in the spatial aggregation within each HUC12. For example, maximum ET is the maximum monthly pixel ET value occurring within the HUC12 boundary after summing daily ET in the month -year: 4-digit year -month: 2-digit month -count: Number of Landsat overpasses included in the ET calculation in the month -et_coverage_pct: Integer percentage of the HUC12 with ET data, which can be used to determine how representative the ET statistic is of the entire HUC12 -count_coverage_pct: Integer percentage of the HUC12 with count data, which can be different than the et_coverage_pct value because the “count” band in the source image collection extends beyond the “et” band in the eastern portion of the image collection extent For the Ensemble data, these additional variables are included in the CSV files: -et_mad: Ensemble ET value, computed as the mean of the ensemble after filtering outliers using the median absolute deviation (MAD) -et_mad_count: The number of models used to compute the ensemble ET value after filtering for outliers using the MAD -et_mad_max: The maximum value in the ensemble range, after filtering for outliers using the MAD -et_mad_min: The minimum value in the ensemble range, after filtering for outliers using the MAD -et_sam: A simple arithmetic mean (across the 6 models) of actual ET average without outlier removal Below are the locations of each OpenET image collection used in this summary: DisALEXI: https://developers.google.com/earth-engine/datasets/catalog/OpenET_DISALEXI_CONUS_GRIDMET_MONTHLY_v2_0 eeMETRIC: https://developers.google.com/earth-engine/datasets/catalog/OpenET_EEMETRIC_CONUS_GRIDMET_MONTHLY_v2_0 geeSEBAL: https://developers.google.com/earth-engine/datasets/catalog/OpenET_GEESEBAL_CONUS_GRIDMET_MONTHLY_v2_0 PT-JPL: https://developers.google.com/earth-engine/datasets/catalog/OpenET_PTJPL_CONUS_GRIDMET_MONTHLY_v2_0 SIMS: https://developers.google.com/earth-engine/datasets/catalog/OpenET_SIMS_CONUS_GRIDMET_MONTHLY_v2_0 SSEBop: https://developers.google.com/earth-engine/datasets/catalog/OpenET_SSEBOP_CONUS_GRIDMET_MONTHLY_v2_0 Ensemble: https://developers.google.com/earth-engine/datasets/catalog/OpenET_ENSEMBLE_CONUS_GRIDMET_MONTHLY_v2_0

  13. The ORBIT (Object Recognition for Blind Image Training)-India Dataset

    • zenodo.org
    • data.niaid.nih.gov
    Updated Apr 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gesu India; Gesu India; Martin Grayson; Martin Grayson; Daniela Massiceti; Daniela Massiceti; Cecily Morrison; Cecily Morrison; Simon Robinson; Simon Robinson; Jennifer Pearson; Jennifer Pearson; Matt Jones; Matt Jones (2025). The ORBIT (Object Recognition for Blind Image Training)-India Dataset [Dataset]. http://doi.org/10.5281/zenodo.12608444
    Explore at:
    Dataset updated
    Apr 24, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Gesu India; Gesu India; Martin Grayson; Martin Grayson; Daniela Massiceti; Daniela Massiceti; Cecily Morrison; Cecily Morrison; Simon Robinson; Simon Robinson; Jennifer Pearson; Jennifer Pearson; Matt Jones; Matt Jones
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    India
    Description

    The ORBIT (Object Recognition for Blind Image Training) -India Dataset is a collection of 105,243 images of 76 commonly used objects, collected by 12 individuals in India who are blind or have low vision. This dataset is an "Indian subset" of the original ORBIT dataset [1, 2], which was collected in the UK and Canada. In contrast to the ORBIT dataset, which was created in a Global North, Western, and English-speaking context, the ORBIT-India dataset features images taken in a low-resource, non-English-speaking, Global South context, a home to 90% of the world’s population of people with blindness. Since it is easier for blind or low-vision individuals to gather high-quality data by recording videos, this dataset, like the ORBIT dataset, contains images (each sized 224x224) derived from 587 videos. These videos were taken by our data collectors from various parts of India using the Find My Things [3] Android app. Each data collector was asked to record eight videos of at least 10 objects of their choice.

    Collected between July and November 2023, this dataset represents a set of objects commonly used by people who are blind or have low vision in India, including earphones, talking watches, toothbrushes, and typical Indian household items like a belan (rolling pin), and a steel glass. These videos were taken in various settings of the data collectors' homes and workspaces using the Find My Things Android app.

    The image dataset is stored in the ‘Dataset’ folder, organized by folders assigned to each data collector (P1, P2, ...P12) who collected them. Each collector's folder includes sub-folders named with the object labels as provided by our data collectors. Within each object folder, there are two subfolders: ‘clean’ for images taken on clean surfaces and ‘clutter’ for images taken in cluttered environments where the objects are typically found. The annotations are saved inside a ‘Annotations’ folder containing a JSON file per video (e.g., P1--coffee mug--clean--231220_084852_coffee mug_224.json) that contains keys corresponding to all frames/images in that video (e.g., "P1--coffee mug--clean--231220_084852_coffee mug_224--000001.jpeg": {"object_not_present_issue": false, "pii_present_issue": false}, "P1--coffee mug--clean--231220_084852_coffee mug_224--000002.jpeg": {"object_not_present_issue": false, "pii_present_issue": false}, ...). The ‘object_not_present_issue’ key is True if the object is not present in the image, and the ‘pii_present_issue’ key is True, if there is a personally identifiable information (PII) present in the image. Note, all PII present in the images has been blurred to protect the identity and privacy of our data collectors. This dataset version was created by cropping images originally sized at 1080 × 1920; therefore, an unscaled version of the dataset will follow soon.

    This project was funded by the Engineering and Physical Sciences Research Council (EPSRC) Industrial ICASE Award with Microsoft Research UK Ltd. as the Industrial Project Partner. We would like to acknowledge and express our gratitude to our data collectors for their efforts and time invested in carefully collecting videos to build this dataset for their community. The dataset is designed for developing few-shot learning algorithms, aiming to support researchers and developers in advancing object-recognition systems. We are excited to share this dataset and would love to hear from you if and how you use this dataset. Please feel free to reach out if you have any questions, comments or suggestions.

    REFERENCES:

    1. Daniela Massiceti, Lida Theodorou, Luisa Zintgraf, Matthew Tobias Harris, Simone Stumpf, Cecily Morrison, Edward Cutrell, and Katja Hofmann. 2021. ORBIT: A real-world few-shot dataset for teachable object recognition collected from people who are blind or low vision. DOI: https://doi.org/10.25383/city.14294597

    2. microsoft/ORBIT-Dataset. https://github.com/microsoft/ORBIT-Dataset

    3. Linda Yilin Wen, Cecily Morrison, Martin Grayson, Rita Faia Marques, Daniela Massiceti, Camilla Longden, and Edward Cutrell. 2024. Find My Things: Personalized Accessibility through Teachable AI for People who are Blind or Low Vision. In Extended Abstracts of the 2024 CHI Conference on Human Factors in Computing Systems (CHI EA '24). Association for Computing Machinery, New York, NY, USA, Article 403, 1–6. https://doi.org/10.1145/3613905.3648641

  14. H

    Web-based GIS for spatiotemporal crop climate niche mapping

    • dataverse.harvard.edu
    Updated Jul 8, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    B. G. Peter; J. P. Messina; Z. Lin (2020). Web-based GIS for spatiotemporal crop climate niche mapping [Dataset]. http://doi.org/10.7910/DVN/UFC6B5
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 8, 2020
    Dataset provided by
    Harvard Dataverse
    Authors
    B. G. Peter; J. P. Messina; Z. Lin
    License

    https://dataverse.harvard.edu/api/datasets/:persistentId/versions/3.1/customlicense?persistentId=doi:10.7910/DVN/UFC6B5https://dataverse.harvard.edu/api/datasets/:persistentId/versions/3.1/customlicense?persistentId=doi:10.7910/DVN/UFC6B5

    Description

    Web-based GIS for spatiotemporal crop climate niche mapping Interactive Google Earth Engine Application—Version 2, July 2020 https://cropniche.cartoscience.com https://cartoscience.users.earthengine.app/view/crop-niche Google Earth Engine Code /* ---------------------------------------------------------------------------------------------------------------------- # CropSuit-GEE Authors: Brad G. Peter (bpeter@ua.edu), Joseph P. Messina, and Zihan Lin Organizations: BGP, JPM - University of Alabama; ZL - Michigan State University Last Modified: 06/28/2020 To cite this code use: Peter, B. G.; Messina, J. P.; Lin, Z., 2019, "Web-based GIS for spatiotemporal crop climate niche mapping", https://doi.org/10.7910/DVN/UFC6B5, Harvard Dataverse, V1 ------------------------------------------------------------------------------------------------------------------------- This is a Google Earth Engine crop climate suitability geocommunication and map export tool designed to support agronomic development and deployment of improved crop system technologies. This content is made possible by the support of the American People provided to the Feed the Future Innovation Lab for Sustainable Intensification through the United States Agency for International Development (USAID). The contents are the sole responsibility of the authors and do not necessarily reflect the views of USAID or the United States Government. Program activities are funded by USAID under Cooperative Agreement No. AID-OAA-L-14-00006. ------------------------------------------------------------------------------------------------------------------------- Summarization of input options: There are 14 user options available. The first is a country of interest selection using a 2-digit FIPS code (link available below). This selection is used to produce a rectangular bounding box for export; however, other geometries can be selected with minimal modification to the code. Options 2 and 3 specify the complete temporal range for aggregation (averaged across seasons; single seasons may also be selected). Options 4–7 specify the growing season for calculating total seasonal rainfall and average season temperatures and NDVI (NDVI is for export only and is not used in suitability determination). Options 8–11 specify the climate parameters for the crop of interest (rainfall and temperature max/min). Option 12 enables masking to agriculture, 13 enables exporting of all data layers, and 14 is a text string for naming export files. ------------------------------------------------------------------------------------------------------------------------- ••••••••••••••••••••••••••••••••••••••••••• USER OPTIONS ••••••••••••••••••••••••••••••••••••••••••••••••••••••••••••• */ // CHIRPS data availability: https://developers.google.com/earth-engine/datasets/catalog/UCSB-CHG_CHIRPS_PENTAD // MOD11A2 data availability: https://developers.google.com/earth-engine/datasets/catalog/MODIS_006_MOD11A2 var country = 'MI' // [1] https://en.wikipedia.org/wiki/List_of_FIPS_country_codes var startRange = 2001 // [2] var endRange = 2017 // [3] var startSeasonMonth = 11 // [4] var startSeasonDay = 1 // [5] var endSeasonMonth = 4 // [6] var endSeasonDay = 30 // [7] var precipMin = 750 // [8] var precipMax = 1200 // [9] var tempMin = 22 // [10] var tempMax = 32 // [11] var maskToAg = 'TRUE' // [12] 'TRUE' (default) or 'FALSE' var exportLayers = 'TRUE' // [13] 'TRUE' (default) or 'FALSE' var exportNameHeader = 'crop_suit_maize' // [14] text string for naming export file // ••••••••••••••••••••••••••••••••• NO USER INPUT BEYOND THIS POINT •••••••••••••••••••••••••••••••••••••••••••••••••••• // Access precipitation and temperature ImageCollections and a global countries FeatureCollection var region = ee.FeatureCollection('USDOS/LSIB_SIMPLE/2017') .filterMetadata('country_co','equals',country) var precip = ee.ImageCollection('UCSB-CHG/CHIRPS/PENTAD').select('precipitation') var temp = ee.ImageCollection('MODIS/006/MOD11A2').select(['LST_Day_1km','LST_Night_1km']) var ndvi = ee.ImageCollection('MODIS/006/MOD13Q1').select(['NDVI']) // Create layers for masking to agriculture and masking out water bodies var waterMask = ee.Image('UMD/hansen/global_forest_change_2015').select('datamask').eq(1) var agModis = ee.ImageCollection('MODIS/006/MCD12Q1').select('LC_Type1').mode() .remap([1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17], [0,0,0,0,0,0,0,0,0,0,0,1,0,1,0,0,0]) var agGC = ee.Image('ESA/GLOBCOVER_L4_200901_200912_V2_3').select('landcover') .remap([11,14,20,30,40,50,60,70,90,100,110,120,130,140,150,160,170,180,190,200,210,220,230], [1,1,1,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0]) var cropland = ee.Image('USGS/GFSAD1000_V1').neq(0) var agMask = agModis.add(agGC).add(cropland).gt(0).eq(1) // Modify user input options for processing with raw data var years = ee.List.sequence(startRange,endRange) var bounds = region.geometry().bounds() var tMinMod = (tempMin+273.15)/0.02 var tMaxMod = (tempMax+273.15)/0.02 //...

  15. Used Cars Sales Listings Dataset 2025

    • kaggle.com
    Updated Aug 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pratyush Puri (2025). Used Cars Sales Listings Dataset 2025 [Dataset]. https://www.kaggle.com/datasets/pratyushpuri/used-car-sales-listings-dataset-2025
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 12, 2025
    Dataset provided by
    Kaggle
    Authors
    Pratyush Puri
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Luxury Cosmetics Pop‑Up Events Dataset

    A comprehensive, real-world–anchored synthetic dataset capturing 2,133 luxury beauty pop-up events across global retail hotspots. It focuses on limited-edition product drops, experiential formats, and performance KPIs—especially footfall and sell‑through. The data is designed for analytics use cases such as demand forecasting, footfall modeling, merchandising optimization, pricing analysis, and market expansion studies across regions and venue types.

    What this dataset contains

    • 2,133 events from global hubs across North America, Europe, Middle East, Asia‑Pacific, and Latin America
    • Luxury/premium cosmetics brands and their limited‑release SKUs
    • Event formats and retail venue archetypes typical of pop‑up retail
    • Time windows and lease lengths aligned with short‑term pop‑up activations
    • Core commercial KPIs: price, units sold, sell‑through percentage
    • Footfall KPI: average daily footfall modeled by location/format/marketing intensity

    Ideal use cases

    • Pop‑up ROI and performance benchmarking by brand, city, venue type, and format
    • Footfall prediction and location strategy (high‑street vs mall vs airport vs districts)
    • Limited-edition launch analytics: pricing vs sell‑through dynamics
    • Event planning: lease length and timing windows vs outcomes
    • Territory planning: region and city segmentation performance
    • Portfolio dashboards: cross‑brand comparisons and trend reporting

    File formats

    • CSV, JSON, XLSX, and SQLite (table: popups)

    Target users

    • Retail strategy and analytics teams
    • Growth, trade marketing, and brand managers
    • Data scientists building forecasting and optimization models
    • BI developers building dashboards for pop‑up performance

    Column Dictionary

    ColumnTypeExampleDescription
    event_idstringPOP100282Unique identifier for each pop‑up event.
    brandstringCharlotte TilburyLuxury/premium cosmetics brand running the pop‑up.
    regionstringNorth AmericaMacro market region (North America, Europe, Middle East, Asia‑Pacific, Latin America).
    citystringMiamiCity of the event; occasionally null to simulate real‑world data gaps.
    location_typestringArt/Design DistrictVenue archetype: High‑Street, Luxury Mall, Dept Store Atrium, Airport Duty‑Free, Art/Design District.
    event_typestringFlash EventPop‑up format: Standalone, Shop‑in‑Shop, Mobile Truck, Flash Event, Mall Kiosk.
    start_datedate2024-02-25Event start date.
    end_datedate2024-03-02Event end date; can be null (e.g., ongoing/TBC) to reflect operational uncertainty.
    lease_length_daysinteger6Duration of the activation (days), aligned with short‑term pop‑up leases.
    skustringLE-UQYNQA1ALimited‑release product code tied to the event/dataset scope.
    product_namestringCharlotte Tilbury Glow MascaraBranded product listing (luxury‑oriented descriptors + category).
    price_usdfloat62.21Ticket price (USD) aligned with luxury cosmetics price bands by category.
    avg_daily_footfallinteger1107Estimated average daily visitors based on venue, format, and activation intensity.
    units_soldinteger3056Total units sold during the event window; capped by allocation dynamics.
    sell_through_pctfloat98.9Share of allocated inventory sold (%), proxy for demand strength and launch success.

    Data quality notes

    • City and end_date contain a small proportion of nulls to reflect real‑world reporting gaps (e.g., ongoing events).
    • avg_daily_footfall varies by locati...
  16. World Bank Projects & Operations

    • kaggle.com
    zip
    Updated Jan 5, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    World Bank (2019). World Bank Projects & Operations [Dataset]. https://www.kaggle.com/datasets/theworldbank/world-bank-projects-operations/versions/112
    Explore at:
    zip(0 bytes)Available download formats
    Dataset updated
    Jan 5, 2019
    Dataset authored and provided by
    World Bankhttp://topics.nytimes.com/top/reference/timestopics/organizations/w/world_bank/index.html
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Content

    World Bank Projects & Operations provides access to basic information on all of the World Bank's lending projects from 1947 to the present. The dataset includes basic information such as the project title, task manager, country, project id, sector, themes, commitment amount, product line, procurement notices, contract awards, and financing. It also provides links to publicly disclosed online documents.

    For older projects, there is a link to the Archives catalog, which contains records of older documents. Where available, there are also links to contract awards since July 2000.

    Context

    This is a dataset hosted by the World Bank. The organization has an open data platform found here and they update their information according the amount of data that is brought in. Explore the World Bank using Kaggle and all of the data sources available through the World Bank organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using the World Bank's APIs and Kaggle's API.

    Cover photo by rawpixel on Unsplash
    Unsplash Images are distributed under a unique Unsplash License.

  17. GovData360

    • kaggle.com
    zip
    Updated May 15, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    World Bank (2019). GovData360 [Dataset]. https://www.kaggle.com/theworldbank/govdata360
    Explore at:
    zip(29922056 bytes)Available download formats
    Dataset updated
    May 15, 2019
    Dataset authored and provided by
    World Bankhttp://topics.nytimes.com/top/reference/timestopics/organizations/w/world_bank/index.html
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Content

    GovData360 is a compendium of the most important governance indicators, from 26 datasets with worldwide coverage and more than 10 years of info, designed to provide guidance on the design of reforms and the monitoring of impacts. We have an Unbalanced Panel Data by Dataset - Country for around 3260 governance focused indicators.

    Context

    This is a dataset hosted by the World Bank. The organization has an open data platform found here and they update their information according the amount of data that is brought in. Explore the World Bank using Kaggle and all of the data sources available through the World Bank organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using the World Bank's APIs and Kaggle's API.

    Cover photo by John Jason on Unsplash
    Unsplash Images are distributed under a unique Unsplash License.

  18. d

    Global Distribution of Economic Activity - Dataset - waterdata

    • waterdata3.staging.derilinx.com
    • wbwaterdata.org
    Updated Mar 16, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2020). Global Distribution of Economic Activity - Dataset - waterdata [Dataset]. https://waterdata3.staging.derilinx.com/dataset/natural-endowment-measuring-economic-growth-outer-space
    Explore at:
    Dataset updated
    Mar 16, 2020
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Data for replicating The Global Spatial Distribution of Economic Activity: Nature, History, and the Role of Trade (forthcoming 2018; with Vernon Henderson, Tim Squires and David N. Weil) Quarterly Journal of Economics We explore the role of natural characteristics in determining the worldwide spatial distribution of economic activity, as proxied by lights at night, observed across 240,000 grid cells. A parsimonious set of 24 physical geography attributes explains 47% of worldwide variation and 35% of within-country variation in lights. We divide geographic characteristics into two groups, those primarily important for agriculture and those primarily important for trade, and confront a puzzle. In examining within-country variation in lights, among countries that developed early, agricultural variables incrementally explain over 6 times as much variation in lights as do trade variables, while among late developing countries the ratio is only about 1.5, even though the latter group is far more dependent on agriculture. Correspondingly, the marginal effects of agricultural variables as a group on lights are larger in absolute value, and those for trade smaller, for early developers than for late developers. We show that this apparent puzzle is explained by persistence and the differential timing of technological shocks in the two sets of countries. For early developers, structural transformation due to rising agricultural productivity began when transport costs were still high, so cities were localized in agricultural regions. When transport costs fell, these agglomerations persisted. In late-developing countries, transport costs fell before structural transformation. To exploit urban scale economies, manufacturing agglomerated in relatively few, often coastal, locations. Consistent with this explanation, countries that developed earlier are more spatially equal in their distribution of education and economic activity than late developers. This dataset is part of the Global Research Program on Spatial Development of Cities funded by the Multi-Donor Trust Fund on Sustainable Urbanization of the World Bank and supported by the U.K. Department for International Development.

  19. EOD – eBird Observation Dataset

    • gbif.org
    • canadensys.net
    • +2more
    Updated Sep 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jasdev Imani; Christine Audette; Thomas Auer; Sara Barker; Jessie Barry; Mike Charnoky; Cynthia Crowley; Jenna Curtis; Ian Davies; Courtney Davis; Renata Diaz; Alexander Feinberg; Daniel Fink; Joshua Ganger; John Garrett; Jeff Gerbracht; Cullen Hanks; Mike Hayes; Wesley Hochachka; Marshall Iliff; Adam Jordan; Shawn Ligocki; Taylor Long; William Morris; Stephen Morrow; Lauren Oldham; Francisco Padilla Obregon; Orin Robinson; Amanda Rodewald; Viviana Ruiz-Gutierrez; Matt Schloss; Alli Smith; Jeremy Smith; Andrew Stillman; Myles Stokowski; Matt Strimas-Mackey; Brian Sullivan; Alexander Tedeschi; Drew Weber; Heather Wolf; Christopher Wood; Jasdev Imani; Christine Audette; Thomas Auer; Sara Barker; Jessie Barry; Mike Charnoky; Cynthia Crowley; Jenna Curtis; Ian Davies; Courtney Davis; Renata Diaz; Alexander Feinberg; Daniel Fink; Joshua Ganger; John Garrett; Jeff Gerbracht; Cullen Hanks; Mike Hayes; Wesley Hochachka; Marshall Iliff; Adam Jordan; Shawn Ligocki; Taylor Long; William Morris; Stephen Morrow; Lauren Oldham; Francisco Padilla Obregon; Orin Robinson; Amanda Rodewald; Viviana Ruiz-Gutierrez; Matt Schloss; Alli Smith; Jeremy Smith; Andrew Stillman; Myles Stokowski; Matt Strimas-Mackey; Brian Sullivan; Alexander Tedeschi; Drew Weber; Heather Wolf; Christopher Wood (2025). EOD – eBird Observation Dataset [Dataset]. http://doi.org/10.15468/aomfnb
    Explore at:
    Dataset updated
    Sep 1, 2025
    Dataset provided by
    Cornell Lab of Ornithologyhttp://birds.cornell.edu/
    Global Biodiversity Information Facilityhttps://www.gbif.org/
    Authors
    Jasdev Imani; Christine Audette; Thomas Auer; Sara Barker; Jessie Barry; Mike Charnoky; Cynthia Crowley; Jenna Curtis; Ian Davies; Courtney Davis; Renata Diaz; Alexander Feinberg; Daniel Fink; Joshua Ganger; John Garrett; Jeff Gerbracht; Cullen Hanks; Mike Hayes; Wesley Hochachka; Marshall Iliff; Adam Jordan; Shawn Ligocki; Taylor Long; William Morris; Stephen Morrow; Lauren Oldham; Francisco Padilla Obregon; Orin Robinson; Amanda Rodewald; Viviana Ruiz-Gutierrez; Matt Schloss; Alli Smith; Jeremy Smith; Andrew Stillman; Myles Stokowski; Matt Strimas-Mackey; Brian Sullivan; Alexander Tedeschi; Drew Weber; Heather Wolf; Christopher Wood; Jasdev Imani; Christine Audette; Thomas Auer; Sara Barker; Jessie Barry; Mike Charnoky; Cynthia Crowley; Jenna Curtis; Ian Davies; Courtney Davis; Renata Diaz; Alexander Feinberg; Daniel Fink; Joshua Ganger; John Garrett; Jeff Gerbracht; Cullen Hanks; Mike Hayes; Wesley Hochachka; Marshall Iliff; Adam Jordan; Shawn Ligocki; Taylor Long; William Morris; Stephen Morrow; Lauren Oldham; Francisco Padilla Obregon; Orin Robinson; Amanda Rodewald; Viviana Ruiz-Gutierrez; Matt Schloss; Alli Smith; Jeremy Smith; Andrew Stillman; Myles Stokowski; Matt Strimas-Mackey; Brian Sullivan; Alexander Tedeschi; Drew Weber; Heather Wolf; Christopher Wood
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    Jan 1, 1800 - Dec 31, 2024
    Area covered
    Description

    eBird is a collective enterprise that takes a novel approach to citizen science by developing cooperative partnerships among experts in a wide range of fields: population ecologists, conservation biologists, quantitative ecologists, statisticians, computer scientists, GIS and informatics specialists, application developers, and data administrators. Managed by the Cornell Lab of Ornithology eBird’s goal is to increase data quantity through participant recruitment and engagement globally, but also to quantify and control for data quality issues such as observer variability, imperfect detection of species, and both spatial and temporal bias in data collection. eBird data are openly available and used by a broad spectrum of students, teachers, scientists, NGOs, government agencies, land managers, and policy makers. The result is that eBird has become a major source of biodiversity data, increasing our knowledge of the dynamics of species distributions, and having a direct impact on the conservation of birds and their habitats.

  20. World Bank Quarterly External Debt Statistics

    • kaggle.com
    zip
    Updated May 4, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    World Bank (2019). World Bank Quarterly External Debt Statistics [Dataset]. https://www.kaggle.com/theworldbank/world-bank-quarterly-external-debt-statistics
    Explore at:
    zip(11652734 bytes)Available download formats
    Dataset updated
    May 4, 2019
    Dataset authored and provided by
    World Bankhttp://topics.nytimes.com/top/reference/timestopics/organizations/w/world_bank/index.html
    License

    Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
    License information was derived automatically

    Description

    Content

    More details about each file are in the individual file descriptions.

    Context

    This is a dataset hosted by the World Bank. The organization has an open data platform found here and they update their information according the amount of data that is brought in. Explore the World Bank using Kaggle and all of the data sources available through the World Bank organization page!

    • Update Frequency: This dataset is updated daily.

    Acknowledgements

    This dataset is maintained using the World Bank's APIs and Kaggle's API.

    Cover photo by Markus Spiske on Unsplash
    Unsplash Images are distributed under a unique Unsplash License.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Google, Dynamic World V1 [Dataset]. http://doi.org/10.1038/s41597-022-01307-4
Organization logo

Dynamic World V1

Explore at:
Dataset provided by
Googlehttp://google.com/
World Resources Institute
Time period covered
Jun 27, 2015 - Sep 22, 2025
Area covered
Earth
Description

Dynamic World is a 10m near-real-time (NRT) Land Use/Land Cover (LULC) dataset that includes class probabilities and label information for nine classes. Dynamic World predictions are available for the Sentinel-2 L1C collection from 2015-06-27 to present. The revisit frequency of Sentinel-2 is between 2-5 days depending on latitude. Dynamic World predictions are generated for Sentinel-2 L1C images with CLOUDY_PIXEL_PERCENTAGE <= 35%. Predictions are masked to remove clouds and cloud shadows using a combination of S2 Cloud Probability, Cloud Displacement Index, and Directional Distance Transform. Images in the Dynamic World collection have names matching the individual Sentinel-2 L1C asset names from which they were derived, e.g: ee.Image('COPERNICUS/S2/20160711T084022_20160711T084751_T35PKT') has a matching Dynamic World image named: ee.Image('GOOGLE/DYNAMICWORLD/V1/20160711T084022_20160711T084751_T35PKT'). All probability bands except the "label" band collectively sum to 1. To learn more about the Dynamic World dataset and see examples for generating composites, calculating regional statistics, and working with the time series, see the Introduction to Dynamic World tutorial series. Given Dynamic World class estimations are derived from single images using a spatial context from a small moving window, top-1 "probabilities" for predicted land covers that are in-part defined by cover over time, like crops, can be comparatively low in the absence of obvious distinguishing features. High-return surfaces in arid climates, sand, sunglint, etc may also exhibit this phenomenon. To select only pixels that confidently belong to a Dynamic World class, it is recommended to mask Dynamic World outputs by thresholding the estimated "probability" of the top-1 prediction.

Search
Clear search
Close search
Google apps
Main menu