Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This data set includes daily, population-weighted mean values of various heat metrics for every county in the contiguous United States from 2000-2020. The dataset methodology, usage notes, and additional citations are published in Scientific Data (see reference below for Spangler et al. [2022]). Minimum, maximum, and mean ambient temperature, dew-point temperature, humidex, heat index, net effective temperature, wet-bulb globe temperature, and Universal Thermal Climate Index are included. Note that Monroe County, Florida (FIPS: 12087) and Nantucket County, Massachusetts (FIPS 25019) are missing due to unavailability of ERA5-Land data for Key West, Florida and Nantucket, MA. To use these data, assign the data from the .Rds file to a new data frame in R using the readRDS() function. Please cite the use of this data set with the following reference. Note that additional citations for specific variables can be found in Table 2.
K.R. Spangler, S. Liang, and G.A. Wellenius. "Wet-Bulb Globe Temperature, Universal Thermal Climate Index, and Other Heat Metrics for US Counties, 2000-2020." Scientific Data (2022). doi: 10.1038/s41597-022-01405-3
This data set contains modified Copernicus Climate Change Service information (2022), as described and cited in the manuscript referenced above. Neither the European Commission nor ECMWF is responsible for any use that may be made of the Copernicus information or data it contains. This data set is provided “as is” with no warranty of any kind.
This dataset includes links to the PoroTomo DAS data in both SEG-Y and hdf5 (via h5py and HSDS with h5pyd) formats with tutorial notebooks for use. Data are hosted on Amazon Web Services (AWS) Simple Storage Service (S3) through the Open Energy Data Initiative (OEDI). Also included are links to the documentation for the dataset, Jupyter Notebook tutorials for working with the data as it is stored in AWS S3, and links to data viewers in OEDI for the horizontal (DASH) and vertical (DASV) DAS datasets. Horizontal DAS (DASH) data collection began 3/8/16, paused, and then started again on 3/11/2016 and ended 3/26/2016 using zigzag trenched fiber optic cabels. Vertical DAS (DASV) data collection began 3/17/2016 and ended 3/28/16 using a fiber optic cable through the first 363 m of a vertical well. These are raw data files from the DAS deployment at (DASH) and below (DASV) the surface during testing at the PoroTomo Natural Laboratory at Brady Hot Spring in Nevada. SEG-Y and hdf5 files are stored in 30 second files organized into directories by day. The hdf5 files available via HSDS are stored in daily files. Metadata includes information on the timing of recording gaps and a file count is included that lists the number of files from each day of recording. These data are available for download without login credentials through the free and publicly accessible Open Energy Data Initiative (OEDI) data viewer which allows users to browse and download individual or groups of files.
State and territorial executive orders, administrative orders, resolutions, and proclamations are collected from government websites and cataloged and coded using Microsoft Excel by one coder with one or more additional coders conducting quality assurance.
Data were collected to determine when individuals in states and territories were subject to executive orders, administrative orders, resolutions, and proclamations for COVID-19 that require or recommend people stay in their homes. Data consists exclusively of state and territorial orders, many of which apply to specific counties within their respective state or territory; therefore, data is broken down to the county level.
These data are derived from the publicly available state and territorial executive orders, administrative orders, resolutions, and proclamations (“orders”) for COVID-19 that expressly require or recommend individuals stay at home found by the CDC, COVID-19 Community Intervention and At-Risk Task Force, Monitoring and Evaluation Team & CDC, Center for State, Tribal, Local, and Territorial Support, Public Health Law Program from March 15, 2020 through August 15, 2021. These data will be updated as new orders are collected. Any orders not available through publicly accessible websites are not included in these data. Only official copies of the documents or, where official copies were unavailable, official press releases from government websites describing requirements were coded; news media reports on restrictions were excluded. Recommendations not included in an order are not included in these data. These data do not include mandatory business closures, curfews, or limitations on public or private gatherings. These data do not necessarily represent an official position of the Centers for Disease Control and Prevention.
Terms of UseData Limitations and DisclaimerThe user’s use of and/or reliance on the information contained in the Document shall be at the user’s own risk and expense. MassDEP disclaims any responsibility for any loss or harm that may result to the user of this data or to any other person due to the user’s use of the Document.This is an ongoing data development project. Attempts have been made to contact all PWS systems, but not all have responded with information on their service area. MassDEP will continue to collect and verify this information. Some PWS service areas included in this datalayer have not been verified by the PWS or the municipality involved, but since many of those areas are based on information published online by the municipality, the PWS, or in a publicly available report, they are included in the estimated PWS service area datalayer.Please note: All PWS service area delineations are estimates for broad planning purposes and should only be used as a guide. The data is not appropriate for site-specific or parcel-specific analysis. Not all properties within a PWS service area are necessarily served by the system, and some properties outside the mapped service areas could be served by the PWS – please contact the relevant PWS. Not all service areas have been confirmed by the systems.Please use the following citation to reference these data:MassDEP, Water Utility Resilience Program. 2025. Community and Non-Transient Non-Community Public Water System Service Area (PubV2025_3).IMPORTANT NOTICE: This MassDEP Estimated Water Service datalayer may not be complete, may contain errors, omissions, and other inaccuracies and the data are subject to change. This version is published through MassGIS. We want to learn about the data uses. If you use this dataset, please notify staff in the Water Utility Resilience Program (WURP@mass.gov).This GIS datalayer represents approximate service areas for Public Water Systems (PWS) in Massachusetts. In 2017, as part of its “Enhancing Resilience and Emergency Preparedness of Water Utilities through Improved Mapping” (Critical Infrastructure Mapping Project ), the MassDEP Water Utility Resilience Program (WURP) began to uniformly map drinking water service areas throughout Massachusetts using information collected from various sources. Along with confirming existing public water system (PWS) service area information, the project collected and verified estimated service area delineations for PWSs not previously delineated and will continue to update the information contained in the datalayers. As of the date of publication, WURP has delineated Community (COM) and Non-Transient Non-Community (NTNC) service areas. Transient non-community (TNCs) are not part of this mapping project.Layers and Tables:The MassDEP Estimated Public Water System Service Area data comprises two polygon feature classes and a supporting table. Some data fields are populated from the MassDEP Drinking Water Program’s Water Quality Testing System (WQTS) and Annual Statistical Reports (ASR).The Community Water Service Areas feature class (PWS_WATER_SERVICE_AREA_COMM_POLY) includes polygon features that represent the approximate service areas for PWS classified as Community systems.The NTNC Water Service Areas feature class (PWS_WATER_SERVICE_AREA_NTNC_POLY) includes polygon features that represent the approximate service areas for PWS classified as Non-Transient Non-Community systems.The Unlocated Sites List table (PWS_WATER_SERVICE_AREA_USL) contains a list of known, unmapped active Community and NTNC PWS services areas at the time of publication.ProductionData UniversePublic Water Systems in Massachusetts are permitted and regulated through the MassDEP Drinking Water Program. The WURP has mapped service areas for all active and inactive municipal and non-municipal Community PWSs in MassDEP’s Water Quality Testing Database (WQTS). Community PWS refers to a public water system that serves at least 15 service connections used by year-round residents or regularly serves at least 25 year-round residents.All active and inactive NTNC PWS were also mapped using information contained in WQTS. An NTNC or Non-transient Non-community Water System refers to a public water system that is not a community water system and that has at least 15 service connections or regularly serves at least 25 of the same persons or more approximately four or more hours per day, four or more days per week, more than six months or 180 days per year, such as a workplace providing water to its employees.These data may include declassified PWSs. Staff will work to rectify the status/water services to properties previously served by declassified PWSs and remove or incorporate these service areas as needed.Maps of service areas for these systems were collected from various online and MassDEP sources to create service areas digitally in GIS. Every PWS is assigned a unique PWSID by MassDEP that incorporates the municipal ID of the municipality it serves (or the largest municipality it serves if it serves multiple municipalities). Some municipalities contain more than one PWS, but each PWS has a unique PWSID. The Estimated PWS Service Area datalayer, therefore, contains polygons with a unique PWSID for each PWS service area.A service area for a community PWS may serve all of one municipality (e.g. Watertown Water Department), multiple municipalities (e.g. Abington-Rockland Joint Water Works), all or portions of two or more municipalities (e.g. Provincetown Water Dept which serves all of Provincetown and a portion of Truro), or a portion of a municipality (e.g. Hyannis Water System, which is one of four PWSs in the town of Barnstable).Some service areas have not been mapped but their general location is represented by a small circle which serves as a placeholder. The location of these circles are estimates based on the general location of the source wells or the general estimated location of the service area - these do not represent the actual service area.Service areas were mapped initially from 2017 to 2022 and reflect varying years for which service is implemented for that service area boundary. WURP maintains the dataset quarterly with annual data updates; however, the dataset may not include all current active PWSs. A list of unmapped PWS systems is included in the USL table PWS_WATER_SERVICE_AREA_USL available for download with the dataset. Some PWSs that are not mapped may have come online after this iteration of the mapping project; these will be reconciled and mapped during the next phase of the WURP project. PWS IDs that represent regional or joint boards with (e.g. Tri Town Water Board, Randolph/Holbrook Water Board, Upper Cape Regional Water Cooperative) will not be mapped because their individual municipal service areas are included in this datalayer.PWSs that do not have corresponding sources, may be part of consecutive systems, may have been incorporated into another PWSs, reclassified as a different type of PWS, or otherwise taken offline. PWSs that have been incorporated, reclassified, or taken offline will be reconciled during the next data update.Methodologies and Data SourcesSeveral methodologies were used to create service area boundaries using various sources, including data received from the systems in response to requests for information from the MassDEP WURP project, information on file at MassDEP, and service area maps found online at municipal and PWS websites. When provided with water line data rather than generalized areas, 300-foot buffers were created around the water lines to denote service areas and then edited to incorporate generalizations. Some municipalities submitted parcel data or address information to be used in delineating service areas.Verification ProcessSmall-scale PDF file maps with roads and other infrastructure were sent to every PWS for corrections or verifications. For small systems, such as a condominium complex or residential school, the relevant parcels were often used as the basis for the delineated service area. In towns where 97% or more of their population is served by the PWS and no other service area delineation was available, the town boundary was used as the service area boundary. Some towns responded to the request for information or verification of service areas by stating that the town boundary should be used since all or nearly all of the municipality is served by the PWS.Sources of information for estimated drinking water service areasThe following information was used to develop estimated drinking water service areas:EOEEA Water Assets Project (2005) water lines (these were buffered to create service areas)Horsely Witten Report 2008Municipal Master Plans, Open Space Plans, Facilities Plans, Water Supply System Webpages, reports and online interactive mapsGIS data received from PWSDetailed infrastructure mapping completed through the MassDEP WURP Critical Infrastructure InitiativeIn the absence of other service area information, for municipalities served by a town-wide water system serving at least 97% of the population, the municipality’s boundary was used. Determinations of which municipalities are 97% or more served by the PWS were made based on the Percent Water Service Map created in 2018 by MassDEP based on various sources of information including but not limited to:The Winter population served submitted by the PWS in the ASR submittalThe number of services from WQTS as a percent of developed parcelsTaken directly from a Master Plan, Water Department Website, Open Space Plan, etc. found onlineCalculated using information from the town on the population servedMassDEP staff estimateHorsely Witten Report 2008Calculation based on Water System Areas Mapped through MassDEP WURP Critical Infrastructure Initiative, 2017-2022Information found in publicly available PWS planning documents submitted to MassDEP or as part of infrastructure planningMaintenanceThe
https://digital.nhs.uk/about-nhs-digital/terms-and-conditionshttps://digital.nhs.uk/about-nhs-digital/terms-and-conditions
This is the latest statistical publication of linked HES (Hospital Episode Statistics) and DID (Diagnostic Imaging Dataset) data held by the Health and Social Care Information Centre. The HES-DID linkage provides the ability to undertake national (within England) analysis along acute patient pathways to understand typical imaging requirements for given procedures, and/or the outcomes after particular imaging has been undertaken, thereby enabling a much deeper understanding of outcomes of imaging and to allow assessment of variation in practice. This publication aims to highlight to users the availability of this updated linkage and provide users of the data with some standard information to assess their analysis approach against. The two data sets have been linked using specific patient identifiers collected in HES and DID. The linkage allows the data sets to be linked from April 2012 when the DID data was first collected; however this report focuses on patients who were present in either data set for the period April 2015-February 2016 only. For DID this is provisional 2015/16 data. For HES this is provisional 2015/16 data. The linkage used for this publication was created on 06 June 2016 and released together with this publication on 07 July 2016.
The survey was conducted in Uganda between January 2013 and August 2013 as part of the Africa Enterprise Survey 2013 roll-out, an initiative of the World Bank. Data from 640 establishments was analyzed. Stratified random sampling was used to select the surveyed businesses.
The objective of the survey is to obtain feedback from enterprises on the state of the private sector as well as to help in building a panel of enterprise data that will make it possible to track changes in the business environment over time, thus allowing, for example, impact assessments of reforms. Through interviews with firms in the manufacturing and services sectors, the survey assesses the constraints to private sector growth and creates statistically significant business environment indicators that are comparable across countries.
The standard Enterprise Survey topics include firm characteristics, gender participation, access to finance, annual sales, costs of inputs and labor, workforce composition, bribery, licensing, infrastructure, trade, crime, competition, capacity utilization, land and permits, taxation, informality, business-government relations, innovation and technology, and performance measures. Over 90 percent of the questions objectively ascertain characteristics of a country’s business environment. The remaining questions assess the survey respondents’ opinions on what are the obstacles to firm growth and performance. The mode of data collection is face-to-face interviews.
National
The primary sampling unit of the study is an establishment. An establishment is a physical location where business is carried out and where industrial operations take place or services are provided. A firm may be composed of one or more establishments. For example, a brewery may have several bottling plants and several establishments for distribution. For the purposes of this survey an establishment must make its own financial decisions and have its own financial statements separate from those of the firm. An establishment must also have its own management and control over its payroll.
The whole population, or the universe, covered in the Enterprise Surveys is the non-agricultural private economy. It comprises: all manufacturing sectors according to the ISIC Revision 3.1 group classification (group D), construction sector (group F), services sector (groups G and H), and transport, storage, and communications sector (group I). Note that this population definition excludes the following sectors: financial intermediation (group J), real estate and renting activities (group K, except sub-sector 72, IT, which was added to the population under study), and all public or utilities sectors. Companies with 100% government ownership are not eligible to participate in the Enterprise Surveys.
Sample survey data [ssd]
The sample for Uganda was selected using stratified random sampling. Three levels of stratification were used in this country: firm sector, firm size, and geographic region.
Industry stratification was designed in the way that follows: the universe was stratified into three manufacturing industry (food, textiles and garments, other manufacturing) and two service sectors (retail and other services).
Size stratification was defined following the standardized definition for the Enterprise Surveys: small (5 to 19 employees), medium (20 to 99 employees), and large (more than 99 employees). For stratification purposes, the number of employees was defined on the basis of reported permanent full-time workers. This seems to be an appropriate definition of the labor force since seasonal/casual/part-time employment is not a common practice, except in the sectors of construction and agriculture.
Regional stratification for the Uganda ES was defined in six regions (city and the surrounding business area): Jinja, Kampala, Lira, Mbale, Mbarara, and Wakiso.
Uganda Bureau of Statistics database was used as a sampling frame with the aim of obtaining interviews with 600 establishments.
Given the impact that non-eligible units included in the sample universe may have on the results, adjustments may be needed when computing the appropriate weights for individual observations. The percentage of confirmed non-eligible units as a proportion of the total number of sampled establishments contacted for the survey was 2% (36 out of 1,567 establishments).
Face-to-face [f2f]
The structure of the data base reflects the fact that 3 different versions of the survey instrument were used for all registered establishments. Questionnaires have common questions (Core module) and respectfully additional manufacturing and retail specific questions. The eligible manufacturing industries have been surveyed using the Manufacturing questionnaire (includes the Core module, plus manufacturing specific questions). Retail firms have been interviewed using the Retail questionnaire (includes the Core module plus retail specific questions) and the residual eligible services have been covered using the Core module only. Each variation of the questionnaire is identified by the index variable, a0.
All variables are named using, first, the letter of each section and, second, the number of the variable within the section, i.e. a1 denotes section A, question 1 (some exceptions apply due to comparability reasons). Variable names proceeded by a prefix "KEN" and "A2F" indicate questions specific to some countries in Africa, therefore, they may not be found in the implementation of the rollout in other countries. All other suffixed variables are global and are present in all country surveys over the world. All variables are numeric with the exception of those variables with an "x" at the end of their names. The suffix "x" denotes that the variable is alpha-numeric. In the implementation of the Africa roll out 2011 an experiment was carried in some of the countries to better estimate the effects of the use of show cards in data collection. In some of the sections (i.e. innovation) the enumerators were trained to alternatively implement the section using either show cards or asking only the questions without showing any cards, please see the variable "cards".
Data entry and quality controls are implemented by the contractor and data is delivered to the World Bank in batches (typically 10%, 50% and 100%). These data deliveries are checked for logical consistency, out of range values, skip patterns, and duplicate entries. Problems are flagged by the World Bank and corrected by the implementing contractor through data checks, callbacks, and revisiting establishments.
Survey non-response must be differentiated from item non-response. The former refers to refusals to participate in the survey altogether whereas the latter refers to the refusals to answer some specific questions. Enterprise Surveys suffer from both problems and different strategies were used to address these issues.
Item non-response was addressed by two strategies: a- For sensitive questions that may generate negative reactions from the respondent, such as corruption or tax evasion, enumerators were instructed to collect "Refusal to respond" (-8) as a different option from "Don't know" (-9). b- Establishments with incomplete information were re-contacted in order to complete this information, whenever necessary.
Survey non-response was addressed by maximizing efforts to contact establishments that were initially selected for interview. Attempts were made to contact the establishment for interview at different times/days of the week before a replacement establishment (with similar strata characteristics) was suggested for interview. Survey non-response did occur but substitutions were made in order to potentially achieve strata-specific goals.
The number of contacted establishments per realized interview was 0.41. This number is the result of two factors: explicit refusals to participate in the survey, as reflected by the rate of rejection (which includes rejections of the screener and the main survey) and the quality of the sample frame, as represented by the presence of ineligible units. The number of rejections per contact was 0.20.
This is version 3.1.1.2020f of Met Office Hadley Centre's Integrated Surface Database, HadISD. These data are global sub-daily surface meteorological data that extends HadISD v3.1.0.2019f to include 2020 and so spans 1931-2020. The quality controlled variables in this dataset are: temperature, dewpoint temperature, sea-level pressure, wind speed and direction, cloud data (total, low, mid and high level). Past significant weather and precipitation data are also included, but have not been quality controlled, so their quality and completeness cannot be guaranteed. Quality control flags and data values which have been removed during the quality control process are provided in the qc_flags and flagged_values fields, and ancillary data files show the station listing with a station listing with IDs, names and location information. The data are provided as one NetCDF file per station. Files in the station_data folder station data files have the format "station_code"_HadISD_HadOBS_19310101-20210101_v3-1-1-2020f.nc. The station codes can be found under the docs tab. The station codes file has five columns as follows: 1) station code, 2) station name 3) station latitude 4) station longitude 5) station height. To keep informed about updates, news and announcements follow the HadOBS team on twitter @metofficeHadOBS. For more detailed information e.g bug fixes, routine updates and other exploratory analysis, see the HadISD blog: http://hadisd.blogspot.co.uk/ References: When using the dataset in a paper you must cite the following papers (see Docs for link to the publications) and this dataset (using the "citable as" reference) : Dunn, R. J. H., (2019), HadISD version 3: monthly updates, Hadley Centre Technical Note. Dunn, R. J. H., Willett, K. M., Parker, D. E., and Mitchell, L.: Expanding HadISD: quality-controlled, sub-daily station data from 1931, Geosci. Instrum. Method. Data Syst., 5, 473-491, doi:10.5194/gi-5-473-2016, 2016. Dunn, R. J. H., et al. (2012), HadISD: A Quality Controlled global synoptic report database for selected variables at long-term stations from 1973-2011, Clim. Past, 8, 1649-1679, 2012, doi:10.5194/cp-8-1649-2012 Smith, A., N. Lott, and R. Vose, 2011: The Integrated Surface Database: Recent Developments and Partnerships. Bulletin of the American Meteorological Society, 92, 704–708, doi:10.1175/2011BAMS3015.1 For a homogeneity assessment of HadISD please see this following reference Dunn, R. J. H., K. M. Willett, C. P. Morice, and D. E. Parker. "Pairwise homogeneity assessment of HadISD." Climate of the Past 10, no. 4 (2014): 1501-1522. doi:10.5194/cp-10-1501-2014, 2014.
https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy
The global data entry service market size is poised to experience significant growth, with the market expected to rise from USD 2.5 billion in 2023 to USD 4.8 billion by 2032, achieving a Compound Annual Growth Rate (CAGR) of 7.5% over the forecast period. This growth can be attributed to several factors including the increasing adoption of digital technologies, the rising demand for data accuracy and integrity, and the need for businesses to manage vast amounts of data efficiently.
One of the key growth factors driving the data entry service market is the rapid digital transformation across various industries. As businesses continue to digitize their operations, the volume of data generated has increased exponentially. This data needs to be accurately entered, processed, and managed to derive meaningful insights. The demand for data entry services has surged as companies seek to outsource these non-core activities, enabling them to focus on their primary business operations. Additionally, the widespread adoption of cloud-based solutions and big data analytics has further fueled the demand for efficient data management services.
Another significant driver of market growth is the increasing need for data accuracy and integrity. Inaccurate or incomplete data can lead to poor decision-making, financial losses, and a decrease in operational efficiency. Organizations are increasingly recognizing the importance of maintaining high-quality data and are investing in data entry services to ensure that their databases are accurate, up-to-date, and reliable. This is particularly crucial for industries such as healthcare, BFSI, and retail, where precise data is essential for regulatory compliance, customer relationship management, and operational efficiency.
The cost-effectiveness of outsourcing data entry services is also contributing to market growth. By outsourcing these tasks to specialized service providers, organizations can save on labor costs, reduce operational expenses, and improve productivity. Service providers often have access to advanced tools and technologies, as well as skilled professionals who can perform data entry tasks more efficiently and accurately. This not only leads to cost savings but also allows businesses to reallocate resources to more strategic activities, driving overall growth.
From a regional perspective, the Asia Pacific region is expected to witness the highest growth in the data entry service market during the forecast period. This can be attributed to the region's strong IT infrastructure, the presence of numerous outsourcing service providers, and the growing adoption of digital technologies across various industries. North America and Europe are also significant markets, driven by the high demand for data management services in sectors such as healthcare, BFSI, and retail. The Middle East & Africa and Latin America are anticipated to experience steady growth, supported by increasing investments in digital infrastructure and the rising awareness of the benefits of data entry services.
The data entry service market can be segmented into various service types, including online data entry, offline data entry, data processing, data conversion, data cleansing, and others. Each of these service types plays a crucial role in ensuring the accuracy, integrity, and usability of data. Online data entry services involve entering data directly into an online system or database, which is essential for real-time data management and accessibility. This service type is particularly popular in industries such as e-commerce, where timely and accurate data entry is critical for inventory management and customer service.
Offline data entry services, on the other hand, involve entering data into offline systems or databases, which are later synchronized with online systems. This service type is often used in industries where internet connectivity may be unreliable or where data security is a primary concern. Offline data entry is also essential for processing historical data or data that is collected through physical forms and documents. The demand for offline data entry services is driven by the need for accurate and timely data entry in sectors such as manufacturing, government, and healthcare.
Data processing services involve the manipulation, transformation, and analysis of raw data to produce meaningful information. This includes tasks such as data validation, data sorting, data aggregation, and data analysis. Data processing is a critical componen
These data were automated to provide an accurate high-resolution historical shoreline of Long Island Sound, New York suitable as a geographic information system (GIS) data layer. These data are derived from shoreline maps that were produced by the NOAA National Ocean Service including its predecessor agencies which were based on an office interpretation of imagery and/or field survey. The N...
https://datacatalog1.worldbank.org/public-licenses?fragment=cchttps://datacatalog1.worldbank.org/public-licenses?fragment=cc
National statistical systems are facing significant challenges. These challenges arise from increasing demands for high quality and trustworthy data to guide decision making, coupled with the rapidly changing landscape of the data revolution. To help create a mechanism for learning amongst national statistical systems, the World Bank has developed improved Statistical Performance Indicators (SPI) to monitor the statistical performance of countries. The SPI focuses on five key dimensions of a country’s statistical performance: (i) data use, (ii) data services, (iii) data products, (iv) data sources, and (v) data infrastructure. This will replace the Statistical Capacity Index (SCI) that the World Bank has regularly published since 2004.
The SPI focus on five key pillars of a country’s statistical performance: (i) data use, (ii) data services, (iii) data products, (iv) data sources, and (v) data infrastructure. The SPI are composed of more than 50 indicators and contain data for 186 countries. This set of countries covers 99 percent of the world population. The data extend from 2016-2023, with some indicators going back to 2004.
For more information, consult the academic article published in the journal Scientific Data. https://www.nature.com/articles/s41597-023-01971-0.
https://data.4tu.nl/info/fileadmin/user_upload/Documenten/4TU.ResearchData_Restricted_Data_2022.pdfhttps://data.4tu.nl/info/fileadmin/user_upload/Documenten/4TU.ResearchData_Restricted_Data_2022.pdf
This file contains raw data for cameras and wearables of the ConfLab dataset.
./cameras
contains the overhead video recordings for 9 cameras (cam2-10) in MP4 files.
These cameras cover the whole interaction floor, with camera 2 capturing the
bottom of the scene layout, and camera 10 capturing top of the scene layout.
Note that cam5 ran out of battery before the other cameras and thus the recordings
are cut short. However, cam4 and 6 contain significant overlap with cam 5, to
reconstruct any information needed.
Note that the annotations are made and provided in 2 minute segments.
The annotated portions of the video include the last 3min38sec of x2xxx.MP4
video files, and the first 12 min of x3xxx.MP4 files for cameras (2,4,6,8,10),
with "x" being the placeholder character in the mp4 file names. If one wishes
to separate the video into 2 min segments as we did, the "video-splitting.sh"
script is provided.
./camera-calibration contains the camera instrinsic files obtained from
https://github.com/idiap/multicamera-calibration. Camera extrinsic parameters can
be calculated using the existing intrinsic parameters and the instructions in the
multicamera-calibration repo. The coordinates in the image are provided by the
crosses marked on the floor, which are visible in the video recordings.
The crosses are 1m apart (=100cm).
./wearables
subdirectory includes the IMU, proximity and audio data from each
participant at the Conflab event (48 in total). In the directory numbered
by participant ID, the following data are included:
1. raw audio file
2. proximity (bluetooth) pings (RSSI) file (raw and csv) and a visualization
3. Tri-axial accelerometer data (raw and csv) and a visualization
4. Tri-axial gyroscope data (raw and csv) and a visualization
5. Tri-axial magnetometer data (raw and csv) and a visualization
6. Game rotation vector (raw and csv), recorded in quaternions.
All files are timestamped.
The sampling frequencies are:
- audio: 1250 Hz
- rest: around 50Hz. However, the sample rate is not fixed
and instead the timestamps should be used.
For rotation, the game rotation vector's output frequency is limited by the
actual sampling frequency of the magnetometer. For more information, please refer to
https://invensense.tdk.com/wp-content/uploads/2016/06/DS-000189-ICM-20948-v1.3.pdf
Audio files in this folder are in raw binary form. The following can be used to convert
them to WAV files (1250Hz):
ffmpeg -f s16le -ar 1250 -ac 1 -i /path/to/audio/file
Synchronization of cameras and werables data
Raw videos contain timecode information which matches the timestamps of the data in
the "wearables" folder. The starting timecode of a video can be read as:
ffprobe -hide_banner -show_streams -i /path/to/video
./audio
./sync: contains wav files per each subject
./sync_files: auxiliary csv files used to sync the audio. Can be used to improve the synchronization.
The code used for syncing the audio can be found here:
https://github.com/TUDelft-SPC-Lab/conflab/tree/master/preprocessing/audio
Over the last two observations, the revenue is forecast to significantly increase in all regions. From the selected regions, the ranking by revenue in the data center market is forecast to be led by Central & Western Europe with ***** billion euro. In contrast, the ranking is trailed by Eastern Europe with **** billion euro, recording a difference of ***** billion euro to Central & Western Europe. The Statista Market Insights cover a broad range of additional markets.
All states (including the District of Columbia) provide data to the Centers for Medicare & Medicaid Services (CMS) on a range of Medicaid and Children’s Health Insurance Program (CHIP) eligibility and enrollment metrics. These data reflect state-reported information on Medicaid and CHIP eligibility renewals initiated and scheduled for completion during the reporting period. In addition to reporting the outcomes of renewals at the end of each reporting period, states also provide an update on renewals that were reported pending as of the end of a reporting period. For more information on these data, see Sections II and III of the Eligibility Processing Data Report specifications.
Notes:
Georgia reported data for individuals who continue to be eligible following a change in circumstances and were granted a new 12-month eligibility period during the reporting period, along with data on individuals due for renewal in the month.
North Carolina reports renewal outcomes for only initiated renewals scheduled for completion in the report month, and as such, the data do not reflect renewals that should have been completed in the reporting period that the state was unable to initiate by the end of the report month.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The datasets used in the creation of the predicted Habitat Suitability models includes the CWHR range maps of Californias regularly-occurring vertebrates which were digitized as GIS layers to support the predictions of the CWHR System software. These vector datasets of CWHR range maps are one component of California Wildlife Habitat Relationships (CWHR), a comprehensive information system and predictive model for Californias wildlife. The CWHR System was developed to support habitat conservation and management, land use planning, impact assessment, education, and research involving terrestrial vertebrates in California. CWHR contains information on life history, management status, geographic distribution, and habitat relationships for wildlife species known to occur regularly in California. Range maps represent the maximum, current geographic extent of each species within California. They were originally delineated at a scale of 1:5,000,000 by species-level experts and have gradually been revised at a scale of 1:1,000,000. For more information about CWHR, visit the CWHR webpage (https://www.wildlife.ca.gov/Data/CWHR). The webpage provides links to download CWHR data and user documents such as a look up table of available range maps including species code, species name, and range map revision history; a full set of CWHR GIS data; .pdf files of each range map or species life history accounts; and a User Guide.The models also used the CALFIRE-FRAP compiled "best available" land cover data known as Fveg. This compilation dataset was created as a single data layer, to support the various analyses required for the Forest and Rangeland Assessment, a legislatively mandated function. These data are being updated to support on-going analyses and to prepare for the next FRAP assessment in 2015. An accurate depiction of the spatial distribution of habitat types within California is required for a variety of legislatively-mandated government functions. The California Department of Forestry and Fire Protections CALFIRE Fire and Resource Assessment Program (FRAP), in cooperation with California Department of Fish and Wildlife VegCamp program and extensive use of USDA Forest Service Region 5 Remote Sensing Laboratory (RSL) data, has compiled the "best available" land cover data available for California into a single comprehensive statewide data set. The data span a period from approximately 1990 to 2014. Typically the most current, detailed and consistent data were collected for various regions of the state. Decision rules were developed that controlled which layers were given priority in areas of overlap. Cross-walks were used to compile the various sources into the common classification scheme, the California Wildlife Habitat Relationships (CWHR) system.CWHR range data was used together with the FVEG vegetation maps and CWHR habitat suitability ranks to create Predicted Habitat Suitability maps for species. The Predicted Habitat Suitability maps show the mean habitat suitability score for the species, as defined in CWHR. CWHR defines habitat suitability as NO SUITABILITY (0), LOW (0.33), MEDIUM (0.66), or HIGH (1) for reproduction, cover, and feeding for each species in each habitat stage (habitat type, size, and density combination). The mean is the average of the reproduction, cover, and feeding scores, and can be interpreted as LOW (less than 0.34), MEDIUM (0.34-0.66), and HIGH (greater than 0.66) suitability. Note that habitat suitability ranks were developed based on habitat patch sizes >40 acres in size, and are best interpreted for habitat patches >200 acres in size. The CWHR Predicted Habitat Suitability rasters are named according to the 4 digit alpha-numeric species CWHR ID code. The CWHR Species Lookup Table contains a record for each species including its CWHR ID, scientific name, common name, and range map revision history (available for download at https://www.wildlife.ca.gov/Data/CWHR).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The study analyzes quantitative micro-level data aggregated to the city-level in urban systems in Europe and the United States. The study demonstrates how urban scaling laws arise from within-city inequality. We show that indicators of interconnectivity, productivity, and innovation have heavy tailed distributions in cities, and that city tails, and their growth with city size, play an important role in the emergence of urban scaling. With agent-based simulation and an analysis of longitudinal micro-level data, we identify a city-size dependent cumulative advantage mechanism behind differences in the tailedness of urban indicators by city size.
The data and code that support the findings of this study are available for download here. We collected the online networking data for Russia and Ukraine through the VKontakte API (https://vk.com/dev/openapi), the data on US patents are from the US Patent and Trademark Office (https://www.patentsview.org) and on research grants from Dimensions (https://www.dimensions.ai). The code for these data collections is available upon request. The Swedish micro-level data come from administrative and tax records and can therefore not be shared; access may be requested from Statistics Sweden (https://scb.se/en/services/guidance-for-researchers-and-universities). Additional information and data may be requested from the authors.
This map symbolizes the relative population counts for the City's 12 Data Divisions, aggregating the tract-level estimates from the the Census Bureau's American Community Survey 2018 five-year samples. Please refer to the map's legend for context to the color shading -- darker hues indicate more population.If you click on each Data Division, you can view other Census demographic information about that Data Division in addition to the population count.About the Census Data:The data comes from the U.S. Census Bureau's American Community Survey's 2014-2018 five-year samples. The American Community Survey (ACS) is an ongoing survey conducted by the federal government that provides vital information annually about America and its population. Information from the survey generates data that help determine how more than $675 billion in federal and state funds are distributed each year.For more information about the Census Bureau's ACS data and process of constructing the survey, visit the ACS's About page.About the City's Data Divisions:As a planning analytic tool, an interdepartmental working group divided Rochester into 12 “data divisions.” These divisions are well-defined and static so they are positioned to be used by the City of Rochester for statistical and planning purposes. Census data is tied to these divisions and serves as the basis for analyses over time. As such, the data divisions are designed to follow census boundaries, while also recognizing natural and human-made boundaries, such as the River, rail lines, and highways. Historical neighborhood boundaries, while informative in the division process, did not drive the boundaries. Data divisions are distinct from the numerous neighborhoods in Rochester. Neighborhood boundaries, like quadrant boundaries, police precincts, and legislative districts often change, which makes statistical analysis challenging when looking at data over time. The data division boundaries, however, are intended to remain unchanged. It is hoped that over time, all City data analysts will adopt the data divisions for the purpose of measuring change over time throughout the city.
This database, compiled by Matthews and Fung (1987), provides information on the distribution and environmental characteristics of natural wetlands. The database was developed to evaluate the role of wetlands in the annual emission of methane from terrestrial sources. The original data consists of five global 1-degree latitude by 1-degree longitude arrays. This subset, for the study area of the Large Scale Biosphere-Atmosphere Experiment in Amazonia (LBA) in South America, retains all five arrays at the 1-degree resolution but only for the area of interest (i.e., longitude 85 deg to 30 deg W, latitude 25 deg S to 10 deg N). The arrays are (1) wetland data source, (2) wetland type, (3) fractional inundation, (4) vegetation type, and (5) soil type. The data subsets are in both ASCII GRID and binary image file formats.The data base is the result of the integration of three independent digital sources: (1) vegetation classified according to the United Nations Educational Scientific and Cultural Organization (UNESCO) system (Matthews, 1983), (2) soil properties from the Food and Agriculture Organization (FAO) soil maps (Zobler, 1986), and (3) fractional inundation in each 1-degree cell compiled from a global map survey of Operational Navigation Charts (ONC). With vegetation, soil, and inundation characteristics of each wetland site identified, the data base has been used for a coherent and systematic estimate of methane emissions from wetlands and for an analysis of the causes for uncertainties in the emission estimate.The complete global data base is available from NASA/GISS [http://www.giss.nasa.gov] and NCAR data set ds765.5 [http://www.ncar.ucar.edu]; the global vegetation types data are available from ORNL DAAC [http://www.daac.ornl.gov].
These data were automated to provide an accurate high-resolution historical shoreline of Wequetequock Cove and Fishers Island Sound, Connecticut and Rhode Island suitable as a geographic information system (GIS) data layer. These data are derived from shoreline maps that were produced by the NOAA National Ocean Service including its predecessor agencies which were based on an office interpre...
These data were compiled to visualize the extent of Lake Powell at various elevation levels. These data represent water surface elevations for Lake Powell at levels critical to the operation of Glen Canyon Dam, at 5 foot intervals from the "Equalization Tier" ("Full Pool") to "Dead Pool", and at maximum and minimum elevations each water year throughout Glen Canyon Dam's operating history. These data were created for Lake Powell in Arizona and Utah. These data were created by the U.S. Geological Survey, Southwest Biological Science Center, Grand Canyon Monitoring & Research Center by reclassifying "Modified topobathymetric elevation data for Lake Powell" (Jones and Root, 2021) at discrete elevation levels and converting them into vector format. These data can be used to visualize locations or resources in Lake Powell at various elevation levels as it continues to change.
Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
License information was derived automatically
These data were used to drive and evaluate Jules Investigation Model (JIM) snow simulations. The data provided are the forcing data used for the "deterministic" runs as described in Winstral et al., 2019. The bias-detecting ensemble (Winstral et al., 2019) used observed snow depths (HS) to detect biases in these deterministic simulations related to precipitation and energy inputs to JIM. Simulations that included the BDE evaluations substantially improved JIM simulations.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This data set includes daily, population-weighted mean values of various heat metrics for every county in the contiguous United States from 2000-2020. The dataset methodology, usage notes, and additional citations are published in Scientific Data (see reference below for Spangler et al. [2022]). Minimum, maximum, and mean ambient temperature, dew-point temperature, humidex, heat index, net effective temperature, wet-bulb globe temperature, and Universal Thermal Climate Index are included. Note that Monroe County, Florida (FIPS: 12087) and Nantucket County, Massachusetts (FIPS 25019) are missing due to unavailability of ERA5-Land data for Key West, Florida and Nantucket, MA. To use these data, assign the data from the .Rds file to a new data frame in R using the readRDS() function. Please cite the use of this data set with the following reference. Note that additional citations for specific variables can be found in Table 2.
K.R. Spangler, S. Liang, and G.A. Wellenius. "Wet-Bulb Globe Temperature, Universal Thermal Climate Index, and Other Heat Metrics for US Counties, 2000-2020." Scientific Data (2022). doi: 10.1038/s41597-022-01405-3
This data set contains modified Copernicus Climate Change Service information (2022), as described and cited in the manuscript referenced above. Neither the European Commission nor ECMWF is responsible for any use that may be made of the Copernicus information or data it contains. This data set is provided “as is” with no warranty of any kind.