100+ datasets found
  1. Summary statistics of cross validation prediction errors applied to...

    • plos.figshare.com
    xls
    Updated Jun 7, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jay Ram Lamichhane; Alfredo Fabi; Roberto Ridolfi; Leonardo Varvaro (2023). Summary statistics of cross validation prediction errors applied to log-transformed data. [Dataset]. http://doi.org/10.1371/journal.pone.0056298.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 7, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Jay Ram Lamichhane; Alfredo Fabi; Roberto Ridolfi; Leonardo Varvaro
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    RMSPE: root mean square prediction error; ASE: average standard error; SMPE: standardized mean prediction error; SRMS: standardized root mean square; Δ: thermal shock.

  2. Medical treatment errors in Germany in 2023, by error type

    • statista.com
    Updated Nov 24, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Medical treatment errors in Germany in 2023, by error type [Dataset]. https://www.statista.com/statistics/582805/distribution-medical-treatment-errors-germany-by-type/
    Explore at:
    Dataset updated
    Nov 24, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    2023
    Area covered
    Germany
    Description

    This statistic shows a distribution of medical treatment errors in Germany in 2023, by type of error. That year, the majority of **** percent of confirmed medical errors had to do with treatment not undertaken, despite the opportunity. At **** percent of all treatment errors, measures undertaken incorrectly were the second-most reason for error in treatment.

  3. Household Survey on Information and Communications Technology, 2014 - West...

    • pcbs.gov.ps
    Updated Jan 28, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Palestinian Central Bureau of statistics (2020). Household Survey on Information and Communications Technology, 2014 - West Bank and Gaza [Dataset]. https://www.pcbs.gov.ps/PCBS-Metadata-en-v5.2/index.php/catalog/465
    Explore at:
    Dataset updated
    Jan 28, 2020
    Dataset provided by
    Palestinian Central Bureau of Statisticshttps://pcbs.gov/
    Authors
    Palestinian Central Bureau of statistics
    Time period covered
    2014
    Area covered
    Gaza Strip, West Bank, Gaza
    Description

    Abstract

    Within the frame of PCBS' efforts in providing official Palestinian statistics in the different life aspects of Palestinian society and because the wide spread of Computer, Internet and Mobile Phone among the Palestinian people, and the important role they may play in spreading knowledge and culture and contribution in formulating the public opinion, PCBS conducted the Household Survey on Information and Communications Technology, 2014.

    The main objective of this survey is to provide statistical data on Information and Communication Technology in the Palestine in addition to providing data on the following: -

    · Prevalence of computers and access to the Internet. · Study the penetration and purpose of Technology use.

    Geographic coverage

    Palestine (West Bank and Gaza Strip) , type of locality (Urban, Rural, Refugee Camps) and governorate

    Analysis unit

    Household. Person 10 years and over .

    Universe

    All Palestinian households and individuals whose usual place of residence in Palestine with focus on persons aged 10 years and over in year 2014.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    Sampling Frame The sampling frame consists of a list of enumeration areas adopted in the Population, Housing and Establishments Census of 2007. Each enumeration area has an average size of about 124 households. These were used in the first phase as Preliminary Sampling Units in the process of selecting the survey sample.

    Sample Size The total sample size of the survey was 7,268 households, of which 6,000 responded.

    Sample Design The sample is a stratified clustered systematic random sample. The design comprised three phases:

    Phase I: Random sample of 240 enumeration areas. Phase II: Selection of 25 households from each enumeration area selected in phase one using systematic random selection. Phase III: Selection of an individual (10 years or more) in the field from the selected households; KISH TABLES were used to ensure indiscriminate selection.

    Sample Strata Distribution of the sample was stratified by: 1- Governorate (16 governorates, J1). 2- Type of locality (urban, rural and camps).

    Sampling deviation

    -

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    The survey questionnaire consists of identification data, quality controls and three main sections: Section I: Data on household members that include identification fields, the characteristics of household members (demographic and social) such as the relationship of individuals to the head of household, sex, date of birth and age.

    Section II: Household data include information regarding computer processing, access to the Internet, and possession of various media and computer equipment. This section includes information on topics related to the use of computer and Internet, as well as supervision by households of their children (5-17 years old) while using the computer and Internet, and protective measures taken by the household in the home.

    Section III: Data on persons (aged 10 years and over) about computer use, access to the Internet and possession of a mobile phone.

    Cleaning operations

    Preparation of Data Entry Program: This stage included preparation of the data entry programs using an ACCESS package and defining data entry control rules to avoid errors, plus validation inquiries to examine the data after it had been captured electronically.

    Data Entry: The data entry process started on 8 May 2014 and ended on 23 June 2014. The data entry took place at the main PCBS office and in field offices using 28 data clerks.

    Editing and Cleaning procedures: Several measures were taken to avoid non-sampling errors. These included editing of questionnaires before data entry to check field errors, using a data entry application that does not allow mistakes during the process of data entry, and then examining the data by using frequency and cross tables. This ensured that data were error free; cleaning and inspection of the anomalous values were conducted to ensure harmony between the different questions on the questionnaire.

    Response rate

    Response Rates= 79%

    Sampling error estimates

    There are many aspects of the concept of data quality; this includes the initial planning of the survey to the dissemination of the results and how well users understand and use the data. There are three components to the quality of statistics: accuracy, comparability, and quality control procedures.

    Checks on data accuracy cover many aspects of the survey and include statistical errors due to the use of a sample, non-statistical errors resulting from field workers or survey tools, and response rates and their effect on estimations. This section includes:

    Statistical Errors Data of this survey may be affected by statistical errors due to the use of a sample and not a complete enumeration. Therefore, certain differences can be expected in comparison with the real values obtained through censuses. Variances were calculated for the most important indicators.

    Variance calculations revealed that there is no problem in disseminating results nationally or regionally (the West Bank, Gaza Strip), but some indicators show high variance by governorate, as noted in the tables of the main report.

    Non-Statistical Errors Non-statistical errors are possible at all stages of the project, during data collection or processing. These are referred to as non-response errors, response errors, interviewing errors and data entry errors. To avoid errors and reduce their effects, strenuous efforts were made to train the field workers intensively. They were trained on how to carry out the interview, what to discuss and what to avoid, and practical and theoretical training took place during the training course. Training manuals were provided for each section of the questionnaire, along with practical exercises in class and instructions on how to approach respondents to reduce refused cases. Data entry staff were trained on the data entry program, which was tested before starting the data entry process.

    Several measures were taken to avoid non-sampling errors. These included editing of questionnaires before data entry to check field errors, using a data entry application that does not allow mistakes during the process of data entry, and then examining the data by using frequency and cross tables. This ensured that data were error free; cleaning and inspection of the anomalous values were conducted to ensure harmony between the different questions on the questionnaire.

    The sources of non-statistical errors can be summarized as: 1. Some of the households were not at home and could not be interviewed, and some households refused to be interviewed. 2. In unique cases, errors occurred due to the way the questions were asked by interviewers and respondents misunderstood some of the questions.

  4. Child and Working Tax Credits error and fraud statistics 2020 to 2021

    • gov.uk
    Updated Apr 11, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HM Revenue & Customs (2024). Child and Working Tax Credits error and fraud statistics 2020 to 2021 [Dataset]. https://www.gov.uk/government/statistics/child-and-working-tax-credits-error-and-fraud-statistics-2020-to-2021
    Explore at:
    Dataset updated
    Apr 11, 2024
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    HM Revenue & Customs
    Description

    HMRC identified a minor historical error in the statistics covering tax year 2020 to 2021 published on 23 June 2022 affecting the final central estimate (£ million) of tax credits error and fraud and number of awards in error and fraud, as a result of incorrect weighting. A corrected version also including data from previously open sample cases was published on 11 April 2024.

    More information on revisions to Official Statistics can be found in the HMRC policy on revisions to official statistics.

    For the tax year 2020 to 2021, the central estimate of the rate of error and fraud favouring the claimant is around 4.7%. This equates to around £710 million paid out incorrectly through error and fraud.

    Media contact:

    HMRC Press Office

    news.desk@hmrc.gov.uk

    Statistical contact:

    benefitsandcredits.analysis@hmrc.gov.uk

    Further details, including data suitability and coverage, are included in the background quality report.

  5. d

    Introduction to Basic Statistics

    • search.dataone.org
    Updated Dec 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statistics Canada (2023). Introduction to Basic Statistics [Dataset]. http://doi.org/10.5683/SP3/H3ASWT
    Explore at:
    Dataset updated
    Dec 28, 2023
    Dataset provided by
    Borealis
    Authors
    Statistics Canada
    Description

    A quick refresher course for those who have had statistical training in the past or a fast-paced introduction to basic statistics for beginners. Statistical measures such as percentages, averages, frequency and standard error are used widely. But how are they calculated, and exactly what do they tell us? This one day workshop will help participants develop an appreciation of the potential of statistics and a critical eye of when and how they should or shouldn't be used.

  6. A

    BLM OR CVS Data Errors Table

    • data.amerigeoss.org
    • navigator.blm.gov
    zip
    Updated Jul 28, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    United States (2019). BLM OR CVS Data Errors Table [Dataset]. https://data.amerigeoss.org/sl/dataset/blm-or-cvs-data-errors-table
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jul 28, 2019
    Dataset provided by
    United States
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Description

    CVS_DATAERRORS_TBL:

    The data errors table was where any validation errors or height recalculation utility errors were sent. (Errors in terms of deviations from the established rules of the data collection procedures or project-specific data limitations). Error records corrected were moved to the ErrorHistory table and any remaining records were removed, including warnings, leaving this table blank.

  7. Additional file 5: of A comparative evaluation of hybrid error correction...

    • springernature.figshare.com
    xlsx
    Updated Feb 19, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shuhua Fu; Anqi Wang; Kin Au (2024). Additional file 5: of A comparative evaluation of hybrid error correction methods for error-prone long reads [Dataset]. http://doi.org/10.6084/m9.figshare.7672256.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Feb 19, 2024
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Shuhua Fu; Anqi Wang; Kin Au
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Table S3. Performance statistics on output rate. a. Output rate (%) performance statistics on PacBio data of ten methods using five SR coverages. b. Output rate (%) performance statistics on ONT data of ten methods using five SR coverages. (XLSX 19 kb)

  8. 3d printing errors

    • kaggle.com
    Updated Feb 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NilsHagenBeyer (2024). 3d printing errors [Dataset]. https://www.kaggle.com/datasets/nimbus200/3d-printing-errors
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 20, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    NilsHagenBeyer
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    This dataset contains images of 3d printed parts recorded while printing.

    The dataset contains 4 classes and 34 shapes:

    classGOODSRINGINGUNDEREXTRUSIONSPAGHETTI
    images506927982962134

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F5666725%2F92b8fca57767fa55ae4e42d3972b2522%2F1.PNG?generation=1708440162571728&alt=media" alt=""> https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F5666725%2Fc36caa40d8d565bafa02d9f97112a777%2F2.PNG?generation=1708440216287321&alt=media" alt=""> https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F5666725%2F3ddeb2380e1106e9d482f3e6940235d3%2F3.PNG?generation=1708440227278455&alt=media" alt="">

    Labels and methadata:

    imageimage file name
    class0: Good, 1: Under-Extrusion, 2: Stringing, 4: Spaghetti
    layerlayer of completion of the printed part
    ex_mulglobal extrusion multiplier during print
    shapeidentifier of the printed geometry (1-34)
    recordingdatetime coded name of the print/recording
    printbed_colorcolor of the printbed (black, silver)

    Recording Process

    The dataset was recorded in the context of this work: https://github.com/NilsHagenBeyer/FDM_error_detection

    The Images were recorded with ELP-USB13MAFKV76 digital autofocus camera with the Sony IMX214 sensor chip, which has a resolution of 3264x2448, which were later downscaled to 256x256px. All Prints were carried out on a customized Creality Ender-3 Pro 3D.

    The Images were mainly recorded with a black printbed from camera position 1. For testing purposes the dataset contains also few images from camera postition 2 (oblique camera) with a black printbed (significant motion blurr) and camera postition 1 with a silver printbed. The positions can be seen in the image below.

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F5666725%2F253a5f4c3d83233ddbc943fc1f8273e0%2Fexp_setup.png?generation=1721130817484111&alt=media" alt="">

    Folder Structure

    ├── general data

     └── all_images_no_filter.csv      # Full Dataset, unfiltered
    
     └── all_images.csv         # Full Dataset, no spaghetti error
    
     └── black_bed_all.csv       # Full Dataset, no silver bed
    

    ├── images

     └── all_images
     |   └── ...         # All Images: Full Dataset + Silver Bed + Oblique Camera
     |
     └── test_images_silver265
     |   └── ...         # Silver bed test images
     |
     └── test_images_oblique256
        └── ...         # Oblique camera test images
    
  9. l

    Full descriptive statistics and error rates for a non-invasive 23-strong...

    • datastore.landcareresearch.co.nz
    Updated Aug 25, 2018
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2018). Full descriptive statistics and error rates for a non-invasive 23-strong marker panel in kiwi - Dataset - DataStore [Dataset]. https://datastore.landcareresearch.co.nz/en_NZ/dataset/whited-kiwimsats
    Explore at:
    Dataset updated
    Aug 25, 2018
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset includes descriptive statistics and error rates for all 23 microsatellite markers that have been developed to work from non-invasive samples of all five currently recognised species of kiwi. Data associated with paper: Ana Ramón-Laca, Daniel J. White, Jason T. Weir, Hugh A. Robertson. Extraction of DNA from faeces and moulted feathers provides a novel tool for conservation management of New Zealand kiwi (Apteryx spp.) 2018 Mar; 8(6): 3119–3130. Ecology and Evolution.

  10. d

    Data from: Error-Level-Controlled Synthetic Forecasts for Renewable...

    • catalog.data.gov
    • data.openei.org
    • +2more
    Updated Nov 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Renewable Energy Laboratory (NREL) (2023). Error-Level-Controlled Synthetic Forecasts for Renewable Generation [Dataset]. https://catalog.data.gov/dataset/error-level-controlled-synthetic-forecasts-for-renewable-generation
    Explore at:
    Dataset updated
    Nov 30, 2023
    Dataset provided by
    National Renewable Energy Laboratory (NREL)
    Description

    Renewable energy resources, including solar and wind energy, play a significant role in sustainable energy systems. However, the inherent uncertainty and intermittency of renewable generation pose challenges to the safe and efficient operation of power systems. Recognizing the importance of short-term (hours ahead) renewable generation forecasting in power systems operation, it becomes crucial to address the potential inaccuracies in these forecasts. To systematically evaluate the performance of controllers in the presence of imperfect forecasts, we generate synthetic forecasts using actual renewable generation profiles (one from solar and one from wind). These synthetic forecasts incorporate different levels of statistical error, allowing us to control and manipulate the accuracy of the predictions. The primary objective is to employ synthetic forecasts with controlled yet realistic error levels to systematically investigate how controllers adapt to variations in forecast accuracy, providing valuable insights into their robustness and effectiveness under real-world conditions.

  11. f

    Data_Sheet_1.docx

    • datasetcatalog.nlm.nih.gov
    Updated Mar 22, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sterr, Annette; Furlan, Leonardo (2018). Data_Sheet_1.docx [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000727136
    Explore at:
    Dataset updated
    Mar 22, 2018
    Authors
    Sterr, Annette; Furlan, Leonardo
    Description

    Motor learning studies face the challenge of differentiating between real changes in performance and random measurement error. While the traditional p-value-based analyses of difference (e.g., t-tests, ANOVAs) provide information on the statistical significance of a reported change in performance scores, they do not inform as to the likely cause or origin of that change, that is, the contribution of both real modifications in performance and random measurement error to the reported change. One way of differentiating between real change and random measurement error is through the utilization of the statistics of standard error of measurement (SEM) and minimal detectable change (MDC). SEM is estimated from the standard deviation of a sample of scores at baseline and a test–retest reliability index of the measurement instrument or test employed. MDC, in turn, is estimated from SEM and a degree of confidence, usually 95%. The MDC value might be regarded as the minimum amount of change that needs to be observed for it to be considered a real change, or a change to which the contribution of real modifications in performance is likely to be greater than that of random measurement error. A computer-based motor task was designed to illustrate the applicability of SEM and MDC to motor learning research. Two studies were conducted with healthy participants. Study 1 assessed the test–retest reliability of the task and Study 2 consisted in a typical motor learning study, where participants practiced the task for five consecutive days. In Study 2, the data were analyzed with a traditional p-value-based analysis of difference (ANOVA) and also with SEM and MDC. The findings showed good test–retest reliability for the task and that the p-value-based analysis alone identified statistically significant improvements in performance over time even when the observed changes could in fact have been smaller than the MDC and thereby caused mostly by random measurement error, as opposed to by learning. We suggest therefore that motor learning studies could complement their p-value-based analyses of difference with statistics such as SEM and MDC in order to inform as to the likely cause or origin of any reported changes in performance.

  12. Sampling errors by variable class and main activity (API identifier: 53129)

    • datos.gob.es
    • data.europa.eu
    Updated Jun 9, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Instituto Nacional de Estadística (2022). Sampling errors by variable class and main activity (API identifier: 53129) [Dataset]. https://datos.gob.es/en/catalogo/ea0010587-errores-de-muestreo-por-actividad-principal-y-clase-de-variable-identificador-api-53129
    Explore at:
    Dataset updated
    Jun 9, 2022
    Dataset provided by
    National Statistics Institutehttp://www.ine.es/
    Authors
    Instituto Nacional de Estadística
    License

    https://www.ine.es/aviso_legalhttps://www.ine.es/aviso_legal

    Description

    Table of INEBase Sampling errors by variable class and main activity. National. Statistics on Products in the Services Sector

  13. Comparison of asymptotic and exact p-values.

    • figshare.com
    • datasetcatalog.nlm.nih.gov
    • +1more
    xls
    Updated Jun 1, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhiming Li; Changxing Ma; Mingyao Ai (2023). Comparison of asymptotic and exact p-values. [Dataset]. http://doi.org/10.1371/journal.pone.0242722.t005
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 1, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Zhiming Li; Changxing Ma; Mingyao Ai
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Comparison of asymptotic and exact p-values.

  14. Data from: Reference Measurements of Error Vector Magnitude

    • nist.gov
    • data.nist.gov
    • +2more
    Updated Feb 18, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Standards and Technology (2022). Reference Measurements of Error Vector Magnitude [Dataset]. http://doi.org/10.18434/mds2-2563
    Explore at:
    Dataset updated
    Feb 18, 2022
    Dataset provided by
    National Institute of Standards and Technologyhttp://www.nist.gov/
    License

    https://www.nist.gov/open/licensehttps://www.nist.gov/open/license

    Description

    The experiment here was to demonstrate that we can reliably measure the Reference Waveforms designed in the IEEE P1765 proposed standard and calculate EVM along with the associated uncertainties. The measurements were performed using NIST's calibrated sampling oscilloscope and were traceable to the primary standards. We have uploaded the following two datasets. (1) Table 3 contains the EVM values (in %) for the Reference Waveforms 1--7 after performing the uncertainty analyses. The Monte Carlo means are also compared with the ideal values from the calculations in the IEEE P1765 standard. (2) Figure 3 shows the complete EVM distribution upon performing uncertainty analysis for Reference Waveform 3 as an example. Each of the entries in Table 3 is associated with an EVM distribution similar to that shown in Fig. 3.

  15. Expenditure and Consumption Survey, 2004 - West Bank and Gaza

    • catalog.ihsn.org
    Updated Mar 29, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Palestinian Central Bureau of Statistics (2019). Expenditure and Consumption Survey, 2004 - West Bank and Gaza [Dataset]. https://catalog.ihsn.org/index.php/catalog/3085
    Explore at:
    Dataset updated
    Mar 29, 2019
    Dataset authored and provided by
    Palestinian Central Bureau of Statisticshttps://pcbs.gov/
    Time period covered
    2004 - 2005
    Area covered
    Gaza Strip, West Bank, Gaza
    Description

    Abstract

    The basic goal of this survey is to provide the necessary database for formulating national policies at various levels. It represents the contribution of the household sector to the Gross National Product (GNP). Household Surveys help as well in determining the incidence of poverty, and providing weighted data which reflects the relative importance of the consumption items to be employed in determining the benchmark for rates and prices of items and services. Generally, the Household Expenditure and Consumption Survey is a fundamental cornerstone in the process of studying the nutritional status in the Palestinian territory.

    The raw survey data provided by the Statistical Office was cleaned and harmonized by the Economic Research Forum, in the context of a major research project to develop and expand knowledge on equity and inequality in the Arab region. The main focus of the project is to measure the magnitude and direction of change in inequality and to understand the complex contributing social, political and economic forces influencing its levels. However, the measurement and analysis of the magnitude and direction of change in this inequality cannot be consistently carried out without harmonized and comparable micro-level data on income and expenditures. Therefore, one important component of this research project is securing and harmonizing household surveys from as many countries in the region as possible, adhering to international statistics on household living standards distribution. Once the dataset has been compiled, the Economic Research Forum makes it available, subject to confidentiality agreements, to all researchers and institutions concerned with data collection and issues of inequality. Data is a public good, in the interest of the region, and it is consistent with the Economic Research Forum's mandate to make micro data available, aiding regional research on this important topic.

    Geographic coverage

    The survey data covers urban, rural and camp areas in West Bank and Gaza Strip.

    Analysis unit

    1- Household/families. 2- Individuals.

    Universe

    The survey covered all the Palestinian households who are a usual residence in the Palestinian Territory.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    Sample and Frame:

    The sampling frame consists of all enumeration areas which were enumerated in 1997; the enumeration area consists of buildings and housing units and is composed of an average of 120 households. The enumeration areas were used as Primary Sampling Units (PSUs) in the first stage of the sampling selection. The enumeration areas of the master sample were updated in 2003.

    Sample Design:

    The sample is a stratified cluster systematic random sample with two stages: First stage: selection of a systematic random sample of 299 enumeration areas. Second stage: selection of a systematic random sample of 12-18 households from each enumeration area selected in the first stage. A person (18 years and more) was selected from each household in the second stage.

    Sample strata:

    The population was divided by: 1- Governorate 2- Type of Locality (urban, rural, refugee camps)

    Sample Size:

    The calculated sample size is 3,781 households.

    Target cluster size:

    The target cluster size or "sample-take" is the average number of households to be selected per PSU. In this survey, the sample take is around 12 households.

    Detailed information/formulas on the sampling design are available in the user manual.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    The PECS questionnaire consists of two main sections:

    First section: Certain articles / provisions of the form filled at the beginning of the month,and the remainder filled out at the end of the month. The questionnaire includes the following provisions:

    Cover sheet: It contains detailed and particulars of the family, date of visit, particular of the field/office work team, number/sex of the family members.

    Statement of the family members: Contains social, economic and demographic particulars of the selected family.

    Statement of the long-lasting commodities and income generation activities: Includes a number of basic and indispensable items (i.e, Livestock, or agricultural lands).

    Housing Characteristics: Includes information and data pertaining to the housing conditions, including type of shelter, number of rooms, ownership, rent, water, electricity supply, connection to the sewer system, source of cooking and heating fuel, and remoteness/proximity of the house to education and health facilities.

    Monthly and Annual Income: Data pertaining to the income of the family is collected from different sources at the end of the registration / recording period.

    Second section: The second section of the questionnaire includes a list of 54 consumption and expenditure groups itemized and serially numbered according to its importance to the family. Each of these groups contains important commodities. The number of commodities items in each for all groups stood at 667 commodities and services items. Groups 1-21 include food, drink, and cigarettes. Group 22 includes homemade commodities. Groups 23-45 include all items except for food, drink and cigarettes. Groups 50-54 include all of the long-lasting commodities. Data on each of these groups was collected over different intervals of time so as to reflect expenditure over a period of one full year.

    Cleaning operations

    Raw Data

    Both data entry and tabulation were performed using the ACCESS and SPSS software programs. The data entry process was organized in 6 files, corresponding to the main parts of the questionnaire. A data entry template was designed to reflect an exact image of the questionnaire, and included various electronic checks: logical check, range checks, consistency checks and cross-validation. Complete manual inspection was made of results after data entry was performed, and questionnaires containing field-related errors were sent back to the field for corrections.

    Harmonized Data

    • The Statistical Package for Social Science (SPSS) is used to clean and harmonize the datasets.
    • The harmonization process starts with cleaning all raw data files received from the Statistical Office.
    • Cleaned data files are then all merged to produce one data file on the individual level containing all variables subject to harmonization.
    • A country-specific program is generated for each dataset to generate/compute/recode/rename/format/label harmonized variables.
    • A post-harmonization cleaning process is run on the data.
    • Harmonized data is saved on the household as well as the individual level, in SPSS and converted to STATA format.

    Response rate

    The survey sample consists of about 3,781 households interviewed over a twelve-month period between January 2004 and January 2005. There were 3,098 households that completed the interview, of which 2,060 were in the West Bank and 1,038 households were in GazaStrip. The response rate was 82% in the Palestinian Territory.

    Sampling error estimates

    The calculations of standard errors for the main survey estimations enable the user to identify the accuracy of estimations and the survey reliability. Total errors of the survey can be divided into two kinds: statistical errors, and non-statistical errors. Non-statistical errors are related to the procedures of statistical work at different stages, such as the failure to explain questions in the questionnaire, unwillingness or inability to provide correct responses, bad statistical coverage, etc. These errors depend on the nature of the work, training, supervision, and conducting all various related activities. The work team spared no effort at different stages to minimize non-statistical errors; however, it is difficult to estimate numerically such errors due to absence of technical computation methods based on theoretical principles to tackle them. On the other hand, statistical errors can be measured. Frequently they are measured by the standard error, which is the positive square root of the variance. The variance of this survey has been computed by using the “programming package” CENVAR.

  16. h

    Data from: Table 4

    • hepdata.net
    Updated May 4, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2019). Table 4 [Dataset]. http://doi.org/10.17182/hepdata.66629.v2/t4
    Explore at:
    Dataset updated
    May 4, 2019
    Description

    Fiducial $W+b$-jets cross-section correlations of statistical errors in the 1-jet region in bins of $p_T^{b-jet}$.

  17. Replication data for: Misclassification Errors and the Underestimation of...

    • search.gesis.org
    • openicpsr.org
    Updated Feb 26, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GESIS search (2020). Replication data for: Misclassification Errors and the Underestimation of the US Unemployment Rate [Dataset]. http://doi.org/10.3886/E112598V1
    Explore at:
    Dataset updated
    Feb 26, 2020
    Dataset provided by
    Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
    GESIS search
    License

    https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de699814https://search.gesis.org/research_data/datasearch-httpwww-da-ra-deoaip--oaioai-da-ra-de699814

    Description

    Abstract (en): Using recent results in the measurement error literature, we show that the official US unemployment rate substantially underestimates the true level of unemployment, due to misclassification errors in the labor force status in the Current Population Survey. During the period from January 1996 to August 2011, the corrected monthly unemployment rates are between 1 and 4.4 percentage points (2.1 percentage points on average) higher than the official rates, and are more sensitive to changes in business cycles. The labor force participation rates, however, are not affected by this correction.

  18. Rates of classification errors of the methods on the ORL database (%).

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 4, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Qi Zhu; Zhengming Li; Jinxing Liu; Zizhu Fan; Lei Yu; Yan Chen (2023). Rates of classification errors of the methods on the ORL database (%). [Dataset]. http://doi.org/10.1371/journal.pone.0070370.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 4, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Qi Zhu; Zhengming Li; Jinxing Liu; Zizhu Fan; Lei Yu; Yan Chen
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Rates of classification errors of the methods on the ORL database (%).

  19. s

    Amount of errors leading to a goal by Ederson in the Premier League...

    • statista.com
    Updated Jul 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Amount of errors leading to a goal by Ederson in the Premier League 2017-2023 [Dataset]. https://www.statista.com/statistics/1434158/amount-of-errors-leading-to-a-goal-ederson-premier-league/
    Explore at:
    Dataset updated
    Jul 10, 2025
    Dataset authored and provided by
    Statista
    Area covered
    United Kingdom
    Description

    Amidst the season 2022/23, Ederson made one error that lead to a goal for the opposing team. His worst season however was 2019/20, where he unfortunately made ***** errors that allowed the opposing team to score. Find further Premier League statistics regarding the amount of errors leading to a goal for players like Petr Cech, Ashley Williams, and Alex Mccarthy.

  20. n

    Data from: Correction of location errors for presence-only species...

    • data.niaid.nih.gov
    • data-staging.niaid.nih.gov
    • +1more
    zip
    Updated Nov 7, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Trevor J. Hefley; David M. Baasch; Andrew J. Tyre; Erin E. Blankenship (2014). Correction of location errors for presence-only species distribution models [Dataset]. http://doi.org/10.5061/dryad.h81s5
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 7, 2014
    Dataset provided by
    University of Nebraska–Lincoln
    Authors
    Trevor J. Hefley; David M. Baasch; Andrew J. Tyre; Erin E. Blankenship
    License

    https://spdx.org/licenses/CC0-1.0.htmlhttps://spdx.org/licenses/CC0-1.0.html

    Area covered
    USA, Nebraska
    Description
    1. Species distribution models (SDMs) for presence-only data depend on accurate and precise measurements of geographic and environmental covariates that influence presence and abundance of the species. Some data sets, however, may contain both systematic and random errors in the recorded location of the species. Environmental covariates at the recorded location may differ from those at the true location and result in biased parameter estimates and predictions from SDMs. 2. Regression calibration is a well-developed statistical method that can be used to correct the bias in estimated coefficients and predictions from SDMs when the recorded geographic location differ from the true location for some, but not all locations. We expand the application of regression calibration methods to SDMs and provide illustrative examples using simulated data and opportunistic records of whooping cranes (Grus americana). 3. We found we were able to successfully correct the bias in our SDM parameters estimated from simulated data and opportunistic records of whooping cranes using regression calibration. 4. When modeling species distributions with data that have geographic location errors, we recommend researchers consider the effect of location errors. Correcting for location errors requires that at least a portion of the data have locations recorded without error. Bias correction can result in an increase in variance; this increase in variance should be considered when evaluating the utility of bias correction.
Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Jay Ram Lamichhane; Alfredo Fabi; Roberto Ridolfi; Leonardo Varvaro (2023). Summary statistics of cross validation prediction errors applied to log-transformed data. [Dataset]. http://doi.org/10.1371/journal.pone.0056298.t002
Organization logo

Summary statistics of cross validation prediction errors applied to log-transformed data.

Related Article
Explore at:
xlsAvailable download formats
Dataset updated
Jun 7, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Jay Ram Lamichhane; Alfredo Fabi; Roberto Ridolfi; Leonardo Varvaro
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

RMSPE: root mean square prediction error; ASE: average standard error; SMPE: standardized mean prediction error; SRMS: standardized root mean square; Δ: thermal shock.

Search
Clear search
Close search
Google apps
Main menu