30 datasets found
  1. i

    Household Health Survey 2012-2013, Economic Research Forum (ERF)...

    • catalog.ihsn.org
    • datacatalog.ihsn.org
    Updated Jun 26, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Central Statistical Organization (CSO) (2017). Household Health Survey 2012-2013, Economic Research Forum (ERF) Harmonization Data - Iraq [Dataset]. https://catalog.ihsn.org/index.php/catalog/6937
    Explore at:
    Dataset updated
    Jun 26, 2017
    Dataset provided by
    Economic Research Forum
    Kurdistan Regional Statistics Office (KRSO)
    Central Statistical Organization (CSO)
    Time period covered
    2012 - 2013
    Area covered
    Iraq
    Description

    Abstract

    The harmonized data set on health, created and published by the ERF, is a subset of Iraq Household Socio Economic Survey (IHSES) 2012. It was derived from the household, individual and health modules, collected in the context of the above mentioned survey. The sample was then used to create a harmonized health survey, comparable with the Iraq Household Socio Economic Survey (IHSES) 2007 micro data set.

    ----> Overview of the Iraq Household Socio Economic Survey (IHSES) 2012:

    Iraq is considered a leader in household expenditure and income surveys where the first was conducted in 1946 followed by surveys in 1954 and 1961. After the establishment of Central Statistical Organization, household expenditure and income surveys were carried out every 3-5 years in (1971/ 1972, 1976, 1979, 1984/ 1985, 1988, 1993, 2002 / 2007). Implementing the cooperation between CSO and WB, Central Statistical Organization (CSO) and Kurdistan Region Statistics Office (KRSO) launched fieldwork on IHSES on 1/1/2012. The survey was carried out over a full year covering all governorates including those in Kurdistan Region.

    The survey has six main objectives. These objectives are:

    1. Provide data for poverty analysis and measurement and monitor, evaluate and update the implementation Poverty Reduction National Strategy issued in 2009.
    2. Provide comprehensive data system to assess household social and economic conditions and prepare the indicators related to the human development.
    3. Provide data that meet the needs and requirements of national accounts.
    4. Provide detailed indicators on consumption expenditure that serve making decision related to production, consumption, export and import.
    5. Provide detailed indicators on the sources of households and individuals income.
    6. Provide data necessary for formulation of a new consumer price index number.

    The raw survey data provided by the Statistical Office were then harmonized by the Economic Research Forum, to create a comparable version with the 2006/2007 Household Socio Economic Survey in Iraq. Harmonization at this stage only included unifying variables' names, labels and some definitions. See: Iraq 2007 & 2012- Variables Mapping & Availability Matrix.pdf provided in the external resources for further information on the mapping of the original variables on the harmonized ones, in addition to more indications on the variables' availability in both survey years and relevant comments.

    Geographic coverage

    National coverage: Covering a sample of urban, rural and metropolitan areas in all the governorates including those in Kurdistan Region.

    Analysis unit

    1- Household/family. 2- Individual/person.

    Universe

    The survey was carried out over a full year covering all governorates including those in Kurdistan Region.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    ----> Design:

    Sample size was (25488) household for the whole Iraq, 216 households for each district of 118 districts, 2832 clusters each of which includes 9 households distributed on districts and governorates for rural and urban.

    ----> Sample frame:

    Listing and numbering results of 2009-2010 Population and Housing Survey were adopted in all the governorates including Kurdistan Region as a frame to select households, the sample was selected in two stages: Stage 1: Primary sampling unit (blocks) within each stratum (district) for urban and rural were systematically selected with probability proportional to size to reach 2832 units (cluster). Stage two: 9 households from each primary sampling unit were selected to create a cluster, thus the sample size of total survey clusters was 25488 households distributed on the governorates, 216 households in each district.

    ----> Sampling Stages:

    In each district, the sample was selected in two stages: Stage 1: based on 2010 listing and numbering frame 24 sample points were selected within each stratum through systematic sampling with probability proportional to size, in addition to the implicit breakdown urban and rural and geographic breakdown (sub-district, quarter, street, county, village and block). Stage 2: Using households as secondary sampling units, 9 households were selected from each sample point using systematic equal probability sampling. Sampling frames of each stages can be developed based on 2010 building listing and numbering without updating household lists. In some small districts, random selection processes of primary sampling may lead to select less than 24 units therefore a sampling unit is selected more than once , the selection may reach two cluster or more from the same enumeration unit when it is necessary.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    ----> Preparation:

    The questionnaire of 2006 survey was adopted in designing the questionnaire of 2012 survey on which many revisions were made. Two rounds of pre-test were carried out. Revision were made based on the feedback of field work team, World Bank consultants and others, other revisions were made before final version was implemented in a pilot survey in September 2011. After the pilot survey implemented, other revisions were made in based on the challenges and feedbacks emerged during the implementation to implement the final version in the actual survey.

    ----> Questionnaire Parts:

    The questionnaire consists of four parts each with several sections: Part 1: Socio – Economic Data: - Section 1: Household Roster - Section 2: Emigration - Section 3: Food Rations - Section 4: housing - Section 5: education - Section 6: health - Section 7: Physical measurements - Section 8: job seeking and previous job

    Part 2: Monthly, Quarterly and Annual Expenditures: - Section 9: Expenditures on Non – Food Commodities and Services (past 30 days). - Section 10 : Expenditures on Non – Food Commodities and Services (past 90 days). - Section 11: Expenditures on Non – Food Commodities and Services (past 12 months). - Section 12: Expenditures on Non-food Frequent Food Stuff and Commodities (7 days). - Section 12, Table 1: Meals Had Within the Residential Unit. - Section 12, table 2: Number of Persons Participate in the Meals within Household Expenditure Other Than its Members.

    Part 3: Income and Other Data: - Section 13: Job - Section 14: paid jobs - Section 15: Agriculture, forestry and fishing - Section 16: Household non – agricultural projects - Section 17: Income from ownership and transfers - Section 18: Durable goods - Section 19: Loans, advances and subsidies - Section 20: Shocks and strategy of dealing in the households - Section 21: Time use - Section 22: Justice - Section 23: Satisfaction in life - Section 24: Food consumption during past 7 days

    Part 4: Diary of Daily Expenditures: Diary of expenditure is an essential component of this survey. It is left at the household to record all the daily purchases such as expenditures on food and frequent non-food items such as gasoline, newspapers…etc. during 7 days. Two pages were allocated for recording the expenditures of each day, thus the roster will be consists of 14 pages.

    Cleaning operations

    ----> Raw Data:

    Data Editing and Processing: To ensure accuracy and consistency, the data were edited at the following stages: 1. Interviewer: Checks all answers on the household questionnaire, confirming that they are clear and correct. 2. Local Supervisor: Checks to make sure that questions has been correctly completed. 3. Statistical analysis: After exporting data files from excel to SPSS, the Statistical Analysis Unit uses program commands to identify irregular or non-logical values in addition to auditing some variables. 4. World Bank consultants in coordination with the CSO data management team: the World Bank technical consultants use additional programs in SPSS and STAT to examine and correct remaining inconsistencies within the data files. The software detects errors by analyzing questionnaire items according to the expected parameter for each variable.

    ----> Harmonized Data:

    • The SPSS package is used to harmonize the Iraq Household Socio Economic Survey (IHSES) 2007 with Iraq Household Socio Economic Survey (IHSES) 2012.
    • The harmonization process starts with raw data files received from the Statistical Office.
    • A program is generated for each dataset to create harmonized variables.
    • Data is saved on the household and individual level, in SPSS and then converted to STATA, to be disseminated.

    Response rate

    Iraq Household Socio Economic Survey (IHSES) reached a total of 25488 households. Number of households refused to response was 305, response rate was 98.6%. The highest interview rates were in Ninevah and Muthanna (100%) while the lowest rates were in Sulaimaniya (92%).

  2. National Survey on Population and Employment, ENPE 2012 - Tunisia

    • erfdataportal.com
    Updated Apr 11, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    National Institute of Statistics - Tunisia (2017). National Survey on Population and Employment, ENPE 2012 - Tunisia [Dataset]. http://www.erfdataportal.com/index.php/catalog/123
    Explore at:
    Dataset updated
    Apr 11, 2017
    Dataset provided by
    National institute of statisticshttp://www.ins.tn/en/
    Economic Research Forum
    Time period covered
    2012
    Area covered
    Tunisia
    Description

    Abstract

    THE CLEANED AND HARMONIZED VERSION OF THE SURVEY DATA PRODUCED AND PUBLISHED BY THE ECONOMIC RESEARCH FORUM REPRESENTS 100% OF THE ORIGINAL SURVEY DATA COLLECTED BY THE NATIONAL INSTITUTE OF STATISTICS (INS) - TUNISIA

    The survey aims at estimating the demographic and educational characteristics of the population. It also calculates the economic indicators of the population such as the number of active individuals, the additional demand for jobs, the number of employed and their characteristics, the number of jobs created, the characteristics of the unemployed and the unemployment rate. Furthermore, this survey estimates these indicators on the household level and their living conditions.

    The results of this survey were compared with the results of the second quarter of the national survey on population and employment 2011. It should also be noted that the National Institute of Statistics -Tunisia uses the unemployment definition and concepts adopted by the International Labour Organization. This definition implies that, the individual did not work during the week preceding the day of the interview, was looking for a job in the month preceding the date of the interview, is available to work within two weeks after the day of the interview.

    In 2010, the National Institute of Statistics has adopted a strict ILO definition for unemployment, by conditioning that the person must perform effective approaches to search for a job in the month preceding the day of the interview.

    Geographic coverage

    Covering a representative sample at the national and regional level (governorates).

    Analysis unit

    1- Household/family. 2- Individual/person.

    Universe

    The survey covered a national sample of households and all individuals permanently residing in surveyed households.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    THE CLEANED AND HARMONIZED VERSION OF THE SURVEY DATA PRODUCED AND PUBLISHED BY THE ECONOMIC RESEARCH FORUM REPRESENTS 100% OF THE ORIGINAL SURVEY DATA COLLECTED BY THE NATIONAL INSTITUTE OF STATISTICS - TUNISIA (INS)

    The sample is drawn from the frame of the 2004 General Census of Population and Housing.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    Three modules were designed for data collection:

    • Household Questionnaire (Module 1): Includes questions regarding household characteristics, living conditions, individuals and their demographic, educational and economic characteristics. This module also provides information on internal and external migration.

    • Active Employed Questionnaire (Module 2): Includes questions regarding the characteristics of the employed individuals as occupation, industry and wages for employees.

    • Active Unemployed Questionnaire (Module 3): Includes questions regarding the characteristics of the unemployed as unemployment duration, the last occupation, activity, and the number of days worked during the last year...etc.

    Cleaning operations

    Harmonized Data

    • SPSS software is used to clean and harmonize the datasets.
    • The harmonization process starts with cleaning all raw data files received from the Statistical Agency.
    • Cleaned data files are then all merged to produce one data file on the individual level containing all variables subject to harmonization.
    • A country-specific program is generated for each dataset to generate/ compute/ recode/ rename/ format/ label harmonized variables.
    • A post-harmonization cleaning process is then conducted on the data.
    • Harmonized data is saved on the household as well as the individual level, in SPSS and converted to STATA format.
  3. Food Security in the United States

    • agdatacommons.nal.usda.gov
    zip
    Updated Nov 22, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    US Department of Agriculture, Economic Research Service (2025). Food Security in the United States [Dataset]. http://doi.org/10.15482/USDA.ADC/1294355
    Explore at:
    zipAvailable download formats
    Dataset updated
    Nov 22, 2025
    Dataset provided by
    Economic Research Servicehttp://www.ers.usda.gov/
    United States Department of Agriculturehttp://usda.gov/
    Authors
    US Department of Agriculture, Economic Research Service
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Area covered
    United States
    Description

    The Current Population Survey Food Security Supplement (CPS-FSS) is the source of national and State-level statistics on food insecurity used in USDA's annual reports on household food security. The CPS is a monthly labor force survey of about 50,000 households conducted by the Census Bureau for the Bureau of Labor Statistics. Once each year, after answering the labor force questions, the same households are asked a series of questions (the Food Security Supplement) about food security, food expenditures, and use of food and nutrition assistance programs. Food security data have been collected by the CPS-FSS each year since 1995. Four data sets that complement those available from the Census Bureau are available for download on the ERS website. These are available as ASCII uncompressed or zipped files. The purpose and appropriate use of these additional data files are described below: 1) CPS 1995 Revised Food Security Status data--This file provides household food security scores and food security status categories that are consistent with procedures and variable naming conventions introduced in 1996. This includes the "common screen" variables to facilitate comparisons of prevalence rates across years. This file must be matched to the 1995 CPS Food Security Supplement public-use data file. 2) CPS 1998 Children's and 30-day Food Security data--Subsequent to the release of the April 1999 CPS-FSS public-use data file, USDA developed two additional food security scales to describe aspects of food security conditions in interviewed households not captured by the 12-month household food security scale. This file provides three food security variables (categorical, raw score, and scale score) for each of these scales along with household identification variables to allow the user to match this supplementary data file to the CPS-FSS April 1998 data file. 3) CPS 1999 Children's and 30-day Food Security data--Subsequent to the release of the April 1999 CPS-FSS public-use data file, USDA developed two additional food security scales to describe aspects of food security conditions in interviewed households not captured by the 12-month household food security scale. This file provides three food security variables (categorical, raw score, and scale score) for each of these scales along with household identification variables to allow the user to match this supplementary data file to the CPS-FSS April 1999 data file. 4) CPS 2000 30-day Food Security data--Subsequent to the release of the September 2000 CPS-FSS public-use data file, USDA developed a revised 30-day CPS Food Security Scale. This file provides three food security variables (categorical, raw score, and scale score) for the 30-day scale along with household identification variables to allow the user to match this supplementary data file to the CPS-FSS September 2000 data file. Food security is measured at the household level in three categories: food secure, low food security and very low food security. Each category is measured by a total count and as a percent of the total population. Categories and measurements are broken down further based on the following demographic characteristics: household composition, race/ethnicity, metro/nonmetro area of residence, and geographic region. The food security scale includes questions about households and their ability to purchase enough food and balanced meals, questions about adult meals and their size, frequency skipped, weight lost, days gone without eating, questions about children meals, including diversity, balanced meals, size of meals, skipped meals and hunger. Questions are also asked about the use of public assistance and supplemental food assistance. The food security scale is 18 items that measure insecurity. A score of 0-2 means a house is food secure, from 3-7 indicates low food security, and 8-18 means very low food security. The scale and the data also report the frequency with which each item is experienced. Data are available as .dat files which may be processed in statistical software or through the United State Census Bureau's DataFerret http://dataferrett.census.gov/. Data from 2010 onwards is available below and online. Data from 1995-2009 must be accessed through DataFerrett. DataFerrett is a data analysis and extraction tool to customize federal, state, and local data to suit your requirements. Through DataFerrett, the user can develop an unlimited array of customized spreadsheets that are as versatile and complex as your usage demands then turn those spreadsheets into graphs and maps without any additional software. Resources in this dataset:Resource Title: December 2014 Food Security CPS Supplement. File Name: dec14pub.zipResource Title: December 2013 Food Security CPS Supplement. File Name: dec13pub.zipResource Title: December 2012 Food Security CPS Supplement. File Name: dec12pub.zipResource Title: December 2011 Food Security CPS Supplement. File Name: dec11pub.zipResource Title: December 2010 Food Security CPS Supplement. File Name: dec10pub.zip

  4. Major League Baseball Game Logs

    • kaggle.com
    zip
    Updated Dec 20, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2023). Major League Baseball Game Logs [Dataset]. https://www.kaggle.com/thedevastator/major-league-baseball-game-logs
    Explore at:
    zip(24569430 bytes)Available download formats
    Dataset updated
    Dec 20, 2023
    Authors
    The Devastator
    License

    Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
    License information was derived automatically

    Description

    Major League Baseball Game Logs

    Historical MLB Game Logs and Player Statistics from 1871-2016

    By Dataquest [source]

    About this dataset

    This comprehensive dataset provides a historical record of Major League Baseball (MLB) games dating back to its inception. It offers an in-depth look into the game's significant aspects, encompassing detailed statistics, player performance information, and game outcomes across multiple seasons.

    The MLB Game Logs dataset is a rich depository of data provided in the form of structured records. Sourced from Retrosheet, this dataset was initially presented in 127 distinct CSV files which have now been amalgamated into a single consolidated file for facilitating seamless analysis.

    Starting from the fundamental game statistics like date and venue of matches, team names and IDs to capturing minute attributes such as day or night match distinction or completion info; all pertinent details are captured meticulously in this voluminous repository. More granular inputs like lengths of games measured via outs or attendance figures lend further richness to this set.

    From a player performance perspective too the set is equally exhaustive housing data on hits home runs stealing bases sacrificing ventures extra-base hits runs batted in (RBIs), winning pitchers losing pitchers saving pitchers all listed alongside their respective players IDs for easy cross-referencing.

    In addition to providing raw data,this dataset carries greatly-detailed column names grounded upon Retrosheets field explanations to proffer better clarity around each field contained within it's ambit thus ensuring users derive maximum value with minimal misinterpretation issues.

    While comprehensive explanations about columns have been included within the data dictionary part of our files ,we recommend referring directly towards Retrosheet field explanation for complete details surround specific fields if so required.

    As part and parcel respect for copyright terms belonging towards Retrosheet we declare :

    The information used here was obtained free of charge from and is copyrighted by Retrosheet.
    Interested parties may contact Retrosheet at www.retrosheet.org.
    
    

    We hope that enthusiasts, researchers, statisticians and other users find value in this rich resource of baseball history

    How to use the dataset

    This dataset is a treasure trove of history and statistics, offering detailed player and game information for MLB games from 1871 to 2016. This 'how-to-use' guide will thus be helpful for beginners or others who are not familiar with the dataset format.

    • Understand columns: Familiarize yourself with the numerous columns in this data set. Each column offers distinct information about each game, such as player performance, location of the match, number of spectators and more. It's okay if you do not understand everything right away.

    • Read documentation: You'll find a 'Retrosheet field explanation' link in the description provided above which explains each column in detail. Do make sure to go through it to get a better understanding.

    • Define your objective: Are you looking at predicting future game outcomes? Or trying to find patterns between attendance and team performance? Defined objectives will help focusing on relevant columns greatly reducing needless exploration efforts.

    • Cleaning Data: A few data points might have missing values or illegible entries; identifying them could help provide accurate insights from analysis.

    • Perform initial EDA (Exploratory Data Analysis): EDA is an approach that includes inspecting, cleaning, transforming, and visualizing raw datasets to inform our understanding of their underlying structure that might inform our selection or creation of statistical models later on down the line:

      • Histograms: Could provide frequency distributions for numeric variables.

      • Box plots: A good way of quickly visualizing where most data points lie.

      • Pivot tables: Aggregating specific groups can give you comprehensive insights into large sets.

    • Statistical Analysis & Machine Learning Models: With clear objectives & prepared dataset at hand trying various machine learning models for prediction like Logistic Regression model for binary outcome prediction (win/lose), Multiple Linear Regression model when outcome variable is numerical (score), decision trees for data segmentation and so on.

    • Visualize: It always help to build charts, graphs or tables for final insights visualization for others.

    This datas...

  5. Aircraft Flux-Raw: U of Wy. (FIFE) - Dataset - NASA Open Data Portal

    • data.nasa.gov
    Updated Apr 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    nasa.gov (2025). Aircraft Flux-Raw: U of Wy. (FIFE) - Dataset - NASA Open Data Portal [Dataset]. https://data.nasa.gov/dataset/aircraft-flux-raw-u-of-wy-fife-105f2
    Explore at:
    Dataset updated
    Apr 1, 2025
    Dataset provided by
    NASAhttp://nasa.gov/
    Area covered
    Wyoming
    Description

    The University of Wyoming (UW) King Air atmospheric boundary layer measurement missions were flown in 1987 during IFCs 3 and 4. This Raw Boundary Layer Fluxes data set contains parameters that describe the environment in which the flux data were collected and the flux data itself. The fluctuations in all variables were calculated with three different methods (the arithmetic means removed, the linear trends removed, or filtered with a high-pass recursive filter) prior to the eddy correlation calculations. This data set contains the data with the arithmetic means removed (i.e., RAW). All the flux measurements were obtained with the eddy-correlation method, wherein the aircraft is equipped with an inertial platform, accelerometers, and a gust probe for measurement of earth-relative gusts in the x, y, and z directions. Gusts in these dimensions are then correlated with each other for momentum fluxes and with fluctuations in other variables to obtain the various scalar fluxes, such as temperature (for sensible heat flux) and water vapor mixing ratio (for latent heat flux). The summary of data calculated from each aircraft pass includes various statistics, correlations, and fluxes calculated after the time series for each variable with the arithmetic means removed.

  6. Crimes In India

    • kaggle.com
    zip
    Updated Jul 16, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Saurabh Shahane (2022). Crimes In India [Dataset]. https://www.kaggle.com/datasets/saurabhshahane/crimes-in-india/code
    Explore at:
    zip(139282 bytes)Available download formats
    Dataset updated
    Jul 16, 2022
    Authors
    Saurabh Shahane
    Area covered
    India
    Description

    This dataset contains number of crimes filed under each category of the Indian Penal Code (IPC), number of victims of those crimes, and average crime rate. The data is presented separately by IPC category and sub-category. Data are available at the state/UT level for 2018.

    ● 7060_source_data.csv: The raw data from the source with original administrative dimensions. This dataset may have already been restructured by scraping PDFs, combining files, or pivoting tables to fit the proper tabular format used by NDAP, but the actual data values remain unchanged. ● NDAP_REPORT_7060.csv: The final standardised data using LGD geographic dimensions as seen on NPAP. ● 7060_metadata.csv: Variable-level metadata, including the following fields: ❖ VariableName: The full variable name as it appears in the data ❖ VariableCode: A unique variable code that is used as a short name for the variable during internal processing and can be used for simplicity if desired ❖ Type_Of_Variable: The classification of the column, whether it is a dimension or a variable (i.e. indicator) ❖ Unit_Of_Measure: ❖ Aggregation_Type: The default aggregation function to be used when aggregating each variable ❖ Weighing_Variable_Name: The weight assigned to each variable that is used by default when aggregating ❖ Weighing_Variable_ID: The weighting variable id corresponding to the weighing variable name ❖ Long_Description: A more descriptive definition of the variable ❖ Scaling_factor: Scaling factor from source ● 7060_KEYS.csv: The key which maps source administrative units to the standardised Local Government Directory (LGD) dimensions. This file also contains pre-calculated weights for every constituent unit mapped from the source dimensions into the LGD. You can interpret each row as describing what fraction of the source unit is mapped to a corresponding LGD unit. This file includes the following fields: ❖ src[Unit]Name: The administrative unit name as it appears in the source data. Depending on the dataset, that may include State, District, Subdistrict, Block, Village/Town, etc. ❖ [Unit]Name: The standardised administrative unit name as it appears in the LGD. Depending on the dataset, that may include State, District, Subdistrict, Block, Village/Town, etc. ❖ [Unit]Name: The standardised administrative unit code corresponding to the unit name in the LDG. ❖ Year: The year in which the data was collected or reported. Depending on the dataset, any other temporal variables may also be present (Quarter, Month, Calendar Day, etc.) ❖ Number_Of_Children: The number of LGD units associated with the mapping described by an individual row. Units from the source that have undergone a split will contain multiple children. ❖ Number_Of_Parents: The number of source units associated with the mapping described by an individual row. Units from the source that have undergone a merge will contain multiple parents. ❖ Weighing_Variables: Households, Population, Male Population, Female Population, Land Area (Total, Rural, and Urban versions of each). For each weighing variable there are the following associated fields: ■ Count: the total count of households, population, or land area mapped from the source unit to the LGD unit for that particular row (NumberOfHouseholds, TotalPopulation, LandArea). ■ Mapping_Error: the percentage error due to missing villages in the base data, meaning what fraction of the weighing variable is dropped because the microdata could not be mapped to the LGD. ■ Weighing_Ratio: the weighing ratio for that constituent match of source unit to LGD unit for each particular row. This is the fraction applied to the source data to achieve the LGD-standardised final data

  7. d

    Data from: Aircraft Flux-Raw: U of Wy. (FIFE)

    • catalog.data.gov
    • s.cnmilf.com
    • +4more
    Updated Sep 19, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ORNL_DAAC (2025). Aircraft Flux-Raw: U of Wy. (FIFE) [Dataset]. https://catalog.data.gov/dataset/aircraft-flux-raw-u-of-wy-fife-66c69
    Explore at:
    Dataset updated
    Sep 19, 2025
    Dataset provided by
    ORNL_DAAC
    Area covered
    Wyoming
    Description

    The University of Wyoming (UW) King Air atmospheric boundary layer measurement missions were flown in 1987 during IFCs 3 and 4. This Raw Boundary Layer Fluxes data set contains parameters that describe the environment in which the flux data were collected and the flux data itself. The fluctuations in all variables were calculated with three different methods (the arithmetic means removed, the linear trends removed, or filtered with a high-pass recursive filter) prior to the eddy correlation calculations. This data set contains the data with the arithmetic means removed (i.e., RAW). All the flux measurements were obtained with the eddy-correlation method, wherein the aircraft is equipped with an inertial platform, accelerometers, and a gust probe for measurement of earth-relative gusts in the x, y, and z directions. Gusts in these dimensions are then correlated with each other for momentum fluxes and with fluctuations in other variables to obtain the various scalar fluxes, such as temperature (for sensible heat flux) and water vapor mixing ratio (for latent heat flux). The summary of data calculated from each aircraft pass includes various statistics, correlations, and fluxes calculated after the time series for each variable with the arithmetic means removed.

  8. Census 2011 - South Africa

    • catalog.ihsn.org
    • datacatalog.ihsn.org
    • +1more
    Updated Mar 29, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statistics South Africa (2019). Census 2011 - South Africa [Dataset]. https://catalog.ihsn.org/catalog/4092
    Explore at:
    Dataset updated
    Mar 29, 2019
    Dataset authored and provided by
    Statistics South Africahttp://www.statssa.gov.za/
    Time period covered
    2011
    Area covered
    South Africa
    Description

    Abstract

    Censuses are principal means of collecting basic population and housing statistics required for social and economic development, policy interventions, their implementation and evaluation.The census plays an essential role in public administration. The results are used to ensure: • equity in distribution of government services • distributing and allocating government funds among various regions and districts for education and health services • delineating electoral districts at national and local levels, and • measuring the impact of industrial development, to name a few The census also provides the benchmark for all surveys conducted by the national statistical office. Without the sampling frame derived from the census, the national statistical system would face difficulties in providing reliable official statistics for use by government and the public. Census also provides information on small areas and population groups with minimum sampling errors. This is important, for example, in planning the location of a school or clinic. Census information is also invaluable for use in the private sector for activities such as business planning and market analyses. The information is used as a benchmark in research and analysis.

    Census 2011 was the third democratic census to be conducted in South Africa. Census 2011 specific objectives included: - To provide statistics on population, demographic, social, economic and housing characteristics; - To provide a base for the selection of a new sampling frame; - To provide data at lowest geographical level; and - To provide a primary base for the mid-year projections.

    Geographic coverage

    National

    Analysis unit

    Households, Individuals

    Kind of data

    Census/enumeration data [cen]

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    About the Questionnaire : Much emphasis has been placed on the need for a population census to help government direct its development programmes, but less has been written about how the census questionnaire is compiled. The main focus of a population and housing census is to take stock and produce a total count of the population without omission or duplication. Another major focus is to be able to provide accurate demographic and socio-economic characteristics pertaining to each individual enumerated. Apart from individuals, the focus is on collecting accurate data on housing characteristics and services.A population and housing census provides data needed to facilitate informed decision-making as far as policy formulation and implementation are concerned, as well as to monitor and evaluate their programmes at the smallest area level possible. It is therefore important that Statistics South Africa collects statistical data that comply with the United Nations recommendations and other relevant stakeholder needs.

    The United Nations underscores the following factors in determining the selection of topics to be investigated in population censuses: a) The needs of a broad range of data users in the country; b) Achievement of the maximum degree of international comparability, both within regions and on a worldwide basis; c) The probable willingness and ability of the public to give adequate information on the topics; and d) The total national resources available for conducting a census.

    In addition, the UN stipulates that census-takers should avoid collecting information that is no longer required simply because it was traditionally collected in the past, but rather focus on key demographic, social and socio-economic variables.It becomes necessary, therefore, in consultation with a broad range of users of census data, to review periodically the topics traditionally investigated and to re-evaluate the need for the series to which they contribute, particularly in the light of new data needs and alternative data sources that may have become available for investigating topics formerly covered in the population census. It was against this background that Statistics South Africa conducted user consultations in 2008 after the release of some of the Community Survey products. However, some groundwork in relation to core questions recommended by all countries in Africa has been done. In line with users' meetings, the crucial demands of the Millennium Development Goals (MDGs) should also be met. It is also imperative that Stats SA meet the demands of the users that require small area data.

    Accuracy of data depends on a well-designed questionnaire that is short and to the point. The interview to complete the questionnaire should not take longer than 18 minutes per household. Accuracy also depends on the diligence of the enumerator and honesty of the respondent.On the other hand, disadvantaged populations, owing to their small numbers, are best covered in the census and not in household sample surveys.Variables such as employment/unemployment, religion, income, and language are more accurately covered in household surveys than in censuses.Users'/stakeholders' input in terms of providing information in the planning phase of the census is crucial in making it a success. However, the information provided should be within the scope of the census.

    1. The Household Questionnaire is divided into the following sections:
    2. Household identification particulars
    3. Individual particulars Section A: Demographics Section B: Migration Section C: General Health and Functioning Section D: Parental Survival and Income Section E: Education Section F: Employment Section G: Fertility (Women 12-50 Years Listed) Section H: Housing, Household Goods and Services and Agricultural Activities Section I: Mortality in the Last 12 Months The Household Questionnaire is available in Afrikaans; English; isiZulu; IsiNdebele; Sepedi; SeSotho; SiSwati;Tshivenda;Xitsonga

    4. The Transient and Tourist Hotel Questionnaire (English) is divided into the following sections:

    5. Name, Age, Gender, Date of Birth, Marital Status, Population Group, Country of birth, Citizenship, Province.

    6. The Questionnaire for Institutions (English) is divided into the following sections:

    7. Particulars of the institution

    8. Availability of piped water for the institution

    9. Main source of water for domestic use

    10. Main type of toilet facility

    11. Type of energy/fuel used for cooking, heating and lighting at the institution

    12. Disposal of refuse or rubbish

    13. Asset ownership (TV, Radio, Landline telephone, Refrigerator, Internet facilities)

    14. List of persons in the institution on census night (name, date of birth, sex, population group, marital status, barcode number)

    15. The Post Enumeration Survey Questionnaire (English)

    These questionnaires are provided as external resources.

    Cleaning operations

    Data editing and validation system The execution of each phase of Census operations introduces some form of errors in Census data. Despite quality assurance methodologies embedded in all the phases; data collection, data capturing (both manual and automated), coding, and editing, a number of errors creep in and distort the collected information. To promote consistency and improve on data quality, editing is a paramount phase in identifying and minimising errors such as invalid values, inconsistent entries or unknown/missing values. The editing process for Census 2011 was based on defined rules (specifications).

    The editing of Census 2011 data involved a number of sequential processes: selection of members of the editing team, review of Census 2001 and 2007 Community Survey editing specifications, development of editing specifications for the Census 2011 pre-tests (2009 pilot and 2010 Dress Rehearsal), development of firewall editing specifications and finalisation of specifications for the main Census.

    Editing team The Census 2011 editing team was drawn from various divisions of the organisation based on skills and experience in data editing. The team thus composed of subject matter specialists (demographers and programmers), managers as well as data processors. Census 2011 editing team was drawn from various divisions of the organization based on skills and experience in data editing. The team thus composed of subject matter specialists (demographers and programmers), managers as well as data processors.

    The Census 2011 questionnaire was very complex, characterised by many sections, interlinked questions and skipping instructions. Editing of such complex, interlinked data items required application of a combination of editing techniques. Errors relating to structure were resolved using structural query language (SQL) in Oracle dataset. CSPro software was used to resolve content related errors. The strategy used for Census 2011 data editing was implementation of automated error detection and correction with minimal changes. Combinations of logical and dynamic imputation/editing were used. Logical imputations were preferred, and in many cases substantial effort was undertaken to deduce a consistent value based on the rest of the household’s information. To profile the extent of changes in the dataset and assess the effects of imputation, a set of imputation flags are included in the edited dataset. Imputation flags values include the following: 0 no imputation was performed; raw data were preserved 1 Logical editing was performed, raw data were blank 2 logical editing was performed, raw data were not blank 3 hot-deck imputation was performed, raw data were blank 4 hot-deck imputation was performed, raw data were not blank

    Data appraisal

    Independent monitoring and evaluation of Census field activities Independent monitoring of the Census 2011 field activities was carried out by a team of 31 professionals and 381 Monitoring

  9. Hard Drive Test Data

    • kaggle.com
    zip
    Updated Nov 5, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Backblaze (2016). Hard Drive Test Data [Dataset]. https://www.kaggle.com/backblaze/hard-drive-test-data
    Explore at:
    zip(211395505 bytes)Available download formats
    Dataset updated
    Nov 5, 2016
    Dataset authored and provided by
    Backblaze
    Description

    Context

    Each day, Backblaze takes a snapshot of each operational hard drive that includes basic hard drive information (e.g., capacity, failure) and S.M.A.R.T. statistics reported by each drive. This dataset contains data from the first two quarters in 2016.

    Content

    This dataset contains basic hard drive information and 90 columns or raw and normalized values of 45 different S.M.A.R.T. statistics. Each row represents a daily snapshot of one hard drive.

    • date: Date in yyyy-mm-dd format

    • serial_number: Manufacturer-assigned serial number of the drive

    • model: Manufacturer-assigned model number of the drive

    • capacity_bytes: Drive capacity in bytes

    • failure: Contains a “0” if the drive is OK. Contains a “1” if this is the last day the drive was operational before failing.

    • 90 variables that begin with 'smart': Raw and Normalized values for 45 different SMART stats as reported by the given drive

    Inspiration

    Some items to keep in mind as you process the data:

    • S.M.A.R.T. statistic can vary in meaning based on the manufacturer and model. It may be more informative to compare drives that are similar in model and manufacturer

    • Some S.M.A.R.T. columns can have out-of-bound values

    • When a drive fails, the 'failure' column is set to 1 on the day of failure, and starting the day after, the drive will be removed from the dataset. Each day, new drives are also added. This means that total number of drives each day may vary.

    • S.M.A.R.T. 9 is the number of hours a drive has been in service. To calculate a drive's age in days, divide this number by 24.

    Given the hints above, below are a couple of questions to help you explore the dataset:

    1. What is the median survival time of a hard drive? How does this differ by model/manufacturer?

    2. Can you calculate the probability that a hard drive will fail given the hard drive information and statistics in the dataset?

    Acknowledgement

    The original collection of data can be found here. When using this data, Backblaze asks that you cite Backblaze as the source; you accept that you are solely responsible for how you use the data; and you do not sell this data to anyone.

  10. Descriptive statistics of the raw data for female and male MZ and DZ twins...

    • plos.figshare.com
    xls
    Updated Jun 3, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nilufer Rahmioglu; Toby Andrew; Lynn Cherkas; Gabriela Surdulescu; Ramasamyiyer Swaminathan; Tim Spector; Kourosh R. Ahmadi (2023). Descriptive statistics of the raw data for female and male MZ and DZ twins including total number of individuals (n), means and standard deviations (s.d.) for the liver function test proteins and the major covariates included in the study. [Dataset]. http://doi.org/10.1371/journal.pone.0004435.t001
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 3, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Nilufer Rahmioglu; Toby Andrew; Lynn Cherkas; Gabriela Surdulescu; Ramasamyiyer Swaminathan; Tim Spector; Kourosh R. Ahmadi
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Note AST was not measured in male twins.

  11. DCMS Sectors Economic Estimates 2022: Business Demographics

    • gov.uk
    Updated Nov 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department for Digital, Culture, Media & Sport (2023). DCMS Sectors Economic Estimates 2022: Business Demographics [Dataset]. https://www.gov.uk/government/statistics/dcms-sectors-economic-estimates-2022-business-demographics
    Explore at:
    Dataset updated
    Nov 16, 2023
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    Department for Digital, Culture, Media & Sport
    Description

    About

    These Economic Estimates are National Statistics providing an estimate of the contribution of DCMS Sectors to the UK economy, measured by the number of businesses.

    We have experimented with using a different, more timely data source to calculate this year’s Business Demographics statistics. As a result, they are not comparable with earlier DCMS Sector Business Demographics publications. More information is provided in these published documents and in the “Call for Feedback” section below.

    Content

    These statistics cover the contributions of the following DCMS sectors to the UK economy;

    • Creative Industries
    • Cultural Sector
    • Digital Sector
    • Gambling
    • Sport
    • Telecoms
    • Tourism (defined in this release as the Tourism Industries)

    Users should note that there is overlap between DCMS Sector definitions and that the Telecoms sector sits wholly within the Digital sector. Estimates are not available for the Civil Society sector, because they are not identifiable in the data source used for this release.

    The release also includes estimates for the Audio Visual sector, which is not a DCMS Sector but is “adjacent” to it and includes some industries also common to DCMS Sectors.

    A definition for each sector is available in the published data tables.

    Released

    These statistics were first published on 8 December 2022

    Call for Feedback

    In this publication we have experimented with using a snapshot of the Inter-Departmental Business Register (IDBR) to generate estimates of DCMS Business Demographics, rather than the Annual Business Survey (ABS) as in previous releases. This has the advantage of being more timely, and commits to most tables included in previous Business Demographics publications. We have used the March 2019, March 2020, March 2021 and March 2022 snapshots from the ONS https://www.ons.gov.uk/businessindustryandtrade/business/activitysizeandlocation/datasets/ukbusinessactivitysizeandlocation">UK business: activity, size and location release rather than raw data from the IDBR.

    We are looking for feedback on this approach. We particularly welcome views on:

    • Continuing with this approach using the more timely IDBR data source.
    • Returning to the previous approach using the ABS as a data source.
    • Experimenting with a ‘mixed approach’ where estimates would be produced using the IDBR snapshot, supplemented with further data from the ABS to produce additional tables.

    Please contact evidence@dcms.gov.uk before Thursday 9th February 2023 with any feedback.

    Hard copy feedback can be sent to:

    DCMS Economic Estimates Team
    Department for Digital, Culture, Media & Sport
    4th Floor - area 4/34
    100 Parliament Street
    London
    SW1A 2BQ

    The UK Statistics Authority

    This release is published in accordance with the Code of Practice for Statistics (2018) produced by the UK Statistics Authority (UKSA). The UKSA has the overall objective of promoting and safeguarding the production and publication of official statistics that serve the public good. It monitors and reports on all official statistics, and promotes good practice in this area.

    Pre-release access

    The accompanying pre-release access document lists ministers and officials who have received privileged early access to this release. In line with best practice, the list has been kept to a minimum and those given access for briefing purposes had a maximum of 24 hours.

    Contact

    Responsible analyst: Eri Hutchinson

    For any queries or feedback, please contact evidence@dcms.gov.uk.

  12. g

    Jobseekers registered with France Travail by area of employment (quarterly,...

    • gimi9.com
    Updated Nov 22, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2024). Jobseekers registered with France Travail by area of employment (quarterly, gross) | gimi9.com [Dataset]. https://gimi9.com/dataset/eu_https-data-dares-travail-emploi-gouv-fr-explore-dataset-dares_defm_zoneeemploi_brut_trim-
    Explore at:
    Dataset updated
    Nov 22, 2024
    Area covered
    France
    Description

    Data These data relate to jobseekers registered at the end of the month at Pôle emploi by category, sex and age group, seniority, and by area of employment (zoning 2020). They are raw and rounded to the ten. There may therefore be slight differences between the sum of the disaggregated data and the aggregated series. The ages used for the different series are the ages at the end of the month (age that the job seeker will have at the end of the month in question). The data are presented by region (including the regional part of the employment areas straddling two regions, in italics). For ease of reading by employment area, statistics for employment areas located in two regions (those with a code starting with "00") are also presented on an aggregated basis at the level of the employment area, irrespective of their regions of origin. Finally, statistics are presented for jobseekers who cannot be located in an employment area (cases where the municipality of residence is not known). For the latter, however, it is possible to distinguish between those residing in metropolitan France and those residing in a DOM. The Drom data presented in this file do not start until January 2000, with jobseekers updated twice a year before that date. None of the data in this file has been the subject of an application for labelling with the Authority for Public Statistics. Abbreviations used: n.o. = not available (mainly divided by 0). n.o.s. = not significant. The ages used for the different series are the average ages during the quarter (average age of the job seeker over the 3 months considered). None of the data in this file has been the subject of an application for labelling with the Authority for Public Statistics. ### Definition Full documentation on data on registered jobseekers and vacancies collected by Pôle emploi is available on the Dares website (see document Methodological documentation - Jobseekers). Information on employment areas (definitions, downloadable files, etc.) is available on the INSEE website. ### Field * the geographical grouping ‘**** Metropolitan France’ includes all the French territories on the European continent (96 departments); * The geographical grouping ‘France’ includes metropolitan France and the overseas departments/regions (DROM), with the exception of Mayotte. ### Source The data are taken from the files of the Monthly Labour Market Statistics (MLTS) of Dares and Pôle emploi. ### Warnings In addition to labour market developments, data on jobseekers can be affected by a number of factors: changes to the rules on compensation or support for jobseekers, procedural changes, incidents. A document presents the main procedural changes and incidents affecting the statistics of jobseekers since 2011 (see document Incidents and procedural changes). More generally, certain precautions are necessary for the interpretation of the monthly series on jobseekers.Documentation provides guidance for the interpretation of the monthly series on jobseekers. Monthly developments in category A, B and C outflows due to non-discounting have been affected for several months. Depending on the month, this affects the number of jobseekers in categories A, B and C upwards (in the event of a decrease in discounting errors) or downwards (in the event of an increase in discounting errors). This could be explained, at least in part, by a sensitivity of discounting behaviour to the number of working days in the update period following the change in the update calendar in January 2016 (see methodological note). Trends should therefore be favoured over month-to-month developments (see the guidance note on the interpretation of the monthly series). Conversely, in August and November 2016 the number of category A, B and C outflows due to non-discounting decreased unusually sharply. On the other hand, in February, March 2016 and September 2016, this number returned to a more usual level.

  13. The raw datasets and statistics for each analysis result graph.

    • plos.figshare.com
    zip
    Updated Jun 11, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Barry Smyth (2023). The raw datasets and statistics for each analysis result graph. [Dataset]. http://doi.org/10.1371/journal.pone.0251513.s002
    Explore at:
    zipAvailable download formats
    Dataset updated
    Jun 11, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Barry Smyth
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Each individual result graph is associated with 4 different comma-separated files: (i) Raw—the (anonymised) raw data behind the means and standard deviations used for a particular result graph; (ii) Paired—the paired statistical significance results; (iii) Successive Male—the statistical significance results to compare successive groups (age and ability) for male runners; and (iv) Successive Female—the corresponding results for the statistical significance tests to compare successive groups (age and ability) of female runners. (ZIP)

  14. I

    Global PCI & PCIe Image Capture Card Market Technological Advancements...

    • statsndata.org
    excel, pdf
    Updated Nov 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stats N Data (2025). Global PCI & PCIe Image Capture Card Market Technological Advancements 2025-2032 [Dataset]. https://www.statsndata.org/report/pci-pcie-image-capture-card-market-71997
    Explore at:
    pdf, excelAvailable download formats
    Dataset updated
    Nov 2025
    Dataset authored and provided by
    Stats N Data
    License

    https://www.statsndata.org/how-to-orderhttps://www.statsndata.org/how-to-order

    Area covered
    Global
    Description

    The PCI and PCIe Image Capture Card market plays a crucial role in various industries, ranging from gaming and streaming to medical imaging and security surveillance. These cards are integral for converting raw video and imaging data from high-definition sources into digital formats that can be processed, recorded,

  15. Kyrgyzstan Roads (OpenStreetMap Export)

    • data.humdata.org
    geojson, geopackage +2
    Updated Oct 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Humanitarian OpenStreetMap Team (HOT) (2025). Kyrgyzstan Roads (OpenStreetMap Export) [Dataset]. https://data.humdata.org/dataset/dee07cb7-c099-4c3c-8cd8-a50bcbf5b962?force_layout=desktop
    Explore at:
    geopackage(20404941), kml(12857189), geojson(13191287), shp(20454777), geojson(64561), kml(63349), geopackage(104308), shp(101664)Available download formats
    Dataset updated
    Oct 4, 2025
    Dataset provided by
    Humanitarian OpenStreetMap Team
    OpenStreetMap//www.openstreetmap.org/
    License

    Open Database License (ODbL) v1.0https://www.opendatacommons.org/licenses/odbl/1.0/
    License information was derived automatically

    Area covered
    Kyrgyzstan
    Description

    OpenStreetMap contains roughly 85.0 thousand km of roads in this region. Based on AI-mapped estimates, this is approximately 34 % of the total road length in the dataset region. The average age of data for the region is 4 years ( Last edited 8 days ago ) and 10% of roads were added or updated in the last 6 months. Read about what this summary means : indicators , metrics

    This theme includes all OpenStreetMap features in this area matching ( Learn what tags means here ) :

    tags['highway'] IS NOT NULL

    Features may have these attributes:

    This dataset is one of many "https://data.humdata.org/organization/hot">OpenStreetMap exports on HDX. See the Humanitarian OpenStreetMap Team website for more information.

  16. d

    Census_sum_15

    • catalog.data.gov
    • datasets.ai
    • +1more
    Updated Nov 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Census_sum_15 [Dataset]. https://catalog.data.gov/dataset/census-sum-15
    Explore at:
    Dataset updated
    Nov 27, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Description

    The GIS layer "Census_sum_15" provides a standardized tool for examining spatial patterns in abundance and demographic trends of the southern sea otter (Enhydra lutris nereis), based on data collected during the spring 2015 range-wide census. The USGS range-wide sea otter census has been undertaken twice a year since 1982, once in May and once in October, using consistent methodology involving both ground-based and aerial-based counts. The spring census is considered more accurate than the fall count, and provides the primary basis for gauging population trends by State and Federal management agencies. This Shape file includes a series of summary statistics derived from the raw census data, including sea otter density (otters per square km of habitat), linear density (otters per km of coastline), relative pup abundance (ratio of pups to independent animals) and 5-year population trend (calculated as exponential rate of change). All statistics are calculated and plotted for small sections of habitat in order to illustrate local variation in these statistics across the entire mainland distribution of sea otters in California (as of 2015). Sea otter habitat is considered to extend offshore from the mean low tide line and out to the 60m isobath: this depth range includes over 99% of sea otter feeding dives, based on dive-depth data from radio tagged sea otters (Tinker et al 2006, 2007). Sea otter distribution in California (the mainland range) is considered to comprise this band of potential habitat stretching along the coast of California, and bounded to the north and south by range limits defined as "the points farthest from the range center at which 5 or more otters are counted within a 10km contiguous stretch of coastline (as measured along the 10m bathymetric contour) during the two most recent spring censuses, or at which these same criteria were met in the previous year". The polygon corresponding to the range definition was then sub-divided into onshore/offshore strips roughly 500 meters in width. The boundaries between these strips correspond to ATOS (As-The-Otter-Swims) points, which are arbitrary locations established approximately every 500 meters along a smoothed 5 fathom bathymetric contour (line) offshore of the State of California.

  17. d

    Annual California Sea Otter Census: 2019 Census Summary Shapefile

    • catalog.data.gov
    • data.usgs.gov
    • +2more
    Updated Nov 27, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Annual California Sea Otter Census: 2019 Census Summary Shapefile [Dataset]. https://catalog.data.gov/dataset/annual-california-sea-otter-census-2019-census-summary-shapefile
    Explore at:
    Dataset updated
    Nov 27, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Description

    The GIS shapefile Census_sum_2019 provides a standardized tool for examining spatial patterns in abundance and demographic trends of the southern sea otter (Enhydra lutris nereis), based on data collected during the spring 2019 range-wide census. The USGS spring range-wide sea otter census has been undertaken each year since 1982, using consistent methodology involving both ground-based and aerial-based counts. The spring census provides the primary basis for gauging population trends by State and Federal management agencies. This shapefile includes a series of summary statistics derived from the raw census data, including sea otter density (otters per square kilometer of habitat), linear density (otters per kilometer of coastline), relative pup abundance (ratio of pups to independent animals) and 5-year population trend (calculated as exponential rate of change). All statistics are calculated and plotted for small sections of habitat in order to illustrate local variation in these statistics across the entire mainland distribution of sea otters in California (as of 2019). Sea otter habitat is considered to extend offshore from the mean low tide line and out to the 60 meter isobath: this depth range includes over 99 percent of sea otter feeding dives, based on dive-depth data from radio tagged sea otters (Tinker et al 2006, 2007). Sea otter distribution in California (the mainland range) is considered to comprise this band of potential habitat stretching along the coast of California, and bounded to the north and south by range limits defined by combining independent otters within a moving window of 10-kilometer stretches of coastline (as measured along the 10-meter bathymetric contour; 20 contiguous ATOS intervals each) and taking the northern and southern ATOS values, respectively, of the northernmost and southernmost stretches in which at least five otters were counted for at least 2 consecutive spring surveys during the last 3 years. The polygon corresponding to the range definition was then sub-divided into onshore/offshore strips roughly 500 meters in width. The boundaries between these strips correspond to ATOS (As-The-Otter-Swims) points, which are arbitrary locations established approximately every 500 meters along a smoothed 5 fathom bathymetric contour (line) offshore of the State of California. References: Tinker, M. T., Doak, D. F., Estes, J. A., Hatfield, B. B., Staedler, M. M. and Bodkin, J. L. (2006), INCORPORATING DIVERSE DATA AND REALISTIC COMPLEXITY INTO DEMOGRAPHIC ESTIMATION PROCEDURES FOR SEA OTTERS. Ecological Applications, 16: 2293–2312, https://doi.org/10.1890/1051-0761(2006)016[2293:IDDARC]2.0.CO;2 Tinker, M. T. , D. P. Costa , J. A. Estes , and N. Wieringa . 2007. Individual dietary specialization and dive behaviour in the California sea otter: using archival time–depth data to detect alternative foraging strategies. Deep Sea Research II 54: 330–342, https://doi.org/10.1016/j.dsr2.2006.11.012

  18. d

    Annual California Sea Otter Census: 2017 Census Summary Shapefile

    • catalog.data.gov
    • data.usgs.gov
    • +3more
    Updated Nov 20, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Annual California Sea Otter Census: 2017 Census Summary Shapefile [Dataset]. https://catalog.data.gov/dataset/annual-california-sea-otter-census-2017-census-summary-shapefile
    Explore at:
    Dataset updated
    Nov 20, 2025
    Dataset provided by
    U.S. Geological Survey
    Description

    The GIS shapefile "Census summary of southern sea otter 2017" provides a standardized tool for examining spatial patterns in abundance and demographic trends of the southern sea otter (Enhydra lutris nereis), based on data collected during the spring 2017 range-wide census. The USGS range-wide sea otter census has been undertaken twice a year since 1982, once in May and once in October, using consistent methodology involving both ground-based and aerial-based counts. The spring census is considered more accurate than the fall count, and provides the primary basis for gauging population trends by State and Federal management agencies. This Shape file includes a series of summary statistics derived from the raw census data, including sea otter density (otters per square km of habitat), linear density (otters per km of coastline), relative pup abundance (ratio of pups to independent animals) and 5-year population trend (calculated as exponential rate of change). All statistics are calculated and plotted for small sections of habitat in order to illustrate local variation in these statistics across the entire mainland distribution of sea otters in California (as of 2017). Sea otter habitat is considered to extend offshore from the mean low tide line and out to the 60m isobath: this depth range includes over 99% of sea otter feeding dives, based on dive-depth data from radio tagged sea otters (Tinker et al 2006, 2007). Sea otter distribution in California (the mainland range) is considered to comprise this band of potential habitat stretching along the coast of California, and bounded to the north and south by range limits defined as "the points farthest from the range center at which 5 or more otters are counted within a 10km contiguous stretch of coastline (as measured along the 10m bathymetric contour) during the two most recent spring censuses, or at which these same criteria were met in the previous year". The polygon corresponding to the range definition was then sub-divided into onshore/offshore strips roughly 500 meters in width. The boundaries between these strips correspond to ATOS (As-The-Otter-Swims) points, which are arbitrary locations established approximately every 500 meters along a smoothed 5 fathom bathymetric contour (line) offshore of the State of California. References: Tinker, M. T., Doak, D. F., Estes, J. A., Hatfield, B. B., Staedler, M. M. and Bodkin, J. L. (2006), INCORPORATING DIVERSE DATA AND REALISTIC COMPLEXITY INTO DEMOGRAPHIC ESTIMATION PROCEDURES FOR SEA OTTERS. Ecological Applications, 16: 2293–2312, https://doi.org/10.1890/1051-0761(2006)016[2293:IDDARC]2.0.CO;2 Tinker, M. T. , D. P. Costa , J. A. Estes , and N. Wieringa . 2007. Individual dietary specialization and dive behaviour in the California sea otter: using archival time–depth data to detect alternative foraging strategies. Deep Sea Research II 54: 330–342, https://doi.org/10.1016/j.dsr2.2006.11.012

  19. d

    Korea Customs Service_Import and export performance by province and province...

    • data.go.kr
    xml
    Updated Sep 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Korea Customs Service_Import and export performance by province and province (GW) [Dataset]. https://www.data.go.kr/en/data/15101639/openapi.do
    Explore at:
    xmlAvailable download formats
    Dataset updated
    Sep 10, 2025
    License

    https://data.go.kr/ugs/selectPortalPolicyView.dohttps://data.go.kr/ugs/selectPortalPolicyView.do

    Description

    These are import and export trade statistics by province and city, compiled based on trade data collected through customs clearance procedures and categorized by type. These statistics are compiled by province and city. The HS Code classification system groups similar products together based on their characteristics, using the three 10-digit HS Code numbers. Imports are broadly categorized into three categories: consumer goods, raw materials, and capital goods. Exports are broadly categorized into four categories: food and direct consumer goods, raw materials and fuel, light industrial products, and heavy chemical industrial products. The HS Code is an international commodity classification system that assigns a unique number to every product. The provinces and cities are categorized by 17 cities and provinces, including Seoul, Daegu, Busan, and Gwangju. Import and export trade statistics, by definition, refer to the exchange of goods between a national economy and other countries. They record all goods that are brought into (import) or taken out (export) within a country's economic territory and thereby increase or decrease the country's material resources. Goods simply passing through a country (transit goods) or temporarily imported or exported do not increase or decrease the country's material resources and are therefore not included in trade statistics. The trade statistics area, which serves as the basis for compiling these statistics, is determined by geographical, administrative, and national economic considerations. In most countries, including Korea, the customs territory, which refers to the area to which customs law comprehensively applies, and the trade statistics area are conceptually identical. Therefore, trade statistics encompass the exchange of goods beyond the customs territory. Korea compiles imports based on CIF (cost-on-ship) values, and exports based on FOB (free-on-board) values. Data for the previous month are updated around the 15th of each month to reflect any changes, such as corrections or cancellations, in export and import declarations. The data cycle is monthly. Weight is net weight (kg), and amount is based on import (taxable price in USD) and export (declared price in USD).

  20. Local authority collected waste management - annual results

    • gov.uk
    • s3.amazonaws.com
    Updated Mar 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department for Environment, Food & Rural Affairs (2025). Local authority collected waste management - annual results [Dataset]. https://www.gov.uk/government/statistics/local-authority-collected-waste-management-annual-results
    Explore at:
    Dataset updated
    Mar 27, 2025
    Dataset provided by
    GOV.UKhttp://gov.uk/
    Authors
    Department for Environment, Food & Rural Affairs
    Description

    This publication covers annual estimates for waste collected by local authorities in England and the regions. These statistics are based on data submitted by all local authorities in England to WasteDataFlow on the waste they collect and manage.

    The methodology and recycling explainer documents give background and context to this statistical notice, accompanying datasets and the waste and recycling measures they present.

    There is also a further historical note on the definition of local authority collected waste relating to earlier releases.

    The entire raw dataset is available in CSV format and can be found here: https://www.data.gov.uk/dataset/0e0c12d8-24f6-461f-b4bc-f6d6a5bf2de5/wastedataflow-local-authority-waste-management">WasteDataFlow - Local Authority waste management - data.gov.uk

    Historic releases

    https://webarchive.nationalarchives.gov.uk/ukgwa/20250102235615/https://www.gov.uk/government/statistics/local-authority-collected-waste-management-annual-results">2022-2023

    https://webarchive.nationalarchives.gov.uk/ukgwa/20230802024231/https://www.gov.uk/government/statistics/local-authority-collected-waste-management-annual-results-202122">2021- 2022

    https://webarchive.nationalarchives.gov.uk/ukgwa/20220503105415/https://www.gov.uk/government/statistics/local-authority-collected-waste-management-annual-results">2020 - 2021

    https://webarchive.nationalarchives.gov.uk/ukgwa/20210728220801/https://www.gov.uk/government/statistics/local-authority-collected-waste-management-annual-results">2019-2020

    https://webarchive.nationalarchives.gov.uk/ukgwa/20200604042448/https://www.gov.uk/government/statistics/local-authority-collected-waste-management-annual-results">2018 - 2019

    https://webarchive.nationalarchives.gov.uk/ukgwa/20190903035029/https://www.gov.uk/government/statistics/local-authority-collected-waste-management-annual-results">2017 - 2018

    https://webarchive.nationalarchives.gov.uk/ukgwa/20181207030346/https://www.gov.uk/government/statistics/local-authority-collected-waste-management-annual-results">2016 - 2017

    https://webarchive.nationalarchives.gov.uk/ukgwa/20170418015547/https://www.gov.uk/government/statistics/local-authority-collected-waste-management-annual-results">2015 - 2016 This includes the ad hoc release entitled “Provisional 2016/17 local authority data on waste collection and treatment for England (April to June and July to September 2016)”.

    https://webarchive.nationalarchives.gov.uk/ukgwa/20160512131028/https://www.gov.uk/government/statistics/local-authority-collected-waste-management-annual-results">2014 - 2015

    https://webarchive.nationalarchives.gov.uk/ukgwa/20150401112814/https://www.gov.uk/government/statistics/local-authority-collected-waste-management-annual-results">2013 - 2014

    https://webarchive.nationalarchives.gov.uk/ukgwa/20140321171631/https://www.gov.uk/government/publications/local-authority-collected-waste-management-annual-results">2012 - 2013

    Historic datasets

    Historic datasets

    Defra statistics: Waste and Recycling

    Email mailto:WasteStatistics@defra.gov.uk">WasteStatistics@defra.gov.uk

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Central Statistical Organization (CSO) (2017). Household Health Survey 2012-2013, Economic Research Forum (ERF) Harmonization Data - Iraq [Dataset]. https://catalog.ihsn.org/index.php/catalog/6937

Household Health Survey 2012-2013, Economic Research Forum (ERF) Harmonization Data - Iraq

Explore at:
Dataset updated
Jun 26, 2017
Dataset provided by
Economic Research Forum
Kurdistan Regional Statistics Office (KRSO)
Central Statistical Organization (CSO)
Time period covered
2012 - 2013
Area covered
Iraq
Description

Abstract

The harmonized data set on health, created and published by the ERF, is a subset of Iraq Household Socio Economic Survey (IHSES) 2012. It was derived from the household, individual and health modules, collected in the context of the above mentioned survey. The sample was then used to create a harmonized health survey, comparable with the Iraq Household Socio Economic Survey (IHSES) 2007 micro data set.

----> Overview of the Iraq Household Socio Economic Survey (IHSES) 2012:

Iraq is considered a leader in household expenditure and income surveys where the first was conducted in 1946 followed by surveys in 1954 and 1961. After the establishment of Central Statistical Organization, household expenditure and income surveys were carried out every 3-5 years in (1971/ 1972, 1976, 1979, 1984/ 1985, 1988, 1993, 2002 / 2007). Implementing the cooperation between CSO and WB, Central Statistical Organization (CSO) and Kurdistan Region Statistics Office (KRSO) launched fieldwork on IHSES on 1/1/2012. The survey was carried out over a full year covering all governorates including those in Kurdistan Region.

The survey has six main objectives. These objectives are:

  1. Provide data for poverty analysis and measurement and monitor, evaluate and update the implementation Poverty Reduction National Strategy issued in 2009.
  2. Provide comprehensive data system to assess household social and economic conditions and prepare the indicators related to the human development.
  3. Provide data that meet the needs and requirements of national accounts.
  4. Provide detailed indicators on consumption expenditure that serve making decision related to production, consumption, export and import.
  5. Provide detailed indicators on the sources of households and individuals income.
  6. Provide data necessary for formulation of a new consumer price index number.

The raw survey data provided by the Statistical Office were then harmonized by the Economic Research Forum, to create a comparable version with the 2006/2007 Household Socio Economic Survey in Iraq. Harmonization at this stage only included unifying variables' names, labels and some definitions. See: Iraq 2007 & 2012- Variables Mapping & Availability Matrix.pdf provided in the external resources for further information on the mapping of the original variables on the harmonized ones, in addition to more indications on the variables' availability in both survey years and relevant comments.

Geographic coverage

National coverage: Covering a sample of urban, rural and metropolitan areas in all the governorates including those in Kurdistan Region.

Analysis unit

1- Household/family. 2- Individual/person.

Universe

The survey was carried out over a full year covering all governorates including those in Kurdistan Region.

Kind of data

Sample survey data [ssd]

Sampling procedure

----> Design:

Sample size was (25488) household for the whole Iraq, 216 households for each district of 118 districts, 2832 clusters each of which includes 9 households distributed on districts and governorates for rural and urban.

----> Sample frame:

Listing and numbering results of 2009-2010 Population and Housing Survey were adopted in all the governorates including Kurdistan Region as a frame to select households, the sample was selected in two stages: Stage 1: Primary sampling unit (blocks) within each stratum (district) for urban and rural were systematically selected with probability proportional to size to reach 2832 units (cluster). Stage two: 9 households from each primary sampling unit were selected to create a cluster, thus the sample size of total survey clusters was 25488 households distributed on the governorates, 216 households in each district.

----> Sampling Stages:

In each district, the sample was selected in two stages: Stage 1: based on 2010 listing and numbering frame 24 sample points were selected within each stratum through systematic sampling with probability proportional to size, in addition to the implicit breakdown urban and rural and geographic breakdown (sub-district, quarter, street, county, village and block). Stage 2: Using households as secondary sampling units, 9 households were selected from each sample point using systematic equal probability sampling. Sampling frames of each stages can be developed based on 2010 building listing and numbering without updating household lists. In some small districts, random selection processes of primary sampling may lead to select less than 24 units therefore a sampling unit is selected more than once , the selection may reach two cluster or more from the same enumeration unit when it is necessary.

Mode of data collection

Face-to-face [f2f]

Research instrument

----> Preparation:

The questionnaire of 2006 survey was adopted in designing the questionnaire of 2012 survey on which many revisions were made. Two rounds of pre-test were carried out. Revision were made based on the feedback of field work team, World Bank consultants and others, other revisions were made before final version was implemented in a pilot survey in September 2011. After the pilot survey implemented, other revisions were made in based on the challenges and feedbacks emerged during the implementation to implement the final version in the actual survey.

----> Questionnaire Parts:

The questionnaire consists of four parts each with several sections: Part 1: Socio – Economic Data: - Section 1: Household Roster - Section 2: Emigration - Section 3: Food Rations - Section 4: housing - Section 5: education - Section 6: health - Section 7: Physical measurements - Section 8: job seeking and previous job

Part 2: Monthly, Quarterly and Annual Expenditures: - Section 9: Expenditures on Non – Food Commodities and Services (past 30 days). - Section 10 : Expenditures on Non – Food Commodities and Services (past 90 days). - Section 11: Expenditures on Non – Food Commodities and Services (past 12 months). - Section 12: Expenditures on Non-food Frequent Food Stuff and Commodities (7 days). - Section 12, Table 1: Meals Had Within the Residential Unit. - Section 12, table 2: Number of Persons Participate in the Meals within Household Expenditure Other Than its Members.

Part 3: Income and Other Data: - Section 13: Job - Section 14: paid jobs - Section 15: Agriculture, forestry and fishing - Section 16: Household non – agricultural projects - Section 17: Income from ownership and transfers - Section 18: Durable goods - Section 19: Loans, advances and subsidies - Section 20: Shocks and strategy of dealing in the households - Section 21: Time use - Section 22: Justice - Section 23: Satisfaction in life - Section 24: Food consumption during past 7 days

Part 4: Diary of Daily Expenditures: Diary of expenditure is an essential component of this survey. It is left at the household to record all the daily purchases such as expenditures on food and frequent non-food items such as gasoline, newspapers…etc. during 7 days. Two pages were allocated for recording the expenditures of each day, thus the roster will be consists of 14 pages.

Cleaning operations

----> Raw Data:

Data Editing and Processing: To ensure accuracy and consistency, the data were edited at the following stages: 1. Interviewer: Checks all answers on the household questionnaire, confirming that they are clear and correct. 2. Local Supervisor: Checks to make sure that questions has been correctly completed. 3. Statistical analysis: After exporting data files from excel to SPSS, the Statistical Analysis Unit uses program commands to identify irregular or non-logical values in addition to auditing some variables. 4. World Bank consultants in coordination with the CSO data management team: the World Bank technical consultants use additional programs in SPSS and STAT to examine and correct remaining inconsistencies within the data files. The software detects errors by analyzing questionnaire items according to the expected parameter for each variable.

----> Harmonized Data:

  • The SPSS package is used to harmonize the Iraq Household Socio Economic Survey (IHSES) 2007 with Iraq Household Socio Economic Survey (IHSES) 2012.
  • The harmonization process starts with raw data files received from the Statistical Office.
  • A program is generated for each dataset to create harmonized variables.
  • Data is saved on the household and individual level, in SPSS and then converted to STATA, to be disseminated.

Response rate

Iraq Household Socio Economic Survey (IHSES) reached a total of 25488 households. Number of households refused to response was 305, response rate was 98.6%. The highest interview rates were in Ninevah and Muthanna (100%) while the lowest rates were in Sulaimaniya (92%).

Search
Clear search
Close search
Google apps
Main menu