100+ datasets found
  1. d

    User stratification analysis

    • dune.com
    Updated Dec 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ljlapple (2024). User stratification analysis [Dataset]. https://dune.com/discover/content/popular?q=author%3Aljlapple&resource-type=queries
    Explore at:
    Dataset updated
    Dec 9, 2024
    Authors
    ljlapple
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Blockchain data query: User stratification analysis

  2. MOESM20 of tmap: an integrative framework based on topological data analysis...

    • springernature.figshare.com
    • datasetcatalog.nlm.nih.gov
    xlsx
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tianhua Liao; Yuchen Wei; Mingjing Luo; Guo-Ping Zhao; Haokui Zhou (2023). MOESM20 of tmap: an integrative framework based on topological data analysis for population-scale microbiome stratification and association studies [Dataset]. http://doi.org/10.6084/m9.figshare.11446575.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    figshare
    Figsharehttp://figshare.com/
    Authors
    Tianhua Liao; Yuchen Wei; Mingjing Luo; Guo-Ping Zhao; Haokui Zhou
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Additional file 20: Table S1. Comparison of the performances in detecting simulated metadata between envfit, adonis, ANOSIM and tmap.

  3. f

    MOESM24 of tmap: an integrative framework based on topological data analysis...

    • datasetcatalog.nlm.nih.gov
    • springernature.figshare.com
    Updated Dec 24, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Wei, Yuchen; Liao, Tianhua; Luo, Mingjing; Zhao, Guo-Ping; Zhou, Haokui (2019). MOESM24 of tmap: an integrative framework based on topological data analysis for population-scale microbiome stratification and association studies [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000166698
    Explore at:
    Dataset updated
    Dec 24, 2019
    Authors
    Wei, Yuchen; Liao, Tianhua; Luo, Mingjing; Zhao, Guo-Ping; Zhou, Haokui
    Description

    Additional file 24: Table S5. Comparison of the stratification of the AGP microbiomes between tmap and PAM based clustering.

  4. H

    Replication data for "Principal stratification analysis using principal...

    • dataverse.harvard.edu
    • search.dataone.org
    Updated Apr 27, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Peng Ding (2016). Replication data for "Principal stratification analysis using principal scores" by Peng Ding and Jiannan Lu [Dataset]. http://doi.org/10.7910/DVN/L4LWMP
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 27, 2016
    Dataset provided by
    Harvard Dataverse
    Authors
    Peng Ding
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Description

    The paper is available at http://arxiv.org/pdf/1602.01196v1.pdf, and will appear in the Journal of the Royal Statistical Society, Series B.

  5. MOESM21 of tmap: an integrative framework based on topological data analysis...

    • springernature.figshare.com
    • datasetcatalog.nlm.nih.gov
    xlsx
    Updated May 30, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Tianhua Liao; Yuchen Wei; Mingjing Luo; Guo-Ping Zhao; Haokui Zhou (2023). MOESM21 of tmap: an integrative framework based on topological data analysis for population-scale microbiome stratification and association studies [Dataset]. http://doi.org/10.6084/m9.figshare.11446581.v1
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    May 30, 2023
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Tianhua Liao; Yuchen Wei; Mingjing Luo; Guo-Ping Zhao; Haokui Zhou
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Additional file 21: Table S2. Detection of host covariates significantly associated with the FGFP microbiomes using envfit, adonis, ANOSIM and tmap.

  6. f

    MOESM22 of tmap: an integrative framework based on topological data analysis...

    • datasetcatalog.nlm.nih.gov
    Updated Dec 24, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Luo, Mingjing; Zhou, Haokui; Zhao, Guo-Ping; Liao, Tianhua; Wei, Yuchen (2019). MOESM22 of tmap: an integrative framework based on topological data analysis for population-scale microbiome stratification and association studies [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000166590
    Explore at:
    Dataset updated
    Dec 24, 2019
    Authors
    Luo, Mingjing; Zhou, Haokui; Zhao, Guo-Ping; Liao, Tianhua; Wei, Yuchen
    Description

    Additional file 22: Table S3. Co-enrichment subnetworks of pet past 3 months and its co-enriched features of the FGFP microbiomes.

  7. Enterprise Survey 2013 - Djibouti

    • microdata.worldbank.org
    • catalog.ihsn.org
    Updated May 21, 2014
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    World Bank (2014). Enterprise Survey 2013 - Djibouti [Dataset]. https://microdata.worldbank.org/index.php/catalog/1993
    Explore at:
    Dataset updated
    May 21, 2014
    Dataset provided by
    World Bank Grouphttp://www.worldbank.org/
    Authors
    World Bank
    Time period covered
    2013
    Area covered
    Djibouti
    Description

    Abstract

    This survey was conducted in Djibouti between June and September 2013, as part of the Enterprise Survey project, an initiative of the World Bank. The objective of the survey is to obtain feedback from enterprises on the state of the private sector as well as to help in building a panel of enterprise data that will make it possible to track changes in the business environment over time, thus allowing, for example, impact assessments of reforms. Through interviews with firms in the manufacturing and services sectors, the survey assesses the constraints to private sector growth and creates statistically significant business environment indicators that are comparable across countries.

    In Djibouti, data from 266 establishments was analyzed. Stratified random sampling was used to select the surveyed businesses.

    The survey topics include firm characteristics, information about sales and suppliers, competition, infrastructure services, judiciary and law enforcement collaboration, security, government policies, laws and regulations, financing, overall business environment, bribery, capacity utilization, performance and investment activities, and workforce composition.

    In 2011, the innovation module was added to the standard set of Enterprise Surveys questionnaires to examine in detail how introduction of new products and practices influence firms' performance and management.

    Geographic coverage

    Djibouti City

    Analysis unit

    The primary sampling unit of the study is the establishment. An establishment is a physical location where business is carried out and where industrial operations take place or services are provided. A firm may be composed of one or more establishments. For example, a brewery may have several bottling plants and several establishments for distribution. For the purposes of this survey an establishment must make its own financial decisions and have its own financial statements separate from those of the firm. An establishment must also have its own management and control over its payroll.

    Universe

    The whole population, or universe of the study, is the non-agricultural economy. It comprises: all manufacturing sectors according to the group classification of ISIC Revision 3.1: (group D), construction sector (group F), services sector (groups G and H), and transport, storage, and communications sector (group I). Note that this definition excludes the following sectors: financial intermediation (group J), real estate and renting activities (group K, except sub-sector 72, IT, which was added to the population under study), and all public or utilities-sectors.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    The sample for Djibouti was selected using stratified random sampling. Three levels of stratification were used in this country: industry, establishment size, and region.

    Industry stratification was designed in the way that follows: the universe was stratified into one collective manufacturing industry, and two services industries (retail and other services).

    Size stratification was defined following the standardized definition for the rollout: small (5 to 19 employees), medium (20 to 99 employees), and large (more than 99 employees). For stratification purposes, the number of employees was defined on the basis of reported permanent full-time workers. This seems to be an appropriate definition of the labor force since seasonal/casual/part-time employment is not common practice, apart from the construction and agriculture sectors which are not included in the survey.

    Regional stratification was limited to one location - Djibouti City.

    The sample frame used for the survey in Djibouti was from the Ministry of Justice of Trade.

    The enumerated establishments were then used as the frame for the selection of a sample with the aim of obtaining interviews at 270 establishments with five or more employees. Given the impact that non-eligible units included in the sample universe may have on the results, adjustments may be needed when computing the appropriate weights for individual observations. The percentage of confirmed non-eligible units as a proportion of the total number of sampled establishments contacted for the survey was 13.3% (56 out of 420 contacted establishments).

    The variables a2 (sampling region), a6a (sampling establishment's size), and a4a (sampling sector) contain the establishment's classification into the strata chosen for each country using information from the sample frame. The strata were defined according to the guidelines described above. Variable a4a is coded using ISIC Rev 3.1 codes for the chosen industries for stratification. These codes include most manufacturing industries (15 to 37), retail (52), and (45, 50, 51, 55, 60-64, 72) for other services.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    The following survey instruments are available: - Manufacturing Questionnaire; - Services Questionnaire.

    All variables are named using, first, the letter of each section and, second, the number of the variable within the section, i.e. a1 denotes section A, question 1. Variable names proceeded by a prefix "MNA" indicate questions specific to the Middle East and North Africa region, therefore, they may not be found in the implementation of the rollout in other countries. All other suffixed variables are global and are present in all economy surveys over the world. All variables are numeric with the exception of those variables with an "x" at the end of their names. The suffix "x" denotes that the variable is alpha-numeric.

    Cleaning operations

    Data entry and quality controls are implemented by the contractor and data is delivered to the World Bank in batches (typically 10%, 50% and 100%). These data deliveries are checked for logical consistency, out of range values, skip patterns, and duplicate entries. Problems are flagged by the World Bank and corrected by the implementing contractor through data checks, callbacks, and revisiting establishments.

    Response rate

    The number of contacted establishments per realized interview was 0.63. This number is the result of two factors: explicit refusals to participate in the survey, as reflected by the rate of rejection (which includes rejections of the screener and the main survey) and the quality of the sample frame, as represented by the presence of ineligible units. The number of rejections per contact was 0.12.

    Item non-response was addressed by two strategies: a- For sensitive questions that may generate negative reactions from the respondent, such as corruption or tax evasion, enumerators were instructed to collect the refusal to respond as a different option from don’t know. b- Establishments with incomplete information were re-contacted in order to complete this information, whenever necessary.

    Survey non-response was addressed by maximizing efforts to contact establishments that were initially selected for interview. Attempts were made to contact the establishment for interview at different times/days of the week before a replacement establishment (with similar strata characteristics) was suggested for interview. Survey non-response did occur but substitutions were made in order to potentially achieve strata-specific goals.

  8. f

    Data from: Diameter Increment Modeling in an Araucaria Forest Fragment Using...

    • scielo.figshare.com
    jpeg
    Updated May 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mailson Roik; Sebastião do Amaral Machado; Afonso Figueiredo Filho; Carlos Roberto Sanquetta; Marcelo Roveda; Thiago Floriani Stepka (2023). Diameter Increment Modeling in an Araucaria Forest Fragment Using Cluster Analysis [Dataset]. http://doi.org/10.6084/m9.figshare.6858227.v1
    Explore at:
    jpegAvailable download formats
    Dataset updated
    May 31, 2023
    Dataset provided by
    SciELO journals
    Authors
    Mailson Roik; Sebastião do Amaral Machado; Afonso Figueiredo Filho; Carlos Roberto Sanquetta; Marcelo Roveda; Thiago Floriani Stepka
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    ABSTRACT The aims of the present study were to test the hypothesis that data stratification by cluster analysis and the use of other variables, in addition to DBH, can improve the precision of the estimates in diametric increment modeling for Mixed Ombrophilous Forest species. The study was carried out in the Irati National Forest. Data from 25 permanent sample plots of 1 ha each were used with all individuals presenting DBH equal to or greater than 10 cm being identified and measured. The increment modeling was performed for the whole forest (non-stratified data), ecological groups and species subgroups (stratified data) defined by cluster analysis. DBH presented a low correlation with the diametric increment and the use of other independent variables had a positive effect on the fitting, reducing the standard error of estimate and increasing the coefficient of determination. The data stratification did not make the models suitable to estimate the diametric increment; however, it provided improvements by reducing the standard error of estimate, suggesting that this technique can be better applied in the search for improvements to diametric modeling in natural forests.

  9. A stratification system for breast cancer based on basoluminal tumor cells...

    • zenodo.org
    application/gzip, bin +1
    Updated Jun 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lasse Meyer; Lasse Meyer (2025). A stratification system for breast cancer based on basoluminal tumor cells and spatial tumor architecture (R data) [Dataset]. http://doi.org/10.5281/zenodo.15396067
    Explore at:
    bin, application/gzip, csvAvailable download formats
    Dataset updated
    Jun 2, 2025
    Dataset provided by
    Zenodohttp://zenodo.org/
    Authors
    Lasse Meyer; Lasse Meyer
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This repository contains all R data objects for reproducing the data analysis in the breast cancer study from Meyer et al., 2025. It also included processed scRNAseq data from 6 breast cancer organoid lines. The code that was used to generate and analyze the data objects is available at https://github.com/BodenmillerGroup/TNBC_publication. Please have a look at the README for further details.

  10. D

    Population Risk Stratification Solutions Market Research Report 2033

    • dataintelo.com
    csv, pdf, pptx
    Updated Sep 30, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dataintelo (2025). Population Risk Stratification Solutions Market Research Report 2033 [Dataset]. https://dataintelo.com/report/population-risk-stratification-solutions-market
    Explore at:
    pdf, csv, pptxAvailable download formats
    Dataset updated
    Sep 30, 2025
    Dataset authored and provided by
    Dataintelo
    License

    https://dataintelo.com/privacy-and-policyhttps://dataintelo.com/privacy-and-policy

    Time period covered
    2024 - 2032
    Area covered
    Global
    Description

    Population Risk Stratification Solutions Market Outlook




    Based on our latest research, the global Population Risk Stratification Solutions market size reached USD 6.2 billion in 2024, with a robust CAGR of 16.7% observed over recent years. The market is anticipated to surge to USD 28.7 billion by 2033, driven by the increasing adoption of advanced analytics and data-driven healthcare management strategies. The primary growth factor is the escalating demand for proactive and preventive healthcare management, especially as healthcare systems worldwide shift towards value-based care and population health initiatives.




    The rapid expansion of the Population Risk Stratification Solutions market is propelled by the rising prevalence of chronic diseases and the growing burden on healthcare infrastructure. With the global population aging and chronic conditions such as diabetes, cardiovascular diseases, and respiratory disorders on the rise, healthcare providers and payers are increasingly leveraging risk stratification solutions to identify high-risk individuals and allocate resources efficiently. This trend is further amplified by the integration of big data analytics, artificial intelligence, and machine learning technologies, which enhance the accuracy and predictive power of risk stratification models. As a result, healthcare organizations are able to intervene earlier, improve patient outcomes, and reduce overall costs, fueling market growth.




    Another significant driver for the Population Risk Stratification Solutions market is the global shift towards value-based care models. Governments and payers are incentivizing healthcare providers to focus on quality outcomes rather than the volume of services delivered. This paradigm shift necessitates the adoption of sophisticated risk stratification tools that can segment patient populations, predict adverse health events, and guide targeted interventions. The ability to stratify risk at a population level enables healthcare systems to allocate resources more effectively, reduce hospital readmissions, and manage chronic diseases proactively. Additionally, regulatory mandates and government initiatives aimed at improving healthcare quality and reducing costs are accelerating the adoption of these solutions across both developed and emerging markets.




    The increasing digitization of healthcare and the proliferation of electronic health records (EHRs) have also played a pivotal role in advancing the Population Risk Stratification Solutions market. The availability of vast amounts of patient data, coupled with advancements in interoperability and data integration, has made it possible to aggregate and analyze information from multiple sources. This comprehensive data ecosystem allows for more accurate risk assessment and personalized care planning. Furthermore, the COVID-19 pandemic has underscored the importance of population health management and the need for robust risk stratification tools to identify vulnerable groups, monitor disease progression, and allocate healthcare resources efficiently. These factors collectively contribute to the sustained growth of the market.




    From a regional perspective, North America continues to dominate the Population Risk Stratification Solutions market, accounting for the largest share in 2024. This leadership is attributed to advanced healthcare infrastructure, high adoption rates of digital health technologies, and a strong focus on value-based care. Europe follows closely, driven by supportive government policies and widespread implementation of population health management programs. The Asia Pacific region is emerging as a high-growth market, fueled by increasing healthcare expenditure, rapid digital transformation, and rising awareness of chronic disease management. Latin America and the Middle East & Africa are also witnessing steady growth, albeit at a slower pace, as healthcare systems in these regions gradually embrace risk stratification solutions to improve outcomes and optimize resource utilization.



    Component Analysis




    The Population Risk Stratification Solutions market is segmented by component into software and services, each playing a critical role in the overall ecosystem. Software solutions form the backbone of risk stratification by providing advanced analytics, data integration, and visualization tools that enable healthcare providers and payers to identify at-risk populations efficiently. These platforms levera

  11. Enterprise Survey 2013 - Georgia

    • microdata.worldbank.org
    • datacatalog.ihsn.org
    • +1more
    Updated Apr 10, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    World Bank (2014). Enterprise Survey 2013 - Georgia [Dataset]. https://microdata.worldbank.org/index.php/catalog/1930
    Explore at:
    Dataset updated
    Apr 10, 2014
    Dataset provided by
    World Bank Grouphttp://www.worldbank.org/
    European Bank for Reconstruction and Development
    Time period covered
    2012 - 2013
    Area covered
    Georgia
    Description

    Abstract

    This survey was conducted in Georgia between December 2012 and August 2013 as part of the fifth round of the Business Environment and Enterprise Performance Survey (BEEPS V), a joint initiative of the World Bank Group and the European Bank for Reconstruction and Development. The objective of the survey is to obtain feedback from enterprises on the state of the private sector as well as to help in building a panel of enterprise data that will make it possible to track changes in the business environment over time, thus allowing, for example, impact assessments of reforms. Through interviews with firms in the manufacturing and services sectors, the survey assesses the constraints to private sector growth and creates statistically significant business environment indicators that are comparable across countries.

    In Georgia, data from 360 establishments was analyzed. Stratified random sampling was used to select the surveyed businesses.

    The survey topics include firm characteristics, information about sales and suppliers, competition, infrastructure services, judiciary and law enforcement collaboration, security, government policies, laws and regulations, financing, overall business environment, bribery, capacity utilization, performance and investment activities, and workforce composition.

    In 2011, the innovation module was added to the standard set of Enterprise Surveys questionnaires to examine in detail how introduction of new products and practices influence firms' performance and management.

    Geographic coverage

    National

    Analysis unit

    The primary sampling unit of the study is an establishment. The establishment is a physical location where business is carried out and where industrial operations take place or services are provided. A firm may be composed of one or more establishments. For example, a brewery may have several bottling plants and several establishments for distribution. For the purposes of this survey an establishment must make its own financial decisions and have its own financial statements separate from those of the firm. An establishment must also have its own management and control over its payroll.

    Universe

    The whole population, or universe of the study, is the non-agricultural economy. It comprises: all manufacturing sectors according to the group classification of ISIC Revision 3.1: (group D), construction sector (group F), services sector (groups G and H), and transport, storage, and communications sector (group I). Note that this definition excludes the following sectors: financial intermediation (group J), real estate and renting activities (group K, except sub-sector 72, IT, which was added to the population under study), and all public or utilities-sectors.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    The sample for Georgia ES was selected using stratified random sampling. Three levels of stratification were used in this country: industry, establishment size, and region.

    Industry stratification was designed in the way that follows: the universe was stratified into one manufacturing industry, and two service industries (retail, and other services).

    Size stratification was defined following the standardized definition for the rollout: small (5 to 19 employees), medium (20 to 99 employees), and large (more than 99 employees). For stratification purposes, the number of employees was defined on the basis of reported permanent full-time workers. This seems to be an appropriate definition of the labor force since seasonal/casual/part-time employment is not common practice, apart from the construction and agriculture sectors which are not included in the survey.

    Regional stratification was defined in 6 regions (city and the surrounding business area) throughout Georgia.

    Database from the National Statistical Office of Georgia was used as the frame for the selection of a sample with the aim of obtaining interviews at 360 establishments with five or more employees.

    Given the impact that non-eligible units included in the sample universe may have on the results, adjustments may be needed when computing the appropriate weights for individual observations. The percentage of confirmed non-eligible units as a proportion of the total number of sampled establishments contacted for the survey was 26.4% (341 out of 1,290 establishments).

    The variables a2 (sampling region), a6a (sampling establishment's size), and a4a (sampling sector) contain the establishment's classification into the strata chosen for each country using information from the sample frame. The strata were defined according to the guidelines described above. Variable a4a is coded using ISIC Rev 3.1 codes for the chosen industries for stratification. These codes include most manufacturing industries (15 to 37), retail (52), and (45, 50, 51, 55, 60-64, 72) for other services.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    Three different versions of the questionnaire were used. The basic questionnaire, the Core Module, includes all common questions asked to all establishments from all sectors. The second expanded variation, the Manufacturing Questionnaire, is built upon the Core Module and adds some specific questions relevant to manufacturing sectors. The third expanded variation, the Retail Questionnaire, is also built upon the Core Module and adds to the core specific questions.

    Cleaning operations

    Data entry and quality controls are implemented by the contractor and data is delivered to the World Bank in batches (typically 10%, 50% and 100%). These data deliveries are checked for logical consistency, out of range values, skip patterns, and duplicate entries. Problems are flagged by the World Bank and corrected by the implementing contractor through data checks, callbacks, and revisiting establishments.

    Response rate

    Survey non-response must be differentiated from item non-response. The former refers to refusals to participate in the survey altogether, while the latter refers to the refusals to answer some specific questions. Enterprise Surveys suffer from both problems and different strategies were used to address these issues.

    Item non-response was addressed by two strategies: a- For sensitive questions that may generate negative reactions from the respondent, such as corruption or tax evasion, enumerators were instructed to collect the refusal to respond as a different option from don’t know. b- Establishments with incomplete information were re-contacted in order to complete this information, whenever necessary.

    Survey non-response was addressed by maximizing efforts to contact establishments that were initially selected for interview. Attempts were made to contact the establishment for interview at different times/days of the week before a replacement establishment (with similar strata characteristics) was suggested for interview. Survey non-response did occur but substitutions were made in order to potentially achieve strata-specific goals.

    The number of contacted establishments per realized interview was 0.28. This number is the result of two factors: explicit refusals to participate in the survey, as reflected by the rate of rejection (which includes rejections of the screener and the main survey) and the quality of the sample frame, as represented by the presence of ineligible units. The number of rejections per contact was 0.19.

  12. Enterprise Survey 2013 - Tajikistan

    • microdata.worldbank.org
    • catalog.ihsn.org
    • +1more
    Updated Aug 30, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    World Bank (2017). Enterprise Survey 2013 - Tajikistan [Dataset]. https://microdata.worldbank.org/index.php/catalog/2036
    Explore at:
    Dataset updated
    Aug 30, 2017
    Dataset provided by
    World Bank Grouphttp://www.worldbank.org/
    European Bank for Reconstruction and Development
    Time period covered
    2013 - 2014
    Area covered
    Tajikistan
    Description

    Abstract

    This survey was conducted in Tajikistan between February 2013 and April 2014 as part of the fifth round of the Business Environment and Enterprise Performance Survey (BEEPS V), a joint initiative of the World Bank Group and the European Bank for Reconstruction and Development. The objective of the survey is to obtain feedback from enterprises on the state of the private sector as well as to help in building a panel of enterprise data that will make it possible to track changes in the business environment over time, thus allowing, for example, impact assessments of reforms. Through interviews with firms in the manufacturing and services sectors, the survey assesses the constraints to private sector growth and creates statistically significant business environment indicators that are comparable across countries.

    Data from 359 establishments was analyzed. Stratified random sampling was used to select the surveyed businesses.

    The survey topics include firm characteristics, information about sales and suppliers, competition, infrastructure services, judiciary and law enforcement collaboration, security, government policies, laws and regulations, financing, overall business environment, bribery, capacity utilization, performance and investment activities, and workforce composition.

    In 2011, the innovation module was added to the standard set of Enterprise Surveys questionnaires to examine in detail how introduction of new products and practices influence firms' performance and management.

    Geographic coverage

    National

    Analysis unit

    The primary sampling unit of the study is an establishment. An establishment is a physical location where business is carried out and where industrial operations take place or services are provided. A firm may be composed of one or more establishments. For example, a brewery may have several bottling plants and several establishments for distribution. For the purposes of this survey an establishment must make its own financial decisions and have its own financial statements separate from those of the firm. An establishment must also have its own management and control over its payroll.

    Universe

    The whole population, or universe of the study, is the non-agricultural economy. It comprises: all manufacturing sectors according to the group classification of ISIC Revision 3.1: (group D), construction sector (group F), services sector (groups G and H), and transport, storage, and communications sector (group I). Note that this definition excludes the following sectors: financial intermediation (group J), real estate and renting activities (group K, except sub-sector 72, IT, which was added to the population under study), and all public or utilities-sectors.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    The sample was selected using stratified random sampling. Three levels of stratification were used in this country: industry, establishment size, and region.

    Industry stratification was designed in the way that follows: the universe was stratified into one manufacturing industry, and two service industries (retail, and other services).

    Size stratification was defined following the standardized definition for the rollout: small (5 to 19 employees), medium (20 to 99 employees), and large (more than 99 employees). For stratification purposes, the number of employees was defined on the basis of reported permanent full-time workers. This seems to be an appropriate definition of the labor force since seasonal/casual/part-time employment is not common practice, apart from the construction and agriculture sectors which are not included in the survey.

    Regional stratification was defined in 4 regions (city and the surrounding business area) throughout Tajikistan.

    The database from the Statistical Committee of Tajikistan was used as the frame for the selection of a sample with the aim of obtaining interviews at 360 establishments with five or more employees.

    Given the impact that non-eligible units included in the sample universe may have on the results, adjustments may be needed when computing the appropriate weights for individual observations. The percentage of confirmed non-eligible units as a proportion of the total number of sampled establishments contacted for the survey was 5.6% (89 out of 1,579 establishments).

    In the dataset, the variables a2 (sampling region), a6a (sampling establishment's size), and a4a (sampling sector) contain the establishment's classification into the strata chosen for each country using information from the sample frame. Variable a4a is coded using ISIC Rev 3.1 codes for the chosen industries for stratification. These codes include most manufacturing industries (15 to 37), retail (52), and (45, 50, 51, 55, 60-64, 72) for other services.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    Three different versions of the questionnaire were used. The basic questionnaire, the Core Module, includes all common questions asked to all establishments from all sectors. The second expanded variation, the Manufacturing Questionnaire, is built upon the Core Module and adds some specific questions relevant to manufacturing sectors. The third expanded variation, the Retail Questionnaire, is also built upon the Core Module and adds to the core specific questions.

    Cleaning operations

    Data entry and quality controls are implemented by the contractor and data is delivered to the World Bank in batches (typically 10%, 50% and 100%). These data deliveries are checked for logical consistency, out of range values, skip patterns, and duplicate entries. Problems are flagged by the World Bank and corrected by the implementing contractor through data checks, callbacks, and revisiting establishments.

    Response rate

    Survey non-response must be differentiated from item non-response. The former refers to refusals to participate in the survey altogether, while the latter refers to the refusals to answer some specific questions. Enterprise Surveys suffer from both problems and different strategies were used to address these issues.

    Item non-response was addressed by two strategies: a- For sensitive questions that may generate negative reactions from the respondent, such as corruption or tax evasion, enumerators were instructed to collect the refusal to respond as a different option from don’t know. b- Establishments with incomplete information were re-contacted in order to complete this information, whenever necessary.

    Survey non-response was addressed by maximizing efforts to contact establishments that were initially selected for interview. Attempts were made to contact the establishment for interview at different times/days of the week before a replacement establishment (with similar strata characteristics) was suggested for interview. Survey non-response did occur but substitutions were made in order to potentially achieve strata-specific goals.

    The number of contacted establishments per realized interview was 0.22. This number is the result of two factors: explicit refusals to participate in the survey, as reflected by the rate of rejection (which includes rejections of the screener and the main survey) and the quality of the sample frame, as represented by the presence of ineligible units. The number of rejections per contact was 0.08.

  13. d

    A visual summary and annotation of the stratification in core MD002361,...

    • data.gov.au
    • researchdata.edu.au
    html, wms
    Updated Jun 24, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Australian National University (2017). A visual summary and annotation of the stratification in core MD002361, sampled from the shelf edge off Western Australia [Dataset]. https://data.gov.au/data/dataset/a-visual-summary-and-annotation-of-the-stratification-in-core-md002361-sampled-from-the-shelf-e
    Explore at:
    html, wmsAvailable download formats
    Dataset updated
    Jun 24, 2017
    Dataset provided by
    Australian National University
    License

    Attribution 2.5 (CC BY 2.5)https://creativecommons.org/licenses/by/2.5/
    License information was derived automatically

    Area covered
    Western Australia, Australia
    Description

    The core description highlights the stark contrast been glacial and interglacial periods. 1-1.5m bands of red/brown clay indicate interglacial periods, which then alternate with 1-1.5m bands of light grey foraminifera sand and dark grey clays deposited during the glacial periods.

  14. c

    Research data supporting: "A systems based qualitative analysis exploring...

    • repository.cam.ac.uk
    pdf
    Updated Feb 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Moorthie, Sowmiya; Taylor, Lily; Dennison, Rebecca; Usher-Smith, Juliet (2025). Research data supporting: "A systems based qualitative analysis exploring the potential to implement risk stratified bowel cancer screening in England" [Dataset]. http://doi.org/10.17863/CAM.115817
    Explore at:
    pdf(229379 bytes), pdf(145111 bytes), pdf(173757 bytes), pdf(152490 bytes)Available download formats
    Dataset updated
    Feb 12, 2025
    Dataset provided by
    University of Cambridge
    Apollo
    Authors
    Moorthie, Sowmiya; Taylor, Lily; Dennison, Rebecca; Usher-Smith, Juliet
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This dataset relates to the STRAT-BCS systems analysis study (A systems based qualitative analysis exploring the potential to implement risk stratified bowel cancer screening in England).

    The dataset contains the study protocol, consent form, participant information sheet and topic guide.

  15. H

    Data from: Socio-spatial stratification of housing tenure trajectories in...

    • dataverse.harvard.edu
    • dataone.org
    Updated Feb 28, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Juta Kawalerowicz; Ida Borg; Eva K. Andersson (2022). Socio-spatial stratification of housing tenure trajectories in Sweden – A longitudinal cohort study [Dataset]. http://doi.org/10.7910/DVN/TPU6TG
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 28, 2022
    Dataset provided by
    Harvard Dataverse
    Authors
    Juta Kawalerowicz; Ida Borg; Eva K. Andersson
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Sweden
    Description

    This is the R script used for the analysis for Socio-spatial stratification of housing tenure trajectories in Sweden – A longitudinal cohort study. Note that we cannot share the micro-data used for this analysis because it belongs to the SCB, Statistics Sweden. To arrange access to Swedish micro-data go to: https://www.scb.se/en/services/ordering-data-and-statistics/ordering-microdata/

  16. H

    Physical Properties of Lakes: Exploratory Data Visualization

    • hydroshare.org
    • search.dataone.org
    zip
    Updated Jan 29, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gabriela Garcia; Kateri Salk (2021). Physical Properties of Lakes: Exploratory Data Visualization [Dataset]. https://www.hydroshare.org/resource/e22442bc4e4940609003b43747b366e0
    Explore at:
    zip(2.9 MB)Available download formats
    Dataset updated
    Jan 29, 2021
    Dataset provided by
    HydroShare
    Authors
    Gabriela Garcia; Kateri Salk
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Time period covered
    May 27, 1984 - Aug 17, 2016
    Area covered
    Description

    Exploratory Data Visualization for the Physical Properties of Lakes

    This lesson was adapted from educational material written by Dr. Kateri Salk for her Fall 2019 Hydrologic Data Analysis course at Duke University. This is the second part of a two-part exercise focusing on the physical properties of lakes.

    Introduction

    The field of limnology, the study of inland waters, uses a unique graph format to display relationships of variables by depth in a lake (the field of oceanography uses the same convention). Depth is placed on the y-axis in reverse order and the other variable(s) are placed on the x-axis. In this manner, the graph appears as if a cross section were taken from that point in the lake, with the surface at the top of the graph. This lesson introduces physical properties of lakes, namely stratification, and its visualization using the package ggplot2.

    Learning Objectives

    After successfully completing this notebook, you will be able to:

    1. Investigate the concepts of lake stratification and mixing by analyzing monitoring data
    2. Apply data analytics skills to applied questions about physical properties of lakes
    3. Communicate findings with peers through oral, visual, and written modes
  17. a

    Examining the Complex Dynamics Influencing Persistent Acute Malnutrition in...

    • microdataportal.aphrc.org
    Updated Jun 14, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dr. Estelle M. Sidze (2022). Examining the Complex Dynamics Influencing Persistent Acute Malnutrition in Turkana and Samburu Counties – A Longitudinal Mixed Methods Study to Support Community Driven Activity Design, NAWIRI - KENYA [Dataset]. https://microdataportal.aphrc.org/index.php/catalog/129
    Explore at:
    Dataset updated
    Jun 14, 2022
    Dataset provided by
    Dr. Estelle M. Sidze
    Dr. Faith Thuita
    Time period covered
    2021
    Area covered
    KENYA
    Description

    Abstract

    Background: Acute malnutrition in infants and children less than 5 years is persistent in the arid and semi-arid lands (ASALs) of East Africa and the Sahel region despite years of investment. In the ASALs of Kenya, the situation is exacerbated by deep-rooted poverty and unequal access to basic services, sustained community conflicts, migration, poor seasonal rainfall/drought and other shocks. Nutrition specific and nutrition sensitive national and county level programs have either not been developed or not implemented effectively.

    Objectives: To understand and map immediate, underlying, basic and systemic drivers of acute malnutrition for the development of overarching as well as micro-solutions for the sustainable reduction of persistent acute malnutrition (PAM) and inform pilot studies and Phase 2 (second phase of NAWIRI project implementation) activities in Turkana and Samburu Counties.

    Methods: This study will be a longitudinal mixed-methods observational cohort study of children less than 3 years and their mothers and/or caregivers in Samburu and Turkana Counties. Both quantitative and qualitative methods will be utilized in the data collection processes. Data collection is scheduled to begin in January 2021. Data analysis and learning and adapting will be ongoing so that results can inform pilots, theory of change (ToC) review and Phase 2 activities throughout the study.

    Study outcomes: To develop new interventions, and to adapt and contextualize existing interventions to prevent global acute malnutrition (GAM); strengthen social and behavior change (SBC) strategies around maternal, infant and young child nutrition (MIYCN), water and sanitation (WASH), community health systems, gender dynamics, livelihoods and resilience, and to inform improvements of the current nutrition surveillance system.

    Geographic coverage

    Turkana and Samburu Counties.

    Analysis unit

    Children less than 3 years and their mothers and/or caregivers

    Universe

    The survey covered household with children less than 3 years and their mothers and/or caregivers in Samburu and Turkana Counties

    Sampling procedure

    SAMBURU

    The study sample was population-based, with stratification by sub-counties grouped into three survey zones (Central, North, and East) reflecting administrative sub-counties used in the Samburu Standardized Monitoring and Assessment of Relief and Transitions (SMART) Surveys. Stratification by livelihood zones was done through post-stratification analysis. We analyzed the data by livelihood zone because it was hypothesized that undernutrition might be more related to a household's livelihood than to its physical location.

    As noted, the study used mixed-method techniques with quantitative and qualitative data collection. The quantitative component included a household survey and a caregiver survey and covered 699 households. The qualitative data collection activities yielded rich and in-depth insights that will be triangulated with the quantitative survey findings in a companion report. Therefore, this report focuses only on findings from the quantitative survey component. Results are reported for global acute malnutrition (GAM), stunting, and underweight. However, the discussion focuses only on GAM because the purpose of the Nawiri program is to reduce persistent acute malnutrition.

    The baseline data collection was carried out in June and July 2021 following a full household listing operation in the county to establish the sampling frame of households with children under 3 years. Subsequent data collection waves are planned for November-December 2021 (Wave 2), March-April 2022 (Wave 3), September-October 2022 (Wave 4), March-April 2023 (Wave 5), and August-September 2023 (Wave 6).

    TURKANA

    The study sample was population-based, with stratification by sub-counties grouped into four survey zones (Central, North, West, and South) reflecting administrative sub-counties used in the Turkana Standardized Monitoring and Assessment of Relief and Transitions (SMART) Surveys. Stratification by livelihood zones was done through post-stratification analysis. We analyzed the data by livelihood zone because it was hypothesized that undernutrition might be more related to a household's livelihood than to its physical location.
    As noted, the study used mixed-method techniques with quantitative and qualitative data collection. The quantitative component included a household survey and a caregiver survey and covered 1,211 households. The qualitative data collection activities yielded rich and in-depth insights that will be triangulated with the quantitative survey findings in a companion report. Therefore, this report focuses only on findings from the quantitative survey component. Results are reported for global acute malnutrition (GAM), stunting, and underweight. However, the discussion focuses only on GAM because the purpose of the Nawiri program is to reduce persistent acute malnutrition.

    The baseline data collection was carried out in May and June 2021 following a full household listing operation in the county to establish the sampling frame of households with children under 3 years. Anthropometric data were collected from all under-5 children in the sampled households. Subsequent data collection waves are planned for October-November 2021 (Wave 2), March-April 2022 (Wave 3), September-October 2022 (Wave 4), March-April 2023 (Wave 5), and August-September 2023 (Wave 6).

    Sampling deviation

    na

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    HOUSEHOLD QUESTIONNAIRE:background, informed consent, household schedule/roster, household demographics, household characteristics, socio-economic characteristics (socio-economic characteristics, POVERTY PROBABILITY INDEX (PP1), HOUSEHOLD WEALTH RANKING (PERCEPTION)), food consumption, water, hygiene and sanitation (wash) (water access, availability and seasonality, household water insecurity experiences (hwise) scale, hygiene and sanitation), household shocks experienced, social safety nets and economic safety guards.

    WOMEN/CAREGIVER QUESTIONNAIRE:background, informed consent, mother's/caregivers information, births / pregnancy history, pregnancy and antenatal care, family planning, infant and young child feeding practices, supplementation and consumption of iron rich or iron fortified foods, maternal knowledge and attitude, on infant and young child feeding practices, caregiving practices, child feeding utensils hygiene, food safety, hygiene, and sanitation practices, child immunization, health and health seeking practices, acute malnutrition screening (community health volunteers), womens minimum dietary diversity, food insecurity experience scale (hfies), gender, women empowerment, violence and community conflict, psychological wellbeing, anthropometric measurements

    Cleaning operations

    Data quality monitoring processes and checks were implemented throughout the data collection process, during the time of developing the data collection tools (through built-in quality control in the tablet-based platform), during training of fieldworkers, in real time during data collection (routine monitoring by the research team and periodic cross-checks against the protocols), and during the data cleaning process. During fieldwork, data quality was enhanced through regular spot checks and sit-ins by supervisors to verify the authenticity of data collected. Data were then reviewed and certified by the field coordinator before they were transferred to the server.

    The quantitative data were collected using SurveyCTO, a survey platform for electronic data collection that has in-built skips and quality checks. Using this software increased efficiency and reduced the time needed for cleaning the data. In addition, the platform supported offline data capturing for regions with slow or no internet connectivity and data transmission when the internet became available. Fieldwork was conducted by trained fieldworkers using digital tablets with the questionnaire loaded in SurveyCTO. The questionnaire included the following modules: (1) identification and tracking, (2) demographics and household composition, (3) anthropometry of children <5 years and mothers, (4) socioeconomics, (5) household food security, (6) WASH, (7) health-seeking behavior, (8) MIYCN, (9) shock experience/exposure, and (10) shock preparedness and response. Data were uploaded from the tablets onto a secure African Population and Health Research Center (APHRC) server after each day of data collection. Data were synchronized automatically to a server when the tablet was in a location with network coverage. The uploaded data were then checked for quality daily by a data manager and a team dedicated to coordinate field procedures and at the APHRC head office in Nairobi.

    Response rate

    na

  18. s

    Large-scale and multi-omics data analysis for supporting precision medicine...

    • eprints.soton.ac.uk
    Updated Jun 10, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zhou, Yilu; Wang, Yihua; Ewing, Robert; Davies, Donna (2023). Large-scale and multi-omics data analysis for supporting precision medicine of human disease [Dataset]. http://doi.org/10.5258/SOTON/D2586
    Explore at:
    Dataset updated
    Jun 10, 2023
    Dataset provided by
    University of Southampton
    Authors
    Zhou, Yilu; Wang, Yihua; Ewing, Robert; Davies, Donna
    Description

    Data supporting for thesis titled “Large-scale data analysis and integration to advance precision prognosis, therapy stratification and understanding of human disease”

  19. Enterprise Survey 2009-2019, Panel Data - Slovenia

    • catalog.ihsn.org
    • microdata.worldbank.org
    Updated Jan 19, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    World Bank Group (WBG) (2021). Enterprise Survey 2009-2019, Panel Data - Slovenia [Dataset]. https://catalog.ihsn.org/catalog/9454
    Explore at:
    Dataset updated
    Jan 19, 2021
    Dataset provided by
    World Bank Grouphttp://www.worldbank.org/
    European Bank for Reconstruction and Development (EBRD)
    European Investment Bank (EIB)
    Time period covered
    2008 - 2019
    Area covered
    Slovenia
    Description

    Abstract

    The documentation covers Enterprise Survey panel datasets that were collected in Slovenia in 2009, 2013 and 2019.

    The Slovenia ES 2009 was conducted between 2008 and 2009. The Slovenia ES 2013 was conducted between March 2013 and September 2013. Finally, the Slovenia ES 2019 was conducted between December 2018 and November 2019. The objective of the Enterprise Survey is to gain an understanding of what firms experience in the private sector.

    As part of its strategic goal of building a climate for investment, job creation, and sustainable growth, the World Bank has promoted improving the business environment as a key strategy for development, which has led to a systematic effort in collecting enterprise data across countries. The Enterprise Surveys (ES) are an ongoing World Bank project in collecting both objective data based on firms' experiences and enterprises' perception of the environment in which they operate.

    Geographic coverage

    National

    Analysis unit

    The primary sampling unit of the study is the establishment. An establishment is a physical location where business is carried out and where industrial operations take place or services are provided. A firm may be composed of one or more establishments. For example, a brewery may have several bottling plants and several establishments for distribution. For the purposes of this survey an establishment must take its own financial decisions and have its own financial statements separate from those of the firm. An establishment must also have its own management and control over its payroll.

    Universe

    As it is standard for the ES, the Slovenia ES was based on the following size stratification: small (5 to 19 employees), medium (20 to 99 employees), and large (100 or more employees).

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    The sample for Slovenia ES 2009, 2013, 2019 were selected using stratified random sampling, following the methodology explained in the Sampling Manual for Slovenia 2009 ES and for Slovenia 2013 ES, and in the Sampling Note for 2019 Slovenia ES.

    Three levels of stratification were used in this country: industry, establishment size, and oblast (region). The original sample designs with specific information of the industries and regions chosen are included in the attached Excel file (Sampling Report.xls.) for Slovenia 2009 ES. For Slovenia 2013 and 2019 ES, specific information of the industries and regions chosen is described in the "The Slovenia 2013 Enterprise Surveys Data Set" and "The Slovenia 2019 Enterprise Surveys Data Set" reports respectively, Appendix E.

    For the Slovenia 2009 ES, industry stratification was designed in the way that follows: the universe was stratified into manufacturing industries, services industries, and one residual (core) sector as defined in the sampling manual. Each industry had a target of 90 interviews. For the manufacturing industries sample sizes were inflated by about 17% to account for potential non-response cases when requesting sensitive financial data and also because of likely attrition in future surveys that would affect the construction of a panel. For the other industries (residuals) sample sizes were inflated by about 12% to account for under sampling in firms in service industries.

    For Slovenia 2013 ES, industry stratification was designed in the way that follows: the universe was stratified into one manufacturing industry, and two service industries (retail, and other services).

    Finally, for Slovenia 2019 ES, three levels of stratification were used in this country: industry, establishment size, and region. The original sample design with specific information of the industries and regions chosen is described in "The Slovenia 2019 Enterprise Surveys Data Set" report, Appendix C. Industry stratification was done as follows: Manufacturing – combining all the relevant activities (ISIC Rev. 4.0 codes 10-33), Retail (ISIC 47), and Other Services (ISIC 41-43, 45, 46, 49-53, 55, 56, 58, 61, 62, 79, 95).

    For Slovenia 2009 and 2013 ES, size stratification was defined following the standardized definition for the rollout: small (5 to 19 employees), medium (20 to 99 employees), and large (more than 99 employees). For stratification purposes, the number of employees was defined on the basis of reported permanent full-time workers. This seems to be an appropriate definition of the labor force since seasonal/casual/part-time employment is not a common practice, except in the sectors of construction and agriculture.

    For Slovenia 2009 ES, regional stratification was defined in 2 regions. These regions are Vzhodna Slovenija and Zahodna Slovenija. The Slovenia sample contains panel data. The wave 1 panel “Investment Climate Private Enterprise Survey implemented in Slovenia” consisted of 223 establishments interviewed in 2005. A total of 57 establishments have been re-interviewed in the 2008 Business Environment and Enterprise Performance Survey.

    For Slovenia 2013 ES, regional stratification was defined in 2 regions (city and the surrounding business area) throughout Slovenia.

    Finally, for Slovenia 2019 ES, regional stratification was done across two regions: Eastern Slovenia (NUTS code SI03) and Western Slovenia (SI04).

    Mode of data collection

    Computer Assisted Personal Interview [capi]

    Research instrument

    Questionnaires have common questions (core module) and respectfully additional manufacturing- and services-specific questions. The eligible manufacturing industries have been surveyed using the Manufacturing questionnaire (includes the core module, plus manufacturing specific questions). Retail firms have been interviewed using the Services questionnaire (includes the core module plus retail specific questions) and the residual eligible services have been covered using the Services questionnaire (includes the core module). Each variation of the questionnaire is identified by the index variable, a0.

    Response rate

    Survey non-response must be differentiated from item non-response. The former refers to refusals to participate in the survey altogether whereas the latter refers to the refusals to answer some specific questions. Enterprise Surveys suffer from both problems and different strategies were used to address these issues.

    Item non-response was addressed by two strategies: a- For sensitive questions that may generate negative reactions from the respondent, such as corruption or tax evasion, enumerators were instructed to collect the refusal to respond as (-8). b- Establishments with incomplete information were re-contacted in order to complete this information, whenever necessary. However, there were clear cases of low response.

    For 2009 and 2013 Slovenia ES, the survey non-response was addressed by maximizing efforts to contact establishments that were initially selected for interview. Up to 4 attempts were made to contact the establishment for interview at different times/days of the week before a replacement establishment (with similar strata characteristics) was suggested for interview. Survey non-response did occur but substitutions were made in order to potentially achieve strata-specific goals. Further research is needed on survey non-response in the Enterprise Surveys regarding potential introduction of bias.

    For 2009, the number of contacted establishments per realized interview was 6.18. This number is the result of two factors: explicit refusals to participate in the survey, as reflected by the rate of rejection (which includes rejections of the screener and the main survey) and the quality of the sample frame, as represented by the presence of ineligible units. The relatively low ratio of contacted establishments per realized interview (6.18) suggests that the main source of error in estimates in the Slovenia may be selection bias and not frame inaccuracy.

    For 2013, the number of realized interviews per contacted establishment was 25%. This number is the result of two factors: explicit refusals to participate in the survey, as reflected by the rate of rejection (which includes rejections of the screener and the main survey) and the quality of the sample frame, as represented by the presence of ineligible units. The number of rejections per contact was 44%.

    Finally, for 2019, the number of interviews per contacted establishments was 9.7%. This number is the result of two factors: explicit refusals to participate in the survey, as reflected by the rate of rejection (which includes rejections of the screener and the main survey) and the quality of the sample frame, as represented by the presence of ineligible units. The share of rejections per contact was 75.2%.

  20. Enterprise Survey 2013 - Moldova

    • catalog.ihsn.org
    • datacatalog.ihsn.org
    • +1more
    Updated Mar 29, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    World Bank (2019). Enterprise Survey 2013 - Moldova [Dataset]. https://catalog.ihsn.org/catalog/4534
    Explore at:
    Dataset updated
    Mar 29, 2019
    Dataset provided by
    World Bank Grouphttp://www.worldbank.org/
    European Bank for Reconstruction and Development
    Time period covered
    2012 - 2013
    Area covered
    Moldova
    Description

    Abstract

    This research was conducted in Moldova between November 2012 and December 2013, as part of the fifth round of the Business Environment and Enterprise Performance Survey. The objective of the study is to obtain feedback from enterprises in client countries on the state of the private sector. The research is also used to build a panel of enterprise data that will make it possible to track changes in the business environment over time, thus allowing, for example, impact assessments of reforms. Through face-to-face interviews with firms in the manufacturing and services sectors, the survey assesses the constraints to private sector growth and creates statistically significant business environment indicators that are comparable across countries.

    In Moldova, data from 360 establishments was analyzed. Stratified random sampling was used to select the surveyed businesses.

    The survey topics include firm characteristics, information about sales and suppliers, competition, infrastructure services, judiciary and law enforcement collaboration, security, government policies, laws and regulations, financing, overall business environment, bribery, capacity utilization, performance and investment activities, and workforce composition.

    In 2011, the innovation module was added to the standard set of Enterprise Surveys questionnaires to examine in detail how introduction of new products and practices influence firms' performance and management.

    Geographic coverage

    National

    Analysis unit

    The primary sampling unit of the study is the establishment. An establishment is a physical location where business is carried out and where industrial operations take place or services are provided. A firm may be composed of one or more establishments. For example, a brewery may have several bottling plants and several establishments for distribution. For the purposes of this survey an establishment must make its own financial decisions and have its own financial statements separate from those of the firm. An establishment must also have its own management and control over its payroll.

    Universe

    The manufacturing and services sectors are the primary business sectors of interest. This corresponds to firms classified with International Standard Industrial Classification of All Economic Activities (ISIC) codes 15-37, 45, 50-52, 55, 60-64, and 72 (ISIC Rev.3.1). Formal (registered) companies with five or more employees are targeted for interview. Services firms include construction, retail, wholesale, hotels, restaurants, transport, storage, communications, and IT. Firms with 100% government/state ownership are not eligible to participate in Enterprise Surveys.

    Kind of data

    Sample survey data [ssd]

    Sampling procedure

    The sample was selected using stratified random sampling technique. Three levels of stratification were used: industry, establishment size, and region.

    Industry was stratified into one manufacturing and two service sectors (retail, and other services).

    Size stratification was defined following the standardized definition for the roll-out: small (5 to 19 employees), medium (20 to 99 employees), and large (more than 99 employees). For stratification purposes, the number of employees was defined on the basis of reported permanent full-time workers. This seems to be an appropriate definition of the labor force since seasonal/casual/part-time employment is not a common practice, except in the sectors of construction and agriculture.

    Regional stratification was defined in 4 regions (city and the surrounding business area) throughout Moldova.

    Given the stratified design, sample frames containing a complete and updated list of establishments for the selected regions were required. Great efforts were made to obtain the best source for these listings. However, the quality of the sample frames was not optimal and, therefore, some adjustments were needed to correct for the presence of ineligible units. These adjustments are reflected in the weights computation.

    The sample frame used for the survey in Moldova was from Enterprise Business Survey. The enumerated establishments were then used as the frame for the selection of a sample with the aim of obtaining interviews at 360 establishments with five or more employees.

    Given the impact that non-eligible units included in the sample universe may have on the results, adjustments may be needed when computing the appropriate weights for individual observations. The percentage of confirmed non-eligible units as a proportion of the total number of sampled establishments contacted for the survey was 10.4% (63 out of 606 establishments).

    In the dataset, the variables a2 (sampling region), a6a (sampling establishment's size), and a4a (sampling sector) contain the establishment's classification into the strata chosen for each country using information from the sample frame. Variable a4a coded using ISIC Rev 3.1 codes for the chosen industries for stratification. These codes include most manufacturing industries (15 to 37), retail (52), and (45, 50, 51, 55, 60-64, 72) for other services.

    Mode of data collection

    Face-to-face [f2f]

    Research instrument

    The structure of the data base reflects the fact that three different versions of the questionnaire were used. The basic questionnaire, the Core Module, includes all common questions asked to all establishments from all sectors. The second expanded variation, the Manufacturing Questionnaire, is built upon the Core Module and adds some specific questions relevant to manufacturing sectors. The third expanded variation, the Retail Questionnaire, is also built upon the Core Module and adds to the core specific questions relevant to retail firms. Each variation of the questionnaire is identified by the index variable, a0.

    Cleaning operations

    Data entry and quality controls are implemented by the contractor and data is delivered to the World Bank in batches (typically 10%, 50% and 100%). These data deliveries are checked for logical consistency, out of range values, skip patterns, and duplicate entries. Problems are flagged by the World Bank and corrected by the implementing contractor through data checks, callbacks, and revisiting establishments.

    Response rate

    Survey non-response must be differentiated from item non-response. The former refers to refusals to participate in the survey altogether, while the latter refers to the refusals to answer some specific questions. Enterprise Surveys suffer from both problems and different strategies were used to address these issues.

    Item non-response was addressed by two strategies: a- For sensitive questions that may generate negative reactions from the respondent, such as corruption or tax evasion, enumerators were instructed to collect the refusal to respond as a different option from don't know. b- Establishments with incomplete information were re-contacted in order to complete this information, whenever necessary.

    Survey non-response was addressed by maximizing efforts to contact establishments that were initially selected for interview. Attempts were made to contact the establishment for interview at different times/days of the week before a replacement establishment (with similar strata characteristics) was suggested for interview. Survey non-response did occur but substitutions were made in order to potentially achieve strata-specific goals.

    The number of realized interviews per contacted establishments was 0.59. This number is the result of two factors: explicit refusals to participate in the survey, as reflected by the rate of rejection (which includes rejections of the screener and the main survey) and the quality of the sample frame, as represented by the presence of ineligible units. The number of rejections per contact was 0.22.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
ljlapple (2024). User stratification analysis [Dataset]. https://dune.com/discover/content/popular?q=author%3Aljlapple&resource-type=queries

User stratification analysis

Explore at:
Dataset updated
Dec 9, 2024
Authors
ljlapple
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Blockchain data query: User stratification analysis

Search
Clear search
Close search
Google apps
Main menu