Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Public health-related decision-making on policies aimed at controlling the COVID-19 pandemic outbreak depends on complex epidemiological models that are compelled to be robust and use all relevant available data. This data article provides a new combined worldwide COVID-19 dataset obtained from official data sources with improved systematic measurement errors and a dedicated dashboard for online data visualization and summary. The dataset adds new measures and attributes to the normal attributes of official data sources, such as daily mortality, and fatality rates. We used comparative statistical analysis to evaluate the measurement errors of COVID-19 official data collections from the Chinese Center for Disease Control and Prevention (Chinese CDC), World Health Organization (WHO) and European Centre for Disease Prevention and Control (ECDC). The data is collected by using text mining techniques and reviewing pdf reports, metadata, and reference data. The combined dataset includes complete spatial data such as countries area, international number of countries, Alpha-2 code, Alpha-3 code, latitude, longitude, and some additional attributes such as population. The improved dataset benefits from major corrections on the referenced data sets and official reports such as adjustments in the reporting dates, which suffered from a one to two days lag, removing negative values, detecting unreasonable changes in historical data in new reports and corrections on systematic measurement errors, which have been increasing as the pandemic outbreak spreads and more countries contribute data for the official repositories. Additionally, the root mean square error of attributes in the paired comparison of datasets was used to identify the main data problems. The data for China is presented separately and in more detail, and it has been extracted from the attached reports available on the main page of the CCDC website. This dataset is a comprehensive and reliable source of worldwide COVID-19 data that can be used in epidemiological models assessing the magnitude and timeline for confirmed cases, long-term predictions of deaths or hospital utilization, the effects of quarantine, stay-at-home orders and other social distancing measures, the pandemic’s turning point or in economic and social impact analysis, helping to inform national and local authorities on how to implement an adaptive response approach to re-opening the economy, re-open schools, alleviate business and social distancing restrictions, design economic programs or allow sports events to resume.
Facebook
TwitterThe Best Management Practices Statistical Estimator (BMPSE) version 1.2.0 was developed by the U.S. Geological Survey (USGS), in cooperation with the Federal Highway Administration (FHWA) Office of Project Delivery and Environmental Review to provide planning-level information about the performance of structural best management practices for decision makers, planners, and highway engineers to assess and mitigate possible adverse effects of highway and urban runoff on the Nation's receiving waters (Granato 2013, 2014; Granato and others, 2021). The BMPSE was assembled by using a Microsoft Access® database application to facilitate calculation of BMP performance statistics. Granato (2014) developed quantitative methods to estimate values of the trapezoidal-distribution statistics, correlation coefficients, and the minimum irreducible concentration (MIC) from available data. Granato (2014) developed the BMPSE to hold and process data from the International Stormwater Best Management Practices Database (BMPDB, www.bmpdatabase.org). Version 1.0 of the BMPSE contained a subset of the data from the 2012 version of the BMPDB; the current version of the BMPSE (1.2.0) contains a subset of the data from the December 2019 version of the BMPDB. Selected data from the BMPDB were screened for import into the BMPSE in consultation with Jane Clary, the data manager for the BMPDB. Modifications included identifying water quality constituents, making measurement units consistent, identifying paired inflow and outflow values, and converting BMPDB water quality values set as half the detection limit back to the detection limit. Total polycyclic aromatic hydrocarbons (PAH) values were added to the BMPSE from BMPDB data; they were calculated from individual PAH measurements at sites with enough data to calculate totals. The BMPSE tool can sort and rank the data, calculate plotting positions, calculate initial estimates, and calculate potential correlations to facilitate the distribution-fitting process (Granato, 2014). For water-quality ratio analysis the BMPSE generates the input files and the list of filenames for each constituent within the Graphical User Interface (GUI). The BMPSE calculates the Spearman’s rho (ρ) and Kendall’s tau (τ) correlation coefficients with their respective 95-percent confidence limits and the probability that each correlation coefficient value is not significantly different from zero by using standard methods (Granato, 2014). If the 95-percent confidence limit values are of the same sign, then the correlation coefficient is statistically different from zero. For hydrograph extension, the BMPSE calculates ρ and τ between the inflow volume and the hydrograph-extension values (Granato, 2014). For volume reduction, the BMPSE calculates ρ and τ between the inflow volume and the ratio of outflow to inflow volumes (Granato, 2014). For water-quality treatment, the BMPSE calculates ρ and τ between the inflow concentrations and the ratio of outflow to inflow concentrations (Granato, 2014; 2020). The BMPSE also calculates ρ between the inflow and the outflow concentrations when a water-quality treatment analysis is done. The current version (1.2.0) of the BMPSE also has the option to calculate urban-runoff quality statistics from inflows to BMPs by using computer code developed for the Highway Runoff Database (Granato and Cazenas, 2009;Granato, 2019). Granato, G.E., 2013, Stochastic empirical loading and dilution model (SELDM) version 1.0.0: U.S. Geological Survey Techniques and Methods, book 4, chap. C3, 112 p., CD-ROM https://pubs.usgs.gov/tm/04/c03 Granato, G.E., 2014, Statistics for stochastic modeling of volume reduction, hydrograph extension, and water-quality treatment by structural stormwater runoff best management practices (BMPs): U.S. Geological Survey Scientific Investigations Report 2014–5037, 37 p., http://dx.doi.org/10.3133/sir20145037. Granato, G.E., 2019, Highway-Runoff Database (HRDB) Version 1.1.0: U.S. Geological Survey data release, https://doi.org/10.5066/P94VL32J. Granato, G.E., and Cazenas, P.A., 2009, Highway-Runoff Database (HRDB Version 1.0)--A data warehouse and preprocessor for the stochastic empirical loading and dilution model: Washington, D.C., U.S. Department of Transportation, Federal Highway Administration, FHWA-HEP-09-004, 57 p. https://pubs.usgs.gov/sir/2009/5269/disc_content_100a_web/FHWA-HEP-09-004.pdf Granato, G.E., Spaetzel, A.B., and Medalie, L., 2021, Statistical methods for simulating structural stormwater runoff best management practices (BMPs) with the stochastic empirical loading and dilution model (SELDM): U.S. Geological Survey Scientific Investigations Report 2020–5136, 41 p., https://doi.org/10.3133/sir20205136
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This data set contains the replication data and supplements for the article "Knowing, Doing, and Feeling: A three-year, mixed-methods study of undergraduates’ information literacy development." The survey data is from two samples: - cross-sectional sample (different students at the same point in time) - longitudinal sample (the same students and different points in time)Surveys were distributed via Qualtrics during the students' first and sixth semesters. Quantitative and qualitative data were collected and used to describe students' IL development over 3 years. Statistics from the quantitative data were analyzed in SPSS. The qualitative data was coded and analyzed thematically in NVivo. The qualitative, textual data is from semi-structured interviews with sixth-semester students in psychology at UiT, both focus groups and individual interviews. All data were collected as part of the contact author's PhD research on information literacy (IL) at UiT. The following files are included in this data set: 1. A README file which explains the quantitative data files. (2 file formats: .txt, .pdf)2. The consent form for participants (in Norwegian). (2 file formats: .txt, .pdf)3. Six data files with survey results from UiT psychology undergraduate students for the cross-sectional (n=209) and longitudinal (n=56) samples, in 3 formats (.dat, .csv, .sav). The data was collected in Qualtrics from fall 2019 to fall 2022. 4. Interview guide for 3 focus group interviews. File format: .txt5. Interview guides for 7 individual interviews - first round (n=4) and second round (n=3). File format: .txt 6. The 21-item IL test (Tromsø Information Literacy Test = TILT), in English and Norwegian. TILT is used for assessing students' knowledge of three aspects of IL: evaluating sources, using sources, and seeking information. The test is multiple choice, with four alternative answers for each item. This test is a "KNOW-measure," intended to measure what students know about information literacy. (2 file formats: .txt, .pdf)7. Survey questions related to interest - specifically students' interest in being or becoming information literate - in 3 parts (all in English and Norwegian): a) information and questions about the 4 phases of interest; b) interest questionnaire with 26 items in 7 subscales (Tromsø Interest Questionnaire - TRIQ); c) Survey questions about IL and interest, need, and intent. (2 file formats: .txt, .pdf)8. Information about the assignment-based measures used to measure what students do in practice when evaluating and using sources. Students were evaluated with these measures in their first and sixth semesters. (2 file formats: .txt, .pdf)9. The Norwegain Centre for Research Data's (NSD) 2019 assessment of the notification form for personal data for the PhD research project. In Norwegian. (Format: .pdf)
Facebook
TwitterThe harmonized data set on health, created and published by the ERF, is a subset of Iraq Household Socio Economic Survey (IHSES) 2012. It was derived from the household, individual and health modules, collected in the context of the above mentioned survey. The sample was then used to create a harmonized health survey, comparable with the Iraq Household Socio Economic Survey (IHSES) 2007 micro data set.
----> Overview of the Iraq Household Socio Economic Survey (IHSES) 2012:
Iraq is considered a leader in household expenditure and income surveys where the first was conducted in 1946 followed by surveys in 1954 and 1961. After the establishment of Central Statistical Organization, household expenditure and income surveys were carried out every 3-5 years in (1971/ 1972, 1976, 1979, 1984/ 1985, 1988, 1993, 2002 / 2007). Implementing the cooperation between CSO and WB, Central Statistical Organization (CSO) and Kurdistan Region Statistics Office (KRSO) launched fieldwork on IHSES on 1/1/2012. The survey was carried out over a full year covering all governorates including those in Kurdistan Region.
The survey has six main objectives. These objectives are:
The raw survey data provided by the Statistical Office were then harmonized by the Economic Research Forum, to create a comparable version with the 2006/2007 Household Socio Economic Survey in Iraq. Harmonization at this stage only included unifying variables' names, labels and some definitions. See: Iraq 2007 & 2012- Variables Mapping & Availability Matrix.pdf provided in the external resources for further information on the mapping of the original variables on the harmonized ones, in addition to more indications on the variables' availability in both survey years and relevant comments.
National coverage: Covering a sample of urban, rural and metropolitan areas in all the governorates including those in Kurdistan Region.
1- Household/family. 2- Individual/person.
The survey was carried out over a full year covering all governorates including those in Kurdistan Region.
Sample survey data [ssd]
----> Design:
Sample size was (25488) household for the whole Iraq, 216 households for each district of 118 districts, 2832 clusters each of which includes 9 households distributed on districts and governorates for rural and urban.
----> Sample frame:
Listing and numbering results of 2009-2010 Population and Housing Survey were adopted in all the governorates including Kurdistan Region as a frame to select households, the sample was selected in two stages: Stage 1: Primary sampling unit (blocks) within each stratum (district) for urban and rural were systematically selected with probability proportional to size to reach 2832 units (cluster). Stage two: 9 households from each primary sampling unit were selected to create a cluster, thus the sample size of total survey clusters was 25488 households distributed on the governorates, 216 households in each district.
----> Sampling Stages:
In each district, the sample was selected in two stages: Stage 1: based on 2010 listing and numbering frame 24 sample points were selected within each stratum through systematic sampling with probability proportional to size, in addition to the implicit breakdown urban and rural and geographic breakdown (sub-district, quarter, street, county, village and block). Stage 2: Using households as secondary sampling units, 9 households were selected from each sample point using systematic equal probability sampling. Sampling frames of each stages can be developed based on 2010 building listing and numbering without updating household lists. In some small districts, random selection processes of primary sampling may lead to select less than 24 units therefore a sampling unit is selected more than once , the selection may reach two cluster or more from the same enumeration unit when it is necessary.
Face-to-face [f2f]
----> Preparation:
The questionnaire of 2006 survey was adopted in designing the questionnaire of 2012 survey on which many revisions were made. Two rounds of pre-test were carried out. Revision were made based on the feedback of field work team, World Bank consultants and others, other revisions were made before final version was implemented in a pilot survey in September 2011. After the pilot survey implemented, other revisions were made in based on the challenges and feedbacks emerged during the implementation to implement the final version in the actual survey.
----> Questionnaire Parts:
The questionnaire consists of four parts each with several sections: Part 1: Socio – Economic Data: - Section 1: Household Roster - Section 2: Emigration - Section 3: Food Rations - Section 4: housing - Section 5: education - Section 6: health - Section 7: Physical measurements - Section 8: job seeking and previous job
Part 2: Monthly, Quarterly and Annual Expenditures: - Section 9: Expenditures on Non – Food Commodities and Services (past 30 days). - Section 10 : Expenditures on Non – Food Commodities and Services (past 90 days). - Section 11: Expenditures on Non – Food Commodities and Services (past 12 months). - Section 12: Expenditures on Non-food Frequent Food Stuff and Commodities (7 days). - Section 12, Table 1: Meals Had Within the Residential Unit. - Section 12, table 2: Number of Persons Participate in the Meals within Household Expenditure Other Than its Members.
Part 3: Income and Other Data: - Section 13: Job - Section 14: paid jobs - Section 15: Agriculture, forestry and fishing - Section 16: Household non – agricultural projects - Section 17: Income from ownership and transfers - Section 18: Durable goods - Section 19: Loans, advances and subsidies - Section 20: Shocks and strategy of dealing in the households - Section 21: Time use - Section 22: Justice - Section 23: Satisfaction in life - Section 24: Food consumption during past 7 days
Part 4: Diary of Daily Expenditures: Diary of expenditure is an essential component of this survey. It is left at the household to record all the daily purchases such as expenditures on food and frequent non-food items such as gasoline, newspapers…etc. during 7 days. Two pages were allocated for recording the expenditures of each day, thus the roster will be consists of 14 pages.
----> Raw Data:
Data Editing and Processing: To ensure accuracy and consistency, the data were edited at the following stages: 1. Interviewer: Checks all answers on the household questionnaire, confirming that they are clear and correct. 2. Local Supervisor: Checks to make sure that questions has been correctly completed. 3. Statistical analysis: After exporting data files from excel to SPSS, the Statistical Analysis Unit uses program commands to identify irregular or non-logical values in addition to auditing some variables. 4. World Bank consultants in coordination with the CSO data management team: the World Bank technical consultants use additional programs in SPSS and STAT to examine and correct remaining inconsistencies within the data files. The software detects errors by analyzing questionnaire items according to the expected parameter for each variable.
----> Harmonized Data:
Iraq Household Socio Economic Survey (IHSES) reached a total of 25488 households. Number of households refused to response was 305, response rate was 98.6%. The highest interview rates were in Ninevah and Muthanna (100%) while the lowest rates were in Sulaimaniya (92%).
Facebook
Twitterhttps://dataverse-staging.rdmc.unc.edu/api/datasets/:persistentId/versions/2.0/customlicense?persistentId=hdl:1902.29/CD-0227https://dataverse-staging.rdmc.unc.edu/api/datasets/:persistentId/versions/2.0/customlicense?persistentId=hdl:1902.29/CD-0227
"The Statistical Abstract of the United States, published since 1878, is the standard summary of statistics on the social, political, and economic organization of the United States. It is designed to serve as a convenient volume for statistical reference and as a guide to other statistical publications and sources. The latter function is served by the introductory text to each section, the source note appearing below each table, and Appendix I, which comprises the Guide to Sources of Statisti cs, the Guide to State Statistical Abstracts, and the Guide to Foreign Statistical Abstracts. The Statistical Abstract sections and tables are compiled into one Adobe PDF named StatAbstract2007.pdf. This PDF is bookmarked by section and by table and can be searched using the Acrobat Search feature. The Statistical Abstract on CD-ROM is best viewed using Adobe Acrobat 5, or any subsequent version of Acrobat or Acrobat Reader. The Statistical Abstract tables and the metropolitan areas tables from Appendix II are available as Excel(.xls or .xlw) spreadsheets. In most cases, these spreadsheet files offer the user direct access to more data than are shown either in the publication or Adobe Acrobat. These files usually contain more years of data, more geographic areas, and/or more categories of subjects than those shown in the Acrobat version. The extensive selection of statistics is provided for the United States, with selected data for regions, divisions, states, metropolitan areas, cities, and foreign countries from reports and records of government and private agencies. Software on the disc can be used to perform full-text searches, view official statistics, open tables as Lotus worksheets or Excel workbooks, and link directly to source agencies and organizations for su pporting information. Except as indicated, figures are for the United States as presently constituted. Although emphasis in the Statistical Abstract is primarily given to national data, many tables present data for regions and individual states and a smaller number for metropolitan areas and cities.Statistics for the Commonwealth of Puerto Rico and for island areas of the United States are included in many state tables and are supplemented by information in Section 29. Additional information for states, cities, counties, metropolitan areas, and other small units, as well as more historical data are available in various supplements to the Abstract. Statistics in this edition are generally for the most recent year or period available by summer 2006. Each year over 1,400 tables and charts are reviewed and evaluated; new tables and charts of current interest are added, continuing series are updated, and less timely data are condensed or eliminated. Text notes and appendices are revised as appropriate. This year we have introduced 72 new tables covering a wide range of subject areas. These cover a variety of topics including: learning disability for children, people impacted by the hurricanes in the Gulf Coast area, employees with alternative work arrangements, adult computer and Internet users by selected characteristics, North America cruise industry, women- and minority-owned businesses, and the percentage of the adult population considered to be obese. Some of the annually surveyed topics are population; vital statistics; health and nutrition; education; law enforcement, courts and prison; geography and environment; elections; state and local government; federal government finances and employment; national defense and veterans affairs; social insurance and human services; labor force, employment, and earnings; income, expenditures, and wealth; prices; business enterprise; science and technology; agriculture; natural resources; energy; construction and housing; manufactures; domestic trade and services; transportation; information and communication; banking, finance, and insurance; arts, entertainment, and recreation; accommodation, food services, and other services; foreign commerce and aid; outlying areas; and comparative international statistics." Note to Users: This CD is part of a collection located in the Data Archive of the Odum Institute for Research in Social Science, at the University of North Carolina at Chapel Hill. The collection is located in Room 10, Manning Hall. Users may check the CDs out subscribing to the honor system. Items can be checked out for a period of two weeks. Loan forms are located adjacent to the collection.
Facebook
TwitterThis survey provides information on household income and expenditure leading to measure the levels and changes of the living conditions of the people and to observe the consumption patterns .
Key objectives of the survey - To identify the income patterns in Urban, Rural and Estate Sectors & provinces. - To identify the income patterns by income levels. - Average consumption of food items and non food items - Expenditure patterns by sector and by income level.
National coverage.
Household, Individuals
For this survey a sample of buildings and the occupants therein was drawn from the whole island
Sample survey data [ssd]
A two stage stratified random sample design was used in the survey. Urban, Rural and Estate sectors of the Districts were the domains for stratification. The sample frame was the list of buildings that were prepared for the Census of Population and Housing 2001.
Selection of Primary Sampling Units (PSU's) Primary sampling units are the census blocks prepared for the Census of Population and Housing - 2001. The sample frame, which is a collection of all census blocks in the domain, was used for the selection of primary sampling units. A sample of 500 primary sampling units was selected from the sampling frame for the survey.
Selection of Secondary Sampling Units (SSU's) Secondary Sampling Units are the housing units in the selected 500 primary sampling units (census blocks). From each primary sampling unit 10 housing units (SSU) were selected for the survey. The total sample size of 5000 housing units was selected and distributed among Districts in Sri Lanka.
Face-to-face [f2f]
Questionaires
The survey schedule was designed to collect data by household and separate schedules were used for each household identified according to the definition of the household within the housing units selected for the survey. The survey schedule consists three main sections .
1. Demographic section
2. Expenditure
3. Income
The Demographic characteristics and usual activities of the inmates belonging to the household were reported in the Demographic section of the schedule (and close relatives temporarily living away are also listed in this section). Expenditure section has two sub sections to report food and non-food consumption data separately. Expenditure incurred on their own decisions by boarders and servants are recorded in the sub section under the Main expenditure section. The income has seven sub sections categorized according to the main sources of income.
The exact differences or sampling error ,varies depending on the particular sample selected and the variability is measured by the standard error of the estimate. There is about a 95% chance or level of confidence that an estimate based on a sample will differ by no more than 1.96 standard errors from the true population value because of sampling error. Analyses relating to the HIES are generally conducted at the 95% level of confidence .
confidence interval = Estimate value ± (standard error )*(1.96)
http://www.statistics.gov.lk/HIES/HIES%202007/introduction%20%20HIES.pdf
By visiting the above website a description about the adjustments for non-response could be read in section 1.2 of the Final report.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
GENERAL INFORMATION
Title of Dataset: A dataset from a survey investigating disciplinary differences in data citation
Date of data collection: January to March 2022
Collection instrument: SurveyMonkey
Funding: Alfred P. Sloan Foundation
SHARING/ACCESS INFORMATION
Licenses/restrictions placed on the data: These data are available under a CC BY 4.0 license
Links to publications that cite or use the data:
Gregory, K., Ninkov, A., Ripp, C., Peters, I., & Haustein, S. (2022). Surveying practices of data citation and reuse across disciplines. Proceedings of the 26th International Conference on Science and Technology Indicators. International Conference on Science and Technology Indicators, Granada, Spain. https://doi.org/10.5281/ZENODO.6951437
Gregory, K., Ninkov, A., Ripp, C., Roblin, E., Peters, I., & Haustein, S. (2023). Tracing data:
A survey investigating disciplinary differences in data citation. Zenodo. https://doi.org/10.5281/zenodo.7555266
DATA & FILE OVERVIEW
File List
Additional related data collected that was not included in the current data package: Open ended questions asked to respondents
METHODOLOGICAL INFORMATION
Description of methods used for collection/generation of data:
The development of the questionnaire (Gregory et al., 2022) was centered around the creation of two main branches of questions for the primary groups of interest in our study: researchers that reuse data (33 questions in total) and researchers that do not reuse data (16 questions in total). The population of interest for this survey consists of researchers from all disciplines and countries, sampled from the corresponding authors of papers indexed in the Web of Science (WoS) between 2016 and 2020.
Received 3,632 responses, 2,509 of which were completed, representing a completion rate of 68.6%. Incomplete responses were excluded from the dataset. The final total contains 2,492 complete responses and an uncorrected response rate of 1.57%. Controlling for invalid emails, bounced emails and opt-outs (n=5,201) produced a response rate of 1.62%, similar to surveys using comparable recruitment methods (Gregory et al., 2020).
Methods for processing the data:
Results were downloaded from SurveyMonkey in CSV format and were prepared for analysis using Excel and SPSS by recoding ordinal and multiple choice questions and by removing missing values.
Instrument- or software-specific information needed to interpret the data:
The dataset is provided in SPSS format, which requires IBM SPSS Statistics. The dataset is also available in a coded format in CSV. The Codebook is required to interpret to values.
DATA-SPECIFIC INFORMATION FOR: MDCDataCitationReuse2021surveydata
Number of variables: 94
Number of cases/rows: 2,492
Missing data codes: 999 Not asked
Refer to MDCDatacitationReuse2021Codebook.pdf for detailed variable information.
Facebook
TwitterThe latest estimates from the 2010/11 Taking Part adult survey produced by DCMS were released on 30 June 2011 according to the arrangements approved by the UK Statistics Authority.
30 June 2011
**
April 2010 to April 2011
**
National and Regional level data for England.
**
Further analysis of the 2010/11 adult dataset and data for child participation will be published on 18 August 2011.
The latest data from the 2010/11 Taking Part survey provides reliable national estimates of adult engagement with sport, libraries, the arts, heritage and museums & galleries. This release also presents analysis on volunteering and digital participation in our sectors and a look at cycling and swimming proficiency in England. The Taking Part survey is a continuous annual survey of adults and children living in private households in England, and carries the National Statistics badge, meaning that it meets the highest standards of statistical quality.
These spreadsheets contain the data and sample sizes for each sector included in the survey:
The previous Taking Part release was published on 31 March 2011 and can be found online.
This release is published in accordance with the Code of Practice for Official Statistics (2009), as produced by the http://www.statisticsauthority.gov.uk/">UK Statistics Authority (UKSA). The UKSA has the overall objective of promoting and safeguarding the production and publication of official statistics that serve the public good. It monitors and reports on all official statistics, and promotes good practice in this area.
The document below contains a list of Ministers and Officials who have received privileged early access to this release of Taking Part data. In line with best practice, the list has been kept to a minimum and those given access for briefing purposes had a maximum of 24 hours.
The responsible statistician for this release is Neil Wilson. For any queries please contact the Taking Part team on 020 7211 6968 or takingpart@culture.gsi.gov.uk.
Facebook
TwitterIntroduction
The Annual Survey of Industries (ASI) is the principal source of industrial statistics in India. It provides statistical information to assess changes in the growth, composition and structure of organised manufacturing sector comprising activities related to manufacturing processes, repair services, gas and water supply and cold storage. Industrial sector occupies an important position in the State economy and has a pivotal role to play in the rapid and balanced economic development. The Survey is conducted annually under the statutory provisions of the Collection of Statistics Act 1953, and the Rules framed there-under in 1959, except in the State of Jammu & Kashmir where it is conducted under the State Collection of Statistics Act, 1961 and the rules framed there-under in 1964.
Coverage of the Annual Survey of Industries extends to the entire Factory Sector, comprising industrial units (called factories) registered under section 2(m)(i) and 2(m)(ii) of the Factories Act.1948, wherein a "Factory", which is the primary statistical unit of enumeration for the ASI is defined as:- "Any premises" including the precincts thereof:- (i) wherein ten or more workers are working or were working on any day of the preceding twelve months, and in any part of which a manufacturing process is being carried on with the aid of power or is ordinarily so carried on, or (ii) wherein twenty or more workers are working or were working on any day of the preceding twelve months, and in any part of which a manufacturing process is being carried on without the aid of power. In addition to section 2(m)(i) & 2(m)(ii) of the Factories Act, 1948, electricity units registered with the Central Electricity Authority and Bidi & Cigar units, registered under the Bidi & Cigar Workers (Conditions of Employment) Act,1966 are also covered in ASI.
The primary unit of enumeration in the survey is a factory in the case of manufacturing industries, a workshop in the case of repair services, an undertaking or a licensee in the case of electricity, gas & water supply undertakings and an establishment in the case of bidi & cigar industries. The owner of two or more establishments located in the same State and pertaining to the same industry group and belonging to same scheme (census or sample) is, however, permitted to furnish a single consolidated return. Such consolidated returns are common feature in the case of bidi and cigar establishments, electricity and certain public sector undertakings.
The survey cover factories registered under the Factory Act 1948. Establishments under the control of the Defence Ministry,oil storage and distribution units, restaurants and cafes and technical training institutions not producing anything for sale or exchange were kept outside the coverage of the ASI. The geographical coverage of the Annual Survey of Industries, 2002-03 has been extended to the entire country except the states of Arunachal Pradesh, Mizoram and Sikkim and Union Territory of Lakshadweep.
Census and Sample survey data [cen/ssd]
Sampling Procedure
The sampling design followed in ASI 2002-03 is a circular systematic one. All the factories in the updated frame (universe) are divided into two sectors, viz., Census and Sample.
Census Sector: Census Sector is defined as follows:
a) All industrial units belonging to the six less industrially developed states/ UT's viz. Manipur, Meghalaya, Nagaland, Tripura, Sikkim and Andaman & Nicobar Islands.
b) For the rest of the twenty-six states/ UT's., (i) units having 100 or more workers, and (ii) all factories covered under Joint Returns.
c) After excluding the Census Sector units as defined above, all units belonging to the strata (State by 4-digit of NIC-04) having less than or equal to 4 units are also considered as Census Sector units.
Remaining units, excluding those of Census Sector, called the sample sector, are arranged in order of their number of workers and samples are then drawn circular systematically considering sampling fraction of 20% within each stratum (State X Sector X 4-digit NIC) for all the states. An even number of units with a minimum of 4 are selected and evenly distributed in two sub-samples. The sectors considered here are Biri, Manufacturing and Electricity.
There was no deviation from sample design in ASI 2002-03
Statutory return submitted by factories as well as Face to face
Annual Survey of Industries Questionnaire (in External Resources) is divided into different blocks:
BLOCK A :IDENTIFICATION PARTICULARS BLOCK B : PARTICULARS OF THE FACTORY (TO BE FILLED BY OWNER OF THE FACTORY) BLOCK C : FIXED ASSETS BLOCK D : WORKING CAPITAL & LOANS BLOCK E : EMPLOYMENT AND LABOUR COST BLOCK F : OTHER EXPENSES BLOCK G : OTHER INCOMES BLOCK H : INPUT ITEMS (indigenous items consumed) BLOCK I : INPUT ITEMS – directly imported items only (consumed) BLOCK J : PRODUCTS AND BY-PRODUCTS (manufactured by the unit)
Pre-data entry scrutiny was carried out on the schedules for inter and intra block consistency checks. Such editing was mostly manual, although some editing was automatic. But, for major inconsistencies, the schedules were referred back to NSSO (FOD) for clarifications/modifications.
Code list, State code list, Tabulation program and ASICC code are also may be refered in the External Resources which are used for editing and data processing as well..
Tabulation procedure The tabulation procedure by CSO(ISW) includes both the ASI 2002-03 data and the extracted data from ASI 01-02 for all tabulation purpose. For extracted returns, status of unit (Block A, Item 12) would be in the range 17 to 20. To make results comparable, users are requested to follow the same procedure. For calculation of various parameters, users are requested to refer instruction manual/report. Please note that a separate inflation factor (Multiplier) is available for each unit against records belonging to Block-A ,pos:62-70 (Please refer STRUC03.XLS) for ASI 2002-03 data. The multiplier is calculated for each sub-stratum (i.e. State X NIC'98(4 Digit) X sub-stratum) after adjusting for non-response cases.
Status of unit code 17-20 may always be considered for all processing.
Merging of unit level data As per existing policy to merge unit level data at ultimate digit level of NIC'98 (i.e., 5 digit) for the purpose of dissemination, the data have been merged for industries having less than three units within State, District and NIC'98(5 Digit) with the adjoining industries within district and then to adjoining districts within a state. There may be some NIC'98(5 Digit) ending with '9' which do not figure in the book of NIC '98. These may be treated as 'Others' under the corresponding 4-digit group. To suppress the identity of factories data fields corresponding to PSL number, Industry code as per Frame (4-digit level of NIC-98) and RO/SRO code have been filled with '9' in each record.
It may please be noted that, tables generated from the merged data may not tally with the published results for few industries, since the merging for published data has been done at aggregate-level to minimise the loss of information.
Relative Standard Error (RSE) is calculated in terms of worker, wages to worker and GVA using the formula. Programs developed in Visual Foxpro are used to compute the RSE of estimates.
To check for consistency and reliability of data the same are compared with the NIC-2digit level growth rate at all India Index of Production (IIP) and the growth rates obtained from the National Accounts Statistics at current and constant prices for the registered manufacturing sector.
Facebook
TwitterThis dataset includes percent distribution of births for females by age group in the United States since 1933. The number of states in the reporting area differ historically. In 1915 (when the birth registration area was established), 10 states and the District of Columbia reported births; by 1933, 48 states and the District of Columbia were reporting births, with the last two states, Alaska and Hawaii, added to the registration area in 1959 and 1960, when these regions gained statehood. Reporting area information is detailed in references 1 and 2 below. Trend lines for 1909–1958 are based on live births adjusted for under-registration; beginning with 1959, trend lines are based on registered live births. SOURCES NCHS, National Vital Statistics System, birth data (see https://www.cdc.gov/nchs/births.htm); public-use data files (see https://www.cdc.gov/nchs/data_access/VitalStatsOnline.htm); and CDC WONDER (see http://wonder.cdc.gov/). REFERENCES National Office of Vital Statistics. Vital Statistics of the United States, 1950, Volume I. 1954. Available from: https://www.cdc.gov/nchs/data/vsus/vsus_1950_1.pdf. Hetzel AM. U.S. vital statistics system: major activities and developments, 1950-95. National Center for Health Statistics. 1997. Available from: https://www.cdc.gov/nchs/data/misc/usvss.pdf. National Center for Health Statistics. Vital Statistics of the United States, 1967, Volume I–Natality. 1969. Available from: https://www.cdc.gov/nchs/data/vsus/nat67_1.pdf. Martin JA, Hamilton BE, Osterman MJK, et al. Births: Final data for 2015. National vital statistics reports; vol 66 no 1. Hyattsville, MD: National Center for Health Statistics. 2017. Available from: https://www.cdc.gov/nchs/data/nvsr/nvsr66/nvsr66_01.pdf. Martin JA, Hamilton BE, Osterman MJK, Driscoll AK, Drake P. Births: Final data for 2016. National Vital Statistics Reports; vol 67 no 1. Hyattsville, MD: National Center for Health Statistics. 2018. Available from: https://www.cdc.gov/nvsr/nvsr67/nvsr67_01.pdf. Martin JA, Hamilton BE, Osterman MJK, Driscoll AK, Births: Final data for 2018. National vital statistics reports; vol 68 no 13. Hyattsville, MD: National Center for Health Statistics. 2019. Available from: https://www.cdc.gov/nchs/data/nvsr/nvsr68/nvsr68_13.pdf.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This collection contains a snapshot of the learning resource metadata from ESIP's Data management Training Clearinghouse (DMTC) associated with the closeout (March 30, 2023) of the Institute of Museum and Library Services funded (Award Number: LG-70-18-0092-18) Development of an Enhanced and Expanded Data Management Training Clearinghouse project. The shared metadata are a snapshot associated with the final reporting date for the project, and the associated data report is also based upon the same data snapshot on the same date.
The materials included in the collection consist of the following:
esip-dev-02.edacnm.org.json.zip - a zip archive containing the metadata for 587 published learning resources as of March 30, 2023. These metadata include all publicly available metadata elements for the published learning resources with the exception of the metadata elements containing individual email addresses (submitter and contact) to reduce the exposure of these data.
statistics.pdf - an automatically generated report summarizing information about the collection of materials in the DMTC Clearinghouse, including both published and unpublished learning resources. This report includes the numbers of published and unpublished resources through time; the number of learning resources within subject categories and detailed subject categories, the dates items assigned to each category were first added to the Clearinghouse, and the most recent data that items were added to that category; the distribution of learning resources across target audiences; and the frequency of keywords within the learning resource collection. This report is based on the metadata for published resourced included in this collection, and preliminary metadata for unpublished learning resources that are not included in the shared dataset.
The metadata fields consist of the following:
Fieldname
Description
abstract_data
A brief synopsis or abstract about the learning resource
abstract_format
Declaration for how the abstract description will be represented.
access_conditions
Conditions upon which the resource can be accessed beyond cost, e.g., login required.
access_cost
Yes or No choice stating whether othere is a fee for access to or use of the resource.
accessibililty_features_name
Content features of the resource, such as accessible media, alternatives and supported enhancements for accessibility.
accessibililty_summary
A human-readable summary of specific accessibility features or deficiencies.
author_names
List of authors for a resource derived from the given/first and family/last names of the personal author fields by the system
author_org
- name
- name_identifier
- name_identifier_type
- Name of organization authoring the learning resource.
- The unique identifier for the organization authoring the resource.
- The identifier scheme associated with the unique identifier for the organization authoring the resource.
authors - givenName - familyName - name_identifier - name_identifier_type
- Given or first name of person(s) authoring the resource.
- Last or family name of person(s) authoring the resource.
- The unique identifier for the person(s) authoring the resource.
- The identifier scheme associated with the unique identifier for the person(s) authoring the resource, e.g., ORCID.
citation
Preferred Form of Citation.
completion_time
Intended Time to Complete
contact - name - org - email
- Name of person(s) who has/have been asserted as the contact(s) for the resource in case of questions or follow-up by resource user.
- Name of organization that has/have been asserted as the contact(s) for the resource in case of questions or follow-up by resource user.
- (excluded) Contact email address.
contributor_orgs
- name
- name_identifier
- name_identifier_type
- type
- Name of organization that is a secondary contributor to the learningresource. A contributor can also be an individual person.
- The unique identifier for the organization contributing to the resource.
- The identifier scheme associated with the unique identifier for the organization contributing to the resource.
- Type of contribution to the resource made by an organization.
contributors
- familyName
- givenName
- name_identifier
- name_identifier_type
contributors.type
Type of contribution to the resource made by a person.
created
The date on which the metadata record was first saved as part of the input workflow.
creator
The name of the person creating the MD record for a resource.
credential_status
Declaration of whether a credential is offered for comopletion of the resource.
ed_frameworks - name - description - nodes.name
- The name of the educational framework to which the resource is aligned, if any. An educational framework is a structured description of educational concepts such as a shared curriculum, syllabus or set of learning objectives, or a vocabulary for describing some other aspect of education such as educational levels or reading ability.
- A description of one or more subcategories of an educational framework to which a resource is associated.
- The name of a subcategory of an educational framework to which a resource is associated.
expertise_level
The skill level targeted for the topic being taught.
id
Unique identifier for the MD record generated by the system in UUID format.
keywords
Important phrases or words used to describe the resource.
language_primary
Original language in which the learning resource being described is published or made available.
languages_secondary
Additional languages in which the resource is tranlated or made available, if any.
license
A license for use of that applies to the resource, typically indicated by URL.
locator_data
The identifier for the learning resource used as part of a citation, if available.
locator_type
Designation of citation locatorr type, e.g., DOI, ARK, Handle.
lr_outcomes
Descriptions of what knowledge, skills or abilities students should learn from the resource.
lr_type
A characteristic that describes the predominant type or kind of learning resource.
media_type
Media type of resource.
modification_date
System generated date and time when MD record is modified.
notes
MD Record Input Notes
pub_status
Status of metadata record within the system, i.e., in-process, in-review, pre-pub-review, deprecate-request, deprecated or published.
published
Date of first broadcast / publication.
publisher
The organization credited with publishing or broadcasting the resource.
purpose
The purpose of the resource in the context of education; e.g., instruction, professional education, assessment.
rating
The aggregation of input from all user assessments evaluating users' reaction to the learning resource following Kirkpatrick's model of training evaluation.
ratings
Inputs from users assessing each user's reaction to the learning resource following Kirkpatrick's model of training evaluation.
resource_modification_date
Date in which the resource has last been modified from the original published or broadcast version.
status
System generated publication status of the resource w/in the registry as a yes for published or no for not published.
subject
Subject domain(s) toward which the resource is targeted. There may be more than one value for this field.
submitter_email
(excluded) Email address of person who submitted the resource.
submitter_name
Submission Contact Person
target_audience
Audience(s) for which the resource is intended.
title
The name of the resource.
url
URL that resolves to a downloadable version of the learning resource or to a landing page for the resource that contains important contextual information including the direct resolvable link to the resource, if applicable.
usage_info
Descriptive information about using the resource, not addressed by the License information field.
version
The specific version of the resource, if declared.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Structural business statistics (SBS) describes the structure, conduct and performance of economic activities, down to the most detailed activity level (several hundred economic sectors).
SBS are transmitted annually by the EU Member States on the basis of a legal obligation from 1995 onwards.
SBS covers all activities of the business economy with the exception of agricultural activities and personal services and the data are provided by all EU Member States, Iceland, Norway and Switzerland, some candidate and potential candidate countries. The data are collected by domain of activity (annex) :
The majority of the data is collected by National Statistical Institutes (NSIs) by means of statistical surveys, business registers or from various administrative sources. Regulatory or controlling national offices for financial institutions or central banks often provide the information required for the financial sector (NACE Rev 2 Section K / NACE Rev 1.1 Section J).
Member States apply various statistical methods, according to the data source, such as grossing up, model based estimation or different forms of imputation, to ensure the quality of SBSs produced.
Main characteristics (variables) of the SBS data category:
All SBS characteristics are published on Eurostat’s website by tables and an example of the existent tables is presented below:
More information on the contents of different tables: the detail level and breakdowns required starting with the reference year 2008 is defined in Commission Regulation N° 251/2009. For previous reference years it is included in Commission Regulations (EC) N° 2701/98 and amended by Commission Regulation N°1614/2002 and Commission Regulation N°1669/2003.
Several important derived indicators are generated in the form of ratios of certain monetary characteristics or per head values. A list with the available derived indicators is available below in the Annex.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Ecological theories often encompass multiple levels of biological organization, such as genes, individuals, populations, and communities. Despite substantial progress toward ecological theory spanning multiple levels, ecological data rarely are connected in this way. This is unfortunate because different types of ecological data often emerge from the same underlying processes and, therefore, are naturally connected among levels. Here, we describe an approach to integrate data collected at multiple levels (e.g., individuals, populations) in a single statistical analysis. The resulting integrated models make full use of existing data and might strengthen links between statistical ecology and ecological models and theories that span multiple levels of organization. Integrated models are increasingly feasible due to recent advances in computational statistics, which allow fast calculations of multiple likelihoods that depend on complex mechanistic models. We discuss recently developed integrated models and outline a simple application using data on freshwater fishes in south-eastern Australia. Available data on freshwater fishes include population survey data, mark-recapture data, and individual growth trajectories. We use these data to estimate age-specific survival and reproduction from size-structured data, accounting for imperfect detection of individuals. Given that such parameter estimates would be infeasible without an integrated model, we argue that integrated models will strengthen ecological theory by connecting theoretical and mathematical models directly to empirical data. Although integrated models remain conceptually and computationally challenging, integrating ecological data among levels is likely to be an important step toward unifying ecology among levels.
Facebook
TwitterData licence Germany – Attribution – Version 2.0https://www.govdata.de/dl-de/by-2-0
License information was derived automatically
Data from various sources are updated in the Statistical Information System of the City of Cologne. The annual statistical yearbook publishes these in tabular, graphic and cartographic form at the level of the city districts and districts. Furthermore, definitions and calculation bases are explained. Small-scale statistics at the level of the 86 districts can be obtained from the Cologne district information become. All levels of the local area structure are presented in this publication explained.
This statistical data catalogue supplements the range of small-scale data. Selected structural data can be called up here in compact tabular form at the level of the 570 statistical districts or the 86 districts. The two overviews provide information about which data is available and from which source it originates. The data itself is provided annually.
Notes:
Facebook
TwitterAccording to a survey carried out in August 2020 in the United Kingdom (UK), ** percent of marketing companies collected customer data through their website. Half did so through social media, while a slightly smaller share said they recorded customer data at organized events. Collection via purchase lists and preference centres were the least used methods.
Facebook
TwitterThe Tanzania Demographic and Health Survey (TDHS) is part of the worldwide Demographic and Health Surveys (DHS) programme, which is designed to collect data on fertility, family planning, and maternal and child health.
The primary objective of the 1999 TRCHS was to collect data at the national level (with breakdowns by urban-rural and Mainland-Zanzibar residence wherever warranted) on fertility levels and preferences, family planning use, maternal and child health, breastfeeding practices, nutritional status of young children, childhood mortality levels, knowledge and behaviour regarding HIV/AIDS, and the availability of specific health services within the community.1 Related objectives were to produce these results in a timely manner and to ensure that the data were disseminated to a wide audience of potential users in governmental and nongovernmental organisations within and outside Tanzania. The ultimate intent is to use the information to evaluate current programmes and to design new strategies for improving health and family planning services for the people of Tanzania.
National. The sample was designed to provide estimates for the whole country, for urban and rural areas separately, and for Zanzibar and, in some cases, Unguja and Pemba separately.
Sample survey data
The TRCHS used a three-stage sample design. Overall, 176 census enumeration areas were selected (146 on the Mainland and 30 in Zanzibar) with probability proportional to size on an approximately self-weighting basis on the Mainland, but with oversampling of urban areas and Zanzibar. To reduce costs and maximise the ability to identify trends over time, these enumeration areas were selected from the 357 sample points that were used in the 1996 TDHS, which in turn were selected from the 1988 census frame of enumeration in a two-stage process (first wards/branches and then enumeration areas within wards/branches). Before the data collection, fieldwork teams visited the selected enumeration areas to list all the households. From these lists, households were selected to be interviewed. The sample was designed to provide estimates for the whole country, for urban and rural areas separately, and for Zanzibar and, in some cases, Unguja and Pemba separately. The health facilities component of the TRCHS involved visiting hospitals, health centres, and pharmacies located in areas around the households interviewed. In this way, the data from the two components can be linked and a richer dataset produced.
See detailed sample implementation in the APPENDIX A of the final report.
Face-to-face
The household survey component of the TRCHS involved three questionnaires: 1) a Household Questionnaire, 2) a Women’s Questionnaire for all individual women age 15-49 in the selected households, and 3) a Men’s Questionnaire for all men age 15-59.
The health facilities survey involved six questionnaires: 1) a Community Questionnaire administered to men and women in each selected enumeration area; 2) a Facility Questionnaire; 3) a Facility Inventory; 4) a Service Provider Questionnaire; 5) a Pharmacy Inventory Questionnaire; and 6) a questionnaire for the District Medical Officers.
All these instruments were based on model questionnaires developed for the MEASURE programme, as well as on the questionnaires used in the 1991-92 TDHS, the 1994 TKAP, and the 1996 TDHS. These model questionnaires were adapted for use in Tanzania during meetings with representatives from the Ministry of Health, the University of Dar es Salaam, the Tanzania Food and Nutrition Centre, USAID/Tanzania, UNICEF/Tanzania, UNFPA/Tanzania, and other potential data users. The questionnaires and manual were developed in English and then translated into and printed in Kiswahili.
The Household Questionnaire was used to list all the usual members and visitors in the selected households. Some basic information was collected on the characteristics of each person listed, including his/her age, sex, education, and relationship to the head of the household. The main purpose of the Household Questionnaire was to identify women and men who were eligible for individual interview and children under five who were to be weighed and measured. Information was also collected about the dwelling itself, such as the source of water, type of toilet facilities, materials used to construct the house, ownership of various consumer goods, and use of iodised salt. Finally, the Household Questionnaire was used to collect some rudimentary information about the extent of child labour.
The Women’s Questionnaire was used to collect information from women age 15-49. These women were asked questions on the following topics: · Background characteristics (age, education, religion, type of employment) · Birth history · Knowledge and use of family planning methods · Antenatal, delivery, and postnatal care · Breastfeeding and weaning practices · Vaccinations, birth registration, and health of children under age five · Marriage and recent sexual activity · Fertility preferences · Knowledge and behaviour concerning HIV/AIDS.
The Men’s Questionnaire covered most of these same issues, except that it omitted the sections on the detailed reproductive history, maternal health, and child health. The final versions of the English questionnaires are provided in Appendix E.
Before the questionnaires could be finalised, a pretest was done in July 1999 in Kibaha District to assess the viability of the questions, the flow and logical sequence of the skip pattern, and the field organisation. Modifications to the questionnaires, including wording and translations, were made based on lessons drawn from the exercise.
In all, 3,826 households were selected for the sample, out of which 3,677 were occupied. Of the households found, 3,615 were interviewed, representing a response rate of 98 percent. The shortfall is primarily due to dwellings that were vacant or in which the inhabitants were not at home despite of several callbacks.
In the interviewed households, a total of 4,118 eligible women (i.e., women age 15-49) were identified for the individual interview, and 4,029 women were actually interviewed, yielding a response rate of 98 percent. A total of 3,792 eligible men (i.e., men age 15-59), were identified for the individual interview, of whom 3,542 were interviewed, representing a response rate of 93 percent. The principal reason for nonresponse among both eligible men and women was the failure to find them at home despite repeated visits to the household. The lower response rate among men than women was due to the more frequent and longer absences of men.
The response rates are lower in urban areas due to longer absence of respondents from their homes. One-member households are more common in urban areas and are more difficult to interview because they keep their houses locked most of the time. In urban settings, neighbours often do not know the whereabouts of such people.
The estimates from a sample survey are affected by two types of errors: (1) non-sampling errors, and (2) sampling errors. Non-sampling errors are the results of mistakes made in implementing data collection and data processing, such as failure to locate and interview the correct household, misunderstanding of the questions on the part of either the interviewer or the respondent, and data entry errors. Although numerous efforts were made during the implementation of the TRCHS to minimise this type of error, nonsampling errors are impossible to avoid and difficult to evaluate statistically.
Sampling errors, on the other hand, can be evaluated statistically. The sample of respondents selected in the TRCHS is only one of many samples that could have been selected from the same population, using the same design and expected size. Each of these samples would yield results that differ somewhat from the results of the actual sample selected. Sampling errors are a measure of the variability between all possible samples. Although the degree of variability is not known exactly, it can be estimated from the survey results.
A sampling error is usually measured in terms of the standard error for a particular statistic (mean, percentage, etc.), which is the square root of the variance. The standard error can be used to calculate confidence intervals within which the true value for the population can reasonably be assumed to fall. For example, for any given statistic calculated from a sample survey, the value of that statistic will fall within a range of plus or minus two times the standard error of that statistic in 95 percent of all possible samples of identical size and design.
If the sample of respondents had been selected as a simple random sample, it would have been possible to use straightforward formulas for calculating sampling errors. However, the TRCHS sample is the result of a two-stage stratified design, and, consequently, it was necessary to use more complex formulae. The computer software used to calculate sampling errors for the TRCHS is the ISSA Sampling Error Module (SAMPERR). This module used the Taylor linearisation method of variance estimation for survey estimates that are means or proportions. The Jackknife repeated replication method is used for variance estimation of more complex statistics such as fertility and mortality rate
Note: See detailed sampling error calculation in the APPENDIX B
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Structural business statistics (SBS) describes the structure, conduct and performance of economic activities, down to the most detailed activity level (several hundred economic sectors).
SBS are transmitted annually by the EU Member States on the basis of a legal obligation from 1995 onwards.
SBS covers all activities of the business economy with the exception of agricultural activities and personal services and the data are provided by all EU Member States, Iceland, Norway and Switzerland, some candidate and potential candidate countries. The data are collected by domain of activity (annex) :
The majority of the data is collected by National Statistical Institutes (NSIs) by means of statistical surveys, business registers or from various administrative sources. Regulatory or controlling national offices for financial institutions or central banks often provide the information required for the financial sector (NACE Rev 2 Section K / NACE Rev 1.1 Section J).
Member States apply various statistical methods, according to the data source, such as grossing up, model based estimation or different forms of imputation, to ensure the quality of SBSs produced.
Main characteristics (variables) of the SBS data category:
All SBS characteristics are published on Eurostat’s website by tables and an example of the existent tables is presented below:
More information on the contents of different tables: the detail level and breakdowns required starting with the reference year 2008 is defined in Commission Regulation N° 251/2009. For previous reference years it is included in Commission Regulations (EC) N° 2701/98 and amended by Commission Regulation N°1614/2002 and Commission Regulation N°1669/2003.
Several important derived indicators are generated in the form of ratios of certain monetary characteristics or per head values. A list with the available derived indicators is available below in the Annex.
Facebook
Twitterhttps://dataverse-staging.rdmc.unc.edu/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=hdl:1902.29/CD-0014https://dataverse-staging.rdmc.unc.edu/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=hdl:1902.29/CD-0014
The Statistical Abstract is the Nation's best known and most popular single source of statistics on the social, political, and economic organization of the country. The print version of this reference source has been published since 1878 while the compact disc version first appeared in 1993. This disc is designed to serve as a convenient, easy-to-use statistical reference source and guide to statistical publications and sources. The disc contains over 1,400 tables from over 250 different gove rnmental, private, and international organizations. The 1999 CD reflects improved and enhanced data on the disc and the software used for accessing the information. The enrichments to the data and their access include: a link for table of contents page to a PDF of The Census web site. This enable the user to have direct links to the Statistical Abstract and its supplements and other features, such as Statistics in Brief and Frequently Requested Tables. A link to the table of contents from the first text page of each section facilitates quick movement between sections of the book. New PDFs provide more explanation of several major economic series including the Federal Budget, the National Income and Product Accounts (NIPA), the Consumer Price Index (CPI)and Producer Price Index (PPI), and the new North American Industry Classification System (NAICS). Another PDF provides information on the Federal court system. Links to these supplemental materials are provided from each appropriate table. A separate PDF presents a compilation of tables showing major economic indices, as selected by the Council of Economic Advisors. Maps of each state and their metro areas and component counties, maps outlining National Park sites throughout the country, a map of the United States with major transportation facilities and routes, a U.S.map locating coal mines and facilities, and one depicting the distribution of forest land have been added. As usual, updates have been made to most of the more than 1,500 tables and charts that were on the previous disc with new or more recent data. The spreadsheet files, which are available in both Excel and Lotus formats, will usually have more information than the tables displayed in the book or Adobe Acrobat files. The 1999 year introduced over 100 new tables covering a wide range of subject areas. Several sections have preliminary data from the 1997 Economic Census, which presents industry statistics for the first time based on the North American Industry Classification System (NAICS). Comparative data for 1992 and 1997, based on the Standard Industrial Classification (SIC), are also presented. Tables 872 and 873 in Section 17, Business, present summary data for industries. Other new tables cover such topics as the foreign-born population, health care expenditures, the medicare trust fund, violence in schools, presale handgun checks, recycling programs, defense- related employment and spending, workplace violence, ownership of mutual funds, computer use, results of the 1997 Census of Agriculture, and mail order catalogue sales. In addition to the above new tables, a new section has been developed, the 20th Century Statistics. This section introduces data beginning in 1900 on a broad range of subjects, including population, vital statistics, health, education, income, labor force, communications, agriculture, defense, and other areas. The Industrial Outlook tables, previously in Section 31, have been deleted for lack of updates. For a complete list of new tables, see Appendix VI,p.947. The Adobe Acrobat Reader and Search engine, Version 4.0, is on the disc. The Acrobat Reader allows users to view, navigate, search, and print on demand any of the pages from the book. Note to Users: This CD is part of a collection located in the Data Archive of the Odum Institute for Research in Social Science, at the University of North Carolina at Chapel Hill. The collection is located in Room 10, Manning Hall. Users may check the CDs out subscribing to the honor system. Items can be checked out for a period of two weeks. Loan forms are located adjacent to the collection.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Structural business statistics (SBS) describes the structure, conduct and performance of economic activities, down to the most detailed activity level (several hundred economic sectors).
SBS are transmitted annually by the EU Member States on the basis of a legal obligation from 1995 onwards.
SBS covers all activities of the business economy with the exception of agricultural activities and personal services and the data are provided by all EU Member States, Iceland, Norway and Switzerland, some candidate and potential candidate countries. The data are collected by domain of activity (annex) :
The majority of the data is collected by National Statistical Institutes (NSIs) by means of statistical surveys, business registers or from various administrative sources. Regulatory or controlling national offices for financial institutions or central banks often provide the information required for the financial sector (NACE Rev 2 Section K / NACE Rev 1.1 Section J).
Member States apply various statistical methods, according to the data source, such as grossing up, model based estimation or different forms of imputation, to ensure the quality of SBSs produced.
Main characteristics (variables) of the SBS data category:
All SBS characteristics are published on Eurostat’s website by tables and an example of the existent tables is presented below:
More information on the contents of different tables: the detail level and breakdowns required starting with the reference year 2008 is defined in Commission Regulation N° 251/2009. For previous reference years it is included in Commission Regulations (EC) N° 2701/98 and amended by Commission Regulation N°1614/2002 and Commission Regulation N°1669/2003.
Several important derived indicators are generated in the form of ratios of certain monetary characteristics or per head values. A list with the available derived indicators is available below in the Annex.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Structural business statistics (SBS) describes the structure, conduct and performance of economic activities, down to the most detailed activity level (several hundred economic sectors).
SBS are transmitted annually by the EU Member States on the basis of a legal obligation from 1995 onwards.
SBS covers all activities of the business economy with the exception of agricultural activities and personal services and the data are provided by all EU Member States, Iceland, Norway and Switzerland, some candidate and potential candidate countries. The data are collected by domain of activity (annex) :
The majority of the data is collected by National Statistical Institutes (NSIs) by means of statistical surveys, business registers or from various administrative sources. Regulatory or controlling national offices for financial institutions or central banks often provide the information required for the financial sector (NACE Rev 2 Section K / NACE Rev 1.1 Section J).
Member States apply various statistical methods, according to the data source, such as grossing up, model based estimation or different forms of imputation, to ensure the quality of SBSs produced.
Main characteristics (variables) of the SBS data category:
All SBS characteristics are published on Eurostat’s website by tables and an example of the existent tables is presented below:
More information on the contents of different tables: the detail level and breakdowns required starting with the reference year 2008 is defined in Commission Regulation N° 251/2009. For previous reference years it is included in Commission Regulations (EC) N° 2701/98 and amended by Commission Regulation N°1614/2002 and Commission Regulation N°1669/2003.
Several important derived indicators are generated in the form of ratios of certain monetary characteristics or per head values. A list with the available derived indicators is available below in the Annex.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Public health-related decision-making on policies aimed at controlling the COVID-19 pandemic outbreak depends on complex epidemiological models that are compelled to be robust and use all relevant available data. This data article provides a new combined worldwide COVID-19 dataset obtained from official data sources with improved systematic measurement errors and a dedicated dashboard for online data visualization and summary. The dataset adds new measures and attributes to the normal attributes of official data sources, such as daily mortality, and fatality rates. We used comparative statistical analysis to evaluate the measurement errors of COVID-19 official data collections from the Chinese Center for Disease Control and Prevention (Chinese CDC), World Health Organization (WHO) and European Centre for Disease Prevention and Control (ECDC). The data is collected by using text mining techniques and reviewing pdf reports, metadata, and reference data. The combined dataset includes complete spatial data such as countries area, international number of countries, Alpha-2 code, Alpha-3 code, latitude, longitude, and some additional attributes such as population. The improved dataset benefits from major corrections on the referenced data sets and official reports such as adjustments in the reporting dates, which suffered from a one to two days lag, removing negative values, detecting unreasonable changes in historical data in new reports and corrections on systematic measurement errors, which have been increasing as the pandemic outbreak spreads and more countries contribute data for the official repositories. Additionally, the root mean square error of attributes in the paired comparison of datasets was used to identify the main data problems. The data for China is presented separately and in more detail, and it has been extracted from the attached reports available on the main page of the CCDC website. This dataset is a comprehensive and reliable source of worldwide COVID-19 data that can be used in epidemiological models assessing the magnitude and timeline for confirmed cases, long-term predictions of deaths or hospital utilization, the effects of quarantine, stay-at-home orders and other social distancing measures, the pandemic’s turning point or in economic and social impact analysis, helping to inform national and local authorities on how to implement an adaptive response approach to re-opening the economy, re-open schools, alleviate business and social distancing restrictions, design economic programs or allow sports events to resume.