100+ datasets found

C
Statistical Data Catalog Cologne
ckan.mobidatalab.eu
Updated Jul 26, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Köln (2023). Statistical Data Catalog Cologne [Dataset]. https://ckan.mobidatalab.eu/dataset/statisticaldatacatalogue-coln
Explore at:
http://publications.europa.eu/resource/authority/file-type/csv(307022), http://publications.europa.eu/resource/authority/file-type/csv(272780), http://publications.europa.eu/resource/authority/file-type/json, http://publications.europa.eu/resource/authority/file-type/csv(3746), http://publications.europa.eu/resource/authority/file-type/csv(3752), http://publications.europa.eu/resource/authority/file-type/csv(274184), http://publications.europa.eu/resource/authority/file-type/csv(3735), http://publications.europa.eu/resource/authority/file-type/csv(275264), http://publications.europa.eu/resource/authority/file-type/csv(5356), http://publications.europa.eu/resource/authority/file-type/csv(273265), http://publications.europa.eu/resource/authority/file-type/csv(3730), http://publications.europa.eu/resource/authority/file-type/csv(19787), http://publications.europa.eu/resource/authority/file-type/csv(273515), http://publications.europa.eu/resource/authority/file-type/csv(272571), http://publications.europa.eu/resource/authority/file-type/csv(3748), http://publications.europa.eu/resource/authority/file-type/csv(3753), http://publications.europa.eu/resource/authority/file-type/csv(271286), http://publications.europa.eu/resource/authority/file-type/csv(3754), http://publications.europa.eu/resource/authority/file-type/csv(273516), http://publications.europa.eu/resource/authority/file-type/csv(273403), http://publications.europa.eu/resource/authority/file-type/csv(3764), http://publications.europa.eu/resource/authority/file-type/csv(1215), http://publications.europa.eu/resource/authority/file-type/csv(3758)Available download formats
Dataset updated
Jul 26, 2023
Dataset provided by
Köln
License
Data licence Germany – Attribution – Version 2.0https://www.govdata.de/dl-de/by-2-0
License information was derived automatically
Description
Data from various sources are updated in the Statistical Information System of the City of Cologne. The annual statistical yearbook publishes these in tabular, graphic and cartographic form at the level of the city districts and districts. Furthermore, definitions and calculation bases are explained. Small-scale statistics at the level of the 86 districts can be obtained from the Cologne district information become. All levels of the local area structure are presented in this publication explained.

This statistical data catalogue supplements the range of small-scale data. Selected structural data can be called up here in compact tabular form at the level of the 570 statistical districts or the 86 districts. The two overviews provide information about which data is available and from which source it originates. The data itself is provided annually.

Notes:

Data sources are indicated in the summary tables. When using the data, the data license Germany - attribution - version 2.0 must be observed.

Some values cannot be given to protect statistical confidentiality. For the data sets of the Federal Employment Agency, these are values from 1 to < 10, for all further data records values from 1 to < 5. This is marked in the data by a * .

The differentiation of population figures by gender is currently made according to female and male residents. The case numbers of those who define themselves as non-binary/diverse are so low at a small-scale level that they cannot be reported for reasons of statistical confidentiality.

The determination of residents with a migration background is carried out by combination various characteristics from the resident registration procedure. The data are to be interpreted as estimates. The statistical yearbook of the city of Cologne provides further details.

The information on households comes from the household generation process. This is a statistical procedure in which residents within an address are assigned to a household as far as possible by querying certain criteria. If the procedure does not identify any connections, the allocation to single-person households takes place. The statistical yearbook of the city of Cologne provides further details.

The data set pupils* at general schools (spatial location by place of residence) is available from 2013.

The number of the statistical quarter or district is a spatial location and can be linked to the geodata (see related resource below).
m
COVID-19 Combined Data-set with Improved Measurement Errors
data.mendeley.com
Updated May 13, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Afshin Ashofteh (2020). COVID-19 Combined Data-set with Improved Measurement Errors [Dataset]. http://doi.org/10.17632/nw5m4hs3jr.3
Explore at:
Unique identifier
https://doi.org/10.17632/nw5m4hs3jr.3
Dataset updated
May 13, 2020
Authors
Afshin Ashofteh
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Public health-related decision-making on policies aimed at controlling the COVID-19 pandemic outbreak depends on complex epidemiological models that are compelled to be robust and use all relevant available data. This data article provides a new combined worldwide COVID-19 dataset obtained from official data sources with improved systematic measurement errors and a dedicated dashboard for online data visualization and summary. The dataset adds new measures and attributes to the normal attributes of official data sources, such as daily mortality, and fatality rates. We used comparative statistical analysis to evaluate the measurement errors of COVID-19 official data collections from the Chinese Center for Disease Control and Prevention (Chinese CDC), World Health Organization (WHO) and European Centre for Disease Prevention and Control (ECDC). The data is collected by using text mining techniques and reviewing pdf reports, metadata, and reference data. The combined dataset includes complete spatial data such as countries area, international number of countries, Alpha-2 code, Alpha-3 code, latitude, longitude, and some additional attributes such as population. The improved dataset benefits from major corrections on the referenced data sets and official reports such as adjustments in the reporting dates, which suffered from a one to two days lag, removing negative values, detecting unreasonable changes in historical data in new reports and corrections on systematic measurement errors, which have been increasing as the pandemic outbreak spreads and more countries contribute data for the official repositories. Additionally, the root mean square error of attributes in the paired comparison of datasets was used to identify the main data problems. The data for China is presented separately and in more detail, and it has been extracted from the attached reports available on the main page of the CCDC website. This dataset is a comprehensive and reliable source of worldwide COVID-19 data that can be used in epidemiological models assessing the magnitude and timeline for confirmed cases, long-term predictions of deaths or hospital utilization, the effects of quarantine, stay-at-home orders and other social distancing measures, the pandemic’s turning point or in economic and social impact analysis, helping to inform national and local authorities on how to implement an adaptive response approach to re-opening the economy, re-open schools, alleviate business and social distancing restrictions, design economic programs or allow sports events to resume.
T
Tanzania Source Data Assessment Of Statistical Capacity Scale 0 100
tradingeconomics.com
csv, excel, json, xml
Updated Jun 16, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TRADING ECONOMICS (2017). Tanzania Source Data Assessment Of Statistical Capacity Scale 0 100 [Dataset]. https://tradingeconomics.com/tanzania/source-data-assessment-of-statistical-capacity-scale-0--100-wb-data.html
Explore at:
json, csv, excel, xmlAvailable download formats
Dataset updated
Jun 16, 2017
Dataset authored and provided by
TRADING ECONOMICS
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jan 1, 1976 - Dec 31, 2025
Area covered
Tanzania
Description
Actual value and historical data chart for Tanzania Source Data Assessment Of Statistical Capacity Scale 0 100
U
Statistical Abstract of the United States, 2007
dataverse-staging.rdmc.unc.edu
Updated Oct 27, 2011
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
UNC Dataverse (2011). Statistical Abstract of the United States, 2007 [Dataset]. https://dataverse-staging.rdmc.unc.edu/dataset.xhtml?persistentId=hdl:1902.29/CD-0227
Explore at:
Dataset updated
Oct 27, 2011
Dataset provided by
UNC Dataverse
License
https://dataverse-staging.rdmc.unc.edu/api/datasets/:persistentId/versions/2.0/customlicense?persistentId=hdl:1902.29/CD-0227https://dataverse-staging.rdmc.unc.edu/api/datasets/:persistentId/versions/2.0/customlicense?persistentId=hdl:1902.29/CD-0227
Description
"The Statistical Abstract of the United States, published since 1878, is the standard summary of statistics on the social, political, and economic organization of the United States. It is designed to serve as a convenient volume for statistical reference and as a guide to other statistical publications and sources. The latter function is served by the introductory text to each section, the source note appearing below each table, and Appendix I, which comprises the Guide to Sources of Statisti cs, the Guide to State Statistical Abstracts, and the Guide to Foreign Statistical Abstracts. The Statistical Abstract sections and tables are compiled into one Adobe PDF named StatAbstract2007.pdf. This PDF is bookmarked by section and by table and can be searched using the Acrobat Search feature. The Statistical Abstract on CD-ROM is best viewed using Adobe Acrobat 5, or any subsequent version of Acrobat or Acrobat Reader. The Statistical Abstract tables and the metropolitan areas tables from Appendix II are available as Excel(.xls or .xlw) spreadsheets. In most cases, these spreadsheet files offer the user direct access to more data than are shown either in the publication or Adobe Acrobat. These files usually contain more years of data, more geographic areas, and/or more categories of subjects than those shown in the Acrobat version. The extensive selection of statistics is provided for the United States, with selected data for regions, divisions, states, metropolitan areas, cities, and foreign countries from reports and records of government and private agencies. Software on the disc can be used to perform full-text searches, view official statistics, open tables as Lotus worksheets or Excel workbooks, and link directly to source agencies and organizations for su pporting information. Except as indicated, figures are for the United States as presently constituted. Although emphasis in the Statistical Abstract is primarily given to national data, many tables present data for regions and individual states and a smaller number for metropolitan areas and cities.Statistics for the Commonwealth of Puerto Rico and for island areas of the United States are included in many state tables and are supplemented by information in Section 29. Additional information for states, cities, counties, metropolitan areas, and other small units, as well as more historical data are available in various supplements to the Abstract. Statistics in this edition are generally for the most recent year or period available by summer 2006. Each year over 1,400 tables and charts are reviewed and evaluated; new tables and charts of current interest are added, continuing series are updated, and less timely data are condensed or eliminated. Text notes and appendices are revised as appropriate. This year we have introduced 72 new tables covering a wide range of subject areas. These cover a variety of topics including: learning disability for children, people impacted by the hurricanes in the Gulf Coast area, employees with alternative work arrangements, adult computer and Internet users by selected characteristics, North America cruise industry, women- and minority-owned businesses, and the percentage of the adult population considered to be obese. Some of the annually surveyed topics are population; vital statistics; health and nutrition; education; law enforcement, courts and prison; geography and environment; elections; state and local government; federal government finances and employment; national defense and veterans affairs; social insurance and human services; labor force, employment, and earnings; income, expenditures, and wealth; prices; business enterprise; science and technology; agriculture; natural resources; energy; construction and housing; manufactures; domestic trade and services; transportation; information and communication; banking, finance, and insurance; arts, entertainment, and recreation; accommodation, food services, and other services; foreign commerce and aid; outlying areas; and comparative international statistics." Note to Users: This CD is part of a collection located in the Data Archive of the Odum Institute for Research in Social Science, at the University of North Carolina at Chapel Hill. The collection is located in Room 10, Manning Hall. Users may check the CDs out subscribing to the honor system. Items can be checked out for a period of two weeks. Loan forms are located adjacent to the collection.
🦠 COVID-19 survey of National Statistical Offices
kaggle.com
zip
Updated Sep 10, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
meer atif magsi (2023). 🦠 COVID-19 survey of National Statistical Offices [Dataset]. https://www.kaggle.com/datasets/meeratif/covid-19-survey-of-national-statistical-offices
Explore at:
zip(22535 bytes)Available download formats
Dataset updated
Sep 10, 2023
Authors
meer atif magsi
Description
Global COVID-19 surveys conducted by National Statistical Offices. This dataset has several columns that contain different types of information. Here's a brief explanation of each column:

1.**Country**: This column likely contains the names of the countries for which the survey data is collected. Each row represents data related to a specific country.

2.**Category**: This column might contain information about the type or category of the survey. It could include categories such as healthcare, economic impact, public sentiment, etc. This helps in categorizing the surveys.

3.**Title and Link**: These columns may contain the title or name of the specific survey and a link to the source or webpage where more information about the survey can be found. The link can be useful for referencing the original source of the data.

4.**Description**: This column likely contains a brief description or summary of the survey's objectives, methodology, or key findings. It provides additional context for the survey data.

5.**Source**: This column may contain information about the organization or agency that conducted the survey. It's essential for understanding the authority behind the data.

6.**Date Added**: This column probably contains the date when the survey data was added to the dataset. This helps track the freshness of the data and can be useful for historical analysis.

With this dataset, you can perform various types of analysis, including but not limited to:

Country-based analysis: You can analyze survey data for specific countries to understand the impact of COVID-19 in different regions.

Category-based analysis: You can group surveys by category and analyze trends or patterns related to healthcare, economics, or public sentiment.

Temporal analysis: You can examine how survey data has evolved over time by using the "Date Added" column to track changes and trends.

Source-based analysis: You can assess the reliability and credibility of the data by considering the source of the surveys.

Data visualization: Create visual representations like charts, graphs, and maps to make the data more understandable and informative.

Before conducting any analysis, it's essential to clean and preprocess the data, handle missing values, and ensure data consistency. Additionally, consider the research questions or insights you want to gain from the dataset, which will guide your analysis approach.
U
Statistical Abstract of the United States, 2002
dataverse-staging.rdmc.unc.edu
Updated Nov 30, 2007
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
UNC Dataverse (2007). Statistical Abstract of the United States, 2002 [Dataset]. https://dataverse-staging.rdmc.unc.edu/dataset.xhtml?persistentId=hdl:1902.29/CD-0175
Explore at:
Dataset updated
Nov 30, 2007
Dataset provided by
UNC Dataverse
License
https://dataverse-staging.rdmc.unc.edu/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=hdl:1902.29/CD-0175https://dataverse-staging.rdmc.unc.edu/api/datasets/:persistentId/versions/1.0/customlicense?persistentId=hdl:1902.29/CD-0175
Description
"The Statistical Abstract is the nation's best known and most popular single source of statistics on the social, political, and economic organization of the country. The print version has been published since 1878, and a compact disc version has been available since 1993. Both are designed to serve as a convenient, easy-to-use statistical reference source and guide to statistical publications and sources. The extensive selection of statistics is provided for the United States, with selected d ata for regions, divisions, states, metropolitan areas, cities, and foreign countries from reports and records of government and private agencies. Software on the disc can be used to perform full-text searches, view official statistics, open tables as Lotus worksheets or Excel workbooks, and link directly to source agencies and organizations for supporting information. The disc contains over 1,500 tables from over 250 different governmental, private, and international organizations. Some of the topics are population; vital statistics; health and nutrition; education; law enforcement, courts and prison; geography and environment; elections; state and local government; federal government finances and employment; national defense and veterans affairs; social insurance and human services; labor force, employment, and earnings; income, expenditures, and wealth; prices; business enterprise; science and technology; agriculture; natural resources; energy; construction and housing; manufactures; domestic trade and services; transportation; information and communication; banking, finance, and insurance; arts, entertainment, and recreation; accommodation, food services, and other services; foreign commerce and aid; outlying areas; and comparative international statistics. Significant changes in the 2002 data include new data from the 2000 census and new tables that include data covering resident population's migration status, educational attainment, disability status, ancestry, place of birth, and language spoken at home as well as househol d income, poverty, and selected housing characteristics from the sample portion of the 2000 census. New tables cover topics such as unmarried households, state children's health insurance programs, limitation of activity level caused by chronic conditions, characteristics of homeschooled children, firearm-use offenders, home- based work and flexible work by workers, computer use in the workplace, employee benefits, and computer and Internet use." Note to Users: This CD is part of a collection located in the Data Archive of the Odum Institute for Research in Social Science, at the University of North Carolina at Chapel Hill. The collection is located in Room 10, Manning Hall. Users may check the CDs out subscribing to the honor system. Items can be checked out for a period of two weeks. Loan forms are located adjacent to the collection.
d
The Water Resources Bureau statistical data indicators
data.gov.tw
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Water Resources Agency,Ministry of Economic Affairs, The Water Resources Bureau statistical data indicators [Dataset]. https://data.gov.tw/en/datasets/16936
Explore at:
Dataset authored and provided by
Water Resources Agency,Ministry of Economic Affairs
License
https://data.gov.tw/licensehttps://data.gov.tw/license
Description
Provide water resources statistics indicators.....
Online platforms consumers use to find information worldwide 2023
statista.com
Updated Apr 3, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2024). Online platforms consumers use to find information worldwide 2023 [Dataset]. https://www.statista.com/statistics/1460032/sources-information-internet-worldwide/
Explore at:
Dataset updated
Apr 3, 2024
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
Aug 2023
Area covered
Worldwide
Description
During an August 2023 survey, ** percent of responding consumers aged 16 to 65 from across the globe stated they used manual search engines, such as Google or Bing, most to find information on the internet. Social media platforms ranked second, named by ** percent of respondents.
ODM Data Analysis—A tool for the automatic validation, monitoring and...
plos.figshare.com
datasetcatalog.nlm.nih.gov
mp4
Updated May 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Tobias Johannes Brix; Philipp Bruland; Saad Sarfraz; Jan Ernsting; Philipp Neuhaus; Michael Storck; Justin Doods; Sonja Ständer; Martin Dugas (2023). ODM Data Analysis—A tool for the automatic validation, monitoring and generation of generic descriptive statistics of patient data [Dataset]. http://doi.org/10.1371/journal.pone.0199242
Explore at:
mp4Available download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0199242
Dataset updated
May 31, 2023
Dataset provided by
PLOShttp://plos.org/
Authors
Tobias Johannes Brix; Philipp Bruland; Saad Sarfraz; Jan Ernsting; Philipp Neuhaus; Michael Storck; Justin Doods; Sonja Ständer; Martin Dugas
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
IntroductionA required step for presenting results of clinical studies is the declaration of participants demographic and baseline characteristics as claimed by the FDAAA 801. The common workflow to accomplish this task is to export the clinical data from the used electronic data capture system and import it into statistical software like SAS software or IBM SPSS. This software requires trained users, who have to implement the analysis individually for each item. These expenditures may become an obstacle for small studies. Objective of this work is to design, implement and evaluate an open source application, called ODM Data Analysis, for the semi-automatic analysis of clinical study data.MethodsThe system requires clinical data in the CDISC Operational Data Model format. After uploading the file, its syntax and data type conformity of the collected data is validated. The completeness of the study data is determined and basic statistics, including illustrative charts for each item, are generated. Datasets from four clinical studies have been used to evaluate the application’s performance and functionality.ResultsThe system is implemented as an open source web application (available at https://odmanalysis.uni-muenster.de) and also provided as Docker image which enables an easy distribution and installation on local systems. Study data is only stored in the application as long as the calculations are performed which is compliant with data protection endeavors. Analysis times are below half an hour, even for larger studies with over 6000 subjects.DiscussionMedical experts have ensured the usefulness of this application to grant an overview of their collected study data for monitoring purposes and to generate descriptive statistics without further user interaction. The semi-automatic analysis has its limitations and cannot replace the complex analysis of statisticians, but it can be used as a starting point for their examination and reporting.
i
Household Health Survey 2012-2013, Economic Research Forum (ERF)...
catalog.ihsn.org
datacatalog.ihsn.org
Updated Jun 26, 2017
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Central Statistical Organization (CSO) (2017). Household Health Survey 2012-2013, Economic Research Forum (ERF) Harmonization Data - Iraq [Dataset]. https://catalog.ihsn.org/index.php/catalog/6937
Explore at:
Dataset updated
Jun 26, 2017
Dataset provided by
Economic Research Forum
Kurdistan Regional Statistics Office (KRSO)
Central Statistical Organization (CSO)
Time period covered
2012 - 2013
Area covered
Iraq
Description
Abstract

The harmonized data set on health, created and published by the ERF, is a subset of Iraq Household Socio Economic Survey (IHSES) 2012. It was derived from the household, individual and health modules, collected in the context of the above mentioned survey. The sample was then used to create a harmonized health survey, comparable with the Iraq Household Socio Economic Survey (IHSES) 2007 micro data set.

----> Overview of the Iraq Household Socio Economic Survey (IHSES) 2012:

Iraq is considered a leader in household expenditure and income surveys where the first was conducted in 1946 followed by surveys in 1954 and 1961. After the establishment of Central Statistical Organization, household expenditure and income surveys were carried out every 3-5 years in (1971/ 1972, 1976, 1979, 1984/ 1985, 1988, 1993, 2002 / 2007). Implementing the cooperation between CSO and WB, Central Statistical Organization (CSO) and Kurdistan Region Statistics Office (KRSO) launched fieldwork on IHSES on 1/1/2012. The survey was carried out over a full year covering all governorates including those in Kurdistan Region.

The survey has six main objectives. These objectives are:

Provide data for poverty analysis and measurement and monitor, evaluate and update the implementation Poverty Reduction National Strategy issued in 2009.

Provide comprehensive data system to assess household social and economic conditions and prepare the indicators related to the human development.

Provide data that meet the needs and requirements of national accounts.

Provide detailed indicators on consumption expenditure that serve making decision related to production, consumption, export and import.

Provide detailed indicators on the sources of households and individuals income.

Provide data necessary for formulation of a new consumer price index number.

The raw survey data provided by the Statistical Office were then harmonized by the Economic Research Forum, to create a comparable version with the 2006/2007 Household Socio Economic Survey in Iraq. Harmonization at this stage only included unifying variables' names, labels and some definitions. See: Iraq 2007 & 2012- Variables Mapping & Availability Matrix.pdf provided in the external resources for further information on the mapping of the original variables on the harmonized ones, in addition to more indications on the variables' availability in both survey years and relevant comments.

Geographic coverage

National coverage: Covering a sample of urban, rural and metropolitan areas in all the governorates including those in Kurdistan Region.

Analysis unit

1- Household/family. 2- Individual/person.

Universe

The survey was carried out over a full year covering all governorates including those in Kurdistan Region.

Kind of data

Sample survey data [ssd]

Sampling procedure

----> Design:

Sample size was (25488) household for the whole Iraq, 216 households for each district of 118 districts, 2832 clusters each of which includes 9 households distributed on districts and governorates for rural and urban.

----> Sample frame:

Listing and numbering results of 2009-2010 Population and Housing Survey were adopted in all the governorates including Kurdistan Region as a frame to select households, the sample was selected in two stages: Stage 1: Primary sampling unit (blocks) within each stratum (district) for urban and rural were systematically selected with probability proportional to size to reach 2832 units (cluster). Stage two: 9 households from each primary sampling unit were selected to create a cluster, thus the sample size of total survey clusters was 25488 households distributed on the governorates, 216 households in each district.

----> Sampling Stages:

In each district, the sample was selected in two stages: Stage 1: based on 2010 listing and numbering frame 24 sample points were selected within each stratum through systematic sampling with probability proportional to size, in addition to the implicit breakdown urban and rural and geographic breakdown (sub-district, quarter, street, county, village and block). Stage 2: Using households as secondary sampling units, 9 households were selected from each sample point using systematic equal probability sampling. Sampling frames of each stages can be developed based on 2010 building listing and numbering without updating household lists. In some small districts, random selection processes of primary sampling may lead to select less than 24 units therefore a sampling unit is selected more than once , the selection may reach two cluster or more from the same enumeration unit when it is necessary.

Mode of data collection

Face-to-face [f2f]

Research instrument

----> Preparation:

The questionnaire of 2006 survey was adopted in designing the questionnaire of 2012 survey on which many revisions were made. Two rounds of pre-test were carried out. Revision were made based on the feedback of field work team, World Bank consultants and others, other revisions were made before final version was implemented in a pilot survey in September 2011. After the pilot survey implemented, other revisions were made in based on the challenges and feedbacks emerged during the implementation to implement the final version in the actual survey.

----> Questionnaire Parts:

The questionnaire consists of four parts each with several sections: Part 1: Socio – Economic Data: - Section 1: Household Roster - Section 2: Emigration - Section 3: Food Rations - Section 4: housing - Section 5: education - Section 6: health - Section 7: Physical measurements - Section 8: job seeking and previous job

Part 2: Monthly, Quarterly and Annual Expenditures: - Section 9: Expenditures on Non – Food Commodities and Services (past 30 days). - Section 10 : Expenditures on Non – Food Commodities and Services (past 90 days). - Section 11: Expenditures on Non – Food Commodities and Services (past 12 months). - Section 12: Expenditures on Non-food Frequent Food Stuff and Commodities (7 days). - Section 12, Table 1: Meals Had Within the Residential Unit. - Section 12, table 2: Number of Persons Participate in the Meals within Household Expenditure Other Than its Members.

Part 3: Income and Other Data: - Section 13: Job - Section 14: paid jobs - Section 15: Agriculture, forestry and fishing - Section 16: Household non – agricultural projects - Section 17: Income from ownership and transfers - Section 18: Durable goods - Section 19: Loans, advances and subsidies - Section 20: Shocks and strategy of dealing in the households - Section 21: Time use - Section 22: Justice - Section 23: Satisfaction in life - Section 24: Food consumption during past 7 days

Part 4: Diary of Daily Expenditures: Diary of expenditure is an essential component of this survey. It is left at the household to record all the daily purchases such as expenditures on food and frequent non-food items such as gasoline, newspapers…etc. during 7 days. Two pages were allocated for recording the expenditures of each day, thus the roster will be consists of 14 pages.

Cleaning operations

----> Raw Data:

Data Editing and Processing: To ensure accuracy and consistency, the data were edited at the following stages: 1. Interviewer: Checks all answers on the household questionnaire, confirming that they are clear and correct. 2. Local Supervisor: Checks to make sure that questions has been correctly completed. 3. Statistical analysis: After exporting data files from excel to SPSS, the Statistical Analysis Unit uses program commands to identify irregular or non-logical values in addition to auditing some variables. 4. World Bank consultants in coordination with the CSO data management team: the World Bank technical consultants use additional programs in SPSS and STAT to examine and correct remaining inconsistencies within the data files. The software detects errors by analyzing questionnaire items according to the expected parameter for each variable.

----> Harmonized Data:

The SPSS package is used to harmonize the Iraq Household Socio Economic Survey (IHSES) 2007 with Iraq Household Socio Economic Survey (IHSES) 2012.

The harmonization process starts with raw data files received from the Statistical Office.

A program is generated for each dataset to create harmonized variables.

Data is saved on the household and individual level, in SPSS and then converted to STATA, to be disseminated.

Response rate

Iraq Household Socio Economic Survey (IHSES) reached a total of 25488 households. Number of households refused to response was 305, response rate was 98.6%. The highest interview rates were in Ninevah and Muthanna (100%) while the lowest rates were in Sulaimaniya (92%).
EHRI Statistical Data Mart (EHRI-SDM)
catalog.data.gov
Updated Jan 26, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Office of Personnel Management (2024). EHRI Statistical Data Mart (EHRI-SDM) [Dataset]. https://catalog.data.gov/dataset/ehri-statistical-data-mart-ehri-sdm-30a87
Explore at:
Dataset updated
Jan 26, 2024
Dataset provided by
United States Office of Personnel Managementhttps://opm.gov/
Description
The Enterprise Human Resources Integration-Statistical Data Mart (EHRI-SDM) is a statistically cleansed sub-set of the data contained in the EHRI data warehouse. It contains data about the employee and their position, along with various demographic variables
Statistical Data Analysis using R
figshare.com
txt
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Samuel Barsanelli Costa (2023). Statistical Data Analysis using R [Dataset]. http://doi.org/10.6084/m9.figshare.5501035.v1
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.5501035.v1
Dataset updated
May 30, 2023
Dataset provided by
Figsharehttp://figshare.com/
figshare
Authors
Samuel Barsanelli Costa
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
R Scripts contain statistical data analisys for streamflow and sediment data, including Flow Duration Curves, Double Mass Analysis, Nonlinear Regression Analysis for Suspended Sediment Rating Curves, Stationarity Tests and include several plots.
w
Social Security, Health and Related Information
data.wu.ac.at
researchdata.edu.au
+1more
xls
Updated Aug 22, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department of Human Services (2016). Social Security, Health and Related Information [Dataset]. https://data.wu.ac.at/schema/data_gov_au/ZDc1NzNkZTgtMzI0NC00Yjg2LTgwZDctY2UxMjUwOWQxZjgw
Explore at:
xlsAvailable download formats
Dataset updated
Aug 22, 2016
Dataset provided by
Department of Human Services
License
Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically
Description
Agencies responsible for the administration and delivery of social security, family assistance, student assistance and related payments and programs publish a range of statistical information online. Relevant agencies include:
• Department of Human Services
• Department of Social Services
• Department of Education and Training
• Department of Health
• Department of Veterans’ Affairs.
Published statistical information generally encompasses customer population data for key payments, with a pre-defined drill-down available for relevant demographic (e.g. gender, age) and geographic (e.g. national, state/territory) characteristics.The Department of Human Services also publishes a comprehensive suite of health related statistical information.
This dataset provides a summary of links to existing sources of statistical information published on a range of government websites, including those noted above and the Australian Institute of Health and Welfare (AIHW). The AIHW provides authoritative information and statistics to promote better health and wellbeing.
Humanservices.gov.au contains information about accessing statistical information. If not available online, you can request statistical information by emailing statistics@humanservices.gov.au. In providing this service, the Department of Human Services liaises with relevant agencies as required.
Most reliable sources of data for market researchers in the U.S. 2017
statista.com
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista, Most reliable sources of data for market researchers in the U.S. 2017 [Dataset]. https://www.statista.com/statistics/917534/market-research-industry-us-most-reliable-sources-of-data/
Explore at:
Dataset authored and provided by
Statistahttp://statista.com/
Time period covered
2017
Area covered
United States
Description
This statistic displays the most reliable sources of data according to professionals in the market research industry in the United States in 2017. During the survey, 32 percent of respondents cited marketing analytics as the most reliable data source.
d
The statistical data of aquaculture area and aquaculture water usage by the...
data.gov.tw
Updated Oct 1, 2002
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Water Resources Agency,Ministry of Economic Affairs (2002). The statistical data of aquaculture area and aquaculture water usage by the Water Resources Agency [Dataset]. https://data.gov.tw/en/datasets/58700
Explore at:
Dataset updated
Oct 1, 2002
Dataset authored and provided by
Water Resources Agency,Ministry of Economic Affairs
License
https://data.gov.tw/licensehttps://data.gov.tw/license
Description
This dataset is commissioned annually to compile data from the previous year and provide a comprehensive overview of water usage for various purposes. It has long been valued by governmental and academic institutions involved in economic development and water resources, aiming to provide clear statistical data for water usage and support the formulation of water policies and resource planning. The Water Resources Agency aims to provide the public with valuable information by publishing the annual Water Usage Statistics Report, using limited data to estimate water usage and sources for resource planning and management. The data is collected at the city and county level and is used to understand actual water usage for various purposes.
A
Algeria DZ: SPI: Pillar 4 Data Sources Score: Scale 0-100
ceicdata.com
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
CEICdata.com, Algeria DZ: SPI: Pillar 4 Data Sources Score: Scale 0-100 [Dataset]. https://www.ceicdata.com/en/algeria/governance-policy-and-institutions/dz-spi-pillar-4-data-sources-score-scale-0100
Explore at:
Dataset provided by
CEICdata.com
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Dec 1, 2016 - Dec 1, 2022
Area covered
Algeria
Variables measured
Money Market Rate
Description
Algeria DZ: SPI: Pillar 4 Data Sources Score: Scale 0-100 data was reported at 45.958 NA in 2022. This records a decrease from the previous number of 49.075 NA for 2021. Algeria DZ: SPI: Pillar 4 Data Sources Score: Scale 0-100 data is updated yearly, averaging 49.892 NA from Dec 2016 (Median) to 2022, with 7 observations. The data reached an all-time high of 52.417 NA in 2018 and a record low of 45.958 NA in 2022. Algeria DZ: SPI: Pillar 4 Data Sources Score: Scale 0-100 data remains active status in CEIC and is reported by World Bank. The data is categorized under Global Database’s Algeria – Table DZ.World Bank.WDI: Governance: Policy and Institutions. The data sources overall score is a composity measure of whether countries have data available from the following sources: Censuses and surveys, administrative data, geospatial data, and private sector/citizen generated data. The data sources (input) pillar is segmented by four types of sources generated by (i) the statistical office (censuses and surveys), and sources accessed from elsewhere such as (ii) administrative data, (iii) geospatial data, and (iv) private sector data and citizen generated data. The appropriate balance between these source types will vary depending on a country’s institutional setting and the maturity of its statistical system. High scores should reflect the extent to which the sources being utilized enable the necessary statistical indicators to be generated. For example, a low score on environment statistics (in the data production pillar) may reflect a lack of use of (and low score for) geospatial data (in the data sources pillar). This type of linkage is inherent in the data cycle approach and can help highlight areas for investment required if country needs are to be met.;Statistical Performance Indicators, The World Bank (https://datacatalog.worldbank.org/dataset/statistical-performance-indicators);Weighted average;
u
Synthetic Administrative Data: Census 1991, 2023
datacatalogue.ukdataservice.ac.uk
Updated Feb 21, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shlomo, N, University of Manchester; Kim, M, University of Manchester (2024). Synthetic Administrative Data: Census 1991, 2023 [Dataset]. http://doi.org/10.5255/UKDA-SN-856310
Explore at:
Unique identifier
https://doi.org/10.5255/UKDA-SN-856310
Dataset updated
Feb 21, 2024
Authors
Shlomo, N, University of Manchester; Kim, M, University of Manchester
Area covered
United Kingdom
Description
We create a synthetic administrative dataset to be used in the development of the R package for calculating quality indicators for administrative data (see: https://github.com/sook-tusk/qualadmin) that mimic the properties of a real administrative dataset according to specifications by the ONS. Taking over 1 million records from a synthetic 1991 UK census dataset, we deleted records, moved records to a different geography and duplicated records to a different geography according to pre-specified proportions for each broad ethnic group (White, Non-white) and gender (males, females). The final size of the synthetic administrative data was 1033664 individuals.
National Statistical Institutes (NSIs) are directing resources into advancing the use of administrative data in official statistics systems. This is a top priority for the UK Office for National Statistics (ONS) as they are undergoing transformations in their statistical systems to make more use of administrative data for future censuses and population statistics. Administrative data are defined as secondary data sources since they are produced by other agencies as a result of an event or a transaction relating to administrative procedures of organisations, public administrations and government agencies. Nevertheless, they have the potential to become important data sources for the production of official statistics by significantly reducing the cost and burden of response and improving the efficiency of such systems. Embedding administrative data in statistical systems is not without costs and it is vital to understand where potential errors may arise. The Total Administrative Data Error Framework sets out all possible sources of error when using administrative data as statistical data, depending on whether it is a single data source or integrated with other data sources such as survey data. For a single administrative data, one of the main sources of error is coverage and representation to the target population of interest. This is particularly relevant when administrative data is delivered over time, such as tax data for maintaining the Business Register. For sub-project 1 of this research project, we develop quality indicators that allow the statistical agency to assess if the administrative data is representative to the target population and which sub-groups may be missing or over-covered. This is essential for producing unbiased estimates from administrative data. Another priority at statistical agencies is to produce a statistical register for population characteristic estimates, such as employment statistics, from multiple sources of administrative and survey data. Using administrative data to build a spine, survey data can be integrated using record linkage and statistical matching approaches on a set of common matching variables. This will be the topic for sub-project 2, which will be split into several topics of research. The first topic is whether adding statistical predictions and correlation structures improves the linkage and data integration. The second topic is to research a mass imputation framework for imputing missing target variables in the statistical register where the missing data may be due to multiple underlying mechanisms. Therefore, the third topic will aim to improve the mass imputation framework to mitigate against possible measurement errors, for example by adding benchmarks and other constraints into the approaches. On completion of a statistical register, estimates for key target variables at local areas can easily be aggregated. However, it is essential to also measure the precision of these estimates through mean square errors and this will be the fourth topic of the sub-project. Finally, this new way of producing official statistics is compared to the more common method of incorporating administrative data through survey weights and model-based estimation approaches. In other words, we evaluate whether it is better 'to weight' or 'to impute' for population characteristic estimates - a key question under investigation by survey statisticians in the last decade.
U
Statistical Methods in Water Resources - Supporting Materials
data.usgs.gov
catalog.data.gov
Updated Apr 7, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Robert Hirsch; Karen Ryberg; Stacey Archfield; Edward Gilroy; Dennis Helsel (2020). Statistical Methods in Water Resources - Supporting Materials [Dataset]. http://doi.org/10.5066/P9JWL6XR
Explore at:
Unique identifier
https://doi.org/10.5066/P9JWL6XR
Dataset updated
Apr 7, 2020
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Authors
Robert Hirsch; Karen Ryberg; Stacey Archfield; Edward Gilroy; Dennis Helsel
License
U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically
Description
This dataset contains all of the supporting materials to accompany Helsel, D.R., Hirsch, R.M., Ryberg, K.R., Archfield, S.A., and Gilroy, E.J., 2020, Statistical methods in water resources: U.S. Geological Survey Techniques and Methods, book 4, chapter A3, 454 p., https://doi.org/10.3133/tm4a3. [Supersedes USGS Techniques of Water-Resources Investigations, book 4, chapter A3, version 1.1.]. Supplemental material (SM) for each chapter are available to re-create all examples and figures, and to solve the exercises at the end of each chapter, with relevant datasets provided in an electronic format readable by R. The SM provide (1) datasets as .Rdata files for immediate input into R, (2) datasets as .csv files for input into R or for use with other software programs, (3) R functions that are used in the textbook but not part of a published R package, (4) R scripts to produce virtually all of the figures in the book, and (5) solutions to the exercises as .html and .Rmd files. The suff ...
Data generation volume worldwide 2010-2029
statista.com
Updated Nov 19, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Data generation volume worldwide 2010-2029 [Dataset]. https://www.statista.com/statistics/871513/worldwide-data-created/
Explore at:
Dataset updated
Nov 19, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Worldwide
Description
The total amount of data created, captured, copied, and consumed globally is forecast to increase rapidly. While it was estimated at ***** zettabytes in 2025, the forecast for 2029 stands at ***** zettabytes. Thus, global data generation will triple between 2025 and 2029. Data creation has been expanding continuously over the past decade. In 2020, the growth was higher than previously expected, caused by the increased demand due to the coronavirus (COVID-19) pandemic, as more people worked and learned from home and used home entertainment options more often.
USA Bureau of Labor Statistics
kaggle.com
zip
Updated Aug 30, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
US Bureau of Labor Statistics (2019). USA Bureau of Labor Statistics [Dataset]. https://www.kaggle.com/bls/bls
Explore at:
zip(0 bytes)Available download formats
Dataset updated
Aug 30, 2019
Dataset provided by
Bureau of Labor Statisticshttp://www.bls.gov/
Authors
US Bureau of Labor Statistics
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

The Bureau of Labor Statistics (BLS) is a unit of the United States Department of Labor. It is the principal fact-finding agency for the U.S. government in the broad field of labor economics and statistics and serves as a principal agency of the U.S. Federal Statistical System. The BLS is a governmental statistical agency that collects, processes, analyzes, and disseminates essential statistical data to the American public, the U.S. Congress, other Federal agencies, State and local governments, business, and labor representatives. Source: https://en.wikipedia.org/wiki/Bureau_of_Labor_Statistics

Content

Bureau of Labor Statistics including CPI (inflation), employment, unemployment, and wage data.

Update Frequency: Monthly

Querying BigQuery Tables

Fork this kernel to get started.

Acknowledgements

https://bigquery.cloud.google.com/dataset/bigquery-public-data:bls

https://cloud.google.com/bigquery/public-data/bureau-of-labor-statistics

Dataset Source: http://www.bls.gov/data/

This dataset is publicly available for anyone to use under the following terms provided by the Dataset Source - http://www.data.gov/privacy-policy#data_policy - and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.

Banner Photo by Clark Young from Unsplash.

Inspiration

What is the average annual inflation across all US Cities? What was the monthly unemployment rate (U3) in 2016? What are the top 10 hourly-waged types of work in Pittsburgh, PA for 2016?

Facebook

Twitter

Click to copy link

Link copied

Cite

Köln (2023). Statistical Data Catalog Cologne [Dataset]. https://ckan.mobidatalab.eu/dataset/statisticaldatacatalogue-coln

Statistical Data Catalog Cologne

Explore at:

http://publications.europa.eu/resource/authority/file-type/csv(307022), http://publications.europa.eu/resource/authority/file-type/csv(272780), http://publications.europa.eu/resource/authority/file-type/json, http://publications.europa.eu/resource/authority/file-type/csv(3746), http://publications.europa.eu/resource/authority/file-type/csv(3752), http://publications.europa.eu/resource/authority/file-type/csv(274184), http://publications.europa.eu/resource/authority/file-type/csv(3735), http://publications.europa.eu/resource/authority/file-type/csv(275264), http://publications.europa.eu/resource/authority/file-type/csv(5356), http://publications.europa.eu/resource/authority/file-type/csv(273265), http://publications.europa.eu/resource/authority/file-type/csv(3730), http://publications.europa.eu/resource/authority/file-type/csv(19787), http://publications.europa.eu/resource/authority/file-type/csv(273515), http://publications.europa.eu/resource/authority/file-type/csv(272571), http://publications.europa.eu/resource/authority/file-type/csv(3748), http://publications.europa.eu/resource/authority/file-type/csv(3753), http://publications.europa.eu/resource/authority/file-type/csv(271286), http://publications.europa.eu/resource/authority/file-type/csv(3754), http://publications.europa.eu/resource/authority/file-type/csv(273516), http://publications.europa.eu/resource/authority/file-type/csv(273403), http://publications.europa.eu/resource/authority/file-type/csv(3764), http://publications.europa.eu/resource/authority/file-type/csv(1215), http://publications.europa.eu/resource/authority/file-type/csv(3758)Available download formats

Dataset updated

Jul 26, 2023

Dataset provided by

Köln

License

Data licence Germany – Attribution – Version 2.0https://www.govdata.de/dl-de/by-2-0
License information was derived automatically

Description

Data from various sources are updated in the Statistical Information System of the City of Cologne. The annual statistical yearbook publishes these in tabular, graphic and cartographic form at the level of the city districts and districts. Furthermore, definitions and calculation bases are explained. Small-scale statistics at the level of the 86 districts can be obtained from the Cologne district information become. All levels of the local area structure are presented in this publication explained.

This statistical data catalogue supplements the range of small-scale data. Selected structural data can be called up here in compact tabular form at the level of the 570 statistical districts or the 86 districts. The two overviews provide information about which data is available and from which source it originates. The data itself is provided annually.

Notes:

Data sources are indicated in the summary tables. When using the data, the data license Germany - attribution - version 2.0 must be observed.
Some values cannot be given to protect statistical confidentiality. For the data sets of the Federal Employment Agency, these are values from 1 to < 10, for all further data records values from 1 to < 5. This is marked in the data by a * .
The differentiation of population figures by gender is currently made according to female and male residents. The case numbers of those who define themselves as non-binary/diverse are so low at a small-scale level that they cannot be reported for reasons of statistical confidentiality.
The determination of residents with a migration background is carried out by combination various characteristics from the resident registration procedure. The data are to be interpreted as estimates. The statistical yearbook of the city of Cologne provides further details.
The information on households comes from the household generation process. This is a statistical procedure in which residents within an address are assigned to a household as far as possible by querying certain criteria. If the procedure does not identify any connections, the allocation to single-person households takes place. The statistical yearbook of the city of Cologne provides further details.
The data set pupils* at general schools (spatial location by place of residence) is available from 2013.
The number of the statistical quarter or district is a spatial location and can be linked to the geodata (see related resource below).

Clear search

Close search

Google apps

Main menu

Statistical Data Catalog Cologne

COVID-19 Combined Data-set with Improved Measurement Errors

Tanzania Source Data Assessment Of Statistical Capacity Scale 0 100

Statistical Abstract of the United States, 2007

🦠 COVID-19 survey of National Statistical Offices

Statistical Abstract of the United States, 2002

The Water Resources Bureau statistical data indicators

Online platforms consumers use to find information worldwide 2023

ODM Data Analysis—A tool for the automatic validation, monitoring and...

Household Health Survey 2012-2013, Economic Research Forum (ERF)...

Abstract

Geographic coverage

Analysis unit

Universe

Kind of data

Sampling procedure

Mode of data collection

Research instrument

Cleaning operations

Response rate

EHRI Statistical Data Mart (EHRI-SDM)

Statistical Data Analysis using R

Social Security, Health and Related Information

Most reliable sources of data for market researchers in the U.S. 2017

The statistical data of aquaculture area and aquaculture water usage by the...

Algeria DZ: SPI: Pillar 4 Data Sources Score: Scale 0-100

Synthetic Administrative Data: Census 1991, 2023

Statistical Methods in Water Resources - Supporting Materials

Data generation volume worldwide 2010-2029

USA Bureau of Labor Statistics

Context

Content

Querying BigQuery Tables

Acknowledgements

Inspiration

Statistical Data Catalog Cologne