100+ datasets found

example 1 - time series - USD RUB 1 year data
kaggle.com
zip
Updated Sep 19, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Denis Andrikov (2024). example 1 - time series - USD RUB 1 year data [Dataset]. https://www.kaggle.com/datasets/denisandrikov/example-1-time-series-usd-rub-1-year-data
Explore at:
zip(675 bytes)Available download formats
Dataset updated
Sep 19, 2024
Authors
Denis Andrikov
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
A simple table time series for school probability and statistics. We have to learn how to investigate data: value via time. What we try to do: - mean: average is the sum of all values divided by the number of values. It is also sometimes referred to as mean. - median is the middle number, when in order. Mode is the most common number. Range is the largest number minus the smallest number. - standard deviation s a measure of how dispersed the data is in relation to the mean.
f
UC_vs_US Statistic Analysis.xlsx
figshare.com
xlsx
Updated Jul 9, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
F. (Fabiano) Dalpiaz (2020). UC_vs_US Statistic Analysis.xlsx [Dataset]. http://doi.org/10.23644/uu.12631628.v1
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.23644/uu.12631628.v1
Dataset updated
Jul 9, 2020
Dataset provided by
Utrecht University
Authors
F. (Fabiano) Dalpiaz
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Sheet 1 (Raw-Data): The raw data of the study is provided, presenting the tagging results for the used measures described in the paper. For each subject, it includes multiple columns: A. a sequential student ID B an ID that defines a random group label and the notation C. the used notation: user Story or use Cases D. the case they were assigned to: IFA, Sim, or Hos E. the subject's exam grade (total points out of 100). Empty cells mean that the subject did not take the first exam F. a categorical representation of the grade L/M/H, where H is greater or equal to 80, M is between 65 included and 80 excluded, L otherwise G. the total number of classes in the student's conceptual model H. the total number of relationships in the student's conceptual model I. the total number of classes in the expert's conceptual model J. the total number of relationships in the expert's conceptual model K-O. the total number of encountered situations of alignment, wrong representation, system-oriented, omitted, missing (see tagging scheme below) P. the researchers' judgement on how well the derivation process explanation was explained by the student: well explained (a systematic mapping that can be easily reproduced), partially explained (vague indication of the mapping ), or not present.

Tagging scheme: Aligned (AL) - A concept is represented as a class in both models, either

with the same name or using synonyms or clearly linkable names; Wrongly represented (WR) - A class in the domain expert model is incorrectly represented in the student model, either (i) via an attribute, method, or relationship rather than class, or (ii) using a generic term (e.g., user'' instead ofurban planner''); System-oriented (SO) - A class in CM-Stud that denotes a technical implementation aspect, e.g., access control. Classes that represent legacy system or the system under design (portal, simulator) are legitimate; Omitted (OM) - A class in CM-Expert that does not appear in any way in CM-Stud; Missing (MI) - A class in CM-Stud that does not appear in any way in CM-Expert.

All the calculations and information provided in the following sheets

originate from that raw data.

Sheet 2 (Descriptive-Stats): Shows a summary of statistics from the data collection,

including the number of subjects per case, per notation, per process derivation rigor category, and per exam grade category.

Sheet 3 (Size-Ratio):

The number of classes within the student model divided by the number of classes within the expert model is calculated (describing the size ratio). We provide box plots to allow a visual comparison of the shape of the distribution, its central value, and its variability for each group (by case, notation, process, and exam grade) . The primary focus in this study is on the number of classes. However, we also provided the size ratio for the number of relationships between student and expert model.

Sheet 4 (Overall):

Provides an overview of all subjects regarding the encountered situations, completeness, and correctness, respectively. Correctness is defined as the ratio of classes in a student model that is fully aligned with the classes in the corresponding expert model. It is calculated by dividing the number of aligned concepts (AL) by the sum of the number of aligned concepts (AL), omitted concepts (OM), system-oriented concepts (SO), and wrong representations (WR). Completeness on the other hand, is defined as the ratio of classes in a student model that are correctly or incorrectly represented over the number of classes in the expert model. Completeness is calculated by dividing the sum of aligned concepts (AL) and wrong representations (WR) by the sum of the number of aligned concepts (AL), wrong representations (WR) and omitted concepts (OM). The overview is complemented with general diverging stacked bar charts that illustrate correctness and completeness.

For sheet 4 as well as for the following four sheets, diverging stacked bar

charts are provided to visualize the effect of each of the independent and mediated variables. The charts are based on the relative numbers of encountered situations for each student. In addition, a "Buffer" is calculated witch solely serves the purpose of constructing the diverging stacked bar charts in Excel. Finally, at the bottom of each sheet, the significance (T-test) and effect size (Hedges' g) for both completeness and correctness are provided. Hedges' g was calculated with an online tool: https://www.psychometrica.de/effect_size.html. The independent and moderating variables can be found as follows:

Sheet 5 (By-Notation):

Model correctness and model completeness is compared by notation - UC, US.

Sheet 6 (By-Case):

Model correctness and model completeness is compared by case - SIM, HOS, IFA.

Sheet 7 (By-Process):

Model correctness and model completeness is compared by how well the derivation process is explained - well explained, partially explained, not present.

Sheet 8 (By-Grade):

Model correctness and model completeness is compared by the exam grades, converted to categorical values High, Low , and Medium.
Z
Data Set of Extracted Summary Statistics from Equipment Sensor Data
data.niaid.nih.gov
data.europa.eu
Updated Jan 26, 2021
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Martin Pleschberger (2021). Data Set of Extracted Summary Statistics from Equipment Sensor Data [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_4462776
Explore at:
Dataset updated
Jan 26, 2021
Dataset provided by
KAI - Kompetenzzentrum Automobil- und Industrieelektronik GmbH
Authors
Martin Pleschberger
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This data set was generated in accordance with the semiconductor industry and contains values of summary statistics from sensor recordings of the high-precision and high-tech production equipment. Basically, the semiconductor production consists of hundreds of process steps performing physical and chemical operations on so-called wafers, i.e. slices based on semiconductor material. In the production chain, each process equipment is equipped with several sensors recording physical parameters like gas flow, temperature, voltage, etc., resulting in so-called sensor data. Out of the sensor data, values of summary statistics are extracted. These are values like mean, standard deviation and gradients. To keep the entire production as stable as possible, these values are used to monitor the whole production in order to intervene in case of deviations.

After the production, each device on the wafer is tested in the most careful way resulting in so-called wafer test data. In some cases, suspicious patterns occur in the wafer test data potentially leading to failure. In this case the root cause must be found in the production chain. For this purpose, the given data is provided. The aim is to find correlations between the wafer test data and the values of summary statistics in order to identify the root cause.

The given data is divided into four data sets: "XTrain.csv", "YTrain.csv", "XTest.csv" and "YTest.csv". "XTrain.csv" and "XTest.csv" represent the values of summary statistics originating in the production chain separated for the purpose of training and validating a statistical model. Included are 114 observations of 77 parameters (values of summary statistics). The "YTrain.csv" and "YTest.csv" contain the corresponding wafer test data (144 observations of one parameter).
C
Statistical Data Catalog Cologne
ckan.mobidatalab.eu
Updated Jul 26, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Köln (2023). Statistical Data Catalog Cologne [Dataset]. https://ckan.mobidatalab.eu/dataset/statisticaldatacatalogue-coln
Explore at:
http://publications.europa.eu/resource/authority/file-type/csv(307022), http://publications.europa.eu/resource/authority/file-type/csv(272780), http://publications.europa.eu/resource/authority/file-type/json, http://publications.europa.eu/resource/authority/file-type/csv(3746), http://publications.europa.eu/resource/authority/file-type/csv(3752), http://publications.europa.eu/resource/authority/file-type/csv(274184), http://publications.europa.eu/resource/authority/file-type/csv(3735), http://publications.europa.eu/resource/authority/file-type/csv(275264), http://publications.europa.eu/resource/authority/file-type/csv(5356), http://publications.europa.eu/resource/authority/file-type/csv(273265), http://publications.europa.eu/resource/authority/file-type/csv(3730), http://publications.europa.eu/resource/authority/file-type/csv(19787), http://publications.europa.eu/resource/authority/file-type/csv(273515), http://publications.europa.eu/resource/authority/file-type/csv(272571), http://publications.europa.eu/resource/authority/file-type/csv(3748), http://publications.europa.eu/resource/authority/file-type/csv(3753), http://publications.europa.eu/resource/authority/file-type/csv(271286), http://publications.europa.eu/resource/authority/file-type/csv(3754), http://publications.europa.eu/resource/authority/file-type/csv(273516), http://publications.europa.eu/resource/authority/file-type/csv(273403), http://publications.europa.eu/resource/authority/file-type/csv(3764), http://publications.europa.eu/resource/authority/file-type/csv(1215), http://publications.europa.eu/resource/authority/file-type/csv(3758)Available download formats
Dataset updated
Jul 26, 2023
Dataset provided by
Köln
License
Data licence Germany – Attribution – Version 2.0https://www.govdata.de/dl-de/by-2-0
License information was derived automatically
Description
Data from various sources are updated in the Statistical Information System of the City of Cologne. The annual statistical yearbook publishes these in tabular, graphic and cartographic form at the level of the city districts and districts. Furthermore, definitions and calculation bases are explained. Small-scale statistics at the level of the 86 districts can be obtained from the Cologne district information become. All levels of the local area structure are presented in this publication explained.

This statistical data catalogue supplements the range of small-scale data. Selected structural data can be called up here in compact tabular form at the level of the 570 statistical districts or the 86 districts. The two overviews provide information about which data is available and from which source it originates. The data itself is provided annually.

Notes:

Data sources are indicated in the summary tables. When using the data, the data license Germany - attribution - version 2.0 must be observed.

Some values cannot be given to protect statistical confidentiality. For the data sets of the Federal Employment Agency, these are values from 1 to < 10, for all further data records values from 1 to < 5. This is marked in the data by a * .

The differentiation of population figures by gender is currently made according to female and male residents. The case numbers of those who define themselves as non-binary/diverse are so low at a small-scale level that they cannot be reported for reasons of statistical confidentiality.

The determination of residents with a migration background is carried out by combination various characteristics from the resident registration procedure. The data are to be interpreted as estimates. The statistical yearbook of the city of Cologne provides further details.

The information on households comes from the household generation process. This is a statistical procedure in which residents within an address are assigned to a household as far as possible by querying certain criteria. If the procedure does not identify any connections, the allocation to single-person households takes place. The statistical yearbook of the city of Cologne provides further details.

The data set pupils* at general schools (spatial location by place of residence) is available from 2013.

The number of the statistical quarter or district is a spatial location and can be linked to the geodata (see related resource below).
Economic Census: Core Statistics: US Industry Product Data
catalog.data.gov
s.cnmilf.com
Updated Jul 19, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Census Bureau (2023). Economic Census: Core Statistics: US Industry Product Data [Dataset]. https://catalog.data.gov/dataset/economic-census-core-statistics-us-industry-product-data
Explore at:
Dataset updated
Jul 19, 2023
Dataset provided by
United States Census Bureauhttp://census.gov/
Area covered
United States
Description
The Economic Census is the U.S. Government's official five-year measure of American business and the economy. It is conducted by the U.S. Census Bureau, and response is required by law. In October through December of the census year, forms are sent out to nearly 4 million businesses, including large, medium and small companies representing all U.S. locations and industries. Respondents were asked to provide a range of operational and performance data for their companies. This dataset presents company, establishments, value of shipments, value of product shipments, percentage of product shipments of the total value of shipments, and percentage of distribution of value of product shipments.
Mental Health Services for Children
kaggle.com
zip
Updated Jan 22, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
The Devastator (2023). Mental Health Services for Children [Dataset]. https://www.kaggle.com/datasets/thedevastator/mental-health-services-for-children/data
Explore at:
zip(291518 bytes)Available download formats
Dataset updated
Jan 22, 2023
Authors
The Devastator
Description
Mental Health Services for Children

Monthly Statistics from England

By data.world's Admin [source]

About this dataset

This dataset provides insight into the mental health services available to children and young people in England. The data includes all primary and secondary levels of care, as well as breakdowns by age group. Information is provided on the number of people in contact with mental health services; open ward stays; open referrals; referrals starting in reporting period; attended contacts; indirect activity; discharged from referral; missed care contacts by DNA reasons and more. With these statistics, analysts may be able to better understand the scope of mental health service usage across different age groups in England and make valuable conclusions about best practices for helping children & young people receive proper care

More Datasets

For more datasets, click here.

Featured Notebooks

🚨 Your notebook can be here! 🚨!

How to use the dataset

This guide provides information on how to use this dataset effectively.

Understanding the Columns:

Each row represents data from a specific month within a reporting period. The first thing to do is to find out what each column represents - this is explained by their titles and descriptions included at the beginning of this dataset. Note that there are primary level columns (e.g., Reporting Period, Breakdown) which provide overall context while secondary level columns (e.g., CYP01 People in contact with children and young peoples' mentally health service…) provide more detail on specific indicators of interest related to that primary level column value pair (i.e., Reporting Period X).

Exploring Data Variables:

The next step is exploring which data variables could potentially be helpful when analyzing initiatives/programs related to mental health care for children & youth in England or developing policies related to them – look through all columns included here for ones you think would be most helpful such as ‘CYP21 – Open ward stays...’ or ‘MHS07a - People with an open hospital spell…’ and note down those that have been considered necessary/relevant based on your particular situation/needs before further analyzing using software packages like Excel or SPSS etc..

Analyzing Data Values:

Now comes the time for analyzing individual values provided under each respective column – take one single numerical data element such as ‘CYP02 – People… CPA end RP’ & run through it all looking at trends over time, averages across different sections by performing calculations via software packages available like tables provided above based upon sorted hierarchies needed.. Then you can then start looking into making meaningful correlations between different pieces of information given herein by cross-referencing contexts against each other resulting if any noticeable patterns found significant enough will make informative decisions towards policy implementations & program improvement opportunities both directly concerned

Research Ideas

Using this dataset to identify key trends in mental health services usage among children and young people in England, such as the number of open ward stays and referrals received.

Using the information to develop targeted solutions on areas of need identified from the data by geographical area or age group, i.e creating campaigns or programs specifically targeting specific groups at a higher risk of experiencing mental health difficulties or engaging with specialist services.

Tracking how well these initiatives are working over time by monitoring relevant metrics such as attendance at appointments, open referrals etc to evaluate their effectiveness in improving access and engagement with mental health services for those most in need

Acknowledgements

If you use this dataset in your research, please credit the original authors. Data Source

License

License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. - ...
Forecast revenue big data market worldwide 2011-2027
statista.com
Updated Mar 15, 2018
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2018). Forecast revenue big data market worldwide 2011-2027 [Dataset]. https://www.statista.com/statistics/254266/global-big-data-market-forecast/
Explore at:
Dataset updated
Mar 15, 2018
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Worldwide
Description
The global big data market is forecasted to grow to 103 billion U.S. dollars by 2027, more than double its expected market size in 2018. With a share of 45 percent, the software segment would become the large big data market segment by 2027. What is Big data? Big data is a term that refers to the kind of data sets that are too large or too complex for traditional data processing applications. It is defined as having one or some of the following characteristics: high volume, high velocity or high variety. Fast-growing mobile data traffic, cloud computing traffic, as well as the rapid development of technologies such as artificial intelligence (AI) and the Internet of Things (IoT) all contribute to the increasing volume and complexity of data sets. Big data analytics Advanced analytics tools, such as predictive analytics and data mining, help to extract value from the data and generate new business insights. The global big data and business analytics market was valued at 169 billion U.S. dollars in 2018 and is expected to grow to 274 billion U.S. dollars in 2022. As of November 2018, 45 percent of professionals in the market research industry reportedly used big data analytics as a research method.
d
Annual General Canvass Statistics
catalog.data.gov
s.cnmilf.com
+1more
Updated Jun 1, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(Point of Contact, Custodian) (2025). Annual General Canvass Statistics [Dataset]. https://catalog.data.gov/dataset/annual-general-canvass-statistics
Explore at:
Dataset updated
Jun 1, 2025
Dataset provided by
(Point of Contact, Custodian)
Description
This data set contains annual quantities and value for all seafood products that are landed and sold by established seafood dealers and brokers in the Southeast Region (North Carolina through Texas). These types of data, referred to as the general canvass landings statistics, have been collected by the NOAA Fisheries Service, National Marine Fisheries Service and its predecessor agency, the Bureau of Commercial Fisheries. The data are available on computer since the early 1960's. The quantities and values that are reported in this data set include the annual landings that were initiated in 1962. Beginning in 1976, the data were collected monthly. See the sections on Links for the reference to the monthly general canvass landings. The annual general canvass landings include quantities and value for all living marine species and are identified by species (usually the local or common name). These data were collected by field agents employed by the National Marine Fisheries Service or the Bureau of Commercial Fisheries and assigned to local fishing ports. The agents contacted the majority of the seafood dealers or brokers in their assigned areas and recorded the quantities and value for each species or species category from the sales receipts maintained by the seafood dealers. In addition, information on the gear and area of capture is available for most of the landings statistics in the data set. Based on their knowledge of the fishing activity in the area, the agents would estimate the type of fishing gear and area where the fishing was likely to have occurred. More detailed information on the caveats associated with these data is provided in the Characteristics, Caveats and Issues section. However, because these data are summaries, they do not contain information on the quantities of fishing effort or identifications of the fishermen or vessels that caught the fish or shellfish.
Data generation volume worldwide 2010-2029
statista.com
Updated Nov 19, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2025). Data generation volume worldwide 2010-2029 [Dataset]. https://www.statista.com/statistics/871513/worldwide-data-created/
Explore at:
Dataset updated
Nov 19, 2025
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Worldwide
Description
The total amount of data created, captured, copied, and consumed globally is forecast to increase rapidly. While it was estimated at ***** zettabytes in 2025, the forecast for 2029 stands at ***** zettabytes. Thus, global data generation will triple between 2025 and 2029. Data creation has been expanding continuously over the past decade. In 2020, the growth was higher than previously expected, caused by the increased demand due to the coronavirus (COVID-19) pandemic, as more people worked and learned from home and used home entertainment options more often.
Data quality and methodology (TSM 2024)
gov.uk
Updated Nov 26, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Regulator of Social Housing (2024). Data quality and methodology (TSM 2024) [Dataset]. https://www.gov.uk/government/statistics/data-quality-and-methodology-tsm-2024
Explore at:
Dataset updated
Nov 26, 2024
Dataset provided by
GOV.UKhttp://gov.uk/
Authors
Regulator of Social Housing
Description
Contents

Introduction

Regulatory context

TSM collection

Statistical release methodology

Why not have your say on our statistics in 2024/25?

Introduction

This report describes the quality assurance arrangements for the registered provider (RP) Tenant Satisfaction Measures statistics, providing more detail on the regulatory and operational context for data collections which feed these statistics and the safeguards that aim to maximise data quality.

Background

The statistics we publish are based on data collected directly from local authority registered provider (LARPs) and from private registered providers (PRPs) through the Tenant Satisfaction Measures (TSM) return. We use the data collected through these returns extensively as a source of administrative data. The United Kingdom Statistics Authority (UKSA) encourages public bodies to use administrative data for statistical purposes and, as such, we publish these data.

These data are first being published in 2024, following the first collection and publication of the TSM.

Official Statistics in development status

In February 2018, the UKSA published the Code of Practice for Statistics. This sets standards for organisations producing and publishing statistics, ensuring quality, trustworthiness and value.

These statistics are drawn from our TSM data collection and are being published for the first time in 2024 as official statistics in development.

Official statistics in development are official statistics that are undergoing development. Over the next year we will review these statistics and consider areas for improvement to guidance, validations, data processing and analysis. We will also seek user feedback with a view to improving these statistics to meet user needs and to explore issues of data quality and consistency.

Change of designation name

Until September 2023, ‘official statistics in development’ were called ‘experimental statistics’. Further information can be found on the https://www.ons.gov.uk/methodology/methodologytopicsandstatisticalconcepts/guidetoofficialstatisticsindevelopment">Office for Statistics Regulation website.

User feedback

We are keen to increase the understanding of the data, including the accuracy and reliability, and the value to users. Please https://forms.office.com/e/cetNnYkHfL">complete the form or email feedback, including suggestions for improvements or queries as to the source data or processing to enquiries@rsh.gov.uk.

Publication schedule

We intend to publish these statistics in Autumn each year, with the data pre-announced in the release calendar.

All data and additional information (including a list of individuals (if any) with 24 hour pre-release access) are published on our statistics pages.

Quality assurance of administrative data

The data used in the production of these statistics are classed as administrative data. In 2015 the UKSA published a regulatory standard for the quality assurance of administrative data. As part of our compliance to the Code of Practice, and in the context of other statistics published by the UK Government and its agencies, we have determined that the statistics drawn from the TSMs are likely to be categorised as low-quality risk – medium public interest (with a requirement for basic/enhanced assurance).

The publication of these statistics can be considered as medium publi
Big data and business analytics revenue worldwide 2015-2022
statista.com
Updated Aug 17, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Statista (2021). Big data and business analytics revenue worldwide 2015-2022 [Dataset]. https://www.statista.com/statistics/551501/worldwide-big-data-business-analytics-revenue/
Explore at:
Dataset updated
Aug 17, 2021
Dataset authored and provided by
Statistahttp://statista.com/
Area covered
Worldwide
Description
The global big data and business analytics (BDA) market was valued at ***** billion U.S. dollars in 2018 and is forecast to grow to ***** billion U.S. dollars by 2021. In 2021, more than half of BDA spending will go towards services. IT services is projected to make up around ** billion U.S. dollars, and business services will account for the remainder. Big data High volume, high velocity and high variety: one or more of these characteristics is used to define big data, the kind of data sets that are too large or too complex for traditional data processing applications. Fast-growing mobile data traffic, cloud computing traffic, as well as the rapid development of technologies such as artificial intelligence (AI) and the Internet of Things (IoT) all contribute to the increasing volume and complexity of data sets. For example, connected IoT devices are projected to generate **** ZBs of data in 2025. Business analytics Advanced analytics tools, such as predictive analytics and data mining, help to extract value from the data and generate business insights. The size of the business intelligence and analytics software application market is forecast to reach around **** billion U.S. dollars in 2022. Growth in this market is driven by a focus on digital transformation, a demand for data visualization dashboards, and an increased adoption of cloud.
M
U.S. Hunger Statistics | Historical Data | Chart | 2001-2022
macrotrends.net
csv
Updated Oct 31, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MACROTRENDS (2025). U.S. Hunger Statistics | Historical Data | Chart | 2001-2022 [Dataset]. https://www.macrotrends.net/datasets/global-metrics/countries/usa/united-states/hunger-statistics
Explore at:
csvAvailable download formats
Dataset updated
Oct 31, 2025
Dataset authored and provided by
MACROTRENDS
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Jan 1, 2001 - Dec 31, 2022
Area covered
United States
Description
Historical dataset showing U.S. hunger statistics by year from 2001 to 2022.
Processed data and statistics
figshare.com
xlsx
Updated May 9, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sohyoung Won (2025). Processed data and statistics [Dataset]. http://doi.org/10.6084/m9.figshare.28969778.v3
Explore at:
xlsxAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.28969778.v3
Dataset updated
May 9, 2025
Dataset provided by
Figsharehttp://figshare.com/
figshare
Authors
Sohyoung Won
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Each sheet contains data and statistics generated for the research. Refer to the table legends bellow.Supplementary Table S1. Core microbiome in gut.Supplementary Table S2. Core microbiome in lung.Supplementary Table S3. Adjacency matrix of gut and lung microbiome in healthy controls.Supplementary Table S4. Adjacency matrix of gut and lung microbiome in patients with TB before treatment.Supplementary Table S5. Adjacency matrix of gut and lung microbiome in patients with TB after treatment.Supplementary Table S6. Node score of gut and lung microbiome in healthy controls.Supplementary Table S7. Node score of gut and lung microbiome in patients with TB before treatment.Supplementary Table S8. Node score of gut and lung microbiome in patients with TB after treatment.Supplementary Table S9. Significant results from differential abundance analysis of gut microbiota between HCs and patients with TB before treatment in the gut. Log fold change (logFC) represents the relative abundance in patients with TB compared to healthy controls. The logFC, p-values, and false discovery rates (FDR) were obtained from ANCOM-BC, and linear discriminant analysis (LDA) scores were calculated using LEfSe.Supplementary Table S10. Significant results from differential abundance analysis of gut microbiota between HCs and patients with TB before treatment in the lung. The logFC represents the log-transformed expression values of patients with TB relative to those of healthy controls (HC). The logFC, p-values, and false discovery rates (FDR) were obtained from ANCOM-BC, while the linear discriminant analysis (LDA) scores were derived from LEfSe.Supplementary Table S11. Significant results from differential abundance analysis of gut microbiota between patients with TB before and after treatment in the gut. The logFC represents the log-transformed expression values of patients with TB after treatment relative to those of patients with TB before treatment. The logFC, p-values, and false discovery rates (FDR) were obtained from ANCOM-BC, while the linear discriminant analysis (LDA) scores were derived from LEfSe.Supplementary Table S12. Significant results from differential abundance analysis of gut microbiota between patients with TB before and after treatment in the lung. The logFC represents the log-transformed expression values of patients with TB after treatment relative to those of patients with TB before treatment. The logFC, p-values, and false discovery rates (FDR) were obtained from ANCOM-BC, while the linear discriminant analysis (LDA) scores were derived from LEfSe.Supplementary Tables S13. Taxa clustering results in healthy controls.Supplementary Tables S14. Taxa clustering results in patients with TB before treatment.Supplementary Tables S15. Taxa clustering results in patients with TB after treatment.
Jobs created as a result of international sourcing by NACE Rev.2 activity...
ec.europa.eu
Updated Oct 10, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Eurostat (2025). Jobs created as a result of international sourcing by NACE Rev.2 activity aggregate, business function and job skill requirement - experimental statistics [Dataset]. http://doi.org/10.2908/ISS_21SOJOBN
Explore at:
json, application/vnd.sdmx.data+csv;version=1.0.0, tsv, application/vnd.sdmx.data+xml;version=3.0.0, application/vnd.sdmx.genericdata+xml;version=2.1, application/vnd.sdmx.data+csv;version=2.0.0Available download formats
Unique identifier
https://doi.org/10.2908/ISS_21SOJOBN
Dataset updated
Oct 10, 2025
Dataset authored and provided by
Eurostathttps://ec.europa.eu/eurostat
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
2021
Area covered
Poland, Latvia, Austria, Sweden, Finland, Germany, Denmark, Norway, Netherlands, Hungary
Description
The data has been collected through a survey of enterprises with at least 50 persons employed.

The data category covers a group of variables which provide relevant statistical evidence and information about factors driving international sourcing e.g. the impact on the competitiveness, motivations and perceived barriers together with possible employment consequences in the Member State.

There have been four collection rounds:

In 2007 the data has been collected on a voluntary basis for 13 countries: Germany, Czechia, Netherlands, Denmark, Spain, Ireland, Italy, Portugal, Finland, Slovenia, Sweden, the United Kingdom and Norway.

In 2011 the data has been collected on a voluntary basis for 15 countries: Belgium, Bulgaria, Denmark, Estonia, Ireland, France, Latvia, Lithuania, Netherlands, Portugal, Romania, Finland, Slovakia, Sweden and Norway.

In 2018 the data has been collected on a voluntary basis for 16 countries: Austria, Bulgaria, Denmark, Finland, Germany, Hungary, Italy, Latvia, Lithuania, Netherlands, Poland, Portugal, Romania, Slovakia, Sweden and Norway.

In 2021 the data has been collected on a voluntary basis for 17 countries: Austria, Bulgaria, Denmark, Finland, France, Germany, Hungary, Ireland, Italy, Latvia, Lithuania, Malta, Netherlands, Poland, Portugal, Sweden and Norway. French data will be published by the end of 2022.

The data focuses on the relocation of core and support business functions of enterprises in the business economy sector, from domestic to abroad and vice versa, as a result of decisions taken by the domestic enterprises.

In summary, the collected indicators are :

Number of enterprises

Number of jobs created and jobs lost as a result of international sourcing activity

Number of persons employed

The dimensions used to describe the International sourcing in the 2021 collection round are:

Economic activity

Sourcing status

Business function

Sourcing activities

Type of business partners

Country destination of international sourcing

Movement of business functions abroad

Job creation/job loss identification

Motivation factors and their level of importance

Barriers for international sourcing

In the 2021 collection round two new dimensions were include:

GVC arrangements related to enterprises purchasing and supplying goods and services abroad

COVID-19 impact on international sourcing, turnover and trade
Enterprises sourcing complete business functions abroad by NACE Rev.2...
ec.europa.eu
Updated Oct 10, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Eurostat (2025). Enterprises sourcing complete business functions abroad by NACE Rev.2 activity aggregate, type of business function and business partner - experimental statistics [Dataset]. http://doi.org/10.2908/ISS_21SOPTC
Explore at:
application/vnd.sdmx.data+csv;version=1.0.0, tsv, application/vnd.sdmx.genericdata+xml;version=2.1, application/vnd.sdmx.data+csv;version=2.0.0, json, application/vnd.sdmx.data+xml;version=3.0.0Available download formats
Unique identifier
https://doi.org/10.2908/ISS_21SOPTC
Dataset updated
Oct 10, 2025
Dataset authored and provided by
Eurostathttps://ec.europa.eu/eurostat
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
2021
Area covered
Norway, Hungary, Finland, Germany, Netherlands, Denmark, Italy, Latvia, Lithuania, Bulgaria
Description
The data has been collected through a survey of enterprises with at least 50 persons employed.

The data category covers a group of variables which provide relevant statistical evidence and information about factors driving international sourcing e.g. the impact on the competitiveness, motivations and perceived barriers together with possible employment consequences in the Member State.

There have been four collection rounds:

In 2007 the data has been collected on a voluntary basis for 13 countries: Germany, Czechia, Netherlands, Denmark, Spain, Ireland, Italy, Portugal, Finland, Slovenia, Sweden, the United Kingdom and Norway.

In 2011 the data has been collected on a voluntary basis for 15 countries: Belgium, Bulgaria, Denmark, Estonia, Ireland, France, Latvia, Lithuania, Netherlands, Portugal, Romania, Finland, Slovakia, Sweden and Norway.

In 2018 the data has been collected on a voluntary basis for 16 countries: Austria, Bulgaria, Denmark, Finland, Germany, Hungary, Italy, Latvia, Lithuania, Netherlands, Poland, Portugal, Romania, Slovakia, Sweden and Norway.

In 2021 the data has been collected on a voluntary basis for 17 countries: Austria, Bulgaria, Denmark, Finland, France, Germany, Hungary, Ireland, Italy, Latvia, Lithuania, Malta, Netherlands, Poland, Portugal, Sweden and Norway. French data will be published by the end of 2022.

The data focuses on the relocation of core and support business functions of enterprises in the business economy sector, from domestic to abroad and vice versa, as a result of decisions taken by the domestic enterprises.

In summary, the collected indicators are :

Number of enterprises

Number of jobs created and jobs lost as a result of international sourcing activity

Number of persons employed

The dimensions used to describe the International sourcing in the 2021 collection round are:

Economic activity

Sourcing status

Business function

Sourcing activities

Type of business partners

Country destination of international sourcing

Movement of business functions abroad

Job creation/job loss identification

Motivation factors and their level of importance

Barriers for international sourcing

In the 2021 collection round two new dimensions were include:

GVC arrangements related to enterprises purchasing and supplying goods and services abroad

COVID-19 impact on international sourcing, turnover and trade
m
Data from: Probability waves: adaptive cluster-based correction by...
data.mendeley.com
narcis.nl
Updated Feb 8, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
DIMITRI ABRAMOV (2021). Probability waves: adaptive cluster-based correction by convolution of p-value series from mass univariate analysis [Dataset]. http://doi.org/10.17632/rrm4rkr3xn.1
Explore at:
Unique identifier
https://doi.org/10.17632/rrm4rkr3xn.1
Dataset updated
Feb 8, 2021
Authors
DIMITRI ABRAMOV
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
dataset and Octave/MatLab codes/scripts for data analysis Background: Methods for p-value correction are criticized for either increasing Type II error or improperly reducing Type I error. This problem is worse when dealing with thousands or even hundreds of paired comparisons between waves or images which are performed point-to-point. This text considers patterns in probability vectors resulting from multiple point-to-point comparisons between two event-related potentials (ERP) waves (mass univariate analysis) to correct p-values, where clusters of signiticant p-values may indicate true H0 rejection. New method: We used ERP data from normal subjects and other ones with attention deficit hyperactivity disorder (ADHD) under a cued forced two-choice test to study attention. The decimal logarithm of the p-vector (p') was convolved with a Gaussian window whose length was set as the shortest lag above which autocorrelation of each ERP wave may be assumed to have vanished. To verify the reliability of the present correction method, we realized Monte-Carlo simulations (MC) to (1) evaluate confidence intervals of rejected and non-rejected areas of our data, (2) to evaluate differences between corrected and uncorrected p-vectors or simulated ones in terms of distribution of significant p-values, and (3) to empirically verify rate of type-I error (comparing 10,000 pairs of mixed samples whit control and ADHD subjects). Results: the present method reduced the range of p'-values that did not show covariance with neighbors (type I and also type-II errors). The differences between simulation or raw p-vector and corrected p-vectors were, respectively, minimal and maximal for window length set by autocorrelation in p-vector convolution. Comparison with existing methods: Our method was less conservative while FDR methods rejected basically all significant p-values for Pz and O2 channels. The MC simulations, gold-standard method for error correction, presented 2.78±4.83% of difference (all 20 channels) from p-vector after correction, while difference between raw and corrected p-vector was 5,96±5.00% (p = 0.0003). Conclusion: As a cluster-based correction, the present new method seems to be biological and statistically suitable to correct p-values in mass univariate analysis of ERP waves, which adopts adaptive parameters to set correction.
M
Taiwan Hunger Statistics | Historical Data | Chart | N/A-N/A
macrotrends.net
csv
Updated Oct 31, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
MACROTRENDS (2025). Taiwan Hunger Statistics | Historical Data | Chart | N/A-N/A [Dataset]. https://www.macrotrends.net/datasets/global-metrics/countries/twn/taiwan/hunger-statistics
Explore at:
csvAvailable download formats
Dataset updated
Oct 31, 2025
Dataset authored and provided by
MACROTRENDS
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Taiwan
Description
Historical dataset showing Taiwan hunger statistics by year from N/A to N/A.
d
Data from: The reproducibility of research and the misinterpretation of...
search.dataone.org
data.niaid.nih.gov
+1more
Updated Apr 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
David Colquhoun (2025). The reproducibility of research and the misinterpretation of p-values [Dataset]. http://doi.org/10.5061/dryad.j313j
Explore at:
Unique identifier
https://doi.org/10.5061/dryad.j313j
Dataset updated
Apr 6, 2025
Dataset provided by
Dryad Digital Repository
Authors
David Colquhoun
Time period covered
Nov 2, 2017
Description
We wish to answer this question: If you observe a â€˜significantâ€™ p-value after doing a single unbiased experiment, what is the probability that your result is a false positive? The weak evidence provided by p-values between 0.01 and 0.05 is explored by exact calculations of false positive risks. When you observe pâ€‰=â€‰0.05, the odds in favour of there being a real effect (given by the likelihood ratio) are about 3â€‰:â€‰1. This is far weaker evidence than the odds of 19 to 1 that might, wrongly, be inferred from the p-value. And if you want to limit the false positive risk to 5%, you would have to assume that you were 87% sure that there was a real effect before the experiment was done. If you observe pâ€‰=â€‰0.001 in a well-powered experiment, it gives a likelihood ratio of almost 100â€‰:â€‰1 odds on there being a real effect. That would usually be regarded as conclusive. But the false positive risk would still be 8% if the prior probability of a real effect were only 0.1. And, in this case, if you w...
Business registration and bankruptcy index by NACE Rev.2 activity -...
ec.europa.eu
Updated Nov 14, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Eurostat (2025). Business registration and bankruptcy index by NACE Rev.2 activity - quarterly data [Dataset]. http://doi.org/10.2908/STS_RB_Q
Explore at:
json, tsv, application/vnd.sdmx.data+xml;version=3.0.0, application/vnd.sdmx.data+csv;version=1.0.0, application/vnd.sdmx.data+csv;version=2.0.0, application/vnd.sdmx.genericdata+xml;version=2.1Available download formats
Unique identifier
https://doi.org/10.2908/STS_RB_Q
Dataset updated
Nov 14, 2025
Dataset authored and provided by
Eurostathttps://ec.europa.eu/eurostat
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Montenegro, Euro area - 19 countries (2015-2022), European Union, Luxembourg, Albania, Hungary, Serbia, Estonia, Finland, Greece
Description
Short-term business statistics (STS) give information on a wide range of economic activities. All STS data are index data. Additionally, annual absolute values are released for building permits indicators. Percentage changes are also available for each indicator: Infra-annual percentage changes - changes between two consecutive months or quarters - are calculated on the basis of non-adjusted data (prices) or calendar and seasonally adjusted data (volume and value indicators) and year-on-year changes - comparing a period to the same period one year ago - are calculated on the basis of non-adjusted data (prices and employment) or calendar adjusted data (volume and value indicators).

The index data are generally presented in the following forms:

unadjusted

calendar adjusted

calendar and seasonally adjusted.

Depending on the EBS Regulation data are accessible as monthly, quarterly and annual data.

The STS indicators are listed below in five different sectors, reflecting the dissemination of these data in Eurostat’s online database “Eurobase”.

Based on the national data, Eurostat compiles short-term indicators for the EU and euro area. Among these, a list of indicators, called Principal European Economic Indicators (PEEIs) has been identified by key users as being of primary importance for the conduct of monetary and economic policy of the euro area. The PEEIs contributed by STS are marked with * in the text below.

The euro indicators are released through Eurostat's website.

INDUSTRY

production (volume)*

turnover: Total, Domestic market and Non-domestic market. A further breakdown of the non-domestic turnover into euro area and non-euro area is available for the euro area countries.

producer prices (output prices)*: Total, Domestic market and Non-domestic market. A further breakdown of the non-domestic producer prices into euro area and non-euro area is available for the euro area countries.

import prices*: Total, euro area market, Non euro area market (euro area countries only)

labour input indicators: Number of employees and self-employed persons, Hours worked by employees, Gross wages and salaries

CONSTRUCTION

production (volume)*

building permits indicators*: number of dwellings, square meters of useful floor

producer (output) prices in construction (if not available, they can be approximated by the construction costs variable)

labour input indicators: number of employees and self-employed persons, hours worked by employees, gross wages and salaries

TRADE

volume of sales (deflated turnover)*

turnover (value)

labour input indicators: number of employees and self-employed persons, hours worked by employees, gross wages and salaries

SERVICES

production (volume)*

turnover (in value)

labour input indicators: number of employees and self-employed persons, hours worked by employees, gross wages and salaries

producer prices (output prices)*

MARKET ECONOMY

total market production (volume)

registrations

bankruptcies

National reference metadata of the reporting countries are available in the Annexes to this metadata file.
Principal Statistics Of Manufacturing By Industry Cluster - Value Added,...
data.gov.sg
Updated Nov 12, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Singapore Department of Statistics (2025). Principal Statistics Of Manufacturing By Industry Cluster - Value Added, Annual [Dataset]. https://data.gov.sg/datasets/d_5300314b2a47151f85b1786e449e860c/view
Explore at:
Dataset updated
Nov 12, 2025
Dataset authored and provided by
Singapore Department of Statistics
License
https://data.gov.sg/open-data-licencehttps://data.gov.sg/open-data-licence
Time period covered
Jan 2000 - Dec 2024
Description
Dataset from Singapore Department of Statistics. For more information, visit https://data.gov.sg/datasets/d_5300314b2a47151f85b1786e449e860c/view

Facebook

Twitter

Click to copy link

Link copied

Cite

Denis Andrikov (2024). example 1 - time series - USD RUB 1 year data [Dataset]. https://www.kaggle.com/datasets/denisandrikov/example-1-time-series-usd-rub-1-year-data

example 1 - time series - USD RUB 1 year data

examples of statistical analysis for "NV SCHOOL"

Explore at:

zip(675 bytes)Available download formats

Dataset updated

Sep 19, 2024

Authors

Denis Andrikov

License

MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically

Description

A simple table time series for school probability and statistics. We have to learn how to investigate data: value via time. What we try to do: - mean: average is the sum of all values divided by the number of values. It is also sometimes referred to as mean. - median is the middle number, when in order. Mode is the most common number. Range is the largest number minus the smallest number. - standard deviation s a measure of how dispersed the data is in relation to the mean.

Clear search

Close search

Google apps

Main menu

example 1 - time series - USD RUB 1 year data

UC_vs_US Statistic Analysis.xlsx

Data Set of Extracted Summary Statistics from Equipment Sensor Data

Statistical Data Catalog Cologne

Economic Census: Core Statistics: US Industry Product Data

Mental Health Services for Children

Mental Health Services for Children

Monthly Statistics from England

About this dataset

More Datasets

Featured Notebooks

How to use the dataset

Understanding the Columns:

Exploring Data Variables:

Analyzing Data Values:

Research Ideas

Acknowledgements

License

Forecast revenue big data market worldwide 2011-2027

Annual General Canvass Statistics

Data generation volume worldwide 2010-2029

Data quality and methodology (TSM 2024)

Contents

Introduction

Background

Official Statistics in development status

Change of designation name

User feedback

Publication schedule

Quality assurance of administrative data

Big data and business analytics revenue worldwide 2015-2022

U.S. Hunger Statistics | Historical Data | Chart | 2001-2022

Processed data and statistics

Jobs created as a result of international sourcing by NACE Rev.2 activity...

Enterprises sourcing complete business functions abroad by NACE Rev.2...

Data from: Probability waves: adaptive cluster-based correction by...

Taiwan Hunger Statistics | Historical Data | Chart | N/A-N/A

Data from: The reproducibility of research and the misinterpretation of...

Business registration and bankruptcy index by NACE Rev.2 activity -...

Principal Statistics Of Manufacturing By Industry Cluster - Value Added,...

example 1 - time series - USD RUB 1 year data

examples of statistical analysis for "NV SCHOOL"