77 datasets found
  1. Youth Tobacco Dataset (2 Decades)

    • kaggle.com
    Updated Jun 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sahir Maharaj (2024). Youth Tobacco Dataset (2 Decades) [Dataset]. https://www.kaggle.com/datasets/sahirmaharajj/youth-tobacco-survey
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 23, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Sahir Maharaj
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    This dataset was developed to provide states with comprehensive data on both middle school and high school students regarding tobacco use, exposure to environmental tobacco smoke, smoking cessation, school curriculum, minors' ability to purchase or otherwise obtain tobacco products, knowledge and attitudes about tobacco, and familiarity with pro-tobacco and anti-tobacco media messages. The dataset uses a two-stage cluster sample design to produce representative samples of students in middle schools (grades 6–8) and high schools (grades 9–12)

    This dataset is valuable for data science due to its coverage of youth tobacco use over nearly two decades. Its rich demographic details and broad geographical spread enable researchers and policymakers to identify trends, behaviors, and risk factors associated with tobacco use among the youth.

    For instance, it can help in understanding how tobacco use prevalence varies across different age groups, genders, races, and educational backgrounds. The stratification of data by location and demographic characteristics allows for targeted analysis that can inform public health strategies and educational campaigns aimed at reducing tobacco use among young people.

    Some analysis of this dataset can include:

    • Statistical assessments of tobacco use trends, examining changes in attitudes towards tobacco, and identifying high-risk groups based on demographic characteristics.
    • Performing time-series analyses to understand how tobacco use has evolved over the years or spatial analyses to identify geographical variations in tobacco use trends.
    • Correlation studies can help uncover associations between tobacco use and factors like education levels, race, and gender.
    • Advanced machine learning models could predict future trends in youth tobacco use or evaluate the potential impact of new tobacco control measures.
  2. w

    Fire statistics data tables

    • gov.uk
    • s3.amazonaws.com
    Updated Jul 10, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ministry of Housing, Communities and Local Government (2025). Fire statistics data tables [Dataset]. https://www.gov.uk/government/statistical-data-sets/fire-statistics-data-tables
    Explore at:
    Dataset updated
    Jul 10, 2025
    Dataset provided by
    GOV.UK
    Authors
    Ministry of Housing, Communities and Local Government
    Description

    On 1 April 2025 responsibility for fire and rescue transferred from the Home Office to the Ministry of Housing, Communities and Local Government.

    This information covers fires, false alarms and other incidents attended by fire crews, and the statistics include the numbers of incidents, fires, fatalities and casualties as well as information on response times to fires. The Ministry of Housing, Communities and Local Government (MHCLG) also collect information on the workforce, fire prevention work, health and safety and firefighter pensions. All data tables on fire statistics are below.

    MHCLG has responsibility for fire services in England. The vast majority of data tables produced by the Ministry of Housing, Communities and Local Government are for England but some (0101, 0103, 0201, 0501, 1401) tables are for Great Britain split by nation. In the past the Department for Communities and Local Government (who previously had responsibility for fire services in England) produced data tables for Great Britain and at times the UK. Similar information for devolved administrations are available at https://www.firescotland.gov.uk/about/statistics/" class="govuk-link">Scotland: Fire and Rescue Statistics, https://statswales.gov.wales/Catalogue/Community-Safety-and-Social-Inclusion/Community-Safety" class="govuk-link">Wales: Community safety and https://www.nifrs.org/home/about-us/publications/" class="govuk-link">Northern Ireland: Fire and Rescue Statistics.

    If you use assistive technology (for example, a screen reader) and need a version of any of these documents in a more accessible format, please email alternativeformats@communities.gov.uk. Please tell us what format you need. It will help us if you say what assistive technology you use.

    Related content

    Fire statistics guidance
    Fire statistics incident level datasets

    Incidents attended

    https://assets.publishing.service.gov.uk/media/686d2aa22557debd867cbe14/FIRE0101.xlsx">FIRE0101: Incidents attended by fire and rescue services by nation and population (MS Excel Spreadsheet, 153 KB) Previous FIRE0101 tables

    https://assets.publishing.service.gov.uk/media/686d2ab52557debd867cbe15/FIRE0102.xlsx">FIRE0102: Incidents attended by fire and rescue services in England, by incident type and fire and rescue authority (MS Excel Spreadsheet, 2.19 MB) Previous FIRE0102 tables

    https://assets.publishing.service.gov.uk/media/686d2aca10d550c668de3c69/FIRE0103.xlsx">FIRE0103: Fires attended by fire and rescue services by nation and population (MS Excel Spreadsheet, 201 KB) Previous FIRE0103 tables

    https://assets.publishing.service.gov.uk/media/686d2ad92557debd867cbe16/FIRE0104.xlsx">FIRE0104: Fire false alarms by reason for false alarm, England (MS Excel Spreadsheet, 492 KB) Previous FIRE0104 tables

    Dwelling fires attended

    https://assets.publishing.service.gov.uk/media/686d2af42cfe301b5fb6789f/FIRE0201.xlsx">FIRE0201: Dwelling fires attended by fire and rescue services by motive, population and nation (MS Excel Spreadsheet, <span class="gem-c-attac

  3. w

    Adult Smoking Prevalence

    • data.wu.ac.at
    • data.europa.eu
    csv, html
    Updated Nov 11, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Lincolnshire County Council (2017). Adult Smoking Prevalence [Dataset]. https://data.wu.ac.at/schema/data_gov_uk/ODljNmUzYzMtNDljYy00ODA3LWFjYTgtMjY0OWMwMDJjOTgz
    Explore at:
    csv, htmlAvailable download formats
    Dataset updated
    Nov 11, 2017
    Dataset provided by
    Lincolnshire County Council
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    This data shows the percentage of adults (age 18 and over) who are current smokers.

    Smoking is the single biggest cause of preventable death and illnesses, and big inequalities exist between and within communities. Smoking is a major risk factor for many diseases, such as lung cancer, chronic obstructive pulmonary disease (COPD, bronchitis and emphysema) and heart disease. It is also associated with cancers in other organs.

    Smoking is a modifiable lifestyle risk factor. Preventing people from starting smoking is important in reducing the health harms and inequalities.

    This data is based on the Office for National Statistics (ONS) Annual Population Survey (APS). In this dataset particularly at district level there may be inherent statistical uncertainty in some data values. Thus as with many other datasets, this data should be used together with other data and resources to obtain a fuller picture.

    Data source: Public Health England, Public Health Outcomes Framework (PHOF) indicator 2.14. This data is updated annually.

  4. Proportion of Adults Who Are Current Smokers (LGHC Indicator)

    • data.chhs.ca.gov
    • data.ca.gov
    • +2more
    chart, csv, xlsx, zip
    Updated Aug 29, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    California Department of Public Health (2024). Proportion of Adults Who Are Current Smokers (LGHC Indicator) [Dataset]. https://data.chhs.ca.gov/dataset/proportion-of-adults-who-are-current-smokers-lghc-indicator-19
    Explore at:
    chart, csv(8316), xlsx(17389), zipAvailable download formats
    Dataset updated
    Aug 29, 2024
    Dataset authored and provided by
    California Department of Public Healthhttps://www.cdph.ca.gov/
    Description

    This is a source dataset for a Let's Get Healthy California indicator at https://letsgethealthy.ca.gov/. Adult smoking prevalence in California, males and females aged 18+, starting in 2012. Caution must be used when comparing the percentages of smokers over time as the definition of ‘current smoker’ was broadened in 1996, and the survey methods were changed in 2012. Current cigarette smoking is defined as having smoked at least 100 cigarettes in lifetime and now smoking every day or some days. Due to the methodology change in 2012, the Centers for Disease Control and Prevention (CDC) recommend not conducting analyses where estimates from 1984 – 2011 are compared with analyses using the new methodology, beginning in 2012. This includes analyses examining trends and changes over time. (For more information, please see the narrative description.) The California Behavioral Risk Factor Surveillance System (BRFSS) is an on-going telephone survey of randomly selected adults, which collects information on a wide variety of health-related behaviors and preventive health practices related to the leading causes of death and disability such as cardiovascular disease, cancer, diabetes and injuries. Data are collected monthly from a random sample of the California population aged 18 years and older. The BRFSS is conducted by Public Health Survey Research Program of California State University, Sacramento under contract from CDPH. The survey has been conducted since 1984 by the California Department of Public Health in collaboration with the Centers for Disease Control and Prevention (CDC). In 2012, the survey methodology of the California BRFSS changed significantly so that the survey would be more representative of the general population. Several changes were implemented: 1) the survey became dual-frame, with both cell and landline random-digit dial components, 2) residents of college housing were eligible to complete the BRFSS, and 3) raking or iterative proportional fitting was used to calculate the survey weights. Due to these changes, estimates from 1984 – 2011 are not comparable to estimates from 2012 and beyond. Center for Disease Control and Policy (CDC) and recommend not conducting analyses where estimates from 1984 – 2011 are compared with analyses using the new methodology, beginning in 2012. This includes analyses examining trends and changes over time.Current cigarette smoking was defined as having smoked at least 100 cigarettes in lifetime and now smoking every day or some days. Prior to 1996, the definition of current cigarettes smoking was having smoked at least 100 cigarettes in lifetime and smoking now.

  5. f

    Data from: Differential impact of smoking on cardiac or non-cardiac death...

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Oct 30, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Youn, Tae-Jin; Park, Jin Joo; Choi, Wonsuk; Kim, Sun-Hwa; Kang, Si-Hyuck; Chae, In-Ho; Yoon, Chang-Hwan (2019). Differential impact of smoking on cardiac or non-cardiac death according to age [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000087736
    Explore at:
    Dataset updated
    Oct 30, 2019
    Authors
    Youn, Tae-Jin; Park, Jin Joo; Choi, Wonsuk; Kim, Sun-Hwa; Kang, Si-Hyuck; Chae, In-Ho; Yoon, Chang-Hwan
    Description

    Tobacco smoking causes cardiovascular diseases, lung disease, and various cancers. Understanding the population-based characteristics associated with smoking and the cause of death is important to improve survival. This study sought to evaluate the differential impact of smoking on cardiac or non-cardiac death according to age. Data from 514,866 healthy adults who underwent national health screening in South Korea were analyzed. The participants were divided into three groups: never-smoker, ex-smoker or current smoker according to the smoking status. The incidence rates and hazard ratios (HRs) of cardiac or non-cardiac deaths according to smoking status and age groups during the 10-year follow-up were calculated to evaluate the differential risk of smoking. Over the follow-up period, 6,192 and 24,443 cardiac and non-cardiac deaths had occurred, respectively. The estimated incidence rate of cardiac and non-cardiac death gradually increased in older age groups and was higher in current smokers and ex-smokers than that in never-smokers among all age groups. After adjustment of covariates, the HRs for cardiac death of current smokers compared to never-smokers were the highest in individuals in their 40’s (1.82; 95% CI, 1.45–2.28); this gradually decreased to 0.96 (95% CI, 0.67–1.38) in individuals >80 years. In contrast, the HRs for non-cardiac death peaked in individuals in their 50’s, (HR 1.69, 95% CI 1.57–1.82) and was sustained in those >80 years (HR 1.40, 95% CI 1.17–1.69). Ex-smokers did not show elevated risk of cardiac death compared to never-smokers in any age group, whereas they showed significantly higher risk of non-cardiac death in their 60’s and 70’s (HR, 1.29; 95% CI, 1.19–1.39; HR 1.22, 95% CI, 1.12–1.32, respectively). Acute myocardial infarction and lung cancer showed patterns similar to those of cardiac and non-cardiac death, respectively. Smoking was associated with higher relative risk of cardiac death in the middle-aged group and non-cardiac death in the older age group. Ex-smokers in the older age group had elevated risk of non-cardiac death. To prevent early cardiac death and late non-cardiac death, smoking cessation should be emphasized as early as possible.

  6. e

    Smoking Indicators, Borough

    • data.europa.eu
    • data.wu.ac.at
    unknown
    Updated Sep 24, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Department of Health, and Office for National Statistics (2021). Smoking Indicators, Borough [Dataset]. https://data.europa.eu/data/datasets/smoking-indicators-borough
    Explore at:
    unknownAvailable download formats
    Dataset updated
    Sep 24, 2021
    Dataset authored and provided by
    Department of Health, and Office for National Statistics
    Description

    This dataset contains three smoking related indicators.

    Rates of self reported four-week smoking quitters

    Smoking quit rates per 100,000 available from the HNA.

    - These quarterly reports present provisional results from the monitoring of the NHS Stop Smoking Services (NHS SSS) in England. This report includes information on the number of people setting a quit date and the number who successfully quit at the 4 week follow-up. Data for London presented with England comparator. PCT level data available from NHS.

    Number of Deaths Attributable to Smoking per 100,000 population by borough

    Deaths attributable to smoking, directly age-sex standardised rate for persons aged 35 years +. Causes of death considered to be related to smoking are: various cancers, cardiovascular and respiratory diseases, and diseases of the digestive system.

    Numbers of adults smoking by borough

    Prevalence of smoking among persons aged 18 years and over.
    - Population who currently smoke, are ex-smokers, or never smoked by borough. This includes cigarette, cigar or pipe smokers. Data by age is also provided for London with a UK comparator.

    Relevant links: http://www.hscic.gov.uk/Article/1685

    http://www.apho.org.uk/default.aspx?QN=HP_DATATABLES

  7. Deaths from Respiratory Disease - Datasets - Lincolnshire Open Data

    • lincolnshire.ckan.io
    Updated May 11, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    lincolnshire.ckan.io (2017). Deaths from Respiratory Disease - Datasets - Lincolnshire Open Data [Dataset]. https://lincolnshire.ckan.io/dataset/deaths-from-respiratory-disease
    Explore at:
    Dataset updated
    May 11, 2017
    Dataset provided by
    CKANhttps://ckan.org/
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Description

    This data shows premature deaths (Age under 75) from Respiratory Disease, numbers and rates by gender, as 3-year range. Smoking is the major cause of chronic obstructive pulmonary disease (COPD), one of the major Respiratory diseases. COPD (which includes chronic bronchitis and emphysema) results in many hospital admissions. Respiratory diseases can also be caused by environmental factors (such as pollution, or housing conditions) and influenza. Respiratory disease mortality rates show a socio-economic gradient. Directly Age-Standardised Rates (DASR) are shown in the data, where numbers are sufficient, so that death rates can be directly compared between areas. The DASR calculation applies Age-specific rates to a Standard (European) population to cancel out possible effects on crude rates due to different age structures among populations, thus enabling direct comparisons of rates. A limitation on using mortalities as a proxy for prevalence of health conditions is that mortalities may give an incomplete view of health conditions in an area, as ill-health might not lead to premature death. Data source: Office for Health Improvement and Disparities (OHID) Public Health Outcomes Framework (PHOF) indicator 4.07i. This data is updated annually.

  8. Adult Smoking Prevalence - Datasets - Lincolnshire Open Data

    • lincolnshire.ckan.io
    Updated May 23, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    lincolnshire.ckan.io (2017). Adult Smoking Prevalence - Datasets - Lincolnshire Open Data [Dataset]. https://lincolnshire.ckan.io/dataset/adult-smoking-prevalence
    Explore at:
    Dataset updated
    May 23, 2017
    Dataset provided by
    CKANhttps://ckan.org/
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Area covered
    Lincolnshire
    Description

    This data shows the percentage of adults (age 18 and over) who are current smokers. Smoking is the single biggest cause of preventable death and illnesses, and big inequalities exist between and within communities. Smoking is a major risk factor for many diseases, such as lung cancer, chronic obstructive pulmonary disease (COPD, bronchitis and emphysema) and heart disease. It is also associated with cancers in other organs. Smoking is a modifiable lifestyle risk factor. Preventing people from starting smoking is important in reducing the health harms and inequalities. This data is based on the Office for National Statistics (ONS) Annual Population Survey (APS). The percentage of adults is not age-standardised. In this dataset particularly at district level there may be inherent statistical uncertainty in some data values. Thus as with many other datasets, this data should be used together with other data and resources to obtain a fuller picture. Data source: Office for Health Improvement and Disparities (OHID) Public Health Outcomes Framework (PHOF) indicator 92443 (Number 15). This data is updated annually.

  9. Predicting Heart Failure

    • kaggle.com
    Updated Sep 13, 2022
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aman Chauhan (2022). Predicting Heart Failure [Dataset]. https://www.kaggle.com/datasets/whenamancodes/heart-failure-clinical-records
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 13, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Aman Chauhan
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Cardiovascular diseases (CVDs) are the number 1 cause of death globally, taking an estimated 17.9 million lives each year, which accounts for 31% of all deaths worlwide. Heart failure is a common event caused by CVDs and this dataset contains 12 features that can be used to predict mortality by heart failure.

    Most cardiovascular diseases can be prevented by addressing behavioural risk factors such as tobacco use, unhealthy diet and obesity, physical inactivity and harmful use of alcohol using population-wide strategies.

    People with cardiovascular disease or who are at high cardiovascular risk (due to the presence of one or more risk factors such as hypertension, diabetes, hyperlipidaemia or already established disease) need early detection and management wherein a machine learning model can be of great help.

    Attribute Information:

    Thirteen (13) clinical features: - age: age of the patient (years) - anaemia: decrease of red blood cells or hemoglobin (boolean) - high blood pressure: if the patient has hypertension (boolean) - creatinine phosphokinase (CPK): level of the CPK enzyme in the blood (mcg/L) - diabetes: if the patient has diabetes (boolean) - ejection fraction: percentage of blood leaving the heart at each contraction (percentage) - platelets: platelets in the blood (kiloplatelets/mL) - sex: woman or man (binary) - serum creatinine: level of serum creatinine in the blood (mg/dL) - serum sodium: level of serum sodium in the blood (mEq/L) - smoking: if the patient smokes or not (boolean) - time: follow-up period (days) - [target] death event: if the patient deceased during the follow-up period (boolean)

    More - Find More Exciting🙀 Datasets Here - An Upvote👍 A Dayᕙ(`▿´)ᕗ , Keeps Aman Hurray Hurray..... ٩(˘◡˘)۶Haha

  10. Adult smoking habits in Great Britain

    • ons.gov.uk
    • cy.ons.gov.uk
    xlsx
    Updated Oct 1, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office for National Statistics (2024). Adult smoking habits in Great Britain [Dataset]. https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/drugusealcoholandsmoking/datasets/adultsmokinghabitsingreatbritain
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Oct 1, 2024
    Dataset provided by
    Office for National Statisticshttp://www.ons.gov.uk/
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Area covered
    United Kingdom
    Description

    Annual data on the proportion of adults in Great Britain who smoke cigarettes, cigarette consumption, the proportion who have never smoked cigarettes and the proportion of smokers who have quit by sex and age over time.

  11. A

    ‘COVID-19 State Data’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Mar 31, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2020). ‘COVID-19 State Data’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-covid-19-state-data-85fa/4a8c7dec/?iid=002-627&v=presentation
    Explore at:
    Dataset updated
    Mar 31, 2020
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of ‘COVID-19 State Data’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/nightranger77/covid19-state-data on 28 January 2022.

    --- Dataset description provided by original source is as follows ---

    This dataset is a per-state amalgamation of demographic, public health and other relevant predictors for COVID-19.

    Deaths, Infections and Tests by State

    The COVID Tracking Project: https://covidtracking.com/data/api

    Used positive, death and totalTestResults from the API for, respectively, Infected, Deaths and Tested in this dataset. Please read the documentation of the API for more context on those columns

    Predictor Data and Sources

    Population (2020)

    Density is people per meter squared https://worldpopulationreview.com/states/

    ICU Beds and Age 60+

    https://khn.org/news/as-coronavirus-spreads-widely-millions-of-older-americans-live-in-counties-with-no-icu-beds/

    GDP

    https://worldpopulationreview.com/states/gdp-by-state/

    Income per capita (2018)

    https://worldpopulationreview.com/states/per-capita-income-by-state/

    Gini

    https://en.wikipedia.org/wiki/List_of_U.S._states_by_Gini_coefficient

    Unemployment (2020)

    Rates from Feb 2020 and are percentage of labor force
    https://www.bls.gov/web/laus/laumstrk.htm

    Sex (2017)

    Ratio is Male / Female
    https://www.kff.org/other/state-indicator/distribution-by-gender/

    Smoking Percentage (2020)

    https://worldpopulationreview.com/states/smoking-rates-by-state/

    Influenza and Pneumonia Death Rate (2018)

    Death rate per 100,000 people
    https://www.cdc.gov/nchs/pressroom/sosmap/flu_pneumonia_mortality/flu_pneumonia.htm

    Chronic Lower Respiratory Disease Death Rate (2018)

    Death rate per 100,000 people
    https://www.cdc.gov/nchs/pressroom/sosmap/lung_disease_mortality/lung_disease.htm

    Active Physicians (2019)

    https://www.kff.org/other/state-indicator/total-active-physicians/

    Hospitals (2018)

    https://www.kff.org/other/state-indicator/total-hospitals

    Health spending per capita

    Includes spending for all health care services and products by state of residence. Hospital spending is included and reflects the total net revenue. Costs such as insurance, administration, research, and construction expenses are not included.
    https://www.kff.org/other/state-indicator/avg-annual-growth-per-capita/

    Pollution (2019)

    Pollution: Average exposure of the general public to particulate matter of 2.5 microns or less (PM2.5) measured in micrograms per cubic meter (3-year estimate)
    https://www.americashealthrankings.org/explore/annual/measure/air/state/ALL

    Medium and Large Airports

    For each state, number of medium and large airports https://en.wikipedia.org/wiki/List_of_the_busiest_airports_in_the_United_States

    Temperature (2019)

    Note that FL was incorrect in the table, but is corrected in the Hottest States paragraph
    https://worldpopulationreview.com/states/average-temperatures-by-state/
    District of Columbia temperature computed as the average of Maryland and Virginia

    Urbanization (2010)

    Urbanization as a percentage of the population https://www.icip.iastate.edu/tables/population/urban-pct-states

    Age Groups (2018)

    https://www.kff.org/other/state-indicator/distribution-by-age/

    School Closure Dates

    Schools that haven't closed are marked NaN https://www.edweek.org/ew/section/multimedia/map-coronavirus-and-school-closures.html

    Note that some datasets above did not contain data for District of Columbia, this missing data was found via Google searches manually entered.

    --- Original source retains full ownership of the source dataset ---

  12. Smoking habits in the UK and its constituent countries

    • ons.gov.uk
    • cy.ons.gov.uk
    xlsx
    Updated Oct 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Office for National Statistics (2024). Smoking habits in the UK and its constituent countries [Dataset]. https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/healthandlifeexpectancies/datasets/smokinghabitsintheukanditsconstituentcountries
    Explore at:
    xlsxAvailable download formats
    Dataset updated
    Oct 1, 2024
    Dataset provided by
    Office for National Statisticshttp://www.ons.gov.uk/
    License

    Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
    License information was derived automatically

    Area covered
    United Kingdom
    Description

    Annual data and annual historic data on the proportion of adults who currently smoke, the proportion of ex-smokers and proportion of those who have never smoked, by sex and age.

  13. Smoking prevalence worldwide 2024, by country

    • statista.com
    Updated Jul 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Smoking prevalence worldwide 2024, by country [Dataset]. https://www.statista.com/forecasts/1140759/smoking-prevalence-by-country
    Explore at:
    Dataset updated
    Jul 11, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Time period covered
    Jan 1, 2024 - Dec 31, 2024
    Area covered
    Albania
    Description

    Comparing the *** selected regions regarding the smoking prevalence , Myanmar is leading the ranking (***** percent) and is followed by Serbia with ***** percent. At the other end of the spectrum is Ghana with **** percent, indicating a difference of ***** percentage points to Myanmar. Shown is the estimated share of the adult population (15 years or older) in a given region or country, that smoke on a daily basis. According to the WHO and World bank, smoking refers to the use of cigarettes, pipes or other types of tobacco.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to *** countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).

  14. O

    ARCHIVED - San Diego County Smoking Attributable Mortality

    • data.sandiegocounty.gov
    application/rdfxml +5
    Updated Mar 29, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    County of San Diego (2019). ARCHIVED - San Diego County Smoking Attributable Mortality [Dataset]. https://data.sandiegocounty.gov/w/8tje-x4na/by4r-nr9x?cur=-NQdIhwMIxd&from=e3bNvMIN4GI
    Explore at:
    csv, application/rdfxml, xml, json, tsv, application/rssxmlAvailable download formats
    Dataset updated
    Mar 29, 2019
    Dataset authored and provided by
    County of San Diego
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Area covered
    San Diego County
    Description

    For current version see: https://www.sandiegocounty.gov/content/sdc/hhsa/programs/phs/community_health_statistics/CHSU_Mortality.html#smoking

    This dataset presents smoking attributable deaths for San Diego County by condition and overall categories for those 35 years of age and older.

    2014-2016. For data by HHSA Region or archived years, please visit www.sdhealthstatistics.com

    Methods: Fractions by the Centers for Disease Control, Smoking‐Attributable Mortality, Morbidity, and Economic Costs (SAMMEC) System. http://www.ncbi.nlm.nih.gov/books/NBK294316/table/ch12.t4/?report=objectonly
    Note: Deaths with unknown age or sex were not included in the analysis. Deaths were pulled using 2016 ICD 10 codes. Source: California Department of Public Health, Center for Health Statistics, Office of Health Information and Research, Vital Records Business Intelligence System (2016). Prepared by County of San Diego, Health & Human Services Agency, Public Health Services, Community Health Statistics Unit, 2019.

    Note: COPD = chronic obstructive pulmonary disease. a - Other cancers consist of cancers of the lip, pharynx and oral cavity, esophagus, stomach, pancreas, larynx, cervix uteri (women), kidney and renal pelvis, bladder, liver, colon and rectum, and acute myeloid leukemia.
    b - Other heart disease comprised of rheumatic heart disease, pulmonary heart disease, and other forms of heart disease.
    c - Cerebrovascular diseases ICD-10 Codes: I60-I69 d - Other vascular diseases are comprised of atherosclerosis, aortic aneurysm, and other arterial diseases. e - Pulmonary diseases consists of pneumonia, influenza, emphysema, bronchitis, and chronic airways obstruction.
    f - Prenatal conditions (All Ages) comprised of ICD-10 codes: K55.0, P00.0, P01.0, P01.1, P01.5, P02.0, P02.1, P02.7, P07.0–P07.3, P10.2, P22.0–P22.9, P25.0–P27.9, P28.0, P28.1, P36.0–P36.9, P52.0–P52.3, and P77 (Dietz et al. 2010).
    g - Sudden Infant Death Syndrome ((All Ages) ICD-10 code R95

  15. Number of smokers worldwide 2014-2029

    • statista.com
    Updated Jul 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Number of smokers worldwide 2014-2029 [Dataset]. https://www.statista.com/forecasts/1167644/smoker-population-forecast-in-the-world
    Explore at:
    Dataset updated
    Jul 11, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    Worldwide
    Description

    The global number of smokers in was forecast to continuously increase between 2024 and 2029 by in total **** million individuals (+**** percent). After the ******** consecutive increasing year, the number of smokers is estimated to reach *** billion individuals and therefore a new peak in 2029. Shown is the estimated share of the adult population (15 years or older) in a given region or country, that smoke. According to the WHO and World bank, smoking refers to the use of cigarettes, pipes or other types of tobacco, be it on a daily or non-daily basis.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to *** countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the number of smokers in countries like Caribbean and Africa.

  16. COVID-19 State Data

    • kaggle.com
    Updated Nov 3, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Night Ranger (2020). COVID-19 State Data [Dataset]. https://www.kaggle.com/nightranger77/covid19-state-data/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 3, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Night Ranger
    Description

    This dataset is a per-state amalgamation of demographic, public health and other relevant predictors for COVID-19.

    Deaths, Infections and Tests by State

    The COVID Tracking Project: https://covidtracking.com/data/api

    Used positive, death and totalTestResults from the API for, respectively, Infected, Deaths and Tested in this dataset. Please read the documentation of the API for more context on those columns

    Predictor Data and Sources

    Population (2020)

    Density is people per meter squared https://worldpopulationreview.com/states/

    ICU Beds and Age 60+

    https://khn.org/news/as-coronavirus-spreads-widely-millions-of-older-americans-live-in-counties-with-no-icu-beds/

    GDP

    https://worldpopulationreview.com/states/gdp-by-state/

    Income per capita (2018)

    https://worldpopulationreview.com/states/per-capita-income-by-state/

    Gini

    https://en.wikipedia.org/wiki/List_of_U.S._states_by_Gini_coefficient

    Unemployment (2020)

    Rates from Feb 2020 and are percentage of labor force
    https://www.bls.gov/web/laus/laumstrk.htm

    Sex (2017)

    Ratio is Male / Female
    https://www.kff.org/other/state-indicator/distribution-by-gender/

    Smoking Percentage (2020)

    https://worldpopulationreview.com/states/smoking-rates-by-state/

    Influenza and Pneumonia Death Rate (2018)

    Death rate per 100,000 people
    https://www.cdc.gov/nchs/pressroom/sosmap/flu_pneumonia_mortality/flu_pneumonia.htm

    Chronic Lower Respiratory Disease Death Rate (2018)

    Death rate per 100,000 people
    https://www.cdc.gov/nchs/pressroom/sosmap/lung_disease_mortality/lung_disease.htm

    Active Physicians (2019)

    https://www.kff.org/other/state-indicator/total-active-physicians/

    Hospitals (2018)

    https://www.kff.org/other/state-indicator/total-hospitals

    Health spending per capita

    Includes spending for all health care services and products by state of residence. Hospital spending is included and reflects the total net revenue. Costs such as insurance, administration, research, and construction expenses are not included.
    https://www.kff.org/other/state-indicator/avg-annual-growth-per-capita/

    Pollution (2019)

    Pollution: Average exposure of the general public to particulate matter of 2.5 microns or less (PM2.5) measured in micrograms per cubic meter (3-year estimate)
    https://www.americashealthrankings.org/explore/annual/measure/air/state/ALL

    Medium and Large Airports

    For each state, number of medium and large airports https://en.wikipedia.org/wiki/List_of_the_busiest_airports_in_the_United_States

    Temperature (2019)

    Note that FL was incorrect in the table, but is corrected in the Hottest States paragraph
    https://worldpopulationreview.com/states/average-temperatures-by-state/
    District of Columbia temperature computed as the average of Maryland and Virginia

    Urbanization (2010)

    Urbanization as a percentage of the population https://www.icip.iastate.edu/tables/population/urban-pct-states

    Age Groups (2018)

    https://www.kff.org/other/state-indicator/distribution-by-age/

    School Closure Dates

    Schools that haven't closed are marked NaN https://www.edweek.org/ew/section/multimedia/map-coronavirus-and-school-closures.html

    Note that some datasets above did not contain data for District of Columbia, this missing data was found via Google searches manually entered.

  17. Data from: Population Assessment of Tobacco and Health (PATH) Study [United...

    • icpsr.umich.edu
    Updated Jun 27, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Inter-university Consortium for Political and Social Research [distributor] (2025). Population Assessment of Tobacco and Health (PATH) Study [United States] Restricted-Use Files [Dataset]. http://doi.org/10.3886/ICPSR36231.v42
    Explore at:
    Dataset updated
    Jun 27, 2025
    Dataset provided by
    Inter-university Consortium for Political and Social Researchhttps://www.icpsr.umich.edu/web/pages/
    License

    https://www.icpsr.umich.edu/web/ICPSR/studies/36231/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/36231/terms

    Area covered
    United States
    Description

    The PATH Study was launched in 2011 to inform the Food and Drug Administration's regulatory activities under the Family Smoking Prevention and Tobacco Control Act (TCA). The PATH Study is a collaboration between the National Institute on Drug Abuse (NIDA), National Institutes of Health (NIH), and the Center for Tobacco Products (CTP), Food and Drug Administration (FDA). The study sampled over 150,000 mailing addresses across the United States to create a national sample of people who use or do not use tobacco. 45,971 adults and youth constitute the first (baseline) wave, Wave 1, of data collected by this longitudinal cohort study. These 45,971 adults and youth along with 7,207 "shadow youth" (youth ages 9 to 11 sampled at Wave 1) make up the 53,178 participants that constitute the Wave 1 Cohort. Respondents are asked to complete an interview at each follow-up wave. Youth who turn 18 by the current wave of data collection are considered "aged-up adults" and are invited to complete the Adult Interview. Additionally, "shadow youth" are considered "aged-up youth" upon turning 12 years old, when they are asked to complete an interview after parental consent. At Wave 4, a probability sample of 14,098 adults, youth, and shadow youth ages 10 to 11 was selected from the civilian, noninstitutionalized population (CNP) at the time of Wave 4. This sample was recruited from residential addresses not selected for Wave 1 in the same sampled Primary Sampling Unit (PSU)s and segments using similar within-household sampling procedures. This "replenishment sample" was combined for estimation and analysis purposes with Wave 4 adult and youth respondents from the Wave 1 Cohort who were in the CNP at the time of Wave 4. This combined set of Wave 4 participants, 52,731 participants in total, forms the Wave 4 Cohort. At Wave 7, a probability sample of 14,863 adults, youth, and shadow youth ages 9 to 11 was selected from the CNP at the time of Wave 7. This sample was recruited from residential addresses not selected for Wave 1 or Wave 4 in the same sampled PSUs and segments using similar within-household sampling procedures. This "second replenishment sample" was combined for estimation and analysis purposes with the Wave 7 adult and youth respondents from the Wave 4 Cohorts who were at least age 15 and in the CNP at the time of Wave 7. This combined set of Wave 7 participants, 46,169 participants in total, forms the Wave 7 Cohort. Please refer to the Restricted-Use Files User Guide that provides further details about children designated as "shadow youth" and the formation of the Wave 1, Wave 4, and Wave 7 Cohorts. Dataset 0002 (DS0002) contains the data from the State Design Data. This file contains 7 variables and 82,139 cases. The state identifier in the State Design file reflects the participant's state of residence at the time of selection and recruitment for the PATH Study. Dataset 1011 (DS1011) contains the data from the Wave 1 Adult Questionnaire. This data file contains 2,021 variables and 32,320 cases. Each of the cases represents a single, completed interview. Dataset 1012 (DS1012) contains the data from the Wave 1 Youth and Parent Questionnaire. This file contains 1,431 variables and 13,651 cases. Dataset 1411 (DS1411) contains the Wave 1 State Identifier data for Adults and has 5 variables and 32,320 cases. Dataset 1412 (DS1412) contains the Wave 1 State Identifier data for Youth (and Parents) and has 5 variables and 13,651 cases. The same 5 variables are in each State Identifier dataset, including PERSONID for linking the State Identifier to the questionnaire and biomarker data and 3 variables designating the state (state Federal Information Processing System (FIPS), state abbreviation, and full name of the state). The State Identifier values in these datasets represent participants' state of residence at the time of Wave 1, which is also their state of residence at the time of recruitment. Dataset 1611 (DS1611) contains the Tobacco Universal Product Code (UPC) data from Wave 1. This data file contains 32 variables and 8,601 cases. This file contains UPC values on the packages of tobacco products used or in the possession of adult respondents at the time of Wave 1. The UPC values can be used to identify and validate the specific products used by respondents and augment the analyses of the characteristics of tobacco products used

  18. Prevalence of smoking in the United States 2001-2029

    • statista.com
    Updated Mar 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Statista (2025). Prevalence of smoking in the United States 2001-2029 [Dataset]. https://www.statista.com/forecasts/1148652/smoking-prevalence-forecast-in-the-united-states
    Explore at:
    Dataset updated
    Mar 3, 2025
    Dataset authored and provided by
    Statistahttp://statista.com/
    Area covered
    United States
    Description

    The smoking prevalence in the United States was forecast to continuously decrease between 2024 and 2029 by in total two percentage points. After the eighth consecutive decreasing year, the smoking prevalence is estimated to reach 19.93 percent and therefore a new minimum in 2029. Shown is the estimated share of the adult population (15 years or older) in a given region or country, that smoke on a daily basis. According to the WHO and World bank, smoking refers to the use of cigarettes, pipes or other types of tobacco.The shown data are an excerpt of Statista's Key Market Indicators (KMI). The KMI are a collection of primary and secondary indicators on the macro-economic, demographic and technological environment in up to 150 countries and regions worldwide. All indicators are sourced from international and national statistical offices, trade associations and the trade press and they are processed to generate comparable data sets (see supplementary notes under details for more information).Find more key insights for the smoking prevalence in countries like Canada and Mexico.

  19. Healthy People 2020 Tobacco Use Objectives

    • catalog.data.gov
    • healthdata.gov
    • +4more
    Updated Jun 28, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Centers for Disease Control and Prevention (2025). Healthy People 2020 Tobacco Use Objectives [Dataset]. https://catalog.data.gov/dataset/healthy-people-2020-tobacco-use-objectives
    Explore at:
    Dataset updated
    Jun 28, 2025
    Dataset provided by
    Centers for Disease Control and Preventionhttp://www.cdc.gov/
    Description

    U.S. Department of Health and Human Services (HHS). Centers for Disease Control and Prevention (CDC). Healthy People 2020 Tobacco Use Objectives. Healthy People 2020. Healthy People 2020 provides a framework for action to reduce tobacco use to the point that it is no longer a public health problem for the Nation. This dataset includes information related to the Healthy People 2020 Tobacco Use objectives, operational definitions, baselines, and targets. Baseline years may vary by objective. Targets represented correspond to the year 2020.

  20. d

    Statistics on Women's Smoking Status at Time of Delivery: England

    • digital.nhs.uk
    Updated Jun 19, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    (2025). Statistics on Women's Smoking Status at Time of Delivery: England [Dataset]. https://digital.nhs.uk/data-and-information/publications/statistical/statistics-on-women-s-smoking-status-at-time-of-delivery-england
    Explore at:
    Dataset updated
    Jun 19, 2025
    License

    https://digital.nhs.uk/about-nhs-digital/terms-and-conditionshttps://digital.nhs.uk/about-nhs-digital/terms-and-conditions

    Time period covered
    Jan 1, 2025 - Mar 31, 2025
    Description

    This report presents statistics on women’s smoking status at time of delivery, at Sub Integrated Care Board (Sub-ICB), Integrated Care Board (ICB), regional and national levels. This release includes finalised data for quarter 4 of 2024-25 using data from the Smoking at Time of Delivery data collection which is submitted by commissioners (presented as SATOD v1). Alongside this, comparative data using the Maternity Services Dataset (MSDS) is also presented using data submitted by Trusts (presented as SATOD v2) as a time series from quarter 1 of 2022-23 to quarter 4 of 2024-25. This is available for the same geographical breakdowns and includes an additional breakdown for Local Authorities. SATOD has been dual reported for the last time in 2024-25. The SATOD v1 data collection has now ceased from Q1 2025-26 and future SATOD reporting will be replaced with SATOD v2 data from MSDS. This decision has been based on how closely the estimates have aligned during dual reporting, a Health and Social Care Statistics consultation carried out in 2024 (link available in Related Links below) and being driven by the need to reduce burden on data collection and duplication across the NHS. MSDS data is submitted monthly by Trusts so sub-ICBs (or other commissioning entities) are no longer required to submit data for this collection from Q1 2025-26. Subsequent reporting will include data from MSDS only. A Methodological Change Notice has now been published, and is available in the Related Links section below. If you would still like to send any final feedback on the SATOD data collection retirement and replacement with MSDS, then please contact us on: england.maternityanalysis@nhs.net

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Sahir Maharaj (2024). Youth Tobacco Dataset (2 Decades) [Dataset]. https://www.kaggle.com/datasets/sahirmaharajj/youth-tobacco-survey
Organization logo

Youth Tobacco Dataset (2 Decades)

A comprehensive dataset of over two decades of data

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 23, 2024
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Sahir Maharaj
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

This dataset was developed to provide states with comprehensive data on both middle school and high school students regarding tobacco use, exposure to environmental tobacco smoke, smoking cessation, school curriculum, minors' ability to purchase or otherwise obtain tobacco products, knowledge and attitudes about tobacco, and familiarity with pro-tobacco and anti-tobacco media messages. The dataset uses a two-stage cluster sample design to produce representative samples of students in middle schools (grades 6–8) and high schools (grades 9–12)

This dataset is valuable for data science due to its coverage of youth tobacco use over nearly two decades. Its rich demographic details and broad geographical spread enable researchers and policymakers to identify trends, behaviors, and risk factors associated with tobacco use among the youth.

For instance, it can help in understanding how tobacco use prevalence varies across different age groups, genders, races, and educational backgrounds. The stratification of data by location and demographic characteristics allows for targeted analysis that can inform public health strategies and educational campaigns aimed at reducing tobacco use among young people.

Some analysis of this dataset can include:

  • Statistical assessments of tobacco use trends, examining changes in attitudes towards tobacco, and identifying high-risk groups based on demographic characteristics.
  • Performing time-series analyses to understand how tobacco use has evolved over the years or spatial analyses to identify geographical variations in tobacco use trends.
  • Correlation studies can help uncover associations between tobacco use and factors like education levels, race, and gender.
  • Advanced machine learning models could predict future trends in youth tobacco use or evaluate the potential impact of new tobacco control measures.
Search
Clear search
Close search
Google apps
Main menu