Facebook
TwitterThis dataset contains counts of live births for California counties based on information entered on birth certificates. Final counts are derived from static data and include out of state births to California residents, whereas provisional counts are derived from incomplete and dynamic data. Provisional counts are based on the records available when the data was retrieved and may not represent all births that occurred during the time period.
The final data tables include both births that occurred in California regardless of the place of residence (by occurrence) and births to California residents (by residence), whereas the provisional data table only includes births that occurred in California regardless of the place of residence (by occurrence). The data are reported as totals, as well as stratified by parent giving birth's age, parent giving birth's race-ethnicity, and birth place type. See temporal coverage for more information on which strata are available for which years.
Facebook
TwitterBy data.world's Admin [source]
This dataset contains daily visitor-submitted birthdays and associated data from an ongoing experimentation known as the Birthday Paradox. Be enlightened as you learn how many people have chosen the same day of their birthday as yours. Get a better perspective on how this phenomenon varies day-to-day, including recent submissions within the last 24 hours. This experiment is published under the MIT License, giving you access to detailed information behind this perplexing cognitive illusion. Find out now why the probability of two people in the same room having birthday matches is much higher than one might expect!
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This dataset provides data on the Birthday Paradox Visitor Experiments. It contains information such as daily visitor-submitted birthdays, the total number of visitors who have submitted birthdays, the total number of visitors who guessed the same day as their birthday, and more. This dataset can be used to analyze patterns in visitor behavior related to the Birthday Paradox Experiment.
In order to use this dataset effectively and efficiently, it is important to understand its fields and variables:
- Updated: The date when this data was last updated
- Count: The total number of visitors who have submitted birthdays
- Recent: The number of visitors who have submitted birthdays in the last 24 hours
- binnedDay: The day of the week for a given visitor's birthday submission
- binnedGuess: The day of week that a given visitor guessed their birthday would fall on 6) Tally: Total number of visitors who guessed same day as their birthday 7) binnedTally: Total number of visitors grouped by guess dayTo begin using this dataset you should first filter your data based on desired criteria such as date range or binnedDay. For instance, if you are interested in analyzing Birthady Paradox Experiment results for Monday submissions only then you can filter your data by binnedDay = 'Monday'. Then further analyze your filtered query by examining other fields such as binnedGuess and comparing it with tally or binnedTally results accordingly. For example if we look at Monday entries above we should compare 'Monday' tallies with 'Tuesday' guesses (or any other weekday). ` Furthermore understanding updates from recent field can also provide interesting insights into user behavior related to Birthady Paradox Experiment -- trackingt recent entries may yield valuable trends over time.
By exploring various combinations offields available in this dataset users will be ableto gain a better understandingof how user behaviordiffers across different daysofweek both within a singledayandover periodsoftimeaccordingtodifferent criteria providedbythisdataset
- Analyzing the likelihood of whether a person will guess their own birthday correctly.
- Estimating which day of the week is seeing the most number of visitors submitting their birthdays each day and analyzing how this varies over time.
- Investigating how likely it is for two people from different regions to have the same birthday by comparing their respective submission rates on each day of the week
If you use this dataset in your research, please credit the original authors. Data Source
See the dataset description for more information.
File: data.csv | Column name | Description | |:----------------|:-----------------------------------------------------------------------------------| | updated | The date and time the data was last updated. (DateTime) | | count | The total number of visitor submissions. (Integer) | | recent | The number of visitor submissions in the last 24 hours. (Integer) | | binnedDay | The day of the week the visitor submitted their birthday. (String) | | binnedGuess | The day of the week the visitor guessed their birthday. (String) | | tally | The total number of visitor guesses that matched their actual birthdays. (Integer) | | binnedTally | The day of the week the visitor guessed their birthday correctly. (String) |
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains counts of live births for California as a whole based on information entered on birth certificates. Final counts are derived from static data and include out of state births to California residents, whereas provisional counts are derived from incomplete and dynamic data. Provisional counts are based on the records available when the data was retrieved and may not represent all births that occurred during the time period.
The final data tables include both births that occurred in California regardless of the place of residence (by occurrence) and births to California residents (by residence), whereas the provisional data table only includes births that occurred in California regardless of the place of residence (by occurrence). The data are reported as totals, as well as stratified by parent giving birth's age, parent giving birth's race-ethnicity, and birth place type. See temporal coverage for more information on which strata are available for which years.
Facebook
TwitterAttribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
SELECTED SOCIAL CHARACTERISTICS IN THE UNITED STATES PLACE OF BIRTH - DP02 Universe - Total population Survey-Program - American Community Survey 5-year estimates Years - 2020, 2021, 2022 People not reporting a place of birth were assigned the state or country of birth of another family member, or were allocated the response of another individual with similar characteristics. People born outside the United States were asked to report their place of birth according to current international boundaries. Since numerous changes in boundaries of foreign countries have occurred in the last century, some people may have reported their place of birth in terms of boundaries that existed at the time of their birth or emigration, or in accordance with their own national preference.
Facebook
TwitterNumber and percentage of live births, by month of birth, 1991 to most recent year.
Facebook
Twitterhttps://www.usa.gov/government-works/https://www.usa.gov/government-works/
The world population has grown rapidly, particularly over the past century: in 1900, there were fewer than 2 billion people on the planet. The world population is around 8045311488 in 2023.
Two metrics determine the change in the world population: the number of babies born and the number of people dying. How many babies are born each year?
There were 133.99 million births in 2022, compared to 92.08 million births in 1950
Facebook
TwitterThis dataset describes birth outcomes (weight, gestational age, sex assigned at birth, presence of birth defects, etc.) and parental factors (age, address, health status, etc.) for people born in North Carolina between 2003 and 2015. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: Data come from the North Carolina Birth Defects Monitoring Program. These data are not publicly available, but more information can be obtained at https://schs.dph.ncdhhs.gov/units/bdmp/ (accessed 11/9/2021). Format: Data are stored as csv files and contain information on birth records in North Carolina from 2003 to 2015, including addresses of parents and medical information on parents and neonates. This dataset is associated with the following publication: Slawsky, E., A. Weaver, T. Luben, and K. Rappazzo. A Cross-sectional Study of Brownfields and Birth Defects. Birth Defects Research. John Wiley & Sons, Inc., Hoboken, NJ, USA, 114(5-6): 197-207, (2022).
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset was created by smitop
Released under CC0: Public Domain
Facebook
TwitterBy Andy Kriebel [source]
The file contains data on births in the United States from 1994 to 2014. The data includes the following columns: year: The year of the observation. (Integer) month: The month of the observation. (Integer) date_of_month: The date of the observation. (Integer) day_of_week: The day of the week of the observation. (Integer) births: The number of births on the given day. (Integer)
The US Births dataset on Kaggle contains data on births in the United States from 1994 to 2014. The data is broken down by year, month, date of month, day of week, and births.
This dataset can be used to answer questions about when people are born, how common certain birthdays are, and any trends over time. For example, you could use this dataset to find out which day of the week has the most births or which month has the most births
- Determining which day of the year and what time of day that people are mostly born to help with staffing levels in maternity wards
- Identifying trends in baby names over time
- Predicting the number of births on a given day
This data set is a combined effort of the U.S. National Center for Health Statistics and the U.S. Social Security Administration, provided by FiveThirtyEight. It contains data on births in the United States from 1994 to 2014, with the following columns: year, month, date_of_month, day_of_week, births
->Thank you to FiveThirtyEight for providing this dataset!
License
License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. - Keep intact - all notices that refer to this license, including copyright notices.
File: US_births_1994-2014.csv | Column name | Description | |:------------------|:---------------------------------------------| | year | Year of the data. (Integer) | | month | Month of the data. (Integer) | | date_of_month | Day of the month of the data. (Integer) | | day_of_week | Day of the week of the data. (Integer) | | births | Number of births on the given day. (Integer) |
If you use this dataset in your research, please credit Andy Kriebel.
Facebook
TwitterThis Dataset contains town level information on the number of infants and toddlers referred, evaluated, determined eligible, and had an Individual Family Service Plan (IFSP) developed through the Connecticut Birth to Three (B23) System. Data can be viewed by calendar or fiscal year. See data element definitions listed in detail below. Included data are collected by the Office of Early Childhood (OEC) as the lead agency for the B23 System, in accordance with Part C of the federal Individuals with Disabilities Education Act (IDEA) and CGS 17a-248. For more information regarding B23 data, please visit https://www.birth23.org/how-are-we-doing/data/ Note: Data fields with a value of 5 or lower (<6) during the reporting period have been suppressed to protect confidentiality, as denoted with a “-99.99”.
Facebook
TwitterEstimated number of persons on July 1, by 5-year age groups and gender, and median age, for Canada, provinces and territories.
Facebook
TwitterInput datasets on Ohio Birth and Autism will not be made accessible to the public due to the fact that they include individual-level data with PII. Output data are all available in tabulated form within the published manuscript. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: Input data can be obtained from Applications from owners of the data (Children's Hospital and Ohio Department of Health). The tabulated output data is found in the manuscript. Format: Input datasets on Ohio Birth and Autism will not be made accessible to the public due to the fact that they include individual-level data with PII. Output data are all available in tabulated form within the published manuscript (e.g., results of regression models, measures of central tendency, population characteristics, etc.). This dataset is associated with the following publication: Kaufman, J., M. Wright, G. Rice, N. Connolly, K. Bowers, and J. Anixt. AMBIENT OZONE AND FINE PARTICULATE MATTER EXPOSURES AND AUTISM SPECTRUM DISORDER IN METROPOLITAN CINCINNATI, OHIO. ENVIRONMENTAL RESEARCH. Elsevier B.V., Amsterdam, NETHERLANDS, 171: 218-227, (2019).
Facebook
Twitterhttps://data.gov.sg/open-data-licencehttps://data.gov.sg/open-data-licence
Dataset from Singapore Department of Statistics. For more information, visit https://data.gov.sg/datasets/d_6150f21b0892b3fdde546d2a1af2af82/view
Facebook
TwitterThe census is undertaken by the Office for National Statistics every 10 years and gives us a picture of all the people and households in England and Wales. The most recent census took place in March of 2021.The census asks every household questions about the people who live there and the type of home they live in. In doing so, it helps to build a detailed snapshot of society. Information from the census helps the government and local authorities to plan and fund local services, such as education, doctors' surgeries and roads.Key census statistics for Leicester are published on the open data platform to make information accessible to local services, voluntary and community groups, and residents.Further information about the census and full datasets can be found on the ONS website - https://www.ons.gov.uk/census/aboutcensus/censusproductsCountry of birthThis dataset provides Census 2021 estimates that classify usual residents in England and Wales by their country of birth. The estimates are as at Census Day, 21 March 2021.Definition: The country in which a person was born. For people not born in one of in the four parts of the UK, there was an option to select "elsewhere". People who selected "elsewhere" were asked to write in the current name for their country of birth.
Facebook
TwitterThe Stockholm Birth Cohort Study (SBC) was created in 2004/2005 by a probability matching of two anonymized longitudinal datasets; The Stockholm Metropolitan study and The Swedish Work and Mortality Database (WMD). The former involves all children born 1953 that lived in the Stockholm metropolitan area as of November 1, 1963, while the latter comprises data for the period 1980-2009 on all individuals living in Sweden in 1980 or 1990, and born before 1986.
The study comprises data from both surveys and public register records. The core of the project consists of three surveys from The Stockholm Metropolitan study; The School Study (1966), The Family Study (1968), and The Culture and Leisure Time Study (1985). There is also a wide range of register data, for instance delivery records, occupational and income data, welfare recipiency data, health records, mortality data, educational data, and dependency and child welfare committee data.
The Stockholm Birth Cohort offers unique opportunities for longitudinal research within various fields such as sociology, public health science, and psychology. So far the datasets have resulted in more than 140 publications which have dealt with, among other things, whether and how childhood circumstances affect later social outcomes in adult life.
Purpose:
To aim is to create a new tool for life-course studies of health outcomes as well as social outcomes for research in fields such as psychology, public health science, and sociology.
Facebook
TwitterThis public dataset was created by the Social Security Administration and contains all names from Social Security card applications for births that occurred in the United States after 1879. Note that many people born before 1937 never applied for a Social Security card, so their names are not included in this data. For others who did apply, records may not show the place of birth, and again their names are not included in the data. All data are from a 100% sample of records on Social Security card applications as of the end of February 2015. To safeguard privacy, the Social Security Administration restricts names to those with at least 5 occurrences. This public dataset is hosted in Google BigQuery and is included in BigQuery's 1TB/mo of free tier processing. This means that each user receives 1TB of free BigQuery processing every month, which can be used to run queries on this public dataset. Watch this short video to learn how to get started quickly using BigQuery to access public datasets. What is BigQuery .
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
UK residents by broad country of birth and citizenship groups, broken down by UK country, local authority, unitary authority, metropolitan and London boroughs, and counties. Estimates from the Annual Population Survey.
Facebook
TwitterThis dataset of U.S. mortality trends since 1900 highlights the differences in age-adjusted death rates and life expectancy at birth by race and sex. Age-adjusted death rates (deaths per 100,000) after 1998 are calculated based on the 2000 U.S. standard population. Populations used for computing death rates for 2011–2017 are postcensal estimates based on the 2010 census, estimated as of July 1, 2010. Rates for census years are based on populations enumerated in the corresponding censuses. Rates for noncensus years between 2000 and 2010 are revised using updated intercensal population estimates and may differ from rates previously published. Data on age-adjusted death rates prior to 1999 are taken from historical data (see References below). Life expectancy data are available up to 2017. Due to changes in categories of race used in publications, data are not available for the black population consistently before 1968, and not at all before 1960. More information on historical data on age-adjusted death rates is available at https://www.cdc.gov/nchs/nvss/mortality/hist293.htm. SOURCES CDC/NCHS, National Vital Statistics System, historical data, 1900-1998 (see https://www.cdc.gov/nchs/nvss/mortality_historical_data.htm); CDC/NCHS, National Vital Statistics System, mortality data (see http://www.cdc.gov/nchs/deaths.htm); and CDC WONDER (see http://wonder.cdc.gov). REFERENCES National Center for Health Statistics, Data Warehouse. Comparability of cause-of-death between ICD revisions. 2008. Available from: http://www.cdc.gov/nchs/nvss/mortality/comparability_icd.htm. National Center for Health Statistics. Vital statistics data available. Mortality multiple cause files. Hyattsville, MD: National Center for Health Statistics. Available from: https://www.cdc.gov/nchs/data_access/vitalstatsonline.htm. Kochanek KD, Murphy SL, Xu JQ, Arias E. Deaths: Final data for 2017. National Vital Statistics Reports; vol 68 no 9. Hyattsville, MD: National Center for Health Statistics. 2019. Available from: https://www.cdc.gov/nchs/data/nvsr/nvsr68/nvsr68_09-508.pdf. Arias E, Xu JQ. United States life tables, 2017. National Vital Statistics Reports; vol 68 no 7. Hyattsville, MD: National Center for Health Statistics. 2019. Available from: https://www.cdc.gov/nchs/data/nvsr/nvsr68/nvsr68_07-508.pdf. National Center for Health Statistics. Historical Data, 1900-1998. 2009. Available from: https://www.cdc.gov/nchs/nvss/mortality_historical_data.htm.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The SBC was established by a probability matching of two longitudinal datasets: the Stockholm Metropolitan Study (SMS) and the Swedish Work and Mortality Database (WMD). The SMS dataset contains information on all individuals who were born in 1953 and lived in the greater metropolitan area of Stockholm, Sweden in 1963. Register and survey follow-ups were carried out up until 1986. The WMD dataset consists of all individuals who were born before 1985 and lived in Sweden in 1980 and/or 1990. The WMD includes information on income, work, education, family formation, in-patient care, and mortality for the period of 1981 to 2009. The probability matching rendered it possible for researchers to follow 14,294 members of the SMS cohort from birth (1953) to the age of 56 (2009).
Facebook
TwitterThe National Child Development Study (NCDS) is a continuing longitudinal study that seeks to follow the lives of all those living in Great Britain who were born in one particular week in 1958. The aim of the study is to improve understanding of the factors affecting human development over the whole lifespan.
The NCDS has its origins in the Perinatal Mortality Survey (PMS) (the original PMS study is held at the UK Data Archive under SN 2137). This study was sponsored by the National Birthday Trust Fund and designed to examine the social and obstetric factors associated with stillbirth and death in early infancy among the 17,000 children born in England, Scotland and Wales in that one week. Selected data from the PMS form NCDS sweep 0, held alongside NCDS sweeps 1-3, under SN 5565.
Survey and Biomeasures Data (GN 33004):
To date there have been ten attempts to trace all members of the birth cohort in order to monitor their physical, educational and social development. The first three sweeps were carried out by the National Children's Bureau, in 1965, when respondents were aged 7, in 1969, aged 11, and in 1974, aged 16 (these sweeps form NCDS1-3, held together with NCDS0 under SN 5565). The fourth sweep, also carried out by the National Children's Bureau, was conducted in 1981, when respondents were aged 23 (held under SN 5566). In 1985 the NCDS moved to the Social Statistics Research Unit (SSRU) - now known as the Centre for Longitudinal Studies (CLS). The fifth sweep was carried out in 1991, when respondents were aged 33 (held under SN 5567). For the sixth sweep, conducted in 1999-2000, when respondents were aged 42 (NCDS6, held under SN 5578), fieldwork was combined with the 1999-2000 wave of the 1970 Birth Cohort Study (BCS70), which was also conducted by CLS (and held under GN 33229). The seventh sweep was conducted in 2004-2005 when the respondents were aged 46 (held under SN 5579), the eighth sweep was conducted in 2008-2009 when respondents were aged 50 (held under SN 6137), the ninth sweep was conducted in 2013 when respondents were aged 55 (held under SN 7669), and the tenth sweep was conducted in 2020-24 when the respondents were aged 60-64 (held under SN 9412).
A Secure Access version of the NCDS is available under SN 9413, containing detailed sensitive variables not available under Safeguarded access (currently only sweep 10 data). Variables include uncommon health conditions (including age at diagnosis), full employment codes and income/finance details, and specific life circumstances (e.g. pregnancy details, year/age of emigration from GB).
Four separate datasets covering responses to NCDS over all sweeps are available. National Child Development Deaths Dataset: Special Licence Access (SN 7717) covers deaths; National Child Development Study Response and Outcomes Dataset (SN 5560) covers all other responses and outcomes; National Child Development Study: Partnership Histories (SN 6940) includes data on live-in relationships; and National Child Development Study: Activity Histories (SN 6942) covers work and non-work activities. Users are advised to order these studies alongside the other waves of NCDS.
From 2002-2004, a Biomedical Survey was completed and is available under Safeguarded Licence (SN 8731) and Special Licence (SL) (SN 5594). Proteomics analyses of blood samples are available under SL SN 9254.
Linked Geographical Data (GN 33497):
A number of geographical variables are available, under more restrictive access conditions, which can be linked to the NCDS EUL and SL access studies.
Linked Administrative Data (GN 33396):
A number of linked administrative datasets are available, under more restrictive access conditions, which can be linked to the NCDS EUL and SL access studies. These include a Deaths dataset (SN 7717) available under SL and the Linked Health Administrative Datasets (SN 8697) available under Secure Access.
Multi-omics Data and Risk Scores Data (GN 33592)
Proteomics analyses were run on the blood samples collected from NCDS participants in 2002-2004 and are available under SL SN 9254. Metabolomics analyses were conducted on respondents of sweep 10 and are available under SL SN 9411. Polygenic indices are available under SL SN 9439. Derived summary scores have been created that combine the estimated effects of many different genes on a specific trait or characteristic, such as a person's risk of Alzheimer's disease, asthma, substance abuse, or mental health disorders, for example. These scores can be combined with existing survey data to offer a more nuanced understanding of how cohort members' outcomes may be shaped.
Additional Sub-Studies (GN 33562):
In addition to the main NCDS sweeps, further studies have also been conducted on a range of subjects such as parent migration, unemployment, behavioural studies and respondent essays. The full list of NCDS studies available from the UK Data Service can be found on the NCDS series access data webpage.
How to access genetic and/or bio-medical sample data from a range of longitudinal surveys:
For information on how to access biomedical data from NCDS that are not held at the UKDS, see the CLS Genetic data and biological samples webpage.
Further information about the full NCDS series can be found on the Centre for Longitudinal Studies website.
Facebook
TwitterThis dataset contains counts of live births for California counties based on information entered on birth certificates. Final counts are derived from static data and include out of state births to California residents, whereas provisional counts are derived from incomplete and dynamic data. Provisional counts are based on the records available when the data was retrieved and may not represent all births that occurred during the time period.
The final data tables include both births that occurred in California regardless of the place of residence (by occurrence) and births to California residents (by residence), whereas the provisional data table only includes births that occurred in California regardless of the place of residence (by occurrence). The data are reported as totals, as well as stratified by parent giving birth's age, parent giving birth's race-ethnicity, and birth place type. See temporal coverage for more information on which strata are available for which years.