Facebook
TwitterBy data.world's Admin [source]
This dataset contains an aggregation of birth data from the United Statesbetween 1985 and 2015. It consists of information on mothers' locations by state (including District of Columbia) and county, as well as information such as the month they gave birth, and aggregates giving the sum of births during that month. This data has been provided by both the National Bureau for Economic Research and National Center for Health Statistics, whose shared mission is to understand how life works in order to aid individuals in making decisions about their health and wellbeing. This dataset provides valuable insight into population trends across time and location - for example, which states have higher or lower birthrates than others? Which counties experience dramatic fluctuations over time? Given its scope, this dataset could be used in a number of contexts--from epidemiology research to population forecasting. Be sure to check out our other datasets related to births while you're here!
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This dataset could be used to examine local trends in birth rates over time or analyze births at different geographical locations. In order to maximize your use of this dataset, it is important that you understand what information the various columns contain.
The main columns are: State (including District of Columbia), County (coded using the FIPS county code number), Month (numbering from 1 for January through 12 for December), Year (4-digit year) countyBirths (calculated sum of births that occurred to mothers living in a county for a given month) and stateBirths (calculated sum of births that occurred to mothers living in a state for a given month). These fields should provide enough information for you analyze trends across geographic locations both at monthly and yearly levels. You could also consider combining variables such as
YearwithStateorYearwithMonthor any other grouping combinations depending on your analysis goal.In addition, while all data were downloaded on April 5th 2017, it is worth noting that all sources used followed privacy guidelines as laid out by NCHC so individual births occurring after 2005 are not included due to geolocation concerns.
We hope you find this dataset useful and can benefit from its content! With proper understanding of what each field contains, we are confident you will gain valuable insights on birth rates across counties within the United States during this period
- Establishing county-level trends in birth rates for the US over time.
- Analyzing the relationship between month of birth and health outcomes for US babies after they are born (e.g., infant mortality, neurological development, etc.).
- Comparing state/county-level differences in average numbers of twins born each year
If you use this dataset in your research, please credit the original authors. Data Source
See the dataset description for more information.
File: allBirthData.csv | Column name | Description | |:-----------------|:-----------------------------------------------------------------------------------------------------------------| | State | The numerical order of the state where the mother lives. (Integer) | | Month | The month in which the birth took place. (Integer) | | Year | The year of the birth. (Integer) | | countyBirths | The calculated sum of births that occurred to mothers living in that county for that particular month. (Integer) | | stateBirths | The aggregate number at the level of entire states for any given month-year combination. (Integer) | | County | The county where the mother lives, coded using FIPS County Code. (Integer) |
If you use this dataset in your research, please credit the original authors. If you use this dataset in your research, please credit data.world's Admin.
Facebook
TwitterThis dataset contains counts of live births for California counties based on information entered on birth certificates. Final counts are derived from static data and include out of state births to California residents, whereas provisional counts are derived from incomplete and dynamic data. Provisional counts are based on the records available when the data was retrieved and may not represent all births that occurred during the time period.
The final data tables include both births that occurred in California regardless of the place of residence (by occurrence) and births to California residents (by residence), whereas the provisional data table only includes births that occurred in California regardless of the place of residence (by occurrence). The data are reported as totals, as well as stratified by parent giving birth's age, parent giving birth's race-ethnicity, and birth place type. See temporal coverage for more information on which strata are available for which years.
Facebook
TwitterNumber and percentage of live births, by month of birth, 1991 to most recent year.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains counts of live births for California as a whole based on information entered on birth certificates. Final counts are derived from static data and include out of state births to California residents, whereas provisional counts are derived from incomplete and dynamic data. Provisional counts are based on the records available when the data was retrieved and may not represent all births that occurred during the time period.
The final data tables include both births that occurred in California regardless of the place of residence (by occurrence) and births to California residents (by residence), whereas the provisional data table only includes births that occurred in California regardless of the place of residence (by occurrence). The data are reported as totals, as well as stratified by parent giving birth's age, parent giving birth's race-ethnicity, and birth place type. See temporal coverage for more information on which strata are available for which years.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset contains US baby names from the Social Security Administration dating back to 1879. With over 150 years of data, this is one of the most comprehensive datasets on baby names in the US. The data includes the name, year of birth, sex, and number of babies with that name for each year. This dataset is a great resource for anyone interested in studying baby naming trends over time
This dataset is a compilation of over 140 years of data from the Social Security Administration. It includes data on baby names, year of birth, and sex. There are also columns for the number of babies with that name born in that year.
This dataset can be used to track changes in baby naming trends over time, or to study how popular names have changed in popularity. It can also be used to study how naming trends differ between sexes, or between different years
This dataset could be used for a number of things, including: 1. Determining baby name trends over time 2. Finding out what the most popular baby names are in the US 3. Analyzing how baby name popularity has changed over the years
If you use this dataset in your research, please credit @nickgott, @rflprr and the Social Security Administration via Data.gov
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Annual live births in England and Wales by age of mother and father, type of registration, median interval between births, number of previous live-born children and National Statistics Socio-economic Classification (NS-SEC).
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Live births and stillbirths annual summary statistics, by sex, age of mother, whether within marriage or civil partnership, percentage of non-UK-born mothers, birth rates and births by month and mothers' area of usual residence.
Facebook
Twitterhttps://data.gov.sg/open-data-licencehttps://data.gov.sg/open-data-licence
Dataset from Singapore Department of Statistics. For more information, visit https://data.gov.sg/datasets/d_6150f21b0892b3fdde546d2a1af2af82/view
Facebook
TwitterNumber and percentage of live births, by age group of mother, 1991 to most recent year.
Facebook
TwitterThis data set contains the first name statistics for newborns in Münster from 2007 to 2021. Two different lists are made available: A first name hit list with the top 30 most commonly used first names, grouped by year of birth and gender. A list of “first name numbers”. This list shows how many babies have been given multiple first names. First name hitlist The table with the first name hitlist contains the following columns: Year = year of birth Rank = Top 30 rank Gender = girl or boy Name = the chosen name Number = Number of children with this name Please note the following additional information: All given first names are taken into account for the calculation of the first name list, i.e. the second and third names. For example, if “Tom” leads the list in a year, that doesn't mean that Tom was the most popular name, but Tom was the most frequently mentioned first name among the total first, second, third and other given names for babies. First name number The table with the first name number contains the following columns: Year = year of birth Children with.. = How many first names Number = number of children The following is an Excel file, which contains both lists in different spreadsheets, as well as two corresponding CSV files.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
US Social Security applications are a great way to track trends in how babies born in the US are named.
Data.gov releases two datasets that are helplful for this: one at the national level and another at the state level. Note that only names with at least 5 babies born in the same year (/ state) are included in this dataset for privacy.
I've taken the raw files here and combined/normalized them into two CSV files (one for each dataset) as well as a SQLite database with two equivalently-defined tables. The code that did these transformations is available here.
New to data exploration in R? Take the free, interactive DataCamp course, "Data Exploration With Kaggle Scripts," to learn the basics of visualizing data with ggplot. You'll also create your first Kaggle Scripts along the way.
Facebook
TwitterAttribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically
Births that occurred by hospital name. Birth events of 5 or more per hospital location are displayed
Facebook
TwitterThis dataset presents the estimated percentage of babies born alive before 37 weeks of pregnancy are completed, by country. Preterm birth is a leading cause of neonatal morbidity and mortality. Understanding national rates supports efforts to improve antenatal care, timely interventions, and newborn outcomes. These estimates are adapted from Liang et al. (2024), based on the Global Burden of Disease Study 2021, and provide a globally comparable measure of preterm birth burden.Data Source:The Lancet: https://www.thelancet.com/journals/eclinm/article/PIIS2589-5370(24)00419-X/fulltext Data Dictionary: The data is collated with the following columns:Column headingContent of this columnPossible valuesRefNumerical counter for each row of data, for ease of identification1+CountryShort name for the country195 countries in total – all 194 WHO member states plus PalestineISO3Three-digit alphabetical codes International Standard ISO 3166-1 assigned by the International Organization for Standardization (ISO). e.g. AFG (Afghanistan)ISO22 letter identifier code for the countrye.g. AF (Afghanistan)ICM_regionICM Region for countryAFR (Africa), AMR (Americas), EMR (Eastern Mediterranean), EUR (Europe), SEAR (South east Asia) or WPR (Western Pacific)CodeUnique project code for each indicator:GGTXXnnnGG=data group e.g. OU for outcomeT = N for novice or E for ExpertXX = identifier number 00 to 30nnn = identifier name eg mmre.g. OUN01sbafor Outcome Novice Indicator 01 skilled birth attendance Short_nameIndicator namee.g. maternal mortality ratioDescriptionText description of the indicator to be used on websitee.g. Maternal mortality ratio (maternal deaths per 100,000 live births)Value_typeDescribes the indicator typeNumeric: decimal numberPercentage: value between 0 & 100Text: value from list of text optionsY/N: yes or noValue_categoryExpect this to be ‘total’ for all indicators for Phase 1, but this could allow future disaggregation, e.g. male/female; urban/ruraltotalYearThe year that the indicator value was reported. For most indicators, we will only report if 2014 or more recente.g. 2020Latest_Value‘LATEST’ if this is the most recent reported value for the indicator since 2014, otherwise ‘No’. Useful for indicators with time trend data.LATEST or NOValueIndicator valuee.g. 99.8. NB Some indicators are calculated to several decimal places. We present the value to the number of decimal places that should be displayed on the Hub.SourceFor Caesarean birth rate [OUN13cbr] ONLY, this column indicates the source of the data, either OECD when reported, or UNICEF otherwise.OECD or UNICEFTargetHow does the latest value compare with Global guidelines / targets?meets targetdoes not meet targetmeets global standarddoes not meet global standardRankGlobal rank for indicator, i.e. the country with the best global score for this indicator will have rank = 1, next = 2, etc. This ranking is only appropriate for a few indicators, others will show ‘na’1-195Rank out ofThe total number of countries who have reported a value for this indicator. Ranking scores will only go as high as this number.Up to 195TrendIf historic data is available, an indication of the change over time. If there is a global target, then the trend is either getting better, static or getting worse. For mmr [OUN04mmr] and nmr [OUN05nmr] the average annual rate of reduction (arr) between 2016 and latest value is used to determine the trend:arr <-1.0 = getting worsearr >=-1.0 AND <=1.0 = staticarr >1.0 = getting betterFor other indicators, the trend is estimated by comparing the average of the last three years with the average ten years ago:decreasing if now < 95% 10 yrs agoincreasing if now > 105% 10 yrs agostatic otherwiseincreasingdecreasing Or, if there is a global target: getting better,static,getting worseNotesClarification comments, when necessary LongitudeFor use with mapping LatitudeFor use with mapping DateDate data uploaded to the Hub the following codes are also possible values: not reported does not apply don’t know This is one of many datasets featured on the Midwives’ Data Hub, a digital platform designed to strengthen midwifery and advocate for better maternal and newborn health services.
Facebook
Twitterhttps://digital.nhs.uk/about-nhs-digital/terms-and-conditionshttps://digital.nhs.uk/about-nhs-digital/terms-and-conditions
This statistical release makes available the most recent monthly data on NHS-funded maternity services in England, using data submitted to the Maternity Services Data Set (MSDS). This is the latest report from the newest version of the data set, MSDS.v.2, which has been in place since April 2019, and the third to include provisional data - see the above change notice for more information. The new data set was a significant change which added support for key policy initiatives such as continuity of carer, as well as increased flexibility through the introduction of new clinical coding. This was a major change, so data quality and coverage has initially reduced from the levels seen in earlier publications. We expect the completeness to continue to get better over time, and are looking at ways of supporting improvements. Last month two new measures were included in this publication for the first time: Saving Babies Lives Element 2 Outcome Indicators i and ii. These measures are the proportion of babies below the 3rd birthweight centile born after 37 weeks gestation, and the proportion of babies born after 39 weeks gestation below the 10th birthweight centile. This data can be found in the Measures file available for download and further information on these new measures can be found in the accompanying Metadata file. The data derived from SNOMED codes is being used in some measures such as those for smoking at booking and birth weight, and others will follow in later publications. SNOMED data is also included in some of the published Clinical Quality Improvement Metrics (CQIMs), where rules have been applied to ensure measure rates are calculated only where data quality is high enough. System suppliers are at different stages of developing their new solution and delivering that to trusts. In some cases, this has limited the aspects of data that could be submitted to NHS Digital. To help Trusts understand to what extent they met the Clinical Negligence Scheme for Trusts (CNST) Maternity Incentive Scheme (MIS) Data Quality Criteria for Safety Action 2, we are producing a CNST Scorecard Dashboard showing trust performance against this criteria. This dashboard can be accessed via the link below. These statistics are classified as experimental and should be used with caution. Experimental statistics are new official statistics undergoing evaluation. More information about experimental statistics can be found on the UK Statistics Authority website. Please note that the percentages presented in this report are based on rounded figures and therefore may not total to 100%.
Facebook
TwitterBy Andy Kriebel [source]
The file contains data on births in the United States from 1994 to 2014. The data includes the following columns: year: The year of the observation. (Integer) month: The month of the observation. (Integer) date_of_month: The date of the observation. (Integer) day_of_week: The day of the week of the observation. (Integer) births: The number of births on the given day. (Integer)
The US Births dataset on Kaggle contains data on births in the United States from 1994 to 2014. The data is broken down by year, month, date of month, day of week, and births.
This dataset can be used to answer questions about when people are born, how common certain birthdays are, and any trends over time. For example, you could use this dataset to find out which day of the week has the most births or which month has the most births
- Determining which day of the year and what time of day that people are mostly born to help with staffing levels in maternity wards
- Identifying trends in baby names over time
- Predicting the number of births on a given day
This data set is a combined effort of the U.S. National Center for Health Statistics and the U.S. Social Security Administration, provided by FiveThirtyEight. It contains data on births in the United States from 1994 to 2014, with the following columns: year, month, date_of_month, day_of_week, births
->Thank you to FiveThirtyEight for providing this dataset!
License
License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. - Keep intact - all notices that refer to this license, including copyright notices.
File: US_births_1994-2014.csv | Column name | Description | |:------------------|:---------------------------------------------| | year | Year of the data. (Integer) | | month | Month of the data. (Integer) | | date_of_month | Day of the month of the data. (Integer) | | day_of_week | Day of the week of the data. (Integer) | | births | Number of births on the given day. (Integer) |
If you use this dataset in your research, please credit Andy Kriebel.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Key figures on fertility, live and stillborn children and multiple births among inhabitants of The Netherlands.
Available selections: - Live born children by sex; - Live born children by age of the mother (31 December), in groups; - Live born children by birth order from the mother; - Live born children by marital status of the mother; - Live born children by country of birth of the mother and origin country of the mother; - Stillborn children by duration of pregnancy; - Births: single and multiple; - Average number of children per female; - Average number of children per male; - Average age of the mother at childbirth by birth order from the mother; - Average age of the father at childbirth by birth order from the mother; - Net replacement factor.
CBS is in transition towards a new classification of the population by origin. Greater emphasis is now placed on where a person was born, aside from where that person’s parents were born. The term ‘migration background’ is no longer used in this regard. The main categories western/non-western are being replaced by categories based on continents and a few countries that share a specific migration history with the Netherlands. The new classification is being implemented gradually in tables and publications on population by origin.
Data available from: 1950 Most of the data is available as of 1950 with the exception of the live born children by country of birth of the mother and origin country of the mother (from 2021, previous periods will be added at a later time), stillborn children by duration of pregnancy (24+) (from 1991), average number of children per male (from 1996) and the average age of the father at childbirth (from 1996).
Status of the figures: All data recorded in this publication are final data.
Changed on 15 augustus 2025: The 2023 figures on stillbirths and (multiple) births are final. Final figures of 2024 have been added.
When will new figures be published? In the third quarter of 2026 final figures of 2025 will be published in this publication.
Facebook
TwitterNumber and percentage of live births, by characteristics of the mother (age, parity, marital status, birthplace) and child (sex, single or multiple births, birth weight) based on weeks of gestation, 2000 to most recent year.
Facebook
TwitterAttribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically
Yearly registered births – breakdown by Month
Facebook
TwitterThis is a source dataset for a Let's Get Healthy California indicator at https://letsgethealthy.ca.gov/. Infant Mortality is defined as the number of deaths in infants under one year of age per 1,000 live births. Infant mortality is often used as an indicator to measure the health and well-being of a community, because factors affecting the health of entire populations can also impact the mortality rate of infants. Although California’s infant mortality rate is better than the national average, there are significant disparities, with African American babies dying at more than twice the rate of other groups. Data are from the Birth Cohort Files. The infant mortality indicator computed from the birth cohort file comprises birth certificate information on all births that occur in a calendar year (denominator) plus death certificate information linked to the birth certificate for those infants who were born in that year but subsequently died within 12 months of birth (numerator). Studies of infant mortality that are based on information from death certificates alone have been found to underestimate infant death rates for infants of all race/ethnic groups and especially for certain race/ethnic groups, due to problems such as confusion about event registration requirements, incomplete data, and transfers of newborns from one facility to another for medical care. Note there is a separate data table "Infant Mortality by Race/Ethnicity" which is based on death records only, which is more timely but less accurate than the Birth Cohort File. Single year shown to provide state-level data and county totals for the most recent year. Numerator: Infants deaths (under age 1 year). Denominator: Live births occurring to California state residents. Multiple years aggregated to allow for stratification at the county level. For this indicator, race/ethnicity is based on the birth certificate information, which records the race/ethnicity of the mother. The mother can “decline to state”; this is considered to be a valid response. These responses are not displayed on the indicator visualization.
Facebook
Twitterhttps://digital.nhs.uk/about-nhs-digital/terms-and-conditionshttps://digital.nhs.uk/about-nhs-digital/terms-and-conditions
This is a publication on maternity activity in English NHS hospitals. This report examines data relating to delivery and birth episodes in 2023-24, and the booking appointments for these deliveries. This annual publication covers the financial year ending March 2024. Data is included from both the Hospital Episodes Statistics (HES) data warehouse and the Maternity Services Data Set (MSDS). HES contains records of all admissions, appointments and attendances for patients admitted to NHS hospitals in England. The HES data used in this publication are called 'delivery episodes'. The MSDS collects records of each stage of the maternity service care pathway in NHS-funded maternity services, and includes information not recorded in HES. The MSDS is a maturing, national-level dataset. In April 2019, the MSDS transitioned to a new version of the dataset. This version, MSDS v2.0, is an update that introduced a new structure and content - including clinical terminology, in order to meet current clinical practice and incorporate new requirements. It is designed to meet requirements that resulted from the National Maternity Review, which led to the publication of the Better Births report in February 2016. This is the fifth publication of data from MSDS v2.0 and data from 2019-20 onwards is not directly comparable to data from previous years. This publication shows the number of HES delivery episodes during the period, with a number of breakdowns including by method of onset of labour, delivery method and place of delivery. It also shows the number of MSDS deliveries recorded during the period, with a breakdown for the mother's smoking status at the booking appointment by age group. It also provides counts of live born term babies with breakdowns for the general condition of newborns (via Apgar scores), skin-to-skin contact and baby's first feed type - all immediately after birth. There is also data available in a separate file on breastfeeding at 6 to 8 weeks. For the first time information on 'Smoking at Time of Delivery' has been presented using annual data from the MSDS. This includes national data broken down by maternal age, ethnicity and deprivation. From 2025/2026, MSDS will become the official source of 'Smoking at Time of Delivery' information and will replace the historic 'Smoking at Time of Delivery' data which is to become retired. We are currently undergoing dual collection and reporting on a quarterly basis for 2024/25 to help users compare information from the two sources. We are working with data submitters to help reconcile any discrepancies at a local level before any close down activities begin. A link to the dual reporting in the SATOD publication series can be found in the links below. Information on how all measures are constructed can be found in the HES Metadata and MSDS Metadata files provided below. In this publication we have also included an interactive Power BI dashboard to enable users to explore key NHS Maternity Statistics measures. The purpose of this publication is to inform and support strategic and policy-led processes for the benefit of patient care. This report will also be of interest to researchers, journalists and members of the public interested in NHS hospital activity in England. Any feedback on this publication or dashboard can be provided to enquiries@nhsdigital.nhs.uk, under the subject “NHS Maternity Statistics”.
Facebook
TwitterBy data.world's Admin [source]
This dataset contains an aggregation of birth data from the United Statesbetween 1985 and 2015. It consists of information on mothers' locations by state (including District of Columbia) and county, as well as information such as the month they gave birth, and aggregates giving the sum of births during that month. This data has been provided by both the National Bureau for Economic Research and National Center for Health Statistics, whose shared mission is to understand how life works in order to aid individuals in making decisions about their health and wellbeing. This dataset provides valuable insight into population trends across time and location - for example, which states have higher or lower birthrates than others? Which counties experience dramatic fluctuations over time? Given its scope, this dataset could be used in a number of contexts--from epidemiology research to population forecasting. Be sure to check out our other datasets related to births while you're here!
For more datasets, click here.
- 🚨 Your notebook can be here! 🚨!
This dataset could be used to examine local trends in birth rates over time or analyze births at different geographical locations. In order to maximize your use of this dataset, it is important that you understand what information the various columns contain.
The main columns are: State (including District of Columbia), County (coded using the FIPS county code number), Month (numbering from 1 for January through 12 for December), Year (4-digit year) countyBirths (calculated sum of births that occurred to mothers living in a county for a given month) and stateBirths (calculated sum of births that occurred to mothers living in a state for a given month). These fields should provide enough information for you analyze trends across geographic locations both at monthly and yearly levels. You could also consider combining variables such as
YearwithStateorYearwithMonthor any other grouping combinations depending on your analysis goal.In addition, while all data were downloaded on April 5th 2017, it is worth noting that all sources used followed privacy guidelines as laid out by NCHC so individual births occurring after 2005 are not included due to geolocation concerns.
We hope you find this dataset useful and can benefit from its content! With proper understanding of what each field contains, we are confident you will gain valuable insights on birth rates across counties within the United States during this period
- Establishing county-level trends in birth rates for the US over time.
- Analyzing the relationship between month of birth and health outcomes for US babies after they are born (e.g., infant mortality, neurological development, etc.).
- Comparing state/county-level differences in average numbers of twins born each year
If you use this dataset in your research, please credit the original authors. Data Source
See the dataset description for more information.
File: allBirthData.csv | Column name | Description | |:-----------------|:-----------------------------------------------------------------------------------------------------------------| | State | The numerical order of the state where the mother lives. (Integer) | | Month | The month in which the birth took place. (Integer) | | Year | The year of the birth. (Integer) | | countyBirths | The calculated sum of births that occurred to mothers living in that county for that particular month. (Integer) | | stateBirths | The aggregate number at the level of entire states for any given month-year combination. (Integer) | | County | The county where the mother lives, coded using FIPS County Code. (Integer) |
If you use this dataset in your research, please credit the original authors. If you use this dataset in your research, please credit data.world's Admin.