By Andy Kriebel [source]
The file contains data on births in the United States from 1994 to 2014. The data includes the following columns: year: The year of the observation. (Integer) month: The month of the observation. (Integer) date_of_month: The date of the observation. (Integer) day_of_week: The day of the week of the observation. (Integer) births: The number of births on the given day. (Integer)
The US Births dataset on Kaggle contains data on births in the United States from 1994 to 2014. The data is broken down by year, month, date of month, day of week, and births.
This dataset can be used to answer questions about when people are born, how common certain birthdays are, and any trends over time. For example, you could use this dataset to find out which day of the week has the most births or which month has the most births
- Determining which day of the year and what time of day that people are mostly born to help with staffing levels in maternity wards
- Identifying trends in baby names over time
- Predicting the number of births on a given day
This data set is a combined effort of the U.S. National Center for Health Statistics and the U.S. Social Security Administration, provided by FiveThirtyEight. It contains data on births in the United States from 1994 to 2014, with the following columns: year, month, date_of_month, day_of_week, births
->Thank you to FiveThirtyEight for providing this dataset!
License
License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. - Keep intact - all notices that refer to this license, including copyright notices.
File: US_births_1994-2014.csv | Column name | Description | |:------------------|:---------------------------------------------| | year | Year of the data. (Integer) | | month | Month of the data. (Integer) | | date_of_month | Day of the month of the data. (Integer) | | day_of_week | Day of the week of the data. (Integer) | | births | Number of births on the given day. (Integer) |
If you use this dataset in your research, please credit Andy Kriebel.
This dataset was created from the CDC's National Vital Statistics Reports Volume 56, Number 6. The dataset includes all data available from this report by state level and includes births by race and Hispanic origin, births to unmarried women, rates of cesarean delivery, and twin and multiple birth rates. The data are final for 2005. No value is represented by a -1. "Descriptive tabulations of data reported on the birth certificates of the 4.1 million births that occurred in 2005 are presented. Denominators for population-based rates are postcensal estimates derived from the U.S. 2000 census".
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset contains US baby names from the Social Security Administration dating back to 1879. With over 150 years of data, this is one of the most comprehensive datasets on baby names in the US. The data includes the name, year of birth, sex, and number of babies with that name for each year. This dataset is a great resource for anyone interested in studying baby naming trends over time
This dataset is a compilation of over 140 years of data from the Social Security Administration. It includes data on baby names, year of birth, and sex. There are also columns for the number of babies with that name born in that year.
This dataset can be used to track changes in baby naming trends over time, or to study how popular names have changed in popularity. It can also be used to study how naming trends differ between sexes, or between different years
This dataset could be used for a number of things, including: 1. Determining baby name trends over time 2. Finding out what the most popular baby names are in the US 3. Analyzing how baby name popularity has changed over the years
If you use this dataset in your research, please credit @nickgott, @rflprr and the Social Security Administration via Data.gov
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
SELECTED SOCIAL CHARACTERISTICS IN THE UNITED STATES PLACE OF BIRTH - DP02 Universe - Total population Survey-Program - American Community Survey 5-year estimates Years - 2020, 2021, 2022 People not reporting a place of birth were assigned the state or country of birth of another family member, or were allocated the response of another individual with similar characteristics. People born outside the United States were asked to report their place of birth according to current international boundaries. Since numerous changes in boundaries of foreign countries have occurred in the last century, some people may have reported their place of birth in terms of boundaries that existed at the time of their birth or emigration, or in accordance with their own national preference.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Cultural diversity in the U.S. has led to great variations in names and naming traditions and names have been used to express creativity, personality, cultural identity, and values. Source: https://en.wikipedia.org/wiki/Naming_in_the_United_States
This public dataset was created by the Social Security Administration and contains all names from Social Security card applications for births that occurred in the United States after 1879. Note that many people born before 1937 never applied for a Social Security card, so their names are not included in this data. For others who did apply, records may not show the place of birth, and again their names are not included in the data.
All data are from a 100% sample of records on Social Security card applications as of the end of February 2015. To safeguard privacy, the Social Security Administration restricts names to those with at least 5 occurrences.
Fork this kernel to get started with this dataset.
https://bigquery.cloud.google.com/dataset/bigquery-public-data:usa_names
https://cloud.google.com/bigquery/public-data/usa-names
Dataset Source: Data.gov. This dataset is publicly available for anyone to use under the following terms provided by the Dataset Source — http://www.data.gov/privacy-policy#data_policy — and is provided "AS IS" without any warranty, express or implied, from Google. Google disclaims all liability for any damages, direct or indirect, resulting from the use of the dataset.
Banner Photo by @dcp from Unplash.
What are the most common names?
What are the most common female names?
Are there more female or male names?
Female names by a wide margin?
This dataset includes Table 13, Table 15, Table 19, Table 21. of the Maryland Vital Statistics Annual Report 2005 which include births by live order, births to unmarried women, births to women receiving first trimester prenatal care, and births to women receiving late or no prenatal care, all for 2005 by race and county. Rates that are based on fewer than five events in the numerator are not presented and are represented here as -1. Baltimore County and Baltimore City have been combined.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
United Kingdom UK: Birth Rate: Crude: per 1000 People data was reported at 11.800 Ratio in 2016. This records a decrease from the previous number of 11.900 Ratio for 2015. United Kingdom UK: Birth Rate: Crude: per 1000 People data is updated yearly, averaging 12.900 Ratio from Dec 1960 (Median) to 2016, with 57 observations. The data reached an all-time high of 18.800 Ratio in 1964 and a record low of 11.300 Ratio in 2002. United Kingdom UK: Birth Rate: Crude: per 1000 People data remains active status in CEIC and is reported by World Bank. The data is categorized under Global Database’s United Kingdom – Table UK.World Bank.WDI: Population and Urbanization Statistics. Crude birth rate indicates the number of live births occurring during the year, per 1,000 population estimated at midyear. Subtracting the crude death rate from the crude birth rate provides the rate of natural increase, which is equal to the rate of population change in the absence of migration.; ; (1) United Nations Population Division. World Population Prospects: 2017 Revision. (2) Census reports and other statistical publications from national statistical offices, (3) Eurostat: Demographic Statistics, (4) United Nations Statistical Division. Population and Vital Statistics Reprot (various years), (5) U.S. Census Bureau: International Database, and (6) Secretariat of the Pacific Community: Statistics and Demography Programme.; Weighted average;
This dataset describes birth outcomes (weight, gestational age, sex assigned at birth, presence of birth defects, etc.) and parental factors (age, address, health status, etc.) for people born in North Carolina between 2003 and 2015. This dataset is not publicly accessible because: EPA cannot release personally identifiable information regarding living individuals, according to the Privacy Act and the Freedom of Information Act (FOIA). This dataset contains information about human research subjects. Because there is potential to identify individual participants and disclose personal information, either alone or in combination with other datasets, individual level data are not appropriate to post for public access. Restricted access may be granted to authorized persons by contacting the party listed. It can be accessed through the following means: Data come from the North Carolina Birth Defects Monitoring Program. These data are not publicly available, but more information can be obtained at https://schs.dph.ncdhhs.gov/units/bdmp/ (accessed 11/9/2021). Format: Data are stored as csv files and contain information on birth records in North Carolina from 2003 to 2015, including addresses of parents and medical information on parents and neonates. This dataset is associated with the following publication: Slawsky, E., A. Weaver, T. Luben, and K. Rappazzo. A Cross-sectional Study of Brownfields and Birth Defects. Birth Defects Research. John Wiley & Sons, Inc., Hoboken, NJ, USA, 114(5-6): 197-207, (2022).
This dataset includes Table 11 of the Maryland Vital Statistics Annual Report 2005 which includes GENERAL FERTILITY RATES (Total births per 1,000 women ages 15-44.) AND BIRTH RATES (Live births per 1,000 women in specified age group.) BY AGE OF MOTHER, RACE OF MOTHER, REGION, AND POLITICAL SUBDIVISION, MARYLAND, 2005. Rates that are based on fewer than five events in the numerator are not presented and are represented here as -1. Numbers and rates for Baltimore city and Baltimore county are combined.
This dataset includes births, deaths and the ratio of births to deaths by metropolitan area for the years 2000-2006. The actual births and deaths for 2000 and estimates were taken from the U.S. Census Components of Population Change. Ratios were calculated based on that data.
The data (name, year of birth, sex, and number) are from a 100 percent sample of Social Security card applications for 1880 on.
List of the data tables as part of the Immigration system statistics Home Office release. Summary and detailed data tables covering the immigration system, including out-of-country and in-country visas, asylum, detention, and returns.
If you have any feedback, please email MigrationStatsEnquiries@homeoffice.gov.uk.
The Microsoft Excel .xlsx files may not be suitable for users of assistive technology.
If you use assistive technology (such as a screen reader) and need a version of these documents in a more accessible format, please email MigrationStatsEnquiries@homeoffice.gov.uk
Please tell us what format you need. It will help us if you say what assistive technology you use.
Immigration system statistics, year ending June 2025
Immigration system statistics quarterly release
Immigration system statistics user guide
Publishing detailed data tables in migration statistics
Policy and legislative changes affecting migration to the UK: timeline
Immigration statistics data archives
https://assets.publishing.service.gov.uk/media/689efececc5ef8b4c5fc448c/passenger-arrivals-summary-jun-2025-tables.ods">Passenger arrivals summary tables, year ending June 2025 (ODS, 31.3 KB)
‘Passengers refused entry at the border summary tables’ and ‘Passengers refused entry at the border detailed datasets’ have been discontinued. The latest published versions of these tables are from February 2025 and are available in the ‘Passenger refusals – release discontinued’ section. A similar data series, ‘Refused entry at port and subsequently departed’, is available within the Returns detailed and summary tables.
https://assets.publishing.service.gov.uk/media/689efd8307f2cc15c93572d8/electronic-travel-authorisation-datasets-jun-2025.xlsx">Electronic travel authorisation detailed datasets, year ending June 2025 (MS Excel Spreadsheet, 57.1 KB)
ETA_D01: Applications for electronic travel authorisations, by nationality
ETA_D02: Outcomes of applications for electronic travel authorisations, by nationality
https://assets.publishing.service.gov.uk/media/68b08043b430435c669c17a2/visas-summary-jun-2025-tables.ods">Entry clearance visas summary tables, year ending June 2025 (ODS, 56.1 KB)
https://assets.publishing.service.gov.uk/media/689efda51fedc616bb133a38/entry-clearance-visa-outcomes-datasets-jun-2025.xlsx">Entry clearance visa applications and outcomes detailed datasets, year ending June 2025 (MS Excel Spreadsheet, 29.6 MB)
Vis_D01: Entry clearance visa applications, by nationality and visa type
Vis_D02: Outcomes of entry clearance visa applications, by nationality, visa type, and outcome
Additional data relating to in country and overseas Visa applications can be fo
This is the monthly data for U.S. employment and unemployment by state including some numbers for Puerto Rico. This dataset was accessed on April 7th 2008. The data for February 2008 are preliminary. The data presented are seasonally adjusted although the unadjusted numbers are also available. Unavailable data are represented as -1. The dataset is taken from Tables 3 and 5 from the United States Department of Labor, Bureau of Labor Statistics. It includes the civilian labor force, the unemployed in numbers and percentages, and employment by industry. Data from table 3 "refer to place of residence. Data for Puerto Rico are derived from a monthly household survey similar to the Current Population Survey. Area definitions are based on Office of Management and Budget Bulletin No. 08-01, dated November 20, 2007, and are available at http://www.bls.gov/lau/lausmsa.htm. Estimates for the latest month are subject to revision the following month". Data from table 5 "are counts of jobs by place of work. Estimates are currently projected from 2007 benchmark levels. Estimates subsequent to the current benchmarks are provisional and will be revised when new information becomes available. Data reflect the conversion to the 2007 version of the North American Industry Classification System (NAICS) as the basis for the assignment and tabulation of economic data by industry, replacing NAICS 2002. For more details, see http://www.bls.gov/sae/saenaics07.htm.
This public dataset was created by the Social Security Administration and contains all names from Social Security card applications for births that occurred in the United States after 1879. Note that many people born before 1937 never applied for a Social Security card, so their names are not included in this data. For others who did apply, records may not show the place of birth, and again their names are not included in the data. All data are from a 100% sample of records on Social Security card applications as of the end of February 2015. To safeguard privacy, the Social Security Administration restricts names to those with at least 5 occurrences. This public dataset is hosted in Google BigQuery and is included in BigQuery's 1TB/mo of free tier processing. This means that each user receives 1TB of free BigQuery processing every month, which can be used to run queries on this public dataset. Watch this short video to learn how to get started quickly using BigQuery to access public datasets. What is BigQuery .
This dataset includes Table 9Aand 9B. of the Maryland Vital Statistics Annual Report 2005 which includes the number of births and the birth rate for 2005 by race and Hispanic origin and by county. Rate are per 1,000 population. Rates that are based on fewer than five events in the numerator are not presented and are represented here as -1.
This dataset explores Early Care and Education Funding: Head Start Allocation and State-Funded Prekindergarten Participation. This data is state level and expresses the participation per state. Head Start and Early Head Start are comprehensive child development programs that serve children from birth to age 5, their families, and pregnant women. The overall goal of these programs is to increase the school readiness of young children in families earning low incomes. The Head Start program delivers comprehensive services including: education, health, nutrition, screening for developmental delays, and a variety of social services, if the family needs them. The program is designed to meet the social, emotional, physical and cognitive development of children. This data is from Latest Data: Fiscal Year 2004 (Head Start) and School Year 2002-2003 (State Funded Prekindergarten). This data is from National Child Care Information Center. Refer to NCCIC Child Care Database for detailed state information (http://nccic.org/IMS/Results.asp). Compiled by: National Association of Child Care Resources and Referral Agencies (http://www.naccrra.org/randd/head_start/expenditure.php)
This dataset displays all the hazardous waste sites in the United States and it's Territories as of 5.08. The data comes from the Agency for Toxic Substances and Disease Registry(ATSDR). The dataset contains information about the site: Site ID Site Name CERCLIS # Address City State County Latitude Longitude Population Region # Congressional Districts Federal Facility National Priorities List Status Ownership Status Classification For more information go to the Agency for Toxic Substances and Disease Registry(ATSDR)website at http://www.atsdr.cdc.gov
This dataset illustrates the largest difference between high and low temperatures and the smallest difference between high and low temperatures in cities with 50,000 people or more. A value of -1 means that the data was not applicable. Also included are the rankings, the inverse ranking to be used for mapping purposes, the popualtion, the name of city and state, and the temperature degree difference. Source City-Data URL http//www.city-data.com/top2/c489.html http//www.city-data.com/top2/c490.html Date Accessed November 13,2007
This data shows where there are interconnections between public transportation modes at aiports, ferry, and intercity rail and bus stations in the United States. More specifically, according to the Bureau of Transportation Statistics: "The Intermodal Passenger Connectivity Database is a nationwide data table of passenger transportation terminals, with data on the availability of connections among the various scheduled public transportation modes at each facility. In addition to geographic data for each terminal, the data elements describe the availability of rail, air, bus, transit, and ferry services. This data has been collected from various public sources to provide the only nationwide measurement of the degree of connectivity available in the national passenger transportation system. At this point, data has been collected for intercity rail stations and airline airports only. Data on terminals of other modes is being collected and will be released when it is available. It is anticipated that the entire database will be complete by December 31, 2008."
This dataset explores what child care providers earn for 2005 and 2006. The BLS defines child care workers as those who attend to children at schools, businesses, private households, and child care institutions and perform a variety of tasks, such as dressing, feeding, bathing, and overseeing play. It is important to note that some family child care providers are excluded in the numbers because they are self-employed and report their income differently. The definition also excludes preschool teachers and teacher assistants. All data obtained from the 2005 and 2006 Occupational Employment Statistics Survey, Bureau of Labor Statistics, U. S. Department of Labor Compiled by the National Association of Child Care Resource and Referral Agencies.
By Andy Kriebel [source]
The file contains data on births in the United States from 1994 to 2014. The data includes the following columns: year: The year of the observation. (Integer) month: The month of the observation. (Integer) date_of_month: The date of the observation. (Integer) day_of_week: The day of the week of the observation. (Integer) births: The number of births on the given day. (Integer)
The US Births dataset on Kaggle contains data on births in the United States from 1994 to 2014. The data is broken down by year, month, date of month, day of week, and births.
This dataset can be used to answer questions about when people are born, how common certain birthdays are, and any trends over time. For example, you could use this dataset to find out which day of the week has the most births or which month has the most births
- Determining which day of the year and what time of day that people are mostly born to help with staffing levels in maternity wards
- Identifying trends in baby names over time
- Predicting the number of births on a given day
This data set is a combined effort of the U.S. National Center for Health Statistics and the U.S. Social Security Administration, provided by FiveThirtyEight. It contains data on births in the United States from 1994 to 2014, with the following columns: year, month, date_of_month, day_of_week, births
->Thank you to FiveThirtyEight for providing this dataset!
License
License: Dataset copyright by authors - You are free to: - Share - copy and redistribute the material in any medium or format for any purpose, even commercially. - Adapt - remix, transform, and build upon the material for any purpose, even commercially. - You must: - Give appropriate credit - Provide a link to the license, and indicate if changes were made. - ShareAlike - You must distribute your contributions under the same license as the original. - Keep intact - all notices that refer to this license, including copyright notices.
File: US_births_1994-2014.csv | Column name | Description | |:------------------|:---------------------------------------------| | year | Year of the data. (Integer) | | month | Month of the data. (Integer) | | date_of_month | Day of the month of the data. (Integer) | | day_of_week | Day of the week of the data. (Integer) | | births | Number of births on the given day. (Integer) |
If you use this dataset in your research, please credit Andy Kriebel.