Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset comprises a corpus of names, in both the first and middle position, for approximately 22 million individuals born in England and Wales between 1838 and 2014. This data is obtained from birth records made available by a set of volunteer-run genealogical resources - collectively, the 'UK local BMD project' (http://www.ukbmd.org.uk/local) - and has been re-purposed here to demonstrate the applicability of network analysis methods to an onomastic dataset. The ownership and licensing of the intellectual property constituting the original birth records is detailed at https://www.ukbmd.org.uk/TermsAndConditions. Under section 29A of the UK Copyright, Designs and Patents Act 1988, a copyright exception permits copies to be made of lawfully accessible material in order to conduct text and data mining for non-commercial research. The data included in this dataset represents the outcome of such a text-mining analysis. No birth records are included in this dataset, and nor is it possible for records to be reconstructed from the data presented herein. The data comprises an archive of tables, presenting this corpus in various forms: as a rank order of names (in both the first and middle position) by number of registered births per year, and by the total number of births across all years sampled. An overview of the data is also provided, with summary statistics such as the number of usable records registered per year, most popular names per year, and measures of forename diversity and the surname-to-forename usage ratio (an indicator of which forenames are more likely to be transferred uses of surnames). These tables are extensive but not exhaustive, and do not exclude the possibility that errors are present in the corpus. Data are also presented both as '.expression' files (an input format readable by the network analysis tool Graphia Professional) and as '.layout' files, a text file format output by Graphia Professional that describes the characteristics of the network so that it may be replicated. Characteristics of the original birth records that allow the identification of individuals - for instance, full name or location of birth - have been removed.
Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
License information was derived automatically
In England and Wales, birth, marriage and death (BMD) registration began in July 1837. BMD records were obtained from the ‘UK local BMD’ project (http://www.ukbmd.org.uk/local), a volunteer-led effort to transcribe the local indices of the UK BMD registers for digital preservation. Birth records spanning the complete years 1838-2014 were downloaded in September 2016 from the ‘UK local BMD’ as part of a previous study describing the application of network methods to onomastic data (Bush, et al. 2018; https://www.ncbi.nlm.nih.gov/pubmed/30379928). These records were then updated in January 2018 for a study describing the re-use of birth records in response to child bereavement (Bush, 2019; https://www.tandfonline.com/doi/abs/10.1080/00277738.2018.1536186). Employing the data used for the latter, 23,468,892 birth records were parsed to generate this dataset, which explores trends in alliterative naming within England and Wales. The dataset approximates 130,000 to 230,000 records per year from 1838-1950, 25,000 to 100,000 records per year from 1951-2000, and 5000 to 15,000 records per year from 2001 to 2014. This supplementary archive represents tables and figures drawn from analysis of this dataset. These are provided in support of the paper “Ambivalence, avoidance, and appeal: alliterative aspects of Anglo anthroponyms.” The website hosting the original UK local BMD data, www.ukbmd.org.uk, is operated by Weston Technologies Ltd (Crewe, Cheshire, UK), this company being the owner or license-holder of the intellectual property constituting the birth records. This data was used for the aforementioned studies pursuant to section 29A of the UK Copyright, Designs and Patents Act 1988, where a copyright exception permits copies to be made of lawfully accessible material in order to conduct text and data mining for non-commercial research. This archive contains no copies of any of the original birth records and nor does it present data in a form by which they may be reconstructed. In several countries, one of the most pronounced trends in contemporary baby naming is to choose a comparatively uncommon name. Nevertheless, although a well-documented phenomenon, studies of uncommon name use are often limited to forenames. This study analyses approximately 22 million full names from England and 1 million from Wales, given between 1838 and 2014. It addresses the hypothesis that, consistent with the contemporary desire to choose an uncommon name, alliterative names – uncommon by definition – would become increasingly popular. More broadly, this study charts the long-term trends in alliterative naming over time, which in both England and Wales is consistent with a random expectation for much of the 19th century but declines significantly throughout the 20th century to its lowest use in the 1970s. This trend reverses towards the end of the 20th century, with alliterative naming becoming more common in contemporary records. These three aspects of alliterative name use are thematically referred to as ‘ambivalence’, ‘avoidance’ and ‘appeal’, and may reflect changing attitudes towards alliterative naming. The relatively renewed appeal of alliterative names towards the end of the 20th century complements previous research on the preponderance of uncommon names and the contemporary ‘need for uniqueness’ in naming.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset comprises a corpus of names, in both the first and middle position, for approximately 22 million individuals born in England and Wales between 1838 and 2014. This data is obtained from birth records made available by a set of volunteer-run genealogical resources - collectively, the 'UK local BMD project' (http://www.ukbmd.org.uk/local) - and has been re-purposed here to demonstrate the applicability of network analysis methods to an onomastic dataset. The ownership and licensing of the intellectual property constituting the original birth records is detailed at https://www.ukbmd.org.uk/TermsAndConditions. Under section 29A of the UK Copyright, Designs and Patents Act 1988, a copyright exception permits copies to be made of lawfully accessible material in order to conduct text and data mining for non-commercial research. The data included in this dataset represents the outcome of such a text-mining analysis. No birth records are included in this dataset, and nor is it possible for records to be reconstructed from the data presented herein. The data comprises an archive of tables, presenting this corpus in various forms: as a rank order of names (in both the first and middle position) by number of registered births per year, and by the total number of births across all years sampled. An overview of the data is also provided, with summary statistics such as the number of usable records registered per year, most popular names per year, and measures of forename diversity and the surname-to-forename usage ratio (an indicator of which forenames are more likely to be transferred uses of surnames). These tables are extensive but not exhaustive, and do not exclude the possibility that errors are present in the corpus. Data are also presented both as '.expression' files (an input format readable by the network analysis tool Graphia Professional) and as '.layout' files, a text file format output by Graphia Professional that describes the characteristics of the network so that it may be replicated. Characteristics of the original birth records that allow the identification of individuals - for instance, full name or location of birth - have been removed.