2 datasets found
  1. E

    A corpus of names drawn from the local birth registers of England and Wales,...

    • dtechtive.com
    • find.data.gov.scot
    txt, xlsx, zip
    Updated Jan 25, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    University of Edinburgh (2018). A corpus of names drawn from the local birth registers of England and Wales, 1838-2014 [Dataset]. http://doi.org/10.7488/ds/2294
    Explore at:
    xlsx(30.21 MB), zip(5.395 MB), txt(0.0166 MB)Available download formats
    Dataset updated
    Jan 25, 2018
    Dataset provided by
    University of Edinburgh
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    England, UNITED KINGDOM
    Description

    This dataset comprises a corpus of names, in both the first and middle position, for approximately 22 million individuals born in England and Wales between 1838 and 2014. This data is obtained from birth records made available by a set of volunteer-run genealogical resources - collectively, the 'UK local BMD project' (http://www.ukbmd.org.uk/local) - and has been re-purposed here to demonstrate the applicability of network analysis methods to an onomastic dataset. The ownership and licensing of the intellectual property constituting the original birth records is detailed at https://www.ukbmd.org.uk/TermsAndConditions. Under section 29A of the UK Copyright, Designs and Patents Act 1988, a copyright exception permits copies to be made of lawfully accessible material in order to conduct text and data mining for non-commercial research. The data included in this dataset represents the outcome of such a text-mining analysis. No birth records are included in this dataset, and nor is it possible for records to be reconstructed from the data presented herein. The data comprises an archive of tables, presenting this corpus in various forms: as a rank order of names (in both the first and middle position) by number of registered births per year, and by the total number of births across all years sampled. An overview of the data is also provided, with summary statistics such as the number of usable records registered per year, most popular names per year, and measures of forename diversity and the surname-to-forename usage ratio (an indicator of which forenames are more likely to be transferred uses of surnames). These tables are extensive but not exhaustive, and do not exclude the possibility that errors are present in the corpus. Data are also presented both as '.expression' files (an input format readable by the network analysis tool Graphia Professional) and as '.layout' files, a text file format output by Graphia Professional that describes the characteristics of the network so that it may be replicated. Characteristics of the original birth records that allow the identification of individuals - for instance, full name or location of birth - have been removed.

  2. o

    Supporting material for "Ambivalence, Avoidance and Appeal: Alliterative...

    • ora.ox.ac.uk
    sheet
    Updated Jan 1, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bush, S (2020). Supporting material for "Ambivalence, Avoidance and Appeal: Alliterative Aspects of Anglo Anthroponyms" [Dataset]. http://doi.org/10.5287/bodleian:4JYXKrzzg
    Explore at:
    sheet(2007900)Available download formats
    Dataset updated
    Jan 1, 2020
    Dataset provided by
    University of Oxford
    Authors
    Bush, S
    License

    Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
    License information was derived automatically

    Time period covered
    1838 - 2014
    Area covered
    England; Wales
    Description

    In England and Wales, birth, marriage and death (BMD) registration began in July 1837. BMD records were obtained from the ‘UK local BMD’ project (http://www.ukbmd.org.uk/local), a volunteer-led effort to transcribe the local indices of the UK BMD registers for digital preservation. Birth records spanning the complete years 1838-2014 were downloaded in September 2016 from the ‘UK local BMD’ as part of a previous study describing the application of network methods to onomastic data (Bush, et al. 2018; https://www.ncbi.nlm.nih.gov/pubmed/30379928). These records were then updated in January 2018 for a study describing the re-use of birth records in response to child bereavement (Bush, 2019; https://www.tandfonline.com/doi/abs/10.1080/00277738.2018.1536186). Employing the data used for the latter, 23,468,892 birth records were parsed to generate this dataset, which explores trends in alliterative naming within England and Wales. The dataset approximates 130,000 to 230,000 records per year from 1838-1950, 25,000 to 100,000 records per year from 1951-2000, and 5000 to 15,000 records per year from 2001 to 2014. This supplementary archive represents tables and figures drawn from analysis of this dataset. These are provided in support of the paper “Ambivalence, avoidance, and appeal: alliterative aspects of Anglo anthroponyms.” The website hosting the original UK local BMD data, www.ukbmd.org.uk, is operated by Weston Technologies Ltd (Crewe, Cheshire, UK), this company being the owner or license-holder of the intellectual property constituting the birth records. This data was used for the aforementioned studies pursuant to section 29A of the UK Copyright, Designs and Patents Act 1988, where a copyright exception permits copies to be made of lawfully accessible material in order to conduct text and data mining for non-commercial research. This archive contains no copies of any of the original birth records and nor does it present data in a form by which they may be reconstructed. In several countries, one of the most pronounced trends in contemporary baby naming is to choose a comparatively uncommon name. Nevertheless, although a well-documented phenomenon, studies of uncommon name use are often limited to forenames. This study analyses approximately 22 million full names from England and 1 million from Wales, given between 1838 and 2014. It addresses the hypothesis that, consistent with the contemporary desire to choose an uncommon name, alliterative names – uncommon by definition – would become increasingly popular. More broadly, this study charts the long-term trends in alliterative naming over time, which in both England and Wales is consistent with a random expectation for much of the 19th century but declines significantly throughout the 20th century to its lowest use in the 1970s. This trend reverses towards the end of the 20th century, with alliterative naming becoming more common in contemporary records. These three aspects of alliterative name use are thematically referred to as ‘ambivalence’, ‘avoidance’ and ‘appeal’, and may reflect changing attitudes towards alliterative naming. The relatively renewed appeal of alliterative names towards the end of the 20th century complements previous research on the preponderance of uncommon names and the contemporary ‘need for uniqueness’ in naming.

  3. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
University of Edinburgh (2018). A corpus of names drawn from the local birth registers of England and Wales, 1838-2014 [Dataset]. http://doi.org/10.7488/ds/2294

A corpus of names drawn from the local birth registers of England and Wales, 1838-2014

Explore at:
xlsx(30.21 MB), zip(5.395 MB), txt(0.0166 MB)Available download formats
Dataset updated
Jan 25, 2018
Dataset provided by
University of Edinburgh
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Area covered
England, UNITED KINGDOM
Description

This dataset comprises a corpus of names, in both the first and middle position, for approximately 22 million individuals born in England and Wales between 1838 and 2014. This data is obtained from birth records made available by a set of volunteer-run genealogical resources - collectively, the 'UK local BMD project' (http://www.ukbmd.org.uk/local) - and has been re-purposed here to demonstrate the applicability of network analysis methods to an onomastic dataset. The ownership and licensing of the intellectual property constituting the original birth records is detailed at https://www.ukbmd.org.uk/TermsAndConditions. Under section 29A of the UK Copyright, Designs and Patents Act 1988, a copyright exception permits copies to be made of lawfully accessible material in order to conduct text and data mining for non-commercial research. The data included in this dataset represents the outcome of such a text-mining analysis. No birth records are included in this dataset, and nor is it possible for records to be reconstructed from the data presented herein. The data comprises an archive of tables, presenting this corpus in various forms: as a rank order of names (in both the first and middle position) by number of registered births per year, and by the total number of births across all years sampled. An overview of the data is also provided, with summary statistics such as the number of usable records registered per year, most popular names per year, and measures of forename diversity and the surname-to-forename usage ratio (an indicator of which forenames are more likely to be transferred uses of surnames). These tables are extensive but not exhaustive, and do not exclude the possibility that errors are present in the corpus. Data are also presented both as '.expression' files (an input format readable by the network analysis tool Graphia Professional) and as '.layout' files, a text file format output by Graphia Professional that describes the characteristics of the network so that it may be replicated. Characteristics of the original birth records that allow the identification of individuals - for instance, full name or location of birth - have been removed.

Search
Clear search
Close search
Google apps
Main menu