Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This Zenodo entry details the methodology for extracting and reconciling ethnicity data from the Clinical Practice Research Datalink (CPRD), incorporating both General Practitioner (GP) and Hospital Episode Statistics (HES) sources. The approach aims to resolve discrepancies between these sources and provide a standardized single ethnicity value per patient, categorized into 6 and 12 levels according to NHS coding guidelines.
Ethnicity data from the CPRD are recorded in multiple formats. This study harmonizes these data to achieve consistent ethnicity classification across patient records, following a hierarchal reconciliation protocol prioritizing hospital data over GP records.
Ethnicity Levels: Ethnicity data are processed to conform to two levels of granularity:
Source Data Mapping:
Algorithm (AIM-CISC):
Unique Patient Identifiers: Each patient is represented once in hospital data, ensuring a single source of truth for hospital-based ethnicities. This simplifies reconciliation with GP data when discrepancies arise.
Instances were noted where multiple Medcodes map back to a single SNOMED code, highlighting the importance of careful data cross-referencing. For example, two different Medcodes represent the New Zealand European ethnicity, which both map back to the identical SNOMED code.
Full details of every data item available on both the Maternity Indicators dataset and National Community Child Health Database are available through the NWIS Data Dictionary: http://www.datadictionary.wales.nhs.uk/#!WordDocuments/datasetstructure20.htm From 1st April 2019 health service provision for residents of Bridgend local authority moved from Abertawe Bro Morgannwg to Cwm Taf. For more information see the joint statement from Cwm Taf and Abertawe Bro Morgannwg University Health Boards (see weblinks). The health board names have changed with Cwm Taf University Health Board becoming Cwm Taf Morgannwg University Health Board and Abertawe Bro Morgannwg University Health Board becoming Swansea Bay University Health Board. Data for Abertawe Bro Morgannwg and Cwm Taf are available for previous years in this table by selecting the tick boxes in the Area drop-down box.
The Born in Bradford study is tracking the health and wellbeing of over 13,500 children, and their parents born at Bradford Royal Infirmary between March 2007 and December 2010.
Born in Bradford is a prospective pregnancy and birth cohort established to examine how genetic, nutritional, environmental, behavioral and social factors affect health and development during childhood, and subsequently adult life, in a deprived multi-ethnic population. It was developed in close consultation with local communities, clinicians and policy makers with commitment from the outset to undertake research that would both inform interventions to improve health in the city and generate robust science relevant to similar communities in the UK and across the world. Between 2007 and 2011 information on a wide range of characteristics were collected from 12,453 women (and 3,356 partners) who experienced 13,778 pregnancies and delivered 13,818 live births.
Notes
Data Presentation: Born in Bradford Data
Born in Bradford Data Dictionary
Born in Bradford has a number of unique strengths: a) Composition. Half of all the families recruited are living in the UK’s most deprived wards, and 45% are of Pakistani origin. Half of Pakistani-origin mothers and fathers were born outside the UK and over half are related to their partner. This combination enhances the opportunity to study the interplay of deprivation, ethnicity, migration and cultural characteristics and their relationship to social, economic and health outcomes research relevant to many communities across the world.
b) Rich characterization. Detailed information has been collected from parents about demographic, economic, lifestyle, cultural, medical and health factors. Pregnancy oral glucose tolerance tests (OGTT), have been completed in 85% of the cohort and in combination with repeat fetal ultrasound data and subsequent follow-up growth and adiposity (repeat skinfolds, weight and height from birth to current age) will enable BiB uniquely to explore ethnic differences in body composition trajectories through infancy and childhood.
c) Genetic and biomarker data. Maternal, neonatal and follow-up child blood samples have provided biomarker measures of adiposity and immunity, together with stored samples, for which funding has been secured, to assess targeted NMR metabolites in maternal pregnancy fasting samples, cord-blood and infant samples taken at 12-24 months. Genome wide data is available for 9000+ mothers and 8000+ children and funding has been secured for DNA methylation of 1000 mother-child pairs. Our BiB biobank contains 200,000 stored samples.
d) System-wide coverage. The study has successfully linked primary and secondary care, radiology, laboratory and local authority data. This successful data linkage to routine health and education data will allow life-time follow up of clinical outcomes for BiB children and their parents, and educational attainment for children.
e) Community involvement. Close links with members of the public and particularly with cohort members allow the co-production of research in terms of the identification of research questions, monitoring the demands research makes on participants and discussion of the implementation of findings. The study has strong community roots and city-wide support.
Full details of the cohort and related publications can be found on the website
Patient characteristics Children born in the city of Bradford Claims years: 2007-2011 12,453 women with 13,776 pregnancies and 3,448 of their partners Cord blood samples have been obtained and stored and DNA extraction on 10,000 mother\offspring pairs. Sex: Adults: 12,453 women, 3,448 males
Application
If you are interested in working with these data, the application packet, with examples, can be found here: Born in Bradford Application Packet
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This Zenodo entry details the methodology for extracting and reconciling ethnicity data from the Clinical Practice Research Datalink (CPRD), incorporating both General Practitioner (GP) and Hospital Episode Statistics (HES) sources. The approach aims to resolve discrepancies between these sources and provide a standardized single ethnicity value per patient, categorized into 6 and 12 levels according to NHS coding guidelines.
Ethnicity data from the CPRD are recorded in multiple formats. This study harmonizes these data to achieve consistent ethnicity classification across patient records, following a hierarchal reconciliation protocol prioritizing hospital data over GP records.
Ethnicity Levels: Ethnicity data are processed to conform to two levels of granularity:
Source Data Mapping:
Algorithm (AIM-CISC):
Unique Patient Identifiers: Each patient is represented once in hospital data, ensuring a single source of truth for hospital-based ethnicities. This simplifies reconciliation with GP data when discrepancies arise.
Instances were noted where multiple Medcodes map back to a single SNOMED code, highlighting the importance of careful data cross-referencing. For example, two different Medcodes represent the New Zealand European ethnicity, which both map back to the identical SNOMED code.