https://www.pioneerdatahub.co.uk/data/data-request-process/https://www.pioneerdatahub.co.uk/data/data-request-process/
This highly granular synthetic dataset created as an asset for the HDR UK Medicines programme includes information on 680 cancer patients over a period of three years. Includes simulated patient-related data, such as demographics & co-morbidities extracted from ICD-10 and SNOMED-CT codes. Serial, structured data pertaining to acute care process (readmissions, survival), primary diagnosis, presenting complaint, physiology readings, blood results (infection, inflammatory markers) and acuity markers such as AVPU Scale, NEWS2 score, imaging reports, prescribed & administered treatments including fluids, blood products, procedures, information on outpatient admissions and survival outcomes following one-year post discharge.
The data was generated using a generative adversarial network model (CTGAN). A flat real data table was created by consolidating essential information from various key relational tables (medications, demographics). A synthetic version of the flat table was generated using a customized script based on the SDV package (N. Patki, 2016), that replicated the real distribution and logic relationships.
Geography: The West Midlands (WM) has a population of 6 million & includes a diverse ethnic & socio-economic mix. UHB is one of the largest NHS Trusts in England, providing direct acute services & specialist care across four hospital sites, with 2.2 million patient episodes per year, 2750 beds & > 120 ITU bed capacity. UHB runs a fully electronic healthcare record (EHR) (PICS; Birmingham Systems), a shared primary & secondary care record (Your Care Connected) & a patient portal “My Health”.
Data set availability: Data access is available via the PIONEER Hub for projects which will benefit the public or patients. This can be by developing a new understanding of disease, by providing insights into how to improve care, or by developing new models, tools, treatments, or care processes. Data access can be provided to NHS, academic, commercial, policy and third sector organisations. Applications from SMEs are welcome. There is a single data access process, with public oversight provided by our public review committee, the Data Trust Committee. Contact pioneer@uhb.nhs.uk or visit www.pioneerdatahub.co.uk for more details.
Available supplementary data: Matched controls; ambulance and community data. Unstructured data (images). We can provide the dataset in OMOP and other common data models and provide the real-data via application.
Available supplementary support: Analytics, model build, validation & refinement; A.I. support. Data partner support for ETL (extract, transform & load) processes. Bespoke and “off the shelf” Trusted Research Environment (TRE) build and run. Consultancy with clinical, patient & end-user and purchaser access/ support. Support for regulatory requirements. Cohort discovery. Data-driven trials and “fast screen” services to assess population size.
HTTPS://CPRD.COM/DATA-ACCESSHTTPS://CPRD.COM/DATA-ACCESS
CPRD GOLD linked Cancer Patient Experience Survey (CPES) data include information from patients who have responded to the CPES about their cancer journey from their initial GP visit prior to diagnosis, through diagnosis and treatment and to the ongoing management of their cancer.
Linked Death Registration data from the Office for National Statistics (ONS) include information on the official date and causes of death using ICD-10 codes.
Test results for COVID-19 tests. Details tests, outcomes, and some clinically relevant patient information about COVID-19 Tests in Wales.
Audit collects Information about general diabetes care. Data submitted by health care services, relevant to service they provide i.e. Secondary Care Bodies = Type 1, GP practices = Type 2. Includes demographics and diabetes relevant biometric information.
CPRD GOLD linked Death Registration data from the Office for National Statistics (ONS) include information on the official date and causes of death.
Contains organisational survey data of pulmonary rehabilitation services collected between July and September 2019. The dataset includes information on the organisation and resourcing of pulmonary rehabilitation services.
https://saildatabank.com/data/apply-to-work-with-the-data/https://saildatabank.com/data/apply-to-work-with-the-data/
Schools and Pupil data for Wales which covers state funded learning centres. Contains information from the Pupil Level Annual School Census (PLASC) and the Welsh Examinations Database (WED). This describes learning centres, outcomes for learners, special educational needs (SEN), attendance summary (prior to 2020), and free school meals (FSM). See table and variable descriptions for further detail.
Attendance data in EDUW was discontinued after 2019 and the Education Daily Attendance Dataset (EDAD) schema replaced it.
Data for 2019/2020 in EOTASPROVISION was found to be unreliable and has been removed by the data owner
The Covid-19 UK Non-hospital Antibody Testing Results (Pillar 3) dataset, also referred to as iElisa, documents individuals that have undergone a finger prick test for antibodies from having had Covid-19.
ISAR is the first global severe asthma registry; a joint initiative where national registries (both newly created and pre-existing) retain ownership of their own data but open their borders and share data with ISAR for ethically approved research purposes
This dataset contains a series of service delivery and institutional mortality indicators from the Haiti dhis2 for the period of January 2019 to December 2020. This monthly dataset includes 15 months pre-COVID and 9 months during the pandemic.
The Welsh Cancer Intelligence & Surveillance Unit (WCISU) is the National Cancer Registry for Wales and its primary role is to record, store and report on all incidence of cancer for the resident population of Wales wherever they are treated.
Linked Data Set - Hospital Episode Statistics to Civil Registration of Deaths
This study will assess the prevalence of asthma amongst hajj pilgrims, and risk of asthma-related events during the hajj using routine data from the hajj medical service.
The National Exercise Referral Scheme (NERS) is a Public Health Wales (PHW) funded scheme which has been in development since 2007. The Scheme targets clients aged 16 and over who have, or are at risk of developing, a chronic disease.
COVID-19 UK Non-hospital Antigen Testing Results (Pillar 2) data is required by NHS Digital to support COVID-19 requests for linkage, analysis and dissemination.
This dataset contains a series of service delivery and institutional mortality indicators from the Nepal dhis2 for the period of January 2019 to December 2020. This monthly dataset includes 15 months pre-COVID and 9 months during the pandemic.
Information on attendances at emergency care departments in 2 Trusts of the 5 Health & Social Care Trusts in Northern Ireland - see Emergency Department (Symphony) for the other 3 Trusts.
The UK Cystic Fibrosis Registry is a national, secure, centralized database sponsored and managed by the Cystic Fibrosis Trust, with UK National Health Service (NHS) research ethics approval and consent from each person for whom data are collected.
The NOGCA data set includes continuously ascertained, record-level data on the diagnosis, investigation and management) received in hospitals in England and Wales for patients with invasive, epithelial cancer of the oesophagus, gastro-oesophageal junction
https://www.pioneerdatahub.co.uk/data/data-request-process/https://www.pioneerdatahub.co.uk/data/data-request-process/
This highly granular synthetic dataset created as an asset for the HDR UK Medicines programme includes information on 680 cancer patients over a period of three years. Includes simulated patient-related data, such as demographics & co-morbidities extracted from ICD-10 and SNOMED-CT codes. Serial, structured data pertaining to acute care process (readmissions, survival), primary diagnosis, presenting complaint, physiology readings, blood results (infection, inflammatory markers) and acuity markers such as AVPU Scale, NEWS2 score, imaging reports, prescribed & administered treatments including fluids, blood products, procedures, information on outpatient admissions and survival outcomes following one-year post discharge.
The data was generated using a generative adversarial network model (CTGAN). A flat real data table was created by consolidating essential information from various key relational tables (medications, demographics). A synthetic version of the flat table was generated using a customized script based on the SDV package (N. Patki, 2016), that replicated the real distribution and logic relationships.
Geography: The West Midlands (WM) has a population of 6 million & includes a diverse ethnic & socio-economic mix. UHB is one of the largest NHS Trusts in England, providing direct acute services & specialist care across four hospital sites, with 2.2 million patient episodes per year, 2750 beds & > 120 ITU bed capacity. UHB runs a fully electronic healthcare record (EHR) (PICS; Birmingham Systems), a shared primary & secondary care record (Your Care Connected) & a patient portal “My Health”.
Data set availability: Data access is available via the PIONEER Hub for projects which will benefit the public or patients. This can be by developing a new understanding of disease, by providing insights into how to improve care, or by developing new models, tools, treatments, or care processes. Data access can be provided to NHS, academic, commercial, policy and third sector organisations. Applications from SMEs are welcome. There is a single data access process, with public oversight provided by our public review committee, the Data Trust Committee. Contact pioneer@uhb.nhs.uk or visit www.pioneerdatahub.co.uk for more details.
Available supplementary data: Matched controls; ambulance and community data. Unstructured data (images). We can provide the dataset in OMOP and other common data models and provide the real-data via application.
Available supplementary support: Analytics, model build, validation & refinement; A.I. support. Data partner support for ETL (extract, transform & load) processes. Bespoke and “off the shelf” Trusted Research Environment (TRE) build and run. Consultancy with clinical, patient & end-user and purchaser access/ support. Support for regulatory requirements. Cohort discovery. Data-driven trials and “fast screen” services to assess population size.