86 datasets found

s
CoVid Plots and Analysis
orda.shef.ac.uk
figshare.shef.ac.uk
+1more
txt
Updated Jul 14, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Colin Angus (2025). CoVid Plots and Analysis [Dataset]. http://doi.org/10.15131/shef.data.12328226.v60
Explore at:
txtAvailable download formats
Unique identifier
https://doi.org/10.15131/shef.data.12328226.v60
Dataset updated
Jul 14, 2025
Dataset provided by
The University of Sheffield
Authors
Colin Angus
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
COVID-19Plots and analysis relating to the coronavirus pandemic. Includes five sets of plots and associated R code to generate them.1) HeatmapsUpdated every few days - heatmaps of COVID-19 case and death trajectories for Local Authorities (or equivalent) in England, Wales, Scotland, Ireland and Germany.2) All cause mortalityUpdated on Tuesday (for England & Wales), Wednesday (for Scotland) and Friday (for Northern Ireland) - analysis and plots of weekly all-cause deaths in 2020 compared to previous years by country, age, sex and region. Also a set of international comparisons using data from mortality.org3) ExposuresNo longer updated - mapping of potential COVID-19 mortality exposure at local levels (LSOAs) in England based on the age-sex structure of the population and levels of poor health.There is also a Shiny app which creates slightly lower resolution versions of the same plots online, which you can find here: https://victimofmaths.shinyapps.io/covidmapper/, on GitHub https://github.com/VictimOfMaths/COVIDmapper and uploaded to this record4) Index of Multiple Deprivation No longer updated - preliminary analysis of the inequality impacts of COVID-19 based on Local Authority level cases and levels of deprivation. 5) Socioeconomic inequalities. No longer updated (unless ONS release more data) - Analysis of published ONS figures of COVID-19 and other cause mortality in 2020 compared to previous years by deprivation decile.Latest versions of plots and associated analysis can be found on Twitter: https://twitter.com/victimofmathsThis work is described in more detail on the UK Data Service Impact and Innovation Lab blog: https://blog.ukdataservice.ac.uk/visualising-high-risk-areas-for-covid-19-mortality/Adapted from data from the Office for National Statistics licensed under the Open Government Licence v.1.0.http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
Coronavirus (COVID-19) Infection Survey: England
ons.gov.uk
cy.ons.gov.uk
xlsx
Updated Mar 10, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Office for National Statistics (2023). Coronavirus (COVID-19) Infection Survey: England [Dataset]. https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/conditionsanddiseases/datasets/coronaviruscovid19infectionsurveydata
Explore at:
xlsxAvailable download formats
Dataset updated
Mar 10, 2023
Dataset provided by
Office for National Statisticshttp://www.ons.gov.uk/
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Description
Findings from the Coronavirus (COVID-19) Infection Survey for England.
w
Fire statistics data tables
gov.uk
s3.amazonaws.com
Updated Jul 10, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ministry of Housing, Communities and Local Government (2025). Fire statistics data tables [Dataset]. https://www.gov.uk/government/statistical-data-sets/fire-statistics-data-tables
Explore at:
Dataset updated
Jul 10, 2025
Dataset provided by
GOV.UK
Authors
Ministry of Housing, Communities and Local Government
Description

On 1 April 2025 responsibility for fire and rescue transferred from the Home Office to the Ministry of Housing, Communities and Local Government.

This information covers fires, false alarms and other incidents attended by fire crews, and the statistics include the numbers of incidents, fires, fatalities and casualties as well as information on response times to fires. The Ministry of Housing, Communities and Local Government (MHCLG) also collect information on the workforce, fire prevention work, health and safety and firefighter pensions. All data tables on fire statistics are below.

MHCLG has responsibility for fire services in England. The vast majority of data tables produced by the Ministry of Housing, Communities and Local Government are for England but some (0101, 0103, 0201, 0501, 1401) tables are for Great Britain split by nation. In the past the Department for Communities and Local Government (who previously had responsibility for fire services in England) produced data tables for Great Britain and at times the UK. Similar information for devolved administrations are available at https://www.firescotland.gov.uk/about/statistics/" class="govuk-link">Scotland: Fire and Rescue Statistics, https://statswales.gov.wales/Catalogue/Community-Safety-and-Social-Inclusion/Community-Safety" class="govuk-link">Wales: Community safety and https://www.nifrs.org/home/about-us/publications/" class="govuk-link">Northern Ireland: Fire and Rescue Statistics.

If you use assistive technology (for example, a screen reader) and need a version of any of these documents in a more accessible format, please email alternativeformats@communities.gov.uk. Please tell us what format you need. It will help us if you say what assistive technology you use.

Related content

Fire statistics guidance
Fire statistics incident level datasets

Incidents attended

https://assets.publishing.service.gov.uk/media/686d2aa22557debd867cbe14/FIRE0101.xlsx">FIRE0101: Incidents attended by fire and rescue services by nation and population (MS Excel Spreadsheet, 153 KB) Previous FIRE0101 tables

https://assets.publishing.service.gov.uk/media/686d2ab52557debd867cbe15/FIRE0102.xlsx">FIRE0102: Incidents attended by fire and rescue services in England, by incident type and fire and rescue authority (MS Excel Spreadsheet, 2.19 MB) Previous FIRE0102 tables

https://assets.publishing.service.gov.uk/media/686d2aca10d550c668de3c69/FIRE0103.xlsx">FIRE0103: Fires attended by fire and rescue services by nation and population (MS Excel Spreadsheet, 201 KB) Previous FIRE0103 tables

https://assets.publishing.service.gov.uk/media/686d2ad92557debd867cbe16/FIRE0104.xlsx">FIRE0104: Fire false alarms by reason for false alarm, England (MS Excel Spreadsheet, 492 KB) Previous FIRE0104 tables

Dwelling fires attended

https://assets.publishing.service.gov.uk/media/686d2af42cfe301b5fb6789f/FIRE0201.xlsx">FIRE0201: Dwelling fires attended by fire and rescue services by motive, population and nation (MS Excel Spreadsheet, <span class="gem-c-attac
Coronavirus (COVID-19) Infection Survey headline results, UK
ons.gov.uk
cy.ons.gov.uk
xlsx
Updated Mar 24, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Office for National Statistics (2023). Coronavirus (COVID-19) Infection Survey headline results, UK [Dataset]. https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/conditionsanddiseases/datasets/coronaviruscovid19infectionsurveyheadlineresultsuk
Explore at:
xlsxAvailable download formats
Dataset updated
Mar 24, 2023
Dataset provided by
Office for National Statisticshttp://www.ons.gov.uk/
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Area covered
United Kingdom
Description
Headline estimates for England, Wales, Northern Ireland and Scotland. 
Vehicle licensing statistics data files
gov.uk
s3.amazonaws.com
Updated Jun 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department for Transport (2025). Vehicle licensing statistics data files [Dataset]. https://www.gov.uk/government/statistical-data-sets/vehicle-licensing-statistics-data-files
Explore at:
Dataset updated
Jun 11, 2025
Dataset provided by
GOV.UKhttp://gov.uk/
Authors
Department for Transport
Description
Data tables containing aggregated information about vehicles in the UK are also available.

Recent changes

A number of changes were introduced to these data files in the 2022 release to help meet the needs of our users and to provide more detail.

Fuel type has been added to:

df_VEH0120_GB

df_VEH0120_UK

df_VEH0160_GB

df_VEH0160_UK

Historic UK data has been added to:

df_VEH0124 (now split into 2 files)

df_VEH0220

df_VEH0270

A new datafile has been added df_VEH0520.

We welcome any feedback on the structure of our data files, their usability, or any suggestions for improvements; please contact vehicles statistics.

How to use CSV files

CSV files can be used either as a spreadsheet (using Microsoft Excel or similar spreadsheet packages) or digitally using software packages and languages (for example, R or Python).

When using as a spreadsheet, there will be no formatting, but the file can still be explored like our publication tables. Due to their size, older software might not be able to open the entire file.

Download data files

Make and model by quarter

df_VEH0120_GB: https://assets.publishing.service.gov.uk/media/68494aca74fe8fe0cbb4676c/df_VEH0120_GB.csv">Vehicles at the end of the quarter by licence status, body type, make, generic model and model: Great Britain (CSV, 58.1 MB)

Scope: All registered vehicles in Great Britain; from 1994 Quarter 4 (end December)

Schema: BodyType, Make, GenModel, Model, Fuel, LicenceStatus, [number of vehicles; 1 column per quarter]

df_VEH0120_UK: https://assets.publishing.service.gov.uk/media/68494acb782e42a839d3a3ac/df_VEH0120_UK.csv">Vehicles at the end of the quarter by licence status, body type, make, generic model and model: United Kingdom (CSV, 34.1 MB)

Scope: All registered vehicles in the United Kingdom; from 2014 Quarter 3 (end September)

Schema: BodyType, Make, GenModel, Model, Fuel, LicenceStatus, [number of vehicles; 1 column per quarter]

df_VEH0160_GB: https://assets.publishing.service.gov.uk/media/68494ad774fe8fe0cbb4676d/df_VEH0160_GB.csv">Vehicles registered for the first time by body type, make, generic model and model: Great Britain (CSV, 24.8 MB)

Scope: All vehicles registered for the first time in Great Britain; from 2001 Quarter 1 (January to March)

Schema: BodyType, Make, GenModel, Model, Fuel, [number of vehicles; 1 column per quarter]

df_VEH0160_UK: https://assets.publishing.service.gov.uk/media/68494ad7aae47e0d6c06e078/df_VEH0160_UK.csv">Vehicles registered for the first time by body type, make, generic model and model: United Kingdom (CSV, 8.26 MB)

Scope: All vehicles registered for the first time in the United Kingdom; from 2014 Quarter 3 (July to September)

Schema: BodyType, Make, GenModel, Model, Fuel, [number of vehicles; 1 column per quarter]

Make and model by age

In order to keep the datafile df_VEH0124 to a reasonable size, it has been split into 2 halves; 1 covering makes starting with A to M, and the other covering makes starting with N to Z.

df_VEH0124_AM: <a class="govuk-link" href="https://assets.
w
Vehicle licensing statistics data tables
gov.uk
s3.amazonaws.com
Updated Jun 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department for Transport (2025). Vehicle licensing statistics data tables [Dataset]. https://www.gov.uk/government/statistical-data-sets/vehicle-licensing-statistics-data-tables
Explore at:
Dataset updated
Jun 11, 2025
Dataset provided by
GOV.UK
Authors
Department for Transport
Description

Data files containing detailed information about vehicles in the UK are also available, including make and model data.

Some tables have been withdrawn and replaced. The table index for this statistical series has been updated to provide a full map between the old and new numbering systems used in this page.

Tables VEH0101 and VEH1104 have not yet been revised to include the recent changes to Large Goods Vehicles (LGV) and Heavy Goods Vehicles (HGV) definitions for data earlier than 2023 quarter 4. This will be amended as soon as possible.

All vehicles

Licensed vehicles

Overview

VEH0101: https://assets.publishing.service.gov.uk/media/6846e8dc57f3515d9611f119/veh0101.ods">Vehicles at the end of the quarter by licence status and body type: Great Britain and United Kingdom (ODS, 151 KB)

Detailed breakdowns

VEH0103: https://assets.publishing.service.gov.uk/media/6846e8dcd25e6f6afd4c01d5/veh0103.ods">Licensed vehicles at the end of the year by tax class: Great Britain and United Kingdom (ODS, 33 KB)

VEH0105: https://assets.publishing.service.gov.uk/media/6846e8dd57f3515d9611f11a/veh0105.ods">Licensed vehicles at the end of the quarter by body type, fuel type, keepership (private and company) and upper and lower tier local authority: Great Britain and United Kingdom (ODS, 16.3 MB)

VEH0206: https://assets.publishing.service.gov.uk/media/6846e8dee5a089417c806179/veh0206.ods">Licensed cars at the end of the year by VED band and carbon dioxide (CO2) emissions: Great Britain and United Kingdom (ODS, 42.3 KB)

VEH0601: https://assets.publishing.service.gov.uk/media/6846e8df5e92539572806176/veh0601.ods">Licensed buses and coaches at the end of the year by body type detail: Great Britain and United Kingdom (ODS, 24.6 KB)

VEH1102: https://assets.publishing.service.gov.uk/media/6846e8e0e5a089417c80617b/veh1102.ods">Licensed vehicles at the end of the year by body type and keepership (private and company): Great Britain and United Kingdom (ODS, 146 KB)

VEH1103: https://assets.publishing.service.gov.uk/media/6846e8e0e5a089417c80617c/veh1103.ods">Licensed vehicles at the end of the quarter by body type and fuel type: Great Britain and United Kingdom (ODS, 992 KB)

VEH1104: https://assets.publishing.service.gov.uk/media/6846e8e15e92539572806177/veh1104.ods">Licensed vehicles at the end of the
Z
Synthetic datasets of the UK Biobank cohort
data.niaid.nih.gov
zenodo.org
Updated Feb 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Vanoli, Jacopo (2025). Synthetic datasets of the UK Biobank cohort [Dataset]. https://data.niaid.nih.gov/resources?id=zenodo_13983169
Explore at:
Dataset updated
Feb 6, 2025
Dataset provided by
Vanoli, Jacopo
Gasparrini, Antonio
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This repository stores synthetic datasets derived from the database of the UK Biobank (UKB) cohort.

The datasets were generated for illustrative purposes, in particular for reproducing specific analyses on the health risks associated with long-term exposure to air pollution using the UKB cohort. The code used to create the synthetic datasets is available and documented in a related GitHub repo, with details provided in the section below. These datasets can be freely used for code testing and for illustrating other examples of analyses on the UKB cohort.

Note: while the synthetic versions of the datasets resemble the real ones in several aspects, the users should be aware that these data are fake and must not be used for testing and making inferences on specific research hypotheses. Even more importantly, these data cannot be considered a reliable description of the original UKB data, and they must not be presented as such.

The original datasets are described in the article by Vanoli et al in Epidemiology (2024) (DOI: 10.1097/EDE.0000000000001796) [freely available here], which also provides information about the data sources.

The work was supported by the Medical Research Council-UK (Grant ID: MR/Y003330/1).

Content

The series of synthetic datasets (stored in two versions with csv and RDS formats) are the following:

synthbdcohortinfo: basic cohort information regarding the follow-up period and birth/death dates for 502,360 participants.

synthbdbasevar: baseline variables, mostly collected at recruitment.

synthpmdata: annual average exposure to PM2.5 for each participant reconstructed using their residential history.

synthoutdeath: death records that occurred during the follow-up with date and ICD-10 code.

In addition, this repository provides these additional files:

codebook: a pdf file with a codebook for the variables of the various datasets, including references to the fields of the original UKB database.

asscentre: a csv file with information on the assessment centres used for recruitment of the UKB participants, including code, names, and location (as northing/easting coordinates of the British National Grid).

Countries_December_2022_GB_BUC: a zip file including the shapefile defining the boundaries of the countries in Great Britain (England, Wales, and Scotland), used for mapping purposes [source].

Generation of the synthetic data

The datasets resemble the real data used in the analysis, and they were generated using the R package synthpop (www.synthpop.org.uk). The generation process involves two steps, namely the synthesis of the main data (cohort info, baseline variables, annual PM2.5 exposure) and then the sampling of death events. The R scripts for performing the data synthesis are provided in the GitHub repo (subfolder Rcode/synthcode).

The first part merges all the data including the annual PM2.5 levels in a single wide-format dataset (with a row for each subject), generates a synthetic version, adds fake IDs, and then extracts (and reshapes) the single datasets. In the second part, a Cox proportional hazard model is fitted on the original data to estimate risks associated with various predictors (including the main exposure represented by PM2.5), and then these relationships are used to simulate death events in each year. Details on the modelling aspects are provided in the article.

This process guarantees that the synthetic data do not hold specific information about the original records, thus preserving confidentiality. At the same time, the multivariate distribution and correlation across variables as well as the mortality risks resemble those of the original data, so the results of descriptive and inferential analyses are similar to those in the original assessments. However, as noted above, the data are used only for illustrative purposes, and they must not be used to test other research hypotheses.
l
Covid-19 - Daily positive tests in Leicester, Leicestershire & Rutland
data.leicester.gov.uk
leicester.opendatasoft.com
csv, excel, json
Updated Apr 17, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Covid-19 - Daily positive tests in Leicester, Leicestershire & Rutland [Dataset]. https://data.leicester.gov.uk/explore/dataset/covid-19-daily-positive-tests-in-leicester-leicestershire-rutland/
Explore at:
json, csv, excelAvailable download formats
Dataset updated
Apr 17, 2024
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Area covered
Rutland, Leicestershire, Leicester
Description
Daily Coronavirus (Covid-19) positive tests in Leicester City Council and surrounding districts.Data for the most recent 4-5 days is likely to be incomplete.Please note automatic updates to this dataset were discontinued on 12th December 2023.
Deaths registered weekly in England and Wales, provisional
ons.gov.uk
cy.ons.gov.uk
xlsx
Updated Jul 30, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Office for National Statistics (2025). Deaths registered weekly in England and Wales, provisional [Dataset]. https://www.ons.gov.uk/peoplepopulationandcommunity/birthsdeathsandmarriages/deaths/datasets/weeklyprovisionalfiguresondeathsregisteredinenglandandwales
Explore at:
xlsxAvailable download formats
Dataset updated
Jul 30, 2025
Dataset provided by
Office for National Statisticshttp://www.ons.gov.uk/
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Area covered
England
Description
Provisional counts of the number of deaths registered in England and Wales, by age, sex, region and Index of Multiple Deprivation (IMD), in the latest weeks for which data are available.
Estimates of the population for the UK, England, Wales, Scotland, and...
ons.gov.uk
cy.ons.gov.uk
xlsx
Updated Oct 8, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Office for National Statistics (2024). Estimates of the population for the UK, England, Wales, Scotland, and Northern Ireland [Dataset]. https://www.ons.gov.uk/peoplepopulationandcommunity/populationandmigration/populationestimates/datasets/populationestimatesforukenglandandwalesscotlandandnorthernireland
Explore at:
xlsxAvailable download formats
Dataset updated
Oct 8, 2024
Dataset provided by
Office for National Statisticshttp://www.ons.gov.uk/
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Area covered
Ireland, Scotland, England, Northern Ireland, United Kingdom
Description
National and subnational mid-year population estimates for the UK and its constituent countries by administrative area, age and sex (including components of population change, median age and population density).
Mode of travel
gov.uk
Updated Apr 16, 2025
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Department for Transport (2025). Mode of travel [Dataset]. https://www.gov.uk/government/statistical-data-sets/nts03-modal-comparisons
Explore at:
Dataset updated
Apr 16, 2025
Dataset provided by
GOV.UKhttp://gov.uk/
Authors
Department for Transport
Description

Accessible Tables and Improved Quality

As part of the Analysis Function Reproducible Analytical Pipeline Strategy, processes to create all National Travel Survey (NTS) statistics tables have been improved to follow the principles of Reproducible Analytical Pipelines (RAP). This has resulted in improved efficiency and quality of NTS tables and therefore some historical estimates have seen very minor change, at least the fifth decimal place.

All NTS tables have also been redesigned in an accessible format where they can be used by as many people as possible, including people with an impaired vision, motor difficulties, cognitive impairments or learning disabilities and deafness or impaired hearing.

If you wish to provide feedback on these changes then please email national.travelsurvey@dft.gov.uk.

Revision to table NTS9919

On the 16th April 2025, the figures in table NTS9919 have been revised and recalculated to include only day 1 of the travel diary where short walks of less than a mile are recorded (from 2017 onwards), whereas previous versions included all days. This is to more accurately capture the proportion of trips which include short walks before a surface rail stage. This revision has resulted in fewer available breakdowns than previously published due to the smaller sample sizes.

Trips, stages, distance and time spent travelling

NTS0303: https://assets.publishing.service.gov.uk/media/66ce0f118e33f28aae7e1f75/nts0303.ods">Average number of trips, stages, miles and time spent travelling by mode: England, 2002 onwards (ODS, 53.9 KB)

NTS0308: https://assets.publishing.service.gov.uk/media/66ce0f128e33f28aae7e1f76/nts0308.ods">Average number of trips and distance travelled by trip length and main mode; England, 2002 onwards (ODS, 191 KB)

NTS0312: https://assets.publishing.service.gov.uk/media/66ce0f12bc00d93a0c7e1f71/nts0312.ods">Walks of 20 minutes or more by age and frequency: England, 2002 onwards (ODS, 35.1 KB)

NTS0313: https://assets.publishing.service.gov.uk/media/66ce0f12bc00d93a0c7e1f72/nts0313.ods">Frequency of use of different transport modes: England, 2003 onwards (ODS, 27.1 KB)

NTS0412: https://assets.publishing.service.gov.uk/media/66ce0f1325c035a11941f653/nts0412.ods">Commuter trips and distance by employment status and main mode: England, 2002 onwards (ODS, 53.8 KB)

NTS0504: https://assets.publishing.service.gov.uk/media/66ce0f141aaf41b21139cf7d/nts0504.ods">Average number of trips by day of the week or month and purpose or main mode: England, 2002 onwards (ODS, 141 KB)

<h2 id=
n
MultNIST Dataset
data.ncl.ac.uk
json
Updated Nov 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
David Towers; Rob Geada; Amir Atapour Abarghouei; Andrew Stephen McGough (2023). MultNIST Dataset [Dataset]. http://doi.org/10.25405/data.ncl.24574678.v1
Explore at:
jsonAvailable download formats
Unique identifier
https://doi.org/10.25405/data.ncl.24574678.v1
Dataset updated
Nov 30, 2023
Dataset provided by
Newcastle University
Authors
David Towers; Rob Geada; Amir Atapour Abarghouei; Andrew Stephen McGough
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Dataset containing the images and labels for the MultNIST data used in the CVPR NAS workshop Unseen-data challenge under the codename "Mateo"The MultNIST dataset is a constructed dataset from MNIST Images. The intention of this dataset is to require machine learning models to do more than just image classification but also perform a calculation, in this case multiplaction followed by a mod operation. For each image, three MNIST Images were randomly chosen and combined together through the colour channels, resulting in a three colour-channel image so each MNIST image represents one colour channel. The data is in a channels-first format with a shape of (n, 3, 28, 28) where n is the number of samples in the corresponding set (50,000 for training, 10,000 for validation, and 10,000 for testing).There are ten classes in the dataset, with 7,000 examples of each, distributed evenly between the three subsets.The label of each image is generated using the formula "(r * b * g) % 10" where r, g, and b are the red, green, and blue colour channels respectively. An example of a MultNIST Image would be a rgb configuation of 3, 7, and 4 respectively, which would result in a label of 4 ((3 * 7 * 4) % 10).
f
Data and tools for studying isograms
figshare.com
Updated Jul 31, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Florian Breit (2017). Data and tools for studying isograms [Dataset]. http://doi.org/10.6084/m9.figshare.5245810.v1
Explore at:
application/x-sqlite3Available download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.5245810.v1
Dataset updated
Jul 31, 2017
Dataset provided by
figshare
Authors
Florian Breit
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
A collection of datasets and python scripts for extraction and analysis of isograms (and some palindromes and tautonyms) from corpus-based word-lists, specifically Google Ngram and the British National Corpus (BNC).Below follows a brief description, first, of the included datasets and, second, of the included scripts.1. DatasetsThe data from English Google Ngrams and the BNC is available in two formats: as a plain text CSV file and as a SQLite3 database.1.1 CSV formatThe CSV files for each dataset actually come in two parts: one labelled ".csv" and one ".totals". The ".csv" contains the actual extracted data, and the ".totals" file contains some basic summary statistics about the ".csv" dataset with the same name.The CSV files contain one row per data point, with the colums separated by a single tab stop. There are no labels at the top of the files. Each line has the following columns, in this order (the labels below are what I use in the database, which has an identical structure, see section below):

Label Data type Description

isogramy int The order of isogramy, e.g. "2" is a second order isogram

length int The length of the word in letters

word text The actual word/isogram in ASCII

source_pos text The Part of Speech tag from the original corpus

count int Token count (total number of occurences)

vol_count int Volume count (number of different sources which contain the word)

count_per_million int Token count per million words

vol_count_as_percent int Volume count as percentage of the total number of volumes

is_palindrome bool Whether the word is a palindrome (1) or not (0)

is_tautonym bool Whether the word is a tautonym (1) or not (0)

The ".totals" files have a slightly different format, with one row per data point, where the first column is the label and the second column is the associated value. The ".totals" files contain the following data:

Label

Data type

Description

!total_1grams

int

The total number of words in the corpus

!total_volumes

int

The total number of volumes (individual sources) in the corpus

!total_isograms

int

The total number of isograms found in the corpus (before compacting)

!total_palindromes

int

How many of the isograms found are palindromes

!total_tautonyms

int

How many of the isograms found are tautonyms

The CSV files are mainly useful for further automated data processing. For working with the data set directly (e.g. to do statistics or cross-check entries), I would recommend using the database format described below.1.2 SQLite database formatOn the other hand, the SQLite database combines the data from all four of the plain text files, and adds various useful combinations of the two datasets, namely:• Compacted versions of each dataset, where identical headwords are combined into a single entry.• A combined compacted dataset, combining and compacting the data from both Ngrams and the BNC.• An intersected dataset, which contains only those words which are found in both the Ngrams and the BNC dataset.The intersected dataset is by far the least noisy, but is missing some real isograms, too.The columns/layout of each of the tables in the database is identical to that described for the CSV/.totals files above.To get an idea of the various ways the database can be queried for various bits of data see the R script described below, which computes statistics based on the SQLite database.2. ScriptsThere are three scripts: one for tiding Ngram and BNC word lists and extracting isograms, one to create a neat SQLite database from the output, and one to compute some basic statistics from the data. The first script can be run using Python 3, the second script can be run using SQLite 3 from the command line, and the third script can be run in R/RStudio (R version 3).2.1 Source dataThe scripts were written to work with word lists from Google Ngram and the BNC, which can be obtained from http://storage.googleapis.com/books/ngrams/books/datasetsv2.html and [https://www.kilgarriff.co.uk/bnc-readme.html], (download all.al.gz).For Ngram the script expects the path to the directory containing the various files, for BNC the direct path to the *.gz file.2.2 Data preparationBefore processing proper, the word lists need to be tidied to exclude superfluous material and some of the most obvious noise. This will also bring them into a uniform format.Tidying and reformatting can be done by running one of the following commands:python isograms.py --ngrams --indir=INDIR --outfile=OUTFILEpython isograms.py --bnc --indir=INFILE --outfile=OUTFILEReplace INDIR/INFILE with the input directory or filename and OUTFILE with the filename for the tidied and reformatted output.2.3 Isogram ExtractionAfter preparing the data as above, isograms can be extracted from by running the following command on the reformatted and tidied files:python isograms.py --batch --infile=INFILE --outfile=OUTFILEHere INFILE should refer the the output from the previosu data cleaning process. Please note that the script will actually write two output files, one named OUTFILE with a word list of all the isograms and their associated frequency data, and one named "OUTFILE.totals" with very basic summary statistics.2.4 Creating a SQLite3 databaseThe output data from the above step can be easily collated into a SQLite3 database which allows for easy querying of the data directly for specific properties. The database can be created by following these steps:1. Make sure the files with the Ngrams and BNC data are named “ngrams-isograms.csv” and “bnc-isograms.csv” respectively. (The script assumes you have both of them, if you only want to load one, just create an empty file for the other one).2. Copy the “create-database.sql” script into the same directory as the two data files.3. On the command line, go to the directory where the files and the SQL script are. 4. Type: sqlite3 isograms.db 5. This will create a database called “isograms.db”.See the section 1 for a basic descript of the output data and how to work with the database.2.5 Statistical processingThe repository includes an R script (R version 3) named “statistics.r” that computes a number of statistics about the distribution of isograms by length, frequency, contextual diversity, etc. This can be used as a starting point for running your own stats. It uses RSQLite to access the SQLite database version of the data described above.
T
United Kingdom Interest Rate
tradingeconomics.com
pl.tradingeconomics.com
+13more
csv, excel, json, xml
Updated Jun 19, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
TRADING ECONOMICS (2025). United Kingdom Interest Rate [Dataset]. https://tradingeconomics.com/united-kingdom/interest-rate
Explore at:
json, csv, excel, xmlAvailable download formats
Dataset updated
Jun 19, 2025
Dataset authored and provided by
TRADING ECONOMICS
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Time period covered
Sep 20, 1971 - Jun 19, 2025
Area covered
United Kingdom
Description
The benchmark interest rate in the United Kingdom was last recorded at 4.25 percent. This dataset provides - United Kingdom Interest Rate - actual values, historical data, forecast, chart, statistics, economic calendar and news.
RSMP Baseline Dataset
cefas.co.uk
obis.org
+1more
Updated 2017
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Centre for Environment, Fisheries and Aquaculture Science (2017). RSMP Baseline Dataset [Dataset]. http://doi.org/10.14466/CefasDataHub.34
Explore at:
Unique identifier
https://doi.org/10.14466/CefasDataHub.34
Dataset updated
2017
Dataset authored and provided by
Centre for Environment, Fisheries and Aquaculture Science
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Time period covered
Apr 1, 1969 - Aug 26, 2016
Description
This dataset was compiled for the Regional Seabed Monitoring Plan (RSMP) baseline assessment reported in Cooper & Barry (2017).

The dataset comprises of 33,198 macrofaunal samples (83% with associated data on sediment particle size composition) covering large parts of the UK continental shelf. Whilst most samples come from existing datasets, also included are 2,500 new samples collected specifically for the purpose of this study. These new samples were collected during 2014-2016 from the main English aggregate dredging regions (Humber, Anglian, Thames, Eastern English Channel and South Coast) and at four individual, isolated extraction sites where the RSMP methodology is also being adopted (e.g. Area 457, North-West dredging region; Area 392, North-West dredging region; Area 376, Bristol Channel dredging region; Goodwin Sands, English Channel). This work was funded by the aggregates industry, and carried out by contractors on their behalf. Samples were collected in accordance with a detailed protocols document which included control measures to ensure the quality of faunal and sediment sample processing. Additional samples were acquired to fill in gaps in spatial coverage and to provide a contemporary baseline for sediment composition.

Sources of existing data include both government and industry, with contributions from the marine aggregate dredging, offshore wind, oil and gas, nuclear and port and harbour sectors. Samples have been collected over a period of 48 years from 1969 to 2016, although the vast majority (96%) were acquired since 2000. Samples have been collected during every month of the year, although there is a clear peak during summer months when weather conditions are generally more favourable for fieldwork.

The DOI includes multiple files for use with the R script that accompanies the paper: Cooper, K. M. & Barry, J. A big data approach to macrofaunal baseline assessment, monitoring and sustainable exploitation of the seabed. Scientific Reports 7, doi: 10.1038/s41598-017-11377-9 (2017). Files include:

C5922 FINAL SCRIPTV91.R

C5922DATASET13022017REDACTED.csv (Raw data)*

Dataset description.xlsx (Description of data in C5922DATASET13022017.csv)

PARTBAGG05022017.csv (Faunal Aggregation data)

EUROPE.shp (European Coastline)

EuropeLiteScoWal.shp (European Coastline with UK boundaries)

Aggregates_Licence_20151112.shp (Aggregates Licensed extraction areas)

Aggregates_Application_20150813.shp (Aggregates Application areas)

HUMBERLICANDAPP.shp (Licensed Extraction and Application Areas - Humber)

H_SIZ_PSD_POLYGONS_UNION_2014.shp (Humber SIZs)

H_492_PIZ_APP.shp (Area 492 Application Area)

ANGLIANLICANDAPP.shp (Licensed Extraction and Application Areas - Anglian)

A_SIZ_PSD_POLYGONS_UNION.shp (Anglian SIZs)

THAMESLICANDAPP.shp (Licensed Extraction and Application Areas - Thames)

T_SIZ_PSD_POLYGONS_UNION_REV_2014.shp (Thames SIZs)

T_501_1_2_SIZ_PSD.shp (Area 501 1/2 SIZ)

EECLICANDAPP.shp (Licensed Extraction and Application Areas-East Channel)

EC_SIZ_PSD_POLYGONS_UNION_REV.shp (East Channel SIZs)

SCOASTLICANDAPP.shp (Licensed Extraction and Application Areas - South Coast)

SC_SIZ_PSD_POLYGONS_UNION.shp (South Coast SIZs)

BRISTOLCHANNELLICANDAPP.shp (Licensed Extraction and Application Areas - Bristol Channel)

BC_SIZ2.shp (Bristol Channel/Severn Estuary SIZs)

NORTHWESTLICANDAPP.shp(Licensed Extraction and Application Areas - North West)

NW_392_SIZ_PSD_LICENCE_EXISTING.shp (Area 392 SIZ)

AREA_457_PSD.shp (Area 457 SIZ)

GOODWIN LICENCE FINAL POLYGON.shp (Goodwin Sands Extraction area)

GoodwinSIZ.shp (Goodwin Sands SIZ)

DEFRADEMKC8.shp (Seabed bathymetry)

*At the request of data owners, macrofaunal abundance and sediment particle size data have been redacted from 13 of the 777 surveys (1.7%) in the dataset. Note that metadata and derived variables are still included. Surveys with redacted data include:

SurveyName

TRIKNOOWF2008,

EAOWF (Owner: East Anglia Offshore Wind Limited),

Wight Barfleur_cSAC_infauna,

MPAFORTH2011,

Hinkely point 108 benthos survey (BEEMS-WP2),

Hinkely point 208 benthos survey (BEEMS-WP2),

Hinkely point 408 benthos survey (BEEMS-WP2),

Hinkely point 308 benthos survey (BEEMS-WP2),

BEEMS WP2 Hinkley Point Q2 2009,

BEEMS WP5 Hinkley Point Infauna,

Hinkley Point 510 benthic survey (WP2-BEEMS),

Hinkley Point benthos survey June 2011 (BEEMS-WP2),

Hinkley Point benthos survey Feb 2010 (BEEMS-WP2)

Cefas will only make redacted data available where the data requester can provide written permission from the relevant data owner(s) - see below. Note that it is the responsibility of the data requester to seek permission from the relevant data owners.

Data owners for the redacted surveys listed above are:

Triton Knoll Offshore Wind Farm Limited

East Anglia Offshore Wind Limited

Joint Nature Conservation Committee (JNCC)

Joint Nature Conservation Committee (JNCC)

EDF Energy

EDF Energy

EDF Energy

EDF Energy

EDF Energy

EDF Energy

EDF Energy

EDF Energy

EDF Energy

Description of the C5922DATASET13022017.csv/ C5922DATASET13022017REDACTED.csv (Raw data)

A variety of gear types have been used for sample collection including grabs (0.1m2 Hamon, 0.2m2 Hamon, 0.1m2 Day, 0.1m2 Van Veen and 0.1m2 Smith McIntrye) and cores. Of these various devices, 93% of samples were acquired using either a 0.1m2 Hamon grab or a 0.1m2 Day grab. Sieve sizes used in sample processing include 1mm and 0.5mm, reflecting the conventional preference for 1mm offshore and 0.5mm inshore (see Figure 2). Of the samples collected using either a 0.1m2 Hamon grab or a 0.1m2 Day grab, 88% were processed using a 1mm sieve.

Taxon names were standardised according to the WoRMS (World Register of Marine Species) list using the Taxon Match Tool (http://www.marinespecies.org/aphia.php?p=match). Of the initial 13,449 taxon names, only 4,248 remained after correction. The output from this tool also provides taxonomic aggregation information, allowing data to be analysed at different taxonomic levels - from species to phyla. The final dataset comprises of a single sheet comma-separated values (.csv) file. Colonials accounted for less than 20% of the total number of taxa and, where present, were given a value of 1 in the dataset. This component of the fauna was missing from 325 out of the 777 surveys, reflecting either a true absence, or simply that colonial taxa were ignored by the analyst. Sediment particle size data were provided as percentage weight by sieve mesh size, with the dataset including 99 different sieve sizes. Sediment samples have been processed using sieve, and a combination of sieve and laser diffraction techniques. Key metadata fields include: Sample coordinates (Latitude & Longitude), Survey Name, Gear, Date, Grab Sample Volume (litres) and Water Depth (m). A number of additional explanatory variables are also provided (salinity, temperature, chlorophyll a, Suspended particulate matter, Water depth, Wave Orbital Velocity, Average Current, Bed Stress). In total, the dataset dimensions are 33,198 rows (samples) x 13,588 columns (variables/factors), yielding a matrix of 451,094,424 individual data values.
Data from: Historical Datasets for Victims' Access to Justice Project,...
beta.ukdataservice.ac.uk
Updated 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
P. Cox; B. Godfrey; R. Lamont; R. Shoemaker; H. Shore; S. Walklate; L. Williams (2023). Historical Datasets for Victims' Access to Justice Project, 1674-1975 [Dataset]. http://doi.org/10.5255/ukda-sn-9082-1
Explore at:
Unique identifier
https://doi.org/10.5255/ukda-sn-9082-1
Dataset updated
2023
Dataset provided by
DataCitehttps://www.datacite.org/
UK Data Servicehttps://ukdataservice.ac.uk/
Authors
P. Cox; B. Godfrey; R. Lamont; R. Shoemaker; H. Shore; S. Walklate; L. Williams
Description
These historical datasets were created by the ESRC funded project, ‘Victims' access to justice through English criminal courts, 1675 to the present’ , carried out between 2018 and 2022. This interdisciplinary project examined public access to justice in England over more than three centuries - from the 1670s to the present. Bringing together leading criminologists and crime historians, it assembled and analysed data on over 235,000 victims involved in trials at the Old Bailey in London in order to understand the rights of, and resources and services available to, victims in the past, present and future. It constructed a new evidence base to establish who these victims were, how they came to be complainants, prosecutors or witnesses, and how they made use of available legal and financial resources. The project director was Pamela Cox and the other co-investigators were Barry Godfrey, Ruth Lamont, Robert Shoemaker, Heather Shore, and Sandra Walklate. Lucy Williams and Elisa Impara were research officers. The results of the data analysis were reported in a monograph: Pamela Cox, Robert Shoemaker and Heather Shore, Victims and Criminal Justice: A History (Oxford University Press, 2023). An additional contemporary dataset was created for the project using aggregated data on crime victims from the Crime Survey for England and Wales 1982-2017. It may be accessed via the UK Data Archive.

There are six datasets in this deposit. Two contain information pertaining to all victims of crimes prosecuted at the Old Bailey between 1674 and 1913, derived from the Old Bailey Proceedings Online (all fields and clean fields); two contain information on victims manually extracted from the Times newspaper between 1910-25 and 1960-75 (all fields and clean fields); data on a sample of Trial Participants at the Old Bailey between 1805 and 1900, collected manually; and judicial statistics extracted from the Parliamentary Papers from 1805-1879.
l
LScDC (Leicester Scientific Dictionary-Core)
figshare.le.ac.uk
docx
Updated Apr 15, 2020
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Neslihan Suzen (2020). LScDC (Leicester Scientific Dictionary-Core) [Dataset]. http://doi.org/10.25392/leicester.data.9896579.v3
Explore at:
docxAvailable download formats
Unique identifier
https://doi.org/10.25392/leicester.data.9896579.v3
Dataset updated
Apr 15, 2020
Dataset provided by
University of Leicester
Authors
Neslihan Suzen
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Leicester
Description
The LScDC (Leicester Scientific Dictionary-Core Dictionary)April 2020 by Neslihan Suzen, PhD student at the University of Leicester (ns433@leicester.ac.uk/suzenneslihan@hotmail.com)Supervised by Prof Alexander Gorban and Dr Evgeny Mirkes[Version 3] The third version of LScDC (Leicester Scientific Dictionary-Core) is formed using the updated LScD (Leicester Scientific Dictionary) - Version 3*. All steps applied to build the new version of core dictionary are the same as in Version 2** and can be found in description of Version 2 below. We did not repeat the explanation. The files provided with this description are also same as described as for LScDC Version 2. The numbers of words in the 3rd versions of LScD and LScDC are summarized below. # of wordsLScD (v3) 972,060LScDC (v3) 103,998 * Suzen, Neslihan (2019): LScD (Leicester Scientific Dictionary). figshare. Dataset. https://doi.org/10.25392/leicester.data.9746900.v3 ** Suzen, Neslihan (2019): LScDC (Leicester Scientific Dictionary-Core). figshare. Dataset. https://doi.org/10.25392/leicester.data.9896579.v2[Version 2] Getting StartedThis file describes a sorted and cleaned list of words from LScD (Leicester Scientific Dictionary), explains steps for sub-setting the LScD and basic statistics of words in the LSC (Leicester Scientific Corpus), to be found in [1, 2]. The LScDC (Leicester Scientific Dictionary-Core) is a list of words ordered by the number of documents containing the words, and is available in the CSV file published. There are 104,223 unique words (lemmas) in the LScDC. This dictionary is created to be used in future work on the quantification of the sense of research texts. The objective of sub-setting the LScD is to discard words which appear too rarely in the corpus. In text mining algorithms, usage of enormous number of text data brings the challenge to the performance and the accuracy of data mining applications. The performance and the accuracy of models are heavily depend on the type of words (such as stop words and content words) and the number of words in the corpus. Rare occurrence of words in a collection is not useful in discriminating texts in large corpora as rare words are likely to be non-informative signals (or noise) and redundant in the collection of texts. The selection of relevant words also holds out the possibility of more effective and faster operation of text mining algorithms.To build the LScDC, we decided the following process on LScD: removing words that appear in no more than 10 documents (
b
UK Biobank Genetic Data: MRC-IEU Quality Control, version 2 - Datasets -...
data.bris.ac.uk
Updated Jan 22, 2019
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2019). UK Biobank Genetic Data: MRC-IEU Quality Control, version 2 - Datasets - data.bris [Dataset]. https://data.bris.ac.uk/data/dataset/1ovaau5sxunp2cv8rcy88688v
Explore at:
Dataset updated
Jan 22, 2019
Description
This is a full description of the quality control procedure undertaken and the derived files produced by the MRC-IEU associated with the full UK Biobank (version 3, March 2018) genetic data. This dataset supersedes the earlier version available at DOI: 10.5523/bris.3074krb6t2frj29yh2b03x3wxj Complete download (zip, 176.9 KiB)
Data from: Biological-based habitat classification approaches promote...
cefas.co.uk
environment.data.gov.uk
+1more
Updated 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Centre for Environment, Fisheries and Aquaculture Science (2018). Biological-based habitat classification approaches promote cost-efficient monitoring: an example using seabed assemblages [Dataset]. http://doi.org/10.14466/CefasDataHub.56
Explore at:
Unique identifier
https://doi.org/10.14466/CefasDataHub.56
Dataset updated
2018
Dataset authored and provided by
Centre for Environment, Fisheries and Aquaculture Science
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Time period covered
Mar 31, 1969 - Jul 31, 2016
Description
Files for use with the R script accompanying the paper Cooper et al. (2018). Note that this script also uses files from https://doi.org/10.14466/CefasDataHub.34 (details provided in script). Cooper, K.M., Bolam, S.G., Downie, A. Callaway, A., Barry, J. (2018). Biological-based habitat classification approaches promote cost-efficient monitoring: an example using seabed assemblages. Journal of Applied Ecology. Files include: R SCRIPT FINAL.R (R script)

C5922DATASETFAM13022017REDACTED.csv (see below for description) UKSeaMap2016_SedimentsDissClip.shp (UK Seamap data clipped to study area. These data are available from http://jncc.defra.gov.uk/ukseamap under an Open Government Licence)) StudyArea.shp (polygon for study area) FaunalCluster.tif (faunal cluster habitat map in raster format) PhysicalCluster.tif (physical cluster habitat map in raster format) FaunalClusterClip.tif (faunal cluster habitat map, clipped to study area, in raster format) PhysicalClusterClip.tif (physical cluster habitat map, clipped to study area, in raster format)

Description of C5922DATASETFAM13022017REDACTED.csv This file is based on the RSMP dataset (see https://www.cefas.co.uk/cefas-data-hub/dois/rsmp-baseline-dataset/), but with macrofaunal data output at the level of family or above. A variety of gear types have been used for sample collection including grabs (0.1m2 Hamon, 0.2m2 Hamon, 0.1m2 Day, 0.1m2 Van Veen and 0.1m2 Smith McIntrye) and cores. Of these various devices, 93% of samples were acquired using either a 0.1m2 Hamon grab or a 0.1m2 Day grab. Sieve sizes used in sample processing include 1mm and 0.5mm, reflecting the conventional preference for 1mm offshore and 0.5mm inshore (see Figure 2). Of the samples collected using either a 0.1m2 Hamon grab or a 0.1m2 Day grab, 88% were processed using a 1mm sieve. Taxon names were standardised according to the WoRMS (World Register of Marine Species) list using the Taxon Match Tool (http://www.marinespecies.org/aphia.php?p=match). Of the initial 13,449 taxon names, only 774 remained after correction and aggregation to family level. The final dataset comprises of a single sheet comma-separated values (.csv) file. Colonials accounted for less than 20% of the total number of taxa and, where present, were given a value of 1 in the dataset. This component of the fauna was missing from 325 out of the 777 surveys, reflecting either a true absence, or simply that colonial taxa were ignored by the analyst. Sediment particle size data were provided as percentage weight by sieve mesh size, with the dataset including 99 different sieve sizes. Sediment samples have been processed using sieve, and a combination of sieve and laser diffraction techniques. Key metadata fields include: Sample coordinates (Latitude & Longitude), Survey Name, Gear, Date, Grab Sample Volume (litres) and Water Depth (m). A number of additional explanatory variables are also provided (salinity, temperature, chlorophyll a, Suspended particulate matter, Water depth, Wave Orbital Velocity, Average Current, Bed Stress). In total, the dataset dimensions are 33,198 rows (samples) x 900 columns (variables/factors), yielding a matrix of 29,878,200 individual data values.
English Longitudinal Study of Ageing: Waves 0-11, 1998-2024
beta.ukdataservice.ac.uk
Updated 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
J. Banks; G. David Batty; J. Breedvelt; K. Coughlin; Crawford, R., Institute For Fiscal Studies (IFS); M. Marmot; J. Nazroo; Oldfield, Z., Institute For Fiscal Studies (IFS); N. Steel; A. Steptoe; M. Wood; P. Zaninotto (2025). English Longitudinal Study of Ageing: Waves 0-11, 1998-2024 [Dataset]. http://doi.org/10.5255/ukda-sn-5050-32
Explore at:
Unique identifier
https://doi.org/10.5255/ukda-sn-5050-32
Dataset updated
2025
Dataset provided by
UK Data Servicehttps://ukdataservice.ac.uk/
datacite
Authors
J. Banks; G. David Batty; J. Breedvelt; K. Coughlin; Crawford, R., Institute For Fiscal Studies (IFS); M. Marmot; J. Nazroo; Oldfield, Z., Institute For Fiscal Studies (IFS); N. Steel; A. Steptoe; M. Wood; P. Zaninotto
Description
The English Longitudinal Study of Ageing (ELSA) is a longitudinal survey of ageing and quality of life among older people that explores the dynamic relationships between health and functioning, social networks and participation, and economic position as people plan for, move into and progress beyond retirement. The main objectives of ELSA are to:

construct waves of accessible and well-documented panel data;
provide these data in a convenient and timely fashion to the scientific and policy research community;
describe health trajectories, disability and healthy life expectancy in a representative sample of the English population aged 50 and over;
examine the relationship between economic position and health;
investigate the determinants of economic position in older age;
describe the timing of retirement and post-retirement labour market activity; and
understand the relationships between social support, household structure and the transfer of assets.

Further information may be found on the "https://www.elsa-project.ac.uk/"> ELSA project website, the or Natcen Social Research: ELSA web pages.

Wave 11 data has been deposited - May 2025

For the 45th edition (May 2025) ELSA Wave 11 core and pension grid data and documentation were deposited. Users should note this dataset version does not contain the survey weights. A version with the survey weights along with IFS and financial derived datasets will be deposited in due course. In the meantime, more information about the data collection or the data collected during this wave of ELSA can be found in the Wave 11 Technical Report or the User Guide.

Health conditions research with ELSA - June 2021

The ELSA Data team have found some issues with historical data measuring health conditions. If you are intending to do any analysis looking at the following health conditions, then please read the ELSA User Guide or if you still have questions contact elsadata@natcen.ac.uk for advice on how you should approach your analysis. The affected conditions are: eye conditions (glaucoma; diabetic eye disease; macular degeneration; cataract), CVD conditions (high blood pressure; angina; heart attack; Congestive Heart Failure; heart murmur; abnormal heart rhythm; diabetes; stroke; high cholesterol; other heart trouble) and chronic health conditions (chronic lung disease; asthma; arthritis; osteoporosis; cancer; Parkinson's Disease; emotional, nervous or psychiatric problems; Alzheimer's Disease; dementia; malignant blood disorder; multiple sclerosis or motor neurone disease).

For information on obtaining data from ELSA that are not held at the UKDS, see the ELSA Genetic data access and Accessing ELSA data webpages.

Wave 10 Health data
Users should note that in Wave 10, the health section of the ELSA questionnaire has been revised and all respondents were asked anew about their health conditions, rather than following the prior approach of asking those who had taken part in the past waves to confirm previously recorded conditions. Due to this reason, the health conditions feed-forward data was not archived for Wave 10, as was done in previous waves.

Harmonized dataset:

Users of the Harmonized dataset who prefer to use the Stata version will need access to Stata MP software, as the version G3 file contains 11,779 variables (the limit for the standard Stata 'Intercooled' version is 2,047).

ELSA COVID-19 study:
A separate ad-hoc study conducted with ELSA respondents, measuring the socio-economic effects/psychological impact of the lockdown on the aged 50+ population of England, is also available under SN 8688, English Longitudinal Study of Ageing COVID-19 Study.