Update 29-04-2020: The data is now split into two files based on the variable collection frequency (monthly and yearly). Additional variables added: area size in hectares, number of jobs in the area, number of people living in the area.
I have been inspired by Xavier and his work on Barcelona to explore the city of London! 🇬🇧 💂
The datasets is primarily centered around the housing market of London. However, it contains a lot of additional relevant data: - Monthly average house prices - Yearly number of houses - Yearly number of houses sold - Yearly percentage of households that recycle - Yearly life satisfaction - Yearly median salary of the residents of the area - Yearly mean salary of the residents of the area - Monthly number of crimes committed - Yearly number of jobs - Yearly number of people living in the area - Area size in hectares
The data is split by areas of London called boroughs (a flag exists to identify these), but some of the variables have other geographical UK regions for reference (like England, North East, etc.). There have been no changes made to the data except for melting it into a long format from the original tables.
The data has been extracted from London Datastore. It is released under UK Open Government License v2 and v3. The underlining datasets can be found here: https://data.london.gov.uk/dataset/uk-house-price-index https://data.london.gov.uk/dataset/number-and-density-of-dwellings-by-borough https://data.london.gov.uk/dataset/subjective-personal-well-being-borough https://data.london.gov.uk/dataset/household-waste-recycling-rates-borough https://data.london.gov.uk/dataset/earnings-place-residence-borough https://data.london.gov.uk/dataset/recorded_crime_summary https://data.london.gov.uk/dataset/jobs-and-job-density-borough https://data.london.gov.uk/dataset/ons-mid-year-population-estimates-custom-age-tables
Cover photo by Frans Ruiter from Unsplash
The dataset lends itself for extensive exploratory data analysis. It could also be a great supervised learning regression problem to predict house price changes of different boroughs over time.
This page is no longer being updated. Please use the UK House Price Index instead. Mix-adjusted house prices, by new/pre-owned dwellings, type of buyer (first time buyer) and region, from February 2002 for London and UK, and average mix-adjusted prices by UK region, and long term Annual House Price Index data since 1969 for London. The ONS House Price Index is mix-adjusted to allow for differences between houses sold (for example type, number of rooms, location) in different months within a year. House prices are modelled using a combination of characteristics to produce a model containing around 100,000 cells (one such cell could be first-time buyer, old dwelling, one bedroom flat purchased in London). Each month estimated prices for all cells are produced by the model and then combined with their appropriate weight to produce mix-adjusted average prices. The index values are based on growth rates in the mix-adjusted average house prices and are annually chain linked. The weights used for mix-adjustment change at the start of each calendar year (i.e. in January). The mix-adjusted prices are therefore not comparable between calendar years, although they are comparable within each calendar year. If you wish to calculate change between years, you should use the mix-adjusted house price index, available in Table 33. The data published in these tables are based on a sub-sample of RMS data. These results will therefore differ from results produced using full sample data. For further information please contact the ONS using the contact details below. House prices, mortgage advances and incomes have been rounded to the nearest £1,000. Data taken from Table 2 and Table 9 of the monthly ONS release. Download from ONS website
FOCUSONLONDON2011: HOUSING:AGROWINGCITY With the highest average incomes in the country but the least space to grow, demand for housing in London has long outstripped supply, resulting in higher housing costs and rising levels of overcrowding. The pressures of housing demand in London have grown in recent years, in part due to fewer people leaving London to buy homes in other regions. But while new supply during the recession held up better in London than in other regions, it needs to increase significantly in order to meet housing needs and reduce housing costs to more affordable levels. This edition of Focus on London authored by James Gleeson in the Housing Unit looks at housing trends in London, from the demand/supply imbalance to the consequences for affordability and housing need. PRESENTATION: How much pressure is London’s popularity putting on housing provision in the capital? This interactive presentation looks at the effect on housing pressure of demographic changes, and recent new housing supply, shown by trends in overcrowding and house prices. Click on the start button at the bottom of the slide to access. View Focus on London - Housing: A Growing City on Prezi FACTS: Some interesting facts from the data… ● Five boroughs with the highest proportion of households that have lived at their address for less than 12 months in 2009/10:
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Housing Prices in London’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/arnavkulkarni/housing-prices-in-london on 30 September 2021.
--- Dataset description provided by original source is as follows ---
This dataset comprises of various house listings in London and neighbouring region. It also encompasses the parameters listed below, the definitions of which are quite self-explanatory. • Property Name • Price • House Type - Contains one of the following types of houses (House, Flat/Apartment, New Development, Duplex, Penthouse, Studio, Bungalow, Mews) • Area in sq ft • No. of Bedrooms • No. of Bathrooms • No. of Receptions • Location • City/County - Includes London, Essex, Middlesex, Hertfordshire, Kent, and Surrey. • Postal Code
This dataset has various parameters for each house listing which can be used to conduct Exploratory Data Analysis. It can also be used to predict the house prices in various regions of London by means of Regression Analysis or other learning methods.
--- Original source retains full ownership of the source dataset ---
Our Price Paid Data includes information on all property sales in England and Wales that are sold for value and are lodged with us for registration.
Get up to date with the permitted use of our Price Paid Data:
check what to consider when using or publishing our Price Paid Data
If you use or publish our Price Paid Data, you must add the following attribution statement:
Contains HM Land Registry data © Crown copyright and database right 2021. This data is licensed under the Open Government Licence v3.0.
Price Paid Data is released under the http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/">Open Government Licence (OGL). You need to make sure you understand the terms of the OGL before using the data.
Under the OGL, HM Land Registry permits you to use the Price Paid Data for commercial or non-commercial purposes. However, OGL does not cover the use of third party rights, which we are not authorised to license.
Price Paid Data contains address data processed against Ordnance Survey’s AddressBase Premium product, which incorporates Royal Mail’s PAF® database (Address Data). Royal Mail and Ordnance Survey permit your use of Address Data in the Price Paid Data:
If you want to use the Address Data in any other way, you must contact Royal Mail. Email address.management@royalmail.com.
The following fields comprise the address data included in Price Paid Data:
The August 2025 release includes:
As we will be adding to the August data in future releases, we would not recommend using it in isolation as an indication of market or HM Land Registry activity. When the full dataset is viewed alongside the data we’ve previously published, it adds to the overall picture of market activity.
Your use of Price Paid Data is governed by conditions and by downloading the data you are agreeing to those conditions.
Google Chrome (Chrome 88 onwards) is blocking downloads of our Price Paid Data. Please use another internet browser while we resolve this issue. We apologise for any inconvenience caused.
We update the data on the 20th working day of each month. You can download the:
These include standard and additional price paid data transactions received at HM Land Registry from 1 January 1995 to the most current monthly data.
Your use of Price Paid Data is governed by conditions and by downloading the data you are agreeing to those conditions.
The data is updated monthly and the average size of this file is 3.7 GB, you can download:
My first attempt to created dataset using Octoparse (data scraping).
Dataset contains listing type (apartment, house, villa etc), price, link, location.
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Median price paid for residential property in England and Wales by property type and electoral ward. Annual data.
This repository is the third updated version of the attribute-linked residential property price dataset in UK Data Service ReShare 854240 (https://reshare.ukdataservice.ac.uk/854240/). As with the first updated version (ReShare 855033 https://reshare.ukdataservice.ac.uk/855033/) in 2021, this updated dataset contains individual property transactions and associated variables from both Land Registry Price Paid Dataset (LR PPD) and the Ministry for Housing, Communities and Local Government (MHCLG) Domestic Energy Performance Certificate (EPC) data. This is a linked result by address matching between LR-PPD data (1/1/1995-31/10/2023) and Domestic EPCs data (ending with 31/10/2023). It is the whole of the 2023 update dataset published in the Greater London Authority (GLA) London Datastore (https://data.london.gov.uk/dataset/house-price-per-square-metre-in-england-and-wales).
The linked dataset in this repository is the uncorrected version, recording over 21 million transactions with 107 variables in England and Wales between 1/1/1995 and 31/10/2023. We have offered technical validation and data cleaning code in UKDA ReShare 854240 to help users to evaluate the representation and clean up the data.
This repository covers the original LR PPD and Domestic EPCs for the linked data (house price per square metre dataset). Similar to the first updated version, a field header has been added in LR PPD. Six variables (LMK_KEY, address, address 1, address 2, address 3, postcode) in Domestic EPCs are removed. A newly created unique identifier (id) is added in Domestic EPCs, this id is newly created for the Domestic EPCs(downloaded on 2/12/2023). It is not the same id as the id in the Domestic EPCs from UK Data Service ReShare 854240 and ReShare 855033 or the ReShare 856204. Since November 2021, DLUCH has published Domestic EPCs with the Unique Property Reference Number (UPRN) hence the dataset in this repository contains the UPRN information from the Domestic EPCs.
This repository is the third updated version of the attribute-linked residential property price dataset in UK Data Service ReShare 854240 (https://reshare.ukdataservice.ac.uk/854240/). As with the first updated version (ReShare 855033 https://reshare.ukdataservice.ac.uk/855033/) in 2021, this updated dataset contains individual property transactions and associated variables from both Land Registry Price Paid Dataset (LR PPD) and the Ministry for Housing, Communities and Local Government (MHCLG) Domestic Energy Performance Certificate (EPC) data. This is a linked result by address matching between LR-PPD data (1/1/1995-31/10/2023) and Domestic EPCs data (ending with 31/10/2023). It is the whole of the 2023 update dataset published in the Greater London Authority (GLA) London Datastore (https://data.london.gov.uk/dataset/house-price-per-square-metre-in-england-and-wales).
The linked dataset in this repository is the uncorrected version, recording over 21 million transactions with 107 variables in England and Wales between 1/1/1995 and 31/10/2023. We have offered technical validation and data cleaning code in UKDA ReShare 854240 to help users to evaluate the representation and clean up the data.
This repository covers the original LR PPD and Domestic EPCs for the linked data (house price per square metre dataset). Similar to the first updated version, a field header has been added in LR PPD. Six variables (LMK_KEY, address, address 1, address 2, address 3, postcode) in Domestic EPCs are removed. A newly created unique identifier (id) is added in Domestic EPCs, this id is newly created for the Domestic EPCs(downloaded on 2/12/2023). It is not the same id as the id in the Domestic EPCs from UK Data Service ReShare 854240 and ReShare 855033 or the ReShare 856204. Since November 2021, DLUCH has published Domestic EPCs with the Unique Property Reference Number (UPRN) hence the dataset in this repository contains the UPRN information from the Domestic EPCs.
This house price per square metre dataset was created on 1/4/2021 and is based on the LR PPD, Domestic EPCs and NSPL downloaded on the same day. It covers over 18 million transactions with 104 variables in England and Wales between 1/1/1995 and 26/2/2021. 16 of the 104 variables come from the LR PPD, 84 variables come from Domestic EPCs, one variable (lad21cd) from NSPL and three variables (i.e.id, classt, priceper) are created by the first author. Before the data linkage, a unique identifier (id) is created for all the unique EPCs after removing the individual lodgement identifier (i.e. LMK_KEY variable). During the data linkage, a variable named classt is created to identify 1:1 and 1:n linkage relationships. After the data linkage, a derived house price per square metre variable (i.e. priceper) is obtained through dividing the transaction price paid in the LR PPD with the total floor area variable in the EPC dataset. The NSPL (May 2021 version) is used to assign the local authority unit (lad21cd) to the house price per square metre dataset. During the data linkage process, the transactions in the LR PPD assigned as category B (Additional Price Paid entry) and other property types are removed. This version of the dataset unlike the previous version can be described as ‘uncorrected’ as we have not removed transactions with any improbable price per square metre values (e.g. total floor area values are null, 0). This uncorrected version of the data will offer the most flexibility for researchers. Researchers are recommended to clean the uncorrected version according to their research need.This repository covers an updated but uncorrected version of the attribute-linked residential property price dataset in UK Data Service ReShare 854240 (https://reshare.ukdataservice.ac.uk/854240/). It is also the entire uncorrected version of the open access (limited attribute) house price per square metre dataset published by local authority in the Greater London Authority (GLA) London Datastore (https://data.london.gov.uk/dataset/house-price-per-square-metre-in-england-and-wales). This linked dataset contains individual property transactions and associated variables from the Land Registry Price Paid Dataset (LR PPD) linked at address level to all attributes, other than the individual lodgement identifier, address and postcode attributes, contained in Version VI of the Domestic Energy Performance Certificate (EPC) data published by the Ministry for Housing, Communities and Local Government (MHCLG). The linked data in this repository is the uncorrected version, recording over 18 million transactions with 104 variables in England and Wales between 1/1/1995 and 26/2/2021. We have offered technical validation and data cleaning code in UKDA ReShare 854240 to help users evaluate the representation of the linked data for a given time period. The data cleaning code shows our methods for cleaning up unlikely floor size records before using this data in analysis. Users can create their own rules and undertake this clean-up process based on their own experience and research aims. This repository also covers the original LR PPD and Domestic EPCs for the linked data (house price per square metre dataset). The LR PPD in this repository has been added in the field header in the open access LR PPD. Domestic EPCs in this repository has had removed the six variables (individual lodgement identifier, address, address 1, address 2, address 3, postcode) with a newly created unique identifier (id). This id column is newly created for Version VI Domestic EPCs, which is not the same id as in the Domestic EPCs from UK Data Service ReShare 854240. The LR PPD dataset is open and available online (https://www.gov.uk/government/statistical-data-sets/price-paid-data-downloads). The LR PPD records 25,914,817 transactions in England and Wales between 1/1/1995 and 26/02/2021. The Domestic Energy Performance Certificates (EPCs) dataset is open and available on-line from the Ministry for Housing, Communities and Local Government – MHCLG (https://epc.opendatacommunities.org/). The Domestic EPCs dataset downloaded in 1/4/2021 is the sixth released version and contains EPCs issued between 1/10/2008 and 20/9/2020, which records 18,575,357 energy performance data records with 85 fields. These two datasets both contain property information at address level but their address structures are different, thus a matching method containing a four-stage (251 matching rules) process was designed to achieve linkage between them. Details of data linkage are published in a UCL Open Environment paper: (https://ucl.scienceopen.com/hosted-document?doi=10.14324/111.444/ucloe.000019). The linkage methodology to create this version of the data remains the same as that in UK Data Service ReShare service (https://reshare.ukdataservice.ac.uk/854942/).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Housing Index in the United Kingdom decreased to 514.20 points in September from 515.60 points in August of 2025. This dataset provides - United Kingdom House Price Index - actual values, historical data, forecast, chart, statistics, economic calendar and news.
This house price per square metre dataset is created through complex address-based matching between the Land Registry’s Price Paid Data (LR-PPD) and property size information from the Domestic Energy Performance Certificates (EPC) data published by the Department for Levelling Up, Housing and Communities (DLUHC, formerly MHCLG). Details of the data linkage are published in the UCL Open: Environment along with the related linkage code via the UK Data Service ReShare repository.
During this data linkage process, the transactions assigned as category B (Additional Price Paid entry) and other property types are removed. Here we publish our latest limited attribute version of the uncorrected house price per square metre dataset in England and Wales with the LR-PPD data (1/1/1995-26/2/2021) and Domestic EPCs data (the sixth version, up to 20/9/2020) downloaded on 1/4/2021 for non-commercial purpose. This uncorrected version of house price per square metre dataset records over 18 million transactions with 16 variables in England and Wales since 1995. Unlike in our published article, in this uncorrected version we have not removed transactions with any improbable price per square metre values - i.e. where either the transaction price or total floor area values are null, 0 or too low to be realistic. This uncorrected version of the data will offer the most flexibility for researchers.
We offer technical validation and data cleaning code via the UKDA ReShare repository to help users evaluate the representation of the linked data for a given time period. The data cleaning code shows our methods for cleaning up unlikely floor size records before using this data in analysis. Users can create their own rules and undertake this clean-up process based on their own experience and research aims.
This limited attribute version is published by local authority (2021 version). Details of the 16 variables are described in the explanation file. The National Statistics Postcode Lookup NSPL (May 2021 version) is used to assign the local authority unit for your production of area-based statistics. Users can match historical changes in LA boundaries by choosing appropriate aggregations using, for instance ONSPD, and the postcode variable in our dataset.
An extended version of this dataset containing additional variables is available from UK Data Service Reshare service. Users can directly access this full version dataset (tranall_link_01042021.zip) via the following link: https://reshare.ukdataservice.ac.uk/855033/ . Accompanying LR-PPD and EPC data are also supplied through the ReShare service. Users who would like to attach their own additional variables from the LR-PPD data are advised to use the transactionid variable to link to the LR-PPD (LRPPD_01042021.zip). Users who would like to attach additional variables from the EPC data are advised to use the id variable to link to the sixth version Domestic EPCs (epc6_id.zip).
The 2024 update
The 2024 updated version of the house price per square metre dataset extends the data coverage to the end of 2024 ( hpm_la_2024.zip ). This new version is the result of linking LR-PPD data (01/01/1995–31/10/2024) and Domestic EPCs data (up to 31/10/2024), downloaded on 26/12/2024 for non-commercial purposes. It records over 22 million transactions in England and Wales since 1995.
Unlike the previous versions, this updated removes the id variable (created by the authors) and adds the lmk_key variable (originally from the Domestic EPCs dataset). This change was made because the lmk_key serves as a unique identifier with no duplicate records since 2024.
The match rate of the linked data varies over time; therefore, we recommend users carefully choose the time coverage and validate the data coverage using the match rate. Please note that publicly available Domestic EPCs data starts in 2008, resulting in an extremely low match rate for the period between 1995 and 2008.
The National Statistics Postcode Lookup (November 2024 version) is used to assign local authorities (2023 version)
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Median price paid for residential property in England and Wales, for all property types by lower layer super output area. Annual data..
Land Registry Price Paid Data (PPD) have been published as open data since 2013. These data have been transformative for house price variation research in the UK as they are a comprehensive record of residential transactions at address level and cover the whole of England and Wales over a period dating back to 1995. Despite the utility of these data, a lack of attribute information relating to the properties, such as total floor area information, is identified as one of the major shortcomings of the PPD data. This means that the impacts of stock mix on broader price patterns cannot be fully accounted for. This research outlines one approach which addresses this deficiency by combining transaction information from the official open Land Registry Price Paid Data (PPD) with property size information form the official open Domestic Energy Performance Certificates (EPCs). A four-stage data linkage is created to generate a new linked dataset, representing 79% of the full market sales in the Land Registry PPD. This new linked dataset details 5,732,838 transactions in England and Wales between 2011 and 2019, along with each property's total floor area and the number of habitable rooms. Codes for other commonly used spatial units from Output Area to Local Authority are also included in the dataset. This offers greater flexibility for the exploration of house price variation in England and Wales at different spatial scales. The data collection includes the scripts used for linkage, as well as the resulting dataset.
Current residential house price variation research in the UK is limited by lack of an open and comprehensive house price database that contains both transaction price alongside dwelling attributes such as size. This research outlines one approach which addresses this deficiency in England and Wales through combining transaction information from the official open Land Registry Price Paid Data (PPD) and property size information form the official open Domestic Energy Performance Certificates (EPCs). A four-stage data linkage is created to generate a new linked data, representing 79% of the full market sales in Land Registry PPD. This new linked dataset offers greater flexibility for the exploration of house price (house price per square metre) variation in England and Wales at different spatial scales over postcode unit between 2011 and 2019.
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Summary of UK House Price Index (HPI) price statistics covering England, Scotland, Wales and Northern Ireland. Full UK HPI data are available on GOV.UK.
Through reading this publication you will: • gain an understanding of how house prices are set in economics terms, how they are measured, and why the cost of housing matters for London’s economy and its residents • see whether incomes and earnings in London have kept pace with the costs of home ownership in London, and see how affordability may be affected by future changes in interest rates • find out about the drivers of demand for residential property in London, and how the supply of homes has responded to changing conditions
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Median monthly rental prices for the private rental market in England by bedroom category, region and administrative area, calculated using data from the Valuation Office Agency and Office for National Statistics.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Price to Rent Ratio in the United Kingdom decreased to 111.37 in the second quarter of 2025 from 113.72 in the first quarter of 2025. This dataset includes a chart with historical data for the United Kingdom Price to Rent Ratio.
http://reference.data.gov.uk/id/open-government-licencehttp://reference.data.gov.uk/id/open-government-licence
The Index of Private Housing Rental Prices (IPHRP) is a quarterly experimental price index. It tracks the prices paid for renting property from private landlords in Great Britain.
IPHRP is produced from a number of administrative sources and is classified as experimental by ONS.
The index compares trends (rather than levels) in average private sector rents across English regions, Wales and Scotland. It uses a complex mix-adjustment and weighting process to produce a single index for each area. This index uses data on actual new and ongoing rents.
The sample ensures that the index is representative of the stock at regional level and that it isn't distorted by units dropping out of the sample because they switch to LHA or for other reasons. This is an advantage over the VOA dataset where the sample is changing over time and may not be representative.
Tables show monthly data. Data is updated once a quarter.
Index level (January 2011 = 100). Not seasonally adjusted.
See more on the ONS Website
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Affordability ratios calculated by dividing house prices by gross annual residence-based earnings. Based on the median and lower quartiles of both house prices and earnings in England and Wales.
Update 29-04-2020: The data is now split into two files based on the variable collection frequency (monthly and yearly). Additional variables added: area size in hectares, number of jobs in the area, number of people living in the area.
I have been inspired by Xavier and his work on Barcelona to explore the city of London! 🇬🇧 💂
The datasets is primarily centered around the housing market of London. However, it contains a lot of additional relevant data: - Monthly average house prices - Yearly number of houses - Yearly number of houses sold - Yearly percentage of households that recycle - Yearly life satisfaction - Yearly median salary of the residents of the area - Yearly mean salary of the residents of the area - Monthly number of crimes committed - Yearly number of jobs - Yearly number of people living in the area - Area size in hectares
The data is split by areas of London called boroughs (a flag exists to identify these), but some of the variables have other geographical UK regions for reference (like England, North East, etc.). There have been no changes made to the data except for melting it into a long format from the original tables.
The data has been extracted from London Datastore. It is released under UK Open Government License v2 and v3. The underlining datasets can be found here: https://data.london.gov.uk/dataset/uk-house-price-index https://data.london.gov.uk/dataset/number-and-density-of-dwellings-by-borough https://data.london.gov.uk/dataset/subjective-personal-well-being-borough https://data.london.gov.uk/dataset/household-waste-recycling-rates-borough https://data.london.gov.uk/dataset/earnings-place-residence-borough https://data.london.gov.uk/dataset/recorded_crime_summary https://data.london.gov.uk/dataset/jobs-and-job-density-borough https://data.london.gov.uk/dataset/ons-mid-year-population-estimates-custom-age-tables
Cover photo by Frans Ruiter from Unsplash
The dataset lends itself for extensive exploratory data analysis. It could also be a great supervised learning regression problem to predict house price changes of different boroughs over time.