Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Existing Home Sales in the United States increased to 4100 Thousand in October from 4050 Thousand in September of 2025. This dataset provides the latest reported value for - United States Existing Home Sales - plus previous releases, historical high and low, short-term forecast and long-term prediction, economic calendar, survey consensus and news.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
https://raw.githubusercontent.com/Masterx-AI/Project_Housing_Price_Prediction_/main/hs.jpg" alt="">
A simple yet challenging project, to predict the housing price based on certain factors like house area, bedrooms, furnished, nearness to mainroad, etc. The dataset is small yet, it's complexity arises due to the fact that it has strong multicollinearity. Can you overcome these obstacles & build a decent predictive model?
Harrison, D. and Rubinfeld, D.L. (1978) Hedonic prices and the demand for clean air. J. Environ. Economics and Management 5, 81–102. Belsley D.A., Kuh, E. and Welsch, R.E. (1980) Regression Diagnostics. Identifying Influential Data and Sources of Collinearity. New York: Wiley.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
New Home Sales in the United States increased to 800 Thousand units in August from 664 Thousand units in July of 2025. This dataset provides the latest reported value for - United States New Home Sales - plus previous releases, historical high and low, short-term forecast and long-term prediction, economic calendar, survey consensus and news.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The dataset contains 2000 rows of house-related data, representing various features that could influence house prices. Below, we discuss key aspects of the dataset, which include its structure, the choice of features, and potential use cases for analysis.
The dataset is designed to capture essential attributes for predicting house prices, including:
Area: Square footage of the house, which is generally one of the most important predictors of price. Bedrooms & Bathrooms: The number of rooms in a house significantly affects its value. Homes with more rooms tend to be priced higher. Floors: The number of floors in a house could indicate a larger, more luxurious home, potentially raising its price. Year Built: The age of the house can affect its condition and value. Newly built houses are generally more expensive than older ones. Location: Houses in desirable locations such as downtown or urban areas tend to be priced higher than those in suburban or rural areas. Condition: The current condition of the house is critical, as well-maintained houses (in 'Excellent' or 'Good' condition) will attract higher prices compared to houses in 'Fair' or 'Poor' condition. Garage: Availability of a garage can increase the price due to added convenience and space. Price: The target variable, representing the sale price of the house, used to train machine learning models to predict house prices based on the other features.
Area Distribution: The area of the houses in the dataset ranges from 500 to 5000 square feet, which allows analysis across different types of homes, from smaller apartments to larger luxury houses. Bedrooms and Bathrooms: The number of bedrooms varies from 1 to 5, and bathrooms from 1 to 4. This variance enables analysis of homes with different sizes and layouts. Floors: Houses in the dataset have between 1 and 3 floors. This feature could be useful for identifying the influence of multi-level homes on house prices. Year Built: The dataset contains houses built from 1900 to 2023, giving a wide range of house ages to analyze the effects of new vs. older construction. Location: There is a mix of urban, suburban, downtown, and rural locations. Urban and downtown homes may command higher prices due to proximity to amenities. Condition: Houses are labeled as 'Excellent', 'Good', 'Fair', or 'Poor'. This feature helps model the price differences based on the current state of the house. Price Distribution: Prices range between $50,000 and $1,000,000, offering a broad spectrum of property values. This range makes the dataset appropriate for predicting a wide variety of housing prices, from affordable homes to luxury properties.
3. Correlation Between Features
A key area of interest is the relationship between various features and house price: Area and Price: Typically, a strong positive correlation is expected between the size of the house (Area) and its price. Larger homes are likely to be more expensive. Location and Price: Location is another major factor. Houses in urban or downtown areas may show a higher price on average compared to suburban and rural locations. Condition and Price: The condition of the house should show a positive correlation with price. Houses in better condition should be priced higher, as they require less maintenance and repair. Year Built and Price: Newer houses might command a higher price due to better construction standards, modern amenities, and less wear-and-tear, but some older homes in good condition may retain historical value. Garage and Price: A house with a garage may be more expensive than one without, as it provides extra storage or parking space.
The dataset is well-suited for various machine learning and data analysis applications, including:
House Price Prediction: Using regression techniques, this dataset can be used to build a model to predict house prices based on the available features. Feature Importance Analysis: By using techniques such as feature importance ranking, data scientists can determine which features (e.g., location, area, or condition) have the greatest impact on house prices. Clustering: Clustering techniques like k-means could help identify patterns in the data, such as grouping houses into segments based on their characteristics (e.g., luxury homes, affordable homes). Market Segmentation: The dataset can be used to perform segmentation by location, price range, or house type to analyze trends in specific sub-markets, like luxury vs. affordable housing. Time-Based Analysis: By studying how house prices vary with the year built or the age of the house, analysts can derive insights into the trends of older vs. newer homes.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains real-world property sales data from the UK, combining details from Rightmove and HM Land Registry.
You'll find: - A main property table (properties_main.csv) with info like type, location, latest price, and build type - A sale history table (price_history.csv) listing every known transaction for each property
🧠 This dataset is designed for learning and practice. It includes: - Messy fields (like missing bedrooms or bathroom info) - Currency values in text format (e.g. £280,000) - Linked tables via a unique property_id
Facebook
TwitterOur Price Paid Data includes information on all property sales in England and Wales that are sold for value and are lodged with us for registration.
Get up to date with the permitted use of our Price Paid Data:
check what to consider when using or publishing our Price Paid Data
If you use or publish our Price Paid Data, you must add the following attribution statement:
Contains HM Land Registry data © Crown copyright and database right 2021. This data is licensed under the Open Government Licence v3.0.
Price Paid Data is released under the http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/">Open Government Licence (OGL). You need to make sure you understand the terms of the OGL before using the data.
Under the OGL, HM Land Registry permits you to use the Price Paid Data for commercial or non-commercial purposes. However, OGL does not cover the use of third party rights, which we are not authorised to license.
Price Paid Data contains address data processed against Ordnance Survey’s AddressBase Premium product, which incorporates Royal Mail’s PAF® database (Address Data). Royal Mail and Ordnance Survey permit your use of Address Data in the Price Paid Data:
If you want to use the Address Data in any other way, you must contact Royal Mail. Email address.management@royalmail.com.
The following fields comprise the address data included in Price Paid Data:
The October 2025 release includes:
As we will be adding to the October data in future releases, we would not recommend using it in isolation as an indication of market or HM Land Registry activity. When the full dataset is viewed alongside the data we’ve previously published, it adds to the overall picture of market activity.
Your use of Price Paid Data is governed by conditions and by downloading the data you are agreeing to those conditions.
Google Chrome (Chrome 88 onwards) is blocking downloads of our Price Paid Data. Please use another internet browser while we resolve this issue. We apologise for any inconvenience caused.
We update the data on the 20th working day of each month. You can download the:
These include standard and additional price paid data transactions received at HM Land Registry from 1 January 1995 to the most current monthly data.
Your use of Price Paid Data is governed by conditions and by downloading the data you are agreeing to those conditions.
The data is updated monthly and the average size of this file is 3.7 GB, you can download:
Facebook
TwitterFor every real estate property in Arlington which has been sold, this dataset includes property sales information and can be associated with other Real Estate datasets by the RPC (RealEstatePropertyCode).
Facebook
TwitterDescription: This dataset provides historical housing prices scraped from Centaline Property Hong Kong, one of the largest real estate agencies in Hong Kong. The dataset includes information on the date of the transaction, the property address, floor plan, saleable area, unit rate, source, and district. The dataset covers a period of time spanning several years, allowing for analysis of trends and changes in the Hong Kong housing market.
Columns: Date: the date of the property transaction Address: the address of the property Floor Plan: -- Price: the price of the property Changes: any changes made to the property since the last transaction Saleable Area: the area of the property that can be sold to a buyer Unit Rate: the price per square foot of saleable area Source: the source of the data (Centaline Property Hong Kong/ Land Registry) District: the district in which the property is located in Hong Kong
Facebook
Twitterhttps://brightdata.com/licensehttps://brightdata.com/license
Gain a complete view of the real estate market with our Zillow datasets. Track price trends, rental/sale status, and price per square foot with the Zillow Price History dataset and explore detailed listings with prices, locations, and features using the Zillow Properties Listing dataset. Over 134M records available Price starts at $250/100K records Data formats are available in JSON, NDJSON, CSV, XLSX and Parquet. 100% ethical and compliant data collection Included datapoints:
Zpid
City
State
Home Status
Street Address
Zipcode
Home Type
Living Area Value
Bedrooms
Bathrooms
Price
Property Type
Date Sold
Annual Homeowners Insurance
Price Per Square Foot
Rent Zestimate
Tax Assessed Value
Zestimate
Home Values
Lot Area
Lot Area Unit
Living Area
Living Area Units
Property Tax Rate
Page View Count
Favorite Count
Time On Zillow
Time Zone
Abbreviated Address
Brokerage Name
And much more
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Single Family Home Prices in the United States increased to 415200 USD in October from 412300 USD in September of 2025. This dataset provides - United States Existing Single Family Home Prices- actual values, historical data, forecast, chart, statistics, economic calendar and news.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This dataset contains data on all Real Property parcels that have sold since 2013 in Allegheny County, PA.
Before doing any market analysis on property sales, check the sales validation codes. Many property "sales" are not considered a valid representation of the true market value of the property. For example, when multiple lots are together on one deed with one price they are generally coded as invalid ("H") because the sale price for each parcel ID number indicates the total price paid for a group of parcels, not just for one parcel. See the Sales Validation Codes Dictionary for a complete explanation of valid and invalid sale codes.
Sales Transactions Disclaimer: Sales information is provided from the Allegheny County Department of Administrative Services, Real Estate Division. Content and validation codes are subject to change. Please review the Data Dictionary for details on included fields before each use. Property owners are not required by law to record a deed at the time of sale. Consequently the assessment system may not contain a complete sales history for every property and every sale. You may do a deed search at http://www.alleghenycounty.us/re/index.aspx directly for the most updated information. Note: Ordinance 3478-07 prohibits public access to search assessment records by owner name. It was signed by the Chief Executive in 2007.
Facebook
Twitterhttps://cdla.io/sharing-1-0/https://cdla.io/sharing-1-0/
The property listings dataset contains information about real estate properties available for sale or rent in Brazil. It includes details such as property type (apartment, house, commercial property), location (city, neighborhood), size (square footage, number of rooms), price, amenities, and contact information for the property owner or real estate agent. This dataset can be used for market analysis, property valuation, and identifying trends in the real estate market.
Sales and Rental Prices Dataset: The sales and rental prices dataset provides information about the prices of real estate properties in Brazil. It includes data on property transactions, including sale prices and rental prices per square meter or per month. This dataset can be used to analyze price trends, compare property prices across different regions, and identify areas with high or low real estate market demand.
Property Characteristics Dataset: The property characteristics dataset contains detailed information about the features and attributes of real estate properties. It includes data such as the number of bedrooms, bathrooms, parking spaces, floor plan, construction year, building amenities, and property condition. This dataset can be used for property classification, identifying popular property features, and evaluating property quality.
Geographical Data: Geographical data includes information about the location and spatial features of real estate properties in Brazil. It can include data such as latitude and longitude coordinates, zoning information, proximity to amenities (schools, hospitals, parks), and neighborhood demographics. This dataset can be used for spatial analysis, identifying hotspots or desirable locations, and understanding the neighborhood characteristics.
Property Market Trends Dataset: The property market trends dataset provides information about market conditions and trends in the real estate sector in Brazil. It includes data such as the number of property listings, average time on the market, price fluctuations, mortgage interest rates, and economic indicators that impact the real estate market. This dataset can be used for market forecasting, understanding market dynamics, and making informed investment decisions.
Real Estate Regulatory Data: Real estate regulatory data includes information about legal and regulatory aspects of the real estate sector in Brazil. It can include data on property ownership, property taxes, zoning regulations, building permits, and legal restrictions on property transactions. This dataset can be used for legal compliance, understanding property ownership rights, and assessing the legal framework for real estate transactions.
Historical Data: Historical real estate data includes past records and trends of property prices, market conditions, and sales volumes in Brazil. This dataset can span several years and can be used to analyze long-term market trends, compare current market conditions with historical data, and assess the performance of the real estate market over time.
Facebook
Twitterhttps://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain
Graph and download economic data for Median Sales Price of Houses Sold for the United States (MSPUS) from Q1 1963 to Q2 2025 about sales, median, housing, and USA.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
There are two files: - nsw_property_data.csv - Property data in NSW from 2001 - 3rd of April 2023 - nsw_property_archived_data.csv - Property data in NSW from 1990 - 2000
Objective - Property data is difficult to come by these days. Luckily in New South Wales - Australia, the NSW State Government has provided public dataset of the transactional property sales data (See link below) - The objective is to create a clean / comprehensive dataset with historical information of the property information in NSW Australia, based on the raw data provided by the government - Please reach out to me to provide any feedbacks / improvements and I will try my best to update the dataset as soon as possible
Disclaimer - This is a personal, non-profit project that is intended for the public to access datasets, which can potentially help people make decisions when analysing on the property market.
Copyright - NSW Property Sales Data: © Updated 24/04/2023. Crown in right of NSW through the Valuer General 2023
Data Source NSW data source
Facebook
TwitterThe table Historical Property 08 is part of the dataset Cotality Smart Data Platform: Historical Property, available at https://stanford.redivis.com/datasets/e9sx-cn4k3cyva. It contains 149059118 rows across 220 variables.
Facebook
TwitterThe table Historical Property 05 is part of the dataset Cotality Smart Data Platform: Historical Property, available at https://stanford.redivis.com/datasets/e9sx-cn4k3cyva. It contains 151169051 rows across 220 variables.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Average House Prices in the United States increased to 534100 USD in August from 478200 USD in July of 2025. This dataset includes a chart with historical data for the United States New Home Average Sales Price.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Update Frequency: Yearly
Access to Residential, Condominium, Commercial, Apartment properties and vacant land sales history data.
To download XML and JSON files, click the CSV option below and click the down arrow next to the Download button in the upper right on its page.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
House Price Index YoY in the United States decreased to 1.70 percent in September from 2.40 percent in August of 2025. This dataset includes a chart with historical data for the United States FHFA House Price Index YoY.
Facebook
Twitterhttps://brightdata.com/licensehttps://brightdata.com/license
The Zoopla Dataset provides a detailed repository of information covering property listings available on the Zoopla platform. Tailored to support businesses, researchers, and analysts in the real estate sector, this dataset delivers valuable insights into market trends, property valuations, and consumer preferences within the real estate market.
With key attributes such as property details, pricing data, location information, and listing history, users can conduct thorough analyses to refine property investment strategies, assess market demand, and identify emerging trends.
Whether you're a real estate agent seeking to enhance your property listings, a researcher investigating trends in the housing market, or an analyst aiming to refine investment strategies, the Zoopla Dataset serves as an essential resource for unlocking opportunities and driving success in the competitive landscape of real estate
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Existing Home Sales in the United States increased to 4100 Thousand in October from 4050 Thousand in September of 2025. This dataset provides the latest reported value for - United States Existing Home Sales - plus previous releases, historical high and low, short-term forecast and long-term prediction, economic calendar, survey consensus and news.