Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Detailed Real Estate Data for Predicting House Prices and Analyzing Market Trends
This dataset contains information on 21,613 properties, making it a comprehensive resource for exploring real estate market trends and building predictive models for house prices. The data includes various features capturing property details, location, and market conditions, providing ample opportunities for data exploration, visualization, and machine learning applications.
General Information:
id: Unique identifier for each property. date: Date of sale. Price Details:
price: Sale price of the house. Property Features:
bedrooms: Number of bedrooms. bathrooms: Number of bathrooms (including partials as fractions). sqft_living: Living space area in square feet. sqft_lot: Lot size in square feet. floors: Number of floors. waterfront: Whether the property has a waterfront view. view: Quality of the view rating. condition: Overall condition of the house. grade: Grade of construction and design (scale of 1–13). Additional Metrics:
sqft_above: Square footage of the property above ground. sqft_basement: Basement area in square feet. yr_built: Year the property was built. yr_renovated: Year of last renovation. Location Coordinates:
zipcode: ZIP code of the property. lat and long: Latitude and longitude coordinates. Neighbor Comparisons:
sqft_living15: Average living space of 15 nearest properties. sqft_lot15: Average lot size of 15 nearest properties. This dataset is a valuable resource for anyone interested in real estate analytics, machine learning, or geographic data visualization.
Facebook
Twitterhttp://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/
The purpose of this dataset is to provide updated data on the Zillow Observed Rent Index (ZORI). Most of the Zillow datasets on Kaggle have not been updated in four years, and no other dataset except one contains information related to rent. Providing updated data on this will also allow the community to analyze the effects of COVID-19 on rent prices, which could not be done with previous available data sets.
Zillow Observed Rent Index (ZORI): A smoothed measure of the typical observed market rate rent across a given region. ZORI is a repeat-rent index that is weighted to the rental housing stock to ensure representativeness across the entire market, not just those homes currently listed for-rent. The index is dollar-denominated by computing the mean of listed rents that fall into the 40th to 60th percentile range for all homes and apartments in a given region, which is once again weighted to reflect the rental housing stock. Details available in ZORI methodology. https://www.zillow.com/research/methodology-zori-repeat-rent-27092/
This dataset contains two files. The Metro dataset looks at the median rent prices for large US cities. The ZIP code dataset breaks the US cities down by their ZIP codes. Note that the region IDs in both datasets are only used for tracking purposes. Also, some of the ZIP codes under the Region Name are less than the standard five-digit zip code and unreliable. Even if you add zeros in accounting for possible formatting mistakes. It is recommended to remove these entries since there is no way to identify which ZIP code the entry actually represents. These entries are left in here in case some analyst can solve the issue.
Zillow provides many useful open source datasets that relate to housing, which can be found at Zillow Research Data. https://www.zillow.com/research/data/ This dataset was also prompted by an older dataset I came across that only lacked updated data. https://www.kaggle.com/zillow/rent-index Thumbnail and banner picture is from this pixabay artist https://pixabay.com/users/pexels-2286921/
Facebook
TwitterRents for industrial real estate in the U.S. have increased since 2017, with flexible/service space reaching the highest price per square foot in 2024. In just a year, the cost of, flex/service space rose by nearly *****U.S. dollars per square foot. Manufacturing facilities, warehouses, and distribution centers had lower rents and experienced milder growth. Los Angeles, Orange County, and Inland Empire, California, are some of the most expensive markets in the country. Office real estate is pricier Industrial real estate is far from being the most expensive commercial property type. For instance, average rental rates in major U.S. metros for office space are much higher than those for industrial space. This is most likely because office units are generally located in urban areas where there is limited space and thus higher demand, whereas industrial units are more suited to the outskirts of such urban areas. Industrial units, such as warehouses or factories, require much more space because they need to house large, heavy equipment or serve as a storage unit for future shipments. Big-box distribution space is gaining in importance Warehouses and distribution may currently command the lowest average rent per square foot among industrial space types, but the growing popularity of the asset class has earned it considerable gains over the past years. In 2021 and 2022, high occupier demand and insufficient supply led to soaring taking rent of big-box buildings. During that time, the vacancy rate of distribution centers fell below ****percent. The development of industrial and logistics facilities has accelerated since then, with the new supply coming to market, causing the vacancy rate to increase and the pressures on rent to ease.
Facebook
TwitterThis residential real estate data set was created by Redfin, an online real estate brokerage. Published on January 9th, 2022, this data summarize the monthly housing market for every State, Metro, and Zip code in the US from 2012 to 2021. Redfin aggregated this data across multiple listing services and has been gracious enough to include property type in their reporting. Please properly cite and link to RedFin if you end up using this data for your research or project.
Source: RedFin Data Center
Property type defined by RedFin
Source: Building Types
For more definitions, please visit RedFin Data Center Metrics
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
https://raw.githubusercontent.com/Masterx-AI/Project_Housing_Price_Prediction_/main/hs.jpg" alt="">
A simple yet challenging project, to predict the housing price based on certain factors like house area, bedrooms, furnished, nearness to mainroad, etc. The dataset is small yet, it's complexity arises due to the fact that it has strong multicollinearity. Can you overcome these obstacles & build a decent predictive model?
Harrison, D. and Rubinfeld, D.L. (1978) Hedonic prices and the demand for clean air. J. Environ. Economics and Management 5, 81–102. Belsley D.A., Kuh, E. and Welsch, R.E. (1980) Regression Diagnostics. Identifying Influential Data and Sources of Collinearity. New York: Wiley.
Facebook
TwitterOur Price Paid Data includes information on all property sales in England and Wales that are sold for value and are lodged with us for registration.
Get up to date with the permitted use of our Price Paid Data:
check what to consider when using or publishing our Price Paid Data
If you use or publish our Price Paid Data, you must add the following attribution statement:
Contains HM Land Registry data © Crown copyright and database right 2021. This data is licensed under the Open Government Licence v3.0.
Price Paid Data is released under the http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/">Open Government Licence (OGL). You need to make sure you understand the terms of the OGL before using the data.
Under the OGL, HM Land Registry permits you to use the Price Paid Data for commercial or non-commercial purposes. However, OGL does not cover the use of third party rights, which we are not authorised to license.
Price Paid Data contains address data processed against Ordnance Survey’s AddressBase Premium product, which incorporates Royal Mail’s PAF® database (Address Data). Royal Mail and Ordnance Survey permit your use of Address Data in the Price Paid Data:
If you want to use the Address Data in any other way, you must contact Royal Mail. Email address.management@royalmail.com.
The following fields comprise the address data included in Price Paid Data:
The October 2025 release includes:
As we will be adding to the October data in future releases, we would not recommend using it in isolation as an indication of market or HM Land Registry activity. When the full dataset is viewed alongside the data we’ve previously published, it adds to the overall picture of market activity.
Your use of Price Paid Data is governed by conditions and by downloading the data you are agreeing to those conditions.
Google Chrome (Chrome 88 onwards) is blocking downloads of our Price Paid Data. Please use another internet browser while we resolve this issue. We apologise for any inconvenience caused.
We update the data on the 20th working day of each month. You can download the:
These include standard and additional price paid data transactions received at HM Land Registry from 1 January 1995 to the most current monthly data.
Your use of Price Paid Data is governed by conditions and by downloading the data you are agreeing to those conditions.
The data is updated monthly and the average size of this file is 3.7 GB, you can download:
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset expands upon the original London Property Listings by including additional attributes to facilitate deeper analysis of rental properties in London. It is ideal for research and projects related to real estate trends, price categorization, and area-wise analysis in one of the world's busiest markets.
This dataset was prepared and uploaded by Mehmet Emre Sezer. It is intended for educational and non-commercial use.
Facebook
TwitterData, statistics and adopted local options related to property taxes
Facebook
TwitterZillow operates an industry-leading economics and analytics bureau led by Zillow’s Chief Economist, Dr. Stan Humphries. At Zillow, Dr. Humphries and his team of economists and data analysts produce extensive housing data and analysis covering more than 500 markets nationwide. Zillow Research produces various real estate, rental and mortgage-related metrics and publishes unique analyses on current topics and trends affecting the housing market.
At Zillow’s core is our living database of more than 100 million U.S. homes, featuring both public and user-generated information including number of bedrooms and bathrooms, tax assessments, home sales and listing data of homes for sale and for rent. This data allows us to calculate, among other indicators, the Zestimate, a highly accurate, automated, estimated value of almost every home in the country as well as the Zillow Home Value Index and Zillow Rent Index, leading measures of median home values and rents.
The Zillow Rent Index is the median estimated monthly rental price for a given area, and covers multifamily, single family, condominium, and cooperative homes in Zillow’s database, regardless of whether they are currently listed for rent. It is expressed in dollars and is seasonally adjusted. The Zillow Rent Index is published at the national, state, metro, county, city, neighborhood, and zip code levels.
Zillow produces rent estimates (Rent Zestimates) based on proprietary statistical and machine learning models. Within each county or state, the models observe recent rental listings and learn the relative contribution of various home attributes in predicting prevailing rents. These home attributes include physical facts about the home, prior sale transactions, tax assessment information and geographic location as well as the estimated market value of the home (Zestimate). Based on the patterns learned, these models estimate rental prices on all homes, including those not presently for rent. Because of the availability of Zillow rental listing data used to train the models, Rent Zestimates are only available back to November 2010; therefore, each ZRI time series starts on the same date.
The rent index data was calculated from Zillow's proprietary Rent Zestimates and published on its website.
What city has the highest and lowest rental prices in the country? Which metropolitan area is the most expensive to live in? Where have rental prices increased in the past five years and where have they remained the same? What city or state has the lowest cost per square foot?
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Detailed Real Estate Data for Predicting House Prices and Analyzing Market Trends
This dataset contains information on 21,613 properties, making it a comprehensive resource for exploring real estate market trends and building predictive models for house prices. The data includes various features capturing property details, location, and market conditions, providing ample opportunities for data exploration, visualization, and machine learning applications.
General Information:
id: Unique identifier for each property. date: Date of sale. Price Details:
price: Sale price of the house. Property Features:
bedrooms: Number of bedrooms. bathrooms: Number of bathrooms (including partials as fractions). sqft_living: Living space area in square feet. sqft_lot: Lot size in square feet. floors: Number of floors. waterfront: Whether the property has a waterfront view. view: Quality of the view rating. condition: Overall condition of the house. grade: Grade of construction and design (scale of 1–13). Additional Metrics:
sqft_above: Square footage of the property above ground. sqft_basement: Basement area in square feet. yr_built: Year the property was built. yr_renovated: Year of last renovation. Location Coordinates:
zipcode: ZIP code of the property. lat and long: Latitude and longitude coordinates. Neighbor Comparisons:
sqft_living15: Average living space of 15 nearest properties. sqft_lot15: Average lot size of 15 nearest properties. This dataset is a valuable resource for anyone interested in real estate analytics, machine learning, or geographic data visualization.