https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
House price prediction dataset
This dataset comprises housing data for various metropolitan cities of India. It includes: - Collection of prices of new and resale houses - The amenities provided for each house
This housing dataset is useful for a range of stakeholders, including real estate agents, property developers, buyers, renters, and researchers interested in analyzing housing markets and trends in metropolitan cities across India. It can be used for market analysis, price prediction, property recommendations, and various other real estate-related tasks.
Shape of dataset : (6207, 40)
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F11965067%2F75861c40e86a4d2d10c044be79542436%2FCapture.JPG?generation=1704918894425981&alt=media" alt="">
Github Link : https://github.com/TusharPaul01/House-Price-Prediction
For more such dataset & code check : https://www.kaggle.com/tusharpaul2001
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Overview: This dataset was collected and curated to support research on predicting real estate prices using machine learning algorithms, specifically Support Vector Regression (SVR) and Gradient Boosting Machine (GBM). The dataset includes comprehensive information on residential properties, enabling the development and evaluation of predictive models for accurate and transparent real estate appraisals.Data Source: The data was sourced from Department of Lands and Survey real estate listings.Features: The dataset contains the following key attributes for each property:Area (in square meters): The total living area of the property.Floor Number: The floor on which the property is located.Location: Geographic coordinates or city/region where the property is situated.Type of Apartment: The classification of the property, such as studio, one-bedroom, two-bedroom, etc.Number of Bathrooms: The total number of bathrooms in the property.Number of Bedrooms: The total number of bedrooms in the property.Property Age (in years): The number of years since the property was constructed.Property Condition: A categorical variable indicating the condition of the property (e.g., new, good, fair, needs renovation).Proximity to Amenities: The distance to nearby amenities such as schools, hospitals, shopping centers, and public transportation.Market Price (target variable): The actual sale price or listed price of the property.Data Preprocessing:Normalization: Numeric features such as area and proximity to amenities were normalized to ensure consistency and improve model performance.Categorical Encoding: Categorical features like property condition and type of apartment were encoded using one-hot encoding or label encoding, depending on the specific model requirements.Missing Values: Missing data points were handled using appropriate imputation techniques or by excluding records with significant missing information.Usage: This dataset was utilized to train and test machine learning models, aiming to predict the market price of residential properties based on the provided attributes. The models developed using this dataset demonstrated improved accuracy and transparency over traditional appraisal methods.Dataset Availability: The dataset is available for public use under the [CC BY 4.0]. Users are encouraged to cite the related publication when using the data in their research or applications.Citation: If you use this dataset in your research, please cite the following publication:[Real Estate Decision-Making: Precision in Price Prediction through Advanced Machine Learning Algorithms].
This dataset is used for predicting house prices from both images and textual information. It is composed of 535 sample houses from California, USA.
This dataset was created by Sohaib Anwaar
The California Housing dataset is based on 1990 US census and is widely used for machine learning and statistics. It was published in 1990 by Pace, R. Kelley and Ronald Barry, and can be found in the UCI Machine Learning Repository. The California Data set gives the information about Economic and Geographic values of the Houses,and also the economic status of the people present in the California.
Just as in many other countries, the housing market in the UK grew substantially during the coronavirus pandemic, fueled by robust demand and low borrowing costs. Nevertheless, high inflation and the increase in mortgage rates has led to house price growth slowing down. According to the forecast, 2024 is expected to see house prices decrease by ***** percent. Between 2024 and 2028, the average house price growth is projected at *** percent. A contraction after a period of continuous growth In June 2022, the UK's house price index exceeded *** index points, meaning that since 2015 which was the base year for the index, house prices had increased by ** percent. In just two years, between 2020 and 2022, the index surged by ** index points. As the market stood in December 2023, the average price for a home stood at approximately ******* British pounds. Rents are expected to continue to grow According to another forecast, the prime residential market is also expected to see rental prices grow in the next years. Growth is forecast to be stronger in 2024 and slow down in the period between 2025 and 2028. The rental market in London is expected to follow a similar trend, with Central London slightly outperforming Greater London.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Housing Index in the United States decreased to 434.90 points in April from 436.70 points in March of 2025. This dataset provides the latest reported value for - United States House Price Index MoM Change - plus previous releases, historical high and low, short-term forecast and long-term prediction, economic calendar, survey consensus and news.
ttd22/house-price dataset hosted on Hugging Face and contributed by the HF Datasets community
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Hedonic Price Model, used in existing house price modeling, may not address the relationship between house prices and streetscapes perceived at the human eye level. Therefore, in this study, we analyzed the relationship between streetscapes perceived at eye level and single-family home prices in Seoul, Korea, using computer vision technology and machine learning algorithms. We used transaction data for 13,776 single-family housing sales between 2017 and 2019. To measure visually perceived streetscapes, this study used the Deeplab V3+ deep-learning model with 233,106 Google Street View panoramic images. Then, the best machine-learning model was selected by comparing the explanatory powers of the hedonic price model and all alternative machine-learning models. According to the results, the Gradient Boost model, a representative ensemble machine learning model, performed better than XGBoost, Random Forest, and Linear Regression models in predicting single-family house prices. In addition, this study used an interpretable machine learning model of the SHAP method to identify key features that affect single-family home price prediction. This solves the "black box" problem of machine learning models. Finally, by analyzing the nonlinear relationship and interaction effects between perceived streetscape characteristics and house prices, we easily and quickly identified the relationship between variables the hedonic price model partially considers.
Problem Statement 👉 Download the case studies here Investors and buyers in the real estate market faced challenges in accurately assessing property values and market trends. Traditional valuation methods were time-consuming and lacked precision, making it difficult to make informed investment decisions. A real estate firm sought a predictive analytics solution to provide accurate property price forecasts and market insights. Challenge Developing a real estate price prediction system involved… See the full description on the dataset page: https://huggingface.co/datasets/globosetechnology12/Real-Estate-Price-Prediction.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Single Family Home Prices in the United States increased to 422800 USD in May from 414000 USD in April of 2025. This dataset provides - United States Existing Single Family Home Prices- actual values, historical data, forecast, chart, statistics, economic calendar and news.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Jiffs house price prediction dataset’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/elakiricoder/jiffs-house-price-prediction-dataset on 13 February 2022.
--- Dataset description provided by original source is as follows ---
I have previously shared a classification based dataset to classify the gender which is liked by those who are new to machine learning as it give a pretty good accuracy, which encouraged me to create a regression dataset to predict continues values. I have tried many real world datasets for regression problems which are predicting with lower accuracy and high error rate. As a beginner, I have struggled and worried why and how the dataset performs poorly. This is another main reason why I created this dataset. Although this is a made up dataset, I have considered all the features when deciding the price of the property. If you are a beginner, you would love to try this as the results are stunning..
Since this is a populated data, I will straightaway explain the features and the label. FEATURES 1. land_size_sqm - This the total size of the land in square meters. 2. house_size_sqm - This is the area in which house is located within the land. This is measured in square meters. 3. no_of_rooms - This indicates the number of rooms available in the house. 4. no_of_bathrooms - This shows the number of total bathrooms made in the house. 5. large_living_room - This indicates whether the house includes a larger living room or not. The assumption is that all the houses contain a living room. This feature attempts to classify whether it's large or small where '1' means large and '0' means small. However in the categorical dataset, 1 and 0 are represented with 'yes' and 'No' respectively. 6. parking_space - This indicates whether there is a parking space or not. '1' represents the parking available while '0' represents no parking space available. However in the categorical dataset, 1 and 0 are represented with 'yes' and 'No' respectively. 7. front_garden - This shows whether there is a garden available in front of the house. '1' means the garden available and '0' means no garden available. However in the categorical dataset, 1 and 0 are represented with 'yes' and 'No' respectively. 8. swimming_pool - This shows the availability of the swimming pool at the house. 1 represents the availability of the swimming pool while 0 represents the non availability of the same. However in the categorical dataset, 1 and 0 are represented with 'yes' and 'No' respectively. 9. distance_to_school_km - This shows the distance from the house to the nearest school in Kilometers. 10. wall_fence - This shows whether there is a wall fence or not. '1' mean there is wall fence and '0' means no wall fence. However in the categorical dataset, 1 and 0 are represented with 'yes' and 'No' respectively. 11. **house_age_or_renovated **- This is either the age of the house in years or the period from the date of renovation. 12. water_front - this indicates whether the house is located in front of the water or not. 1 means waterfront and 0 means its not located near the water. However in the categorical dataset, 1 and 0 are represented with 'yes' and 'No' respectively. 13. distance_to_supermarket_km - what is the distance to the nearest supermarket in kilometers.
LABEL property_value - This is the price of the property
Following features are only available in the "house price dataset original v2 cleaned" and "house price dataset original v2 with categorical features" data only. 14. crime_rate - its in float and falls between 0 and 7. lesser the better 15. room_size - As the name suggests, it explains the size of the room. 0 is being 'small', 1 is being 'medium', 2 is 'large' and 3 is being 'Extra large'. However in the categorical dataset, these values are categorical and self explanatory.
I spent around 3 hours creating this dataset. Enjoy..
Share your notebooks to see which algorithm predicts the house price precisely.
--- Original source retains full ownership of the source dataset ---
The UK House Price Index is a National Statistic.
Download the full UK House Price Index data below, or use our tool to https://landregistry.data.gov.uk/app/ukhpi?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=tool&utm_term=9.30_18_08_21" class="govuk-link">create your own bespoke reports.
Datasets are available as CSV files. Find out about republishing and making use of the data.
Google Chrome is blocking downloads of our UK HPI data files (Chrome 88 onwards). Please use another internet browser while we resolve this issue. We apologise for any inconvenience caused.
This file includes a derived back series for the new UK HPI. Under the UK HPI, data is available from 1995 for England and Wales, 2004 for Scotland and 2005 for Northern Ireland. A longer back series has been derived by using the historic path of the Office for National Statistics HPI to construct a series back to 1968.
Download the full UK HPI background file:
If you are interested in a specific attribute, we have separated them into these CSV files:
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Average-prices-2021-06.csv?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=average_price&utm_term=9.30_18_08_21" class="govuk-link">Average price (CSV, 9.2MB)
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Average-prices-Property-Type-2021-06.csv?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=average_price_property_price&utm_term=9.30_18_08_21" class="govuk-link">Average price by property type (CSV, 28MB)
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Sales-2021-06.csv?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=sales&utm_term=9.30_18_08_21" class="govuk-link">Sales (CSV, 4.7MB)
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Cash-mortgage-sales-2021-06.csv?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=cash_mortgage-sales&utm_term=9.30_18_08_21" class="govuk-link">Cash mortgage sales (CSV, 6.1MB)
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/First-Time-Buyer-Former-Owner-Occupied-2021-06.csv?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=FTNFOO&utm_term=9.30_18_08_21" class="govuk-link">First time buyer and former owner occupier (CSV, 5.9MB)
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/New-and-Old-2021-06.csv?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=new_build&utm_term=9.30_18_08_21" class="govuk-link">New build and existing resold property (CSV, 17MB)
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Indices-2021-06.csv?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=index&utm_term=9.30_18_08_21" class="govuk-link">Index (CSV, 6MB)
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Indices-seasonally-adjusted-2021-06.csv?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=index_season_adjusted&utm_term=9.30_18_08_21" class="govuk-link">Index seasonally adjusted (CSV, 192KB)
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Average-price-seasonally-adjusted-2021-06.csv?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=average-price_season_adjusted&utm_term=9.30_18_08_21" class="govuk-link">Average price seasonally adjusted</a
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Housing Index in China decreased by 3.20 percent in June from -3.50 percent in May of 2025. This dataset provides the latest reported value for - China Newly Built House Prices YoY Change - plus previous releases, historical high and low, short-term forecast and long-term prediction, economic calendar, survey consensus and news.
House prices in Spain are forecast to fall in 2024, after increasing by 1.2 percent in 2023. Nevertheless, prices are expected to pick up in 2025, with an increase of one percent. The Portuguese housing market, on the other hand, grew by 5.5 percent in 2023, but was forecast to contract in the next two years.
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
S&P/Case-Shiller home price index and 12 demographic and macroeconomic factors in five metropolitan areas: Boston, Dallas, New York, Chicago, and San Francisco (SF) data were collected from the Federal Reserve Bank, FBI, and Freddie Mac. https://fred.stlouisfed.org; http://www.freddiemac.com/pmms/; https://www.philadelphiafed.org/surveys-and-data/community-development-data/consumer-credit-explorer; https://ucr.fbi.gov/crime-in-the-u.s/2005;
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Housing Index in Spain increased to 2033 EUR/SQ. METRE in the first quarter of 2025 from 1972.10 EUR/SQ. METRE in the fourth quarter of 2024. This dataset provides the latest reported value for - Spain House Prices - plus previous releases, historical high and low, short-term forecast and long-term prediction, economic calendar, survey consensus and news.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Amsterdam House Price Prediction’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/thomasnibb/amsterdam-house-price-prediction on 28 January 2022.
--- Dataset description provided by original source is as follows ---
If you are like me, you might get overwhelmed when having to make big decisions such as buying a house. In such cases, I always like to go for a data driven approach, that will help me find an optimum solution. This involves two steps. First, we need to gather as much data as we can. Second, we need to define a metric for success.
Gathering housing prices requires some effort. A caveat is that the asking prices are not the prices to which the houses were actually sold. Defining a metric for success is somewhat subjective. I consider a house to be a good option if the house price is cheap compared to other listings in the area.
The housing prices have been obtained from Pararius.nl as a snapshot in August 2021. The original data provided features such as price, floor area and the number of rooms. The data has been further enhanced by utilising the Mapbox API to obtain the coordinates of each listing.
Thanks to Pararius
--- Original source retains full ownership of the source dataset ---
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset was created by Richard Sikaonga
Released under MIT
House prices in Norway fell by *** percent and, according to the forecast, are expected to continue to fall until 2024. In 2023, properties were forecast to experience a decline in prices of ** percent. In 2025, growth is projected to recover, rising to **** percent.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
House price prediction dataset
This dataset comprises housing data for various metropolitan cities of India. It includes: - Collection of prices of new and resale houses - The amenities provided for each house
This housing dataset is useful for a range of stakeholders, including real estate agents, property developers, buyers, renters, and researchers interested in analyzing housing markets and trends in metropolitan cities across India. It can be used for market analysis, price prediction, property recommendations, and various other real estate-related tasks.
Shape of dataset : (6207, 40)
https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F11965067%2F75861c40e86a4d2d10c044be79542436%2FCapture.JPG?generation=1704918894425981&alt=media" alt="">
Github Link : https://github.com/TusharPaul01/House-Price-Prediction
For more such dataset & code check : https://www.kaggle.com/tusharpaul2001