Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
https://raw.githubusercontent.com/Masterx-AI/Project_Housing_Price_Prediction_/main/hs.jpg" alt="">
A simple yet challenging project, to predict the housing price based on certain factors like house area, bedrooms, furnished, nearness to mainroad, etc. The dataset is small yet, it's complexity arises due to the fact that it has strong multicollinearity. Can you overcome these obstacles & build a decent predictive model?
Harrison, D. and Rubinfeld, D.L. (1978) Hedonic prices and the demand for clean air. J. Environ. Economics and Management 5, 81–102. Belsley D.A., Kuh, E. and Welsch, R.E. (1980) Regression Diagnostics. Identifying Influential Data and Sources of Collinearity. New York: Wiley.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
So this data set is collected for completing a college project ,which is an android app for calculating the price of houses.
This data is scraped from magic bricks website between june 2021 and july 2021 .
magicbricks.com
With the help of the data available one can make a regression model to predict house prices.
Facebook
Twitterttd22/house-price dataset hosted on Hugging Face and contributed by the HF Datasets community
Facebook
TwitterThe UK House Price Index is a National Statistic.
Download the full UK House Price Index data below, or use our tool to https://landregistry.data.gov.uk/app/ukhpi?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=tool&utm_term=9.30_23_03_22" class="govuk-link">create your own bespoke reports.
Datasets are available as CSV files. Find out about republishing and making use of the data.
Google Chrome is blocking downloads of our UK HPI data files (Chrome 88 onwards). Please use another internet browser while we resolve this issue. We apologise for any inconvenience caused.
This file includes a derived back series for the new UK HPI. Under the UK HPI, data is available from 1995 for England and Wales, 2004 for Scotland and 2005 for Northern Ireland. A longer back series has been derived by using the historic path of the Office for National Statistics HPI to construct a series back to 1968.
Download the full UK HPI background file:
If you are interested in a specific attribute, we have separated them into these CSV files:
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Average-prices-2022-01.csv?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=average_price&utm_term=9.30_23_03_22" class="govuk-link">Average price (CSV, 9.3MB)
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Average-prices-Property-Type-2022-01.csv?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=average_price_property_price&utm_term=9.30_23_03_22" class="govuk-link">Average price by property type (CSV, 28.2MB)
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Sales-2022-01.csv?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=sales&utm_term=9.30_23_03_22" class="govuk-link">Sales (CSV, 4.7MB)
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Cash-mortgage-sales-2022-01.csv?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=cash_mortgage-sales&utm_term=9.30_23_03_22" class="govuk-link">Cash mortgage sales (CSV, 6.4MB)
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/First-Time-Buyer-Former-Owner-Occupied-2022-01.csv?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=FTNFOO&utm_term=9.30_23_03_22" class="govuk-link">First time buyer and former owner occupier (CSV, 6.1MB)
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/New-and-Old-2022-01.csv?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=new_build&utm_term=9.30_23_03_22" class="govuk-link">New build and existing resold property (CSV, 17.1MB)
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Indices-2022-01.csv?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=index&utm_term=9.30_23_03_22" class="govuk-link">Index (CSV, 5.9MB)
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Indices-seasonally-adjusted-2022-01.csv?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=index_season_adjusted&utm_term=9.30_23_03_22" class="govuk-link">Index seasonally adjusted (CSV, 196KB)
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Average-price-seasonally-adjusted-2022-01.csv?utm_medium=GOV.UK&utm_source=datadownload&utm_campaign=average-price_season_adjusted&utm_term=9.30_23_03_22" class="govuk-link">Average price seasonally a
Facebook
TwitterAttribution-NonCommercial-ShareAlike 3.0 (CC BY-NC-SA 3.0)https://creativecommons.org/licenses/by-nc-sa/3.0/
License information was derived automatically
This dataset contains various features of residential properties along with their corresponding prices. It is suitable for exploring and analyzing factors influencing housing prices and for building predictive models to estimate the price of a property based on its attributes.
| Feature | Description |
|---|---|
| price | The price of the property. |
| area | The total area of the property in square feet. |
| bedrooms | The number of bedrooms in the property. |
| bathrooms | The number of bathrooms in the property. |
| stories | The number of stories (floors) in the property. |
| mainroad | Indicates whether the property is located on a main road (binary: yes/no). |
| guestroom | Indicates whether the property has a guest room (binary: yes/no). |
| basement | Indicates whether the property has a basement (binary: yes/no). |
| hotwaterheating | Indicates whether the property has hot water heating (binary: yes/no). |
| airconditioning | Indicates whether the property has air conditioning (binary: yes/no). |
| parking | The number of parking spaces available with the property. |
| prefarea | Indicates whether the property is in a preferred area (binary: yes/no). |
| furnishingstatus | The furnishing status of the property (e.g., furnished, semi-furnished, unfurnished). |
License: This dataset is made available under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Facebook
TwitterOpen Government Licence 2.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/2/
License information was derived automatically
Average House Price
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The dataset contains 2000 rows of house-related data, representing various features that could influence house prices. Below, we discuss key aspects of the dataset, which include its structure, the choice of features, and potential use cases for analysis.
The dataset is designed to capture essential attributes for predicting house prices, including:
Area: Square footage of the house, which is generally one of the most important predictors of price. Bedrooms & Bathrooms: The number of rooms in a house significantly affects its value. Homes with more rooms tend to be priced higher. Floors: The number of floors in a house could indicate a larger, more luxurious home, potentially raising its price. Year Built: The age of the house can affect its condition and value. Newly built houses are generally more expensive than older ones. Location: Houses in desirable locations such as downtown or urban areas tend to be priced higher than those in suburban or rural areas. Condition: The current condition of the house is critical, as well-maintained houses (in 'Excellent' or 'Good' condition) will attract higher prices compared to houses in 'Fair' or 'Poor' condition. Garage: Availability of a garage can increase the price due to added convenience and space. Price: The target variable, representing the sale price of the house, used to train machine learning models to predict house prices based on the other features.
Area Distribution: The area of the houses in the dataset ranges from 500 to 5000 square feet, which allows analysis across different types of homes, from smaller apartments to larger luxury houses. Bedrooms and Bathrooms: The number of bedrooms varies from 1 to 5, and bathrooms from 1 to 4. This variance enables analysis of homes with different sizes and layouts. Floors: Houses in the dataset have between 1 and 3 floors. This feature could be useful for identifying the influence of multi-level homes on house prices. Year Built: The dataset contains houses built from 1900 to 2023, giving a wide range of house ages to analyze the effects of new vs. older construction. Location: There is a mix of urban, suburban, downtown, and rural locations. Urban and downtown homes may command higher prices due to proximity to amenities. Condition: Houses are labeled as 'Excellent', 'Good', 'Fair', or 'Poor'. This feature helps model the price differences based on the current state of the house. Price Distribution: Prices range between $50,000 and $1,000,000, offering a broad spectrum of property values. This range makes the dataset appropriate for predicting a wide variety of housing prices, from affordable homes to luxury properties.
3. Correlation Between Features
A key area of interest is the relationship between various features and house price: Area and Price: Typically, a strong positive correlation is expected between the size of the house (Area) and its price. Larger homes are likely to be more expensive. Location and Price: Location is another major factor. Houses in urban or downtown areas may show a higher price on average compared to suburban and rural locations. Condition and Price: The condition of the house should show a positive correlation with price. Houses in better condition should be priced higher, as they require less maintenance and repair. Year Built and Price: Newer houses might command a higher price due to better construction standards, modern amenities, and less wear-and-tear, but some older homes in good condition may retain historical value. Garage and Price: A house with a garage may be more expensive than one without, as it provides extra storage or parking space.
The dataset is well-suited for various machine learning and data analysis applications, including:
House Price Prediction: Using regression techniques, this dataset can be used to build a model to predict house prices based on the available features. Feature Importance Analysis: By using techniques such as feature importance ranking, data scientists can determine which features (e.g., location, area, or condition) have the greatest impact on house prices. Clustering: Clustering techniques like k-means could help identify patterns in the data, such as grouping houses into segments based on their characteristics (e.g., luxury homes, affordable homes). Market Segmentation: The dataset can be used to perform segmentation by location, price range, or house type to analyze trends in specific sub-markets, like luxury vs. affordable housing. Time-Based Analysis: By studying how house prices vary with the year built or the age of the house, analysts can derive insights into the trends of older vs. newer homes.
Facebook
TwitterHouse Price prediction Mini Dataset For Begging notebooks
Facebook
Twitterhttps://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
Explore the Redfin USA Properties Dataset, available in CSV format. This extensive dataset provides valuable insights into the U.S. real estate market, including detailed property listings, prices, property types, and more across various states and cities. Perfect for those looking to conduct in-depth market analysis, real estate investment research, or financial forecasting.
Key Features:
Who Can Benefit From This Dataset:
Download the Redfin USA Properties Dataset to access essential information on the U.S. housing market, ideal for professionals in real estate, finance, and data analytics. Unlock key insights to make informed decisions in a dynamic market environment.
Looking for deeper insights or a custom data pull from Redfin?
Send a request with just one click and explore detailed property listings, price trends, and housing data.
đź”— Request Redfin Real Estate Data
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Detailed Real Estate Data for Predicting House Prices and Analyzing Market Trends
This dataset contains information on 21,613 properties, making it a comprehensive resource for exploring real estate market trends and building predictive models for house prices. The data includes various features capturing property details, location, and market conditions, providing ample opportunities for data exploration, visualization, and machine learning applications.
General Information:
id: Unique identifier for each property. date: Date of sale. Price Details:
price: Sale price of the house. Property Features:
bedrooms: Number of bedrooms. bathrooms: Number of bathrooms (including partials as fractions). sqft_living: Living space area in square feet. sqft_lot: Lot size in square feet. floors: Number of floors. waterfront: Whether the property has a waterfront view. view: Quality of the view rating. condition: Overall condition of the house. grade: Grade of construction and design (scale of 1–13). Additional Metrics:
sqft_above: Square footage of the property above ground. sqft_basement: Basement area in square feet. yr_built: Year the property was built. yr_renovated: Year of last renovation. Location Coordinates:
zipcode: ZIP code of the property. lat and long: Latitude and longitude coordinates. Neighbor Comparisons:
sqft_living15: Average living space of 15 nearest properties. sqft_lot15: Average lot size of 15 nearest properties. This dataset is a valuable resource for anyone interested in real estate analytics, machine learning, or geographic data visualization.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This research data file contains the necessary software and the dataset for estimating the missing prices of house units. This approach combines several machine learning techniques (linear regression, support vector regression, the k-nearest neighbors and a multi-layer perceptron neural network) with several dimensionality reduction techniques (non-negative factorization, recursive feature elimination and feature selection with a variance threshold). It includes the input dataset formed with the available house prices in two neighborhoods of Teruel city (Spain) in November 13, 2017 from Idealista website. These two neighborhoods are the center of the city and “Ensanche”.
This dataset supports the research of the authors in the improvement of the setup of agent-based simulations about real-estate market. The work about this dataset has been submitted for consideration for publication to a scientific journal.
The open source python code is composed of all the files with the “.py” extension. The main program can be executed from the “main.py” file. The “boxplotErrors.eps” is a chart generated from the execution of the code, and compares the results of the different combinations of machine learning techniques and dimensionality reduction methods.
The dataset is in the “data” folder. The input raw data of the house prices are in the “dataRaw.csv” file. These were shuffled into the “dataShuffled.csv” file. We used cross-validation to obtain the estimations of house prices. The outputted estimations alongside the real values are stored in different files of the “data” folder, in which each filename is composed by the machine learning technique abbreviation and the dimensionality reduction method abbreviation.
Facebook
Twitterhttps://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
The Housing Data Extracted from Homes.com (USA) dataset is a comprehensive collection of 2 million real estate listings sourced from Homes.com, one of the leading real estate platforms in the United States. This dataset offers detailed insights into the U.S. housing market, making it an invaluable resource for real estate professionals, investors, researchers, and analysts.
The dataset contains extensive property details, including location, price, property type (single-family homes, condos, apartments), number of bedrooms and bathrooms, square footage, lot size, year built, and availability status. Organized in CSV format, it provides users with easy access to structured data for analyzing trends, developing investment strategies, or building real estate applications.
Key Features:
Facebook
TwitterODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
UK house prices since 1953 as monthly time-series. Data comes from the Nationwide.
Data can be found in the data/data.csv file. See datapackage.json for source info.
Source: http://www.nationwide....
Facebook
TwitterThis dataset contains prices of New York houses, providing valuable insights into the real estate market in the region. It includes information such as broker titles, house types, prices, number of bedrooms and bathrooms, property square footage, addresses, state, administrative and local areas, street names, and geographical coordinates.
- BROKERTITLE: Title of the broker
- TYPE: Type of the house
- PRICE: Price of the house
- BEDS: Number of bedrooms
- BATH: Number of bathrooms
- PROPERTYSQFT: Square footage of the property
- ADDRESS: Full address of the house
- STATE: State of the house
- MAIN_ADDRESS: Main address information
- ADMINISTRATIVE_AREA_LEVEL_2: Administrative area level 2 information
- LOCALITY: Locality information
- SUBLOCALITY: Sublocality information
- STREET_NAME: Street name
- LONG_NAME: Long name
- FORMATTED_ADDRESS: Formatted address
- LATITUDE: Latitude coordinate of the house
- LONGITUDE: Longitude coordinate of the house
- Price analysis: Analyze the distribution of house prices to understand market trends and identify potential investment opportunities.
- Property size analysis: Explore the relationship between property square footage and prices to assess the value of different-sized houses.
- Location-based analysis: Investigate geographical patterns to identify areas with higher or lower property prices.
- Bedroom and bathroom trends: Analyze the impact of the number of bedrooms and bathrooms on house prices.
- Broker performance analysis: Evaluate the influence of different brokers on the pricing of houses.
If you find this dataset useful, your support through an upvote would be greatly appreciated ❤️🙂 Thank you
Facebook
TwitterThis dataset is designed for beginners to practice regression problems, particularly in the context of predicting house prices. It contains 1000 rows, with each row representing a house and various attributes that influence its price. The dataset is well-suited for learning basic to intermediate-level regression modeling techniques.
Beginner Regression Projects: This dataset can be used to practice building regression models such as Linear Regression, Decision Trees, or Random Forests. The target variable (house price) is continuous, making this an ideal problem for supervised learning techniques.
Feature Engineering Practice: Learners can create new features by combining existing ones, such as the price per square foot or age of the house, providing an opportunity to experiment with feature transformations.
Exploratory Data Analysis (EDA): You can explore how different features (e.g., square footage, number of bedrooms) correlate with the target variable, making it a great dataset for learning about data visualization and summary statistics.
Model Evaluation: The dataset allows for various model evaluation techniques such as cross-validation, R-squared, and Mean Absolute Error (MAE). These metrics can be used to compare the effectiveness of different models.
The dataset is highly versatile for a range of machine learning tasks. You can apply simple linear models to predict house prices based on one or two features, or use more complex models like Random Forest or Gradient Boosting Machines to understand interactions between variables.
It can also be used for dimensionality reduction techniques like PCA or to practice handling categorical variables (e.g., neighborhood quality) through encoding techniques like one-hot encoding.
This dataset is ideal for anyone wanting to gain practical experience in building regression models while working with real-world features.
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains 2,000 entries of house price data from all states in Malaysia, providing a comprehensive overview of the country’s real estate market for 2025. Sourced from Brickz, a trusted platform for property transaction insights, it includes detailed information such as property location, tenure, type, median prices, and transaction counts. This dataset is ideal for real estate market analysis, predictive modeling, and exploring trends across Malaysia’s diverse property market.
https://encrypted-tbn1.gstatic.com/licensed-image?q=tbn:ANd9GcR8ttDRWTx7dIxuUegBTsggS4a6tQrnNA6DEW_HJu2DphQNsverV0PYsSkdbSdqm4qRaRuBOh4Txbv11yXMxIKWqh-_WAkeTuQI8Diu-Q" alt="Kuala Lumpur, Malaysia">
Facebook
TwitterThis dataset was created by 173050055
Released under Other (specified in description)
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
A dataset comprising various variables around housing and demographics for the top 50 American cities by population.
Variables:
Zip Code: Zip code within which the listing is present.
Price: Listed price for the property.
Beds: Number of beds mentioned in the listing.
Baths: Number of baths mentioned in the listing.
Living Space: The total size of the living space, in square feet, mentioned in the listing.
Address: Street address of the listing.
City: City name where the listing is located.
State: State name where the listing is located.
Zip Code Population: The estimated number of individuals within the zip code. Data from Simplemaps.com.
Zip Code Density: The estimated number of individuals per square mile within the zip code. Data from Simplemaps.com.
County: County where the listing is located.
Median Household income: Estimated median household income. Data from the U.S. Census Bureau.
Latitude: Latitude of the zip code. ** Data from Simplemaps.com.**
Longitude: Longitude of the zip code. Data from Simplemaps.com.
Facebook
TwitterDatasets are available as CSV files. Find out about republishing and making use of the data.
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/UK-HPI-full-file-2016-08.csv" class="govuk-link">UK HPI full file (CSV, 42.5MB)
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Average-prices-2016-08.csv" class="govuk-link">Average price.csv
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Average-prices-Property-Type-2016-08.csv" class="govuk-link">Average price by property type.csv
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Sales-2016-08.csv" class="govuk-link">Sales.csv
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Cash-mortgage-sales-2016-08.csv" class="govuk-link">Cash mortgage sales.csv
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/First-Time-Buyer-Former-Owner-Occupied-2016-08.csv" class="govuk-link">First time buyer and former owner occupied.csv
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/New-and-Old-2016-08.csv" class="govuk-link">New build and existing resold property.csv
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Indices-2016-08.csv" class="govuk-link">Index.csv
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Indices-seasonally-adjusted-2016-08.csv" class="govuk-link">Index seasonally adjusted.csv
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Average-price-seasonally-adjusted-2016-08.csv" class="govuk-link">Average Price seasonally adjusted.csv
http://publicdata.landregistry.gov.uk/market-trend-data/house-price-index-data/Repossession-2016-08.csv" class="govuk-link">Repossessions.csv
This file includes a derived back series for the new UK HPI. Under the UK HPI, data is available from 1995 for England and Wales, 2004 for Scotland and 2005 for Northern Ireland. A longer back series has been derived by using the historic path of the ONS HPI to construct a series back to 1968:
The release calendar shows when the next month’s data will be published.
Create your own reports based on the UK House Price Index data, http://landregistry.data.gov.uk/app/ukhpi" class="govuk-link">use our tool.
Facebook
TwitterOur Price Paid Data includes information on all property sales in England and Wales that are sold for value and are lodged with us for registration.
Get up to date with the permitted use of our Price Paid Data:
check what to consider when using or publishing our Price Paid Data
If you use or publish our Price Paid Data, you must add the following attribution statement:
Contains HM Land Registry data © Crown copyright and database right 2021. This data is licensed under the Open Government Licence v3.0.
Price Paid Data is released under the http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/">Open Government Licence (OGL). You need to make sure you understand the terms of the OGL before using the data.
Under the OGL, HM Land Registry permits you to use the Price Paid Data for commercial or non-commercial purposes. However, OGL does not cover the use of third party rights, which we are not authorised to license.
Price Paid Data contains address data processed against Ordnance Survey’s AddressBase Premium product, which incorporates Royal Mail’s PAF® database (Address Data). Royal Mail and Ordnance Survey permit your use of Address Data in the Price Paid Data:
If you want to use the Address Data in any other way, you must contact Royal Mail. Email address.management@royalmail.com.
The following fields comprise the address data included in Price Paid Data:
The October 2025 release includes:
As we will be adding to the October data in future releases, we would not recommend using it in isolation as an indication of market or HM Land Registry activity. When the full dataset is viewed alongside the data we’ve previously published, it adds to the overall picture of market activity.
Your use of Price Paid Data is governed by conditions and by downloading the data you are agreeing to those conditions.
Google Chrome (Chrome 88 onwards) is blocking downloads of our Price Paid Data. Please use another internet browser while we resolve this issue. We apologise for any inconvenience caused.
We update the data on the 20th working day of each month. You can download the:
These include standard and additional price paid data transactions received at HM Land Registry from 1 January 1995 to the most current monthly data.
Your use of Price Paid Data is governed by conditions and by downloading the data you are agreeing to those conditions.
The data is updated monthly and the average size of this file is 3.7 GB, you can download:
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
https://raw.githubusercontent.com/Masterx-AI/Project_Housing_Price_Prediction_/main/hs.jpg" alt="">
A simple yet challenging project, to predict the housing price based on certain factors like house area, bedrooms, furnished, nearness to mainroad, etc. The dataset is small yet, it's complexity arises due to the fact that it has strong multicollinearity. Can you overcome these obstacles & build a decent predictive model?
Harrison, D. and Rubinfeld, D.L. (1978) Hedonic prices and the demand for clean air. J. Environ. Economics and Management 5, 81–102. Belsley D.A., Kuh, E. and Welsch, R.E. (1980) Regression Diagnostics. Identifying Influential Data and Sources of Collinearity. New York: Wiley.