63 datasets found
  1. Real Estate Price Prediction Data

    • figshare.com
    txt
    Updated Aug 8, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mohammad Shbool; Rand Al-Dmour; Bashar Al-Shboul; Nibal Albashabsheh; Najat Almasarwah (2024). Real Estate Price Prediction Data [Dataset]. http://doi.org/10.6084/m9.figshare.26517325.v1
    Explore at:
    txtAvailable download formats
    Dataset updated
    Aug 8, 2024
    Dataset provided by
    Figsharehttp://figshare.com/
    Authors
    Mohammad Shbool; Rand Al-Dmour; Bashar Al-Shboul; Nibal Albashabsheh; Najat Almasarwah
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Overview: This dataset was collected and curated to support research on predicting real estate prices using machine learning algorithms, specifically Support Vector Regression (SVR) and Gradient Boosting Machine (GBM). The dataset includes comprehensive information on residential properties, enabling the development and evaluation of predictive models for accurate and transparent real estate appraisals.Data Source: The data was sourced from Department of Lands and Survey real estate listings.Features: The dataset contains the following key attributes for each property:Area (in square meters): The total living area of the property.Floor Number: The floor on which the property is located.Location: Geographic coordinates or city/region where the property is situated.Type of Apartment: The classification of the property, such as studio, one-bedroom, two-bedroom, etc.Number of Bathrooms: The total number of bathrooms in the property.Number of Bedrooms: The total number of bedrooms in the property.Property Age (in years): The number of years since the property was constructed.Property Condition: A categorical variable indicating the condition of the property (e.g., new, good, fair, needs renovation).Proximity to Amenities: The distance to nearby amenities such as schools, hospitals, shopping centers, and public transportation.Market Price (target variable): The actual sale price or listed price of the property.Data Preprocessing:Normalization: Numeric features such as area and proximity to amenities were normalized to ensure consistency and improve model performance.Categorical Encoding: Categorical features like property condition and type of apartment were encoded using one-hot encoding or label encoding, depending on the specific model requirements.Missing Values: Missing data points were handled using appropriate imputation techniques or by excluding records with significant missing information.Usage: This dataset was utilized to train and test machine learning models, aiming to predict the market price of residential properties based on the provided attributes. The models developed using this dataset demonstrated improved accuracy and transparency over traditional appraisal methods.Dataset Availability: The dataset is available for public use under the [CC BY 4.0]. Users are encouraged to cite the related publication when using the data in their research or applications.Citation: If you use this dataset in your research, please cite the following publication:[Real Estate Decision-Making: Precision in Price Prediction through Advanced Machine Learning Algorithms].

  2. Real Estate Houses Price Prediction Dataset

    • kaggle.com
    Updated Nov 14, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Huda Imran (2023). Real Estate Houses Price Prediction Dataset [Dataset]. https://www.kaggle.com/datasets/hudairr/real-estate-houses-price-prediction-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 14, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Huda Imran
    Description

    Dataset

    This dataset was created by Huda Imran

    Contents

  3. Boston House Price Prediction Dataset

    • kaggle.com
    Updated Dec 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    SRIHAAS PIGILAM (2023). Boston House Price Prediction Dataset [Dataset]. https://www.kaggle.com/datasets/srihaaspigilam/boston-house-price-prediction-dataset/data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 28, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    SRIHAAS PIGILAM
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Area covered
    Boston
    Description

    Title: Boston Housing Price Prediction Dataset

    Description:

    This dataset contains information about housing prices in Boston and is often used for regression analysis and predictive modeling. The dataset is based on the classic Boston Housing dataset, which is frequently used as a benchmark in machine learning.

    Attributes:

    1. CRIM (Per Capita Crime Rate): The per capita crime rate in the neighborhood.
    2. ZN (Proportion of Residential Land Zoned for Large Lots): The proportion of residential land zoned for lots over 25,000 sq. ft.
    3. INDUS (Proportion of Non-Retail Business Acres): The proportion of non-retail business acres per town.
    4. CHAS (Charles River Dummy Variable): A binary variable indicating whether the Charles River bounds the tract (1 if bounded, 0 otherwise).
    5. NOX (Nitric Oxides Concentration): Nitric oxides concentration (parts per 10 million).
    6. RM (Average Number of Rooms per Dwelling): The average number of rooms per dwelling.
    7. AGE (Proportion of Owner-Occupied Units Built Prior to 1940): The proportion of owner-occupied units built prior to 1940.
    8. DIS (Weighted Distances to Employment Centers): Weighted distances to five Boston employment centers.
    9. RAD (Index of Accessibility to Radial Highways): An index representing accessibility to radial highways.
    10. TAX (Full-Value Property Tax Rate per $10,000): The full-value property tax rate per $10,000.
    11. PTRATIO (Pupil-Teacher Ratio): The pupil-teacher ratio by town.
    12. B (1000(Bk - 0.63)^2 where Bk is the Proportion of Black Residents): A measure of the proportion of Black residents adjusted for an offset.
    13. LSTAT (Percentage of Lower Status of the Population): The percentage of lower-status residents in the population.
    14. MEDV (Median Value of Owner-Occupied Homes): The median value of owner-occupied homes in $1000s (Target Variable).

    Objective:

    Predict the median value of owner-occupied homes (MEDV) based on various features to gain insights into factors influencing housing prices.

    Usage:

    This dataset is suitable for regression tasks, machine learning practice, and understanding the dynamics of housing markets.

    Citation:

    The dataset is derived from the UCI Machine Learning Repository and can be cited as follows:

    Harrison Jr., D., & Rubinfeld, D. L. (1978). Hedonic prices and the demand for clean air. Journal of Environmental Economics and Management, 5(1), 81-102.

  4. Mumbai House Price Data (70k Entries)

    • kaggle.com
    Updated Oct 18, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    kevinnadar22 (2024). Mumbai House Price Data (70k Entries) [Dataset]. https://www.kaggle.com/datasets/kevinnadar22/mumbai-house-price-data-70k-entries
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 18, 2024
    Dataset provided by
    Kaggle
    Authors
    kevinnadar22
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Area covered
    Mumbai
    Description

    Mumbai House Price Dataset (70k+ Entries)

    Dataset Overview

    This dataset provides detailed information on housing prices in Mumbai, India. It includes over 70,000 entries and is ideal for analyzing various factors affecting real estate prices in the city. The dataset captures key aspects of residential properties such as price, area, property type, and more, enabling detailed insights into the real estate market trends.

    Note: This data is based on the year 2024

    Sources

    This dataset has been scraped from makaan.com using Python and Requests library

    Potential Use Cases

    • Real Estate Market Analysis: Understanding property price trends across different localities and neighborhoods in Mumbai.
    • Price Prediction Models: Building machine learning models to predict housing prices based on features like area, property type, and location.

    Data Quality

    All columns in this dataset are fully populated with non-null values

  5. A

    ‘Real estate price prediction’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Nov 12, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘Real estate price prediction’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-real-estate-price-prediction-920d/3d6c92ac/?iid=005-356&v=presentation
    Explore at:
    Dataset updated
    Nov 12, 2021
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of ‘Real estate price prediction’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/quantbruce/real-estate-price-prediction on 12 November 2021.

    --- No further description of dataset provided by original source ---

    --- Original source retains full ownership of the source dataset ---

  6. Real Estate Housing Price Prediction

    • kaggle.com
    Updated Nov 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Palpha 01 (2024). Real Estate Housing Price Prediction [Dataset]. https://www.kaggle.com/datasets/palpha01/real-estate-housing-price-prediction/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 10, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Palpha 01
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Palpha 01

    Released under Apache 2.0

    Contents

  7. Real Estate Price Prediction

    • kaggle.com
    Updated Aug 4, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Natalia Lapteva (2021). Real Estate Price Prediction [Dataset]. https://www.kaggle.com/datasets/natalialapteva/real-estate-price-prediction/versions/2
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 4, 2021
    Dataset provided by
    Kaggle
    Authors
    Natalia Lapteva
    Description

    Dataset

    This dataset was created by Natalia Lapteva

    Contents

  8. House Price Prediction Dataset

    • kaggle.com
    Updated Jan 25, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jackson Divakar R (2024). House Price Prediction Dataset [Dataset]. https://www.kaggle.com/datasets/jacksondivakarr/house-price-prediction-dataset/versions/1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 25, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Jackson Divakar R
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    "Charting the Realms of Real Estate: A Holistic and Expansive Dataset Curated for In-Depth House Price Prediction Analysis, Market Trends Evaluation, and Strategic Decision-Making in the Dynamic Landscape of Property Valuation and Investment"

  9. Synthetic House Price Prediction Datasets

    • kaggle.com
    Updated Jul 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    D.Madhan raj (2025). Synthetic House Price Prediction Datasets [Dataset]. http://doi.org/10.34740/kaggle/dsv/12582291
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 26, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    D.Madhan raj
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    The Synthetic House Price Prediction Datasets is a publicly available Kaggle dataset created by D.Madhan Raj for machine learning experiments. It features a single CSV file containing synthetic data on house attributes such as bedrooms, bathrooms, square footage, house age, location rating, and estimated prices in USD. Designed for regression tasks, the dataset allows users to practice predictive modeling without the constraints of real-world data privacy. It's licensed under Apache 2.0 and includes around 3,203 data rows, making it a handy resource for learning, prototyping, and fine-tuning models learning

  10. A

    ‘Paris Housing Price Prediction’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Sep 30, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘Paris Housing Price Prediction’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-paris-housing-price-prediction-4dae/0e686e66/?iid=035-738&v=presentation
    Explore at:
    Dataset updated
    Sep 30, 2021
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Analysis of ‘Paris Housing Price Prediction’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/mssmartypants/paris-housing-price-prediction on 30 September 2021.

    --- Dataset description provided by original source is as follows ---

    Context

    This is a set of data created from imaginary data of house prices in an urban environment - Paris. I recommend using this dataset for educational purposes, for practice and to acquire the necessary knowledge. What I'm trying to do next is to create a classification dataset with same data from this dataset, I'll add a new column for class attribute ofc. Here is a classification dataset ---> classification dataset <---

    Content

    What's inside is more than just rows and columns. You can see house details listed as column names.

    Description

    All attributes are numeric variables and they are listed bellow:

    • squareMeters
    • numberOfRooms
    • hasYard
    • hasPool
    • floors - number of floors
    • cityCode - zip code
    • cityPartRange - the higher the range, the more exclusive the neighbourhood is
    • numPrevOwners - number of prevoious owners
    • made - year
    • isNewBuilt
    • hasStormProtector
    • basement - basement square meters
    • attic - attic square meteres
    • garage - garage size
    • hasStorageRoom
    • hasGuestRoom - number of guest rooms
    • price - predicted value

    Inspiration

    Idea was to create dataset that is good for regression and that gives adequate results.

    --- Original source retains full ownership of the source dataset ---

  11. A

    ‘Delhi House Price Prediction’ analyzed by Analyst-2

    • analyst-2.ai
    Updated Sep 30, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com) (2021). ‘Delhi House Price Prediction’ analyzed by Analyst-2 [Dataset]. https://analyst-2.ai/analysis/kaggle-delhi-house-price-prediction-0274/898e21e9/?iid=018-396&v=presentation
    Explore at:
    Dataset updated
    Sep 30, 2021
    Dataset authored and provided by
    Analyst-2 (analyst-2.ai) / Inspirient GmbH (inspirient.com)
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Area covered
    Delhi
    Description

    Analysis of ‘Delhi House Price Prediction’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/neelkamal692/delhi-house-price-prediction on 27 August 2021.

    --- Dataset description provided by original source is as follows ---

    Content

    This is not a comprehensive list, some of the attributes i left intentionally and some just couldn't extract. Dataset consists of 12 columns and 1259 rows. 6 of the features are numerical valued and rest are categorical. code for extracting Data is available at my Github account.

    Acknowledgements

    The Data has been extracted from MagicBricks (a website, provides common platform to property buyer and seller ).

    Inspiration

    I have done property price prediction on Boston Dataset, so i was wondering, if i can do it for Delhi properties too.

    --- Original source retains full ownership of the source dataset ---

  12. Data from: Housing Price Prediction

    • kaggle.com
    Updated Nov 27, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Probin Kumar Sah (2020). Housing Price Prediction [Dataset]. https://www.kaggle.com/datasets/probinkumarsah/housing-price-prediction/suggestions
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 27, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Probin Kumar Sah
    Description

    Dataset

    This dataset was created by Probin Kumar Sah

    Contents

  13. USA Housing Dataset

    • kaggle.com
    Updated Feb 5, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    ArnavGupta (2025). USA Housing Dataset [Dataset]. https://www.kaggle.com/datasets/arnavgupta1205/usa-housing-dataset/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Feb 5, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    ArnavGupta
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Area covered
    United States
    Description

    This USA Housing Market Dataset (Synthetic) contains 300 rows and 10 columns of real estate-related data designed for housing price prediction, trend analysis, and investment insights. It includes key property details such as price, number of bedrooms and bathrooms, square footage, year built, garage spaces, lot size, zip code, crime rate, and school ratings.

    This dataset is ideal for: ✅ Machine Learning Models for predicting housing prices ✅ Market Research & Investment Analysis ✅ Exploring Property Trends in the USA ✅ Educational Purposes for Data Science and Analytics

    This dataset provides a realistic yet synthetic view of the real estate market, making it useful for data-driven decision-making in the housing industry.

    Let me know if you need any modifications!

  14. Houses price prediction

    • kaggle.com
    Updated Dec 16, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Suliman Almasrey (2020). Houses price prediction [Dataset]. https://www.kaggle.com/sulimanalmasrey/houses-price-prediction/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 16, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Suliman Almasrey
    Description

    Dataset

    This dataset was created by Suliman Almasrey

    Contents

  15. RealEstateData Housing Price Prediction

    • kaggle.com
    Updated Nov 7, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pranav Tonge (2021). RealEstateData Housing Price Prediction [Dataset]. https://www.kaggle.com/pranavtonge/realestatedata-housing-price-prediction/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 7, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Pranav Tonge
    Description

    Dataset

    This dataset was created by Pranav Tonge

    Contents

  16. Houses Price Prediction

    • kaggle.com
    Updated Nov 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Merna Assaad (2023). Houses Price Prediction [Dataset]. https://www.kaggle.com/mernaassaad/houses-price-prediction/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 12, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Merna Assaad
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Merna Assaad

    Released under CC0: Public Domain

    Contents

  17. Housing Prices Prediction

    • kaggle.com
    Updated Jun 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kat Hernandez (2020). Housing Prices Prediction [Dataset]. https://www.kaggle.com/katmaryher/housing-prices-prediction/metadata
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 24, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Kat Hernandez
    Description

    Dataset

    This dataset was created by Kat Hernandez

    Contents

  18. Boston

    • kaggle.com
    zip
    Updated Jul 3, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Arpit Kumar (2020). Boston [Dataset]. https://www.kaggle.com/arpikr/boston
    Explore at:
    zip(14416 bytes)Available download formats
    Dataset updated
    Jul 3, 2020
    Authors
    Arpit Kumar
    Area covered
    Boston
    Description

    Dataset

    This dataset was created by Arpit Kumar

    Contents

    It contains the following files:

  19. Housing Prices Prediction

    • kaggle.com
    Updated Jul 9, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Fahad QureXhi (2024). Housing Prices Prediction [Dataset]. https://www.kaggle.com/fahadqurexhi/housing-prices-prediction/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 9, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Fahad QureXhi
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Fahad QureXhi

    Released under Apache 2.0

    Contents

  20. House Price Predication

    • kaggle.com
    Updated May 7, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sheema Zain (2024). House Price Predication [Dataset]. https://www.kaggle.com/datasets/sheemazain/house-price-predication
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 7, 2024
    Dataset provided by
    Kaggle
    Authors
    Sheema Zain
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    House price prediction Predicting house prices is a common task in data science and machine learning. Here's a high-level overview of how you might approach it:

    Data Collection: Gather a dataset containing features of houses (e.g., size, number of bedrooms, location, amenities) and their corresponding prices. Websites like Zillow, Kaggle, or government housing datasets are good sources.

    Data Preprocessing: Clean the data by handling missing values, encoding categorical variables, and scaling numerical features if necessary. This step ensures that the data is in a suitable format for training a model. Feature Selection/Engineering: Choose relevant features that are likely to influence house prices. You may also create new features based on domain knowledge or data analysis.

    Model Selection: Select a regression model suitable for predicting continuous target variables like house prices. Common choices include Linear Regression, Decision Trees, Random Forests, Gradient Boosting, and Neural Networks.

    Model Training: Split your dataset into training and testing sets to train and evaluate the performance of your model. You can further split the training set for validation purposes or use cross-validation techniques.

    Model Evaluation: Assess the performance of your model using appropriate evaluation metrics such as Mean Absolute Error (MAE), Mean Squared Error (MSE), or Root Mean Squared Error (RMSE).

    Hyperparameter Tuning: Fine-tune your model's hyperparameters to improve its performance. Techniques like grid search or random search can be employed for this purpose.

    Deployment: Once satisfied with your model's performance, deploy it to make predictions on new data. This could be as simple as saving the trained model and creating an interface for users to input house features.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Mohammad Shbool; Rand Al-Dmour; Bashar Al-Shboul; Nibal Albashabsheh; Najat Almasarwah (2024). Real Estate Price Prediction Data [Dataset]. http://doi.org/10.6084/m9.figshare.26517325.v1
Organization logo

Real Estate Price Prediction Data

Explore at:
9 scholarly articles cite this dataset (View in Google Scholar)
txtAvailable download formats
Dataset updated
Aug 8, 2024
Dataset provided by
Figsharehttp://figshare.com/
Authors
Mohammad Shbool; Rand Al-Dmour; Bashar Al-Shboul; Nibal Albashabsheh; Najat Almasarwah
License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

Overview: This dataset was collected and curated to support research on predicting real estate prices using machine learning algorithms, specifically Support Vector Regression (SVR) and Gradient Boosting Machine (GBM). The dataset includes comprehensive information on residential properties, enabling the development and evaluation of predictive models for accurate and transparent real estate appraisals.Data Source: The data was sourced from Department of Lands and Survey real estate listings.Features: The dataset contains the following key attributes for each property:Area (in square meters): The total living area of the property.Floor Number: The floor on which the property is located.Location: Geographic coordinates or city/region where the property is situated.Type of Apartment: The classification of the property, such as studio, one-bedroom, two-bedroom, etc.Number of Bathrooms: The total number of bathrooms in the property.Number of Bedrooms: The total number of bedrooms in the property.Property Age (in years): The number of years since the property was constructed.Property Condition: A categorical variable indicating the condition of the property (e.g., new, good, fair, needs renovation).Proximity to Amenities: The distance to nearby amenities such as schools, hospitals, shopping centers, and public transportation.Market Price (target variable): The actual sale price or listed price of the property.Data Preprocessing:Normalization: Numeric features such as area and proximity to amenities were normalized to ensure consistency and improve model performance.Categorical Encoding: Categorical features like property condition and type of apartment were encoded using one-hot encoding or label encoding, depending on the specific model requirements.Missing Values: Missing data points were handled using appropriate imputation techniques or by excluding records with significant missing information.Usage: This dataset was utilized to train and test machine learning models, aiming to predict the market price of residential properties based on the provided attributes. The models developed using this dataset demonstrated improved accuracy and transparency over traditional appraisal methods.Dataset Availability: The dataset is available for public use under the [CC BY 4.0]. Users are encouraged to cite the related publication when using the data in their research or applications.Citation: If you use this dataset in your research, please cite the following publication:[Real Estate Decision-Making: Precision in Price Prediction through Advanced Machine Learning Algorithms].

Search
Clear search
Close search
Google apps
Main menu