100+ datasets found
  1. Housing Prices Dataset

    • kaggle.com
    zip
    Updated Jan 12, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    M Yasser H (2022). Housing Prices Dataset [Dataset]. https://www.kaggle.com/datasets/yasserh/housing-prices-dataset
    Explore at:
    zip(4740 bytes)Available download formats
    Dataset updated
    Jan 12, 2022
    Authors
    M Yasser H
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    https://raw.githubusercontent.com/Masterx-AI/Project_Housing_Price_Prediction_/main/hs.jpg" alt="">

    Description:

    A simple yet challenging project, to predict the housing price based on certain factors like house area, bedrooms, furnished, nearness to mainroad, etc. The dataset is small yet, it's complexity arises due to the fact that it has strong multicollinearity. Can you overcome these obstacles & build a decent predictive model?

    Acknowledgement:

    Harrison, D. and Rubinfeld, D.L. (1978) Hedonic prices and the demand for clean air. J. Environ. Economics and Management 5, 81–102. Belsley D.A., Kuh, E. and Welsch, R.E. (1980) Regression Diagnostics. Identifying Influential Data and Sources of Collinearity. New York: Wiley.

    Objective:

    • Understand the Dataset & cleanup (if required).
    • Build Regression models to predict the sales w.r.t a single & multiple feature.
    • Also evaluate the models & compare thier respective scores like R2, RMSE, etc.
  2. House Price Prediction Dataset

    • kaggle.com
    zip
    Updated Sep 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Zafar (2024). House Price Prediction Dataset [Dataset]. https://www.kaggle.com/datasets/zafarali27/house-price-prediction-dataset
    Explore at:
    zip(29372 bytes)Available download formats
    Dataset updated
    Sep 21, 2024
    Authors
    Zafar
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    House Price Prediction Dataset.

    The dataset contains 2000 rows of house-related data, representing various features that could influence house prices. Below, we discuss key aspects of the dataset, which include its structure, the choice of features, and potential use cases for analysis.

    1. Dataset Features

    The dataset is designed to capture essential attributes for predicting house prices, including:

    Area: Square footage of the house, which is generally one of the most important predictors of price. Bedrooms & Bathrooms: The number of rooms in a house significantly affects its value. Homes with more rooms tend to be priced higher. Floors: The number of floors in a house could indicate a larger, more luxurious home, potentially raising its price. Year Built: The age of the house can affect its condition and value. Newly built houses are generally more expensive than older ones. Location: Houses in desirable locations such as downtown or urban areas tend to be priced higher than those in suburban or rural areas. Condition: The current condition of the house is critical, as well-maintained houses (in 'Excellent' or 'Good' condition) will attract higher prices compared to houses in 'Fair' or 'Poor' condition. Garage: Availability of a garage can increase the price due to added convenience and space. Price: The target variable, representing the sale price of the house, used to train machine learning models to predict house prices based on the other features.

    2. Feature Distributions

    Area Distribution: The area of the houses in the dataset ranges from 500 to 5000 square feet, which allows analysis across different types of homes, from smaller apartments to larger luxury houses. Bedrooms and Bathrooms: The number of bedrooms varies from 1 to 5, and bathrooms from 1 to 4. This variance enables analysis of homes with different sizes and layouts. Floors: Houses in the dataset have between 1 and 3 floors. This feature could be useful for identifying the influence of multi-level homes on house prices. Year Built: The dataset contains houses built from 1900 to 2023, giving a wide range of house ages to analyze the effects of new vs. older construction. Location: There is a mix of urban, suburban, downtown, and rural locations. Urban and downtown homes may command higher prices due to proximity to amenities. Condition: Houses are labeled as 'Excellent', 'Good', 'Fair', or 'Poor'. This feature helps model the price differences based on the current state of the house. Price Distribution: Prices range between $50,000 and $1,000,000, offering a broad spectrum of property values. This range makes the dataset appropriate for predicting a wide variety of housing prices, from affordable homes to luxury properties.

    3. Correlation Between Features

    A key area of interest is the relationship between various features and house price: Area and Price: Typically, a strong positive correlation is expected between the size of the house (Area) and its price. Larger homes are likely to be more expensive. Location and Price: Location is another major factor. Houses in urban or downtown areas may show a higher price on average compared to suburban and rural locations. Condition and Price: The condition of the house should show a positive correlation with price. Houses in better condition should be priced higher, as they require less maintenance and repair. Year Built and Price: Newer houses might command a higher price due to better construction standards, modern amenities, and less wear-and-tear, but some older homes in good condition may retain historical value. Garage and Price: A house with a garage may be more expensive than one without, as it provides extra storage or parking space.

    4. Potential Use Cases

    The dataset is well-suited for various machine learning and data analysis applications, including:

    House Price Prediction: Using regression techniques, this dataset can be used to build a model to predict house prices based on the available features. Feature Importance Analysis: By using techniques such as feature importance ranking, data scientists can determine which features (e.g., location, area, or condition) have the greatest impact on house prices. Clustering: Clustering techniques like k-means could help identify patterns in the data, such as grouping houses into segments based on their characteristics (e.g., luxury homes, affordable homes). Market Segmentation: The dataset can be used to perform segmentation by location, price range, or house type to analyze trends in specific sub-markets, like luxury vs. affordable housing. Time-Based Analysis: By studying how house prices vary with the year built or the age of the house, analysts can derive insights into the trends of older vs. newer homes.

    5. Limitations and ...

  3. Housing Price Data

    • kaggle.com
    zip
    Updated Mar 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Saurabh Badole (2024). Housing Price Data [Dataset]. https://www.kaggle.com/datasets/saurabhbadole/housing-price-data
    Explore at:
    zip(4762 bytes)Available download formats
    Dataset updated
    Mar 13, 2024
    Authors
    Saurabh Badole
    License

    Attribution-NonCommercial-ShareAlike 3.0 (CC BY-NC-SA 3.0)https://creativecommons.org/licenses/by-nc-sa/3.0/
    License information was derived automatically

    Description

    Description:

    This dataset contains various features of residential properties along with their corresponding prices. It is suitable for exploring and analyzing factors influencing housing prices and for building predictive models to estimate the price of a property based on its attributes.

    FeatureDescription
    priceThe price of the property.
    areaThe total area of the property in square feet.
    bedroomsThe number of bedrooms in the property.
    bathroomsThe number of bathrooms in the property.
    storiesThe number of stories (floors) in the property.
    mainroadIndicates whether the property is located on a main road (binary: yes/no).
    guestroomIndicates whether the property has a guest room (binary: yes/no).
    basementIndicates whether the property has a basement (binary: yes/no).
    hotwaterheatingIndicates whether the property has hot water heating (binary: yes/no).
    airconditioningIndicates whether the property has air conditioning (binary: yes/no).
    parkingThe number of parking spaces available with the property.
    prefareaIndicates whether the property is in a preferred area (binary: yes/no).
    furnishingstatusThe furnishing status of the property (e.g., furnished, semi-furnished, unfurnished).

    Usage:

    • This dataset can be used for exploratory data analysis to understand the relationships between different housing features and prices.
    • It can also be used to build machine learning models for predicting housing prices based on the given features.

    License: This dataset is made available under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

  4. New York Housing Market

    • kaggle.com
    Updated Jan 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nidula Elgiriyewithana ⚡ (2024). New York Housing Market [Dataset]. http://doi.org/10.34740/kaggle/dsv/7351086
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 6, 2024
    Dataset provided by
    Kaggle
    Authors
    Nidula Elgiriyewithana ⚡
    Area covered
    New York
    Description

    Description:

    This dataset contains prices of New York houses, providing valuable insights into the real estate market in the region. It includes information such as broker titles, house types, prices, number of bedrooms and bathrooms, property square footage, addresses, state, administrative and local areas, street names, and geographical coordinates.

    DOI

    Key Features:

    • BROKERTITLE: Title of the broker
    • TYPE: Type of the house
    • PRICE: Price of the house
    • BEDS: Number of bedrooms
    • BATH: Number of bathrooms
    • PROPERTYSQFT: Square footage of the property
    • ADDRESS: Full address of the house
    • STATE: State of the house
    • MAIN_ADDRESS: Main address information
    • ADMINISTRATIVE_AREA_LEVEL_2: Administrative area level 2 information
    • LOCALITY: Locality information
    • SUBLOCALITY: Sublocality information
    • STREET_NAME: Street name
    • LONG_NAME: Long name
    • FORMATTED_ADDRESS: Formatted address
    • LATITUDE: Latitude coordinate of the house
    • LONGITUDE: Longitude coordinate of the house

    Potential Use Cases:

    • Price analysis: Analyze the distribution of house prices to understand market trends and identify potential investment opportunities.
    • Property size analysis: Explore the relationship between property square footage and prices to assess the value of different-sized houses.
    • Location-based analysis: Investigate geographical patterns to identify areas with higher or lower property prices.
    • Bedroom and bathroom trends: Analyze the impact of the number of bedrooms and bathrooms on house prices.
    • Broker performance analysis: Evaluate the influence of different brokers on the pricing of houses.

    If you find this dataset useful, your support through an upvote would be greatly appreciated ❤️🙂 Thank you

  5. US Cities Housing Market Data - Live Dataset

    • kaggle.com
    zip
    Updated Oct 12, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vincent Vaseghi (2025). US Cities Housing Market Data - Live Dataset [Dataset]. https://www.kaggle.com/datasets/vincentvaseghi/us-cities-housing-market-data
    Explore at:
    zip(984945960 bytes)Available download formats
    Dataset updated
    Oct 12, 2025
    Authors
    Vincent Vaseghi
    Area covered
    United States
    Description

    Redfin is a real estate brokerage and publishes the US housing market data on a regular basis. Using this dataset, you can analyze and visualize housing market data for US cities. Timeline: Starting from February 2012 until the present time (Data is refreshed and updated on a monthly basis)

    The dataset has the following columns: - period_begin - period_end - period_duration
    - region_type
    - region_type_id - table_id - is_seasonally_adjusted. (indicates if prices are seasonally adjusted; f represents False) - region - city - state - state_code - property_type - property_type_id - median_sale_price
    - median_sale_price_mom (median sale price changes month over month) - median_sale_price_yoy (median sale price changes year over year) - median_list_price
    - median_list_price_mom (median list price changes month over month) - median_list_price_yoy (median list price changes year over year) - median_ppsf (median sale price per square foot) - median_ppsf_mom (median sale price per square foot changes month over month) - median_ppsf_yoy (median sale price per square foot changes year over year) - median_list_ppsf (median list price per square foot) - median_list_ppsf_mom (median list price per square foot changes month over month) - median_list_ppsf_yoy. (median list price per square foot changes year over year) - homes_sold (number of homes sold) - homes_sold_mom (number of homes sold month over month) - homes_sold_yoy (number of homes sold year over year) - pending_sales
    - pending_sales_mom
    - pending_sales_yoy
    - new_listings - new_listings_mom
    - new_listings_yoy
    - inventory - inventory_mom
    - inventory_yoy
    - months_of_supply
    - months_of_supply_mom - months_of_supply_yoy
    - median_dom (median days on market until property is sold) - median_dom_mom (median days on market changes month over month) - median_dom_yoy (median days on market changes year over year) - avg_sale_to_list (average sale price to list price ratio) - avg_sale_to_list_mom (average sale price to list price ratio changes month over month) - avg_sale_to_list_yoy (average sale price to list price ratio changes year over year) - sold_above_list
    - sold_above_list_mom - sold_above_list_yoy - price_drops - price_drops_mom - price_drops_yoy - off_market_in_two_weeks (number of properties that will be taken off the market within 2 weeks) - off_market_in_two_weeks_mom (changes in number of properties that will be taken off the market within 2 weeks, month over month) - off_market_in_two_weeks_yoy (changes in number of properties that will be taken off the market within 2 weeks, year over year) - parent_metro_region - parent_metro_region_metro_code - last_updated

    Filetype: gzip (gz) Support for gzip files in Python: https://docs.python.org/3/library/gzip.html

    Data Source & Credit: Redfin.com

  6. House Pricing Dataset

    • kaggle.com
    zip
    Updated Jan 27, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aly El-badry (2025). House Pricing Dataset [Dataset]. https://www.kaggle.com/datasets/alyelbadry/house-pricing-dataset
    Explore at:
    zip(815554 bytes)Available download formats
    Dataset updated
    Jan 27, 2025
    Authors
    Aly El-badry
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    House Prices Dataset

    Subtitle:

    Detailed Real Estate Data for Predicting House Prices and Analyzing Market Trends

    Description:

    This dataset contains information on 21,613 properties, making it a comprehensive resource for exploring real estate market trends and building predictive models for house prices. The data includes various features capturing property details, location, and market conditions, providing ample opportunities for data exploration, visualization, and machine learning applications.

    Key Features:

    • General Information:

      • id: Unique identifier for each property.
      • date: Date of sale.
    • Price Details:

      • price: Sale price of the house.
    • Property Features:

      • bedrooms: Number of bedrooms.
      • bathrooms: Number of bathrooms (including partials as fractions).
      • sqft_living: Living space area in square feet.
      • sqft_lot: Lot size in square feet.
      • floors: Number of floors.
      • waterfront: Whether the property has a waterfront view.
      • view: Quality of the view rating.
      • condition: Overall condition of the house.
      • grade: Grade of construction and design (scale of 1–13).
    • Additional Metrics:

      • sqft_above: Square footage of the property above ground.
      • sqft_basement: Basement area in square feet.
      • yr_built: Year the property was built.
      • yr_renovated: Year of last renovation.
    • Location Coordinates:

      • zipcode: ZIP code of the property.
      • lat and long: Latitude and longitude coordinates.
    • Neighbor Comparisons:

      • sqft_living15: Average living space of 15 nearest properties.
      • sqft_lot15: Average lot size of 15 nearest properties.

    Use Cases:

    • Predicting house prices using regression models.
    • Identifying the impact of various features (e.g., number of bedrooms, location) on property prices.
    • Analyzing market trends and spatial distribution of real estate prices.

    This dataset is a valuable resource for anyone interested in real estate analytics, machine learning, or geographic data visualization.

  7. Housing Prices Regression 🏘️

    • kaggle.com
    Updated Dec 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Den_Kuznetz (2024). Housing Prices Regression 🏘️ [Dataset]. https://www.kaggle.com/datasets/denkuznetz/housing-prices-regression
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 10, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Den_Kuznetz
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Task Description: Real Estate Price Prediction

    This task involves predicting the price of real estate properties based on various features that influence the value of a property. The dataset contains several attributes of real estate properties such as square footage, the number of bedrooms, bathrooms, floors, the year the property was built, whether the property has a garden or pool, the size of the garage, the location score, and the distance from the city center.

    The goal is to build a regression model that can predict the Price of a property based on the provided features.

    Dataset Columns:

    ID: A unique identifier for each property.

    Square_Feet: The area of the property in square meters.

    Num_Bedrooms: The number of bedrooms in the property.

    Num_Bathrooms: The number of bathrooms in the property.

    Num_Floors: The number of floors in the property.

    Year_Built: The year the property was built.

    Has_Garden: Indicates whether the property has a garden (1 for yes, 0 for no).

    Has_Pool: Indicates whether the property has a pool (1 for yes, 0 for no).

    Garage_Size: The size of the garage in square meters.

    Location_Score: A score from 0 to 10 indicating the quality of the neighborhood (higher scores indicate better neighborhoods).

    Distance_to_Center: The distance from the property to the city center in kilometers.

    Price: The target variable that represents the price of the property. This is the value we aim to predict.

    Objective: The goal of this task is to develop a regression model that predicts the Price of a real estate property using the other features as inputs. The model should be able to learn the relationship between these features and the price, providing an accurate prediction for unseen data.

  8. Housing Price Dataset of Delhi(India)

    • kaggle.com
    zip
    Updated Nov 23, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yash Goel (2021). Housing Price Dataset of Delhi(India) [Dataset]. https://www.kaggle.com/datasets/goelyash/housing-price-dataset-of-delhiindia
    Explore at:
    zip(966172 bytes)Available download formats
    Dataset updated
    Nov 23, 2021
    Authors
    Yash Goel
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Area covered
    India, Delhi
    Description

    Context

    So this data set is collected for completing a college project ,which is an android app for calculating the price of houses.

    Content

    This data is scraped from magic bricks website between june 2021 and july 2021 .

    Acknowledgements

    magicbricks.com

    Inspiration

    With the help of the data available one can make a regression model to predict house prices.

  9. House Price Regression Dataset

    • kaggle.com
    zip
    Updated Sep 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Prokshitha Polemoni (2024). House Price Regression Dataset [Dataset]. https://www.kaggle.com/datasets/prokshitha/home-value-insights
    Explore at:
    zip(27045 bytes)Available download formats
    Dataset updated
    Sep 6, 2024
    Authors
    Prokshitha Polemoni
    Description

    Home Value Insights: A Beginner's Regression Dataset

    This dataset is designed for beginners to practice regression problems, particularly in the context of predicting house prices. It contains 1000 rows, with each row representing a house and various attributes that influence its price. The dataset is well-suited for learning basic to intermediate-level regression modeling techniques.

    Features:

    1. Square_Footage: The size of the house in square feet. Larger homes typically have higher prices.
    2. Num_Bedrooms: The number of bedrooms in the house. More bedrooms generally increase the value of a home.
    3. Num_Bathrooms: The number of bathrooms in the house. Houses with more bathrooms are typically priced higher.
    4. Year_Built: The year the house was built. Older houses may be priced lower due to wear and tear.
    5. Lot_Size: The size of the lot the house is built on, measured in acres. Larger lots tend to add value to a property.
    6. Garage_Size: The number of cars that can fit in the garage. Houses with larger garages are usually more expensive.
    7. Neighborhood_Quality: A rating of the neighborhood’s quality on a scale of 1-10, where 10 indicates a high-quality neighborhood. Better neighborhoods usually command higher prices.
    8. House_Price (Target Variable): The price of the house, which is the dependent variable you aim to predict.

    Potential Uses:

    1. Beginner Regression Projects: This dataset can be used to practice building regression models such as Linear Regression, Decision Trees, or Random Forests. The target variable (house price) is continuous, making this an ideal problem for supervised learning techniques.

    2. Feature Engineering Practice: Learners can create new features by combining existing ones, such as the price per square foot or age of the house, providing an opportunity to experiment with feature transformations.

    3. Exploratory Data Analysis (EDA): You can explore how different features (e.g., square footage, number of bedrooms) correlate with the target variable, making it a great dataset for learning about data visualization and summary statistics.

    4. Model Evaluation: The dataset allows for various model evaluation techniques such as cross-validation, R-squared, and Mean Absolute Error (MAE). These metrics can be used to compare the effectiveness of different models.

    Versatility:

    • The dataset is highly versatile for a range of machine learning tasks. You can apply simple linear models to predict house prices based on one or two features, or use more complex models like Random Forest or Gradient Boosting Machines to understand interactions between variables.

    • It can also be used for dimensionality reduction techniques like PCA or to practice handling categorical variables (e.g., neighborhood quality) through encoding techniques like one-hot encoding.

    • This dataset is ideal for anyone wanting to gain practical experience in building regression models while working with real-world features.

  10. 🏡 Global Housing Market Analysis (2015-2024)

    • kaggle.com
    zip
    Updated Mar 18, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Atharva Soundankar (2025). 🏡 Global Housing Market Analysis (2015-2024) [Dataset]. https://www.kaggle.com/datasets/atharvasoundankar/global-housing-market-analysis-2015-2024
    Explore at:
    zip(18363 bytes)Available download formats
    Dataset updated
    Mar 18, 2025
    Authors
    Atharva Soundankar
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    This dataset provides insights into the global housing market, covering various economic factors from 2015 to 2024. It includes details about property prices, rental yields, interest rates, and household income across multiple countries. This dataset is ideal for real estate analysis, financial forecasting, and market trend visualization.

    📑 Column Descriptions

    Column NameDescription
    CountryThe country where the housing market data is recorded 🌍
    YearThe year of observation 📅
    Average House Price ($)The average price of houses in USD 💰
    Median Rental Price ($)The median monthly rent for properties in USD 🏠
    Mortgage Interest Rate (%)The average mortgage interest rate percentage 📉
    Household Income ($)The average annual household income in USD 🏡
    Population Growth (%)The percentage increase in population over the year 👥
    Urbanization Rate (%)Percentage of the population living in urban areas 🏙️
    Homeownership Rate (%)The percentage of people who own their homes 🔑
    GDP Growth Rate (%)The annual GDP growth percentage 📈
    Unemployment Rate (%)The percentage of unemployed individuals in the labor force 💼
  11. Australian Housing Prices

    • kaggle.com
    zip
    Updated Nov 28, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    The Devastator (2022). Australian Housing Prices [Dataset]. https://www.kaggle.com/datasets/thedevastator/australian-housing-data-1000-properties-sampled
    Explore at:
    zip(51778 bytes)Available download formats
    Dataset updated
    Nov 28, 2022
    Authors
    The Devastator
    Area covered
    Australia
    Description

    Australian Housing Prices

    Location, Size, Price, Etc

    By Jeff [source]

    About this dataset

    This dataset contains information on 1000 properties in Australia, including location, size, price, and other details

    More Datasets

    For more datasets, click here.

    Featured Notebooks

    • 🚨 Your notebook can be here! 🚨!

    How to use the dataset

    If you're looking for a dataset on Australian housing data, this is a great option. This dataset contains information on over 1000 properties in Australia, including location, size, price, and other details. With this data, you can answer questions like What is the average price of a home in Australia?, What are the most popular type of homes in Australia?, and more

    Research Ideas

    • This dataset can be used to predict hosing prices in Australia.
    • This dataset can be used to find relationships between housing prices and location.
    • This dataset can be used to find relationships between housing prices and features such as size, number of bedrooms, and number of bathrooms

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. Data Source

    License

    See the dataset description for more information.

    Columns

    File: RealEstateAU_1000_Samples.csv | Column name | Description | |:--------------------|:---------------------------------------------------------------------------------------| | breadcrumb | A breadcrumb is a text trail that shows the user's location within a website. (String) | | category_name | The name of the category that the listing belongs to. (String) | | property_type | The type of property being listed. (String) | | building_size | The size of the property's building, in square meters. (Numeric) | | land_size | The size of the property's land, in square meters. (Numeric) | | preferred_size | The preferred size of the property, in square meters. (Numeric) | | open_date | The date that the property was first listed for sale. (Date) | | listing_agency | The agency that is listing the property. (String) | | price | The listing price of the property. (Numeric) | | location_number | The number that corresponds to the property's location. (Numeric) | | location_type | The type of location that the property is in. (String) | | location_name | The name of the location that the property is in. (String) | | address | The property's address. (String) | | address_1 | The first line of the property's address. (String) | | city | The city that the property is located in. (String) | | state | The state that the property is located in. (String) | | zip_code | The zip code that the property is located in. (String) | | phone | The listing agent's phone number. (String) | | latitude | The property's latitude. (Numeric) | | longitude | The property's longitude. (Numeric) | | product_depth | The depth of the product. (Numeric) | | bedroom_count | The number of bedrooms in the property. (Numeric) | | bathroom_count | The number of bathrooms in the property. (Numeric) | | parking_count | The number of parking spaces in the property. (Numeric) | | RunDate | The date that the listing was last updated. (Date) |

    Acknowledgements

    If you use this dataset in your research, please credit the original authors. If you use this dataset in your research, please credit Jeff.

  12. Housing Prices Dataset - Philippines

    • kaggle.com
    Updated May 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jandrik Lana (2024). Housing Prices Dataset - Philippines [Dataset]. https://www.kaggle.com/datasets/linkanjarad/housing-prices-dataset-philippines
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 3, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Jandrik Lana
    Area covered
    Philippines
    Description

    Dataset on Housing Prices in the Philippines, scraped from from Lamudi on May 2023.

  13. Paris Housing Price Prediction

    • kaggle.com
    zip
    Updated Jun 30, 2021
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    MsSmartyPants (2021). Paris Housing Price Prediction [Dataset]. https://www.kaggle.com/mssmartypants/paris-housing-price-prediction
    Explore at:
    zip(275823 bytes)Available download formats
    Dataset updated
    Jun 30, 2021
    Authors
    MsSmartyPants
    Area covered
    Paris
    Description

    Context

    This is a set of data created from imaginary data of house prices in an urban environment - Paris. I recommend using this dataset for educational purposes, for practice and to acquire the necessary knowledge. What I'm trying to do next is to create a classification dataset with same data from this dataset, I'll add a new column for class attribute ofc. Here is a classification dataset ---> classification dataset <---

    Content

    What's inside is more than just rows and columns. You can see house details listed as column names.

    Description

    All attributes are numeric variables and they are listed bellow:

    • squareMeters
    • numberOfRooms
    • hasYard
    • hasPool
    • floors - number of floors
    • cityCode - zip code
    • cityPartRange - the higher the range, the more exclusive the neighbourhood is
    • numPrevOwners - number of prevoious owners
    • made - year
    • isNewBuilt
    • hasStormProtector
    • basement - basement square meters
    • attic - attic square meteres
    • garage - garage size
    • hasStorageRoom
    • hasGuestRoom - number of guest rooms
    • price - predicted value

    Inspiration

    Idea was to create dataset that is good for regression and that gives adequate results.

  14. Hong Kong Housing Price (2020-2023)

    • kaggle.com
    zip
    Updated Mar 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CyrusTTF (2023). Hong Kong Housing Price (2020-2023) [Dataset]. https://www.kaggle.com/datasets/cyrusttf/hong-kong-housing-price-2020-2023
    Explore at:
    zip(4267310 bytes)Available download formats
    Dataset updated
    Mar 16, 2023
    Authors
    CyrusTTF
    Area covered
    Hong Kong
    Description

    Description: This dataset provides historical housing prices scraped from Centaline Property Hong Kong, one of the largest real estate agencies in Hong Kong. The dataset includes information on the date of the transaction, the property address, floor plan, saleable area, unit rate, source, and district. The dataset covers a period of time spanning several years, allowing for analysis of trends and changes in the Hong Kong housing market.

    Columns: Date: the date of the property transaction Address: the address of the property Floor Plan: -- Price: the price of the property Changes: any changes made to the property since the last transaction Saleable Area: the area of the property that can be sold to a buyer Unit Rate: the price per square foot of saleable area Source: the source of the data (Centaline Property Hong Kong/ Land Registry) District: the district in which the property is located in Hong Kong

  15. Housing Prices Dataset

    • kaggle.com
    zip
    Updated Dec 8, 2017
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    173050055 (2017). Housing Prices Dataset [Dataset]. https://www.kaggle.com/datasets/alphaepsilon/housing-prices-dataset
    Explore at:
    zip(183401 bytes)Available download formats
    Dataset updated
    Dec 8, 2017
    Authors
    173050055
    Description

    Dataset

    This dataset was created by 173050055

    Released under Other (specified in description)

    Contents

  16. house_data

    • kaggle.com
    Updated Jul 27, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Arathi P Raj (2022). house_data [Dataset]. https://www.kaggle.com/datasets/arathipraj/house-data
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 27, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Arathi P Raj
    License

    http://opendatacommons.org/licenses/dbcl/1.0/http://opendatacommons.org/licenses/dbcl/1.0/

    Description

    Content

    The dataset consists of Price of Houses in King County , Washington from sales between May 2014 and May 2015. Along with house price it consists of information on 18 house features, date of sale and ID of sale.

    Attribute information

    1. id - Unique id for each home sold
    2. date - Date of the home saled
    3. price - Price of each home sold
    4. bedrooms - Number of bedrooms
    5. bathrooms - Number of bathrooms
    6. sqft _ living - Square footage of the apartments interior living space
    7. sqft _ lot - Square footage of the land space
    8. floors - Number of floors
    9. waterfront - A dummy variable for whether the apartment was overlooking the waterfront or not
    10. view - An index from 0 to 4 of how good the view of the property was
    11. condition - an index from 1 to 5 on the condition of the apartment
    12. grade - An index from 1 to 13 , where 1-3falls short of building construction and design, 7 has an average level of construction and design , and 11-13 have a high quality level of construction and design
    13. sqft _ above - the square footage of the interior housing space that is above ground level
    14. sqft _ basement - the square footage of the inerior housing space that is below ground level
    15. yr _ built - The year of the house was initially built
    16. yr _ renovated - The year of the house's last renovation
    17. zipcode - What zipcode area the house is in
    18. lat - Lattitude
    19. long - Longitude
    20. sqft _ living15 - The square footage of inerior housing living space for the nearest nearest 15 neighbours
    21. sqft _ lot15 - the square footage of the land lots of the nearest 15 neighbours
  17. Housing Price & Real Estate - 2023

    • kaggle.com
    zip
    Updated Oct 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Reena Pinto (2023). Housing Price & Real Estate - 2023 [Dataset]. https://www.kaggle.com/datasets/reenapinto/housing-price-and-real-estate-2023
    Explore at:
    zip(260191 bytes)Available download formats
    Dataset updated
    Oct 8, 2023
    Authors
    Reena Pinto
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    A housing market prediction that many experts agree on is that it will be a seller’s market. Home prices are expected to rise for some time due to increased demand and limited supply. Millennials are at the age to start investing in the real estate market for the first time. Hence, the demand for residential and commercial projects is rising with every passing day. The future of real estate will witness a rise in demand and limited supply, resulting in it being a seller’s market.

    Your 1 upvote encourages me to upload more trending datasets. Thanks for your support.

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F8355503%2F20827a3fb7a1b4bc6e3227006563692f%2FCapture.PNG?generation=1696752722617297&alt=media" alt="">

    If you liked the dataset, please upvote to upload more trending datasets. Thanks for your support.

  18. American House Prices

    • kaggle.com
    zip
    Updated Dec 9, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jeremy Larcher (2023). American House Prices [Dataset]. https://www.kaggle.com/datasets/jeremylarcher/american-house-prices-and-demographics-of-top-cities
    Explore at:
    zip(682260 bytes)Available download formats
    Dataset updated
    Dec 9, 2023
    Authors
    Jeremy Larcher
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Area covered
    United States
    Description

    A dataset comprising various variables around housing and demographics for the top 50 American cities by population.

    Variables:

    Zip Code: Zip code within which the listing is present.

    Price: Listed price for the property.

    Beds: Number of beds mentioned in the listing.

    Baths: Number of baths mentioned in the listing.

    Living Space: The total size of the living space, in square feet, mentioned in the listing.

    Address: Street address of the listing.

    City: City name where the listing is located.

    State: State name where the listing is located.

    Zip Code Population: The estimated number of individuals within the zip code. Data from Simplemaps.com.

    Zip Code Density: The estimated number of individuals per square mile within the zip code. Data from Simplemaps.com.

    County: County where the listing is located.

    Median Household income: Estimated median household income. Data from the U.S. Census Bureau.

    Latitude: Latitude of the zip code. ** Data from Simplemaps.com.**

    Longitude: Longitude of the zip code. Data from Simplemaps.com.

  19. Mini House Price Data Set

    • kaggle.com
    zip
    Updated Aug 6, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vikas Ukani (2020). Mini House Price Data Set [Dataset]. https://www.kaggle.com/datasets/vikasukani/mini-house-price-data-set
    Explore at:
    zip(260 bytes)Available download formats
    Dataset updated
    Aug 6, 2020
    Authors
    Vikas Ukani
    Description

    Context

    House Price prediction Mini Dataset For Begging notebooks

  20. House Prices in Malaysia (2025)

    • kaggle.com
    zip
    Updated Jan 3, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jien Weng (2025). House Prices in Malaysia (2025) [Dataset]. https://www.kaggle.com/datasets/lyhatt/house-prices-in-malaysia-2025
    Explore at:
    zip(39697 bytes)Available download formats
    Dataset updated
    Jan 3, 2025
    Authors
    Jien Weng
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Area covered
    Malaysia
    Description

    This dataset contains 2,000 entries of house price data from all states in Malaysia, providing a comprehensive overview of the country’s real estate market for 2025. Sourced from Brickz, a trusted platform for property transaction insights, it includes detailed information such as property location, tenure, type, median prices, and transaction counts. This dataset is ideal for real estate market analysis, predictive modeling, and exploring trends across Malaysia’s diverse property market.

    https://encrypted-tbn1.gstatic.com/licensed-image?q=tbn:ANd9GcR8ttDRWTx7dIxuUegBTsggS4a6tQrnNA6DEW_HJu2DphQNsverV0PYsSkdbSdqm4qRaRuBOh4Txbv11yXMxIKWqh-_WAkeTuQI8Diu-Q" alt="Kuala Lumpur, Malaysia">

    Data Columns (Total 8 Columns):

    1. Township: The specific township where the property is located (e.g., Cheras, Subang Jaya).
    2. Area: The locality or broader area encompassing the township (e.g., Klang Valley, Penang Island).
    3. State: The Malaysian state where the property is situated (e.g., Selangor, Johor, Penang).
    4. Tenure: The property ownership type (e.g., Freehold, Leasehold).
    5. Type: The category of property (e.g., Terrace, Condominium, Semi-Detached).
    6. Median_Price: The median price (in MYR) for properties in the specified township or area.
    7. Median_PSF: The median price per square foot (in MYR) for properties.
    8. Transactions: The number of recorded property transactions.

    Future Plans:

    • Expanded Coverage: This dataset will be regularly updated with additional property data to make it even more versatile.
    • Enhanced Features: Future updates may include rental prices, amenities, or property-specific details to offer deeper insights into Malaysia’s housing market.
Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
M Yasser H (2022). Housing Prices Dataset [Dataset]. https://www.kaggle.com/datasets/yasserh/housing-prices-dataset
Organization logo

Housing Prices Dataset

Housing Prices Prediction - Regression Problem

Explore at:
13 scholarly articles cite this dataset (View in Google Scholar)
zip(4740 bytes)Available download formats
Dataset updated
Jan 12, 2022
Authors
M Yasser H
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

https://raw.githubusercontent.com/Masterx-AI/Project_Housing_Price_Prediction_/main/hs.jpg" alt="">

Description:

A simple yet challenging project, to predict the housing price based on certain factors like house area, bedrooms, furnished, nearness to mainroad, etc. The dataset is small yet, it's complexity arises due to the fact that it has strong multicollinearity. Can you overcome these obstacles & build a decent predictive model?

Acknowledgement:

Harrison, D. and Rubinfeld, D.L. (1978) Hedonic prices and the demand for clean air. J. Environ. Economics and Management 5, 81–102. Belsley D.A., Kuh, E. and Welsch, R.E. (1980) Regression Diagnostics. Identifying Influential Data and Sources of Collinearity. New York: Wiley.

Objective:

  • Understand the Dataset & cleanup (if required).
  • Build Regression models to predict the sales w.r.t a single & multiple feature.
  • Also evaluate the models & compare thier respective scores like R2, RMSE, etc.
Search
Clear search
Close search
Google apps
Main menu