Facebook
TwitterThis data set is a listing of all property sales by NORA through the following disposition channels. - Auction: Properties put up for auction and sold to the highest bidder. - Development: Properties offered to development partners at a discounted rate to support the development of affordable housing. - Lot Next Door: Properties sold to adjacent parcel owners, with discount opportunities for eligible participants. - Alternative Land Use: Properties sold for development of green space and community gardens. Note: this dataset contains duplicate addresses, which likely represent reversions or quitclaims that NORA sold again.
Facebook
TwitterCC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This dataset contains data on all Real Property parcels that have sold since 2013 in Allegheny County, PA.
Before doing any market analysis on property sales, check the sales validation codes. Many property "sales" are not considered a valid representation of the true market value of the property. For example, when multiple lots are together on one deed with one price they are generally coded as invalid ("H") because the sale price for each parcel ID number indicates the total price paid for a group of parcels, not just for one parcel. See the Sales Validation Codes Dictionary for a complete explanation of valid and invalid sale codes.
Sales Transactions Disclaimer: Sales information is provided from the Allegheny County Department of Administrative Services, Real Estate Division. Content and validation codes are subject to change. Please review the Data Dictionary for details on included fields before each use. Property owners are not required by law to record a deed at the time of sale. Consequently the assessment system may not contain a complete sales history for every property and every sale. You may do a deed search at http://www.alleghenycounty.us/re/index.aspx directly for the most updated information. Note: Ordinance 3478-07 prohibits public access to search assessment records by owner name. It was signed by the Chief Executive in 2007.
Facebook
TwitterApache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Here's an explanation of each column:
MSSubClass: The building class. This column represents the type of dwelling involved in the sale.
MSZoning: The general zoning classification of the sale. This column indicates the general zoning classification of the property.
LotArea: Lot size in square feet. This column represents the area of the land associated with the property.
LotConfig: Lot configuration. This column describes the shape of the property and how it is situated.
BldgType: Type of dwelling. This column indicates the type of building involved in the sale, such as single-family detached, duplex, etc.
OverallCond: Overall condition rating. This column represents the overall condition of the property, rated on a scale from 1 to 10.
YearBuilt: Original construction date. This column indicates the year when the property was originally built.
YearRemodAdd: Remodel date. This column represents the year when the property was remodeled or renovated.
Exterior1st: Exterior covering on the house. This column describes the primary exterior covering material of the property.
BsmtFinSF2: Type 2 finished square feet. This column represents the area of the basement that is finished with a secondary type of finishing material.
TotalBsmtSF: Total square feet of basement area. This column indicates the total area of the basement in square feet.
SalePrice: Sale price of the property. This column represents the sale price of the property in dollars.
These columns provide various attributes and characteristics of the properties in the dataset, which can be used to predict the sale price of houses.
Facebook
TwitterPlease note: this data is live (updated nightly) to reflect the latest changes in the City's systems of record.Overview of the Data:This dataset is a polygon feature layer with the boundaries of all tax parcels owned by the City of Rochester. This includes all public parks, and municipal buildings, as well as vacant land and structures currently owned by the City. The data includes fields with features about each property including property type, date of sale, land value, dimensions, and more.About City Owned Properties:The City's real estate inventory is managed by the Division of Real Estate in the Department of Neighborhood and Business Development. Properties like municipal buildings and parks are expected to be in long term ownership of the City. Properties such as vacant land and vacant structures are ones the City is actively seeking to reposition for redevelopment to increase the City's tax base and economic activity. The City acquires many of these properties through the tax foreclosure auction process when no private entity bids the minimum bid. Some of these properties stay in the City's ownership for years, while others are quickly sold to development partners. For more information please visit the City's webpage for the Division of Real Estate: https://www.cityofrochester.gov/realestate/Data Dictionary: SBL: The twenty-digit unique identifier assigned to a tax parcel. PRINTKEY: A unique identifier for a tax parcel, typically in the format of “Tax map section – Block – Lot". Street Number: The street number where the tax parcel is located. Street Name: The street name where the tax parcel is located. NAME: The street number and street name for the tax parcel. City: The city where the tax parcel is located. Property Class Code: The standardized code to identify the type and/or use of the tax parcel. For a full list of codes, view the NYS Real Property System (RPS) property classification codes guide. Property Class: The name of the property class associated with the property class code. Property Type: The type of property associated with the property class code. There are nine different types of property according to RPS: 100: Agricultural 200: Residential 300: Vacant Land 400: Commercial 500: Recreation & Entertainment 600: Community Services 700: Industrial 800: Public Services 900: Wild, forested, conservation lands and public parks First Owner Name: The name of the property owner of the vacant tax parcel. If there are multiple owners, then the first one is displayed. Postal Address: The USPS postal address for the vacant landowner. Postal City: The USPS postal city, state, and zip code for the vacant landowner. Lot Frontage: The length (in feet) of how wide the lot is across the street. Lot Depth: The length (in feet) of how far the lot goes back from the street. Stated Area: The area of the vacant tax parcel. Current Land Value: The current value (in USD) of the tax parcel. Current Total Assessed Value: The current value (in USD) assigned by a tax assessor, which takes into consideration both the land value, buildings on the land, etc. Current Taxable Value: The amount (in USD) of the assessed value that can be taxed. Tentative Land Value: The current value (in USD) of the land on the tax parcel, subject to change based on appeals, reassessments, and public review. Tentative Total Assessed Value: The preliminary estimate (in USD) of the tax parcel’s assessed value, which includes tentative land value and tentative improvement value. Tentative Taxable Value: The preliminary estimate (in USD) of the tax parcel’s value used to calculate property taxes. Sale Date: The date (MM/DD/YYYY) of when the vacant tax parcel was sold. Sale Price: The price (in USD) of what the vacant tax parcel was sold for. Book: The record book that the property deed or sale is recorded in. Page: The page in the record book where the property deed or sale is recorded in. Deed Type: The type of deed associated with the vacant tax parcel sale. RESCOM: Notes whether the vacant tax parcel is zoned for residential or commercial use. R: Residential C: Commercial BISZONING: Notes the zoning district the vacant tax parcel is in. For more information on zoning, visit the City’s Zoning District map. OWNERSHIPCODE: Code to note type of ownership (if applicable). Number of Residential Units: Notes how many residential units are available on the tax parcel (if applicable). LOW_STREET_NUM: The street number of the vacant tax parcel. HIGH_STREET_NUM: The street number of the vacant tax parcel. GISEXTDATE: The date and time when the data was last updated. SALE_DATE_datefield: The recorded date of sale of the vacant tax parcel (if available). Source: This data comes from the department of Neighborhood and Business Development, Bureau of Real Estate.
Facebook
TwitterAttribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
The table below showcases the total number of homes sold for each zip code in Columbus, Ohio. It's important to understand that the number of homes sold can vary greatly and can change yearly.
Facebook
Twitterhttps://data.gov.tw/licensehttps://data.gov.tw/license
Facebook
Twitterhttps://data.gov.tw/licensehttps://data.gov.tw/license
Real estate sales case transaction information, including the location of the subject property (de-identified), area, total price, and other information. (Tucheng District)
Facebook
Twitterhttps://louisville-metro-opendata-lojic.hub.arcgis.com/pages/terms-of-use-and-licensehttps://louisville-metro-opendata-lojic.hub.arcgis.com/pages/terms-of-use-and-license
Develop Louisville Focuses on the full range of land development activities, including planning and design, vacant property initiatives, advanced planning, housing & community development programs, permits and licensing, land acquisition, public art and clean and green sustainable development partnerships.Data Dictionary:“LBA” is the abbreviation for the Louisville and Jefferson County LBA Authority, Inc."Parcel ID" is an identification code assigned to a piece of real estate by the Jefferson County Property Valuation Administration. The Parcel ID is used for record keeping and tax purposes.“IMPROV” stands for whether or not the real estate parcel had an “improvement” (i.e., a structure) situated on it at the time it was sold. “1” indicates that a structure existed when the parcel was sold and “0” indicates that the parcel was an empty, piece of land.“APPLICANT” is the individual(s) or active business entity that submitted an Application to Purchase the real estate parcel and whose application was presented to and approved by the LBA’s Board of Directors. The Board of Directors must approve each application before a transfer deed is officially recorded with the Office of the County Clerk of Jefferson County, Kentucky.“SALE DATE” is the date that the Applicant signed the transfer deed for the respective real estate parcel.“SALE AMOUNT” is the amount that the Applicant paid to purchase the respective real estate parcel.“SALE PROGRAM” is the LBA’s disposition program that the Applicant participated in to acquire the real estate parcel.The Office of Community Development defines each “Sale Program” as follows:Budget Rate (“Budget Rate Policy for New Construction Projects”) – Applicant submitted a proposed construction project for the empty, piece of land.Cut It Keep It - Applicant requested to maintain the empty piece of land situated on the same block as a real estate parcel owned by the Applicant. Applicant must retain ownership of the lot for three (3) years before the Applicant can sell it.Demo for Deed (“Last Look – Demo for Deed”) – Applicant requested to demolish the structure situated on the real estate parcel and retain the land for a future use.Flex Rate (“Flex Rate Policy for New Construction Projects”) – Applicant submitted a proposed construction project for the empty, piece of land but did not have proof of funding or a timeline as to when the project would be completed.Metro Redevelopment – The real estate parcel was part of a redevelopment project being considered by Metro Government.Minimum Pricing Policy – The pricing policy that was approved by the LBA’s Board of Directors and in effect as of the real estate parcel’s sale date.RFP (“Request for Proposals”) - Applicant requested to rehabilitate the structure in order to place it back into productive use within the neighborhood.Save the Structure (“Last Look – Save the Structure”) - Applicant requested to rehabilitate the structure in order to place it back into productive use within the neighborhood.Side Yard – The Applicant requested to acquire the LBA’s adjoining piece of land to make the Applicant’s occupied, real estate parcel larger and more valuable.SOI (“Solicitation of Interest”) – The LBA assembled two (2) or more real estate parcels and the Applicant submitted a redevelopment project for the subject parcels.For more information about each of the current disposition programs that the LBA offers, please refer to the following website pages:https://louisvilleky.gov/government/community-development/vacant-lot-sales-programshttps://louisvilleky.gov/government/community-development/vacant-structures-saleContact:Connie Suttonconnie.sutton@louisvilleky.gov
Facebook
TwitterProvides a selection of recently sold homes that have similar characteristics, location, age, etc. to a subject property. Used for determining the value of a property in relation to similar properties that have recently sold in the same area.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
https://raw.githubusercontent.com/Masterx-AI/Project_Housing_Price_Prediction_/main/hs.jpg" alt="">
A simple yet challenging project, to predict the housing price based on certain factors like house area, bedrooms, furnished, nearness to mainroad, etc. The dataset is small yet, it's complexity arises due to the fact that it has strong multicollinearity. Can you overcome these obstacles & build a decent predictive model?
Harrison, D. and Rubinfeld, D.L. (1978) Hedonic prices and the demand for clean air. J. Environ. Economics and Management 5, 81–102. Belsley D.A., Kuh, E. and Welsch, R.E. (1980) Regression Diagnostics. Identifying Influential Data and Sources of Collinearity. New York: Wiley.
Facebook
TwitterPurchase cases, Area sold, Purchase amount, Average purchase value for building land: Germany, years, land size classes, type of land
Facebook
TwitterDetroit Land Bank Authority (DLBA) programs help individuals and organizations purchase houses, vacant land, and other properties at discounted prices. This dataset provides information on houses for sale through the DLBA Auction, Own It Now, and Renovation programs. Detroit Land Bank Authority (DLBA) programs help individuals and organizations purchase houses, vacant land, and other properties at discounted prices. This dataset provides information on houses for sale through the DLBA Auction, Own It Now, and Renovation programs.
The Detroit Land Bank Authority (DLBA) works directly with individual buyers, as well as Community Partner organizations and developers to achieve their mission to return the city's blighted and vacant properties to productive use. They utilize a variety of sales programs to make homeownership and land purchases accessible to Detroiters. Each row in the DLBA For Sale dataset represents a DLBA-acquired home that is for sale through the Auction, Own It Now, or Renovation program. Each of these house listings includes data about the listing date, sale price, and location information such as address, parcel number, and neighborhood. For more information about these DLBA sales programs, see the DLBA 'Structures' and Frequently Asked Questions (FAQ) webpages.
Facebook
Twitterhttps://fred.stlouisfed.org/legal/#copyright-citation-requiredhttps://fred.stlouisfed.org/legal/#copyright-citation-required
Graph and download economic data for Real Residential Property Prices for United States (QUSR628BIS) from Q1 1970 to Q2 2025 about residential, HPI, housing, real, price index, indexes, price, and USA.
Facebook
TwitterOpen Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
These datasets are published as part of the requirements on data transparency and are refreshed on the first of the month.
This dataset provides information on the government estate, including various property related characteristics such as: location, ownership, size, tenure and type of property.
The scope of the data includes land and property information for UK central government departments and their arms length bodies including non-ministerial departments, executive agencies, non-departmental public bodies and special health authorities. Whilst these assets are primarily located in the UK,some are located overseas.
Some properties may have more than one entry in the data extract as the government has more than one ‘interest’ in that property. For example, there may be two or more government occupiers in the same property. It also provides information about the ‘holding’ government department and, if relevant, the arm’s length body of the department responsible for the property. This dataset contains non sensitive information on the government estate e.g. commercially sensitive contract data is not published. The dataset also excludes property records that are classed as sensitive e.g. for national security purposes.
All data provided via these data sets are as reported to the Cabinet Office by the holding departments.
Property and Contracts
This dataset covers properties and their associated contracts. A property may have more than one contract associated with it. This data set includes information such as Ownership, Location, Size, Usage, Asset type (Building or Land), Contract Name and Contracted Organisation.
Building
Properties can be made up of one or more buildings and are linked to the property via a property reference. Characteristics such as Building Ownership, Location, Floor Area, Usage, Size and Construction Date are recorded and this entity is linked to the property via the property reference.
Land
Whilst properties can be made up of Building(s) and Land they can also refer exclusively to Land only. Land records include information on Ownership, Location, Size and Usage and this entity is linked to the property via the property reference.
Occupation
Occupations highlight which organisations reside within a given property. The following types of information about occupying organisations is recorded: organisation, location, asset type(e.g. Land, Building), size of the occupation (floor area), type of agreement (e.g. sub-let) and the usage (e.g. Office, Court).
Surplus Property
When a property is no longer required for the purposes of the organisation that currently holds the asset, it is then designated as being Surplus. These can then be made available for disposal which involves the transfer of a freehold or leasehold by way of sale or other agreement. Data such as Ownership, Location, Size, Usage and Contact Information is recorded for surplus property.
Vacant Space
To facilitate better utilisation of the estate; where space is available in properties these can be marked as such and made available to other government departments for co-location purposes. This data set contains Ownership, Location, Size, Information about the Space, and Contact Details.
Facebook
TwitterThis residential real estate data set was created by Redfin, an online real estate brokerage. Published on January 9th, 2022, this data summarize the monthly housing market for every State, Metro, and Zip code in the US from 2012 to 2021. Redfin aggregated this data across multiple listing services and has been gracious enough to include property type in their reporting. Please properly cite and link to RedFin if you end up using this data for your research or project.
Source: RedFin Data Center
Property type defined by RedFin
Source: Building Types
For more definitions, please visit RedFin Data Center Metrics
Facebook
TwitterMIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset contains information about real estate transactions in São Paulo in the first quarter of 2023. The dataset includes information about the location of the transaction, the type of transaction, the value of the transaction, the date of the transaction, the type of financing used, the type of property involved, the built-up area of the property, the year the property was built, and the geographic coordinates of the property.
For organizational reasons, I only considered residential properties and only purchase and sale or auction transactions.
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset contains information related to housing sales, in the form of individual properties. Here's a breakdown of the columns:
| Column Name | Description |
|---|---|
| Lot_Frontage | Linear feet of street connected to the property |
| Lot_Area | Lot size in square feet |
| Bldg_Type | Type of building |
| House_Style | Style of the house |
| Overall_Cond | Overall condition rating of the house |
| Year_Built | Year the house was built |
| Exter_Cond | Exterior condition rating of the house |
| Total_Bsmt_SF | Total square feet of basement area |
| First_Flr_SF | First-floor square feet |
| Second_Flr_SF | Second-floor square feet |
| Full_Bath | Number of full bathrooms |
| Half_Bath | Number of half bathrooms |
| Bedroom_AbvGr | Number of bedrooms above ground |
| Kitchen_AbvGr | Number of kitchens above ground |
| Fireplaces | Number of fireplaces |
| Longitude | Longitude coordinates of the property location |
| Latitude | Latitude coordinates of the property location |
| Sale_Price | Sale price of the property |
The dataset contains 2413 entries and has a mixture of numerical and categorical data. It's likely used for analyzing various factors influencing housing sale prices, such as location, size, condition, and amenities.
Facebook
Twitterhttps://data.gov.tw/licensehttps://data.gov.tw/license
Facebook
TwitterAttribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
France New Houses Sold: Average Sales Price: Region: Center-Val de Loire data was reported at 218.536 EUR th in Mar 2018. This records an increase from the previous number of 194.742 EUR th for Dec 2017. France New Houses Sold: Average Sales Price: Region: Center-Val de Loire data is updated quarterly, averaging 192.745 EUR th from Dec 2001 (Median) to Mar 2018, with 66 observations. The data reached an all-time high of 222.940 EUR th in Sep 2017 and a record low of 136.700 EUR th in Mar 2003. France New Houses Sold: Average Sales Price: Region: Center-Val de Loire data remains active status in CEIC and is reported by Ministry of Ecology, Sustainable Development and Energy. The data is categorized under Global Database’s France – Table FR.P004: New Houses: Sales Price.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
I have previously shared a classification based dataset to classify the gender which is liked by those who are new to machine learning as it give a pretty good accuracy, which encouraged me to create a regression dataset to predict continues values. I have tried many real world datasets for regression problems which are predicting with lower accuracy and high error rate. As a beginner, I have struggled and worried why and how the dataset performs poorly. This is another main reason why I created this dataset. Although this is a made up dataset, I have considered all the features when deciding the price of the property. If you are a beginner, you would love to try this as the results are stunning..
Since this is a populated data, I will straightaway explain the features and the label. FEATURES 1. land_size_sqm - This the total size of the land in square meters. 2. house_size_sqm - This is the area in which house is located within the land. This is measured in square meters. 3. no_of_rooms - This indicates the number of rooms available in the house. 4. no_of_bathrooms - This shows the number of total bathrooms made in the house. 5. large_living_room - This indicates whether the house includes a larger living room or not. The assumption is that all the houses contain a living room. This feature attempts to classify whether it's large or small where '1' means large and '0' means small. However in the categorical dataset, 1 and 0 are represented with 'yes' and 'No' respectively. 6. parking_space - This indicates whether there is a parking space or not. '1' represents the parking available while '0' represents no parking space available. However in the categorical dataset, 1 and 0 are represented with 'yes' and 'No' respectively. 7. front_garden - This shows whether there is a garden available in front of the house. '1' means the garden available and '0' means no garden available. However in the categorical dataset, 1 and 0 are represented with 'yes' and 'No' respectively. 8. swimming_pool - This shows the availability of the swimming pool at the house. 1 represents the availability of the swimming pool while 0 represents the non availability of the same. However in the categorical dataset, 1 and 0 are represented with 'yes' and 'No' respectively. 9. distance_to_school_km - This shows the distance from the house to the nearest school in Kilometers. 10. wall_fence - This shows whether there is a wall fence or not. '1' mean there is wall fence and '0' means no wall fence. However in the categorical dataset, 1 and 0 are represented with 'yes' and 'No' respectively. 11. **house_age_or_renovated **- This is either the age of the house in years or the period from the date of renovation. 12. water_front - this indicates whether the house is located in front of the water or not. 1 means waterfront and 0 means its not located near the water. However in the categorical dataset, 1 and 0 are represented with 'yes' and 'No' respectively. 13. distance_to_supermarket_km - what is the distance to the nearest supermarket in kilometers.
LABEL property_value - This is the price of the property
Following features are only available in the "house price dataset original v2 cleaned" and "house price dataset original v2 with categorical features" data only. 14. crime_rate - its in float and falls between 0 and 7. lesser the better 15. room_size - As the name suggests, it explains the size of the room. 0 is being 'small', 1 is being 'medium', 2 is 'large' and 3 is being 'Extra large'. However in the categorical dataset, these values are categorical and self explanatory.
I spent around 3 hours creating this dataset. Enjoy..
Share your notebooks to see which algorithm predicts the house price precisely.
Facebook
TwitterThis data set is a listing of all property sales by NORA through the following disposition channels. - Auction: Properties put up for auction and sold to the highest bidder. - Development: Properties offered to development partners at a discounted rate to support the development of affordable housing. - Lot Next Door: Properties sold to adjacent parcel owners, with discount opportunities for eligible participants. - Alternative Land Use: Properties sold for development of green space and community gardens. Note: this dataset contains duplicate addresses, which likely represent reversions or quitclaims that NORA sold again.