https://brightdata.com/licensehttps://brightdata.com/license
Real estate datasets from various websites cover all major real estate data points including: property type, size, location, price, bedrooms, baths, address, history, images, and much more. Popular use cases include: forecast housing demand, analyze price fluctuations, improve customer satisfaction, see past prices to monitor market trends, and more.
The Office of Policy and Management maintains a listing of all real estate sales with a sales price of $2,000 or greater that occur between October 1 and September 30 of each year. For each sale record, the file includes: town, property address, date of sale, property type (residential, apartment, commercial, industrial or vacant land), sales price, and property assessment. Data are collected in accordance with Connecticut General Statutes, section 10-261a and 10-261b: https://www.cga.ct.gov/current/pub/chap_172.htm#sec_10-261a and https://www.cga.ct.gov/current/pub/chap_172.htm#sec_10-261b. Annual real estate sales are reported by grand list year (October 1 through September 30 each year). For instance, sales from 2018 GL are from 10/01/2018 through 9/30/2019. Some municipalities may not report data for certain years because when a municipality implements a revaluation, they are not required to submit sales data for the twelve months following implementation.
With Zillow Real Estate Data at its core, it excels in real-time Data Extraction, delivering up-to-the-minute and comprehensive Real Estate Market Data. APISCRAPY's Data Extraction services are your key to staying informed in today's fast-paced real estate landscape.
Attribution-NonCommercial 4.0 (CC BY-NC 4.0)https://creativecommons.org/licenses/by-nc/4.0/
License information was derived automatically
High-quality, free real estate dataset from all around the United States, in CSV format. Over 10.000 records relevant to Real Estate investors, agents, and data scientists. We are working on complete datasets from a wide variety of countries. Don't hesitate to contact us for more information.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Overview: This dataset was collected and curated to support research on predicting real estate prices using machine learning algorithms, specifically Support Vector Regression (SVR) and Gradient Boosting Machine (GBM). The dataset includes comprehensive information on residential properties, enabling the development and evaluation of predictive models for accurate and transparent real estate appraisals.Data Source: The data was sourced from Department of Lands and Survey real estate listings.Features: The dataset contains the following key attributes for each property:Area (in square meters): The total living area of the property.Floor Number: The floor on which the property is located.Location: Geographic coordinates or city/region where the property is situated.Type of Apartment: The classification of the property, such as studio, one-bedroom, two-bedroom, etc.Number of Bathrooms: The total number of bathrooms in the property.Number of Bedrooms: The total number of bedrooms in the property.Property Age (in years): The number of years since the property was constructed.Property Condition: A categorical variable indicating the condition of the property (e.g., new, good, fair, needs renovation).Proximity to Amenities: The distance to nearby amenities such as schools, hospitals, shopping centers, and public transportation.Market Price (target variable): The actual sale price or listed price of the property.Data Preprocessing:Normalization: Numeric features such as area and proximity to amenities were normalized to ensure consistency and improve model performance.Categorical Encoding: Categorical features like property condition and type of apartment were encoded using one-hot encoding or label encoding, depending on the specific model requirements.Missing Values: Missing data points were handled using appropriate imputation techniques or by excluding records with significant missing information.Usage: This dataset was utilized to train and test machine learning models, aiming to predict the market price of residential properties based on the provided attributes. The models developed using this dataset demonstrated improved accuracy and transparency over traditional appraisal methods.Dataset Availability: The dataset is available for public use under the [CC BY 4.0]. Users are encouraged to cite the related publication when using the data in their research or applications.Citation: If you use this dataset in your research, please cite the following publication:[Real Estate Decision-Making: Precision in Price Prediction through Advanced Machine Learning Algorithms].
Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
License information was derived automatically
Dataset Overview
This dataset provides a detailed snapshot of real estate properties listed in Dubai, UAE, as of August 2024. The dataset includes over 5,000 listings scraped using the Apify API from Propertyfinder and various other real estate websites in the UAE. The data includes key details such as the number of bedrooms and bathrooms, price, location, size, and whether the listing is verified. All personal identifiers, such as agent names and contact details, have been ethically removed.
Data Science Applications
Given the size and structure of this dataset, it is ideal for the following data science applications:
This dataset provides a practical foundation for both beginners and experts in data science, allowing for the exploration of real estate trends, development of predictive models, and implementation of machine learning algorithms.
# Column Descriptors
# Ethically Mined Data
This dataset was ethically scraped using the Apify API, ensuring compliance with data privacy standards. All personal data such as agent names, phone numbers, and any other sensitive information have been omitted from this dataset to ensure privacy and ethical use. The data is intended solely for educational purposes and should not be used for commercial activities.
# Acknowledgements
This dataset was made possible thanks to the following:
-**Photo by** : Francesca Tosolini on Unsplash
Use the Data Responsibly
Please ensure that this dataset is used responsibly, with respect to privacy and data ethics. This data is provided for educational purposes.
Open Data Commons Attribution License (ODC-By) v1.0https://www.opendatacommons.org/licenses/by/1.0/
License information was derived automatically
This comprehensive real estate dataset contains over 5,000 property listings from South Carolina, collected in 2025 from Realtor.com using apify api. The dataset captures diverse property types including single-family homes, condominiums, land parcels, townhomes, and other residential properties. This dataset provides a rich snapshot of South Carolina's real estate market suitable for predictive modeling, market analysis, and investment research.
This dataset was ethically scraped from publicly available listings on Realtor.com and is provided strictly for educational and learning purposes only. The data collection complied with ethical web scraping practices and contains only publicly accessible information. Users should utilize this dataset exclusively for academic research, educational projects, and learning data science techniques. Any commercial use is strictly prohibited.
Extract detailed property data points — address, URL, prices, floor space, overview, parking, agents, and more — from any real estate listings. The Rankings data contains the ranking of properties as they come in the SERPs of different property listing sites. Furthermore, with our real estate agents' data, you can directly get in touch with the real estate agents/brokers via email or phone numbers.
A. Usecase/Applications possible with the data:
Property pricing - accurate property data for real estate valuation. Gather information about properties and their valuations from Federal, State, or County level websites. Monitor the real estate market across the country and decide the best time to buy or sell based on data
Secure your real estate investment - Monitor foreclosures and auctions to identify investment opportunities. Identify areas within special economic and opportunity zones such as QOZs - cross-map that with commercial or residential listings to identify leads. Ensure the safety of your investments, property, and personnel by analyzing crime data prior to investing.
Identify hot, emerging markets - Gather data about rent, demographic, and population data to expand retail and e-commerce businesses. Helps you drive better investment decisions.
Profile a building’s retrofit history - a building permit is required before the start of any construction activity of a building, such as changing the building structure, remodeling, or installing new equipment. Moreover, many large cities provide public datasets of building permits in history. Use building permits to profile a city’s building retrofit history.
Study market changes - New construction data helps measure and evaluate the size, composition, and changes occurring within the housing and construction sectors.
Finding leads - Property records can reveal a wealth of information, such as how long an owner has currently lived in a home. US Census Bureau data and City-Data.com provide profiles of towns and city neighborhoods as well as demographic statistics. This data is available for free and can help agents increase their expertise in their communities and get a feel for the local market.
Searching for Targeted Leads - Focusing on small, niche areas of the real estate market can sometimes be the most efficient method of finding leads. For example, targeting high-end home sellers may take longer to develop a lead, but the payoff could be greater. Or, you may have a special interest or background in a certain type of home that would improve your chances of connecting with potential sellers. In these cases, focused data searches may help you find the best leads and develop relationships with future sellers.
How does it work?
This table contains property sales information including sale date, price, and amounts for properties within Fairfax County. There is a one to many relationship to the parcel data. Refer to this document for descriptions of the data in the table.
https://fred.stlouisfed.org/legal/#copyright-citation-requiredhttps://fred.stlouisfed.org/legal/#copyright-citation-required
Graph and download economic data for Housing Inventory: Median Days on Market in the United States (MEDDAYONMARUS) from Jul 2016 to Jun 2025 about median and USA.
Nothing is more personal than home. In order to form a meaningful connection with a relevant audience, real estate and home services brands turn to data to fuel a wide variety of strategies.
TRAK's US Real Estate dataset includes enough rich home and real estate focused variables to power highly customizable analytics and direct marketing strategies. Our data is deep and wide, covering everything from financing information to the number of rooms in a home.
There are also the table stakes variables useful for a variety of industries like new movers, homeowners vs. renters, and in-market for a home purchase (premovers).
We work closely with marketers and data teams to recommend an ideal volume and depth of attributes to empower them to crush their goals. Whether it's limiting the geographic area to your market territories, or removing variables that won't have an impact on your business, we right size the data for your organization's needs. At a high level, key categories in our data set includes:
✔ Home Financing Details ✔ Home Ownership vs Renters ✔ In-Market for a Home ✔ Property Type ✔ Home Attributes ✔ Real Estate Investing ✔ New Mover
CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically
This dataset contains data on all Real Property parcels that have sold since 2013 in Allegheny County, PA.
Before doing any market analysis on property sales, check the sales validation codes. Many property "sales" are not considered a valid representation of the true market value of the property. For example, when multiple lots are together on one deed with one price they are generally coded as invalid ("H") because the sale price for each parcel ID number indicates the total price paid for a group of parcels, not just for one parcel. See the Sales Validation Codes Dictionary for a complete explanation of valid and invalid sale codes.
Sales Transactions Disclaimer: Sales information is provided from the Allegheny County Department of Administrative Services, Real Estate Division. Content and validation codes are subject to change. Please review the Data Dictionary for details on included fields before each use. Property owners are not required by law to record a deed at the time of sale. Consequently the assessment system may not contain a complete sales history for every property and every sale. You may do a deed search at http://www.alleghenycounty.us/re/index.aspx directly for the most updated information. Note: Ordinance 3478-07 prohibits public access to search assessment records by owner name. It was signed by the Chief Executive in 2007.
https://cubig.ai/store/terms-of-servicehttps://cubig.ai/store/terms-of-service
1) Data Introduction • The Real Estate DataSet consists of 506 examples, including home prices in the Boston suburbs and various residential and environmental characteristics.
2) Data Utilization (1) Real Estate DataSet has characteristics that: • The dataset provides 13 continuous variables and one binary variable, including crime rate, house size, environmental pollution, accessibility, tax rate, and population characteristics. (2) Real Estate DataSet can be used to: • House Price Forecast: It can be used to develop a regression model that predicts the median price (MEDV) of a house based on various residential and environmental factors. • Analysis of Urban Planning and Policy: It can be used for urban development and policy making by analyzing the impact of residential environmental factors such as crime rates, environmental pollution, and educational environment on housing values.
ImmobilienScout24 is the largest real estate internet platform in Germany. Properties for private as well as commercial use are offered on the website. The dataset covers most characteristics collected on the platform like price, size and characteristics of the housing unit but also automatically generated items like the duration of the advertisement spell.
https://crawlfeeds.com/privacy_policyhttps://crawlfeeds.com/privacy_policy
This dataset contains over 1.1 million property listings extracted from Trulia, one of the largest U.S. real estate marketplaces. Compiled and structured by the CrawlFeeds team, this dataset includes residential property data across the United States — making it a valuable resource for real estate analytics, machine learning, and location-based modeling.
Full listing info: title, description, URL
Detailed location data: city, ZIP code, latitude, longitude
Property specs: bedrooms, bathrooms, floor space, features
Pricing details: current price, currency, status
Metadata: timestamps, image URLs, and breadcrumbs
Format: Clean CSV, ready for modeling and analysis
Housing price prediction models
Real estate investment analysis
Location clustering & zip code segmentation
Building property recommendation engines
Mapping visualizations & geospatial applications
Last crawled: September 2, 2021
Data format: CSV (1.4M+ records)
Create a custom request through CrawlFeeds if you need to re-extract updated listings from Trulia or slice by region, price range, or timestamp.
The Oklahoma Real Property Asset Report is published annually in compliance with the Oklahoma State Government Asset Reduction and Cost Savings Program found in Title 62 O.S. §908. The act requires the Office of Management and Enterprise Services (OMES) to compile and maintain a comprehensive inventory of all real property owned and leased by the state. All data contained in this report was self-reported by each state agency, board, commission, or public trust having the State of Oklahoma as a beneficiary.
This dataset has been published by the Office of the Real Estate Assessor of the City of Virginia Beach and data.virginiabeach.gov. The mission of data.virginiabeach.gov is to provide timely and accurate City information to increase government transparency and access to useful and well organized data by the general public, non-governmental organizations, and City of Virginia Beach employees.
SafeGraph Places provides baseline location information for every record in the SafeGraph product suite via the Places schema and polygon information when applicable via the Geometry schema. The current scope of a place is defined as any location humans can visit with the exception of single-family homes. This definition encompasses a diverse set of places ranging from restaurants, grocery stores, and malls; to parks, hospitals, museums, offices, and industrial parks. Premium sets of Places include apartment buildings, Parking Lots, and Point POIs (such as ATMs or transit stations).
SafeGraph Places is a point of interest (POI) data offering with varying coverage and properties depending on the country. Note that address conventions and formatting vary across countries. SafeGraph has coalesced these fields into the Places schema.
SafeGraph provides clean and accurate geospatial datasets on 51M+ physical places/points of interest (POI) globally. Hundreds of industry leaders like Mapbox, Verizon, Clear Channel, and Esri already rely on SafeGraph POI data to unlock business insights and drive innovation.
Note:- Only publicly available real estate data can be worked upon.
Discover the world of property insights with APISCRAPY's user-friendly services – Realtor Property Data, Realtor Data, and Realtor API. Designed for ease of use, our platform allows anyone, from real estate professionals to researchers and businesses, to effortlessly access publicly available property listings and Property owner Data.
Our Realtor Property Data service provides comprehensive details on property listings, while Realtor API ensures easy integration for streamlined access. Additionally, we offer Zillow Property Data, enriching your property insights with information from one of the leading property platforms.
Key Features:
Realtor Property Data: Dive into detailed property listings effortlessly with our user-friendly platform.
Realtor API Integration: Seamlessly integrate our Realtor API into your systems for easy access to property data.
Zillow Property Data: Enrich your property insights with data from Zillow, one of the leading property platforms.
Publicly Available Property Listings: APISCRAPY ensures access to publicly available property listings, making property data easily accessible.
Easy Integration: Our platform is designed for simplicity, allowing for easy integration into your existing systems.
Whether you're a real estate professional, researcher, or business looking for straightforward access to property information, APISCRAPY's services cater to your needs. Choose us for simple and efficient property data services, where ease of use and accessibility come together for your convenience.
https://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain
Graph and download economic data for Median Sales Price of Houses Sold for the United States (MSPUS) from Q1 1963 to Q2 2025 about sales, median, housing, and USA.
https://brightdata.com/licensehttps://brightdata.com/license
Real estate datasets from various websites cover all major real estate data points including: property type, size, location, price, bedrooms, baths, address, history, images, and much more. Popular use cases include: forecast housing demand, analyze price fluctuations, improve customer satisfaction, see past prices to monitor market trends, and more.