This dataset contains information collected by the U.S Census Service concerning housing in the area of Boston Mass. It was obtained from the StatLib archive (http://lib.stat.cmu.edu/datasets/boston), and has been used extensively throughout the literature to benchmark algorithms. However, these comparisons were primarily done outside of Delve and are thus somewhat suspect. The dataset is small in size with only 506 cases.
The Boston Housing Dataset is a derived from information collected by the U.S. Census Service concerning housing in the area of Boston MA. The following describes the dataset columns: 1. CRIM- per capita crime rate by town 2. ZN- proportion of residential land zoned for lots over 25,000 sq.ft. 3. INDUS- proportion of non-retail business acres per town. 4. CHAS- Charles River dummy variable (1 if tract bounds river; 0 otherwise) 5. NOX- nitric oxides concentration (parts per 10 million) 6. RM- average number of rooms per dwelling 7. AGE- proportion of owner-occupied units built prior to 1940 8. DIS- weighted distances to five Boston employment centres 9. RAD- index of accessibility to radial highways 10. TAX- full-value property-tax rate per $10,000 11. PTRATIO- pupil-teacher ratio by town 12. B- 1000(Bk - 0.63)^2 where Bk is the proportion of blacks by town 13. LSTAT- % lower status of the population 14. MED- Median value of owner-occupied homes in **1000's
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Boston housing dataset’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/altavish/boston-housing-dataset on 30 September 2021.
--- Dataset description provided by original source is as follows ---
Domain: Real Estate
Difficulty: Easy to Medium
Challenges:
1. Missing value treatment
2. Outlier treatment
3. Understanding which variables drive the price of homes in Boston
Summary: The Boston housing dataset contains 506 observations and 14 variables. The dataset contains missing values.
--- Original source retains full ownership of the source dataset ---
This dataset was created by Jetendra Mulinti
https://fred.stlouisfed.org/legal/#copyright-pre-approvalhttps://fred.stlouisfed.org/legal/#copyright-pre-approval
Graph and download economic data for S&P CoreLogic Case-Shiller MA-Boston Home Price Index (BOXRNSA) from Jan 1987 to Apr 2025 about Boston, NH, MA, HPI, housing, price index, indexes, price, and USA.
https://fred.stlouisfed.org/legal/#copyright-public-domainhttps://fred.stlouisfed.org/legal/#copyright-public-domain
Graph and download economic data for All-Transactions House Price Index for Boston, MA (MSAD) (ATNHPIUS14454Q) from Q3 1977 to Q1 2025 about Boston, MA, appraisers, HPI, housing, price index, indexes, price, and USA.
The S&P Case Shiller Boston Home Price Index has risen steadily since February 2020. The index measures changes in the prices of existing single-family homes. The index value was equal to 100 as of January 2000, so if the index value is equal to 130 in a given month, for example, it means that the house prices have increased by 30 percent since 2000. The value of the S&P Case Shiller Boston Home Price Index amounted to nearly 335.36 in August 2024. That was above the national average.
https://choosealicense.com/licenses/unknown/https://choosealicense.com/licenses/unknown/
Dataset Card for 'Ames Iowa: Alternative to the Boston Housing Data Set'
This dataset contains information from the Ames Assessor’s Office about residential properties sold in Ames, IA from 2006 to 2010. This repository is a mirror the original dataset meant to facilitate its consumption. The dataset was originally published by Dean De Cock in Ames, Iowa: Alternative to the Boston Housing Data as an End of Semester Regression Project, it is meant as a resource for teaching machine… See the full description on the dataset page: https://huggingface.co/datasets/cloderic/ames_iowa_housing.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Boston Housing’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/schirmerchad/bostonhoustingmlnd on 28 January 2022.
--- Dataset description provided by original source is as follows ---
The dataset for this project originates from the UCI Machine Learning Repository. The Boston housing data was collected in 1978 and each of the 506 entries represent aggregated data about 14 features for homes from various suburbs in Boston, Massachusetts.
https://github.com/udacity/machine-learning
https://archive.ics.uci.edu/ml/datasets/Housing
--- Original source retains full ownership of the source dataset ---
ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
This data, maintained by the Mayor’s Office of Housing (MOH), is an inventory of all income-restricted units in the city. This data includes public housing owned by the Boston Housing Authority (BHA), privately- owned housing built with funding from DND and/or on land that was formerly City-owned, and privately-owned housing built without any City subsidy, e.g., created using Low-Income Housing Tax Credits (LIHTC) or as part of the Inclusionary Development Policy (IDP). Information is gathered from a variety of sources, including the City's IDP list, permitting and completion data from the Inspectional Services Department (ISD), newspaper advertisements for affordable units, Community Economic Development Assistance Corporation’s (CEDAC) Expiring Use list, and project lists from the BHA, the Massachusetts Department of Housing and Community Development (DHCD), MassHousing, and the U.S. Department of Housing and Urban Development (HUD), among others. The data is meant to be as exhaustive and up-to-date as possible, but since many units are not required to report data to the City of Boston, MOH is constantly working to verify and update it. See the data dictionary for more information on the structure of the data and important notes.
The database only includes units that have a deed-restriction. It does not include tenant-based (also known as mobile) vouchers, which subsidize rent, but move with the tenant and are not attached to a particular unit. There are over 22,000 tenant-based vouchers in the city of Boston which provide additional affordability to low- and moderate-income households not accounted for here.
The Income-Restricted Housing report can be directly accessed here:
https://www.boston.gov/sites/default/files/file/2023/04/Income%20Restricted%20Housing%202022_0.pdf
Learn more about income-restricted housing (as well as other types of affordable housing) here: https://www.boston.gov/affordable-housing-boston#income-restricted
This dataset was created by Nikhil Pathrikar
https://fred.stlouisfed.org/legal/#copyright-citation-requiredhttps://fred.stlouisfed.org/legal/#copyright-citation-required
Graph and download economic data for Housing Inventory: Active Listing Count Year-Over-Year in Boston-Cambridge-Newton, MA-NH (CBSA) (ACTLISCOUYY14460) from Jul 2017 to May 2025 about Boston, NH, MA, active listing, listing, and USA.
This dataset was created by Benjamin Nnabo
This dataset was created by Iaroslav Shcherbatyi
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Analysis of ‘Boston House Prices-Advanced Regression Techniques’ provided by Analyst-2 (analyst-2.ai), based on source dataset retrieved from https://www.kaggle.com/fedesoriano/the-boston-houseprice-data on 13 February 2022.
--- Dataset description provided by original source is as follows ---
The Boston house-price data of Harrison, D. and Rubinfeld, D.L. 'Hedonic prices and the demand for clean air', J. Environ. Economics & Management, vol.5, 81-102, 1978.
Input features in order: 1) CRIM: per capita crime rate by town 2) ZN: proportion of residential land zoned for lots over 25,000 sq.ft. 3) INDUS: proportion of non-retail business acres per town 4) CHAS: Charles River dummy variable (1 if tract bounds river; 0 otherwise) 5) NOX: nitric oxides concentration (parts per 10 million) [parts/10M] 6) RM: average number of rooms per dwelling 7) AGE: proportion of owner-occupied units built prior to 1940 8) DIS: weighted distances to five Boston employment centres 9) RAD: index of accessibility to radial highways 10) TAX: full-value property-tax rate per $10,000 [$/10k] 11) PTRATIO: pupil-teacher ratio by town 12) B: The result of the equation B=1000(Bk - 0.63)^2 where Bk is the proportion of blacks by town 13) LSTAT: % lower status of the population
Output variable: 1) MEDV: Median value of owner-occupied homes in $1000's [k$]
StatLib - Carnegie Mellon University
Harrison, David & Rubinfeld, Daniel. (1978). Hedonic housing prices and the demand for clean air. Journal of Environmental Economics and Management. 5. 81-102. 10.1016/0095-0696(78)90006-2. LINK
Belsley, David A. & Kuh, Edwin. & Welsch, Roy E. (1980). Regression diagnostics: identifying influential data and sources of collinearity. New York: Wiley LINK
--- Original source retains full ownership of the source dataset ---
https://fred.stlouisfed.org/legal/#copyright-citation-requiredhttps://fred.stlouisfed.org/legal/#copyright-citation-required
Graph and download economic data for Housing Inventory: Median Days on Market Year-Over-Year in Boston-Cambridge-Newton, MA-NH (CBSA) (MEDDAYONMARYY14460) from Jul 2017 to May 2025 about Boston, NH, MA, median, and USA.
This dataset was created by BUTTA GLORY
Financial overview and grant giving statistics of Metropolitan Boston Housing Partnership Inc.
This dataset was created by Nilay Chauhan
This dataset was created by Michaelwang2002
This dataset contains information collected by the U.S Census Service concerning housing in the area of Boston Mass. It was obtained from the StatLib archive (http://lib.stat.cmu.edu/datasets/boston), and has been used extensively throughout the literature to benchmark algorithms. However, these comparisons were primarily done outside of Delve and are thus somewhat suspect. The dataset is small in size with only 506 cases.