Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Summary of results obtained for Boston House Pricing.
(https://www.kaggle.com/c/house-prices-advanced-regression-techniques) About this Dataset Start here if... You have some experience with R or Python and machine learning basics. This is a perfect competition for data science students who have completed an online course in machine learning and are looking to expand their skill set before trying a featured competition.
Competition Description
Ask a home buyer to describe their dream house, and they probably won't begin with the height of the basement ceiling or the proximity to an east-west railroad. But this playground competition's dataset proves that much more influences price negotiations than the number of bedrooms or a white-picket fence.
With 79 explanatory variables describing (almost) every aspect of residential homes in Ames, Iowa, this competition challenges you to predict the final price of each home.
Practice Skills Creative feature engineering Advanced regression techniques like random forest and gradient boosting Acknowledgments The Ames Housing dataset was compiled by Dean De Cock for use in data science education. It's an incredible alternative for data scientists looking for a modernized and expanded version of the often cited Boston Housing dataset.
There's a story behind every dataset and here's your opportunity to share yours.
What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.
We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.
Your data will be in front of the world's largest data science community. What questions do you want to see answered?
ODC Public Domain Dedication and Licence (PDDL) v1.0http://www.opendatacommons.org/licenses/pddl/1.0/
License information was derived automatically
Mayor Michelle Wu is committed to creating equal opportunities for businesses of all kinds in Boston. Through the business certification process, the City identifies businesses that are owned by women, minorities, veterans as well as those that are small or local. Once a business is certified with our office, they are included in any vendor outreach efforts for City contracting opportunities and are also connected to resources offered inside and outside of the City.
In order to provide access to more minority-owned and woman-owned businesses, small and small local businesses, and veteran and service disabled veteran-owned small businesses, the City of Boston Directory of certified businesses is now available on Analyze Boston.
If you think you might be eligible for certification, visit our website and apply today
If you have questions about obtaining certification, please contact stacey.williams@boston.gov
Minority Business Enterprise (MBE) - means a business organization which is beneficially owned or substantially invested in by one or more minority group members as follows:
The firm has not been solely established for the purpose of taking advantage of a special program which has been developed to assist minority-owned businesses.
Woman Business Enterprise (WBE) - means a business organization which is beneficially owned or substantially invested in by one or more women meeting the following criteria:
The business must be at least 51% beneficially owned by a woman.
The woman owner must demonstrate that she has control over management.
The firm has not been solely established for the purpose of taking advantage of a special program which has been developed to assist woman-owned businesses.
Small Business Enterprise (SBE) - means a business with gross receipts, that when averaged over a three-year period do not exceed gross income limitations for that particular industry as defined by the Small Local Business Enterprise Office.
Small Local Business Enterprise (SLBE) - means a business which is a Small Business Enterprise, as defined above, and whose principal office is physically located in the City of Boston, as defined by the SLBE certification regulations.
A Veteran Owned Small Business (VOSB) and a Service Disabled Veteran Owned Small Business (SDVOSB) is a business that has already been verified as such by the U.S. Department of Veteran Affairs.
Yes, businesses may qualify for more than one certification.
Businesses are required to renew their certification _ every three years_.
description: These data are the trackline from the seafloor photograph and video survey conducted September 2004 using the mini-SeaBOSS sampling system on the R/V Rafael in Boston Harbor and the harbor approaches, Massachusetts. This data accompanies approximately 170 km of sidescan sonar data that were collected by the National Oceanic and Atmospheric Administration (NOAA) Ship Whiting in 2000 and 2001 and reprocessed by the Massachusetts Office of Coastal Zone Management (CZM) and the U.S. Geological Survey (USGS).; abstract: These data are the trackline from the seafloor photograph and video survey conducted September 2004 using the mini-SeaBOSS sampling system on the R/V Rafael in Boston Harbor and the harbor approaches, Massachusetts. This data accompanies approximately 170 km of sidescan sonar data that were collected by the National Oceanic and Atmospheric Administration (NOAA) Ship Whiting in 2000 and 2001 and reprocessed by the Massachusetts Office of Coastal Zone Management (CZM) and the U.S. Geological Survey (USGS).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset tracks annual diversity score from 1991 to 2023 for Clarence R Edwards Middle School vs. Massachusetts and Boston School District
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset tracks annual two or more races student percentage from 2011 to 2023 for Clarence R Edwards Middle School vs. Massachusetts and Boston School District
Attribution 3.0 (CC BY 3.0)https://creativecommons.org/licenses/by/3.0/
License information was derived automatically
This dataset is about: Meteorological observations during ACTIVE cruise from Boston to Plymouth started at 1774-05-05. Please consult parent dataset @ https://doi.org/10.1594/PANGAEA.611088 for more information.
This dataverse repository contains two datasets: 1. A one square meter resolution map of biomass for the City of Boston. Units are Mg biomass per hectare (Mg/ha). 2. A one square meter resolution map of canopy cover for the City of Boston. Units are binary: 0 = no canopy, 1 = canopy Both datasets are derived from LiDAR and high resolution remote sensing imagery. Details of the methodology are provided in the following publications: Raciti, SM, Hutyra, LR, Newell, JD, 2014. Mapping carbon storage in urban trees withmulti-source remote sensing data: Relationships between biomass, land use, and demographics in Boston neighborhoods,Science of the Total Environment, 500-501, 72-83. http://dx.doi.org/10.1016/j.scitotenv.2014.08.070 Raciti, SM, Hutyra, LR, Newell, JD, 2015. Corrigendum to “Mapping carbon storage in urban trees with multi-source remote sensing data: Relationships between biomass, land use, and demographics in Boston neighborhoods”, Science of the Total Environment, 538, 1039-1041. http://dx.doi.org/10.1016/j.scitotenv.2015.07.154
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset tracks annual black student percentage from 1991 to 2023 for Clarence R Edwards Middle School vs. Massachusetts and Boston School District
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Seed Bank Dynamics in Boston Urban Community Gardens: Data and Code Repository
Description:
This dataset and accompanying R code support the manuscript titled “Seed Bank Dynamics in Boston Urban Community Gardens: Multiscalar Drivers of Abundance, Diversity, and Ecological Homogenization” (submitted to People and Nature). The study investigates the composition, abundance, and diversity of soil seed banks in 32 community gardens across Boston, Massachusetts, USA, using a multiscalar modeling framework.
The dataset includes cleaned germinant emergence data from greenhouse trials, metadata on plot- and garden-level management practices, and spatial context variables derived from GIS. Code is provided for data wrangling, statistical modeling using mixed-effects models, and figure generation. The modeling framework includes multiscale analyses of germinant abundance and diversity, as well as species composition across plots, gardens, and neighborhoods.
Contents:
data/
: Processed seed bank and management data
scripts/
: R scripts for analysis and figure creation
README.md
: Instructions for file structure and reproduction of analyse
This is a multivariate data set recorded from a patient in the sleep laboratory of the Beth Israel Hospital (now the Beth Israel Deaconess Medical Center) in Boston, Massachusetts. This data set was extracted from record slp60 of the MIT-BIH Polysomnographic Database, and it was submitted to the Santa Fe Time Series Competition in 1991 by our group. The data are presented in text form and have been split into two sequential parts, b1.txt and b2.txt. Each line contains simultaneous samples of three parameters; the interval between samples in successive lines is 0.5 seconds. The first column is the heart rate, the second is the chest volume (respiration force), and the third is the blood oxygen concentration (measured by ear oximetry). The sampling frequency for each measurement is 2 Hz (i.e., the time interval between measurements in successive rows is 0.5 seconds).
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset tracks annual white student percentage from 1991 to 2023 for Clarence R Edwards Middle School vs. Massachusetts and Boston School District
https://github.com/MIT-LCP/license-and-dua/tree/master/draftshttps://github.com/MIT-LCP/license-and-dua/tree/master/drafts
Retrospectively collected medical data has the opportunity to improve patient care through knowledge discovery and algorithm development. Broad reuse of medical data is desirable for the greatest public good, but data sharing must be done in a manner which protects patient privacy. Here we present Medical Information Mart for Intensive Care (MIMIC)-IV, a large deidentified dataset of patients admitted to the emergency department or an intensive care unit at the Beth Israel Deaconess Medical Center in Boston, MA. MIMIC-IV contains data for over 65,000 patients admitted to an ICU and over 200,000 patients admitted to the emergency department. MIMIC-IV incorporates contemporary data and adopts a modular approach to data organization, highlighting data provenance and facilitating both individual and combined use of disparate data sources. MIMIC-IV is intended to carry on the success of MIMIC-III and support a broad set of applications within healthcare.
https://www.icpsr.umich.edu/web/ICPSR/studies/9400/termshttps://www.icpsr.umich.edu/web/ICPSR/studies/9400/terms
These data measure the effects of blood alcohol content coupled with officer reports at the time of arrest on driving while intoxicated (DWI) case outcomes (jury verdicts and guilty pleas). Court records and relevant police reports for drunk-driving cases drawn from the greater metropolitan areas of Boston, Denver, and Los Angeles were compiled to produce this data collection. Cases were selected to include roughly equal proportions of guilty pleas, guilty verdicts, and not-guilty verdicts. DWI cases were compared on the quality and quantity of evidence concerning the suspect's behavior, with the evidence coming from any mention of 20 standard visual detection cues prior to the stop, 13 attributes of general appearance and behavior immediately after the stop, and the results of as many as 7 field sobriety tests. Questions concerned driving-under-the-influence cues (scoring sheet), observed traffic violations and actual traffic accidents, the verdict, DWI history, whether the stop resulted from an accident, whether the attorney was public or private, and sanctions that followed the verdict. Also included were demographic questions on age, sex, and ethnicity.
Attribution-NonCommercial-ShareAlike 3.0 (CC BY-NC-SA 3.0)https://creativecommons.org/licenses/by-nc-sa/3.0/
License information was derived automatically
Abstract: This dataset includes processed MCS
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset tracks annual asian student percentage from 1991 to 2023 for Clarence R Edwards Middle School vs. Massachusetts and Boston School District
Chirp seismic-reflection data and associated navigation files were collected from lacustrine and fjord basins in southcentral Alaska following the 2018 Anchorage earthquake. These data were collected from a 25-foot Boston Whaler (R/V Moose Dancer),18-foot cataraft (R/V Enterprise), and the R/V Alaskan Gyre in the summers of 2020 and 2021 for use in regional hazard assessments relating to Alaska’s seismic hazards.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
The Isolated urban sound database contains the audio samples used to design urban sound mixtures using SimScene software.
This database has already been used to design urban sound mixtures that can be found in Estimation of the road traffic sound levels based on Non-Negative Matrix Factorization dataset [1] and in Realistic urban sound mixture dataset [2]
The dataset contains two folders :
- 'event' which includes includes 231 brief sound samples considered as salient, with a 1 to 20 seconds duration and classified among 21 sound classes (ringing bell, whistling bird, car horn, passing car, hammer, barking dog, siren, footstep, metallic noise, voice...)
- 'background' which includes 162 long duration sounds (~1mn30), whose acoustic properties do not vary in time. This category includes among others, whistling bird, crowd noise, rain, children playing in schoolyard, constant traffic noise ...
More details on this sound database can be found in [3]
[1] J.-R. Gloaguen, M. Lagrange, A. Can, J.-F. Petiot, Estimation of the road traffic sound levels in urban areas based on non-negative matrix factorization techniques, submitted for publication
[2] J.-R. Gloaguen, A. Can, M. Lagrange, J.-F. Petiot, Road traffic sound level estimation from realistic urban sound mixtures by Non-negative Matrix Factorization, submitted for publication
[3] J.-R. Gloaguen, A. Can, M. Lagrange, J.-F. Petiot, Creation of a corpus of realistic urban sound scenes with controlled acoustic properties, in: Acoustics ’17 Boston, Vol. 141 of The Journal of the Acoustical Society of America, Acoustical Society of America and the European Acoustics Association, Boston, United States, 2017, pp. 4044–4044.
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset tracks annual hispanic student percentage from 1991 to 2023 for Clarence R Edwards Middle School vs. Massachusetts and Boston School District
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
This dataset tracks annual free lunch eligibility from 2000 to 2023 for Clarence R Edwards Middle School vs. Massachusetts and Boston School District
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Summary of results obtained for Boston House Pricing.