96 datasets found

d
An example data set for exploration of Multiple Linear Regression
catalog.data.gov
data.usgs.gov
Updated Jul 6, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Geological Survey (2024). An example data set for exploration of Multiple Linear Regression [Dataset]. https://catalog.data.gov/dataset/an-example-data-set-for-exploration-of-multiple-linear-regression
Explore at:
Dataset updated
Jul 6, 2024
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Description
This data set contains example data for exploration of the theory of regression based regionalization. The 90th percentile of annual maximum streamflow is provided as an example response variable for 293 streamgages in the conterminous United States. Several explanatory variables are drawn from the GAGES-II data base in order to demonstrate how multiple linear regression is applied. Example scripts demonstrate how to collect the original streamflow data provided and how to recreate the figures from the associated Techniques and Methods chapter.
Stonybrook_AMS578_Multiple_Regression_Dataset
kaggle.com
Updated Dec 20, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Joseph Chan (2020). Stonybrook_AMS578_Multiple_Regression_Dataset [Dataset]. https://www.kaggle.com/josephchan524/stonybrook-ams578-multiple-regression-dataset/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Dec 20, 2020
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Joseph Chan
Description
Context

This is a dataset is a Multiple Regression Project from an Applied Math Science Graduate Level Course at Stony Brook (AMS578 Spring 2020).

The class blackboard has a pdf file of a paper by Caspi et al. that reports a finding of a gene-environment interaction. This paper used multiple regression techniques as the methodology for its findings. You should read it for background, as it is the genesis of the models that you will be given. The data that you are analyzing is synthetic. That is, the TA used a model to generate the data. Your task is to find the model that the TA used for your data. For example, one possible model is

The class blackboard also contains a paper by Risch et al. that uses a larger collection of data to assess the findings in Caspi et al. These researchers confirmed that Caspi et al. calculated their results correctly but that no other dataset had the relation reported in Caspi et al. That is, Caspi et al. seem to have reported a false positive (Type I error). The class blackboard contains a recent paper about the genetics of mental illness and a technical appendix giving the specifics. Together these papers are an example of the response of the research community to studying the genetics of mental illness, which is a notoriously difficult research area.

Content

One file contains the patient identifier and the dependent variable value. The second file contains the patient identifier and values of six environment variables called E1 to E6. The third file contains the patient identifier and the twenty independent indicator variables called G1 to G20. The records may not be in correct order in each file, and cases may be missing in one or more of the files. You can process the data with VMLOOKUP or other data merging software.
f
Data from: Multiple linear regression model to evaluate the market value of...
scielo.figshare.com
search.datacite.org
jpeg
Updated May 30, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
David Brandão Nunes; José de Paula Barros Neto; Silvia Maria de Freitas (2023). Multiple linear regression model to evaluate the market value of residential apartments in Fortaleza, CE [Dataset]. http://doi.org/10.6084/m9.figshare.7368278.v1
Explore at:
jpegAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.7368278.v1
Dataset updated
May 30, 2023
Dataset provided by
SciELO journals
Authors
David Brandão Nunes; José de Paula Barros Neto; Silvia Maria de Freitas
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Area covered
Fortaleza
Description
Abstract The valuation of real estate, which assists in the definition of market value, is an important science with a wide field of action, which includes the collection of taxes, commercial transactions, insurance and judicial expertise. This study presents the construction of a linear regression model to determine the market value (dependent variable) of residential apartments in the city of Fortaleza-CE. The studied database presents 17,493 apartments, divided into 227 plan types in a total of 154 projects launched between the years of 2011 and 2014. The model developed was obtained using Multiple Linear Regression associated with the Ridge Regression technique to solve the existing multicollinearity problem. In the analysis of 30 variables (12 quantitative and 18 dummy type qualitative variables), an equation with 6 variables was reached, which meets the theoretical assumptions for its existence.
Linear Regression E-commerce Dataset
kaggle.com
zip
Updated Sep 16, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Saurabh Kolawale (2019). Linear Regression E-commerce Dataset [Dataset]. https://www.kaggle.com/datasets/kolawale/focusing-on-mobile-app-or-website
Explore at:
zip(44169 bytes)Available download formats
Dataset updated
Sep 16, 2019
Authors
Saurabh Kolawale
Description
This dataset is having data of customers who buys clothes online. The store offers in-store style and clothing advice sessions. Customers come in to the store, have sessions/meetings with a personal stylist, then they can go home and order either on a mobile app or website for the clothes they want.

The company is trying to decide whether to focus their efforts on their mobile app experience or their website.
o
Weighted Linear Regression - Dataset - Open Data NI
admin.opendatani.gov.uk
Updated Oct 9, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Weighted Linear Regression - Dataset - Open Data NI [Dataset]. https://admin.opendatani.gov.uk/dataset/weighted-linear-regression
Explore at:
Dataset updated
Oct 9, 2024
License
Open Government Licence 3.0http://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/
License information was derived automatically
Description
The primary objective from this project was to acquire historical shoreline information for all of the Northern Ireland coastline. Having this detailed understanding of the coast’s shoreline position and geometry over annual to decadal time periods is essential in any management of the coast.The historical shoreline analysis was based on all available Ordnance Survey maps and aerial imagery information. Analysis looked at position and geometry over annual to decadal time periods, providing a dynamic picture of how the coastline has changed since the start of the early 1800s.Once all datasets were collated, data was interrogated using the ArcGIS package – Digital Shoreline Analysis System (DSAS). DSAS is a software package which enables a user to calculate rate-of-change statistics from multiple historical shoreline positions. Rate-of-change was collected at 25m intervals and displayed both statistically and spatially allowing for areas of retreat/accretion to be identified at any given stretch of coastline.The DSAS software will produce the following rate-of-change statistics:Net Shoreline Movement (NSM) – the distance between the oldest and the youngest shorelines.Shoreline Change Envelope (SCE) – a measure of the total change in shoreline movement considering all available shoreline positions and reporting their distances, without reference to their specific dates.End Point Rate (EPR) – derived by dividing the distance of shoreline movement by the time elapsed between the oldest and the youngest shoreline positions.Linear Regression Rate (LRR) – determines a rate of change statistic by fitting a least square regression to all shorelines at specific transects.Weighted Linear Regression Rate (WLR) - calculates a weighted linear regression of shoreline change on each transect. It considers the shoreline uncertainty giving more emphasis on shorelines with a smaller error.The end product provided by Ulster University is an invaluable tool and digital asset that has helped to visualise shoreline change and assess approximate rates of historical change at any given coastal stretch on the Northern Ireland coast.
TV Sales Regression
kaggle.com
Updated Jul 13, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sachin Gupta (2023). TV Sales Regression [Dataset]. https://www.kaggle.com/sachinmethdai/tv-sales-regression
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jul 13, 2023
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Sachin Gupta
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Dataset

This dataset was created by Sachin Gupta

Released under CC0: Public Domain

Contents
AirQualityCOVID-dataset
zenodo.org
data.niaid.nih.gov
zip
Updated Apr 23, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Jaime González-Pardo; Jaime González-Pardo; Rodrigo Manzanas; Rodrigo Manzanas; Sandra Ceballos-Santos; Sandra Ceballos-Santos (2023). AirQualityCOVID-dataset [Dataset]. http://doi.org/10.5281/zenodo.5642868
Explore at:
zipAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.5642868
Dataset updated
Apr 23, 2023
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Jaime González-Pardo; Jaime González-Pardo; Rodrigo Manzanas; Rodrigo Manzanas; Sandra Ceballos-Santos; Sandra Ceballos-Santos
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
This repository contains all the data used for the article "Estimating changes in air pollutant levels due to COVID-19 lockdown measures based on a business-as-usual prediction scenario using data mining models: A case-study for urban traffic sites in Spain", submitted to Environmental Software & Modelling by J. González-Pardo et al. (2022) published in Science of the Total Environment (STOTEN). For the sake of reproducibility, it includes Jupyter notebooks with worked examples which allow to reproduce the results shown in that paper.

Contact: jaime.diez.gp@gmail.com

During the course of this research the pyaemet python library has been developed in order to download daily meteorological observations from the Spanish Met Service (AEMET) via its OpenData API REST and it is needed to perform the data curation process.

This research was developed in the framework of the project “Contaminación atmosférica y COVID-19: ¿Qué podemos aprender de esta pandemia?”, selected in the Extraordinary BBVA Foundation grant call for SARS-CoV-2 and COVID-19 research proposals, within the area of ecology and veterinary science.
Dataset for the mechanical performance prediction of asphalt mixtures: a...
zenodo.org
Updated Mar 21, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Nicola Baldo; Nicola Baldo; Fabio Rondinella; Fabio Rondinella; Fabiola Daneluz; Fabiola Daneluz; Pavla Vacková; Pavla Vacková; Jan Valentin; Jan Valentin; Marcin Daniel Gajewski; Marcin Daniel Gajewski; Jan Krol; Jan Krol (2025). Dataset for the mechanical performance prediction of asphalt mixtures: a baseline study of linear and non-linear regression compared with Neural Network modelling within Weave-UNISONO 2021 project, NCN project No 2021/03/Y/ST8/00079, and GACR project GA22-04047K [Dataset]. http://doi.org/10.5281/zenodo.15058842
Explore at:
Unique identifier
https://doi.org/10.5281/zenodo.15058842
Dataset updated
Mar 21, 2025
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Nicola Baldo; Nicola Baldo; Fabio Rondinella; Fabio Rondinella; Fabiola Daneluz; Fabiola Daneluz; Pavla Vacková; Pavla Vacková; Jan Valentin; Jan Valentin; Marcin Daniel Gajewski; Marcin Daniel Gajewski; Jan Krol; Jan Krol
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Summary:

Two selected mixtures were thoroughly investigated in an experimental trial carried out by means of a four-point bending test (4PBT) apparatus. The mixtures were prepared using aggregate, a conventional 50/70 penetration grade bitumen, and limestone filler. Their stiffness moduli (SM) were determined while samples were exposed to loading frequencies from 0.1 to 50 Hz, and testing temperatures ranged from 0 to 30 °C. The main scope of this research was to compare analysis between different modelling approaches: conventional regressions, both linear and non-linear, and artificial neural networks.

The dataset includes:

Outcomes of the 4PBT experimental carried out on two types of asphalt concrete: NMAS16 and NMAS22 mixtures

Stiffness Modulus NMAS16.csv

Stiffness Modulus NMAS22.csv
m
Early Software Size Estimation using Weighted Analysis Class Diagram Metrics...
data.mendeley.com
Updated Jun 9, 2022
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Marriam Daud (2022). Early Software Size Estimation using Weighted Analysis Class Diagram Metrics - Datasets [Dataset]. http://doi.org/10.17632/mnrpcxzk88.1
Explore at:
Unique identifier
https://doi.org/10.17632/mnrpcxzk88.1
Dataset updated
Jun 9, 2022
Authors
Marriam Daud
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
It includes five different datasets. The first four datasets contain student projects collected from different offerings of two undergraduate-level courses – Object-Oriented Analysis and Design (OOAD) and Software Engineering (SE) – taught in a renowned private university in Lahore over a period of six years. The fifth dataset contains real-life industry projects collected from a renowned software house (i.e. member of Pakistan Software Houses Association for IT and ITeS (P@SHA)) in Lahore.

Dataset #1 consists of 31 C++ GUI-based desktop applications. Dataset #2 consists of 19 Java GUI-based desktop applications. Dataset #3 consists of 12 Java web applications. Dataset #4 consists of 31 Java all two categories. Dataset #5 consists of 11 VB.NET GUI-based desktop applications.

Attributes are used as follows: Project Code – Project ID for identification purposes NOC – The total number of classes in a class diagram NOA – The total number of attributes in a class diagram NOM – The total number of methods/operations in a class diagram NODep – The total number of dependency relationships in a class diagram NOAss – The total number of association relationships in a class diagram NOComp – The total number of composition relationships in a class diagram NOAgg – The total number of aggregation relationships in a class diagram NOGen – The total number of generalization relationships in a class diagram NORR – The total number of realization relationships in a class diagram NOOM – The total number of one-to-one multiplicity relationships in a class diagram NOMM – The total number of one-to-many multiplicity relationships in a class diagram NMMM – The total number of many-to-many multiplicity relationships in a class diagram OCP – objective class points EOCP – enhanced objective class points WEOCP – weighted enhanced objective class points SLOC – software size measured in source lines of code
f
Data from: Integrating statistical writing in an applied regression course...
tandf.figshare.com
pdf
Updated Jun 30, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Laura A. Hildreth; Ella M. Burnham (2025). Integrating statistical writing in an applied regression course using small-scale writing projects [Dataset]. http://doi.org/10.6084/m9.figshare.29438235.v1
Explore at:
pdfAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.29438235.v1
Dataset updated
Jun 30, 2025
Dataset provided by
Taylor & Francis
Authors
Laura A. Hildreth; Ella M. Burnham
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Effective communication skills, both written and oral, are considered core skills for statisticians. This article presents five small-scale writing projects that were developed for an applied regression course, including the specific writing skills emphasized in each project and what each project entails. We also present and discuss results from surveys on changes in writing attitudes throughout the course and student feedback on the projects. The results indicate improved attitudes toward writing and a positive experience for students. Recommendations for incorporating the writing projects based on our observations of implementing them and potential changes are also provided. Materials for all projects are available in the online supplemental materials.
d
Multivariate regression model for predicting oxygen reduction rates in...
catalog.data.gov
data.usgs.gov
+1more
Updated Jul 20, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Geological Survey (2024). Multivariate regression model for predicting oxygen reduction rates in groundwater for the State of Wisconsin [Dataset]. https://catalog.data.gov/dataset/multivariate-regression-model-for-predicting-oxygen-reduction-rates-in-groundwater-for-the
Explore at:
Dataset updated
Jul 20, 2024
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Description
A multivariate regression model was developed to predict zero-order oxygen reduction rates (mg/L/yr) in aquifers across the State of Wisconsin. The model used a combination of dissolved oxygen concentrations and mean groundwater ages estimated with sampled age tracers from wells in the U.S. Geological Survey National Water Information System and previously published project reports from state agencies and universities. The multivariate regression model was solved using the Microsoft Excel solver, with 461 wells used for training and 46 wells held-out for validation. A total of 31 predictor variables were used for model development (56 were tested), including basic well characteristics, soil properties, aquifer properties, hydrologic position on the landscape, recharge and evapotranspiration rates, and land use characteristics. Model results indicate that the mean oxygen reduction rate for the training wells is 0.15 mg/L/yr (ranges from 0.07 to 0.59 mg/L/yr), with a root mean weighted square error of 3.13 mg/L/yr and Coefficient of Correlation (r^2) of 0.49 for the holdout validation data. This data release includes the Microsoft Excel file that represents the final solved regression model, as well as an Excel file that describes all of the predictor variables that were tested with the model.
Datasets for linear regression on Swedish Motor Insurance
zenodo.org
csv
Updated Apr 13, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sibincic Aleksandar; Sibincic Aleksandar (2020). Datasets for linear regression on Swedish Motor Insurance [Dataset]. http://doi.org/10.5281/zenodo.3749812
Explore at:
csvAvailable download formats
Unique identifier
https://doi.org/10.5281/zenodo.3749812
Dataset updated
Apr 13, 2020
Dataset provided by
Zenodohttp://zenodo.org/
Authors
Sibincic Aleksandar; Sibincic Aleksandar
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
In this project we have 3 datasets. Training Set and Test set consists of the input data from Swedish Motor Insurance dataset which is dividen in ratio 80%-20%. Third dataset consists of our predictions for Sum of payments using linear regression.

Code and data from simulations that apply multiple regression analysis...

zenodo.org

bin, csv

Updated Jun 12, 2025

Facebook

Twitter

Click to copy link

Link copied

Cite

Takeharu SEKI; Takeharu SEKI (2025). Code and data from simulations that apply multiple regression analysis models to biased occurrence data to detect thermophilization. [Dataset]. http://doi.org/10.5281/zenodo.13431533

Explore at:

csv, binAvailable download formats

Unique identifier

https://doi.org/10.5281/zenodo.13431533

Dataset updated

Jun 12, 2025

Dataset provided by

Zenodohttp://zenodo.org/

Authors

Takeharu SEKI; Takeharu SEKI

License

Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically

Description

READ ME

Description of this repository

This repository houses the code and data for simulations that apply multiple regression analysis models to biased occurrence data to detect thermophilization.

Explanation of each file

SimulationCode.R

This R code simulates the application of a multiple regression analysis model to biased occurrence data to detect thermophilization.

Note: To save the running time, we used a parallel computation approach (run time of approximately 30 minutes). Since seven CPUs were used, an equal or greater number of CPUs would be required to reproduce the same results.

01_GeneratedDistributionData.csv

Simulation-generated distribution data of fictitious biota species. The column names are explained below.

Column Names	Explanation
IndID	Unique individual identification number
SpeciesID	Unique identification number for the species to witch the individual belongs.
Step	Steps in which the individual exists.
LTI	Local Temperature Index (LTI) of the location where the individual occurred.
SpeciesLTICenter	Central value of the species-specific LTI at the time of its Step
Prob.BiasToWarm	Value of weighting sampled when Bias to Warm is present.
Prob.BiasToCold	Value of weighting sampled when Bias to Cold is present.

02_ExtractedBiasedOccurrenceData.csv

The result of extracting 2,000 biased occurrences data ofrom the Distribution data.

Column Names	Explanation
IndID	Unique identification number of the extracted individual.
SpeciesID	Unique identification number for the species to witch the individual belongs.
Step	Steps in which the individual is extracted
LTI	Local Temperature Index (LTI) of the location where the individual occurred.
EstSTI	Species Temperature Index (STI) of the record species calculated on the basis of the occurrence data.
BiasType	The type of bias
iter	The number of iteration

Reference

This simulation code uses the following packages.

{tidyverse} package,

 Wickham H, Averick M, Bryan J, Chang W, McGowan LD, François R, Grolemund G, Hayes A, Henry L, Hester J, Kuhn M, Pedersen TL, Miller E, Bache SM, Müller K, Ooms J, Robinson D, Seidel DP, Spinu V, Takahashi K, Vaughan D, Wilke C, Woo K, Yutani H (2019). “Welcome to the tidyverse.” _Journal of Open Source Software_, *4*(43), 1686. doi:10.21105/joss.01686 <https://doi.org/10.21105/joss.01686>.

{broom} package,

Robinson D, Hayes A, Couch S (2024). broom: Convert Statistical Objects into Tidy Tibbles. R package version 1.0.7, https://github.com/tidymodels/broom,

{rlist} package.

Ren K (2021). _rlist: A Toolbox for Non-Tabular Data Manipulation_. R package version 0.4.6.2, <https://CRAN.R-project.org/package=rlist>.

{data.table} package

Barrett T, Dowle M, Srinivasan A, Gorecki J, Chirico M, Hocking T (2024). _data.table: Extension of `data.frame`_. R package version 1.15.4, <https://CRAN.R-project.org/package=data.table>.

{snowfall} package

Knaus J (2023). _snowfall: Easier Cluster Computing (Based on 'snow')_. R package version 1.84-6.3, <https://CRAN.R-project.org/package=snowfall>.

{magrittr} package

Bache S, Wickham H (2022). _magrittr: A Forward-Pipe Operator for R_. R package version 2.0.3, <https://CRAN.R-project.org/package=magrittr>.

{ggpmisc} package

Aphalo P (2024). _ggpmisc: Miscellaneous Extensions to 'ggplot2'_. R package version 0.5.6, <https://CRAN.R-project.org/package=ggpmisc>.

{effsize} package

Torchiano M (2020). _effsize: Efficient Effect Size Computation_. doi:10.5281/zenodo.1480624 <https://doi.org/10.5281/zenodo.1480624>, R package version 0.8.1, <https://CRAN.R-project.org/package=effsize>.

{conflicted] package

Wickham H (2023). _conflicted: An Alternative Conflict Resolution Strategy_. R package version 1.2.0, <https://CRAN.R-project.org/package=conflicted>.

f
Multiple regression results for health outcomes—VADER.
figshare.com
xls
Updated Jun 1, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Joseph Gibbons; Robert Malouf; Brian Spitzberg; Lourdes Martinez; Bruce Appleyard; Caroline Thompson; Atsushi Nara; Ming-Hsiang Tsou (2023). Multiple regression results for health outcomes—VADER. [Dataset]. http://doi.org/10.1371/journal.pone.0219550.t003
Explore at:
xlsAvailable download formats
Unique identifier
https://doi.org/10.1371/journal.pone.0219550.t003
Dataset updated
Jun 1, 2023
Dataset provided by
PLOS ONE
Authors
Joseph Gibbons; Robert Malouf; Brian Spitzberg; Lourdes Martinez; Bruce Appleyard; Caroline Thompson; Atsushi Nara; Ming-Hsiang Tsou
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Multiple regression results for health outcomes—VADER.
d
Digital Shoreline Analysis System version 4.3 Transects with Long-Term...
catalog.data.gov
data.usgs.gov
+2more
Updated Jul 7, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Geological Survey (2024). Digital Shoreline Analysis System version 4.3 Transects with Long-Term Linear Regression Rate Calculations for Alabama [Dataset]. https://catalog.data.gov/dataset/digital-shoreline-analysis-system-version-4-3-transects-with-long-term-linear-regression-r-a6df9
Explore at:
Dataset updated
Jul 7, 2024
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Area covered
Alabama
Description
Sandy ocean beaches are a popular recreational destination, often surrounded by communities containing valuable real estate. Development is on the rise despite the fact that coastal infrastructure is subjected to flooding and erosion. As a result, there is an increased demand for accurate information regarding past and present shoreline changes. To meet these national needs, the Coastal and Marine Geology Program of the U.S. Geological Survey (USGS) is compiling existing reliable historical shoreline data along open-ocean sandy shores of the conterminous United States and parts of Alaska and Hawaii under the National Assessment of Shoreline Change project. There is no widely accepted standard for analyzing shoreline change. Existing shoreline data measurements and rate calculation methods vary from study to study and prevent combining results into state-wide or regional assessments. The impetus behind the National Assessment project was to develop a standardized method of measuring changes in shoreline position that is consistent from coast to coast. The goal was to facilitate the process of periodically and systematically updating the results in an internally consistent manner.
f
Data from: Learning While Learning: Psychology Case Studies for Teaching...
tandf.figshare.com
bin
Updated Apr 1, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ciaran Evans; Alex Reinhart; Erin Cooley; William Cipolli (2025). Learning While Learning: Psychology Case Studies for Teaching Regression [Dataset]. http://doi.org/10.6084/m9.figshare.28127458.v2
Explore at:
binAvailable download formats
Unique identifier
https://doi.org/10.6084/m9.figshare.28127458.v2
Dataset updated
Apr 1, 2025
Dataset provided by
Taylor & Francis
Authors
Ciaran Evans; Alex Reinhart; Erin Cooley; William Cipolli
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
In this article, we explore the use of two published datasets for teaching a wide range of students about regression models, with a particular focus on interaction terms. The two datasets come from recent psychology studies on beliefs about poverty and welfare, and about the dynamics of groups projects. Both datasets (and their original research papers) are accessible to students, and because of their context, students can learn about data collection, measurement, and the use of statistics when studying complex social topics, while using the data to learn about regression analysis. We have used these data for a range of in-class activities, journal paper discussions, exams, and extended projects, at the undergraduate, master’s, and doctoral levels. Supplementary materials for this article are available online.
d
Digital Shoreline Analysis System version 4.3 Transects with Short-Term...
datadiscoverystudio.org
search.dataone.org
+4more
Updated Jun 8, 2018
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2018). Digital Shoreline Analysis System version 4.3 Transects with Short-Term Linear Regression Rate Calculations for Texas west (TXwest). [Dataset]. http://datadiscoverystudio.org/geoportal/rest/metadata/item/cd4a68bf7fd74762bf0e15af1d38fa74/html
Explore at:
Dataset updated
Jun 8, 2018
Description
description: Sandy ocean beaches are a popular recreational destination, often surrounded by communities containing valuable real estate. Development is on the rise despite the fact that coastal infrastructure is subjected to flooding and erosion. As a result, there is an increased demand for accurate information regarding past and present shoreline changes. To meet these national needs, the Coastal and Marine Geology Program of the U.S. Geological Survey (USGS) is compiling existing reliable historical shoreline data along open-ocean sandy shores of the conterminous United States and parts of Alaska and Hawaii under the National Assessment of Shoreline Change project. There is no widely accepted standard for analyzing shoreline change. Existing shoreline data measurements and rate calculation methods vary from study to study and prevent combining results into state-wide or regional assessments. The impetus behind the National Assessment project was to develop a standardized method of measuring changes in shoreline position that is consistent from coast to coast. The goal was to facilitate the process of periodically and systematically updating the results in an internally consistent manner.; abstract: Sandy ocean beaches are a popular recreational destination, often surrounded by communities containing valuable real estate. Development is on the rise despite the fact that coastal infrastructure is subjected to flooding and erosion. As a result, there is an increased demand for accurate information regarding past and present shoreline changes. To meet these national needs, the Coastal and Marine Geology Program of the U.S. Geological Survey (USGS) is compiling existing reliable historical shoreline data along open-ocean sandy shores of the conterminous United States and parts of Alaska and Hawaii under the National Assessment of Shoreline Change project. There is no widely accepted standard for analyzing shoreline change. Existing shoreline data measurements and rate calculation methods vary from study to study and prevent combining results into state-wide or regional assessments. The impetus behind the National Assessment project was to develop a standardized method of measuring changes in shoreline position that is consistent from coast to coast. The goal was to facilitate the process of periodically and systematically updating the results in an internally consistent manner.
c
Digital Shoreline Analysis System version 4.2 Transects with Long-Term...
s.cnmilf.com
catalog.data.gov
+1more
Updated Jul 6, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
U.S. Geological Survey (2024). Digital Shoreline Analysis System version 4.2 Transects with Long-Term Linear Regression Rate Calculations for Oregon (OR_transects_LT.shp) [Dataset]. https://s.cnmilf.com/user74170196/https/catalog.data.gov/dataset/digital-shoreline-analysis-system-version-4-2-transects-with-long-term-linear-regression-r-972c0
Explore at:
Dataset updated
Jul 6, 2024
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Description
Sandy ocean beaches are a popular recreational destination, often surrounded by communities containing valuable real estate. Development is on the rise despite the fact that coastal infrastructure is subjected to flooding and erosion. As a result, there is an increased demand for accurate information regarding past and present shoreline changes. To meet these national needs, the Coastal and Marine Geology Program of the U.S. Geological Survey (USGS) is compiling existing reliable historical shoreline data along open-ocean sandy shores of the conterminous United States and parts of Alaska and Hawaii under the National Assessment of Shoreline Change project. There is no widely accepted standard for analyzing shoreline change. Existing shoreline data measurements and rate calculation methods vary from study to study and prevent combining results into state-wide or regional assessments. The impetus behind the National Assessment project was to develop a standardized method of measuring changes in shoreline position that is consistent from coast to coast. The goal was to facilitate the process of periodically and systematically updating the results in an internally consistent manner.
u
Boston Short-term Linear Regression Change Rates
marine.usgs.gov
Updated Jun 14, 2016
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2016). Boston Short-term Linear Regression Change Rates [Dataset]. https://marine.usgs.gov/coastalchangehazardsportal/ui/info/item/Evr6tXs2
Explore at:
Dataset updated
Jun 14, 2016
Area covered

Description
This dataset consists of short-term (1970-2009) linear regression shoreline change rates for the Boston region of Massachusetts. Rates of short-term shoreline change were computed within a GIS using the Digital Shoreline Analysis System (DSAS) version 4.3, an ArcGIS extension developed by the U.S. Geological Survey. The baseline is used as a reference line for the transects cast by the DSAS software. The transects intersect each shoreline at the measurement points, which are then used to calculate the short-term rates. Due to continued coastal population growth and increased threats of erosion, current data on trends and rates of shoreline movement are required to inform shoreline and floodplain management. The Massachusetts Office of Coastal Zone Management launched the Shoreline Change Project in 1989 to identify erosion-prone areas of the coast. In 2001, a 1994 shoreline was added to calculate both long- and short-term shoreline change rates at 40-meter intervals along ocean-facing sections of the Massachusetts coast. The Coastal and Marine Geology Program of the U.S. Geological Survey (USGS) in cooperation with the Massachusetts Office of Coastal Zone Management, has compiled reliable historical shoreline data along open-facing sections of the Massachusetts coast under the Massachusetts Shoreline Change Mapping and Analysis Project 2013 Update. Two oceanfront shorelines for Massachusetts (approximately 1,800 km) were (1) delineated using 2008/09 color aerial orthoimagery, and (2) extracted from topographic LIDAR datasets (2007) obtained from NOAA's Ocean Service, Coastal Services Center. The new shorelines were integrated with existing Massachusetts Office of Coastal Zone Management and USGS historical shoreline data in order to compute long- and short-term rates using the latest version of the Digital Shoreline Analysis System (DSAS).
g
Digital Shoreline Analysis System version 4.3 Transects with Long-Term...
gimi9.com
Updated Dec 9, 2024
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
(2024). Digital Shoreline Analysis System version 4.3 Transects with Long-Term Linear Regression Rate Calculations for Louisiana | gimi9.com [Dataset]. https://gimi9.com/dataset/data-gov_045d66eace2fbab609d46886e6a8bc16b33fd2e0/
Explore at:
Dataset updated
Dec 9, 2024
Area covered
Louisiana
Description
Sandy ocean beaches are a popular recreational destination, often surrounded by communities containing valuable real estate. Development is on the rise despite the fact that coastal infrastructure is subjected to flooding and erosion. As a result, there is an increased demand for accurate information regarding past and present shoreline changes. To meet these national needs, the Coastal and Marine Geology Program of the U.S. Geological Survey (USGS) is compiling existing reliable historical shoreline data along open-ocean sandy shores of the conterminous United States and parts of Alaska and Hawaii under the National Assessment of Shoreline Change project. There is no widely accepted standard for analyzing shoreline change. Existing shoreline data measurements and rate calculation methods vary from study to study and prevent combining results into state-wide or regional assessments. The impetus behind the National Assessment project was to develop a standardized method of measuring changes in shoreline position that is consistent from coast to coast. The goal was to facilitate the process of periodically and systematically updating the results in an internally consistent manner.

Facebook

Twitter

Click to copy link

Link copied

Cite

U.S. Geological Survey (2024). An example data set for exploration of Multiple Linear Regression [Dataset]. https://catalog.data.gov/dataset/an-example-data-set-for-exploration-of-multiple-linear-regression

An example data set for exploration of Multiple Linear Regression

Explore at:

Dataset updated

Jul 6, 2024

Dataset provided by

United States Geological Surveyhttp://www.usgs.gov/

Description

This data set contains example data for exploration of the theory of regression based regionalization. The 90th percentile of annual maximum streamflow is provided as an example response variable for 293 streamgages in the conterminous United States. Several explanatory variables are drawn from the GAGES-II data base in order to demonstrate how multiple linear regression is applied. Example scripts demonstrate how to collect the original streamflow data provided and how to recreate the figures from the associated Techniques and Methods chapter.

Clear search

Close search

Google apps

Main menu

An example data set for exploration of Multiple Linear Regression

Stonybrook_AMS578_Multiple_Regression_Dataset

Context

Content

Data from: Multiple linear regression model to evaluate the market value of...

Linear Regression E-commerce Dataset

Weighted Linear Regression - Dataset - Open Data NI

TV Sales Regression

Dataset

Contents

AirQualityCOVID-dataset

Dataset for the mechanical performance prediction of asphalt mixtures: a...

Early Software Size Estimation using Weighted Analysis Class Diagram Metrics...

Data from: Integrating statistical writing in an applied regression course...

Multivariate regression model for predicting oxygen reduction rates in...

Datasets for linear regression on Swedish Motor Insurance

Code and data from simulations that apply multiple regression analysis...

READ ME

Description of this repository

Explanation of each file

SimulationCode.R

01_GeneratedDistributionData.csv

02_ExtractedBiasedOccurrenceData.csv

Reference

Multiple regression results for health outcomes—VADER.

Digital Shoreline Analysis System version 4.3 Transects with Long-Term...

Data from: Learning While Learning: Psychology Case Studies for Teaching...

Digital Shoreline Analysis System version 4.3 Transects with Short-Term...

Digital Shoreline Analysis System version 4.2 Transects with Long-Term...

Boston Short-term Linear Regression Change Rates

Digital Shoreline Analysis System version 4.3 Transects with Long-Term...

An example data set for exploration of Multiple Linear Regression