96 datasets found
  1. U

    An example data set for exploration of Multiple Linear Regression

    • data.usgs.gov
    • catalog.data.gov
    Updated Feb 24, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    William Farmer (2024). An example data set for exploration of Multiple Linear Regression [Dataset]. http://doi.org/10.5066/P9T5ZEXV
    Explore at:
    Dataset updated
    Feb 24, 2024
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Authors
    William Farmer
    License

    U.S. Government Workshttps://www.usa.gov/government-works
    License information was derived automatically

    Time period covered
    1956 - 2016
    Description

    This data set contains example data for exploration of the theory of regression based regionalization. The 90th percentile of annual maximum streamflow is provided as an example response variable for 293 streamgages in the conterminous United States. Several explanatory variables are drawn from the GAGES-II data base in order to demonstrate how multiple linear regression is applied. Example scripts demonstrate how to collect the original streamflow data provided and how to recreate the figures from the associated Techniques and Methods chapter.

  2. House Price Regression Dataset

    • kaggle.com
    zip
    Updated Sep 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Prokshitha Polemoni (2024). House Price Regression Dataset [Dataset]. https://www.kaggle.com/datasets/prokshitha/home-value-insights
    Explore at:
    zip(27045 bytes)Available download formats
    Dataset updated
    Sep 6, 2024
    Authors
    Prokshitha Polemoni
    Description

    Home Value Insights: A Beginner's Regression Dataset

    This dataset is designed for beginners to practice regression problems, particularly in the context of predicting house prices. It contains 1000 rows, with each row representing a house and various attributes that influence its price. The dataset is well-suited for learning basic to intermediate-level regression modeling techniques.

    Features:

    1. Square_Footage: The size of the house in square feet. Larger homes typically have higher prices.
    2. Num_Bedrooms: The number of bedrooms in the house. More bedrooms generally increase the value of a home.
    3. Num_Bathrooms: The number of bathrooms in the house. Houses with more bathrooms are typically priced higher.
    4. Year_Built: The year the house was built. Older houses may be priced lower due to wear and tear.
    5. Lot_Size: The size of the lot the house is built on, measured in acres. Larger lots tend to add value to a property.
    6. Garage_Size: The number of cars that can fit in the garage. Houses with larger garages are usually more expensive.
    7. Neighborhood_Quality: A rating of the neighborhood’s quality on a scale of 1-10, where 10 indicates a high-quality neighborhood. Better neighborhoods usually command higher prices.
    8. House_Price (Target Variable): The price of the house, which is the dependent variable you aim to predict.

    Potential Uses:

    1. Beginner Regression Projects: This dataset can be used to practice building regression models such as Linear Regression, Decision Trees, or Random Forests. The target variable (house price) is continuous, making this an ideal problem for supervised learning techniques.

    2. Feature Engineering Practice: Learners can create new features by combining existing ones, such as the price per square foot or age of the house, providing an opportunity to experiment with feature transformations.

    3. Exploratory Data Analysis (EDA): You can explore how different features (e.g., square footage, number of bedrooms) correlate with the target variable, making it a great dataset for learning about data visualization and summary statistics.

    4. Model Evaluation: The dataset allows for various model evaluation techniques such as cross-validation, R-squared, and Mean Absolute Error (MAE). These metrics can be used to compare the effectiveness of different models.

    Versatility:

    • The dataset is highly versatile for a range of machine learning tasks. You can apply simple linear models to predict house prices based on one or two features, or use more complex models like Random Forest or Gradient Boosting Machines to understand interactions between variables.

    • It can also be used for dimensionality reduction techniques like PCA or to practice handling categorical variables (e.g., neighborhood quality) through encoding techniques like one-hot encoding.

    • This dataset is ideal for anyone wanting to gain practical experience in building regression models while working with real-world features.

  3. q

    Linear Regression (Excel) and Cellular Respiration for Biology, Chemistry...

    • qubeshub.org
    Updated Jan 11, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Irene Corriette; Beatriz Gonzalez; Daniela Kitanska; Henriette Mozsolits; Sheela Vemu (2022). Linear Regression (Excel) and Cellular Respiration for Biology, Chemistry and Mathematics [Dataset]. http://doi.org/10.25334/5PX5-H796
    Explore at:
    Dataset updated
    Jan 11, 2022
    Dataset provided by
    QUBES
    Authors
    Irene Corriette; Beatriz Gonzalez; Daniela Kitanska; Henriette Mozsolits; Sheela Vemu
    Description

    Students typically find linear regression analysis of data sets in a biology classroom challenging. These activities could be used in a Biology, Chemistry, Mathematics, or Statistics course. The collection provides student activity files with Excel instructions and Instructor Activity files with Excel instructions and solutions to problems.

    Students will be able to perform linear regression analysis, find correlation coefficient, create a scatter plot and find the r-square using MS Excel 365. Students will be able to interpret data sets, describe the relationship between biological variables, and predict the value of an output variable based on the input of an predictor variable.

  4. Data from: Regression with Empirical Variable Selection: Description of a...

    • plos.figshare.com
    txt
    Updated Jun 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Anne E. Goodenough; Adam G. Hart; Richard Stafford (2023). Regression with Empirical Variable Selection: Description of a New Method and Application to Ecological Datasets [Dataset]. http://doi.org/10.1371/journal.pone.0034338
    Explore at:
    txtAvailable download formats
    Dataset updated
    Jun 8, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Anne E. Goodenough; Adam G. Hart; Richard Stafford
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Despite recent papers on problems associated with full-model and stepwise regression, their use is still common throughout ecological and environmental disciplines. Alternative approaches, including generating multiple models and comparing them post-hoc using techniques such as Akaike's Information Criterion (AIC), are becoming more popular. However, these are problematic when there are numerous independent variables and interpretation is often difficult when competing models contain many different variables and combinations of variables. Here, we detail a new approach, REVS (Regression with Empirical Variable Selection), which uses all-subsets regression to quantify empirical support for every independent variable. A series of models is created; the first containing the variable with most empirical support, the second containing the first variable and the next most-supported, and so on. The comparatively small number of resultant models (n = the number of predictor variables) means that post-hoc comparison is comparatively quick and easy. When tested on a real dataset – habitat and offspring quality in the great tit (Parus major) – the optimal REVS model explained more variance (higher R2), was more parsimonious (lower AIC), and had greater significance (lower P values), than full, stepwise or all-subsets models; it also had higher predictive accuracy based on split-sample validation. Testing REVS on ten further datasets suggested that this is typical, with R2 values being higher than full or stepwise models (mean improvement = 31% and 7%, respectively). Results are ecologically intuitive as even when there are several competing models, they share a set of “core” variables and differ only in presence/absence of one or two additional variables. We conclude that REVS is useful for analysing complex datasets, including those in ecology and environmental disciplines.

  5. Data from: AgroReg: main regression models in agricultural sciences...

    • scielo.figshare.com
    jpeg
    Updated Jul 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gabriel Danilo Shimizu; Leandro Simões Azeredo Gonçalves (2023). AgroReg: main regression models in agricultural sciences implemented as an R Package [Dataset]. http://doi.org/10.6084/m9.figshare.23648270.v1
    Explore at:
    jpegAvailable download formats
    Dataset updated
    Jul 8, 2023
    Dataset provided by
    SciELOhttp://www.scielo.org/
    Authors
    Gabriel Danilo Shimizu; Leandro Simões Azeredo Gonçalves
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    ABSTRACT Regression analysis is highly relevant to agricultural sciences since many of the factors studied are quantitative. Researchers have generally used polynomial models to explain their experimental results, mainly because much of the existing software perform this analysis and a lack of knowledge of other models. On the other hand, many of the natural phenomena do not present such behavior; nevertheless, the use of non-linear models is costly and requires advanced knowledge of language programming such as R. Thus, this work presents several regression models found in scientific studies, implementing them in the form of an R package called AgroReg. The package comprises 44 analysis functions with 66 regression models such as polynomial, non-parametric (loess), segmented, logistic, exponential, and logarithmic, among others. The functions provide the coefficient of determination (R2), model coefficients and the respective p-values from the t-test, root mean square error (RMSE), Akaike’s information criterion (AIC), Bayesian information criterion (BIC), maximum and minimum predicted values, and the regression plot. Furthermore, other measures of model quality and graphical analysis of residuals are also included. The package can be downloaded from the CRAN repository using the command: install.packages(“AgroReg”). AgroReg is a promising analysis tool in agricultural research on account of its user-friendly and straightforward functions that allow for fast and efficient data processing with greater reliability and relevant information.

  6. Multiple regression analysis for log HOMA-R.

    • plos.figshare.com
    • datasetcatalog.nlm.nih.gov
    xls
    Updated Jun 4, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shutaro Ishimura; Masato Furuhashi; Yuki Watanabe; Kyoko Hoshina; Takahiro Fuseya; Tomohiro Mita; Yusuke Okazaki; Masayuki Koyama; Marenao Tanaka; Hiroshi Akasaka; Hirofumi Ohnishi; Hideaki Yoshida; Shigeyuki Saitoh; Tetsuji Miura (2023). Multiple regression analysis for log HOMA-R. [Dataset]. http://doi.org/10.1371/journal.pone.0081318.t005
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 4, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Shutaro Ishimura; Masato Furuhashi; Yuki Watanabe; Kyoko Hoshina; Takahiro Fuseya; Tomohiro Mita; Yusuke Okazaki; Masayuki Koyama; Marenao Tanaka; Hiroshi Akasaka; Hirofumi Ohnishi; Hideaki Yoshida; Shigeyuki Saitoh; Tetsuji Miura
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Multiple regression analysis for log HOMA-R.

  7. f

    Multiple regression analysis.

    • datasetcatalog.nlm.nih.gov
    • figshare.com
    Updated Aug 25, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kimber, Allysa J.; Meyer, Kacie J.; Anderson, Michael G.; Larson, Demelza R. (2023). Multiple regression analysis. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0000973289
    Explore at:
    Dataset updated
    Aug 25, 2023
    Authors
    Kimber, Allysa J.; Meyer, Kacie J.; Anderson, Michael G.; Larson, Demelza R.
    Description

    Anterior chamber depth (ACD) is a quantitative trait associated with primary angle closure glaucoma (PACG). Although ACD is highly heritable, known genetic variations explain a small fraction of the phenotypic variability. The purpose of this study was to identify additional ACD-influencing loci using strains of mice. Cohorts of 86 N2 and 111 F2 mice were generated from crosses between recombinant inbred BXD24/TyJ and wild-derived CAST/EiJ mice. Using anterior chamber optical coherence tomography, mice were phenotyped at 10–12 weeks of age, genotyped based on 93 genome-wide SNPs, and subjected to quantitative trait locus (QTL) analysis. In an analysis of ACD among all mice, six loci passed the significance threshold of p = 0.05 and persisted after multiple regression analysis. These were on chromosomes 6, 7, 11, 12, 15 and 17 (named Acdq6, Acdq7, Acdq11, Acdq12, Acdq15, and Acdq17, respectively). Our findings demonstrate a quantitative multi-genic pattern of ACD inheritance in mice and identify six previously unrecognized ACD-influencing loci. We have taken a unique approach to studying the anterior chamber depth phenotype by using mice as genetic tool to examine this continuously distributed trait.

  8. d

    Digital Shoreline Analysis System version 4.3 Transects with Long-Term...

    • catalog.data.gov
    • search.dataone.org
    • +1more
    Updated Nov 18, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Digital Shoreline Analysis System version 4.3 Transects with Long-Term Linear Regression Rate Calculations for southern North Carolina (NCsouth) [Dataset]. https://catalog.data.gov/dataset/digital-shoreline-analysis-system-version-4-3-transects-with-long-term-linear-regression-r
    Explore at:
    Dataset updated
    Nov 18, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Area covered
    North Carolina
    Description

    Sandy ocean beaches are a popular recreational destination, often surrounded by communities containing valuable real estate. Development is on the rise despite the fact that coastal infrastructure is subjected to flooding and erosion. As a result, there is an increased demand for accurate information regarding past and present shoreline changes. To meet these national needs, the Coastal and Marine Geology Program of the U.S. Geological Survey (USGS) is compiling existing reliable historical shoreline data along open-ocean sandy shores of the conterminous United States and parts of Alaska and Hawaii under the National Assessment of Shoreline Change project. There is no widely accepted standard for analyzing shoreline change. Existing shoreline data measurements and rate calculation methods vary from study to study and prevent combining results into state-wide or regional assessments. The impetus behind the National Assessment project was to develop a standardized method of measuring changes in shoreline position that is consistent from coast to coast. The goal was to facilitate the process of periodically and systematically updating the results in an internally consistent manner.

  9. KC_House Dataset -Linear Regression of Home Prices

    • kaggle.com
    zip
    Updated May 15, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    vikram amin (2023). KC_House Dataset -Linear Regression of Home Prices [Dataset]. https://www.kaggle.com/datasets/vikramamin/kc-house-dataset-home-prices
    Explore at:
    zip(776807 bytes)Available download formats
    Dataset updated
    May 15, 2023
    Authors
    vikram amin
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description
    1. Dataset: House pricing dataset containing 21 columns and 21613 rows.
    2. Programming Language : R
    3. Objective : To predict house prices by creating a model
    4. Steps : A) Import the dataset B) Install and run libraries C) Data Cleaning - Remove Null Values , Change Data Types , Dropping of Columns which are not important D) Data Analysis - (i)Linear Regression Model was used to establish the relationship between the dependent variable (price) and other independent variable (ii) Outliers were identified and removed (iii) Regression model was run once again after removing the outliers (iv) Multiple R- squared was calculated which indicated the independent variables can explain 73% change/ variation in the dependent variable (v) P value was less than that of alpha 0.05 which shows it is statistically significant. (vi) Interpreting the meaning of the results of the coefficients (vii) Checked the assumption of multicollinearity (viii) VIF(Variance inflation factor) was calculated for all the independent variables and their absolute value was found to be less than 5. Hence, there is not threat of multicollinearity and that we can proceed with the independent variables specified.
  10. c

    Data from: Data for Regression Models to Estimate Water Use in Providence,...

    • s.cnmilf.com
    • data.usgs.gov
    • +2more
    Updated Oct 1, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Data for Regression Models to Estimate Water Use in Providence, Rhode Island, 2014-2021 [Dataset]. https://s.cnmilf.com/user74170196/https/catalog.data.gov/dataset/data-for-regression-models-to-estimate-water-use-in-providence-rhode-island-2014-2021
    Explore at:
    Dataset updated
    Oct 1, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Area covered
    Rhode Island, Providence
    Description

    This data release contains input data and programs (scripts) used to estimate monthly water demand for retail customers of Providence Water, located in Providence, Rhode Island. Explanatory data and model outputs are from July 2014 through June 2021. Models of per capita (for single-family residential customers) or per connection (for multi-family residential, commercial, and industrial customers) water use were developed using multiple linear regression. The dependent variables, provided by Providence Water, are the monthly number of connections and gallons of water delivered to single- and multi-family residential, commercial, and industrial connections. Potential independent variables (from online sources) are climate variables (temperature and precipitation), economic statistics, and a drought statistic. Not all independent variables were used in all of the models. The data are provided in data tables and model files. The data table RIWaterUseVariableExplanation.csv describes the explanatory variables and their data sources. The data table ProvModelInputData.csv provides the monthly water-use data that are the independent variables and the monthly climatic and economic data that are the dependent variables. The data table DroughtInputData.csv provides the weekly U.S. drought monitor index values that were processed to formulate a potential independent variable. The R script model_water_use.R runs the models that predict water use. The other two R scripts (load_preprocess_input_data.R and model_water_use_functions.R) are not run explicitly but are called from the primary script model_water_use.R. Regression equations produced by the models can be used to predict water demand throughout Rhode Island.

  11. m

    Datasets and R Markdown files for the article "Survey on critical results...

    • data.mendeley.com
    Updated Jul 21, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alan Carvalho Dias (2025). Datasets and R Markdown files for the article "Survey on critical results management in Brazilian clinical laboratories: Profiling practices through multivariate analysis and a “New Statistics” approach" submitted to Clinica Chimica Acta [Dataset]. http://doi.org/10.17632/nbntxphxkw.2
    Explore at:
    Dataset updated
    Jul 21, 2025
    Authors
    Alan Carvalho Dias
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    This repository contains supplementary materials related to the study "𝐒𝐮𝐫𝐯𝐞𝐲 𝐨𝐧 𝐜𝐫𝐢𝐭𝐢𝐜𝐚𝐥 𝐫𝐞𝐬𝐮𝐥𝐭𝐬 𝐦𝐚𝐧𝐚𝐠𝐞𝐦𝐞𝐧𝐭 𝐢𝐧 𝐁𝐫𝐚𝐳𝐢𝐥𝐢𝐚𝐧 𝐜𝐥𝐢𝐧𝐢𝐜𝐚𝐥 𝐥𝐚𝐛𝐨𝐫𝐚𝐭𝐨𝐫𝐢𝐞𝐬: 𝐏𝐫𝐨𝐟𝐢𝐥𝐢𝐧𝐠 𝐩𝐫𝐚𝐜𝐭𝐢𝐜𝐞𝐬 𝐭𝐡𝐫𝐨𝐮𝐠𝐡 𝐦𝐮𝐥𝐭𝐢𝐯𝐚𝐫𝐢𝐚𝐭𝐞 𝐚𝐧𝐚𝐥𝐲𝐬𝐢𝐬 𝐚𝐧𝐝 𝐚 '𝐍𝐞𝐰 𝐒𝐭𝐚𝐭𝐢𝐬𝐭𝐢𝐜𝐬' 𝐚𝐩𝐩𝐫𝐨𝐚𝐜𝐡". The dataset, figures, exported results, and analysis scripts are included to ensure full transparency and reproducibility of the research findings.

  12. f

    Multiple regression models through stepwise addition of independent...

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Feb 21, 2012
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Thompson, Cristiane C.; Cabral, Anderson S.; Valentin, Jean L.; Nascimento, Juliana R.; Thompson, Fabiano L.; Gregoracci, Gustavo B.; Paranhos, Rodolfo (2012). Multiple regression models through stepwise addition of independent environmental variables. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001158155
    Explore at:
    Dataset updated
    Feb 21, 2012
    Authors
    Thompson, Cristiane C.; Cabral, Anderson S.; Valentin, Jean L.; Nascimento, Juliana R.; Thompson, Fabiano L.; Gregoracci, Gustavo B.; Paranhos, Rodolfo
    Description

    Multiple regression models through stepwise addition of independent environmental variables.

  13. Study Hours ,Student Scores for Linear Regression

    • kaggle.com
    Updated Sep 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    douaa bennoune (2024). Study Hours ,Student Scores for Linear Regression [Dataset]. https://www.kaggle.com/datasets/douaabennoune/study-hours-student-scores-for-linear-regression
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 23, 2024
    Dataset provided by
    Kaggle
    Authors
    douaa bennoune
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset contains a collection of 100 randomly generated data points representing the relationship between the number of hours a student spends studying and their corresponding performance, measured as a score. The data has been generated to simulate a real-world scenario where study hours are assumed to influence academic outcomes, making it an excellent resource for linear regression analysis and other machine learning tasks.

    Each row in the dataset consists of:

    Hours: The number of hours a student dedicates to studying, ranging between 0 and 10 hours. Scores: The student's performance score, represented as a percentage, ranging from 0 to 100. Use Cases: This dataset is particularly useful for:

    Linear Regression: Exploring how study hours influence student performance, fitting a regression line to predict scores based on study time. Data Science & Machine Learning: Practicing regression analysis, training models, and applying other predictive algorithms. Educational Research: Simulating data-driven insights into student behavior and performance metrics. Features: 100 rows of data. Continuous numerical variables suitable for regression tasks. Generated for educational purposes, making it ideal for students, teachers, and beginners in machine learning and data science. Potential Applications: Build a linear regression model to predict student scores. Investigate the correlation between study time and performance. Apply data visualization techniques to better understand the data. Use the dataset to experiment with model evaluation metrics like Mean Squared Error (MSE) and R-squared.

  14. E

    Data from: STAD-R Descriptive statistics for experimental designs

    • data.moa.gov.et
    html
    Updated Jan 20, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    CIMMYT Ethiopia (2025). STAD-R Descriptive statistics for experimental designs [Dataset]. https://data.moa.gov.et/dataset/hdl-11529-10853
    Explore at:
    htmlAvailable download formats
    Dataset updated
    Jan 20, 2025
    Dataset provided by
    CIMMYT Ethiopia
    Description

    STAD-R is a set of R programs that performs descriptive statistics, in order to make boxplots and histograms. STAD-R was designed because is necessary before than the thing, check if the dataset have the same number of repetitions, blocks, genotypes, environments, if we have missing values, where and how many, review the distributions and outliers, because is important to be sure that the dataset is complete and have the correct structure for do and other kind of analysis.

  15. f

    A multiple linear regression of the US News and World Report Score and its...

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Sep 23, 2014
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Goldstein, Jeffrey A.; Prasad, Vinay (2014). A multiple linear regression of the US News and World Report Score and its contributors. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001258514
    Explore at:
    Dataset updated
    Sep 23, 2014
    Authors
    Goldstein, Jeffrey A.; Prasad, Vinay
    Description

    *Standardized units.r-squared = 0.94.A multiple linear regression of the US News and World Report Score and its contributors.

  16. m

    Data from: Persistent B-cell memory after SARS-CoV-2 vaccination is...

    • data.mendeley.com
    Updated Jan 7, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Eva Piano Mortari (2022). Persistent B-cell memory after SARS-CoV-2 vaccination is functional during breakthrough infections. [Dataset]. http://doi.org/10.17632/3yxkps6msr.1
    Explore at:
    Dataset updated
    Jan 7, 2022
    Authors
    Eva Piano Mortari
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    We reported the R code used to study the relationship between variables using a simple linear regression model in the software R (R Core Team (2021). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/. Accessed 24/09/2021).

  17. d

    Digital Shoreline Analysis System version 4.3 Transects with Long-Term...

    • catalog.data.gov
    • datasets.ai
    • +1more
    Updated Oct 22, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Digital Shoreline Analysis System version 4.3 Transects with Long-Term Linear Regression Rate Calculations for Florida west (FLwest) [Dataset]. https://catalog.data.gov/dataset/digital-shoreline-analysis-system-version-4-3-transects-with-long-term-linear-regression-r-405ba
    Explore at:
    Dataset updated
    Oct 22, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Area covered
    Florida
    Description

    Sandy ocean beaches are a popular recreational destination, often surrounded by communities containing valuable real estate. Development is on the rise despite the fact that coastal infrastructure is subjected to flooding and erosion. As a result, there is an increased demand for accurate information regarding past and present shoreline changes. To meet these national needs, the Coastal and Marine Geology Program of the U.S. Geological Survey (USGS) is compiling existing reliable historical shoreline data along open-ocean sandy shores of the conterminous United States and parts of Alaska and Hawaii under the National Assessment of Shoreline Change project. There is no widely accepted standard for analyzing shoreline change. Existing shoreline data measurements and rate calculation methods vary from study to study and prevent combining results into state-wide or regional assessments. The impetus behind the National Assessment project was to develop a standardized method of measuring changes in shoreline position that is consistent from coast to coast. The goal was to facilitate the process of periodically and systematically updating the results in an internally consistent manner.

  18. Multiple Linear Regression R2 Values.

    • figshare.com
    • plos.figshare.com
    xls
    Updated Jun 2, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Luis F. H. Basile; João R. Sato; Milkes Y. Alvarenga; Nelson Henrique Jr.; Henrique A. Pasquini; William Alfenas; Sérgio Machado; Bruna Velasques; Pedro Ribeiro; Roberto Piedade; Renato Anghinah; Renato T. Ramos (2023). Multiple Linear Regression R2 Values. [Dataset]. http://doi.org/10.1371/journal.pone.0059595.t002
    Explore at:
    xlsAvailable download formats
    Dataset updated
    Jun 2, 2023
    Dataset provided by
    PLOShttp://plos.org/
    Authors
    Luis F. H. Basile; João R. Sato; Milkes Y. Alvarenga; Nelson Henrique Jr.; Henrique A. Pasquini; William Alfenas; Sérgio Machado; Bruna Velasques; Pedro Ribeiro; Roberto Piedade; Renato Anghinah; Renato T. Ramos
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Examples of adjusted R2 linear regression values obtained for the other subject who presented 9 good ICA components for beta activity.

  19. f

    Multiple regression model summary of market groups and time periods.

    • datasetcatalog.nlm.nih.gov
    • plos.figshare.com
    Updated Nov 20, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Woloszynek, Stephen; Gonder, Mary Katherine; O’Connor, Michael P.; Zhao, Zhengqiao; Mitchell, Dana Venditti; Meñe, Maximiliano Fero; Cronin, Drew T.; Mitchell, Matthew W.; Rosen, Gail R. (2024). Multiple regression model summary of market groups and time periods. [Dataset]. https://datasetcatalog.nlm.nih.gov/dataset?q=0001498848
    Explore at:
    Dataset updated
    Nov 20, 2024
    Authors
    Woloszynek, Stephen; Gonder, Mary Katherine; O’Connor, Michael P.; Zhao, Zhengqiao; Mitchell, Dana Venditti; Meñe, Maximiliano Fero; Cronin, Drew T.; Mitchell, Matthew W.; Rosen, Gail R.
    Description

    Multiple regression model summary of market groups and time periods.

  20. d

    Digital Shoreline Analysis System version 4.2 Transects with Long-Term...

    • catalog.data.gov
    • data.usgs.gov
    • +1more
    Updated Nov 20, 2025
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    U.S. Geological Survey (2025). Digital Shoreline Analysis System version 4.2 Transects with Long-Term Linear Regression Rate Calculations for Washington (WA_transects_LT.shp) [Dataset]. https://catalog.data.gov/dataset/digital-shoreline-analysis-system-version-4-2-transects-with-long-term-linear-regression-r
    Explore at:
    Dataset updated
    Nov 20, 2025
    Dataset provided by
    United States Geological Surveyhttp://www.usgs.gov/
    Area covered
    Washington
    Description

    Sandy ocean beaches are a popular recreational destination, often surrounded by communities containing valuable real estate. Development is on the rise despite the fact that coastal infrastructure is subjected to flooding and erosion. As a result, there is an increased demand for accurate information regarding past and present shoreline changes. To meet these national needs, the Coastal and Marine Geology Program of the U.S. Geological Survey (USGS) is compiling existing reliable historical shoreline data along open-ocean sandy shores of the conterminous United States and parts of Alaska and Hawaii under the National Assessment of Shoreline Change project. There is no widely accepted standard for analyzing shoreline change. Existing shoreline data measurements and rate calculation methods vary from study to study and prevent combining results into state-wide or regional assessments. The impetus behind the National Assessment project was to develop a standardized method of measuring changes in shoreline position that is consistent from coast to coast. The goal was to facilitate the process of periodically and systematically updating the results in an internally consistent manner.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
William Farmer (2024). An example data set for exploration of Multiple Linear Regression [Dataset]. http://doi.org/10.5066/P9T5ZEXV

An example data set for exploration of Multiple Linear Regression

Explore at:
Dataset updated
Feb 24, 2024
Dataset provided by
United States Geological Surveyhttp://www.usgs.gov/
Authors
William Farmer
License

U.S. Government Workshttps://www.usa.gov/government-works
License information was derived automatically

Time period covered
1956 - 2016
Description

This data set contains example data for exploration of the theory of regression based regionalization. The 90th percentile of annual maximum streamflow is provided as an example response variable for 293 streamgages in the conterminous United States. Several explanatory variables are drawn from the GAGES-II data base in order to demonstrate how multiple linear regression is applied. Example scripts demonstrate how to collect the original streamflow data provided and how to recreate the figures from the associated Techniques and Methods chapter.

Search
Clear search
Close search
Google apps
Main menu