100+ datasets found

Linear Models Practice (Linear/Log Regression)
kaggle.com
zip
Updated Jun 16, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Andrew K (2023). Linear Models Practice (Linear/Log Regression) [Dataset]. https://www.kaggle.com/datasets/kornilovag94/linear-models-practice-linearlog-regression
Explore at:
zip(236924 bytes)Available download formats
Dataset updated
Jun 16, 2023
Authors
Andrew K
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
In order to access the building, one must pass through a gate. To gain entry to the car park, the barrier needs to be opened. To reach a specific floor, a 'tablet' must be utilized. Every entry is recorded, noting the individuals, the dates, and the times of their arrival.

Is it possible to analyze the available data and ascertain the arrival methods of each office visitor, allowing for predictions? For instance, "User 5 arrives at gate 4 every Monday at 8 am," or "User 18 visits at 11 am on Saturdays, except for the last day of the month." Are there patterns within the current data?

This is a synthetic learning dataset for linear models' practice.
Synthetic Dataset for Linear Regression
kaggle.com
zip
Updated Mar 23, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Stable Space (2025). Synthetic Dataset for Linear Regression [Dataset]. https://www.kaggle.com/datasets/sharmajicoder/synthetic-dataset-for-linear-regression/data
Explore at:
zip(9111 bytes)Available download formats
Dataset updated
Mar 23, 2025
Authors
Stable Space
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
The file contain dataset with two variables (x & y). The dataset is for Linear regression ML Models. The dataset can be used for Testing purpose. The x variable is the independent variable, and y is the dependent variable. The dataset has a correlation of 0.9981 showing the dataset is best suited for linear models and can be used for the testing purpose.
ESS Linear Models
kaggle.com
zip
Updated Oct 12, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Oleksiy Kononenko (2023). ESS Linear Models [Dataset]. https://www.kaggle.com/datasets/kononenko/ess-linear-models
Explore at:
zip(3534 bytes)Available download formats
Dataset updated
Oct 12, 2023
Authors
Oleksiy Kononenko
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Oleksiy Kononenko

Released under Apache 2.0

Contents
Salary Dataset for Simple Linear regression model
kaggle.com
zip
Updated Dec 4, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Abhishek Kumar (2023). Salary Dataset for Simple Linear regression model [Dataset]. https://www.kaggle.com/datasets/abhishek121212/salary-dataset-for-simple-linear-regression-model
Explore at:
zip(457 bytes)Available download formats
Dataset updated
Dec 4, 2023
Authors
Abhishek Kumar
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Abhishek Kumar

Released under Apache 2.0

Contents
unet-num-2.linear model
kaggle.com
zip
Updated Aug 26, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Esraa_Fouad777 (2025). unet-num-2.linear model [Dataset]. https://www.kaggle.com/datasets/esraafouad777/unet-num-2-linear-model
Explore at:
zip(1288122266 bytes)Available download formats
Dataset updated
Aug 26, 2025
Authors
Esraa_Fouad777
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Esraa_Fouad777

Released under Apache 2.0

Contents
Call Center Simulated Data
kaggle.com
zip
Updated Mar 28, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Pablo Sebastián Campos Ortiz (2023). Call Center Simulated Data [Dataset]. https://www.kaggle.com/datasets/scss17/call-center-simulated-data
Explore at:
zip(3098 bytes)Available download formats
Dataset updated
Mar 28, 2023
Authors
Pablo Sebastián Campos Ortiz
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
The aim of this data set is to be used along with my notebook Linear Regression Notes which provides a guideline for applying correlation analysis and linear regression models from a statistical approach.

A fictional call center is interested in knowing the relationship between the number of personnel and some variables that measure their performance such as average answer time, average calls per hour, and average time per call. Data were simulated to represent 200 shifts.
XGBR-linear-model.h5
kaggle.com
zip
Updated Jun 27, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
EESHAKHANZADI (2024). XGBR-linear-model.h5 [Dataset]. https://www.kaggle.com/datasets/eeshakhanzadi/xgbr-linear-model-h5/data
Explore at:
zip(7618021 bytes)Available download formats
Dataset updated
Jun 27, 2024
Authors
EESHAKHANZADI
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by EESHAKHANZADI

Released under Apache 2.0

Contents
tfidf model and linear regression model
kaggle.com
zip
Updated May 15, 2019
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Terry guo (2019). tfidf model and linear regression model [Dataset]. https://www.kaggle.com/terry578868722/tfidf-model-and-linear-regression-model
Explore at:
zip(11070 bytes)Available download formats
Dataset updated
May 15, 2019
Authors
Terry guo
Description
Context

There's a story behind every dataset and here's your opportunity to share yours.

Content

What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

Acknowledgements

We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.

Inspiration

Your data will be in front of the world's largest data science community. What questions do you want to see answered?
Predictive analytics: Linear regression model
kaggle.com
zip
Updated Jun 12, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Yerzat Tursunkulov (2024). Predictive analytics: Linear regression model [Dataset]. https://www.kaggle.com/datasets/yerzattursunkulov/linear-regression-model
Explore at:
zip(3488404 bytes)Available download formats
Dataset updated
Jun 12, 2024
Authors
Yerzat Tursunkulov
Description
Dataset

This dataset was created by Yerzat Tursunkulov

Contents
Linear Regression Model
kaggle.com
zip
Updated Jul 24, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Suhail Sajid (2023). Linear Regression Model [Dataset]. https://www.kaggle.com/datasets/shoyoisgenki/admittance
Explore at:
zip(47485 bytes)Available download formats
Dataset updated
Jul 24, 2023
Authors
Suhail Sajid
Description
Dataset

This dataset was created by Suhail Sajid

Contents
Weight-Height Polynomial Dataset
kaggle.com
zip
Updated Oct 26, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Muhammad Afaq Bhatti (2024). Weight-Height Polynomial Dataset [Dataset]. https://www.kaggle.com/datasets/mafaqbhatti/weight-height-polynomial-dataset
Explore at:
zip(1181 bytes)Available download formats
Dataset updated
Oct 26, 2024
Authors
Muhammad Afaq Bhatti
License
Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
License information was derived automatically
Description
This dataset represents a synthetic relationship between weight and height, designed to exhibit a non-linear, polynomial trend suitable for polynomial regression analysis. The dataset includes 50 data points where "Weight" is the independent variable and "Height" is the dependent variable. Unlike a simple linear trend, this dataset's pattern follows a curved path, making it ideal for demonstrating polynomial regression models and machine learning techniques that address non-linear relationships. Suggested Applications: Polynomial regression modeling Non-linear data visualization Machine learning algorithm experimentation
Study Hours ,Student Scores for Linear Regression
kaggle.com
Updated Sep 23, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
douaa bennoune (2024). Study Hours ,Student Scores for Linear Regression [Dataset]. https://www.kaggle.com/datasets/douaabennoune/study-hours-student-scores-for-linear-regression
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Sep 23, 2024
Dataset provided by
Kaggle
Authors
douaa bennoune
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
This dataset contains a collection of 100 randomly generated data points representing the relationship between the number of hours a student spends studying and their corresponding performance, measured as a score. The data has been generated to simulate a real-world scenario where study hours are assumed to influence academic outcomes, making it an excellent resource for linear regression analysis and other machine learning tasks.

Each row in the dataset consists of:

Hours: The number of hours a student dedicates to studying, ranging between 0 and 10 hours. Scores: The student's performance score, represented as a percentage, ranging from 0 to 100. Use Cases: This dataset is particularly useful for:

Linear Regression: Exploring how study hours influence student performance, fitting a regression line to predict scores based on study time. Data Science & Machine Learning: Practicing regression analysis, training models, and applying other predictive algorithms. Educational Research: Simulating data-driven insights into student behavior and performance metrics. Features: 100 rows of data. Continuous numerical variables suitable for regression tasks. Generated for educational purposes, making it ideal for students, teachers, and beginners in machine learning and data science. Potential Applications: Build a linear regression model to predict student scores. Investigate the correlation between study time and performance. Apply data visualization techniques to better understand the data. Use the dataset to experiment with model evaluation metrics like Mean Squared Error (MSE) and R-squared.
Dataset for demonstrating simple linear Regression
kaggle.com
zip
Updated Jul 3, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Aaditya Gupta (2024). Dataset for demonstrating simple linear Regression [Dataset]. https://www.kaggle.com/datasets/aadityagupta11/data-for-demonstrating-basic-linear-regression
Explore at:
zip(2132 bytes)Available download formats
Dataset updated
Jul 3, 2024
Authors
Aaditya Gupta
Description
This dataset has been created to demonstrate the use of a simple linear regression model. It includes two variables: an independent variable and a dependent variable. The data can be used for training, testing, and validating a simple linear regression model, making it ideal for educational purposes, tutorials, and basic predictive analysis projects. The dataset consists of 100 observations with no missing values, and it follows a linear relationship
Energy Consumption Dataset - Linear Regression
kaggle.com
Updated Jan 6, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
GOVINDARAM SRIRAM (2025). Energy Consumption Dataset - Linear Regression [Dataset]. https://www.kaggle.com/datasets/govindaramsriram/energy-consumption-dataset-linear-regression
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jan 6, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
GOVINDARAM SRIRAM
License
Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
License information was derived automatically
Description
Description: This dataset is designed for predicting energy consumption based on various building features and environmental factors. It contains data for multiple building types, square footage, the number of occupants, appliances used, average temperature, and the day of the week. The goal is to build a predictive model to estimate energy consumption using these attributes.

The dataset can be used for training machine learning models such as linear regression to forecast energy needs based on the building's characteristics. This is useful for understanding energy demand patterns and optimizing energy consumption in different building types and environmental conditions.
Linear-Regression
kaggle.com
zip
Updated Oct 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
COLD (2025). Linear-Regression [Dataset]. https://www.kaggle.com/datasets/maulikgajera/linear-regression/discussion?sort=undefined
Explore at:
zip(7223 bytes)Available download formats
Dataset updated
Oct 2, 2025
Authors
COLD
Description
Based on the file snippets, your dataset, "Advertising.csv," is a classic dataset used for marketing and data analysis. It explores the relationship between advertising budgets and product sales.

Here is a breakdown of the columns:

TV: This column represents the advertising budget (likely in thousands of dollars) spent on television commercials.

Radio: This column shows the advertising budget allocated to radio broadcasts.

Newspaper: This column contains the advertising budget for newspaper ads.

Sales: This is your target variable. It represents the sales of a product (likely in thousands of units) for a given advertising budget.

In essence, this dataset is designed to help you answer questions like:

Which advertising channel (TV, Radio, or Newspaper) is the most effective at driving sales?

How does spending on each channel affect overall sales?

Can we predict future sales based on a planned advertising budget?

This dataset is frequently used to practice and demonstrate concepts in linear regression and marketing mix modeling.
House Price Regression Dataset
kaggle.com
zip
Updated Sep 6, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Prokshitha Polemoni (2024). House Price Regression Dataset [Dataset]. https://www.kaggle.com/datasets/prokshitha/home-value-insights
Explore at:
zip(27045 bytes)Available download formats
Dataset updated
Sep 6, 2024
Authors
Prokshitha Polemoni
Description
Home Value Insights: A Beginner's Regression Dataset

This dataset is designed for beginners to practice regression problems, particularly in the context of predicting house prices. It contains 1000 rows, with each row representing a house and various attributes that influence its price. The dataset is well-suited for learning basic to intermediate-level regression modeling techniques.

Features:

Square_Footage: The size of the house in square feet. Larger homes typically have higher prices.

Num_Bedrooms: The number of bedrooms in the house. More bedrooms generally increase the value of a home.

Num_Bathrooms: The number of bathrooms in the house. Houses with more bathrooms are typically priced higher.

Year_Built: The year the house was built. Older houses may be priced lower due to wear and tear.

Lot_Size: The size of the lot the house is built on, measured in acres. Larger lots tend to add value to a property.

Garage_Size: The number of cars that can fit in the garage. Houses with larger garages are usually more expensive.

Neighborhood_Quality: A rating of the neighborhood’s quality on a scale of 1-10, where 10 indicates a high-quality neighborhood. Better neighborhoods usually command higher prices.

House_Price (Target Variable): The price of the house, which is the dependent variable you aim to predict.

Potential Uses:

Beginner Regression Projects: This dataset can be used to practice building regression models such as Linear Regression, Decision Trees, or Random Forests. The target variable (house price) is continuous, making this an ideal problem for supervised learning techniques.

Feature Engineering Practice: Learners can create new features by combining existing ones, such as the price per square foot or age of the house, providing an opportunity to experiment with feature transformations.

Exploratory Data Analysis (EDA): You can explore how different features (e.g., square footage, number of bedrooms) correlate with the target variable, making it a great dataset for learning about data visualization and summary statistics.

Model Evaluation: The dataset allows for various model evaluation techniques such as cross-validation, R-squared, and Mean Absolute Error (MAE). These metrics can be used to compare the effectiveness of different models.

Versatility:

The dataset is highly versatile for a range of machine learning tasks. You can apply simple linear models to predict house prices based on one or two features, or use more complex models like Random Forest or Gradient Boosting Machines to understand interactions between variables.

It can also be used for dimensionality reduction techniques like PCA or to practice handling categorical variables (e.g., neighborhood quality) through encoding techniques like one-hot encoding.

This dataset is ideal for anyone wanting to gain practical experience in building regression models while working with real-world features.
ML model practice (linear regression )
kaggle.com
zip
Updated Dec 19, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shubham Singh (2022). ML model practice (linear regression ) [Dataset]. https://www.kaggle.com/datasets/shubhamsingh57/ml-model-practice-linear-regression
Explore at:
zip(49292 bytes)Available download formats
Dataset updated
Dec 19, 2022
Authors
Shubham Singh
Description
Dataset

This dataset was created by Shubham Singh

Contents
Hosuing Data set
kaggle.com
zip
Updated Nov 27, 2017
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
RakeshSk (2017). Hosuing Data set [Dataset]. https://www.kaggle.com/skrakesh5/hosuing-data-set
Explore at:
zip(4748 bytes)Available download formats
Dataset updated
Nov 27, 2017
Authors
RakeshSk
Description
Dataset

This dataset was created by RakeshSk

Contents
Advance Store Sales by Time Series
kaggle.com
zip
Updated Jul 7, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
xodeum (2022). Advance Store Sales by Time Series [Dataset]. https://www.kaggle.com/datasets/xodeum/advance-store-sales-by-time-series/code
Explore at:
zip(78584 bytes)Available download formats
Dataset updated
Jul 7, 2022
Authors
xodeum
Description
I simply motivated by the competition posted by Kaggle. This Dataset has beautiful visualization created with the help of matplotlib and Seaborn. I also used the pandas and Numpy
Simulation Linear Regression
kaggle.com
zip
Updated Dec 17, 2016
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ricky (2016). Simulation Linear Regression [Dataset]. https://www.kaggle.com/zurfer/simulation-linear-regression
Explore at:
zip(405 bytes)Available download formats
Dataset updated
Dec 17, 2016
Authors
Ricky
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Context

There's a story behind every dataset and here's your opportunity to share yours.

Content

What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

Acknowledgements

We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.

Inspiration

Your data will be in front of the world's largest data science community. What questions do you want to see answered?

Facebook

Twitter

Click to copy link

Link copied

Cite

Andrew K (2023). Linear Models Practice (Linear/Log Regression) [Dataset]. https://www.kaggle.com/datasets/kornilovag94/linear-models-practice-linearlog-regression

Linear Models Practice (Linear/Log Regression)

This is a synthetic learning dataset for linear models' practice.

Explore at:

zip(236924 bytes)Available download formats

Dataset updated

Jun 16, 2023

Authors

Andrew K

License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

In order to access the building, one must pass through a gate. To gain entry to the car park, the barrier needs to be opened. To reach a specific floor, a 'tablet' must be utilized. Every entry is recorded, noting the individuals, the dates, and the times of their arrival.

Is it possible to analyze the available data and ascertain the arrival methods of each office visitor, allowing for predictions? For instance, "User 5 arrives at gate 4 every Monday at 8 am," or "User 18 visits at 11 am on Saturdays, except for the last day of the month." Are there patterns within the current data?

This is a synthetic learning dataset for linear models' practice.

Clear search

Close search

Google apps

Main menu

Linear Models Practice (Linear/Log Regression)

Synthetic Dataset for Linear Regression

ESS Linear Models

Dataset

Contents

Salary Dataset for Simple Linear regression model

Dataset

Contents

unet-num-2.linear model

Dataset

Contents

Call Center Simulated Data

XGBR-linear-model.h5

Dataset

Contents

tfidf model and linear regression model

Context

Content

Acknowledgements

Inspiration

Predictive analytics: Linear regression model

Dataset

Contents

Linear Regression Model

Dataset

Contents

Weight-Height Polynomial Dataset

Study Hours ,Student Scores for Linear Regression

Dataset for demonstrating simple linear Regression

Energy Consumption Dataset - Linear Regression

Linear-Regression

This dataset is frequently used to practice and demonstrate concepts in linear regression and marketing mix modeling.

House Price Regression Dataset

Home Value Insights: A Beginner's Regression Dataset

Features:

Potential Uses:

Versatility:

ML model practice (linear regression )

Dataset

Contents

Hosuing Data set

Dataset

Contents

Advance Store Sales by Time Series

Simulation Linear Regression

Context

Content

Acknowledgements

Inspiration

Linear Models Practice (Linear/Log Regression)

This is a synthetic learning dataset for linear models' practice.