100+ datasets found
  1. Linear Models Practice (Linear/Log Regression)

    • kaggle.com
    zip
    Updated Jun 16, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Andrew K (2023). Linear Models Practice (Linear/Log Regression) [Dataset]. https://www.kaggle.com/datasets/kornilovag94/linear-models-practice-linearlog-regression
    Explore at:
    zip(236924 bytes)Available download formats
    Dataset updated
    Jun 16, 2023
    Authors
    Andrew K
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    In order to access the building, one must pass through a gate. To gain entry to the car park, the barrier needs to be opened. To reach a specific floor, a 'tablet' must be utilized. Every entry is recorded, noting the individuals, the dates, and the times of their arrival.

    Is it possible to analyze the available data and ascertain the arrival methods of each office visitor, allowing for predictions? For instance, "User 5 arrives at gate 4 every Monday at 8 am," or "User 18 visits at 11 am on Saturdays, except for the last day of the month." Are there patterns within the current data?

    This is a synthetic learning dataset for linear models' practice.

  2. Synthetic Dataset for Linear Regression

    • kaggle.com
    zip
    Updated Mar 23, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stable Space (2025). Synthetic Dataset for Linear Regression [Dataset]. https://www.kaggle.com/datasets/sharmajicoder/synthetic-dataset-for-linear-regression/data
    Explore at:
    zip(9111 bytes)Available download formats
    Dataset updated
    Mar 23, 2025
    Authors
    Stable Space
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    The file contain dataset with two variables (x & y). The dataset is for Linear regression ML Models. The dataset can be used for Testing purpose. The x variable is the independent variable, and y is the dependent variable. The dataset has a correlation of 0.9981 showing the dataset is best suited for linear models and can be used for the testing purpose.

  3. ESS Linear Models

    • kaggle.com
    zip
    Updated Oct 12, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Oleksiy Kononenko (2023). ESS Linear Models [Dataset]. https://www.kaggle.com/datasets/kononenko/ess-linear-models
    Explore at:
    zip(3534 bytes)Available download formats
    Dataset updated
    Oct 12, 2023
    Authors
    Oleksiy Kononenko
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Oleksiy Kononenko

    Released under Apache 2.0

    Contents

  4. Salary Dataset for Simple Linear regression model

    • kaggle.com
    zip
    Updated Dec 4, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abhishek Kumar (2023). Salary Dataset for Simple Linear regression model [Dataset]. https://www.kaggle.com/datasets/abhishek121212/salary-dataset-for-simple-linear-regression-model
    Explore at:
    zip(457 bytes)Available download formats
    Dataset updated
    Dec 4, 2023
    Authors
    Abhishek Kumar
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Abhishek Kumar

    Released under Apache 2.0

    Contents

  5. unet-num-2.linear model

    • kaggle.com
    zip
    Updated Aug 26, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Esraa_Fouad777 (2025). unet-num-2.linear model [Dataset]. https://www.kaggle.com/datasets/esraafouad777/unet-num-2-linear-model
    Explore at:
    zip(1288122266 bytes)Available download formats
    Dataset updated
    Aug 26, 2025
    Authors
    Esraa_Fouad777
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Esraa_Fouad777

    Released under Apache 2.0

    Contents

  6. Call Center Simulated Data

    • kaggle.com
    zip
    Updated Mar 28, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pablo Sebastián Campos Ortiz (2023). Call Center Simulated Data [Dataset]. https://www.kaggle.com/datasets/scss17/call-center-simulated-data
    Explore at:
    zip(3098 bytes)Available download formats
    Dataset updated
    Mar 28, 2023
    Authors
    Pablo Sebastián Campos Ortiz
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    The aim of this data set is to be used along with my notebook Linear Regression Notes which provides a guideline for applying correlation analysis and linear regression models from a statistical approach.

    A fictional call center is interested in knowing the relationship between the number of personnel and some variables that measure their performance such as average answer time, average calls per hour, and average time per call. Data were simulated to represent 200 shifts.

  7. XGBR-linear-model.h5

    • kaggle.com
    zip
    Updated Jun 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    EESHAKHANZADI (2024). XGBR-linear-model.h5 [Dataset]. https://www.kaggle.com/datasets/eeshakhanzadi/xgbr-linear-model-h5/data
    Explore at:
    zip(7618021 bytes)Available download formats
    Dataset updated
    Jun 27, 2024
    Authors
    EESHAKHANZADI
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by EESHAKHANZADI

    Released under Apache 2.0

    Contents

  8. tfidf model and linear regression model

    • kaggle.com
    zip
    Updated May 15, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Terry guo (2019). tfidf model and linear regression model [Dataset]. https://www.kaggle.com/terry578868722/tfidf-model-and-linear-regression-model
    Explore at:
    zip(11070 bytes)Available download formats
    Dataset updated
    May 15, 2019
    Authors
    Terry guo
    Description

    Context

    There's a story behind every dataset and here's your opportunity to share yours.

    Content

    What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

    Acknowledgements

    We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.

    Inspiration

    Your data will be in front of the world's largest data science community. What questions do you want to see answered?

  9. Predictive analytics: Linear regression model

    • kaggle.com
    zip
    Updated Jun 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yerzat Tursunkulov (2024). Predictive analytics: Linear regression model [Dataset]. https://www.kaggle.com/datasets/yerzattursunkulov/linear-regression-model
    Explore at:
    zip(3488404 bytes)Available download formats
    Dataset updated
    Jun 12, 2024
    Authors
    Yerzat Tursunkulov
    Description

    Dataset

    This dataset was created by Yerzat Tursunkulov

    Contents

  10. Linear Regression Model

    • kaggle.com
    zip
    Updated Jul 24, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Suhail Sajid (2023). Linear Regression Model [Dataset]. https://www.kaggle.com/datasets/shoyoisgenki/admittance
    Explore at:
    zip(47485 bytes)Available download formats
    Dataset updated
    Jul 24, 2023
    Authors
    Suhail Sajid
    Description

    Dataset

    This dataset was created by Suhail Sajid

    Contents

  11. Weight-Height Polynomial Dataset

    • kaggle.com
    zip
    Updated Oct 26, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Muhammad Afaq Bhatti (2024). Weight-Height Polynomial Dataset [Dataset]. https://www.kaggle.com/datasets/mafaqbhatti/weight-height-polynomial-dataset
    Explore at:
    zip(1181 bytes)Available download formats
    Dataset updated
    Oct 26, 2024
    Authors
    Muhammad Afaq Bhatti
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    This dataset represents a synthetic relationship between weight and height, designed to exhibit a non-linear, polynomial trend suitable for polynomial regression analysis. The dataset includes 50 data points where "Weight" is the independent variable and "Height" is the dependent variable. Unlike a simple linear trend, this dataset's pattern follows a curved path, making it ideal for demonstrating polynomial regression models and machine learning techniques that address non-linear relationships. Suggested Applications: Polynomial regression modeling Non-linear data visualization Machine learning algorithm experimentation

  12. Study Hours ,Student Scores for Linear Regression

    • kaggle.com
    Updated Sep 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    douaa bennoune (2024). Study Hours ,Student Scores for Linear Regression [Dataset]. https://www.kaggle.com/datasets/douaabennoune/study-hours-student-scores-for-linear-regression
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 23, 2024
    Dataset provided by
    Kaggle
    Authors
    douaa bennoune
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    This dataset contains a collection of 100 randomly generated data points representing the relationship between the number of hours a student spends studying and their corresponding performance, measured as a score. The data has been generated to simulate a real-world scenario where study hours are assumed to influence academic outcomes, making it an excellent resource for linear regression analysis and other machine learning tasks.

    Each row in the dataset consists of:

    Hours: The number of hours a student dedicates to studying, ranging between 0 and 10 hours. Scores: The student's performance score, represented as a percentage, ranging from 0 to 100. Use Cases: This dataset is particularly useful for:

    Linear Regression: Exploring how study hours influence student performance, fitting a regression line to predict scores based on study time. Data Science & Machine Learning: Practicing regression analysis, training models, and applying other predictive algorithms. Educational Research: Simulating data-driven insights into student behavior and performance metrics. Features: 100 rows of data. Continuous numerical variables suitable for regression tasks. Generated for educational purposes, making it ideal for students, teachers, and beginners in machine learning and data science. Potential Applications: Build a linear regression model to predict student scores. Investigate the correlation between study time and performance. Apply data visualization techniques to better understand the data. Use the dataset to experiment with model evaluation metrics like Mean Squared Error (MSE) and R-squared.

  13. Dataset for demonstrating simple linear Regression

    • kaggle.com
    zip
    Updated Jul 3, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Aaditya Gupta (2024). Dataset for demonstrating simple linear Regression [Dataset]. https://www.kaggle.com/datasets/aadityagupta11/data-for-demonstrating-basic-linear-regression
    Explore at:
    zip(2132 bytes)Available download formats
    Dataset updated
    Jul 3, 2024
    Authors
    Aaditya Gupta
    Description

    This dataset has been created to demonstrate the use of a simple linear regression model. It includes two variables: an independent variable and a dependent variable. The data can be used for training, testing, and validating a simple linear regression model, making it ideal for educational purposes, tutorials, and basic predictive analysis projects. The dataset consists of 100 observations with no missing values, and it follows a linear relationship

  14. Energy Consumption Dataset - Linear Regression

    • kaggle.com
    Updated Jan 6, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    GOVINDARAM SRIRAM (2025). Energy Consumption Dataset - Linear Regression [Dataset]. https://www.kaggle.com/datasets/govindaramsriram/energy-consumption-dataset-linear-regression
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 6, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    GOVINDARAM SRIRAM
    License

    Attribution 4.0 (CC BY 4.0)https://creativecommons.org/licenses/by/4.0/
    License information was derived automatically

    Description

    Description: This dataset is designed for predicting energy consumption based on various building features and environmental factors. It contains data for multiple building types, square footage, the number of occupants, appliances used, average temperature, and the day of the week. The goal is to build a predictive model to estimate energy consumption using these attributes.

    The dataset can be used for training machine learning models such as linear regression to forecast energy needs based on the building's characteristics. This is useful for understanding energy demand patterns and optimizing energy consumption in different building types and environmental conditions.

  15. Linear-Regression

    • kaggle.com
    zip
    Updated Oct 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    COLD (2025). Linear-Regression [Dataset]. https://www.kaggle.com/datasets/maulikgajera/linear-regression/discussion?sort=undefined
    Explore at:
    zip(7223 bytes)Available download formats
    Dataset updated
    Oct 2, 2025
    Authors
    COLD
    Description

    Based on the file snippets, your dataset, "Advertising.csv," is a classic dataset used for marketing and data analysis. It explores the relationship between advertising budgets and product sales.

    Here is a breakdown of the columns:

    • TV: This column represents the advertising budget (likely in thousands of dollars) spent on television commercials.
    • Radio: This column shows the advertising budget allocated to radio broadcasts.
    • Newspaper: This column contains the advertising budget for newspaper ads.
    • Sales: This is your target variable. It represents the sales of a product (likely in thousands of units) for a given advertising budget.

    In essence, this dataset is designed to help you answer questions like:

    • Which advertising channel (TV, Radio, or Newspaper) is the most effective at driving sales?
    • How does spending on each channel affect overall sales?
    • Can we predict future sales based on a planned advertising budget?
    This dataset is frequently used to practice and demonstrate concepts in linear regression and marketing mix modeling.
  16. House Price Regression Dataset

    • kaggle.com
    zip
    Updated Sep 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Prokshitha Polemoni (2024). House Price Regression Dataset [Dataset]. https://www.kaggle.com/datasets/prokshitha/home-value-insights
    Explore at:
    zip(27045 bytes)Available download formats
    Dataset updated
    Sep 6, 2024
    Authors
    Prokshitha Polemoni
    Description

    Home Value Insights: A Beginner's Regression Dataset

    This dataset is designed for beginners to practice regression problems, particularly in the context of predicting house prices. It contains 1000 rows, with each row representing a house and various attributes that influence its price. The dataset is well-suited for learning basic to intermediate-level regression modeling techniques.

    Features:

    1. Square_Footage: The size of the house in square feet. Larger homes typically have higher prices.
    2. Num_Bedrooms: The number of bedrooms in the house. More bedrooms generally increase the value of a home.
    3. Num_Bathrooms: The number of bathrooms in the house. Houses with more bathrooms are typically priced higher.
    4. Year_Built: The year the house was built. Older houses may be priced lower due to wear and tear.
    5. Lot_Size: The size of the lot the house is built on, measured in acres. Larger lots tend to add value to a property.
    6. Garage_Size: The number of cars that can fit in the garage. Houses with larger garages are usually more expensive.
    7. Neighborhood_Quality: A rating of the neighborhood’s quality on a scale of 1-10, where 10 indicates a high-quality neighborhood. Better neighborhoods usually command higher prices.
    8. House_Price (Target Variable): The price of the house, which is the dependent variable you aim to predict.

    Potential Uses:

    1. Beginner Regression Projects: This dataset can be used to practice building regression models such as Linear Regression, Decision Trees, or Random Forests. The target variable (house price) is continuous, making this an ideal problem for supervised learning techniques.

    2. Feature Engineering Practice: Learners can create new features by combining existing ones, such as the price per square foot or age of the house, providing an opportunity to experiment with feature transformations.

    3. Exploratory Data Analysis (EDA): You can explore how different features (e.g., square footage, number of bedrooms) correlate with the target variable, making it a great dataset for learning about data visualization and summary statistics.

    4. Model Evaluation: The dataset allows for various model evaluation techniques such as cross-validation, R-squared, and Mean Absolute Error (MAE). These metrics can be used to compare the effectiveness of different models.

    Versatility:

    • The dataset is highly versatile for a range of machine learning tasks. You can apply simple linear models to predict house prices based on one or two features, or use more complex models like Random Forest or Gradient Boosting Machines to understand interactions between variables.

    • It can also be used for dimensionality reduction techniques like PCA or to practice handling categorical variables (e.g., neighborhood quality) through encoding techniques like one-hot encoding.

    • This dataset is ideal for anyone wanting to gain practical experience in building regression models while working with real-world features.

  17. ML model practice (linear regression )

    • kaggle.com
    zip
    Updated Dec 19, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shubham Singh (2022). ML model practice (linear regression ) [Dataset]. https://www.kaggle.com/datasets/shubhamsingh57/ml-model-practice-linear-regression
    Explore at:
    zip(49292 bytes)Available download formats
    Dataset updated
    Dec 19, 2022
    Authors
    Shubham Singh
    Description

    Dataset

    This dataset was created by Shubham Singh

    Contents

  18. Hosuing Data set

    • kaggle.com
    zip
    Updated Nov 27, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    RakeshSk (2017). Hosuing Data set [Dataset]. https://www.kaggle.com/skrakesh5/hosuing-data-set
    Explore at:
    zip(4748 bytes)Available download formats
    Dataset updated
    Nov 27, 2017
    Authors
    RakeshSk
    Description

    Dataset

    This dataset was created by RakeshSk

    Contents

  19. Advance Store Sales by Time Series

    • kaggle.com
    zip
    Updated Jul 7, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    xodeum (2022). Advance Store Sales by Time Series [Dataset]. https://www.kaggle.com/datasets/xodeum/advance-store-sales-by-time-series/code
    Explore at:
    zip(78584 bytes)Available download formats
    Dataset updated
    Jul 7, 2022
    Authors
    xodeum
    Description

    I simply motivated by the competition posted by Kaggle. This Dataset has beautiful visualization created with the help of matplotlib and Seaborn. I also used the pandas and Numpy

  20. Simulation Linear Regression

    • kaggle.com
    zip
    Updated Dec 17, 2016
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ricky (2016). Simulation Linear Regression [Dataset]. https://www.kaggle.com/zurfer/simulation-linear-regression
    Explore at:
    zip(405 bytes)Available download formats
    Dataset updated
    Dec 17, 2016
    Authors
    Ricky
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    There's a story behind every dataset and here's your opportunity to share yours.

    Content

    What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

    Acknowledgements

    We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.

    Inspiration

    Your data will be in front of the world's largest data science community. What questions do you want to see answered?

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Andrew K (2023). Linear Models Practice (Linear/Log Regression) [Dataset]. https://www.kaggle.com/datasets/kornilovag94/linear-models-practice-linearlog-regression
Organization logo

Linear Models Practice (Linear/Log Regression)

This is a synthetic learning dataset for linear models' practice.

Explore at:
zip(236924 bytes)Available download formats
Dataset updated
Jun 16, 2023
Authors
Andrew K
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

In order to access the building, one must pass through a gate. To gain entry to the car park, the barrier needs to be opened. To reach a specific floor, a 'tablet' must be utilized. Every entry is recorded, noting the individuals, the dates, and the times of their arrival.

Is it possible to analyze the available data and ascertain the arrival methods of each office visitor, allowing for predictions? For instance, "User 5 arrives at gate 4 every Monday at 8 am," or "User 18 visits at 11 am on Saturdays, except for the last day of the month." Are there patterns within the current data?

This is a synthetic learning dataset for linear models' practice.

Search
Clear search
Close search
Google apps
Main menu