100+ datasets found
  1. Hello world

    • kaggle.com
    zip
    Updated Aug 1, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pashito (2022). Hello world [Dataset]. https://www.kaggle.com/pashito/hello-world
    Explore at:
    zip(365735 bytes)Available download formats
    Dataset updated
    Aug 1, 2022
    Authors
    Pashito
    Area covered
    World
    Description

    Beginner projects made with help of YouTube tutorials and Kaggle.

  2. Sales Data Analysis Project

    • kaggle.com
    zip
    Updated Jun 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Stina Tonia (2024). Sales Data Analysis Project [Dataset]. https://www.kaggle.com/datasets/stinatonia/2019-project-on-sales
    Explore at:
    zip(3818151 bytes)Available download formats
    Dataset updated
    Jun 1, 2024
    Authors
    Stina Tonia
    Description

    This project was done to analyze sales data: to identify trends, top-selling products, and revenue metrics for business decision-making. I did this project offered by MeriSKILL, to learn more and be exposed to real-world projects and challenges that will provide me with valuable industry experience and help me develop my data analytical skills.https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F20837845%2Fe3561db319392bf9cc8b7d3fcc7ed94d%2F2019%20Sales%20dashboard.png?generation=1717273572595587&alt=media" alt=""> More on this project is on Medium

  3. To-Do-List Project

    • kaggle.com
    zip
    Updated Aug 19, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Santtosh Muniyandy (2023). To-Do-List Project [Dataset]. https://www.kaggle.com/datasets/santtoshmuniyandy/to-do-list-project
    Explore at:
    zip(43156 bytes)Available download formats
    Dataset updated
    Aug 19, 2023
    Authors
    Santtosh Muniyandy
    Description

    Hi, i just finished my second project to improve my coding skills which is a simple to-do-list program. It runs perfectly on my visual studios, although it's quite problematic to run the output in Kaggle. As a beginner, i used the file input output, looping, string-formatting, conditions and much more information learned and implemented in it. I hope i can receive ideas or opinions on how to improve it hehe. Please take your time to use it at your leisure.

  4. Machine Learning Projects

    • kaggle.com
    zip
    Updated Apr 18, 2024
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Deeksha3@ (2024). Machine Learning Projects [Dataset]. https://www.kaggle.com/datasets/deekshaa1/machine-learning-projects
    Explore at:
    zip(43093 bytes)Available download formats
    Dataset updated
    Apr 18, 2024
    Authors
    Deeksha3@
    Description

    Dataset

    This dataset was created by Deeksha3@

    Contents

  5. Analysis of small businesses in Michigan

    • kaggle.com
    zip
    Updated Oct 12, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Maooz Abdullah (2024). Analysis of small businesses in Michigan [Dataset]. https://www.kaggle.com/datasets/maoozabdullah/analysis-of-small-businesses-in-michigan
    Explore at:
    zip(334456 bytes)Available download formats
    Dataset updated
    Oct 12, 2024
    Authors
    Maooz Abdullah
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Area covered
    Michigan
    Description

    The objective of this report is to analyze the role of small businesses in the Michigan job market using the provided dataset. We aim to understand the impact of small businesses on employment, sales, and other economic factors. This analysis will help in identifying trends and patterns that can inform policy decisions and support for small businesses.

  6. OpenStreetMap Data - North Bangalore, India

    • kaggle.com
    zip
    Updated Jun 13, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Priya_ds (2017). OpenStreetMap Data - North Bangalore, India [Dataset]. https://www.kaggle.com/priya2908/openstreetmap-data-north-bangalore-india
    Explore at:
    zip(15364108 bytes)Available download formats
    Dataset updated
    Jun 13, 2017
    Authors
    Priya_ds
    Area covered
    Bangalore North, India, Bengaluru
    Description

    Dataset

    This dataset was created by Priya_ds

    Released under Other (specified in description)

    Contents

    a

  7. Store Data Analysis using MS excel

    • kaggle.com
    zip
    Updated Mar 10, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NisshaaChoudhary (2024). Store Data Analysis using MS excel [Dataset]. https://www.kaggle.com/datasets/nisshaachoudhary/store-data-analysis-using-ms-excel/discussion
    Explore at:
    zip(13048217 bytes)Available download formats
    Dataset updated
    Mar 10, 2024
    Authors
    NisshaaChoudhary
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Vrinda Store: Interactive Ms Excel dashboardVrinda Store: Interactive Ms Excel dashboard Feb 2024 - Mar 2024Feb 2024 - Mar 2024 The owner of Vrinda store wants to create an annual sales report for 2022. So that their employees can understand their customers and grow more sales further. Questions asked by Owner of Vrinda store are as follows:- 1) Compare the sales and orders using single chart. 2) Which month got the highest sales and orders? 3) Who purchased more - women per men in 2022? 4) What are different order status in 2022?

    And some other questions related to business. The owner of Vrinda store wanted a visual story of their data. Which can depict all the real time progress and sales insight of the store. This project is a Ms Excel dashboard which presents an interactive visual story to help the Owner and employees in increasing their sales. Task performed : Data cleaning, Data processing, Data analysis, Data visualization, Report. Tool used : Ms Excel The owner of Vrinda store wants to create an annual sales report for 2022. So that their employees can understand their customers and grow more sales further. Questions asked by Owner of Vrinda store are as follows:- 1) Compare the sales and orders using single chart. 2) Which month got the highest sales and orders? 3) Who purchased more - women per men in 2022? 4) What are different order status in 2022? And some other questions related to business. The owner of Vrinda store wanted a visual story of their data. Which can depict all the real time progress and sales insight of the store. This project is a Ms Excel dashboard which presents an interactive visual story to help the Owner and employees in increasing their sales. Task performed : Data cleaning, Data processing, Data analysis, Data visualization, Report. Tool used : Ms Excel Skills: Data Analysis · Data Analytics · ms excel · Pivot Tables

  8. Data Science Tutorial for Beginners

    • kaggle.com
    zip
    Updated Jun 4, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Naimul Hasan Shadesh (2024). Data Science Tutorial for Beginners [Dataset]. https://www.kaggle.com/datasets/shadesh/data-science-tutorial-for-beginners/code
    Explore at:
    zip(848077 bytes)Available download formats
    Dataset updated
    Jun 4, 2024
    Authors
    Naimul Hasan Shadesh
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Naimul Hasan Shadesh

    Released under Apache 2.0

    Contents

  9. Electronics Project(2600+ projects)

    • kaggle.com
    zip
    Updated Nov 13, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    NICK-2908 (2025). Electronics Project(2600+ projects) [Dataset]. https://www.kaggle.com/datasets/nick2908/electronics-project2600-projects
    Explore at:
    zip(274002 bytes)Available download formats
    Dataset updated
    Nov 13, 2025
    Authors
    NICK-2908
    Description

    **Summary ** This dataset contains over 2,600 circuit projects scraped from Instructables, focusing on the "Circuits" category. It includes project titles, authors, engagement metrics (views, likes), and the primary component used (Instruments).

    ** How This Data Was Collected**

    I built a web scraper using Python and Selenium to gather all project links (over 2,600 of them) by handling the "Load All" button. The full page source was saved, and I then used BeautifulSoup to parse the HTML and extract the raw data for each project.

    Data Cleaning (The Important Part!)

    The raw data was very messy. I performed a full data cleaning pipeline in a Colab notebook using Pandas.

    • Converted Text to Numbers: Views and Likes were text fields (object).
    • Handled "K" Values: Found and converted "K" values (e.g., "2.2K") into proper numbers (2200).
    • Handled Missing Data: Replaced all "N/A" strings with null values.
    • Mean Imputation: To keep the dataset complete, I filled all missing Likes and Views with the mean (average) of the respective column.

    Key Insights & Analysis

    1. "Viral" Effect (High Skew): The Views and Likes data is highly right-skewed (skewness of ~9.5). This shows a "viral" effect where a tiny number of superstar projects get the vast majority of all views and likes.

    [](url)

    1. Log-Transformation: Because of the skew, I created log_Views and log_Likes columns. A 2D density plot of these log-transformed columns shows a strong positive correlation (as likes increase, views increase) and that the most "typical" project gets around 30-40 likes and 4,000-5,000 views. https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F29431778%2Fd90e2039f1be11b53308ab7191b10954%2Fdownload%20(1).png?generation=1763013545903998&alt=media" alt="">

    2. Top Instruments: I've also analyzed the most popular instruments to see which ones get the most engagement. https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F29431778%2F19fca1ce142ddddc1e16a5319a1f4fc5%2Fdownload%20(2).png?generation=1763013562400830&alt=media" alt="">

    Column Descriptions

    • Title: The name of the project.
    • Project_Admin: The author/creator of the project.
    • Image_URL: The URL for the project's cover image.
    • Views: The total number of views (cleaned and imputed).
    • Likes: The total number of likes/favorites (cleaned and imputed).
    • Instruments: The main component or category tag (e.g., "Arduino", "Raspberry Pi").
  10. PROJECT

    • kaggle.com
    zip
    Updated Aug 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ibritics (2023). PROJECT [Dataset]. https://www.kaggle.com/datasets/ibritics/project
    Explore at:
    zip(242 bytes)Available download formats
    Dataset updated
    Aug 8, 2023
    Authors
    Ibritics
    Description

    Dataset

    This dataset was created by Ibritics

    Contents

  11. PrimeEstate

    • kaggle.com
    zip
    Updated Mar 21, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Dennis Ponce Tagamolila (2023). PrimeEstate [Dataset]. https://www.kaggle.com/datasets/dtagamolila/primeestate
    Explore at:
    zip(1195689 bytes)Available download formats
    Dataset updated
    Mar 21, 2023
    Authors
    Dennis Ponce Tagamolila
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Description

    Real Estate Database

    This is a mock-up of a real estate company, this is based on an actual company that had a number of challenges - collection and revenue is the biggest issue. A deep dive into the available data will provide the possible reasons and is the purpose of the data analytics project.

    Here's the fictional business scenario:

    Ms. Aurora Sanchez, the Chief Operations Officer (COO) of Prime Estate talked to the operations data analyst team to discuss a couple of her requirements. Ms. Sanchez is responsible for sales, property and project management, customer service, collections, and several other operations departments under her umbrella. When she joined the organization in late 2018, she quickly got several escalations from buyers who were complaining about units, properties that were not turned over on time, and delays in the projects. Ms. Sanchez also noted problems with collections not meeting the targets, and inconsistent sales performance.

    As the COO, Ms. Sanchez wants to identify and validate the history of these problems as well as see if there have been improvements in these pain points ever since she joined Prime Estate. Her focus points are Collections, Project Management, Customer Service, Collections, and Sales.

    As the Business/Data Analyst Lead, your responsibility is to gather the performance data related to this part of operations, find trends, present findings, and provide recommendations that will help the organization improve the pain points of operations. You must work with the manager of customer service and collections, and the project and property management managers for this undertaking.

    The data that is available is an inventory database that includes a listing of all projects, properties, their cost, package price, current status, and sales date. Another database provided is the project management database that tracks the construction initiation, time lapsed till the project is at 90% completion, and another date that tags it at 100% completed. Lastly, the collections database includes a listing of all units that are tagged as sold and tracks the turnover date (the date that the unit was turned over to the owner), collection date (the date that the full amount was based on the package price and all other charges) was collected from the buyer through multiple channels.

  12. Coffee Sales Excel Project

    • kaggle.com
    Updated Nov 13, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Nuha Zahidi (2024). Coffee Sales Excel Project [Dataset]. https://www.kaggle.com/datasets/nuhazahidi/coffee-sales-excel-project
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 13, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Nuha Zahidi
    Description

    Tool: Microsoft Excel

    Dataset: Coffee Sales

    Process: 1. Data Cleaning: • Remove duplicates and blanks. • Standardize date and currency formats.

    1. Data Manipulation: • Sorting and filtering function to work
      with interest subsets of data. • Use XLOOKUP, INDEX-MATCH and IF
      formula for efficient data manipulation, such as retrieving, matching and organising information in spreadsheets

    2. Data Analysis: • Create Pivot Tables and Pivot Charts with the formatting to visualize trends.

    3. Dashboard Development: • Insert Slicers with the formatting for easy filtering and dynamic updates.

    Highlights: This project aims to understand coffee sales trends by country, roast type, and year, which could help identify marketing opportunities and customer segments.

  13. Data for Pandas Tutorial for Beginners

    • kaggle.com
    zip
    Updated Feb 28, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Le Liem (2022). Data for Pandas Tutorial for Beginners [Dataset]. https://www.kaggle.com/datasets/thanhlimlk/data-for-pandas-tutorial-for-beginners
    Explore at:
    zip(1212 bytes)Available download formats
    Dataset updated
    Feb 28, 2022
    Authors
    Le Liem
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    This dataset is used to practice Pandas for beginners

    Content

    This dataset is presented with some errors which is needed to be fixed. You can use this dataset to practice: Cleaning NaN values with basic Pandas techniques.

    Acknowledgements

    I have this dataset from w3school

  14. Housing Dataset

    • kaggle.com
    zip
    Updated Aug 31, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Himani Rana2004 (2024). Housing Dataset [Dataset]. https://www.kaggle.com/datasets/himanirana2004/housing-dataset
    Explore at:
    zip(409382 bytes)Available download formats
    Dataset updated
    Aug 31, 2024
    Authors
    Himani Rana2004
    Description

    heyy this is beginner project of python .This dataset help you to practise your projects and you can learn all . Actually i am also practise this So,you can also take the help from Youtube to learn more. Thankyou

  15. House Price Regression Dataset

    • kaggle.com
    zip
    Updated Sep 6, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Prokshitha Polemoni (2024). House Price Regression Dataset [Dataset]. https://www.kaggle.com/datasets/prokshitha/home-value-insights
    Explore at:
    zip(27045 bytes)Available download formats
    Dataset updated
    Sep 6, 2024
    Authors
    Prokshitha Polemoni
    Description

    Home Value Insights: A Beginner's Regression Dataset

    This dataset is designed for beginners to practice regression problems, particularly in the context of predicting house prices. It contains 1000 rows, with each row representing a house and various attributes that influence its price. The dataset is well-suited for learning basic to intermediate-level regression modeling techniques.

    Features:

    1. Square_Footage: The size of the house in square feet. Larger homes typically have higher prices.
    2. Num_Bedrooms: The number of bedrooms in the house. More bedrooms generally increase the value of a home.
    3. Num_Bathrooms: The number of bathrooms in the house. Houses with more bathrooms are typically priced higher.
    4. Year_Built: The year the house was built. Older houses may be priced lower due to wear and tear.
    5. Lot_Size: The size of the lot the house is built on, measured in acres. Larger lots tend to add value to a property.
    6. Garage_Size: The number of cars that can fit in the garage. Houses with larger garages are usually more expensive.
    7. Neighborhood_Quality: A rating of the neighborhood’s quality on a scale of 1-10, where 10 indicates a high-quality neighborhood. Better neighborhoods usually command higher prices.
    8. House_Price (Target Variable): The price of the house, which is the dependent variable you aim to predict.

    Potential Uses:

    1. Beginner Regression Projects: This dataset can be used to practice building regression models such as Linear Regression, Decision Trees, or Random Forests. The target variable (house price) is continuous, making this an ideal problem for supervised learning techniques.

    2. Feature Engineering Practice: Learners can create new features by combining existing ones, such as the price per square foot or age of the house, providing an opportunity to experiment with feature transformations.

    3. Exploratory Data Analysis (EDA): You can explore how different features (e.g., square footage, number of bedrooms) correlate with the target variable, making it a great dataset for learning about data visualization and summary statistics.

    4. Model Evaluation: The dataset allows for various model evaluation techniques such as cross-validation, R-squared, and Mean Absolute Error (MAE). These metrics can be used to compare the effectiveness of different models.

    Versatility:

    • The dataset is highly versatile for a range of machine learning tasks. You can apply simple linear models to predict house prices based on one or two features, or use more complex models like Random Forest or Gradient Boosting Machines to understand interactions between variables.

    • It can also be used for dimensionality reduction techniques like PCA or to practice handling categorical variables (e.g., neighborhood quality) through encoding techniques like one-hot encoding.

    • This dataset is ideal for anyone wanting to gain practical experience in building regression models while working with real-world features.

  16. Google Data Analytics Capstone

    • kaggle.com
    zip
    Updated Aug 9, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Reilly McCarthy (2022). Google Data Analytics Capstone [Dataset]. https://www.kaggle.com/datasets/reillymccarthy/google-data-analytics-capstone/discussion
    Explore at:
    zip(67456 bytes)Available download formats
    Dataset updated
    Aug 9, 2022
    Authors
    Reilly McCarthy
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Hello! Welcome to the Capstone project I have completed to earn my Data Analytics certificate through Google. I chose to complete this case study through RStudio desktop. The reason I did this is that R is the primary new concept I learned throughout this course. I wanted to embrace my curiosity and learn more about R through this project. In the beginning of this report I will provide the scenario of the case study I was given. After this I will walk you through my Data Analysis process based on the steps I learned in this course:

    1. Ask
    2. Prepare
    3. Process
    4. Analyze
    5. Share
    6. Act

    The data I used for this analysis comes from this FitBit data set: https://www.kaggle.com/datasets/arashnic/fitbit

    " This dataset generated by respondents to a distributed survey via Amazon Mechanical Turk between 03.12.2016-05.12.2016. Thirty eligible Fitbit users consented to the submission of personal tracker data, including minute-level output for physical activity, heart rate, and sleep monitoring. "

  17. fdu-nlp-beginner-task4-embedding-vectors

    • kaggle.com
    zip
    Updated Feb 21, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    acmgyt (2024). fdu-nlp-beginner-task4-embedding-vectors [Dataset]. https://www.kaggle.com/datasets/acmgyt/fdu-nlp-beginner-task4-embedding-vectors
    Explore at:
    zip(1918844 bytes)Available download formats
    Dataset updated
    Feb 21, 2024
    Authors
    acmgyt
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset

    This dataset was created by acmgyt

    Released under MIT

    Contents

    from https://nlp.stanford.edu/projects/glove/

  18. Beginner Projects - Analyse subtitles for a movie

    • kaggle.com
    zip
    Updated Jun 13, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Priya_ds (2017). Beginner Projects - Analyse subtitles for a movie [Dataset]. https://www.kaggle.com/priya2908/beginner-projects-analyse-subtitles-for-a-movie
    Explore at:
    zip(5169303 bytes)Available download formats
    Dataset updated
    Jun 13, 2017
    Authors
    Priya_ds
    Description

    Context

    There's a story behind every dataset and here's your opportunity to share yours.

    Content

    What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

    Acknowledgements

    We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.

    Inspiration

    Your data will be in front of the world's largest data science community. What questions do you want to see answered?

  19. Power BI dataset

    • kaggle.com
    zip
    Updated Oct 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ahmadali Jamali (2023). Power BI dataset [Dataset]. https://www.kaggle.com/datasets/ahmadalijamali/dataset
    Explore at:
    zip(1642 bytes)Available download formats
    Dataset updated
    Oct 31, 2023
    Authors
    Ahmadali Jamali
    License

    https://www.licenses.ai/ai-licenseshttps://www.licenses.ai/ai-licenses

    Description

    Tabular dataset for data analysis and machine learning practice. The dataset is about the market and is usable for Power BI practice and data science.

  20. Titanic Data Simple EDA with Logistic Regression

    • kaggle.com
    zip
    Updated Aug 12, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Vicky Nayak (2020). Titanic Data Simple EDA with Logistic Regression [Dataset]. https://www.kaggle.com/vickynayak9/titanic-dataset
    Explore at:
    zip(22558 bytes)Available download formats
    Dataset updated
    Aug 12, 2020
    Authors
    Vicky Nayak
    Description

    Dataset

    This dataset was created by Vicky Nayak

    Contents

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Pashito (2022). Hello world [Dataset]. https://www.kaggle.com/pashito/hello-world
Organization logo

Hello world

My first projects on data analysis

Explore at:
zip(365735 bytes)Available download formats
Dataset updated
Aug 1, 2022
Authors
Pashito
Area covered
World
Description

Beginner projects made with help of YouTube tutorials and Kaggle.

Search
Clear search
Close search
Google apps
Main menu