72 datasets found
  1. tutorial

    • kaggle.com
    Updated Aug 20, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    chewytteok (2020). tutorial [Dataset]. https://www.kaggle.com/chewytteok/tutorial/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 20, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    chewytteok
    Description

    Dataset

    This dataset was created by chewytteok

    Contents

  2. Bikes For Tutorial

    • kaggle.com
    Updated Jan 23, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    deluXe (2020). Bikes For Tutorial [Dataset]. https://www.kaggle.com/johannesbernhard/bikesfortutorial/notebooks
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 23, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    deluXe
    Description

    Dataset for my (German) Python Data Science Tutorial on YouTube.

    Playlist: https://www.youtube.com/playlist?list=PLW4WJMmOF9juA1Ebs1vNwTBuF7ck6YCT7

    My version of: 'Bike Share Daily Data' (https://www.kaggle.com/contactprad/bike-share-daily-data)

    Data used in this competition: https://www.kaggle.com/c/bike-sharing-demand

    Use of this dataset in publications must be cited to the following publication:

    [1] Fanaee-T, Hadi, and Gama, Joao, "Event labeling combining ensemble detectors and background knowledge", Progress in Artificial Intelligence (2013): pp. 1-15, Springer Berlin Heidelberg, doi:10.1007/s13748-013-0040-3.

    @article{ year={2013}, issn={2192-6352}, journal={Progress in Artificial Intelligence}, doi={10.1007/s13748-013-0040-3}, title={Event labeling combining ensemble detectors and background knowledge}, url={http://dx.doi.org/10.1007/s13748-013-0040-3}, publisher={Springer Berlin Heidelberg}, keywords={Event labeling; Event detection; Ensemble learning; Background knowledge}, author={Fanaee-T, Hadi and Gama, Joao}, pages={1-15} }

  3. tutorial dataset

    • kaggle.com
    Updated Aug 24, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    rahul nakka (2023). tutorial dataset [Dataset]. https://www.kaggle.com/datasets/rahulnakka/tutorial-dataset/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 24, 2023
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    rahul nakka
    Description

    Dataset

    This dataset was created by Rahul Nakka

    Contents

  4. Pandas Tutorial Example Dataset - 2

    • kaggle.com
    Updated May 1, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Şükrü Yusuf Kaya (2025). Pandas Tutorial Example Dataset - 2 [Dataset]. https://www.kaggle.com/datasets/kryusufkaya/pandas-tutorial-example-dataset-2/versions/1
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    May 1, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Şükrü Yusuf Kaya
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Şükrü Yusuf Kaya

    Released under MIT

    Contents

  5. numpy-tutorial-seattle

    • kaggle.com
    Updated Jul 8, 2019
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chao CHEN (2019). numpy-tutorial-seattle [Dataset]. https://www.kaggle.com/datasets/monkeyboard568/numpytutorialseattle/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 8, 2019
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Chao CHEN
    Area covered
    Seattle
    Description

    Dataset

    This dataset was created by Chao CHEN

    Contents

  6. q_funcs

    • kaggle.com
    Updated Jul 4, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Joe Fitzgerald (2020). q_funcs [Dataset]. https://www.kaggle.com/datasets/jsfitz/q-funcs
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 4, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Joe Fitzgerald
    Description

    Dataset

    This dataset was created by Joe Fitzgerald

    Contents

  7. Roblox Studio Tutorial

    • kaggle.com
    Updated Jul 14, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Kristo Radion Purba (2024). Roblox Studio Tutorial [Dataset]. https://www.kaggle.com/datasets/krpurba/roblox-studio-tutorial/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 14, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Kristo Radion Purba
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Kristo Radion Purba

    Released under Apache 2.0

    Contents

  8. Recommender Systems Tutorial

    • kaggle.com
    Updated Sep 16, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Daniil Barysevich (2018). Recommender Systems Tutorial [Dataset]. https://www.kaggle.com/devvindan/recommender-systems-tutorial/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 16, 2018
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Daniil Barysevich
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Daniil Barysevich

    Released under CC0: Public Domain

    Contents

  9. Vowpal Wabbit tutorial

    • kaggle.com
    Updated Apr 2, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Yury Kashnitsky (2018). Vowpal Wabbit tutorial [Dataset]. https://www.kaggle.com/kashnitsky/spooky-vw-tutorial/kernels
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 2, 2018
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Yury Kashnitsky
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    This dataset is comprised of several others and is collected specially for the Vowpal Wabbit tutorial, Kernel. The tutorial covers (both theoretically and in practice) two reasons of Vowpal Wabbit's exceptional training speed, namely, online learning and hashing trick. We'll try it out with the Spooky Author Identification dataset as well as with news, letters, movie reviews datasets and gigabytes of StackOverflow questions.

    Content

    The included datasets are:

    • SOCR weights and heights, link (CC-BY licensed)
    • UCI bikes sharing demand, link (no license)
    • the 20 newsgroups text dataset, link (BSD licensed)
    • pickled VW-versions of IMDB movie reviews (no license)
    • a sample from the 10mln StackOverflow dataset (preprocessed and shared under the cc-by-sa 3.0 license)
  10. vit-tutorial-illustrations

    • kaggle.com
    Updated Dec 7, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abhinand (2020). vit-tutorial-illustrations [Dataset]. https://www.kaggle.com/datasets/abhinand05/vittutorialillustrations
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 7, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Abhinand
    Description

    Dataset

    This dataset was created by Abhinand

    Contents

  11. TF Tutorial: PTB Dataset

    • kaggle.com
    Updated Jan 2, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    pistachio_overlord (2018). TF Tutorial: PTB Dataset [Dataset]. https://www.kaggle.com/myqrizzo/tf-tutorial-ptb-dataset
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jan 2, 2018
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    pistachio_overlord
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by pistachio_overlord

    Released under CC0: Public Domain

    Contents

  12. numpy-tutorial-president

    • kaggle.com
    Updated Jul 8, 2019
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Chao CHEN (2019). numpy-tutorial-president [Dataset]. https://www.kaggle.com/monkeyboard568/numpytutorial/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jul 8, 2019
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Chao CHEN
    Description

    Dataset

    This dataset was created by Chao CHEN

    Contents

  13. tutorial_srgan

    • kaggle.com
    Updated Nov 23, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    annie2509 (2024). tutorial_srgan [Dataset]. https://www.kaggle.com/datasets/annie2509/tutorial-srgan
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Nov 23, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    annie2509
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset

    This dataset was created by annie2509

    Released under MIT

    Contents

  14. Cats and Dogs Sentdex Tutorial

    • kaggle.com
    Updated Oct 8, 2018
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sherpa (2018). Cats and Dogs Sentdex Tutorial [Dataset]. https://www.kaggle.com/thesherpafromalabama/cats-and-dogs-sentdex-tutorial/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 8, 2018
    Dataset provided by
    Kaggle
    Authors
    Sherpa
    Description
  15. complete pandas tutorial

    • kaggle.com
    Updated Aug 24, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Pritam Purohit (2020). complete pandas tutorial [Dataset]. https://www.kaggle.com/pritampurohit/complete-pandas-tutorial/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 24, 2020
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Pritam Purohit
    Description

    Dataset

    This dataset was created by Pritam Purohit

    Contents

  16. margherita pizza tutorial

    • kaggle.com
    Updated Dec 1, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Guangyu Song (2024). margherita pizza tutorial [Dataset]. https://www.kaggle.com/datasets/gysong/margherita-pizza-tutorial/discussion
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Dec 1, 2024
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Guangyu Song
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Dataset

    This dataset was created by Guangyu Song

    Released under CC0: Public Domain

    Contents

  17. name from which language

    • kaggle.com
    Updated Apr 3, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Andrew Tu (2021). name from which language [Dataset]. https://www.kaggle.com/tusonggao/name-from-which-language
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 3, 2021
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Andrew Tu
    Description
  18. Big Data Certification KR

    • kaggle.com
    zip
    Updated Nov 29, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    KIM TAE HEON (2021). Big Data Certification KR [Dataset]. https://www.kaggle.com/agileteam/bigdatacertificationkr
    Explore at:
    zip(15840 bytes)Available download formats
    Dataset updated
    Nov 29, 2021
    Authors
    KIM TAE HEON
    License

    Attribution-NoDerivs 4.0 (CC BY-ND 4.0)https://creativecommons.org/licenses/by-nd/4.0/
    License information was derived automatically

    Description

    빅데이터 분석기사 실기 준비 놀이터

    함께 놀아볼까요? 무궁화 꽃이 피었습니다 😜 빅데이터 분석기사 실기 준비를 위한 데이터 셋입니다. 더 좋은 코드를 만든다면 많은 공유 부탁드려요🎉 (Python과 R모두 환영합니다.)

    4회 기출 유형

    3회 기출 유형 및 심화 학습자료

    🆕 New 문제 업데이트 2022.6

    🎁 빅데이터 분식기사 실기 입문 강의 Open 🎁

    • https://class101.page.link/tp9k
    • 입문자를 위한 강의 오픈 했어요 👍
    • 파이썬-판다스-머신러닝-모의문제(작업형1,2)-꿀팁 등을 실기 준비에 필요한 내용만 친절하게 알려드려요🎉
    • 머신러닝을 해보신 분이라면 수강 할 필요 없을 것 같아요, 바로 모의 문제를 풀기 힘든 설명이 필요한 찐 입문자에게 추천드려요!

    📌작업형1 예상문제 (P:파이썬, R)

    Tasks 탭에서 문제 및 코드 확인

    📌작업형2 예상문제

    Tasks 탭에서 문제 및 코드 확인 - [3회차 기출유형 작업형2] : 여행 보험 패키지 상품 (데이터를 조금 어렵게 변경함) P: https://www.kaggle.com/code/agileteam/3rd-type2-3-2-baseline

    📌6 주 완성 코스 (아래 표 참고)

    주차유형(에디터)번호
    6주 전작업형1(노트북)T1-1~5
    5주 전작업형1(노트북)T1-6~9, T1 EQ(기출),
    4주 전작업형1(스크립트), 작업형2(노트북)T1-10~13, T1.Ex, T2EQ, T2-1
    3주 전작업형1(스크립트), 작업형2(노트북)T1-14~19, T2-2~3
    2주 전작업형1(스크립트), 작업형2(스크립트)T1-20~21, T2-4~6, 복습
    1주 전작업형1, 작업형2(스크립트), 단답형T1-22~24, 모의고사, 복습, 응시환경 체험, 단답

    📌입문자를 위한 머신러닝 튜토리얼 (공유해주신 노트북 중 선정하였음👍)

    - https://www.kaggle.com/ohseokkim/t2-2-pima-indians-diabetes 작성자: @ohseokkim 😆

  19. NYC Jobs Dataset (Filtered Columns)

    • kaggle.com
    Updated Oct 5, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jeffery Mandrake (2022). NYC Jobs Dataset (Filtered Columns) [Dataset]. https://www.kaggle.com/datasets/jefferymandrake/nyc-jobs-filtered-cols
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Oct 5, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Jeffery Mandrake
    License

    Attribution-NonCommercial-ShareAlike 4.0 (CC BY-NC-SA 4.0)https://creativecommons.org/licenses/by-nc-sa/4.0/
    License information was derived automatically

    Area covered
    New York
    Description

    Use this dataset with Misra's Pandas tutorial: How to use the Pandas GroupBy function | Pandas tutorial

    The original dataset came from this site: https://data.cityofnewyork.us/City-Government/NYC-Jobs/kpav-sd4t/data

    I used Google Colab to filter the columns with the following Pandas commands. Here's a Colab Notebook you can use with the commands listed below: https://colab.research.google.com/drive/17Jpgeytc075CpqDnbQvVMfh9j-f4jM5l?usp=sharing

    Once the csv file is uploaded to Google Colab, use these commands to process the file.

    import pandas as pd # load the file and create a pandas dataframe df = pd.read_csv('/content/NYC_Jobs.csv') # keep only these columns df = df[['Job ID', 'Civil Service Title', 'Agency', 'Posting Type', 'Job Category', 'Salary Range From', 'Salary Range To' ]] # save the csv file without the index column df.to_csv('/content/NYC_Jobs_filtered_cols.csv', index=False)

  20. Data from: Bag of Words Meets Bags of Popcorn

    • kaggle.com
    zip
    Updated May 18, 2017
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    rocha (2017). Bag of Words Meets Bags of Popcorn [Dataset]. https://www.kaggle.com/rochachan/bag-of-words-meets-bags-of-popcorn
    Explore at:
    zip(13788314 bytes)Available download formats
    Dataset updated
    May 18, 2017
    Authors
    rocha
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Context

    The competition is over 2 yrs ago. I just wanna play around the dataset.

    Content

    The labeled data set consists of 50,000 IMDB movie reviews, specially selected for sentiment analysis. The sentiment of reviews is binary, meaning the IMDB rating < 5 results in a sentiment score of 0, and rating >=7 have a sentiment score of 1. No individual movie has more than 30 reviews. The 25,000 review labeled training set does not include any of the same movies as the 25,000 review test set. In addition, there are another 50,000 IMDB reviews provided without any rating labels.

    • id - Unique ID of each review
    • sentiment - Sentiment of the review; 1 for positive reviews and 0 for negative reviews
    • review - Text of the review

    Acknowledgements

    The origin place is here. Awesome tutorial is here, we can play with it.

    Inspiration

    Just for study and learning

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
chewytteok (2020). tutorial [Dataset]. https://www.kaggle.com/chewytteok/tutorial/code
Organization logo

tutorial

Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 20, 2020
Dataset provided by
Kagglehttp://kaggle.com/
Authors
chewytteok
Description

Dataset

This dataset was created by chewytteok

Contents

Search
Clear search
Close search
Google apps
Main menu