100+ datasets found
  1. Hackathon Dataset

    • kaggle.com
    zip
    Updated Jun 17, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Abhiraj Mandal (2024). Hackathon Dataset [Dataset]. https://www.kaggle.com/datasets/abhirajmandal/hackathon-dataset
    Explore at:
    zip(1271001 bytes)Available download formats
    Dataset updated
    Jun 17, 2024
    Authors
    Abhiraj Mandal
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Abhiraj Mandal

    Released under Apache 2.0

    Contents

  2. Fraud Detection Hackathon

    • kaggle.com
    zip
    Updated Nov 30, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Ar2197 (2024). Fraud Detection Hackathon [Dataset]. https://www.kaggle.com/datasets/ar2197/fraud-detection-hackathon
    Explore at:
    zip(121782590 bytes)Available download formats
    Dataset updated
    Nov 30, 2024
    Authors
    Ar2197
    Description

    Sample hackathon data to practice fraud detection . It has multiple files which will require some thinking to structure and the type of dataset will challenge to find ways to get good accuracy

  3. Game of Deep Learning: Computer Vision Hackathon

    • kaggle.com
    Updated Aug 12, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gaurav Dutta (2022). Game of Deep Learning: Computer Vision Hackathon [Dataset]. https://www.kaggle.com/datasets/gauravduttakiit/game-of-deep-learning-computer-vision-hackathon
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Aug 12, 2022
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Gaurav Dutta
    Description

    Ship or vessel detection has a wide range of applications, in the areas of maritime safety, fisheries management, marine pollution, defence and maritime security, protection from piracy, illegal migration, etc. Keeping this in mind, a Governmental Maritime and Coastguard Agency is planning to deploy a computer vision based automated system to identify ship type only from the images taken by the survey boats. You have been hired as a consultant to build an efficient model for this project.

  4. Meta kaggle Hackathon

    • kaggle.com
    zip
    Updated Jul 17, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    KHUSHI YADAV (2025). Meta kaggle Hackathon [Dataset]. https://www.kaggle.com/datasets/khushiyadav34/meta-kaggle-hackathon
    Explore at:
    zip(4619 bytes)Available download formats
    Dataset updated
    Jul 17, 2025
    Authors
    KHUSHI YADAV
    License

    Attribution-NonCommercial-ShareAlike 3.0 (CC BY-NC-SA 3.0)https://creativecommons.org/licenses/by-nc-sa/3.0/
    License information was derived automatically

    Description

    Dataset

    This dataset was created by KHUSHI YADAV

    Released under Attribution-NonCommercial-ShareAlike 3.0 IGO (CC BY-NC-SA 3.0 IGO)

    Contents

  5. Kaggle events and collaborations

    • kaggle.com
    zip
    Updated Nov 10, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    BwandoWando (2025). Kaggle events and collaborations [Dataset]. https://www.kaggle.com/datasets/bwandowando/kaggle-staff-forum-topic-posts
    Explore at:
    zip(1627 bytes)Available download formats
    Dataset updated
    Nov 10, 2025
    Authors
    BwandoWando
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Context

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1842206%2Fd9e95121cb5e00a0c6ef76b3f2039470%2F_6ff9a514-feae-4016-a680-5e674c943d14.jpeg?generation=1752462551017569&alt=media" alt="">

    Events in & outside Kaggle coinciding +/- 2 days within user registration spikes. Used for MetaKaggle Hackathon

  6. Hackathon Participants Data

    • kaggle.com
    zip
    Updated Jun 25, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Priyanshu Sethi (2023). Hackathon Participants Data [Dataset]. https://www.kaggle.com/datasets/priyanshusethi/high-school-hackathon-data/code
    Explore at:
    zip(1293 bytes)Available download formats
    Dataset updated
    Jun 25, 2023
    Authors
    Priyanshu Sethi
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    Hackathons are a great way for people to not only learn more about technology but also showcase their existing skills by making projects often in a few hours. This dataset contains data collected from 200 participants of a hackathon conducted for high school students. A lot of columns have been deleted but the remaining columns can be useful to understand the demographic and interests of someone participating in these kind of events.

  7. Hackathon-2025-Big-Data

    • kaggle.com
    zip
    Updated Sep 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Bard2024 (2025). Hackathon-2025-Big-Data [Dataset]. https://www.kaggle.com/datasets/bard2024/hackathon-2025-big-data/code
    Explore at:
    zip(120593430 bytes)Available download formats
    Dataset updated
    Sep 16, 2025
    Authors
    Bard2024
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Bard2024

    Released under MIT

    Contents

  8. MetaKaggle Forum Data Stella Embeddings

    • kaggle.com
    zip
    Updated Jun 2, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    BwandoWando (2025). MetaKaggle Forum Data Stella Embeddings [Dataset]. https://www.kaggle.com/datasets/bwandowando/metakaggle-forum-data-embeddings-stella-en-1-5b-v5
    Explore at:
    zip(14369578283 bytes)Available download formats
    Dataset updated
    Jun 2, 2025
    Authors
    BwandoWando
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Context

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1842206%2Fe27c4ece1c20108bff7baf4f8dc5a37e%2F_d0311f97-3d66-461c-9af5-ba20d8a9da6f-small.jpeg?generation=1748786488909664&alt=media" alt="">

    These are NovaSearch/stella_en_1.5B_v5 embeddings of the Meta Kaggle ForumTopics.csv and ForumMessages.csv

    Intended purpose

    This is a supplemental dataset for the Meta Kaggle Hackathon

    How I preprocessed the text data

    1. I removed html elements using BeautifulSoup
    2. I replaced any URL value with a placeholder <url> value
    3. I removed emojis and symbols
    4. I replaced 1 or more carriage returns with just a single white space
    5. NovaSearch/stella_en_1.5B_v5 was set to 2048 tokens context size and normalize_embeddings is set to true

    Sample Data

    The actual text data that I fed into the embedding model can be seen in this dataset

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1842206%2Fe6a47fb262445e7dfefbb7be71d14565%2FScreenshot%20from%202025-06-01%2021-44-28.png?generation=1748785487135090&alt=media" alt="">

    How to use

    • Download the original csvs from Meta Kaggle dataset so that you can see the original text values and compare it to the preprocessed values.
    • You can also just download the samples in the ./sample/*.parquet folder to see how the data looks like, before you download the whole dataset (16GB)
    • These are normalized embeddings that you can use with Cosine Similarity

    See Related Datasets

    Image

    Generated with Bing Image Generator

  9. MetaKaggle Forum Data Qwen2 Embeddings

    • kaggle.com
    zip
    Updated Jun 3, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    BwandoWando (2025). MetaKaggle Forum Data Qwen2 Embeddings [Dataset]. https://www.kaggle.com/datasets/bwandowando/metakaggle-forum-data-qwen2-1-5-embeddings
    Explore at:
    zip(21475584602 bytes)Available download formats
    Dataset updated
    Jun 3, 2025
    Authors
    BwandoWando
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Context

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1842206%2Fef7fe21fba54bb94bff875f3f9820ea5%2F_9e90e8e2-5caf-4214-8726-77afecdaafc1-small.jpeg?generation=1748913590396040&alt=media" alt="">

    These are Qwen/Qwen2-1.5B-Instruct embeddings of the Meta Kaggle ForumTopics.csv and ForumMessages.csv

    Intended purpose

    This is a supplemental dataset for the Meta Kaggle Hackathon

    How I preprocessed the text data

    1. I removed html elements using BeautifulSoup
    2. I replaced any URL value with a placeholder <url> value
    3. I removed emojis and symbols
    4. I replaced 1 or more carriage returns with just a single white space
    5. Qwen/Qwen2-1.5B-Instruct was set to 2048 tokens context size and normalize_embeddings is set to true

    Sample Data

    The actual text data that I fed into the embedding model can be seen in this dataset

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1842206%2F155f60ac3a36046a5d546283bad80368%2FScreenshot%20from%202025-06-03%2009-20-35.png?generation=1748913654315054&alt=media" alt="">

    How to use

    • Download the original csvs from Meta Kaggle dataset so that you can see the original text values and compare it to the preprocessed values.
    • You can also just download the samples in the ./sample/*.parquet folder to see how the data looks like, before you download the whole dataset (23GB)
    • These are normalized embeddings that you can use with Cosine Similarity

    See Related Datasets

    Image

    Generated with Bing Image Generator

  10. MACHINE LEARNING HACKATHON

    • kaggle.com
    zip
    Updated Oct 31, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Shreya Halgeri (2023). MACHINE LEARNING HACKATHON [Dataset]. https://www.kaggle.com/datasets/shreyahalgeri/machine-learning-hackathon
    Explore at:
    zip(4874543 bytes)Available download formats
    Dataset updated
    Oct 31, 2023
    Authors
    Shreya Halgeri
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Shreya Halgeri

    Released under Apache 2.0

    Contents

  11. MetaKaggle Forum Data Jina-Small-Eng-V1 Embeddings

    • kaggle.com
    zip
    Updated Jun 4, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    BwandoWando (2025). MetaKaggle Forum Data Jina-Small-Eng-V1 Embeddings [Dataset]. https://www.kaggle.com/datasets/bwandowando/metakaggle-forum-data-jina-small-eng-v1-embeddings
    Explore at:
    zip(7290271023 bytes)Available download formats
    Dataset updated
    Jun 4, 2025
    Authors
    BwandoWando
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Context

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1842206%2Fa052924b64c6ae6cd801dcc917067ea2%2F_ae937b93-d6fb-4985-b526-f6a31c2970c0-small.jpeg?generation=1749029365407793&alt=media" alt="">

    These are jinaai/jina-embedding-s-en-v1 embeddings of the Meta Kaggle ForumTopics.csv and ForumMessages.csv

    Intended purpose

    This is a supplemental dataset for the Meta Kaggle Hackathon

    How I preprocessed the text data

    1. I removed html elements using BeautifulSoup
    2. I replaced any URL value with a placeholder <url> value
    3. I removed emojis and symbols
    4. I replaced 1 or more carriage returns with just a single white space
    5. jinaai/jina-embedding-s-en-v1 was set to 512 tokens context size and normalize_embeddings is set to true

    Sample Data

    The actual text data that I fed into the embedding model can be seen in this dataset

    https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1842206%2F155f60ac3a36046a5d546283bad80368%2FScreenshot%20from%202025-06-03%2009-20-35.png?generation=1748913654315054&alt=media" alt="">

    How to use

    • Download the original csvs from Meta Kaggle dataset so that you can see the original text values and compare it to the preprocessed values.
    • You can also just download the samples in the ./sample/*.parquet folder to see how the data looks like, before you download the whole dataset (23GB)
    • These are normalized embeddings that you can use with Cosine Similarity

    See Related Datasets

    Image

    Generated with Bing Image Generator

  12. data-for-hackathon

    • kaggle.com
    zip
    Updated Oct 27, 2024
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    HoangTran223 (2024). data-for-hackathon [Dataset]. https://www.kaggle.com/datasets/hoangtran223/data-for-hackathon/code
    Explore at:
    zip(338 bytes)Available download formats
    Dataset updated
    Oct 27, 2024
    Authors
    HoangTran223
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by HoangTran223

    Released under Apache 2.0

    Contents

  13. GenAI hackathon

    • kaggle.com
    Updated Apr 16, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sagar Mondal (2025). GenAI hackathon [Dataset]. https://www.kaggle.com/datasets/phenomenalsagar/genai-hackathon
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Apr 16, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Sagar Mondal
    License

    MIT Licensehttps://opensource.org/licenses/MIT
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Sagar Mondal

    Released under MIT

    Contents

  14. Football Hackathon

    • kaggle.com
    zip
    Updated Jun 10, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Gaurav Dutta (2022). Football Hackathon [Dataset]. https://www.kaggle.com/datasets/gauravduttakiit/football-hackathon
    Explore at:
    zip(68510595 bytes)Available download formats
    Dataset updated
    Jun 10, 2022
    Authors
    Gaurav Dutta
    Description

    Dataset

    This dataset was created by Gaurav Dutta

    Contents

  15. Amazon business analytics hackathon

    • kaggle.com
    zip
    Updated Sep 1, 2022
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sarth Mirashi (2022). Amazon business analytics hackathon [Dataset]. https://www.kaggle.com/datasets/sarthmirashi07/amazon-train
    Explore at:
    zip(1972072 bytes)Available download formats
    Dataset updated
    Sep 1, 2022
    Authors
    Sarth Mirashi
    Description

    Dataset

    This dataset was created by Sarth Mirashi

    Contents

  16. Community Hackathon

    • kaggle.com
    zip
    Updated Oct 4, 2020
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sahabudin Ali (2020). Community Hackathon [Dataset]. https://www.kaggle.com/sahabudin9/community-hackathon
    Explore at:
    zip(21795226 bytes)Available download formats
    Dataset updated
    Oct 4, 2020
    Authors
    Sahabudin Ali
    Description

    Dataset

    This dataset was created by Sahabudin Ali

    Contents

  17. hackathons

    • kaggle.com
    zip
    Updated May 19, 2023
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Alexander Nolte (2023). hackathons [Dataset]. https://www.kaggle.com/datasets/alexandernolte/hackathons
    Explore at:
    zip(5531 bytes)Available download formats
    Dataset updated
    May 19, 2023
    Authors
    Alexander Nolte
    Description

    Dataset

    This dataset was created by Alexander Nolte

    Contents

  18. Hackathon Competition

    • kaggle.com
    zip
    Updated Feb 20, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mariusz Bronowicki (2021). Hackathon Competition [Dataset]. https://www.kaggle.com/godzill22/hackathon-competittion
    Explore at:
    zip(5013870 bytes)Available download formats
    Dataset updated
    Feb 20, 2021
    Authors
    Mariusz Bronowicki
    Description

    Context

    This dataset comes from Hackathon Competition: https://tournament.datacrunch.com/how-to-get-started

    Content

    What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

    Acknowledgements

    We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.

    Inspiration

    Your data will be in front of the world's largest data science community. What questions do you want to see answered?

  19. H2O AI - AQI Hackathon

    • kaggle.com
    zip
    Updated Apr 8, 2023
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Rajat Ranjan (2023). H2O AI - AQI Hackathon [Dataset]. https://www.kaggle.com/datasets/rajatranjan/h2o-ai-aqi-hackathon
    Explore at:
    zip(591350 bytes)Available download formats
    Dataset updated
    Apr 8, 2023
    Authors
    Rajat Ranjan
    Description

    Dataset

    This dataset was created by Rajat Ranjan

    Contents

  20. Hackathon

    • kaggle.com
    Updated Jun 11, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Mukuliitg (2025). Hackathon [Dataset]. https://www.kaggle.com/datasets/mukuliitg/hackathon/code
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Jun 11, 2025
    Dataset provided by
    Kagglehttp://kaggle.com/
    Authors
    Mukuliitg
    License

    Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
    License information was derived automatically

    Description

    Dataset

    This dataset was created by Mukuliitg

    Released under Apache 2.0

    Contents

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Abhiraj Mandal (2024). Hackathon Dataset [Dataset]. https://www.kaggle.com/datasets/abhirajmandal/hackathon-dataset
Organization logo

Hackathon Dataset

Explore at:
zip(1271001 bytes)Available download formats
Dataset updated
Jun 17, 2024
Authors
Abhiraj Mandal
License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Dataset

This dataset was created by Abhiraj Mandal

Released under Apache 2.0

Contents

Search
Clear search
Close search
Google apps
Main menu