100+ datasets found

Hackathon Dataset
kaggle.com
zip
Updated Jun 17, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Abhiraj Mandal (2024). Hackathon Dataset [Dataset]. https://www.kaggle.com/datasets/abhirajmandal/hackathon-dataset
Explore at:
zip(1271001 bytes)Available download formats
Dataset updated
Jun 17, 2024
Authors
Abhiraj Mandal
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Abhiraj Mandal

Released under Apache 2.0

Contents
Fraud Detection Hackathon
kaggle.com
zip
Updated Nov 30, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Ar2197 (2024). Fraud Detection Hackathon [Dataset]. https://www.kaggle.com/datasets/ar2197/fraud-detection-hackathon
Explore at:
zip(121782590 bytes)Available download formats
Dataset updated
Nov 30, 2024
Authors
Ar2197
Description
Sample hackathon data to practice fraud detection . It has multiple files which will require some thinking to structure and the type of dataset will challenge to find ways to get good accuracy
Game of Deep Learning: Computer Vision Hackathon
kaggle.com
Updated Aug 12, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gaurav Dutta (2022). Game of Deep Learning: Computer Vision Hackathon [Dataset]. https://www.kaggle.com/datasets/gauravduttakiit/game-of-deep-learning-computer-vision-hackathon
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Aug 12, 2022
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Gaurav Dutta
Description
Ship or vessel detection has a wide range of applications, in the areas of maritime safety, fisheries management, marine pollution, defence and maritime security, protection from piracy, illegal migration, etc. Keeping this in mind, a Governmental Maritime and Coastguard Agency is planning to deploy a computer vision based automated system to identify ship type only from the images taken by the survey boats. You have been hired as a consultant to build an efficient model for this project.
Meta kaggle Hackathon
kaggle.com
zip
Updated Jul 17, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
KHUSHI YADAV (2025). Meta kaggle Hackathon [Dataset]. https://www.kaggle.com/datasets/khushiyadav34/meta-kaggle-hackathon
Explore at:
zip(4619 bytes)Available download formats
Dataset updated
Jul 17, 2025
Authors
KHUSHI YADAV
License
Attribution-NonCommercial-ShareAlike 3.0 (CC BY-NC-SA 3.0)https://creativecommons.org/licenses/by-nc-sa/3.0/
License information was derived automatically
Description
Dataset

This dataset was created by KHUSHI YADAV

Released under Attribution-NonCommercial-ShareAlike 3.0 IGO (CC BY-NC-SA 3.0 IGO)

Contents
Kaggle events and collaborations
kaggle.com
zip
Updated Nov 10, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
BwandoWando (2025). Kaggle events and collaborations [Dataset]. https://www.kaggle.com/datasets/bwandowando/kaggle-staff-forum-topic-posts
Explore at:
zip(1627 bytes)Available download formats
Dataset updated
Nov 10, 2025
Authors
BwandoWando
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Context

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1842206%2Fd9e95121cb5e00a0c6ef76b3f2039470%2F_6ff9a514-feae-4016-a680-5e674c943d14.jpeg?generation=1752462551017569&alt=media" alt="">

Events in & outside Kaggle coinciding +/- 2 days within user registration spikes. Used for MetaKaggle Hackathon
Hackathon Participants Data
kaggle.com
zip
Updated Jun 25, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Priyanshu Sethi (2023). Hackathon Participants Data [Dataset]. https://www.kaggle.com/datasets/priyanshusethi/high-school-hackathon-data/code
Explore at:
zip(1293 bytes)Available download formats
Dataset updated
Jun 25, 2023
Authors
Priyanshu Sethi
License
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
Description
Hackathons are a great way for people to not only learn more about technology but also showcase their existing skills by making projects often in a few hours. This dataset contains data collected from 200 participants of a hackathon conducted for high school students. A lot of columns have been deleted but the remaining columns can be useful to understand the demographic and interests of someone participating in these kind of events.
Hackathon-2025-Big-Data
kaggle.com
zip
Updated Sep 16, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Bard2024 (2025). Hackathon-2025-Big-Data [Dataset]. https://www.kaggle.com/datasets/bard2024/hackathon-2025-big-data/code
Explore at:
zip(120593430 bytes)Available download formats
Dataset updated
Sep 16, 2025
Authors
Bard2024
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset

This dataset was created by Bard2024

Released under MIT

Contents
MetaKaggle Forum Data Stella Embeddings
kaggle.com
zip
Updated Jun 2, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
BwandoWando (2025). MetaKaggle Forum Data Stella Embeddings [Dataset]. https://www.kaggle.com/datasets/bwandowando/metakaggle-forum-data-embeddings-stella-en-1-5b-v5
Explore at:
zip(14369578283 bytes)Available download formats
Dataset updated
Jun 2, 2025
Authors
BwandoWando
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Context

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1842206%2Fe27c4ece1c20108bff7baf4f8dc5a37e%2F_d0311f97-3d66-461c-9af5-ba20d8a9da6f-small.jpeg?generation=1748786488909664&alt=media" alt="">

These are NovaSearch/stella_en_1.5B_v5 embeddings of the Meta Kaggle ForumTopics.csv and ForumMessages.csv

Intended purpose

This is a supplemental dataset for the Meta Kaggle Hackathon

How I preprocessed the text data

I removed html elements using BeautifulSoup

I replaced any URL value with a placeholder <url> value

I removed emojis and symbols

I replaced 1 or more carriage returns with just a single white space

NovaSearch/stella_en_1.5B_v5 was set to 2048 tokens context size and normalize_embeddings is set to true

Sample Data

The actual text data that I fed into the embedding model can be seen in this dataset

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1842206%2Fe6a47fb262445e7dfefbb7be71d14565%2FScreenshot%20from%202025-06-01%2021-44-28.png?generation=1748785487135090&alt=media" alt="">

How to use

Download the original csvs from Meta Kaggle dataset so that you can see the original text values and compare it to the preprocessed values.

You can also just download the samples in the ./sample/*.parquet folder to see how the data looks like, before you download the whole dataset (16GB)

These are normalized embeddings that you can use with Cosine Similarity

See Related Datasets

👉 MetaKaggle Forum Data ALL-MINILM-L12-v2 Embeddings (256 context size| 384 dimensions)

👉 MetaKaggle Forum Data BAAI/bge-m3 Embeddings (2048 context size| 1024 dimensions)

👉 MetaKaggle Forum Data BGE BASE-EN v1.5 Embeddings (512 context size| 768 dimensions)

👉 MetaKaggle Forum Data Jina-Small-Eng-V1 Embeddings (512 context size| 512 dimensions)

👉 MetaKaggle Forum Data Qwen2 Embeddings (2048 context size| 1536 dimensions)

👉 MetaKaggle Forum Data Stella Embeddings (2048 context size| 1024 dimensions)

Image

Generated with Bing Image Generator
MetaKaggle Forum Data Qwen2 Embeddings
kaggle.com
zip
Updated Jun 3, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
BwandoWando (2025). MetaKaggle Forum Data Qwen2 Embeddings [Dataset]. https://www.kaggle.com/datasets/bwandowando/metakaggle-forum-data-qwen2-1-5-embeddings
Explore at:
zip(21475584602 bytes)Available download formats
Dataset updated
Jun 3, 2025
Authors
BwandoWando
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Context

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1842206%2Fef7fe21fba54bb94bff875f3f9820ea5%2F_9e90e8e2-5caf-4214-8726-77afecdaafc1-small.jpeg?generation=1748913590396040&alt=media" alt="">

These are Qwen/Qwen2-1.5B-Instruct embeddings of the Meta Kaggle ForumTopics.csv and ForumMessages.csv

Intended purpose

This is a supplemental dataset for the Meta Kaggle Hackathon

How I preprocessed the text data

I removed html elements using BeautifulSoup

I replaced any URL value with a placeholder <url> value

I removed emojis and symbols

I replaced 1 or more carriage returns with just a single white space

Qwen/Qwen2-1.5B-Instruct was set to 2048 tokens context size and normalize_embeddings is set to true

Sample Data

The actual text data that I fed into the embedding model can be seen in this dataset

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1842206%2F155f60ac3a36046a5d546283bad80368%2FScreenshot%20from%202025-06-03%2009-20-35.png?generation=1748913654315054&alt=media" alt="">

How to use

Download the original csvs from Meta Kaggle dataset so that you can see the original text values and compare it to the preprocessed values.

You can also just download the samples in the ./sample/*.parquet folder to see how the data looks like, before you download the whole dataset (23GB)

These are normalized embeddings that you can use with Cosine Similarity

See Related Datasets

👉 MetaKaggle Forum Data ALL-MINILM-L12-v2 Embeddings (256 context size| 384 dimensions)

👉 MetaKaggle Forum Data BAAI/bge-m3 Embeddings (2048 context size| 1024 dimensions)

👉 MetaKaggle Forum Data BGE BASE-EN v1.5 Embeddings (512 context size| 768 dimensions)

👉 MetaKaggle Forum Data Jina-Small-Eng-V1 Embeddings (512 context size| 512 dimensions)

👉 MetaKaggle Forum Data Qwen2 Embeddings (2048 context size| 1536 dimensions)

👉 MetaKaggle Forum Data Stella Embeddings (2048 context size| 1024 dimensions)

Image

Generated with Bing Image Generator
MACHINE LEARNING HACKATHON
kaggle.com
zip
Updated Oct 31, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Shreya Halgeri (2023). MACHINE LEARNING HACKATHON [Dataset]. https://www.kaggle.com/datasets/shreyahalgeri/machine-learning-hackathon
Explore at:
zip(4874543 bytes)Available download formats
Dataset updated
Oct 31, 2023
Authors
Shreya Halgeri
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Shreya Halgeri

Released under Apache 2.0

Contents
MetaKaggle Forum Data Jina-Small-Eng-V1 Embeddings
kaggle.com
zip
Updated Jun 4, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
BwandoWando (2025). MetaKaggle Forum Data Jina-Small-Eng-V1 Embeddings [Dataset]. https://www.kaggle.com/datasets/bwandowando/metakaggle-forum-data-jina-small-eng-v1-embeddings
Explore at:
zip(7290271023 bytes)Available download formats
Dataset updated
Jun 4, 2025
Authors
BwandoWando
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Context

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1842206%2Fa052924b64c6ae6cd801dcc917067ea2%2F_ae937b93-d6fb-4985-b526-f6a31c2970c0-small.jpeg?generation=1749029365407793&alt=media" alt="">

These are jinaai/jina-embedding-s-en-v1 embeddings of the Meta Kaggle ForumTopics.csv and ForumMessages.csv

Intended purpose

This is a supplemental dataset for the Meta Kaggle Hackathon

How I preprocessed the text data

I removed html elements using BeautifulSoup

I replaced any URL value with a placeholder <url> value

I removed emojis and symbols

I replaced 1 or more carriage returns with just a single white space

jinaai/jina-embedding-s-en-v1 was set to 512 tokens context size and normalize_embeddings is set to true

Sample Data

The actual text data that I fed into the embedding model can be seen in this dataset

https://www.googleapis.com/download/storage/v1/b/kaggle-user-content/o/inbox%2F1842206%2F155f60ac3a36046a5d546283bad80368%2FScreenshot%20from%202025-06-03%2009-20-35.png?generation=1748913654315054&alt=media" alt="">

How to use

Download the original csvs from Meta Kaggle dataset so that you can see the original text values and compare it to the preprocessed values.

You can also just download the samples in the ./sample/*.parquet folder to see how the data looks like, before you download the whole dataset (23GB)

These are normalized embeddings that you can use with Cosine Similarity

See Related Datasets

👉 MetaKaggle Forum Data ALL-MINILM-L12-v2 Embeddings (256 context size| 384 dimensions)

👉 MetaKaggle Forum Data BAAI/bge-m3 Embeddings (2048 context size| 1024 dimensions)

👉 MetaKaggle Forum Data BGE BASE-EN v1.5 Embeddings (512 context size| 768 dimensions)

👉 MetaKaggle Forum Data Jina-Small-Eng-V1 Embeddings (512 context size| 512 dimensions)

👉 MetaKaggle Forum Data Qwen2 Embeddings (2048 context size| 1536 dimensions)

👉 MetaKaggle Forum Data Stella Embeddings (2048 context size| 1024 dimensions)

Image

Generated with Bing Image Generator
data-for-hackathon
kaggle.com
zip
Updated Oct 27, 2024
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
HoangTran223 (2024). data-for-hackathon [Dataset]. https://www.kaggle.com/datasets/hoangtran223/data-for-hackathon/code
Explore at:
zip(338 bytes)Available download formats
Dataset updated
Oct 27, 2024
Authors
HoangTran223
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by HoangTran223

Released under Apache 2.0

Contents
GenAI hackathon
kaggle.com
Updated Apr 16, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sagar Mondal (2025). GenAI hackathon [Dataset]. https://www.kaggle.com/datasets/phenomenalsagar/genai-hackathon
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Apr 16, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Sagar Mondal
License
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Description
Dataset

This dataset was created by Sagar Mondal

Released under MIT

Contents
Football Hackathon
kaggle.com
zip
Updated Jun 10, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Gaurav Dutta (2022). Football Hackathon [Dataset]. https://www.kaggle.com/datasets/gauravduttakiit/football-hackathon
Explore at:
zip(68510595 bytes)Available download formats
Dataset updated
Jun 10, 2022
Authors
Gaurav Dutta
Description
Dataset

This dataset was created by Gaurav Dutta

Contents
Amazon business analytics hackathon
kaggle.com
zip
Updated Sep 1, 2022
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sarth Mirashi (2022). Amazon business analytics hackathon [Dataset]. https://www.kaggle.com/datasets/sarthmirashi07/amazon-train
Explore at:
zip(1972072 bytes)Available download formats
Dataset updated
Sep 1, 2022
Authors
Sarth Mirashi
Description
Dataset

This dataset was created by Sarth Mirashi

Contents
Community Hackathon
kaggle.com
zip
Updated Oct 4, 2020
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Sahabudin Ali (2020). Community Hackathon [Dataset]. https://www.kaggle.com/sahabudin9/community-hackathon
Explore at:
zip(21795226 bytes)Available download formats
Dataset updated
Oct 4, 2020
Authors
Sahabudin Ali
Description
Dataset

This dataset was created by Sahabudin Ali

Contents
hackathons
kaggle.com
zip
Updated May 19, 2023
+ more versions
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Alexander Nolte (2023). hackathons [Dataset]. https://www.kaggle.com/datasets/alexandernolte/hackathons
Explore at:
zip(5531 bytes)Available download formats
Dataset updated
May 19, 2023
Authors
Alexander Nolte
Description
Dataset

This dataset was created by Alexander Nolte

Contents
Hackathon Competition
kaggle.com
zip
Updated Feb 20, 2021
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mariusz Bronowicki (2021). Hackathon Competition [Dataset]. https://www.kaggle.com/godzill22/hackathon-competittion
Explore at:
zip(5013870 bytes)Available download formats
Dataset updated
Feb 20, 2021
Authors
Mariusz Bronowicki
Description
Context

This dataset comes from Hackathon Competition: https://tournament.datacrunch.com/how-to-get-started

Content

What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.

Acknowledgements

We wouldn't be here without the help of others. If you owe any attributions or thanks, include them here along with any citations of past research.

Inspiration

Your data will be in front of the world's largest data science community. What questions do you want to see answered?
H2O AI - AQI Hackathon
kaggle.com
zip
Updated Apr 8, 2023
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Rajat Ranjan (2023). H2O AI - AQI Hackathon [Dataset]. https://www.kaggle.com/datasets/rajatranjan/h2o-ai-aqi-hackathon
Explore at:
zip(591350 bytes)Available download formats
Dataset updated
Apr 8, 2023
Authors
Rajat Ranjan
Description
Dataset

This dataset was created by Rajat Ranjan

Contents
Hackathon
kaggle.com
Updated Jun 11, 2025
Share
Facebook
Twitter
Email
Click to copy link
Link copied
Cite
Mukuliitg (2025). Hackathon [Dataset]. https://www.kaggle.com/datasets/mukuliitg/hackathon/code
Explore at:
CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
Dataset updated
Jun 11, 2025
Dataset provided by
Kagglehttp://kaggle.com/
Authors
Mukuliitg
License
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Description
Dataset

This dataset was created by Mukuliitg

Released under Apache 2.0

Contents

Facebook

Twitter

Click to copy link

Link copied

Cite

Abhiraj Mandal (2024). Hackathon Dataset [Dataset]. https://www.kaggle.com/datasets/abhirajmandal/hackathon-dataset

Hackathon Dataset

Explore at:

zip(1271001 bytes)Available download formats

Dataset updated

Jun 17, 2024

Authors

Abhiraj Mandal

License

Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically

Description

Dataset

This dataset was created by Abhiraj Mandal

Released under Apache 2.0

Clear search

Close search

Google apps

Main menu

Hackathon Dataset

Dataset

Contents

Fraud Detection Hackathon

Game of Deep Learning: Computer Vision Hackathon

Meta kaggle Hackathon

Dataset

Contents

Kaggle events and collaborations

Context

Hackathon Participants Data

Hackathon-2025-Big-Data

Dataset

Contents

MetaKaggle Forum Data Stella Embeddings

Context

Intended purpose

How I preprocessed the text data

Sample Data

How to use

See Related Datasets

Image

MetaKaggle Forum Data Qwen2 Embeddings

Context

Intended purpose

How I preprocessed the text data

Sample Data

How to use

See Related Datasets

Image

MACHINE LEARNING HACKATHON

Dataset

Contents

MetaKaggle Forum Data Jina-Small-Eng-V1 Embeddings

Context

Intended purpose

How I preprocessed the text data

Sample Data

How to use

See Related Datasets

Image

data-for-hackathon

Dataset

Contents

GenAI hackathon

Dataset

Contents

Football Hackathon

Dataset

Contents

Amazon business analytics hackathon

Dataset

Contents

Community Hackathon

Dataset

Contents

hackathons

Dataset

Contents

Hackathon Competition

Context

Content

Acknowledgements

Inspiration

H2O AI - AQI Hackathon

Dataset

Contents

Hackathon

Dataset

Contents

Hackathon Dataset

Dataset

Contents