mochi-skz/twt-kaggle-data dataset hosted on Hugging Face and contributed by the HF Datasets community
Gholamreza/test-dataset-kaggle dataset hosted on Hugging Face and contributed by the HF Datasets community
kaggle-map/data dataset hosted on Hugging Face and contributed by the HF Datasets community
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
This dataset was created by Mohammad Reza Mashoufi
Released under MIT
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Juliekyungyoon/plant-kaggle-seg-data dataset hosted on Hugging Face and contributed by the HF Datasets community
This dataset was created by HinePo
Kaggle toxic dataset annotated with gpt-4o-mini with the same prompt used to annotate Toxic-Commons Celadon
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
This dataset was created by ahmadreza rostamani
Released under Apache 2.0
https://choosealicense.com/licenses/odbl/https://choosealicense.com/licenses/odbl/
Date: 2022-07-10 Files: ner_dataset.csv Source: Kaggle entity annotated corpus notes: The dataset only contains the tokens and ner tag labels. Labels are uppercase.
About Dataset
from Kaggle Datasets
Context
Annotated Corpus for Named Entity Recognition using GMB(Groningen Meaning Bank) corpus for entity classification with enhanced and popular features by Natural Language Processing applied to the data set. Tip: Use Pandas Dataframe to load dataset if using Python forโฆ See the full description on the dataset page: https://huggingface.co/datasets/rjac/kaggle-entity-annotated-corpus-ner-dataset.
Apache License, v2.0https://www.apache.org/licenses/LICENSE-2.0
License information was derived automatically
Personality Dataset
Essays https://huggingface.co/datasets/jingjietan/essays-big5 MBTI https://huggingface.co/datasets/jingjietan/kaggle-mbti Pandora https://huggingface.co/datasets/jingjietan/pandora-big5 Please contact jingjietan.com for another dataset. Cite: @software{jingjietan-apr-dataset, author = {Jing Jie, Tan}, title = {Personality Kaggle Dataset Splitting}, url = {https://huggingface.co/datasets/jingjietan/kaggle-mbti}, version = {1.0.0}, year = {2024} }
This dataset was created by Inhoi
GitHub Issues & Kaggle Notebooks
Description
GitHub Issues & Kaggle Notebooks is a collection of two code datasets intended for language models training, they are sourced from GitHub issues and notebooks in Kaggle platform. These datasets are a modified part of the StarCoder2 model training corpus, precisely the bigcode/StarCoder2-Extras dataset. We reformat the samples to remove StarCoder2's special tokens and use natural text to delimit comments in issues and displayโฆ See the full description on the dataset page: https://huggingface.co/datasets/HuggingFaceTB/issues-kaggle-notebooks.
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
This dataset provides a comprehensive overview of online sales transactions across different product categories. Each row represents a single transaction with detailed information such as the order ID, date, category, product name, quantity sold, unit price, total price, region, and payment method.
babs/kinyarwada-kaggle dataset hosted on Hugging Face and contributed by the HF Datasets community
https://choosealicense.com/licenses/pddl/https://choosealicense.com/licenses/pddl/
Dataset Card for MergedDataset
Dataset Summary
Supported Tasks and Leaderboards
[More Information Needed]
Languages
[More Information Needed]
Dataset Structure
Data Instances
[More Information Needed]
Data Fields
[More Information Needed]
Data Splits
[More Information Needed]
Dataset Creation
Curation Rationale
[More Information Needed]
Source Data
Initial Dataโฆ See the full description on the dataset page: https://huggingface.co/datasets/ahmadkhan1022/kaggle.
sflagg/Kaggle-Mental-Health-Survey-Data dataset hosted on Hugging Face and contributed by the HF Datasets community
gayanin/kaggle-native dataset hosted on Hugging Face and contributed by the HF Datasets community
https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
The dataset contains the Timezone for 90 countries.
Column Description:-
Source Timezone - This contains the information such as name of the country, time zone & its associated information. We will be working mostly with this column.
I have uploaded this dataset as many a times we face problem when we work with datetime, timezone. Please make use of this dataste & learn from it , share with your peers to grow our community.
gayanin/kaggle-native-v8-vocab-noised dataset hosted on Hugging Face and contributed by the HF Datasets community
kaggle-aimo/aime_filtered dataset hosted on Hugging Face and contributed by the HF Datasets community
mochi-skz/twt-kaggle-data dataset hosted on Hugging Face and contributed by the HF Datasets community