4 datasets found
  1. Minecraft Composting Dataset

    • kaggle.com
    zip
    Updated Nov 27, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Brandon Conrady (2021). Minecraft Composting Dataset [Dataset]. https://www.kaggle.com/datasets/brandonconrady/minecraft-composting-dataset
    Explore at:
    zip(42528 bytes)Available download formats
    Dataset updated
    Nov 27, 2021
    Authors
    Brandon Conrady
    License

    https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

    Description

    ### Introduction

    Hello. My name is Brandon Conrady and I am currently early on in my data science studies in college. This is my first data set, so enjoy!

    ### Context

    I am currently taking a statistics course and this got me curious as to finding distributions from samples gathered in my day to day life. Since I play video games, I turned to Minecraft. For those who don't know, Minecraft has a block called the composter which allows you to input an item such as wheat. The item disappears, and has a percent chance of raising the compost level within the composter. When the compost level reaches 7, it creates another item called bone meal, which can act as fertilizer to grow plants. I wanted to collect this data and throw it onto Kaggle to see what people could come up with using it.

    ### Content

    Each csv file contains samples from when the item specified was used on the composter. Most contain 2000 entries. However, the cookies dataset contains 3000 since it is more efficient at creating bone meal. I may update to add further entries to each csv file, but seeing as the current data already approximates a distribution I am currently unsure if any more entries would be useful.

    ### Acknowledgements

    Minecraft is the intellectual property of Microsoft, although the datasets themselves don't involve any direct usage of the product itself, rather records of observations gathered playing the game. However I should state the obvious that I don't own the game itself.

    ### Inspiration

    I wanted to see if, based on the data provided, people could estimate the probability that for a given item, adding one of it to the composter will raise the compost level. I am also just generally curious as to what applications people can come up with given the data provided. By all means take it and run with it!

  2. h

    Minecraft-Server-Chat

    • huggingface.co
    Updated Sep 2, 2020
    + more versions
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jackson (2020). Minecraft-Server-Chat [Dataset]. https://huggingface.co/datasets/declip/Minecraft-Server-Chat
    Explore at:
    CroissantCroissant is a format for machine-learning datasets. Learn more about this at mlcommons.org/croissant.
    Dataset updated
    Sep 2, 2020
    Authors
    Jackson
    License

    https://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/

    Description

    Minecraft Server Chat

    Important Info: This dataset contains swears. I filtered out as much racism as possible. People who were racist were banned from the server. I am not affiliated with the server in any way. A collection of 2,000,000 messages said across two years in a minecraft server. The minecraft semi-anarchy server logged all of its messages to discord between 2020 and 2023. I downloaded all of them and made them into a json in chronological order. I also cleaned the… See the full description on the dataset page: https://huggingface.co/datasets/declip/Minecraft-Server-Chat.

  3. Indonesian Chat Dataset

    • kaggle.com
    zip
    Updated May 21, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Jasson Prestiliano (2025). Indonesian Chat Dataset [Dataset]. https://www.kaggle.com/datasets/jprestiliano/indonesian-chat-dataset/code
    Explore at:
    zip(335131 bytes)Available download formats
    Dataset updated
    May 21, 2025
    Authors
    Jasson Prestiliano
    License

    https://cdla.io/sharing-1-0/https://cdla.io/sharing-1-0/

    Description

    Indonesian Chat Dataset, including around 10,702 meticulously edited chats among users of Roblox and Minecraft. Certain chats use conventional terminology, while others utilize colloquial expressions. Slang phrases emerged because younger gamers often incorporate them into their regular conversations. The author personally categorizes the chats under four classifications: neutral, violent, racist, and harassing.

    Classification details: Neutral: no violent sentences, casual chat without any means to harm someone Violence: swearing sentences, threats, incitement to harm others, or associating people with some animal or creature Racist: discriminate against or demean people based on race, religion, ethnicity, or nationality, including slurs, hate speech, or promoting racial superiority Harassment: porn, sexual abuse words, body shaming, derogation

    An example of the dataset's content and the preprocessing methodology is the sentence: “Pada bisa diem ga sih, 4nj1n9 semua,” which translates to “Can you shut up, all of you are dogs,” where numbers substitute certain letters in the word 'dog,' categorized as 'violence.' The preprocessing procedure involves converting the phrase to lowercase, normalizing it by eliminating punctuation, and substituting some numerals with their nearest alphabetic equivalents, for as replacing the numeral 4 with 'a' and 1 with 'i'. The preprocessed sentence is: "pada bisa diem ga sih anjing semua."

  4. Minecraft Player Faces

    • kaggle.com
    zip
    Updated Mar 21, 2021
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Sebastian Ponce (2021). Minecraft Player Faces [Dataset]. https://www.kaggle.com/sebastianponce/minecraft-player-faces
    Explore at:
    zip(192263 bytes)Available download formats
    Dataset updated
    Mar 21, 2021
    Authors
    Sebastian Ponce
    License

    http://www.gnu.org/licenses/old-licenses/gpl-2.0.en.htmlhttp://www.gnu.org/licenses/old-licenses/gpl-2.0.en.html

    Description

    Why?

    After learning some popular image generative algorithms (DCGANs, WGANs, CGANs), I tried to take a shot on a dataset created by me. However, because of the size of this dataset, the results weren't as good. That's why I made this dataset available to everyone as a reminder that, even after learning a lot, you can still learn from people who have a lot more experience than you. (And because Minecraft!)

    Content

    The dataset has 300+ front faces of popular minecraft skins from famous youtubers. All of these images are in .png format with the same image size (190*190)

    Inspiration

    This dataset was inspired by other minecraft skins datasets available in Kaggle. However, they include the whole skin, which, personally, isn't as good looking as only the front face of the Minecraft skin.

  5. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Brandon Conrady (2021). Minecraft Composting Dataset [Dataset]. https://www.kaggle.com/datasets/brandonconrady/minecraft-composting-dataset
Organization logo

Minecraft Composting Dataset

A dataset recording items used in a composter per bone meal created.

Explore at:
zip(42528 bytes)Available download formats
Dataset updated
Nov 27, 2021
Authors
Brandon Conrady
License

https://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/

Description

### Introduction

Hello. My name is Brandon Conrady and I am currently early on in my data science studies in college. This is my first data set, so enjoy!

### Context

I am currently taking a statistics course and this got me curious as to finding distributions from samples gathered in my day to day life. Since I play video games, I turned to Minecraft. For those who don't know, Minecraft has a block called the composter which allows you to input an item such as wheat. The item disappears, and has a percent chance of raising the compost level within the composter. When the compost level reaches 7, it creates another item called bone meal, which can act as fertilizer to grow plants. I wanted to collect this data and throw it onto Kaggle to see what people could come up with using it.

### Content

Each csv file contains samples from when the item specified was used on the composter. Most contain 2000 entries. However, the cookies dataset contains 3000 since it is more efficient at creating bone meal. I may update to add further entries to each csv file, but seeing as the current data already approximates a distribution I am currently unsure if any more entries would be useful.

### Acknowledgements

Minecraft is the intellectual property of Microsoft, although the datasets themselves don't involve any direct usage of the product itself, rather records of observations gathered playing the game. However I should state the obvious that I don't own the game itself.

### Inspiration

I wanted to see if, based on the data provided, people could estimate the probability that for a given item, adding one of it to the composter will raise the compost level. I am also just generally curious as to what applications people can come up with given the data provided. By all means take it and run with it!

Search
Clear search
Close search
Google apps
Main menu