Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
### Introduction
Hello. My name is Brandon Conrady and I am currently early on in my data science studies in college. This is my first data set, so enjoy!
### Context
I am currently taking a statistics course and this got me curious as to finding distributions from samples gathered in my day to day life. Since I play video games, I turned to Minecraft. For those who don't know, Minecraft has a block called the composter which allows you to input an item such as wheat. The item disappears, and has a percent chance of raising the compost level within the composter. When the compost level reaches 7, it creates another item called bone meal, which can act as fertilizer to grow plants. I wanted to collect this data and throw it onto Kaggle to see what people could come up with using it.
### Content
Each csv file contains samples from when the item specified was used on the composter. Most contain 2000 entries. However, the cookies dataset contains 3000 since it is more efficient at creating bone meal. I may update to add further entries to each csv file, but seeing as the current data already approximates a distribution I am currently unsure if any more entries would be useful.
### Acknowledgements
Minecraft is the intellectual property of Microsoft, although the datasets themselves don't involve any direct usage of the product itself, rather records of observations gathered playing the game. However I should state the obvious that I don't own the game itself.
### Inspiration
I wanted to see if, based on the data provided, people could estimate the probability that for a given item, adding one of it to the composter will raise the compost level. I am also just generally curious as to what applications people can come up with given the data provided. By all means take it and run with it!
Facebook
Twitterhttps://choosealicense.com/licenses/cc0-1.0/https://choosealicense.com/licenses/cc0-1.0/
Minecraft Server Chat
Important Info: This dataset contains swears. I filtered out as much racism as possible. People who were racist were banned from the server. I am not affiliated with the server in any way. A collection of 2,000,000 messages said across two years in a minecraft server. The minecraft semi-anarchy server logged all of its messages to discord between 2020 and 2023. I downloaded all of them and made them into a json in chronological order. I also cleaned the… See the full description on the dataset page: https://huggingface.co/datasets/declip/Minecraft-Server-Chat.
Facebook
Twitterhttps://cdla.io/sharing-1-0/https://cdla.io/sharing-1-0/
Indonesian Chat Dataset, including around 10,702 meticulously edited chats among users of Roblox and Minecraft. Certain chats use conventional terminology, while others utilize colloquial expressions. Slang phrases emerged because younger gamers often incorporate them into their regular conversations. The author personally categorizes the chats under four classifications: neutral, violent, racist, and harassing.
Classification details: Neutral: no violent sentences, casual chat without any means to harm someone Violence: swearing sentences, threats, incitement to harm others, or associating people with some animal or creature Racist: discriminate against or demean people based on race, religion, ethnicity, or nationality, including slurs, hate speech, or promoting racial superiority Harassment: porn, sexual abuse words, body shaming, derogation
An example of the dataset's content and the preprocessing methodology is the sentence: “Pada bisa diem ga sih, 4nj1n9 semua,” which translates to “Can you shut up, all of you are dogs,” where numbers substitute certain letters in the word 'dog,' categorized as 'violence.' The preprocessing procedure involves converting the phrase to lowercase, normalizing it by eliminating punctuation, and substituting some numerals with their nearest alphabetic equivalents, for as replacing the numeral 4 with 'a' and 1 with 'i'. The preprocessed sentence is: "pada bisa diem ga sih anjing semua."
Facebook
Twitterhttp://www.gnu.org/licenses/old-licenses/gpl-2.0.en.htmlhttp://www.gnu.org/licenses/old-licenses/gpl-2.0.en.html
After learning some popular image generative algorithms (DCGANs, WGANs, CGANs), I tried to take a shot on a dataset created by me. However, because of the size of this dataset, the results weren't as good. That's why I made this dataset available to everyone as a reminder that, even after learning a lot, you can still learn from people who have a lot more experience than you. (And because Minecraft!)
The dataset has 300+ front faces of popular minecraft skins from famous youtubers. All of these images are in .png format with the same image size (190*190)
This dataset was inspired by other minecraft skins datasets available in Kaggle. However, they include the whole skin, which, personally, isn't as good looking as only the front face of the Minecraft skin.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Facebook
Twitterhttps://creativecommons.org/publicdomain/zero/1.0/https://creativecommons.org/publicdomain/zero/1.0/
### Introduction
Hello. My name is Brandon Conrady and I am currently early on in my data science studies in college. This is my first data set, so enjoy!
### Context
I am currently taking a statistics course and this got me curious as to finding distributions from samples gathered in my day to day life. Since I play video games, I turned to Minecraft. For those who don't know, Minecraft has a block called the composter which allows you to input an item such as wheat. The item disappears, and has a percent chance of raising the compost level within the composter. When the compost level reaches 7, it creates another item called bone meal, which can act as fertilizer to grow plants. I wanted to collect this data and throw it onto Kaggle to see what people could come up with using it.
### Content
Each csv file contains samples from when the item specified was used on the composter. Most contain 2000 entries. However, the cookies dataset contains 3000 since it is more efficient at creating bone meal. I may update to add further entries to each csv file, but seeing as the current data already approximates a distribution I am currently unsure if any more entries would be useful.
### Acknowledgements
Minecraft is the intellectual property of Microsoft, although the datasets themselves don't involve any direct usage of the product itself, rather records of observations gathered playing the game. However I should state the obvious that I don't own the game itself.
### Inspiration
I wanted to see if, based on the data provided, people could estimate the probability that for a given item, adding one of it to the composter will raise the compost level. I am also just generally curious as to what applications people can come up with given the data provided. By all means take it and run with it!