Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
This dataset provides detailed information on 1007 Pokémon from the popular mobile game Pokémon GO. It includes a wide range of attributes such as base stats, move sets, rarity, and acquisition methods. The data was collected using the RapidAPI Pokémon GO API, offering researchers and data enthusiasts a rich resource for analysis, machine learning projects, and game strategy development.
Key features of this dataset include:
This dataset is ideal for:
Whether you're a data scientist, game developer, or Pokémon enthusiast, this dataset offers a wealth of information to explore and analyze the world of Pokémon GO.
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
In the bustling world of Kanto, where Pokémon battles shape destinies, crime lurks in the shadows. Detective Kotso, the sharpest mind in Pokémon crime investigations, has been tasked with an urgent mission. The mayor suspects that Team Rocket has infiltrated the city, disguising themselves as ordinary citizens.
But Kotso doesn’t work alone—he relies on you, a brilliant data scientist, to uncover the truth. Your job? Analyze the data of 5,000 residents to predict which of the 1,000 unclassified individuals are secretly part of Team Rocket.
Can you spot the hidden patterns? Can Machine Learning crack the case where traditional detective work fails? The fate of Kanto depends on your skills.
This dataset holds the key to exposing Team Rocket’s operatives. Below is a breakdown of the features at your disposal:
Column Name | Description |
---|---|
ID | Unique identifier for each citizen |
Age | Age of the citizen |
City | City the citizen is from |
Economic Status | Low, Medium, High |
Occupation | Profession in the Pokémon world |
Most Frequent Pokémon Type | The type of Pokémon most frequently used |
Average Pokémon Level | Average level of owned Pokémon |
Criminal Record | Clean (0) or Dirty (1) |
Pokéball Usage | Preferred Pokéball type (e.g., DarkBall, UltraBall) |
Winning Percentage | Battle win rate (e.g., 64%, 88%) |
Gym Badges | Number of gym badges collected (0 to 8) |
Is Pokémon Champion | True if the citizen has defeated the Pokémon Elite Four |
Battle Strategy | Defensive, Aggressive, Unpredictable |
City Movement Frequency | Number of times the citizen moved between cities in the last year |
Possession of Rare Items | Yes or No |
Debts to the Kanto System | Amount of debt (e.g., 20,000) |
Charitable Activities | Yes or No |
Team Rocket Membership | Yes or No (target variable) |
This dataset is not just about numbers—it’s a criminal investigation. Hidden patterns lurk beneath the surface, waiting to be uncovered.
This isn’t just another classification task—it’s a race against time to stop Team Rocket before they take control of Kanto!
Detective Kotso is counting on you. Will you rise to the challenge? 🕵️♂️🔎
1️⃣ Do certain Pokémon types indicate suspicious behavior?
- 📈 Graph: Stacked bar chart comparing Pokémon type distribution between Rocket & non-Rocket members.
- 🎯 Test: Chi-square test for correlation.
2️⃣ Is economic status a reliable predictor of criminal affiliation?
- 📊 Graph: Box plot of debt and economic status per Team Rocket status.
- 🏦 Test: ANOVA test for group differences.
3️⃣ Do Team Rocket members have a preference for specific PokéBalls?
- 🎨 Graph: Heatmap of PokéBall usage vs. Team Rocket status.
- ⚡ Test: Chi-square test for independence.
4️⃣ Does a high battle win ratio correlate with Team Rocket membership?
- 📉 Graph: KDE plot of win ratio distribution for both classes.
- 🏆 Test: T-test for mean differences.
5️⃣ Are migration patterns different for Team Rocket members?
- 📈 Graph: Violin plot of migration counts per group.
- 🌍 Test: Mann-Whitney U test.
6️⃣ Do Rocket members tend to avoid charity participation?
- 📊 Graph: Grouped bar chart of charity participation rates.
- 🕵️♂️ Test: Fisher’s Exact Test for small sample sizes.
7️⃣ Do Rocket members disguise themselves in certain professions?
- 📊 Graph: Horizontal bar chart of profession frequency per group.
- 🕵️♂️ Test: Chi-square test for profession-Team Rocket relationship.
8️⃣ Is there an unusual cluster of Rocket members in specific cities?
- 🗺 Graph: Geographic heatmap of city distributions.
- 📌 Test: Spatial autocorrelation test.
9️⃣ How does badge count affect the likelihood of being a Rocket member?
- 📉 Graph: Histogram of gym badge distributions.
- 🏅 Test: Kruskal-Wallis test.
🔟 **Are there any multi-feature interactions that reve...
This dataset provides detailed information for Pokémon from 1 to 1045, as listed in the National Pokédex. It includes fundamental Pokédex entries such as their names, types, and physical attributes, alongside more in-depth data like move sets, type effectiveness, abilities with full descriptions, and battle strategies sourced from Smogon. Additionally, the dataset contains brief descriptions from Bulbapedia. A distinct text corpus file is also included, offering a textual representation for each Pokémon, compiled from all the details present in the main Pokédex file.
The main Pokémon file features 56 columns, providing extensive details for each creature. Key columns include: * pokédex number: The official National Pokédex identification number. * name: The English name of the Pokémon. * japanese name: The Japanese name of the Pokémon. * generation: The generation number the Pokémon originates from. * status: Indicates if the Pokémon is Legendary. * species: The specific species of the Pokémon. * type number: How many elemental types the Pokémon possesses. * type 1: The primary elemental type. * type 2: The secondary elemental type, if applicable. * height: The Pokémon's height in metres. * weight: The Pokémon's weight in kilograms. * abilities number: The count of abilities it can have. * total points: The sum of all base stats. * stats: Individual columns for key battle statistics: HP, attack, defence, special attack, special defence, and speed. * catch rate: The Pokémon's catch rate. * base friendship: The base friendship value. * base experience: The base experience yield. * growth rate: The growth rate category. * egg type number: The number of egg groups it belongs to. * egg type 1: The primary egg group. * egg type 2: The secondary egg group, if applicable. * percentage male: The likelihood of the Pokémon being male. * egg cycles: The number of steps required to hatch an egg. * type effectiveness: Columns detailing effectiveness against various types (e.g., normal, fire, water, grass, electric, flying, ground, rock, fighting, psychic, dark, ghost, dragon, ice, fairy, poison, bug, steel). * Smogon description: Battle strategies primarily from SM Pokédex, or other generations if more relevant. * Bulba description: Initial sentences from the Pokémon's Bulbapedia page. * moves: A dictionary detailing moves the Pokémon learns by levelling up, including name, type, damage type, power, accuracy, PP, level learned, secondary effect chance, and description. * ability 1, ability 2, hidden ability: The names of the Pokémon's abilities. * ability 1 description, ability 2 description, hidden ability description: Descriptions for each of the Pokémon's abilities.
The accompanying Poké corpus file contains a text corpus for each Pokémon, generated by consolidating all the information from the Pokédex file.
This dataset encompasses information for Pokémon numbered 1 through 1045. The primary Pokémon data file contains 56 distinct columns for each entry. While specific row counts are not provided, there are 1045 unique Pokémon entries detailed. Data files are typically provided in CSV format.
This dataset is ideally suited for a variety of applications, particularly in the fields of artificial intelligence, machine learning, and data analysis related to gaming and entertainment. * Building AI Chatbots: Useful for creating conversational agents, such as a Pokémon chatbot, through retrieval-augmented generation (RAG) pipelines. * Game Development: Provides extensive data for developers creating Pokémon-inspired games or applications. * Data Analysis: Researchers and enthusiasts can analyse Pokémon stats, moves, and abilities for competitive strategy or general insights. * Natural Language Processing (NLP): The text corpus can be used for text generation, entity recognition, and other NLP tasks related to Pokémon lore.
The dataset covers Pokémon from number 1 to 1045 in the National Pokédex. Its scope is global, providing information relevant to all regions where Pokémon are known. There are no specific notes on data availability for certain groups or years beyond the stated Pokédex range.
CC BY-SA
PokemonGo is a mobile augmented reality game developed by Niantic inc. for iOS, Android, and Apple Watch devices. It was initially released in selected countries in July 2016. In the game, players use a mobile device's GPS capability to locate, capture, battle, and train virtual creatures, called Pokémon, who appear on the screen as if they were in the same real-world location as the player.
Dataset consists of roughly 293,000 pokemon sightings (historical appearances of Pokemon), having coordinates, time, weather, population density, distance to pokestops/ gyms etc. as features. The target is to train a machine learning algorithm so that it can predict where pokemon appear in future. So, can you predict'em all?)
yyyy-mm-dd'T'hh-mm-ss.ms'Z'
(nominal)All pokemon sightings (in JSON file, without features) can be found in Discussion "Datadump"
MIT Licensehttps://opensource.org/licenses/MIT
License information was derived automatically
Step into the Golden Era of Pokémon and relive the magic from Kanto to Hoenn — the first three generations that defined childhoods, sparked rivalries, and ignited a global phenomenon. Whether you started with Bulbasaur, bonded with Cyndaquil, or battled through Hoenn with Mudkip, this dataset is a tribute to the most iconic 386 Pokémon that shaped an era.
As a lifelong Pokémon fan and a data enthusiast, I created this dataset to bridge nostalgia with numbers, giving fellow data scientists, analysts, and fans a structured, rich, and clean dataset that’s more than just stats, it's a journey through the classics.
A complete profile for 386 Pokémon from Gen I–III (Kanto, Johto, Hoenn) featuring:
Battle Stats
Base HP
: Health PointsBase Attack
: Physical attack strengthBase Defense
: Resistance to physical attacksBase Special Attack
: Strength of special moves (e.g., Flamethrower)Base Special Defense
: Resistance to special attacksBase Speed
: Determines move order in battleTotal Base Stats
: Sum of all base stats (useful for comparing overall strength)Type Information
Type 1
: Primary elemental type (e.g., Fire, Water, Grass)Type 2
: Secondary type if applicable (e.g., Flying, Poison)Evolution Chain
Evolution Stage
: Stage in the evolution line (1 = basic, 2 = mid, 3 = final)Hidden Ability
: Special ability that’s not always visible or available by defaultAbilities
Breeding Data
Gender Ratio
: Distribution of male/female genders (or Genderless)Catch Method
: How the Pokémon is typically acquired (wild, starter, evolution, etc.)Base Friendship
: Starting friendship level, used in evolution or move effectivenessCapture Mechanics
Capture Rate
: Integer from 3–255, higher means easier to catchIs Legendary
: TRUE/FALSE flag to identify rare, one-of-a-kind PokémonVisual Traits (Great for Clustering & Modeling)
Color
: UI categorization from Pokédex (used in official classifications)Shape
: Visual silhouette group (bipedal, quadruped, etc.)Height (m)
: Official height in metersWeight (kg)
: Official weight in kilogramsMeta Info
ID
: Unique internal identifier (can be dropped in most analyses)Name
: Pokémon's English nameGeneration
: Number indicating the generation (1 = Kanto, etc.)Region
: Region the Pokémon originated from (Kanto, Johto, Hoenn)Number
: National Pokédex number (missing in a few cases, may need fixing)Data compiled from trusted community resources:
If you're a fan of Pokémon and data, I hope this brings you joy and insights. Let’s catch 'em all — one data point at a time.
Not seeing a result you expected?
Learn how you can add new datasets to our index.
Attribution-ShareAlike 4.0 (CC BY-SA 4.0)https://creativecommons.org/licenses/by-sa/4.0/
License information was derived automatically
This dataset provides detailed information on 1007 Pokémon from the popular mobile game Pokémon GO. It includes a wide range of attributes such as base stats, move sets, rarity, and acquisition methods. The data was collected using the RapidAPI Pokémon GO API, offering researchers and data enthusiasts a rich resource for analysis, machine learning projects, and game strategy development.
Key features of this dataset include:
This dataset is ideal for:
Whether you're a data scientist, game developer, or Pokémon enthusiast, this dataset offers a wealth of information to explore and analyze the world of Pokémon GO.