Pokemon Analysis

1. Introduction

The following analysis uses a dataset created by Mario Tormo Romero and can be found here. The dataset provides a list of the 890 known Pokemon until 8th Generation, their varieties and various features.

We will perform an exploratory analysis and gather some insights based on our Pokemon knowledge. We will look at the number and types of Pokemon, their heights and weights, and their battle stats. We start by removing the columns we're not interested in.

2. Pokemon through the Generations

We first see the number of Pokemon that were added in each generation. Generation I pokemon were released in the 1990s and the latest generation (VIII) were released in 2019-2020.

Note that the cumulative count here is more than the 890 unique Pokemons as pokemon like Meowth and Alalon Meowth count as 2 varieties under the same Pokedex number.

We also see a distribution of the types of Pokemon and note that Water-type pokemon are most common (13% of all pokemon) followed by Normal-type and Grass-type pokemon.

3. Pokemon BMI (Body Mass Index)

Next we will calculate the BMI for various Pokemon types.

Review of Heights and Weights

Before we do that, let's compare the heights and weights with a simple scatter plot to see if we can find any correlation between the two for Pokemons. Generally speaking, for humans, the taller you are, the heavier you're expected to be.

Review Outliers

We notice the positive corelation, which can be expected but we also observe some extremes values of weight - particularly one with height close to 0m but weight at 1000kg. Upon closer review of the entry, we find it's for Cosmoem which is a ProtoStar pokemon, which makes sense since a star is supposed to be extremely dense. Note the Type for this pokemon is Psychic.

BMI

As expected, the BMI for Psychic type is an outlier and we exclude it to get a better view of the BMIs of the other pokemon types. We observe Steel and Rock types of Pokemon have higher BMIs and Fairy type Pokemons have the lowest BMI.

4. Pokemon Battle Stats

Now let's look at the battle stats like HP, Attack, Defense, Special Attack, Special Defense and Speed.

Range and Corelation

We first look at a whisker plot for a quick analysis of the range of each of the stats. We also look at the correlation of each field with the others. We note that while all battle stats have a positive correlation, it is weak to moderate at best. The highest correlation (0.54) is observed between Defense and Special Defense while Speed and Defense are not correlated at all.

Max Battle Stats

Next we look at which Pokemon is strongest for each battle stat. When we pick the record with the Max value, we observe that Blissey has the highest HP and that's why it's the most popular choice for defending gyms.

5. Conclusion

We did an exploratory analysis of the Pokemon data and figured out the number of pokemons through the generations and determined that Water-type pokemon are the most popular. We found that while Pokemon BMI is generally consistent with human BMI, there are outliers like Cosmoen which are observed due to the nature of certain pokemon like those based on proton stars. We also looked at the battle stats and found that Blissey rightfully deserves its tag as the best gym defender due to it's high HP.