Statistical Analysis of Penguin Dataset
tatistical Analysis of Penguin Body Mass Using One-Way ANOVA
Introduction:
In this analysis, we examine the body mass data of three penguin species: Adelie, Chinstrap, and
Gentoo. The goal is to determine whether there is a statistically significant difference in the mean
body mass among these species.
Dataset Overview:
Total records: 344
Records after removing missing values: 334
Features used:
species: Species of penguin
body_mass_g: Body mass in grams
Preprocessing Steps:
Rows with missing values were removed.
The remaining dataset includes the following species:
o Adelie
o Chinstrap
o Gentoo
Statistical Test:
A one-way ANOVA test was conducted to compare the mean body mass between the three
species.
Results of ANOVA:
F-statistic: 343.64
P-value: 1.69e-81
Interpretation:
Since the p-value is much less than 0.05, we reject the null hypothesis. This indicates that there is
a statistically significant difference in the average body mass among the three penguin species.
Visualization:
The boxplot below shows the distribution of body mass for each species:
Conclusion:
The one-way ANOVA analysis reveals that the mean body mass differs significantly between the
penguin species. This insight can be useful for ecological studies, species classification, or
understanding the adaptation of each species to its environment.