ggplot2 style plotting in Python

R is my language of choice for data science but a good data scientist should have some knowledge of all of the great tools available to them. Recently, I have been gleefully using Python for machine learning problems (specifically pandas and the wonderful scikit-learn). However, for all its greatness, I couldn't help but feel it… Continue reading ggplot2 style plotting in Python

Naive Bayes Classification in R (Part 2)

Following on from Part 1 of this two-part post, I would now like to explain how the Naive Bayes classifier works before applying it to a classification problem involving breast cancer data. The dataset is sourced from Matjaz Zwitter and Milan Soklic from the Institute of Oncology, University Medical Center in Ljubljana, Slovenia (formerly Yugoslavia) and… Continue reading Naive Bayes Classification in R (Part 2)

Naive Bayes Classification in R (Part 1)

Introduction A very useful machine learning method which, for its simplicity, is incredibly successful in many real world applications is the Naive Bayes classifier. I am currently taking a machine learning module as part of my data science college course and this week's practical work involved a classification problem using the Naive Bayes method. I… Continue reading Naive Bayes Classification in R (Part 1)

Predicting the Willingen 2017 men’s ski jumping competition

In an earlier post of mine, I carried out an analysis on ski jumping data for Zakopane, Poland and attempted to predict which athletes would end up on the podium. I also created a classification tree and tested it on the 2017 competition data with good results. For this side project of mine, I hope… Continue reading Predicting the Willingen 2017 men’s ski jumping competition

Predicting the Zakopane 2017 men’s ski jumping competition

When I was a young boy with a wild imagination, I used to try my hand at numerous sports ranging from tennis to gaelic footbal to soccer, each with varying degrees of success. Living in the countryside throughout my childhood, a big garden allowed me to construct vivid simulations of soccer championships (crowd and all)… Continue reading Predicting the Zakopane 2017 men’s ski jumping competition

An Analysis of Global Shark Attacks between 1543-2016

Travelling to a warm country and plan on hitting the beach? "Beware of the sharks!!" While the expression has almost become a cliché at this stage, shark attacks do occur and the fact that the global ocean is heating up potentially facilitates the occurrence of attacks (I would refer to them as interactions but we… Continue reading An Analysis of Global Shark Attacks between 1543-2016

The other carbon dioxide problem

Researchers say ocean acidification may have dramatic effects on phytoplankton. Edit section Phytoplankton, tiny plants which live in the ocean, form the foundation of the marine food web and play an important role in the regulation of Earth's climate. These wonderful organisms create their own food using sunlight, nutrients and carbon dioxide drawn down from… Continue reading The other carbon dioxide problem