How the Poisson Distribution Can Help Predict Video Game Combat: A Toy ExampleThe Poisson distribution is a powerful tool that allows us to understand the probabilities associated with X number of events occurring…Feb 21, 2023Feb 21, 2023
Published inTowards Data ScienceA Case for Heuristics: Why Simple Solutions Often Win in Data ScienceIn this defence of heuristics, I examine how simple solutions can often be the best port of call when looking to ship data science…Nov 23, 20224Nov 23, 20224
Published inTowards Data ScienceQuantum Deep Learning: A Quick Guide to Quantum Convolutional Neural NetworksEverything you need to know about quantum convolutional neural networks (QCNNs), including the benefits and limitations of these…Oct 4, 20221Oct 4, 20221
Published inTowards Data ScienceAre you Scared, VADER? Understanding how NLP Pre-Processing Impacts VADER ScoringWhy common pre-processing activities can actually harm the power of VADER and why it’s important to consider your NLP pipeline carefullyJul 18, 2021Jul 18, 2021
Kaplan-Meier Survival Analysis in PythonSurvival analysis is a relatively under-utilised range of statistical methods that are highly applicable in a range of fields including…Jan 3, 2021Jan 3, 2021
Published inTowards Data ScienceEasily Query ORC Data in Python with PySparkOptimized Row Columnar, or ORC, is an column-oriented data storage format, that is part of the Apache Hadoop family. While ORC files and…Aug 12, 2019Aug 12, 2019
Published inHackerNoon.comUnder the Hood of AdaBoostA short introduction to the AdaBoost algorithmJan 7, 2019Jan 7, 2019
Published inCoinmonksCreating factors and re-ordering factor levels in RWhen working with data in R, you might want to convert numeric values into factors for easier exploratory data analysis or model building…Oct 30, 2018Oct 30, 2018
Published inTowards Data ScienceBuild & Deploy Data Science Projects at Lightspeed(or thereabouts).Sep 3, 2018Sep 3, 2018