Machine Learning. Pipeline.

A machine learning pipeline is the overall process from data processing up till model prediction of the data. It could be considered as being a sequence of actions (e.g. data imputation / data scaling / encoding of categorical values / hyperparameter tuning / model selection) which could be broken down into smaller ‘components’ or functions. […]

Machine Learning. Classification with kNN.

This post will be looking at using the k-Nearest Neighbour (kNN) algorithm for a classification problem. Suppose there is a new data point that is required to be classified into one of the known categories that comprises the data. This algorithm will look at the k closest points (where k is an integer specified by […]