Project - Building a Trainer and a Binary Classifier based on Naive Bayes Algorithm

This solution was attempted with 3 major aims:
a. Generalize the Naive Bayes algorithm for application to situations other than spam SMS, where binary classification is required, using any training dataset of the minimal format [‘Label, Information]
b. Reduce the code to just two code cells: one containing "training algorithm’ and the other ‘classifier algorithm’. Although, explanations and secondary tasks are performed, but the solution is based on just these two algorithms which can perform multiple binary classification tasks other than SMS classification, without the need for writing any additional code.
c. Unit test the code, as a number of complex calculations needed verification for their accuracy.

Feedback on methodology and application will be appreciated.

https://app.dataquest.io/m/433/guided-project%3A-building-a-spam-filter-with-naive-bayes/10/next-steps

Project- Naive Bayes Trainer and Binary Classifier.ipynb (137.2 KB)

Click here to view the jupyter notebook file in a new tab

2 Likes