Building real-time phishing attack detectors using different machine learning models
In the next sections, we are going to learn how to build machine learning phishing detectors. We will cover the following two methods:
- Phishing detection with logistic regression
- Phishing detection with decision trees
Phishing detection with logistic regression
In this section, we are going to build a phishing detector from scratch with a logistic regression algorithm. Logistic regression is a well-known statistical technique used to make binomial predictions (two classes).
Like in every machine learning project, we will need data to feed our machine learning model. For our model, we are going to use the UCI Machine Learning Repository (Phishing Websites Data Set). You can check it out at https://archive.ics.uci.edu/ml/datasets/Phishing+Websites:

The dataset is provided as an arff
file:

The following is a snapshot from the dataset:

For better manipulation, we have organized the dataset into a csv
file:

As you probably...