HOME

ABOUT CHALLENGE MODEL RESULT PRESENTATION CODE

Detection and
categorization of
malicious URLs

Identify and classify malicious URLs.

URL dataset (ISCX-URL2016)

Challenge

Lack of description on individual features.

Difficult to determine the correlations because of large number of features.

The dataset contains Null, NaN and Infinity values.

Unsupervised model: Isolation Forest for unsupervised anomaly detection

Supervised models: Random Forest, Decision Tree, Logistic Regression, AdaBoost and
Naive Bayes.

To the code

See the results for each model: