Countering malicious URLs in Internet-of-Thing (IoT) using a knowledge-based approach and simulated expert

Anwar, S., Al-Obeidat, Feras, Tubaishat, A., Din, S., Ahmad, A., Khan, A.F., Jeo, G. and Loo, Jonathan ORCID: (2019) Countering malicious URLs in Internet-of-Thing (IoT) using a knowledge-based approach and simulated expert. IEEE Internet of Things Journal.

[thumbnail of Loo_etal_JIOT_IEEE_2019_Countering_malicious_URLs_in_Internet-of-Thing_(IoT)_using_a_knowledge-based_approach_and_simulated_expert.pdf] PDF
Loo_etal_JIOT_IEEE_2019_Countering_malicious_URLs_in_Internet-of-Thing_(IoT)_using_a_knowledge-based_approach_and_simulated_expert.pdf - Submitted Version
Restricted to Repository staff only

Download (706kB)


This study proposes a novel methodology to detect malicious URLs using simulated expert (SE) and knowledge base system (KBS). The proposed study not only efficiently detects known malicious URLs, but also adapt countermeasure against the newly generated malicious URLs. Moreover, this study also explored which lexical features are more contributing in final decision using a factor analysis method and thus helped in avoiding involvement of human expert. Further, we applied the following state-of-the-art ML algorithms, i.e., Naïve Bayes (NB), Decision Tree (DT), Gradient Boosted Trees (GBT), Generalized Linear Model (GLM), Logistic Regression (LR), Deep Learning (DL), and Random rest (RF), and evaluated the performance of these algorithms on a large-scale real data set of data-driven Web application. The experimental results clearly demonstrated the efficiency of NB in the proposed model as NB outperformed when compared to the rest of aforementioned algorithms in term of average minimum execution time (i.e., 3 seconds) and was able to accurately classify the 107586 URLs with 0.2% error rate and 99.8% accuracy rate.

Item Type: Article
Identifier: 10.1109/JIOT.2019.2954919
Additional Information: © 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Keywords: Malicious URLs, URLs Classification, Naïve Bayes, Simulated Experts, Feature Extraction.
Subjects: Computing > Information security > Cyber security
Related URLs:
Depositing User: Jonathan Loo
Date Deposited: 24 Nov 2019 19:39
Last Modified: 28 Aug 2021 07:12

Actions (login required)

View Item View Item