Is this URL Safe: Detection of Malicious URLs Using Global Vector for Word Representation

Bhatia, Ashutosh; Tiwari, Kamlesh

Is this URL Safe: Detection of Malicious URLs Using Global Vector for Word Representation

dc.contributor.author	Bhatia, Ashutosh
dc.contributor.author	Tiwari, Kamlesh
dc.date.accessioned	2024-10-14T10:34:35Z
dc.date.available	2024-10-14T10:34:35Z
dc.date.issued	2022
dc.description.abstract	Users are frequently exposed to many unknown links through advertisements and emails. These links may contain URLs to mount targeted attacks like spamming, phishing, and malware installation. Using blacklist of URLs is the most widely used defense mechanism to detect a malicious URLs. However, automatically generating such a list for fresh malicious URLs is challenging. Detecting a URL as malicious using the lexicographical approach is an important research problem. This paper proposes a malicious URL detection mechanism using natural language processing. We use features including word vector representation obtained through GloVe along with statistical cues and n-gram on blacklist words. The proposed approach is efficient, and it does not require inputs from external servers to identify malicious URLs. Experiments are performed on 227,909 size database containing 80,128 benign and 147,781 malicious URLs. Proposed system has achieved an accuracy of 89% for ANN model with GloVe based features.	en_US
dc.identifier.uri	https://ieeexplore.ieee.org/abstract/document/9687204
dc.identifier.uri	https://dspace.bits-pilani.ac.in/handle/123456789/16082
dc.language.iso	en	en_US
dc.publisher	IEEE	en_US
dc.subject	Computer Science	en_US
dc.subject	Machine learning (ML)	en_US
dc.subject	URL Classification	en_US
dc.subject	GloVe embedding model	en_US
dc.title	Is this URL Safe: Detection of Malicious URLs Using Global Vector for Word Representation	en_US
dc.type	Article	en_US

Files

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Department of Computer Science and Information Systems