DSpace logo

Please use this identifier to cite or link to this item: http://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16082
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBhatia, Ashutosh-
dc.contributor.authorTiwari, Kamlesh-
dc.date.accessioned2024-10-14T10:34:35Z-
dc.date.available2024-10-14T10:34:35Z-
dc.date.issued2022-
dc.identifier.urihttps://ieeexplore.ieee.org/abstract/document/9687204-
dc.identifier.urihttp://dspace.bits-pilani.ac.in:8080/jspui/handle/123456789/16082-
dc.description.abstractUsers are frequently exposed to many unknown links through advertisements and emails. These links may contain URLs to mount targeted attacks like spamming, phishing, and malware installation. Using blacklist of URLs is the most widely used defense mechanism to detect a malicious URLs. However, automatically generating such a list for fresh malicious URLs is challenging. Detecting a URL as malicious using the lexicographical approach is an important research problem. This paper proposes a malicious URL detection mechanism using natural language processing. We use features including word vector representation obtained through GloVe along with statistical cues and n-gram on blacklist words. The proposed approach is efficient, and it does not require inputs from external servers to identify malicious URLs. Experiments are performed on 227,909 size database containing 80,128 benign and 147,781 malicious URLs. Proposed system has achieved an accuracy of 89% for ANN model with GloVe based features.en_US
dc.language.isoenen_US
dc.publisherIEEEen_US
dc.subjectComputer Scienceen_US
dc.subjectMachine learning (ML)en_US
dc.subjectURL Classificationen_US
dc.subjectGloVe embedding modelen_US
dc.titleIs this URL Safe: Detection of Malicious URLs Using Global Vector for Word Representationen_US
dc.typeArticleen_US
Appears in Collections:Department of Computer Science and Information Systems

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.