Skip to main content
Exit comparison
Removed
Added

Datasets Comparison

Version 1

PhiUSIIL Phishing URL Dataset

Published:29 August 2023|Version 1|DOI:10.17632/shwpxscxy2.1
Contributors:Arvind Prasad, Shalini Chandra

Description

PhiUSIIL Phishing URL Dataset is a substantial dataset comprising 134,850 legitimate and 100,945 phishing URLs. Most of the URLs we analyzed while constructing the dataset are latest URLs, mostly from 2023.

Categories

Computer Science, Cybersecurity, Machine Learning, Cyber Attack

Licence

Creative Commons Attribution 4.0 International

Version 2

PhiUSIIL Phishing URL Dataset

Published:15 November 2023|Version 2|DOI:10.17632/shwpxscxy2.2
Contributors:Arvind Prasad, Shalini Chandra

Description

PhiUSIIL Phishing URL Dataset is a substantial dataset comprising 134,850 legitimate and 100,945 phishing URLs. Most of the URLs we analyzed while constructing the dataset are the latest URLs. Features are extracted from the source code of the webpage and URL. Features such as CharContinuationRate, URLTitleMatchScore, URLCharProb, and TLDLegitimateProb are derived from existing features. Citation: Prasad, A., & Chandra, S. (2023). PhiUSIIL: A diverse security profile empowered phishing URL detection framework based on similarity index and incremental learning. Computers & Security, 103545. doi: https://doi.org/10.1016/j.cose.2023.103545

Steps to reproduce

Refer to https://doi.org/10.1016/j.cose.2023.103545

Categories

Computer Science, Cybersecurity, Machine Learning, Cyber Attack

Related Links

Licence

Creative Commons Attribution 4.0 International