StealthPhisher
Description
The StealthPhisher dataset is a large, diverse, and up-to-date resource tailored to address the evolving nature of phishing attacks. It contains over 336,749 records, comprising 160,943 legitimate URLs and 175,806 phishing URLs, sourced from platforms like PhishTank, spam email repositories, and user submissions. This dataset reflects recent phishing tactics, making it invaluable for training AI models to detect modern threats. Key features include URL-based attributes (length, TLD type, IP presence), statistical metrics (Shannon Entropy, Kolmogorov Complexity, Fractal Dimension), and HTML/interaction-based data (popups, redirects, forms). These features provide comprehensive insights into phishing behaviors, enabling precise detection. Designed to capture real-world scenarios, the dataset equips AI models with the ability to identify both traditional phishing strategies and advanced, evolving attacks. Its scale and focus on recent trends make it an essential tool for advancing AI-driven cybersecurity solutions.