Detection of Phished URLs Using Machine Learning


Year : 2024 | Volume : 11 | Issue : 03 | Page : 1-7
    By

    Shilpa Mundodagi,

  • Nyamatulla Patel,

  • Ziaullah Choudhari,

  1. Student, Department of Master of Computer Application, SECAB Institute of Engineering and Technology, Vijayapur, Karnataka, India
  2. Associate Professor, Department of Master of Computer Application, SECAB Institute of Engineering and Technology, Vijayapur, Karnataka, India
  3. Associate Professor, Department of Electronics and Communication Engineering, SECAB Institute of Engineering and Technology, Vijayapur, Karnataka, India

Abstract

Phishing attacks remain a significant cybersecurity challenge, requiring innovative detection strategies. This study investigates the use of machine learning to detect phishing URLs, to improve the accuracy and reliability of detection systems. Utilizing a diverse dataset of legitimate and phishing URLs we extracted the features such as lexical properties, domain-specific details, and HTML content to train various machine learning models. Algorithms including Random Forest, support vector machine (SVM), and gradient boosting were evaluated for their effectiveness. The results demonstrate that these models can reliably identify phishing URLs with high accuracy and low false-positive rates. This work underscores the potential of machine learning to improve phishing detection mechanisms, contributing to stronger cybersecurity measures. Future work will involve refining these models, investigating deep learning approaches, and integrating real-time detection capabilities.

Keywords: Phishing detection, machine learning, URL analysis, classification algorithms, cybersecurity

[This article belongs to Journal of Web Engineering & Technology (jowet)]

How to cite this article:
Shilpa Mundodagi, Nyamatulla Patel, Ziaullah Choudhari. Detection of Phished URLs Using Machine Learning. Journal of Web Engineering & Technology. 2024; 11(03):1-7.
How to cite this URL:
Shilpa Mundodagi, Nyamatulla Patel, Ziaullah Choudhari. Detection of Phished URLs Using Machine Learning. Journal of Web Engineering & Technology. 2024; 11(03):1-7. Available from: https://journals.stmjournals.com/jowet/article=2024/view=180547


References

  1. Sheng S, Wardman B, Warner G, Cranor LF, Hong J, Zhang C. Empirical analysis of phishing blacklists. In: 6th International Conference on Email and Anti Spam; 2009 Jul 6; Mountain View, California. p. 16–17.
  2. Khonji M, Iraqi Y, Jones A. Phishing detection: A literature survey. IEEE Commun Surv Tutor. 2013;15:2091–2121. doi:10.1109/SURV.2013.032213.00009.
  3. Acquisti A, Adjerid I, Balebako R, Brandimarte L, Cranor LF, Komanduri S, Leon PG, Sadeh N, Schaub F, Sleeper M, Wang Y, Wilson S. Nudges for privacy and security: understanding and assisting users’ choices online. ACM Comput Surv. 2018;50:1–41. doi:10.1145/3054926.
  4. Junger M, Montoya L, Overink FJ. Priming and warnings are not effective to prevent social engineering attacks. Comput Hum Behav. 2017;66:75–87. doi:10.1016/j.chb.2016.09.012.
  5. El-Alfy ESM. Detection of phishing websites based on probabilistic neural networks and K-medoids clustering. Comput J. 2017;60:1745–1759. doi:10.1093/comjnl/bxx035.
  6. Moreno-Fernández MM, Blanco F, Garaizar P, Matute H. Fishing for phishers: improving internet users’ sensitivity to visual deception cues to prevent electronic fraud. Comput Hum Behav. 2017;69:421–436. doi:10.1016/j.chb.2016.12.044.
  7. Kamalam GK, Suresh P, Nivash R, Ramya A, Raviprasath G. Detection of phishing websites using machine learning. In: 2022 International Conference on Computer Communication and Informatics (ICCCI); 2022. p. 1–4. doi:10.1109/ICCCI54379.2022.9740763.
  8. Beknazarova SS, Latipova NM, Maxmudova MJ, Alekseeva VS, Turakulova AS. Machine learning algorithms are used to detect and track objects on video images. In: Third International Conference on Optics, Computer Applications, and Materials Science (CMSD-III 2023); 2024. Vol. 13065. SPIE. doi:10.1117/12.3025015.
  9. Deshpande A, Pedamkar O, Chaudhary N, Borde S. Detection of phishing websites using machine learning. Int J Eng Res Technol (IJERT). 2021;10.
  10. Wu L, Du X, Wu J. Effective defense schemes for phishing attacks on mobile computing platforms. IEEE Trans Veh Technol. 2015;65:6678–6691. doi:10.1109/TVT.2015.2472993.
  11. Gowtham R, Krishnamurthi I. A comprehensive and efficacious architecture for detecting phishing webpages. Comput Secur. 2014;40:23–37. doi:10.1016/j.cose.2013.10.004.
  12. Xiang G, Hong J, Rose CP, Cranor L. CANTINA+: A feature-rich machine learning framework for detecting phishing web sites. ACM Trans Inf Syst Secur. 2011;14:1–28. doi:10.1145/2019599.2019606.
  13. Zhu E, Chen Y, Ye C, Li X, Liu F. OFS-NN: An effective phishing websites detection model based on optimal feature selection and neural network. IEEE Access. 2019;7:73271–73284. doi:10.1109/ACCESS.2019.2920655.
  14. Dou Z, Khalil I, Khreishah A, Al-Fuqaha A, Guizani M. Systematization of knowledge (SoK): A systematic review of software-based web phishing detection. IEEE Commun Surv Tutor. 2017;19:2797–2819. doi:10.1109/COMST.2017.2752087.
  15. Cova M, Kruegel C, Vigna G. Detection and analysis of drive-by-download attacks and malicious JavaScript code. In: Proceedings of the 19th International Conference on World Wide Web; 2010 Apr 26. p. 281–290. doi:10.1145/1772690.1772720.
  16. Tan CL, Chiew KL, Sze SN. Phishing website detection using URL-assisted brand name weighting system. In: Proceedings of the 2014 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS); 2014. p. 54–59. doi:10.1109/ISPACS.2014.7024424.
  17. Basnet RB, Sung AH. Mining web to detect phishing URLs. In: 11th International Conference on Machine Learning and Applications; 2012 Dec 12; Vol. 1. IEEE Publications. p. 568–573. doi:10.1109/ICMLA.2012.104.
  18. Kurniabudi K, Purnama B, Sharipuddin S, Darmawijoyo D, Stiawan D, Samsuryadi S, Heryanto A, Budiarto R. Network anomaly detection research: A survey. Indones J Electr Eng Inform. 2019;7:37–50. doi:10.52549/ijeei.v7i1.773.
  19. Nguyen LA, To BL, Nguyen HK, Nguyen MH. A novel approach for phishing detection using URL-based heuristic. In: 2014 International Conference on Computing, Management and Telecommunications (ComManTel); 2014. p. 298–303.

Regular Issue Subscription Review Article
Volume 11
Issue 03
Received 12/09/2024
Accepted 23/09/2024
Published 29/10/2024


Loading citations…