Detection of Phished URL’s Using Machine Learning

Notice

This is an unedited manuscript accepted for publication and provided as an Article in Press for early access at the author’s request. The article will undergo copyediting, typesetting, and galley proof review before final publication. Please be aware that errors may be identified during production that could affect the content. All legal disclaimers of the journal apply.

Year : 2024 | Volume :11 | Issue : 03 | Page : –
By

Shilpa Mundodagi,

Nyamatulla Patel,

Ziaullah Choudhari,

  1. Student, Department of Master of Computer Application, Secab Institute of Engineering and Technology, Vijayapur, Karnataka, India
  2. Associate Professor, Department of Master of Computer Application, Secab Institute of Engineering and Technology, Vijayapur, Karnataka, India
  3. Associate Professor, Department of Electronics and Communication Engineering, Secab Institute of Engineering and Technology, Vijayapur, Karnataka, India

Abstract

Phishing attacks remain a significant cybersecurity challenge, requiring innovative detection strategies. This study investigates the use of machine learning to detect phishing URLs, with the goal of improving the accuracy and reliability of detection systems. Utilizing a diverse dataset of legitimate and phishing URLs we extracted the features such as lexical properties, domain-specific details, and HTML content to train various machine learning models. Algorithms including Random Forest, Support Vector Machine (SVM), and Gradient Boosting were evaluated for their effectiveness. The results demonstrate that these models can reliably identify phishing URLs with high accuracy and low false positive rates. This research underscores the potential of machine learning to improve phishing detection mechanisms, contributing to stronger cybersecurity measures. Future work will involve refining these models, investigating deep learning approaches, and integrating real-time detection capabilities.

Keywords: Phishing Detection, Machine Learning, URL Analysis, Classification Algorithms, Cybersecurity

[This article belongs to Journal of Web Engineering & Technology (jowet)]

How to cite this article:
Shilpa Mundodagi, Nyamatulla Patel, Ziaullah Choudhari. Detection of Phished URL’s Using Machine Learning. Journal of Web Engineering & Technology. 2024; 11(03):-.
How to cite this URL:
Shilpa Mundodagi, Nyamatulla Patel, Ziaullah Choudhari. Detection of Phished URL’s Using Machine Learning. Journal of Web Engineering & Technology. 2024; 11(03):-. Available from: https://journals.stmjournals.com/jowet/article=2024/view=180547

References

  1. Sheng S, Wardman B, Warner G, Cranor LF, Hong J, Zhang C. EmpericalAnalysisOf Phishing BlackLists” at 6th International Conference on Email and Anti Spam CEAS. Mounatin View California July. 2009:16-7.
  2. Khonji M, Iraqi Y, Jones A. Phishing detection: a literature survey. IEEE Communications Surveys & Tutorials. 2013 Apr 15;15(4):2091-121.
  3. Acquisti A, Adjerid I, Balebako R, Brandimarte L, Cranor LF, Komanduri S, Leon PG, Sadeh N, Schaub F, Sleeper M, Wang Y. Nudges for privacy and security: Understanding and assisting users’ choices online. ACM Computing Surveys (CSUR). 2017 Aug 8;50(3):1-41.
  4. Junger M, Montoya L, Overink FJ. Priming and warnings are not effective to prevent social engineering attacks. Computers in human behavior. 2017 Jan 1;66:75-87.
  5. El-Alfy ES. Detection of phishing websites based on probabilistic neural networks and K-medoids clustering. The Computer Journal. 2017 Dec 1;60(12):1745-59.
  6. Moreno-Fernández MM, Blanco F, Garaizar P, Matute H. Fishing for phishers. Improving Internet users’ sensitivity to visual deception cues to prevent electronic fraud. Computers in Human Behavior. 2017 Apr 1;69:421-36.
  7. Kamalam GK, Suresh P, Nivash R, Ramya A, Raviprasath G. Detection of phishing websites using machine learning. In2022 International Conference on Computer Communication and Informatics (ICCCI) 2022 Jan 25 (pp. 1-4). IEEE.
  8. Beknazarova SS, Latipova NM, Maxmudova MJ, Alekseeva VS, Turakulova AS. Machine learning algorithms are used to detect and track objects on video images. InThird International Conference on Optics, Computer Applications, and Materials Science (CMSD-III 2023) 2024 Feb 20 (Vol. 13065, pp. 118-127). SPIE.
  9. Deshpande A, Pedamkar O, Chaudhary N, Borde S. Detection of phishing websites using Machine Learning. International Journal of Engineering Research & Technology (IJERT). 2021 May;10(05).
  10. Wu L, Du X, Wu J. Effective defense schemes for phishing attacks on mobile computing platforms. IEEE Transactions on Vehicular Technology. 2015 Aug 25;65(8):6678-91.
  11. Gowtham R, Krishnamurthi I. A comprehensive and efficacious architecture for detecting phishing webpages. Computers & Security. 2014 Feb 1;40:23-37.
  12. Xiang G, Hong J, Rose CP, Cranor L. Cantina+ a feature-rich machine learning framework for detecting phishing web sites. ACM Transactions on Information and System Security (TISSEC). 2011 Sep 1;14(2):1-28.
  13. Zhu E, Chen Y, Ye C, Li X, Liu F. OFS-NN: an effective phishing websites detection model based on optimal feature selection and neural network. Ieee Access. 2019 Jun 4;7:73271-84.
  14. Dou Z, Khalil I, Khreishah A, Al-Fuqaha A, Guizani M. Systematization of knowledge (sok): A systematic review of software-based web phishing detection. IEEE Communications Surveys & Tutorials. 2017 Sep 13;19(4):2797-819.
  15. Cova M, Kruegel C, Vigna G. Detection and analysis of drive-by-download attacks and malicious JavaScript code. InProceedings of the 19th international conference on World wide web 2010 Apr 26 (pp. 281-290).
  16. Tan CL, Chiew KL. Phishing website detection using URL-assisted brand name weighting system. In2014 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS) 2014 Dec 1 (pp. 054-059). IEEE.
  17. Basnet RB, Sung AH. Mining web to detect phishing URLs. In2012 11th International Conference on Machine Learning and Applications 2012 Dec 12 (Vol. 1, pp. 568-573). IEEE.
  18. Kurniabudi K, Purnama B, Sharipuddin S, Darmawijoyo D, Stiawan D, Samsuryadi S, Heryanto A, Budiarto R. Network anomaly detection research: a survey. Indonesian Journal of Electrical Engineering and Informatics (IJEEI). 2019 Mar 25;7(1):37-50.
  19. Nguyen LA, To BL, Nguyen HK, Nguyen MH. A novel approach for phishing detection using URL-based heuristic. In2014 international conference on computing, management and telecommunications (ComManTel) 2014 Apr 27 (pp. 298-303). IEEE.

Regular Issue Subscription Review Article
Volume 11
Issue 03
Received 12/09/2024
Accepted 23/09/2024
Published 29/10/2024