INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND MATHEMATICAL THEORY (IJCSMT )

E-ISSN 2545-5699
P-ISSN 2695-1924
VOL. 11 NO. 1 2025
DOI: 10.56201/ijcsmt.v11.no1.2025.pg84.96


Development of a Predictive Model for Phishing and Ransomware Detection in Financial Institution Using a Random Forest Classifier and Outlier Technique

K.C. PAUL & V. T. EMMAH


Abstract


Phishing and ransomware attacks are significant cyber threats that target financial institutions, aiming to deceive users and exploit vulnerabilities for malicious gains. Phishing attacks often involve fraudulent emails or websites that trick users into revealing sensitive information, while ransomware encrypts a victim's data and demands payment for its release. Both types of attacks pose severe risks to financial institutions, potentially leading to data breaches, financial losses, and reputational damage. To combat these threats, this paper proposed an advanced detection system using machine learning techniques. The proposed system focused on feature engineering and training a Random Forest classifier to detect phishing and ransomware attacks based on key attributes like URL structure and file characteristics. For phishing detection, features such as URL length, subdomains, and the presence of secure HTTPS protocols were extracted, while for ransomware detection, file name length, the presence of executable extensions, and suspicious


keywords:

Phishing, ransomware, attacks, cyber, institutions, detection, techniques


References:


Abdelkader, W., Navarro, T., Parrish, R., Cotoi, C., Germini, F., Iorio, A., … & Lokker, C. (2021).
Machine learning approaches to retrieve high-quality, clinically relevant evidence from the
biomedical literature: systematic review. Jmir Medical Informatics, 9(9), e30401.
https://doi.org/10.2196/30401
Angelopoulos, A., Michailidis, E., ???????, ?., Trakadas, P., Hatziefremidis, A., Voliotis, S., … &
Zahariadis, T. (2019). Tackling faults in the industry 4.0 era—a survey of machine-learning
solutions and key aspects. Sensors, 20(1), 109. https://doi.org/10.3390/s20010109
Asiimwe, I., Zhang, E., Osanlou, R., Jorgensen, A., & Pirmohamed, M. (2020). Warfarin dosing
algorithms: a systematic review. British Journal of Clinical Pharmacology, 87(4), 1717-1729.
https://doi.org/10.1111/bcp.14608.
Vox Sanguinis, 113(8), 737-749. https://doi.org/10.1111/vox.12708
Brasil, S., Pascoal, C., Francisco, R., Ferreira, V., Videira, P., & Valadão, G. (2019). Artificial
intelligence (ai) in rare diseases: is the future brighter? Genes, 10(12), 978. s
Brown, G., Ross, S., & Kirchhübel, C. (2021). Voicing concerns: the balance between data protection
principles and research developments in forensic speech science. Science & Justice, 61(4), 311-
https://doi.org/10.1016/j.scijus.2021.05.006.
Chen, C., Hasan, M., & Mohan, S. (2018). Securing real-time internet-of-things. Sensors, 18(12),
https://doi.org/10.3390/s18124356.
Gai, K., Qiu, M., & Elnagdy, S. A. (2016, April). Security-aware information classifications using
supervised learning for cloud-based cyber risk management in financial big data. In 2016 IEEE
2nd International Conference on Big Data Security on Cloud (BigDataSecurity), IEEE
International Conference on High Performance and Smart Computing (HPSC), and IEEE
International Conference on Intelligent Data and Security (IDS) (pp. 197-202). IEEE.
Gao, B. (2021). The use of machine learning combined with data mining technology in financial risk
prevention. Computational Economics, 1-21.
Islam, U., Muhammad, A., Mansoor, R., Hossain, M. S., Ahmad, I., Eldin, E. T., ... & Shafiq, M.
(2022). Detection of distributed denial of service (DDoS) attacks in IOT based monitoring
system of banking sector using machine learning models. Sustainability, 14(14), 8374.
Karimi, D., Dou, H., Warfield, S., & Gholipour, A. (2020). Deep learning with noisy labels: exploring
techniques and remedies in medical image analysis. Medical Image Analysis, 65, 101759.
https://doi.org/10.1016/j.media.2020.101759
King, Z., Henshel, D., Flora, L., Cains, M., Hoffman, B., & Sample, C. (2018). Characterizing and
measuring maliciousness for cybersecurity risk assessment. Frontiers in Psychology, 9.
https://doi.org/10.3389/fpsyg.2018.00039
Mugarza, I., Flores, J., & Montero, J. (2020). Security issues and software update management in the
industrial internet of things (iiot) era. Sensors, 20(24), 7160. https://doi.org/10.3390/s20247160
Nifakos, S., Chandramouli, K., Nikolaou, C., Papachristou, P., Koch, S., Panaousis, E., … & Bonacina,
S. (2021). Influence of human factors on cyber security within healthcare organizations: a
systematic review. Sensors, 21(15), 5119. https://doi.org/10.3390/s21155119



Noor, U., Anwar, Z., Amjad, T., & Choo, K. K. R. (2019). A machine learning-based FinTech cyber
threat attribution framework using high-level indicators of compromise. Future Generation
Computer Systems, 96, 227-242.
Pascarella, G., Rossi, M., Montella, E., Capasso, A., Feo, G., Botti, G., … & Morabito, A. (2021). Risk
analysis in healthcare organizations: methodological framework and critical variables. Risk
Management
and
Healthcare
Policy,
Volume
14,
2897-2911.
https://doi.org/10.2147/rmhp.s309098
Pemmasani, S., Raman, R., Mohapatra, R., Vidyasagar, M., & Acharya, A. (2020). A review on the
challenges in Indian genomics research for variant identification and interpretation. Frontiers in
Genetics, 11. https://doi.org/10.3389/fgene.2020.00753
Pesapane, F., Volonté, C., Codari, M., & Sardanelli, F. (2018). Artificial intelligence as a medical
device in radiology: ethical and regulatory issues in Europe and the United States. Insights Into
Imaging, 9(5), 745-753. https://doi.org/10.1007/s13244-018-0645-y
Petersilge, C. (2019). The enterprise imaging value proposition. Journal of Digital Imaging, 33(1), 37-
https://doi.org/10.1007/s10278-019-00293-1
Qayyum, A., Qadir, J., Bilal, M., & Al-Fuqaha, A. (2021). Secure and robust machine learning for
healthcare:
a
survey.
Ieee
Reviews
in
Biomedical
Engineering,
14,
156-180.
https://doi.org/10.1109/rbme.2020.3013489
Radanliev, P., Roure, D., Walton, R., Kleek, M., Montalvo, R., Santos, O., … & Cannady, S. (2020).
Covid-19 what have we learned? the rise of social machines and connected devices in pandemic
management following the concepts of predictive, preventive, and personalized medicine. The
Epma Journal, 11(3), 311-332. https://doi.org/10.1007/s13167-020-00218-x
Roy, N. C., & Prabhakaran, S. (2022). Sustainable response system building against insider-led cyber
frauds in banking sector: a machine learning approach. Journal of Financial Crime, 30(1), 48-
Sarker, I. (2021). Machine learning: algorithms, real-world applications, and research directions. Sn
Computer Science, 2(3). https://doi.org/10.1007/s42979-021-00592-x


DOWNLOAD PDF

Back


Google Scholar logo
Crossref logo
ResearchGate logo
Open Access logo
Google logo