Predictive Machine Learning Models for Financial Fraud Detection Leveraging Big Data Analysis
DOI:
https://doi.org/10.63282/3050-9246.IJETCSIT-V4I1P105Keywords:
Big Data Analytics, Financial Fraud, Fraud Detection, Machine Learning (ML), Predictive Analytics, Fraud Prevention, Transaction DataAbstract
Maintaining the integrity of financial systems and preventing people and organizations from suffering financial losses depends heavily on the ability to spot fraudulent financial transactions. Recognizing complicated patterns and developing fraud methods is a challenge for conventional rule-based fraud detection methods. Using the Credit Card Fraud Detection (CCFD) dataset, this research aims to compare and analyze different prediction models to accurately identify fraudulent transactions. The anonymized dataset was preprocessed thoroughly, i.e., missing values, outlier elimination, min-max normalization, and relevant feature selection were addressed. A modified Deep Neural Network (DNN) model, Multi-Layer Perceptron (MLP), Naive Bayes (NB), as well as Decision Tree (DT), were among the classification models that were trained and evaluated. The suggested DNN model performed better than the others, achieving 99.89% accuracy, a 99.87% F1-score, 99.99% recall, as well as 99% precision. These results demonstrate that deep learning, when combined with an efficient preprocessing pipeline, may greatly improve fraud detection in extremely unbalanced financial data, and they also show that the DNN model is capable of learning complicated, non-linear patterns in the data
Downloads
References
[1] S. Zhang et al., “HiDDen: Hierarchical Dense Subgraph Detection with Application to Financial Fraud Detection,” in Proceedings of the 2017 SIAM International Conference on Data Mining, Philadelphia, PA: Society for Industrial and Applied Mathematics, 2017, pp. 570–578. doi: 10.1137/1.9781611974973.64.
[2] N. B. Harikrishnan, R. Vinayakumar, K. P. Soman, and Prabaharan Poornachandran, “Time split based pre-processing with a data-driven approach for malicious URL detection,” in Advanced Sciences and Technologies for Security Applications, 2019. doi: 10.1007/978-3-030-16837-7_4.
[3] V. Kolluri, “A Thorough Examination of Fortifying Cyber Defenses: AI in Real Time Driving Cyber Defence Strategies Today,” Int. J. Emerg. Technol. Adv. Appl., 2018.
[4] D. Zhang and L. Zhou, “Discovering golden nuggets: Data mining in financial application,” IEEE Trans. Syst. Man Cybern. Part C Appl. Rev., 2004, doi: 10.1109/TSMCC.2004.829279.
[5] A. Singh and A. Jain, “An empirical study of AML approach for credit card fraud detection-financial transactions,” Int. J. Comput. Commun. Control, 2019, doi: 10.15837/ijccc.2019.6.3498.
[6] V. Kolluri, “An In-Depth Exploration of Unveiling Vulnerabilities: Exploring Risks in AI Models and Algorithms,” Int. J. Res. Anal. Rev., vol. 1, no. 3, pp. 910–913, 2014.
[7] S. Wang, “A comprehensive survey of data mining-based accounting-fraud detection research,” in 2010 International Conference on Intelligent Computation Technology and Automation, ICICTA 2010, 2010. doi: 10.1109/ICICTA.2010.831.
[8] J. West and M. Bhattacharya, “Intelligent financial fraud detection: A comprehensive review,” 2016. doi: 10.1016/j.cose.2015.09.005.
[9] S. Garg, “Predictive Analytics and Auto Remediation using Artificial Inteligence and Machine learning in Cloud Computing Operations,” Int. J. Innov. Res. Eng. Multidiscip. Phys. Sci., vol. 7, no. 2, 2019.
[10] C. Gardner, D. C.-T. Lo, J.-C. Chern, P. Paschos, and C. Ng, “Tiered Financial Fraud Detection Utilizing Precision Stratified Random Forest Assembly,” in 2019 IEEE 5th International Conference on Big Data Intelligence and Computing (DATACOM), 2019, pp. 254–257. doi: 10.1109/DataCom.2019.00047.
[11] O. Adepoju, J. Wosowei, S. Lawte, and H. Jaiman, “Comparative Evaluation of Credit Card Fraud Detection Using Machine Learning Techniques,” in 2019 Global Conference for Advancement in Technology, GCAT 2019, 2019. doi: 10.1109/GCAT47503.2019.8978372.
[12] A. M. Mubalaike and E. Adali, “Deep Learning Approach for Intelligent Financial Fraud Detection System,” in UBMK 2018 - 3rd International Conference on Computer Science and Engineering, 2018. doi: 10.1109/UBMK.2018.8566574.
[13] P. Shiguihara-Juarez and N. Murrugarra-Llerena, “A Bayesian Classifier Based on Constraints of Ordering of Variables for Fraud Detection,” in 2018 Congreso Internacional de Innovacion y Tendencias en Ingenieria, CONIITI 2018 - Proceedings, 2018. doi: 10.1109/CONIITI.2018.8587081.
[14] A. Chouiekh and E. H. I. EL Haj, “ConvNets for Fraud Detection analysis,” Procedia Comput. Sci., vol. 127, pp. 133–138, 2018, doi: 10.1016/j.procs.2018.01.107.
[15] J. O. Awoyemi, A. O. Adetunmbi, and S. A. Oluwadare, “Credit card fraud detection using machine learning techniques: A comparative analysis,” in Proceedings of the IEEE International Conference on Computing, Networking and Informatics, ICCNI 2017, 2017. doi: 10.1109/ICCNI.2017.8123782.
[16] C. C. Lin, A. A. Chiu, S. Y. Huang, and D. C. Yen, “Detecting the financial statement fraud: The analysis of the differences between data mining techniques and experts’ judgments,” Knowledge-Based Syst., 2015, doi: 10.1016/j.knosys.2015.08.011.
[17] S. Patil, V. Nemade, and P. K. Soni, “Predictive Modelling for Credit Card Fraud Detection Using Data Analytics,” in Procedia Computer Science, 2018. doi: 10.1016/j.procs.2018.05.199.
[18] A. Thompson, L. Aborisade, O. Oyinloye, and E. Odeniyi, “A Fraud Detection Framework using Machine Learning Approach,” Fourth Int. Conf. Cyber Technol. Cyber Syst. 2019., pp. 12–18, 2019.
[19] Polu, A. R., Buddula, D. V. K. R., Narra, B., Gupta, A., Vattikonda, N., & Patchipulusu, H. (2021). Evolution of AI in Software Development and Cybersecurity: Unifying Automation, Innovation, and Protection in the Digital Age. Available at SSRN 5266517.
[20] Chinta, P. C. R., Katnapally, N., Ja, K., Bodepudi, V., Babu, S., & Boppana, M. S. (2022). Exploring the role of neural networks in big data-driven ERP systems for proactive cybersecurity management. Kurdish Studies.
[21] Routhu, K., Bodepudi, V., Jha, K. M., & Chinta, P. C. R. (2020). A Deep Learning Architectures for Enhancing Cyber Security Protocols in Big Data Integrated ERP Systems. Available at SSRN 5102662.
[22] Chinta, P. C. R., & Katnapally, N. (2021). Neural Network-Based Risk Assessment for Cybersecurity in Big Data-Oriented ERP Infrastructures. Neural Network-Based Risk Assessment for Cybersecurity in Big Data-Oriented ERP Infrastructures.
[23] Katnapally, N., Chinta, P. C. R., Routhu, K. K., Velaga, V., Bodepudi, V., & Karaka, L. M. (2021). Leveraging Big Data Analytics and Machine Learning Techniques for Sentiment Analysis of Amazon Product Reviews in Business Insights. American Journal of Computing and Engineering, 4(2), 35-51.
[24] Kalla, D. (2022). AI-Powered Driver Behavior Analysis and Accident Prevention Systems for Advanced Driver Assistance. International Journal of Scientific Research and Modern Technology (IJSRMT) Volume, 1.
[25] Chinta, P. C. R. (2022). Enhancing Supply Chain Efficiency and Performance Through ERP Optimisation Strategies. Journal of Artificial Intelligence & Cloud Computing, 1(4), 10-47363.
[26] Kuraku, D. S., Kalla, D., & Samaah, F. (2022). Navigating the link between internet user attitudes and cybersecurity awareness in the era of phishing challenges. International Advanced Research Journal in Science, Engineering and Technology, 9(12).
[27] Sadaram, G., Sakuru, M., Karaka, L. M., Reddy, M. S., Bodepudi, V., Boppana, S. B., & Maka, S. R. (2022). Internet of Things (IoT) Cybersecurity Enhancement through Artificial Intelligence: A Study on Intrusion Detection Systems. Universal Library of Engineering Technology, (2022).
[28] Karaka, L. M. (2021). Optimising Product Enhancements Strategic Approaches to Managing Complexity. Available at SSRN 5147875.
[29] Polu, A. R., Vattikonda, N., Buddula, D. V. K. R., Narra, B., Patchipulusu, H., & Gupta, A. (2021). Integrating AI-Based Sentiment Analysis With Social Media Data For Enhanced Marketing Insights. Available at SSRN 5266555.
[30] Jha, K. M., Bodepudi, V., Boppana, S. B., Katnapally, N., Maka, S. R., & Sakuru, M. Deep Learning-Enabled Big Data Analytics for Cybersecurity Threat Detection in ERP Ecosystems.
[31] Kalla, D., Smith, N., Samaah, F., & Polimetla, K. (2022). Enhancing Early Diagnosis: Machine Learning Applications in Diabetes Prediction. Journal of Artificial Intelligence & Cloud Computing. SRC/JAICC-205. DOI: doi. org/10.47363/JAICC/2022 (1), 191, 2-7.
[32] Kalla, D., Kuraku, D. S., & Samaah, F. (2021). Enhancing cyber security by predicting malwares using supervised machine learning models. International Journal of Computing and Artificial Intelligence, 2(2), 55-62.
[33] Katari, A., & Kalla, D. (2021). Cost Optimization in Cloud-Based Financial Data Lakes: Techniques and Case Studies. ESP Journal of Engineering & Technology Advancements (ESP-JETA), 1(1), 150-157.
[34] Kalla, D., Smith, N., Samaah, F., & Polimetla, K. (2021). Facial Emotion and Sentiment Detection Using Convolutional Neural Network. Indian Journal of Artificial Intelligence Research (INDJAIR), 1(1), 1-13.