Big Data Analytics and Machine Learning Techniques for Real-Time Credit Card Fraud Detection

  • Sadia Abbas Shah School of System and Technology, Department of Software Engineering, University of Management and Technology, Lahore, Pakistan
  • Rabia Javed Department of Computer Science, Lahore College for Women University, Lahore, Pakistan
  • Fahima Tahir Department of Computer Science, Lahore College for Women University, Lahore, Pakistan
  • Khansa Aatif Department of Computer Science, Lahore College for Women University, Lahore, Pakistan
  • Wajeeha Malik Department of Computer Science, Lahore College for Women University, Lahore, Pakistan
Keywords: Anomaly Detection, Machine Learning Algorithms, Big Data, Credit Card Fraud Detection, XGBoost and KNN

Abstract

Big Data is commonly characterized by the 4 V's: Volume, Variety, Velocity, and Veracity. In today’s digital age, data is generated in terabytes and petabytes, far exceeding the storage capabilities of a single machine. With data constantly circulating across cloud platforms, the risk of leakage and fraud has increased significantly credit card fraud being one of the most pressing global concerns. As numerous shopping platforms and businesses operate around us, each domain generates vast amounts of data, often reaching into yottabytes. Manually handling, analyzing, or detecting anomalies in such large-scale data is extremely challenging. However, with the advancement of computing and emerging technologies, detecting fraud has become much more efficient and scalable. This study examines the application of big data in analysing credit card consumer behaviour, specifically in the context of online transactions, password creation, age, income, and other relevant factors. The focus is on identifying anomalies in these data points to detect potentially fraudulent activities quantitative approach is employed to identify statistical patterns, and the performance of seven different machine learning algorithms, such as Logistic Regression, K-Nearest Neighbours (KNN), and XGBoost, is evaluated for their effectiveness. As technology advances, factors such as age and increasing reliance on online transactions, e-commerce, and digital banking contribute to rising vulnerabilities, making fraud detection more critical than ever. Result: In the Real-time credit card fraud detection using big data, different algorithms are discussed and implemented, so KNN & XGBOOST give better results than other ML algorithms. The impact of compliance on sophisticated data-based security systems will be examined in a later study, which can make use of historical fraud typologies and trends to comprehend potential shifts over time.

Published
2025-10-15
How to Cite
Abbas Shah, S., Javed, R., Tahir, F., Aatif, K., & Malik, W. (2025). Big Data Analytics and Machine Learning Techniques for Real-Time Credit Card Fraud Detection. International Journal for Electronic Crime Investigation, 9(2). https://doi.org/10.54692/ijeci.2025.0902/255