TY - GEN
T1 - Federated Gradient Boosting for Financial Fraud Detection
T2 - 34th ACM International Conference on Information and Knowledge Management, CIKM 2025
AU - Park, Dae Young
AU - Ko, In Young
AU - Lee, Taek Ho
AU - Lee, Junghye
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s).
PY - 2025/11/10
Y1 - 2025/11/10
N2 - The development of effective fraud detection systems (FDS) is hindered by strict data privacy regulations that prevent centralized data sharing. Federated learning (FL) has emerged as a promising alternative, enabling collaborative model training without exposing sensitive data. While FL has been explored in the healthcare domain, research on its application to financial fraud detection remains relatively limited. Specifically, FL research on real-world banking fraud types-with detailed customer, account, and transaction data-remains underexplored. We present the first empirical study of federated gradient boosting models for financial fraud detection in the banking sector, motivated by their superior performance over deep learning models on tabular fraud data. We evaluate and compare four representative federated gradient boosting models using both a private multi-fraud banking dataset from the Financial Security Institute (FSI) and a publicly available banking dataset, under various scenarios. Key findings include the consistent superiority of FedXGBBagging (a federated gradient boosting model), general vulnerability to data quantity skew, performance instability under bank join/dropout, and limitations in detecting localized banking fraud types such as ATM skimming. The findings from our empirical study highlight challenges and design considerations for deploying FL-based FDSs in the banking sector.
AB - The development of effective fraud detection systems (FDS) is hindered by strict data privacy regulations that prevent centralized data sharing. Federated learning (FL) has emerged as a promising alternative, enabling collaborative model training without exposing sensitive data. While FL has been explored in the healthcare domain, research on its application to financial fraud detection remains relatively limited. Specifically, FL research on real-world banking fraud types-with detailed customer, account, and transaction data-remains underexplored. We present the first empirical study of federated gradient boosting models for financial fraud detection in the banking sector, motivated by their superior performance over deep learning models on tabular fraud data. We evaluate and compare four representative federated gradient boosting models using both a private multi-fraud banking dataset from the Financial Security Institute (FSI) and a publicly available banking dataset, under various scenarios. Key findings include the consistent superiority of FedXGBBagging (a federated gradient boosting model), general vulnerability to data quantity skew, performance instability under bank join/dropout, and limitations in detecting localized banking fraud types such as ATM skimming. The findings from our empirical study highlight challenges and design considerations for deploying FL-based FDSs in the banking sector.
KW - federated gradient boosting model
KW - financial fraud detection
UR - https://www.scopus.com/pages/publications/105023149359
U2 - 10.1145/3746252.3760891
DO - 10.1145/3746252.3760891
M3 - Conference contribution
AN - SCOPUS:105023149359
T3 - CIKM 2025 - Proceedings of the 34th ACM International Conference on Information and Knowledge Management
SP - 5089
EP - 5093
BT - CIKM 2025 - Proceedings of the 34th ACM International Conference on Information and Knowledge Management
PB - Association for Computing Machinery, Inc
Y2 - 10 November 2025 through 14 November 2025
ER -