On importance weighting for electric fraud detection with dataset shifts
Title | On importance weighting for electric fraud detection with dataset shifts |
Publication Type | Conference Proceedings |
Year of Conference | 2019 |
Authors | Antonelo EA, State R |
Conference Name | IEEE International Conference on Systems, Man, and Cybernetics (SMC) |
Pagination | 3215-3222 |
Date Published | 2019 |
Keywords | covariate shift, electric fraud detection, importance weighting |
Abstract | Covariate shift and imbalanced datasets are common in real-world scenarios. Usually, the probability distribution for the collected data is non-stationary due to the incremental and endless process of sequential data collection, which is influenced by actions and predictions of human experts, predictive models, set of rules or other unknown external factors (i.e., user interaction on a website, seasonal/cyclic or geographical factors). Thus, a predictive model may be suboptimal in terms of generalization performance under a shift in the test input. In this work, we evaluate the importance-weighted fisher discriminant analysis (FDA) classifier in an electric fraud detection task with dataset shift, where the goal is to detect customers with frauds or irregular eletricity meters, also called nontechnical loss detection in the literature. |