Abstract:Because the credit history data of small and micro enterprises are small and the problem of class imbalance is more serious, this paper proposes a Smote XGboost-Bayes Minimum Risk (SXG-BMR) model based on the sample-dependent cost matrix. The whole sample is oversampled at a low rate to weaken the problem of class imbalance and reduce the risk of model overfitting. The model combines the integrated learning model with the minimum risk Bayes decision to realize the cost sensitivity. At the same time, this paper introduces the sample-dependent cost matrix into the model. The cost matrix is related not only to the category, but also to the attributes of the sample.Therefore ,it can characterize the cost more accurately. In the empirical study,this paper uses a standard credit dataset and a real credit dataset of small and micro enterprises in Shanghai. Besides,it compares and analzes of various algorithms. The results show that the SXG-BMR model proposed in this paper has a good performance.