TY - GEN
T1 - Impact of low class prevalence on the performance evaluation of neural network based classifiers
T2 - 2007 International Joint Conference on Neural Networks, IJCNN 2007
AU - Mazurowski, Maciej A.
AU - Habas, Piotr A.
AU - Zurada, Jacek M.
AU - Tourassi, Georgia D.
PY - 2007
Y1 - 2007
N2 - This paper presents an experimental study on the impact of low class prevalence on the neural network based classifier performance as measured using Receiver Operator Characteristic (ROC) analysis. Two methods of dealing with the problem are investigated: oversampling and undersampling in the context of varying the class prevalence and the size of training datasets with uncorrelated and correlated features. The results show that the class imbalance can significantly decrease the classifier performance especially in the case of small training datasets. Furthermore, the oversampling method is shown to be more effective than the undersampling method in compensating the class imbalance. Statistically significant differences, however, are observed only in the cases with large total number of samples and very low prevalence.
AB - This paper presents an experimental study on the impact of low class prevalence on the neural network based classifier performance as measured using Receiver Operator Characteristic (ROC) analysis. Two methods of dealing with the problem are investigated: oversampling and undersampling in the context of varying the class prevalence and the size of training datasets with uncorrelated and correlated features. The results show that the class imbalance can significantly decrease the classifier performance especially in the case of small training datasets. Furthermore, the oversampling method is shown to be more effective than the undersampling method in compensating the class imbalance. Statistically significant differences, however, are observed only in the cases with large total number of samples and very low prevalence.
UR - http://www.scopus.com/inward/record.url?scp=51749123990&partnerID=8YFLogxK
U2 - 10.1109/IJCNN.2007.4371266
DO - 10.1109/IJCNN.2007.4371266
M3 - Conference contribution
AN - SCOPUS:51749123990
SN - 142441380X
SN - 9781424413805
T3 - IEEE International Conference on Neural Networks - Conference Proceedings
SP - 2005
EP - 2009
BT - The 2007 International Joint Conference on Neural Networks, IJCNN 2007 Conference Proceedings
Y2 - 12 August 2007 through 17 August 2007
ER -