Abstract
Ensemble classifiers have been shown efficient in multiple applications. In this article, the authors explore the effectiveness of ensemble classifiers in a case-based computer-aided diagnosis system for detection of masses in mammograms. They evaluate two general ways of constructing subclassifiers by resampling of the available development dataset: Random division and random selection. Furthermore, they discuss the problem of selecting the ensemble size and propose two adaptive incremental techniques that automatically select the size for the problem at hand. All the techniques are evaluated with respect to a previously proposed information-theoretic CAD system (IT-CAD). The experimental results show that the examined ensemble techniques provide a statistically significant improvement (AUC=0.905±0.024) in performance as compared to the original IT-CAD system (AUC=0.865±0.029). Some of the techniques allow for a notable reduction in the total number of examples stored in the case base (to 1.3% of the original size), which, in turn, results in lower storage requirements and a shorter response time of the system. Among the methods examined in this article, the two proposed adaptive techniques are by far the most effective for this purpose. Furthermore, the authors provide some discussion and guidance for choosing the ensemble parameters.
Original language | English |
---|---|
Pages (from-to) | 2976-2984 |
Number of pages | 9 |
Journal | Medical Physics |
Volume | 36 |
Issue number | 7 |
DOIs | |
State | Published - 2009 |
Externally published | Yes |
Funding
This work was supported in part by Grant No. R01 CA101911 from the National Cancer Institute and the University of Louisville Grosscurth Fellowship.
Funders | Funder number |
---|---|
National Cancer Institute | R01CA101911 |
University of Louisville |