Random forest classifier combined with feature selection for breast cancer diagnosis and prognostic
Random forest classifier combined with feature selection for breast cancer diagnosis and prognostic作者机构:College of Technology Vietnam National University Hanoi Vietnam School of Business and Administration Chongqing University Chongqing China
出 版 物:《Journal of Biomedical Science and Engineering》 (生物医学工程(英文))
年 卷 期:2013年第6卷第5期
页 面:551-560页
学科分类:1002[医学-临床医学] 100214[医学-肿瘤学] 10[医学]
主 题:Breast Cancer Diagnosis Prognosis Feature Selection Random Forest
摘 要:As the incidence of this disease has increased significantly in the recent years, expert systems and machine learning techniques to this problem have also taken a great attention from many scholars. This study aims at diagnosing and prognosticating breast cancer with a machine learning method based on random forest classifier and feature selection technique. By weighting, keeping useful features and removing redundant features in datasets, the method was obtained to solve diagnosis problems via classifying Wisconsin Breast Cancer Diagnosis Dataset and to solve prognosis problem via classifying Wisconsin Breast Cancer Prognostic Dataset. On these datasets we obtained classification accuracy of 100% in the best case and of around 99.8% on average. This is very promising compared to the previously reported results. This result is for Wisconsin Breast Cancer Dataset but it states that this method can be used confidently for other breast cancer diagnosis problems, too.