Diabetes Prediction Algorithm Using Recursive Ridge Regression L2
作者机构:Department of Computer ScienceSingidunum UniversityBelgrade11000Serbia Department of Computer science and EngineeringK.Ramakrishnan College of TechnologyTrichy621112India Department of Applied CyberneticsFaculty of ScienceUniversity of Hradec Králové50003Hradec KrálovéCzech Republic
出 版 物:《Computers, Materials & Continua》 (计算机、材料和连续体(英文))
年 卷 期:2022年第71卷第4期
页 面:457-471页
核心收录:
学科分类:0711[理学-系统科学] 07[理学] 08[工学] 081101[工学-控制理论与控制工程] 0811[工学-控制科学与工程] 071102[理学-系统分析与集成] 081103[工学-系统工程]
主 题:Ridge regression recursive feature elimination random forest machine learning feature selection
摘 要:At present,the prevalence of diabetes is increasing because the human body cannot metabolize the glucose *** prediction of diabetes patients is an important research *** researchers have proposed techniques to predict this disease through data mining and machine learning *** prediction,feature selection is a key concept in ***,the features that are relevant to the disease are used for *** condition improves the prediction *** the right features in the whole feature set is a complicated process,and many researchers are concentrating on it to produce a predictive model with high *** this work,a wrapper-based feature selection method called recursive feature elimination is combined with ridge regression(L2)to form a hybrid L2 regulated feature selection algorithm for overcoming the overfitting problem of data *** is a major problem in feature selection,where the new data are unfit to the model because the training data are *** regression is mainly used to overcome the overfitting *** features are selected by using the proposed feature selection method,and random forest classifier is used to classify the data on the basis of the selected *** work uses the Pima Indians Diabetes data set,and the evaluated results are compared with the existing algorithms to prove the accuracy of the proposed *** accuracy of the proposed algorithm in predicting diabetes is 100%,and its area under the curve is 97%.The proposed algorithm outperforms existing algorithms.