Optimized Decision Tree and Black Box Learners for Revealing Genetic Causes of Bladder Cancer
作者机构:Computer Engineering DepartmentCanakkale Onsekiz Mart UniversityCanakkale17100Turkey
出 版 物:《Intelligent Automation & Soft Computing》 (智能自动化与软计算(英文))
年 卷 期:2023年第37卷第7期
页 面:49-71页
核心收录:
学科分类:1002[医学-临床医学] 100214[医学-肿瘤学] 0811[工学-控制科学与工程] 0812[工学-计算机科学与技术(可授工学、理学学位)] 10[医学]
主 题:Random forest neural network deep learning hyper-parameter optimization bladder cancer single nucleotide polymorphism
摘 要:The number of studies in the literature that diagnose cancer with machine learning using genome data is quite *** studies focus on the prediction performance,and the extraction of genomic factors that cause disease is often ***,finding underlying genetic causes is very important in terms of early diagnosis,development of diagnostic kits,preventive medicine,*** motivation of our study was to diagnose bladder cancer(BCa)based on genetic data and to reveal underlying genetic factors by using machine-learning *** addition,conducting hyper-parameter optimization to get the best performance from different models,which is overlooked in most studies,was another objective of the *** the framework of these motivations,C4.5,random forest(RF),artificial neural networks(ANN),and deep learning(DL)were *** this way,the diagnostic performance of decision tree(DT)-based models and black box models on BCa was also *** most successful model,DL,yielded an area under the curve(AUC)of 0.985 and a mean square error(MSE)of *** each model,hyper-parameters were optimized by an evolutionary *** average,hyper-parameter optimization increased MSE,root mean square error(RMSE),LogLoss,and AUC by 30%,17.5%,13%,and 6.75%,*** features causing BCa were *** this purpose,entropy and Gini coefficients were used for DT-based methods,and the Gedeon variable importance was used for black box *** single nucleotide polymorphisms(SNPs)rs197412,rs2275928,rs12479919,rs798766 and rs2275928,whose BCa relations were proven in the literature,were found to be closely related to *** addition,rs1994624 and rs2241766 susceptibility loci were proposed to be examined in future studies.