An Ensemble Learning Based Approach for Detecting and Tracking COVID19 Rumors
作者机构:Computer Science DepartmentCollege of Computer and Information SciencesAl Imam Mohammad Ibn Saud Islamic University(IMSIU)Riyadh11432Saudi Arabia Computer Science DepartmentFaculty of Applied ScienceTaiz UniversityTaiz6803Yemen College of Computer Science and EngineeringTaibah UniversityMedinaSaudi Arabia Information System DepartmentSaba’a Region UniversityMareebYemen
出 版 物:《Computers, Materials & Continua》 (计算机、材料和连续体(英文))
年 卷 期:2022年第70卷第1期
页 面:1721-1747页
核心收录:
学科分类:0502[文学-外国语言文学] 050201[文学-英语语言文学] 05[文学]
基 金:This research was funded by the Deanship of Scientific Research Imam Mohammad Ibn Saud Islamic University Saudi Arabia Grant No.(20-12-18-013)
主 题:Rumor detection rumor tracking similarity techniques COVID-19 social media analytics
摘 要:Rumors regarding epidemic diseases such as COVID 19,medicines and treatments,diagnostic methods and public emergencies can have harmful impacts on health and political,social and other aspects of people’s lives,especially during emergency situations and health *** huge amounts of content being posted to social media every second during these situations,it becomes very difficult to detect fake news(rumors)that poses threats to the stability and sustainability of the healthcare sector.A rumor is defined as a statement for which truthfulness has not been *** COVID 19,people found difficulty in obtaining the most truthful news easily because of the huge amount of unverified information on social *** methods have been applied for detecting rumors and tracking their sources for COVID 19-related ***,very few studies have been conducted for this purpose for the Arabic language,which has unique ***,this paper proposes a comprehensive approach which includes two phases:detection and *** the detection phase of the study carried out,several standalone and ensemble machine learning methods were applied on the Arcov-19 dataset.A new detection model was used which combined two models:The Genetic Algorithm Based Support Vector Machine(that works on users’and tweets’features)and the stacking ensemble method(that works on tweets’texts).In the tracking phase,several similarity-based techniques were used to obtain the top 1%of similar tweets to a target tweet/post,which helped to find the source of the *** experiments showed interesting results in terms of accuracy,precision,recall and F1-Score for rumor detection(the accuracy reached 92.63%),and showed interesting findings in the tracking phase,in terms of ROUGE L precision,recall and F1-Score for similarity techniques.