咨询与建议

看过本文的还看了

相关文献

该作者的其他文献

文献详情 >Analyzing the Dissemination of... 收藏

Analyzing the Dissemination of News by Model Averaging and Subsampling

作     者:ZOU Jiahui ZOU Jiahui

作者机构:School of Statistics Capital University of Economics and Business 

出 版 物:《Journal of Systems Science & Complexity》 (系统科学与复杂性学报(英文版))

年 卷 期:2024年第37卷第5期

页      面:2104-2131页

核心收录:

学科分类:050302[文学-传播学] 02[经济学] 0202[经济学-应用经济学] 020208[经济学-统计学] 05[文学] 07[理学] 0714[理学-统计学(可授理学、经济学学位)] 070103[理学-概率论与数理统计] 0503[文学-新闻传播学] 0701[理学-数学] 

基  金:supported by the National Natural Science Foundation of China under Grant No. 12201431 the Young Teacher Foundation of Capital University of Economics and Business under Grant Nos. XRZ2022-070 and 00592254413070 

主  题:Asymptotic optimality dissemination of news linear regression models model averaging optimal subsampling 

摘      要:The dissemination of news is a vital topic in management science, social science and data science. With the development of technology, the sample sizes and dimensions of digital news data increase remarkably. To alleviate the computational burden in big data, this paper proposes a method to deal with massive and moderate-dimensional data for linear regression models via combing model averaging and subsampling methodologies. The author first samples a subsample from the full data according to some special probabilities and split covariates into several groups to construct candidate models. Then, the author solves each candidate model and calculates the model-averaging weights to combine these estimators based on this subsample. Additionally, the asymptotic optimality in subsampling form is proved and the way to calculate optimal subsampling probabilities is *** author also illustrates the proposed method via simulations, which shows it takes less running time than that of the full data and generates more accurate estimations than uniform subsampling. Finally,the author applies the proposed method to analyze and predict the sharing number of news, and finds the topic, vocabulary and dissemination time are the determinants.

读者评论 与其他读者分享你的观点

用户名:未登录
我的评分