A Novel Self-Supervised Learning Network for Binocular Disparity Estimation
作者机构:Department of Computer Science and EngineeringMajor in Bio Artificial IntelligenceHanyang UniversityAnsan-si15577Republic of Korea School of Electrical and Computer EngineeringLouisiana State UniversityBaton RougeLA 70803USA Department of Computer EngineeringCollege of Computer and Information SciencesKing Saud UniversityRiyadh11574Saudi Arabia School of AutomationUniversity of Electronic Science and Technology of ChinaChengdu610054China
出 版 物:《Computer Modeling in Engineering & Sciences》 (工程与科学中的计算机建模(英文))
年 卷 期:2025年第142卷第1期
页 面:209-229页
核心收录:
学科分类:08[工学] 0812[工学-计算机科学与技术(可授工学、理学学位)]
基 金:Supported by Sichuan Science and Technology Program(2023YFSY0026,2023YFH0004) Supported by the Institute of Information&Communications Technology Planning&Evaluation(IITP)grant funded by the Korean government(MSIT)(No.RS-2022-00155885,Artificial Intelligence Convergence Innovation Human Resources Development(Hanyang University ERICA))
主 题:Parallax estimation parallax regression model self-supervised learning Pseudo-Siamese neural network pyramid dilated convolution binocular disparity estimation
摘 要:Two-dimensional endoscopic images are susceptible to interferences such as specular reflections and monotonous texture illumination,hindering accurate three-dimensional lesion reconstruction by surgical *** study proposes a novel end-to-end disparity estimation model to address these *** approach combines a Pseudo-Siamese neural network architecture with pyramid dilated convolutions,integrating multi-scale image information to enhance robustness against lighting *** study introduces a Pseudo-Siamese structure-based disparity regression model that simplifies left-right image comparison,improving accuracy and *** model was evaluated using a dataset of stereo endoscopic videos captured by the Da Vinci surgical robot,comprising simulated silicone heart sequences and real heart video *** results demonstrate significant improvement in the network’s resistance to lighting interference without substantially increasing ***,the model exhibited faster convergence during training,contributing to overall performance *** study advances endoscopic image processing accuracy and has potential implications for surgical robot applications in complex environments.