基于结构光和深度神经网络的3维面形重建

代金科; 郑素珍; 苏娟

doi:10.7510/jgjs.issn.1001-3806.2023.06.015

基于结构光和深度神经网络的3维面形重建

1.
西南石油大学电气信息学院, 成都 610500
2.
西南石油大学理学院, 成都 610500

作者简介: 代金科(1995-), 男, 硕士研究生, 主要研究方向为光学3维传感和单光子3维成像.

通讯作者: 郑素珍, suzhen317@swpu.edu.cn ;

基金项目:
国家自然科学基金资助项目 11804285

四川省自然科学基金资助项目 2022NSFSC0884
中图分类号: TN247

3-D surface reconstruction based on structured light and deep neural network

1.
School of Electrical Engineering and Information, Southwest Petroleum University, Chengdu 610500, China
2.
School of Sciences, Southwest Petroleum University, Chengdu 610500, China

Corresponding author: ZHENG Suzhen, suzhen317@swpu.edu.cn ;

CLC number: TN247

摘要: 为了提高基于结构光法的3维重建精度, 采用机器学习中的回归模型对物体进行了3维形貌测量, 通过以单目式获取对象高度点不同方向的光强信息簇样本, 将其作为回归模型的训练集, 在训练好回归模型后, 直接建立起条纹图案的光强信息分布与对象高度之间的映射函数关系, 完成对目标的3维测量; 将调制条纹光数值信息以特征形式导入回归模型, 获得端到端高度信息, 验证了机器学习的神经网络回归模型在3维面形重建上的可行性。结果表明, 该模型即使在投影特征模糊或噪音较大的情况也能较精确地重建3维面形, 平均重建误差为1.40×10^-4 mm, 优于一般面形重建方法的数据。该研究为物体在强干扰条件下的单目式高精度3维面形重建提供了参考, 简化了繁琐的计算过程和测量过程, 提高了测量精度。
- 信息光学 /
- 高精度3维面形重建 /
- 深度神经网络 /
- 结构光 /
- 单目式 /
- 形变条纹
Abstract: For the purpose of enhancing the precision of 3-D reconstruction based on the structured light method, the regression model in machine learning was used to measure the 3-D topography of objects. The light intensity information cluster samples in different directions of object height points were obtained monocular as the training set of the regression model. After the regression model was trained, the mapping function relationship between the illumination intensity information distribution of the modulation diagram and the height of the object can be directly established to complete the three-dimensional measurement of the object. The numerical information of modulated fringe light was introduced into the regression model in the form of characteristics. 3-D surface of the object was accurately reconstructed, and the purpose of obtaining the height information from end to end was realized. The feasibility of the neural network regression model based on machine learning in 3-D surface reconstruction was verified. The results show that the model can reconstruct the 3-D surface accurately even when the projection features are fuzzy or the noise is large. The average reconstruction error is 1.40×10^-4 mm, which is better than the data of the general reconstruction method. This study provides a reference for the high-precision 3-D surface reconstruction of monocular objects under strong interference conditions, effectively simplifies the tedious calculation and measurement process, and improves measurement accuracy.
- information optics /
- high precision 3-D surface reconstruction /
- deep neural network /
- structured light /
- monocular /
- stripe of deformation

图 1 结构光法面形恢复原理

Figure 1. Structured light method surface restoration principle

下载: 全尺寸图片幻灯片

图 2 3步相移调制

Figure 2. 3 -step phase shift modulation

下载: 全尺寸图片幻灯片

图 3 深度神经网络拓扑图

Figure 3. Deep neural network topology

下载: 全尺寸图片幻灯片

图 4 多层感知器结构

Figure 4. MLP structure

下载: 全尺寸图片幻灯片

图 5 DNN 3维重建实现过程

Figure 5. 3 -D reconstruction process with DNN

下载: 全尺寸图片幻灯片

图 6 DNN训练流程图

Figure 6. DNN training flowchart

下载: 全尺寸图片幻灯片

图 7 条纹投影调制模式

Figure 7. Stripe projection modulation mode

下载: 全尺寸图片幻灯片

图 8 单点光照强度的样本采集

Figure 8. Sample collection of light intensity at a single point

下载: 全尺寸图片幻灯片

图 9 简化的节点信息传递图

Figure 9. Simplified node information transfer diagram

下载: 全尺寸图片幻灯片

图 10 计算机内存使用示意

Figure 10. Computer memory usage schematic

下载: 全尺寸图片幻灯片

图 11 不同类型的仿真训练物体

a—刀锋状物体 b—波纹状物体 c—随机噪点物体

Figure 11. Different types of simulation training objects

a—knife-like object b—corrugated object c—random noise object

下载: 全尺寸图片幻灯片

图 12 阶梯状物体

Figure 12. Stepped object

下载: 全尺寸图片幻灯片

图 13 倒置共轭阶梯

Figure 13. Inverted conjugate stepped object

下载: 全尺寸图片幻灯片

图 14 MSE收敛过程

Figure 14. MSE convergence process

下载: 全尺寸图片幻灯片

图 15 其它参数变化

Figure 15. Other parameters changes

下载: 全尺寸图片幻灯片

图 16 山峰绝对值函数重建

Figure 16. Peak absolute value function reconstruction

下载: 全尺寸图片幻灯片

图 17 重建误差分布

Figure 17. Reconstruction error distribution

下载: 全尺寸图片幻灯片

图 18 铁质校徽对照重建实验

Figure 18. Iron school emblem contrast reconstruction experiment

下载: 全尺寸图片幻灯片

图 19 太阳神鸟仿品重建

Figure 19. Reconstruction of the Sunbird imitation

下载: 全尺寸图片幻灯片

图 20 观音饰品重建

Figure 20. Reconstruction of Guanyin ornaments

下载: 全尺寸图片幻灯片

表 1 本文中的DNN训练参数与实验配置

Table 1. Training parameters and experimental configuration of DNN

experimental configuration table		network training parameters
inclination angle of projector	15°	initial learning rate	0.01
rotation angle of projector	0°, 45°, 90°	weight decay	open
distance between projector and base plane	0.966 m	optimizer	Adam
distance between camera and base plane	1.0 m	number of learning rounds	193 epochs
program platform	MATLAB/Python	number of hidden layers	32
model of projector	XGIMI NEW Z6X	number of neurons per layer	18
GPU model of computer	GTX 1650	activation function	tanh
CUDA model of computer	896	interlayer connection	dropout
experimental environment	chamber	penalty function	elastic net

下载: 导出CSV

表 2 不同条件下重建能力对比

Table 2. Comparison of reconstruction capabilities under different conditions

measuring distance/mm	light interference	average error/mm	maximum error /mm	reconstruction time/s
1.0×10³	no interference	1.40×10^-4	1.42×10^-4	0.27
1.0×10³	bright light	1.93×10^-4	1.94×10^-4	0.30
2.0×10³	no interference	1.62×10^-4	1.65×10^-4	0.27
2.0×10³	bright light	2.82×10^-4	2.84×10^-4	0.31
3.0×10³	no interference	2.16×10^-4	2.29×10^-4	0.26
3.0×10³	bright light	3.72×10^-4	3.81×10^-4	0.31
4.0×10³	no interference	3.46×10^-4	3.72×10^-4	0.25
4.0×10³	bright light	4.21×10^-4	4.84×10^-4	0.32
5.0×10³	no interference	3.84×10^-4	4.04×10^-4	0.25
5.0×10³	bright light	5.39×10^-4	5.77×10^-4	0.33
6.0×10³	no interference	4.34×10^-4	4.72×10^-4	0.24
6.0×10³	bright light	5.79×10^-4	6.00×10^-4	0.35
7.0×10³	no interference	5.57×10^-4	5.83×10^-4	0.23
7.0×10³	bright light	6.83×10^-4	6.93×10^-4	0.37
8.0×10³	no interference	6.26×10^-4	6.63×10^-4	0.20
8.0×10³	bright light	7.55×10^-4	7.75×10^-4	0.41
9.0×10³	no interference	7.54×10^-4	7.94×10^-4	0.19
9.0×10³	bright light	8.40×10^-4	8.75×10^-4	0.43
10.0×10³	no interference	7.81×10^-4	8.51×10^-4	0.18
10.0×10³	bright light	9.89×10^-4	9.96×10^-4	0.51

下载: 导出CSV

表 3 不同方法重建阶梯状物体性能对照

Table 3. Comparison of properties of step objects reconstructed by different methods

methods	sample size	training time/ (s/batch)	MSE	maximum error/mm	reconstruction time/s
PMP/FTP	—	—	＞1.00	＞1.00	＞1
MLP	unknow	unknow	＞1.00	1.00×10^-2	＞1
BP neural network	32768000	3.9	9.27×10^-4	3.40×10^-3	3.8
our approach	2621440	1.11	4.12×10^-5	1.42×10^-4	0.27

下载: 导出CSV

[1]	MIN L, LI D, DONG Sh. 3D surface roughness measurement based on SFS method[C]//2017 8th International Conference on Intelligent Human-Machine Systems and Cybernetics(IHMSC). Hangzhou, Ch-ina: IEEE, 2017: 484-488.
[2]	郭小凡, 张启灿. 应用BP神经网络重建物体3维面形[J]. 激光杂志, 2019, 40(1): 40-41. GUO X F, ZHANG Q C. Three-dimensional shape reconstruction based on BP neural network[J]. Laser Journal, 2019, 40(1): 40-41(in Chinese).
[3]	GERON A. Hands-on machine learning with scikit-learn, keras, and tensorflow: Concepts, tools, and techniques to build intelligent systems[M]. 2th ed. Sebastopol, USA: O'Reilly Media, 2019: 220-226.
[4]	THEOBALD O. Machine learning for absolute beginners: A plain english introduction[M]. Washington DC, USA: Amazon Publishing, 2019: 66-68.
[5]	LIU Y Sh, WANG R M, ZHAO J J, et al. A novel robust variable selection algorithm for multilayer perceptron[C]//2022 13th Asian Control Conference(ASCC). Jeju, Korea: IEEE, 2022: 470-475.
[6]	NIELSEN M. Neural networks and deep learning[M]. Berlin, Germany: Springer Publishing, 2019: 113-116.
[7]	周志华. 机器学习[M]. 北京: 清华大学出版社, 2017: 35-48. ZHOU Zh H. Machine learning[M]. Beijing: Tsinghua University Press, 2017: 35-48(in Chinese).
[8]	KO B S, KIM H G, OH K J, et al. Controlled dropout: A different approach to using dropout on deep neural network[C]//2017 IEEE International Conference on Big Data and Smart Computing (BigComp). New York, USA: IEEE, 2017: 358-362.
[9]	XIE Sh J, LI L. Improvement and application of deep belief network based on sparrow search algorithm[C]//2021 IEEE International Conference on Advances in Electrical Engineering and Computer A-pplications (AEECA). New York, USA: IEEE, 2021: 705-708.
[10]	李蒙, 张翠, 童杏林. 基于BP算法和FBG传感的复合材料冲击定位检测技术[J]. 激光技术, 2022, 46(3): 320-325. LI M, ZHANG C, TONG X L. Composite material impact location detection technology based on BP algorithm and FBG sensing[J]. Laser Technology, 2022, 46(3): 320-325(in Chinese).
[11]	AYHAN T, ALTUN M. Approximate fully connected neural network generation[C]//2018 15th International Conference on Synthesis, Modeling, Analysis and Simulation Methods and Applications to Circuit Design (SMACD). New York, USA: IEEE, 2018: 93-96.
[12]	YAN D X, AN Y, LI G H, et al. High-resolution reconstruction of FMT based on elastic net optimized by relaxed ADMM[J]. IEEE Transactions on Biomedical Engineering(Early Access), 2022, 10(11): 1-10.
[13]	GONG F X, GONG T R, YU Y, et al. An electricity load forecasting algorithm based on kernel lasso regression[C]//2021 IEEE 4th International Electrical and Energy Conference (CIEEC). New York, USA: IEEE, 2021: 1-4.
[14]	LI D, GE Q F, ZHANG P Ch, et al. Ridge regression with high order truncated gradient descent method[C]//2020 12th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC). New York, USA: IEEE, 2020: 252-255.
[15]	LIU L, LUO Y H, SHEN X, et al. β-Dropout: A unified dropout[J]. IEEE Access, 2019, 7(3): 36140-36153.
[16]	SMITH R, KANDIMALLA V A K, REDDY G D. Predicting diabetes using outlier detection and multilayer perceptron with optimal stochastic gradient descent[C]//2020 IEEE India Council International Subsections Conference (INDISCON). New York, USA: IEEE, 2022: 51-56.
[17]	PATTERSON J, GIBSON A. Deep learning: A practitioners a-pproach[M]. Sebastopol, USA: O'Reilly Published, 2019: 402-406.
[18]	KHANIKI M A L, HADI M B, MANTHOURI M. Feedback error learning controller based on RMSprop and salp swarm algorithm for automatic voltage regulator system[C]//2020 10th International Conference on Computer and Knowledge Engineering (ICCKE). New York, USA: IEEE, 2020: 425-430.
[19]	古德费洛I, 本吉奥Y. 深度学习[M]. 北京: 人民邮电出版社, 2017: 53-79. GOODFELLOW I, BENGIO Y. Deep learning[M]. Beijing: Posts Telecom Press, 2017: 53-79(in Chinese).
[20]	马园园, 王立地. 神经网络的光电测量系统畸变校正和优化研究[J]. 激光杂志, 2017, 37(11): 42-45. MA Y Y, WANG L D. Study on distortion correction and optimization of optical measurement system based on neural network[J]. Laser Journal, 2017, 37(11): 42-45(in Chinese).
[21]	GERON A. Hands-on machine learning with scikit-learn and tensorflow[M]. 2th ed. Sebastopol, USA: O'Reilly Media, 2020: 576-579.

[1]	张志俊 , 吴庆阳 , 邓亦锋 , 蒋逸凡 , 郑国梁 , 翟剑庞 . 基于霍夫变换的结构光场3维成像方法. 激光技术, 2023, 47(4): 492-499. doi: 10.7510/jgjs.issn.1001-3806.2023.04.008
[2]	闫乾宏 , 李勇 , 江溢腾 , 黄凯 , 周星灿 , 陈晓鹏 . 条纹投影动态3维测量中相位高精度估计. 激光技术, 2019, 43(5): 619-623. doi: 10.7510/jgjs.issn.1001-3806.2019.05.006
[3]	刘顺涛 , 骆华芬 , 陈雪梅 , 徐静 . 结构光测量系统的标定方法综述. 激光技术, 2015, 39(2): 252-258. doi: 10.7510/jgjs.issn.1001-3806.2015.02.023
[4]	杨初平 , 翁嘉文 , 杨玲玲 , 张子邦 . 2维载频条纹傅里叶变换轮廓术. 激光技术, 2010, 34(4): 493-496,501. doi: 10.3969/j.issn.1001-3806.2010.04.017
[5]	杨初平 . 高斯光束荧光共焦显微镜的三维光学传递函数. 激光技术, 2005, 29(5): 552-554.
[6]	梁宇龙 , 段发阶 . 基于密度聚类的光条中心线提取方法. 激光技术, 2020, 44(4): 459-465. doi: 10.7510/jgjs.issn.1001-3806.2020.04.011
[7]	曹森鹏 , 王伟锋 , 薛喜昌 . 基于傅里叶变换去隔行图像的动态3维面形测量. 激光技术, 2013, 37(6): 736-741. doi: 10.7510/jgjs.issn.1001-3806.2013.06.007
[8]	蔡振华 , 陈文静 , 钟敏 . 几种小波在3维面形测量中的应用研究. 激光技术, 2015, 39(5): 610-616. doi: 10.7510/jgjs.issn.1001-3806.2015.05.006
[9]	邵珺 , 华文深 , 周中亮 , 高鸿启 . 神经网络和遗传算法在相关峰判读中的应用. 激光技术, 2009, 33(4): 422-425. doi: 10.3969/j.issn.1001-3806.2009.04.026
[10]	朱清溢 , 苏显渝 , 肖焱山 , 向立群 . 基于最大色差彩色组合编码的三维面形测量方法. 激光技术, 2006, 30(4): 340-343.
[11]	雍汉华 , 曹益平 . 基于光栅调制的归一化频谱三维识别. 激光技术, 2008, 32(2): 218-221.
[12]	冯伟 , 张启灿 . 基于结构光投影的薄膜振动模式分析. 激光技术, 2015, 39(4): 446-449. doi: 10.7510/jgjs.issn.1001-3806.2015.04.003
[13]	杨初平 , 纪婧如 , 谭穗妍 , 林盈洪 . 条纹相位分析小区域平整度检测. 激光技术, 2011, 35(6): 784-786,836. doi: 10.3969/j.issn.1001-3806.2011.06.017
[14]	李俊昌 , 马琨 , 樊则宾 , 伏云昌 , 凌东雄 . 三维轴对称场的一种代数层析重建方法. 激光技术, 2004, 28(6): 588-590.
[15]	肖焱山 , 苏显渝 , 张启灿 , 朱清溢 . 动态过程中破裂表面的三维重建. 激光技术, 2006, 30(3): 258-261.
[16]	宋旸 , 张斌 , 贺安之 . 包含遮挡物的三维流场莫尔层析重建. 激光技术, 2007, 31(2): 153-155,159.
[17]	熊润华 , 张启灿 . 基于优先度排序的3维数据缺失快速插补法. 激光技术, 2014, 38(1): 30-34. doi: 10.7510/jgjs.issn.1001-3806.2014.01.007
[18]	郝劲波 , 王良甚 , 忽满利 . 微透镜阵列实现3维物体旋转不变实时识别. 激光技术, 2009, 33(1): 8-11,41.
[19]	王鹏 , 张亚萍 , 张建强 , 吴上 , 陈伟 . 基于数字微镜器件的计算全息3维显示. 激光技术, 2013, 37(4): 483-486. doi: 10.7510/jgjs.issn.1001-3806.2013.04.015
[20]	杨恒 , 黄佐华 , 刘云 . 背景光存在下的相衬法原理分析. 激光技术, 2011, 35(5): 696-698. doi: 10.3969/j.issn.1001-3806.2011.05.032

点击查看大图

图(20) / 表(3)

计量

文章访问数: 1180
HTML全文浏览量: 869
PDF下载量: 18
被引次数: 0

全文HTML

引言

在测量与恢复物体3维面形的领域，以传统结构光法为原理的3维测量有着重要意义，如位相测量轮廓术法(phase measurement profilometry, PMP)和傅里叶变换轮廓术法(Fourier transform profilometry, FTP)等等。这些经典方法对于3维面形的重建已经有了非常大的成效，PMP法能够进行针对于点的初相位求解，解决了被测对象面形的非均匀反射引起的偏差，测量的准确度为等效波长的1%~10%；FTP法只需获取单幅调制图就可以复刻出被测对象的3维面形，并且含有高速和高精度恢复的优点。但是PMP法对正弦光栅的稳定性和相移设备的精密性要求都较高，否则就会存在比较大的测量误差，其测量目标也只适合于小尺寸物体；而FTP法存在频谱混叠现象且缺乏迁移适用性，采用FTP法也有可能因为基准频和零频频谱掺杂导致无法进行准确的窗口式波过滤，从而无法正确恢复被测对象的面形结构。此类经典方法在解相位步骤时都会过于繁琐，亦或是噪声过大光源混杂，或在测量断崖式高度陡变物体等病态问题时，效果往往不尽如人意。

随着人工智能时代的到来，算法图像处理和神经网络于众多方面迅速兴起，其中非常多的算法和框架都被引入到了3维面形重建之中。MIN等人^[1]利用最小二乘法测量3维物体面形粗糙度。GUO等人^[2]以改进简化的反向传播(back propagation, BP)神经网络模型提高3维面形测量精度。本文作者通过对样本数据特征的采集和算法优化，训练一种深度神经网络模型来恢复物体的面形信息。

4. 结论

本文中采用了以较多维度的相关参量作训练集来表征物体面形高度，即GUO等人^[2]提出的进一步优化的方法之一。根据深度信念网络理念，只要模型训练的批次取得合适范围，训练后期步长足够小，并且整个模型有更多的神经结点和隐藏层层数，拟合任意连续超曲面都是可行的，甚至可以让损失无限趋近于0。特征维度增加之后，需要更高质量的样本集来训练神经网络，需要更精准的仪器和低噪声的光照环境中所收集的样本集来进一步提高面形恢复精确度。作者将相位测量轮廓术中的相移思想运用到神经网模型的样本集收集中，通过不同角度的条纹增加了单点光强信息的特征数，同时防止了神经网模型欠拟合。本文中的方法省去了传统相位测量轮廓术和傅里叶变换轮廓术的繁琐计算过程，优化了神经网络模型的训练样本采集方式，在实践中可降低计算时间和设备成本，为实时面形复原提供了一定的可行性。

参考文献 (21)

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

基于结构光和深度神经网络的3维面形重建

作者简介: 代金科(1995-), 男, 硕士研究生, 主要研究方向为光学3维传感和单光子3维成像.

通讯作者: 郑素珍, suzhen317@swpu.edu.cn ;

3-D surface reconstruction based on structured light and deep neural network

Corresponding author: ZHENG Suzhen, suzhen317@swpu.edu.cn ;

计量

基于结构光和深度神经网络的3维面形重建

通讯作者: 郑素珍, suzhen317@swpu.edu.cn;

作者简介: 代金科(1995-), 男, 硕士研究生, 主要研究方向为光学3维传感和单光子3维成像

English Abstract

3-D surface reconstruction based on structured light and deep neural network

Corresponding author: ZHENG Suzhen, suzhen317@swpu.edu.cn

全文HTML

2.1. 深度神经网络的基本结构

2.2. DNN训练数据采集与预处理

2.3. DNN的前向传播与代价函数

2.4. 随机梯度下降迭代过程

2.5. 反向传播求取梯度

2.6. 多次训练至网络收敛

目录

留言板

基于结构光和深度神经网络的3维面形重建

作者简介: 代金科(1995-), 男, 硕士研究生, 主要研究方向为光学3维传感和单光子3维成像.

通讯作者: 郑素珍, suzhen317@swpu.edu.cn ;

3-D surface reconstruction based on structured light and deep neural network

Corresponding author: ZHENG Suzhen, suzhen317@swpu.edu.cn ;

计量

出版历程

基于结构光和深度神经网络的3维面形重建

通讯作者: 郑素珍, suzhen317@swpu.edu.cn;

作者简介: 代金科(1995-), 男, 硕士研究生, 主要研究方向为光学3维传感和单光子3维成像

English Abstract

3-D surface reconstruction based on structured light and deep neural network

Corresponding author: ZHENG Suzhen, suzhen317@swpu.edu.cn

全文HTML

2.1. 深度神经网络的基本结构

2.2. DNN训练数据采集与预处理

2.3. DNN的前向传播与代价函数

2.4. 随机梯度下降迭代过程

2.5. 反向传播求取梯度

2.6. 多次训练至网络收敛

目录