基于监督下降法的大地电磁二维反演及应用研究
Application of supervised descent method for 2D magnetotelluric inversion and its application
通讯作者: 谭捍东(1966-),男,教授,地球探测与信息技术专业,主要从事电法勘探理论及应用研究工作。Email:thd@cugb.edu.cn
责任编辑: 王萌
收稿日期: 2023-10-13 修回日期: 2023-10-20
基金资助: |
|
Received: 2023-10-13 Revised: 2023-10-20
作者简介 About authors
付兴(1999-),男,硕士研究生,地球探测与信息技术专业,主要研究方向为机器学习与电法勘探算法。Email:
传统的大地电磁二维反演方法较为成熟,但仍存在反演结果依赖初始模型和正则化参数选取、容易陷入局部极小值等问题。监督下降法是一种学习平均下降方向来预测数据残差的机器学习算法。本文尝试采用监督下降法解决传统的大地电磁二维反演存在的问题,基于监督下降法理论开发了大地电磁二维反演算法,设计理论模型合成算例验证了算法的正确性,并对西藏高原实测数据进行反演,检验了监督下降法的实用性。理论模型合成数据和实测数据反演结果表明,相较于传统的非线性共轭梯度反演,基于监督下降法的反演具有收敛速度快、反演效果好、抗噪能力强等特点。
关键词:
Traditional two-dimensional inversion methods of magnetotelluric are mature, but there are still some problems, such as reliance on the initial model, reliance on regularization parameter selection, and easy to fall into local minimum. In order to solve the above problems, this paper adopts the supervised descent method to improve the effect of two-dimensional inversion of magnetotelluric. The supervised descent method is a machine learning algorithm that learns the average descending direction to predict the residual of data. Based on the theory of supervised descent method, this paper develops the two-dimensional inversion algorithm of magnetotelluric, designs the theoretical model synthesis example to verify the correctness of the algorithm, and inverts the measured data on the Tibet Plateau to test the practicability of the supervised descent method. The inversion results of the theoretical model synthesis data and the measured data show that, compared with the traditional nonlinear conjugate gradient inversion, the inversion based on the supervised descent method has the characteristics of fast convergence speed, good inversion effect, and strong anti-noise ability.
Keywords:
本文引用格式
付兴, 谭捍东, 董岩, 汪茂.
FU Xing, TAN Han-Dong, DONG Yan, WANG Mao.
0 引言
大地电磁测深法(MT)是一种通过观测天然交变电磁场来对地电构造进行研究的物探方法,大地电磁法的频域很宽,其探测深度距地表有几百米到数百公里不等,具有分辨率高,穿透能力强,勘探深度大以及效率高等实际优点,被广泛应用于金属矿产普查、勘探地热和油气田等领域[1]。
本文采用监督下降法理论改进大地电磁二维数据反演算法,编写相关代码并设计理论模型验证算法的正确性,对其抗噪能力及实用性进行检验。相较于传统的非线性共轭梯度反演,基于监督下降法的反演具有收敛速度快、反演效果好、抗噪能力强等特点。
1 大地电磁二维正反演简介
1.1 大地电磁二维有限单元法正演
忽略位移电流和磁导率的影响,时间因子为e-iωt,大地电磁场满足麦克斯韦方程组:
式中:σ为介质的电导率;μ0为介质的磁导率;E为电场强度;H为磁感应强度。由式(1)可推导出赫姆霍兹方程组:
式中:k为波数,k=
式中:AB为上边界,CD为下边界。
对图1所有单元进行离散,式(3)变为:
式中:K为总体系数矩阵,求取式(5)的极值,可得
可得出各节点的u,相应的视电阻率及阻抗相位为:
图1
1.2 大地电磁二维非线性共轭梯度反演
常规的大地电磁二维反演方法有高斯牛顿法、OCCAM法、非线性共轭梯度法。本文将采用非线性共轭梯度反演与基于监督下降法的反演进行对比,下面概要介绍非线性共轭梯度反演方法。
定义大地电磁二维反演目标函数如下[15]:
式中:m为模型参数;F(m)为求取阻抗张量的正演函数;dobs为模型正演响应或实测数据;V为协方差矩阵;λ为正则化参数;L为二次差分拉普拉斯算子;m0为先验信息,目标函数具有数据拟合差最小和模型最光滑的双重约束。
目标函数的梯度为:
其中:A表示雅可比矩阵,数据误差向量:e=dobs-F(m)。
1)i=0时,设定初始模型mi,计算梯度ri=gi;
2)取Mi=
3)求解ri=0,得出最优搜索步长αi;
4)mi+1=mi+αiui,ri+1=gi+1;
5)若ri+1小于预设的最小值,则反演结束,否则令ui+1=
6)取i=i+1,回到步骤3),直至收敛,反演结束。
2 基于监督下降法的大地电磁二维反演
监督下降法运用机器学习理论,采取半监督学习方式,分为训练和预测两个阶段。训练阶段通过对训练集的学习,得到一组平均下降方向。预测阶段运用平均下降方向迭代计算数据残差,进而进行迭代。监督下降法的流程如图2。
图2
2.1 训练阶段
在进行大地电磁反演计算时,定义目标函数如下:
对式(10)的正演函数F(m)进行一阶Taylor展开,求取对m的极小值,可得:
其中:J为雅可比矩阵;Δm=m-m1;Δd=dobs-F(m);α=
此时,相较于每次迭代均需计算雅可比矩阵或海森矩阵且每步均需搜索下降步长的常规大地电磁反演,引入机器学习的思想,通过学习的平均下降方向,使目标函数快速趋于极小值的方法可以节约大量计算时间以及大量内存空间,同时避免目标函数陷入局部极小值。
合成一组训练集(标签集MP和数据集DP),为了计算下降方向,求下式极小值:
其中:
其中:m1为初始模型(通常为均匀半空间);
式中:i为迭代次数;λ为阻尼系数(可与相加的内容呈现比例关系),
Mi+1=Mi+ΔDiαi, (16)
在此阶段,数据拟合差RMSD与模型拟合差RMSM为:
当模型拟合差趋于稳定、足够小或者达到了预设的最大迭代次数时,训练阶段结束。
2.2 预测阶段
将上阶段的平均步长用于该阶段的计算,具体过程如下:
从i=1开始计算,m1与训练过程的初始模型相同。当F(mi)与dobs间的数据残差为0时,停止迭代,输出反演结果。
为解决反演的多解性问题,考虑在预测阶段加入吉洪诺夫正则化来解决这一问题。
式(19)为进行正则化的目标函数,其中υv、υh为垂直方向以及水平方向的正则化系数。
式中:
式中:υd=1/
求取式(19)的极小值进行偏导处理,可得下式,化简后,可得
求解式(22),解出的mi,利用mi进行下次迭代,直至数据拟合差停止变小或小于一定数值,数据拟合差公式如下:
最大训练次数的步长预测完后,得出的数据拟合差未达到标准或者预测结果未达到期望,采取重启步长算法,即从第一个步长开始,进行新一轮的预测,直至数据拟合差足够小或不再减少。
3 理论模型合成数据二维反演算例
大地电磁的二维正演采用有限单元法,将地下介质剖分成33×26的网格单元,横向网格中心区域均匀剖分,两侧不断变大。纵向为26个不断增大的非均匀网格,设计40个频点。
将部分不同电阻率的规则形状的异常体在背景电阻率为100 Ω·m的均匀半空间内移动,形成了4 355个不同模型,如图3所示。
图3
将上述模型分别进行正演计算,使用正演结果中TM模式的数据信息,将上述模型与正演响应构成一个训练集,进行离线训练,本阶段采用matlab及C++混合编程的方法,模型试算工作采用的CPU型号为i5-12500H,开启10线程并行,训练共耗时2 h。
图4
图4
迭代过程的模型拟合差
Fig.4
The normalized model misfit in each iteration during training
图5
3.1 理论模型1
模型1(图6)包含4个异常体,背景电阻率为100 Ω·m,异常体1是大小为800 m×800 m、埋深为400 m、电阻率值为1 000 Ω·m的高阻异常体,异常体2是大小为800 m×700 m、埋深为300 m、电阻率值为10 Ω·m的低阻异常体,异常体3是大小为600 m×700 m、埋深为300 m、电阻率值为1 000 Ω·m的高阻异常体,异常体4是大小为1 000 m×1 000 m、埋深为400 m、电阻率值为10 Ω·m的低阻异常体。
图6
图6
理论模型1示意(从左到右分别为异常体1、2、3、4)
Fig.6
Schematic diagram of model 1 (abnormal bodies 1, 2, 3 and 4 are arranged from left to right)
图7
图7
预测结果抗噪能力对比
a—高斯误差为0%的预测结果 ;b—高斯误差为10%的预测结果
Fig.7
Comparison of anti-interference ability
a—inversion results with 0% random noise;b—inversion results with 10% random noise
图8
图9a为非线性共轭梯度法的二维反演结果,用时5.695 s。高阻异常体1、3均未能恢复理论模型电阻率,能还原异常体大致位置;低阻异常体2、4未能恢复理论模型电阻率,大致还原异常体位置。
图9
图9
理论模型1反演结果对比
a—非线性共轭梯度反演结果;b—监督下降法反演结果
Fig.9
Comparison of inversion results of model 1
a—inversion results of using NLCG scheme;b——inversion results of using SDM scheme
3.2 理论模型2
模型2(图10)包含4个异常体,背景电阻率为100 Ω·m,异常体1是大小为1 400 m×1 000 m、埋深为1 200 m、电阻率值为1 000 Ω·m的高阻异常体,异常体2是埋深300 m、电阻率值为10 Ω·m的阶梯状低阻异常体,异常体3是大小为1 200 m×260 m、埋深为40 m、电阻率值为10 Ω·m的低阻异常体,异常体4是大小为600 m×1 300 m、埋深为850 m、电阻率值为1 000 Ω·m的高阻异常体。
图10
图10
理论模型2示意(从左到右分别为异常体1、2、3、4)
Fig.10
Schematic diagram of model 2 (abnormal bodies 1, 2, 3 and 4 are arranged from left to right)
图11a为非线性共轭梯度法的二维反演的结果,用时5.6 s。高阻异常体1未能恢复理论模型的电阻率,也未能还原异常体位置;非线性共轭梯度反演方法仅恢复了低阻异常体2地下1 000 m以上的信息,但基本无法得出1 000 m以下的有效信息,可能受低阻异常体3的影响较大;低阻异常体3大致恢复理论模型的电阻率,并还原异常体位置;高阻异常体4未恢复理论模型的电阻率,也未还原异常体大致位置。
图11
图11
理论模型2反演结果对比
a—非线性共轭梯度反演结果;b—监督下降法反演结果
Fig.11
Comparison of inversion results of model 2
a—inversion results of using NLCG scheme;b—inversion results of using SDM scheme
图11b为监督下降算法反演的结果,初始迭代误差为0.546,迭代10次后迭代误差为0.063 1,用时2.8 s。高阻异常体1理论模型电阻率的恢复效果优于11a,且能还原异常体位置;监督下降算法可以反演出低阻阶梯状异常体2的地下2 000 m以上的异常体信息,并恢复异常体理论模型电阻率,2 000~3 000 m的部分没有很好的体现,可能是低阻异常体3屏蔽的影响;低阻异常体3能恢复理论模型的电阻率,也能还原异常体位置;高阻异常体4理论模型电阻率恢复效果优于11a,可能是低阻异常体3的影响,也未还原异常体位置。
3.3 理论模型3
模型3(图12)为复杂地层模型,背景电阻率为100 Ω·m,存在地层平均层厚500 m,顶部埋深100 m,电阻率值10 Ω·m的中部突起地层异常体。
图12
图13
图13
反演结果对比
a—非线性共轭梯度反演结果;b—监督下降法反演结果
Fig.13
Comparison of inversion results
a—inversion results of using NLCG scheme;b—inversion results of using SDM scheme
总体上,监督下降法的反演效果优于非线性共轭梯度法的反演效果。训练集中并没有训练较复杂的模型,但我们仍可以反演理论模型的高阻异常体、复杂异常体以及复杂地层异常模型,说明监督下降法具有处理较复杂模型的能力。
4 实测数据应用
为验证算法的实用性,将本文开发的反演算法应用于西藏高原南部雅鲁藏布江缝合带地区实测数据的反演。测线南起错那县,终止于墨竹工卡县,全长约240 km,测点29个,观测频率范围为0.000 1~250 Hz。测线横跨雅鲁藏布江缝合带,缝合带附近存在大规模的冈底斯花岗岩[18]。
图14为原始数据拟断面图,本文主要对TM模式的数据进行反演,对比监督下降算法及非线性共轭梯度算法反演结果,以此验证监督下降法的实用性。
图14
图14
原始数据拟断面(空白区域为剔除的坏点)
a—TM模式视电阻率拟断面;b—TM模式相位拟断面;c—TE模式视电阻率拟断面;d—TE模式相位拟断面
Fig.14
Pseudosection of field data (blank areas indicate removed bad points)
a—apparent resistivity of TM mode;b—impedance phase of TM mode;c—apparent resistivity of TE mode;d—impedance phase of TE mode
图15
图15
实测数据反演结果对比
a—非线性共轭梯度法反演结果;b—监督下降法的反演结果
Fig.15
Comparison of inversion results of field data
a—inversion results using NLCG scheme;b—inversion results using SDM scheme
图16
图16
反演结果与实测数据对比
a—原始数据TM模式视电阻率拟断面;b—原始数据TM模式相位拟断面;c—SDM结果TM模式视电阻率拟断面;d—SDM结果TM模式相位拟断面
Fig.16
Comparison of model data and Field data
a—field data apparent resistivity of TM mode;b—field data impedance phase of TM mode;c—SDM inversion results apparent resistivity of TM mode;d—SDM inversion results impedance phase of TM mode
5 结论
本文实现了基于监督下降法的大地电磁二维反演,学习了一组训练集,对三组理论模型和实测数据进行了反演,检验了反演算法的可行性和实用性。反演结果表明基于监督下降法的大地电磁二维反演收敛速度、反演效果优于非线性共轭梯度反演,具有较好的抗噪能力及实用性。
1)训练得到的平均步长对异常体的预测起了指导性作用,避免了雅可比矩阵及海森矩阵的计算,大大降低了陷入局部极小值的风险,减少反演多解性。
2)就预测阶段来说,需要的时间小于常规反演所需的时间,对于批量数据的处理具有一定优势。
3)监督下降法具有较高泛化能力,可以用于更多地球物理勘探方法。
监督下降法总体耗时(训练+预测)较常规反演略长,同时平均步长的储存对硬盘的需求较大,并行降低耗时与减少储存是其未来的研究方向。
参考文献
Occam’s inversion to generate smooth,two-dimensional models from magnetotelluric data
[J].
DOI:10.1190/1.1442813
URL
[本文引用: 1]
Magnetotelluric (MT) data are inverted for smooth 2-D models using an extension of the existing 1-D algorithm, Occam’s inversion. Since an MT data set consists of a finite number of imprecise data, an infinity of solutions to the inverse problem exists. Fitting field or synthetic electromagnetic data as closely as possible results in theoretical models with a maximum amount of roughness, or structure. However, by relaxing the misfit criterion only a small amount, models which are maximally smooth may be generated. Smooth models are less likely to result in overinterpretation of the data and reflect the true resolving power of the MT method. The models are composed of a large number of rectangular prisms, each having a constant conductivity. [Formula: see text] information, in the form of boundary locations only or both boundary locations and conductivity, may be included, providing a powerful tool for improving the resolving power of the data. Joint inversion of TE and TM synthetic data generated from known models allows comparison of smooth models with the true structure. In most cases, smoothed versions of the true structure may be recovered in 12–16 iterations. However, resistive features with a size comparable to depth of burial are poorly resolved. Real MT data present problems of non‐Gaussian data errors, the breakdown of the two‐dimensionality assumption and the large number of data in broadband soundings; nevertheless, real data can be inverted using the algorithm.
Computational recipes for electromagnetic inverse problems
[J].DOI:10.1111/gji.2012.189.issue-1 URL [本文引用: 1]
Deep learning technology for construction machinery and robotics
[J].DOI:10.1016/j.autcon.2023.104852 URL [本文引用: 1]
Machine learning for molecular thermodynamics
[J].DOI:10.1016/j.cjche.2020.10.044 URL [本文引用: 1]
Supervised descent method and its applications to face alignment
[C]//
Supervised descent method(Conference Paper)
[J].
Image human thorax using ultrasound traveltime tomography with supervised descent method
[J].
DOI:10.3390/app12136763
URL
[本文引用: 1]
The change of acoustic velocity in the human thorax reflects the functional status of the respiratory system. Imaging the thorax’s acoustic velocity distribution can be used to monitor the respiratory system. In this paper, the feasibility of imaging the human thorax using ultrasound traveltime tomography with a supervised descent method (SDM) is studied. The forward modeling is computed using the shortest path ray tracing (SPR) method. The training model is composed of homogeneous acoustic velocity background and a high-velocity rectangular block moving in the domain of interest (DoI). The average descent direction is learned from the training set. Numerical experiments are conducted to verify the method’s feasibility. Normal thorax model experiment proves that SDM traveltime tomography can efficiently reconstruct thorax acoustic velocity distribution. Numerical experiments based on synthetic thorax model of pleural effusion and pneumothorax show that SDM traveltime tomography has good generalization ability and can detect the change of acoustic velocity in human thorax. This method might be helpful for the diagnosis and evaluation of respiratory diseases.
Guided wave tomography based on supervised descent method for quantitative corrosion imaging
[J].DOI:10.1109/TUFFC.2021.3097080 URL [本文引用: 1]
Pixel- and model-based microwave inversion with supervised descent method for dielectric targets
[J].DOI:10.1109/TAP.8 URL [本文引用: 1]
Application of supervised descent method for 2D magnetotelluric data inversion
[J].
DOI:10.1190/geo2019-0409.1
URL
[本文引用: 1]
The supervised descent method (SDM) is applied to 2D magnetotellurics (MT) data inversion. SDM contains offline training and online prediction. The training set is composed of the models generated according to prior knowledge and the data simulated by MT forward modeling. In the training process, a set of descent directions from an initial model to the training models is learned. In the prediction, model reconstruction is achieved by optimizing an online regularized objective function with a restart scheme, where the learned descent directions and the computed data residual are involved. SDM inversion has the advantages of (1) being more efficient than traditional gradient-descent methods because the computation of local derivatives of the objective function is avoided, (2) incorporating prior uncertain knowledge easier than deterministic inversion approach by generating training models flexibly, and (3) having high generalization ability because the physical modeling can guide the online model reconstruction. Furthermore, a way of designing general training set is introduced, which can be used for training when the prior knowledge is weak. The efficiency and accuracy of this method are validated by two numerical examples. The results indicate that the reconstructed models are consistent with prior information, and the simulated responses agree well with the data. This method also shows good potential to improve the accuracy and efficiency in field MT data inversion.
Application of supervised descent method to transient electromagnetic data inversion
[J].
DOI:10.1190/geo2018-0129.1
URL
[本文引用: 1]
Inversion plays an important role in transient electromagnetic (TEM) data interpretation. This problem is highly nonlinear and severely ill posed. Gradient-descent methods have been widely used to invert TEM data, and regularization schemes containing prior information are applied to reduce the nonuniqueness and stabilize the inversion. During the inversion, the partial derivatives are repeatedly computed, which is time and memory consuming. Furthermore, regularization schemes can only provide limited prior information. Much prior information from knowledge and experience cannot be directly used in inversion. In this work, we applied the supervised descent method (SDM) to TEM data inversion. This method contains an offline training stage and an online prediction stage. In the training stage, a training data set is generated according to prior information. Then, the average descent direction between a fixed initial model and the training models can be learned by iterative schemes. In the online stage of prediction, the learned descent directions are applied directly into the inversion to update the models. In this manner, one can select models satisfying the data and model misfit. In this study, SDM is applied to model- and pixel-based inversion schemes. Synthetic examples indicate that SDM inversion can not only enhance the accuracy of inversion due to the incorporation of prior information but also largely accelerate the inversion procedure because it avoids the online computation of derivatives.
Three-dimensional magnetotelluric inversion using non-linear conjugate gradients
[J].DOI:10.1046/j.1365-246x.2000.00007.x URL [本文引用: 1]
地面可控源频率测深三维非线性共轭梯度反演
[J].
Three-dimensional controlled source electromagnetic inversion using non-linear conjugate gradients
[J].
西藏高原南部雅鲁藏布江缝合带地区地壳电性结构研究
[J].
Crustal electrical conductivity structure beneath the Yarlung Zangbo Jiang suture in the southern Xizang Plateau
[J].
/
〈 |
|
〉 |
