融合光学和声学特征的岛礁周边海底底质GA-XGBoost分类方法

GA-XGBoost classification method for seabed sediment around islands and reefs based on optical and acoustic features

  • 摘要: 海底底质类型的精确识别对了解底栖海洋群落的分布和规划海洋资源可持续开发至关重要,机器学习算法是识别底质类型的有效手段。针对岛礁单一声学数据底质分类局限性,融合多光谱遥感数据为解决该局限性提供了新思路。本研究提出了一种融合多光谱遥感数据和多波束数据、基于特征选择和遗传算法——极限梯度提升算法(Genetic Algorithm-Extreme Gradient Boosting, GA-XGBoost)的多源数据海底底质分类方法。首先对WorldView-2多光谱数据和多波束数据进行预处理,统一地理坐标系统并进行空间分辨率配准;然后提取多光谱影像的光谱特征、测深数据的地形特征及反向散射强度纹理特征,组成18维特征参数,基于XGBoost(Extreme Gradient Boosting)算法结合向前逐步特征选择从18维特征中选出12维最优特征子集;之后构建GA-XGBoost分类模型,分别使用为解决该局限性提供了新思路及多源数据训练和测试模型,与BP(Back Propagation neural network)、GA-BP(Genetic Algorithm- Back Propagation neural network)和XGBoost分类算法的精度对比分析;最后,应用最优的GA-XGBoost模型对整个研究区底质进行分类并可视化。实验结果表明,该方法在海底底质分类中的总体精度达91.23%,Kappa系数为0.87,F1分数为91.18%,显著优于单一数据源输入及对比算法。表明GA-XGBoost为海底底质快速、准确分类的一种新的有效解决方案。

     

    Abstract: The accurate identification of seabed substrate types is essential for understanding the distribution of benthic marine communities and planning the sustainable development of marine resources, and machine learning algorithms are an effective means to obtain substrate types. In view of the limitations of substrate classification of single acoustic data of islands and reefs, fusion of multispectral remote sensing data provides a new idea to solve the limitation. In this study, a multi-source seafloor fusion of multispectral remote sensing data and multibeam data based on feature selection and Genetic Algorithm Extreme Gradient Boosting (GA-XGBoost) is proposed as a seafloor classification method. Firstly, the WorldView-2 multispectral data and multibeam data are preprocessed to unify the geographic coordinate system and align the spatial resolution; then the spectral features of the multispectral image, the topographic features of the bathymetry data, and the texture features of the backscatter intensity are extracted to form an 18-dimensional feature parameters, and the 12-dimensional optimal feature subsets are selected from the 18-dimensional features based on the XGBoost (Extreme Gradient Boosting) algorithm combined with forward stepwise feature selection. Then, the GA-XGBoost classification model was established, and the model was trained and tested using the data from multiple sources that provide a new idea to solve the limitation, respectively, and is compared with the BP (Back Propagation neural network (BP), GA-BP (Genetic Algorithm- Back Propagation neural network, GA-BP) and XGBoost classification algorithms. Finally, the optimal GA-XGBoost classification model was applied to classify and visualize the substrate in the whole study area. The experimental results show that the overall accuracy of the method in seabed substrate classification reaches 91.23%, with a Kappa coefficient of 0.87 and an F1 score of 91.18%, which is significantly better than the single data source input and the comparison algorithm. Indicating that GA-XGBoost can provide a new way for the rapid and accurate classification of seafloor sediment.

     

/

返回文章
返回