Journal of Henan Agricultural Sciences ›› 2024, Vol. 53 ›› Issue (6): 144-153.DOI: 10.15933/j.cnki.1004-3268.2024.06.016

• Agricultural Information and Engineering and Agricultural Product Processing • Previous Articles     Next Articles

Effects of Different Sample Set Partition Strategies on Crop Remote Sensing Classification Accuracy

LIU Yang1,LI Qiangzi1,2,DU Xin1,2,WANG Hongyan1,2,ZHANG Yuan1,2,ZHANG Xiwang2,3,SHEN Yunqi1,2,ZHANG Sichen1,2,YU Shiqi3   

  1. (1.Aerospace Information Research Institute,Chinese Academy of Sciences,Beijing 100101,China;2.University of Chinese Academy of Sciences,Beijing 100049,China;3.Henan University,Zhengzhou 475001,China)
  • Received:2024-01-08 Published:2024-06-15 Online:2024-07-11

不同样本集划分策略对农作物遥感分类精度的影响

刘洋1,李强子1,2,杜鑫1,2,王红岩1,2,张源1,2,张喜旺2,3,沈云祺1,2,张思宸1,2,余仕奇3
  

  1. (1.中国科学院空天信息创新研究院,北京 100101;2.中国科学院大学,北京 100049;3.河南大学,河南 郑州 475001)
  • 通讯作者: 李强子(1970-),男,河南洛阳人,研究员,主要从事农业遥感研究。E-mail:liqz@aircas.ac.cn 杜鑫(1982-),男,河北邯郸人,副研究员,主要从事农业遥感研究。E-mail:duxin@aircas.ac.cn
  • 作者简介:刘洋(1997-),男,河南郑州人,在读硕士研究生,研究方向:农业遥感。E-mail:1051126985@qq.com
  • 基金资助:
    国家重点研发计划项目(2021YFD1500103);中国科学院战略性先导科技专项(XDA28070504);国家自然基金面上项目(42071403);高分辨率对地观测系统国家科技重大专项(71-Y50G10-9001-22/23)

Abstract: The extraction accuracy of crop distribution has a profound impact on the subsequent inversion of farmland parameters and estimation of crop yield per unit area.In the process of crop classification and recognition,the accuracy and number of training samples are crucial to the final classification results.Aiming at the problem of small number of samples and uneven distribution,the crop classification sample data set was constructed by two ways of field identification and visual interpretation,and five sample data set construction schemes were designed:①all the measured sample points(70% training,30% verification)were used in the scheme;②all visual interpretation sample points were used(70% training,30% verification);③ the same proportion of training samples and verification samples were selected from the measured sample points and visual sample points respectively,and then the training sample set and verification sample set were constructed combinedly(70% training,30% verification);④the visual interpretation sample points were used as training samples,and the measured sample points were used as verification samples;⑤the same number of samples were selected from the visual sample points and the measured sample points to construct a sample set(70% training,30% verification).The accuracy of crop remote sensing classification was studied under different schemes.The results showed that except④,the overall accuracy of ① ② ③ ⑤ four sample data set partition schemes was more than 95%,and the classification results were good. Using visual interpretation to supplement sample points could effectively solve the problem of fewer sample points and uneven distribution.As the best classification scheme for crop recognition and extraction in the study area,scheme③had an overall accuracy of 97.6% and a Kappa coefficient of 0.970,and the accuracy of corn,rice and soybean was all more than 97%,indicating that the combination of training samples and validation samples selected from visual interpretation samples and measured samples to construct training and validation sample sets can not only improve the accuracy of classification results,but also improve the authenticity and accuracy of classification results.

Key words: Remote sensing, Crop classification, Visual interpretation, Sample set constraction

摘要: 农作物分布的提取精度对于后续的农田参数反演和作物单产估算等均具有深刻的影响,而农作物分类识别过程中,训练样本的准确性和数量对其最终分类结果的影响是至关重要的。针对样本较少且分布不均匀的问题,通过实地标识和目视解译2种方式构建农作物分类样本数据集,设计5种样本数据集构建方案:方案①全部采用实测样点(70%训练、30%验证);方案②全部采用目视解译样点(70%训练、30%验证);方案③实测样点与目视样点分别选取相同比例的训练样本与验证样本,再结合构建训练样本集与验证样本集(70%训练、30%验证);方案④目视解译样点作为训练样本,实测样点作为验证样本;方案⑤目视样点与实测样点选取相同数量的样本进行结合构建样本集(70%训练、30%验证)。研究不同方案下的农作物遥感分类精度。结果表明,除方案④外,①②③⑤4种样本数据集划分方案的整体精度均在95%以上,分类结果较好,表明采用目视解译补充样本点,可以有效解决样本点较少和分布不均匀的情况。方案③作为研究区作物识别提取的最佳分类方案,总体精度达到97.6%,Kappa系数达到0.970,且玉米、水稻、大豆单类精度均超过97%,表明目视解译样点与实测样点分别选取训练样本与验证样本再结合构建训练样本集与验证样本集,不仅可以提升分类结果的精度,而且可以提高分类结果的真实性、准确性。

关键词: 遥感, 农作物分类, 目视解译, 样本集划分

CLC Number: