基于改进YOLOX 的自然环境下核桃识别算法研究

doi:10.15933/j.cnki.1004-3268.2024.01.017

河南农业科学 ›› 2024, Vol. 53 ›› Issue (1): 152-161.DOI: 10.15933/j.cnki.1004-3268.2024.01.017

• 农业信息与工程·农产品加工 • 上一篇下一篇

基于改进YOLOX 的自然环境下核桃识别算法研究

钟正扬^1,2，云利军^1,2，杨璇玺³，陈载清^1,2

（1.云南师范大学信息学院，云南昆明 650500；2. 云南省教育厅计算机视觉与智能控制技术工程研究中心，云南昆明 650500；3. 云南省林业调查规划院生态分院卫星林业应用中心，云南昆明 650500）

收稿日期:2023-08-10 出版日期:2024-01-15 发布日期:2024-02-27
通讯作者: 云利军（1973-），男，内蒙古呼和浩特人，教授，博士，主要从事视频图像处理、物联网技术研究。E-mail：yunlijun@ynnu.edu.cn
作者简介:钟正扬（1999-），男，安徽马鞍山人，在读硕士研究生，研究方向：视频图像处理。E-mail：zhongzheng11111@163.com
基金资助:
云南省教育厅科学研究基金项目（2023Y0533）

Research on Walnut Recognition Algorithm in Natural Environment Based on Improved YOLOX

ZHONG Zhengyang^1,2，YUN Lijun^1,2，YANG Xuanxi³，CHEN Zaiqing^1,2

（1.School of Information，Yunnan Normal University，Kunming 650500，China；2. Yunnan Provincial Department of Education Computer Vision and Intelligent Control Technology Engineering Research Center，Kunming 650500，China；3. Satellite Forestry Application Center，Ecological Branch of Yunnan Forestry Survey and Planning Institute，Kunming 650500，China）

Received:2023-08-10 Published:2024-01-15 Online:2024-02-27

摘要/Abstract

摘要： ：针对现有目标检测算法对自然环境下核桃识别存在漏检、误检等问题，提出了一种基于Swin Transformer 多层特征融合改进的YOLOX-S 核桃识别算法。首先，在主干特征提取网络中引入基于Swin Transformer的多层特征融合模块，借助Swin Transformer的多头注意力机制对小目标的特征信息进行提取并与特征图进行融合，可以有效解决因网络层数加深导致的高层特征图中小目标特征信息丢失问题；其次，为了提高算法的检测精度，引入更高效的Repblock模块对原网络中的CSP模块进行替换；最后，为了提高下采样效果，使用更为优秀的Transition Block模块作为主干特征提取网络的下采样模块。结果表明，改进后的YOLOX-S模型在采集的自然环境下核桃数据集上平均精度AP50达到96.72%，分别比Faster-RCNN、YOLOv5-S、YOLOX-S算法提高7.36、1.38、0.62百分点，检测速度达到46 f/s，模型参数大小为20.55 M。改进后的YOLOX-S算法具有更好的精度，改善了漏检和误检问题，对自然环境下的核桃有更好的识别效果。

关键词: 核桃识别, Swin Transformer, 多层特征融合模块, YOLOX-S, 深度学习

Abstract: Aiming to address the issues of missed detection and false detection of walnut recognition in natural environments using existing target detection algorithms，we proposed an improved YOLOX‑S walnut recognition algorithm based on Swin Transformer multi‑layer feature fusion. First of all，a multi‑layer feature fusion module based on Swin Transformer was introduced into the backbone feature extraction network，and the multi‑head attention mechanism of Swin Transformer was used to extract the feature information of small targets and fuse them with feature maps，which could effectively resolve the issue of losing feature information related to smaller targets within the higher‑level feature map as a result of deepening network layers. Secondly，to enhance the detection accuracy of the algorithm，we introduced a more efficient Repblock module to replace the CSP module in the original network. Finally，to enhance the down‑sampling effect，we employed the Transition Block module as the down‑sampling module of the backbone feature extraction network. The results showed that the improved YOLOX‑S algorithm demonstrated an average accuracy of 96.72% on the walnut datasets，which was higher than the accuracy achieved by the Faster R‑CNN，YOLOv5‑S，and YOLOX‑S algorithms，with improvements of 7.36，1.38，and 0.62 percentage points respectively. The detection speed of the algorithm reached 46 f/s，while the model parameter size was 20.55 M. The improved YOLOX‑S algorithm exhibited superior average precision，thereby addressing the issues of missed detection and false detection effectively. It had a better recognition effect on walnuts in the natural environment.

Key words: Walnut detection, Swin Transformer, Multi?layer feature fusion module, YOLOX?S, Deep learning

中图分类号:

S664.1
S126

钟正扬, 云利军, 杨璇玺, 陈载清. 基于改进YOLOX 的自然环境下核桃识别算法研究[J]. 河南农业科学, 2024, 53(1): 152-161.

ZHONG Zhengyang, YUN Lijun, YANG Xuanxi, CHEN Zaiqing. Research on Walnut Recognition Algorithm in Natural Environment Based on Improved YOLOX[J]. Journal of Henan Agricultural Sciences, 2024, 53(1): 152-161.

参考文献

［1］徐雨欣，李娅，韩长志. 云南核桃产业践行“两山理论”探析［J］. 西南林业大学学报（社会科学），2023，7（1）：34‑38.
XU Y X，LI Y，HAN C Z，et al. Study on the practice of 'Two mountains' in Yunnan walnut industry［J］. Journal
of Southwest Forestry University（Social Sciences），2023，7（1）：34‑38.
［2］袁国勇，张铁中. 温室黄瓜果实的模式识别与分割：利用Bayes分类判别模型［J］. 农机化研究，2006，28（7）：150‑153.
YUAN G Y，ZHANG T Z. Pattern recognition of cucumber image by Bayes sort discrimination model［J］. Journal of

Agricultural Mechanization Research，2006，28（7）：150‑153.

［3］袁挺，许晨光，任永新，等. 基于近红外图像的温室环境下黄瓜果实信息获取［J］. 光谱学与光谱分析，2009，29（8）：2054‑2058.

YUAN T，XU C G，REN Y X，et al. Detecting the information of cucumber in greenhouse for picking based on NIR

image［J］. Spectroscopy and Spectral Analysis，2009，29（8）：2054‑2058.

［4］章云，周素茵，周竹. 自然环境下山核桃果实的图像识别提取方法［J］. 湖北农业科学，2016，55（21）：5643‑5645.

ZHANG Y，ZHOU S Y，ZHOU Z. The method for recognition and extraction of pecan fruit in natural environment［J］.Hubei Agricultural Sciences，2016，55（21）：5643‑5645.

［5］樊湘鹏，许燕，周建平，等. 基于改进Faster R-CNN的核桃识别和定位［J］. 燕山大学学报，2021，45（6）：544‑551.

FAN X P，XU Y，ZHOU J P，et al. Green walnuts recognition and localization based on updated Faster R‑CNN［J］.

Journal of Yanshan University，2021，45（6）：544‑551.

［6］王梁，侯义锋，贺杰. 基于Mask-RCNN的自然场景下油茶果目标识别与检测［J］. 中国农机化学报，2022，43（12）：148‑154.

WANG L，HOU Y F，HE J. Target recognition and detection of Camellia oleifera fruit in natural scene based on Mask‑RCNN ［J］. Journal of Chinese Agricultural Mechanization，2022，43（12）：148‑154.

［7］郝建军，邴振凯，杨淑华，等. 采用改进YOLOv3算法检测青皮核桃［J］. 农业工程学报，2022，38（14）：183‑190.

HAO J J，BING Z K，YANG S H，et al. Detection of green walnut by improved YOLOv3［J］. Transactions of the

Chinese Society of Agricultural Engineering，2022，38（14）：183‑190.

［8］ REN S Q，HE K，GIRSHICK R，et al. Faster R‑CNN：Towards real‑time object detection with region proposal networks［J］. IEEE Transactions on Pattern Analysis and Machine Intelligence，2017，39（6）：1137‑1149.

［9］ LIU W，ANGUELOV D，ERHAN D，et al. SSD：Single shot MultiBox detector［M］//Computer vision‑ECCV 2016. Cham：Springer International Publishing，2016：21‑37.

［10］ GE Z，LIU S T，WANG F，et al. YOLOX：Exceeding YOLO series in 2021 ［EB/OL］. （2021‑06‑18）［2023‑08‑08］. https：//arxiv. org/abs/2107. 08430. pdf.

［11］ BOCHKOVSKIY A，WANG C Y，LIAO H Y M. YOLOv4：Optimal speed and accuracy of object detection［EB/OL］.（2020‑04‑23）［2023‑08‑08］.https：//arxiv.org/abs/2004.10934.pdf.

［12］ ZHANG H Y，CISSE M，DAUPHIN Y N，et al. Mixup：Beyond empirical risk minimization ［EB/OL］.（2017‑10‑25）［2023‑08‑08］. https：//arxiv. org/abs/1710.09412.pdf.

［13］ LIU S，QI L，QIN H F，et al. Path aggregation network for instance segmentation ［C］//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. Salt Lake City，UT，USA：IEEE，2018：8759‑8768.

［14］ LIU Z，LIN Y T，CAO Y，et al. Swin transformer：Hierarchical vision transformer using shifted windows［C］//2021 IEEE/CVF International Conference on Computer Vision（ICCV）. Montreal，QC，Canada：IEEE，2021：9992‑10002.

［15］ LIN T Y，DOLLÁR P，GIRSHICK R，et al. Feature pyramid networks for object detection［C］//2017 IEEE

Conference on Computer Vision and Pattern Recognition（CVPR）. Honolulu，HI，USA：IEEE，2017：936‑944.

［16］ HE K M，ZHANG X Y，REN S Q，et al. Deep residual learning for image recognition ［C］//2016 IEEE Conference on Computer Vision and Pattern Recognition（CVPR）. Las Vegas，NV，USA：IEEE，2016：770‑778.

［17］ LI C Y，LI L L，JIANG H L，et al. YOLOv6：A single‑stage object detection framework for industrial applications ［EB/OL］.（2022‑09‑07）［2023‑08‑08］.https：//arxiv. org/abs/2209.02976.pdf.

［18］ DING X H，ZHANG X Y，MA N N，et al. Repvgg：Making VGG‑style ConvNets great again［C］//2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition（CVPR）. Nashville，TN，USA：IEEE，2021：13728‑13737.

［19］ WANG C Y，BOCHKOVSKIY A，LIAO H Y M. YOLOv7：Trainable bag‑of‑freebies sets new state‑of‑the‑art for

real‑time object detectors ［C］//2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition（CVPR）.Vancouver，BC，Canada：IEEE，2023：7464‑7475.

［20］ LOSHCHILOV I，HUTTER F. SGDR：Stochastic gradient descent with warm restarts［EB/OL］.（2016‑08‑13）［2023‑08‑08］. https：//arxiv. org/abs/1608.03983.pdf.

[1]	于雁南, 莫泳彬, 严继池, 熊春林, 窦世卿, 杨荣峰. 基于改进ShuffleNet V2 的柑橘病害识别研究[J]. 河南农业科学, 2024, 53(1): 142-151.
[2]	陈聪, 于啸, 宫琪. 基于改进残差网络的苹果叶片病害识别研究[J]. 河南农业科学, 2023, 52(4): 152-161.
[3]	臧贺藏, 王从胜, 赵巧丽, 赵晴, 张杰, 李国强, 郑国清. 基于深度学习的小麦倒伏自动分类方法研究[J]. 河南农业科学, 2023, 52(11): 167-173.
[4]	杨文庆, 刘天霞, 唐兴萍, 徐国富, 马喆, 杨贺凯, 吴文斗. 智慧农业背景下的植物表型组学研究进展[J]. 河南农业科学, 2022, 51(7): 1-12.
[5]	刁智华, 闫娇楠, 赵素娜, 贺振东. 基于图像处理的作物行识别算法研究进展[J]. 河南农业科学, 2022, 51(3): 12-19.
[6]	董燕, 李环宇, 李卫杰, 李春雷, 刘洲峰. 基于联合剪枝深度模型压缩的种子分选方法研究[J]. 河南农业科学, 2022, 51(1): 162-170.
[7]	郑二功，田迎芳，陈涛. 基于深度学习的无人机影像玉米倒伏区域提取[J]. 河南农业科学, 2018, 47(8): 155-160.

基于改进YOLOX 的自然环境下核桃识别算法研究

Research on Walnut Recognition Algorithm in Natural Environment Based on Improved YOLOX

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 7

编辑推荐

Metrics