河南农业科学 ›› 2023, Vol. 52 ›› Issue (7): 163-171.DOI: 10.15933/j.cnki.1004-3268.2023.07.017

• 农业信息与工程·农产品加工 • 上一篇    下一篇

基于近红外光谱的烤烟香型分类模型研究

付博1,2,杨永锋1,刘向真1,赵森森1,刘茂林1,贾国涛1,牛洋洋1,张坤芳1,于建军2,彭桂新1,姬小明2
  

  1. (1.河南中烟工业有限责任公司技术中心,河南 郑州 450016;2.河南农业大学 烟草学院,河南 郑州 450002)
  • 收稿日期:2022-12-16 出版日期:2023-07-15 发布日期:2023-08-10
  • 通讯作者: 彭桂新(1961-),男,河南西平人,高级工程师,本科,主要从事烟草加工研究。E-mail:penggx2012@126.com 姬小明(1972-),女,河南舞钢人,教授,博士,主要从事烟草化学研究。E-mail:xiaomingji@henau.edu.cn
  • 作者简介:付博(1984-),男,河南夏邑人,副教授,博士,主要从事烟草质量评价与烟草代谢研究。E-mail:fubo@henau.edu.cn
  • 基金资助:
    河南中烟工业有限责任公司科技项目(C202023)

Study on Aroma Type Classification Model Based on Near Infrared Spectra of Flue⁃cured Tobacco

FU Bo1,2,YANG Yongfeng1,LIU Xiangzhen1,ZHAO Sensen1,LIU Maolin1,JIA Guotao1,NIU Yangyang1,ZHANG Kunfang1,YU Jianjun2,PENG Guixin1,JI Xiaoming2   

  1. (1.Technology Center,China Tobacco Henan Industrial Co.,Ltd.,Zhengzhou 450016,China;2.College of Tobacco,Henan Agricultural University,Zhengzhou 450002,China)
  • Received:2022-12-16 Published:2023-07-15 Online:2023-08-10

摘要: 为了快速无损地判别国产烤烟香型,指导卷烟配方原料利用,对1 383份烟叶样品进行近红外光谱采集,基于八大香型区划结果进行香型分类,选用各香型稳定区的样品构建香型分类模型。结果显示,基于近红外原始光谱数据利用随机森林构建的香型分类模型准确率仅为48.64%,光谱数据经过SG滤波一阶或二阶导数和多元散射校正预处理后,模型准确率提高29.54个百分点,然后经因子分析降维处理(45个因子)模型准确率提高到85.91%,最后对模型关键参数进行优化,当评估器数量为500、随机种子为9时,模型准确率最高为90.45%。利用建立的分类模型对预测集进行预测,清甜香型、焦甜焦香型、清甜蜜甜香型和木香蜜甜香型的召回率均达到90.00%以上,召回率最低的为蜜甜香型和焦甜醇甜香型(66.67%)。以上结果表明,利用近红外光谱技术能够有效鉴别烤烟八大香型。

关键词: 烤烟, 近红外光谱, 香型, 分类模型, 随机森林

Abstract: In order to quickly and non⁃destructively discriminate the aroma type of domestic flue⁃cured tobacco and guide the utilization of raw materials for cigarette formulation,near infrared spectra were collected from 1 383 tobacco samples,the aroma was classified based on the results of eight aroma zones,and the samples from each stable aroma zone were selected to build the aroma classification model.The results showed that the accuracy of the fragrance classification model constructed by using random forest based on the NIR raw spectral data was only 48.64%;the model accuracy was improved by 29.54 percentage points after the spectral data were pre⁃processed by SG filtering first or second derivatives and multiple scattering correction;then the model accuracy was improved to 85.91% by factor analysis and dimensionality reduction(45 factors);finally,the model key parameters were optimized,the highest model accuracy of 90.45% was achieved when the number of base evaluators was 500 and the random seed was 9.This classification model was used to predict the prediction set,the recall rates of fresh⁃sweetness type,burnt⁃sweetness⁃burnt type,fresh⁃honey⁃sweetness type and woody⁃honey⁃sweetness type all reached more than 90.00%.The lowest recall rate was for honey⁃sweetness type and burnt⁃honey⁃sweetness type(66.67%).The results show that the use of NIR spectroscopy can effectively identify the eight aroma types of flue⁃cured tobacco,which provides new ideas and technical support for the rapid identification of flue⁃cured tobacco aroma types and digital evaluation of tobacco styles.

Key words: Flue?cured tobacco, Near infrared spectroscopy, Aroma type, Classification model, Random forest

中图分类号: