基于CNN-BiLSTM的猪咳嗽声识别方法

doi:10.15933/j.cnki.1004-3268.2026.02.016

河南农业科学 ›› 2026, Vol. 55 ›› Issue (2): 144-155.DOI: 10.15933/j.cnki.1004-3268.2026.02.016

• 农业信息与工程·农产品加工 • 上一篇下一篇

基于CNN-BiLSTM的猪咳嗽声识别方法

付小朋^1,2，周昕²，王星博²，徐杏²，吴越²，谢荣辉³，单颖⁴，叶春林¹，周卫东²

（1.浙江科技大学生物与化学工程学院，浙江杭州 310023；2.浙江省农业科学院畜牧兽医研究所，浙江杭州 310021；3.浙江省动物疫病预防控制中心，浙江杭州 311119；4.浙江大学动物科学学院，浙江杭州 310058）

收稿日期:2025-10-09 接受日期:2025-11-28 出版日期:2026-02-15 发布日期:2026-03-10
通讯作者: 叶春林，教授，博士，主要从事肿瘤的靶向治疗研究。E-mail：chlye2005@126.com 周卫东，研究员，博士，主要从事动物营养与饲料研究。E-mail：zhouwd@zaas.ac.cn
作者简介:付小朋，在读硕士研究生，研究方向：畜牧农机装备。E-mail：2171794909@qq.com
基金资助:
浙江省重点研发计划项目（2021C02050）；浙江省农机研发制造推广一体化项目（10401110124KH5601G）

A Pig Cough Sound Recognition Method Based on CNN‑BiLSTM

FU Xiaopeng^1,2，ZHOU Xin²，WANG Xingbo²，XU Xing²，WU Yue²，XIE Ronghui³，SHAN Ying⁴，YE Chunlin¹，ZHOU Weidong²

（1.School of Biological & Chemical Engineering，Zhejiang University of Science and Technology，Hangzhou 310023，China；2.Institute of Animal Husbandry and Veterinary，Zhejiang Academy of Agricultural Sciences，Hangzhou 310021，China；3.Zhejiang Center of Animal Disease Control and Prevention，Hangzhou 311119，China；4.College of Animal Sciences，Zhejiang University，Hangzhou 310058，China）

Received:2025-10-09 Accepted:2025-11-28 Published:2026-02-15 Online:2026-03-10

摘要/Abstract

摘要： 呼吸道疾病是规模猪场常见高发疫病之一，及时准确发现猪呼吸道疾病典型临床症状如咳嗽声对于实现早期预警、预防至关重要。以怀孕中期母猪咳嗽、尖叫、打呼噜声音为研究对象，提出了基于卷积神经网络和双向长短期记忆网络（CNN-BiLSTM）融合的猪咳嗽声识别模型，通过四阶巴特沃斯带通滤波器降噪、一阶高通滤波器预加重、短时能量端点检测等方法预处理猪声数据，采用分帧、加窗、快速傅里叶变换等方法提取预处理后声音数据的梅尔频率倒谱系数（MFCC）特征参数，并对模型识别性能进行评价。结果表明，采用四阶巴特沃斯带通滤波器降噪处理可明显降低猪咳嗽声、尖叫声和打呼噜声的背景噪音，且波形无失真，猪声信号的主要能量保留完整；一阶高通滤波器预加重可明显增强高频区域能量，减弱低频区域能量，缩小区域范围；端点检测可快速标出猪声的有效语音段，减少无关信息对识别模型的干扰；通过提取预处理声音数据的MFCC特征参数可较好地反映猪声的声学特性，将MFCC系数作为特征输入用于模型的识别。融合卷积神经网络与双向长短期记忆网络的深度神经网络（CNN-BiLSTM）模型具有良好的收敛性，混淆矩阵显示，猪咳嗽声、尖叫声和打呼噜声正确识别率分别为83.67%、85.19%和81.58%，说明模型具有良好的泛化能力；五折交叉验证显示，平均准确率为84.03%（82.79%~85.31%）；CNN-BiLSTM模型在测试集上的准确率为83.93%，优于Transformer、CNN、LSTM和BiLSTM模型。由此，所提出的CNN-BiLSTM模型在识别猪咳嗽声上具有良好的性能，能够为猪只呼吸道疾病早期检测提供新的方法。

关键词: 猪咳嗽声, CNN-BiLSTM识别模型, 特征参数, 混淆矩阵, 五折交叉验证

Abstract: Respiratory diseases are one of the common and frequently‑occurring diseases in large‑scale pig farms.Timely and accurate detection of typical clinical symptoms of coughing in pigs is crucial for early warning and prevention. This study taked the sounds of coughing，squealing，and snoring of mid‑pregnancy sows as the research object and proposed a pig cough sound recognition model based on the fusion of convolutional neural network and bidirectional long short‑term memory network（CNN‑BiLSTM）.The pig sounds data was preprocessed through methods such as fourth‑order Butterworth band‑pass filtering for noise reduction，first‑order high‑pass filtering for pre‑emphasis，and short‑time energy endpoint detection.The Mel frequency cepstral coefficients（MFCC）feature parameters of the preprocessed sound data were extracted using methods such as framing，windowing，and fast Fourier transform，and the model recognition performance was evaluated.The results showed that the fourth‑order Butterworth band‑pass filter for noise reduction could significantly reduce the background noise of pig coughing，squealing，and snoring sounds，without distorting the waveform and retaining the main energy of the pig sound signal.The first‑order high‑pass filter for pre‑emphasis could significantly enhance the energy in the high‑frequency region，weaken the energy in the low‑frequency region，and narrow the frequency range. Endpoint detection could quickly mark the effective speech segments of the pig sounds and reduce the interference of irrelevant information to the recognition model.The MFCC feature parameters extracted from the preprocessed sound data could well reflect the acoustic characteristics of the pig sounds，and the MFCC coefficients could be used as feature inputs for model recognition.The established deep neural network model of CNN‑BiLSTM fusion had good convergence.The confusion matrix showed that the correct recognition rates of pig coughing，squealing，and snoring sounds were 83.67%，85.19%，and 81.58% respectively，and it had good generalization ability.The five‑fold cross‑validation showed that the average accuracy rate was 84.03%（82.79%—85.31%）.The accuracy rate of the CNN‑BiLSTM model on the test set was 83.93%，which was superior to the Transformer，CNN，LSTM，and BiLSTM models.Therefore，the CNN‑BiLSTM model proposed in this study has good performance in recognizing pig coughing sounds and can provide a new method for the early detection of respiratory diseases in pigs.

Key words: Pig cough sounds, CNN?BiLSTM detection model, Characteristic parameters, Confusion matrix, Five?fold cross?validation

中图分类号:

S126

付小朋, 周昕, 王星博, 徐杏, 吴越, 谢荣辉, 单颖, 叶春林, 周卫东. 基于CNN-BiLSTM的猪咳嗽声识别方法[J]. 河南农业科学, 2026, 55(2): 144-155.

FU Xiaopeng, ZHOU Xin, WANG Xingbo, XU Xing, WU Yue, XIE Ronghui, SHAN Ying, YE Chunlin, ZHOU Weidong. A Pig Cough Sound Recognition Method Based on CNN‑BiLSTM[J]. Journal of Henan Agricultural Sciences, 2026, 55(2): 144-155.

图/表 12

参考文献

［1］王艾晶，杨龙峰，王瑾. 冬季生猪常见呼吸道疾病及防控措施［J］. 现代畜牧科技，2024（6）：94‑96.
WANG A J，YANG L F，WANG J. Research on common respiratory diseases in winter and their prevention and control measures in pig farming［J］. Modern Animal Husbandry Science & Technology，2024（6）：94‑96.

［2］孙燕，赵立平，玄世豪，等. 猪繁殖与呼吸综合征流行病学、诊断方法及防控措施研究［J］. 猪业科学，2025，42 （7）：68‑70.

SUN Y，ZHAO L P，XUAN S H，et al. Study on epidemiology，diagnostic methods and prevention and control measures of porcine reproductive and respiratory syndrome［J］. Swine Industry Science，2025，42（7）：68‑70.

［3］陈玲，闻兵. 猪咳嗽的病因与诊断［J］. 猪业观察，2023 （4）：40‑42.

CHEN L，WEN B. Etiology and diagnosis of pig cough ［J］. Swine Industry Outlook，2023（4）：40‑42.

［4］许亚改. 基于声音的猪常见呼吸道疾病鉴别分析［J］. 中国动物保健，2023，25（8）：7‑8.

XU Y G. Differential analysis of common respiratory diseases in pigs based on sound［J］. China Animal Health，2023，25（8）：7‑8.

［5］HIRTUM A V，GUARINO M，COSTA A，et al.Automatic detection of chronic pig coughing from continuous registration in field situations ［C］//Proceedings of the 5th International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications，2003：251‑254.

［6］徐亚妮，沈明霞，闫丽，等. 待产梅山母猪咳嗽声识别算法的研究［J］. 南京农业大学学报，2016，39（4）：681‑687.

XU Y N，SHEN M X，YAN L，et al. Research of predelivery Meishan sow cough recognition algorithm［J］.Journal of Nanjing Agricultural University，2016，39（4）：681‑687.

［7］熊梓奥. 育肥猪舍中的咳嗽声监测系统设计［D］. 哈尔滨：哈尔滨工程大学，2021.

XIONG Z A.Design of pig cough monitoring system in fattening pig houses［D］. Harbin：Harbin Engineering University，2021.

［8］付欣，李晓童，苟阳. 基于CNN的随钻声波仪器信号降噪方法研究［J］. 仪器仪表学报，2025，46（4）：270‑282.

FU X，LI X T，GOU Y. A CNN‑based noise reduction method for acoustic logging while drill instrument signal［J］. Chinese Journal of Scientific Instrument，2025，46（4）：270‑282.

［9］任俊红，詹旭，范涛，等. 基于多通道特征融合的CNNSVM说话人识别研究［J］. 四川轻化工大学学报（自然科学版），2025，38（4）：18‑27.

REN J H，ZHAN X，FAN T，et al. Speaker recognition based on multi‑channel feature fusion with CNN‑SVM［J］.Journal of Sichuan University of Science &Engineering（Natural Science Edition），2025，38（4）：18‑27.

［10］SHAH G，SHARMA A，JOSHI D，et al. A novel AVOA optimized DNN‑BiLSTM‑attention model for improved gesture classification using electromyography signal［J］. Results in Engineering，2025，27：105648.

［11］SENATORI T，NARDONE D，LO GIUDICE M，et al.Explainable instrument classification：From MFCC mean‑vector models to CNNs on MFCC and mel‑spectrograms with T‑SNE and GRAD‑CAM insights［J］.Information，2025，16（10）：864.

［12］ZHANG H C，LI P C，JIN H Q，et al. Nonlinear wave energy dissipator with wave attenuation and energy harvesting at low frequencies［J］. Ocean Engineering，2022，266：112935.

［13］JIMÉNEZ‑ZACARÍAS L L，CAMPOS‑CANTÓN I.Lorenz system manufacturing with a Butterworth filter［J］.Integration，2025，103：102386.

［14］KUMAR A，MAKHDOOMI B S，KUSHWAHA A K.Tunable current‑mode first‑order resistorless multifunction filter using CMOS DX‑MOCCII［J］.Russian Microelectronics，2024，53（1）：S54‑S62.

［15］XU W Z，MO H Q，TIAN L F，et al. An endpoint‑detection algorithm of surface electromyography insensitive to electrocardiogram interference［J］.Journal of Biomedical Engineering，2018，35（6）：953‑958.

［16］SAKTHIVEL K M，NANDHINI V. Adaptive threshold selection for extreme value analysis to predict return levels of ozone layer depletion［J］. Natural Hazards，2025，121（11）：12741‑12766.

［17］AMIRIAN B，BRAHME A，LEBENSOHN R A，et al. A generalizable machine learning‑assisted fast Fourier transform algorithm to simulate the large strain phenomena in polycrystalline materials［J］.International Journal of Plasticity，2025，192：104404.

［18］JIANG Y，WANG X，ZHAO N，et al. Random vibration analysis of linear structures under time‑varying coherent nonstationary excitations：An enhanced evolutionary spectral method［J］. Mechanical Systems and Signal Processing，2025，237：113062.

［19］廖华，申晓杰，潘勇斌，等. 基于梅尔滤波器的变电站开关故障监测仿真［J］. 计算机仿真，2024，41（11）：123‑126.

LIAO H，SHEN X J，PAN Y B，et al. Simulation of substation switch fault monitoring based on Mel filter［J］.Computer Simulation，2024，41（11）：123‑126.

［20］ZHANG S Y，AN W T，LAN T L，et al. An analysis of the effects of various polarimetric features and decomposition algorithms on PolSAR image classification through CNN［J］. Results in Engineering，2025，28：107225.

［21］LIU W，SHEN H，HU Y，et al. Polar ship trajectory prediction based on Kolmogorov‑Arnold networks and LSTM［J］. Ocean Engineering，2025，336：121702.

［22］LI X L，HUO T M，ZHU L Q，et al. Modified BiLSTM network for interval prediction based on Aerospace Load System［J］. Neurocomputing，2025，651：130887.

［23］TOGKOUSIDIS A，STAMATAKIS A，GASCUEL O.Accelerating maximum likelihood phylogenetic inference via early stopping to evade（over‑）optimization［J］. Systematic Biology，2025，74（6）：1020‑1028.

［24］CHOI J. Efficient prompt optimization for relevance evaluation via LLM‑based confusion matrix feedback［J］.Applied Sciences，2025，15（9）：5198.

［25］TEODORESCU V，OBREJA BRAȘOVEANU L.Assessing the validity of k‑fold cross‑validation for model selection：Evidence from bankruptcy prediction using random forest and XGBoost［J］. Computation，2025，13（5）：127.

［26］AHN J M，JEONG W，LEE H，et al. Hyperspectral imaging‑based land use classification using a hybrid Convolutional Neural network‑vision transformer model［J］.Environmental Technology & Innovation，2025，39：104317.

[1]	江明泽, 闫建伟, 邹传筑. 果蔬成熟度检测关键技术研究进展[J]. 河南农业科学, 2026, 55(1): 1-15.
[2]	李晓锁, 郭旺, 朱华吉, 顾静秋, 李庆学, 吴华瑞. 基于机器视觉的露地甘蓝无人化采收作业质量检测模型[J]. 河南农业科学, 2025, 54(10): 150-158.
[3]	刘星麟, 刘园, 杨凡, 刘布春, 韩锐. 基于机器学习算法的河南省县级冬小麦产量模拟研究[J]. 河南农业科学, 2025, 54(8): 167-180.
[4]	臧贺藏, 周萌, 王亚辉, 彭一龙, 赵晴, 张杰, 李国强. 基于改进YOLOv8n 的麦穗自动检测方法研究[J]. 河南农业科学, 2025, 54(7): 162-169.
[5]	吉莉, 李强. 重庆市蜡梅开花期气候适宜度模型构建及潜在种植区研究[J]. 河南农业科学, 2025, 54(3): 147-156.
[6]	王兴家, 魏清岗, 穆元杰, 盛清凯, 杨亮, 安静, 张霞, 郑纪业. 基于机器视觉的生猪体质量估测研究进展[J]. 河南农业科学, 2024, 53(11): 17-26.
[7]	任飞燕, 张蕙, 李智, 杨卫东, 张艳飞, 陈卫东, 谈云建, 柳瑞芸. 基于U-Net 的粮仓储粮高度定量检测方法[J]. 河南农业科学, 2024, 53(11): 156-163.
[8]	陈佳骏, 刘芝妤, 周婉, 李杨, 詹炜, 黄岚, 王俊, 邱丽娟. 大豆茎秆相关表型自动检测方法研究[J]. 河南农业科学, 2024, 53(10): 170-180.
[9]	胡峻峰, 李松青, 黄晓文, 刘大洋, 李柏聪. 基于改进YOLOv5s 的轻量级葡萄目标检测融合算法[J]. 河南农业科学, 2024, 53(9): 150-158.
[10]	关皓月, 李梦瑶, 李国强, 张建涛, 高桐梅, 陈先冠, 张文宇, 吴金芝. 芝麻发育期模拟模型参数敏感性分析与优化[J]. 河南农业科学, 2024, 53(9): 159-170.
[11]	盛彬. 基于改进YOLOv5 的黄花成熟度检测方法[J]. 河南农业科学, 2024, 53(8): 145-153.
[12]	李梦瑶, 关皓月, 张建涛, 黄明, 李丰, 戎亚思, 李友军, 李国强. 基于AquaCrop 模型的芝麻作物参数敏感性分析及模型适用性评价[J]. 河南农业科学, 2024, 53(7): 149-159.
[13]	冯晓, 张辉, 刘正, 张会芳, 陈海燕, 赵威, 郑国清, 马中杰. 基于深度学习的玉米包衣种子品种识别[J]. 河南农业科学, 2024, 53(7): 160-167.
[14]	戴佳兵, 宋春芳, 凌彩金, 李臻锋, 孙崇高. 基于YOLOv5s-SE和通道剪枝的虫咬紫金蝉茶检测方法研究[J]. 河南农业科学, 2024, 53(5): 157-163.
[15]	郑兴凯, 杨铁军, 黄琳. 基于神经网络架构搜索的细粒度花卉图像分类方法研究[J]. 河南农业科学, 2024, 53(5): 164-171.

基于CNN-BiLSTM的猪咳嗽声识别方法

A Pig Cough Sound Recognition Method Based on CNN‑BiLSTM

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 12

参考文献

相关文章 15

编辑推荐

Metrics