基于FE-P2Pnet的無(wú)人機(jī)小麥圖像麥穗計(jì)數(shù)方法

doi:10.6041/j.issn.1000-1298.2024.04.015

首頁(yè) > 過(guò)刊瀏覽>2024年第55卷第4期 >155-164，289. DOI:10.6041/j.issn.1000-1298.2024.04.015

基于FE-P2Pnet的無(wú)人機(jī)小麥圖像麥穗計(jì)數(shù)方法
DOI:
                        10.6041/j.issn.1000-1298.2024.04.015
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者單位:
作者簡(jiǎn)介:
通訊作者:
中圖分類號(hào):
基金項(xiàng)目:安徽省自然科學(xué)基金項(xiàng)目(2208085MC60)、安徽省科學(xué)技術(shù)廳高校科研計(jì)劃項(xiàng)目（2023AH050084）和國(guó)家自然科學(xué)基金項(xiàng)目（62273001、32372632）

Method for Counting Wheat Ears in UAV Images Based on FE-P2Pnet

Author:

Affiliation:

Fund Project:

摘要

圖/表

訪問(wèn)統(tǒng)計(jì)

參考文獻(xiàn)

相似文獻(xiàn)

引證文獻(xiàn)

資源附件

文章評(píng)論

摘要:

針對(duì)無(wú)人機(jī)圖像背景復(fù)雜、小麥密集、麥穗目標(biāo)較小以及麥穗尺寸不一等問(wèn)題，提出了一種基于FE-P2Pnet（Feature enhance-point to point）的無(wú)人機(jī)小麥圖像麥穗自動(dòng)計(jì)數(shù)方法。對(duì)無(wú)人機(jī)圖像進(jìn)行亮度和對(duì)比度增強(qiáng)，增大麥穗目標(biāo)與背景之間的差異度，減少葉、稈等復(fù)雜背景因素的影響。引入了基于點(diǎn)標(biāo)注的網(wǎng)絡(luò)P2Pnet作為基線網(wǎng)絡(luò)，以解決麥穗密集的問(wèn)題。同時(shí)，針對(duì)麥穗目標(biāo)小引起的特征信息較少的問(wèn)題，在P2Pnet的主干網(wǎng)絡(luò)VGG16中添加了Triplet模塊，將C(通道)、H(高度)和W(寬度)3個(gè)維度的信息交互，使得主干網(wǎng)絡(luò)可以提取更多與目標(biāo)相關(guān)的特征信息；針對(duì)麥穗尺寸不一的問(wèn)題，在FPN(Feature pyramid networks)上增加了FEM(Feature enhancement module)和SE(Squeeze excitation)模塊，使得該模塊能夠更好地處理特征信息和融合多尺度信息；為了更好地對(duì)目標(biāo)進(jìn)行分類，使用Focal Loss損失函數(shù)代替交叉熵?fù)p失函數(shù)，該損失函數(shù)可以對(duì)背景和目標(biāo)的特征信息進(jìn)行不同的權(quán)重加權(quán)，進(jìn)一步突出特征。實(shí)驗(yàn)結(jié)果表明，在本文所構(gòu)建的無(wú)人機(jī)小麥圖像數(shù)據(jù)集(Wheat-ZWF)上，麥穗計(jì)數(shù)的平均絕對(duì)誤差（MAE）、均方誤差（MSE）和平均精確度(ACC)分別達(dá)到3.77、5.13和90.87%，相較于其他目標(biāo)計(jì)數(shù)回歸方法如MCNN(Multi-column convolutional neural network)、CSRnet（Congested scene recognition network)和WHCNETs (Wheat head counting networks)等，表現(xiàn)最佳。與基線網(wǎng)絡(luò)P2Pnet相比，MAE和MSE分別降低23.2%和16.6%，ACC提高2.67個(gè)百分點(diǎn)。為了進(jìn)一步驗(yàn)證本文算法的有效性，對(duì)采集的其它4種不同品種的小麥（AK1009、AK1401、AK1706和YKM222）進(jìn)行了實(shí)驗(yàn)，實(shí)驗(yàn)結(jié)果顯示，麥穗計(jì)數(shù)MAE和MSE平均為5.10和6.17，ACC也達(dá)到89.69%，表明本文提出的模型具有較好的泛化性能。

Abstract:

Ear count is the committed step of wheat yield estimation. With the rapid development of unmanned aerial vehicle (UAV) and computer vision technology, the problem of automatic counting of wheat ears can be solved more quickly and efficiently. An automatic counting method for UAV wheat ear images was proposed based on feature enhance-point to point (FE-P2Pnet) to address issues such as complex background, dense wheat, small wheat ear targets, and varying wheat ear sizes. Firstly, the brightness and contrast of the UAV image were enhanced to increase the difference between the wheat ear target and the background, and the influence of complex background factors such as leaves and stems were reduced. Secondly, a point annotated network P2Pnet was introduced as the baseline network to address the problem of dense wheat ears. At the same time, in response to the problem of limited feature information caused by small wheat ear targets, a Triplet module was added to the backbone network VGG16 of P2Pnet, which interacted with the information of C (channel), H (height), and W (width) dimensions, allowing the backbone network to extract more feature information related to the target. In response to the issue of varying wheat ear sizes, feature enhancement module (FEM) and squeeze excitation (SE) modules were added to feature pyramid networks (FPN), enabling this module to better process feature information and fuse multi-scale information. In order to better classify targets, Focal Loss function instead of cross entropy loss function was used. This loss function can carry out different weights on the background and target feature information to further highlight features. The experimental results showed that the mean absolute error (MAE), mean square error (MSE), and accuracy (ACC) indicators of wheat ear counting on the constructed unmanned aerial vehicle wheat image dataset (Wheat-ZWF) achieved 3.77, 5.13, and 90.87%, respectively. Compared with other target counting regression methods such as MCNN, CSRnet, and WHCNETs, the performance was the best. Compared with the baseline network P2Pnet, the MAE and MSE values were decreased by 23.2% and 16.6% respectively, and the ACC value was increased by 2.67 percentage points. In order to further validate the effectiveness of the algorithm proposed, experiments were conducted on four other different wheat varieties (AK1009, AK1401, AK1706, and YKM222) collected. The experimental results showed that the average MAE and MSE values of wheat ear counting were 5.10 and 6.17, with ACC of 89.69%. This indicated that the proposed model had good generalization performance. The research can provide certain support and assistance for related studies on wheat ear counting.

參考文獻(xiàn)

相似文獻(xiàn)

引證文獻(xiàn)

引用本文

鮑文霞,蘇彪彪,胡根生,黃承沛,梁棟.基于FE-P2Pnet的無(wú)人機(jī)小麥圖像麥穗計(jì)數(shù)方法[J].農(nóng)業(yè)機(jī)械學(xué)報(bào),2024,55(4):155-164，289. BAO Wenxia, SU Biaobiao, HU Gensheng, HUANG Chengpei, LIANG Dong. Method for Counting Wheat Ears in UAV Images Based on FE-P2Pnet[J]. Transactions of the Chinese Society for Agricultural Machinery,2024,55(4):155-164，289.

復(fù)制

文章指標(biāo)

點(diǎn)擊次數(shù):
下載次數(shù):
HTML閱讀次數(shù):
引用次數(shù):

歷史

收稿日期:2023-08-16
最后修改日期:
錄用日期:
在線發(fā)布日期: 2024-04-10
出版日期:

文章二維碼

^{<thead id="kchjh"></thead>}

期刊瀏覽

EI收錄結(jié)果

引用本文

相關(guān)視頻

分享

文章指標(biāo)

歷史

文章二維碼