融合注意力機(jī)制的枸杞蟲(chóng)害圖文跨模態(tài)檢索方法

doi:10.6041/j.issn.1000-1298.2022.02.032

首頁(yè) > 過(guò)刊瀏覽>2022年第53卷第2期 >299-308. DOI:10.6041/j.issn.1000-1298.2022.02.032

融合注意力機(jī)制的枸杞蟲(chóng)害圖文跨模態(tài)檢索方法
DOI:
                        10.6041/j.issn.1000-1298.2022.02.032
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者單位:
作者簡(jiǎn)介:
通訊作者:
中圖分類號(hào):
基金項(xiàng)目:國(guó)家自然科學(xué)基金項(xiàng)目（61862050）和寧夏自然科學(xué)基金項(xiàng)目（2020AAC03031）

Cross-modal Image and Text Retrieval Method for Lycium Barbarum- Pests by Integrating Attention Mechanism

Author:

Affiliation:

Fund Project:

摘要

圖/表

訪問(wèn)統(tǒng)計(jì)

參考文獻(xiàn)

相似文獻(xiàn)

引證文獻(xiàn)

資源附件

文章評(píng)論

摘要:

針對(duì)現(xiàn)有農(nóng)作物病蟲(chóng)害檢索模態(tài)較為單一問(wèn)題，以17種常見(jiàn)的枸杞蟲(chóng)害圖像和文本描述為研究對(duì)象，將跨模態(tài)檢索引入枸杞蟲(chóng)害檢索領(lǐng)域，提出一種融合注意力機(jī)制的枸杞蟲(chóng)害圖文跨模態(tài)檢索方法。首先，借助Transformer模型和循環(huán)神經(jīng)網(wǎng)絡(luò)分別獲取帶有上下文信息的細(xì)粒度圖像和文本特征序列；然后，利用注意力機(jī)制對(duì)特征序列進(jìn)行聚合以挖掘圖像和文本的顯著性語(yǔ)義信息；最后，為了深入挖掘不同模態(tài)間語(yǔ)義關(guān)聯(lián)，采用跨媒體聯(lián)合損失函數(shù)對(duì)模型進(jìn)行約束。試驗(yàn)結(jié)果表明，本文方法在自建的枸杞蟲(chóng)害圖文跨模態(tài)數(shù)據(jù)集上平均精度均值平均值達(dá)到了0.458。與現(xiàn)有的8種方法相比，平均精度均值平均值提高了0.011~0.195，優(yōu)于所有對(duì)比方法，可為農(nóng)作物病蟲(chóng)害多樣化檢索提供技術(shù)支撐和算法參考。

Abstract:

In recent years, with the change of climatic conditions and the introduction of cultivation techniques, the planting area of Lycium has gradually expanded. It has become one of the important economic crops in Ningxia and even the entire northwestern region. Lycium is a multi-insect host and has poor resistance to insect pests. It is very susceptible to insect infestation, which has a huge impact on yield and quality, causing serious economic losses. Therefore, it is very important to quickly and accurately retrieve and obtain various information about Lycium pests and provide timely and accurate control for the development of the industry. To address the problem that the present retrieval system on crop pests owns only the single mode, the crossmodal retrieval for images and texts in Lycium pest dataset was introduced, which had 17 kinds of common pests, and a cross-modal image and text retrieval method with the attention mechanism was proposed. Firstly, the transformer and the LSTM were used to obtain text and image fine-grained feature sequences with the context information, respectively. Then, the attention mechanism was leveraged to aggregate feature sequences to capture the salient semantic information in texts and images. Finally, in order to explore the semantic correlation between different modalities, the cross-media joint loss was used to constrain the proposed model. The experiment showed that the averaged MAP of the proposed method in the self-built Lycium pest dataset achieved 0.458. Compared with the existing eight methods, the averaged MAP of the method was improved by 0.011~0.195, outperforming all these methods. The proposed method can provide technical support and algorithm reference for diversified retrieval requirements of crop pests.

參考文獻(xiàn)

相似文獻(xiàn)

引證文獻(xiàn)

引用本文

劉立波,趙斐斐.融合注意力機(jī)制的枸杞蟲(chóng)害圖文跨模態(tài)檢索方法[J].農(nóng)業(yè)機(jī)械學(xué)報(bào),2022,53(2):299-308. LIU Libo, ZHAO Feifei. Cross-modal Image and Text Retrieval Method for Lycium Barbarum- Pests by Integrating Attention Mechanism[J]. Transactions of the Chinese Society for Agricultural Machinery,2022,53(2):299-308.

復(fù)制

文章指標(biāo)

點(diǎn)擊次數(shù):
下載次數(shù):
HTML閱讀次數(shù):
引用次數(shù):

歷史

收稿日期:2020-12-28
最后修改日期:
錄用日期:
在線發(fā)布日期: 2021-07-09
出版日期:

期刊瀏覽

EI收錄結(jié)果

引用本文

相關(guān)視頻

分享

文章指標(biāo)

歷史

文章二維碼