Lie Guo, Tuanshan Zhang, Weizhen Sun, Jielong Guo. Image Semantic Description Algorithm with Integrated Spatial Attention Mechanism[J]. Laser & Optoelectronics Progress, 2021, 58(12): 1210030

Search by keywords or author
- Laser & Optoelectronics Progress
- Vol. 58, Issue 12, 1210030 (2021)

Fig. 1. Encoder-decoder model with integrated spatial attention mechanism

Fig. 2. Diagram of spatial attention module

Fig. 3. Diagram of encoder-decoder network with integrated spatial attention mechanism

Fig. 4. Experimental data loss curves of spatial attention mechanism in VGG network. (a) VGG(MSCOCO); (b) VGG(Flickr30k)

Fig. 5. Experimental data loss curves of spatial attention mechanism in ResNet network. (a) ResNet-50(MSCOCO); (b) ResNet-50(Flickr30k)

Fig. 6. Comparison of visualization results. (a) Test set; (b) SAT model visualization results; (c) proposed model visualization results

Fig. 7. Comparison of visualization results. (a) Test set; (b) SAT model visualization results; (c) proposed model visualization results

Fig. 8. Comparison of visualization results. (a) Test set; (b) SAT model visualization results; (c) proposed model visualization results

Fig. 9. Comparison of visualization results. (a) Test set; (b) SAT model visualization results; (c) proposed model visualization results
|
Table 1. Server configuration used for the experiment
|
Table 2. Experimental server configuration
|
Table 3. Experimental comparison of spatial attention mechanism in VGG network
|
Table 4. Experimental comparison of spatial attention mechanism in ResNet network

Set citation alerts for the article
Please enter your email address