메뉴
닫기


Feature pyramid network with multi-scale prediction fusion for real-time semantic segmentation
Year of publication
2023
Author
Toan Van Quyen, Min Young Kim
Journal
Neurocomputing
volume
519
Issue
28
Page
104-113

Feature pyramid network (FPN) is constructed from a bottom-up pathway and a top-down pathway. The method involves multi-scale features, so it can obtain rich contextual information from lower scales and high resolution from the largest scale. Additionally, different receptive fields are effective to capture both thin and large objects in image scenes. All feature maps concatenate together to predict the targets. However, the average pooling method yields the problem of combining the best predictions with poorer ones. In this paper, we proposed a dual prediction to leverage the useful characteristics of each FPN feature map. A low scale prediction attains good precision for large objects. The other one suitably segments narrow objects. Finally, a multi-scale fusion is deployed with an attention part. The attention module finds pixels of a low scale having high probabilities of wrong labels, and then requires the supplements from a high scale. A multi-scale fusion allows the network to learn across the different scales of predictions. We have achieved good Results 77.9% mIoU at 62 FPS on Cityscapes and 44.1% mIoU on Mapillary Vistas. 



[702-701] 1370 Sankyuk-dong, Buk-gu, Daegu, Korea
Tel : +82-53-950-7233 / Fax : +82-53-950-5505
Copyrights ⒞ 2019 Kyungpook National University. All Rights Reserved.