6533b823fe1ef96bd127eab4

RESEARCH PRODUCT

Multiscale Attention-Based Prototypical Network For Few-Shot Semantic Segmentation

Yifei ZhangDésiré SidibéFabrice MeriaudeauOlivier Morel

subject

business.industryComputer scienceDeep learningFeature extractionComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]Pattern recognition02 engineering and technologyImage segmentation010501 environmental sciencesSemantics01 natural sciencesImage (mathematics)[INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]Minimum bounding boxFeature (computer vision)0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingSegmentationArtificial intelligencebusiness0105 earth and related environmental sciences

description

International audience; Deep learning-based image understanding techniques require a large number of labeled images for training. Few-shot semantic segmentation, on the contrary, aims at generalizing the segmentation ability of the model to new categories given only a few labeled samples. To tackle this problem, we propose a novel prototypical network (MAPnet) with multiscale feature attention. To fully exploit the representative features of target classes, we firstly extract rich contextual information of labeled support images via a multiscale feature enhancement module. The learned prototypes from support features provide further semantic guidance on the query image. Then we adaptively integrate multiple similarity-guided probability maps by attention mechanism, yielding an optimal pixel-wise prediction. Furthermore, the proposed method was validated on the PASCAL-5 i dataset in terms of 1-way N-shot evaluation. We also test the model with weak annotations, including scribble and bounding box annotations. Both the qualitative and quantitative results demonstrate the advantages of our approach over other state-of-the-art methods.

https://hal-univ-evry.archives-ouvertes.fr/hal-02977830