An EM Framework for Online Incremental Learning of Semantic Segmentation

Shipeng Yan, Jiale Zhou, Jiangwei Xie, Songyang Zhang, Xuming He

August 2021

PDF

Model Overview

Abstract

Incremental learning of semantic segmentation has emerged as a promising strategy for visual scene interpretation in the openworld setting. However, it remains challenging to acquire novel classes in an online fashion for the segmentation task, mainly due to its continuously-evolving semantic label space, partial pixelwise ground-truth annotations, and constrained data availability. To address this, we propose an incremental learning strategy that can fast adapt deep segmentation models without catastrophic forgetting, using a streaming input data with pixel annotations on the novel classes only. To this end, we develop a unified learning strategy based on the Expectation-Maximization (EM) framework, which integrates an iterative relabeling strategy that fills in the missing labels and a rehearsal-based incremental learning step that balances the stability-plasticity of the model. Moreover, our EM algorithm adopts an adaptive sampling method to select informative training data and a class-balancing training strategy in the incremental model updates, both improving the efficacy of model learning. We validate our approach on the PASCAL VOC 2012 and ADE20K datasets, and the results demonstrate its superior performance over the existing incremental methods.

Type

Conference paper

Publication

In The 29th ACM International Conference on Multimedia