Indexed on: 08 Jul '19Published on: 11 Apr '19Published in: IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
In this paper, we propose a design scheme for deep learning networks in face parsing task with promising accuracy and real-time inference speed. By analyzing the differences between general image parsing task and face parsing task, we first revisit the structure of traditional FCN and make improvements to adapt to the unique properties of face parsing task. Especially, the concept of Normalized Receptive Field is proposed to give more insights on designing the network. Then a novel loss function called Statistical Contextual Loss is introduced, which integrates richer contextual information and regularizes features during training. For further model acceleration, we propose a semi-supervised distillation scheme that effectively transfers the learned knowledge to a lighter network. Extensive experiments on LFW and Helen dataset demonstrate the significant superiority of the new design scheme on both efficacy and efficiency.