RED-Net: A Recurrent Encoder-Decoder Network for Video-based Face Alignment

Peng, Xi; Feris, Rogerio S.; Wang, Xiaoyu; Metaxas, Dimitris N.

Computer Science > Computer Vision and Pattern Recognition

arXiv:1801.06066 (cs)

[Submitted on 17 Jan 2018]

Title:RED-Net: A Recurrent Encoder-Decoder Network for Video-based Face Alignment

Authors:Xi Peng, Rogerio S. Feris, Xiaoyu Wang, Dimitris N. Metaxas

View PDF

Abstract:We propose a novel method for real-time face alignment in videos based on a recurrent encoder-decoder network model. Our proposed model predicts 2D facial point heat maps regularized by both detection and regression loss, while uniquely exploiting recurrent learning at both spatial and temporal dimensions. At the spatial level, we add a feedback loop connection between the combined output response map and the input, in order to enable iterative coarse-to-fine face alignment using a single network model, instead of relying on traditional cascaded model ensembles. At the temporal level, we first decouple the features in the bottleneck of the network into temporal-variant factors, such as pose and expression, and temporal-invariant factors, such as identity information. Temporal recurrent learning is then applied to the decoupled temporal-variant features. We show that such feature disentangling yields better generalization and significantly more accurate results at test time. We perform a comprehensive experimental analysis, showing the importance of each component of our proposed model, as well as superior results over the state of the art and several variations of our method in standard datasets.

Comments:	International Journal of Computer Vision. arXiv admin note: text overlap with arXiv:1608.05477
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1801.06066 [cs.CV]
	(or arXiv:1801.06066v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1801.06066

Submission history

From: Xi Peng [view email]
[v1] Wed, 17 Jan 2018 04:29:44 UTC (4,498 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2018-01

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Xi Peng
Rogério Schmidt Feris
Xiaoyu Wang
Dimitris N. Metaxas

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:RED-Net: A Recurrent Encoder-Decoder Network for Video-based Face Alignment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:RED-Net: A Recurrent Encoder-Decoder Network for Video-based Face Alignment

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators