Facial Landmark Detection is a computer vision topic and it deals with the problem of detecting distinctive features in human faces automatically.
- Tip of the nose
- Corners of the eyes
- Corners of the eyebrows
- Corners of the mouth
- Eye pupils
The detected landmarks are used in several different applications.
Introduction
The following quote introduces to facial landmarks detection: "As computer vision engineers and researchers we have been trying to understand the human face since the very early days. The most obvious application of facial analysis is Face Recognition. But to be able to identify a person in an image we first need to find where in the image a face is located. Therefore, face detection — locating a face in an image and returning a bounding rectangle / square that contains the face — was a hot research area. In 2001, Paul Viola and Michael Jones pretty much nailed the problem with their seminal paper titled “Rapid Object Detection using a Boosted Cascade of Simple Features.” In the early days of OpenCV and to some extent even now, the killer application of OpenCV was a good implementation of the Viola and Jones face detector." (1)
Facial landmark detection is also referred to as “facial feature detection”, “facial keypoint detection” and “face alignment” in the literature. Today more and more approaches with neural networks are developed and outperform classical approaches.
Metric
There are three different metric which are commonly used to measure the performance of facial landmark detection.
- Mean Distance: The mean distance of detected landmarks compared to the ground truth landmarks.
- Mean Error Rate: The mean distance of detected landmarks compared to the ground truth landmarks and divided by the ocular distance of the image.
- Mean Failure Rate: A failure rate for detection is also an important metric. This rate is calculated as the percentage of the detected landmarks within a certain threshold of error or mean distance.
Network Structures
One typical State-of-the-Art structure for a facial landmark detection neural network is shown below.
Figure 1: Network architecture of a facial landmark detection neuronal network. First convolution are applied to the input image, then fully connected layers are applied. (2)
The first part contains convolutions and max pooling operations. This can be interpreted as searching key features inside the input image. Subsequently, a fully connected layer is applied. Neurons which represent the location of each landmark are activated respectively. In the present day, these networks can achieve a mean error rate of 8% (2) .
Literature
- http://www.learnopencv.com/facial-landmark-detection/ (Last visited: 23.01.2017)
- http://mmlab.ie.cuhk.edu.hk/projects/TCDCN.html (Facial Landmark Detection by Deep Multi-task Learning; Zhanpeng Zhang, Ping Luo, Chen Change Loy, Xiaoou Tang)
- https://computingforpsychologists.files.wordpress.com/2011/09/bush-arnie-morph.jpg
- http://www.learnopencv.com/wp-content/uploads/2015/10/taaz-before-after.jpg
- http://hight3ch.com/wp-content/uploads/2011/12/video-face-replacement.jpg
Weblinks
- https://arxiv.org/pdf/1511.04031v2.pdf (Facial Landmark Detection with Tweaked Convolutional Neural Networks; Yue Wu, Tal Hassner, KangGeon Kim, Gerard Medioni, Prem Natarajan)
- https://www.ics.uci.edu/~xzhu/paper/face-cvpr12.pdf (Face Detection, Pose Estimation, and Landmark Localization in the Wild; Xiangxin Zhu, Deva Ramanan)
- http://mmlab.ie.cuhk.edu.hk/projects/TCDCN.html (Facial Landmark Detection by Deep Multi-task Learning; Zhanpeng Zhang, Ping Luo, Chen Change Loy, Xiaoou Tang)
- http://www.cv-foundation.org/openaccess/content_iccv_2015/papers/Wu_Robust_Facial_Landmark_ICCV_2015_paper.pdf (Robust Facial Landmark Detection under Significant Head Poses and Occlusion; Yue Wu Qiang Ji)
- http://www.learnopencv.com/face-swap-using-opencv-c-python/ (Face Swapping Example)
- http://www.taaz.com/ (Virtual Makeover - Application Examples)
- http://jivp.eurasipjournals.springeropen.com/articles/10.1186/s13640-016-0103-z (Head Pose Estimation - Application Example)
- https://www.kaggle.com/competitions (One of the most famous facial landmark detection competition; Kaggle)
2 Kommentare
Unbekannter Benutzer (ga69taq) sagt:
30. Januar 2017General problems and suggestions:
Corrections (note that some of the corrections have more than one changes):
Final comments:
Unbekannter Benutzer (ga63muv) sagt:
31. Januar 2017Hi, Martin,
All the following arguments are highly subjective!
Aykin's comments covered almost all the shortcomings of you wiki, so I just added something from my view.
Suggestions:
Corrections:
The detected landmarks can be used to the calculate contours of the mouth, eyes etc. to render makeup virtually.
The detected landmarks can be used to the calculate contours of, for instance, mouth, eyes and so on, to render makeup virtually.
Final Comments: