Improved AdaBoost Algorithm for Face Detection and Its Application
Xin-chao ZHAO
1, Jia-zheng YUAN
2,*, Xian-kai HUANG
3and Hong-zhe LIU
11Beijing Key Laboratory of Information Service Engineering, Beijing Union University, Beijing
100101, China
2Institute Institute of Computer Technology, Beijing Union University, Beijing 100101, China 3Beijing Open University, Beijing 100101, China
*Corresponding author
Keywords: Face detection, Improved adaBoost algorithm, Real-time, Multi-face.
Abstract. Face detection plays a crucial role in developing human-robot interaction (HRI) for Intelligent Educational Robot (IER) to recognize users or speakers. In this paper, we introduce an intelligent vision algorithm that is able to detect human face from complex scene and filter out all the non-face but face-like images. The human face is detected in real time using the approach called AdaBoost-based Haar-Cascade Classifier[1-2], and the real human face detection is improved to implement from single-face detection to multi-face detection. Furthermore, variable head pose is taken into account, such as pitch, roll, yaw, etc. The proposed robot vision algorithm for human detection is tested to be effective and robust through real-time experiments.
Introduction
It is essential for intelligent robots to have the ability of recognizing people during communication and cooperation between robots and people [3-4]. In this paper, we focus on one of the significant human-robot interaction technologies, the intelligent facial vision algorithm to robustly detect human face from a variety of complex scenes. Facial vision algorithm has attracted considerable attention in numerous practical applications [5-7] in multiple areas, since it is a vital part in the intelligent human-robot interaction and directly leads to whether robots are capable of interactive communication with people in the educational environment. Imprecise human detection will lead to the insufficient interactions between humans and IER. Therefore, it is indispensable for the IER to detect human faces with a high accuracy.
Related Work
In the recently literatures, many experts research in the intelligent interaction robot. Bernhard Froba et al address in developing intelligent system of face tracking of mobile robot using Kalman filter[8]. Kwang Ho An et al. pay attention to use relatively small amounts of critical rectangle features selected and trained by AdaBoost learning algorithm, and they detect the initial position, size and view of a face correctly [9]. Paul Viola et al present the robust real-time face detection with the integral image method and AdaBoost training algorithm [10]. The Xangdong Xie et al present a real-time tracking algorithm on eye feature tracking [11].
Proposed Method
Original Method
Various algorithms are proposed for face detection in the real-time application such as robot system, the existing deficiencies which often limit the application are usually discussed [12].
ο· Scene problem
How does the detector response when a new object enters the field of view scene and therefore the object should be detected.
ο· Head pose problem
There are two conditions which make the tracker lost track. One is that the object left the scene, and the other is losing tracking due to the failure of the tracker.
[image:2.595.203.394.269.518.2]According to the deficiencies mentioned above, we proposed the algorithm of face detection to resolve those drawbacks. The whole system flow chart is illustrated schematically in Figure 1.
Figure 1. The whole system flow chart.
Improved Method
The AdaBoost algorithm [13] is adopted to perform the face detection in image sequences. We denote it as Global AdaBoost Face Detection algorithm (GAFD), since its huge time consuming and running loading to the system, it is executed in low frequency, especially for intelligent robot system running in real-time. The proposed diagram is shown in Figure 2.
[image:2.595.125.474.630.755.2]time by utilizing the state of the tracked face. We call the algorithm as Local AdaBoost Face Detection algorithm (LAFD). The LAFD presumes that there is single face in the image and its previous state of the tracked face is presented. The tracker tracks and predicts the new state of the face between the sequential image frames using the Kalman Filter [14, 15]. Partial image which is called region of interest (ROI) is obtained for LAFD, according to the prediction of the tracker. The LAFD is launched in high frequency up to real-time. The tracker controller controls the running timing both GAFD and LAFD and maintains the states of tracked faces.
Some improvements are proposed in this paper, aiming at the defects of the traditional AdaBoost algorithm. The improved algorithm is mainly to provide face relevance, which is used to the ROI selection, the approach reduces the redundant information to minimum, that is unnecessary for face detection, among selected ROI. A certain number of samples, labeled positive or negative, are selected as training set, then the algorithm is used for feature selection in ROI. The weak classifiers are boosted into a stronger classifier by the following steps:
(1) The given sample images of (π₯π, π¦π)π=1π as input, where N is number of samples, π¦π =
0 , 1 represent negative and positive samples, respectively. (2) Initialize the weights π1,π= 1
2π ,
1
2π for π¦π = 0 , 1, where m and l are the number of the
negative and positive samples, respectively.
(3) Normalize the weights ππ‘ for t = 1, Β· Β· Β· , T , so that it is a probability distribution.
ππ‘,π = ππ‘,π βπ‘ ππ‘,π
π=1
Then select the best weak classifier with respect to the weighted error:
ππ‘ = min
π,π,πβ ππ|β(π₯π, π, π, π) β π¦π|
π
where a weak classifier β(x, π, π, π) consists of a feature f, a threshold ΞΈ and a polarity p indicate the direction of the inequality:
β(x, π, π, π) = {1, ππ(π₯) < ππ 0, ππ‘βππ
where x is a 24 Γ 24 pixel sub-window of an image. After that, choose the classifier:
βπ‘(x) = β(x, ππ‘, ππ‘, ππ‘)
where ππ‘, ππ‘ and ππ‘ are the minimizers of ππ‘.
Defining π½π‘ = ππ‘
1βππ‘, and update the weights:
ππ‘+1,π= ππ‘,ππ½π‘1βππ
where ππ = 0 if sample π₯π is classified correctly and ππ = 1 otherwise.
(4) By defining Ξ±t= log 1
Ξ²t, the final strong classifier is:
πΆ(π₯) = {1, β πΌπ π
π‘=1
βπ‘(π₯) β₯1
2β πΌπ‘
π
π‘=1
0, ππ‘βππ
Experimental Results
faces are contained in each image, with different lighting conditions and complex scenes. The faces are different in size, pose, location and facial expression. Especially, most of the faces are multi-pose face which include rotated frontal and profile face.
Figure 3 shows a face detection result for single face image, we mark the face area detected using a green rectangle bounding box. Figure 4 shows a result for multi-face detection in the images with complex and confusing background. Figure 5 shows a result for multi-face detection, the head pose of the people in the candidate images are various and different.
[image:4.595.92.273.185.324.2]
Figure 3. Face detection result for single face image. Figure 4. Face detection result for multi-face image with confusing scene.
Figure 5. Face detection result for multi-face image with varied pose.
[image:4.595.298.506.185.325.2]Comparison results of face detection are shown in Table 1. Experiments demonstrate that the proposed algorithm is feasible and robust. Furthermore, it performs at 26 frames/s speed, the frame's size is 560 x 420 pixel, and the configuration of the testing computer is Intel Core 2 Dual 2.8 GHz and 4G RAM. Therefore, this approach can provide good inputs for face recognition and facial expression recognition.
Table 1. This is the comparison results of face detection.
Detection
Algorithm Testing Dataset
Missing
Rate(%) Correct Rate(%)
Average Speed(ms)
Viola Jones
MIT+CMU 31.3 72.3 121
Our Dataset 35.6 68.2 135
AdaBoost
MIT+CMU 21.8 87.5 76
Our Dataset 24.1 84.6 84
Our Method
MIT+CMU 6.9 93.8 51
[image:4.595.182.412.366.512.2] [image:4.595.136.462.627.760.2]Conclusion and Future Work
In this paper we have implemented a novel methodology for robust multi-face detection in image sequences, intended for HRI in IER applications. Experimental results have confirmed the effectiveness and the increased computational efficiency of the proposed approach, proving that the characteristic advantages are maintained, leading to merits that combine efficiency, accuracy and robustness simultaneously.
We intend to use the proposed approach to support natural interaction with autonomously navigating robots that guide visitors in exhibition centers and museums. To be more specific, the proposed method will provide input for the recognition and analysis of facial expressions that people utilize when engaged in various conversational states.
Future work includes extension of the face detection algorithm to handle stereo vision by exploiting epipolar constraints. Moreover, the approach presented in the paper will be widely employed in an integrated system for naturalistic human-robot interaction.
Acknowledgments
This paper is supported by the following projects: The National Natural Science Foundation of China (No. 61271369, No. 61372148, No. 61571045); Beijing Natural Science Foundation (No. 4152016, No. 4152018); Beijing Advanced Innovation Center for Imaging Technology (No. BAICIT-2016002); The National Key Technology R&D Program (No. 2014BAK08B02, No. 2015BAH55F03).
The authors would like to thank CASIA for providing the databases used in the experiments related in this paper, and the anonymous reviewers for their constructive comments.
References
[1]J Zhu, Z Chen. Real Time Face detection System Using Adaboost and Haar-like Features. International Conference on Information Science & Control Engineering, 2015:404-407.
[2]S Sun, Z Xu, X Wang, G Huang. Real-time Vehicle Detection using Haar-SURF Mixed Features and Gentle AdaBoost Classifier. Control & Decision Conference, 2015:1888-1894.
[3]S. S. Ge and C. H. Fua. Queues and artificial potential trenches for multi-robot formations. IEEE Transactions on Robotics, vol. 21, no. 4, pp. 646-656, August 2005.
[4]S. S. Ge. Social robotics: Integrating advances in engineering and computer science. Proc. of the 2007 Electrical Engineering/Electronics, Computer, Telecommunications and Information Technology (ECTI) Int. Conf., pp. xviiβxxvi, May 9-12 2007, keynote Speech.
[5]M. H. Yang, D. J. Kriegman, and N. Ahuja. Detecting faces in images: a survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, no. 1, pp. 34-58, January 2002.
[6]W. Zhao, R. Chellappa, P. J. Phillips, and A. Rosenfeld. Face recognition: A literature survey. ACM Computing Surveys, vol. 35, no. 4, pp. 399-458, 2003.
[7]M. Osadchy, M. Miller, and Y. LeCun. Synergistic face detection and pose estimation with energy-based model. Advances in Neural Information Processing Systems (NIPS 2004). MIT Press, 2005.
[8]Bernhard Froba and Christian Kiiblbeck. Face Tracking by Means of Continuous Detection. Proceeding of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW 2004).
[10] P. Viola and M. Jones. Robust Real-time Object Detection. Proceeding of IEEE Workshop on Statistical and Computational Theories of Vision (SCTV 2001).
[11] Xangdong Xie, Sudhakar, R., Hanqi Zhuang. Real-time eye feature tracking from a video image sequence using Kalman filter. IEEE Transactions on Systems, Man and Cybernetics, Volume 25, Issue 12, Dec. 1995 pp. 1568-1577.
[12] Bernhard Froba and Christian Kiiblbeck. Face Tracking by Means of Continuous Detection. Proceeding of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW 2004).
[13] F Blais. Review of 20 Years of Ranges Sensor Development. Videometrics VII, Proc. of SPIE-IS&T Electronic Imaging, SPIE, Vol. 5013, pp. 62-76, 2003.
[14] R. E. Kalman. A New Approach to Linear Filtering and Prediction Problems. J. Basic Eng. Trans. ASME, 2015, 82D(1):35-45.
[15] Greg Welch, Gary Bishop. An Introduction to the Kalman Filter. University of North Carolina at Chapel Hill, 1995(7):127-132.
[16] S Yin, P Ouyang, X Dai, L Liu. An AdaBoost-Based Face Detection System Using Parallel Configurable Architecture with Optimized Computation. IEEE Systems Journal, 2015:1-12.
[17] M Abualkibash, A Mahmood, S Moslehpour. A near real-time, parallel and distributed adaptive object detection and retraining framework based on AdaBoost algorithm. High Performance Extreme Computing Conference, 2015:1-8.
[18] W Liu, J Lv, B Yu, W Shang. Multi-type road marking recognition using adaboost detection and extreme learning machine classification. 2015 IEEE Intelligent Vehicles Symposium (IV), 2015:41-46.