Search in:

Applied Artificial Intelligence

An International Journal

Volume 34, 2020 - Issue 6

Submit an article Journal homepage

Free access

2,523

Views

CrossRef citations to date

Altmetric

Listen

Articles

Driver Fatigue Detection Using Viola Jones and Principal Component Analysis

Bahjat FatimaCOMSATS Institute of Information Technology, Islamabad, PakistanCorrespondence[email protected]

Ahmad R. ShahidCOMSATS Institute of Information Technology, Islamabad, Pakistan

Sheikh ZiauddinCOMSATS Institute of Information Technology, Islamabad, Pakistan

Asad Ali SafiCOMSATS Institute of Information Technology, Islamabad, Pakistan

Huma RamzanCOMSATS Institute of Information Technology, Islamabad, Pakistan

Pages 456-483 | Published online: 10 Feb 2020

Cite this article
https://doi.org/10.1080/08839514.2020.1723875
CrossMark

In this article

ABSTRACT
Introduction
Review of Related Literature
Proposed Scheme
Results
Conclusion
Future Work
References

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions
View PDF PDF View EPUB EPUB

ABSTRACT

In this paper, we have proposed a low-cost solution for driver fatigue detection based on micro-sleep patterns. Contrary to conventional methods, we acquired images by placing a camera on the extreme left side of the driver and proposed two algorithms that facilitate accurate face and eye detections, even when the driver is not facing the camera or driver’s eyes are closed. The classification to find whether eye is closed or open is done on the right eye only using SVM and Adaboost. Based on eye states, micro-sleep patterns are determined and an alarm is triggered to warn the driver, when needed. In our dataset, we considered multiple subjects from both genders, having different appearances and under different lightning conditions. The proposed scheme gives 99.9% and 98.7% accurate results for face and eye detection, respectively. For all the subjects, the average accuracy of SVM and Adaboost is 96.5% and 95.4%, respectively.

Introduction

In the past few years, there has been a tremendous increase in the number of road accidents due to driver’s fatigue (Dhivya and Suresh Babu Citation2015). According to a report published by the European Union, driver fatigue forms a notable factor of about 20% of the road crashes. Drivers with a diminished vigilance level experience a notable drop in their capacity of recognition, response, and control, which increases the risk of accident. It increases the need for systems that can detect symptoms of fatigue and warn the driver in advance (Commission Citation2016). More recently, a study by the National Sleep Foundation in the US showed that in their study, more than 51% of the adult drivers with fatigue symptoms had driven a vehicle and 17% had momentarily fallen asleep while driving. It is estimated that 76,000 injuries and 1,200 deaths are due to fatigue-induced accidents annually (National Sleep Foundation Citation2016). Due to the recent attention to fatigue-related crashes, fatigue detection systems have become an active area of research.

Driver Fatigue Detection Systems

Monitoring driver fatigue is important to prevent road accidents and to improve road safety. When a driver is fatigued, he/she shows particular signs (e.g. yawning, eyes closed for relatively longer periods, and the head bouncing, etc.) that can be observed easily. Moreover, fatigue can also be determined by monitoring the driving behavior, e.g. steering movements, lane keeping, car speed, gear changing, acceleration, braking, etc. (Wierwille Citation1995). The driver fatigue detection system automatically analyzes the driving or driver’s behavior and if the driver is fatigued, it generates an alarm that can wake up the driver.

Symptoms of Fatigue

When a driver is fatigued, he/she shows multiple nonvisual and visual signs that can be measured to detect fatigue, such as .

Figure 1. Symptoms of fatigue.

Besides physical state, the mental state of the driver also affects the driving and hence the physiological variables which give information about stress state, heart rate, brain activity, etc., can also be used for the analysis of the driver’s attentiveness. Eye closure is a widely used fatigue measure and based on the eye behaviors micro-sleep patterns can be determined. A micro-sleep (MS) is a brief, unintended, and temporary episode of loss of attention or sleep which may occur when a driver is fatigued. It may be associated with the event of prolonged eye closure which may last for fraction of a second or up to 30 s.

Fatigue Detection Techniques

Driver’s fatigue detection techniques can be broadly divided into two categories (). The first is to detect driving behavior and the other is to detect driver’s behavior, which further includes two approaches, i.e. physiological feature-based approach and a visual feature-based approach. The former is based on heart rate, pulse rate, and brain activity, etc. The latter tracks the driver’s eye and head movements, yawn and facial expressions (Kong et al. Citation2015).

Figure 2. Classification of fatigue detection techniques.

Driving behavior information–based methods are affected by the individual variation in driving behavior and the vehicle type. Physiological feature–based approaches usually result in good performance of fatigue detection. Electroencephalogram (EEG) for brain waves, Electrocardiogram (ECG) for heart rate, HRV (Heart Rate Variability), and Electrooculography (EOG) for eye movements, can give good results; however, these methods are intrusive because the sensors must be in contact with the driver’s body (Kong et al. Citation2015). Moreover, different specialized hardware devices are also used for monitoring driver fatigue however they are costly in terms of money and energy usage.

Visual feature–based approaches are mostly preferred because they are nonintrusive to the driver. Among the visual features, the head bouncing can be a bit misleading as there can be more than one reason for this, e.g. bumpy road or music, etc. Similarly, yawning is not a reliable measure for driver fatigue detection as one’s mouth may be open for multiple reasons, e.g. talking to someone or maybe he/she is habitual to keep mouth open ().

Eye movements can be the best correlated with the most obvious symptom of fatigue, i.e. micro-sleep. In this research work, we focused on the visual feature–based approach to monitor driver’s attentiveness, and based on the eye behavior of the driver, micro-sleep patterns are determined.

Contributions

Though a lot of research work has been done in this area, there is still scope for improvement in face and eye detection and eye-state classification results. Second, for fatigue detection mostly images are acquired by placing the camera in front of driver which is not practical, as it can block the driver’s view, even though it makes the task of eye closure detection relatively easier. The contributions of this research work are as follows:

First, we introduced the new idea for video acquisition in which the camera does not distract the driver or block his/her view. Three camera locations with/without zoom were tested with two subjects and after detailed analysis, we concluded that the most suitable location for camera is on the extreme left of the dashboard without zoom.
Second, we generated a dataset in which we considered multiple subjects from both genders, having different appearances, in different cars and under different lightning conditions.
Third, we proposed face and eye-detection algorithms that facilitate accurate face and eye detection, even when the driver is not facing the camera or driver’s eyes are closed.
Fourth, achieved results are comparable with the existing literature.
Fifth, the overall cost of the proposed system is very low as there is no specialized experimental setup used.

The rest of the paper is organized as follows: The “Review of Related Literature” section reviews the related literature. In the “Proposed Scheme” section, an overall framework of the proposed scheme is explained in detail. In the “Results” section, experimental results of the proposed scheme and its performance evaluation are presented and discussed. The “Conclusion” section concludes the paper and future research suggestions are briefly presented in the “Future Work” section.

Review of Related Literature

Extensive research has been carried out in the domain of driver fatigue detection, and visual features (e.g. eye movements, yawning, head nodding, etc.) are considered as the key symptoms of fatigue. We have reviewed the existing literature and categorized the schemes with respect to the approaches used, i.e. whether it is based on eye behaviors or it uses yawn analysis or is it a hybrid scheme.

Eye Behaviors

In literature, mostly eye behaviors particularly eye closure, blink rate, blink frequency, etc., have been used to determine fatigue (AL-Anizy Citation2015; Bergasa et al. Citation2006; Budiharto, Putra, and Putra Citation2013; Bharambe and Mahajan Citation2015; Batchu and Kumar Citation2015; Diddi and Jamge Citation2015; Devi, Choudhari, and Bajaj Citation2011; Dasgupta et al. Citation2013; Fazli and Esfehani Citation2012; Flores et al. Citation2010a; Jo, Lee, and Jung Citation2011; Jayanthi and Bommy Citation2012; Jiménez-Pinto and Torres-Torriti Citation2013; Karchani, Mazloumi, and Saraji Citation2015; Kong et al. Citation2015; Kumari Citation2014; Lenskiy and Lee Citation2012; Liu et al. Citation2010; Liu et al. Citation2010; Patel et al. Citation2010; Patel, Patel, and Sharma Citation2015; Punitha and Geetha Citation2015; Sugur, Hemadri, and Kulkarni Citation2013; Suryaprasad et al. Citation2013; Sacco and Farrugia Citation2012a; Shewale and Pranita Citation2014; Shen and Xu Citation2015; Tang, Fang, and Hu Citation2010; Vikash and Barwar Citation2014; Zhang, Cheng, and Lin Citation2012).

Patel et al. (Citation2010) presented a driver fatigue detection system that uses a color camcorder manufactured by JVC to capture the video. Skin segmentation model based on Gaussian distribution was used for face detection, followed by the eyes detection based on iris using binary images. Finally, fatigue was detected by assessing the eye blink rate. Authors claim good performance however there are some limitations such as false detection of eyes and iris detection in partially opened eyes. Moreover, the proposed system tracks eyes in every frame thus increasing the processing.

Lenskiy and Lee (Citation2012) proposed a novel skin-color segmentation algorithm based on an NN and texture segmentation algorithm based on SURF features, for eye detection. Blink frequency and eye closure duration were then estimated by tracking the localized eyes with extended Kalman filter.

Jo, Lee, and Jung (Citation2011) have used NIR LEDs with a narrow band-pass filter to lighten the driver’s face and then acquired filtered images. The proposed eye-detection algorithm combines Adaboost, template matching, and blob detection along with the eye validation using SVM, PCA, and LDA. Eye-state detection algorithm combines statistical features with appearance features and finally, SVM with RBF kernels was used for classification.

Zhang, Cheng, and Lin (Citation2012) presented an eye-detection algorithm to overcome the limitations of changes in driver pose and lightening condition. To do so, six measures were used to evaluate the drowsiness and fisher’s linear discriminant functions were then used as the classification function. Experimental evaluation was performed with six participants and high fidelity simulator.

Fazli and Esfehani (Citation2012) proposed a novel eye state tracking technique for drowsiness detection. A CCD camera was used to capture the video, followed by the image conversion to YCbCr color space. First, the face is detected and cropped from the image. Then, eyes are localized using a canny edge detector. Finally, drowsiness is detected based on the white pixel count of the remaining region.

In Jayanthi and Bommy (Citation2012), face detection is done using the skin color model followed by the eye tracking using dynamic template matching. Templates were trained using artificial neural network (ANN). Authors claim an overall good performance; however, there are some limitations such as quick head rotation, large head movement, and distance from the camera. Jiménez-Pinto and Torres-Torriti (Citation2013) presented a novel technique for drowsiness detection which uses driver’s kinematic model and optical flow analysis (Lucas and Kanade Citation1981).

Batchu and Kumar (Citation2015) proposed another drowsiness monitoring system, where face and eyes are detected using inbuilt HARR classifier cascades in OpenCV. Eye camera was used to take facial images of the driver and to speed up the processing only the open eye state is detected. Vikash and Barwar (Citation2014) also used the similar approach with the motivation to develop an efficient yet non-obtrusive drowsiness detection system.

Contrary to the previous techniques, Sugur, Hemadri, and Kulkarni (Citation2013) argued that HAAR cascade classifier is inefficient when working with different types of facial views and multiple image frames. As an alternate, Local Successive Mean Quantization Transform (SMQT) features and split up snow classifier algorithm are used for face detection. Then, eye region is extracted and converted to binary, followed by the eye blink detected using shape measurement. Finally, drowsiness is detected using three different criteria.

Viola–Jones algorithm (Viola and Jones Citation2004) is one of the most efficient tools for object detection and is widely used to detect faces (Lopar and Slobodan Citation2013). Several researches have used Viola–Jones algorithm for driver drowsiness detection (AL-Anizy Citation2015; Bharambe and Mahajan Citation2015; Diddi and Jamge Citation2015; Flores et al. Citation2010a; Karchani, Mazloumi, and Saraji Citation2015; Kong et al. Citation2015; Kumari Citation2014; Liu et al. Citation2010; Liu et al. Citation2010; Patel, Patel, and Sharma Citation2015; Punitha and Geetha Citation2015; Suryaprasad et al. Citation2013; Shewale and Pranita Citation2014; Shen and Xu Citation2015).

Suryaprasad et al. (Citation2013) presented a drowsiness detection system, composed of five modules from video acquisition to fatigue detection. First, the region of interest is marked followed by the face and eye detection using pre-defined Haarcascade samples. Likewise, Shewale and Pranita (Citation2014) presented a fatigue monitoring system based on Viola–Jones algorithm which uses Haar-like features technique to detect face and eye. However, the paper lacks the experimental evaluation.

Karchani, Mazloumi, and Saraji (Citation2015) proposed a drowsiness detection system design, employing virtual-reality driving simulator. Twenty professional urban male bus drivers with normal eyesight (without wearing glasses) participated in the study and images of driver’s face were taken with CC Camera placed in front of the driver. Image processing techniques (i.e. Viola–Jones algorithm, binary and histogram methods) were employed to detect the eyes. Once the eyes are extracted, frames are transformed into grayscale space. Finally, the level of drowsiness was determined on the basis of close eye, eye blink duration, and blink frequency, and MLP neural network.

Patel, Patel, and Sharma (Citation2015) used a standard webcam to continuously capture the images of the driver, followed by face detection and eye tracking using Viola–Jones. An alarm is issued if the eyes are found to be closed or blinking rate is abnormal for 4–5 consecutive frames. However, the paper lacks the experimental evaluation.

Kong et al. (Citation2015) proposed an improved drowsiness detection strategy where Viola–Jones Adaboost algorithm is trained for both fronts and deflected faces. First, images were converted to grayscale and enhanced by histogram equalization. Second, the modified algorithm is used for face detection and eye state recognition. Finally, fatigue is determined on the basis of eye closure.

The idea behind Kumari’s work (Kumari Citation2014) was to develop an economical yet efficient drowsiness detection system. A low-quality webcam was used to capture the video and Viola–Jones algorithm was used for face detection and facial feature extraction. Unlike previous approaches, average intensity variations were used to determine eye state (open/close). However, the paper does not include any experimental results.

Bharambe and Mahajan (Citation2015) proposed another real-time vision-based driver drowsiness detection system which adopts Viola–Jones classifier. Webcam was used to capture videos of eight drivers in normal lightning conditions. First, the images were preprocessed to remove noise using histogram equalization. Second, face and eyes were detected using Haar like feature-based object detection algorithm and template matching method, respectively. Finally, the drowsiness is detected on the basis of eye state (open/close) in consecutive frames. Likewise, Diddi and Jamge (Citation2015) also used the similar approach for drowsiness detection. Authors claim that the proposed scheme is low cost and able to detect all types of eyes of any gender.

Liu et al. (Citation2010) proposed another fatigue detection algorithm based on Viola–Jones cascaded classifiers algorithm and the diamond search algorithm. A simple feature is extracted from a temporal difference image and the distance between the lower and upper eyelids is then used to analyze the eye state. Three criteria were used to determine driver’s attentiveness. Similarly, Liu et al. (Citation2010) proposed a novel algorithm to detect eye closure using an infrared camera. Viola–Jones framework was used to localize face and eyes, followed by the extraction of eye region. Then, the proposed algorithm detects eye corners and eyelid's movement, based on which eye state (open/close) was classified.

Several researches used a combination of Viola–Jones framework and Support vector machine (SVM) for image processing and eye-state classification, respectively (AL-Anizy Citation2015; Flores et al. Citation2010b; Punitha and Geetha Citation2015; Shen and Xu Citation2015).

Flores et al. Citation2010b presented a real-time fatigue detection system that works under different illuminations. IR Camera was used for image acquisition followed by the face detection using Viola–Jones framework. Gabor filter and condensation algorithm were applied for feature extraction. Finally, the drowsiness is detected using SVM based on eye state (open/close).

Drowsiness detection scheme proposed in Punitha and Geetha (Citation2015) uses Viola–Jones face cascade of classifiers to detect the face and then eyes are detected heuristically (i.e. with respect to height and width of face). Eye images were preprocessed using histogram equalization and minimum intensity projection is used to detect the eye state, which is then fed to SVM to classify as open/close. Finally, the drowsiness is detected on the basis of eye state (open/close) in consecutive frames.

Shen and Xu (Citation2015) proposed an embedded driver drowsiness monitoring system that contains four modules, i.e. image detection, image process, fatigue detection, and display. In order to minimize the lightning effects, an infrared camera is used. The underlying algorithm is based on Viola–Jones framework and SVM is used to determine eye state. Finally, the fatigue is detected on the basis of eye closure.

Similarly, in AL-Anizy (Citation2015) a two-phase scheme is proposed where image processing is based on Viola–Jones HAAR face detection algorithm and SVM is used as a classifier. Six drivers participated in the study and a standard Hp laptop webcam is used to capture images. Unlike previous approaches, template matching was replaced by histogram equalization to speed up the processing.

Yawn Analysis

Several studies have proposed methods for driver drowsiness detection based on yawn analysis (Abtahi Citation2012; Saradadevi and Bajaj Citation2008).

Saradadevi and Bajaj (Citation2008) used Viola–Jones framework for face detection and mouth region extraction. For fatigue detection SVM was trained to recognize the yawn.

Abtahi (Citation2012) introduced three different methods for driver’s fatigue detection based on yawn analysis, namely, color segmentation, active contour model, and Viola–Jones method. Participants with varied characteristics (e.g. male/female, with/without glasses, with/without beard, etc.) were chosen for study and a Canon A720Is digital camera was used for video acquisition.

Hybrid

Some researchers have also proposed hybrid schemes for drowsiness detection where they have combined two or more facial features (Churiwala et al. Citation2012; Sacco and Farrugia Citation2012a; Sigari, Fathy, and Soryani Citation2013).

Churiwala et al. (Citation2012) combined eye movement, yawn detection, and head rotation for driver drowsiness detection. An algorithm based on Haar classifier was used to compute the four parameters (i.e. duration of eye closure, eye bink frequency, yawning, and head rotation) and their outputs altogether help to get the result. Authors claim good performance; however, the paper does not include the test results.

Sacco and Farrugia (Citation2012a) used Viola–Jones object detection framework for detecting facial features along with the correlation coefficient template matching to determine the state of features. Finally, overall fatigue level is determined applying SVM on the combination of three features.

Sigari, Fathy, and Soryani (Citation2013) proposed an adaptive fuzzy expert system that uses a combination of eye region-related symptoms (i.e. distance between eyelids and eye closure) and face region (i.e. head rotation) for drowsiness detection. A digital camera is used for image acquisition, followed by the face detection using Viola–Jones method. The conventional eye detection step was bypassed and instead horizontal projection of top half segment is used for the extraction of eye region related symptoms while template-based matching is used for face region. Finally, the fuzzy expert system is used for drowsiness detection with a short training phase.

From the cited literature review, we observed that mostly eye behaviors, particularly eye closure, have been used to determine fatigue by applying a combination of Viola–Jones and Support Vector Machine (SVM) for eye detection and eye-state classification, respectively. The issues pertaining to the real-time driver fatigue detection problem are as follows:

Dataset collection is the biggest challenge since there are no standard datasets available to study fatigue detection problem.
Determine a suitable camera location for video acquisition in such a way that the camera will not distract the driver or block his/her view.
Detect face and eyes:
- When the driver is not facing the camera.
- When driver’s eyes are closed.
- Across gender with variations in physical appearances.
- Under different lightning conditions.
- In the presence of interference and noise while driving.
Devise a low-cost solution for fatigue detection.

Proposed Scheme

Workflow

First, the driver’s video is captured and input frames are preprocessed, followed by the face and eye detection. After localizing the right eye, eye state is determined and if the eye is found closed for a certain number (i.e. threshold) of consecutive frames, it is marked as micro-sleep. Once a micro-sleep pattern is detected, an alarm will be triggered to warn the driver. Pseudo-code for the basic workflow of the proposed system is given in and the major phases of the system are depicted in the block diagram given in .

Figure 3. Workflow of the proposed scheme.

Figure 4. Block diagram of the proposed system.

Performance evaluation will be done in terms of face and eye detection accuracies along with their respective miss rates, eye-state classification accuracy, and overall fatigue recognition accuracy along with the number of missed/false alarms.

Method Description

Face and Eye Detection

The Viola–Jones framework (Viola and Jones Citation2004) is the first object detection framework proposed in 2001 by Paul Viola and Michael Jones. Viola–Jones is the state-of-the-art object detector which is used to detect people’s faces, noses, eyes, mouth, or upper body, based on their Haar features.

Features Extraction

Features extraction is the process of selecting relevant attributes in the data that are used for the construction of the model. This phase has two main objectives:

Enhance the dataset quality and classifier performance by eliminating the redundant/irrelevant features.
Decreasing the size of the input data.

Principal Component Analysis (PCA) is used for features extraction. PCA extracts the most relevant features of an image which can principally differentiate among various input images. It uses an orthogonal transformation to convert a set of observations of possibly correlated variables into a set of values of uncorrelated variables called principal components. Later the extracted features’ set is passed to the classifiers for training and testing.

Classification Methods

Two classification methods which have been used for the eye-state classification are briefly described in the following subsections.

Support Vector Machine (SVM)

Support vector machine (SVM) is supervised learning models that analyze data and recognize patterns, used for classification. SVM is based on the principle of maximum margin. It represents the samples as points and map the points from separate classes in such a way that they are divided by a clear gap. New test samples are then mapped into that same space and classified based on the side of the boundary they fall. SVM uses two datasets, i.e. training dataset and testing dataset.

Given a set of training samples, each belonging to one of the two classes, SVM training algorithm builds a model that assigns new test samples into one class or the other.

Adaboost

Adaboost is an ensemble learning algorithm used for classification, which creates the strong learner by iteratively adding weak learners. Training is done in such a way that during each round, a new weak learner is added to the ensemble and a weight vector is adjusted to focus on samples that were misclassified previously.

Dataset Collection

In our dataset we considered multiple subjects from both genders, having different appearances, in different cars and under different lightning conditions ().

We placed a mobile camera on the extreme left of the dashboard and made videos of each driver in different vehicles, covering multiple scenarios, i.e. parked car, while driving, in shade, in sunlight, etc ().

The datasets used in the research papers we studied are not standardized and there is a lot of variation in the experimental setup as well ().

Implementation Steps

The proposed scheme is implemented in MATLAB and is based on several sequential steps, which have been described in the following sections.

Video Acquisition

Initially, we performed an exercise in order to find the most suitable camera location for video acquisition in such a way that camera will not distract the driver or block his/her view (). Three camera locations with/without zoom were tested with two subjects:

Extreme left of the dashboard
Middle of the dashboard
Under the back view mirror

First, the frames extracted from the videos were preprocessed, followed by the face and eye detection. Second, we took opinion from the drivers about which camera location is less distracting while driving. After analyzing the face and eye detection results and the opinion of drivers, we finalized the camera location (i.e. on the extreme left of the dashboard without zoom).

Figure 5. Camera locations tested for video acquisition (extreme left of the dashboard, middle of the dashboard, under the back view mirror).

For video acquisition in our proposed system, we placed a mobile camera on the extreme left of the dashboard and made videos of each driver in different vehicles, covering multiple scenarios, i.e. parked car, while driving, in shade, in sunlight.

Preprocessing

In this step, we extracted frames from the video and every 10th frame is taken as input. Input frame is preprocessed by converting to grayscale followed by the gamma correction. Gamma correction controls the overall lightness or darkness of the image and is used to improve the image quality ().

Figure 6. Preprocessing (grayscale conversion and gamma correction).

Face Detection

After preprocessing the input frame, face is detected. In our face-detection algorithm, we have used Viola–Jones “ProfileFace” model since the video is captured from the left side of the driver (, and ).

Figure 7. Flowchart for face-detection algorithm.

Figure 8. Face detection in input frame (open eyes, closed eyes).

Figure 9. Extracted face images after face detection (open eyes, closed eyes).

Eye Detection

After extracting the face image, right eye is detected in each frame. In our eye-detection algorithm, we have used Viola–Jones “RightEyeCART” model since the video is captured from the left side of the driver (, and ).

Figure 10. Flowchart for eye-detection algorithm.

Figure 11. Right eye detection in input face image (open eye, close eye).

Figure 12. Extracted right eye images after eye detection (open eye, close eye).