Abstract
Fatigue detection plays a critical role in mission-critical environments such as defense operations, transportation, and industrial control, where sustained attention and alertness are paramount for safety, operational efficiency, and human performance. This study introduces a real-time, non-intrusive fatigue detection system that employs Vision Transformers (ViTs) to identify subtle facial cues associated with fatigue. Unlike traditional methods that rely primarily on overt indicators such as yawning, eye closure, or head nodding, our approach leverages advanced deep learning techniques to capture nuanced micro-expressions and subtle behavioral patterns that are often overlooked. By applying transfer learning from the FER-2013 dataset to the NTHU-DDD dataset, we achieve enhanced model generalization, with ViT-L16 and R50+ViT-B16 architectures attaining classification accuracies of 70.31% and 72.79%, respectively. Visual attention maps reveal that FER-FFR models focus more effectively on critical facial regions, enabling precise feature extraction and interpretability. Furthermore, we introduce a fatigue level indicator that quantifies fatigue progression over consecutive video frames, demonstrating behavior that closely aligns with human fatigue dynamics. The proposed system is robust, scalable, and suitable for deployment in real-world operational settings, providing an automated, reliable, and objective solution for continuous fatigue monitoring, potentially enhancing safety, productivity, and decision-making in high-stakes environments.
Keywords
Facial Expression Recognition (FER), Fatigue Detection, CNN, Resnet, Attention Based Models,Downloads
References
- C. Sutherland, A. Smallwood, T. Wootten, N. Redfern. Fatigue and its impact on performance and health. British Journal of Hospital Medicine, 84(2), (2023) 1-8. https://doi.org/10.12968/hmed.2022.0548
- D.N. Reinken, J. Rizek. Strategies to Prevent and Effectively Respond to Compassion Fatigue and Burnout. Journal of Emergency Nursing, 51(2), (2025) 205-210. https://doi.org/10.1016/j.jen.2024.10.004
- Y. Li, J. He. A review of strategies to detect fatigue and sleep problems in aviation: Insights from artificial intelligence. Archives of Computational Methods in Engineering, 31(8), (2024) 4655-4672. https://doi.org/10.1007/s11831-024-10123-5
- H. Pan, Y. Hu, Y. Wang, V. Duong. Fatigue Detection in Air Traffic Controllers: A Comprehensive Review. IEEE Access, 12, (2024) 185869 – 185880. https://doi.org/10.1109/ACCESS.2024.3513292
- P. Ertl, A. Kruse, M. Tilp. Detecting fatigue thresholds from electromyographic signals: A systematic review on approaches and methodologies. Journal of Electromyography and Kinesiology, 30, (2016) 216-230. https://doi.org/10.1016/j.jelekin.2016.08.002
- Y. Albadawi, M. Takruri, M. Awad. A review of recent developments in driver drowsiness detection systems. Sensors, 22(5), (2022) 2069. https://doi.org/10.3390/s22052069
- L. Dziuda, P. Baran, P. Zieliński, K. Murawski, M. Dziwosz, M. Krej, M. Piotrowski, R. Stablewski, A. Wojdas, W. Strus, H. Gasiul, M. Kosobudzki, A. Bortkiewicz. Evaluation of a fatigue detector using eye closure-associated indicators acquired from truck drivers in a simulator study. Sensors, 21(19), (2021) 6449. https://doi.org/10.3390/s21196449
- Z. Zhao, N. Zhou, L. Zhang, H. Yan, Y. Xu, Z. Zhang. Driver fatigue detection based on convolutional neural networks using em‐CNN. Computational intelligence and neuroscience, 2020(1), (2020) 7251280. https://doi.org/10.1155/2020/7251280
- H.A. ELLaban, A.A. Ewees, A.E. Elsaeed. A real-time system for facial expression recognition using support vector machines and k-nearest neighbor classifier. International Journal of Computer Applications, 159(8), (2017) 23-29.
- J.J. Lee, M.Z. Uddin, T.S. Kim. (2008) Spatiotemporal human facial expression recognition using fisher independent component analysis and Hidden Markov Model. In 2008 30th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, IEEE, Vancouver, BC, Canada, 2546-2549. https://doi.org/10.1109/IEMBS.2008.4649719
- S. Datta, D. Sen, R. Balasubramanian. (2017) Integrating geometric and textural features for facial emotion classification using SVM frameworks. In Proceedings of International Conference on Computer Vision and Image Processing: CVIP 2016, Springer Singapore, 1, 619-628. https://doi.org/10.1007/978-981-10-2104-6_55
- H. Ding, S.K. Zhou, R. Chellappa. (2017) Facenet2expnet: Regularizing a deep face recognition net for expression recognition. In 2017 12th IEEE international conference on automatic face & gesture recognition (FG 2017), IEEE, Washington, DC, USA. https://doi.org/10.1109/FG.2017.23
- T. Zhang, W. Zheng, Z. Cui, Y. Zong, J. Yan, K. Yan. A deep neural network-driven feature learning method for multi-view facial expression recognition. IEEE Transactions on Multimedia, IEEE, 18(12), (2016) 2528-2536. https://doi.org/10.1109/TMM.2016.2598092
- V. Mavani, S.Raman, K.P. Miyapuram. (2017) Facial expression recognition using visual saliency and deep learning. In Proceedings of the IEEE international conference on computer vision workshops,IEEE, Venice, Italy, 2783-2788. https://doi.org/10.1109/ICCVW.2017.327
- P.D. Marrero Fernandez, F.A. Guerrero Pena, T. Ren, A. Cunha. (2019) Feratt: Facial expression recognition with attention net. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, IEEE, Long Beach, CA, USA, https://doi.org/10.1109/CVPRW.2019.00112
- K. Wang, X. Peng, J. Yang, D. Meng, Y. Qiao. Region attention networks for pose and occlusion robust facial expression recognition. IEEE Transactions on Image Processing, IEEE, 29, (2020) 4057-4069. https://doi.org/10.1109/TIP.2019.2956143
- A. Jarndal, H. Tawfik, A.I. Siam, I. Alsyouf, A. Cheaitou. A real-time vision transformers-based system for enhanced driver drowsiness detection and vehicle safety. IEEE Access, IEEE, 13, (2024) 1790-1803. https://doi.org/10.1109/ACCESS.2024.3522111
- Z. Liu, Y. Peng, W. Hu. Driver fatigue detection based on deeply-learned facial expression representation. Journal of Visual Communication and Image Representation, 71, (2020) 102723. https://doi.org/10.1016/j.jvcir.2019.102723
- B.K. Savaş, Y. Becerikli. 2020. Real time driver fatigue detection system based on multi-task ConNN. IEEE, Access, 8, (2020)12491-12498. https://doi.org/10.1109/ACCESS.2020.2963960
- Y. Ji, S. Wang, Y. Zhao, J. Wei, Y. Lu. Fatigue state detection based on multi-index fusion and state recognition network. IEEE Access, 7, (2019) 64136-64147. https://doi.org/10.1109/ACCESS.2019.2917382
- K. Dwivedi, K. Biswaranjan, A. Sethi. (2014) Drowsy driver detection using representation learning. In 2014 IEEE international advance computing conference (IACC), IEEE, Gurgaon, India, 995-999. https://doi.org/10.1109/IAdCC.2014.6779459
- S. Park, F. Pan, S. Kang, C.D. Yoo. (2016) Driver drowsiness detection system based on feature representation learning using various deep networks. In Asian conference on computer vision Cham: Springer International Publishing, 154-164. https://doi.org/10.1007/978-3-319-54526-4_12
- H. Ding, S.K. Zhou, R. Chellappa. (2017) Facenet2expnet: Regularizing a deep face recognition net for expression recognition. In 2017 12th IEEE international conference on automatic face & gesture recognition (FG 2017), IEEE, Washington, DC, USA. https://doi.org/10.1109/FG.2017.23
- H. Gu, Q. Ji. (2004) An automated face reader for fatigue detection. In Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings. IEEE, 111-116.
- E. Vural, M. Bartlett, G. Littlewort, M. Cetin, A. Ercil, J. Movellan. (2010) Discrimination of moderate and acute drowsiness based on spontaneous facial expressions. In 2010 20th International Conference on Pattern Recognition IEEE, Istanbul, Turkey, 3874-3877. https://doi.org/10.1109/ICPR.2010.943
- S. Cao, P. Feng, W. Kang, Z. Chen, B. Wang. Optimized driver fatigue detection method using multimodal neural networks. Scientific Reports, 15(1), (2025) 12240. https://doi.org/10.1038/s41598-025-86709-1
- J. Bai, W. Zhu, S. Liu, C. Ye, P. Zheng, X. Wang. A Temporal Convolutional Network–Bidirectional Long Short-Term Memory (TCN-BiLSTM) Prediction Model for Temporal Faults in Industrial Equipment. Applied Sciences, 15(4), (2025) 1702. https://doi.org/10.3390/app15041702
- T.K. Soon, N.K. Ibrahim, B. Hussin, A. Idris, N.M. Yaacob, M.A. Algaet, N.A. Jalil. (2018) The utilization of feature based Viola-Jones method for face detection in invariant rotation. International Journal of Advanced Computer Science and Applications, 9(12). https://dx.doi.org/10.14569/IJACSA.2018.091211
- R. Jin, H. Li, J. Pan, W. Ma, J. Lin. (2021) Face recognition based on MTCNN and Facenet. In AAAI Conference on Artificial Intelligence.
- R.Y. Patil, Y.H. Patil, S.U. Bhandari. FER to FFR: a deep-learning-based approach for robust fatigue detection. International Journal of Computer Applications in Technology, 72(3), (2023) 203-211. https://dx.doi.org/10.1504/IJCAT.2023.133292
- F. Wang, M. Jiang, C. Qian, S. Yang, C. Li, H. Zhang, X. Wang, X. Tang. (2017) Residual attention network for image classification. In Proceedings of the IEEE conference on computer vision and pattern recognition, IEEE, Honolulu, HI, USA. https://doi.org/10.1109/CVPR.2017.683
Articles

