An Enhanced Lightweight Convolutional Cascade Network Based on MobileNetV3 and MTCNN for Robust Face Detection in Complex Environments

Linnea Rourke

doi:10.5281/zenodo.17566873

pdf

Published: 2025-01-01

DOI: https://doi.org/10.5281/zenodo.17566873

Linnea Rourke

University of Central Missouri

Abstract

In order to improve the detection accuracy of small-size and multi-angle faces in complex environments while maintaining efficiency for deployment on edge devices, this paper proposes an enhanced neural network model—MobileNetV3-MTCNN. Based on the classical MTCNN architecture, the proposed model reconstructs the feature extraction network by integrating the MobileNetV3 Bottleneck module, optimizing the balance between computational efficiency and detection precision. The improved architecture enables more effective multi-scale feature extraction and reduces redundancy in network operations. Experimental results on the WIDER_FACE dataset demonstrate that MobileNetV3-MTCNN achieves substantial gains in average precision (AP) and recall rate across easy, medium, and hard subsets compared to the original MTCNN, with increases of 4.8%, 6.2%, and 7.5% in AP respectively. Although the model parameters slightly increase, the proposed method demonstrates superior robustness and adaptability in detecting small and occluded faces under diverse lighting and pose conditions. The study confirms that the MobileNetV3-MTCNN model provides an effective and lightweight solution for real-world face detection applications, especially in densely populated and dynamic environments.

Issue

Vol. 1 No. 1 (2025)

Section

Articles

Article Sidebar

Main Article Content

Abstract

Article Details

Issue

Section