Sex estimation is an important part of skeletal analysis and forensic identification. Traditionally pelvic traits are utilized for accurate sex estimation. However, the long bones, especially humerus, have been proved to be as effective for determine the sex of the individual.The aim of this study was to compare the predictive accuracy of seven statistical modelling techniques including classical statistical methods and machine learning algorithms, to assess the sexual dimorphism of humerus on a French sample based on a metric analysis of 26 measurements. A total of 98 humeral bones (divided in two samples) were measured. Seven statistical models were compared: Linear Discriminant Analysis (LDA), Regularized Discriminant Analysis (RDA), Penalized Logistic Regression (PLR), Flexible Discriminant Analysis (FDA), Support Vector Machine (SVM), and Artificial Neural Network (ANN) and Random Forest (RF).With cross validation, classification accuracy was greater than 90% (ranges between 92% and 98%) for all models without variable selection methods. The simplification of the models has improved the accuracy between 98% and 100% and also a reduction of the number of variables to 6 or less. Penalized logistic regression (PLR), Random Forest (RF) and Linear discriminant analysis (LDA) were the best accuracy models.The measurements made at the proximal part of the humerus (WTT, CSD), at distal part (BEW, WT, MAW, THT) and of the entire bone (PLCT) stand out among the various models.The present study suggests that the humerus is an interesting alternative for sex estimation and that non-classical statistical models can provide a new approach.
Keywords: Forensic anthropology; Humerus bone; Machine learning; Sex estimation; Sexual dimorphism; Statistical models.
© 2025. The Author(s).