Relationships between demographic, anthropometric, inflammatory, lipid and glucose tolerance markers in connection with the fat but fit paradigm were investigated by supervised and unsupervised learning. Data from 81 apparently healthy participants (87% females) were used to generate four classes of fatness and fitness. Principal Component Analysis (PCA) revealed that the principal component was preponderantly composed of glucose tolerance parameters. IL-10 and high-density lipoprotein, low-density lipoprotein (LDL), and total cholesterol, along with body mass index (BMI), were the most important features according to Random Forest based recursive feature elimination. Decision Tree classification showed that these play a key role into assigning each individual in one of the four classes, with 70% accuracy, and acceptable classification agreement, κ = 0.54. However, the best classifier with 88% accuracy and κ = 0.79 was the Naïve Bayes. LDL and BMI partially mediated the relationship between fitness and fatness. Although unsupervised learning showed that the glucose tolerance cluster explains the highest quote of the variance, supervised learning revealed that the importance of IL-10, cholesterol levels and BMI was greater than the glucose tolerance PCA cluster. These results suggest that fitness and fatness may be interconnected by anti-inflammatory responses and cholesterol levels. Randomized controlled trials are needed to confirm these preliminary outcomes.
Keywords: PCA; VO2max; anti-inflammatory; machine learning.