Dimension reduction has been used to visualise the distribution of multidimensional microbiome data, but the composite variables calculated by the dimension reduction methods have not been widely used to investigate the relationship of the human gut microbiome with lifestyle and disease. In the present study, we applied several dimension reduction methods, including principal component analysis, principal coordinate analysis (PCoA), non-metric multidimensional scaling (NMDS), and non-negative matrix factorization, to a microbiome dataset from 186 subjects with symptoms of allergic rhinitis (AR) and 106 controls. All the dimension reduction methods supported that the distribution of microbial data points appeared to be continuous rather than discrete. Comparison of the composite variables calculated from the different dimension reduction methods showed that the characteristics of the composite variables differed depending on the distance matrices and the dimension reduction methods. The first composite variables calculated from PCoA and NMDS with the UniFrac distance were strongly associated with AR (FDR adjusted P = 2.4 × 10-4 for PCoA and P = 2.8 × 10-4 for NMDS), and also with the relative abundance of Bifidobacterium and Prevotella. The abundance of Bifidobacterium was also linked to intake of several nutrients, including carbohydrate, saturated fat, and alcohol via composite variables. Notably, the association between the composite variables and AR was much stronger than the association between the relative abundance of individual genera and AR. Our results highlight the usefulness of the dimension reduction methods for investigating the association of microbial composition with lifestyle and disease in clinical research.
© 2024. The Author(s).