Ruminant microbiome data are skewed and unFAIR, undermining their usefulness for sustainable production improvement

Anim Microbiome. 2024 Oct 25;6(1):61. doi: 10.1186/s42523-024-00348-x.

Abstract

The ruminant microbiome plays a key role in the health, feed utilization and environmental impact of ruminant production systems. Microbiome research provides insights to reduce the environmental footprint and improve meat and milk production from ruminants. However, the microbiome composition depends on the ruminant species, habitat and diet, highlighting the importance of having a good representation of ruminant microbiomes in their local environment to translate research findings into beneficial approaches. This information is currently lacking. In this study, we examined the metadata of farmed ruminant microbiome studies to determine global representativeness and summarized information by ruminant species, geographic location, body site, and host information. We accessed data from the International Nucleotide Sequence Database Collaboration via the National Center for Biotechnology Information database. We retrieved 47,628 sample metadata, with cattle accounting for more than two-thirds of the samples. In contrast, goats, which have a similar global population to cattle, were underrepresented with less than 4% of the total samples. Most samples originated in Western Europe, North America, Australasia and China but countries with large ruminant populations in South America, Africa, Asia, and Eastern Europe were underrepresented. Microbiomes from the gastrointestinal tract were the most frequently studied, comprising about 87% of all samples. Additionally, the number of samples from other body sites such as the respiratory tract, milk, skin, reproductive tract, and fetal tissue, has markedly increased over the past decade. More than 40% of the samples lacked basic information and many were retrieved from generic taxonomic classifications where the ruminant species was manually recovered. The lack of basic information such as age, breed or sex can limit the reusability of the data for further analysis and follow-up studies. This requires correct taxonomic assignment of the ruminant host and basic metadata information using accepted ontologies adapted to host-associated microbiomes. Repositories should require this information as a condition of acceptance. The results of this survey highlight the need to encourage studies of the ruminant microbiome from underrepresented ruminant species and countries worldwide. This shortfall in information poses a challenge for the development of microbiome-based strategies to meet sustainability requirements, particularly in areas with expanding livestock production systems.

Keywords: Global representativeness; Metadata; Metagenome; Ontology; Ruminant microbiome.