Gene expression predictions and networks in natural populations supports the omnigenic theory

Aurélien Chateigner; Marie-Claude Lesage-Descauses; Odile Rogier; Véronique Jorge; Jean-Charles Leplé; Véronique Brunaud; Christine Paysant-Le Roux; Ludivine Soubigou-Taconnat; Marie-Laure Martin-Magniette; Leopoldo Sanchez; Vincent Segura

doi:10.1186/s12864-020-06809-2

Gene expression predictions and networks in natural populations supports the omnigenic theory

BMC Genomics. 2020 Jun 22;21(1):416. doi: 10.1186/s12864-020-06809-2.

Authors

Aurélien Chateigner¹, Marie-Claude Lesage-Descauses¹, Odile Rogier¹, Véronique Jorge¹, Jean-Charles Leplé², Véronique Brunaud^{3

4}, Christine Paysant-Le Roux^{3

4}, Ludivine Soubigou-Taconnat^{3

4}, Marie-Laure Martin-Magniette^{3

4

5}, Leopoldo Sanchez¹, Vincent Segura^{6

7}

Affiliations

¹ BioForA, INRAE, ONF, Orléans, France.
² BIOGECO, INRAE, Univ. Bordeaux, Cestas, France.
³ Institute of Plant Sciences Paris-Saclay (IPS2), CNRS, INRAE, Université Paris-Sud, Université d'Evry, Université Paris-Saclay, Gif sur Yvette, France.
⁴ Institute of Plant Sciences Paris-Saclay (IPS2), CNRS, INRAE, Université Paris-Diderot, Sorbonne Paris-Cité, Gif sur Yvette, France.
⁵ MIA-Paris, AgroParisTech, INRAE, Paris, France.
⁶ BioForA, INRAE, ONF, Orléans, France. [email protected].
⁷ AGAP, Université Montpellier, CIRAD, INRAE, Montpellier SupAgro, Montpellier, France. [email protected].

Abstract

Background: Recent literature on the differential role of genes within networks distinguishes core from peripheral genes. If previous works have shown contrasting features between them, whether such categorization matters for phenotype prediction remains to be studied.

Results: We measured 17 phenotypic traits for 241 cloned genotypes from a Populus nigra collection, covering growth, phenology, chemical and physical properties. We also sequenced RNA for each genotype and built co-expression networks to define core and peripheral genes. We found that cores were more differentiated between populations than peripherals while being less variable, suggesting that they have been constrained through potentially divergent selection. We also showed that while cores were overrepresented in a subset of genes statistically selected for their capacity to predict the phenotypes (by Boruta algorithm), they did not systematically predict better than peripherals or even random genes.

Conclusion: Our work is the first attempt to assess the importance of co-expression network connectivity in phenotype prediction. While highly connected core genes appear to be important, they do not bear enough information to systematically predict better quantitative traits than other gene sets.

Keywords: Boruta; Core; Machine learning; Peripheral; Populus nigra.

MeSH terms

Computational Biology / methods*
Gene Expression Profiling / methods*
Gene Expression Regulation, Developmental
Gene Expression Regulation, Plant
Gene Regulatory Networks*
Genotype
Machine Learning
Phenotype
Plant Proteins / genetics
Populus / genetics
Populus / growth & development*
Quantitative Trait Loci
Sequence Analysis, RNA

Substances

Plant Proteins