The Influence of Matrix Size on Statistical Properties of Co-Occurrence and Limiting Similarity Null Models

Thomas Michael Lavender; Brandon S Schamp; Eric G Lamb

doi:10.1371/journal.pone.0151146

The Influence of Matrix Size on Statistical Properties of Co-Occurrence and Limiting Similarity Null Models

PLoS One. 2016 Mar 4;11(3):e0151146. doi: 10.1371/journal.pone.0151146. eCollection 2016.

Authors

Thomas Michael Lavender¹, Brandon S Schamp², Eric G Lamb¹

Affiliations

¹ Dept. of Plant Sciences, University of Saskatchewan, Saskatoon, Saskatchewan, Canada.
² Dept. of Biology, Algoma University, Sault Ste. Marie, Ontario, Canada.

Abstract

Null models exploring species co-occurrence and trait-based limiting similarity are increasingly used to explore the influence of competition on community assembly; however, assessments of common models have not thoroughly explored the influence of variation in matrix size on error rates, in spite of the fact that studies have explored community matrices that vary considerably in size. To determine how smaller matrices, which are of greatest concern, perform statistically, we generated biologically realistic presence-absence matrices ranging in size from 3-50 species and sites, as well as associated trait matrices. We examined co-occurrence tests using the C-Score statistic and independent swap algorithm. For trait-based limiting similarity null models, we used the mean nearest neighbour trait distance (NN) and the standard deviation of nearest neighbour distances (SDNN) as test statistics, and considered two common randomization algorithms: abundance independent trait shuffling (AITS), and abundance weighted trait shuffling (AWTS). Matrices as small as three × three resulted in acceptable type I error rates (p < 0.05) for both the co-occurrence and trait-based limiting similarity null models when exclusive p-values were used. The commonly used inclusive p-value (≤ or ≥, as opposed to exclusive p-values; < or >) was associated with increased type I error rates, particularly for matrices with fewer than eight species. Type I error rates increased for limiting similarity tests using the AWTS randomization scheme when community matrices contained more than 35 sites; a similar randomization used in null models of phylogenetic dispersion has previously been viewed as robust. Notwithstanding other potential deficiencies related to the use of small matrices to represent communities, the application of both classes of null model should be restricted to matrices with 10 or more species to avoid the possibility of type II errors. Additionally, researchers should restrict the use of the AWTS randomization to matrices with fewer than 35 sites to avoid type I errors when testing for trait-based limiting similarity. The AITS randomization scheme performed better in terms of type I error rates, and therefore may be more appropriate when considering systems for which traits are not clustered by abundance.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Ecosystem*
Models, Statistical*
Models, Theoretical*

Grants and funding

This study was funded by Natural Sciences and Engineering Research Council of Canada grants to EGL (Discovery grant: Plant-soil interactions and plant community structure) and BSS (RGPIN-371849-2009), and a University of Saskatchewan Department of Plant Sciences scholarship to TML. Computing resources were provided by WestGrid (www.westgrid.ca), Compute Canada / Calcul Canada (www.computecanada.ca) and the University of Saskatchewan High Performance Computing Research Facility (HPCRF). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.