Volume 11, Issue 4
RESEARCH ARTICLE

hyperoverlap: Detecting biological overlap in n‐dimensional space

Matilda J. M. Brown

Corresponding Author

E-mail address: matilda.brown@utas.edu.au

School of Natural Sciences, University of Tasmania, Hobart, TAS, Australia

Correspondence

Matilda J. M. Brown

Email: matilda.brown@utas.edu.au

Search for more papers by this author
Barbara R. Holland

School of Natural Sciences, University of Tasmania, Hobart, TAS, Australia

Search for more papers by this author
Greg J. Jordan

School of Natural Sciences, University of Tasmania, Hobart, TAS, Australia

Search for more papers by this author
First published: 30 January 2020
The peer review history for this article is available at https://publons.com/publon/10.1111/2041-210X.13363

Abstract

  1. Comparative biological studies often investigate the morphological, physiological or ecological divergence (or overlap) between entities such as species or populations. Here we discuss the weaknesses of using existing methods to analyse patterns of phenotypic overlap and present a novel method to analyse co‐occurrence in multidimensional space.
  2. We propose a ‘hyperoverlap’ framework to detect qualitative overlap (or divergence) between point datasets and present the hyperoverlap r package which implements this framework, including functions for visualization. hyperoverlap uses support vector machines (SVMs) to train a classifier based on point data (such as morphological or ecological data) for two entities. This classifier finds the optimal boundary between the two sets of data and compares the predictions to the original labels. Misclassification is an evidence of overlap between the two entities. We demonstrate the theoretical and practical advantages of this method compared to existing approaches (e.g. single‐entity hypervolume models) using the bioclimatic data extracted from global occurrence records of conifers.
  3. We find that there are instances where single‐entity hypervolume models predict overlap, but there are no observations of either entity in the shared hypervolume. In these instances, hyperoverlap reports nonoverlap. We show that our method is stable and less likely to be affected by sampling biases than current approaches. We also find that hyperoverlap is particularly effective for situations involving entities with a small number of data points (e.g. narrowly endemic species) for which single‐entity models cannot be reliably constructed.
  4. We argue that overlap can be reliably detected using hyperoverlap, particularly for descriptive studies. The method proposed here is a valuable tool for studying patterns of overlap in a multidimensional space.

DATA AVAILABILITY STATEMENT

The r package code, conifer data and script used to for evaluations are available on GitHub and deposited in Zenodo (https://doi.org/10.5281/zenodo.3628890) (Brown, 2020).