Volume 4, Issue 8
Application
Free Access

diveRsity: An R package for the estimation and exploration of population genetics parameters and their associated errors

Kevin Keenan

Institute for Global Food Security, School of Biological Science, Medical Biology Centre, Queen's University, Belfast, BT9 7BL Northern Ireland

Search for more papers by this author
Philip McGinnity

Aquaculture & Fisheries Development Centre, School of Biological, Earth & Environmental Sciences, University College Cork, Cork, Ireland

Search for more papers by this author
Tom F. Cross

Aquaculture & Fisheries Development Centre, School of Biological, Earth & Environmental Sciences, University College Cork, Cork, Ireland

Search for more papers by this author
Walter W. Crozier

Agri‐Food and Biosciences Institute, Newforge Lane, Belfast, Northern Ireland

Search for more papers by this author
Paulo A. Prodöhl

Corresponding Author

Institute for Global Food Security, School of Biological Science, Medical Biology Centre, Queen's University, Belfast, BT9 7BL Northern Ireland

Correspondence author. E‐mail: p.prodohl@qub.ac.ukSearch for more papers by this author
First published: 11 May 2013
Citations: 454

Summary

  1. We present a new R package, diveRsity, for the calculation of various diversity statistics, including common diversity partitioning statistics (θ, GST) and population differentiation statistics (DJost, urn:x-wiley:2041210X:media:mee312067:mee312067-math-0001, χ2 test for population heterogeneity), among others. The package calculates these estimators along with their respective bootstrapped confidence intervals for loci, sample population pairwise and global levels. Various plotting tools are also provided for a visual evaluation of estimated values, allowing users to critically assess the validity and significance of statistical tests from a biological perspective.
  2. diveRsity has a set of unique features, which facilitate the use of an informed framework for assessing the validity of the use of traditional F‐statistics for the inference of demography, with reference to specific marker types, particularly focusing on highly polymorphic microsatellite loci. However, the package can be readily used for other co‐dominant marker types (e.g. allozymes, SNPs).
  3. Detailed examples of usage and descriptions of package capabilities are provided. The examples demonstrate useful strategies for the exploration of data and interpretation of results generated by diveRsity. Additional online resources for the package are also described, including a GUI web app version intended for those with more limited experience using R for statistical analysis.

Introduction

As a consequence of the growing suite of statistical genetics tools, which are often tailored to particular marker types, the analyses of population genetic data are becoming an increasingly complex task (Excoffier & Heckel 2006). For instance, F‐statistics is a commonly used framework for the description of genetic diversity partitioning within and among populations. F‐statistics estimators (e.g. θ, GST) suffer from an incompatibility when applied to highly polymorphic microsatellite markers (Hedrick 1999; Jost 2008), as a result of their negative dependence on within subpopulation heterozygosity (Jost 2008). Thus, for loci with many alleles (e.g. >10), within subpopulation, heterozygosity will invariably be high, and as a consequence, ‘traditional’ F‐statistics will have a theoretical maximum well below the expected FST = 1. Attempts have been made to overcome this issue, most notably by Hedrick (2005), with the development of urn:x-wiley:2041210X:media:mee312067:mee312067-math-0002 and more recently, Jost (2008) with the development of DJost. However, much confusion still exists about what these ‘new’ statistics should actually be used for (Gerlach et al. 2010). It is not the purpose of this study to elaborate on such issues; however, interested readers are encouraged to see Jost (2008), Meirmans & Hedrick (2011) and Whitlock (2011) for useful reviews.

To add to the complexity, recent advances in molecular screening methodologies have greatly facilitated the ease with which genetic data can be generated. As a consequence, an increasing number of researchers, often with a limited background in statistical genetics analyses (Karl et al. 2012), face the difficult task of analysing and interpreting such data. Thus, software tools that facilitate this task, by providing suitable frameworks to allow for informed analysis pipelines, are essential. To this end, we present the software diveRsity. This R package allows the estimation of various population genetic summary statistics including the two ‘traditional’ F‐statistics analogues; θ (Weir & Cockerham 1984) and GST (Nei & Chesser 1983), and the two ‘new’ differentiation statistics; urn:x-wiley:2041210X:media:mee312067:mee312067-math-0003 (Hedrick 2005) and DJost (Jost 2008), as well as their unbiased/nearly unbiased estimators. Each statistic can be estimated for locus, global and sample pairwise comparisons. The package also provides functionality for the estimation of 95% confidence intervals at all relevant levels, through an integrated bootstrapping procedure. Uniquely to diveRsity, various plotting functions, designed to allow researchers to assess the validity of using their particular data set (or suite of marker loci) for the inference of geneflow using the F‐statistics framework, are also provided, as well as visualisation tools for large pairwise matrices of genetic differentiation and parameter confidence intervals. Furthermore, diveRsity also provides a range of other statistical tools, which are commonly used in population genetic analyses pipelines, but are rarely integrated into a single software package.

Another major advantage of using diveRsity is that it produces summary data structures, which are very close to publication‐ready formats (e.g. Fig. 1). Given that the compilation of such summary data is time consuming and often involves the use of several software packages, diveRsity offers a valuable addition to the molecular ecologist's statistical toolkit. Its implementation as an R package also makes diveRsity ideal for easy incorporation into analysis pipelines where batch processing of files/data is required, as is often the case in simulation‐based studies.

image
A screenshot of the results output format from the function divBasic. This table format is commonly seen in journal articles when presenting basic population genetic parameters. However, the parameters often have to be calculated in separate software packages and tabulated by authors. diveRsity aims to reduce this requirement for authors. The parameter calculated in this table are; N = Number of individuals per population sample genotyped per locus, A = Total number of alleles observed per population sample per locus, % = Percentage of total alleles observed across population samples per population sample per locus, Ar = Allelic richness per locus, Ho = observed heterozygosity per locus, He = expected heterozygosity per locus, HWE = Hardy–Weinberg Equilibrium P‐value from the χ2 goodness‐of‐fit tests per locus.

This package is intended to promote a more considered and simplified approach to frequentist population genetic structure analyses. Through the inclusion of diversity partitioning statistics (e.g. θ & GST), differentiation statistics (e.g. urn:x-wiley:2041210X:media:mee312067:mee312067-math-0004 & DJost), as well as functionality to assess the behaviour of these statistics across loci and population samples, we hope to give researchers the necessary tools to make educated decisions about the statistical and biological validity of their analyses with relative ease. Following this rationale, we have also opted to omit the option for users to carry out P‐value null hypothesis testing in relation to F‐statistics and population sample differentiation estimators. This decision was taken given the lack of meaningful information conveyed through the use of P‐values in this context, as well as the many misconceptions that exist regarding the biological interpretation of P‐values in relation to these statistics (Wagenmakers 2007). We have instead provided functions to allow users to estimate 95% confidence intervals (calculated as the 2·5% and 97·5% quantiles of a bootstrap distribution), for a range of statistical estimators calculated by the package, thus, leading to more reliable conclusions about the biological significance of trends in the data, (see Fig. 2 in du Prel et al. 2009), leaving less room for erroneous interpretation.

image
Visualisation of pairwise DJost (estimator), for = 50 populations. Total pairwise comparisons = 1225. This figure is returned from the difPlot function, which will plot diversity partitioning and differentiation estimators returned by divPart. Regions of dark blue represent low genetic differentiation, while light blue/white represents high differentiation. The text box caption is an example of the tooltip information associated with each pairwise population comparison.

Description

diveRsity is a package written for use in R (R Development Core Team 2012). It is primarily designed for the estimation, exploration and validation of genetic differentiation/structure indices. The package aims to consolidate under the same work environment, many of the most popular population genetic statistics such as those mentioned above, in order to provide researchers with a simplified way in which to calculate and compare these statistics. This strategy is particularly useful for the identification of polymorphism‐based biases mentioned previously. This information can be subsequently used, along with additional exploration tools implemented in the package, to make informed decisions about which statistical measures or molecular markers can be appropriately applied to address a particular question.

diveRsity also calculates a plethora of other statistics and has various other population genetics applications. Table 1 provides a list of functions along with brief descriptions of their specific purposes. The package accepts raw genotype data for any group of co‐dominant molecular markers in the genepop file format (Raymond & Rousset 1995). There is no limit to the size of the accepted input file other than the amount of random access memory (RAM) available to users. In addition to providing users with the ability to efficiently estimate an array of population genetic statistics, diveRsity is also particularly flexible in terms of return result formats (e.g. text files, excel workbooks and native R objects such as matrices and data frames). This flexibility facilitates subsequent downstream analysis (e.g. incorporation into simulation or approximate bayesian computation (ABC) pipelines as the summary statistic calculation software). A list of specific output formats is also summarised in Table 1.

Table 1. Functions of the diveRsity package
Function Returned objects Description
chiCalc R character matrix, optional.txt file Test for genetic heterogeneity between population samples using the chi‐square distribution. The function provides the unique option to disregard alleles of very low frequencies using the argument minFreq
corPlot R graphics plot (not automatically written to file) Correlation plotting of diversity statistics against the number of alleles per locus. The function is intended to aid in the assessment of marker suitability for the estimation of geneflow
divPart .html,.png,.txt,.xlsx, R data object A function for the calculation of diversity partition statistics and their associated variance through bootstrapping. Global, locus and pairwise levels are addressed
divOnline NA This function launches the web app version of divPart. Local resources are used when running analyses. The system default web browser is used to host the application
difPlot .html,.png Provides visualisation and exploration of pairwise genetic differentiation. The function is particularly useful for data sets containing a large number of population samples.
inCalc .png,.txt,.xlsx, R data object A function for the calculation of allele and locus informativeness for the inference of ancestry. Bootstrap confidence intervals are also calculated.
readGenepop R data object A general purpose function designed to calculate basic descriptive parameters from raw genetic data. This function is intended as a tool for developers of population genetics software in R.
divRatio R data object,.txt, or.xlsx This function calculates the diversity ratio statistics presented in (Skrbinšek et al. 2012)
bigDivPart R data object,.txt, or.xlsx This function is identical to divPart except for its lack of bootstrapping functionality. It is coded in a specific way to allow the sequential analysis of large number of markers (e.g. <100 000)
fstOnly R data object,.txt, or.xlsx This function calculates only Weir & Cockerham's 1984 F‐statistics. The function is slightly faster than divPart, which also calculates these statistics
divBasic R data object,.txt, or.xlsx This function calculates basic population bases statistics such as Allelic richness, Hardy–Weinberg equilibrium and locus expected and observed heterozygosis

Dependencies and suggested packages

In general, diveRsity can be used with a standard R installation and two additional extension packages (plotrix and shiny). The functions divPart, inCalc, chiCalc and readGenepop, divBasic, bigDivPart and divRatio, (i.e. the major analytical functions), can all operate independently of nonstandard packages. The only disadvantages of this approach are slower execution times (i.e. parallel computation is not available) and a limited number of formats available for returned results. To fully capitalise on the additional features of diveRsity (listed in Table 1), the installation of all suggested packages is recommended. Details of these packages are given in Table 2.

Table 2. Additional packages used by the diveRsity package, along with their implementations
Package Implementation Status Citation
Xlsx Used in divPart and inCalc to return multisheet.xlsx workbooks Suggested Dragulescu (2012)
sendplot Used in divPart, divPlot and inCalc to produce tooltips for data visualisation Suggested Gaile et al. (2012)
doParallel Used in divPart and inCalc for parallel computation Suggested Revolution Analytics (2012a)
parallel Used in divPart and inCalc for parallel computation Suggested R Development Core Team (2012)
foreach Used in divPart and inCalc for parallel computation Suggested Revolution Analytics (2012b)
iterators Used in divPart and inCalc for parallel computation Suggested Revolution Analytics (2012c)
plotrix Used in difPlot for additional plotting features Dependency Lemon (2006)
shiny Used to build and run the web app version of the divPart function Dependency RStudio & Inc (2012)

Comparisons with other software

The main motivation behind the development of diveRsity was to provide a cross‐platform software, which allows comprehensive and fast frequentist analysis of co‐dominant molecular data, while maintaining usability and convenient result formats. On each of these aims, diveRsity performs comparatively better in relation to other similar software.

Comprehensiveness

When compared with other software which estimates similar statistics, diveRsity generally provides a more comprehensive range of parameter calculation options. In terms of the total number of available population genetics statistics, with the possible exception of the Mac OS X only program, GenoDive (Meirmans & Van Tienderen 2004), diveRsity estimates many more than DEMEtics (Gerlach et al. 2010), SMOGD (Crawford 2010), mmod (Winter 2012), hierfstat (Goudet 2004) or SPADE (Chao & Shen 2003).

Focusing only on diversity partitioning/differentiation statistics, diveRsity overlaps in its calculation of DJost with all of the above‐mentioned software. However, diveRsity is the only package that allows the estimation of 95% confidence intervals, globally (i.e. for all samples and loci), per locus (i.e. over all samples) and for all pairwise sample comparisons (i.e. over all loci per population pair). SMOGD, for example, which is perhaps the most popular of these applications (with over 212 citations according to Google scholar), calculates bootstrapped confidence intervals for DJost at the locus level across all population samples, but does not provide this estimation for either the global or pairwise levels.

Despite the focus of this study on diversity partition/differentiation statistics, diveRsity also estimates many other useful population genetics statistics. These include, χ2 tests of Hardy–Weinberg equilibrium (HWE), Allelic richness (Ar), Chi‐square tests for sample homogeneity, ‘Yardstick’ diversity standardised ratios (Skrbinšek et al. 2012) and locus informativeness for the inference of ancestry (Rosenberg et al. 2003). Contrary to other similar programs, diveRsity also provides various exploratory plotting tools, which can be very useful for the identification of meaningful trends within results with minimal effort (e.g. Example 1). Typically, this task would involve the compilation of output results from various programs and subsequent visualisation in an independent software package (e.g. Microsoft Excel). A full description of diveRsity's functionality can be found by typing either of the following commands into the R console:

# diveRsity must be installed

# 1) package help pages

help(package =”diveRsity”)

# 2) package user manual

vignette(“diveRsity”)

Speed

Given the different analytical focuses of distinct softwares, performance comparisons in terms of speed are not straightforward. For example, while in one software, a given test statistic might be estimated using a maximum likelihood procedure, in another, a more computational intensive procedure (e.g. bootstrapping) may be used. For the purposes of this study, comparisons were restricted to instances were distinct softwares implemented similar computational processes to calculate a similar suit of statistical parameters. Based on these criteria, only two truly comparable speed comparisons were possible between diveRsity and any of the above listed software.

The first is a comparison of locus confidence interval estimation using bootstrapping with SMOGD. The reproducible code used to run diveRsity is as follows:

system.time({

# load diveRsity

library(“diveRsity”)

# load Test_data

data(Test_data)

# run the analysis

x <‐ divPart(infile = Test_data, outfile = NULL, gp = 3,

pairwise = TRUE, WC_Fst = FALSE, bs_locus = TRUE,

bs_pairwise = FALSE, bootstraps = 1000, plot = FALSE,

parallel = TRUE)

})

When running SMOGD on the example data set Test_data (see Keenan et al. in press for details on these data), with bootstraps set to 1000, the time taken to return results to the web browser is 2 min 34·1 s, while diveRsity takes only 1 min 17·3 s to carry out the same calculations on a laptop with an Intel Core i5‐2435 CPU @ 2·49GHz. It is also relevant to note that diveRsity's performance can be significantly increased with the use of additional CPUs.

The second comparison involves the calculation of diversity partitioning statistics per locus for large data sets (e.g. RAD‐seq derived SNP genotypes). This comparison was carried out between the diveRsity function bigDivPart and the hierfstat function basic.stats. For this test, a simulated data set of 268 individuals across four population samples genotyped for 55 200 bi‐allelic SNP loci was used. To complete the entire analysis, diveRsity took 3 min 20·1 s, while hierfstat took 6 min 44·8 sec, using the same laptop as described above. Such speed differences become even more important with the increasing rate at which large arrays of loci can be genotyped for large numbers of individuals.

Usability & convenience

Similar to other R packages, to fully benefit from all features built into diveRsity, a reasonable level of expertise in R is required. However, diveRsity has been designed so that even R beginners or those with very limited expertise can easily carry out comprehensive analysis of their data, including results being written to file, in many cases with a single command line. This is in contrast to other packages such as mmod and hierfstat, which invariably require users to export their own result from the R environment, as well as execute more functions to calculate fewer parameters than diveRsity. An example of the convenient results formats returned by diveRsity is shown in Fig. 1.

In keeping with the focus on ease of use, diveRsity also includes a web application, which provides a browser based user interface for the estimation of the most popular statistics implemented in the command line version of the package. This application was built using the framework provided by the R package, shiny (RStudio & Inc 2012), and provides users with a range of benefits including an easy to use interface and downloadable result files. The browser user interface also allows users to run their analyses on a remote server; thus, local system resources are not consumed. The application can be accessed at: http://glimmer.rstudio.com/kkeenan/diveRsity-online/.

Users can also run this application locally by executing the following command in the R console:

# after loading diveRsity

divOnline()

Despite an emphasis on simplicity, diveRsity still retains all of the functionality and flexibility provided by the R environment (i.e. all results are returned to the current session workspace). Thus, users with more experience can easily pipe results from their analyses into downstream custom analyses (e.g. ABC).

Accessing the package

The diveRsity package is hosted on the Comprehensive R Archive Network (CRAN), and can be downloaded using the install.packages function in R. Simply type the following command into the R console:

install.packages(“diveRsity”, dependencies = TRUE)

Providing the user has a working internet connection, and following the selection of a suitable CRAN repository mirror, the package will download and install automatically.

Ongoing development of diveRsity can also be tracked at: http://diversityinlife.weebly.com/software.html

This web page contains the latest developmental versions of the package as well as an update log.

Examples

As a demonstration of some of the envisaged applications of diveRsity, two reproducible examples are provided below. These examples assume that the diveRsity, shiny, doParallel, sendplot and plotrix packages have been installed as well as their dependencies. For additional examples, users are encouraged to read the package manual.

Example 1. Using visualisation tools to investigate large genetic differentiation matrices

Pairwise genetic differentiation is an important parameter in the assessment of relationships among populations within a geographical context. To date, the true potential of pairwise genetic differentiation statistics has not been fully realised, owing mainly to difficulties in identifying meaningful trends in often very large numbers of population comparisons.

However, using both the divPart and difPlot functions, diveRsity allows users to visualise large pairwise matrices of genetic differentiation, making the identification of particularly differentiated population samples relatively straightforward. This procedure is demonstrated below.

Load diveRsity into the current R session:

# Load the diveRsity package

require(“diveRsity”)

In this example, the Big_data data set (distributed with diveRsity) will be used. The data were simulated under a hierarchical island model (i.e. five island groups with 10 subpopulations each allowing high geneflow within island groups and low geneflow among island groups), using the software EASYPOP v1.7 (Balloux 2001). Population samples within the Big_data data file were arranged in order of geographical proximity for the purpose of demonstrating how diveRsity can be used to identify broad‐scale geographical trends from genetic data.

# Load'Big_data'

data(Big_data, package =”diveRsity”)

The divPart function is first used to calculate the required pairwise statistics matrices. In this example, the argument parallel will be set to TRUE as a large number of comparisons have to be computed (i.e. urn:x-wiley:2041210X:media:mee312067:mee312067-math-0005 for N = 50).

# Assign the results to the variable'pwStats'

# (i.e. pw = pairwise)

pwStats <‐ divPart(infile = Big_data, outfile =”Big_results”,

gp = 2, WC_Fst = TRUE, bs_locus = FALSE,

bs_pairwise = FALSE, bootstraps = 0,

Plot = FALSE, parallel = TRUE)

The resulting R object, pwStats contains the required pairwise statistics, which can be passed to the function difPlot for visualisation.

difPlot(x = pwStats, outfile =”Big_results”,

interactive = TRUE)

This command will write four.png files (one for each estimated statistic) and four.html files to the folder Big_results under the current R working directory. An example of the functionality of the.html tooltips is given in Fig. 2. From this figure, it is clear that the data are represented by five distinct genetic groups, which correlates with the simulation conditions described above. There are clearly high levels of differentiation among island groups (light blue/white) and low levels of differentiation within island groups (dark blue). This graphical representation perfectly relays what is known to be genetically/evolutionarily true (though natural population systems will rarely be so ideal).

Figure 2 also illustrates the ability to rapidly identify population pairs of interest by simply positioning the mouse pointer over a particular comparison square/pixel. In this example, the pairwise comparison between populations 18 vs. 23, (GST = 0·8883, θ = 0·9408, urn:x-wiley:2041210X:media:mee312067:mee312067-math-0006 = 0·9927 and DJost = 0·8802), indicates that these two populations are highly differentiated from one another.

Example 2. Assessing polymorphism bias in diversity partitioning estimators

As discussed above, diversity partitioning statistics such as GST and θ are negatively dependent on within subpopulation heterozygosity. Where this negative dependence is present (e.g. when using highly polymorphic microsatellites), it is important to ensure that inferences made from calculated values do not violate important assumptions. Using the functions divPart, readGenepop and corPlot, it is possible to carry out an ad hoc assessment of polymorphism bias in diversity statistics, thus allowing users to make informed decisions about whether to proceed with inference of demographic processes for example. A reproducible example is given below:

# Load the diveRsity package

require(“diveRsity”)

Next, an example data set (Test_data) provided with diveRsity should be loaded into the R session.

# Load'Test_data'

data(Test_data, package =”diveRsity”)

Initially, Test_data is analysed by the function divPart to calculate locus θ, GST, urn:x-wiley:2041210X:media:mee312067:mee312067-math-0007 and DJost estimators.

# Assign the results to the variable ‘difStats

difStats <‐ divPart(infile = Test_data, outfile =”Test”,

gp = 3, WC_Fst = TRUE, bs_locus = TRUE,

bs_pairwise = FALSE, bootstraps = 1000,

plot = TRUE, parallel = TRUE)

Next, Test_data is analysed by readGenepop to count the total number of alleles per locus.

# Assign the result to the variable ‘numAlleles

numAlleles <‐ readGenepop(infile = Test_data, gp = 3,

bootstrap = FALSE)

The package has now generated two results objects in the R environment: difStats and numAlleles. These objects can be passed to the function corPlot.

corPlot(x = numAlleles, y = difStats)

Figure 3 provides an example of the output from this analysis. As can be seen in this example, both θ and GST are negatively correlated with the number of alleles per locus, while urn:x-wiley:2041210X:media:mee312067:mee312067-math-0008 and DJost are strongly positively correlated. This discordance is indicative of a case where the mutation rate is likely to obscure past demographic processes (e.g. geneflow); thus, such a data set is unsuitable for addressing such questions.

image
Correlation assessment of locus estimators θ, GST, urn:x-wiley:2041210X:media:mee312067:mee312067-math-0009 and Dest (DJost unbiased estimator), with locus polymorphism (total number of alleles), returned from the corPlot function. Red lines represent the line of best fit and r values are Pearson product moment correlation coefficients.

Users executing the above code will also see a range of other graphical outputs in a folder named ‘Test’ within their working directory. These plots allow users to assess the variability of parameter estimation for individual loci, which can in turn be incorporated into decisions about ‘misbehaving’ loci for example.

Acknowledgements

The authors would like to thank J.J. Magee, M.S.P Ravinet, J. Coughlan and C. Johnston for testing the diveRsity package and R. Hynes for proofreading the manuscript. We would also like to express our gratitude to MEE executive editor Dr. Robert B. O'Hara and two anonymous reviewers, whose comments greatly improved the manuscript and the diveRsity package. K.K. was supported by a PhD studentship from the Beaufort Marine Research Award in Fish Population Genetics funded by the Irish Government under the Sea Change programme. P.A.P, T.F.C, W.W.C and P.McG were also supported by this award.

      Number of times cited according to CrossRef: 454

      • Rejection of the genetic implications of the “Abundant Centre Hypothesis” in marine mussels, Scientific Reports, 10.1038/s41598-020-57474-0, 10, 1, (2020).
      • RAD‐Seq Refines Previous Estimates of Genetic Structure in Lake Erie Walleye, Transactions of the American Fisheries Society, 10.1002/tafs.10215, 149, 2, (159-173), (2020).
      • Large-scale assessment of genetic diversity and population connectivity of Amazonian jaguars (Panthera onca) provides a baseline for their conservation and monitoring in fragmented landscapes, Biological Conservation, 10.1016/j.biocon.2020.108417, 242, (108417), (2020).
      • Multiple decades of stocking has resulted in limited hatchery introgression in wild brook trout (Salvelinus fontinalis) populations of Nova Scotia, Evolutionary Applications, 10.1111/eva.12923, 13, 5, (1069-1089), (2020).
      • Genotyping‐by‐sequencing illuminates high levels of divergence among sympatric forms of coregonines in the Laurentian Great Lakes, Evolutionary Applications, 10.1111/eva.12919, 13, 5, (1037-1054), (2020).
      • Genetic versus demographic stock structure of rainbow smelt in a large fragmented lake, Journal of Great Lakes Research, 10.1016/j.jglr.2020.02.009, (2020).
      • Intercontinental long‐distance seed dispersal across the Mediterranean Basin explains population genetic structure of a bird‐dispersed shrub, Molecular Ecology, 10.1111/mec.15413, 29, 8, (1408-1420), (2020).
      • Dwarfism in close continental amphibian populations despite lack of genetic isolation, Oikos, 10.1111/oik.07086, 129, 8, (1243-1256), (2020).
      • Genetic structure among morphotypes of the endangered Brazilian palm Euterpe edulis Mart (Arecaceae), Ecology and Evolution, 10.1002/ece3.6348, 10, 12, (6039-6048), (2020).
      • Bighorn Sheep Genetic Structure in Wyoming Reflects Geography and Management, The Journal of Wildlife Management, 10.1002/jwmg.21882, 84, 6, (1072-1090), (2020).
      • Genetic Structure of Smallmouth Bass in the Lake Michigan and Upper Mississippi River Drainages Relates to Habitat, Distance, and Drainage Boundaries, Transactions of the American Fisheries Society, 10.1002/tafs.10238, 149, 4, (383-397), (2020).
      • Historical demography and climate driven distributional changes in a widespread Neotropical freshwater species with high economic importance, Ecography, 10.1111/ecog.04874, 43, 9, (1291-1304), (2020).
      • The Andaman day gecko paradox: an ancient endemic without pronounced phylogeographic structure, Scientific Reports, 10.1038/s41598-020-68402-7, 10, 1, (2020).
      • Genetic diversity and structure of circumtropical almaco jack, : tool for conservation and management, Journal of Fish Biology, 10.1111/jfb.14450, 97, 3, (882-894), (2020).
      • Genome-wide analysis of diamondback moth, Plutella xylostella L., from Brassica crops and wild host plants reveals no genetic structure in Australia, Scientific Reports, 10.1038/s41598-020-68140-w, 10, 1, (2020).
      • Anthropogenic pressures negatively impact genomic diversity of the vulnerable seagrass Zostera capensis, Journal of Environmental Management, 10.1016/j.jenvman.2019.109831, 255, (109831), (2020).
      • Modular chromosome rearrangements reveal parallel and nonparallel adaptation in a marine fish, Ecology and Evolution, 10.1002/ece3.5828, 10, 2, (638-653), (2020).
      • The Transformative Impact of Genomics on Sage-Grouse Conservation and Management, , 10.1007/13836_2019_65, (2020).
      • Development of 18 microsatellite markers for Salvia pratensis, Applications in Plant Sciences, 10.1002/aps3.11316, 8, 1, (2020).
      • Telemetry and genetics reveal asymmetric dispersal of a lake‐feeding salmonid between inflow and outflow spawning streams at a microgeographic scale, Ecology and Evolution, 10.1002/ece3.5937, 10, 4, (1762-1783), (2020).
      • Genetic methods reveal high diversity and no evidence of stock structure among cuckoo rays (Leucoraja naevus) in the northern part of Northeast Atlantic, Fisheries Research, 10.1016/j.fishres.2020.105715, 232, (105715), (2020).
      • Origins and insights into the historic Judean date palm based on genetic analysis of germinated ancient seeds and morphometric studies, Science Advances, 10.1126/sciadv.aax0384, 6, 6, (eaax0384), (2020).
      • Haplotype Block Analysis Reveals Candidate Genes and QTLs for Meat Quality and Disease Resistance in Chinese Jiangquhai Pig Breed, Frontiers in Genetics, 10.3389/fgene.2020.00752, 11, (2020).
      • Population genetics of the European rabbit along a rural-to-urban gradient, Scientific Reports, 10.1038/s41598-020-57962-3, 10, 1, (2020).
      • Microsatellite markers for Anthericum ramosum: Development, characterization, and cross‐species amplification, Applications in Plant Sciences, 10.1002/aps3.11323, 8, 2, (2020).
      • Nickel‐Catalyzed ortho‐Acyloxylation of Benzamides and Acrylamides with Carboxylic Acids, ChemistrySelect, 10.1002/slct.201904651, 5, 6, (1925-1928), (2020).
      • Decimated little brown bats show potential for adaptive change, Scientific Reports, 10.1038/s41598-020-59797-4, 10, 1, (2020).
      • Congruent population genetic structures and divergence histories in anther‐smut fungi and their host plants Silene italica and the Silene nutans species complex, Molecular Ecology, 10.1111/mec.15387, 29, 6, (1154-1172), (2020).
      • The genetic legacy of extreme exploitation in a polar vertebrate, Scientific Reports, 10.1038/s41598-020-61560-8, 10, 1, (2020).
      • The intensity of sexual selection, body size and reproductive success in a mating system with male–male combat: is bigger better?, Oikos, 10.1111/oik.07223, 129, 7, (998-1011), (2020).
      • Comparative Genetic Characteristics of the Russian and Belarusian Populations of Wisent (Bison bonasus), North American Bison (Bison bison) and Cattle (Bos taurus), Cytology and Genetics, 10.3103/S0095452720020085, 54, 2, (116-123), (2020).
      • Is Niagara Falls a barrier to gene flow in riverine fishes? A test using genome‐wide SNP data from seven native species, Molecular Ecology, 10.1111/mec.15406, 29, 7, (1235-1249), (2020).
      • High Connectivity Among Breeding Populations of the Elegant Tern (Thalasseus elegans) in Mexico and Southern California Revealed Through Population Genomic Analysis, Waterbirds, 10.1675/063.043.0102, 43, 1, (17), (2020).
      • An empirical comparison of population genetic analyses using microsatellite and SNP data for a species of conservation concern, BMC Genomics, 10.1186/s12864-020-06783-9, 21, 1, (2020).
      • Negative dominance and dominance-by-dominance epistatic effects reduce grain-yield heterosis in wide crosses in wheat, Science Advances, 10.1126/sciadv.aay4897, 6, 24, (eaay4897), (2020).
      • adiv: An r package to analyse biodiversity in ecology, Methods in Ecology and Evolution, 10.1111/2041-210X.13430, 11, 9, (1106-1112), (2020).
      • Fine-scale population structure and evidence for local adaptation in Australian giant black tiger shrimp (Penaeus monodon) using SNP analysis, BMC Genomics, 10.1186/s12864-020-07084-x, 21, 1, (2020).
      • Using genomics to design and evaluate the performance of underwater forest restoration, Journal of Applied Ecology, 10.1111/1365-2664.13707, 57, 10, (1988-1998), (2020).
      • Integrating genetics, biophysical, and demographic insights identifies critical sites for seagrass conservation, Ecological Applications, 10.1002/eap.2121, 30, 6, (2020).
      • Genetic evaluation of migratory fish: Implications for conservation and stocking programs, Ecology and Evolution, 10.1002/ece3.6231, 10, 19, (10314-10324), (2020).
      • Association of genetic and climatic variability in giant sequoia, Sequoiadendron giganteum, reveals signatures of local adaptation along moisture‐related gradients, Ecology and Evolution, 10.1002/ece3.6716, 10, 19, (10619-10632), (2020).
      • Cryptic species and genetic connectivity among populations of the coral Pocillopora damicornis (Scleractinia) in the tropical southwestern Pacific, Marine Biology, 10.1007/s00227-020-03757-z, 167, 10, (2020).
      • Shifts in coral clonality along a gradient of disturbance: insights on reproduction and dispersal of Pocillopora acuta, Marine Biology, 10.1007/s00227-020-03777-9, 167, 11, (2020).
      • Using Genetic Data to Estimate Capture Rate of Wisconsin and Leech Lake Strains of Muskellunge Stocked in Four Wisconsin Lakes, North American Journal of Fisheries Management, 10.1002/nafm.10502, 40, 5, (1302-1312), (2020).
      • On the relationships between plant species richness and the environment: a case study in Eastern Ghats, India, Environmental Monitoring and Assessment, 10.1007/s10661-019-7686-7, 191, S3, (2020).
      • Differing, multiscale landscape effects on genetic diversity and differentiation in eastern chipmunks, Heredity, 10.1038/s41437-020-0293-0, (2020).
      • Paths for colonization or exodus? New insights from the brown bear (Ursus arctos) population of the Cantabrian Mountains, PLOS ONE, 10.1371/journal.pone.0227302, 15, 1, (e0227302), (2020).
      • Molecular and paleo‐climatic data uncover the impact of an ancient bottleneck on the demographic history and contemporary genetic structure of endangered Pinus uliginosa, Journal of Systematics and Evolution, 10.1111/jse.12573, 0, 0, (2020).
      • Genetic analysis of red deer (Cervus elaphus) administrative management units in a human-dominated landscape, Conservation Genetics, 10.1007/s10592-020-01248-8, (2020).
      • Landscape genetics of northern crested newt Triturus cristatus populations in a contrasting natural and human-impacted boreal forest, Conservation Genetics, 10.1007/s10592-020-01266-6, (2020).
      • Population Genetic Structure Is Unrelated to Shell Shape, Thickness and Organic Content in European Populations of the Soft-Shell Clam Mya Arenaria, Genes, 10.3390/genes11030298, 11, 3, (298), (2020).
      • Genetic signature of disease epizootic and reintroduction history in an endangered carnivore, Journal of Mammalogy, 10.1093/jmammal/gyaa043, (2020).
      • Restored river habitat provides a natural spawning area for a critically endangered landlocked Atlantic salmon population, PLOS ONE, 10.1371/journal.pone.0232723, 15, 5, (e0232723), (2020).
      • Identification of Quantitative Trait Loci for Altitude Adaptation of Tree Leaf Shape With Populus szechuanica in the Qinghai-Tibetan Plateau, Frontiers in Plant Science, 10.3389/fpls.2020.00632, 11, (2020).
      • Infection dynamics, dispersal, and adaptation: understanding the lack of recovery in a remnant frog population following a disease outbreak, Heredity, 10.1038/s41437-020-0324-x, (2020).
      • Restricted connectivity and population genetic fragility in a globally endangered Hammerhead Shark, Reviews in Fish Biology and Fisheries, 10.1007/s11160-020-09607-x, (2020).
      • Disagreement in FST estimators: A case study from sex chromosomes, Molecular Ecology Resources, 10.1111/1755-0998.13210, 0, 0, (2020).
      • Population Genomic Analyses of Wild and Farmed Striped Catfish Pangasianodon Hypophthalmus in the Lower Mekong River, Journal of Marine Science and Engineering, 10.3390/jmse8060471, 8, 6, (471), (2020).
      • Genetic structure of the endemic Papaver occidentale indicates survival and immigration in the Western Prealps, Alpine Botany, 10.1007/s00035-020-00238-3, (2020).
      • Insight into the Current Genetic Diversity and Population Structure of Domestic Reindeer (Rangifer tarandus) in Russia, Animals, 10.3390/ani10081309, 10, 8, (1309), (2020).
      • Genetic melting pot in blacklegged ticks at the northern edge of their expansion front, Journal of Heredity, 10.1093/jhered/esaa017, (2020).
      • Genetic Diversity of Historical and Modern Populations of Russian Cattle Breeds Revealed by Microsatellite Analysis, Genes, 10.3390/genes11080940, 11, 8, (940), (2020).
      • Population genomics reveals a mismatch between management and biological units in green abalone ( Haliotis fulgens ) , PeerJ, 10.7717/peerj.9722, 8, (e9722), (2020).
      • Temporal landscape genetic data indicate an ongoing disruption of gene flow in a relict bird species, Conservation Genetics, 10.1007/s10592-020-01253-x, (2020).
      • Genetic Structure of Invasive Baby’s Breath (Gypsophila paniculata L.) Populations in a Michigan Dune System, Plants, 10.3390/plants9091123, 9, 9, (1123), (2020).
      • Range reduction of Oblong Rocksnail, Leptoxis compacta , shapes riverscape genetic patterns , PeerJ, 10.7717/peerj.9789, 8, (e9789), (2020).
      • Using genetics to inform restoration and predict resilience in declining populations of a keystone marine sponge, Biodiversity and Conservation, 10.1007/s10531-020-01941-7, (2020).
      • Multi-targeted management of upland game birds at the agroecosystem interface in midwestern North America, PLOS ONE, 10.1371/journal.pone.0230735, 15, 4, (e0230735), (2020).
      • Phylogeography of the iconic Australian red-tailed black-cockatoo (Calyptorhynchus banksii) and implications for its conservation, Heredity, 10.1038/s41437-020-0315-y, (2020).
      • Genetic variability of blood groups in southern Brazil, Genetics and Molecular Biology, 10.1590/1678-4685-gmb-2018-0327, 43, 2, (2020).
      • Genetic structure of amphi-Atlantic Laminaria digitata (Laminariales, Phaeophyceae) reveals a unique range-edge gene pool and suggests post-glacial colonization of the NW Atlantic , European Journal of Phycology, 10.1080/09670262.2020.1750058, (1-12), (2020).
      • A GT‐seq panel for walleye (Sander vitreus) provides important insights for efficient development and implementation of amplicon panels in non‐model organisms, Molecular Ecology Resources, 10.1111/1755-0998.13226, 0, 0, (2020).
      • Perceptions of Similarity Can Mislead Provenancing Strategies—An Example from Five Co-Distributed Acacia Species, Diversity, 10.3390/d12080306, 12, 8, (306), (2020).
      • Genetic diversity, relatedness and inbreeding of ranched and fragmented Cape buffalo populations in southern Africa, PLOS ONE, 10.1371/journal.pone.0236717, 15, 8, (e0236717), (2020).
      • Microsatellite Diversity and Phylogenetic Relationships among East Eurasian Bos taurus Breeds with an Emphasis on Rare and Ancient Local Cattle, Animals, 10.3390/ani10091493, 10, 9, (1493), (2020).
      • Multi-level patterns of genetic structure and isolation by distance in the widespread plant Mimulus guttatus, Heredity, 10.1038/s41437-020-0335-7, (2020).
      • Microsatellite multiplex assay for sable (Martes zibellina) and pine marten (Martes martes), Mammal Research, 10.1007/s13364-020-00529-4, (2020).
      • Genetic diversity assessment of sorghum ( Sorghum bicolor (L.) Moench) landraces using SNP markers , South African Journal of Plant and Soil, 10.1080/02571862.2020.1736346, (1-7), (2020).
      • Supporting Fisheries Management With Genomic Tools: A Case Study of Kingklip (Genypterus capensis) Off Southern Africa, Frontiers in Marine Science, 10.3389/fmars.2020.557146, 7, (2020).
      • Variable clonality and genetic structure among disjunct populations of Banksia mimica, Conservation Genetics, 10.1007/s10592-020-01288-0, (2020).
      • The Influence of Environmental Variation on the Genetic Structure of a Poison Frog Distributed Across Continuous Amazonian Rainforest, Journal of Heredity, 10.1093/jhered/esaa034, (2020).
      • Seascape genetics of the stalked kelp Pterygophora californica and comparative population genetics in the Santa Barbara Channel, Journal of Phycology, 10.1111/jpy.12918, 56, 1, (110-120), (2019).
      • Genetic Management of Captive and Reintroduced Bilby Populations, The Journal of Wildlife Management, 10.1002/jwmg.21777, 84, 1, (20-32), (2019).
      • Genome‐wide SNPs resolve spatiotemporal patterns of connectivity within striped marlin (Kajikia audax), a broadly distributed and highly migratory pelagic species, Evolutionary Applications, 10.1111/eva.12892, 13, 4, (677-698), (2019).
      • Rapid and repeatable host plant shifts drive reproductive isolation following a recent human‐mediated introduction of the apple maggot fly, Rhagoletis pomonella, Evolution, 10.1111/evo.13882, 74, 1, (156-168), (2019).
      • Contrasting patterns of population structure at large and fine geographical scales in a migratory avian disturbance specialist of braided river ecosystems, Diversity and Distributions, 10.1111/ddi.12994, 26, 1, (16-33), (2019).
      • Genetic evidence of a northward range expansion in the eastern Bering Sea stock of Pacific cod, Evolutionary Applications, 10.1111/eva.12874, 13, 2, (362-375), (2019).
      • A population genomics appraisal suggests independent dispersals for bitter and sweet manioc in Brazilian Amazonia, Evolutionary Applications, 10.1111/eva.12873, 13, 2, (342-361), (2019).
      • Regional genetic structure of sandfish Holothuria (Metriatyla) scabra populations across the Philippine archipelago, Fisheries Research, 10.1016/j.fishres.2018.09.021, 209, (143-155), (2019).
      • Reproductive philopatry in a coastal shark drives age-related population structure, Marine Biology, 10.1007/s00227-019-3467-7, 166, 3, (2019).
      • Effective population size of the critically endangered east Australian grey nurse shark Carcharias taurus, Marine Ecology Progress Series, 10.3354/meps12850, 610, (137-148), (2019).
      • The genetic structure of flax illustrates environmental and anthropogenic selections that gave rise to its eco-geographical adaptation, Molecular Phylogenetics and Evolution, 10.1016/j.ympev.2019.04.010, (2019).
      • Genetic diversity and differentiation among provenances of Prosopis flexuosa DC (Leguminosae) in a progeny trial: Implications for arid land restoration, Forest Ecology and Management, 10.1016/j.foreco.2019.04.016, 443, (59-68), (2019).
      • Genome-wide patterns of population structure and association mapping of nut-related traits in Persian walnut populations from Iran using the Axiom J. regia 700K SNP array, Scientific Reports, 10.1038/s41598-019-42940-1, 9, 1, (2019).
      • Elucidating the Clusia criuva species ‘complex’: cryptic taxa can exhibit great genetic and geographical variation, Botanical Journal of the Linnean Society, 10.1093/botlinnean/boz004, 190, 1, (67-82), (2019).
      • Novel multimarker comparisons address the genetic population structure of silvertip sharks (Carcharhinus albimarginatus), Marine and Freshwater Research, 10.1071/MF18296, 70, 7, (1007), (2019).
      • Feral swine harming insular sea turtle reproduction: The origin, impacts, behavior and elimination of an invasive species, Acta Oecologica, 10.1016/j.actao.2019.103442, 99, (103442), (2019).
      • Early genetic outcomes of American black bear reintroductions in the Central Appalachians, USA, Ursus, 10.2192/URSU-D-18-00011.1, 29, 2, (119), (2019).
      • New guidance for ex situ gene conservation: Sampling realistic population systems and accounting for collection attrition, Biological Conservation, 10.1016/j.biocon.2019.04.013, 235, (199-208), (2019).
      • Exploring the genetic diversity within traditional Philippine pigmented Rice, Rice, 10.1186/s12284-019-0281-2, 12, 1, (2019).
      • See more