A new measure of ecological network structure based on node overlap and segregation
Summary
- Despite substantial recent progress, ecologists continue to search for methods of measuring the structure of ecological networks. Several studies have focused on nestedness, a pattern reflecting the tendency of network nodes to share interaction partners. Here, we introduce a new statistical procedure to measure both this kind of structure and the opposite one (i.e. species' tendency against sharing interacting partners) that we call ‘node segregation’. In addition, our procedure provides also a straightforward measure of modularity, that is, the tendency of a network to be compartmented into separated clusters of interacting nodes.
- This new analytical measure of network structure assesses the average deviation between the observed number of neighbours shared by any pair of nodes (species), and the expected number, that is computed using a probabilistic approach based on simple combinatorics. The measure can be applied to both bipartite networks (such as plant–pollinators) and unimode networks (such as food webs). We tested our approach on several sets of hypothetical and real‐world networks.
- We demonstrate that our approach makes it possible to identify different kinds of non‐random network configurations (nestedness, node segregation and modularity). In addition, we show that nestedness in ecological networks is less common than previously thought, and that most ecological networks (including the majority of mutualistic ones) tend towards patterns of segregated associations.
- Our analyses show that the new measure of node overlap and segregation can efficiently identify different structural patterns. The results of our analyses conducted on real networks highlight the need to carefully reconsider the assumption that ecological networks are stable due to their inherent nestedness.
Introduction
Representing species interactions in the form of networks has become a common approach to investigate the patterns of coexistence and community structure (Proulx, Promislow & Phillips 2005). Investigating ecological networks makes it possible to summarize complex patterns into simple statistics. This task is particularly compelling in a context where the challenge for ecologists is to progressively move from collecting large amounts of data to analysing them (Hampton et al. 2013).
Some properties of ecological networks can be inferred by focusing on individual nodes (or species). For example, one may randomly select species from a food web and count their number of prey and predators in order to obtain the network degree distribution. In this way, one would not have any information on the way nodes are connected one to another (i.e. who eats whom), but would still obtain insight into the mechanisms responsible for network structure (Newman 2003). Other measures require taking into account larger fragments of the network. Among these, clustering measures (such as the local, global and average clustering coefficients) are of great interest, because they focus on ‘triangles’ (i.e. two interacting nodes sharing a neighbour) (Watts & Strogatz 1998), which can represent many common real‐world interactions. For example, in a trophic web, triangles can indicate several different situations (see Stouffer 2010), such as two predators sharing a prey and also feeding on one another (e.g. intraguild predation; Polis, Myers & Holt 1989) or a parasite using two species that are predator and prey (Strona 2014).
Besides food webs, investigating patterns of node sharing have gained attention in the study of mutualistic networks (Bascompte et al. 2003; Johnson, Dominguez‐Garcia & Munoz 2013). Researchers have hypothesized that competition between two species (A and B) could be reduced if the number of shared mutualistic partners is maximized (Bastolla et al. 2009; Johnson, Dominguez‐Garcia & Munoz 2013). Typically, an incremental increase in the population of either species A or B would have a negative effect on the other species. However, if the two species share a mutualistic partner (species C), then growth of A could have a positive effect on C, that, in turn, could positively affect B thereby offsetting some of the direct negative effect of A on B. Extended to the whole network, this scenario implies that each node pair could benefit (i.e. have a reduced chance of extinction) from sharing as many neighbouring nodes as possible. This gives the network a nested structure; that is, the set of neighbours connected to any node is a subset of the set of neighbours connected to any node with higher degree (Atmar & Patterson 1993; Kondoh, Kato & Sakato 2010; Johnson, Dominguez‐Garcia & Munoz 2013; Staniczenko, Kopp & Allesina 2013).
Several authors have suggested that a nested structure can promote network stability and persistence (Bastolla et al. 2009; Thébault & Fontaine 2010). The coexistence of numerous interacting species might be obtained if the species are all part of a highly stable and primarily mutualistic network (Montoya, Pimm & Solé 2006). Although these hypotheses are mostly speculative, they have been used as the basis for a growing corpus of literature that is currently trying to explain why ecological networks are nested (Rohr, Saavedra & Bascompte 2014). Apart from nestedness, there are other network structures that could contribute to network stability. For example, the presence of trophically based subwebs in a food web might confer greater stability by isolating perturbations from the rest of the web (Kondoh, Kato & Sakato 2010).
However, several authors (see, for example, Joppa et al. 2010; Johnson, Dominguez‐Garcia & Munoz 2013) have shown that the commonness of nestedness in ecological networks could be in part due by methodological issues, which casts some shadows on the robustness of the observed patterns. Here, we provide a new powerful statistical procedure (applicable to both unimode and bipartite networks) that could help clarifying this issue, and we use it to re‐examine the structure of a large set of ecological networks. Following previous studies (Almeida‐Neto, Guimarães & Lewinsohn 2007; Fortuna et al. 2010), we suggest that nestedness represents only one potential outcome, since situations where nodes tend to share few neighbours (herein referred to as segregation), or where high node overlap can be identified within particular groups of nodes (modularity), also constitute meaningful ecological patterns (Podani & Schmera 2012; Podani, Jordán & Schmera 2014).
Materials and methods
It has been suggested that the average number of shared nodes is a natural measure of nestedness (Bascompte et al. 2003; Johnson, Dominguez‐Garcia & Munoz 2013). This is consistent with the original concept of nestedness (Patterson & Atmar 1986; Johnson, Dominguez‐Garcia & Munoz 2013; Staniczenko, Kopp & Allesina 2013) and with the formulation of one of the most popular nestedness measures, that is NODF (Almeida‐Neto et al. 2008); in a perfectly nested pattern, each pair of nodes always shares a number of neighbours equal to the degree of the less connected node (Fig. 1a). In other words, the species with lower degree is not linked to any species that the species with higher degree is not also linked to. In any network, there is a finite number of ways that nodes can be connected for a given number of links (connectance) among the nodes. As such, combinatorics can be used to calculate the expected amount of node sharing.

In the context of species co‐occurrence, Veech (2013) has shown how the expected number of shared localities between two species can be computed analytically using simple combinatorics. The probabilities given in Veech (2013) can also be obtained through the hypergeometric distribution as described recently in Griffith, Veech & Marsh (2015). Here, we extend the probabilistic species co‐occurrence model of Veech (2013) to network node sharing wherein a neighbour node is equivalent to a ‘site’, and the task is to determine whether the two focal nodes (or species) ‘co‐occur’ at or share a number of neighbours that is different from the random (probabilistic) expectation. This allows detection of cases where the number of neighbours shared by two nodes is significantly greater or less than expected (Veech 2013 and Appendix S1). Therefore, for individual species pairs, the probabilistic approach can distinguish significant node overlap from significant node segregation (Fig. 1b,c). Moreover, this is accomplished without having to create a randomized network for comparison. In most other approaches (see below) to measuring network structure, it is necessary to construct randomized networks against which to compare the observed network. Furthermore, in these other methods, the amount of node sharing (whether greater or less than expected) is only assessed at the level of the entire network, not individual species pairs. The probabilistic approach that we propose determines whether the observed amount of node overlap (sharing) between a pair of nodes could be due to chance alone (Veech 2014). The probabilistic model of species co‐occurrence (Veech 2013; Griffith, Veech & Marsh 2015) has a historical predecessor in the Raup–Crick metric of similarity (Raup & Crick 1979) in that both approaches use combinatorics to analytically derive exact probabilities. The main difference is that the Raup–Crick metric is applied to test for non‐random sharing of species among two assemblages (sampling localities), whereas the former tests for non‐random sharing of sites by a pair of species. Further, at the time, Raup & Crick (1979) did not recognize the link to the hypergeometric distribution, although their metric has since been linked to that sampling distribution (Chase et al. 2011).
Node overlap and segregation measure (
)
(eqn 1)
(eqn 2)We note that Equation 2 derives from Veech (2013) which provides a detailed mathematical (probabilistic) description for species co‐occurrence among sites, but not node sharing. Because Sij and Pij must be less than or equal to min(di, dj), ωij ranges from −1 to 1, but it is not symmetrical around zero in the sense that a given positive and negative magnitude (e.g. ωij = 0·5 and ωij = −0·5) do not represent an equivalent amount of node sharing and node segregation, respectively. In order to get symmetry and retain scaling from −1 to 1, each ωij is divided by a factor, Ωij, that is the maximum possible value that a given pair of nodes could have.
(eqn 3)
(eqn 4)
(eqn 5)
(eqn 6)Some examples of computation of Ɲij values are provided in Appendix S2. Ɲij can be computed for each pair of nodes in a network and then averaged to obtain an overall measure of node overlap
. Because Ɲij is constrained to be between −1 and 1,
also varies between −1 and 1.
values near −1 represent a network with less than expected node overlap, whereas
values near 1 represent greater than expected node overlap. The expected value of
is zero under a condition of no overall structure of node overlap in the network. A
value around zero can also occur when positive and negative Ɲij values cancel one another (which sometimes indicates a situation of network modularity, see next section) or when all Ɲij are near zero.
Computing
for directed networks
In a directed network, we can identify two categories of nodes, that is those having at least one incoming link (heads), and those having at least one outgoing link (tails). Thus, we can compute a separate Ɲij value for each set, by evaluating, respectively, the overlap in head nodes for any pair of tail nodes, and the overlap in tail nodes for any pair of head nodes. Obviously, this requires the n parameter in equations 2, 4 and 5 to be properly adjusted as, respectively, the size of the head set and the size of the tail set. For example, in a food web, Ɲij values are computed first focusing on the overlap in food items for each pair of consumers (setting n equal to the total number of food items), and then by focusing on the overlap in consumers for each pair of food items (setting n equal to the total number of consumers). Then, Ɲij values belonging to the two sets can be averaged together to compute
, or separately, providing two distinct measures of node overlap and segregation (
in and
out). Differences between
in and
out values (and in the respective distributions of Ɲij values) can provide interesting insights into the structure of directed networks, making it possible to detect asymmetric patterns of interactions.
According to the above criterion, node overlap is evaluated within heads and tails but not between them, ensuring, for example, that when evaluating the overlap in the consumers of a pair of plants, we do not include other plants in the computation of Ɲij. This derives directly from network structure, however, specific rules can be applied on a per‐case basis to improve ecological realism of
, by identifying a particular set of nodes to be used in the computation of each Ɲij value (adjusting n as the size of that set). In a trophic web, one may take into account trophic levels to discriminate between permitted and forbidden links, setting n for any Ɲij as the actual number of nodes that can be shared by Vi and Vj according to their trophic ecology. Similarly, one could decide to exclude cannibalistic interactions by excluding Vi and Vj from the computation of Ɲij, and thus replacing the parameter n with (n − 2) .
These kinds of adjustments are, in general, not needed for directed bipartite networks because the set of tails is disjoint from the set of heads, that is links are allowed between but not within two different, well‐distinguished node categories (e.g. plants and pollinators). However, adjustments can be made to the sets of nodes involved in the computation of each Ɲij value, by identifying all nodes that Vi and Vj can share according to a specific hypothesis. Since all of these possibilities, however, go well beyond the demonstrative scope of our analyses, we will demonstrate here the use of
in its basic formulation. Thus, for all the analyses on simulated and real bipartite networks, we will compute
as the average of all Ɲij values (calculated separately for tail and head nodes). For the analyses on food webs, we will compute
as the average of all Ɲij values (again, computed separately for tail and head nodes, that is food items and consumers), with the only exclusion of cannibalistic interactions (see previous paragraph).
Significance testing
Assessing significance of
is complicated by the fact that the distribution of Ɲij values is truncated, often non‐normal, and the values themselves are not completely independent of one another. However, we propose a test that is able to overcome these difficulties. First, the truncation and non‐normality of the Ɲij distribution can be overcome by invoking the central limit theorem (CLT), which has been recently demonstrated to be effective also for truncated distributions (Cha & Cho 2014). According to the CLT, the distribution of
as a random expectation (i.e. a set of
values from random networks) should be approximately normal and centred on zero regardless of the underlying form of each Ɲij distribution. Therefore, the significance of
can be assessed with the test statistic, z = (
− μ)/SE. However, because the Ɲij values are not independent, the standard error (SE) is underestimated and the true or effective sample size (n) cannot be known. Instead of using the typical SE = σ/√n, where n = the number of node pairs (or Ɲij values), we conservatively use n = 1. Therefore, our test statistic becomes ZƝ = (
− μ)/σ with μ = 0 and σ = the standard deviation of the observed
, with the P‐value derived from the standard normal distribution as usual. An observed network (or matrix) has non‐random structure if ZƝ is significantly different from zero.
Our test for significance of
is similar to the test typically used in nestedness analyses involving data randomization (e.g. Ulrich, Almeida‐Neto & Gotelli 2009; Strona & Fattorini 2014). In those analyses, a Z‐value is calculated as Z = [Nobs − mean(Nsim)]/σ(Nsim) where Nobs is the observed value of a nestedness metric and Nsim represent values from randomized or null species presence–absence matrices (or networks). In our approach to testing network structure, we do not have randomized networks but rather our test invokes the CLT. A key feature of our analytical approach is the desire to avoid data randomization, given that randomization is often sensitive to the particular algorithm (i.e. null model) employed (Gotelli 2000; Ulrich & Gotelli 2007; Strona & Fattorini 2014).
Modularity
As mentioned above,
values close to 0 could arise from different scenarios. A case of great interest is that of modularity; that is, a situation where a group(s) of nodes can be identified in a network where node overlap is substantially higher than expected between nodes belonging to the same group, and lower than expected between nodes belonging to different groups (Fig. 1d). Investigating the distribution of Ɲij values provides an immediate visual way to distinguish between random and modular networks having 
, as the first would have an unimodal distribution of Ɲij values with a peak centred at around 0, whereas the latter would have a bimodal distribution of Ɲij values, with a peak at near −1, arising from node segregation between different modules, and another peak near 1 arising from node overlap within modules.
The most straightforward way to quantify the bimodal pattern is evaluating the statistical dispersion of Ɲij values around the mean. Thus, we suggest the standard deviation of the Ɲij values as a simple measure of modularity (Mod). Although modularity measured in this way provides only a qualitative descriptor (i.e. it is not paired to a significance test), it has the convenient property of being mathematically related to the z‐test for the significance of
. That is, a large standard deviation might prevent an observed
value from being significant; nonetheless, the network might have structure in the form of modules even though the mean amount of node sharing throughout the entire network is not significant.
Test on simulated networks
We tested the ability of our approach to correctly identify non‐random patterns of network nestedness, segregation and modularity on a set of simulated networks with different amounts and types of structure. To allow comparisons with previous nestedness measures, we created all networks (1000 per type of structure) as bipartite.
To obtain different amounts of order in the simulated networks, we followed Strona & Fattorini (2014). For this, first we generated perfectly nested/segregated/modular networks of various sizes, and then we rewired edges at random, quantifying order based on the percentage of edges rewired. To create nested networks, we constrained the set of neighbours connected to any node to be a subsample of any other set of neighbours connected to a node having the same or a higher degree. To create segregated networks, we constrained each node to share no neighbouring node with any other node in the network (this also forced the in‐ and out‐degree of each node to be equal to 1). For both the nested and the segregated networks, we randomly varied the number of nodes between 20 and 100. To create a different amount of order, for each network, we reiterated r × E times the procedure of extracting two nodes at random and changing their status from connected to disconnected or vice versa. The parameter E indicates the total number of possible edges between all nodes in the network (which, in a bipartite network, corresponds to the size of the corresponding binary matrix). We let r vary randomly between 0 (maximum order) and 1 (maximum disorder), as we empirically observed that a number of randomization steps equal to E were enough to approximate the maximum perturbation status of the network (Strona et al. 2014).
For each of these nested/segregated random networks, we computed
and, additionally, we quantified nestedness using both NODF (Almeida‐Neto et al. 2008) and the spectral radius of the network adjacency matrix, ρ (Staniczenko, Kopp & Allesina 2013). To evaluate significance of NODF and ρ, we created two independent sets of null networks using different randomization rules. For ρ, we used the null model recommended by Staniczenko, Kopp & Allesina (2013), which creates randomly connected networks having the same number of tails, heads and edges of the original one. For NODF, we used the proportional null model proposed by Bascompte et al. (2003), which starts from the set of nodes of the original network and then connects two nodes with a probability proportional to their in‐ and out‐degree. In all cases, we quantified nestedness using both P and Z values (see Strona & Fattorini 2014). Finally, we performed a set of pairwise comparisons between the amount of network structure, network quantitative properties,
values and the different raw and standardized nestedness measures. To make clear the distinction between node overlap and segregation, instead of quantifying order using the parameter r (that ranges between 0, maximum order, and 1, maximum disorder), we transformed it to 1−r for nested networks and to r−1 for segregated networks. In this way, we obtained a symmetric measure of order ranging from −1 (complete segregation) to 1 (perfect nestedness), with 0 indicating randomness.
We also used simulated networks to conduct a power analysis of our test based on ZƝ. For a specified effect size, we calculated type I error probability as the number of
values that were less than the effect size but still statistically significant (α = 0·05) expressed as a proportion of all
values less than the effect size. Type II error probability was calculated as the number of
values that were greater than the effect size but not statistically significant and expressed as a proportion of all
values greater than the effect size. Power was calculated as 1 − P (Type II error).
To evaluate our measure of modularity, we created (bipartite) networks with different degrees of modularity by selecting a priori a random number of nodes (varying from 100 to 200) and a random number of modules (varying between 2 and 10). Then, we randomly assigned each node to a module. Finally, for each possible node pair within a module, with a probability P, we created an edge between the node pair, and with probability 1−p, we created an edge between two nodes picked at random from the whole node set. We let p vary between 0 (complete randomness) and 1 (maximum modularity). Finally, we assessed modularity in each network as
(Ɲij), and we compared each Mod value with the corresponding P.
Test on real networks
We measured
in a large set of real ecological networks featuring different kinds of interactions, and particularly on: (i) all the mutualistic networks available from Web of Life data set (www.web-of-life.es), including 59 plant/pollinator networks and 30 plant/seed disperser networks; (ii) two different sets of host/parasite networks, the first including the 50 largest networks compiled by grouping per country all the fish parasite records included in the Natural History Museum (London, UK) host/parasite data base (http://www.nhm.ac.uk), and the second including 26 mammal/flea networks provided by Krasnov et al. (2012) (available at http://datadryad.org/handle/10255/dryad.36193); (iii) all the community food webs (n = 179) available from the ECO‐WeB data base (Cohen 2010), as an example of application of
to unimode networks. It should be pointed out that many of the food webs in the last data set include ecological functional groups in place of, or together with taxonomic units. Consequently, any interpretation of results requires caveats, since the causes of node overlap and segregation are intimately related to node types, and should therefore be evaluated on a per‐case basis.
Results
Comparison with simulated networks
Our new measure,
, performed well in quantifying the amount of order or structure in the simulated networks. As the amount of order increased from −0·5 to −1,
rapidly approached −1 indicating increasing node segregation, while when order increased from 0·5 to 1,
rapidly approached 1 indicating increasing node overlap (Fig. 2a). As intended,
gradually declined to near zero for networks, in which an increasing number of connections among nodes was random. With 50% or more of the node links being random (order between −0·5 and 0·5),
was almost always between −0·1 and 0·1 (Fig. 2a) indicating that networks with a majority of random links have
values near zero. ZƝ and P‐values of
effectively distinguished between highly ordered and random networks (Fig. 2b,c). The proposed modularity measure (i.e. the standard deviation of Ɲij values) was positively correlated with modularity degree (Spearman's rho = 0·77, P < 0·0001) and made it possible to discriminate between modular and non‐modular networks (Fig. 3).

(a–c), ρ (d–f) and NODF (g–i).

The Z‐test had relatively good power (>0·7) when effect size was
> 0·7 and
< −0·8 for nested and segregated networks respectively (Fig. 4). Our Z‐test is inherently conservative in that we base the SE on an effective sample size of n = 1. However, the power can be increased by using any number between 1 and 3 (as a power adjustment factor) in the calculation of SE and subsequently ZƝ. Using a power adjustment factor of n = 3, the test had good power even for effect sizes as low as
> 0·5 and
< −0·5 (Fig. 4). In any test of significance, there is generally a trade‐off between power and type I error. As power increases, type I error also increases. This trade‐off was found for our Z‐test, although type I error probabilities generally remained at a reasonably low level typically not exceeding the specified significance level (Fig. 4). We also examined power adjustment = 5 (not shown in Fig. 4) and found that type I error probabilities became intolerably high.

ranging between −1 and 1 (effect size). Lines represent different power adjustment settings (see text for details).
Comparison between
and other nestedness measures
The spectral radii (ρ) computed for the set of simulated networks were poorly correlated with the corresponding amount of order (Fig. 2d). Conversely, comparison of NODF values with order of the simulated networks produced results similar to those obtained using
, that is NODF was able to detect quite well variation in network structure, both for nested (as in Strona & Fattorini 2014) and for segregated networks (Fig. 2g). However, the correlation between order and
was more tight than that between order and NODF, that is a given amount of structure was typically indexed by a smaller range in
values than in NODF values (cf. Fig. 2a,g).
Z‐ and P‐values of ρ performed better than the spectral radius in assessing network structure (Fig. 2e,f). Conversely, Z‐ and P‐values of NODF were less correlated with the amount of structure than the raw NODF value (Fig. 2h,i). In general, Z‐ and P‐values of
(Fig. 2b,c) were more efficient than those of ρ (Fig. 2e,f) and NODF (Fig. 2h,i) in measuring segregation.
Independence of
from network properties
The simulated networks were much variable in their structure and symmetry, with number of edges ranging between 10 and 1375 (mean = 246·6 ± 283·4), connectance ranging between 0·02 and 0·75 (mean = 0·28 ± 0·24) and symmetry (ratio between tails and heads) varying between 0·2 and 4·9 (mean = 0·95 ± 0·70). Neither
nor the other investigated nestedness measures (i.e. NODF and the spectral radius, ρ) were completely independent from network properties (Table 1). The number of network nodes was strongly correlated only with spectral radius ρ, but
, NODF and ρ were all correlated with the number of edges and network connectance, that is the fraction of existing edges over the possible total number. However, both edge number and connectance were strongly correlated with the amount of simulated order (with rs respectively equal to 0·751 and 0·752). Therefore, to ‘factor out’ this intermediary effect of order, we obtained the residuals from regressions of number of edges vs. order and connectance vs. order. When using the residuals, none of the measures were correlated with edge number and connectance (Table 1), with the exception of ρ, which was still moderately correlated with the number of edges (rs = 0·28).
, ρ, NODF and their respective Z values, and network properties measured in a set of 1000 nested and 1000 segregated simulated networks. Correlations for the residuals are indicated by ‘ε’
| Edges | Nodes | Connectance | ε (Edges vs. Order) | ε (Connectance vs. Order) | Order | |
|---|---|---|---|---|---|---|
![]() |
0·74 | 0·00 | 0·73 | −0·12 | −0·02 | 0·98 |
| ZƝ | 0·74 | −0·01 | 0·75 | −0·08 | 0·01 | 0·98 |
| ρ | 0·82 | 0·63 | 0·33 | 0·28 | −0·08 | 0·67 |
| Z (ρ) | 0·73 | 0·09 | 0·67 | 0·11 | 0·08 | 0·85 |
| NODF | 0·74 | −0·03 | 0·78 | −0·10 | 0·03 | 0·98 |
| Z (NODF) | 0·68 | 0·00 | 0·69 | 0·02 | 0·03 | 0·87 |
Comparison with real networks
We found that situations of overall node segregation and strong differences in overlap between node categories (e.g. plants vs. pollinators, host vs. parasites, etc.) are common in both mutualistic and antagonistic networks. In most of the investigated networks (72%), node pairs tended to share fewer neighbours than expected by chance, leading to negative
values (Fig. 5). However, among all the investigated ecological networks (353), only 44 showed significant structure (P < 0·05, with ZƝ computed using an adjustment factor n = 3), 33 of which had
. These findings contrast with the fact that, consistent with previous studies (Bascompte et al. 2003; Staniczenko, Kopp & Allesina 2013), most of these networks are significantly nested when examined with other procedures such as the spectral radius approach and NODF (see Table S1).

values computed on the different categories of ecological networks, and particularly plant–pollinator, seed–disperser and host–parasite networks and food webs.
As mentioned,
values near zero could indicate modularity, that is situations where many highly positive and negative Ɲij values cancel one another in the averaging procedure. We found that most networks (including those not showing strong patterns as revealed by
) were highly modular (average Mod = 0·78 ± 0·18), thus indicating that overall randomness is indeed rare in ecological networks.
Discussion
By using a new approach to measuring network structure, our results indicate that nestedness is probably not as widespread as currently thought. Among the network typologies we investigated, patterns of substantial node overlap were moderately common only in seed–disperser networks (Fig. 5). This is consistent with the fact that most granivores and frugivores (particularly mammals and birds) tend to be dietary generalists. (Wheelwright 1985). In the other kinds of networks, including the plant–pollinator ones, we found an overall tendency towards node segregation (i.e. against sharing interacting partners), which supports the idea that specialization may be the key to species coexistence (Pauw 2013).
Investigating how the taxonomic, biogeographical and ecological relatedness between species pairs affects their respective Ɲij values could provide fundamental insights into these issues, possibly leading to a better understanding of the determinants of ecological network structure. Although this kind of detailed analysis was far beyond the scope of this paper, we hope that this approach will be pursued by future researchers.
Our findings are consistent with the possibility that nested networks are no more likely to be stable than are unstructured ones, and that the strength of interactions may play a fundamental role in system stability (Allesina & Tang 2014). Thus, the controversial notion that nestedness promotes the stability of ecological networks (Bastolla et al. 2009; Thébault & Fontaine 2010; James, Pitchford & Plank 2012; Rohr, Saavedra & Bascompte 2014) should not be taken without questioning. Although our study is not the first casting doubt on the ubiquity of nestedness (see Joppa et al. 2010; Johnson, Dominguez‐Garcia & Munoz 2013), it is the first providing analytical evidence that most species tend to share fewer partners than expected by chance.
By offering a synthetic measure capable of simultaneously quantifying node overlap and segregation,
provides various advances to the field of ecological network analysis. A fundamental aspect of our method is that the deviation (positive or negative) between the observed and the expected overlap for any pair of nodes in a network is computed by using a probabilistic approach based on combinatorics (as in Veech 2013). This makes it possible to obtain analytically the expected number of shared partners between two nodes while controlling their respective number of edges and the total number of nodes in the network. In this sense, our approach is substantially different from other nestedness measures that assess the deviation between expected and observed patterns of node overlap a posteriori (using null model analysis) and only at the level of the whole network (Ulrich & Gotelli 2007).
We have proposed
as a unique measure to assess the overall tendency of a network towards nestedness or segregation. Although we have shown with our simulations that this measure is a faithful representation of the amount of structure of a network, it should be emphasized that the core information provided by our analytical approach lays in the distribution of Ɲij values that can offer a detailed picture of how the tendency towards nestedness and/or segregation is distributed across the network. Researchers can rely on
or on the distribution of Ɲij values (or both) for their analysis depending on their particular purposes. Moreover, substantial information can be obtained by comparing the distributions of expected and observed node overlap values; we hope this will be investigated in more detail in future studies of real‐world networks.
Similarly to NODF and spectral radius, our measure was not completely independent from connectance, with weakly connected networks tending towards low
values, due to the fact that networks of this kind are constrained towards a segregated edge configuration. The effect of connectance on nestedness measures is often considered undesirable. A possible solution to this issue is using null model analysis, that is comparing the observed nestedness with that simulated in a set of null networks having the same (or comparable) connectance of the target network (Ulrich & Gotelli 2007; Almeida‐Neto et al. 2008; Staniczenko, Kopp & Allesina 2013).
Our approach, in principle, does not prevent the use of data randomization that is common to most null models; it is permissible to compare
of a given network to that of any other network built according to any criteria. Nevertheless, we have provided an alternative test of significance that has sufficient power, and that is free from the potential issues related to traditional null model approaches and data randomization (Gotelli 2000; Ulrich & Gotelli 2007; Strona & Fattorini 2014). However, we are confident that users might find even better ways to use Ɲij values (as well as the distributions of observed and expected node overlap values) to analyse ecological networks. Our hope is that the probabilistic approach will lead to a more thorough and comprehensive analysis and thereby a more in‐depth understanding of ecological network structure.
Acknowledgements
We are indebted with four anonymous Reviewers for their useful comments and suggestions on a previous version of the manuscript.
Data accessibility
Mutualistic networks are available from Web of Life data set (www.web-of-life.es). Data necessary to build fish host/parasite networks grouped by country can be obtained from the Natural History Museum (London, UK) host/parasite data base (http://www.nhm.ac.uk). The 26 mammal/flea networks provided by Krasnov et al. (2012) are available at http://datadryad.org/handle/10255/dryad.36193. All the food webs used in this study are included in the ECO‐WeB data base (Cohen 2010), which can be downloaded at http://hdl.handle.net/10209/306.
References
Citing Literature
Number of times cited according to CrossRef: 21
- Kristiina Mark, Lauri Laanisto, C. Guillermo Bueno, Ülo Niinemets, Christine Keller, Christoph Scheidegger, Contrasting co‐occurrence patterns of photobiont and cystobasidiomycete yeast associated with common epiphytic lichen species, New Phytologist, 10.1111/nph.16475, 227, 5, (1362-1375), (2020).
- Nicholas M. Fountain‐Jones, Craig Packer, Maude Jacquot, F. Guillaume Blanchet, Karen Terio, Meggan E. Craft, Endemic infection can shape exposure to novel pathogens: Pathogen co‐occurrence networks in the Serengeti lions, Ecology Letters, 10.1111/ele.13250, 22, 6, (904-913), (2019).
- Colton Brehm, Astrid Layton, Designing eco-industrial parks in a nested structure to mimic mutualistic ecological networks, Procedia CIRP, 10.1016/j.procir.2018.12.011, 80, (590-595), (2019).
- M. J. Palazzi, J. Borge-Holthoefer, C. J. Tessone, A. Solé-Ribalta, Macro- and mesoscale pattern interdependencies in complex networks, Journal of The Royal Society Interface, 10.1098/rsif.2019.0553, 16, 159, (20190553), (2019).
- Matthew J. Michalska-Smith, Stefano Allesina, Telling ecological networks apart by their structure: A computational challenge, PLOS Computational Biology, 10.1371/journal.pcbi.1007076, 15, 6, (e1007076), (2019).
- Daniela N. López, Patricio A. Camus, Nelson Valdivia, Sergio A. Estay, Integrating species and interactions into similarity metrics: a graph theory-based approach to understanding community similarity, PeerJ, 10.7717/peerj.7013, 7, (e7013), (2019).
- Simone Cenci, Chuliang Song, Serguei Saavedra, Rethinking the importance of the structure of ecological networks under an environment‐dependent framework, Ecology and Evolution, 10.1002/ece3.4252, 8, 14, (6852-6859), (2018).
- Corrie Jacobien Carstens, Annabell Berger, Giovanni Strona, A unifying framework for fast randomization of ecological networks with fixed (node) degrees, MethodsX, 10.1016/j.mex.2018.06.018, 5, (773-780), (2018).
- Daniel J. McGarvey, Joseph A. Veech, Modular structure in fish co-occurrence networks: A comparison across spatial scales and grouping methodologies, PLOS ONE, 10.1371/journal.pone.0208720, 13, 12, (e0208720), (2018).
- Giovanni Strona, Werner Ulrich, Nicholas J. Gotelli, Bi‐dimensional null model analysis of presence‐absence binary matrices, Ecology, 10.1002/ecy.2043, 99, 1, (103-115), (2017).
- Giovanni Strona, Thomas J. Matthews, Susanne Kortsch, Joseph A. Veech, NOS: a software suite to compute node overlap and segregation in ecological networks, Ecography, 10.1111/ecog.03447, 41, 3, (558-566), (2017).
- Giovanni Strona, Joseph A. Veech, Forbidden versus permitted interactions: Disentangling processes from patterns in ecological network analysis, Ecology and Evolution, 10.1002/ece3.3102, 7, 14, (5476-5481), (2017).
- Chuliang Song, Rudolf P. Rohr, Serguei Saavedra, Why are some plant–pollinator networks more nested than others?, Journal of Animal Ecology, 10.1111/1365-2656.12749, 86, 6, (1417-1424), (2017).
- Mauricio Cantor, Mathias M. Pires, Flavia M. D. Marquitti, Rafael L. G. Raimundo, Esther Sebastián-González, Patricia P. Coltri, S. Ivan Perez, Diego R. Barneche, Débora Y. C. Brandt, Kelly Nunes, Fábio G. Daura-Jorge, Sergio R. Floeter, Paulo R. Guimarães, Nestedness across biological scales, PLOS ONE, 10.1371/journal.pone.0171691, 12, 2, (e0171691), (2017).
- Elisavet Georgopoulou, Thomas A. Neubauer, Giovanni Strona, Andreas Kroh, Oleg Mandic, Mathias Harzhauser, Beginning of a new age: How did freshwater gastropods respond to the Quaternary climate change in Europe?, Quaternary Science Reviews, 10.1016/j.quascirev.2016.07.034, 149, (269-278), (2016).
- Youhua Chen, Newly proposed node segregation index is not suitable for analyzing unimode food webs, Ecological Complexity, 10.1016/j.ecocom.2016.04.001, 26, (89-94), (2016).
- Giovanni Strona, Kevin D. Lafferty, Environmental change makes robust ecological networks fragile, Nature Communications, 10.1038/ncomms12462, 7, 1, (2016).
- Yve Eligiêr Alves Gadelha, Wesley Dáttilo, Olivia Evangelista, Benedito Cortês Lopes, Structure of mutualistic ant–treehopper interactions in the Brazilian Atlantic Forest, Journal of Tropical Ecology, 10.1017/S0266467416000183, 32, 3, (250-259), (2016).
- Giovanni Strona, Achille Mauri, Joseph A. Veech, Günther Seufert, Jesus San-Miguel Ayanz, Simone Fattorini, Far from Naturalness: How Much Does Spatial Ecological Structure of European Tree Assemblages Depart from Potential Natural Vegetation?, PLOS ONE, 10.1371/journal.pone.0165178, 11, 12, (e0165178), (2016).
- Giovanni Strona, Past, present and future of host–parasite co-extinctions, International Journal for Parasitology: Parasites and Wildlife, 10.1016/j.ijppaw.2015.08.007, 4, 3, (431-441), (2015).
- Colton Brehm, Astrid Layton, Nestedness of eco‐industrial networks: Exploring linkage distribution to promote sustainable industrial growth, Journal of Industrial Ecology, 10.1111/jiec.13057, 0, 0, (undefined).




, ρ, NODF (and their respective Z and P values), Mod and quantitative properties of all the ecological networks included in the analyses.

