A generic structure for plant trait databases
Correspondence site: http://www.respond2articles.com/MEE/
Summary
1. Plant traits are fundamental for understanding and predicting vegetation responses to global changes, and they provide a promising basis towards a more quantitative and predictive approach to ecology. As a consequence, information on plant traits is rapidly accumulating, and there is a growing need for efficient database tools that enable the assembly and synthesis of trait data.
2. Plant traits are highly heterogeneous, exhibit a low degree of standardization and are linked and interdependent at various levels of biological organization: tissue, organ, plant and population. Therefore, they often require ancillary data for interpretation, including descriptors of the biotic and abiotic environment, methods and taxonomic relationships.
3. We introduce a generic database structure that is tailored to accommodate plant trait complexity and is consistent with current theoretical approaches to characterize the structure of observational data. The over‐arching utility of the proposed database structure is illustrated based on two independent plant trait database projects.
4. The generic database structure proposed here is meant to serve as a flexible blueprint for future plant trait databases, improving data discovery, and ensuring compatibility among them.
Introduction
There is a critical need for integrated analyses in ecology to better understand and manage Earth’s biological resources (Clark et al. 2001). This raises significant challenges in accessing relevant data, including the development of global data information systems (Scholes et al. 2008), of which integrated plant trait databases must be a keystone. Plant traits – morphological, anatomical, physiological or phenological features measurable at the individual level (Violle et al. 2007) – reflect the outcome of evolutionary processes in the context of abiotic and biotic environmental constraints (Grime et al. 1997; Westoby et al. 2002; Díaz et al. 2004; Valladares, Gianoli & Gomez 2007). Information on a set of traits may therefore be a more objective predictor of ecosystem dynamics and functioning than, for example, species identity or functional group classification (McGill et al. 2006).
Plant trait data have been used in studies covering a diversity of topics, including plant functional ecology (Wright et al. 2004; Reich, Wright & Lusk 2007; Sperry, 2008), community ecology (Lavorel & Garnier 2002; Ackerly & Cornwell 2007; Messier, McGill & Lechowicz 2010), plant evolution (Moles et al. 2005; Cavender‐Bares et al. 2009), macroecological theory (Enquist et al. 2007), palaeobiology (Barboni et al. 2004; Royer et al. 2007), disturbance ecology (Wirth 2005; Diaz et al. 2007), plant migration and invasion (Schurr et al. 2005; Tackenberg & Stocklin 2008), conservation biology (Kahmen, Poschlod & Schreiber 2002) and – more recently – plant geography (Swenson & Enquist 2007; Swenson & Weiser in press). Plant trait data are also critical for parameterizing vegetation characteristics in models of ecosystem dynamics (White et al. 2000; Kattge et al. 2009) and individual‐based models of plant growth and mortality (Ogle & Pacala 2009; Wirth & Lichstein 2009). Furthermore, plant traits provide complementary information to earth observations like remote sensing (Ollinger et al. 2008), measurements of atmospheric CO2 fluxes and concentrations (e.g. FluxNet11
FluxNet: http://daac.ornl.gov/FLUXNET/index.cfm.
and GlobalView22
Globalview: http://www.esrl.noaa.gov/gmd/ccgg/globalview.
), forest inventories (e.g. the US Forest Inventory33
US Forest Inventory and Analysis National Program: http://www.fia.fs.fed.us.
), and global biodiversity assessments (Scholes et al. 2008).
The scientific focus determines the database requirements: the structure of a database should be as complex as necessary but as simple as possible, because the data compilation itself tends to introduce its own level of complexity. Major compilations of plant trait data have been, and are still being developed for different purposes: to characterize the species–trait matrix for specific geographical regions or selected traits, to compile trait measurements in the context of their environmental or experimental conditions, or to analyse correlations between a limited number of different traits accounting for intraspecific variability in an environmental context (for references see Table 1). Data compilations to characterize species–trait matrices compile one to several trait values per species and trait, but do not need to characterize the intraspecific variability in an environmental context in detail, as they are focussed on the species’ mean state. Compilations focussed on trait measurements in the context of environmental or experimental information account for intra‐ and interspecific variability, but they do not necessarily support the correlation analysis between different traits, as they only allow indirect deducing if different measurements had been conducted on the same object from information about the environmental/experimental context (location, plot, experimental treatment, individual, etc.). This indirect identification easily becomes ambiguous and may complicate analyses of trait correlations (see Appendix S1). Data compilations that account for the fact that different traits have been measured on the same object are often two‐dimensional spreadsheets (tables) with traits and additional information (ancillary data) in separate columns and observations in rows. These two‐dimensional spreadsheets are convenient for the compilation of a limited number of traits and ancillary data, but they become increasingly inconvenient as the number of traits and/or ancillary variables increase (Madin et al. 2007). This limitation leads to researchers often only including a limited amount of ancillary information and important details of the data collection reside in written notes or the researcher’s memory (Michener et al. 1997).
In the past few years, the focus of plant trait research included the characterization of ecological strategies and biodiversity in a quantitative, functional context as a function of the environment and phylogenetic constraints, which provide conditions for a more quantitative and predictive community ecology (McGill et al. 2006) and for a more realistic representation of vegetation in earth system models (Lavorel et al. 2007). This requires the compilation of several traits in combination with a detailed characterization of their intra‐ and interspecific variability, trait–trait correlations, abiotic and biotic environment, and phylogeny. Statistical tools and modelling concepts to accomplish these demands are being developed with strong momentum, e.g. the adaptation of hierarchical Bayesian concepts to ecological analyses (Ogle & Barber 2008), or the implementation of adaptive strategies to vegetation modelling (Scheiter & Higgins 2009), while dedicated structures for plant trait databases have not been published so far, and ecological researchers had to develop them for each individual application.
The need for integrated analyses poses another requirement for the development of database structures: interoperability – the seamless merging of disparate information from different databases into an integrated form for analysis. Here, ontologies, formal models that use mathematical logics to define concepts and their relationships (Madin et al. 2008), provide a mechanism for creating a common computer‐interpretable basis for interoperability. Ontologies have been successfully used in genetics, microbiology, and medicine, especially when used for interoperability among different databases and organizations. In ecology, where data structures are more complex, framework ontologies have only recently been laid out, which show considerable overlap (Madin et al. 2008). Consistency with these ontology schemata should ensure compatibility between different datasets and therefore be another requirement for future trait databases. However, simultaneously accounting for the various aspects of plant traits and the need for database interoperability poses a new challenge to the development of plant trait databases.
In the following, we analyse the structure of plant trait data to derive the key requirements for the development of plant trait databases (‘entity‐relationship analysis’, Chen 1976). Based on this we propose a generic structure, which is supposed to serve as a blueprint to help ecologists develop plant trait databases according to their individual needs. Finally, we illustrate consistency with current framework ontologies and improvements in access and interoperability of disparate plant trait data by two recently developed trait databases using the generic database structure presented here.
Entity‐relationship analysis of plant traits
Plant traits are a heterogeneous group of data with low degree of standardization
The term ‘plant trait’ is used in a wide range of contexts and there have been numerous attempts to define, conceptualize and categorize plant traits. Violle et al. (2007) propose a general definition of plant traits: ‘Any morphological, physiological or phenological feature measurable at the individual level, from the cell to the whole‐organism level,…’ According to this definition, plant traits may characterize anything from a leaf dark respiration rate to a maximum plant height, referring to an organ vs. the whole plant, measured with different instruments (e.g. gas exchange instrument vs. hypsometer), reported in different units (e.g. μmol CO2 m2 s−1 vs. m), and with different modes of standardization (e.g. at 25 °C in the dark vs. mean of tallest 5% of individuals ever measured), and so on. There have been substantial efforts to develop standardized trait definitions and measurement protocols, e.g. Cornelissen et al. (2003) and in the context of the LEDA project (Knevel et al. 2003; Kleyer et al. 2008), but these standards are only available for a limited set of traits, are often ignored, and do not help when it comes to the vast literature dating prior to 2003. Some traits cannot be measured but need to be estimated by fitting models to raw data (parameter‐based traits, e.g. Vcmax,25, the maximum carboxylation rate at 25 °C or parameters defining seed dispersal kernels). In summary, plant traits can be seen as a heterogeneous group of information with relatively low degree of standardization (Table 2).
| Definition | Example | Typical sources | Application | |
|---|---|---|---|---|
| Categorical – phylogenetic | Taxonomical descriptor | SpeciesGenus | FlorasSpecies monographs | Grouping variableRandom or fixed factor in mixed effects model |
| Categorical – nominal | Qualitative feature that cannot be expressed numerically | Photosynthetic pathwayComposite leavesDeciduousWeedAnimal dispersed | FlorasSpecies monographsPhysiological literatureTrait databases | GroupingDefinition of plant functional types |
| Categorical – ordinal | Qualitative feature that describes a performance intensity relative to other plants | Shade toleranceFrost toleranceResprouting capacity | Literature on plant indicator valuesSpecialized databasesApplied sciences (forestry, agriculture) | GroupingClassificationCovariateDefinition of plant functional types |
| Categorical – environment | Qualitative feature describing a plant’s growth environment | Slope exposureGrowth chamberSoil classification | Vegetation descriptionsOriginal publications | FilteringClassificationRandom factor in mixed effects model |
| Quantitative – environment | Quantitative measure describing a plant’s growth environment | Annual precipitationFertilizer applicationClay contentLAI of community | Original publicationsClimate/Soil databasesExperimental setup | CovariateDriver in model inversion |
| Quantitative plant state | Quantity characterizing the (often transient) state of a plant | HeightGrowth rateSpecific leaf area | Original publicationsInventories | CovariateModel validation |
| Quantitative plant trait | Quantity characterizing a typical (non‐transient) feature of a taxonomic unit (e.g. family, species) | Maximum heightSpecific leaf area of sun leavesStem wood density | Original publicationTrait databases | Dependent variableCovariateModel parameter |
| Parameter‐based trait | Statistical parameter of a function relating a plant state to an environmental variable | V cmax Rate constant of leaf litter decompositionQ10 of leaf respiration | Original publicationsMeta‐analyses | Parameter in process model |
Variability as signal and noise in comparative trait analysis: the need for ancillary data
Understanding and quantifying the variability of plant traits at the species or functional group level in response to changes in environmental drivers are of utmost importance in global change research and evolutionary ecology. Traits that are treated as qualitative, like leaf habit (e.g. needle‐leaved, broad‐leaved) or wood porosity type (e.g. ring‐porous, diffuse‐porous) are often nearly invariant within species, even though in some cases they are much more variable than studies suggest (e.g. flower colour, dispersal mode). Traits that are treated as quantitative, like leaf mass per area or leaf dry matter content vary substantially between and within individuals of a species (Messier, McGill & Lechowicz 2010). This intraspecific trait variability can optimize plant performance and fitness in response to abiotic and biotic constraints and is the consequence of genetic variation and phenotypic plasticity, the latter being the environmentally contingent trait expression of a given genotype (Fig. 1). The capacity for expressing trait variability may differ between species and developmental stages and may be constrained by trade‐offs between different traits (Valladares, Gianoli & Gomez 2007). This variability may be the signal of interest, but may also represent unwanted ‘noise’ in a comparative analysis (Fig. 1).

Different levels of variability of plant traits and statistical treatment. No variability: qualitative traits invariable at the respective level of phylogeny (e.g. leaf habit at species‐level); each species i is assigned its trait value θi. Low variability – quantitative traits with a low degree of variability (e.g. lignin content of bark): calculate the mean trait value
from several measurements for species i. High variability – quantitative traits with a high degree of variability (e.g. photosynthetic capacity): model the plastic response in relation to factors that affect it (cf. ‘covariates’); the result, θi*, is the standardized (predicted) trait value at a reference state of the covariates. Finally, a comparative analysis requires phylogenetic (or taxonomic) information as a predictor or grouping variable. Frequency indicates the relative occurrence of the different types of traits: most traits are characterized by a high variability and only a few show no variability.
For qualitative traits, which are almost invariant at the species‐level (e.g. wood porosity type), we can assign each species i its unambiguous trait value θi. For quantitative traits with a low degree of variability (e.g. density of cell walls), it is often sufficient to calculate the mean trait value
if several measurements per species are available. In this case
is a stochastic (or uncertain) quantity, and this uncertainty should be propagated into a comparative analysis. However, the majority of quantitative traits, including many of the most relevant traits for ecosystem functioning, are strongly modulated by the environment. In this case, the absence of quantifying the variability in the trait substantially reduces the information content in the data and can lead to inference problems (e.g. ecological fallacy, Robinson 1950). Using the mean value will yield an unrealistic point estimate contingent to the particular individuals and localities that happen to be represented in the dataset (Albert et al. in press). Here, we need to model the variable trait in response to ‘driving variables’ (covariates). For example, estimates of maximum photosynthesis rates should account for the influence of irradiance, temperature, water availability, and air humidity during the measurement period as well as plant and leaf developmental stage (Kattge & Knorr 2007). The result is a standardized rate θi*, i.e. a predicted rate at some reference state of the covariates (e.g. maximum photosynthesis at light saturation and 25 °C), coefficients that quantify the influence of the respective covariates (variability coefficients), and measures of their uncertainty and covariance. Comparing between species requires a standardization of traits, which in turn requires ancillary information about the environmental context, the type and amount of which depends on the trait and its inherent variability.
Moreover, ancillary data are often used for filtering, classification, and for accounting for data heterogeneity (Table 2). For example, filter variables may be used to tailor the dataset to the research questions. An analysis only interested in the traits of mature trees under field conditions might filter the data by ‘tree age’ and ‘growth environment’. Ecological data are inherently stochastic and often structured by processes (e.g. ‘random effects’) that do not necessarily reflect the impacts of treatments or covariates of interest. For example, in a study on specific leaf area, the data may come from 15 different laboratories and these laboratories may have employed slightly different methods. In this case, it is not desirable to estimate separate parameters for each laboratory, and yet we must account for this extra variability to obtain accurate parameter estimates for the quantities of interest. Thus, providing information on laboratories or research groups allows one to treat them as random effects. It should be noted that traits might themselves be used for filtering, classification and grouping of another trait. For example, the variable ‘photosynthetic pathway’ may be used as a grouping variable in a study on leaf nitrogen concentrations or as a response variable in a study on the distribution of photosynthetic pathways.
Trait correlations and the identity principle
Correlations between traits are important as they often reflect optimality principles (Wright et al. 2004; Reich et al. 2006, 2008) and allow us to draw conclusions about the underlying evolutionary forces (Reich et al. 2003). Modellers employ them to exclude non‐reasonable parameter combinations (Moorcroft, Hurtt & Pacala 2001) and for improved parameterization (Kattge et al. 2009). Most traits involved in these correlations and trade‐offs vary from organ to organ, from individual to individual, and they covary over time in response to ontogenetic processes and environmental fluctuations. Correlations will appear progressively weaker if the different traits have been measured at different points in time or on different organs of the same individual, different individuals of the same population, etc. (Fig. 2). Keeping track of the identities of the objects and the temporal context associated with a trait measurement (identity principle) within a database is therefore important if one wishes to control for the ‘degree of relatedness’ when analysing correlations.

Representation of the identity principle and hierarchical structure. T, vector of traits. The example traits are specific leaf area (SLA), nitrogen concentration (N%), maximum photosynthesis rate (Amax), and wood density (ρ). C, vectors of covariates (or ancillary data). Covariates either characterize trait measurements directly (trait covariates, Ct) or the hierarchical context of trait measurements, i.e. organ covariates (Co), individual tree (or plant) covariates (Ci), stand covariates (Cs), location covariates (Cl) and time covariates (not shown). This scheme allows the measured objects to be unambiguously characterized and various types of identities specified (examples in italics).
Hierarchical structure
The objects from which traits are measured are often nested in an observational hierarchy (Madin et al. 2007; Messier, McGill, & Lechowicz 2010). Each level of the hierarchy can be characterized by a unique set of traits and ancillary data that provide important context for other levels in the hierarchy (Fig. 2). An example is the hierarchy of ecological organization: the leaf is part of a branch, characterized by its length and particular position in the tree canopy; the branch is part of a tree, which may be healthy or damaged, young or old, and so on; the tree is part of a stand, which could be, for example, an uneven‐aged mixed or mono‐specific even‐aged stand. The stand is part of a larger landscape unit, which may be characterized by shallow or deep soils. Since all the levels of this ecological hierarchy are connected, the influence of landscape (or site) conditions propagates along the hierarchy from the level of the stand to the individual leaf and cells.
Trait information is only as good as the respective taxonomic information
Finally, the trait information is only as good as the taxonomic information. This poses an additional challenge to the development of a database structure. One problem that is very difficult to resolve is the mis‐identification of species. Here, the collection of specimen of each studied individual may help and the plant trait database should facilitate the link from species name to the respective specimen. Additional problems are different species concepts used by different floras, the synonymy of plant names, and the ongoing development and updating of species names and the deep taxonomy (Berendsohn & Geoffroy 2007). Assuming a good representation according to the current taxonomic concepts, what happens to the database 6 months, 2 years, a decade from now, when many of those species have been lumped, split, renamed, synonymized, etc.? Names are not static. The generic database structure cannot solve these problems, but it has to provide the respective concepts to enable the ecologist to treat these problems appropriately, e.g. by introducing a versioning system and facilitating links to specimen compilations.
A generic structure for plant trait databases
As a consequence of the above arguments, a plant trait database needs to provide the appropriate structure to (i) characterize each trait entry in detail, which is necessary due to the heterogeneity and relatively little standardization of plant traits, and (ii) place it in its specific biotic and abiotic context, accommodating ancillary data, the degree of relatedness of different measurements, inherent hierarchical structures and taxonomic specifications.
Characterizing trait and ancillary data as measurements on specific objects
Despite their heterogeneity, all plant trait data can be characterized as being measurable characteristics of specific objects: e.g. the length of a leaf or the height of a plant (cf. Madin et al. 2007). This is also true for ancillary data, like latitude and longitude of a location or the name of the person that has conducted the measurements. In this context, even the taxonomic classification can be addressed as measurable characteristics of specific objects: e.g. ‘Quercus robur L.’ is the binomial expression of the characteristic ‘species’, like ‘tree’ is an expression of the categorical trait ‘growth form’. In terms of data structure, there is hence no principle difference between trait data and ancillary data, including the taxonomic specification, and we propose to treat them identically as measurements of specific objects (cf. Madin et al. 2007).
Measurements are aggregated to observations
All measurements that have been taken on the same object for the same time are directly related to each other. We consider this aspect of different measurements being ‘related to the same object and time’ as the most important relationship among traits and between traits and ancillary data (the identity principle). We therefore propose to directly keep track of this relationship in the database and link all individual measurements taken at the same time on the same object to a unique ‘observation’ identifier.
In accordance with the hierarchical structure of traits and ancillary data we propose observations to be hierarchically nested, and influences on a higher level of the hierarchy, like stand, are propagated along the hierarchy to the lower levels, like individual leaf and cells (Fig. 2). Due to this hierarchically nested structure, different observations provide context for each other, and thus facilitate the comprehensive description of abiotic and biotic environmental conditions.
The database structure
The enfolding database structure is characterized by two key aspects: ‘measurement’ and ‘observation’ (Fig. 3). Measurement integrates all information directly related to a specific measurement, like name of trait or ancillary data, measurement standard, value, unit and precision. Relating all of this information to a measurement facilitates the detailed characterization of each database entry of a trait or ancillary data. The aggregation of different measurements to observations facilitates the realization of the most important relationship between traits and ancillary data: ‘being related to the same object in time’. Finally, observations are hierarchically nested, which facilitates the comprehensive characterization of the abiotic and biotic context of each measurement, accounting for the degree of relatedness.

Core tables (boxes) and relationships (connectors) of the proposed generic structure for plant trait databases: a dimensional data model realized in a relational database. ‘Observation’ is the central table of this conceptual framework, indicated by the 1:n (one to many) relationship between Observation and Measurement table: each observation can be characterized by measurements in n dimensions (traits and ancillary data). Each measurement is characterized by a value, precision (if appropriate), characteristic, and a measurement standard. Traits and ancillary data are defined in the Characteristic table. Measurements on the same object in time are aggregated to an observation. Observations are embedded in a hierarchy of observations on different levels, which is realized via the link within the observation table. Each Observation is related to a real world object, here called entity. No specification into traits, ancillary data or taxonomy here. In this generic representation of the database structure, all of these data are identically treated as measurable characteristics of specific objects. Bold italics: primary key (uniquely identifies each row in a table); italics: foreign keys (a key stored in one table which refers to a primary key in another table, used to establish a relationship between two tables); plain text: data entries.
Due to the aggregation of different measurements to observations and their hierarchical nesting, observation becomes the central element of the database structure. Such a centralized structure is called a ‘dimensional’ database, because the central element is characterized by attributes in different dimensions (Kimball & Ross 2002). In our case, the central element ‘observation’ is characterized by measurements of traits and ancillary data, where each trait and each kind of ancillary data represent a dimension, such that this results in a database structure with n dimensions. Dimensional databases with one central element are called ‘star‐schema’ (Kimball & Ross 2002).
Realized as a relational database, the dimensional structure is appropriate to compile diverse data, like plant traits and their ancillary data, because a relational database is highly flexible and can be easily adapted to the required level of complexity (e.g. adding new dimensions or subcentres to characterize the measurements). Finally, the proposed generic structure for plant trait databases can be characterized as a hierarchically nested relational‐dimensional database structure.
Two example databases using the generic database structure
The proposed generic database structure, designed to be a blueprint for the development of plant trait databases for individual purposes, has so far been used in the development of three plant trait databases. Here, we will present two contrasting databases – the Functional Ecology of Trees (FET) database and the TRY database – and illustrate how the generic database structure ensures data interoperability.
The FET44
The FET database is being collaboratively developed by the Organismic Biogeochemistry group at the Max‐Planck‐Institute for Biogeochemistry in Jena (Germany), the Department of Special Botany and Functional Biodiversity Research at the University of Leipzig (Germany) and the Ogle‐Labat the University of Wyoming, Laramie (USA).
database (Fig. 4) was collaboratively developed to address questions in the field of comparative trait research, functional biodiversity, and biogeochemical and demographic modelling. It was designed to compile data for a wide range of tree physiological, morphological, anatomical, and life‐history traits accompanied by a wide range of ancillary data related to several hierarchical levels of organization (organ, individual, forest stand, site), with the goal to disentangle phylogenetic, environmental, and disturbance‐related influences on plant trait values (Kattge et al. 2009; Kutsch et al. 2009). Data are primarily derived from the literature (data summaries) and direct experiments (raw data). The database supports the standardization of numerical and textual data during manual data entry.

Simplified relationships in the FET database. All information is linked through the central Observation table. Measurements are separated into traits (Trait data and Measurement standard tables) and tables providing information about the context of trait measurements with an inbuilt hierarchical structure (Entity, Site, Treatment, Taxonomy/phylogeny, and Source information). For each trait value entered in the trait measurement tables, information is entered, when available, on study site details, manipulative treatments and other study/sampling information. The trait data are linked to this information and a species, which is linked to external taxonomy datasets, to a position in the phylogenetic relationships, and to categorical traits that are invariable within species. Each box represents a core table in the database, and the current numbers of fields in each table are indicated.
The TRY55
The TRY initiative (http://try‐db.org) has been developed under the framework of the International Geosphere–Biosphere Programme (IGBP) in the context of the Fast‐Track Initiative ‘Refining Plant Functional Classifications for Earth System Modelling’ (http://www.igbp.kva.se/page.php?pid=369).
database (Fig. 5) is a communal effort to merge information about ecological traits of plant species from different existing datasets at a global scale (Lavorel et al. 2007). The specific traits, ancillary data and formats of data to be compiled within the TRY database were not known during the development of the database. The objective was therefore to design a database structure that is flexible enough to compile data for any plant trait and ancillary data contributed in any format without compromising the integrity of the contributed data.

Core tables, relationships and data entry‐types of the TRY database (each box represents a table). Each observation is characterized by one to several measurements, the respective dataset and the name of a species. Each measurement is linked to a characteristic (either trait or ancillary data). The measurement standard is specified in the Characteristic table. In the DataSet table each contributed dataset is at least characterized by its name and the names of contributors. The additional DataSet Characteristic table facilitates the import of the original name of each trait and ancillary data, and a specific characterization of measurement details for each dataset. The automated import of contributed data as original entries realizes data integrity (original value, original precision, original unit, original species name and original characteristic name). Standardized values and species names are added to the original entries.
Specific aspects of the two databases
The FET database provides a modular organization of templates (groups of tables with preset formats) to compile trait and context‐specific data. Each dimension of the traits and their context is characterized by a separate template. The modular organization conserves the transparency of the database and allows for exchanging or adding of templates. For example, it is easy to add a new trait with its own set of trait‐specific ancillary data. The users can therefore easily adapt the database according to their specific needs and it still is compatible with other users’ databases. The modular organization facilitates highly standardized data input via specific entry forms. These entry forms support the consistent design of experiments and extraction of data from literature sources and facilitate the standardized manual input of data into the FET database. An inbuilt hierarchical structure promotes the quantification of plant traits or ancillary data at the prescribed levels of organs, individuals, stands and sites (Figs 1 and 5 and Appendix S2). There are sets of ancillary data at each of these levels, which may be continuous variables or binary, categorical, or ordinal variables, in which case, the possible entries are predefined.
The major requirement of the TRY database was to respect the integrity of the contributed data, such that they can be reproduced from the database in their originally contributed form. This requirement was addressed through the use of an automated algorithm for importation of the original data values with minimal manual interference. By assembling numerous databases, the TRY project revealed a vast heterogeneity of the contributed trait information. For example, up to six different units were provided per trait; 138 different categories were provided for the categorical trait plant growth form; leaf toughness was provided as three different measures and as a categorical trait; and, identical traits come with different names (e.g. specific leaf area and leaf mass area). The database structure proved to be appropriate to deal with this heterogeneity and to support its standardization.
Treatment of taxonomic specification
A separate ‘Species’ table is used in FET and TRY to organize the taxonomy. This table contains fields for the ‘original species name’ and ‘accepted species’ identifier. An entry in the field ‘original species name’ consists of the binomial species name, which should be accompanied by the authority and, if possible, a reference to the regional flora. The original species name is on the one hand linked via the observation table to the data source, which is either a literature source or a specimen, and on the one hand to an accepted species identifier, which is linked to an official plant name index, e.g. IPNI.66
IPNI: International Plant Names Index, http://www.ipni.org.
Conserving the originally contributed species name and linking it to accepted species lists ensures that the species classification can be adapted to changes of the official taxonomy without loosing the original information.
Interoperability of different datasets
Interoperability ensures that information – data and their relationships – from different datasets can seamlessly be merged for subsequent integrated analysis. Ontologies provide a formal mechanism for defining terms and their relationships using mathematical logic, to facilitate the interoperability of different datasets. One ontology scheme that is gathering support in ecology and environmental science is the Extensible Observation Ontology (OBOE, see Appendix S3), developed by the SEEK project.77
Science Environment for Ecological Knowledge (SEEK): http://seek.ecoinformatics.org.
Like in the generic database structure presented here, in OBOE the information content of observational data is structured along ‘Measurement’ and ‘Observation’: measurements are aggregated to observations, which in turn are hierarchically nested. The generic database structure proposed here and the two databases presented are therefore consistent with this general structure of the OBOE ontology scheme, which facilitates interoperability, the seamless merging of disparate information from different databases into an integrated form for analysis (details see Appendix S4).
Discussion
In this article, we analysed the inherent structure of plant trait data to determine an appropriate database structure for capturing existing and future plant trait data. This entity‐relationship analysis uncovered several key aspects that must be considered to develop successfully an appropriate and interoperable database structure: plant traits are a heterogeneous group of data with a relatively low degree of standardization; a wide range of equally heterogeneous ancillary data are required to characterize the biotic and abiotic environment and the measurement methods; it is important to keep track of object identities in order to specify the ‘degree of relatedness’ relevant for the analysis of trait correlation, of the hierarchical structure of biological systems and of the taxonomy, which is constantly under revision. The resulting generic database structure for plant trait data followed three principles: (i) traits and ancillary data are treated identically as measurements of objects, (ii) measurements are aggregated to observations, and (iii) observations are hierarchically nested along the hierarchy of biological integration. These principles resulted in a hierarchically nested relational‐dimensional database structure. Finally, we illustrate the realization of this structure with two plant trait databases developed for different purposes, and we show that the new structure is consistent with the existing ontology schemes in ecology, which facilitate interoperability among plant traits and other ecological and environmental databases. The realized databases may be very detailed (e.g. FET) or comparatively simple (e.g. TRY). However, the underlying database structure is straightforward and is based on a dimensional data model that facilitates easy and unambiguous data entry, queries, and data quality assurance (Kimball & Ross 2002).
The database software needs not be specified, because the structure can be implemented in any software that facilitates the construction of a relational database. The appropriate software depends on user preference and application, and will probably depend on whether the database shall facilitate Internet‐based user access. The expected size of the database will in most cases not be an issue, as all common database programmes provide sufficient storage capacity for plant trait databases. Compared to other ecological data streams, the quantity of plant trait and ancillary data to be stored is relatively small (e.g. 750 megabytes in the case of the TRY data base, realized in Microsoft Office Access 2007, version 12.0.6535.5005; Microsoft Corporation, Redmond, Washington, USA), because the individual trait measurements are often manual, and therefore relatively time‐consuming and expensive. This is in contrast to other ecological databases, e.g. remote sensing or eddy covariance data, which contain data that are sampled automatically and may be weighed in the order of giga‐ to terabytes.
We are not aware of any other trait database that simultaneously accounts for (i) different measurements being taken on the same object and (ii) the hierarchical nesting of information. Two‐dimensional spreadsheet datasets often account for different measurements being taken on the same object (observation), which obviously has been identified by ecologists as being a pivotal characteristic of plant trait data. Yet, spreadsheet datasets are limited with respect to the number of different traits and ancillary data, and the representation of the hierarchical structure. During the transition from two‐dimensional spreadsheets to a relational database the characteristic ‘being measured on the same object’ gets lost if the relational database does not provide the respective structure.
Identical treatment of traits, taxonomy and ancillary data
The proposed generic structure for plant trait databases is not defined explicitly in terms of plant traits, taxonomy and ancillary data, but it is based on measurements of objects. Traits and covariates are only defined a posteriori as specific characteristics of the measurements. This is a major advantage because only in some cases are traits well defined (Cornelissen et al. 2003), while in most cases their definition is either vague or deliberately modified to address a specific scientific question. For example, in the literature, leaf nitrogen content could be reported as the average leaf nitrogen content of all leaves of an individual plant, the nitrogen content of a single leaf, the nitrogen content of a single leaf per dry mass or leaf area, the nitrogen content per leaf area of a sun‐exposed leaf, or the nitrogen content per leaf area of a sun‐exposed leaf during the peak growth period. The compilation of the individual measurements together with key ancillary data exactly describing the measurement provides the opportunity to make this decision and select the relevant data a posteriori, as dictated by the ecological question and subsequent statistical analysis (see also Appendix S5: The representation of parameter‐based traits). The transparent aggregation of measurements to observations and the hierarchical structure of observations avoids duplicating information and allows tracking post‐sampling data processing (Ellison et al. 2006).
Even the treatment of taxonomy fits into this concept of ‘being a measurement of a specific object’, although the classification systems are not always unambiguous (e.g. synonymy), different classification systems exist in parallel and are overlapping (regional floras, global name indices), and these classification systems are constantly changing (Berendsohn & Geoffroy 2007). Here, the compilation of the original species name related to an accepted species name has proven convenient and flexible. The original names are unambiguously linked via the Observation table to the data source (literature source, specimen), while the accepted names are linked to ‘official’ lists of species names, and thus make use of the treatment of synonyms in these lists and are able to follow changes. Dealing with species information in a few specific tables is a major advantage over using species names independently in several separate spreadsheets. Thus, the database structure supports the adaptation of taxonomy within the database to keep track with changes in external taxonomy sources.
Definition of observation: weakness or strength?
The central element of the proposed database structure is the observation. An observation is defined by measurements on the same object in space and time. Deciding which measured values belong to the same observation is flexible and can be subjective. The decision to be made is: what is to be considered an object in space and time? Two measurements on one leaf may be considered to belong to the same or different observations, depending on the perspective of the researcher (they will still be related on a higher level of the hierarchy). This subjectivity does not present a weakness but a strength of this approach, because the decision of what is to be considered a group of information that belongs to the same observation is most vivid at the time of data acquisition. Thus, the aggregation of measurements to observations already constitutes a knowledge component (Baumeister et al. 2007) stored in the database and ready to be reused by later projects.
Due to the separation of measurements, the aggregation of measurements to observations and the hierarchical arrangement of observations, the generic database structure realized in an relational database is extremely flexible, and facilitates the consistent compilation of data on higher levels of the biological organization, e.g. community‐level data, in combination with plant trait data. This flexibility in consistence with the major ontology schemes (e.g. OBOE) makes the generic structure not only appropriate for plant trait databases, but also applicable in other contexts (e.g. databases to compile data for scientific projects in general), where different kinds of data are to be compiled in combination with several ancillary data. First applications of the generic database structure for such project databases are currently being tested.
Conclusions and perspectives
Based on a comprehensive examination of plant traits with respect to data compilation, we have developed a generic dimensional database structure that follows three key principles: (i) traits and ancillary data are identically treated as measurements of specific objects; (ii) measurements related to the same object and time are aggregated to observations; and (iii) observations are hierarchically nested from organ to ecosystem. This database structure is consistent with main ontology frameworks (e.g. OBOE) that are currently being developed in ecology for improving data interoperability among research efforts. We illustrate the over‐arching utility of the proposed database structure using two independent plant trait database projects. The generic database structure will serve as a flexible blueprint for future plant trait databases, improving data discovery, and ensuring compatibility among them.
Acknowledgements
The authors wish to thank Jens Nieschulze and Brian McGill for essential input in the context of eco‐informatics, the anonymous referees and the handling editor for valuable comments that helped to substantially improve the manuscript. K.O. and K.S. were supported by National Science Foundation (NSF) grants awarded to K.O. in 2003 and 2006 (#0630119) and by NSF grant #EPS‐0447681. The FET database was supported by the German Science Foundation (DFG) through the BEAM project of C.W. within the Biodiversity Exploratories. The development of the TRY database was supported by IGBP, GLP, DIVERSITAS, QUEST and the French GIS Climate Environment and Society consortium.
References
Citing Literature
Number of times cited according to CrossRef: 53
- Carlos Cano‐Barbacil, Johannes Radinger, Emili García‐Berthou, Reliability analysis of fish traits reveals discrepancies among databases, Freshwater Biology, 10.1111/fwb.13469, 65, 5, (863-877), (2020).
- Patrick Weigelt, Christian König, Holger Kreft, GIFT – A Global Inventory of Floras and Traits for macroecology and biogeography, Journal of Biogeography, 10.1111/jbi.13623, 47, 1, (16-43), (2019).
- Jens Kattge, Gerhard Bönisch, Sandra Díaz, Sandra Lavorel, Iain Colin Prentice, Paul Leadley, Susanne Tautenhahn, Gijsbert D. A. Werner, Tuomas Aakala, Mehdi Abedi, Alicia T. R. Acosta, George C. Adamidis, Kairi Adamson, Masahiro Aiba, Cécile H. Albert, Julio M. Alcántara, Carolina Alcázar C, Izabela Aleixo, Hamada Ali, Bernard Amiaud, Christian Ammer, Mariano M. Amoroso, Madhur Anand, Carolyn Anderson, Niels Anten, Joseph Antos, Deborah Mattos Guimarães Apgaua, Tia‐Lynn Ashman, Degi Harja Asmara, Gregory P. Asner, Michael Aspinwall, Owen Atkin, Isabelle Aubin, Lars Baastrup‐Spohr, Khadijeh Bahalkeh, Michael Bahn, Timothy Baker, William J. Baker, Jan P. Bakker, Dennis Baldocchi, Jennifer Baltzer, Arindam Banerjee, Anne Baranger, Jos Barlow, Diego R. Barneche, Zdravko Baruch, Denis Bastianelli, John Battles, William Bauerle, Marijn Bauters, Erika Bazzato, Michael Beckmann, Hans Beeckman, Carl Beierkuhnlein, Renee Bekker, Gavin Belfry, Michael Belluau, Mirela Beloiu, Raquel Benavides, Lahcen Benomar, Mary Lee Berdugo‐Lattke, Erika Berenguer, Rodrigo Bergamin, Joana Bergmann, Marcos Bergmann Carlucci, Logan Berner, Markus Bernhardt‐Römermann, Christof Bigler, Anne D. Bjorkman, Chris Blackman, Carolina Blanco, Benjamin Blonder, Dana Blumenthal, Kelly T. Bocanegra‐González, Pascal Boeckx, Stephanie Bohlman, Katrin Böhning‐Gaese, Laura Boisvert‐Marsh, William Bond, Ben Bond‐Lamberty, Arnoud Boom, Coline C. F. Boonman, Kauane Bordin, Elizabeth H. Boughton, Vanessa Boukili, David M. J. S. Bowman, Sandra Bravo, Marco Richard Brendel, Martin R. Broadley, Kerry A. Brown, Helge Bruelheide, Federico Brumnich, Hans Henrik Bruun, David Bruy, Serra W. Buchanan, Solveig Franziska Bucher, Nina Buchmann, Robert Buitenwerf, Daniel E. Bunker, Jana Bürger, Sabina Burrascano, David F. R. P. Burslem, Bradley J. Butterfield, Chaeho Byun, Marcia Marques, Marina C. Scalon, Marco Caccianiga, Marc Cadotte, Maxime Cailleret, James Camac, Jesús Julio Camarero, Courtney Campany, Giandiego Campetella, Juan Antonio Campos, Laura Cano‐Arboleda, Roberto Canullo, Michele Carbognani, Fabio Carvalho, Fernando Casanoves, Bastien Castagneyrol, Jane A. Catford, Jeannine Cavender‐Bares, Bruno E. L. Cerabolini, Marco Cervellini, Eduardo Chacón‐Madrigal, Kenneth Chapin, F. Stuart Chapin, Stefano Chelli, Si‐Chong Chen, Anping Chen, Paolo Cherubini, Francesco Chianucci, Brendan Choat, Kyong‐Sook Chung, Milan Chytrý, Daniela Ciccarelli, Lluís Coll, Courtney G. Collins, Luisa Conti, David Coomes, Johannes H. C. Cornelissen, William K. Cornwell, Piermaria Corona, Marie Coyea, Joseph Craine, Dylan Craven, Joris P. G. M. Cromsigt, Anikó Csecserits, Katarina Cufar, Matthias Cuntz, Ana Carolina Silva, Kyla M. Dahlin, Matteo Dainese, Igor Dalke, Michele Dalle Fratte, Anh Tuan Dang‐Le, Jirí Danihelka, Masako Dannoura, Samantha Dawson, Arend Jacobus Beer, Angel De Frutos, Jonathan R. De Long, Benjamin Dechant, Sylvain Delagrange, Nicolas Delpierre, Géraldine Derroire, Arildo S. Dias, Milton Hugo Diaz‐Toribio, Panayiotis G. Dimitrakopoulos, Mark Dobrowolski, Daniel Doktor, Pavel Dřevojan, Ning Dong, John Dransfield, Stefan Dressler, Leandro Duarte, Emilie Ducouret, Stefan Dullinger, Walter Durka, Remko Duursma, Olga Dymova, Anna E‐Vojtkó, Rolf Lutz Eckstein, Hamid Ejtehadi, James Elser, Thaise Emilio, Kristine Engemann, Mohammad Bagher Erfanian, Alexandra Erfmeier, Adriane Esquivel‐Muelbert, Gerd Esser, Marc Estiarte, Tomas F. Domingues, William F. Fagan, Jaime Fagúndez, Daniel S. Falster, Ying Fan, Jingyun Fang, Emmanuele Farris, Fatih Fazlioglu, Yanhao Feng, Fernando Fernandez‐Mendez, Carlotta Ferrara, Joice Ferreira, Alessandra Fidelis, Bryan Finegan, Jennifer Firn, Timothy J. Flowers, Dan F. B. Flynn, Veronika Fontana, Estelle Forey, Cristiane Forgiarini, Louis François, Marcelo Frangipani, Dorothea Frank, Cedric Frenette‐Dussault, Grégoire T. Freschet, Ellen L. Fry, Nikolaos M. Fyllas, Guilherme G. Mazzochini, Sophie Gachet, Rachael Gallagher, Gislene Ganade, Francesca Ganga, Pablo García‐Palacios, Verónica Gargaglione, Eric Garnier, Jose Luis Garrido, André Luís Gasper, Guillermo Gea‐Izquierdo, David Gibson, Andrew N. Gillison, Aelton Giroldo, Mary‐Claire Glasenhardt, Sean Gleason, Mariana Gliesch, Emma Goldberg, Bastian Göldel, Erika Gonzalez‐Akre, Jose L. Gonzalez‐Andujar, Andrés González‐Melo, Ana González‐Robles, Bente Jessen Graae, Elena Granda, Sarah Graves, Walton A. Green, Thomas Gregor, Nicolas Gross, Greg R. Guerin, Angela Günther, Alvaro G. Gutiérrez, Lillie Haddock, Anna Haines, Jefferson Hall, Alain Hambuckers, Wenxuan Han, Sandy P. Harrison, Wesley Hattingh, Joseph E. Hawes, Tianhua He, Pengcheng He, Jacob Mason Heberling, Aveliina Helm, Stefan Hempel, Jörn Hentschel, Bruno Hérault, Ana‐Maria Hereş, Katharina Herz, Myriam Heuertz, Thomas Hickler, Peter Hietz, Pedro Higuchi, Andrew L. Hipp, Andrew Hirons, Maria Hock, James Aaron Hogan, Karen Holl, Olivier Honnay, Daniel Hornstein, Enqing Hou, Nate Hough‐Snee, Knut Anders Hovstad, Tomoaki Ichie, Boris Igić, Estela Illa, Marney Isaac, Masae Ishihara, Leonid Ivanov, Larissa Ivanova, Colleen M. Iversen, Jordi Izquierdo, Robert B. Jackson, Benjamin Jackson, Hervé Jactel, Andrzej M. Jagodzinski, Ute Jandt, Steven Jansen, Thomas Jenkins, Anke Jentsch, Jens Rasmus Plantener Jespersen, Guo‐Feng Jiang, Jesper Liengaard Johansen, David Johnson, Eric J. Jokela, Carlos Alfredo Joly, Gregory J. Jordan, Grant Stuart Joseph, Decky Junaedi, Robert R. Junker, Eric Justes, Richard Kabzems, Jeffrey Kane, Zdenek Kaplan, Teja Kattenborn, Lyudmila Kavelenova, Elizabeth Kearsley, Anne Kempel, Tanaka Kenzo, Andrew Kerkhoff, Mohammed I. Khalil, Nicole L. Kinlock, Wilm Daniel Kissling, Kaoru Kitajima, Thomas Kitzberger, Rasmus Kjøller, Tamir Klein, Michael Kleyer, Jitka Klimešová, Joice Klipel, Brian Kloeppel, Stefan Klotz, Johannes M. H. Knops, Takashi Kohyama, Fumito Koike, Johannes Kollmann, Benjamin Komac, Kimberly Komatsu, Christian König, Nathan J. B. Kraft, Koen Kramer, Holger Kreft, Ingolf Kühn, Dushan Kumarathunge, Jonas Kuppler, Hiroko Kurokawa, Yoko Kurosawa, Shem Kuyah, Jean‐Paul Laclau, Benoit Lafleur, Erik Lallai, Eric Lamb, Andrea Lamprecht, Daniel J. Larkin, Daniel Laughlin, Yoann Le Bagousse‐Pinguet, Guerric Maire, Peter C. Roux, Elizabeth Roux, Tali Lee, Frederic Lens, Simon L. Lewis, Barbara Lhotsky, Yuanzhi Li, Xine Li, Jeremy W. Lichstein, Mario Liebergesell, Jun Ying Lim, Yan‐Shih Lin, Juan Carlos Linares, Chunjiang Liu, Daijun Liu, Udayangani Liu, Stuart Livingstone, Joan Llusià, Madelon Lohbeck, Álvaro López‐García, Gabriela Lopez‐Gonzalez, Zdeňka Lososová, Frédérique Louault, Balázs A. Lukács, Petr Lukeš, Yunjian Luo, Michele Lussu, Siyan Ma, Camilla Maciel Rabelo Pereira, Michelle Mack, Vincent Maire, Annikki Mäkelä, Harri Mäkinen, Ana Claudia Mendes Malhado, Azim Mallik, Peter Manning, Stefano Manzoni, Zuleica Marchetti, Luca Marchino, Vinicius Marcilio‐Silva, Eric Marcon, Michela Marignani, Lars Markesteijn, Adam Martin, Cristina Martínez‐Garza, Jordi Martínez‐Vilalta, Tereza Mašková, Kelly Mason, Norman Mason, Tara Joy Massad, Jacynthe Masse, Itay Mayrose, James McCarthy, M. Luke McCormack, Katherine McCulloh, Ian R. McFadden, Brian J. McGill, Mara Y. McPartland, Juliana S. Medeiros, Belinda Medlyn, Pierre Meerts, Zia Mehrabi, Patrick Meir, Felipe P. L. Melo, Maurizio Mencuccini, Céline Meredieu, Julie Messier, Ilona Mészáros, Juha Metsaranta, Sean T. Michaletz, Chrysanthi Michelaki, Svetlana Migalina, Ruben Milla, Jesse E. D. Miller, Vanessa Minden, Ray Ming, Karel Mokany, Angela T. Moles, Attila Molnár, Jane Molofsky, Martin Molz, Rebecca A. Montgomery, Arnaud Monty, Lenka Moravcová, Alvaro Moreno‐Martínez, Marco Moretti, Akira S. Mori, Shigeta Mori, Dave Morris, Jane Morrison, Ladislav Mucina, Sandra Mueller, Christopher D. Muir, Sandra Cristina Müller, François Munoz, Isla H. Myers‐Smith, Randall W. Myster, Masahiro Nagano, Shawna Naidu, Ayyappan Narayanan, Balachandran Natesan, Luka Negoita, Andrew S. Nelson, Eike Lena Neuschulz, Jian Ni, Georg Niedrist, Jhon Nieto, Ülo Niinemets, Rachael Nolan, Henning Nottebrock, Yann Nouvellon, Alexander Novakovskiy, Kristin Odden Nystuen, Anthony O'Grady, Kevin O'Hara, Andrew O'Reilly‐Nugent, Simon Oakley, Walter Oberhuber, Toshiyuki Ohtsuka, Ricardo Oliveira, Kinga Öllerer, Mark E. Olson, Vladimir Onipchenko, Yusuke Onoda, Renske E. Onstein, Jenny C. Ordonez, Noriyuki Osada, Ivika Ostonen, Gianluigi Ottaviani, Sarah Otto, Gerhard E. Overbeck, Wim A. Ozinga, Anna T. Pahl, C. E. Timothy Paine, Robin J. Pakeman, Aristotelis C. Papageorgiou, Evgeniya Parfionova, Meelis Pärtel, Marco Patacca, Susana Paula, Juraj Paule, Harald Pauli, Juli G. Pausas, Begoña Peco, Josep Penuelas, Antonio Perea, Pablo Luis Peri, Ana Carolina Petisco‐Souza, Alessandro Petraglia, Any Mary Petritan, Oliver L. Phillips, Simon Pierce, Valério D. Pillar, Jan Pisek, Alexandr Pomogaybin, Hendrik Poorter, Angelika Portsmuth, Peter Poschlod, Catherine Potvin, Devon Pounds, A. Shafer Powell, Sally A. Power, Andreas Prinzing, Giacomo Puglielli, Petr Pyšek, Valerie Raevel, Anja Rammig, Johannes Ransijn, Courtenay A. Ray, Peter B. Reich, Markus Reichstein, Douglas E. B. Reid, Maxime Réjou‐Méchain, Victor Resco Dios, Sabina Ribeiro, Sarah Richardson, Kersti Riibak, Matthias C. Rillig, Fiamma Riviera, Elisabeth M. R. Robert, Scott Roberts, Bjorn Robroek, Adam Roddy, Arthur Vinicius Rodrigues, Alistair Rogers, Emily Rollinson, Victor Rolo, Christine Römermann, Dina Ronzhina, Christiane Roscher, Julieta A. Rosell, Milena Fermina Rosenfield, Christian Rossi, David B. Roy, Samuel Royer‐Tardif, Nadja Rüger, Ricardo Ruiz‐Peinado, Sabine B. Rumpf, Graciela M. Rusch, Masahiro Ryo, Lawren Sack, Angela Saldaña, Beatriz Salgado‐Negret, Roberto Salguero‐Gomez, Ignacio Santa‐Regina, Ana Carolina Santacruz‐García, Joaquim Santos, Jordi Sardans, Brandon Schamp, Michael Scherer‐Lorenzen, Matthias Schleuning, Bernhard Schmid, Marco Schmidt, Sylvain Schmitt, Julio V. Schneider, Simon D. Schowanek, Julian Schrader, Franziska Schrodt, Bernhard Schuldt, Frank Schurr, Galia Selaya Garvizu, Marina Semchenko, Colleen Seymour, Julia C. Sfair, Joanne M. Sharpe, Christine S. Sheppard, Serge Sheremetiev, Satomi Shiodera, Bill Shipley, Tanvir Ahmed Shovon, Alrun Siebenkäs, Carlos Sierra, Vasco Silva, Mateus Silva, Tommaso Sitzia, Henrik Sjöman, Martijn Slot, Nicholas G. Smith, Darwin Sodhi, Pamela Soltis, Douglas Soltis, Ben Somers, Grégory Sonnier, Mia Vedel Sørensen, Enio Egon Sosinski, Nadejda A. Soudzilovskaia, Alexandre F. Souza, Marko Spasojevic, Marta Gaia Sperandii, Amanda B. Stan, James Stegen, Klaus Steinbauer, Jörg G. Stephan, Frank Sterck, Dejan B. Stojanovic, Tanya Strydom, Maria Laura Suarez, Jens‐Christian Svenning, Ivana Svitková, Marek Svitok, Miroslav Svoboda, Emily Swaine, Nathan Swenson, Marcelo Tabarelli, Kentaro Takagi, Ulrike Tappeiner, Rubén Tarifa, Simon Tauugourdeau, Cagatay Tavsanoglu, Mariska Beest, Leho Tedersoo, Nelson Thiffault, Dominik Thom, Evert Thomas, Ken Thompson, Peter E. Thornton, Wilfried Thuiller, Lubomír Tichý, David Tissue, Mark G. Tjoelker, David Yue Phin Tng, Joseph Tobias, Péter Török, Tonantzin Tarin, José M. Torres‐Ruiz, Béla Tóthmérész, Martina Treurnicht, Valeria Trivellone, Franck Trolliet, Volodymyr Trotsiuk, James L. Tsakalos, Ioannis Tsiripidis, Niklas Tysklind, Toru Umehara, Vladimir Usoltsev, Matthew Vadeboncoeur, Jamil Vaezi, Fernando Valladares, Jana Vamosi, Peter M. Bodegom, Michiel Breugel, Elisa Van Cleemput, Martine Weg, Stephni Merwe, Fons Plas, Masha T. Sande, Mark Kleunen, Koenraad Van Meerbeek, Mark Vanderwel, Kim André Vanselow, Angelica Vårhammar, Laura Varone, Maribel Yesenia Vasquez Valderrama, Kiril Vassilev, Mark Vellend, Erik J. Veneklaas, Hans Verbeeck, Kris Verheyen, Alexander Vibrans, Ima Vieira, Jaime Villacís, Cyrille Violle, Pandi Vivek, Katrin Wagner, Matthew Waldram, Anthony Waldron, Anthony P. Walker, Martyn Waller, Gabriel Walther, Han Wang, Feng Wang, Weiqi Wang, Harry Watkins, James Watkins, Ulrich Weber, James T. Weedon, Liping Wei, Patrick Weigelt, Evan Weiher, Aidan W. Wells, Camilla Wellstein, Elizabeth Wenk, Mark Westoby, Alana Westwood, Philip John White, Mark Whitten, Mathew Williams, Daniel E. Winkler, Klaus Winter, Chevonne Womack, Ian J. Wright, S. Joseph Wright, Justin Wright, Bruno X. Pinho, Fabiano Ximenes, Toshihiro Yamada, Keiko Yamaji, Ruth Yanai, Nikolay Yankov, Benjamin Yguel, Kátia Janaina Zanini, Amy E. Zanne, David Zelený, Yun‐Peng Zhao, Jingming Zheng, Ji Zheng, Kasia Ziemińska, Chad R. Zirbel, Georg Zizka, Irié Casimir Zo‐Bi, Gerhard Zotz, Christian Wirth, TRY plant trait database – enhanced coverage and open access, Global Change Biology, 10.1111/gcb.14904, 26, 1, (119-188), (2019).
- Sumira Jan, Parvaiz Ahmad, Climatic Change and Metabolome Fluxes, Ecometabolomics, 10.1016/B978-0-12-814872-3.00004-7, (179-237), (2019).
- A.L. Gomes, R. Revermann, F.M.P. Gonçalves, F. Lages, M.P.M. Aidar, M. Finckh, N. Jürgens, Tree or not a tree: Differences in plant functional traits among geoxyles and closely related tree species, South African Journal of Botany, 10.1016/j.sajb.2019.08.044, 127, (176-187), (2019).
- Florian D. Schneider, David Fichtmueller, Martin M. Gossner, Anton Güntsch, Malte Jochum, Birgitta König‐Ries, Gaëtane Le Provost, Peter Manning, Andreas Ostrowski, Caterina Penone, Nadja K. Simons, Towards an ecological trait‐data standard, Methods in Ecology and Evolution, 10.1111/2041-210X.13288, 10, 12, (2006-2019), (2019).
- Christian König, Patrick Weigelt, Julian Schrader, Amanda Taylor, Jens Kattge, Holger Kreft, Biodiversity data integration--The significance of data resolution and domain, PLOS Biology, 10.1371/journal.pbio.3000183, 17, 3, (e3000183), (2019).
- Christoph U. Germeier, Stefan Unger, Modeling Crop Genetic Resources Phenotyping Information Systems, Frontiers in Plant Science, 10.3389/fpls.2019.00728, 10, (2019).
- Norbert Kunert, Joseph Zailaa, Specific leaf area for five tropical tree species growing in different tree species mixtures in Central Panama, New Forests, 10.1007/s11056-019-09706-z, (2019).
- Ewa Jabłońska, Dierk Michaelis, Marlena Tokarska, Klara Goldstein, Mateusz Grygoruk, Mateusz Wilk, Tomasz Wyszomirski, Wiktor Kotowski, Alleviation of Plant Stress Precedes Termination of Rich Fen Stages in Peat Profiles of Lowland Mires, Ecosystems, 10.1007/s10021-019-00437-y, (2019).
- Elias C. Massoud, Chonggang Xu, Rosie A. Fisher, Ryan G. Knox, Anthony P. Walker, Shawn P. Serbin, Bradley O. Christoffersen, Jennifer A. Holm, Lara M. Kueppers, Daniel M. Ricciuto, Liang Wei, Daniel J. Johnson, Jeffrey Q. Chambers, Charlie D. Koven, Nate G. McDowell, Jasper A. Vrugt, Identification of key parameters controlling demographically structured vegetation dynamics in a land surface model: CLM4.5(FATES), Geoscientific Model Development, 10.5194/gmd-12-4133-2019, 12, 9, (4133-4164), (2019).
- W. Daniel Kissling, Ramona Walls, Anne Bowser, Matthew O. Jones, Jens Kattge, Donat Agosti, Josep Amengual, Alberto Basset, Peter M. van Bodegom, Johannes H. C. Cornelissen, Ellen G. Denny, Salud Deudero, Willi Egloff, Sarah C. Elmendorf, Enrique Alonso García, Katherine D. Jones, Owen R. Jones, Sandra Lavorel, Dan Lear, Laetitia M. Navarro, Samraat Pawar, Rebecca Pirzl, Nadja Rüger, Sofia Sal, Roberto Salguero-Gómez, Dmitry Schigel, Katja-Sabine Schulz, Andrew Skidmore, Robert P. Guralnick, Towards global data products of Essential Biodiversity Variables on species traits, Nature Ecology & Evolution, 10.1038/s41559-018-0667-3, 2, 10, (1531-1540), (2018).
- Michael Fell, Jarrett Barber, Jeremy W. Lichstein, Kiona Ogle, Multidimensional trait space informed by a mechanistic model of tree growth and carbon allocation, Ecosphere, 10.1002/ecs2.2060, 9, 1, (2018).
- P. Török, G. Matus, E. Tóth, M. Papp, A. Kelemen, J. Sonkoly, B. Tóthmérész, Both trait-neutrality and filtering effects are validated by the vegetation patterns detected in the functional recovery of sand grasslands, Scientific Reports, 10.1038/s41598-018-32078-x, 8, 1, (2018).
- Hanna Ćwiek-Kupczyńska, Striving for Semantics of Plant Phenotyping Data, Semantics, Analytics, Visualization, 10.1007/978-3-030-01379-0_12, (161-169), (2018).
- Adugna Abdi Woldesemayat, David M. Modise, Junaid Gemeildien, Bongani K. Ndimba, Alan Christoffels, Cross-species multiple environmental stress responses: An integrated approach to identify candidate genes for multiple stress tolerance in sorghum (Sorghum bicolor (L.) Moench) and related model species, PLOS ONE, 10.1371/journal.pone.0192678, 13, 3, (e0192678), (2018).
- Peter Török, Aveliina Helm, Ecological theory provides strong support for habitat restoration, Biological Conservation, 10.1016/j.biocon.2016.12.024, 206, (85-91), (2017).
- Alice Nunes, Melanie Köbel, Pedro Pinho, Paula Matos, Francesco de Bello, Otília Correia, Cristina Branquinho, Which plant traits respond to aridity? A critical step to assess functional diversity in Mediterranean drylands, Agricultural and Forest Meteorology, 10.1016/j.agrformet.2017.03.007, 239, (176-184), (2017).
- Shafique Matin, Mukunda Dev Behera, Partha Sarathi Roy, Demonstrating Surrogacy of Animal Diversity with Plant Diversity and Their Integration to Assess Inclusive Biodiversity: A Geoinformatics Basis, Proceedings of the National Academy of Sciences, India Section A: Physical Sciences, 10.1007/s40010-017-0459-1, 87, 4, (911-925), (2017).
- Débora S. G. Henriques, Claudine Ah-Peng, Rosalina Gabriel, Structure and Applications of BRYOTRAIT-AZO, a Trait Database for Azorean Bryophytes, Cryptogamie, Bryologie, 10.7872/cryb/v38.iss2.2017.137, 38, 2, (137-152), (2017).
- Ilaria Rosati, Caterina Bergami, Elena Stanca, Leonilde Roselli, Paolo Tagliolato, Alessandro Oggioni, Nicola Fiore, Alessandra Pugnetti, Adriana Zingone, Angela Boggero, Alberto Basset, A thesaurus for phytoplankton trait-based approaches: Development and applicability, Ecological Informatics, 10.1016/j.ecoinf.2017.10.014, 42, (129-138), (2017).
- Cornelia B .Krug, Michael E Schaepman, Lynne J Shannon, Jeannine Cavender-Bares, William Cheung, Peter B McIntyre, Jean Paul Metzger, Ülo Niinemets, David O Obura, Bernhard Schmid, Bernardo BN Strassburg, Astrid JA Van Teeffelen, Olaf LF Weyl, Moriaki Yasuhara, Paul W Leadley, Observations, indicators and scenarios of biodiversity and ecosystem services change — a framework to support policy and decision-making, Current Opinion in Environmental Sustainability, 10.1016/j.cosust.2018.04.001, 29, (198-206), (2017).
- Philipp Brun, Mark R. Payne, Thomas Kiørboe, A trait database for marine copepods, Earth System Science Data, 10.5194/essd-9-99-2017, 9, 1, (99-113), (2017).
- Eric Garnier, Ulrike Stahl, Marie‐Angélique Laporte, Jens Kattge, Isabelle Mougenot, Ingolf Kühn, Baptiste Laporte, Bernard Amiaud, Farshid S. Ahrestani, Gerhard Bönisch, Daniel E. Bunker, J. Hans C. Cornelissen, Sandra Díaz, Brian J. Enquist, Sophie Gachet, Pedro Jaureguiberry, Michael Kleyer, Sandra Lavorel, Lutz Maicher, Natalia Pérez‐Harguindeguy, Hendrik Poorter, Mark Schildhauer, Bill Shipley, Cyrille Violle, Evan Weiher, Christian Wirth, Ian J. Wright, Stefan Klotz, Towards a thesaurus of plant characteristics: an ecological contribution, Journal of Ecology, 10.1111/1365-2745.12698, 105, 2, (298-309), (2016).
- Eric M. Lind, Unified data management for distributed experiments: A model for collaborative grassroots scientific networks, Ecological Informatics, 10.1016/j.ecoinf.2016.08.002, 36, (231-236), (2016).
- Cédric Scherer, Florian Jeltsch, Volker Grimm, Niels Blaum, Merging trait-based and individual-based modelling: An animal functional type approach to explore the responses of birds to climatic and land use changes in semi-arid African savannas, Ecological Modelling, 10.1016/j.ecolmodel.2015.07.005, 326, (75-89), (2016).
- Daniel Sebastian Goll, Coupled Cycling of Carbon, Nitrogen, and Phosphorus, Soil Phosphorus, 10.1201/9781315372327-4, (43-63), (2016).
- Robert P. Guralnick, Paula F. Zermoglio, John Wieczorek, Raphael LaFrance, David Bloom, Laura Russell, The importance of digitized biocollections as a source of trait data and a new VertNet resource, Database, 10.1093/database/baw158, 2016, (baw158), (2016).
- Joshua S. Madin, Mia O. Hoogenboom, Sean R. Connolly, Emily S. Darling, Daniel S. Falster, Danwei Huang, Sally A. Keith, Toni Mizerek, John M. Pandolfi, Hollie M. Putnam, Andrew H. Baird, A Trait-Based Approach to Advance Coral Reef Science, Trends in Ecology & Evolution, 10.1016/j.tree.2016.02.012, 31, 6, (419-428), (2016).
- Nicholas R. Record, David Talmy, Selina Våge, Quantifying Tradeoffs for Marine Viruses, Frontiers in Marine Science, 10.3389/fmars.2016.00251, 3, (2016).
- Lucas Enrico, Sandra Díaz, Mark Westoby, Barbara L. Rice, Leaf mechanical resistance in plant trait databases: comparing the results of two common measurement methods, Annals of Botany, 10.1093/aob/mcv149, 117, 1, (209-214), (2015).
- Joaquín Hortal, Francesco de Bello, José Alexandre F. Diniz-Filho, Thomas M. Lewinsohn, Jorge M. Lobo, Richard J. Ladle, Seven Shortfalls that Beset Large-Scale Knowledge of Biodiversity, Annual Review of Ecology, Evolution, and Systematics, 10.1146/annurev-ecolsys-112414-054400, 46, 1, (523-549), (2015).
- Cyrille Violle, Philippe Choler, Benjamin Borgy, Eric Garnier, Bernard Amiaud, Guilhem Debarros, Sylvain Diquelou, Sophie Gachet, Claudy Jolivet, Jens Kattge, Sandra Lavorel, Servane Lemauviel-Lavenant, Jessy Loranger, Alexis Mikolajczak, François Munoz, Jean Olivier, Nicolas Viovy, Vegetation ecology meets ecosystem science: Permanent grasslands as a functional biogeography case study, Science of The Total Environment, 10.1016/j.scitotenv.2015.03.141, 534, (43-51), (2015).
- Arun Prabhu Dhanapal, Mahalingam Govindaraj, Unlimited Thirst for Genome Sequencing, Data Interpretation, and Database Usage in Genomic Era: The Road towards Fast-Track Crop Plant Improvement, Genetics Research International, 10.1155/2015/684321, 2015, (1-15), (2015).
- Adam R. Martin, Marney E. Isaac, REVIEW: Plant functional traits in agroecosystems: a blueprint for research, Journal of Applied Ecology, 10.1111/1365-2664.12526, 52, 6, (1425-1435), (2015).
- Miguel Cifuentes Jara, Matieu Henry, Maxime Réjou Méchain, Omar R. Lopez, Craig Wayson, José María Michel Fuentes, Edwin Castellanos, Mauricio Zapata-Cuartas, Daniel Piotto, Federico Alice Guier, Héctor Castañeda Lombis, Ruby Cuenca Lara, Kelvin Cueva Rojas, Jhon del Águila Pasquel, Álvaro Duque Montoya, Javier Fernández Vega, Abner Jiménez Galo, Lars Gunnar Marklund, Fabián Milla, José de Jesús Návar Chaidez, Edgar Ortiz Malavassi, Johnny Pérez, Carla Ramírez Zea, Luis Rangel García, Rafael Rubilar Pons, Laurent Saint-André, Carlos Sanquetta, Charles Scott, James Westfall, Overcoming obstacles to sharing data on tree allometric equations, Annals of Forest Science, 10.1007/s13595-015-0467-8, 72, 6, (789-794), (2015).
- Javad Chamanara, Birgitta König-Ries, A conceptual model for data management in the field of ecology, Ecological Informatics, 10.1016/j.ecoinf.2013.12.003, 24, (261-272), (2014).
- C. Violle, P. B. Reich, S. W. Pacala, B. J. Enquist, J. Kattge, The emergence and promise of functional biogeography, Proceedings of the National Academy of Sciences, 10.1073/pnas.1415442111, 111, 38, (13690-13696), (2014).
- Mark van Kleunen, Wayne Dawson, Oliver Bossdorf, Markus Fischer, The more the merrier: Multi-species experiments in ecology, Basic and Applied Ecology, 10.1016/j.baae.2013.10.006, 15, 1, (1-9), (2014).
- Christine Granier, Vincent Nègre, Fabio Fiorani, Systematic Phenotyping of Plant Development in Arabidopsis thaliana, Phenomics, 10.1201/b16437, (111-141), (2014).
- Marie-Angélique Laporte, Isabelle Mougenot, Eric Garnier, Ulrike Stahl, Lutz Maicher, Jens Kattge, A Semantic Web Faceted Search System for Facilitating Building of Biodiversity and Ecosystems Services, Data Integration in the Life Sciences, 10.1007/978-3-319-08590-6_5, (50-57), (2014).
- H. Ferguson-Gow, S. Sumner, A. F. G. Bourke, K. E. Jones, Colony size predicts division of labour in attine ants, Proceedings of the Royal Society B: Biological Sciences, 10.1098/rspb.2014.1411, 281, 1793, (20141411-20141411), (2014).
- Kiona Ogle, Sharmila Pathikonda, Karla Sartor, Jeremy W. Lichstein, Jeanne L. D. Osnas, Stephen W. Pacala, A model‐based meta‐analysis for estimating species‐specific wood density and identifying potential sources of variation, Journal of Ecology, 10.1111/1365-2745.12178, 102, 1, (194-208), (2013).
- Isabelle Aubin, Lisa Venier, Jennie Pearce, Marco Moretti, Can a trait-based multi-taxa approach improve our assessment of forest management impact on biodiversity?, Biodiversity and Conservation, 10.1007/s10531-013-0565-6, 22, 12, (2957-2975), (2013).
- Christian Mulder, Farshid S. Ahrestani, Michael Bahn, David A. Bohan, Michael Bonkowski, Bryan S. Griffiths, Rannveig Anna Guicharnaud, Jens Kattge, Paul Henning Krogh, Sandra Lavorel, Owen T. Lewis, Giorgio Mancinelli, Shahid Naeem, Josep Peñuelas, Hendrik Poorter, Peter B. Reich, Loreto Rossi, Graciela M. Rusch, Jordi Sardans, Ian J. Wright, Connecting the Green and Brown Worlds, Ecological Networks in an Agricultural World, 10.1016/B978-0-12-420002-9.00002-0, (69-175), (2013).
- Chaeho Byun, Sylvie Blois, Jacques Brisson, Plant functional group identity and diversity determine biotic resistance to invasion by an exotic grass, Journal of Ecology, 10.1111/1365-2745.12016, 101, 1, (128-139), (2012).
- Karin Nadrowski, Sophia Ratcliffe, Gerhard Bönisch, Helge Bruelheide, Jens Kattge, Xiaojuan Liu, Lutz Maicher, Xiangcheng Mi, Michael Prilop, Daniel Seifarth, Karl Welter, Sven Windisch, Christian Wirth, Harmonizing, annotating and sharing data in biodiversity–ecosystem functioning research, Methods in Ecology and Evolution, 10.1111/2041-210x.12009, 4, 2, (201-205), (2012).
- Kenny Billiau, Heike Sprenger, Christian Schudoma, Dirk Walther, Karin I. Köhl, Data management pipeline for plant phenotyping in a multisite project, Functional Plant Biology, 10.1071/FP12009, 39, 11, (948), (2012).
- Laura Pla, Fernando Casanoves, Julio Di Rienzo, Laura Pla, Fernando Casanoves, Julio Di Rienzo, Introduction, Quantifying Functional Biodiversity, 10.1007/978-94-007-2648-2_1, (1-8), (2012).
- M. D. A. Rounsevell, D. T. Robinson, D. Murray-Rust, From actors to agents in socio-ecological systems models, Philosophical Transactions of the Royal Society B: Biological Sciences, 10.1098/rstb.2011.0187, 367, 1586, (259-269), (2012).
- V. Brovkin, P. M. van Bodegom, T. Kleinen, C. Wirth, W. K. Cornwell, J. H. C. Cornelissen, J. Kattge, Plant-driven variation in decomposition rates improves projections of global litter stock distribution, Biogeosciences, 10.5194/bg-9-565-2012, 9, 1, (565-576), (2012).
- Cécile H. Albert, Fabrice Grassein, Frank M. Schurr, Ghislain Vieilledent, Cyrille Violle, When and how should intraspecific variability be considered in trait-based plant ecology?, Perspectives in Plant Ecology, Evolution and Systematics, 10.1016/j.ppees.2011.04.003, 13, 3, (217-225), (2011).
- Sandra Lavorel, Karl Grigulis, Pénélope Lamarque, Marie‐Pascale Colace, Denys Garden, Jacky Girel, Gilles Pellet, Rolland Douzet, Using plant functional traits to understand the landscape distribution of multiple ecosystem services, Journal of Ecology, 10.1111/j.1365-2745.2010.01753.x, 99, 1, (135-147), (2010).




