We reviewed the availability of cytochrome
COI; invertebrates; biomonitoring; high-throughput sequencing; DNA metabarcoding; identification; genus; Biomonitoring 2.0
Rapid advances in our ability to obtain biodiversity information by sequencing genetic material from environmental samples, such as homogenized bulk tissue samples (Hajibabaei et al. [
Biodiversity measurement and biomonitoring traditionally have relied on the collection of samples from local biotic assemblages. Invertebrates are often the target taxon for biomonitoring studies in freshwater habitats because of their high species richness, ubiquity, and range of responses to anthropogenic stressors (Rosenberg and Resh [
HTS can provide taxonomic information at greater resolution, depth, and consistency, and at lower cost than morphologically identified samples (Gibson et al. [
Numerous campaigns exist to complete barcode libraries for specific groups (e.g., Trichoptera; www.trichopterabol.org), or regions (e.g., Arctic; Zhou et al. [
The primary objectives of our study were to assess the state of publicly available COI reference sequences for North American freshwater invertebrate genera and to estimate COI library completeness for the most commonly encountered genera in CABIN samples. Other genetic markers are available for species identification (e.g., 16S ribosomal RNA), but COI has been used extensively for metazoan barcoding because it consistently discriminates among closely related species (Sweeney et al. [
A full survey of the genetic diversity of North American freshwater invertebrates represented at the genus level in COI libraries was beyond the scope of our study. However, we saw value in illustrating some potential pitfalls created by incomplete reference libraries by looking in detail at sequence representation for contrasting genera: 1 with many sequences, broad geographic coverage, and numerous species (Ephemeroptera:Baetis) and 1 with many sequences but restricted geographic coverage (Diptera:Eukiefferiella). Genera with few or no associated sequences can be assumed to have poor species and geographic coverage in COI libraries, reflecting incomplete knowledge of COI sequence diversity. The purpose of our analysis is to inform discussion of COI sequence library-building activities moving forward.
We compiled lists of freshwater invertebrate genera from a wide variety of sources. We compiled arthropod lists primarily from taxonomic keys, specifically those by Merritt et al. ([
We searched GenBank (Benson et al. [
We used the Integrated Taxonomic Information System (ITIS) as a taxonomic standard when preparing our genus lists. ITIS provides lists of synonyms where possible, but these lists cannot be considered exhaustive. Where searches based on ITIS taxonomy did not yield any sequence records, we conducted searches based on known synonyms. Searches were conducted between 12 April and 4 May 2016.
We recorded the number of sequences returned by each query separately for both databases. Given the overlap in results between BOLD and GenBank, we included the maximum number of sequences returned from either query when combining results. We calculated the percentage of genera with >0, 1–10, 11–25, and >25 COI sequences for each database and the combined data set, and we performed these calculations at phylum/subphylum and class levels.
Relatively uncommon taxa often are downweighted in biomonitoring studies because their occurrence is difficult to predict in most monitoring and reference-condition models (Cao et al. [
We downloaded COI sequences for Baetis and Eukiefferiella from BOLD on 18 May 2016. We included only sequences >350 bp long, and we excluded flagged sequences, misidentifications, or sequences containing stop codons. We aligned sequences for each genus (Baetis and Eukiefferiella) manually with MEGA5 software (Tamura et al. [
Across both databases, 61.2% of genera were associated with COI sequence records (Fig. 3), but either database on its own contained COI sequences for ~51% of genera. Non-crustacean arthropods were the most diverse group of genera, with 1500 genera recorded in North America. Across both BOLD and GenBank, 69.2% of these genera had COI reference sequences recorded, but only 53.9% of genera were represented in GenBank and 56.1% in BOLD. This result included particularly low representation for class Arachnida (principally aquatic mites, 171 genera), where only 43.4% of genera were represented in either database. Henceforth, we report only combined results for GenBank and BOLD.
For subphylum Crustacea (409 genera), the 2
Insects were by far the most diverse class of invertebrates, represented by 1305 genera in our study (Fig. 5). COI sequences were associated with 72.4% of aquatic insect genera. This representation was not evenly distributed among insect orders, but ≥50% of genera within each order except Orthoptera (4/9 genera) had associated COI sequence records. Within the orders Diptera, Ephemeroptera, Hymenoptera, and Lepidoptera, ≥50% of genera with COI sequence records had >10 associated sequences. Trichoptera, a key group for freshwater biomonitoring, had the greatest sequence representation, with associated COI sequence records for 87.9% of genera. However, only 41.4% of genera had >25 associated sequence records. Plecoptera and Odonata had the lowest genus-level representation (60.0 and 62.2%, respectively) among other orders routinely scrutinized in freshwater biomonitoring.
Taxonomic resolution varied greatly across samples and phyla. The CABIN assessment protocol is focused on running waters in wadeable rivers, so certain groups (e.g., Crustacea) usually are excluded from totals, whereas others (e.g., Annelida) typically are not identified below a coarse level of resolution. Across the entire data set, only 57.7% of individuals were identified to genus level.
Of the 804 unique genera identified in this data set, 206 occurred in >1% of samples. Figure 6 illustrates that more frequently encountered genera are more likely to be associated with COI sequence records. Across all genera 688 (85.4%) had associated COI sequence records in either database, and 395 of these had >25 associated sequences. For genera occurring in >1% of samples, 197 genera (95.2%) had records and 146 of these had >25 associated sequences. In the subset of 3279 samples in which >70% of individuals were identified to genus based on morphology, an average of 94.1% of genera/sample had associated COI sequence data. The most frequently encountered genus lacking associated COI sequence records was the balloon fly genus Metachela (Diptera:Empididae), the 58
The genus Baetis was represented by 2750 sequences. Of these sequences, 2262 represented 29 unique species, whereas the remaining species were identified only to genus. The average genetic difference between sequence pairs was 16.2%, and the minimum difference was 0% (115,257 pairs, 3.06% of all comparisons). The maximum distance between pairs was 28.2%. Figure 7A illustrates the broad geographic distribution of Baetis COI sequences in North America, and Fig. 8 demonstrates the high variability in the number of sequences across species within the genus.
The genus Eukiefferiella was represented by 1008 sequences and 1 species in BOLD. Most sequences were from specimens identified only at the genus level. The average genetic difference between sequence pairs was 8.6%, and the minimum difference was 0% (56,753 sequence pairs, 11.2% of all comparisons). The maximum difference between pairs was 19.6%. Figure 7B illustrates the comparatively restricted geographic distribution of Eukiefferiella sequences in North America.
As freshwater biomonitoring moves toward a diagnostic, trait-based framework (Poff et al. [
Representation for certain invertebrate groups in public DNA COI-sequence libraries is poor and may limit the applicability of DNA barcode analysis in certain habitats or contexts. For example, Maxillopoda and Ostracoda are important arthropod components of the food web in lentic systems, but <<50% of the genera in these groups are included in public DNA libraries. Furthermore, many of the DNA COI-sequence records available in public databases are not identified to the species or genus level, but merely family or order (Kwong et al. [
Some groups may have relatively high representation at the genus level, but these genera may be represented by few total associated sequences. Having a greater number of sequences for a given genus, especially from geographically distinct populations, will improve confidence in identifications, particularly for speciose genera (Lou and Golding [
The robustness of COI libraries for identification purposes depends on several factors. Many genera contain large numbers of species. For instance, the Trichoptera genus Limnephilus is represented by ≥64 species in Canada alone (CJC, unpublished data), so one might expect to see greater sequence variation at the genus level for Limnephilus than for a genus containing 1 or just a few species. Our analysis of COI sequence diversity for a well-studied genus (Baetis) highlighted a large degree of COI sequence diversity (average genetic distance: 16.2%). Therefore, one can be confident that an unknown sequence that matches at, e.g., 90 to 95% similarity is a true Baetis sequence. The high level of genetic diversity in Baetis probably is caused by its high species diversity (19 species recognized by ITIS; most of these included in BOLD) and its ancient origins. Fossils from the family Baetidae are known from the lower Cretaceous (120–135 mya; McCafferty [
Variability in reference sequences at the genus level may be underestimated if sequences represent a single species or specimens collected from a restricted location. Eukiefferiella is one such example. Although represented by >1000 sequences in BOLD, records for this genus include just 1 definitively identified species. ITIS recognizes 15 species. Many unique sites are represented, but >60% of the records come from 4 localities (Fig. 7B). Thus, COI diversity within the genus may be underestimated, although sequences listed as Eukiefferiella sp. may represent additional species. Were additional library building to be pursued for this genus (particularly increasing the number of species represented within the genus), the diversity of COI sequences in the library probably would increase and further our ability to assign unknown sequences to it with confidence.
Recent work on invertebrates suggests that taxa can be assigned correctly to genus or higher taxonomic levels with a high degree of confidence (Wilson et al. [
Both BOLD and GenBank take measures to ensure that voucher material is correctly identified and associated with accurate reference sequences. However, clerical and identification mistakes can still occur. Moreover, HTS is likely to reveal a large amount of cryptic diversity at both the species and genus level. As the state of phylogenetic and taxonomic knowledge changes (e.g., species within 1 genus are split into different genera), samples should be revisited to ensure that identifications reflect the current state of knowledge. In practice this rarely happens because retrieving and re-identifying many archived specimens is prohibitively time consuming. Sequence information is stored digitally and easily searched, so a further advantage of a DNA-based approach is that identifications can be updated rapidly to reflect current knowledge, provided changes to taxonomic knowledge are accompanied by barcode sequence information. Revisiting identifications based on genomics tools (and any resulting data analysis) should be considered a part of quality assurance/quality control workflows for biomonitoring studies.
In the short term, groups that are routinely used for biomonitoring activities and considered responsive to environmental stress should be the focus of library-building activities. For riverine ecosystems, efforts could be directed toward expanding and completing the reference library for EPTO orders and Diptera. EPTO are widely considered to be sensitive to pollution and hydrological alteration (Compin and Céréghino [
A further consequence of higher-resolution biodiversity information obtained through DNA-based approaches is that descriptions of aquatic invertebrate assemblages will be radically altered. Insects are numerically dominant and speciose and often constitute the bulk of taxa identified in biomonitoring samples, particularly in lotic systems (Cushing and Allan [
The models and analytical approaches currently used in biomonitoring and bioassessment studies must be adapted to use the information provided by genomics tools fully. Traditional approaches based on taxonomic composition have been used to provide pass/fail assessments of sites or to position sites within deviance envelopes from expected conditions, but users struggle to provide mechanistic explanations for observed effects. Ecosystem response to stress is mediated through species interactions, so investigating how networks of ecological interactions and their properties change along stressor gradients may allow more robust and mechanistically grounded assessments (Gray et al. [
The monumental task of distinguishing the effects of anthropogenic stress from natural variation in biodiversity in freshwater ecosystems is difficult because of inadequate sample size or limited capacity to gather information from remote areas and difficult-to-sample habitats. This difficulty is compounded by the scarcity of taxonomic expertise, which results in coarse-resolution taxonomy across a narrow section of the aquatic food web as standard practice. Genomics approaches can address many of these problems, but in the short term must rely upon publicly available reference sequence information for taxon identification. Kvist ([
Author contributions: CJC, JFG, DJB, and MH conceived the study. SS provided data collection and analysis guidance. CJC and JFG collected and analyzed the data and prepared the manuscript.
This work was supported by Environment and Climate Change Canada-funded Natural Sciences and Engineering Research Council Visiting Fellowships to CJC and JFG.
1 Baird, D. J., and M. Hajibabaei. 2012. Biomonitoring 2.0: a new paradigm in ecosystem assessment made possible by next-generation DNA sequencing. Molecular Ecology 21:2039–2044.
- 2 Balian, E. V., H. Segers, C. Lévêque, and K. Martens. 2008. An introduction to the Freshwater Animal Diversity Assessment (FADA) project. Hydrobiologia 595:3–8.
- 3 Benson, D. A., K. Clark, I. Karsch-Mizrachi, D. J. Lipman, J. Ostell, and E. W. Sayers. 2015. GenBank. Nucleic Acids Research 43:D30–D35.
- 4 Blaxter, M., J. Mann, T. Chapman, F. Thomas, C. Whitton, R. Floyd, and E. Abebe. 2005. Defining operational taxonomic units using DNA barcode data. Philosophical Transactions of the Royal Society of London Series B: Biological Sciences 360:1935–1943.
- 5 Bongers, T., and H. Ferris. 1999. Nematode community structure as a bioindicator in biomonitoring. Trends in Ecology and Evolution 14:224–228.
- 6 Cao, Y., D. P. Larsen, and R. St-J. Thorne. 2001. Rare species in multivariate analysis for bioassessment: some considerations. Journal of the North American Benthological Society 20:144–153.
- 7 Carew, M. E., S. J. Nichols, J. Batovska, R. St Clair, N. P. Murphy, M. J. Blacket, and M. E. Shackleton. 2017. A DNA barcode database of Australia’s freshwater macroinvertebrate fauna. Marine and Freshwater Research 68:1788–1802.
- 8 CESCC (Canadian Endangered Species Conservation Council). 2011. Wild species 2010: the general status of species in Canada. National General Status Working Group, Ottawa, Ontario.
- 9 Chariton, A. A., M. Sun, J. Gibson, J. A. Webb, K. M. Y. Leung, C. W. Hickey, and G. C. Hose. 2015. Emergent technologies and analytical approaches for understanding the effects of multiple stressors in aquatic environments. Marine and Freshwater Research 67:414–428.
- 10 Compin, A., and R. Céréghino. 2003. Sensitivity of aquatic insect species richness to disturbance in the Adour–Garonne stream system (France). Ecological Indicators 3:135–142.
- 11 Courtemanch, D. L. 1996. Commentary on the subsampling procedures used for rapid bioassessments. Journal of the North American Benthological Society 15:381–385.
- 12 Curry, C. J., X. Zhou, and D. J. Baird. 2012. Congruence of biodiversity measures among larval dragonflies and caddisflies from three Canadian rivers. Freshwater Biology 57:628–639.
- 13 Cushing, C. E., and J. D. Allan. 2001. Streams: their ecology and life. Academic Press, San Diego, California.
- 14 Davidson, N. C. 2014. How much wetland has the world lost? Long-term and recent trends in global wetland area. Marine and Freshwater Research 65:935–941.
- 15 Ekrem, T., E. Willassen, and E. Stur. 2007. A comprehensive DNA sequence library is essential for identification with DNA barcodes. Molecular Phylogenetics and Evolution 43:530–542.
- 16 Environment Canada. 2012. Canadian Aquatic Biomonitoring Network laboratory methods: processing, taxonomy, and quality control of benthic macroinvertebrate samples. Environment Canada, Ottawa, Ontario.
- 17 Epp, L. S., S. Boessenkool, E. P. Bellemain, J. Haile, A. Esposito, T. Riaz, C. Erséus, V. I. Gusarov, M. E. Edwards, A. Johnsen, H. K. Stenøien, K. Hassel, H. Kauserud, N. G. Yoccoz, K. A. Brathen, E. Willerslev, P. Taberlet, E. Coissac, and C. Brochmann. 2012. New environmental barcodes for analysing soil DNA: potential for studying past and present ecosystems. Molecular Ecology 21:1821–1833.
- 18 Fahner, N. A., S. Shokralla, D. J. Baird, and M. Hajibabaei. 2016. Large-scale monitoring of plants through environmental DNA metabarcoding of soil: recovery, resolution, and annotation of four DNA markers. PLoS ONE 11:e0157505.
- 19 Ficetola, G. F., C. Miaud, F. Pompanon, and P. Taberlet. 2008. Species detection using environmental DNA from water samples. Biology Letters 4:423–425.
- 20 Gibson, J., S. Shokralla, T. M. Porter, I. King, S. van Konynenburg, D. H. Janzen, W. Hallwachs, and M. Hajibabaei. 2014. Simultaneous assessment of the macrobiome and microbiome in a bulk sample of tropical arthropods through DNA metasystematics. Proceedings of the National Academy of Sciences of the United States of America 111:8007–8012.
- 21 Gibson, J. F., S. Shokralla, C. Curry, D. J. Baird, W. A. Monk, I. King, and M. Hajibabaei. 2015. Large-scale biomonitoring of remote and threatened ecosystems via high-throughput sequencing. PLoS ONE 10:e0138432.
- 23 Gray, C., D. J. Baird, S. Baumgartner, U. Jacob, G. B. Jenkins, E. J. O’Gorman, X. Lu, A. Ma, M. J. O. Pocock, N. Schuwirth, M. Thompson, and G. Woodward. 2014. Ecological networks: the missing links in biomonitoring science. Journal of Applied Ecology 51:1444–1449.
- 24 Hajibabaei, M., D. J. Baird, N. A. Fahner, R. Beiko and G. B. Golding. 2016. A new way to contemplate Darwin’s tangled bank: how DNA barcodes are reconnecting biodiversity science and biomonitoring. Philosophical Transactions of the Royal Society of London Series B: Biological Sciences 371:20150330.
- 25 Hajibabaei, M., S. Shokralla, X. Zhou, G. A. C. Singer, and D. J. Baird. 2011. Environmental barcoding: a next-generation sequencing approach for biomonitoring applications using river benthos. PLoS ONE 6:e17497.
- 26 Jackson, J. K., J. M. Battle, B. P. White, E. M. Pilgrim, E. D. Stein, P. E. Miller, and B. W. Sweeney. 2014. Cryptic biodiversity in streams: a comparison of macroinvertebrate communities based on morphological and DNA barcode identifications. Freshwater Science 33:312–324.
- 27 Kvist, S. 2013. Barcoding in the dark? A critical view of the sufficiency of zoological DNA barcoding databases and a plea for broader integration of taxonomic knowledge. Molecular Phylogenetics and Evolution 69:39–45.
- 28 Kwong, S., A. Srivathsan, and R. Meier. 2012. An update on DNA barcoding: low species coverage and numerous unidentified sequences. Cladistics 28:639–644.
- 29 Lou, M., and G. B. Golding. 2012. The effect of sampling from subdivided populations on species identification with DNA barcodes using a Bayesian statistical approach. Molecular Phylogenetics and Evolution 65:765–773.
- 30 Majdi, N., and W. Traunspurger. 2015. Free-living nematodes in the freshwater food web: a review. Journal of Nematology 47:28–44.
- 31 Marchant, R. 1989. A subsampler for samples of benthic invertebrates. Bulletin of the Australian Society for Limnology 12:49–52.
- 32 Marcogliese, D. J., and M. Pietrock. 2011. Combined effects of parasites and contaminants on animal health: parasites do matter. Trends in Parasitology 27:123–130.
- 33 McCafferty, W. P. 1997. Discovery and analysis of the oldest mayflies (Insecta, Ephemeroptera) known from amber. Bulletin de la Société d’Histoire Naturelle de Toulouse 133:77–82.
- 34 Merritt, R. W., K. W. Cummins, and M. B. Berg (editors). 2008. An introduction to the aquatic insects of North America. 4th edition. Kendall/Hunt Publishing, Dubuque, Iowa.
- 35 Meusnier, I., G. A. C. Singer, J.-F. Landry, D. A. Hickey, P. D. N. Hebert, and M. Hajibabaei. 2008. A universal DNA mini-barcode for biodiversity analysis. BMC Genomics 9:214.
- 36 Nicacio, G., and L. Juen. 2015. Chironomids as indicators in freshwater ecosystems: an assessment of the literature. Insect Conservation and Diversity 8:393–403.
- 37 Orlofske, J. M., and D. J. Baird 2014. The tiny mayfly in the room: implications of size-dependent invertebrate taxonomic identification for biomonitoring data properties. Aquatic Ecology 47:481–494.
- 38 Pilgrim, E. M., S. A. Jackson, S. Swenson, I. Turcsanyi, E. Friedman, L. Weigt, and M. J. Bagley. 2011. Incorporation of DNA barcoding into a large-scale biomonitoring program: opportunities and pitfalls. Journal of the North American Benthological Society 30:217–231.
- 39 Poff, N. L., J. D. Olden, N. K. M. Vieira, D. S. Finn, M. P. Simmons, and B. C. Kondratieff. 2006. Functional trait niches of North American lotic insects: traits-based ecological applications in light of phylogenetic relationships. Journal of the North American Benthological Society 25:730–755.
- 40 Porter, T. M., J. F. Gibson, S. Shokralla, D. J. Baird, G. B. Golding, and M. Hajibabaei. 2014. Rapid and accurate taxonomic classification of insect (class Insecta) cytochrome c oxidase subunit 1 (COI) DNA barcode sequences using a naïve Bayesian classifier. Molecular Ecology Resources 14:929–942.
- 41 Ratnasingham, S., and P. D. N. Hebert. 2007. BOLD: the barcode of life data system ( www.barcodinglife.org). Molecular Ecology Notes 7:355–364.
- 42 Resh, V. H. 2008. Which group is best? Attributes of different biological assemblages used in freshwater biomonitoring programs. Environmental Monitoring and Assessment 138:131–138.
- 43 Reynoldson, T. B., R. C. Bailey, K. E. Day, and R. H. Norris. 1995. Biological guidelines for freshwater sediment based on BEnthic Assessment of SedimenT (the BEAST) using a multivariate approach for predicting biological state. Australian Journal of Ecology 20:198–219.
- 44 Reynoldson, T. B., D. M. Rosenberg, and V. H. Resh. 2001. Comparison of models predicting invertebrate assemblages for biomonitoring in the Fraser River catchment, British Columbia. Canadian Journal of Fisheries and Aquatic Sciences 58:1395–1410.
- 45 Rosenberg, D. M., and V. H. Resh (editors). 1993. Freshwater biomonitoring and benthic macroinvertebrates. Chapman and Hall, New York.
- 46 Segers, H. 2008. Global diversity of rotifers in freshwaters. Hydrobiologia 595:49–59.
- 47 Shokralla, S., J. F. Gibson, H. Nikbakht, D. H. Janzen, W. Hallwachs, and M. Hajibabaei. 2014. Next-generation DNA barcoding: using next-generation sequencing to enhance and accelerate DNA barcode capture from single specimens. Molecular Ecology Resources 14:892–901.
- 48 Shokralla, S., T. M. Porter, J. F. Gibson, R. Dobosz, D. H. Janzen, W. Hallwachs, G. B. Golding, and M. Hajibabaei. 2015. Massively parallel multiplex DNA sequencing for specimen identification using an Illumina MiSeq platform. Scientific Reports 5:9687.
- 49 Shokralla, S., J. Spall, J. F. Gibson, and M. Hajibabaei. 2012. Next-generation DNA sequencing technologies for environmental DNA research. Molecular Ecology 21:1794–1805.
- 50 Smith, M. J., W. R. Kay, D. H. D. Edward, P. J. Papas, K. S. J. Richardson, J. C. Simpson, A. M. Pinder, D. J. Cale, P. H. J. Horwitz, J. A. Davis, F. H. Yung, R. H. Norris, and S. A. Halse. 1999. AusRivAS: using macroinvertebrates to assess ecological condition of rivers in Western Australia. Freshwater Biology 41:269–282.
- 51 Stein, E. D., B. P. White, R. D. Mazor, J. K. Jackson, J. M. Battle, P. E. Miller, E. M. Pilgrim, and B. W. Sweeney. 2014. Does DNA barcoding improve performance of traditional stream bioassessment metrics? Freshwater Science 33:302–311.
- 52 Stribling, J. B., K. L. Pavlik, S. M. Holdsworth, and E. W. Leppo. 2008. Data quality, performance, and uncertainty in taxonomic identification for biological assessments. Journal of the North American Benthological Society 27:906–919.
- 53 Sweeney, B. W., J. M. Battle, J. K. Jackson, and T. Dapkey. 2011. Can DNA barcodes of stream macroinvertebrates improve descriptions of community structure and water quality? Journal of the North American Benthological Society 30:195–216.
- 54 Tamura, K., D. Peterson, N. Peterson, G. Stecher, M. Nei, and S. Kumar. 2011. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Molecular Biology and Evolution 28:2731–2739.
- 55 Thorp, J. H., and A. P. Covich (editors). 2010. Ecology and classification of North American freshwater invertebrates. Academic Press, San Diego, California.
- 56 Traunspurger, W. 2000. The biology and ecology of lotic nematodes. Freshwater Biology 44:29–45.
- 57 Trebitz, A. S., J. C. Hoffman, G. W. Grant, T. M. Billehus, and E. M. Pilgrim. 2015. Potential for DNA-based identification of Great Lakes fauna: match and mismatch between taxa inventories and DNA barcode libraries. Scientific Reports 5:12162.
- 58 Vinson, M. R., and C. P. Hawkins. 1996. Effects of sampling area and subsampling procedure on comparisons of taxa richness among streams. Journal of the North American Benthological Society 15:392–399.
- 59 Vivien, R., S. Wyler, M. Lafont, and J. Pawlowski. 2015. Molecular barcoding of aquatic oligochaetes: implications for biomonitoring. PLoS ONE 10:e0125485.
- 60 Walsh, C. J. 1997. A multivariate method for determining optimal subsample size in the analysis of macroinvertebrate samples. Marine and Freshwater Research 48:241–248.
- 61 Watson, N. T. N., I. C. Duggan, and I. D. Hogg. 2015. Assessing the diversity of New Zealand freshwater harpacticoid copepods (Crustacea: Copepoda) using mitochondrial DNA (COI) barcodes. New Zealand Journal of Zoology 42:57–67.
- 62 Webb, J. M., L. M. Jacobus, D. H. Funk, X. Zhou, B. Kondratieff, C. J. Geraci, R. E. DeWalt, D. J. Baird, R. Barton, I. Phillips, and P. D. N. Hebert. 2012. A DNA barcode library for North American mayflies: progress and prospects. PLoS ONE 7:e38063.
- 63 White, B. P., E. Pilgrim, L. M. Boykin, E. D. Stein, and R. D. Mazor. 2014. Comparison of four species-delimitation methods applied to a DNA barcode data set of insect larvae for use in routine bioassessment. Freshwater Science 33:338–348.
- 64 Wilson, J. J., R. Rougerie, J. Schonfeld, D. H. Janzen, W. Hallwachs, M. Hajibabaei, I. J. Kitching, J. Haxaire, and P. D. N. Hebert. 2011. When species matches are unavailable are DNA barcodes correctly assigned to higher taxa? An assessment using sphingid moths. BMC Ecology 11:18.
- 65 Wright, J. F., M. T. Furse, and D. Moss. 1998. River classification using invertebrates: RIVPACS applications. Aquatic Conservation: Marine and Freshwater Ecosystems 8:617–631.
- 66 Zhou, X., S. J. Adamowicz, L. M. Jacobus, R. E. DeWalt, and P. D. N. Hebert. 2009. Towards a comprehensive barcode library for Arctic life—Ephemeroptera, Plecoptera, and Trichoptera of Churchill, Manitoba, Canada. Frontiers in Zoology 6:30.
PHOTO (COLOR): A typical high-throughput sequencing for biomonitoring workflow.
PHOTO (COLOR): Distribution of the 16,671 Canadian Aquatic Biomonitoring Network (CABIN) samples collected between 1993 and 2013 that were considered for this analysis.
PHOTO (COLOR): Number of genera with associated cytochrome c oxidase subunit I (COI) reference sequences and distribution of COI sequence abundance for freshwater invertebrate genera across all phyla and for noncrustacean arthropods.
PHOTO (COLOR): Number of genera with associated cytochrome c oxidase subunit I (COI) reference sequences and distribution of COI sequence abundance for freshwater invertebrate genera across remaining phyla/subphyla.
PHOTO (COLOR): Number of genera with associated cytochrome c oxidase subunit I (COI) reference sequences associated with genera and distribution of COI sequence abundance for freshwater insect orders.
PHOTO (COLOR): Cumulative proportion of Canadian Aquatic Biomonitoring Network (CABIN) genera present in public cytochrome c oxidase subunit I (COI) sequence libraries. Genera were ranked from most (
PHOTO (COLOR): Distribution and relative abundance of cytochrome c oxidase subunit I (COI) sequence records extracted from the Barcode of Life Data systems for specimens within the genera (A) Baetis and (B) Eukiefferiella. The size of circles reflects the number of sequence records from a locality.
PHOTO (COLOR): Relative abundance of cytochrome c oxidase subunit I (COI) sequence records among different Baetis species from North America. An equivalent plot is not shown for Eukiefferiella because only 1 species was represented among the sequences.
By Colin J. Curry; Joel F. Gibson; Shadi Shokralla; Mehrdad Hajibabaei and Donald J. Baird