Multiple full-length variants of the mitochondrial COI DNA barcode region are prevalent in north European sawflies
Prous, Marko; Urpilainen, Santtu; Hebert, Paul D.N.; Zakharov, Evgeny; Kiljunen, Niina; Mutanen, Marko (2026-02-16)
Prous, Marko
Urpilainen, Santtu
Hebert, Paul D.N.
Zakharov, Evgeny
Kiljunen, Niina
Mutanen, Marko
John Wiley & Sons
16.02.2026
Prous, M., Urpilainen, S., Hebert, P.D.N., Zakharov, E., Kiljunen, N. & Mutanen, M. (2026) Multiple full-length variants of the mitochondrial COI DNA barcode region are prevalent in north European sawflies. Systematic Entomology, 51(1), e70031. Available from: https://doi.org/10.1111/syen.70031
https://creativecommons.org/licenses/by/4.0/
© 2026 The Author(s). Systematic Entomology published by John Wiley & Sons Ltd on behalf of Royal Entomological Society. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
https://creativecommons.org/licenses/by/4.0/
© 2026 The Author(s). Systematic Entomology published by John Wiley & Sons Ltd on behalf of Royal Entomological Society. This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
https://creativecommons.org/licenses/by/4.0/
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:oulu-202602261980
https://urn.fi/URN:NBN:fi:oulu-202602261980
Tiivistelmä
Abstract
DNA barcoding, the use of a standard DNA fragment for species identification, has emerged as a major field of biodiversity research. The effectiveness of this approach rests on the premise that much less variation exists within species than between them. While exceptions occur, this has been demonstrated in many animal taxa where the COI gene is effective in species discrimination. Sawflies are an exception to this pattern because DNA barcodes often fail to distinguish congeneric species. Using high-throughput single-molecule DNA sequencing to recover COI sequences from thousands of sawflies, we found that single individuals often possess multiple, seemingly functional, full-length DNA barcodes (i.e., unrecognizable as nuclear pseudogenes)—a phenomenon not documented at similar prevalence in any animal taxon. While the evolutionary causes of multiple variants require further investigation, our observation is remarkable as it violates the one-barcode-one-specimen assumption. The presence of multiple variants of barcodes within individuals does not jeopardize the concept, but it introduces a complexity for species inventories based on metabarcoding. They will overestimate the species count when barcode-based operational species units are used as species proxies. Similarly, DNA barcode reference libraries must consider how best to deal with the high frequency of multiple intra-individual variants.
DNA barcoding, the use of a standard DNA fragment for species identification, has emerged as a major field of biodiversity research. The effectiveness of this approach rests on the premise that much less variation exists within species than between them. While exceptions occur, this has been demonstrated in many animal taxa where the COI gene is effective in species discrimination. Sawflies are an exception to this pattern because DNA barcodes often fail to distinguish congeneric species. Using high-throughput single-molecule DNA sequencing to recover COI sequences from thousands of sawflies, we found that single individuals often possess multiple, seemingly functional, full-length DNA barcodes (i.e., unrecognizable as nuclear pseudogenes)—a phenomenon not documented at similar prevalence in any animal taxon. While the evolutionary causes of multiple variants require further investigation, our observation is remarkable as it violates the one-barcode-one-specimen assumption. The presence of multiple variants of barcodes within individuals does not jeopardize the concept, but it introduces a complexity for species inventories based on metabarcoding. They will overestimate the species count when barcode-based operational species units are used as species proxies. Similarly, DNA barcode reference libraries must consider how best to deal with the high frequency of multiple intra-individual variants.
Kokoelmat
- Avoin saatavuus [42834]

