Austin Baker

and 93 more

Insects are the most diverse terrestrial organisms, yet they are underrepresented in large-scale conservation assessments. To address this gap, the California Insect Barcoding Initiative is developing a publicly accessible, statewide DNA-barcode reference library for California insects to support scalable surveys, establish baseline biodiversity measurements, and generate potential distribution maps that inform conservation planning. Specimens are collected with a hybrid strategy that combines standardized Malaise trapping, to enable replicable sampling, and opportunistic collecting, to maximize taxonomic coverage. To date, we have barcoded over one million specimens; preliminary completeness analyses suggest that current sampling captures roughly 65–69% of the fauna, implying a conservative minimum of circa 61,000 insect species in California. Using all sequenced specimen records, we generated rule-based spatial range interpolations constrained by ecoregion and vegetation type, and used these to infer spatial patterns in insect species richness across the state. We identify areas of both high and low potential species richness, with current peaks in the Southern California Mountains ecoregion and the Mojave Basin and Range ecoregion. Our species richness estimates and spatial patterns are explicitly provisional and are expected to evolve as sampling gaps are addressed. Finally, we make all sequence data, specimen images, and occurrence records publicly available via the Barcode of Life Datasystem. This ongoing effort constitutes the first large-scale DNA barcode-based survey for California insects, providing an expandable foundation for tracking temporal change, testing drivers of insect diversity, and prioritizing regions for conservation and targeted inventory.

Paul Hebert

and 2 more

Jessica Schultz

and 1 more

Tomas Roslin

and 96 more

To associate specimens identified by molecular characters to other biological knowledge, we need reference sequences annotated by Linnaean taxonomy. In this paper, we 1) report the creation of a comprehensive reference library of DNA barcodes for the arthropods of an entire country (Finland), 2) publish this library, and 3) deliver a new identification tool based on this resource. The reference library contains mtDNA COI barcodes for 11,275 (43%) of 26,437 arthropod species known from Finland, including 10,811 (45%) of 23,956 insect species. To quantify the improvement in identification accuracy enabled by the current reference library, we ran 1,000 Finnish insect and spider species through the Barcode of Life Data system (BOLD) identification engine. Of these, 91% were correctly assigned to a unique species when compared to the new reference library alone, 85% were correctly identified when compared to BOLD with the new material included, and 75% with the new material excluded. To capitalize on this resource, we used the new reference material to train a probabilistic taxonomic assignment tool, FinPROTAX, scoring high success. For the full-length barcode region, the accuracy of taxonomic assignments at the level of classes, orders, families, subfamilies, tribes, genera, and species reached 99.9%, 99.9%, 99.8%, 99.7%, 99.4%, 96.8%, and 88.5%, respectively. The FinBOL arthropod reference library and FinPROTAX are available through the Finnish Biodiversity Information Facility (www.laji.fi). Overall, the FinBOL investment represents a massive capacity-transfer from the taxonomic community of Finland to all sectors of society.