Comparison of short and long-read metabarcoding sequencing: challenges and solutions for plastid read removal and microbial community exploration of seaweed samples

Coralie Rousseau; Nicolas Henry; Sylvie Rousvoal; Gwenn Tanguy; Erwan Legeay; Catherine Leblanc; Simon Dittami

doi:10.22541/au.172675397.78191661/v1

loading page

Comparison of short and long-read metabarcoding sequencing: challenges and solutions for plastid read removal and microbial community exploration of seaweed samples

Coralie Rousseau,
Nicolas Henry,
Sylvie Rousvoal,
Gwenn Tanguy,
Erwan Legeay,
Catherine Leblanc,
Simon Dittami

Abstract

Short-read metabarcoding analysis is the gold standard to access to partial 16S and ITS genes with high read quality. With the advent of long-read sequencing, the amplification of full-length target genes is possible but with low read accuracy. Moreover, the amplification of 16S rDNA genes in seaweed or plant samples results in a large proportion of plastid reads, which are directly or indirectly derived from cyanobacteria. Primers designed not to amplify plastid sequences are available for short-read sequencing, while Oxford Nanopore Technology offers adaptive sampling, a unique way to remove reads in real-time. In this study, we compare three options to address the plastid read issue: deleting plastid reads with adaptative sampling, using optimized primers with Illumina MiSeq technology, and sequencing large numbers of reads with Illumina NovaSeq technology with universal primers. We showed that adaptive sampling using default settings of the MinKNOW software was ineffective for plastid depletion. We also demonstrated with a mock community that the SAMBA workflow provided the most accurate taxonomic assignment at the bacterial genus level compared to the IDTAXA and KRAKEN2 pipelines, but many false positives were generated at species level. Although NovaSeq sequencing with universal primer stood out for studying the algal bacterial community due to its deep coverage, the inclusion of eukaryotes and bacteria in the same sequencing run, and the low error rate. The combination of Illumina and ONT sequencing helped us explore the fungal diversity and allowed for the retrieval taxonomic information for genera poorly represented in the sequence databases.

18 Sep 2024Submitted to Molecular Ecology Resources

Show details

Hide details

19 Sep 2024Submission Checks Completed

19 Sep 2024Assigned to Editor

19 Sep 2024Review(s) Completed, Editorial Evaluation Pending

27 Oct 2024Reviewer(s) Assigned

Abstract

Peer review status:UNDER REVIEW