3.3 Genome annotation
There were 57.73% repetitive sequences in the F. chinensisgenome (Table S9). A total of 25,026 genes, with average length of 11,290 bp (including untranslated regions [UTRs]), average coding sequence (CDS) length of 1,230 bp, and 5.94 exons per gene were predicted with three methods—de novo prediction, homolog prediction, and RNA-seq prediction (Table S10). Gene function of 76.7% of the predicted genes was annotated using multiple databases (Table S11). By comparing with the non-coding RNA (ncRNA) database, 72,517 ncRNA genes were annotated, including 59,026 miRNA genes, 2,592 tRNA genes, 24 rRNA genes, and 10,875 snRNA genes (Table S12).