References
Altschul, S. F. et al. (1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Research , 25, 3389–3402.
Ashburner, M. et al. (2000). Gene Ontology: tool for the unification of biology. Nature Genetics , 25, 25–29.
Bao, Z. R. & Eddy, S. R. (2002). Automated de novo identification of repeat sequence families in sequenced genomes. Genome Research , 12, 1269–1276.
Belaghzal, H., Dekker, J., & Gibcus, J. H. (2017). Hi-C 2.0: An optimized hi-c procedure for high-resolution genome-wide mapping of chromosome conformation. Methods , 123, 56-65.
Benson, G. (1999). Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res , 27, 573–580.
Boeckmann, B. et al. (2003). The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Research , 31, 365–370.
Burge, C. & Karlin, S. (1997). Prediction of complete gene structures in human genomic DNA. Journal of molecular biology , 268, 78–94.
Burton J N, Adey A, Patwardhan R P, et al. (2013). Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions[J]. Nature biotechnology , 31(12): 1119.
Camacho, C. et al. (2009). BLAST plus: architecture and applications.BMC Bioinformatics , 10.
Cantarel, B. L. et al. (2008). MAKER: An easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res earch, 18, 188–196.
Conesa, A. & Gotz, S. (2008). Blast2GO: a comprehensive suite for functional analysis in plant genomics. International Journal of Plant Genomics , 1–12.
Durand, N.C., Shamim, M.S., Machol, I., Rao S.S.P., Huntley, M.H., Lander, E.S., and Aiden E.L. (2016). Juicer provides a one-click system for analyzing loop-resolution Hi-C experiments. Cell Syst , 3, 95-98.
Edgar R C. (2004). MUSCLE: multiple sequence alignment with high accuracy and high throughput[J]. Nucleic acids research , 32(5): 1792-1797.
Emms D M, Kelly S. (2015). OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy[J]. Genome biology , 16 (157):157.
Finn, R. D. et al. (2008). The Pfam protein families database.Nucleic Acids Research , 36, D281– D288.
Flicek, P., Amode, M. R., Barrell, D., Beal, K., Billis, K., Brent, S., … Fitzgerald, S. (2014). Ensembl 2014. Nucleic Acids Research , 42 (Database issue), D749-D755.
Galal-Khallaf, A., Osman, A. G. M., El-Ganainy, A., Farrag, M. M., Mohammed-Abdallah, E., & Moustafa, M. A., et al. (2018). Mitochondrial genetic markers for authentication of major red sea grouper species (Perciformes: Serranidae) in egypt: a tool for enhancing fisheries management and species conservation. Gene .
Gaither, M. R., Bowen, B. W., Bordenave, T. R., Rocha, L. A., Newman, S. J., & Gomez, J. A., et al. (2011). Phylogeography of the reef fishCephalopholis argus (Epinephelidae) indicates pleistocene isolation across the indo-pacific barrier with contemporary overlap in the coral triangle. Bmc Evolutionary Biology,  11(1), 189-189.
Ge H, Lin K, Shen M, Wu S, Wang Y, Zhang Z, Wang Z, Zhang Y, Huang Z, Zhou C, Lin Q, Wu J, Liu L, Hu J, Huang Z, Zheng L. (2019). De novo assembly of a chromosome-level reference genome of red-spotted grouper (Epinephelus akaara ) using nanopore sequencing and Hi-C.Molecular ecology resources , 19 (6).
Han MV, Thomas GW, Lugo-Martinez J, Hahn MW. (2013). Estimating gene
gain and loss rates in the presence of error in genome assembly and annotation using CAFE 3. Molecular Biology and Evolution , 30(8):1987–1997.
Jones, P. et al. (2014). InterProScan 5: genome-scale protein function classification. Bioinformatics 30, 1236–1240.
Jurka, J. (2000). Repbase Update -a database and an electronic journal of repetitive elements. Trends Genetics , 16, 418–420.
Jurka, J. et al. (2005). Repbase update, a database of eukaryotic repetitive elements. Cytogenet & Genome Research, 110, 462–467.
Kanehisa, M., Goto, S., Sato, Y., Furumichi, M. & Tanabe, M. (2012). KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Research , 40, D109–D114. Return to ref 41 in article
Kasahara, M., Naruse, K., Sasaki, S., Nakatani, Y., Wei, Q., Ahsan, B., . . . Kasai, Y. (2007). The medaka draft genome and insights into vertebrate genome evolution. Nature , 447(7145), 714-719.
Liu, B., Shi, Y., Yuan, J., Hu, X., Zhang, H., Li, N., Li, Z., Chen, Y., Mu, D. and Fan, W. (2013). Estimation of genomic characteristics by analyzing k-mer frequency in de novo genome projects. arXiv preprint arXiv:1308.2012.
Lowe, T. M., & Eddy, S. R. (1997). tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic acids research , 25(5), 955-964.
Love, M. I., Huber, W., & Anders, S. (2014). Moderated estimation of
fold change and dispersion for RNA-seq data with DESeq2. Genome
Biology , 15(12), 550. https://doi.org/10.1186/s1305 9-014-0550-8
Liu., Shi, Y., Yuan, J., Hu, X., & Wei, F. (2013). Estimation of genomic characteristics by analyzing k-mer frequency in de novo genome projects. Quantitative Biology , 35 (s 1–3), 62-67.
Meyer, A. L. (2008). An ecological comparison of Cephalopholis argus between native and introduced populations. PhD Thesis, University of Hawaii. Available at http://www.fpir.noaa.gov/Library/HCD/Master%20dissertation%205-31-08.pdf
Mitchell, A. et al. (2015). The InterPro protein families database: the classification resource after 15 years. Nucleic Acids Research , 43, D213–D221.
Mistry, J., Bateman, A. & Finn, R. D. (2007). Predicting active site residue annotations in the Pfam database. BMC Bioinformatics , 8, 298. Return
Morris, A. V., Roberts, C. M., & Hawkins, J. P. (2000). The threatened status of groupers (epinephelinae). Biodiversity & Conservation , 9 (7), 919-942.
Nawrocki, E. P., Kolbe, D. L. & Eddy, S. R. (2009). Infernal 1.0: inference of RNA alignments. Bioinformatics , 25, 1335–1337.
Rhoads, A., & Au, K. F. (2015). Pacbio sequencing and its applications. Genomics, Proteomics & Bioinformatics,  13 (5), 278-289.
Roberts, H. C. M. (1994). The growth of coastal tourism in the red sea: present and future effects on coral reefs. Ambio, 23(8), 503-508.
Price, A. L., Jones, N. C. & Pevzner, P. A. (2005). De novo identification of repeat families in large genomes.Bioinformatics , 21, I351–I358.
Shpigel, M., & Fishelson, L. (2010). Territoriality and associated behaviour in three species of the genus Cephalopholis (Pisces, Serranidae) in the gulf of aqaba, red sea. Journal of Fish Biology,  38(6).
Shpigel, M. (1985). Aspects of the biology and ecology of the Red Sea groupers of the genus Cephalopholis (Serranidae, Teleostei). PhD Dissertation, Tel Aviv University (in Hebrew, summary in English).
Shpigel, M. & Fishelson, L. (1989a). Food habits and prey selection of three species of groupers from the genus Cephalopholis(Serranidae, Teleostei). Environmental Biology of Fishes24,67-73.
Shpigel, M. & Fishelson, L. (1989b). Habitat partitioning between species of the genus Cephalopholis (Pisces, Serranidae) across the fringing reef of the Gulf of Aqaba (Red Sea). Marine Ecology Progress Series 58, 17–22.
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V., & Zdobnov, E. M. (2015). BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics , 31(19), 3210-3212.
Stamatakis, A. (2014). RAxML version 8: a tool for phylogenetic analysis and post analysis of large phylogenies. Bioinformatics , 30, 1312-1313.
Stanke, M., Steinkamp, R., Waack, S. & Morgenstern, B. (2004). AUGUSTUS: a web server for gene finding in eukaryotes. Nucleic Acids Res , 32, W309-W312.
Tarailo-Graovac, M. & Chen, N. (2009). Using Repeat Masker to identify repetitive elements in genomic sequences. Curr. Protoc . Bioinformatics Chapter 4, Unit 4.10.
Walker, B. J. et al. (2014). Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. Plos One , 9, e112963.
Wu, T.D. and Watanabe, C.K. (2005). GMAP: a genomic mapping and alignment program for mRNA and EST sequences. Bioinformatics , 21(9), pp.1859-1875.
Xiao, C. et al. (2017). MECAT2: fast mapping, error correction, and de novo assembly for single-moecule sequencing reads. Nature methods , 14, 1072.
Xu, Z. &Wang, H. (2007). LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Research , 35, W265–W268.
Yang, Z. (2007). PAML 4: Phylogenetic Analysis by Maximum Likelihood.
Molecular Biology and Evolution , 24(8), 1586–1591. https://doi. org/10.1093/molbe v/msm088
Yang., Liu, D., Liu, F., Wu, J., Zou, J., Xiao, X., Zhu, B. (2013). HTQC: a fast quality control toolkit for Illumina sequencing data.BMC Bioinformatics , 14(1), 1-4.
Ze-Gang, W., & Shao-Wu, Z.. (2018). Npbss: a new pacbio sequencing simulator for generating the continuous long reads with an empirical model. Bmc Bioinformatics,  19 (1), 177.
Zhang, Xuan, Qu, Meng, Zhang, & Xiang, et al. (2013). A Comprehensive Description and Evolutionary Analysis of 22 Grouper (Perciformes, Epinephelidae) Mitochondrial Genomes with Emphasis on Two Novel Genome Organizations. (Doctoral dissertation, PUBLIC LIBRARY SCIENCE).
Zhou Q, Gao H, Zhang Y, Fan G, Xu H, Zhai J, Xu W, Chen Z, Zhang H, Liu S, Niu Y, Li W, Li W, Lin H, Chen S. (2019). A chromosome-level genome assembly of the giant grouper (Epinephelus lanceolatus ) provides insights into its innate immunity and rapid growth. Molecular ecology
resources .
Zhou, Q., Guo, X., Huang, Y., Gao, H., & Chen, S. (2020). De novo sequencing and chromosomal-scale genome assembly of leopard coral grouper, Plectropomus leopardusMolecular Ecology Resources .
Table 1. Sequencing data for the C. sonnerati genome assembly.