References
Altschul, S. F. et al. (1997). Gapped BLAST and PSI-BLAST: a new
generation of protein database search programs. Nucleic Acids
Research , 25, 3389–3402.
Ashburner, M. et al. (2000). Gene Ontology: tool for the unification of
biology. Nature Genetics , 25, 25–29.
Bao, Z. R. & Eddy, S. R. (2002). Automated de novo identification of
repeat sequence families in sequenced genomes. Genome Research ,
12, 1269–1276.
Belaghzal, H., Dekker, J., & Gibcus, J. H. (2017). Hi-C 2.0: An
optimized hi-c procedure for high-resolution genome-wide mapping of
chromosome conformation. Methods , 123, 56-65.
Benson, G. (1999). Tandem repeats finder: a program to analyze DNA
sequences. Nucleic Acids Res , 27, 573–580.
Boeckmann, B. et al. (2003). The SWISS-PROT protein knowledgebase and
its supplement TrEMBL in 2003. Nucleic Acids Research , 31,
365–370.
Burge, C. & Karlin, S. (1997). Prediction of complete gene structures
in human genomic DNA. Journal of molecular biology , 268, 78–94.
Burton J N, Adey A, Patwardhan R P, et al. (2013). Chromosome-scale
scaffolding of de novo genome assemblies based on chromatin
interactions[J]. Nature biotechnology , 31(12): 1119.
Camacho, C. et al. (2009). BLAST plus: architecture and applications.BMC Bioinformatics , 10.
Cantarel, B. L. et al. (2008). MAKER: An easy-to-use annotation pipeline
designed for emerging model organism genomes. Genome Res earch,
18, 188–196.
Conesa, A. & Gotz, S. (2008). Blast2GO: a comprehensive suite for
functional analysis in plant genomics. International Journal of
Plant Genomics , 1–12.
Durand, N.C., Shamim, M.S., Machol, I., Rao S.S.P., Huntley, M.H.,
Lander, E.S., and Aiden E.L. (2016). Juicer provides a one-click system
for analyzing loop-resolution Hi-C experiments. Cell Syst , 3,
95-98.
Edgar R C. (2004). MUSCLE: multiple sequence alignment with high
accuracy and high throughput[J]. Nucleic acids research ,
32(5): 1792-1797.
Emms D M, Kelly S. (2015). OrthoFinder: solving fundamental biases in
whole genome comparisons dramatically improves orthogroup inference
accuracy[J]. Genome biology , 16 (157):157.
Finn, R. D. et al. (2008). The Pfam protein families database.Nucleic Acids Research , 36, D281– D288.
Flicek, P., Amode, M. R., Barrell, D., Beal, K., Billis, K., Brent, S.,
… Fitzgerald, S. (2014). Ensembl 2014. Nucleic Acids
Research , 42 (Database issue), D749-D755.
Galal-Khallaf, A., Osman, A. G. M., El-Ganainy, A., Farrag, M. M.,
Mohammed-Abdallah, E., & Moustafa, M. A., et al. (2018). Mitochondrial
genetic markers for authentication of major red sea grouper species
(Perciformes: Serranidae) in egypt: a tool for enhancing fisheries
management and species conservation. Gene .
Gaither, M. R., Bowen, B. W., Bordenave, T. R., Rocha, L. A., Newman, S.
J., & Gomez, J. A., et al. (2011). Phylogeography of the reef fishCephalopholis argus (Epinephelidae) indicates pleistocene
isolation across the indo-pacific barrier with contemporary overlap in
the coral triangle. Bmc Evolutionary Biology, 11(1), 189-189.
Ge H, Lin K, Shen M, Wu S, Wang Y, Zhang Z, Wang Z, Zhang Y, Huang Z,
Zhou C, Lin Q, Wu J, Liu L, Hu J, Huang Z, Zheng L. (2019). De novo
assembly of a chromosome-level reference genome of red-spotted grouper
(Epinephelus akaara ) using nanopore sequencing and Hi-C.Molecular ecology resources , 19 (6).
Han MV, Thomas GW, Lugo-Martinez J, Hahn MW. (2013). Estimating gene
gain and loss rates in the presence of error in genome assembly and
annotation using CAFE 3. Molecular Biology and Evolution ,
30(8):1987–1997.
Jones, P. et al. (2014). InterProScan 5: genome-scale protein function
classification. Bioinformatics 30, 1236–1240.
Jurka, J. (2000). Repbase Update -a database and an electronic journal
of repetitive elements. Trends Genetics , 16, 418–420.
Jurka, J. et al. (2005). Repbase update, a database of eukaryotic
repetitive elements. Cytogenet & Genome Research, 110, 462–467.
Kanehisa, M., Goto, S., Sato, Y., Furumichi, M. & Tanabe, M. (2012).
KEGG for integration and interpretation of large-scale molecular data
sets. Nucleic Acids Research , 40, D109–D114. Return to ref 41 in
article
Kasahara, M., Naruse, K., Sasaki, S., Nakatani, Y., Wei, Q., Ahsan, B.,
. . . Kasai, Y. (2007). The medaka draft genome and insights into
vertebrate genome evolution. Nature , 447(7145), 714-719.
Liu, B., Shi, Y., Yuan, J., Hu, X., Zhang, H., Li, N., Li, Z., Chen, Y.,
Mu, D. and Fan, W. (2013). Estimation of genomic characteristics by
analyzing k-mer frequency in de novo genome projects. arXiv
preprint arXiv:1308.2012.
Lowe, T. M., & Eddy, S. R. (1997). tRNAscan-SE: a program for improved
detection of transfer RNA genes in genomic sequence. Nucleic acids
research , 25(5), 955-964.
Love, M. I., Huber, W., & Anders, S. (2014). Moderated estimation of
fold change and dispersion for RNA-seq data with DESeq2. Genome
Biology , 15(12), 550. https://doi.org/10.1186/s1305 9-014-0550-8
Liu., Shi, Y., Yuan, J., Hu, X., & Wei, F. (2013). Estimation of
genomic characteristics by analyzing k-mer frequency in de novo genome
projects. Quantitative Biology , 35 (s 1–3), 62-67.
Meyer, A. L. (2008). An ecological comparison of Cephalopholis
argus between native and introduced populations. PhD Thesis, University
of Hawaii. Available at
http://www.fpir.noaa.gov/Library/HCD/Master%20dissertation%205-31-08.pdf
Mitchell, A. et al. (2015). The InterPro protein families database: the
classification resource after 15 years. Nucleic Acids Research ,
43, D213–D221.
Mistry, J., Bateman, A. & Finn, R. D. (2007). Predicting active site
residue annotations in the Pfam database. BMC Bioinformatics , 8,
298. Return
Morris, A. V., Roberts, C. M., & Hawkins, J. P. (2000). The threatened
status of groupers (epinephelinae). Biodiversity & Conservation ,
9 (7), 919-942.
Nawrocki, E. P., Kolbe, D. L. & Eddy, S. R. (2009). Infernal 1.0:
inference of RNA alignments. Bioinformatics , 25, 1335–1337.
Rhoads, A., & Au, K. F. (2015). Pacbio sequencing and its
applications. Genomics, Proteomics &
Bioinformatics, 13 (5), 278-289.
Roberts, H. C. M. (1994). The growth of coastal tourism in the red sea:
present and future effects on coral reefs. Ambio, 23(8), 503-508.
Price, A. L., Jones, N. C. & Pevzner, P. A. (2005). De novo
identification of repeat families in large genomes.Bioinformatics , 21, I351–I358.
Shpigel, M., & Fishelson, L. (2010). Territoriality and associated
behaviour in three species of the genus Cephalopholis (Pisces,
Serranidae) in the gulf of aqaba, red sea. Journal of Fish
Biology, 38(6).
Shpigel, M. (1985). Aspects of the biology and ecology of the Red Sea
groupers of the genus Cephalopholis (Serranidae, Teleostei). PhD
Dissertation, Tel Aviv University (in Hebrew, summary in English).
Shpigel, M. & Fishelson, L. (1989a). Food habits and prey selection of
three species of groupers from the genus Cephalopholis(Serranidae, Teleostei). Environmental Biology of Fishes24,67-73.
Shpigel, M. & Fishelson, L. (1989b). Habitat partitioning between
species of the genus Cephalopholis (Pisces, Serranidae) across
the fringing reef of the Gulf of Aqaba (Red Sea). Marine Ecology
Progress Series 58, 17–22.
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V., &
Zdobnov, E. M. (2015). BUSCO: assessing genome assembly and annotation
completeness with single-copy orthologs. Bioinformatics , 31(19),
3210-3212.
Stamatakis, A. (2014). RAxML version 8: a tool for phylogenetic analysis
and post analysis of large phylogenies. Bioinformatics , 30,
1312-1313.
Stanke, M., Steinkamp, R., Waack, S. & Morgenstern, B. (2004).
AUGUSTUS: a web server for gene finding in eukaryotes. Nucleic
Acids Res , 32, W309-W312.
Tarailo-Graovac, M. & Chen, N. (2009). Using Repeat Masker to identify
repetitive elements in genomic sequences. Curr. Protoc .
Bioinformatics Chapter 4, Unit 4.10.
Walker, B. J. et al. (2014). Pilon: an integrated tool for comprehensive
microbial variant detection and genome assembly improvement. Plos
One , 9, e112963.
Wu, T.D. and Watanabe, C.K. (2005). GMAP: a genomic mapping and
alignment program for mRNA and EST sequences. Bioinformatics ,
21(9), pp.1859-1875.
Xiao, C. et al. (2017). MECAT2: fast mapping, error correction, and de
novo assembly for single-moecule sequencing reads. Nature
methods , 14, 1072.
Xu, Z. &Wang, H. (2007). LTR_FINDER: an efficient tool for the
prediction of full-length LTR retrotransposons. Nucleic Acids
Research , 35, W265–W268.
Yang, Z. (2007). PAML 4: Phylogenetic Analysis by Maximum Likelihood.
Molecular Biology and Evolution , 24(8), 1586–1591. https://doi.
org/10.1093/molbe v/msm088
Yang., Liu, D., Liu, F., Wu, J., Zou, J., Xiao, X., Zhu, B. (2013).
HTQC: a fast quality control toolkit for Illumina sequencing data.BMC Bioinformatics , 14(1), 1-4.
Ze-Gang, W., & Shao-Wu, Z.. (2018). Npbss: a new pacbio sequencing
simulator for generating the continuous long reads with an empirical
model. Bmc Bioinformatics, 19 (1), 177.
Zhang, Xuan, Qu, Meng, Zhang, & Xiang, et al. (2013). A Comprehensive
Description and Evolutionary Analysis of 22 Grouper (Perciformes,
Epinephelidae) Mitochondrial Genomes with Emphasis on Two Novel Genome
Organizations. (Doctoral dissertation, PUBLIC LIBRARY SCIENCE).
Zhou Q, Gao H, Zhang Y, Fan G, Xu H, Zhai J, Xu W, Chen Z, Zhang H, Liu
S, Niu Y, Li W, Li W, Lin H, Chen S. (2019). A chromosome-level genome
assembly of the giant grouper (Epinephelus lanceolatus ) provides
insights into its innate immunity and rapid growth. Molecular
ecology
resources .
Zhou, Q., Guo, X., Huang, Y., Gao, H., & Chen, S. (2020). De
novo sequencing and chromosomal-scale genome assembly of leopard coral
grouper, Plectropomus leopardus . Molecular Ecology
Resources .
Table 1. Sequencing data for the C. sonnerati genome assembly.