Beyond standard metrics like Contig N50 and Scaffold N50, rigorous quality assessment is essential to ensure assembly reliability. This involves:
Transcriptome Validation: Utilizing EST and RNA-seq data to assess the completeness of the assembled gene models.
Physical Map Verification: Using BAC data to verify the assembly and detect potential misassemblies or breaks.
Conserved Gene Analysis: Employing benchmarking tools like CEGMA or BUSCO to evaluate the overall completeness of the genome assembly.
Ideally, the DNA used for both the Survey and de novo sequencing phases should be derived from the same individual to ensure data consistency. However, if the DNA quantity is insufficient for the entire de novo project, we recommend the following strategy:
Small-fragment Libraries: DNA must be sourced from the same individual used for the Survey.
Large-fragment Libraries: DNA can be sourced from another individual within the same population.