Release of a 6.1 Fold Assembly of the Culex pipiens quinquefasciatus JHB Genome
The National Institute of Allergy and Infectious Diseases, National Institutes of Health has funded the sequencing and annotation of the Culex pipiens quinquefasciatus JHB genome through its Microbial Sequencing Centers (the J. Craig Venter Institute (JCVI) (non-government link) and the Broad Institute (non-government link), Massachusetts Institute of Technology) and the NIAID-funded Bioinformatics Resource Center, VectorBase (non-government link), at the University of Notre Dame. Random sequence data representing a 6.1 fold (5.3X in Q20 bases) sequence coverage of the genome have now been generated, assembled, and deposited at DDBJ/EMBL/GenBank under the project accession AAWU00000000. A preliminary annotation of the genome is available at www.vectorbase.org (non-government link) and a final annotation is expected in Spring 2008.
Assembly statistics
Total number of sequence reads: 7.4 million
Total number of sequence reads placed in the assembly: 4.3 million
Unplaced reads: 3.1 million
Estimated fold coverage of genome: 6.14 fold (5.3X in Q20 bases)
Number of contigs: 48,671
Number of scaffolds: 3,171
Total length of combined contigs 0.54 Gb
The N50 contig length: 28.5 kb
Total length of combined scaffolds (including gaps) 0.58 Gb
The N50 scaffold length: 475.6 kb
Complementary to the sequencing and annotation project described above, The National Institute of Allergy and Infectious Diseases, National Institutes of Health has also funded the generation of additional C. pipiens genome-related resources. In fact, the efforts have resulted in the generation of 109,152 C. pipiens EST sequences from a variety of libraries provided by the community. All sequences are publicly available through the NCBI Web site.
back to top