HomoloGene Release 64 Statistics
Initial numbers of genes from complete genomes, numbers of genes placed in a homology group, and the numbers of groups for each species.
|
Species |
Number of Genes |
HomoloGene |
|
Input |
Grouped |
|
groups |
|
Homo sapiens |
22,165* |
19,571 |
|
18,876 |
Pan troglodytes |
25,096 |
17,243 |
|
16,375 |
Canis lupus familiaris |
19,766 |
16,789 |
|
15,996 |
Bos taurus |
22,049* |
19,803 |
|
16,276 |
Mus musculus |
25,388 |
21,786 |
|
19,026 |
Rattus norvegicus |
21,991 |
19,267 |
|
17,512 |
Gallus gallus |
17,959 |
13,207 |
|
11,969 |
Danio rerio |
26,288 |
20,764 |
|
13,900 |
Drosophila melanogaster |
14,085 |
9,315 |
|
7,796 |
Anopheles gambiae |
12,460* |
8,944 |
|
7,618 |
Caenorhabditis elegans |
20,155* |
8,685 |
|
4,829 |
Schizosaccharomyces pombe |
5,043 |
3,237 |
|
2,949 |
Saccharomyces cerevisiae |
5,880 |
4,854 |
|
4,373 |
Kluyveromyces lactis |
5,335 |
4,462 |
|
4,385 |
Eremothecium gossypii |
4,722 |
3,933 |
|
3,889 |
Magnaporthe grisea |
12,832* |
7,295 |
|
6,364 |
Neurospora crassa |
10,079 |
6,175 |
|
6,039 |
Arabidopsis thaliana |
27,165* |
19,850 |
|
11,226 |
Oryza sativa |
26,887 |
17,330 |
|
10,674 |
Plasmodium falciparum |
5,266 |
2,440 |
|
1,130 |
'*' indicates organisms where new genome annotation data is used in this build.
Last updated on: Sat Aug 8 2009
We have recently adopted a new build procedure that makes use of amino acid sequence searching (blastp) to find more distant relationships, but the procedure still refers to the DNA sequence for computation of some of the statistics. The matching strategy is guided by the taxonomic tree such that more closely related organisms are compared first. Moreover, HomoloGene entries now include paralogs in addition to orthologs.
Sources of Additional Information
HomoloGene entries have been augumented with homology and phenotype information drawn from the following sources. Online Mendelian Inheritance in Man (OMIM)
Mouse Genome Informatics (MGI)
Zebrafish Information Network (ZFIN)
Saccharomyces Genome Database (SGD)
Clusters of Orthologous Groups (COG)
FlyBase
|
|
|
HomoloGene release 64 is now public. It includes updated annotations for the following species: Homo sapiens (NCBI release 37.1), Caenorhabditis elegans (WS190, NCBI release 8.1), Anopheles gambiae (AgamP3.3, NCBI release 3.1), Arabidopsis thaliana (NCBI release 8.1), Bos taurus (NCBI release 3.1), and Magnaporthe grisea (NCBI release 3.1).
|
| |
COGs
Phylogenetic classification of proteins encoded in complete genomes.
|
|
|