Your browser version may not work well with NCBI's Web applications. More information here...
HomoloGene is a system for automated detection of homologs among the annotated genes of several completely sequenced eukaryotic genomes.
HomoloGene Release 64 Statistics



Initial numbers of genes from complete genomes, numbers of genes placed in a homology group, and the numbers of groups for each species.

Species   Number of Genes   HomoloGene
  Input Grouped   groups
Homo sapiens 22,165* 19,571   18,876
Pan troglodytes 25,096  17,243   16,375
Canis lupus familiaris 19,766  16,789   15,996
Bos taurus 22,049* 19,803   16,276
Mus musculus 25,388  21,786   19,026
Rattus norvegicus 21,991  19,267   17,512
Gallus gallus 17,959  13,207   11,969
Danio rerio 26,288  20,764   13,900
Drosophila melanogaster 14,085  9,315   7,796
Anopheles gambiae 12,460* 8,944   7,618
Caenorhabditis elegans 20,155* 8,685   4,829
Schizosaccharomyces pombe 5,043  3,237   2,949
Saccharomyces cerevisiae 5,880  4,854   4,373
Kluyveromyces lactis 5,335  4,462   4,385
Eremothecium gossypii 4,722  3,933   3,889
Magnaporthe grisea 12,832* 7,295   6,364
Neurospora crassa 10,079  6,175   6,039
Arabidopsis thaliana 27,165* 19,850   11,226
Oryza sativa 26,887  17,330   10,674
Plasmodium falciparum 5,266  2,440   1,130


'*' indicates organisms where new genome annotation data is used in this build.


Last updated on: Sat Aug 8 2009



We have recently adopted a new build procedure that makes use of amino acid sequence searching (blastp) to find more distant relationships, but the procedure still refers to the DNA sequence for computation of some of the statistics. The matching strategy is guided by the taxonomic tree such that more closely related organisms are compared first. Moreover, HomoloGene entries now include paralogs in addition to orthologs.




Sources of Additional Information



HomoloGene entries have been augumented with homology and phenotype information drawn from the following sources.

Online Mendelian Inheritance in Man (OMIM)

Mouse Genome Informatics (MGI)

Zebrafish Information Network (ZFIN)

Saccharomyces Genome Database (SGD)

Clusters of Orthologous Groups (COG)

FlyBase

 

What's New
HomoloGene release 64 is now public. It includes updated annotations for the following species: Homo sapiens (NCBI release 37.1), Caenorhabditis elegans (WS190, NCBI release 8.1), Anopheles gambiae (AgamP3.3, NCBI release 3.1), Arabidopsis thaliana (NCBI release 8.1), Bos taurus (NCBI release 3.1), and Magnaporthe grisea (NCBI release 3.1).



Tip of The Day




Related Resources


Entrez Genomes


A collection of complete genome sequences that includes more than 1000 viruses and over hundred microbes

  Archaea

  Bacteria

  Eukaryota

  Viruses



  COGs

Phylogenetic classification of proteins encoded in complete genomes.