Scientific Supercomputing at the NIH

HMMER on Helix

Profile hidden Markov models for biological sequence analysis

Profile hidden Markov models (profile HMMs) can be used to do sensitive database searching using statistical descriptions of a sequence family's consensus. HMMER uses profile HMMs, and can be useful in situations like:

HMMER (pronounced 'hammer', as in a more precise mining tool than BLAST) was developed by Sean Eddy at Washington University in St. Louis. The HMMER website is hmmer.janelia.org.

HMMER User Guide (PDF)

HMMER programs

The best way to understand HMMER is to read and follow the Tutorial in the HMMER User Guide.

Version

The output of the HMMER programs include the version number at the top.

Running a HMMER program on Helix

HMMER programs can be run interactively on Helix.
Sample run (user input in bold):

[user@helix mydir]$ hmmpfam /fdb/fastadb/pfam/Pfam_ls myseq 
hmmpfam - search one or more sequences against HMM database
HMMER 2.3.2 (Oct 2003)
Copyright (C) 1992-2003 HHMI/Washington University School of Medicine
Freely distributed under the GNU General Public License (GPL)
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
-
HMM file:                 /fdb/fastadb/pfam/Pfam_ls
Sequence file:            myseq
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Query sequence: 7LES_DROME
Accession:      P13368
Description:    SEVENLESS PROTEIN (EC 2.7.1.112).

Scores for sequence family classification (score includes all domains):
Model           Description                             Score E-value    N 
--------        -----------                             ----- --------- ---
Pkinase_Tyr     Protein tyrosine kinase                 487.3 1.8e-143   1
fn3             Fibronectin type III domain             177.0 4.9e-50    6
Pkinase         Protein kinase domain                   134.0 4.3e-37    1
Ldl_recept_b    Low-density lipoprotein receptor repe     9.0 0.18       1
Vicilin_N       Vicilin N terminal region               -89.9 1.7        1 
DUF1478         Protein of unknown function (DUF1478)   -86.2 1.8        1
Cys_knot        Cystine-knot domain                     -39.0 2          1
NHL             NHL repeat                                7.7 2          1
LSR             Lipolysis stimulated receptor (LSR)     -19.2 2          1
APH             Phosphotransferase enzyme family        -35.1 2.1        1
Fzo_mitofusin   fzo-like conserved region              -100.4 3.2        1
[...]