Search Search Status Download FAQ Help

Sequence Libraries

 

OMSSA searches protein libraries formatted for BLAST.  A precompiled set of popular libraries such as nr can be found here (note that Swiss-Prot is implemented as a "mask" on nr, so you need to download both sets of files).  You can also use the program formatdb to create your own protein sequence libraries from various formats, such as fasta.  formatdb can be found in the BLAST executable distribution -- look for an archive that begins with "blast" and is specific to the operating system that you use. A sample command line is "formatdb -i sequences.fasta -o F -p T ". This will will create a set of files that contain the sequences and other information for speedy access.

The library files can be placed in the same folder as the OMSSACL executable, but ideally should be kept in a separate folder.  If you wish to keep the libraries in another folder, you need to tell OMSSA where to find the blast libraries. The simplest way is to specify the folder when specifying the library, e.g. "omssacl -d /path/to/your/blast/libraries/library_name" where library_name is the name of the library without the ".p*" extensions.

Another way to use another folder is to specify the follow the operating system-specific instructions below:

 1. On Linux and MacOS X you need to have a file called .ncbirc in your home folder which points to the blast library folder.  Example file contents:

[BLAST]
BLASTDB=/path/to/your/blast/libraries

Please do not put quotes around the path name. 


 2. For Windows you need to have a file called ncbi.ini in your WINDOWS or WINNT folder which points to the BLAST library folder.  Example file contents:

[BLAST]
BLASTDB=c:\\path\\to\\your\\blast\\libraries

Please do not put quotes around the path name. Note that backslashes are repeated.

Note that some libraries are implemented as a "mask" (aka subset) on another library.  You can tell if a library is a mask if there is a file that ends with .msk.  In this case, you will need the library that is being masked.  For example, in the precompiled set, swissprot is a mask on nr, which means you need to download both swissprot and nr to search swissprot.

Write to the Help Desk | Disclaimer | Privacy statement | Accessibility |
NCBI Home NCBI Search NCBI SiteMap