Debian Clusters for Education and Research: The Missing Manual

MpiBLAST: Nucleotide/Protein Searching

From Debian Clusters

Jump to: navigation, search

About mpiBLAST

mpiBLAST is a parallel implementation of the Blast algorithm. BLAST, or the Basic Local Alignment Search Tool, is used to search large databases. "Local alignment" refers to the way BLAST compares sequences. When dealing with a potentially large sequences, rather than looking for a match in entirety, it looks for smaller patches that match. When it finds a match, it evaluates the probability that this match would occur by chance. If it's highly likely the match occurred by chance, it probably isn't very statistically significant. These matches can be searched for in either a string of nucleotides (which make up DNA) or amino acids (which make up proteins). When running BLAST, you select which type your sequence is and choose the databases of known sequences you're interested in comparing it against.

mpiBlast divides the work of searching these databases up amongst multiple processes (and processors, if available) by using MPI. First, the database to be searched is split into as many pieces are there will be processes. Then, the processes communicate amongst themselves using MPI. You'll need to have MPICH or another implementation of MPI installed in order for mpiblast to work.

There are two pages about mpiblast:

References

Personal tools