Biotech > FAQ > BioPerl FAQ (Frequently Asked Questions)

How do I do motif searches with BioPerl? Can I d...

To see other biotech frequently asked questions, please visit

(Continued from previous question...)

How do I do motif searches with BioPerl? Can I do "find all sequences that are 75% identical" to a given motif?

There are a number of approaches. Within BioPerl take a look at Bio::Tools::SeqPattern. Or, take a look at the TFBS package. This BioPerl-compliant package specializes in pattern searching of nucleotide sequence using matrices.

It's also conceivable that the combination of BioPerl and Perl's regular expressions could do the trick. You might also consider the CPAN module String::Approx (this module addresses the percent match query), but experienced users question whether its distance estimates are correct, the Unix agrep command is thought to be faster and more accurate. Finally, you could use EMBOSS, as discussed in the previous question (or you could use Pise to run EMBOSS applications). The relevant programs would be fuzzpro or fuzznuc.

(Continued on next question...)

Other Frequently Asked Questions