thanks for all your help people.
here is a (perhaps) better description of what i want to do:
first, I want to allow the user to input a literal sequence to match such
as 'ACCCUCUAUUCUC', and also allow the user to match arbitrary seqeunce
elements of any length and then be able to match that pattern again, the
reverse, or the reverse complement of that sequence. I also want to allow
the user to input his own pairing rules for revcomp matches. For example,
if you would want to match an RNA hairpin structure of 10-20 bases with a
4 base loop, allowing for non watson and crick pairing, I would think of
it this way:
match any sequence that is a minimum length of 10 to a maximum length of
20, followed by any 4 bases, followed by the reverse complement of what
was matched in the first part as defined by the pairing rules A-U, C-G,
G-C, U-A, U-G, G-U.
in my program this would be:
(AU, CG, GC, UA, UG, GU)
1s=10-20
4-4
1r
of course, i don't want any of these particular values hard wired, I want
them all to be input by the user. the elements I want to include (so far)
are these:
-arbitrary sequence of length N to M for matching later: 1s=N-M
-literal: some string of ACGU such as: ACCCUAUA
-literal to match and remember for later: 1s=ACCCUA (for example)
-match a remembered sequence again: 1s (for example to find tandem
repeats 1 to 10 bases long I would use:
1s=1-10
1s
-match the reverse of a sequence already matched: 1p (for example to
find inverted repeats 1 to 10 bases long I would use:
1s=1-10
1p
-match the revcomp of a sequence already matched: 1r (for example to
find hairpins:
1s=1-10
1r
-pairing rules: (AU, CG, UA, GC, AG, GA, UG, GU)
I also want to eventually do this with approximate matching to allow
mismatches, insertions, and deletions.
I am essentially trying to duplicate the PatSearch program (Nucleic Acids
research 2003, 31(13):360

that is available on the web, but not as an
executable I can install locally.
Thanks for the offer to help and let me know what more information you
need from me.