Biosequences - general
Basic alphabet ? = { a, t/u, c, g} (DNA/RNA)? = {A, C, .., Y} (Protein sequence)
Character group alphabet ? = {g1…gn} (e.g. amino-acid class)
Wild card X = { x(n1,n2) | n1<n2 ? N}
V(x(c1,c2)) set of all words over ? of length between c1 and c2
Pattern P = p1…pn , pi ?? ? ? ? X
? character & position constraints ?