Page tree

Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

To search for multiple residues simultaneously enter residues in advanced search box separated with the pipe (|) symbol.
Example: atgttc|atggc
This will return a search for atgttc and atggc separately.

Regular expression

...

, wild cards, and nucleotide symbols

IGB searching supports regular expressions and wild cards. This is especially useful when searching for sequence motifs, such as transcription factor binding sites.

Searching by nucleotide symbols is available in IGB versions 9.1.12 and above.

Example queries:

Pattern

Represents

Example

Finds

.

any single nucleotide

ACCT.T

ACCTTT, ACCTAT, ACCTGT, and ACCTCT (4 possibilities)

..

any two nucleotides

ACCT..T

ACCTAAT, ACCTATT, ACTAGT, Etc. (4 x 4 possibilities)

[CG]

C or G

ACCT[CG]TC

ACCTCTC and ACCTGTC

T{1,n}

1 to n T's

ACGGT{1,3}C

ACGGTC, ACGGTTC, ACGGTTTC

T*

Zero or more T's

ACGGT*C

ACGGC, ACGGTC, ACGGTTC, ACGGTTTC, ACGGTTTTTTTTTTTTTTTTTTTTTTTTTTTC, Etc.

.*?

a string of any length containing any nucleotides

TCGGGGTTAA.*?CTGGACTC

Many possibilities.

Because this allows for so many possibilities, it only recommended with a limited scope of search and/or with very specific (several specified base pairs) on both ends.

.*

the longest possible string of any length containing any nucleotides

TCGGGGTTAA.*CTGGACTC

Differs from the search above in that the longest possible result(s) will be found.

Bear in mind that the result returned from this search with depend on the scope of the search, ie how much of the genomic sequence has been loaded and is available for searching.

RA or GGCCRGCCA, GCCG
YC or TAGCYAGCC, AGCT
G or CAGCSAGCG, AGCC
WA or TAWGCAGCWAAGCAGCA, ATGCAGCT
KG or TAKGCAGCKAGGCAGCG, ATGCAGCT
A or CAMGCAGCMAAGCAGCA, ACGCAGCC
BC or G or TABGCAGCBACGCAGCC, AGGCAGCG, ATGCAGCT
DA or G or TADGCAGCDAAGCAGCA, AGGCAGCG, ATGCAGCT
A or C or TAGHCAGCHAGACAGCA, AGCC, AGTCAGCT
A or C or GAGVCAGCVAGACAGCA, AGCC, AGGCAGCG
Nany base i.e. A or G or T or CAGCNAGCA, AGCG, AGCT, AGCC
\QN\ENAGC\QNNN\EAGCNNN

More information about regular expressions is available from http://docs.oracle.com/javase/7/docs/index.html.