                         SEQUENCE LISTING

<110>  Salk Institute for Biological Studies
 
<120>   METHODS AND COMPOSITIONS FOR EXPRESSION OF EDITING PROTEINS

<130>  7158-102574-22

<150> 63/189.048
<151> 2021-05-14

<160>  226   

<170>  PatentIn version 3.5

<210>  1
<211>  1491
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  1
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttacttctag gcgcgccgcc      540

accatggtga gcaagggcga ggagctgttc accggggtgg tgcccatcct ggtcgagctg      600

gacggcgacg taaacggcca caagttcagc gtgtccggcg agggcgaggg cgatgccacc      660

tacggcaagc tgaccctgaa gttcatctgc accaccggca agctgcccgt gccctggccc      720

accctcgtga ccaccttcgg ctacggcctg atgtgcttcg cccgctaccc cgaccacatg      780

aagcagcacg acttcttcaa gtccgccatg cccgaaggct acgtccagga gcgcaccatc      840

ttcttcaagg acgacggcaa ctacaagacc cgcgccgagg tgaagttcga gggcgacacc      900

ctggtgaacc gcatcgagct gaagggcatc gacttcaagg aggacggcaa catcctgggg      960

cacaagctgg agtacaacta caacagccac aacgtctata tcatggccga caagcagaag     1020

aacggcatca aggtaagtat tagctctttc tttccatggg ttggcctcgc cgcgtgggct     1080

gagggaagga ctgtcctggg actggacagg cgggttatgg gacctgaaaa gcggccctga     1140

aaaagggccg cgatgaaaac gaagcgagct aaagcctcct ctctcttctt cagaactcct     1200

ctcttttctc tcctccagga gttcttcctc tctcccttct tctcaaatgc tttctccctc     1260

tctcctgcat ttgagctcct tctttcctct ctcgacaatc cccttttctc cctcttgatt     1320

gtcgactagc tcgcaatcat cgcggtatca aaaagcggtc aggcagctaa accaaaaggt     1380

ttagcaattg cctctgatga gtcgctgaaa tgcgacgaaa accgcttttt ggtaccaata     1440

aaatatcttt attttcatta catctgtgtg ttggtttttt gtgtgactag t              1491


<210>  2
<211>  1302
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  2
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttacttctag gcgcgccgcg      540

gaaaaccgcg ggataccgcg atgattgcga gctagtcgac aatcaagagg gagaaaaggg      600

gattgtcgag agaggaaaga aggagctcaa atgcaggaga gagggagaaa gcatttgaga      660

agaagggaga gaggaacaac tcgtggagga gagaaaagag acgagttgtg aagaagagag      720

aggaggcttt agctcgcttc gttttcatca ttattgcggc cctgaaaaag ggccgcttat      780

aacgttgctc gaattcgggt tatgggacca gtgaaggctg agggaaggac tgtcctggga      840

ctggacaggc gggttatggg acctgaaaat actaacaatc gatttttttt cccttttttt      900

ccaggtgaac ttcaagatcc gccacaacat cgaggacggc agcgtgcagc tcgccgacca      960

ctaccagcag aacaccccca tcggcgacgg ccccgtgctg ctgcccgaca accactacct     1020

gagctaccag tccgccctga gcaaagaccc caacgagaag cgcgatcaca tggtcctgct     1080

ggagttcgtg accgccgccg ggatcactct cggcatggac gagctgtaca aggacctttg     1140

agaattcctc acctgcgatc tcgatgcttt atttgtgaaa tttgtgatgc tattgcttta     1200

tttgtaacca ttataagctg caataaacaa gttaacaaca acaattgcat tcattttatg     1260

tttcaggttc agggggaggt gtgggaggtt ttttaaacta gt                        1302


<210>  3
<211>  404
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  3
gtaagtatta gctctttctt tccatgggtt ggcctcgccg cgtgggctga gggaaggact       60

gtcctgggac tggacaggcg ggttatggga cctgaaaagc ggccctgaaa aagggccgcg      120

atgaaaacga agcgagctaa agcctcctct ctcttcttca gaactcctct cttttctctc      180

ctccaggagt tcttcctctc tcccttcttc tcaaatgctt tctccctctc tcctgcattt      240

gagctccttc tttcctctct cgacaatccc cttttctccc tcttgattgt cgactagctc      300

gcaatcatcg cggtatcaaa aagcggtcag gcagctaaac caaaaggttt agcaattgcc      360

tctgatgagt cgctgaaatg cgacgaaaac cgctttttgg tacc                       404


<210>  4
<211>  382
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  4
acttctaggc gcgccgcgga aaaccgcggg ataccgcgat gattgcgagc tagtcgacaa       60

tcaagaggga gaaaagggga ttgtcgagag aggaaagaag gagctcaaat gcaggagaga      120

gggagaaagc atttgagaag aagggagaga ggaacaactc gtggaggaga gaaaagagac      180

gagttgtgaa gaagagagag gaggctttag ctcgcttcgt tttcatcatt attgcggccc      240

tgaaaaaggg ccgcttataa cgttgctcga attcgggtta tgggaccagt gaaggctgag      300

ggaaggactg tcctgggact ggacaggcgg gttatgggac ctgaaaatac taacaatcga      360

ttttttttcc ctttttttcc ag                                               382


<210>  5
<211>  489
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  5
atggtgagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac       60

ggcgacgtaa acggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac      120

ggcaagctga ccctgaagtt catctgcacc accggcaagc tgcccgtgcc ctggcccacc      180

ctcgtgacca ccttcggcta cggcctgatg tgcttcgccc gctaccccga ccacatgaag      240

cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc      300

ttcaaggacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg      360

gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac      420

aagctggagt acaactacaa cagccacaac gtctatatca tggccgacaa gcagaagaac      480

ggcatcaag                                                              489


<210>  6
<211>  237
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  6
gtgaacttca agatccgcca caacatcgag gacggcagcg tgcagctcgc cgaccactac       60

cagcagaaca cccccatcgg cgacggcccc gtgctgctgc ccgacaacca ctacctgagc      120

taccagtccg ccctgagcaa agaccccaac gagaagcgcg atcacatggt cctgctggag      180

ttcgtgaccg ccgccgggat cactctcggc atggacgagc tgtacaagga cctttga         237


<210>  7
<211>  382
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  7
acttctaggc gcgccgcgga aaaccgcggg ataccgcgat gattgcgagc tagggagaga       60

gaggggaaag aaaagagaaa gaggaggagg aaagagggga gagaggggag ggaaaggaga      120

gaagggagga agggaagaaa gaaagaagag gaaaagaggg gaggaggagg agaaaggaga      180

aaaaaagaag ggaagggaga aaggctttag ctcgcttcgt tttcatcatt attgcggccc      240

tgaaaaaggg ccgcttataa cgttgctcga attcgggtta tgggaccagt gaaggctgag      300

ggaaggactg tcctgggact ggacaggcgg gttatgggac ctgaaaatac taacaatcga      360

ttttttttcc ctttttttcc ag                                               382


<210>  8
<211>  301
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  8
gtaagtgtcc cgcggaacat tattataacg ttgctcgaag atatcagatg gtgcgctcct       60

ggacgtagcc ttcgggcatg gcggacttga agaagtcgtg ctgcttcatg tggtcggggt      120

agcggctgaa gcactgcacg ccgtaggtca gggtggtcac gagggtgggc cagggcacgg      180

gcagcttgcc ggtggtgcag atgaacttca gggtcagctt gccgtaggtg gcatcgccct      240

cgccctcgcc ggacacgctg aacttgtggc cgtttacgtc gccgtccagc tcgactctag      300

a                                                                      301


<210>  9
<211>  326
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  9
gctagcgtcg agctggacgg cgacgtaaac ggccacaagt tcagcgtgtc cggcgagggc       60

gagggcgatg ccacctacgg caagctgacc ctgaagttca tctgcaccac cggcaagctg      120

cccgtgccct ggcccaccct cgtgaccacc ctgacctacg gcgtgcagtg cttcagccgc      180

taccccgacc acatgaagca gcacgacttc ttcaagtccg ccatgcccga aggctacgtc      240

caggagcgca ccatctccgc ggaacattat tataacgttg ctcgaatact aactggtacc      300

tcttcttttt tttttgatat ctgcag                                           326


<210>  10
<211>  278
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  10
gttgccttta cttctggcgc gccaaaaggc gtgccagaag taccgggcta ataatgtttc       60

gcggtcctct taaatctgcc taaatacgta taaatttgat cgccctgaaa aagggcgatc      120

aaagccctga aaaagggcat acgtagccct gaaaaagggc aggcagagcc ctgaaaaagg      180

gcaagaggac cgcggaacat tattagccgc caccatggac aggcgggtta tgggacctga      240

aaatactaac aatcgatttt ttttcccttt ttttccag                              278


<210>  11
<211>  190
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  11
acttctaggc gcgccgcgga aaaccgcggg atatcattat tgcggccctg aaaaagggcc       60

gcttataacg ttgctcgaat tcgggttatg ggaccagtga aggctgaggg aaggactgtc      120

ctgggactgg acaggcgggt tatgggacct gaaaatacta acaatcgatt ttttttccct      180

ttttttccag                                                             190


<210>  12
<211>  459
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  12
gtaagtatta gctctttctt tccatgggtt ggcctcgccg cgtgggctga gggaaggact       60

gtcctgggac tggacaggcg ggttatggga cctgaaaagc ggccctgaaa aagggccgcg      120

atgaaaacga agcgagctaa agcctcctct ctcttcttca gaactcctct cttttctctc      180

ctccaggagt tcttcctctc tcccttcttc tcaaatgctt tctccctctc tcctgcattt      240

gagctccttc tttcctctct cgacaatccc cttttctccc tcttgattgt cgactagctc      300

gcaatcatcg cggtatcaaa aagcggtcag gcagctaaac caaaaggttt agcaattgcc      360

tctgatgagt cgctgaaatg cgacgaaaac cgctttttgg taccaataaa atatctttat      420

tttcattaca tctgtgtgtt ggttttttgt gtgactagt                             459


<210>  13
<211>  382
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  13
acttctaggc gcgccgcgga aaaccgcggg ataccgcgat gattgcgagc tagtcgacaa       60

tcaagaggga gaaaagggga ttgtcgagag aggaaagaag gagctcaaat gcaggagaga      120

gggagaaagc atttgagaag aagggagaga ggaagaactc ctggaggaga gaaaagagag      180

gagttctgaa gaagagagag gaggctttag ctcgcttcgt tttcatcatt attgcggccc      240

tgaaaaaggg ccgcttataa cgttgctcga attcgggtta tgggaccagt gaaggctgag      300

ggaaggactg tcctgggact ggacaggcgg gttatgggac ctgaaaatac taacaatcga      360

ttttttttcc ctttttttcc ag                                               382


<210>  14
<211>  372
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  14
gtaagtatta agcggccctg aaaaagggcc gcgatgaaaa cgaagcgagc taaagcctcc       60

tctctcttct tcagaactcc tctcttttct ctcctccagg agttcttcct ctctcccttc      120

ttctcaaatg ctttctccct ctctcctgca tttgagctcc ttctttcctc tctcgacaat      180

ccccttttct ccctcttgat tgtcgactag ctcgcaatca tcgcggtatc aaaaagcggt      240

caggcagcta aaccaaaagg tttagcaatt gcctctgatg agtcgctgaa atgcgacgaa      300

aaccgctttt tggtaccaat aaaatatctt tattttcatt acatctgtgt gttggttttt      360

tgtgtgacta gt                                                          372


<210>  15
<211>  407
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  15
gtaagtatta gctctttctt tccatgggtt ggcctcgccg cgtgaagcgg ccctgaaaaa       60

gggccgcgat gaaaacgaag cgagctaaag cctcctctct cttcttcaga actcctctct      120

tttctctcct ccaggagttc ttcctctctc ccttcttctc aaatgctttc tccctctctc      180

ctgcatttga gctccttctt tcctctctcg acaatcccct tttctccctc ttgattgtcg      240

actagctcgc aatcatcgcg gtatcaaaaa gcggtcaggc agctaaacca aaaggtttag      300

caattgcctc tgatgagtcg ctgaaatgcg acgaaaaccg ctttttggta ccaataaaat      360

atctttattt tcattacatc tgtgtgttgg ttttttgtgt gactagt                    407


<210>  16
<211>  378
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  16
gtaagtatta gctctttctt tccatgggtt ggcctcgccg cgtgggctga gggaaggact       60

gtcctgggac tggacaggcg ggttatggga cctgaaaagc ggccctgaaa aagggccgcg      120

atgaaaacga agcgagctaa agcctcctct ctcttcttca gaactcctct cttttctctc      180

ctccaggagt tcttcctctc tcccttcttc tcaaatgctt tctccctctc tcctgcattt      240

gagctccttc tttcctctct cgacaatccc cttttctccc tcttgattgt cgactagctc      300

gcaatcatcg cggtatcggt accaataaaa tatctttatt ttcattacat ctgtgtgttg      360

gttttttgtg tgactagt                                                    378


<210>  17
<211>  309
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  17
acttctaggc gcgccgcgga aaaccgcggg ataccgcgat gattgcgagc tagtcgacaa       60

tcaagaggga gaaaagggga ttgtcgagag aggaaagaag gagctcaaat gcaggagaga      120

gggagaaagc atttgagaag aagggagaga ggaagaactc ctggaggaga gaaaagagag      180

gagttctgaa gaagagagag gaggctttag ctcgcttcgt tttcatcatt attgcggccc      240

tgaaaaaggg ccgcttataa cgttgctcga attctactaa caatcgattt tttttccctt      300

tttttccag                                                              309


<210>  18
<211>  419
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  18
atatcctttt agggcagagt gaagagttag gaggaaggtg gttgggagag ggatttccag       60

gccttaggac atcatgacag atgaaaacga agcgagctaa agcctcctct ctcttcttca      120

gaactcctct cttttctctc ctccaggagt tcttcctctc tcccttcttc tcaaatgctt      180

tctccctctc tcctgcattt gagctccttc tttcctctct cgacaatccc cttttctccc      240

tcttgattgt cgactagctc gcaatcatcg cggtatcaaa aagcggtcag gcagctaaac      300

caaaaggttt agcaattgcc tctgatgagt cgctgaaatg cgacgaaaac cgctttttgg      360

taccaataaa atatctttat tttcattaca tctgtgtgtt ggttttttgt gtgactagt       419


<210>  19
<211>  275
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  19
acttctaggc gcgccgcgga aaaccgcggg ataccgcgat gattgcgagc tagtcgacaa       60

tcaagaggga gaaaagggga ttgtcgagag aggaaagaag gagctcaaat gcaggagaga      120

gggagaaagc atttgagaag aagggagaga ggaacaactc gtggaggaga gaaaagagac      180

gagttgtgaa gaagagagag gaggctttag ctcgcttcgt tttcatcatt tccaggcctt      240

aggacatcat gacatttttc cttaactttg ctcac                                 275


<210>  20
<211>  3975
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  20
acttctaggc gcgccgccac catgggattc gtgcggcaga ttcagctgct gctgtggaag       60

aactggaccc tgcggaagcg gcagaaaatc agattcgtgg tggaactcgt gtggcccctg      120

agcctgtttc tggtgctgat ctggctgcgg aacgccaatc ctctgtacag ccaccacgag      180

tgtcacttcc ccaacaaggc catgccttct gccggaatgc tgccttggct gcagggcatc      240

ttctgcaacg tgaacaaccc ctgctttcaa agccccacac ctggcgaaag ccctggcatc      300

gtgtccaact acaacaacag catcctggcc agagtgtacc gggacttcca agagctgctg      360

atgaacgccc ctgagtctca gcacctgggc agaatctgga ccgagctgca catcctgagc      420

cagttcatgg acaccctgag aacacacccc gagagaatcg ccggcagggg catcagaatc      480

cgggacatcc tgaaggacga ggaaaccctg acactgttcc tcatcaagaa catcggcctg      540

agcgacagcg tggtgtacct gctgatcaac agccaagtgc ggcccgagca gtttgctcat      600

ggcgtgccag atctcgccct gaaggatatc gcctgttctg aggccctgct ggaacggttc      660

atcatcttca gccagcggag aggcgccaag accgtcagat atgccctgtg cagtctgagc      720

cagggaaccc tgcagtggat cgaggatacc ctgtacgcca acgtggactt cttcaagctg      780

ttccgggtgc tgcccacact gctggattct cggtcccaag gcatcaacct gagaagctgg      840

ggcggcatcc tgtccgacat gagcccaaga atccaagagt tcatccaccg gcctagcatg      900

caggacctgc tgtgggttac cagacctctg atgcagaacg gcggacccga gacattcacc      960

aagctgatgg gcattctgag cgatctgctg tgcggctacc ctgaaggcgg aggatctaga     1020

gtgctgagct tcaattggta cgaggacaac aactacaagg ccttcctggg catcgactcc     1080

accagaaagg accccatcta cagctacgac cggcggacaa ccagcttctg caatgccctg     1140

atccagagcc tggaaagcaa ccctctgacc aagatcgctt ggagggccgc caaacctctg     1200

ctgatgggaa agatcctgta cacccctgac agccctgccg ccagaagaat cctgaagaac     1260

gccaacagca ccttcgagga actggaacac gtgcgcaagc tggtcaaggc ctgggaagaa     1320

gtgggacctc agatctggta cttcttcgac aatagcaccc agatgaacat gatcagagac     1380

accctgggca accctaccgt gaaggacttc ctgaacagac agctgggcga agagggcatt     1440

accgccgagg ccatcctgaa ctttctgtac aagggcccca gagagtccca ggccgacgac     1500

atggccaact tcgattggcg ggacatcttc aacatcaccg acagaaccct gcggctggtc     1560

aaccagtacc tggaatgcct ggtgctggac aagttcgaga gctacaacga cgagacacag     1620

ctgacccaga gagccctgtc tctgctggaa gagaatatgt tctgggctgg cgtggtgttc     1680

cccgacatgt acccttggac aagcagcctg cctcctcacg tgaagtacaa gatccggatg     1740

gacatcgacg tggtcgaaaa gaccaacaag atcaaggacc ggtactggga cagcggccct     1800

agagctgatc ccgtggaaga ttttcgctac atctggggcg gattcgcata cctgcaggac     1860

atggtggaac agggaatcac acggtcccag gtgcaggctg aagctcctgt gggaatctac     1920

ctgcagcaga tgccttatcc ttgcttcgtg gacgacagct tcatgatcat cctgaatcgg     1980

tgcttcccca tcttcatggt gctggcctgg atctactccg tgtctatgac cgtgaagtcc     2040

atcgtgctgg aaaaagagct gcggctgaaa gagacactga agaaccaggg cgtgtccaat     2100

gccgtgatct ggtgcacctg gtttctggac agcttctcca ttatgagcat gagcatcttt     2160

ctgctgacga tcttcatcat gcacggccgg atcctgcact acagcgaccc ctttatcctc     2220

ttcctgttcc tgctggcctt ctccaccgct acaatcatgc tgtgttttct gctgtccacc     2280

ttcttctcca aagcctctct ggccgctgct tgtagcggcg tgatctactt caccctgtac     2340

ctgcctcaca tcctgtgctt cgcatggcag gacagaatga ccgccgagct gaagaaagct     2400

gtgtccctgc tgagccctgt ggcctttggc tttggcaccg agtacctcgt cagatttgag     2460

gaacaaggac tgggactgca gtggtccaac atcggcaata gccctacaga gggcgacgag     2520

ttcagcttcc tgctgtctat gcaaatgatg ctgctggacg ccgccgtgta tggactgctg     2580

gcttggtatc tggaccaggt gttccctgcc gattacggca ctcctctgcc ttggtatttc     2640

ctgctgcaag agagctactg gctcggcggc gagggatgta gcaccagaga agaaagagcc     2700

ctggaaaaga ccgagcctct gaccgaggaa acagaggacc ctgaacaccc agagggcatc     2760

cacgatagct ttttcgagag agaacacccc ggctgggtgc caggcgtgtg tgtgaagaat     2820

ctggtcaaga tcttcgagcc ctgcggcaga cctgccgtgg acagactgaa catcaccttc     2880

tacgagaacc agattaccgc ctttctgggc cacaacggcg ctggcaagac aaccacactg     2940

agcatcctca ccggcctgct gcctccaaca agcggcacag ttctcgttgg cggcagagac     3000

atcgagacaa gcctggatgc cgtcagacag tccctgggca tgtgccctca gcacaacatc     3060

ctgtttcacc acctgaccgt ggccgagcac atgctgtttt atgcccagct gaagggcaag     3120

agccaagaag aggctcagct ggaaatggaa gccatgctcg aggacaccgg cctgcaccac     3180

aagagaaatg aggaagccca ggatctgagc ggcggcatgc agagaaaact gagcgtggcc     3240

attgccttcg tgggcgacgc caaggttgtg atcctggatg agcctacaag cggcgtggac     3300

ccttacagca gaagatccat ctgggatctg ctgctgaagt acagaagcgg ccggaccatc     3360

atcatgagca cccaccacat ggacgaggcc gatctgctcg gagacagaat cgccatcatt     3420

gctcagggca gactgtactg cagcggcacc ccactgtttc tgaagaactg tttcggcacc     3480

ggactgtatc tgaccctcgt gcggaagatg aagaacatcc agtctcagcg gaagggcagc     3540

gagggcacct gtagctgttc tagcaagggc tttagcacca cctgtccagc tcacgtggac     3600

gatctgaccc ctgaacaggt gctggatggc gacgtgaacg agctgatgga cgtggtgctg     3660

caccatgtgc ctgaggccaa gctggtggaa tgcatcggcc aggtaagtat tagctctttc     3720

tttccatggg ttggcctcgc cgcgtgggct gagggaagga ctgtcctggg actggacagg     3780

cgggttatgg gacctgaagc gataaaaggc atgcacgttt gcggctacgt gcatgccaaa     3840

aggagtcggg cttgcctccg tgcccgactc caaaagacct gctcgaggag gtggacgagc     3900

aggtcaaaaa tccgggtacc aataaaatat ctttattttc attacatctg tgtgttggtt     3960

ttttgtgtga ctagt                                                      3975


<210>  21
<211>  3611
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  21
aggatttttg acctgctcga ttgtccactg cgagcaggtc ttttggagtc gggcgaggcg       60

gaagcccgac tccttttggc atgcacgcta gccgcgtcgt gcatgccttt tatcgaattc      120

gggttatggg accagtgaag gctgagggaa ggactgtcct gggactggac aggcgggtta      180

tgggacctga aaatactaac aatcgatttt ttttcccttt ttttccagga actgattttt      240

ctgctcccga acaagaactt caagcaccgg gcctacgcca gcctgttcag agagctggaa      300

gaaaccctgg ccgacctggg cctgtctagc tttggcatca gcgacacccc tctcgaagag      360

atcttcctga aagtgacaga ggacagcgat agcggccctc tgtttgctgg cggagcacag      420

caaaagcgcg agaacgtgaa ccctagacac ccctgtctgg gcccaagaga gaaagccgga      480

cagacccctc aggacagcaa tgtgtgctct cctggtgctc ctgccgctca tcctgaggga      540

caacctccac ctgaacctga gtgtcctgga cctcagctga acaccggaac acagctggtt      600

ctgcagcacg tgcaggctct gctcgtgaag agattccagc acaccatcag aagccacaag      660

gactttctgg cccagatcgt gctgcccgcc acctttgttt ttctggctct gatgctgagc      720

atcgtgatcc ctccattcgg cgagtacccc gctctgacac tgcacccttg gatctacggc      780

cagcagtaca cctttttctc catggacgaa cccggcagcg agcagttcac agtgctggct      840

gatgtcctgc tgaacaagcc cggcttcggc aaccggtgtc tgaaagaagg atggctgcct      900

gagtaccctt gcggcaacag cacaccttgg aaaaccccta gcgtgtcccc taacatcacc      960

cagctgttcc aaaagcagaa atggacccaa gtgaacccct ctccatcctg ccggtgctcc     1020

acaagggaaa agctgaccat gctgcccgag tgtccagaag gcgctggcgg acttcctcca     1080

cctcagagaa cacagagatc caccgagatt ctccaggacc tgaccgaccg gaatatcagc     1140

gacttcctgg ttaagacata ccccgcactg atccggtcca gcctgaagtc caagttctgg     1200

gtcaacgaac agagatacgg cggcatcagc atcggcggaa aactgcctgt ggtgcctatc     1260

acaggcgagg cccttgtggg ctttctgtcc gatctgggga gaatcatgaa cgtgtccggc     1320

ggacctatca ccagggaagc cagcaaagag atccccgatt tcctgaagca cctggaaacc     1380

gaggacaata tcaaagtgtg gttcaacaac aaaggatggc acgccctcgt gtcttttctg     1440

aacgtggccc acaatgccat cctgcgggct agcctgccta aggacagaag ccctgaggaa     1500

tacggcatca ccgtgatctc ccagcctctg aatctgacca aagagcagct gagcgagatc     1560

accgtgctga ccacctctgt ggatgctgtg gtggccatct gcgtgatctt cagcatgagc     1620

ttcgtgcccg cctccttcgt gctgtacctg attcaagaga gagtgaacaa gagcaagcac     1680

ctccagttca tctccggggt gtccccaacc acctactggg tcaccaattt tctgtgggac     1740

atcatgaact acagcgtgtc agccggcctg gtcgtgggca tctttatcgg ctttcaaaag     1800

aaggcctaca cgagccccga gaacctgcct gctttggttg ctctgctgct cctgtatggc     1860

tgggccgtga ttcccatgat gtaccccgcc agctttctgt ttgacgtgcc cagcacagcc     1920

tacgtggccc tgtcttgcgc caatctgttc atcggcatca acagcagcgc catcacattc     1980

atcctggaac tgttcgagaa caacaggacc ctgctgcggt tcaacgccgt gctgcggaaa     2040

ctgctgatcg tgttccctca cttctgtctc ggccggggcc tgatcgacct ggctctgtct     2100

caagccgtga ccgatgtgta cgccagattt ggcgaggaac actccgccaa tccattccac     2160

tgggacctga tcggcaagaa cctgttcgcc atggtggtgg aaggcgtcgt gtacttcctg     2220

ctcactctgc tggtgcagag acactttttt ctgtcccaat ggatcgccga gcctaccaaa     2280

gaacccattg tggacgagga cgacgatgtg gccgaggaaa gacagagaat catcaccggc     2340

ggcaacaaga ccgatatcct gagactgcac gagctgacaa agatctaccc cggcacaagc     2400

tccccagccg tggataggct ttgtgtggga gttagacccg gcgagtgctt tggcctgctg     2460

ggagttaatg gcgccggaaa gaccaccacc ttcaagatgc tgaccggcga caccacagtg     2520

acaagcggag atgctacagt ggccggcaag agcatcctga ccaacatcag cgaagtgcat     2580

cagaacatgg gctactgccc tcagttcgac gccatcgacg aactgctgac aggccgcgaa     2640

cacctgtatc tgtatgccag actgagaggc gtgcccgctg aagagatcga gaaggtggcc     2700

aactggtcca tcaagtctct gggcctgaca gtgtacgccg actgtctggc cggaacatac     2760

agcggaggaa acaagcggaa gctgagcacc gccattgctc tgatcggatg cccacctctg     2820

gtcctgctgg atgaacccac caccggaatg gatccccagg ctagaagaat gctctggaac     2880

gtgatcgtgt ctatcatccg cgagggcaga gctgtggtgc tgacctctca ctccatggaa     2940

gagtgcgagg ctctgtgtac ccggctggcc attatggtca agggcgcctt cagatgcatg     3000

ggcaccattc agcatctgaa aagcaagttc ggcgacggct acatcgtgac aatgaagatc     3060

aagagcccca aggacgacct cctgcctgat ctgaaccccg tggaacagtt ttttcagggc     3120

aacttccccg gctccgtgca gcgggaaaga cactataaca tgctgcagtt tcaggtgtcc     3180

tcctccagcc tggctcggat ctttcaactg ctgctctctc acaaggacag cctgctgatt     3240

gaagagtaca gcgtgacaca gaccacactc gaccaggttt tcgtgaactt cgccaagcag     3300

cagaccgaga gccacgacct gcctctgcat cctcgggccg ctggtgcctc tagacaagct     3360

caggacggcg ctcgggctga ctacaaagac catgacggtg attataaaga tcatgacatc     3420

gactataagg atgacgatga caaatgaggt accaattcct cacctgcgat ctcgagcttt     3480

atttgtgaaa tttgtgatgc tattgcttta tttgtaacca ttataagctg caataaacaa     3540

gttaacaaca acaattgcat tcattttatg tttcaggttc agggggaggt gtgggaggtt     3600

ttttaaacta g                                                          3611


<210>  22
<211>  3975
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  22
acttctaggc gcgccgccac catggcccca aagaagaagc ggaaggtcgg tatccacgga       60

gtcccagcag ccaagcggaa ctacatcctg ggcctggaca tcggcatcac cagcgtgggc      120

tacggcatca tcgactacga gacacgggac gtgatcgatg ccggcgtgcg gctgttcaaa      180

gaggccaacg tggaaaacaa cgagggcagg cggagcaaga gaggcgccag aaggctgaag      240

cggcggaggc ggcatagaat ccagagagtg aagaagctgc tgttcgacta caacctgctg      300

accgaccaca gcgagctgag cggcatcaac ccctacgagg ccagagtgaa gggcctgagc      360

cagaagctga gcgaggaaga gttctctgcc gccctgctgc acctggccaa gagaagaggc      420

gtgcacaacg tgaacgaggt ggaagaggac accggcaacg agctgtccac caaagagcag      480

atcagccgga acagcaaggc cctggaagag aaatacgtgg ccgaactgca gctggaacgg      540

ctgaagaaag acggcgaagt gcggggcagc atcaacagat tcaagaccag cgactacgtg      600

aaagaagcca aacagctgct gaaggtgcag aaggcctacc accagctgga ccagagcttc      660

atcgacacct acatcgacct gctggaaacc cggcggacct actatgaggg acctggcgag      720

ggcagcccct tcggctggaa ggacatcaaa gaatggtacg agatgctgat gggccactgc      780

acctacttcc ccgaggaact gcggagcgtg aagtacgcct acaacgccga cctgtacaac      840

gccctgaacg acctgaacaa tctcgtgatc accagggacg agaacgagaa gctggaatat      900

tacgagaagt tccagatcat cgagaacgtg ttcaagcaga agaagaagcc caccctgaag      960

cagatcgcca aagaaatcct cgtgaacgaa gaggatatta agggctacag agtgaccagc     1020

accggcaagc ccgagttcac caacctgaag gtgtaccacg acatcaagga cattaccgcc     1080

cggaaagaga ttattgagaa cgccgagctg ctggatcaga ttgccaagat cctgaccatc     1140

taccagagca gcgaggacat ccaggaagaa ctgaccaatc tgaactccga gctgacccag     1200

gaagagatcg agcagatctc taatctgaag ggctataccg gcacccacaa cctgagcctg     1260

aaggccatca acctgatcct ggacgagctg tggcacacca acgacaacca gatcgctatc     1320

ttcaaccggc tgaagctggt gcccaagaag gtggacctgt cccagcagaa agagatcccc     1380

accaccctgg tggacgactt catcctgagc cccgtcgtga agagaagctt catccagagc     1440

atcaaagtga tcaacgccat catcaagaag tacggcctgc ccaacgacat cattatcgag     1500

ctggcccgcg agaagaactc caaggacgcc cagaaaatga tcaacgagat gcagaagcgg     1560

aaccggcaga ccaacgagcg gatcgaggaa atcatccgga ccaccggcaa agagaacgcc     1620

aagtacctga tcgagaagat caagctgcac gacatgcagg aaggcaagtg cctgtacagc     1680

ctggaagcca tccctctgga agatctgctg aacaacccct tcaactatga ggtggaccac     1740

atcatcccca gaagcgtgtc cttcgacaac agcttcaaca acaaggtgct cgtgaagcag     1800

gaagaaaaca gcaagaaggg caaccggacc ccattccagt acctgagcag cagcgacagc     1860

aagatcagct acgaaacctt caagaagcac atcctgaatc tggccaaggg caagggcaga     1920

atcagcaaga ccaagaaaga gtatctgctg gaagaacggg acatcaacag gttctccgtg     1980

cagaaagact tcatcaaccg gaacctggtg gataccagat acgccaccag aggcctgatg     2040

aacctgctgc ggagctactt cagagtgaac aacctggacg tgaaagtgaa gtccatcaat     2100

ggcggcttca ccagctttct gcggcggaag tggaagttta agaaagagcg gaacaagggg     2160

tacaagcacc acgccgagga cgccctgatc attgccaacg ccgatttcat cttcaaagag     2220

tggaagaaac tggacaaggc caaaaaagtg atggaaaacc agatgttcga ggaaaagcag     2280

gccgagagca tgcccgagat cgaaaccgag caggagtaca aagagatctt catcaccccc     2340

caccagatca agcacattaa ggacttcaag gactacaagt acagccaccg ggtggacaag     2400

aagcctaata gagagctgat taacgacacc ctgtactcca cccggaagga cgacaagggc     2460

aacaccctga tcgtgaacaa tctgaacggc ctgtacgaca aggacaatga caagctgaaa     2520

aagctgatca acaagagccc cgaaaagctg ctgatgtacc accacgaccc ccagacctac     2580

cagaaactga agctgattat ggaacagtac ggcgacgaga agaatcccct gtacaagtac     2640

tacgaggaaa ccgggaacta cctgaccaag tactccaaaa aggacaacgg ccccgtgatc     2700

aagaagatta agtattacgg caacaaactg aacgcccatc tggacatcac cgacgactac     2760

cccaacagca gaaacaaggt cgtgaagctg tccctgaagc cctacagatt cgacgtgtac     2820

ctggacaatg gcgtgtacaa gttcgtgacc gtgaagaatc tggatgtgat caaaaaagaa     2880

aactactacg aagtgaatag caagtgctat gaggaagcta agaagctgaa gaagatcagc     2940

aaccaggccg agtttatcgc ctccttctac aacaacgatc tgatcaagat caacggcgag     3000

ctgtatagag tgatcggcgt gaacaacgac ctgctgaacc ggatcgaagt gaacatgatc     3060

gacatcacct accgcgagta cctggaaaac atgaacgaca agaggccccc caggatcatt     3120

aagacaatcg ccggaagcgg agctactaac ttcagcctgc tgaagcaggc tggagacgtg     3180

gaggagaacc ctggacctag gcgcgccgcc accatggtga gcaagggcga ggagctgttc     3240

accggggtgg tgcccatcct ggtcgagctg gacggcgacg taaacggcca caagttcagc     3300

gtgtccggcg agggcgaggg cgatgccacc tacggcaagc tgaccctgaa gttcatctgc     3360

accaccggca agctgcccgt gccctggccc accctcgtga ccaccttcgg ctacggcctg     3420

atgtgcttcg cccgctaccc cgaccacatg aagcagcacg acttcttcaa gtccgccatg     3480

cccgaaggct acgtccagga gcgcaccatc ttcttcaagg acgacggcaa ctacaagacc     3540

cgcgccgagg tgaagttcga gggcgacacc ctggtgaacc gcatcgagct gaagggcatc     3600

gacttcaagg aggacggcaa catcctgggg cacaagctgg agtacaacta caacagccac     3660

aacgtctata tcatggccga caagcagaag aacggcatca aggtaagtat tagctctttc     3720

tttccatggg ttggcctcgc cgcgtgggct gagggaagga ctgtcctggg actggacagg     3780

cgggttatgg gacctgaagc gataaaaggc atgcacgttt gcggctacgt gcatgccaaa     3840

aggagtcggg cttgcctccg tgcccgactc caaaagacct gctcgaggag gtggacgagc     3900

aggtcaaaaa tccgggtacc aataaaatat ctttattttc attacatctg tgtgttggtt     3960

ttttgtgtga ctagt                                                      3975


<210>  23
<211>  3912
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  23
aggatttttg acctgctcga ttgtccactg cgagcaggtc ttttggagtc gggcgaggcg       60

gaagcccgac tccttttggc atgcacgcta gccgcgtcgt gcatgccttt tatcttcggg      120

ttatgggacc agtgaaggct gagggaagga ctgtcctggg actggacagg cgggttatgg      180

gacctgaaaa tactaacaat cgattttttt tccctttttt tccaggtgaa cttcaagatc      240

cgccacaaca tcgaggacgg cagcgtgcag ctcgccgacc actaccagca gaacaccccc      300

atcggcgacg gccccgtgct gctgcccgac aaccactacc tgagctacca gtccgccctg      360

agcaaagacc ccaacgagaa gcgcgatcac atggtcctgc tggagttcgt gaccgccgcc      420

gggatcactc tcggcatgga cgagctgtac aaggaccttg gaagcggagc tactaacttc      480

agcctgctga agcaggctgg agacgtggag gagaaccctg gacctatcac aaagaagcac      540

acagcccact tctccaagaa gggcgaagag gaaaacctgg aaggcctggg caatcagacc      600

aagcagatcg tcgagaagta cgcctgcacc accagaatca gccccaacac aagccagcag      660

aacttcgtga cccagcggag caaaagagcc ctgaagcagt ttcggctgcc cctggaagaa      720

accgagctgg aaaagcggat catcgtggac gacaccagca cacagtggtc caagaacatg      780

aagcacttga cccctagcac actgacccag atcgactaca acgagaaaga gaagggcgct      840

atcacacaga gcccactgag cgactgtctg accagaagcc acagcatccc tcaggccaac      900

agatcccctc tgccaatcgc caaagtgtct agcttcccca gcatcagacc catctacctg      960

accagagtgc tgttccagga caacagcagc catctgccag ccgccagcta ccggaagaaa     1020

gatagcggcg tgcaagagtc cagccacttt ctgcaaggcg ctaagaagaa caatctgagc     1080

ctggctattc tgaccctgga aatgaccggc gatcagagag aagtcggctc tctgggcacc     1140

agcgccacaa atagcgtgac ctacaaaaag gtggaaaaca ccgtgctgcc taagcctgac     1200

ctgccaaaga caagcggcaa ggtggaactg ctgccaaagg tgcacatcta ccagaaggac     1260

ctgtttccta ccgagacaag caacggctct cccggccatc tggatctggt ggaaggatct     1320

ctgctgcagg gaaccgaggg cgccatcaag tggaacgagg ccaatagacc tggcaaggtg     1380

cccttcctga gagtggccac agagtctagc gccaagacac cctccaaact gctggatccc     1440

ctggcctggg ataaccacta cggcactcag atccccaaag aggaatggaa gtcccaagag     1500

aagtcccctg aaaagaccgc cttcaagaag aaggacacca ttctgtccct gaatgcctgc     1560

gagagcaacc acgccattgc cgccatcaat gagggccaga acaagcccga gatcgaagtg     1620

acctgggcca agcagggaag aaccgagaga ctgtgctccc agaatcctcc tgtgctgaag     1680

cggcaccaga gagaaatcac ccggaccaca ctgcagagcg accaagaaga gatcgattac     1740

gacgatacca tcagcgtcga gatgaagaaa gaagatttcg acatctacga cgaggacgag     1800

aatcagagcc ctcggagctt ccagaagaaa accaggcact actttattgc cgccgtcgag     1860

cggctgtggg actacggaat gtctagctct cctcacgtgc tgcggaatag agcccagtct     1920

ggtagcgtgc cccagttcaa aaaggtcgtg ttccaagagt tcaccgacgg cagcttcacc     1980

cagccactgt atagaggcga gctgaacgag catctgggcc tgctgggccc ttatatcaga     2040

gccgaagtgg aagataacat catggtcacc ttccggaatc aggcctctcg gccctacagc     2100

ttctacagct ccctgatctc ctacgaagag gaccagagac agggcgcaga gccccggaag     2160

aatttcgtga agcccaacga gactaagacc tacttttgga aggtgcagca ccatatggcc     2220

cctacaaagg acgagttcga ctgcaaagcc tgggcctact tctccgatgt ggacctcgag     2280

aaggatgtgc acagcggact catcggccca ctgcttgtgt gccacaccaa cacactgaac     2340

cccgctcacg gcagacaagt gacagtgcaa gaattcgccc tgtttttcac catcttcgac     2400

gaaacgaagt cctggtactt caccgaaaac atggaaagaa actgcagggc cccttgcaac     2460

attcagatgg aagatcccac cttcaaagag aactaccggt tccacgccat caacggctac     2520

atcatggaca cactgcccgg cctggttatg gctcaggatc agagaatccg gtggtatctg     2580

ctgtccatgg gctccaacga gaatatccac tccatccact tctccggcca cgtgttcacc     2640

gtgcggaaaa aagaagagta caaaatggcc ctgtacaatc tgtaccctgg ggtgttcgaa     2700

accgttgaga tgctgcctag caaggccgga atttggagag tggaatgtct gattggagag     2760

cacctccacg ccgggatgag caccctgttt ctggtgtact ccaacaagtg tcagacccct     2820

ctcggcatgg cctctggcca cattagagac ttccagatca ccgccagcgg acagtatgga     2880

cagtgggccc ctaaactggc cagactgcac tactccggca gcatcaatgc ctggtccacc     2940

aaagagcctt tcagctggat caaagtggac ctgctggctc ccatgatcat ccacggaatc     3000

aagacccagg gcgccagaca aaagttcagc agcctgtaca tcagccagtt catcatcatg     3060

tacagcctgg acggaaagaa gtggcagacc taccggggca atagcaccgg cacactgatg     3120

gtgttcttcg gcaacgtgga ctccagcggc attaagcaca acatcttcaa ccctccaatc     3180

attgcccgat acatccggct gcaccccaca cactacagca tcaggtctac cctgagaatg     3240

gaactgatgg gctgcgacct gaacagctgc agcatgcccc tcggaatgga aagcaaggcc     3300

atcagcgacg cccagatcac agcctctagc tacttcacca acatgttcgc cacttggagc     3360

ccctctaagg cccggcttca tctgcaaggc agaagcaacg cttggaggcc ccaagtgaac     3420

aaccccaaag aatggctgca ggtcgacttt cagaaaacca tgaaagtgac aggcgtgacc     3480

acacagggcg tcaagtccct gctgacctct atgtacgtga aagagtttct gatcagctcc     3540

agccaggacg gccaccagtg gaccctgttc ttccaaaacg gcaaagtgaa agtgttccag     3600

ggaaatcagg acagcttcac acccgtggtc aactccctgg atcctccact gctgacaaga     3660

tacctgcgga ttcaccctca gtcttgggtg caccagattg ccctgcggat ggaagtgctg     3720

ggctgtgaag ctcaggacct ctactgaggt accaattcct cacctgcgat ctcgatgctt     3780

tatttgtgaa atttgtgatg ctattgcttt atttgtaacc attataagct gcaataaaca     3840

agttaacaac aacaattgca ttcattttat gtttcaggtt cagggggagg tgtgggaggt     3900

tttttaaact ag                                                         3912


<210>  24
<211>  3828
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  24
acttctaggc gcgccgccac catgtaccca tacgatgttc cagattacgc ttatccttat       60

gacgtgcctg actacgccta tccctacgac gtccccgact atgcagtgta caagaaaacc      120

ctgttcgtgg aattcaccga ccacctgttc aatatcgcca agcctcggcc tccttggatg      180

ggactgctgg gacctacaat tcaggccgag gtgtacgaca ccgtggtcat caccctgaag      240

aacatggcca gccatcctgt gtctctgcac gccgtgggag tgtcttactg gaaggcttct      300

gagggcgccg agtacgacga tcagacaagc cagagagaga aagaggacga caaggttttc      360

cctggcggca gccacaccta tgtctggcaa gtcctgaaag aaaacggccc tatggcctcc      420

gatcctctgt gcctgacata cagctacctg agccacgtgg acctggtcaa ggacctgaat      480

tctggcctga tcggagccct gctcgtgtgt agagaaggca gcctggccaa agagaaaacc      540

cagacactgc acaagttcat cctgctgttc gccgtgttcg acgagggcaa gagctggcac      600

agcgagacaa agaacagcct gatgcaggac agggatgccg cctctgctcg ggcttggcct      660

aagatgcaca ccgtgaacgg ctacgtgaac agaagcctgc ctggactgat cggctgccac      720

agaaagtccg tgtactggca cgtgatcggc atgggcacaa cacctgaggt gcacagcatc      780

tttctggaag gacacacctt cctcgtgcgg aaccatagac aggccagcct ggaaatcagc      840

cctatcacct tcctgaccgc tcagaccctg ctgatggatc tgggccagtt tctgctgttc      900

tgccacatca gctcccacca gcacgatggc atggaagcct acgtgaaggt ggacagctgc      960

cccgaagaac cccagctgcg gatgaagaac aacgaggaag ccgaggacta cgacgacgac     1020

ctgaccgact ctgagatgga cgtcgtcaga ttcgacgacg ataacagccc cagcttcatc     1080

caaatcagaa gcgtggccaa gaagcacccc aagacctggg tgcactatat cgccgccgag     1140

gaagaggact gggattacgc tcctctggtg ctggcccctg acgacagaag ctacaagagc     1200

cagtacctga acaacggccc tcagcggatc ggccggaagt ataagaaagt gcggttcatg     1260

gcctacaccg acgagacatt caagaccaga gaggccatcc agcacgagag cggaattctg     1320

ggccctctgc tgtatggcga agtgggcgat acactgctga tcatcttcaa gaaccaggcc     1380

agcagaccct acaacatcta ccctcacggc atcaccgatg tgcggcccct gtattctaga     1440

aggctgccca agggcgtgaa gcacctgaag gacttcccta tcctgcctgg cgagatcttc     1500

aagtacaagt ggaccgtgac cgtggaagat ggccccacca agagcgaccc tagatgtctg     1560

acacggtact acagcagctt cgtgaacatg gaacgcgacc tggccagcgg cctgattgga     1620

cctctgctga tctgctacaa agaaagcgtg gaccagcggg gcaaccagat catgagcgac     1680

aagcggaacg tgatcctgtt tagcgtgttc gatgagaacc ggtcctggta tctgaccgag     1740

aacatccagc ggtttctgcc caatcctgct ggcgtgcagc tggaagatcc tgagttccag     1800

gcctccaaca tcatgcactc catcaatggc tatgtgttcg acagcctgca gctgagcgtg     1860

tgcctgcacg aagtggccta ctggtacatc ctgagcattg gcgcccagac cgacttcctg     1920

tccgtgttct tttccggcta caccttcaag cacaagatgg tgtacgagga taccctgaca     1980

ctgttcccat tctccggcga gacagtgttc atgagcatgg aaaaccccgg cctgtggatc     2040

ctgggctgtc acaacagcga cttccggaac agaggcatga cagccctgct gaaggtgtcc     2100

agctgcgaca agaacaccgg cgactactac gaggacagct atgaggacat cagcgcctac     2160

ctgctgagca agaacaatgc catcgagccc agaagcttca gccagaatag cagacacccc     2220

tccaccagac agaagcagtt caacgccaca acaatccccg agaacgacat cgagaaaacc     2280

gatccttggt ttgcccaccg gacccctatg cctaagatcc agaacgtgtc ctccagcgat     2340

ctgctgatgc tcctgagaca gagccctaca cctcacggac tgagcctgtc cgatctgcaa     2400

gaggccaaat acgaaacctt cagcgacgac ccttctcctg gcgccatcga cagcaacaat     2460

agcctgagcg agatgaccca cttcagacca cagctgcacc acagcggcga catggtgttt     2520

acacctgaga gcggcctcca gctgagactg aatgagaagc tgggaaccac cgccgccacc     2580

gagctgaaga aactggactt caaggtgtcc tctaccagca acaacctgat cagcacaatc     2640

ccctccgaca acctggctgc cggcaccgac aacacatctt ctctgggccc acctagcatg     2700

cccgtgcact acgatagcca gctggatacc acactgttcg gcaagaagtc tagccctctg     2760

acagagtctg gcggccctct gtctctgagc gaggaaaaca acgacagcaa gctgctggaa     2820

tccggcctga tgaacagcca agagtcctcc tggggcaaga atgtgtccag caccgagtcc     2880

ggcagactgt tcaagggaaa gagagcccac ggacctgctc tgctgaccaa ggataacgcc     2940

ctgttcaaag tgtccatcag cctgctcaag accaacaaga cctccaacaa ctccgccacc     3000

aacagaaaga cccacatcga cggccctagc ctgctgatcg agaatagccc tagcgtctgg     3060

cagaatatcc tggaaagcga caccgagttc aagaaagtga cccctctgat ccacgaccgg     3120

atgctcatgg acaagaacgc caccgctctg cggctgaacc acatgagcaa caagacaacc     3180

agcagcaaga atatggaaat ggtgcagcag aagaaagagg gccccattcc tccagacgct     3240

cagaaccccg atatgagctt cttcaagatg ctctttctgc ccgagagcgc ccggtggatc     3300

cagagaacac acggcaagaa ctccctgaac tccggccagg gaccttctcc aaagcagctg     3360

gtttccctgg gacctgagaa gtccgtggaa ggccagaact tcctgagcga aaagaacaaa     3420

gtggtcgtcg gcaagggcga gttcaccaag gatgtgggcc tgaaagagat ggtctttccc     3480

agcagccgga acctgttcct gaccaacctg gacaacctgc acgagaacaa cacccacaat     3540

caagagaaga agatccaaga ggtaagtatt agctctttct ttccatgggt tggcctcgcc     3600

gcgtgggctg agggaaggac tgtcctggga ctggacaggc gggttatggg acctgaagcg     3660

ataaaaggca tgcacgtttg cggctacgtg catgccaaaa ggagtcgggc ttgcctccgt     3720

gcccgactcc aaaagacctg ctcgaggagg tggacgagca ggtcaaaaat ccgggtacca     3780

ataaaatatc tttattttca ttacatctgt gtgttggttt tttgtgtg                  3828


<210>  25
<211>  3802
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  25
aggatttttg acctgctcga ttgtccactg cgagcaggtc ttttggagtc gggcgaggcg       60

gaagcccgac tccttttggc atgcacgcta gccgcgtcgt gcatgccttt tatcttcggg      120

ttatgggacc agtgaaggct gagggaagga ctgtcctggg actggacagg cgggttatgg      180

gacctgaaaa tactaacaat cgattttttt tccctttttt tccaggaaat cgaaaagaaa      240

gagacactca tccaagagaa cgtggtgctg cctcagatcc acacagtgac cggcaccaag      300

aactttatga agaatctgtt cctgctgagt acccggcaga atgtggaagg cagctacgac      360

ggcgcttatg cccctgtgct gcaagacttc agatccctga acgactccac caatcggaca      420

aagaagcaca cagcccactt ctccaagaag ggcgaagagg aaaacctgga aggcctgggc      480

aatcagacca agcagatcgt cgagaagtac gcctgcacca ccagaatcag ccccaacaca      540

agccagcaga acttcgtgac ccagcggagc aaaagagccc tgaagcagtt tcggctgccc      600

ctggaagaaa ccgagctgga aaagcggatc atcgtggacg acaccagcac acagtggtcc      660

aagaacatga agcacttgac ccctagcaca ctgacccaga tcgactacaa cgagaaagag      720

aagggcgcta tcacacagag cccactgagc gactgtctga ccagaagcca cagcatccct      780

caggccaaca gatcccctct gccaatcgcc aaagtgtcta gcttccccag catcagaccc      840

atctacctga ccagagtgct gttccaggac aacagcagcc atctgccagc cgccagctac      900

cggaagaaag atagcggcgt gcaagagtcc agccactttc tgcaaggcgc taagaagaac      960

aatctgagcc tggctattct gaccctggaa atgaccggcg atcagagaga agtcggctct     1020

ctgggcacca gcgccacaaa tagcgtgacc tacaaaaagg tggaaaacac cgtgctgcct     1080

aagcctgacc tgccaaagac aagcggcaag gtggaactgc tgccaaaggt gcacatctac     1140

cagaaggacc tgtttcctac cgagacaagc aacggctctc ccggccatct ggatctggtg     1200

gaaggatctc tgctgcaggg aaccgagggc gccatcaagt ggaacgaggc caatagacct     1260

ggcaaggtgc ccttcctgag agtggccaca gagtctagcg ccaagacacc ctccaaactg     1320

ctggatcccc tggcctggga taaccactac ggcactcaga tccccaaaga ggaatggaag     1380

tcccaagaga agtcccctga aaagaccgcc ttcaagaaga aggacaccat tctgtccctg     1440

aatgcctgcg agagcaacca cgccattgcc gccatcaatg agggccagaa caagcccgag     1500

atcgaagtga cctgggccaa gcagggaaga accgagagac tgtgctccca gaatcctcct     1560

gtgctgaagc ggcaccagag agaaatcacc cggaccacac tgcagagcga ccaagaagag     1620

atcgattacg acgataccat cagcgtcgag atgaagaaag aagatttcga catctacgac     1680

gaggacgaga atcagagccc tcggagcttc cagaagaaaa ccaggcacta ctttattgcc     1740

gccgtcgagc ggctgtggga ctacggaatg tctagctctc ctcacgtgct gcggaataga     1800

gcccagtctg gtagcgtgcc ccagttcaaa aaggtcgtgt tccaagagtt caccgacggc     1860

agcttcaccc agccactgta tagaggcgag ctgaacgagc atctgggcct gctgggccct     1920

tatatcagag ccgaagtgga agataacatc atggtcacct tccggaatca ggcctctcgg     1980

ccctacagct tctacagctc cctgatctcc tacgaagagg accagagaca gggcgcagag     2040

ccccggaaga atttcgtgaa gcccaacgag actaagacct acttttggaa ggtgcagcac     2100

catatggccc ctacaaagga cgagttcgac tgcaaagcct gggcctactt ctccgatgtg     2160

gacctcgaga aggatgtgca cagcggactc atcggcccac tgcttgtgtg ccacaccaac     2220

acactgaacc ccgctcacgg cagacaagtg acagtgcaag aattcgccct gtttttcacc     2280

atcttcgacg aaacgaagtc ctggtacttc accgaaaaca tggaaagaaa ctgcagggcc     2340

ccttgcaaca ttcagatgga agatcccacc ttcaaagaga actaccggtt ccacgccatc     2400

aacggctaca tcatggacac actgcccggc ctggttatgg ctcaggatca gagaatccgg     2460

tggtatctgc tgtccatggg ctccaacgag aatatccact ccatccactt ctccggccac     2520

gtgttcaccg tgcggaaaaa agaagagtac aaaatggccc tgtacaatct gtaccctggg     2580

gtgttcgaaa ccgttgagat gctgcctagc aaggccggaa tttggagagt ggaatgtctg     2640

attggagagc acctccacgc cgggatgagc accctgtttc tggtgtactc caacaagtgt     2700

cagacccctc tcggcatggc ctctggccac attagagact tccagatcac cgccagcgga     2760

cagtatggac agtgggcccc taaactggcc agactgcact actccggcag catcaatgcc     2820

tggtccacca aagagccttt cagctggatc aaagtggacc tgctggctcc catgatcatc     2880

cacggaatca agacccaggg cgccagacaa aagttcagca gcctgtacat cagccagttc     2940

atcatcatgt acagcctgga cggaaagaag tggcagacct accggggcaa tagcaccggc     3000

acactgatgg tgttcttcgg caacgtggac tccagcggca ttaagcacaa catcttcaac     3060

cctccaatca ttgcccgata catccggctg caccccacac actacagcat caggtctacc     3120

ctgagaatgg aactgatggg ctgcgacctg aacagctgca gcatgcccct cggaatggaa     3180

agcaaggcca tcagcgacgc ccagatcaca gcctctagct acttcaccaa catgttcgcc     3240

acttggagcc cctctaaggc ccggcttcat ctgcaaggca gaagcaacgc ttggaggccc     3300

caagtgaaca accccaaaga atggctgcag gtcgactttc agaaaaccat gaaagtgaca     3360

ggcgtgacca cacagggcgt caagtccctg ctgacctcta tgtacgtgaa agagtttctg     3420

atcagctcca gccaggacgg ccaccagtgg accctgttct tccaaaacgg caaagtgaaa     3480

gtgttccagg gaaatcagga cagcttcaca cccgtggtca actccctgga tcctccactg     3540

ctgacaagat acctgcggat tcaccctcag tcttgggtgc accagattgc cctgcggatg     3600

gaagtgctgg gctgtgaagc tcaggacctc tactgaggta ccaattcctc acctgcgatc     3660

tcgatgcttt atttgtgaaa tttgtgatgc tattgcttta tttgtaacca ttataagctg     3720

caataaacaa gttaacaaca acaattgcat tcattttatg tttcaggttc agggggaggt     3780

gtgggaggtt ttttaaacta gt                                              3802


<210>  26
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  26
tggggggagg                                                              10


<210>  27
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  27
gtagtgaggg                                                              10


<210>  28
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  28
gttggtggtt                                                              10


<210>  29
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  29
agttgtggtt                                                              10


<210>  30
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  30
gtattgggtc                                                              10


<210>  31
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  31
agtgtgaggg                                                              10


<210>  32
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  32
gggtaatggg                                                              10


<210>  33
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  33
tcattggggt                                                              10


<210>  34
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  34
ggtgggggtc                                                              10


<210>  35
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  35
ggttttgttg                                                              10


<210>  36
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  36
tatactcccg                                                              10


<210>  37
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  37
gtattcgatc                                                              10


<210>  38
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  38
gtagttccct                                                              10


<210>  39
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  39
gttaatagta                                                              10


<210>  40
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  40
tgctggttag                                                              10


<210>  41
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  41
ataggtaacg                                                              10


<210>  42
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  42
tctgaattgc                                                              10


<210>  43
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  43
tctgggtttg                                                              10


<210>  44
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  44
cattctcttt                                                              10


<210>  45
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  45
gtattggtgt                                                              10


<210>  46
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  46
tttagatttg                                                              10


<210>  47
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  47
ataagtactg                                                              10


<210>  48
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  48
tagtctatta                                                              10


<210>  49
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  49
aggtattgca                                                              10


<210>  50
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  50
gtagattacg                                                              10


<210>  51
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  51
gggcgggtgc                                                              10


<210>  52
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  52
cgtttacaat                                                              10


<210>  53
<211>  11
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  53
gtacagggat g                                                            11


<210>  54
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  54
aatcagggga                                                              10


<210>  55
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  55
ggaggttttg                                                              10


<210>  56
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  56
gtattccctg                                                              10


<210>  57
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  57
tggtaagatc                                                              10


<210>  58
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  58
gtagttaagt                                                              10


<210>  59
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  59
gttggtttgg                                                              10


<210>  60
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  60
gtatttactt                                                              10


<210>  61
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  61
gtaacggggt                                                              10


<210>  62
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  62
tttttttctg                                                              10


<210>  63
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  63
ggggaaggga                                                              10


<210>  64
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  64
ttaccccggt                                                              10


<210>  65
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  65
gtattctatg                                                              10


<210>  66
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  66
aggtattgtg                                                              10


<210>  67
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  67
tttggggggg                                                              10


<210>  68
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  68
gttgttagcg                                                              10


<210>  69
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  69
ggtagttggg                                                              10


<210>  70
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  70
ctaagtactg                                                              10


<210>  71
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  71
aaccatcttc                                                              10


<210>  72
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  72
gtacctgggt                                                              10


<210>  73
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  73
gtatctcatt                                                              10


<210>  74
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  74
aaataaaatt                                                              10


<210>  75
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  75
ggtgggttat                                                              10


<210>  76
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  76
taagggaggg                                                              10


<210>  77
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  77
tatgggaggg                                                              10


<210>  78
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  78
gatgggaggg                                                              10


<210>  79
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  79
tggggggggt                                                              10


<210>  80
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  80
ggggaagggg                                                              10


<210>  81
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  81
tggtaagagg                                                              10


<210>  82
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  82
gggttagggt                                                              10


<210>  83
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  83
gtatcggggg                                                              10


<210>  84
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  84
ggttttgctg                                                              10


<210>  85
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  85
tgggggtgga                                                              10


<210>  86
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  86
acttttagag                                                              10


<210>  87
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  87
gtaacgggtt                                                              10


<210>  88
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  88
gtttggggga                                                              10


<210>  89
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  89
atttttagag                                                              10


<210>  90
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  90
ttaaagtagg                                                              10


<210>  91
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  91
gtattaatat                                                              10


<210>  92
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  92
ggtttgggtg                                                              10


<210>  93
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  93
tatgggaaag                                                              10


<210>  94
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  94
ggttgggagg                                                              10


<210>  95
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  95
gtatttagtg                                                              10


<210>  96
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  96
gagttaaatg                                                              10


<210>  97
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  97
ttgtaagttg                                                              10


<210>  98
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  98
tgggggtagg                                                              10


<210>  99
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  99
gttcttaggg                                                              10


<210>  100
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  100
gtattctaag                                                              10


<210>  101
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  101
ggaggttttg                                                              10


<210>  102
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  102
agaatatgta                                                              10


<210>  103
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  103
atctttcggg                                                              10


<210>  104
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  104
ttgcattgaa                                                              10


<210>  105
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  105
ggtgggattt                                                              10


<210>  106
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  106
tttatctaat                                                              10


<210>  107
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  107
gcgggtggtg                                                              10


<210>  108
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  108
ggtttagata                                                              10


<210>  109
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  109
tttatgcgtt                                                              10


<210>  110
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  110
tgggtaaggc                                                              10


<210>  111
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  111
gggggtggtc                                                              10


<210>  112
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  112
gtagtatatt                                                              10


<210>  113
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  113
ggaggtattt                                                              10


<210>  114
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  114
gtattgtaag                                                              10


<210>  115
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  115
tttacgggag                                                              10


<210>  116
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  116
tagttctggg                                                              10


<210>  117
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  117
ccacgtctat                                                              10


<210>  118
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  118
agtgggtagg                                                              10


<210>  119
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  119
caatttttac                                                              10


<210>  120
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  120
ggtctggggg                                                              10


<210>  121
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  121
atcaagattg                                                              10


<210>  122
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  122
gttagctaaa                                                              10


<210>  123
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  123
agtgtggggt                                                              10


<210>  124
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  124
ggtatgtggg                                                              10


<210>  125
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  125
gtagtgtggg                                                              10


<210>  126
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  126
aggaggtgtt                                                              10


<210>  127
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  127
gttggtaggt                                                              10


<210>  128
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  128
gtaggtggtt                                                              10


<210>  129
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  129
aggtgttggt                                                              10


<210>  130
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  130
tatggttgtg                                                              10


<210>  131
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  131
ttaggttagt                                                              10


<210>  132
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  132
gattggagtt                                                              10


<210>  133
<211>  10
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  133
gtagagtgga                                                              10


<210>  134
<211>  24
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  134
cucuuucuuu uccauggguu ggcu                                              24


<210>  135
<211>  24
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  135
ggcugaggga aggacugucc uggg                                              24


<210>  136
<211>  13
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  136
ggguuauggg acc                                                          13


<210>  137
<211>  12
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  137
auauccuuuu ua                                                           12


<210>  138
<211>  12
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  138
guauccuuuu ua                                                           12


<210>  139
<211>  33
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  139
aggcuucgga gcaaggaggc agcuccgaag ccu                                    33


<210>  140
<211>  33
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  140
aggcuucgga gcaagccucc agcuccgaag ccu                                    33


<210>  141
<211>  29
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  141
gucgaggccg agcgggcaaa ggccucgac                                         29


<210>  142
<211>  29
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  142
gucgaggccg agcccgcaaa ggccucgac                                         29


<210>  143
<211>  10
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence


<220>
<221>  misc_feature
<222>  (1)..(3)
<223>  n is a, c, g, or u

<220>
<221>  misc_feature
<222>  (8)..(10)
<223>  n is a, c, g, or u

<400>  143
nnnaggunnn                                                              10


<210>  144
<211>  12
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  144
uuuuccuuaa cu                                                           12


<210>  145
<211>  1305
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  145
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttacttctag gcgcgccgcc      540

accatggtga gcaagggcga ggagctgttc accggggtgg tgcccatcct ggtcgagctg      600

gacggcgacg taaacggcca caagttcagc gtgtccggcg agggcgaggg cgatgccacc      660

tacggcaagc tgaccctgaa gttcatctgc accaccggca agctgcccgt gccctggccc      720

accctcgtga ccaccttcgg ctacggcctg atgtgcttcg cccgctaccc cgaccacatg      780

aagcagcacg acttcttcaa gtccgccatg cccgaaggct acgtccagga gcgcaccatc      840

ttcttcaagg taagtattag ctctttcttt ccatgggttg gcctcgccgc gtgggctgag      900

ggaaggactg tcctgggact ggacaggcgg gttatgggac ctgaaaagcg gccctgaaaa      960

agggccgcga tctgtagaaa gcgagctagt gccggacagt tagaggaaaa ggggaagaac     1020

tgtccgaaaa aaggggggga agacagtgac tagaaaggga agggagaagt cactgtagag     1080

gggaaggaaa aggctagcta gaggagaagg aaagaggcta gctagcagag gagaaggaaa     1140

ggcgccagca gttcggtgct atcaaaaagc ggtcaggcag ctaaaccaaa aggtttagca     1200

attgcctctg atgagtcgct gaaatgcgac gaaaaccgct ttttggtacc aataaaatat     1260

ctttattttc attacatctg tgtgttggtt ttttgtgtga ctagt                     1305


<210>  146
<211>  1543
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  146
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttacttctag gcgcgccgcg      540

gaaaaccgcg ggatagcacc gaactgctgg cgcctttcct tctcctctgc tagctagcct      600

ctttccttct cctctagcta gccttttcct tcccctctac agtgacttct cccttccctt      660

tctagtcact gtcttccccc ccttttttcg gacagttctt ccccttttcc tctaactgtc      720

cggcactagc tcgctttcta cagatcatta ttgcggccct gaaaaagggc cgcttataac      780

gttgctcgaa ttcgggttat gggaccagtg aaggctgagg gaaggactgt cctgggactg      840

gacaggcggg ttatgggacc tgaaaatact aacaatcgat tttttttccc tttttttcca      900

ggacgacggc aactacaaga cccgcgccga ggtgaagttc gagggcgaca ccctggtgaa      960

ccgcatcgag ctgaagggca tcgacttcaa ggaggacggc aacatcctgg ggcacaagct     1020

ggagtacaac tacaacagcc acaacgtcta tatcatggcc gacaagcaga agaacggcat     1080

caaggtaagt attagctctt tctttccatg ggttggcctc gccgcgtggg ctgagggaag     1140

gactgtcctg ggactggaca ggcgggttat gggacctgaa aagcggccct gaaaaagggc     1200

cgcagcgaaa acgaagcgag ctaaagcctc ctctctcttc ttcagaactc ctctcttttc     1260

tctcctccag gagttcttcc tctctccctt cttctcaaat gctttctccc tctctcctgc     1320

atttgagctc cttctttcct ctctcgacaa tccccttttc tccctcttga ttgtcgacta     1380

gctcgcaatc atcgcggtgc taaaaagcgg tcaggcagct aaaccaaaag gtttagcaat     1440

tgcctctgat gagtcgctga aatgcgacga aaaccgcttt ttggtaccaa taaaatatct     1500

ttattttcat tacatctgtg tgttggtttt ttgtgtgact agt                       1543


<210>  147
<211>  1571
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  147
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttacttctag gcgcgccacc      540

atgagccagt tcgacatcct gtgcaagacc ccccccaagg tgctggtgcg gcagttcgtg      600

gagagattcg agaggcccag cggcgagaag atcgccagct gtgccgccga gctgacctac      660

ctgtgctgga tgatcaccca caacggcacc gccatcaaga gggccacctt catgagctac      720

aacaccatca tcagcaacag cctgagcttc gacatcgtga acaagagcct gcagttcaag      780

tacaagaccc agaaggccac catcctggag gccagcctga agaagctgat ccccgcctgg      840

gagttcacca tcatccctta caacggccag aagcaccaga gcgacatcac cgacatcgtg      900

tccagcctgc agctgcagtt cgagagcagc gaggaggccg acaagggcaa cagccacagc      960

aagaagatgc tgaaggccct gctgtccgag ggcgagagca tctgggagat caccgagaag     1020

atcctgaaca gcttcgagta caccagcagg ttcaccaaga ccaagaccct gtaccagttc     1080

ctgttcctgg ccacattcat caactgcggc aggtaagtat tagctctttc tttccatggg     1140

ttggcctcgc cgcgtgggct gagggaagga ctgtcctggg actggacagg cgggttatgg     1200

gacctgaaaa gcggccctga aaaagggccg cgatgaaaac gaagcgagct aaagcctcct     1260

ctctcttctt cagaactcct ctcttttctc tcctccagga gttcttcctc tctcccttct     1320

tctcaaatgc tttctccctc tctcctgcat ttgagctcct tctttcctct ctcgacaatc     1380

cccttttctc cctcttgatt gtcgactagc tcgcaatcat cgcggtatca aaaagcggtc     1440

aggcagctaa accaaaaggt ttagcaattg cctctgatga gtcgctgaaa tgcgacgaaa     1500

accgcttttt ggtaccaata aaatatcttt attttcatta catctgtgtg ttggtttttt     1560

gtgtgactag t                                                          1571


<210>  148
<211>  1765
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  148
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttacttctag gcgcgccgcg      540

gaaaaccgcg ggataccgcg atgattgcga gctagtcgac aatcaagagg gagaaaaggg      600

gattgtcgag agaggaaaga aggagctcaa atgcaggaga gagggagaaa gcatttgaga      660

agaagggaga gaggaagaac tcctggagga gagaaaagag aggagttctg aagaagagag      720

aggaggcttt agctcgcttc gttttcatca ttattgcggc cctgaaaaag ggccgcttat      780

aacgttgctc gaattcgggt tatgggacca gtgaaggctg agggaaggac tgtcctggga      840

ctggacaggc gggttatggg acctgaaaat actaacaatc gatttttttt cccttttttt      900

ccaggttcag cgacatcaag aacgtggacc ccaagagctt caagctggtg cagaacaagt      960

acctgggcgt gatcattcag tgcctggtga ccgagaccaa gacaagcgtg tccaggcaca     1020

tctacttttt cagcgccaga ggcaggatcg accccctggt gtacctggac gagttcctga     1080

ggaacagcga gcccgtgctg aagagagtga acaggaccgg caacagcagc agcaacaagc     1140

aggagtacca gctgctgaag gacaacctgg tgcgcagcta caacaaggcc ctgaagaaga     1200

acgcccccta ccccatcttc gctatcaaga acggccctaa gagccacatc ggcaggcacc     1260

tgatgaccag ctttctgagc atgaagggcc tgaccgagct gacaaacgtg gtgggcaact     1320

ggagcgacaa gagggcctcc gccgtggcca ggaccaccta cacccaccag atcaccgcca     1380

tccccgacca ctacttcgcc ctggtgtcca ggtactacgc ctacgacccc atcagcaagg     1440

agatgatcgc cctgaaggac gagaccaacc ccatcgagga gtggcagcac atcgagcagc     1500

tgaagggcag cgccgagggc agcatcagat accccgcctg gaacggcatc atcagccagg     1560

aggtgctgga ctacctgagc agctacatca acaggcggat ctgagaattc ctcacctgcg     1620

atctcgatgc tttatttgtg aaatttgtga tgctattgct ttatttgtaa ccattataag     1680

ctgcaataaa caagttaaca acaacaattg cattcatttt atgtttcagg ttcaggggga     1740

ggtgtgggag gttttttaaa ctagt                                           1765


<210>  149
<211>  10
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  149
aaagaaggaa                                                              10


<210>  150
<211>  12
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  150
cuuucuuuuc uu                                                           12


<210>  151
<211>  11
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence


<220>
<221>  misc_feature
<222>  (1)..(3)
<223>  n is a, c, g, or u

<220>
<221>  misc_feature
<222>  (8)..(11)
<223>  n is a, c, g, or u

<400>  151
nnnaggunnn n                                                            11


<210>  152
<211>  11
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence


<220>
<221>  misc_feature
<222>  (1)..(3)
<223>  n is a, c, g, or u

<220>
<221>  misc_feature
<222>  (8)..(11)
<223>  n is a, c, g, or u

<400>  152
nnnuggunnn n                                                            11


<210>  153
<211>  11
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence


<220>
<221>  misc_feature
<222>  (3)..(8)
<223>  n is a, c, g, or u

<400>  153
gannnnnnaa a                                                            11


<210>  154
<211>  12
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  154
gccgccacca tg                                                           12


<210>  155
<211>  4311
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  155
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttacttctag gcgcgccgcc      540

gccaccatgg ctctgatcgt gcacctgaaa accgtgtccg agctgagagg caagggcgac      600

agaatcgcca aagtgacctt cagaggccag agcttctaca gcagagtgct ggaaaactgc      660

gaaggcgtgg ccgacttcga cgagacattc agatggcctg tggccagcag catcgacaga      720

aacgaggtgc tcgagatcca gatcttcaac tacagcaagg tgttcagcaa caagctgatc      780

gggaccttct gcatggtgct gcagaaagtg gtggaagaga accgcgtgga agtgaccgac      840

acactgatgg acgacagcaa cgccatcatc aagaccagcc tgagcatgga agtgcgctac      900

caggccacag atggcacagt cggaccttgg gacgatggcg atttcctggg agatgagagc      960

ctgcaagagg aaaaggacag ccaagagaca gacggcctgc tgcctggctc tcggcctagc     1020

acaagaatca gcggcgagaa gtccttcaga agcaagggca gagaaaagac caaaggcggc     1080

agagatggcg agcacaaggc tggcagatct gtgttcagcg ccatgaagct gggcaagacc     1140

agaagccaca aagaggaacc ccagagacag gacgagccag ccgttctgga aatggaagat     1200

ctcgaccatc tggccatcca gctcggcgac ggacttgacc ctgattctgt gtctctggcc     1260

agcgtgacag ccctgacaag caacgtgtcc aacaagagaa gcaagcccga catcaagatg     1320

gaacccagcg ccggcagacc catggattac caggtgtcca tcaccgtgat cgaggccaga     1380

cagctcgtgg gcctgaacat ggatcctgtc gtgtgtgtgg aagtgggcga cgacaaaaag     1440

tacaccagca tgaaggaaag caccaactgt ccctactaca acgagtactt cgtgttcgac     1500

ttccacgtgt ccccagacgt gatgttcgac aagatcatta agatcagcgt gatccacagc     1560

aagaacctgc tgagaagcgg cacactcgtg ggcagcttta agatggacgt gggcaccgtg     1620

tacagccagc cagagcacca gtttcaccac aagtgggcca tcctgagcga ccccgatgat     1680

atctctgctg gcctgaaggg ctacgtgaag tgtgatgtgg ctgtcgtcgg caaaggcgac     1740

aacatcaaga caccccacaa ggccaacgag actgacgagg acgatatcga gggcaacctg     1800

ctgctgccag aaggcgtgcc accagaaaga cagtgggcca gattctatgt gaagatctac     1860

agagccgagg gcctgcctag aatgaacaca agcctgatgg ccaacgtgaa gaaggctttc     1920

atcggcgaga acaaggacct ggtggacccc tacgtccagg tgttcttcgc tggacagaaa     1980

ggcaagacct ccgtgcagaa gtccagctac gagcccctgt ggaacgaaca ggtggtgttc     2040

accgatctgt tccctccact gtgcaagaga atgaaggtgc agatccggga cagcgacaaa     2100

gtgaacgatg tggccatcgg cacccacttc atcgacctga gaaagatcag caacgacggc     2160

gacaagggct tcctgcctac acttggacct gcctgggtca acatgtacgg cagcaccaga     2220

aactacaccc tgctggacga gcaccaggac ctgaacgaag gactcggaga gggcgtgtcc     2280

ttccgggcta gactgatgct gggactcgcc gtggaaatcc tggacacaag caaccctgag     2340

ctgaccagca gcacagaggt gcaggttgaa caggccacac ctgtgtctga gagctgcacc     2400

ggcagaatgg aagagttctt cctgttcggc gccttcctgg aagcctccat gatcgataga     2460

aagaacggcg ataagcccat caccttcgaa gtgaccatcg gcaactacgg caacgaggtg     2520

gacggcatgt ctagacccct ccggcctaga ccaagaaaag agcccggcga cgaggaagag     2580

gtggacctga tccagaacag cagcgacgat gagggcgacg aagctggcga tctggcaagc     2640

gttagcagca cccctcctat gaggccccag atcaccgacc ggaactactt tcatctgccc     2700

tacctggaaa gaaagccctg catctacatc aagagctggt ggcctgacca gagaaggcgg     2760

ctgtacaacg ctaacatcat ggaccatatc gccgacaagc tggaagaggg actgaacgac     2820

gtccaagaga tgatcaagac cgagaagtct taccccgaga gaaggctgag gggcgtgctc     2880

gaggaactga gctgtggatg ccacagattt ctgagcctgt ccgacaagga ccagggcaga     2940

agcagcagaa ccagactgga tagagagcgg ctgaagtcct gcatgcgcga gctggaatct     3000

atgggccagc aggccaagag cctgagagcc caagtgaaga gacacaccgt gcgggacaag     3060

ctgagatcct gccagaactt cctgcagaag ctgcggttcc tggccgatga gcctcagcac     3120

tctatccccg acgtgttcat ctggatgatg agcaacaaca agaggatcgc ctacgccaga     3180

gtgcccagca aggatctgct gtttagcatc gtggaagagg aactcggcaa ggactgcgcc     3240

aaagtcaaga ccctgttcct gaagctgcca ggcaagagag gcttcggctc tgctggatgg     3300

acagtgcagg ctaagctgga actgtacctg tggctgggcc tgagcaagca gagaaaggac     3360

ttcctgtgcg gcctgccttg cggcttcgaa gaagtgaagg ctgctcaagg cctgggcctg     3420

cacagcttcc ctccaatctc tctggtgtac acaaagaagc aggccttcca gctgagggcc     3480

cacatgtacc aggctagatc tctgttcgcc gccgactcta gcggcctgtc tgatcctttc     3540

gctcgggtgt tcttcatcaa ccagagccag tgcaccgagg tgctgaacga gacactgtgt     3600

cctacctggg accagatgct ggtctttgac aacctcgagc tgtacggcga ggctcacgaa     3660

ctgagagatg accctcctat catcgtcatc gagatctacg accaggacag catgggcaaa     3720

gccgacttca tgggcagaac cttcgccaag cctctggtca agatggccga cgaggcttac     3780

tgccctcctc ggttcccacc tcagctcgag tactaccaga tctaccgggg ctctgctaca     3840

gccggcgatc tgctggctgc ttttgagctg ctgcaaatcg gccctagcgg caaggctgat     3900

ctgcctccaa tcaacggccc tgtggacatg gacagaggcc ccattatgcc tgtgcctgtg     3960

ggcatcagac ccgtgctgag caagtacaga gtggaagtgc tgttttgggg cctgcgcgac     4020

ctgaagagag tgaacctggc tcaggtaagt attagctctt tctttccatg ggttggcctc     4080

gccgcgtggg ctgagggaag gactgtcctg ggactggaca ggcgggttat gggacctgaa     4140

gcgataaaag gcatgcacgt ttgcggctac gtgcatgcca aaaggagtcg ggcttgcctc     4200

cgtgcccgac tccaaaagac ctgctcgagg aggtggacga gcaggtcaaa aatccgggta     4260

ccaataaaat atctttattt tcattacatc tgtgtgttgg ttttttgtgt g              4311


<210>  156
<211>  3467
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  156
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttaggatttt tgacctgctc      540

gattgtccac tgcgagcagg tcttttggag tcgggcgagg cggaagcccg actccttttg      600

gcatgcacgc tagccgcgtc gtgcatgcct tttatcttcg ggttatggga ccagtgaagg      660

ctgagggaag gactgtcctg ggactggaca ggcgggttat gggacctgaa aatactaaca      720

atcgattttt tttccctttt tttccaggtg gacagaccca gagtggatat cgagtgtgct      780

ggcaaagggg tgcagagcag cctgatccat aactacaaga agaaccccaa cttcaacacc      840

ctggtcaagt ggttcgaagt ggatctgccc gagaacgaac tgctgcaccc acctctgaac      900

atcagagtgg tggactgcag agccttcggc agatacaccc tcgtgggatc tcacgccgtg      960

tctagcctga gaagattcat ctacagacct ccagacagaa gcgcccctaa ctggaacaca     1020

acaggcgagg tggtggtgtc catggaaccc gaggaacccg tgaagaaact ggaaaccatg     1080

gtcaagctgg acgccacctc cgatgctgtc gtgaaagtgg acgtggccga ggacgagaaa     1140

gagcgcaaga agaagaaaaa gaagggcccc agcgaggaac ctgaagagga agaacctgac     1200

gagagcatgc tggactggtg gtccaagtac ttcgcctcca tcgacacaat gaaggaacag     1260

ctgagacagc acgagacaag cggcaccgac ctcgaagaga aagaagagat ggaatccgcc     1320

gaaggactga agggccctat gaagtccaaa gagaagtcta gggccgccaa agaagagaaa     1380

aaaaagaaga accagtctcc tggaccaggc cagggatctg aggctcccga aaagaaaaag     1440

gccaagatcg acgagctgaa ggtgtacccc aaagagctgg aaagcgagtt cgacagcttc     1500

gaggactggc tgcacacctt caatctgctg agaggaaaga caggcgacga cgaggatggc     1560

agcactgaag aagagagaat cgtcggcaga ttcaagggca gcctgtgcgt gtacaaggtg     1620

ccactgcctg aggacgtgtc cagagaggct ggctacgatc ctacctacgg catgttccaa     1680

ggcatcccta gcaacgaccc catcaatgtg ctcgtgcgga tctatgtcgt gcgggccact     1740

gatctgcatc ccgccgatat caacggcaag gcagacccct atatcgctat caagctgggg     1800

aaaaccgaca tcagggacaa agagaactac atcagcaagc agctgaaccc cgtgttcggc     1860

aagagcttcg acatcgaggc tagcttcccc atggaatcca tgctgaccgt ggccgtgtac     1920

gactgggatc tcgtgggaac agacgacctg atcggagaga caaagattga cctggaaaac     1980

cggttctact ccaagcaccg ggccacctgt ggaatcgccc agacctactc tatccacggc     2040

tacaacatct ggcgggaccc catgaagcct agccagatcc tgaccaggct gtgcaaagaa     2100

ggcaaggtcg acggccctca ctttggacct cacggccggg tcagagtggc caacagagtg     2160

ttcacaggcc cctccgagat cgaggatgag aacggccaga gaaagcccac cgatgagcat     2220

gtggctctga gcgctctgag acactgggaa gatatcccta gagtgggctg cagactggtg     2280

cccgagcacg tggaaacaag acccctgctg aacccagaca agcccggaat cgaacagggc     2340

agactcgaac tgtgggtcga catgttccct atggacatgc ccgcacctgg cacaccactg     2400

gacatcagcc ctaggaagcc caagaaatac gagctgcgcg tgatcgtgtg gaacaccgac     2460

gaagtggtgc tggaagatga cgacttcttc accggcgaaa agtccagcga catcttcgtc     2520

agaggatggc tgaagggaca gcaagaggat aagcaggaca ccgacgtgca ctaccacagc     2580

cttacaggcg aaggcaactt taactggcgc tacctgtttc ctttcgacta cctggccgcc     2640

gaagagaaga tcgtgatgtc caagaaagaa tctatgttca gctgggacga gacagagtac     2700

aagatccccg ccagactgac cctgcagatc tgggatgccg atcacttcag cgccgacgac     2760

tttctgggag ccatcgagct ggacctgaat agattcccca gaggcgccaa gaccgccaag     2820

cagtgcacaa tggaaatggc cactggcgag gtcgacgtgc cactggtgtc tatcttcaag     2880

cagaagcgcg tcaaaggctg gtggcccctg ctggctagaa acgagaacga cgagttcgag     2940

ctgaccggaa aggtggaagc cgagctgcat ctgctgacag ctgaagaggc cgagaagaat     3000

cctgtgggcc tcgctaggaa tgagcccgat cctctggaaa agcccaacag acccgatacc     3060

gccttcgtgt ggtttctgaa cccactgaag tccatcaagt acctgatctg tacccggtac     3120

aagtggctga ttatcaagat cgtgctggcc ctgctggggc tgctgatgct tgctctgttc     3180

ctgtactccc tgcctggcta tatggtcaag aagctgctgg gcgccggcgc tcgggctgac     3240

tacaaagacc atgacggtga ttataaagat catgacatcg actataagga tgacgatgac     3300

aaatgaggta ccaattcctc acctgcgatc tcgatgcttt atttgtgaaa tttgtgatgc     3360

tattgcttta tttgtaacca ttataagctg caataaacaa gttaacaaca acaattgcat     3420

tcattttatg tttcaggttc agggggaggt gtgggaggtt ttttaaa                   3467


<210>  157
<211>  4392
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  157
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttacttctag gcgcgccgcc      540

accatggtca ttctgcagca gggcgaccac gtgtggatgg atctgagact gggccaagag      600

ttcgacgtgc caatcggcgc cgtggtcaag ctgtgtgatt ctggccaggt gcaagtcgtg      660

gacgacgagg ataatgagca ctggatcagc cctcagaacg ccacacacat caagcctatg      720

caccccacat ctgtgcacgg cgtggaagat atgatccggc tgggcgatct gaacgaggcc      780

ggcatcctga gaaacctgct gatcagatac cgggaccacc tgatctacac ctacaccggc      840

tctatcctgg tggccgtgaa tccctaccag ctgctgagca tctacagccc cgagcacatc      900

cggcagtaca ccaacaagaa aatcggcgag atgcctcctc acatcttcgc cattgccgac      960

aactgctact tcaacatgaa gcggaacagc cgggaccagt gctgcatcat ctctggcgaa     1020

tctggcgccg gaaagaccga gagcacaaag ctgatcctgc agttcctggc cgccatcagc     1080

ggacagcact cttggattga gcagcaggtc ctggaagcca cacctattct ggaagccttc     1140

ggcaacgcca agaccatccg gaacgacaac agcagcagat tcggcaaata catcgacatc     1200

cacttcaaca agagaggcgc cattgagggc gccaagatcg agcagtacct gctggaaaag     1260

tccagagtgt gcagacaggc cctggacgag agaaactacc acgtgttcta ctgcatgctg     1320

gaaggcatga gcgaggacca gaagaagaag ctcggactcg gccaggccag cgactacaat     1380

tatctggcca tgggcaactg catcacatgc gagggcagag tggacagcca agagtacgcc     1440

aacatccgca gcgccatgaa ggtgctgatg ttcaccgaca ccgagaactg ggagatcagc     1500

aaactgctgg ccgctatcct gcatctgggc aacctgcagt acgaggccag aaccttcgag     1560

aacctggatg cctgcgaggt gctgttctct ccttccctgg ctaccgccgc ctctctgctg     1620

gaagtgaacc ctcctgatct gatgagctgc ctgaccagca gaaccctgat caccagaggc     1680

gagacagtgt ctacccctct gagcagagaa caggctctgg atgtgcggga cgccttcgtg     1740

aagggcatct acggcagact gttcgtgtgg atcgtggaca agatcaacgc cgccatctac     1800

aagcctccaa gccaggacgt gaagaacagc agaagatcca tcggcctgct ggacatcttc     1860

ggcttcgaga atttcgccgt gaacagcttc gagcagctgt gcatcaactt cgccaacgag     1920

cacctccagc agttcttcgt gcggcacgtg ttcaagctgg aacaagagga atacgacctg     1980

gaatccatcg actggctgca catcgagttc accgataacc aggacgccct ggacatgatc     2040

gccaacaagc ccatgaacat catcagcctg atcgacgagg aaagcaagtt ccccaagggc     2100

accgatacca ccatgctgca caagctgaac agccagcaca aactgaatgc caactacatc     2160

ccgcctaaga acaaccacga gacacagttc ggcatcaacc acttcgccgg catcgtgtac     2220

tacgaaaccc agggctttct ggaaaagaac cgggacaccc tgcacggcga catcattcag     2280

ctggtgcaca gcagccggaa caagttcatc aagcagatct tccaggccga cgtcgccatg     2340

ggagccgaga caagaaagag aagccccaca ctgagcagcc agttcaagcg gagtctggaa     2400

ctgctgatga gaaccctggg agcctgccag cctttctttg tgcggtgcat caagcccaac     2460

gagttcaaga aacccatgct gttcgaccgg cacctgtgtg tgcggcagct gagatacagc     2520

ggcatgatgg aaaccatcag gattcggaga gccggctatc ccatccggta cagcttcgtg     2580

gaattcgtcg agcggtacag agtgctgctg cctggcgtga agcctgccta caaacagggc     2640

gatctcagag gcacctgtca gagaatggcc gaagccgtgc tgggcaccca tgacgattgg     2700

cagatcggaa agacaaagat cttcctgaag gaccaccacg acatgctgct cgaggtggaa     2760

agagacaagg ccatcaccga cagagtgatc ctgctccaga aagtgatccg gggcttcaag     2820

gacagaagca atttcctgaa gctgaagaat gccgccactc tgatccagag acactggcgg     2880

ggacacaact gccggaagaa ctacggcctg atgaggctgg gcttcctgag actgcaggcc     2940

ctgcacagaa gcagaaagct gcaccagcag tacagactgg cccggcagcg gatcatccag     3000

tttcaagcca gatgtcgggc ctacctcgtg cgcaaggcct tcagacatag actgtgggcc     3060

gtgctgaccg tgcaggccta tgccagagga atgattgccc gcagactgca ccagagactg     3120

agagccgagt atctgtggcg gctggaagcc gagaaaatgc ggctggccga ggaagagaag     3180

ctgcggaaag agatgagcgc caagaaggcc aaagaagagg ccgagcggaa gcaccaagag     3240

agactggctc aactggccag agaggacgcc gagagagagc tgaaagagaa agaggccgcc     3300

agacggaaga aagaactcct ggaacagatg gaacgggcca gacacgagcc cgtgaaccac     3360

agcgatatgg tggataagat gttcggcttc ctgggcacct ctggcggact gcctggacaa     3420

gaaggacagg cccctagcgg ctttgaggac ctggaacgtg ggagaagaga aatggtggaa     3480

gaggatctgg acgccgctct gcctctgcct gacgaggatg aagaagatct gagcgagtac     3540

aagttcgcca agtttgccgc cacctacttt caaggcacca ccacacacag ctacaccaga     3600

aggcctctga agcagcccct gctgtaccac gatgatgagg gcgatcaact ggcagccctg     3660

gccgtgtgga ttaccatcct cagattcatg ggcgacctgc ctgagcctaa gtaccacacc     3720

gccatgtctg acggctccga gaagatcccc gtgatgacca agatctacga gactctgggc     3780

aagaaaacct acaagcgcga gctgcaggct ctccaaggcg aaggcgaagc tcaactgcct     3840

gagggccaga aaaagtcctc tgtgcgccac aaactggtgc acctgacact gaagaagaaa     3900

agcaagctga cagaggaagt gaccaagcgg ctgcacgatg gcgagtctac agtgcagggc     3960

aacagcatgc tcgaggacag acccaccagc aacctggaaa aactgcactt catcatcggc     4020

aacggaatcc tgcggcctgc tctgagggat gagatctact gccagatctc caagcagctg     4080

acacacaacc ccagcaagag cagctacgcc agaggctgga ttctggtaag tattagctct     4140

ttctttccat gggttggcct cgccgcgtgg gctgagggaa ggactgtcct gggactggac     4200

aggcgggtta tgggacctga agcgataaaa ggcatgcacg tttgcggcta cgtgcatgcc     4260

aaaaggagtc gggcttgcct ccgtgcccga ctccaaaaga cctgctcgag gaggtggacg     4320

agcaggtcaa aaatccgggt accaataaaa tatctttatt ttcattacat ctgtgtgttg     4380

gttttttgtg tg                                                         4392


<210>  158
<211>  4055
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  158
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttaggatttt tgacctgctc      540

gattgtccac tgcgagcagg tcttttggag tcgggcgagg cggaagcccg actccttttg      600

gcatgcacgc tagccgcgtc gtgcatgcct tttatcttcg ggttatggga ccagtgaagg      660

ctgagggaag gactgtcctg ggactggaca ggcgggttat gggacctgaa aatactaaca      720

atcgattttt tttccctttt tttccaggtg tctctgtgcg tgggctgttt cgccccaagc      780

gagaagttcg tgaagtacct gaggaacttc atccacggcg gacctccagg ctacgcccct      840

tactgtgaag agaggctgag aaggaccttt gtgaacggca cccggacaca gcctccatcc      900

tggctggaac tccaggccac caagagcaaa aagcccatca tgctgcccgt gacctttatg      960

gatggcacca caaagaccct gctgaccgat agcgccacca ccgccaaaga gctgtgtaac     1020

gccctggctg acaagattag cctgaaggat agattcggct tcagcctgta cattgccctg     1080

ttcgacaagg tgtccagcct cggctctggc tctgaccatg tgatggatgc catcagccag     1140

tgcgagcagt atgccaaaga acagggcgcc caagagagga acgctccttg gcggctgttc     1200

tttcggaaag aggtgttcac cccttggcac agccccagcg aagataacgt ggccaccaat     1260

ctgatctacc agcaagttgt gcggggcgtg aagttcggcg agtacagatg cgaaaaagag     1320

gacgatctgg ccgagctggc ctctcagcag tactttgtgg actacggcag cgagatgatc     1380

ctggaacggc tgctgaatct ggtgcccacc tacattcccg atcgggagat caccccactg     1440

aaaaccctcg agaagtgggc ccagctggcc attgctgccc acaagaaagg catctatgcc     1500

cagcggagaa cagacgccca gaaagtcaaa gaggatgtcg ttagctacgc ccggttcaag     1560

tggcctctgc tgtttagccg gttctacgag gcctacaagt tcagcggccc cagtctgccc     1620

aagaacgatg tgatcgtggc tgtgaactgg accggcgtgt acttcgtgga tgagcaagaa     1680

caagtgctgc ttgagctgag cttccccgag atcatggccg tgtccagctc cagagaatgc     1740

agagtgtggc tgagcctggg ctgtagcgat ctgggatgtg ccgctcctca ttctggatgg     1800

gctggactga caccagccgg accttgtagc ccttgttggt cttgccgggg ggccaagaca     1860

acagccccta gctttaccct ggccaccatt aagggcgacg agtacacctt caccagcagc     1920

aacgccgagg acatcagaga tctggtcgtg accttcctgg aaggcctgcg gaagcggagc     1980

aaatatgtgg tggccctgca ggacaacccc aatcctgctg gcgaggaatc cggctttctg     2040

agctttgcca aaggcgacct gatcatcctg gaccacgaca ccggcgagca agtgatgaat     2100

agcggctggg ccaacggcat caatgagcgg acaaagcagc ggggcgactt ccctaccgat     2160

agcgtgtacg tgatgcccac cgtgaccatg cctccaaggg aaatcgtggc cctggtcacc     2220

atgacacccg accagagaca ggatgttgtg cggctgctgc agctgaggac agccgaacca     2280

gaagtgcggg ccaagcctta cacactggaa gagttcagct acgactactt ccggcctcct     2340

ccaaagcaca ccctgtctag agtgatggtg tccaaggcca gaggcaagga taggctgtgg     2400

tcccacacaa gagagcccct gaaacaggca ctgctgaaaa agctgctggg cagcgaggaa     2460

ctgagccaag aagcctgtct ggcctttatc gccgtgctga agtacatggg cgattacccc     2520

tccaagcgga ccagatccgt gaacgaactg accgaccaga ttttcgaggg cccactgaag     2580

gccgagcctc tgaaagatga ggcctacgtg cagattctga aacagctgac cgacaaccac     2640

atccgctaca gcgaggaacg cggatgggaa ctgctgtggc tgtgtaccgg actgttccca     2700

cctagcaaca ttctgctgcc ccacgtgcag cggtttctgc agtctagaaa gcactgccct     2760

ctggccatcg attgcctgca gaggctgcaa aaggccctga gaaatggctc ccggaagtac     2820

cctcctcacc tggtggaagt ggaagccatc cagcacaaga ccacacagat ctttcacaag     2880

gtctacttcc ccgacgacac agacgaggcc tttgaggtgg aatcctctac caaggccaag     2940

gacttctgcc agaatatcgc caccaggctg ctgctgaagt ccagcgaagg ctttagcctg     3000

tttgtgaaga tcgccgacaa agtgctgagc gtgcccgaga acgacttctt tttcgatttt     3060

gtgcgccatc tgaccgactg gattaagaag gctagaccca tcaaggatgg catcgtgccc     3120

agcctgacct atcaggtgtt ctttatgaag aagctgtgga cgaccaccgt gcctggcaag     3180

gatcctatgg ccgacagcat cttccactac taccaagagc tgcccaagta cctgcggggc     3240

taccacaagt gtaccagaga agaggtcctg cagctgggag ccctgatcta tagagtgaag     3300

tttgaagagg acaagagcta cttccctagc atccccaagc tgctgcgcga actggttccc     3360

caggatctga tccggcaagt gtcccctgat gactggaagc ggtctatcgt ggcctacttt     3420

aacaagcacg ccggcaagag taaagaggaa gccaagctgg cctttctgaa gctcatcttt     3480

aagtggccta ccttcggctc cgccttcttc gaagtgaagc agaccaccga gcctaacttc     3540

cctgagattc tgctgatcgc catcaacaaa tacggcgtgt ccctgatcga tcccaagaca     3600

aaggacatcc tgacaacaca ccccttcacc aaaatcagca actggtccag cggcaacacc     3660

tacttccaca tcaccatcgg caatctcgtg cggggctcta agctgctgtg tgaaaccagc     3720

ctgggataca agatggacga cctgctgaca agctacatct cccagatgct gaccgccatg     3780

agcaaacaga gaggctctcg gagcggcaag tggggcgctc gggctgacta caaagaccat     3840

gacggtgatt ataaagatca tgacatcgac tataaggatg acgatgacaa atgaggtacc     3900

aattcctcac ctgcgatctc gatgctttat ttgtgaaatt tgtgatgcta ttgctttatt     3960

tgtaaccatt ataagctgca ataaacaagt taacaacaac aattgcattc attttatgtt     4020

tcaggttcag ggggaggtgt gggaggtttt ttaaa                                4055


<210>  159
<211>  4161
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  159
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttacttctag gcgcgccgcc      540

accatgaaga gaaccgccga cggcagcgag ttcgagagcc ctaagaaaaa gcggaaggtg      600

gacaagaagt acagcatcgg cctggctatc ggcaccaatt ctgttggctg ggccgtgatc      660

accgacgagt acaaggtgcc cagcaagaaa ttcaaggtgc tgggcaacac cgaccggcac      720

agcatcaaga agaatctgat cggcgccctg ctgttcgact ctggcgaaac agccgaagcc      780

accagactga agaggacagc cagacggcgg tacaccagaa gaaagaaccg gatctgctac      840

ctgcaagaga tcttcagcaa cgagatggcc aaggtggacg acagcttctt ccaccggctg      900

gaagagtcct tcctggtgga agaggataag aagcacgagc ggcaccccat cttcggcaac      960

atcgtggatg aggtggccta ccacgagaag taccccacca tctaccacct gagaaagaaa     1020

ctggtggaca gcaccgacaa ggccgacctg agactgatct atctggccct ggctcacatg     1080

atcaagttcc ggggccactt cctgatcgag ggcgacctga atcctgacaa cagcgacgtg     1140

gacaagctgt tcatccagct ggtgcagacc tacaaccagc tgttcgagga aaaccccatc     1200

aacgccagcg gagtggatgc caaggccatc ctgtctgccc ggctgagcaa gagcagacgg     1260

ctggaaaacc tgatcgctca gctgcccggc gagaagaaga atggcctgtt cggcaacctg     1320

attgccctga gcctgggcct gacacctaac ttcaagagca acttcgacct ggccgaggac     1380

gccaaactgc agctgtccaa ggacacctac gacgacgacc tggacaatct gctggcccag     1440

atcggcgatc agtacgccga cttgtttctg gccgccaaga acctgtccga cgccatcctg     1500

ctgagcgaca tcctgagagt gaacaccgag atcacaaagg cccctctgag cgcctctatg     1560

atcaagagat acgacgagca ccaccaggat ctgaccctgc tgaaggccct cgttagacag     1620

cagctgcctg agaagtacaa agagattttc ttcgaccaga gcaagaacgg ctacgccggc     1680

tacattgatg gcggagccag ccaagaggaa ttctacaagt tcatcaagcc catcctcgag     1740

aagatggacg gcaccgagga actgctggtc aagctgaaca gagaggacct gctgcggaag     1800

cagcggacct tcgacaatgg ctctatccct caccaaatcc acctgggaga gctgcacgcc     1860

attctgcgga gacaagagga cttttaccca ttcctgaagg acaaccggga aaagattgag     1920

aagatcctga ccttcaggat cccctactac gtgggaccac tggccagagg caatagcaga     1980

ttcgcctgga tgaccagaaa gagcgaggaa accatcacac cctggaactt cgaggaagtg     2040

gtggataagg gcgccagcgc tcagtccttc atcgagcgga tgaccaactt cgataagaac     2100

ctgcctaacg agaaggtgct gcccaagcac agcctgctgt acgagtactt caccgtgtac     2160

aacgagctga ccaaagtgaa atacgtgacc gagggaatga gaaagcccgc ctttctgagc     2220

ggcgagcaga aaaaggccat tgtggatctg ctgttcaaga ccaaccggaa agtgaccgtg     2280

aagcagctga aagaggacta cttcaagaaa atcgagtgct tcgacagcgt ggaaatcagc     2340

ggcgtggaag atcggttcaa tgccagcctg ggcacatacc acgacctgct gaaaattatc     2400

aaggacaagg acttcctgga caacgaagag aacgaggaca tcctggaaga tatcgtgctg     2460

accctgacac tgtttgagga cagagagatg atcgaggaac ggctgaaaac atacgcccac     2520

ctgttcgacg acaaagtgat gaagcaactg aagcggcgga gatacaccgg ctggggcaga     2580

ctgtctcgga agctgatcaa cggcatccgg gataagcagt ccggcaagac catcctggac     2640

tttctgaagt ccgacggctt cgccaatcgg aacttcatgc agctgatcca cgacgacagc     2700

ctgaccttta aagaggatat ccagaaagcc caggtgtccg gccagggcga ttctctgcat     2760

gagcacattg ccaacctggc cggctctccc gccattaaga agggcattct gcagacagtg     2820

aaggtggtgg acgagctggt caaagtcatg ggcagacaca agcccgagaa catcgtgatc     2880

gaaatggcca gagagaacca gaccacacag aagggccaga agaacagccg cgagagaatg     2940

aagcggatcg aagagggcat caaagagctg ggcagccaga tcctgaaaga acaccccgtg     3000

gaaaacaccc agctgcagaa cgagaagctg tacctgtact acctccaaaa cggccgggat     3060

atgtatgtgg accaagagct ggacatcaac cggctgtccg actacgatgt ggacgctatc     3120

gtgccccagt cttttctgaa agacgactcc atcgacaaca aggtcctgac cagaagcgac     3180

aagaaccggg gcaagagcga taacgtgccc tccgaagagg tcgtgaagaa gatgaagaac     3240

tactggcgac agctgctgaa cgccaagctg attacccagc ggaagttcga taacctgacc     3300

aaggccgaga gaggcggcct gtctgaactg gataaggccg gcttcatcaa gagacagctg     3360

gtggaaaccc ggcagatcac caaacacgtg gcacagattc tggactcccg gatgaacact     3420

aagtacgacg agaatgacaa gctgatccgg gaagtgaaag tgatcaccct gaagtccaag     3480

ctggtgtccg atttccggaa ggatttccag ttctacaaag tgcgcgagat caacaactac     3540

catcacgccc acgacgccta cctgaatgcc gttgttggaa cagccctgat caagaagtat     3600

cccaagctgg aatccgagtt cgtgtacggc gactacaagg tgtacgacgt gcggaagatg     3660

atcgccaaga gcgagcaaga gattggcaag gctaccgcca agtacttctt ctacagcaac     3720

atcatgaact ttttcaagac cgagattacc ctggccaacg gcgagatcag aaagcggcct     3780

ctgatcgaga caaacggcga aaccggcgag attgtgtggg acaagggcag agattttgcc     3840

accgtgcgga aagtgctgag catgccccaa gtgaatatcg tgaaaaagac cgaggtaagt     3900

attagctctt tctttccatg ggttggcctc gccgcgtggg ctgagggaag gactgtcctg     3960

ggactggaca ggcgggttat gggacctgaa gcgataaaag gcatgcacgt ttgcggctac     4020

gtgcatgcca aaaggagtcg ggcttgcctc cgtgcccgac tccaaaagac ctgctcgagg     4080

aggtggacga gcaggtcaaa aatccgggta ccaataaaat atctttattt tcattacatc     4140

tgtgtgttgg ttttttgtgt g                                               4161


<210>  160
<211>  3410
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  160
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttaggatttt tgacctgctc      540

gattgtccac tgcgagcagg tcttttggag tcgggcgagg cggaagcccg actccttttg      600

gcatgcacgc tagccgcgtc gtgcatgcct tttatcttcg ggttatggga ccagtgaagg      660

ctgagggaag gactgtcctg ggactggaca ggcgggttat gggacctgaa aatactaaca      720

atcgattttt tttccctttt tttccaggtg cagacaggcg gcttcagcaa agagtccatt      780

ctgcccaaga gaaacagcga taagctgatc gcccggaaga aggactggga ccctaagaag      840

tacggcggct tcgatagccc taccgtggcc tattctgtgc tggtggtggc caaagtggaa      900

aagggcaagt ccaagaaact caagagcgtg aaagagctgc tggggatcac catcatggaa      960

agaagcagct tcgagaagaa tcctatcgat ttcctcgagg ccaagggcta caaagaagtg     1020

aaaaaggacc tgatcatcaa gctccccaag tactccctgt tcgagctgga aaatggccgg     1080

aagcggatgc tggcttctgc tggcgaactg cagaagggaa acgaactggc cctgcctagc     1140

aaatatgtga acttcctgta cctggccagc cactatgaga agctgaaggg cagccccgag     1200

gacaatgagc aaaagcagct gtttgtggaa cagcacaagc actacctgga cgagatcatc     1260

gagcagatct ccgagttctc caagagagtg atcctggccg acgctaatct ggacaaagtg     1320

ctgtccgcct acaacaagca ccgggacaag cctatcagag agcaggccga gaatatcatc     1380

cacctgttta ccctgaccaa tctgggagcc cctgccgcct tcaagtactt tgacaccacc     1440

atcgaccgga agcgctacac cagcaccaaa gaggtgctgg acgccacact gatccaccag     1500

tctatcaccg gcctgtacga gacacggatc gacctgtctc agctcggagg cgatagcagg     1560

gctgacccca agaagaagag gaaggtgtcg ccagggatcc gtcgacttga cgcgttgata     1620

tcaacaagtt tgtacaaaaa agcaggctac aaagaggcca gcggttccgg acgggctgac     1680

gcattggacg attttgatct ggatatgctg ggaagtgacg ccctcgatga ttttgacctt     1740

gacatgcttg gttcggatgc ccttgatgac tttgacctcg acatgctcgg cagtgacgcc     1800

cttgatgatt tcgacctgga catgctgatt aactctagaa gttccggatc tccgaaaaag     1860

aaacgcaaag ttggtagcca gtacctgccc gacaccgacg accggcaccg gatcgaggaa     1920

aagcggaagc ggacctacga gacattcaag agcatcatga agaagtcccc cttcagcggc     1980

cccaccgacc ctagacctcc acctagaaga atcgccgtgc ccagcagatc cagcgccagc     2040

gtgccaaaac ctgcccccca gccttacccc ttcaccagca gcctgagcac catcaactac     2100

gacgagttcc ctaccatggt gttccccagc ggccagatct ctcaggcctc tgctctggct     2160

ccagcccctc ctcaggtgct gcctcaggct cctgctcctg caccagctcc agccatggtg     2220

tctgcactgg ctcaggcacc agcacccgtg cctgtgctgg ctcctggacc tccacaggct     2280

gtggctccac cagcccctaa acctacacag gccggcgagg gcacactgtc tgaagctctg     2340

ctgcagctgc agttcgacga cgaggatctg ggagccctgc tgggaaacag caccgatcct     2400

gccgtgttca ccgacctggc cagcgtggac aacagcgagt tccagcagct gctgaaccag     2460

ggcatccctg tggcccctca caccaccgag cccatgctga tggaataccc cgaggccatc     2520

acccggctcg tgacaggcgc tcagaggcct cctgatccag ctcctgcccc tctgggagca     2580

ccaggcctgc ctaatggact gctgtctggc gacgaggact tcagctctat cgccgatatg     2640

gatttctcag ccttgctggg ctctggcagc ggcagccggg attccaggga agggatgttt     2700

ttgccgaagc ctgaggccgg ctccgctatt agtgacgtgt ttgagggccg cgaggtgtgc     2760

cagccaaaac gaatccggcc atttcatcct ccaggaagtc catgggccaa ccgcccactc     2820

cccgccagcc tcgcaccaac accaaccggt ccagtacatg agccagtcgg gtcactgacc     2880

ccggcaccag tccctcagcc actggatcca gcgcccgcag tgactcccga ggccagtcac     2940

ctgttggagg atcccgatga agagacgagc caggctgtca aagcccttcg ggagatggcc     3000

gatactgtga ttccccagaa ggaagaggct gcaatctgtg gccaaatgga cctttcccat     3060

ccgcccccaa ggggccatct ggatgagctg acaaccacac ttgagtccat gaccgaggat     3120

ctgaacctgg actcacccct gaccccggaa ttgaacgaga ttctggatac cttcctgaac     3180

gacgagtgcc tcttgcatgc catgcatatc agcacaggac tgtccatctt cgacacatct     3240

ctgttttgag gtaccaattc ctcacctgcg atctcgatgc tttatttgtg aaatttgtga     3300

tgctattgct ttatttgtaa ccattataag ctgcaataaa caagttaaca acaacaattg     3360

cattcatttt atgtttcagg ttcaggggga ggtgtgggag gttttttaaa                3410


<210>  161
<211>  4161
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  161
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttacttctag gcgcgccgcc      540

accatgaaga gaaccgccga cggcagcgag ttcgagagcc ctaagaaaaa gcggaaggtg      600

gacaagaagt acagcatcgg cctggacatc ggcaccaatt ctgttggctg ggccgtgatc      660

accgacgagt acaaggtgcc cagcaagaaa ttcaaggtgc tgggcaacac cgaccggcac      720

agcatcaaga agaatctgat cggcgccctg ctgttcgact ctggcgaaac agccgaagcc      780

accagactga agaggacagc cagacggcgg tacaccagaa gaaagaaccg gatctgctac      840

ctgcaagaga tcttcagcaa cgagatggcc aaggtggacg acagcttctt ccaccggctg      900

gaagagtcct tcctggtgga agaggataag aagcacgagc ggcaccccat cttcggcaac      960

atcgtggatg aggtggccta ccacgagaag taccccacca tctaccacct gagaaagaaa     1020

ctggtggaca gcaccgacaa ggccgacctg agactgatct atctggccct ggctcacatg     1080

atcaagttcc ggggccactt cctgatcgag ggcgacctga atcctgacaa cagcgacgtg     1140

gacaagctgt tcatccagct ggtgcagacc tacaaccagc tgttcgagga aaaccccatc     1200

aacgccagcg gagtggatgc caaggccatc ctgtctgccc ggctgagcaa gagcagacgg     1260

ctggaaaacc tgatcgctca gctgcccggc gagaagaaga atggcctgtt cggcaacctg     1320

attgccctga gcctgggcct gacacctaac ttcaagagca acttcgacct ggccgaggac     1380

gccaaactgc agctgtccaa ggacacctac gacgacgacc tggacaatct gctggcccag     1440

atcggcgatc agtacgccga cttgtttctg gccgccaaga acctgtccga cgccatcctg     1500

ctgagcgaca tcctgagagt gaacaccgag atcacaaagg cccctctgag cgcctctatg     1560

atcaagagat acgacgagca ccaccaggat ctgaccctgc tgaaggccct cgttagacag     1620

cagctgcctg agaagtacaa agagattttc ttcgaccaga gcaagaacgg ctacgccggc     1680

tacattgatg gcggagccag ccaagaggaa ttctacaagt tcatcaagcc catcctcgag     1740

aagatggacg gcaccgagga actgctggtc aagctgaaca gagaggacct gctgcggaag     1800

cagcggacct tcgacaatgg ctctatccct caccaaatcc acctgggaga gctgcacgcc     1860

attctgcgga gacaagagga cttttaccca ttcctgaagg acaaccggga aaagattgag     1920

aagatcctga ccttcaggat cccctactac gtgggaccac tggccagagg caatagcaga     1980

ttcgcctgga tgaccagaaa gagcgaggaa accatcacac cctggaactt cgaggaagtg     2040

gtggataagg gcgccagcgc tcagtccttc atcgagcgga tgaccaactt cgataagaac     2100

ctgcctaacg agaaggtgct gcccaagcac agcctgctgt acgagtactt caccgtgtac     2160

aacgagctga ccaaagtgaa atacgtgacc gagggaatga gaaagcccgc ctttctgagc     2220

ggcgagcaga aaaaggccat tgtggatctg ctgttcaaga ccaaccggaa agtgaccgtg     2280

aagcagctga aagaggacta cttcaagaaa atcgagtgct tcgacagcgt ggaaatcagc     2340

ggcgtggaag atcggttcaa tgccagcctg ggcacatacc acgacctgct gaaaattatc     2400

aaggacaagg acttcctgga caacgaagag aacgaggaca tcctggaaga tatcgtgctg     2460

accctgacac tgtttgagga cagagagatg atcgaggaac ggctgaaaac atacgcccac     2520

ctgttcgacg acaaagtgat gaagcaactg aagcggcgga gatacaccgg ctggggcaga     2580

ctgtctcgga agctgatcaa cggcatccgg gataagcagt ccggcaagac catcctggac     2640

tttctgaagt ccgacggctt cgccaatcgg aacttcatgc agctgatcca cgacgacagc     2700

ctgaccttta aagaggatat ccagaaagcc caggtgtccg gccagggcga ttctctgcat     2760

gagcacattg ccaacctggc cggctctccc gccattaaga agggcattct gcagacagtg     2820

aaggtggtgg acgagctggt caaagtcatg ggcagacaca agcccgagaa catcgtgatc     2880

gaaatggcca gagagaacca gaccacacag aagggccaga agaacagccg cgagagaatg     2940

aagcggatcg aagagggcat caaagagctg ggcagccaga tcctgaaaga acaccccgtg     3000

gaaaacaccc agctgcagaa cgagaagctg tacctgtact acctccaaaa cggccgggat     3060

atgtatgtgg accaagagct ggacatcaac cggctgtccg actacgatgt ggacgctatc     3120

gtgccccagt cttttctgaa agacgactcc atcgacaaca aggtcctgac cagaagcgac     3180

aagaaccggg gcaagagcga taacgtgccc tccgaagagg tcgtgaagaa gatgaagaac     3240

tactggcgac agctgctgaa cgccaagctg attacccagc ggaagttcga taacctgacc     3300

aaggccgaga gaggcggcct gtctgaactg gataaggccg gcttcatcaa gagacagctg     3360

gtggaaaccc ggcagatcac caaacacgtg gcacagattc tggactcccg gatgaacact     3420

aagtacgacg agaatgacaa gctgatccgg gaagtgaaag tgatcaccct gaagtccaag     3480

ctggtgtccg atttccggaa ggatttccag ttctacaaag tgcgcgagat caacaactac     3540

catcacgccc acgacgccta cctgaatgcc gttgttggaa cagccctgat caagaagtat     3600

cccaagctgg aatccgagtt cgtgtacggc gactacaagg tgtacgacgt gcggaagatg     3660

atcgccaaga gcgagcaaga gattggcaag gctaccgcca agtacttctt ctacagcaac     3720

atcatgaact ttttcaagac cgagattacc ctggccaacg gcgagatcag aaagcggcct     3780

ctgatcgaga caaacggcga aaccggcgag attgtgtggg acaagggcag agattttgcc     3840

accgtgcgga aagtgctgag catgccccaa gtgaatatcg tgaaaaagac cgaggtaagt     3900

attagctctt tctttccatg ggttggcctc gccgcgtggg ctgagggaag gactgtcctg     3960

ggactggaca ggcgggttat gggacctgaa gcgataaaag gcatgcacgt ttgcggctac     4020

gtgcatgcca aaaggagtcg ggcttgcctc cgtgcccgac tccaaaagac ctgctcgagg     4080

aggtggacga gcaggtcaaa aatccgggta ccaataaaat atctttattt tcattacatc     4140

tgtgtgttgg ttttttgtgt g                                               4161


<210>  162
<211>  3911
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  162
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttaggatttt tgacctgctc      540

gattgtccac tgcgagcagg tcttttggag tcgggcgagg cggaagcccg actccttttg      600

gcatgcacgc tagccgcgtc gtgcatgcct tttatcttcg ggttatggga ccagtgaagg      660

ctgagggaag gactgtcctg ggactggaca ggcgggttat gggacctgaa aatactaaca      720

atcgattttt tttccctttt tttccaggtg cagacaggcg gcttcagcaa agagtccatt      780

ctgcccaaga gaaacagcga taagctgatc gcccggaaga aggactggga ccctaagaag      840

tacggcggct tcgatagccc taccgtggcc tattctgtgc tggtggtggc caaagtggaa      900

aagggcaagt ccaagaaact caagagcgtg aaagagctgc tggggatcac catcatggaa      960

agaagcagct tcgagaagaa tcctatcgat ttcctcgagg ccaagggcta caaagaagtg     1020

aaaaaggacc tgatcatcaa gctccccaag tactccctgt tcgagctgga aaatggccgg     1080

aagcggatgc tggcttctgc tggcgaactg cagaagggaa acgaactggc cctgcctagc     1140

aaatatgtga acttcctgta cctggccagc cactatgaga agctgaaggg cagccccgag     1200

gacaatgagc aaaagcagct gtttgtggaa cagcacaagc actacctgga cgagatcatc     1260

gagcagatct ccgagttctc caagagagtg atcctggccg acgctaatct ggacaaagtg     1320

ctgtccgcct acaacaagca ccgggacaag cctatcagag agcaggccga gaatatcatc     1380

cacctgttta ccctgaccaa tctgggagcc cctgccgcct tcaagtactt tgacaccacc     1440

atcgaccgga agcgctacac cagcaccaaa gaggtgctgg acgccacact gatccaccag     1500

tctatcaccg gcctgtacga gacacggatc gacctgtctc agctcggagg cgattctggc     1560

ggatctagcg gtggaagctc tggctctgag acacctggca caagcgagtc tgccacacct     1620

gagtctagcg gcggatcttc aggcggcagc agcaccctga atatcgagga tgagtacaga     1680

ctgcacgaga caagcaaaga acccgacgtg tccctgggct ctacctggct gtctgatttt     1740

cctcaagcct gggccgaaac aggcggaatg ggacttgctg ttagacaggc tcccctgatc     1800

attcccctga aggccacaag cacccctgtg tccatcaagc agtaccccat gtctcaagag     1860

gcccggctgg gaatcaagcc ccacattcag agactgctgg accagggcat cctggtgcct     1920

tgtcaaagcc cttggaatac ccctctgctg cctgtgaaga agcccggcac caacgactac     1980

agacccgtgc aggatctgcg cgaagtgaac aagagagtcg aggacattca ccccaccgtg     2040

cctaatcctt acaacctgct gtctggcctg cctccttccc accaatggta cacagtgctg     2100

gacctgaagg atgccttctt ctgcctgcgg ctgcacccta caagccagcc tctgtttgcc     2160

ttcgagtggc gggatccaga gatgggcatt agcggacagc tgacctggac cagactgccc     2220

cagggcttca agaatagccc cacactgttc aacgaggccc tgcacaggga cctcgccgac     2280

tttagaattc agcaccccga cctgattctg ctgcagtatg tggatgatct gctgctggcc     2340

gctaccagcg agctggattg tcagcaggga acaagagccc tgctgcagac cctgggcaat     2400

ctgggctata gagcctctgc caagaaggcc cagatttgcc agaagcaagt taagtacctg     2460

ggctacctgc tcaaagaagg ccagcgttgg ctgaccgagg ccagaaaaga aaccgtgatg     2520

ggccagccta cacctaagac acccagacag ctgagagagt tcctgggcaa agccggattc     2580

tgcaggctgt ttatccctgg cttcgccgag atggctgccc ctctgtatcc tctgacaaag     2640

cccggaactc tgttcaactg gggcccagac cagcagaaag cctaccaaga gatcaagcag     2700

gctctgctga cagcccctgc tctgggactg cctgatctga ccaagccttt cgagctgttc     2760

gtggacgaga agcagggcta tgccaagggc gtgctgacac agaaactcgg cccttggaga     2820

aggcccgtgg cttacctgag caaaaagctg gatcctgtgg ccgctggctg gcctccttgt     2880

ctgagaatgg tggccgctat cgccgtgctg actaaggatg ccggcaagct gacaatggga     2940

cagcctctgg ttattctggc ccctcatgcc gtggaagccc tcgtgaaaca gcctcctgat     3000

cggtggctga gcaacgccag aatgacccac taccaggcac tgctgctcga caccgacaga     3060

gtgcaatttg gccctgtggt ggccctgaat ccagccacat tgctgcctct gcctgaggag     3120

ggactgcagc acaactgcct cgatatcctg gctgaggccc acggcacaag acccgatctg     3180

acagatcagc cactgcctga cgccgaccac acctggtata cagatggcag ctctctgctg     3240

caagagggcc agagaaaagc tggcgccgct gtgaccacag agacagaagt gatttgggcc     3300

aaagctctgc ctgccggcac atctgcccaa agagccgaac tgatcgcact gacacaggcc     3360

ctgaagatgg ccgagggcaa gaaactgaac gtgtacaccg actccagata cgccttcgcc     3420

accgctcaca tccacggcga aatctacaga cgcagaggat ggctgaccag cgagggaaaa     3480

gagattaaga acaaggacga gattctcgcc ctcctcaagg ccctgttcct gcctaagcgg     3540

ctgagcatca tccactgtcc tggccaccag aagggacact ctgccgaggc tagaggcaac     3600

agaatggccg atcaggctgc cagaaaggcc gccattaccg agacacccga taccagcaca     3660

ctgctgattg agaacagcag cccttccggc ggctccaaaa gaacagctga cggctccgag     3720

tttgagccca aaaagaaacg gaaagtgtga ggtaccaatt cctcacctgc gatctcgatg     3780

ctttatttgt gaaatttgtg atgctattgc tttatttgta accattataa gctgcaataa     3840

acaagttaac aacaacaatt gcattcattt tatgtttcag gttcaggggg aggtgtggga     3900

ggttttttaa a                                                          3911


<210>  163
<211>  3159
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  163
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttacttctag gcgcgccacc      540

atgaaacgga cagccgacgg aagcgagttc gagtcaccaa agaagaagcg gaaagtcagc      600

agtgaaaccg gaccagtggc agtggaccca accctgagga gacggattga gccccatgaa      660

tttgaagtgt tctttgaccc aagggagctg aggaaggaga catgcctgct gtacgagatc      720

aagtggggca caagccacaa gatctggcgc cacagctcca agaacaccac aaagcacgtg      780

gaagtgaatt tcatcgagaa gtttacctcc gagcggcact tctgcccctc taccagctgt      840

tccatcacat ggtttctgtc ttggagccct tgcggcgagt gttccaaggc catcaccgag      900

ttcctgtctc agcaccctaa cgtgaccctg gtcatctacg tggcccggct gtatcaccac      960

atggaccagc agaacaggca gggcctgcgc gatctggtga attctggcgt gaccatccag     1020

atcatgacag ccccagagta cgactattgc tggcggaact tcgtgaatta tccacctggc     1080

aaggaggcac actggccaag atacccaccc ctgtggatga agctgtatgc actggagctg     1140

cacgcaggaa tcctgggcct gcctccatgt ctgaatatcc tgcggagaaa gcagccccag     1200

ctgacatttt tcaccattgc tctgcaatct tgtcactatc agcggctgcc tcctcatatt     1260

ctgtgggcta ccggcctgaa gtctggagga tctagcggag gatcctctgg cagcgagaca     1320

ccaggaacaa gcgagtcagc aacaccagag agcagtggcg gcagcagcgg cggcagcgac     1380

aagaagtaca gcatcggcct ggccatcggc accaattctg ttggctgggc cgtgatcacc     1440

gacgagtaca aggtgcccag caagaaattc aaggtgctgg gcaacaccga ccggcacagc     1500

atcaagaaga atctgatcgg cgccctgctg ttcgactctg gcgaaacagc cgaagccacc     1560

agactgaaga ggacagccag acggcggtac accagaagaa agaaccggat ctgctacctg     1620

caagagatct tcagcaacga gatggccaag gtggacgaca gcttcttcca ccggctggaa     1680

gagtccttcc tggtggaaga ggataagaag cacgagcggc accccatctt cggcaacatc     1740

gtggatgagg tggcctacca cgagaagtac cccaccatct accacctgag aaagaaactg     1800

gtggacagca ccgacaaggc cgacctgaga ctgatctatc tggccctggc tcacatgatc     1860

aagttccggg gccacttcct gatcgagggc gacctgaatc ctgacaacag cgacgtggac     1920

aagctgttca tccagctggt gcagacctac aaccagctgt tcgaggaaaa ccccatcaac     1980

gccagcggag tggatgccaa ggccatcctg tctgcccggc tgagcaagag cagacggctg     2040

gaaaacctga tcgctcagct gcccggcgag aagaagaatg gcctgttcgg caacctgatt     2100

gccctgagcc tgggcctgac acctaacttc aagagcaact tcgacctggc cgaggacgcc     2160

aaactgcagc tgtccaagga cacctacgac gacgacctgg acaatctgct ggcccagatc     2220

ggcgatcagt acgccgactt gtttctggcc gccaagaacc tgtccgacgc catcctgctg     2280

agcgacatcc tgagagtgaa caccgagatc acaaaggccc ctctgagcgc ctctatgatc     2340

aagagatacg acgagcacca ccaggatctg accctgctga aggccctcgt tagacagcag     2400

ctgcctgaga agtacaaaga gattttcttc gaccagagca agaacggcta cgccggctac     2460

attgatggcg gagccagcca agaggaattc tacaagttca tcaagcccat cctcgagaag     2520

atggacggca ccgaggaact gctggtcaag ctgaacagag aggacctgct gcggaagcag     2580

cggaccttcg acaatggctc tatccctcac caaatccacc tgggagagct gcacgccatt     2640

ctgcggagac aagaggactt ttacccattc ctgaaggaca accgggaaaa gattgagaag     2700

atcctgacct tcaggatccc ctactacgtg ggaccactgg ccagaggcaa tagcagattc     2760

gcctggatga ccagaaagag cgaggaaacc atcacaccct ggaacttcga ggaagtggtg     2820

gataagggcg ccagcgctca gtccttcatc gagcggatga ccaacttcga taagaacctg     2880

cctaacgaga aggtaagtat tagctctttc tttccatggg ttggcctcgc cgcgtgggct     2940

gagggaagga ctgtcctggg actggacagg cgggttatgg gacctgaagc gataaaaggc     3000

atgcacgttt gcggctacgt gcatgccaaa aggagtcggg cttgcctccg tgcccgactc     3060

caaaagacct gctcgaggag gtggacgagc aggtcaaaaa tccgggtacc aataaaatat     3120

ctttattttc attacatctg tgtgttggtt ttttgtgtg                            3159


<210>  164
<211>  4115
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  164
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttaggatttt tgacctgctc      540

gattgtccac tgcgagcagg tcttttggag tcgggcgagg cggaagcccg actccttttg      600

gcatgcacgc tagccgcgtc gtgcatgcct tttatcttcg ggttatggga ccagtgaagg      660

ctgagggaag gactgtcctg ggactggaca ggcgggttat gggacctgaa aatactaaca      720

atcgattttt tttccctttt tttccaggtg ctgcccaagc acagcctgct gtacgagtac      780

ttcaccgtgt acaacgagct gaccaaagtg aaatacgtga ccgagggaat gagaaagccc      840

gcctttctga gcggcgagca gaaaaaggcc attgtggatc tgctgttcaa gaccaaccgg      900

aaagtgaccg tgaagcagct gaaagaggac tacttcaaga aaatcgagtg cttcgacagc      960

gtggaaatca gcggcgtgga agatcggttc aatgccagcc tgggcacata ccacgacctg     1020

ctgaaaatta tcaaggacaa ggacttcctg gacaacgaag agaacgagga catcctggaa     1080

gatatcgtgc tgaccctgac actgtttgag gacagagaga tgatcgagga acggctgaaa     1140

acatacgccc acctgttcga cgacaaagtg atgaagcaac tgaagcggcg gagatacacc     1200

ggctggggca gactgtctcg gaagctgatc aacggcatcc gggataagca gtccggcaag     1260

accatcctgg actttctgaa gtccgacggc ttcgccaatc ggaacttcat gcagctgatc     1320

cacgacgaca gcctgacctt taaagaggat atccagaaag cccaggtgtc cggccagggc     1380

gattctctgc atgagcacat tgccaacctg gccggctctc ccgccattaa gaagggcatt     1440

ctgcagacag tgaaggtggt ggacgagctg gtcaaagtca tgggcagaca caagcccgag     1500

aacatcgtga tcgaaatggc cagagagaac cagaccacac agaagggcca gaagaacagc     1560

cgcgagagaa tgaagcggat cgaagagggc atcaaagagc tgggcagcca gatcctgaaa     1620

gaacaccccg tggaaaacac ccagctgcag aacgagaagc tgtacctgta ctacctccaa     1680

aacggccggg atatgtatgt ggaccaagag ctggacatca accggctgtc cgactacgat     1740

gtggaccata tcgtgcccca gtcttttctg aaagacgact ccatcgacaa caaggtcctg     1800

accagaagcg acaagaaccg gggcaagagc gataacgtgc cctccgaaga ggtcgtgaag     1860

aagatgaaga actactggcg acagctgctg aacgccaagc tgattaccca gcggaagttc     1920

gataacctga ccaaggccga gagaggcggc ctgtctgaac tggataaggc cggcttcatc     1980

aagagacagc tggtggaaac ccggcagatc accaaacacg tggcacagat tctggactcc     2040

cggatgaaca ctaagtacga cgagaatgac aagctgatcc gggaagtgaa agtgatcacc     2100

ctgaagtcca agctggtgtc cgatttccgg aaggatttcc agttctacaa agtgcgcgag     2160

atcaacaact accatcacgc ccacgacgcc tacctgaatg ccgttgttgg aacagccctg     2220

atcaagaagt atcccaagct ggaatccgag ttcgtgtacg gcgactacaa ggtgtacgac     2280

gtgcggaaga tgatcgccaa gagcgagcaa gagattggca aggctaccgc caagtacttc     2340

ttctacagca acatcatgaa ctttttcaag accgagatta ccctggccaa cggcgagatc     2400

agaaagcggc ctctgatcga gacaaacggc gaaaccggcg agattgtgtg ggacaagggc     2460

agagattttg ccaccgtgcg gaaagtgctg agcatgcccc aagtgaatat cgtgaaaaag     2520

accgaggtgc agacaggcgg cttcagcaaa gagtccattc tgcccaagag aaacagcgat     2580

aagctgatcg cccggaagaa ggactgggac cctaagaagt acggcggctt cgatagccct     2640

accgtggcct attctgtgct ggtggtggcc aaagtggaaa agggcaagtc caagaaactc     2700

aagagcgtga aagagctgct ggggatcacc atcatggaaa gaagcagctt cgagaagaat     2760

cctatcgatt tcctcgaggc caagggctac aaagaagtga aaaaggacct gatcatcaag     2820

ctccccaagt actccctgtt cgagctggaa aatggccgga agcggatgct ggcttctgct     2880

ggcgaactgc agaagggaaa cgaactggcc ctgcctagca aatatgtgaa cttcctgtac     2940

ctggccagcc actatgagaa gctgaagggc agccccgagg acaatgagca aaagcagctg     3000

tttgtggaac agcacaagca ctacctggac gagatcatcg agcagatctc cgagttctcc     3060

aagagagtga tcctggccga cgctaatctg gacaaagtgc tgtccgccta caacaagcac     3120

cgggacaagc ctatcagaga gcaggccgag aatatcatcc acctgtttac cctgaccaat     3180

ctgggagccc ctgccgcctt caagtacttt gacaccacca tcgaccggaa gcgctacacc     3240

agcaccaaag aggtgctgga cgccacactg atccaccagt ctatcaccgg cctgtacgag     3300

acacggatcg acctgtctca gctcggaggc gatagcggcg ggagcggcgg gagcgggggg     3360

agcactaatc tgagcgacat cattgagaag gagactggga aacagctggt cattcaggag     3420

tccatcctga tgctgcctga ggaggtggag gaagtgatcg gcaacaagcc agagtctgac     3480

atcctggtgc acaccgccta cgacgagtcc acagatgaga atgtgatgct gctgacctct     3540

gacgcccccg agtataagcc ttgggccctg gtcatccagg attctaacgg cgagaataag     3600

atcaagatgc tgagcggagg atccggagga tctggaggca gcaccaacct gtctgacatc     3660

atcgagaagg agacaggcaa gcagctggtc atccaggaga gcatcctgat gctgcccgaa     3720

gaagtcgaag aagtgatcgg aaacaagcct gagagcgata tcctggtcca taccgcctac     3780

gacgagagta ccgacgaaaa tgtgatgctg ctgacatccg acgccccaga gtataagccc     3840

tgggctctgg tcatccagga ttccaacgga gagaacaaaa tcaaaatgct gtctggcggc     3900

tcaaaaagaa ccgccgacgg cagcgaattc gagcccaaga agaagaggaa agtctaaacc     3960

aattcctcac ctgcgatctc gatgctttat ttgtgaaatt tgtgatgcta ttgctttatt     4020

tgtaaccatt ataagctgca ataaacaagt taacaacaac aattgcattc attttatgtt     4080

tcaggttcag ggggaggtgt gggaggtttt ttaaa                                4115


<210>  165
<211>  2973
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  165
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttacttctag gcgcgccacc      540

atgaaacgga cagccgacgg aagcgagttc gagtcaccaa agaagaagcg gaaagtctct      600

gaggtggagt tttcccacga gtactggatg agacatgccc tgaccctggc caagagggca      660

cgggatgaga gggaggtgcc tgtgggagcc gtgctggtgc tgaacaatag agtgatcggc      720

gagggctgga acagagccat cggcctgcac gacccaacag cccatgccga aattatggcc      780

ctgagacagg gcggcctggt catgcagaac tacagactga ttgacgccac cctgtacgtg      840

acattcgagc cttgcgtgat gtgcgccggc gccatgatcc actctaggat cggccgcgtg      900

gtgtttggcg tgaggaactc aaaaagaggc gccgcaggct ccctgatgaa cgtgctgaac      960

taccccggca tgaatcaccg cgtcgaaatt accgagggaa tcctggcaga tgaatgtgcc     1020

gccctgctgt gcgatttcta tcggatgcct agacaggtgt tcaatgctca gaagaaggcc     1080

cagagctcca tcaactccgg aggatctagc ggaggctcct ctggctctga gacacctggc     1140

acaagcgaga gcgcaacacc tgaaagcagc gggggcagca gcggggggtc agacaagaag     1200

tacagcatcg gcctggccat cggcaccaat tctgttggct gggccgtgat caccgacgag     1260

tacaaggtgc ccagcaagaa attcaaggtg ctgggcaaca ccgaccggca cagcatcaag     1320

aagaatctga tcggcgccct gctgttcgac tctggcgaaa cagccgaagc caccagactg     1380

aagaggacag ccagacggcg gtacaccaga agaaagaacc ggatctgcta cctgcaagag     1440

atcttcagca acgagatggc caaggtggac gacagcttct tccaccggct ggaagagtcc     1500

ttcctggtgg aagaggataa gaagcacgag cggcacccca tcttcggcaa catcgtggat     1560

gaggtggcct accacgagaa gtaccccacc atctaccacc tgagaaagaa actggtggac     1620

agcaccgaca aggccgacct gagactgatc tatctggccc tggctcacat gatcaagttc     1680

cggggccact tcctgatcga gggcgacctg aatcctgaca acagcgacgt ggacaagctg     1740

ttcatccagc tggtgcagac ctacaaccag ctgttcgagg aaaaccccat caacgccagc     1800

ggagtggatg ccaaggccat cctgtctgcc cggctgagca agagcagacg gctggaaaac     1860

ctgatcgctc agctgcccgg cgagaagaag aatggcctgt tcggcaacct gattgccctg     1920

agcctgggcc tgacacctaa cttcaagagc aacttcgacc tggccgagga cgccaaactg     1980

cagctgtcca aggacaccta cgacgacgac ctggacaatc tgctggccca gatcggcgat     2040

cagtacgccg acttgtttct ggccgccaag aacctgtccg acgccatcct gctgagcgac     2100

atcctgagag tgaacaccga gatcacaaag gcccctctga gcgcctctat gatcaagaga     2160

tacgacgagc accaccagga tctgaccctg ctgaaggccc tcgttagaca gcagctgcct     2220

gagaagtaca aagagatttt cttcgaccag agcaagaacg gctacgccgg ctacattgat     2280

ggcggagcca gccaagagga attctacaag ttcatcaagc ccatcctcga gaagatggac     2340

ggcaccgagg aactgctggt caagctgaac agagaggacc tgctgcggaa gcagcggacc     2400

ttcgacaatg gctctatccc tcaccaaatc cacctgggag agctgcacgc cattctgcgg     2460

agacaagagg acttttaccc attcctgaag gacaaccggg aaaagattga gaagatcctg     2520

accttcagga tcccctacta cgtgggacca ctggccagag gcaatagcag attcgcctgg     2580

atgaccagaa agagcgagga aaccatcaca ccctggaact tcgaggaagt ggtggataag     2640

ggcgccagcg ctcagtcctt catcgagcgg atgaccaact tcgataagaa cctgcctaac     2700

gagaaggtaa gtattagctc tttctttcca tgggttggcc tcgccgcgtg ggctgaggga     2760

aggactgtcc tgggactgga caggcgggtt atgggacctg aagcgataaa aggcatgcac     2820

gtttgcggct acgtgcatgc caaaaggagt cgggcttgcc tccgtgcccg actccaaaag     2880

acctgctcga ggaggtggac gagcaggtca aaaatccggg taccaataaa atatctttat     2940

tttcattaca tctgtgtgtt ggttttttgt gtg                                  2973


<210>  166
<211>  3560
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  166
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttaggatttt tgacctgctc      540

gattgtccac tgcgagcagg tcttttggag tcgggcgagg cggaagcccg actccttttg      600

gcatgcacgc tagccgcgtc gtgcatgcct tttatcttcg ggttatggga ccagtgaagg      660

ctgagggaag gactgtcctg ggactggaca ggcgggttat gggacctgaa aatactaaca      720

atcgattttt tttccctttt tttccaggtg ctgcccaagc acagcctgct gtacgagtac      780

ttcaccgtgt acaacgagct gaccaaagtg aaatacgtga ccgagggaat gagaaagccc      840

gcctttctga gcggcgagca gaaaaaggcc attgtggatc tgctgttcaa gaccaaccgg      900

aaagtgaccg tgaagcagct gaaagaggac tacttcaaga aaatcgagtg cttcgacagc      960

gtggaaatca gcggcgtgga agatcggttc aatgccagcc tgggcacata ccacgacctg     1020

ctgaaaatta tcaaggacaa ggacttcctg gacaacgaag agaacgagga catcctggaa     1080

gatatcgtgc tgaccctgac actgtttgag gacagagaga tgatcgagga acggctgaaa     1140

acatacgccc acctgttcga cgacaaagtg atgaagcaac tgaagcggcg gagatacacc     1200

ggctggggca gactgtctcg gaagctgatc aacggcatcc gggataagca gtccggcaag     1260

accatcctgg actttctgaa gtccgacggc ttcgccaatc ggaacttcat gcagctgatc     1320

cacgacgaca gcctgacctt taaagaggat atccagaaag cccaggtgtc cggccagggc     1380

gattctctgc atgagcacat tgccaacctg gccggctctc ccgccattaa gaagggcatt     1440

ctgcagacag tgaaggtggt ggacgagctg gtcaaagtca tgggcagaca caagcccgag     1500

aacatcgtga tcgaaatggc cagagagaac cagaccacac agaagggcca gaagaacagc     1560

cgcgagagaa tgaagcggat cgaagagggc atcaaagagc tgggcagcca gatcctgaaa     1620

gaacaccccg tggaaaacac ccagctgcag aacgagaagc tgtacctgta ctacctccaa     1680

aacggccggg atatgtatgt ggaccaagag ctggacatca accggctgtc cgactacgat     1740

gtggaccata tcgtgcccca gtcttttctg aaagacgact ccatcgacaa caaggtcctg     1800

accagaagcg acaagaaccg gggcaagagc gataacgtgc cctccgaaga ggtcgtgaag     1860

aagatgaaga actactggcg acagctgctg aacgccaagc tgattaccca gcggaagttc     1920

gataacctga ccaaggccga gagaggcggc ctgtctgaac tggataaggc cggcttcatc     1980

aagagacagc tggtggaaac ccggcagatc accaaacacg tggcacagat tctggactcc     2040

cggatgaaca ctaagtacga cgagaatgac aagctgatcc gggaagtgaa agtgatcacc     2100

ctgaagtcca agctggtgtc cgatttccgg aaggatttcc agttctacaa agtgcgcgag     2160

atcaacaact accatcacgc ccacgacgcc tacctgaatg ccgttgttgg aacagccctg     2220

atcaagaagt atcccaagct ggaatccgag ttcgtgtacg gcgactacaa ggtgtacgac     2280

gtgcggaaga tgatcgccaa gagcgagcaa gagattggca aggctaccgc caagtacttc     2340

ttctacagca acatcatgaa ctttttcaag accgagatta ccctggccaa cggcgagatc     2400

agaaagcggc ctctgatcga gacaaacggc gaaaccggcg agattgtgtg ggacaagggc     2460

agagattttg ccaccgtgcg gaaagtgctg agcatgcccc aagtgaatat cgtgaaaaag     2520

accgaggtgc agacaggcgg cttcagcaaa gagtccattc tgcccaagag aaacagcgat     2580

aagctgatcg cccggaagaa ggactgggac cctaagaagt acggcggctt cgatagccct     2640

accgtggcct attctgtgct ggtggtggcc aaagtggaaa agggcaagtc caagaaactc     2700

aagagcgtga aagagctgct ggggatcacc atcatggaaa gaagcagctt cgagaagaat     2760

cctatcgatt tcctcgaggc caagggctac aaagaagtga aaaaggacct gatcatcaag     2820

ctccccaagt actccctgtt cgagctggaa aatggccgga agcggatgct ggcttctgct     2880

ggcgaactgc agaagggaaa cgaactggcc ctgcctagca aatatgtgaa cttcctgtac     2940

ctggccagcc actatgagaa gctgaagggc agccccgagg acaatgagca aaagcagctg     3000

tttgtggaac agcacaagca ctacctggac gagatcatcg agcagatctc cgagttctcc     3060

aagagagtga tcctggccga cgctaatctg gacaaagtgc tgtccgccta caacaagcac     3120

cgggacaagc ctatcagaga gcaggccgag aatatcatcc acctgtttac cctgaccaat     3180

ctgggagccc ctgccgcctt caagtacttt gacaccacca tcgaccggaa gcgctacacc     3240

agcaccaaag aggtgctgga cgccacactg atccaccagt ctatcaccgg cctgtacgag     3300

acacggatcg acctgtctca gctcggaggc gattctggcg gctcaaaaag aaccgccgac     3360

ggcagcgaat tcgagcccaa gaagaagagg aaagtctaag gtaccaattc ctcacctgcg     3420

atctcgatgc tttatttgtg aaatttgtga tgctattgct ttatttgtaa ccattataag     3480

ctgcaataaa caagttaaca acaacaattg cattcatttt atgtttcagg ttcaggggga     3540

ggtgtgggag gttttttaaa                                                 3560


<210>  167
<211>  112
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  167
gatttttgac ctgctcgatt gtccactgcg agcaggtctt ttggagtcgg gcgaggcgga       60

agcccgactc cttttggcat gcacgctagc cgcgtcgtgc atgcctttta tc              112


<210>  168
<211>  13
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  168
gggttatggg acc                                                          13


<210>  169
<211>  24
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  169
ggctgaggga aggactgtcc tggg                                              24


<210>  170
<211>  24
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  170
ctctttcttt ccatgggttg gcct                                              24


<210>  171
<211>  4463
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence


<220>
<221>  misc_feature
<222>  (4225)..(4294)
<223>  n is a, c, g, or t

<400>  171
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttacttctag gcgcgccgcc      540

accatggccc caaagaagaa gcggaaggtc ggtatccacg gagtcccagc agccaagcgg      600

aactacatcc tgggcctgga catcggcatc accagcgtgg gctacggcat catcgactac      660

gagacacggg acgtgatcga tgccggcgtg cggctgttca aagaggccaa cgtggaaaac      720

aacgagggca ggcggagcaa gagaggcgcc agaaggctga agcggcggag gcggcataga      780

atccagagag tgaagaagct gctgttcgac tacaacctgc tgaccgacca cagcgagctg      840

agcggcatca acccctacga ggccagagtg aagggcctga gccagaagct gagcgaggaa      900

gagttctctg ccgccctgct gcacctggcc aagagaagag gcgtgcacaa cgtgaacgag      960

gtggaagagg acaccggcaa cgagctgtcc accaaagagc agatcagccg gaacagcaag     1020

gccctggaag agaaatacgt ggccgaactg cagctggaac ggctgaagaa agacggcgaa     1080

gtgcggggca gcatcaacag attcaagacc agcgactacg tgaaagaagc caaacagctg     1140

ctgaaggtgc agaaggccta ccaccagctg gaccagagct tcatcgacac ctacatcgac     1200

ctgctggaaa cccggcggac ctactatgag ggacctggcg agggcagccc cttcggctgg     1260

aaggacatca aagaatggta cgagatgctg atgggccact gcacctactt ccccgaggaa     1320

ctgcggagcg tgaagtacgc ctacaacgcc gacctgtaca acgccctgaa cgacctgaac     1380

aatctcgtga tcaccaggga cgagaacgag aagctggaat attacgagaa gttccagatc     1440

atcgagaacg tgttcaagca gaagaagaag cccaccctga agcagatcgc caaagaaatc     1500

ctcgtgaacg aagaggatat taagggctac agagtgacca gcaccggcaa gcccgagttc     1560

accaacctga aggtgtacca cgacatcaag gacattaccg cccggaaaga gattattgag     1620

aacgccgagc tgctggatca gattgccaag atcctgacca tctaccagag cagcgaggac     1680

atccaggaag aactgaccaa tctgaactcc gagctgaccc aggaagagat cgagcagatc     1740

tctaatctga agggctatac cggcacccac aacctgagcc tgaaggccat caacctgatc     1800

ctggacgagc tgtggcacac caacgacaac cagatcgcta tcttcaaccg gctgaagctg     1860

gtgcccaaga aggtggacct gtcccagcag aaagagatcc ccaccaccct ggtggacgac     1920

ttcatcctga gccccgtcgt gaagagaagc ttcatccaga gcatcaaagt gatcaacgcc     1980

atcatcaaga agtacggcct gcccaacgac atcattatcg agctggcccg cgagaagaac     2040

tccaaggacg cccagaaaat gatcaacgag atgcagaagc ggaaccggca gaccaacgag     2100

cggatcgagg aaatcatccg gaccaccggc aaagagaacg ccaagtacct gatcgagaag     2160

atcaagctgc acgacatgca ggaaggcaag tgcctgtaca gcctggaagc catccctctg     2220

gaagatctgc tgaacaaccc cttcaactat gaggtggacc acatcatccc cagaagcgtg     2280

tccttcgaca acagcttcaa caacaaggtg ctcgtgaagc aggaagaaaa cagcaagaag     2340

ggcaaccgga ccccattcca gtacctgagc agcagcgaca gcaagatcag ctacgaaacc     2400

ttcaagaagc acatcctgaa tctggccaag ggcaagggca gaatcagcaa gaccaagaaa     2460

gagtatctgc tggaagaacg ggacatcaac aggttctccg tgcagaaaga cttcatcaac     2520

cggaacctgg tggataccag atacgccacc agaggcctga tgaacctgct gcggagctac     2580

ttcagagtga acaacctgga cgtgaaagtg aagtccatca atggcggctt caccagcttt     2640

ctgcggcgga agtggaagtt taagaaagag cggaacaagg ggtacaagca ccacgccgag     2700

gacgccctga tcattgccaa cgccgatttc atcttcaaag agtggaagaa actggacaag     2760

gccaaaaaag tgatggaaaa ccagatgttc gaggaaaagc aggccgagag catgcccgag     2820

atcgaaaccg agcaggagta caaagagatc ttcatcaccc cccaccagat caagcacatt     2880

aaggacttca aggactacaa gtacagccac cgggtggaca agaagcctaa tagagagctg     2940

attaacgaca ccctgtactc cacccggaag gacgacaagg gcaacaccct gatcgtgaac     3000

aatctgaacg gcctgtacga caaggacaat gacaagctga aaaagctgat caacaagagc     3060

cccgaaaagc tgctgatgta ccaccacgac ccccagacct accagaaact gaagctgatt     3120

atggaacagt acggcgacga gaagaatccc ctgtacaagt actacgagga aaccgggaac     3180

tacctgacca agtactccaa aaaggacaac ggccccgtga tcaagaagat taagtattac     3240

ggcaacaaac tgaacgccca tctggacatc accgacgact accccaacag cagaaacaag     3300

gtcgtgaagc tgtccctgaa gccctacaga ttcgacgtgt acctggacaa tggcgtgtac     3360

aagttcgtga ccgtgaagaa tctggatgtg atcaaaaaag aaaactacta cgaagtgaat     3420

agcaagtgct atgaggaagc taagaagctg aagaagatca gcaaccaggc cgagtttatc     3480

gcctccttct acaacaacga tctgatcaag atcaacggcg agctgtatag agtgatcggc     3540

gtgaacaacg acctgctgaa ccggatcgaa gtgaacatga tcgacatcac ctaccgcgag     3600

tacctggaaa acatgaacga caagaggccc cccaggatca ttaagacaat cgccggaagc     3660

ggagctacta acttcagcct gctgaagcag gctggagacg tggaggagaa ccctggacct     3720

aggcgcgccg ccaccatggt gagcaagggc gaggagctgt tcaccggggt ggtgcccatc     3780

ctggtcgagc tggacggcga cgtaaacggc cacaagttca gcgtgtccgg cgagggcgag     3840

ggcgatgcca cctacggcaa gctgaccctg aagttcatct gcaccaccgg caagctgccc     3900

gtgccctggc ccaccctcgt gaccaccttc ggctacggcc tgatgtgctt cgcccgctac     3960

cccgaccaca tgaagcagca cgacttcttc aagtccgcca tgcccgaagg ctacgtccag     4020

gagcgcacca tcttcttcaa ggacgacggc aactacaaga cccgcgccga ggtgaagttc     4080

gagggcgaca ccctggtgaa ccgcatcgag ctgaagggca tcgacttcaa ggaggacggc     4140

aacatcctgg ggcacaagct ggagtacaac tacaacagcc acaacgtcta tatcatggcc     4200

gacaagcaga agaacggcat caagnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn     4260

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnngataaa aggcatgcac gtttgcggct     4320

acgtgcatgc caaaaggagt cgggcttgcc tccgtgcccg actccaaaag acctgctcga     4380

ggaggtggac gagcaggtca aaaatccggg taccaataaa atatctttat tttcattaca     4440

tctgtgtgtt ggttttttgt gtg                                             4463


<210>  172
<211>  3467
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  172
cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt       60

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      120

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      180

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      240

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac      300

catggtgatg cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg      360

atttccaagt ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg      420

ggactttcca aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt      480

acggtgggag gtctatataa gcagagcttg gatgttgcct ttaggatttt tgacctgctc      540

gattgtccac tgcgagcagg tcttttggag tcgggcgagg cggaagcccg actccttttg      600

gcatgcacgc tagccgcgtc gtgcatgcct tttatcttcg ggttatggga ccagtgaagg      660

ctgagggaag gactgtcctg ggactggaca ggcgggttat gggacctgaa aatactaaca      720

atcgattttt tttccctttt tttccaggtg gacagaccca gagtggatat cgagtgtgct      780

ggcaaagggg tgcagagcag cctgatccat aactacaaga agaaccccaa cttcaacacc      840

ctggtcaagt ggttcgaagt ggatctgccc gagaacgaac tgctgcaccc acctctgaac      900

atcagagtgg tggactgcag agccttcggc agatacaccc tcgtgggatc tcacgccgtg      960

tctagcctga gaagattcat ctacagacct ccagacagaa gcgcccctaa ctggaacaca     1020

acaggcgagg tggtggtgtc catggaaccc gaggaacccg tgaagaaact ggaaaccatg     1080

gtcaagctgg acgccacctc cgatgctgtc gtgaaagtgg acgtggccga ggacgagaaa     1140

gagcgcaaga agaagaaaaa gaagggcccc agcgaggaac ctgaagagga agaacctgac     1200

gagagcatgc tggactggtg gtccaagtac ttcgcctcca tcgacacaat gaaggaacag     1260

ctgagacagc acgagacaag cggcaccgac ctcgaagaga aagaagagat ggaatccgcc     1320

gaaggactga agggccctat gaagtccaaa gagaagtcta gggccgccaa agaagagaaa     1380

aaaaagaaga accagtctcc tggaccaggc cagggatctg aggctcccga aaagaaaaag     1440

gccaagatcg acgagctgaa ggtgtacccc aaagagctgg aaagcgagtt cgacagcttc     1500

gaggactggc tgcacacctt caatctgctg agaggaaaga caggcgacga cgaggatggc     1560

agcactgaag aagagagaat cgtcggcaga ttcaagggca gcctgtgcgt gtacaaggtg     1620

ccactgcctg aggacgtgtc cagagaggct ggctacgatc ctacctacgg catgttccaa     1680

ggcatcccta gcaacgaccc catcaatgtg ctcgtgcgga tctatgtcgt gcgggccact     1740

gatctgcatc ccgccgatat caacggcaag gcagacccct atatcgctat caagctgggg     1800

aaaaccgaca tcagggacaa agagaactac atcagcaagc agctgaaccc cgtgttcggc     1860

aagagcttcg acatcgaggc tagcttcccc atggaatcca tgctgaccgt ggccgtgtac     1920

gactgggatc tcgtgggaac agacgacctg atcggagaga caaagattga cctggaaaac     1980

cggttctact ccaagcaccg ggccacctgt ggaatcgccc agacctactc tatccacggc     2040

tacaacatct ggcgggaccc catgaagcct agccagatcc tgaccaggct gtgcaaagaa     2100

ggcaaggtcg acggccctca ctttggacct cacggccggg tcagagtggc caacagagtg     2160

ttcacaggcc cctccgagat cgaggatgag aacggccaga gaaagcccac cgatgagcat     2220

gtggctctga gcgctctgag acactgggaa gatatcccta gagtgggctg cagactggtg     2280

cccgagcacg tggaaacaag acccctgctg aacccagaca agcccggaat cgaacagggc     2340

agactcgaac tgtgggtcga catgttccct atggacatgc ccgcacctgg cacaccactg     2400

gacatcagcc ctaggaagcc caagaaatac gagctgcgcg tgatcgtgtg gaacaccgac     2460

gaagtggtgc tggaagatga cgacttcttc accggcgaaa agtccagcga catcttcgtc     2520

agaggatggc tgaagggaca gcaagaggat aagcaggaca ccgacgtgca ctaccacagc     2580

cttacaggcg aaggcaactt taactggcgc tacctgtttc ctttcgacta cctggccgcc     2640

gaagagaaga tcgtgatgtc caagaaagaa tctatgttca gctgggacga gacagagtac     2700

aagatccccg ccagactgac cctgcagatc tgggatgccg atcacttcag cgccgacgac     2760

tttctgggag ccatcgagct ggacctgaat agattcccca gaggcgccaa gaccgccaag     2820

cagtgcacaa tggaaatggc cactggcgag gtcgacgtgc cactggtgtc tatcttcaag     2880

cagaagcgcg tcaaaggctg gtggcccctg ctggctagaa acgagaacga cgagttcgag     2940

ctgaccggaa aggtggaagc cgagctgcat ctgctgacag ctgaagaggc cgagaagaat     3000

cctgtgggcc tcgctaggaa tgagcccgat cctctggaaa agcccaacag acccgatacc     3060

gccttcgtgt ggtttctgaa cccactgaag tccatcaagt acctgatctg tacccggtac     3120

aagtggctga ttatcaagat cgtgctggcc ctgctggggc tgctgatgct tgctctgttc     3180

ctgtactccc tgcctggcta tatggtcaag aagctgctgg gcgccggcgc tcgggctgac     3240

tacaaagacc atgacggtga ttataaagat catgacatcg actataagga tgacgatgac     3300

aaatgaggta ccaattcctc acctgcgatc tcgatgcttt atttgtgaaa tttgtgatgc     3360

tattgcttta tttgtaacca ttataagctg caataaacaa gttaacaaca acaattgcat     3420

tcattttatg tttcaggttc agggggaggt gtgggaggtt ttttaaa                   3467


<210>  173
<211>  33
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  173
gtaagtattg ctttcatttt tgtctttttt taa                                    33


<210>  174
<211>  30
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  174
gtaagttctt gctttgttca aactgtctat                                        30


<210>  175
<211>  27
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  175
gtaagtattc ttttgttctt cactcat                                           27


<210>  176
<211>  32
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  176
gtaagtattt ttttactcct catttttact cc                                     32


<210>  177
<211>  36
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  177
gtaagtattt ttttacggtt atattctcct ttcccc                                 36


<210>  178
<211>  28
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  178
gtaagtattt tctgttgttt attttcag                                          28


<210>  179
<211>  39
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  179
gtaagtattg gggttgatta tgtgtgggac ggtgtaagg                              39


<210>  180
<211>  35
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  180
gtaagtattt cctctttctt tccatgggtt ggcct                                  35


<210>  181
<211>  35
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  181
gtaagtatta ccagagattc gtagacctgc ttgac                                  35


<210>  182
<211>  39
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  182
tggggctggg cagagggttg aggggagagg gtcctgggg                              39


<210>  183
<211>  28
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  183
tcatgggtgg gttcattggg tgggttca                                          28


<210>  184
<211>  23
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  184
tagggcgcag tagtccaggg ttt                                               23


<210>  185
<211>  30
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  185
ttctctgtgg ggtggcattc tctgctctct                                        30


<210>  186
<211>  29
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  186
gggttatggg acctcaggga taagggacc                                         29


<210>  187
<211>  15
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  187
cggggatggg ggtca                                                        15


<210>  188
<211>  23
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  188
tggggggagg tcatgggggg agg                                               23


<210>  189
<211>  24
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  189
gttggtggtt tcatgttggt ggtt                                              24


<210>  190
<211>  29
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  190
gggtttcggg ttttcaggtg gtcgttggt                                         29


<210>  191
<211>  29
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  191
ggtggtcgtt ggttcatttg ggctattgg                                         29


<210>  192
<211>  29
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  192
tttgggctat tggtcaaggg ggcgagggg                                         29


<210>  193
<211>  29
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  193
agggggcgag gggtcaggta ttcggtatt                                         29


<210>  194
<211>  29
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  194
ggtattcggt atttcaaggt aacaggtaa                                         29


<210>  195
<211>  29
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  195
aggtaacagg taatcagggt ttcgggttt                                         29


<210>  196
<211>  29
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  196
tcttactttt gtaaacttta tggtttgtg                                         29


<210>  197
<211>  28
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  197
cacgtattct cggtacggac gttacaga                                          28


<210>  198
<211>  13
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  198
taagctggta tcc                                                          13


<210>  199
<211>  34
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  199
cactaactct ttttcccccc tttttttttt acag                                   34


<210>  200
<211>  36
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  200
tactaactct ttcttttttc ctttccttct tcacag                                 36


<210>  201
<211>  43
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  201
cactaactct gtcatactta tcctgtccct tttttttcca cag                         43


<210>  202
<211>  45
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  202
cactaactct ctttcttttt cttccctcct ctcccccaac tgcag                       45


<210>  203
<211>  38
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  203
cactaactct tttttttttt tttttttttt tacagcag                               38


<210>  204
<211>  13
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Synthetic nucleic acid sequence

<400>  204
taagctggta tcc                                                          13


<210>  205
<211>  8
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  branch point sequence

<400>  205
tactaaca                                                                 8


<210>  206
<211>  49
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  polyadenylation signal

<400>  206
aataaaatat ctttattttc attacatctg tgtgttggtt ttttgtgtg                   49


<210>  207
<211>  4104
<212>  DNA
<213>  Streptococcus pyogenes


<220>
<221>  CDS
<222>  (1)..(4104)

<400>  207
atg gac aag aag tac agc atc ggc ctg gac atc ggc acc aac tct gtg         48
Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val           
1               5                   10                  15                

ggc tgg gcc gtg atc acc gac gag tac aag gtg ccc agc aag aaa ttc         96
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe           
            20                  25                  30                    

aag gtg ctg ggc aac acc gac cgg cac agc atc aag aag aac ctg atc        144
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile           
        35                  40                  45                        

gga gcc ctg ctg ttc gac agc ggc gaa aca gcc gag gcc acc cgg ctg        192
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu           
    50                  55                  60                            

aag aga acc gcc aga aga aga tac acc aga cgg aag aac cgg atc tgc        240
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys           
65                  70                  75                  80            

tat ctg caa gag atc ttc agc aac gag atg gcc aag gtg gac gac agc        288
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser           
                85                  90                  95                

ttc ttc cac aga ctg gaa gag tcc ttc ctg gtg gaa gag gat aag aag        336
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys           
            100                 105                 110                   

cac gag cgg cac ccc atc ttc ggc aac atc gtg gac gag gtg gcc tac        384
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr           
        115                 120                 125                       

cac gag aag tac ccc acc atc tac cac ctg aga aag aaa ctg gtg gac        432
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp           
    130                 135                 140                           

agc acc gac aag gcc gac ctg cgg ctg atc tat ctg gcc ctg gcc cac        480
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His           
145                 150                 155                 160           

atg atc aag ttc cgg ggc cac ttc ctg atc gag ggc gac ctg aac ccc        528
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro           
                165                 170                 175               

gac aac agc gac gtg gac aag ctg ttc atc cag ctg gtg cag acc tac        576
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr           
            180                 185                 190                   

aac cag ctg ttc gag gaa aac ccc atc aac gcc agc ggc gtg gac gcc        624
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala           
        195                 200                 205                       

aag gcc atc ctg tct gcc aga ctg agc aag agc aga cgg ctg gaa aat        672
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn           
    210                 215                 220                           

ctg atc gcc cag ctg ccc ggc gag aag aag aat ggc ctg ttc gga aac        720
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn           
225                 230                 235                 240           

ctg att gcc ctg agc ctg ggc ctg acc ccc aac ttc aag agc aac ttc        768
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe           
                245                 250                 255               

gac ctg gcc gag gat gcc aaa ctg cag ctg agc aag gac acc tac gac        816
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp           
            260                 265                 270                   

gac gac ctg gac aac ctg ctg gcc cag atc ggc gac cag tac gcc gac        864
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp           
        275                 280                 285                       

ctg ttt ctg gcc gcc aag aac ctg tcc gac gcc atc ctg ctg agc gac        912
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp           
    290                 295                 300                           

atc ctg aga gtg aac acc gag atc acc aag gcc ccc ctg agc gcc tct        960
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser           
305                 310                 315                 320           

atg atc aag aga tac gac gag cac cac cag gac ctg acc ctg ctg aaa       1008
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys           
                325                 330                 335               

gct ctc gtg cgg cag cag ctg cct gag aag tac aaa gag att ttc ttc       1056
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe           
            340                 345                 350                   

gac cag agc aag aac ggc tac gcc ggc tac att gac ggc gga gcc agc       1104
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser           
        355                 360                 365                       

cag gaa gag ttc tac aag ttc atc aag ccc atc ctg gaa aag atg gac       1152
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp           
    370                 375                 380                           

ggc acc gag gaa ctg ctc gtg aag ctg aac aga gag gac ctg ctg cgg       1200
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg           
385                 390                 395                 400           

aag cag cgg acc ttc gac aac ggc agc atc ccc cac cag atc cac ctg       1248
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu           
                405                 410                 415               

gga gag ctg cac gcc att ctg cgg cgg cag gaa gat ttt tac cca ttc       1296
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe           
            420                 425                 430                   

ctg aag gac aac cgg gaa aag atc gag aag atc ctg acc ttc cgc atc       1344
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile           
        435                 440                 445                       

ccc tac tac gtg ggc cct ctg gcc agg gga aac agc aga ttc gcc tgg       1392
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp           
    450                 455                 460                           

atg acc aga aag agc gag gaa acc atc acc ccc tgg aac ttc gag gaa       1440
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu           
465                 470                 475                 480           

gtg gtg gac aag ggc gct tcc gcc cag agc ttc atc gag cgg atg acc       1488
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr           
                485                 490                 495               

aac ttc gat aag aac ctg ccc aac gag aag gtg ctg ccc aag cac agc       1536
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser           
            500                 505                 510                   

ctg ctg tac gag tac ttc acc gtg tat aac gag ctg acc aaa gtg aaa       1584
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys           
        515                 520                 525                       

tac gtg acc gag gga atg aga aag ccc gcc ttc ctg agc ggc gag cag       1632
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln           
    530                 535                 540                           

aaa aag gcc atc gtg gac ctg ctg ttc aag acc aac cgg aaa gtg acc       1680
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr           
545                 550                 555                 560           

gtg aag cag ctg aaa gag gac tac ttc aag aaa atc gag tgc ttc gac       1728
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp           
                565                 570                 575               

tcc gtg gaa atc tcc ggc gtg gaa gat cgg ttc aac gcc tcc ctg ggc       1776
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly           
            580                 585                 590                   

aca tac cac gat ctg ctg aaa att atc aag gac aag gac ttc ctg gac       1824
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp           
        595                 600                 605                       

aat gag gaa aac gag gac att ctg gaa gat atc gtg ctg acc ctg aca       1872
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr           
    610                 615                 620                           

ctg ttt gag gac aga gag atg atc gag gaa cgg ctg aaa acc tat gcc       1920
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala           
625                 630                 635                 640           

cac ctg ttc gac gac aaa gtg atg aag cag ctg aag cgg cgg aga tac       1968
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr           
                645                 650                 655               

acc ggc tgg ggc agg ctg agc cgg aag ctg atc aac ggc atc cgg gac       2016
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp           
            660                 665                 670                   

aag cag tcc ggc aag aca atc ctg gat ttc ctg aag tcc gac ggc ttc       2064
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe           
        675                 680                 685                       

gcc aac aga aac ttc atg cag ctg atc cac gac gac agc ctg acc ttt       2112
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe           
    690                 695                 700                           

aaa gag gac atc cag aaa gcc cag gtg tcc ggc cag ggc gat agc ctg       2160
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu           
705                 710                 715                 720           

cac gag cac att gcc aat ctg gcc ggc agc ccc gcc att aag aag ggc       2208
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly           
                725                 730                 735               

atc ctg cag aca gtg aag gtg gtg gac gag ctc gtg aaa gtg atg ggc       2256
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly           
            740                 745                 750                   

cgg cac aag ccc gag aac atc gtg atc gaa atg gcc aga gag aac cag       2304
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln           
        755                 760                 765                       

acc acc cag aag gga cag aag aac agc cgc gag aga atg aag cgg atc       2352
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile           
    770                 775                 780                           

gaa gag ggc atc aaa gag ctg ggc agc cag atc ctg aaa gaa cac ccc       2400
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro           
785                 790                 795                 800           

gtg gaa aac acc cag ctg cag aac gag aag ctg tac ctg tac tac ctg       2448
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu           
                805                 810                 815               

cag aat ggg cgg gat atg tac gtg gac cag gaa ctg gac atc aac cgg       2496
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg           
            820                 825                 830                   

ctg tcc gac tac gat gtg gac cat atc gtg cct cag agc ttt ctg aag       2544
Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys           
        835                 840                 845                       

gac gac tcc atc gac aac aag gtg ctg acc aga agc gac aag aac cgg       2592
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg           
    850                 855                 860                           

ggc aag agc gac aac gtg ccc tcc gaa gag gtc gtg aag aag atg aag       2640
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys           
865                 870                 875                 880           

aac tac tgg cgg cag ctg ctg aac gcc aag ctg att acc cag aga aag       2688
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys           
                885                 890                 895               

ttc gac aat ctg acc aag gcc gag aga ggc ggc ctg agc gaa ctg gat       2736
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp           
            900                 905                 910                   

aag gcc ggc ttc atc aag aga cag ctg gtg gaa acc cgg cag atc aca       2784
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr           
        915                 920                 925                       

aag cac gtg gca cag atc ctg gac tcc cgg atg aac act aag tac gac       2832
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp           
    930                 935                 940                           

gag aat gac aag ctg atc cgg gaa gtg aaa gtg atc acc ctg aag tcc       2880
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser           
945                 950                 955                 960           

aag ctg gtg tcc gat ttc cgg aag gat ttc cag ttt tac aaa gtg cgc       2928
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg           
                965                 970                 975               

gag atc aac aac tac cac cac gcc cac gac gcc tac ctg aac gcc gtc       2976
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val           
            980                 985                 990                   

gtg gga acc gcc ctg atc aaa aag  tac cct aag ctg gaa  agc gag ttc     3024
Val Gly Thr Ala Leu Ile Lys Lys  Tyr Pro Lys Leu Glu  Ser Glu Phe         
        995                 1000                 1005                     

gtg tac  ggc gac tac aag gtg  tac gac gtg cgg aag  atg atc gcc        3069
Val Tyr  Gly Asp Tyr Lys Val  Tyr Asp Val Arg Lys  Met Ile Ala            
    1010                 1015                 1020                        

aag agc  gag cag gaa atc ggc  aag gct acc gcc aag  tac ttc ttc        3114
Lys Ser  Glu Gln Glu Ile Gly  Lys Ala Thr Ala Lys  Tyr Phe Phe            
    1025                 1030                 1035                        

tac agc  aac atc atg aac ttt  ttc aag acc gag att  acc ctg gcc        3159
Tyr Ser  Asn Ile Met Asn Phe  Phe Lys Thr Glu Ile  Thr Leu Ala            
    1040                 1045                 1050                        

aac ggc  gag atc cgg aag cgg  cct ctg atc gag aca  aac ggc gaa        3204
Asn Gly  Glu Ile Arg Lys Arg  Pro Leu Ile Glu Thr  Asn Gly Glu            
    1055                 1060                 1065                        

acc ggg  gag atc gtg tgg gat  aag ggc cgg gat ttt  gcc acc gtg        3249
Thr Gly  Glu Ile Val Trp Asp  Lys Gly Arg Asp Phe  Ala Thr Val            
    1070                 1075                 1080                        

cgg aaa  gtg ctg agc atg ccc  caa gtg aat atc gtg  aaa aag acc        3294
Arg Lys  Val Leu Ser Met Pro  Gln Val Asn Ile Val  Lys Lys Thr            
    1085                 1090                 1095                        

gag gtg  cag aca ggc ggc ttc  agc aaa gag tct atc  ctg ccc aag        3339
Glu Val  Gln Thr Gly Gly Phe  Ser Lys Glu Ser Ile  Leu Pro Lys            
    1100                 1105                 1110                        

agg aac  agc gat aag ctg atc  gcc aga aag aag gac  tgg gac cct        3384
Arg Asn  Ser Asp Lys Leu Ile  Ala Arg Lys Lys Asp  Trp Asp Pro            
    1115                 1120                 1125                        

aag aag  tac ggc ggc ttc gac  agc ccc acc gtg gcc  tat tct gtg        3429
Lys Lys  Tyr Gly Gly Phe Asp  Ser Pro Thr Val Ala  Tyr Ser Val            
    1130                 1135                 1140                        

ctg gtg  gtg gcc aaa gtg gaa  aag ggc aag tcc aag  aaa ctg aag        3474
Leu Val  Val Ala Lys Val Glu  Lys Gly Lys Ser Lys  Lys Leu Lys            
    1145                 1150                 1155                        

agt gtg  aaa gag ctg ctg ggg  atc acc atc atg gaa  aga agc agc        3519
Ser Val  Lys Glu Leu Leu Gly  Ile Thr Ile Met Glu  Arg Ser Ser            
    1160                 1165                 1170                        

ttc gag  aag aat ccc atc gac  ttt ctg gaa gcc aag  ggc tac aaa        3564
Phe Glu  Lys Asn Pro Ile Asp  Phe Leu Glu Ala Lys  Gly Tyr Lys            
    1175                 1180                 1185                        

gaa gtg  aaa aag gac ctg atc  atc aag ctg cct aag  tac tcc ctg        3609
Glu Val  Lys Lys Asp Leu Ile  Ile Lys Leu Pro Lys  Tyr Ser Leu            
    1190                 1195                 1200                        

ttc gag  ctg gaa aac ggc cgg  aag aga atg ctg gcc  tct gcc ggc        3654
Phe Glu  Leu Glu Asn Gly Arg  Lys Arg Met Leu Ala  Ser Ala Gly            
    1205                 1210                 1215                        

gaa ctg  cag aag gga aac gaa  ctg gcc ctg ccc tcc  aaa tat gtg        3699
Glu Leu  Gln Lys Gly Asn Glu  Leu Ala Leu Pro Ser  Lys Tyr Val            
    1220                 1225                 1230                        

aac ttc  ctg tac ctg gcc agc  cac tat gag aag ctg  aag ggc tcc        3744
Asn Phe  Leu Tyr Leu Ala Ser  His Tyr Glu Lys Leu  Lys Gly Ser            
    1235                 1240                 1245                        

ccc gag  gat aat gag cag aaa  cag ctg ttt gtg gaa  cag cac aag        3789
Pro Glu  Asp Asn Glu Gln Lys  Gln Leu Phe Val Glu  Gln His Lys            
    1250                 1255                 1260                        

cac tac  ctg gac gag atc atc  gag cag atc agc gag  ttc tcc aag        3834
His Tyr  Leu Asp Glu Ile Ile  Glu Gln Ile Ser Glu  Phe Ser Lys            
    1265                 1270                 1275                        

aga gtg  atc ctg gcc gac gct  aat ctg gac aaa gtg  ctg tcc gcc        3879
Arg Val  Ile Leu Ala Asp Ala  Asn Leu Asp Lys Val  Leu Ser Ala            
    1280                 1285                 1290                        

tac aac  aag cac cgg gat aag  ccc atc aga gag cag  gcc gag aat        3924
Tyr Asn  Lys His Arg Asp Lys  Pro Ile Arg Glu Gln  Ala Glu Asn            
    1295                 1300                 1305                        

atc atc  cac ctg ttt acc ctg  acc aat ctg gga gcc  cct gcc gcc        3969
Ile Ile  His Leu Phe Thr Leu  Thr Asn Leu Gly Ala  Pro Ala Ala            
    1310                 1315                 1320                        

ttc aag  tac ttt gac acc acc  atc gac cgg aag agg  tac acc agc        4014
Phe Lys  Tyr Phe Asp Thr Thr  Ile Asp Arg Lys Arg  Tyr Thr Ser            
    1325                 1330                 1335                        

acc aaa  gag gtg ctg gac gcc  acc ctg atc cac cag  agc atc acc        4059
Thr Lys  Glu Val Leu Asp Ala  Thr Leu Ile His Gln  Ser Ile Thr            
    1340                 1345                 1350                        

ggc ctg  tac gag aca cgg atc  gac ctg tct cag ctg  gga ggc gac        4104
Gly Leu  Tyr Glu Thr Arg Ile  Asp Leu Ser Gln Leu  Gly Gly Asp            
    1355                 1360                 1365                        


<210>  208
<211>  1368
<212>  PRT
<213>  Streptococcus pyogenes

<400>  208

Met Asp Lys Lys Tyr Ser Ile Gly Leu Asp Ile Gly Thr Asn Ser Val 
1               5                   10                  15      


Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe 
            20                  25                  30          


Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile 
        35                  40                  45              


Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu 
    50                  55                  60                  


Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys 
65                  70                  75                  80  


Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser 
                85                  90                  95      


Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys 
            100                 105                 110         


His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr 
        115                 120                 125             


His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp 
    130                 135                 140                 


Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His 
145                 150                 155                 160 


Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro 
                165                 170                 175     


Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr 
            180                 185                 190         


Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala 
        195                 200                 205             


Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn 
    210                 215                 220                 


Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn 
225                 230                 235                 240 


Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe 
                245                 250                 255     


Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp 
            260                 265                 270         


Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp 
        275                 280                 285             


Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp 
    290                 295                 300                 


Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser 
305                 310                 315                 320 


Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys 
                325                 330                 335     


Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe 
            340                 345                 350         


Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser 
        355                 360                 365             


Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp 
    370                 375                 380                 


Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg 
385                 390                 395                 400 


Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu 
                405                 410                 415     


Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe 
            420                 425                 430         


Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile 
        435                 440                 445             


Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp 
    450                 455                 460                 


Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu 
465                 470                 475                 480 


Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr 
                485                 490                 495     


Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser 
            500                 505                 510         


Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys 
        515                 520                 525             


Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln 
    530                 535                 540                 


Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr 
545                 550                 555                 560 


Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp 
                565                 570                 575     


Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly 
            580                 585                 590         


Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp 
        595                 600                 605             


Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr 
    610                 615                 620                 


Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala 
625                 630                 635                 640 


His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr 
                645                 650                 655     


Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp 
            660                 665                 670         


Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe 
        675                 680                 685             


Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe 
    690                 695                 700                 


Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu 
705                 710                 715                 720 


His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly 
                725                 730                 735     


Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly 
            740                 745                 750         


Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln 
        755                 760                 765             


Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile 
    770                 775                 780                 


Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro 
785                 790                 795                 800 


Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu 
                805                 810                 815     


Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg 
            820                 825                 830         


Leu Ser Asp Tyr Asp Val Asp His Ile Val Pro Gln Ser Phe Leu Lys 
        835                 840                 845             


Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Asn Arg 
    850                 855                 860                 


Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys 
865                 870                 875                 880 


Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys 
                885                 890                 895     


Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp 
            900                 905                 910         


Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr 
        915                 920                 925             


Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp 
    930                 935                 940                 


Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser 
945                 950                 955                 960 


Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg 
                965                 970                 975     


Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val 
            980                 985                 990         


Val Gly Thr Ala Leu Ile Lys Lys  Tyr Pro Lys Leu Glu  Ser Glu Phe 
        995                 1000                 1005             


Val Tyr  Gly Asp Tyr Lys Val  Tyr Asp Val Arg Lys  Met Ile Ala 
    1010                 1015                 1020             


Lys Ser  Glu Gln Glu Ile Gly  Lys Ala Thr Ala Lys  Tyr Phe Phe 
    1025                 1030                 1035             


Tyr Ser  Asn Ile Met Asn Phe  Phe Lys Thr Glu Ile  Thr Leu Ala 
    1040                 1045                 1050             


Asn Gly  Glu Ile Arg Lys Arg  Pro Leu Ile Glu Thr  Asn Gly Glu 
    1055                 1060                 1065             


Thr Gly  Glu Ile Val Trp Asp  Lys Gly Arg Asp Phe  Ala Thr Val 
    1070                 1075                 1080             


Arg Lys  Val Leu Ser Met Pro  Gln Val Asn Ile Val  Lys Lys Thr 
    1085                 1090                 1095             


Glu Val  Gln Thr Gly Gly Phe  Ser Lys Glu Ser Ile  Leu Pro Lys 
    1100                 1105                 1110             


Arg Asn  Ser Asp Lys Leu Ile  Ala Arg Lys Lys Asp  Trp Asp Pro 
    1115                 1120                 1125             


Lys Lys  Tyr Gly Gly Phe Asp  Ser Pro Thr Val Ala  Tyr Ser Val 
    1130                 1135                 1140             


Leu Val  Val Ala Lys Val Glu  Lys Gly Lys Ser Lys  Lys Leu Lys 
    1145                 1150                 1155             


Ser Val  Lys Glu Leu Leu Gly  Ile Thr Ile Met Glu  Arg Ser Ser 
    1160                 1165                 1170             


Phe Glu  Lys Asn Pro Ile Asp  Phe Leu Glu Ala Lys  Gly Tyr Lys 
    1175                 1180                 1185             


Glu Val  Lys Lys Asp Leu Ile  Ile Lys Leu Pro Lys  Tyr Ser Leu 
    1190                 1195                 1200             


Phe Glu  Leu Glu Asn Gly Arg  Lys Arg Met Leu Ala  Ser Ala Gly 
    1205                 1210                 1215             


Glu Leu  Gln Lys Gly Asn Glu  Leu Ala Leu Pro Ser  Lys Tyr Val 
    1220                 1225                 1230             


Asn Phe  Leu Tyr Leu Ala Ser  His Tyr Glu Lys Leu  Lys Gly Ser 
    1235                 1240                 1245             


Pro Glu  Asp Asn Glu Gln Lys  Gln Leu Phe Val Glu  Gln His Lys 
    1250                 1255                 1260             


His Tyr  Leu Asp Glu Ile Ile  Glu Gln Ile Ser Glu  Phe Ser Lys 
    1265                 1270                 1275             


Arg Val  Ile Leu Ala Asp Ala  Asn Leu Asp Lys Val  Leu Ser Ala 
    1280                 1285                 1290             


Tyr Asn  Lys His Arg Asp Lys  Pro Ile Arg Glu Gln  Ala Glu Asn 
    1295                 1300                 1305             


Ile Ile  His Leu Phe Thr Leu  Thr Asn Leu Gly Ala  Pro Ala Ala 
    1310                 1315                 1320             


Phe Lys  Tyr Phe Asp Thr Thr  Ile Asp Arg Lys Arg  Tyr Thr Ser 
    1325                 1330                 1335             


Thr Lys  Glu Val Leu Asp Ala  Thr Leu Ile His Gln  Ser Ile Thr 
    1340                 1345                 1350             


Gly Leu  Tyr Glu Thr Arg Ile  Asp Leu Ser Gln Leu  Gly Gly Asp 
    1355                 1360                 1365             


<210>  209
<211>  4104
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  dCas9


<220>
<221>  CDS
<222>  (1)..(4104)

<400>  209
atg gac aag aag tac tcc att ggg ctc gct atc ggc aca aac agc gtc         48
Met Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val           
1               5                   10                  15                

ggc tgg gcc gtc att acg gac gag tac aag gtg ccg agc aaa aaa ttc         96
Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe           
            20                  25                  30                    

aaa gtt ctg ggc aat acc gat cgc cac agc ata aag aag aac ctc att        144
Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile           
        35                  40                  45                        

ggc gcc ctc ctg ttc gac tcc ggg gag acg gcc gaa gcc acg cgg ctc        192
Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu           
    50                  55                  60                            

aaa aga aca gca cgg cgc aga tat acc cgc aga aag aat cgg atc tgc        240
Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys           
65                  70                  75                  80            

tac ctg cag gag atc ttt agt aat gag atg gct aag gtg gat gac tct        288
Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser           
                85                  90                  95                

ttc ttc cat agg ctg gag gag tcc ttt ttg gtg gag gag gat aaa aag        336
Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys           
            100                 105                 110                   

cac gag cgc cac cca atc ttt ggc aat atc gtg gac gag gtg gcg tac        384
His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr           
        115                 120                 125                       

cat gaa aag tac cca acc ata tat cat ctg agg aag aag ctt gta gac        432
His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp           
    130                 135                 140                           

agt act gat aag gct gac ttg cgg ttg atc tat ctc gcg ctg gcg cat        480
Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His           
145                 150                 155                 160           

atg atc aaa ttt cgg gga cac ttc ctc atc gag ggg gac ctg aac cca        528
Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro           
                165                 170                 175               

gac aac agc gat gtc gac aaa ctc ttt atc caa ctg gtt cag act tac        576
Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr           
            180                 185                 190                   

aat cag ctt ttc gaa gag aac ccg atc aac gca tcc gga gtt gac gcc        624
Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala           
        195                 200                 205                       

aaa gca atc ctg agc gct agg ctg tcc aaa tcc cgg cgg ctc gaa aac        672
Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn           
    210                 215                 220                           

ctc atc gca cag ctc cct ggg gag aag aag aac ggc ctg ttt ggt aat        720
Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn           
225                 230                 235                 240           

ctt atc gcc ctg tca ctc ggg ctg acc ccc aac ttt aaa tct aac ttc        768
Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe           
                245                 250                 255               

gac ctg gcc gaa gat gcc aag ctt caa ctg agc aaa gac acc tac gat        816
Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp           
            260                 265                 270                   

gat gat ctc gac aat ctg ctg gcc cag atc ggc gac cag tac gca gac        864
Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp           
        275                 280                 285                       

ctt ttt ttg gcg gca aag aac ctg tca gac gcc att ctg ctg agt gat        912
Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp           
    290                 295                 300                           

att ctg cga gtg aac acg gag atc acc aaa gct ccg ctg agc gct agt        960
Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser           
305                 310                 315                 320           

atg atc aag cgc tat gat gag cac cac caa gac ttg act ttg ctg aag       1008
Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys           
                325                 330                 335               

gcc ctt gtc aga cag caa ctg cct gag aag tac aag gaa att ttc ttc       1056
Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe           
            340                 345                 350                   

gat cag tct aaa aat ggc tac gcc gga tac att gac ggc gga gca agc       1104
Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser           
        355                 360                 365                       

cag gag gaa ttt tac aaa ttt att aag ccc atc ttg gaa aaa atg gac       1152
Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp           
    370                 375                 380                           

ggc acc gag gag ctg ctg gta aag ctt aac aga gaa gat ctg ttg cgc       1200
Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg           
385                 390                 395                 400           

aaa cag cgc act ttc gac aat gga agc atc ccc cac cag att cac ctg       1248
Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu           
                405                 410                 415               

ggc gaa ctg cac gct atc ctc agg cgg caa gag gat ttc tac ccc ttt       1296
Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe           
            420                 425                 430                   

ttg aaa gat aac agg gaa aag att gag aaa atc ctc aca ttt cgg ata       1344
Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile           
        435                 440                 445                       

ccc tac tat gta ggc ccc ctc gcc cgg gga aat tcc aga ttc gcg tgg       1392
Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp           
    450                 455                 460                           

atg act cgc aaa tca gaa gag acc atc act ccc tgg aac ttc gag gaa       1440
Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu           
465                 470                 475                 480           

gtc gtg gat aag ggg gcc tct gcc cag tcc ttc atc gaa agg atg act       1488
Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr           
                485                 490                 495               

aac ttt gat aaa aat ctg cct aac gaa aag gtg ctt cct aaa cac tct       1536
Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser           
            500                 505                 510                   

ctg ctg tac gag tac ttc aca gtt tat aac gag ctc acc aag gtc aaa       1584
Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys           
        515                 520                 525                       

tac gtc aca gaa ggg atg aga aag cca gca ttc ctg tct gga gag cag       1632
Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln           
    530                 535                 540                           

aag aaa gct atc gtg gac ctc ctc ttc aag acg aac cgg aaa gtt acc       1680
Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr           
545                 550                 555                 560           

gtg aaa cag ctc aaa gaa gac tat ttc aaa aag att gaa tgt ttc gac       1728
Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp           
                565                 570                 575               

tct gtt gaa atc agc gga gtg gag gat cgc ttc aac gca tcc ctg gga       1776
Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly           
            580                 585                 590                   

acg tat cac gat ctc ctg aaa atc att aaa gac aag gac ttc ctg gac       1824
Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp           
        595                 600                 605                       

aat gag gag aac gag gac att ctt gag gac att gtc ctc acc ctt acg       1872
Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr           
    610                 615                 620                           

ttg ttt gaa gat agg gag atg att gaa gaa cgc ttg aaa act tac gct       1920
Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala           
625                 630                 635                 640           

cat ctc ttc gac gac aaa gtc atg aaa cag ctc aag agg cgc cga tat       1968
His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr           
                645                 650                 655               

aca gga tgg ggg cgg ctg tca aga aaa ctg atc aat ggg atc cga gac       2016
Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp           
            660                 665                 670                   

aag cag agt gga aag aca atc ctg gat ttt ctt aag tcc gat gga ttt       2064
Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe           
        675                 680                 685                       

gcc aac cgg aac ttc atg cag ttg atc cat gat gac tct ctc acc ttt       2112
Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe           
    690                 695                 700                           

aag gag gac atc cag aaa gca caa gtt tct ggc cag ggg gac agt ctt       2160
Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu           
705                 710                 715                 720           

cac gag cac atc gct aat ctt gca ggt agc cca gct atc aaa aag gga       2208
His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly           
                725                 730                 735               

ata ctg cag acc gtt aag gtc gtg gat gaa ctc gtc aaa gta atg gga       2256
Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly           
            740                 745                 750                   

agg cat aag ccc gag aat atc gtt atc gag atg gcc cga gag aac caa       2304
Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln           
        755                 760                 765                       

act acc cag aag gga cag aag aac agt agg gaa agg atg aag agg att       2352
Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile           
    770                 775                 780                           

gaa gag ggt ata aaa gaa ctg ggg tcc caa atc ctt aag gaa cac cca       2400
Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro           
785                 790                 795                 800           

gtt gaa aac acc cag ctt cag aat gag aag ctc tac ctg tac tac ctg       2448
Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu           
                805                 810                 815               

cag aac ggc agg gac atg tac gtg gat cag gaa ctg gac atc aat cgg       2496
Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg           
            820                 825                 830                   

ctc tcc gac tac gac gtg gct gct atc gtg ccc cag tct ttt ctc aaa       2544
Leu Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys           
        835                 840                 845                       

gat gat tct att gat aat aaa gtg ttg aca aga tcc gat aaa gct aga       2592
Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Ala Arg           
    850                 855                 860                           

ggg aag agt gat aac gtc ccc tca gaa gaa gtt gtc aag aaa atg aaa       2640
Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys           
865                 870                 875                 880           

aat tat tgg cgg cag ctg ctg aac gcc aaa ctg atc aca caa cgg aag       2688
Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys           
                885                 890                 895               

ttc gat aat ctg act aag gct gaa cga ggt ggc ctg tct gag ttg gat       2736
Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp           
            900                 905                 910                   

aaa gcc ggc ttc atc aaa agg cag ctt gtt gag aca cgc cag atc acc       2784
Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr           
        915                 920                 925                       

aag cac gtg gcc caa att ctc gat tca cgc atg aac acc aag tac gat       2832
Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp           
    930                 935                 940                           

gaa aat gac aaa ctg att cga gag gtg aaa gtt att act ctg aag tct       2880
Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser           
945                 950                 955                 960           

aag ctg gtc tca gat ttc aga aag gac ttt cag ttt tat aag gtg aga       2928
Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg           
                965                 970                 975               

gag atc aac aat tac cac cat gcg cat gat gcc tac ctg aat gca gtg       2976
Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val           
            980                 985                 990                   

gta ggc act gca ctt atc aaa aaa  tat ccc aag ctt gaa  tct gaa ttt     3024
Val Gly Thr Ala Leu Ile Lys Lys  Tyr Pro Lys Leu Glu  Ser Glu Phe         
        995                 1000                 1005                     

gtt tac  gga gac tat aaa gtg  tac gat gtt agg aaa  atg atc gca        3069
Val Tyr  Gly Asp Tyr Lys Val  Tyr Asp Val Arg Lys  Met Ile Ala            
    1010                 1015                 1020                        

aag tct  gag cag gaa ata ggc  aag gcc acc gct aag  tac ttc ttt        3114
Lys Ser  Glu Gln Glu Ile Gly  Lys Ala Thr Ala Lys  Tyr Phe Phe            
    1025                 1030                 1035                        

tac agc  aat att atg aat ttt  ttc aag acc gag att  aca ctg gcc        3159
Tyr Ser  Asn Ile Met Asn Phe  Phe Lys Thr Glu Ile  Thr Leu Ala            
    1040                 1045                 1050                        

aat gga  gag att cgg aag cga  cca ctt atc gaa aca  aac gga gaa        3204
Asn Gly  Glu Ile Arg Lys Arg  Pro Leu Ile Glu Thr  Asn Gly Glu            
    1055                 1060                 1065                        

aca gga  gaa atc gtg tgg gac  aag ggt agg gat ttc  gcg aca gtc        3249
Thr Gly  Glu Ile Val Trp Asp  Lys Gly Arg Asp Phe  Ala Thr Val            
    1070                 1075                 1080                        

cgg aag  gtc ctg tcc atg ccg  cag gtg aac atc gtt  aaa aag acc        3294
Arg Lys  Val Leu Ser Met Pro  Gln Val Asn Ile Val  Lys Lys Thr            
    1085                 1090                 1095                        

gaa gta  cag acc gga ggc ttc  tcc aag gaa agt atc  ctc ccg aaa        3339
Glu Val  Gln Thr Gly Gly Phe  Ser Lys Glu Ser Ile  Leu Pro Lys            
    1100                 1105                 1110                        

agg aac  agc gac aag ctg atc  gca cgc aaa aaa gat  tgg gac ccc        3384
Arg Asn  Ser Asp Lys Leu Ile  Ala Arg Lys Lys Asp  Trp Asp Pro            
    1115                 1120                 1125                        

aag aaa  tac ggc gga ttc gat  tct cct aca gtc gct  tac agt gta        3429
Lys Lys  Tyr Gly Gly Phe Asp  Ser Pro Thr Val Ala  Tyr Ser Val            
    1130                 1135                 1140                        

ctg gtt  gtg gcc aaa gtg gag  aaa ggg aag tct aaa  aaa ctc aaa        3474
Leu Val  Val Ala Lys Val Glu  Lys Gly Lys Ser Lys  Lys Leu Lys            
    1145                 1150                 1155                        

agc gtc  aag gaa ctg ctg ggc  atc aca atc atg gag  cga tca agc        3519
Ser Val  Lys Glu Leu Leu Gly  Ile Thr Ile Met Glu  Arg Ser Ser            
    1160                 1165                 1170                        

ttc gaa  aaa aac ccc atc gac  ttt ctc gag gcg aaa  gga tat aaa        3564
Phe Glu  Lys Asn Pro Ile Asp  Phe Leu Glu Ala Lys  Gly Tyr Lys            
    1175                 1180                 1185                        

gag gtc  aaa aaa gac ctc atc  att aag ctt ccc aag  tac tct ctc        3609
Glu Val  Lys Lys Asp Leu Ile  Ile Lys Leu Pro Lys  Tyr Ser Leu            
    1190                 1195                 1200                        

ttt gag  ctt gaa aac ggc cgg  aaa cga atg ctc gct  agt gcg ggc        3654
Phe Glu  Leu Glu Asn Gly Arg  Lys Arg Met Leu Ala  Ser Ala Gly            
    1205                 1210                 1215                        

gag ctg  cag aaa ggt aac gag  ctg gca ctg ccc tct  aaa tac gtt        3699
Glu Leu  Gln Lys Gly Asn Glu  Leu Ala Leu Pro Ser  Lys Tyr Val            
    1220                 1225                 1230                        

aat ttc  ttg tat ctg gcc agc  cac tat gaa aag ctc  aaa ggg tct        3744
Asn Phe  Leu Tyr Leu Ala Ser  His Tyr Glu Lys Leu  Lys Gly Ser            
    1235                 1240                 1245                        

ccc gaa  gat aat gag cag aag  cag ctg ttc gtg gaa  caa cac aaa        3789
Pro Glu  Asp Asn Glu Gln Lys  Gln Leu Phe Val Glu  Gln His Lys            
    1250                 1255                 1260                        

cac tac  ctt gat gag atc atc  gag caa ata agc gaa  ttc tcc aaa        3834
His Tyr  Leu Asp Glu Ile Ile  Glu Gln Ile Ser Glu  Phe Ser Lys            
    1265                 1270                 1275                        

aga gtg  atc ctc gcc gac gct  aac ctc gat aag gtg  ctt tct gct        3879
Arg Val  Ile Leu Ala Asp Ala  Asn Leu Asp Lys Val  Leu Ser Ala            
    1280                 1285                 1290                        

tac aat  aag cac agg gat aag  ccc atc agg gag cag  gca gaa aac        3924
Tyr Asn  Lys His Arg Asp Lys  Pro Ile Arg Glu Gln  Ala Glu Asn            
    1295                 1300                 1305                        

att atc  cac ttg ttt act ctg  acc aac ttg ggc gcg  cct gca gcc        3969
Ile Ile  His Leu Phe Thr Leu  Thr Asn Leu Gly Ala  Pro Ala Ala            
    1310                 1315                 1320                        

ttc aag  tac ttc gac acc acc  ata gac aga aag cgg  tac acc tct        4014
Phe Lys  Tyr Phe Asp Thr Thr  Ile Asp Arg Lys Arg  Tyr Thr Ser            
    1325                 1330                 1335                        

aca aag  gag gtc ctg gac gcc  aca ctg att cat cag  tca att acg        4059
Thr Lys  Glu Val Leu Asp Ala  Thr Leu Ile His Gln  Ser Ile Thr            
    1340                 1345                 1350                        

ggg ctc  tat gaa aca aga atc  gac ctc tct cag ctc  ggt gga gac        4104
Gly Leu  Tyr Glu Thr Arg Ile  Asp Leu Ser Gln Leu  Gly Gly Asp            
    1355                 1360                 1365                        


<210>  210
<211>  1368
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  210

Met Asp Lys Lys Tyr Ser Ile Gly Leu Ala Ile Gly Thr Asn Ser Val 
1               5                   10                  15      


Gly Trp Ala Val Ile Thr Asp Glu Tyr Lys Val Pro Ser Lys Lys Phe 
            20                  25                  30          


Lys Val Leu Gly Asn Thr Asp Arg His Ser Ile Lys Lys Asn Leu Ile 
        35                  40                  45              


Gly Ala Leu Leu Phe Asp Ser Gly Glu Thr Ala Glu Ala Thr Arg Leu 
    50                  55                  60                  


Lys Arg Thr Ala Arg Arg Arg Tyr Thr Arg Arg Lys Asn Arg Ile Cys 
65                  70                  75                  80  


Tyr Leu Gln Glu Ile Phe Ser Asn Glu Met Ala Lys Val Asp Asp Ser 
                85                  90                  95      


Phe Phe His Arg Leu Glu Glu Ser Phe Leu Val Glu Glu Asp Lys Lys 
            100                 105                 110         


His Glu Arg His Pro Ile Phe Gly Asn Ile Val Asp Glu Val Ala Tyr 
        115                 120                 125             


His Glu Lys Tyr Pro Thr Ile Tyr His Leu Arg Lys Lys Leu Val Asp 
    130                 135                 140                 


Ser Thr Asp Lys Ala Asp Leu Arg Leu Ile Tyr Leu Ala Leu Ala His 
145                 150                 155                 160 


Met Ile Lys Phe Arg Gly His Phe Leu Ile Glu Gly Asp Leu Asn Pro 
                165                 170                 175     


Asp Asn Ser Asp Val Asp Lys Leu Phe Ile Gln Leu Val Gln Thr Tyr 
            180                 185                 190         


Asn Gln Leu Phe Glu Glu Asn Pro Ile Asn Ala Ser Gly Val Asp Ala 
        195                 200                 205             


Lys Ala Ile Leu Ser Ala Arg Leu Ser Lys Ser Arg Arg Leu Glu Asn 
    210                 215                 220                 


Leu Ile Ala Gln Leu Pro Gly Glu Lys Lys Asn Gly Leu Phe Gly Asn 
225                 230                 235                 240 


Leu Ile Ala Leu Ser Leu Gly Leu Thr Pro Asn Phe Lys Ser Asn Phe 
                245                 250                 255     


Asp Leu Ala Glu Asp Ala Lys Leu Gln Leu Ser Lys Asp Thr Tyr Asp 
            260                 265                 270         


Asp Asp Leu Asp Asn Leu Leu Ala Gln Ile Gly Asp Gln Tyr Ala Asp 
        275                 280                 285             


Leu Phe Leu Ala Ala Lys Asn Leu Ser Asp Ala Ile Leu Leu Ser Asp 
    290                 295                 300                 


Ile Leu Arg Val Asn Thr Glu Ile Thr Lys Ala Pro Leu Ser Ala Ser 
305                 310                 315                 320 


Met Ile Lys Arg Tyr Asp Glu His His Gln Asp Leu Thr Leu Leu Lys 
                325                 330                 335     


Ala Leu Val Arg Gln Gln Leu Pro Glu Lys Tyr Lys Glu Ile Phe Phe 
            340                 345                 350         


Asp Gln Ser Lys Asn Gly Tyr Ala Gly Tyr Ile Asp Gly Gly Ala Ser 
        355                 360                 365             


Gln Glu Glu Phe Tyr Lys Phe Ile Lys Pro Ile Leu Glu Lys Met Asp 
    370                 375                 380                 


Gly Thr Glu Glu Leu Leu Val Lys Leu Asn Arg Glu Asp Leu Leu Arg 
385                 390                 395                 400 


Lys Gln Arg Thr Phe Asp Asn Gly Ser Ile Pro His Gln Ile His Leu 
                405                 410                 415     


Gly Glu Leu His Ala Ile Leu Arg Arg Gln Glu Asp Phe Tyr Pro Phe 
            420                 425                 430         


Leu Lys Asp Asn Arg Glu Lys Ile Glu Lys Ile Leu Thr Phe Arg Ile 
        435                 440                 445             


Pro Tyr Tyr Val Gly Pro Leu Ala Arg Gly Asn Ser Arg Phe Ala Trp 
    450                 455                 460                 


Met Thr Arg Lys Ser Glu Glu Thr Ile Thr Pro Trp Asn Phe Glu Glu 
465                 470                 475                 480 


Val Val Asp Lys Gly Ala Ser Ala Gln Ser Phe Ile Glu Arg Met Thr 
                485                 490                 495     


Asn Phe Asp Lys Asn Leu Pro Asn Glu Lys Val Leu Pro Lys His Ser 
            500                 505                 510         


Leu Leu Tyr Glu Tyr Phe Thr Val Tyr Asn Glu Leu Thr Lys Val Lys 
        515                 520                 525             


Tyr Val Thr Glu Gly Met Arg Lys Pro Ala Phe Leu Ser Gly Glu Gln 
    530                 535                 540                 


Lys Lys Ala Ile Val Asp Leu Leu Phe Lys Thr Asn Arg Lys Val Thr 
545                 550                 555                 560 


Val Lys Gln Leu Lys Glu Asp Tyr Phe Lys Lys Ile Glu Cys Phe Asp 
                565                 570                 575     


Ser Val Glu Ile Ser Gly Val Glu Asp Arg Phe Asn Ala Ser Leu Gly 
            580                 585                 590         


Thr Tyr His Asp Leu Leu Lys Ile Ile Lys Asp Lys Asp Phe Leu Asp 
        595                 600                 605             


Asn Glu Glu Asn Glu Asp Ile Leu Glu Asp Ile Val Leu Thr Leu Thr 
    610                 615                 620                 


Leu Phe Glu Asp Arg Glu Met Ile Glu Glu Arg Leu Lys Thr Tyr Ala 
625                 630                 635                 640 


His Leu Phe Asp Asp Lys Val Met Lys Gln Leu Lys Arg Arg Arg Tyr 
                645                 650                 655     


Thr Gly Trp Gly Arg Leu Ser Arg Lys Leu Ile Asn Gly Ile Arg Asp 
            660                 665                 670         


Lys Gln Ser Gly Lys Thr Ile Leu Asp Phe Leu Lys Ser Asp Gly Phe 
        675                 680                 685             


Ala Asn Arg Asn Phe Met Gln Leu Ile His Asp Asp Ser Leu Thr Phe 
    690                 695                 700                 


Lys Glu Asp Ile Gln Lys Ala Gln Val Ser Gly Gln Gly Asp Ser Leu 
705                 710                 715                 720 


His Glu His Ile Ala Asn Leu Ala Gly Ser Pro Ala Ile Lys Lys Gly 
                725                 730                 735     


Ile Leu Gln Thr Val Lys Val Val Asp Glu Leu Val Lys Val Met Gly 
            740                 745                 750         


Arg His Lys Pro Glu Asn Ile Val Ile Glu Met Ala Arg Glu Asn Gln 
        755                 760                 765             


Thr Thr Gln Lys Gly Gln Lys Asn Ser Arg Glu Arg Met Lys Arg Ile 
    770                 775                 780                 


Glu Glu Gly Ile Lys Glu Leu Gly Ser Gln Ile Leu Lys Glu His Pro 
785                 790                 795                 800 


Val Glu Asn Thr Gln Leu Gln Asn Glu Lys Leu Tyr Leu Tyr Tyr Leu 
                805                 810                 815     


Gln Asn Gly Arg Asp Met Tyr Val Asp Gln Glu Leu Asp Ile Asn Arg 
            820                 825                 830         


Leu Ser Asp Tyr Asp Val Ala Ala Ile Val Pro Gln Ser Phe Leu Lys 
        835                 840                 845             


Asp Asp Ser Ile Asp Asn Lys Val Leu Thr Arg Ser Asp Lys Ala Arg 
    850                 855                 860                 


Gly Lys Ser Asp Asn Val Pro Ser Glu Glu Val Val Lys Lys Met Lys 
865                 870                 875                 880 


Asn Tyr Trp Arg Gln Leu Leu Asn Ala Lys Leu Ile Thr Gln Arg Lys 
                885                 890                 895     


Phe Asp Asn Leu Thr Lys Ala Glu Arg Gly Gly Leu Ser Glu Leu Asp 
            900                 905                 910         


Lys Ala Gly Phe Ile Lys Arg Gln Leu Val Glu Thr Arg Gln Ile Thr 
        915                 920                 925             


Lys His Val Ala Gln Ile Leu Asp Ser Arg Met Asn Thr Lys Tyr Asp 
    930                 935                 940                 


Glu Asn Asp Lys Leu Ile Arg Glu Val Lys Val Ile Thr Leu Lys Ser 
945                 950                 955                 960 


Lys Leu Val Ser Asp Phe Arg Lys Asp Phe Gln Phe Tyr Lys Val Arg 
                965                 970                 975     


Glu Ile Asn Asn Tyr His His Ala His Asp Ala Tyr Leu Asn Ala Val 
            980                 985                 990         


Val Gly Thr Ala Leu Ile Lys Lys  Tyr Pro Lys Leu Glu  Ser Glu Phe 
        995                 1000                 1005             


Val Tyr  Gly Asp Tyr Lys Val  Tyr Asp Val Arg Lys  Met Ile Ala 
    1010                 1015                 1020             


Lys Ser  Glu Gln Glu Ile Gly  Lys Ala Thr Ala Lys  Tyr Phe Phe 
    1025                 1030                 1035             


Tyr Ser  Asn Ile Met Asn Phe  Phe Lys Thr Glu Ile  Thr Leu Ala 
    1040                 1045                 1050             


Asn Gly  Glu Ile Arg Lys Arg  Pro Leu Ile Glu Thr  Asn Gly Glu 
    1055                 1060                 1065             


Thr Gly  Glu Ile Val Trp Asp  Lys Gly Arg Asp Phe  Ala Thr Val 
    1070                 1075                 1080             


Arg Lys  Val Leu Ser Met Pro  Gln Val Asn Ile Val  Lys Lys Thr 
    1085                 1090                 1095             


Glu Val  Gln Thr Gly Gly Phe  Ser Lys Glu Ser Ile  Leu Pro Lys 
    1100                 1105                 1110             


Arg Asn  Ser Asp Lys Leu Ile  Ala Arg Lys Lys Asp  Trp Asp Pro 
    1115                 1120                 1125             


Lys Lys  Tyr Gly Gly Phe Asp  Ser Pro Thr Val Ala  Tyr Ser Val 
    1130                 1135                 1140             


Leu Val  Val Ala Lys Val Glu  Lys Gly Lys Ser Lys  Lys Leu Lys 
    1145                 1150                 1155             


Ser Val  Lys Glu Leu Leu Gly  Ile Thr Ile Met Glu  Arg Ser Ser 
    1160                 1165                 1170             


Phe Glu  Lys Asn Pro Ile Asp  Phe Leu Glu Ala Lys  Gly Tyr Lys 
    1175                 1180                 1185             


Glu Val  Lys Lys Asp Leu Ile  Ile Lys Leu Pro Lys  Tyr Ser Leu 
    1190                 1195                 1200             


Phe Glu  Leu Glu Asn Gly Arg  Lys Arg Met Leu Ala  Ser Ala Gly 
    1205                 1210                 1215             


Glu Leu  Gln Lys Gly Asn Glu  Leu Ala Leu Pro Ser  Lys Tyr Val 
    1220                 1225                 1230             


Asn Phe  Leu Tyr Leu Ala Ser  His Tyr Glu Lys Leu  Lys Gly Ser 
    1235                 1240                 1245             


Pro Glu  Asp Asn Glu Gln Lys  Gln Leu Phe Val Glu  Gln His Lys 
    1250                 1255                 1260             


His Tyr  Leu Asp Glu Ile Ile  Glu Gln Ile Ser Glu  Phe Ser Lys 
    1265                 1270                 1275             


Arg Val  Ile Leu Ala Asp Ala  Asn Leu Asp Lys Val  Leu Ser Ala 
    1280                 1285                 1290             


Tyr Asn  Lys His Arg Asp Lys  Pro Ile Arg Glu Gln  Ala Glu Asn 
    1295                 1300                 1305             


Ile Ile  His Leu Phe Thr Leu  Thr Asn Leu Gly Ala  Pro Ala Ala 
    1310                 1315                 1320             


Phe Lys  Tyr Phe Asp Thr Thr  Ile Asp Arg Lys Arg  Tyr Thr Ser 
    1325                 1330                 1335             


Thr Lys  Glu Val Leu Asp Ala  Thr Leu Ile His Gln  Ser Ile Thr 
    1340                 1345                 1350             


Gly Leu  Tyr Glu Thr Arg Ile  Asp Leu Ser Gln Leu  Gly Gly Asp 
    1355                 1360                 1365             


<210>  211
<211>  2904
<212>  DNA
<213>  Ruminococcus flavefaciens


<220>
<221>  CDS
<222>  (1)..(2904)

<400>  211
atg atc gaa aag aag aag tca ttt gca aag ggc atg gga gta aaa tca         48
Met Ile Glu Lys Lys Lys Ser Phe Ala Lys Gly Met Gly Val Lys Ser           
1               5                   10                  15                

aca ctt gta tcc ggt tca aag gta tac atg acg acg ttc gca gaa gga         96
Thr Leu Val Ser Gly Ser Lys Val Tyr Met Thr Thr Phe Ala Glu Gly           
            20                  25                  30                    

agc gat gcc aga ctt gaa aag atc gtt gaa ggc gat tct atc aga tct        144
Ser Asp Ala Arg Leu Glu Lys Ile Val Glu Gly Asp Ser Ile Arg Ser           
        35                  40                  45                        

gtc aac gaa gga gaa gcg ttc tca gct gaa atg gct gat aag aat gca        192
Val Asn Glu Gly Glu Ala Phe Ser Ala Glu Met Ala Asp Lys Asn Ala           
    50                  55                  60                            

ggc tac aag atc ggt aac gca aag ttc agc cac cca aag ggc tat gct        240
Gly Tyr Lys Ile Gly Asn Ala Lys Phe Ser His Pro Lys Gly Tyr Ala           
65                  70                  75                  80            

gta gtt gca aac aac ccc tta tac acc gga ccg gta cag cag gat atg        288
Val Val Ala Asn Asn Pro Leu Tyr Thr Gly Pro Val Gln Gln Asp Met           
                85                  90                  95                

ctc ggt ctg aag gaa acg ctt gaa aag aga tat ttt gga gag tct gcc        336
Leu Gly Leu Lys Glu Thr Leu Glu Lys Arg Tyr Phe Gly Glu Ser Ala           
            100                 105                 110                   

gac gga aat gat aat atc tgt att cag gtc atc cat aat atc ctc gat        384
Asp Gly Asn Asp Asn Ile Cys Ile Gln Val Ile His Asn Ile Leu Asp           
        115                 120                 125                       

atc gaa aag atc ctc gct gaa tat ata acc aat gct gct tat gcg gta        432
Ile Glu Lys Ile Leu Ala Glu Tyr Ile Thr Asn Ala Ala Tyr Ala Val           
    130                 135                 140                           

aac aat att tcc ggt ctt gat aag gat atc atc ggt ttt ggt aag ttc        480
Asn Asn Ile Ser Gly Leu Asp Lys Asp Ile Ile Gly Phe Gly Lys Phe           
145                 150                 155                 160           

agt acg gtc tat act tat gat gag ttc aag gat cct gaa cat cac aga        528
Ser Thr Val Tyr Thr Tyr Asp Glu Phe Lys Asp Pro Glu His His Arg           
                165                 170                 175               

gca gct ttc aac aat aac gat aag tta att aat gcc atc aag gca cag        576
Ala Ala Phe Asn Asn Asn Asp Lys Leu Ile Asn Ala Ile Lys Ala Gln           
            180                 185                 190                   

tat gat gaa ttt gac aat ttc ctt gat aat cct cgt ctc ggc tac ttt        624
Tyr Asp Glu Phe Asp Asn Phe Leu Asp Asn Pro Arg Leu Gly Tyr Phe           
        195                 200                 205                       

gga cag gct ttt ttc agt aag gaa ggc aga aat tac att atc aat tac        672
Gly Gln Ala Phe Phe Ser Lys Glu Gly Arg Asn Tyr Ile Ile Asn Tyr           
    210                 215                 220                           

ggc aac gag tgt tat gat att ctt gct tta ctc agc gga ttg cgt cac        720
Gly Asn Glu Cys Tyr Asp Ile Leu Ala Leu Leu Ser Gly Leu Arg His           
225                 230                 235                 240           

tgg gta gta cat aat aat gag gaa gaa tca agg att tcc cgt aca tgg        768
Trp Val Val His Asn Asn Glu Glu Glu Ser Arg Ile Ser Arg Thr Trp           
                245                 250                 255               

ctt tat aat ctc gac aag aat ctt gac aac gaa tat atc tct act ctc        816
Leu Tyr Asn Leu Asp Lys Asn Leu Asp Asn Glu Tyr Ile Ser Thr Leu           
            260                 265                 270                   

aat tat ctg tat gat aga att aca aac gaa tta aca aat tcc ttc tca        864
Asn Tyr Leu Tyr Asp Arg Ile Thr Asn Glu Leu Thr Asn Ser Phe Ser           
        275                 280                 285                       

aag aat agt gca gcc aac gta aac tat atc gct gaa acc ctt ggt att        912
Lys Asn Ser Ala Ala Asn Val Asn Tyr Ile Ala Glu Thr Leu Gly Ile           
    290                 295                 300                           

aat cct gct gaa ttt gca gag cag tat ttc aga ttc agt atc atg aag        960
Asn Pro Ala Glu Phe Ala Glu Gln Tyr Phe Arg Phe Ser Ile Met Lys           
305                 310                 315                 320           

gaa cag aag aat ctc ggt ttc aat att act aag ctg aga gaa gta atg       1008
Glu Gln Lys Asn Leu Gly Phe Asn Ile Thr Lys Leu Arg Glu Val Met           
                325                 330                 335               

ctt gac aga aag gat atg tct gag atc cgt aaa aat cat aag gtc ttt       1056
Leu Asp Arg Lys Asp Met Ser Glu Ile Arg Lys Asn His Lys Val Phe           
            340                 345                 350                   

gat tca atc cgt act aag gtc tat act atg atg gat ttc gtt atc tac       1104
Asp Ser Ile Arg Thr Lys Val Tyr Thr Met Met Asp Phe Val Ile Tyr           
        355                 360                 365                       

aga tat tac att gaa gag gat gca aag gtt gct gct gcc aac aag tct       1152
Arg Tyr Tyr Ile Glu Glu Asp Ala Lys Val Ala Ala Ala Asn Lys Ser           
    370                 375                 380                           

ctg ccg gat aac gaa aaa agc ctc agt gaa aag gat atc ttt gtt ata       1200
Leu Pro Asp Asn Glu Lys Ser Leu Ser Glu Lys Asp Ile Phe Val Ile           
385                 390                 395                 400           

aat ctc aga gga agc ttt aac gat gat cag aag gat gcc ctt tat tat       1248
Asn Leu Arg Gly Ser Phe Asn Asp Asp Gln Lys Asp Ala Leu Tyr Tyr           
                405                 410                 415               

gat gag gcc aat cgt att tgg aga aag ctc gaa aac att atg cac aat       1296
Asp Glu Ala Asn Arg Ile Trp Arg Lys Leu Glu Asn Ile Met His Asn           
            420                 425                 430                   

atc aag gaa ttc aga ggc aat aag aca cgt gaa tac aag aag aag gat       1344
Ile Lys Glu Phe Arg Gly Asn Lys Thr Arg Glu Tyr Lys Lys Lys Asp           
        435                 440                 445                       

gct cca aga ctc ccc aga att ctt cct gcc gga agg gat gtt tcc gcg       1392
Ala Pro Arg Leu Pro Arg Ile Leu Pro Ala Gly Arg Asp Val Ser Ala           
    450                 455                 460                           

ttc tca aag ttg atg tac gct ctt acc atg ttc ctt gat ggt aag gag       1440
Phe Ser Lys Leu Met Tyr Ala Leu Thr Met Phe Leu Asp Gly Lys Glu           
465                 470                 475                 480           

atc aat gat ctt ctc acc acg ctc atc aat aag ttc gat aac atc cag       1488
Ile Asn Asp Leu Leu Thr Thr Leu Ile Asn Lys Phe Asp Asn Ile Gln           
                485                 490                 495               

agt ttc ctc aag gta atg cct ctt atc gga gtg aat gca aag ttt gtt       1536
Ser Phe Leu Lys Val Met Pro Leu Ile Gly Val Asn Ala Lys Phe Val           
            500                 505                 510                   

gag gaa tat gcc ttc ttc aag gac agc gca aag att gct gac gaa ctc       1584
Glu Glu Tyr Ala Phe Phe Lys Asp Ser Ala Lys Ile Ala Asp Glu Leu           
        515                 520                 525                       

agg ctg att aag agc ttt gcc aga atg gga gaa cct atc gca gat gca       1632
Arg Leu Ile Lys Ser Phe Ala Arg Met Gly Glu Pro Ile Ala Asp Ala           
    530                 535                 540                           

aga cgt gct atg tat atc gat gct atc agg att ctc gga aca aac ctc       1680
Arg Arg Ala Met Tyr Ile Asp Ala Ile Arg Ile Leu Gly Thr Asn Leu           
545                 550                 555                 560           

agc tat gat gag ctt aag gcc ctt gcc gat act ttt tcg ctt gat gaa       1728
Ser Tyr Asp Glu Leu Lys Ala Leu Ala Asp Thr Phe Ser Leu Asp Glu           
                565                 570                 575               

aac ggc aac aag ctt aag aag ggc aag cac ggc atg aga aac ttc atc       1776
Asn Gly Asn Lys Leu Lys Lys Gly Lys His Gly Met Arg Asn Phe Ile           
            580                 585                 590                   

att aat aat gta atc agt aac aag cgc ttc cat tat ctc att cgt tac       1824
Ile Asn Asn Val Ile Ser Asn Lys Arg Phe His Tyr Leu Ile Arg Tyr           
        595                 600                 605                       

ggt gat cct gca cat ctc cat gag atc gcc aag aat gaa gct gtt gta       1872
Gly Asp Pro Ala His Leu His Glu Ile Ala Lys Asn Glu Ala Val Val           
    610                 615                 620                           

aag ttc gtc ctc ggc agg ata gct gat atc cag aag aag cag gga cag       1920
Lys Phe Val Leu Gly Arg Ile Ala Asp Ile Gln Lys Lys Gln Gly Gln           
625                 630                 635                 640           

aac gga aag aat cag atc gac agg tac tat gag acc tgt atc ggc aag       1968
Asn Gly Lys Asn Gln Ile Asp Arg Tyr Tyr Glu Thr Cys Ile Gly Lys           
                645                 650                 655               

gac aag ggc aag tct gtc tcc gaa aag gtt gat gcc ctc aca aag att       2016
Asp Lys Gly Lys Ser Val Ser Glu Lys Val Asp Ala Leu Thr Lys Ile           
            660                 665                 670                   

atc acc ggt atg aac tac gat cag ttc gat aag aag aga agc gtt att       2064
Ile Thr Gly Met Asn Tyr Asp Gln Phe Asp Lys Lys Arg Ser Val Ile           
        675                 680                 685                       

gag gat act gga aga gaa aac gct gag aga gaa aag ttc aag aag atc       2112
Glu Asp Thr Gly Arg Glu Asn Ala Glu Arg Glu Lys Phe Lys Lys Ile           
    690                 695                 700                           

atc agc ctc tat ctt act gtc att tat cac atc ctt aag aat att gtt       2160
Ile Ser Leu Tyr Leu Thr Val Ile Tyr His Ile Leu Lys Asn Ile Val           
705                 710                 715                 720           

aat atc aat gcg cgt tac gtt atc ggc ttc cat tgc gtt gag cgt gat       2208
Asn Ile Asn Ala Arg Tyr Val Ile Gly Phe His Cys Val Glu Arg Asp           
                725                 730                 735               

gca cag ctc tat aag gaa aag ggc tat gat atc aac ctc aag aag ctc       2256
Ala Gln Leu Tyr Lys Glu Lys Gly Tyr Asp Ile Asn Leu Lys Lys Leu           
            740                 745                 750                   

gaa gaa aag ggg ttt tca tca gtc aca aag ctg tgt gca ggt att gat       2304
Glu Glu Lys Gly Phe Ser Ser Val Thr Lys Leu Cys Ala Gly Ile Asp           
        755                 760                 765                       

gag act gct cct gac aag cgt aag gat gtt gaa aag gaa atg gct gag       2352
Glu Thr Ala Pro Asp Lys Arg Lys Asp Val Glu Lys Glu Met Ala Glu           
    770                 775                 780                           

cgt gca aag gaa tct atc gat agc ctt gaa tct gca aat cct aag ctt       2400
Arg Ala Lys Glu Ser Ile Asp Ser Leu Glu Ser Ala Asn Pro Lys Leu           
785                 790                 795                 800           

tac gca aac tat atc aag tat tct gac gag aag aag gct gag gaa ttt       2448
Tyr Ala Asn Tyr Ile Lys Tyr Ser Asp Glu Lys Lys Ala Glu Glu Phe           
                805                 810                 815               

act aga cag atc aac cgt gag aag gca aag acc gct ctg aat gca tat       2496
Thr Arg Gln Ile Asn Arg Glu Lys Ala Lys Thr Ala Leu Asn Ala Tyr           
            820                 825                 830                   

ctc aga aat act aag tgg aat gtg ata atc agg gaa gat ctt ctt aga       2544
Leu Arg Asn Thr Lys Trp Asn Val Ile Ile Arg Glu Asp Leu Leu Arg           
        835                 840                 845                       

atc gat aat aag aca tgt acg ctc ttt aga aat aag gcc gtt cat ctt       2592
Ile Asp Asn Lys Thr Cys Thr Leu Phe Arg Asn Lys Ala Val His Leu           
    850                 855                 860                           

gaa gtt gca aga tat gtt cat gca tat atc aac gat att gcc gaa gta       2640
Glu Val Ala Arg Tyr Val His Ala Tyr Ile Asn Asp Ile Ala Glu Val           
865                 870                 875                 880           

aac agc tat ttc cag ctt tat cat tac atc atg cag aga atc atc atg       2688
Asn Ser Tyr Phe Gln Leu Tyr His Tyr Ile Met Gln Arg Ile Ile Met           
                885                 890                 895               

aac gaa aga tat gaa aag tct tct gga aag gta agc gaa tac ttc gat       2736
Asn Glu Arg Tyr Glu Lys Ser Ser Gly Lys Val Ser Glu Tyr Phe Asp           
            900                 905                 910                   

gct gtg aac gat gaa aag aag tac aac gac agg ctt ctg aag ctg ttg       2784
Ala Val Asn Asp Glu Lys Lys Tyr Asn Asp Arg Leu Leu Lys Leu Leu           
        915                 920                 925                       

tgc gtt cca ttt ggt tac tgc atc ccg aga ttc aag aat ctc tcc att       2832
Cys Val Pro Phe Gly Tyr Cys Ile Pro Arg Phe Lys Asn Leu Ser Ile           
    930                 935                 940                           

gaa gct ttg ttc gac agg aac gaa gca gct aag ttt gac aag gaa aag       2880
Glu Ala Leu Phe Asp Arg Asn Glu Ala Ala Lys Phe Asp Lys Glu Lys           
945                 950                 955                 960           

aag aaa gta tca ggt aat tca tag                                       2904
Lys Lys Val Ser Gly Asn Ser                                               
                965                                                       


<210>  212
<211>  967
<212>  PRT
<213>  Ruminococcus flavefaciens

<400>  212

Met Ile Glu Lys Lys Lys Ser Phe Ala Lys Gly Met Gly Val Lys Ser 
1               5                   10                  15      


Thr Leu Val Ser Gly Ser Lys Val Tyr Met Thr Thr Phe Ala Glu Gly 
            20                  25                  30          


Ser Asp Ala Arg Leu Glu Lys Ile Val Glu Gly Asp Ser Ile Arg Ser 
        35                  40                  45              


Val Asn Glu Gly Glu Ala Phe Ser Ala Glu Met Ala Asp Lys Asn Ala 
    50                  55                  60                  


Gly Tyr Lys Ile Gly Asn Ala Lys Phe Ser His Pro Lys Gly Tyr Ala 
65                  70                  75                  80  


Val Val Ala Asn Asn Pro Leu Tyr Thr Gly Pro Val Gln Gln Asp Met 
                85                  90                  95      


Leu Gly Leu Lys Glu Thr Leu Glu Lys Arg Tyr Phe Gly Glu Ser Ala 
            100                 105                 110         


Asp Gly Asn Asp Asn Ile Cys Ile Gln Val Ile His Asn Ile Leu Asp 
        115                 120                 125             


Ile Glu Lys Ile Leu Ala Glu Tyr Ile Thr Asn Ala Ala Tyr Ala Val 
    130                 135                 140                 


Asn Asn Ile Ser Gly Leu Asp Lys Asp Ile Ile Gly Phe Gly Lys Phe 
145                 150                 155                 160 


Ser Thr Val Tyr Thr Tyr Asp Glu Phe Lys Asp Pro Glu His His Arg 
                165                 170                 175     


Ala Ala Phe Asn Asn Asn Asp Lys Leu Ile Asn Ala Ile Lys Ala Gln 
            180                 185                 190         


Tyr Asp Glu Phe Asp Asn Phe Leu Asp Asn Pro Arg Leu Gly Tyr Phe 
        195                 200                 205             


Gly Gln Ala Phe Phe Ser Lys Glu Gly Arg Asn Tyr Ile Ile Asn Tyr 
    210                 215                 220                 


Gly Asn Glu Cys Tyr Asp Ile Leu Ala Leu Leu Ser Gly Leu Arg His 
225                 230                 235                 240 


Trp Val Val His Asn Asn Glu Glu Glu Ser Arg Ile Ser Arg Thr Trp 
                245                 250                 255     


Leu Tyr Asn Leu Asp Lys Asn Leu Asp Asn Glu Tyr Ile Ser Thr Leu 
            260                 265                 270         


Asn Tyr Leu Tyr Asp Arg Ile Thr Asn Glu Leu Thr Asn Ser Phe Ser 
        275                 280                 285             


Lys Asn Ser Ala Ala Asn Val Asn Tyr Ile Ala Glu Thr Leu Gly Ile 
    290                 295                 300                 


Asn Pro Ala Glu Phe Ala Glu Gln Tyr Phe Arg Phe Ser Ile Met Lys 
305                 310                 315                 320 


Glu Gln Lys Asn Leu Gly Phe Asn Ile Thr Lys Leu Arg Glu Val Met 
                325                 330                 335     


Leu Asp Arg Lys Asp Met Ser Glu Ile Arg Lys Asn His Lys Val Phe 
            340                 345                 350         


Asp Ser Ile Arg Thr Lys Val Tyr Thr Met Met Asp Phe Val Ile Tyr 
        355                 360                 365             


Arg Tyr Tyr Ile Glu Glu Asp Ala Lys Val Ala Ala Ala Asn Lys Ser 
    370                 375                 380                 


Leu Pro Asp Asn Glu Lys Ser Leu Ser Glu Lys Asp Ile Phe Val Ile 
385                 390                 395                 400 


Asn Leu Arg Gly Ser Phe Asn Asp Asp Gln Lys Asp Ala Leu Tyr Tyr 
                405                 410                 415     


Asp Glu Ala Asn Arg Ile Trp Arg Lys Leu Glu Asn Ile Met His Asn 
            420                 425                 430         


Ile Lys Glu Phe Arg Gly Asn Lys Thr Arg Glu Tyr Lys Lys Lys Asp 
        435                 440                 445             


Ala Pro Arg Leu Pro Arg Ile Leu Pro Ala Gly Arg Asp Val Ser Ala 
    450                 455                 460                 


Phe Ser Lys Leu Met Tyr Ala Leu Thr Met Phe Leu Asp Gly Lys Glu 
465                 470                 475                 480 


Ile Asn Asp Leu Leu Thr Thr Leu Ile Asn Lys Phe Asp Asn Ile Gln 
                485                 490                 495     


Ser Phe Leu Lys Val Met Pro Leu Ile Gly Val Asn Ala Lys Phe Val 
            500                 505                 510         


Glu Glu Tyr Ala Phe Phe Lys Asp Ser Ala Lys Ile Ala Asp Glu Leu 
        515                 520                 525             


Arg Leu Ile Lys Ser Phe Ala Arg Met Gly Glu Pro Ile Ala Asp Ala 
    530                 535                 540                 


Arg Arg Ala Met Tyr Ile Asp Ala Ile Arg Ile Leu Gly Thr Asn Leu 
545                 550                 555                 560 


Ser Tyr Asp Glu Leu Lys Ala Leu Ala Asp Thr Phe Ser Leu Asp Glu 
                565                 570                 575     


Asn Gly Asn Lys Leu Lys Lys Gly Lys His Gly Met Arg Asn Phe Ile 
            580                 585                 590         


Ile Asn Asn Val Ile Ser Asn Lys Arg Phe His Tyr Leu Ile Arg Tyr 
        595                 600                 605             


Gly Asp Pro Ala His Leu His Glu Ile Ala Lys Asn Glu Ala Val Val 
    610                 615                 620                 


Lys Phe Val Leu Gly Arg Ile Ala Asp Ile Gln Lys Lys Gln Gly Gln 
625                 630                 635                 640 


Asn Gly Lys Asn Gln Ile Asp Arg Tyr Tyr Glu Thr Cys Ile Gly Lys 
                645                 650                 655     


Asp Lys Gly Lys Ser Val Ser Glu Lys Val Asp Ala Leu Thr Lys Ile 
            660                 665                 670         


Ile Thr Gly Met Asn Tyr Asp Gln Phe Asp Lys Lys Arg Ser Val Ile 
        675                 680                 685             


Glu Asp Thr Gly Arg Glu Asn Ala Glu Arg Glu Lys Phe Lys Lys Ile 
    690                 695                 700                 


Ile Ser Leu Tyr Leu Thr Val Ile Tyr His Ile Leu Lys Asn Ile Val 
705                 710                 715                 720 


Asn Ile Asn Ala Arg Tyr Val Ile Gly Phe His Cys Val Glu Arg Asp 
                725                 730                 735     


Ala Gln Leu Tyr Lys Glu Lys Gly Tyr Asp Ile Asn Leu Lys Lys Leu 
            740                 745                 750         


Glu Glu Lys Gly Phe Ser Ser Val Thr Lys Leu Cys Ala Gly Ile Asp 
        755                 760                 765             


Glu Thr Ala Pro Asp Lys Arg Lys Asp Val Glu Lys Glu Met Ala Glu 
    770                 775                 780                 


Arg Ala Lys Glu Ser Ile Asp Ser Leu Glu Ser Ala Asn Pro Lys Leu 
785                 790                 795                 800 


Tyr Ala Asn Tyr Ile Lys Tyr Ser Asp Glu Lys Lys Ala Glu Glu Phe 
                805                 810                 815     


Thr Arg Gln Ile Asn Arg Glu Lys Ala Lys Thr Ala Leu Asn Ala Tyr 
            820                 825                 830         


Leu Arg Asn Thr Lys Trp Asn Val Ile Ile Arg Glu Asp Leu Leu Arg 
        835                 840                 845             


Ile Asp Asn Lys Thr Cys Thr Leu Phe Arg Asn Lys Ala Val His Leu 
    850                 855                 860                 


Glu Val Ala Arg Tyr Val His Ala Tyr Ile Asn Asp Ile Ala Glu Val 
865                 870                 875                 880 


Asn Ser Tyr Phe Gln Leu Tyr His Tyr Ile Met Gln Arg Ile Ile Met 
                885                 890                 895     


Asn Glu Arg Tyr Glu Lys Ser Ser Gly Lys Val Ser Glu Tyr Phe Asp 
            900                 905                 910         


Ala Val Asn Asp Glu Lys Lys Tyr Asn Asp Arg Leu Leu Lys Leu Leu 
        915                 920                 925             


Cys Val Pro Phe Gly Tyr Cys Ile Pro Arg Phe Lys Asn Leu Ser Ile 
    930                 935                 940                 


Glu Ala Leu Phe Asp Arg Asn Glu Ala Ala Lys Phe Asp Lys Glu Lys 
945                 950                 955                 960 


Lys Lys Val Ser Gly Asn Ser 
                965         


<210>  213
<211>  2769
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Ruminococcus_bicirculans


<220>
<221>  CDS
<222>  (1)..(2769)

<400>  213
atg gca aaa aag aat aaa atg aag cct aga gag ctg cgt gag gct cag         48
Met Ala Lys Lys Asn Lys Met Lys Pro Arg Glu Leu Arg Glu Ala Gln           
1               5                   10                  15                

aaa aaa gcc aga cag ctc aaa gcg gct gag ata aat aat aac gct gct         96
Lys Lys Ala Arg Gln Leu Lys Ala Ala Glu Ile Asn Asn Asn Ala Ala           
            20                  25                  30                    

cct gcg atc gct gcc atg cct gct gca gag gtc att gca cct gtg gca        144
Pro Ala Ile Ala Ala Met Pro Ala Ala Glu Val Ile Ala Pro Val Ala           
        35                  40                  45                        

gag aag aaa aaa tcc tcc gta aag gcg gca gga atg aag tct att ctt        192
Glu Lys Lys Lys Ser Ser Val Lys Ala Ala Gly Met Lys Ser Ile Leu           
    50                  55                  60                            

gtc agc gaa aat aaa atg tac ata acc tct ttc ggc aag ggc aat tct        240
Val Ser Glu Asn Lys Met Tyr Ile Thr Ser Phe Gly Lys Gly Asn Ser           
65                  70                  75                  80            

gct gtg ctt gaa tat gag gtg gac aat aat gac tac aac aaa act cag        288
Ala Val Leu Glu Tyr Glu Val Asp Asn Asn Asp Tyr Asn Lys Thr Gln           
                85                  90                  95                

ctt tct tca aag gac aac agc aat atc gag ctt ggt gat gta aac gag        336
Leu Ser Ser Lys Asp Asn Ser Asn Ile Glu Leu Gly Asp Val Asn Glu           
            100                 105                 110                   

gta aac atc act ttt tca agc aag cat ggc ttt ggg agc gga gtg gag        384
Val Asn Ile Thr Phe Ser Ser Lys His Gly Phe Gly Ser Gly Val Glu           
        115                 120                 125                       

ata aat act tca aac cct act cac aga agc ggt gaa agc tcg cct gta        432
Ile Asn Thr Ser Asn Pro Thr His Arg Ser Gly Glu Ser Ser Pro Val           
    130                 135                 140                           

aga ggg gat atg ctg ggg ctt aaa tcg gag ctt gaa aag cgc ttt ttc        480
Arg Gly Asp Met Leu Gly Leu Lys Ser Glu Leu Glu Lys Arg Phe Phe           
145                 150                 155                 160           

ggc aaa act ttt gat gat aat ata cat atc cag ctt att tac aac att        528
Gly Lys Thr Phe Asp Asp Asn Ile His Ile Gln Leu Ile Tyr Asn Ile           
                165                 170                 175               

ctg gat atc gaa aag ata ctt gcg gtg tat gta acg aat atc gtt tat        576
Leu Asp Ile Glu Lys Ile Leu Ala Val Tyr Val Thr Asn Ile Val Tyr           
            180                 185                 190                   

gcg ctg aac aat atg ctt ggt ata aag gat tct gaa agt tat gat gat        624
Ala Leu Asn Asn Met Leu Gly Ile Lys Asp Ser Glu Ser Tyr Asp Asp           
        195                 200                 205                       

ttt atg ggg tat ctt tct gca aga aat act tat gaa gtt ttt act cac        672
Phe Met Gly Tyr Leu Ser Ala Arg Asn Thr Tyr Glu Val Phe Thr His           
    210                 215                 220                           

cct gac aaa agc aat ctt tcc gat aag gta aag ggt aat atc aag aaa        720
Pro Asp Lys Ser Asn Leu Ser Asp Lys Val Lys Gly Asn Ile Lys Lys           
225                 230                 235                 240           

agc ctt agc aag ttt aat gac ttg ctg aaa act aag cgc ctt ggc tat        768
Ser Leu Ser Lys Phe Asn Asp Leu Leu Lys Thr Lys Arg Leu Gly Tyr           
                245                 250                 255               

ttc ggc ctt gaa gag cca aag aca aaa gac aca aga gct tcg gaa gca        816
Phe Gly Leu Glu Glu Pro Lys Thr Lys Asp Thr Arg Ala Ser Glu Ala           
            260                 265                 270                   

tac aaa aag cgt gtt tat cat atg ctt gca att gtg ggg cag ata aga        864
Tyr Lys Lys Arg Val Tyr His Met Leu Ala Ile Val Gly Gln Ile Arg           
        275                 280                 285                       

cag tgt gtt ttt cat gat aaa tcg ggt gca aaa aga ttt gac ctt tac        912
Gln Cys Val Phe His Asp Lys Ser Gly Ala Lys Arg Phe Asp Leu Tyr           
    290                 295                 300                           

agt ttt att aac aat att gat ccc gaa tac aga gat act ctt gac tat        960
Ser Phe Ile Asn Asn Ile Asp Pro Glu Tyr Arg Asp Thr Leu Asp Tyr           
305                 310                 315                 320           

ctt gtt gag gag cgt tta aag tcc ata aac aag gac ttt atc gag ggt       1008
Leu Val Glu Glu Arg Leu Lys Ser Ile Asn Lys Asp Phe Ile Glu Gly           
                325                 330                 335               

aac aag gtc aat atc agc ctg ctt att gat atg atg aaa ggc tat gag       1056
Asn Lys Val Asn Ile Ser Leu Leu Ile Asp Met Met Lys Gly Tyr Glu           
            340                 345                 350                   

gct gat gat atc ata cgc ctt tat tac gat ttc att gtg ctt aaa tct       1104
Ala Asp Asp Ile Ile Arg Leu Tyr Tyr Asp Phe Ile Val Leu Lys Ser           
        355                 360                 365                       

cag aaa aat ctc ggc ttt tct atc aaa aag ctt cgt gag aaa atg ctg       1152
Gln Lys Asn Leu Gly Phe Ser Ile Lys Lys Leu Arg Glu Lys Met Leu           
    370                 375                 380                           

gag gaa tac ggt ttc aga ttt aag gac aag caa tat gac tct gtg cgc       1200
Glu Glu Tyr Gly Phe Arg Phe Lys Asp Lys Gln Tyr Asp Ser Val Arg           
385                 390                 395                 400           

tca aag atg tac aag ctt atg gat ttc ctg ctt ttc tgc aac tac tac       1248
Ser Lys Met Tyr Lys Leu Met Asp Phe Leu Leu Phe Cys Asn Tyr Tyr           
                405                 410                 415               

aga aat gac gtt gcc gca ggc gaa gct ctt gtg cgt aaa ctg cgt ttt       1296
Arg Asn Asp Val Ala Ala Gly Glu Ala Leu Val Arg Lys Leu Arg Phe           
            420                 425                 430                   

tca atg acc gat gat gaa aaa gag ggg ata tat gct gat gaa gcg gca       1344
Ser Met Thr Asp Asp Glu Lys Glu Gly Ile Tyr Ala Asp Glu Ala Ala           
        435                 440                 445                       

aag ctt tgg ggc aaa ttc agg aat gat ttt gaa aat atc gcc gac cac       1392
Lys Leu Trp Gly Lys Phe Arg Asn Asp Phe Glu Asn Ile Ala Asp His           
    450                 455                 460                           

atg aac ggt gac gtt atc aag gag ctt ggc aag gct gac atg gat ttt       1440
Met Asn Gly Asp Val Ile Lys Glu Leu Gly Lys Ala Asp Met Asp Phe           
465                 470                 475                 480           

gat gag aaa att ctt gac agt gaa aag aag aat gcg tct gac ctt ttg       1488
Asp Glu Lys Ile Leu Asp Ser Glu Lys Lys Asn Ala Ser Asp Leu Leu           
                485                 490                 495               

tat ttc tcc aaa atg ata tat atg ctc aca tat ttt ctt gac ggc aag       1536
Tyr Phe Ser Lys Met Ile Tyr Met Leu Thr Tyr Phe Leu Asp Gly Lys           
            500                 505                 510                   

gag ata aac gat ctt ctt aca acg ctt atc agc aag ttt gat aac atc       1584
Glu Ile Asn Asp Leu Leu Thr Thr Leu Ile Ser Lys Phe Asp Asn Ile           
        515                 520                 525                       

aag gag ttt ttg aag ata atg aaa agc tct gct gtt gat gtt gag tgt       1632
Lys Glu Phe Leu Lys Ile Met Lys Ser Ser Ala Val Asp Val Glu Cys           
    530                 535                 540                           

gag ctt acg gcg ggc tac aag ctg ttc aat gac agc cag agg ata acc       1680
Glu Leu Thr Ala Gly Tyr Lys Leu Phe Asn Asp Ser Gln Arg Ile Thr           
545                 550                 555                 560           

aac gag ctt ttt atc gta aag aac att gct tcc atg aga aag cct gcg       1728
Asn Glu Leu Phe Ile Val Lys Asn Ile Ala Ser Met Arg Lys Pro Ala           
                565                 570                 575               

gct tca gcg aag ctt acg atg ttc cgt gac gca ctg act ata ctc ggt       1776
Ala Ser Ala Lys Leu Thr Met Phe Arg Asp Ala Leu Thr Ile Leu Gly           
            580                 585                 590                   

ata gac gac aat atc acg gac gat agg ata agc gag att cta aaa ctt       1824
Ile Asp Asp Asn Ile Thr Asp Asp Arg Ile Ser Glu Ile Leu Lys Leu           
        595                 600                 605                       

aaa gaa aaa ggc aag ggc ata cat ggt ctg aga aat ttt ata aca aac       1872
Lys Glu Lys Gly Lys Gly Ile His Gly Leu Arg Asn Phe Ile Thr Asn           
    610                 615                 620                           

aat gtt atc gag tcc tct cgg ttt gta tac ctt atc aag tat gcg aac       1920
Asn Val Ile Glu Ser Ser Arg Phe Val Tyr Leu Ile Lys Tyr Ala Asn           
625                 630                 635                 640           

gct cag aag ata aga gaa gtg gct aag gat gag aaa gtt gtc atg ttt       1968
Ala Gln Lys Ile Arg Glu Val Ala Lys Asp Glu Lys Val Val Met Phe           
                645                 650                 655               

gtt ctt ggg ggt atc cct gac acg cag ata gag cgt tat tac aag agt       2016
Val Leu Gly Gly Ile Pro Asp Thr Gln Ile Glu Arg Tyr Tyr Lys Ser           
            660                 665                 670                   

tgt gtg gag ttt cct gac atg aat agt tct ttg gaa gca aag cgc agt       2064
Cys Val Glu Phe Pro Asp Met Asn Ser Ser Leu Glu Ala Lys Arg Ser           
        675                 680                 685                       

gag ctt gcg aga atg ata aag aac atc agc ttt gat gat ttc aaa aat       2112
Glu Leu Ala Arg Met Ile Lys Asn Ile Ser Phe Asp Asp Phe Lys Asn           
    690                 695                 700                           

gtg aaa cag cag gca aag ggc aga gaa aac gtg gct aag gag agg gca       2160
Val Lys Gln Gln Ala Lys Gly Arg Glu Asn Val Ala Lys Glu Arg Ala           
705                 710                 715                 720           

aag gct gtt atc ggg ctt tat ctt acg gtc atg tat ctg ctg gtg aaa       2208
Lys Ala Val Ile Gly Leu Tyr Leu Thr Val Met Tyr Leu Leu Val Lys           
                725                 730                 735               

aat ctt gtg aat gtc aat gca agg tat gtt att gcg ata cac tgc ctt       2256
Asn Leu Val Asn Val Asn Ala Arg Tyr Val Ile Ala Ile His Cys Leu           
            740                 745                 750                   

gaa cgt gat ttt ggg ctg tat aag gag ata att cct gag ttg gct tca       2304
Glu Arg Asp Phe Gly Leu Tyr Lys Glu Ile Ile Pro Glu Leu Ala Ser           
        755                 760                 765                       

aag aac ttg aaa aat gac tac agg ata ctt tca cag acg ctt tgt gaa       2352
Lys Asn Leu Lys Asn Asp Tyr Arg Ile Leu Ser Gln Thr Leu Cys Glu           
    770                 775                 780                           

ctt tgt gat gat cgt aat gag tcg tcg aat ttg ttc ttg aaa aag aac       2400
Leu Cys Asp Asp Arg Asn Glu Ser Ser Asn Leu Phe Leu Lys Lys Asn           
785                 790                 795                 800           

aag cgg ctg cgc aag tgc gtt gaa gtt gat atc aat aat gca gac agc       2448
Lys Arg Leu Arg Lys Cys Val Glu Val Asp Ile Asn Asn Ala Asp Ser           
                805                 810                 815               

agc atg aca aga aaa tac cgc aac tgt att gct cat ctt act gta gtt       2496
Ser Met Thr Arg Lys Tyr Arg Asn Cys Ile Ala His Leu Thr Val Val           
            820                 825                 830                   

cgt gaa ctg aaa gaa tac ata gga gat att cgt aca gtg gat tct tac       2544
Arg Glu Leu Lys Glu Tyr Ile Gly Asp Ile Arg Thr Val Asp Ser Tyr           
        835                 840                 845                       

ttc tcc att tat cat tat gtt atg cag cgt tgt atc acg aaa agg gga       2592
Phe Ser Ile Tyr His Tyr Val Met Gln Arg Cys Ile Thr Lys Arg Gly           
    850                 855                 860                           

gat gac aca aag caa gaa gag aaa ata aag tat gag gac gat ctt tta       2640
Asp Asp Thr Lys Gln Glu Glu Lys Ile Lys Tyr Glu Asp Asp Leu Leu           
865                 870                 875                 880           

aaa aat cac ggc tat acg aaa gac ttt gta aag gct ctc aac tcg ccg       2688
Lys Asn His Gly Tyr Thr Lys Asp Phe Val Lys Ala Leu Asn Ser Pro           
                885                 890                 895               

ttt gga tac aac att ccg agg ttt aaa aat ctt tca att gag cag ttg       2736
Phe Gly Tyr Asn Ile Pro Arg Phe Lys Asn Leu Ser Ile Glu Gln Leu           
            900                 905                 910                   

ttt gac aga aat gaa tat ctt act gaa aag tag                           2769
Phe Asp Arg Asn Glu Tyr Leu Thr Glu Lys                                   
        915                 920                                           


<210>  214
<211>  922
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  214

Met Ala Lys Lys Asn Lys Met Lys Pro Arg Glu Leu Arg Glu Ala Gln 
1               5                   10                  15      


Lys Lys Ala Arg Gln Leu Lys Ala Ala Glu Ile Asn Asn Asn Ala Ala 
            20                  25                  30          


Pro Ala Ile Ala Ala Met Pro Ala Ala Glu Val Ile Ala Pro Val Ala 
        35                  40                  45              


Glu Lys Lys Lys Ser Ser Val Lys Ala Ala Gly Met Lys Ser Ile Leu 
    50                  55                  60                  


Val Ser Glu Asn Lys Met Tyr Ile Thr Ser Phe Gly Lys Gly Asn Ser 
65                  70                  75                  80  


Ala Val Leu Glu Tyr Glu Val Asp Asn Asn Asp Tyr Asn Lys Thr Gln 
                85                  90                  95      


Leu Ser Ser Lys Asp Asn Ser Asn Ile Glu Leu Gly Asp Val Asn Glu 
            100                 105                 110         


Val Asn Ile Thr Phe Ser Ser Lys His Gly Phe Gly Ser Gly Val Glu 
        115                 120                 125             


Ile Asn Thr Ser Asn Pro Thr His Arg Ser Gly Glu Ser Ser Pro Val 
    130                 135                 140                 


Arg Gly Asp Met Leu Gly Leu Lys Ser Glu Leu Glu Lys Arg Phe Phe 
145                 150                 155                 160 


Gly Lys Thr Phe Asp Asp Asn Ile His Ile Gln Leu Ile Tyr Asn Ile 
                165                 170                 175     


Leu Asp Ile Glu Lys Ile Leu Ala Val Tyr Val Thr Asn Ile Val Tyr 
            180                 185                 190         


Ala Leu Asn Asn Met Leu Gly Ile Lys Asp Ser Glu Ser Tyr Asp Asp 
        195                 200                 205             


Phe Met Gly Tyr Leu Ser Ala Arg Asn Thr Tyr Glu Val Phe Thr His 
    210                 215                 220                 


Pro Asp Lys Ser Asn Leu Ser Asp Lys Val Lys Gly Asn Ile Lys Lys 
225                 230                 235                 240 


Ser Leu Ser Lys Phe Asn Asp Leu Leu Lys Thr Lys Arg Leu Gly Tyr 
                245                 250                 255     


Phe Gly Leu Glu Glu Pro Lys Thr Lys Asp Thr Arg Ala Ser Glu Ala 
            260                 265                 270         


Tyr Lys Lys Arg Val Tyr His Met Leu Ala Ile Val Gly Gln Ile Arg 
        275                 280                 285             


Gln Cys Val Phe His Asp Lys Ser Gly Ala Lys Arg Phe Asp Leu Tyr 
    290                 295                 300                 


Ser Phe Ile Asn Asn Ile Asp Pro Glu Tyr Arg Asp Thr Leu Asp Tyr 
305                 310                 315                 320 


Leu Val Glu Glu Arg Leu Lys Ser Ile Asn Lys Asp Phe Ile Glu Gly 
                325                 330                 335     


Asn Lys Val Asn Ile Ser Leu Leu Ile Asp Met Met Lys Gly Tyr Glu 
            340                 345                 350         


Ala Asp Asp Ile Ile Arg Leu Tyr Tyr Asp Phe Ile Val Leu Lys Ser 
        355                 360                 365             


Gln Lys Asn Leu Gly Phe Ser Ile Lys Lys Leu Arg Glu Lys Met Leu 
    370                 375                 380                 


Glu Glu Tyr Gly Phe Arg Phe Lys Asp Lys Gln Tyr Asp Ser Val Arg 
385                 390                 395                 400 


Ser Lys Met Tyr Lys Leu Met Asp Phe Leu Leu Phe Cys Asn Tyr Tyr 
                405                 410                 415     


Arg Asn Asp Val Ala Ala Gly Glu Ala Leu Val Arg Lys Leu Arg Phe 
            420                 425                 430         


Ser Met Thr Asp Asp Glu Lys Glu Gly Ile Tyr Ala Asp Glu Ala Ala 
        435                 440                 445             


Lys Leu Trp Gly Lys Phe Arg Asn Asp Phe Glu Asn Ile Ala Asp His 
    450                 455                 460                 


Met Asn Gly Asp Val Ile Lys Glu Leu Gly Lys Ala Asp Met Asp Phe 
465                 470                 475                 480 


Asp Glu Lys Ile Leu Asp Ser Glu Lys Lys Asn Ala Ser Asp Leu Leu 
                485                 490                 495     


Tyr Phe Ser Lys Met Ile Tyr Met Leu Thr Tyr Phe Leu Asp Gly Lys 
            500                 505                 510         


Glu Ile Asn Asp Leu Leu Thr Thr Leu Ile Ser Lys Phe Asp Asn Ile 
        515                 520                 525             


Lys Glu Phe Leu Lys Ile Met Lys Ser Ser Ala Val Asp Val Glu Cys 
    530                 535                 540                 


Glu Leu Thr Ala Gly Tyr Lys Leu Phe Asn Asp Ser Gln Arg Ile Thr 
545                 550                 555                 560 


Asn Glu Leu Phe Ile Val Lys Asn Ile Ala Ser Met Arg Lys Pro Ala 
                565                 570                 575     


Ala Ser Ala Lys Leu Thr Met Phe Arg Asp Ala Leu Thr Ile Leu Gly 
            580                 585                 590         


Ile Asp Asp Asn Ile Thr Asp Asp Arg Ile Ser Glu Ile Leu Lys Leu 
        595                 600                 605             


Lys Glu Lys Gly Lys Gly Ile His Gly Leu Arg Asn Phe Ile Thr Asn 
    610                 615                 620                 


Asn Val Ile Glu Ser Ser Arg Phe Val Tyr Leu Ile Lys Tyr Ala Asn 
625                 630                 635                 640 


Ala Gln Lys Ile Arg Glu Val Ala Lys Asp Glu Lys Val Val Met Phe 
                645                 650                 655     


Val Leu Gly Gly Ile Pro Asp Thr Gln Ile Glu Arg Tyr Tyr Lys Ser 
            660                 665                 670         


Cys Val Glu Phe Pro Asp Met Asn Ser Ser Leu Glu Ala Lys Arg Ser 
        675                 680                 685             


Glu Leu Ala Arg Met Ile Lys Asn Ile Ser Phe Asp Asp Phe Lys Asn 
    690                 695                 700                 


Val Lys Gln Gln Ala Lys Gly Arg Glu Asn Val Ala Lys Glu Arg Ala 
705                 710                 715                 720 


Lys Ala Val Ile Gly Leu Tyr Leu Thr Val Met Tyr Leu Leu Val Lys 
                725                 730                 735     


Asn Leu Val Asn Val Asn Ala Arg Tyr Val Ile Ala Ile His Cys Leu 
            740                 745                 750         


Glu Arg Asp Phe Gly Leu Tyr Lys Glu Ile Ile Pro Glu Leu Ala Ser 
        755                 760                 765             


Lys Asn Leu Lys Asn Asp Tyr Arg Ile Leu Ser Gln Thr Leu Cys Glu 
    770                 775                 780                 


Leu Cys Asp Asp Arg Asn Glu Ser Ser Asn Leu Phe Leu Lys Lys Asn 
785                 790                 795                 800 


Lys Arg Leu Arg Lys Cys Val Glu Val Asp Ile Asn Asn Ala Asp Ser 
                805                 810                 815     


Ser Met Thr Arg Lys Tyr Arg Asn Cys Ile Ala His Leu Thr Val Val 
            820                 825                 830         


Arg Glu Leu Lys Glu Tyr Ile Gly Asp Ile Arg Thr Val Asp Ser Tyr 
        835                 840                 845             


Phe Ser Ile Tyr His Tyr Val Met Gln Arg Cys Ile Thr Lys Arg Gly 
    850                 855                 860                 


Asp Asp Thr Lys Gln Glu Glu Lys Ile Lys Tyr Glu Asp Asp Leu Leu 
865                 870                 875                 880 


Lys Asn His Gly Tyr Thr Lys Asp Phe Val Lys Ala Leu Asn Ser Pro 
                885                 890                 895     


Phe Gly Tyr Asn Ile Pro Arg Phe Lys Asn Leu Ser Ile Glu Gln Leu 
            900                 905                 910         


Phe Asp Arg Asn Glu Tyr Leu Thr Glu Lys 
        915                 920         


<210>  215
<211>  954
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Eubacterium siraeum with mutated HEPN domain

<400>  215

Met Gly Lys Lys Ile His Ala Arg Asp Leu Arg Glu Gln Arg Lys Thr 
1               5                   10                  15      


Asp Arg Thr Glu Lys Phe Ala Asp Gln Asn Lys Lys Arg Glu Ala Glu 
            20                  25                  30          


Arg Ala Val Pro Lys Lys Asp Ala Ala Val Ser Val Lys Ser Val Ser 
        35                  40                  45              


Ser Val Ser Ser Lys Lys Asp Asn Val Thr Lys Ser Met Ala Lys Ala 
    50                  55                  60                  


Ala Gly Val Lys Ser Val Phe Ala Val Gly Asn Thr Val Tyr Met Thr 
65                  70                  75                  80  


Ser Phe Gly Arg Gly Asn Asp Ala Val Leu Glu Gln Lys Ile Val Asp 
                85                  90                  95      


Thr Ser His Glu Pro Leu Asn Ile Asp Asp Pro Ala Tyr Gln Leu Asn 
            100                 105                 110         


Val Val Thr Met Asn Gly Tyr Ser Val Thr Gly His Arg Gly Glu Thr 
        115                 120                 125             


Val Ser Ala Val Thr Asp Asn Pro Leu Arg Arg Phe Asn Gly Arg Lys 
    130                 135                 140                 


Lys Asp Glu Pro Glu Gln Ser Val Pro Thr Asp Met Leu Cys Leu Lys 
145                 150                 155                 160 


Pro Thr Leu Glu Lys Lys Phe Phe Gly Lys Glu Phe Asp Asp Asn Ile 
                165                 170                 175     


His Ile Gln Leu Ile Tyr Asn Ile Leu Asp Ile Glu Lys Ile Leu Ala 
            180                 185                 190         


Val Tyr Ser Thr Asn Ala Ile Tyr Ala Leu Asn Asn Met Ser Ala Asp 
        195                 200                 205             


Glu Asn Ile Glu Asn Ser Asp Phe Phe Met Lys Arg Thr Thr Asp Glu 
    210                 215                 220                 


Thr Phe Asp Asp Phe Glu Lys Lys Lys Glu Ser Thr Asn Ser Arg Glu 
225                 230                 235                 240 


Lys Ala Asp Phe Asp Ala Phe Glu Lys Phe Ile Gly Asn Tyr Arg Leu 
                245                 250                 255     


Ala Tyr Phe Ala Asp Ala Phe Tyr Val Asn Lys Lys Asn Pro Lys Gly 
            260                 265                 270         


Lys Ala Lys Asn Val Leu Arg Glu Asp Lys Glu Leu Tyr Ser Val Leu 
        275                 280                 285             


Thr Leu Ile Gly Lys Leu Ala His Trp Cys Val Ala Ser Glu Glu Gly 
    290                 295                 300                 


Arg Ala Glu Phe Trp Leu Tyr Lys Leu Asp Glu Leu Lys Asp Asp Phe 
305                 310                 315                 320 


Lys Asn Val Leu Asp Val Val Tyr Asn Arg Pro Val Glu Glu Ile Asn 
                325                 330                 335     


Asn Arg Phe Ile Glu Asn Asn Lys Val Asn Ile Gln Ile Leu Gly Ser 
            340                 345                 350         


Val Tyr Lys Asn Thr Asp Ile Ala Glu Leu Val Arg Ser Tyr Tyr Glu 
        355                 360                 365             


Phe Leu Ile Thr Lys Lys Tyr Lys Asn Met Gly Phe Ser Ile Lys Lys 
    370                 375                 380                 


Leu Arg Glu Ser Met Leu Glu Gly Lys Gly Tyr Ala Asp Lys Glu Tyr 
385                 390                 395                 400 


Asp Ser Val Arg Asn Lys Leu Tyr Gln Met Thr Asp Phe Ile Leu Tyr 
                405                 410                 415     


Thr Gly Tyr Ile Asn Glu Asp Ser Asp Arg Ala Asp Asp Leu Val Asn 
            420                 425                 430         


Thr Leu Arg Ser Ser Leu Lys Glu Asp Asp Lys Thr Thr Val Tyr Cys 
        435                 440                 445             


Lys Glu Ala Asp Tyr Leu Trp Lys Lys Tyr Arg Glu Ser Ile Arg Glu 
    450                 455                 460                 


Val Ala Asp Ala Leu Asp Gly Asp Asn Ile Lys Lys Leu Ser Lys Ser 
465                 470                 475                 480 


Asn Ile Glu Ile Gln Glu Asp Lys Leu Arg Lys Cys Phe Ile Ser Tyr 
                485                 490                 495     


Ala Asp Ser Val Ser Glu Phe Thr Lys Leu Ile Tyr Leu Leu Thr Arg 
            500                 505                 510         


Phe Leu Ser Gly Lys Glu Ile Asn Asp Leu Val Thr Thr Leu Ile Asn 
        515                 520                 525             


Lys Phe Asp Asn Ile Arg Ser Phe Leu Glu Ile Met Asp Glu Leu Gly 
    530                 535                 540                 


Leu Asp Arg Thr Phe Thr Ala Glu Tyr Ser Phe Phe Glu Gly Ser Thr 
545                 550                 555                 560 


Lys Tyr Leu Ala Glu Leu Val Glu Leu Asn Ser Phe Val Lys Ser Cys 
                565                 570                 575     


Ser Phe Asp Ile Asn Ala Lys Arg Thr Met Tyr Arg Asp Ala Leu Asp 
            580                 585                 590         


Ile Leu Gly Ile Glu Ser Asp Lys Thr Glu Glu Asp Ile Glu Lys Met 
        595                 600                 605             


Ile Asp Asn Ile Leu Gln Ile Asp Ala Asn Gly Asp Lys Lys Leu Lys 
    610                 615                 620                 


Lys Asn Asn Gly Leu Arg Asn Phe Ile Ala Ser Asn Val Ile Asp Ser 
625                 630                 635                 640 


Asn Arg Phe Lys Tyr Leu Val Arg Tyr Gly Asn Pro Lys Lys Ile Arg 
                645                 650                 655     


Glu Thr Ala Lys Cys Lys Pro Ala Val Arg Phe Val Leu Asn Glu Ile 
            660                 665                 670         


Pro Asp Ala Gln Ile Glu Arg Tyr Tyr Glu Ala Cys Cys Pro Lys Asn 
        675                 680                 685             


Thr Ala Leu Cys Ser Ala Asn Lys Arg Arg Glu Lys Leu Ala Asp Met 
    690                 695                 700                 


Ile Ala Glu Ile Lys Phe Glu Asn Phe Ser Asp Ala Gly Asn Tyr Gln 
705                 710                 715                 720 


Lys Ala Asn Val Thr Ser Arg Thr Ser Glu Ala Glu Ile Lys Arg Lys 
                725                 730                 735     


Asn Gln Ala Ile Ile Arg Leu Tyr Leu Thr Val Met Tyr Ile Met Leu 
            740                 745                 750         


Lys Asn Leu Val Asn Val Asn Ala Arg Tyr Val Ile Ala Phe His Cys 
        755                 760                 765             


Val Glu Arg Asp Thr Lys Leu Tyr Ala Glu Ser Gly Leu Glu Val Gly 
    770                 775                 780                 


Asn Ile Glu Lys Asn Lys Thr Asn Leu Thr Met Ala Val Met Gly Val 
785                 790                 795                 800 


Lys Leu Glu Asn Gly Ile Ile Lys Thr Glu Phe Asp Lys Ser Phe Ala 
                805                 810                 815     


Glu Asn Ala Ala Asn Arg Tyr Leu Arg Asn Ala Arg Trp Tyr Lys Leu 
            820                 825                 830         


Ile Leu Asp Asn Leu Lys Lys Ser Glu Arg Ala Val Val Asn Glu Phe 
        835                 840                 845             


Ala Asn Thr Val Cys Ala Leu Asn Ala Ile Arg Asn Ile Asn Ile Asn 
    850                 855                 860                 


Ile Lys Glu Ile Lys Glu Val Glu Asn Tyr Phe Ala Leu Tyr His Tyr 
865                 870                 875                 880 


Leu Ile Gln Lys His Leu Glu Asn Arg Phe Ala Asp Lys Lys Val Glu 
                885                 890                 895     


Arg Asp Thr Gly Asp Phe Ile Ser Lys Leu Glu Glu His Lys Thr Tyr 
            900                 905                 910         


Cys Lys Asp Phe Val Lys Ala Tyr Cys Thr Pro Phe Gly Tyr Asn Leu 
        915                 920                 925             


Val Arg Tyr Lys Asn Leu Thr Ile Asp Gly Leu Phe Asp Lys Asn Tyr 
    930                 935                 940                 


Pro Gly Lys Asp Asp Ser Asp Glu Gln Lys 
945                 950                 


<210>  216
<211>  919
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Uncultured Ruminococcus sp. with mutated HEPN site

<400>  216

Met Ala Lys Lys Asn Lys Met Lys Pro Arg Glu Leu Arg Glu Ala Gln 
1               5                   10                  15      


Lys Lys Ala Arg Gln Leu Lys Ala Ala Glu Ile Asn Asn Asn Ala Ala 
            20                  25                  30          


Pro Ala Ile Ala Ala Met Pro Ala Ala Glu Val Ile Ala Pro Ala Ala 
        35                  40                  45              


Glu Lys Lys Lys Ser Ser Val Lys Ala Ala Gly Met Lys Ser Ile Leu 
    50                  55                  60                  


Val Ser Glu Asn Lys Met Tyr Ile Thr Ser Phe Gly Lys Gly Asn Ser 
65                  70                  75                  80  


Ala Val Leu Glu Tyr Glu Val Asp Asn Asn Asp Tyr Asn Gln Thr Gln 
                85                  90                  95      


Leu Ser Ser Lys Asp Asn Ser Asn Ile Gln Leu Gly Gly Val Asn Glu 
            100                 105                 110         


Val Asn Ile Thr Phe Ser Ser Lys His Gly Phe Glu Ser Gly Val Glu 
        115                 120                 125             


Ile Asn Thr Ser Asn Pro Thr His Arg Ser Gly Glu Ser Ser Pro Val 
    130                 135                 140                 


Arg Gly Asp Met Leu Gly Leu Lys Ser Glu Leu Glu Lys Arg Phe Phe 
145                 150                 155                 160 


Gly Lys Thr Phe Asp Asp Asn Ile His Ile Gln Leu Ile Tyr Asn Ile 
                165                 170                 175     


Leu Asp Ile Glu Lys Ile Leu Ala Val Tyr Val Thr Asn Ile Val Tyr 
            180                 185                 190         


Ala Leu Asn Asn Met Leu Gly Val Lys Gly Ser Glu Ser His Asp Asp 
        195                 200                 205             


Phe Ile Gly Tyr Leu Ser Thr Asn Asn Ile Tyr Asp Val Phe Ile Asp 
    210                 215                 220                 


Pro Asp Asn Ser Ser Leu Ser Asp Asp Lys Lys Ala Asn Val Arg Lys 
225                 230                 235                 240 


Ser Leu Ser Lys Phe Asn Ala Leu Leu Lys Thr Lys Arg Leu Gly Tyr 
                245                 250                 255     


Phe Gly Leu Glu Glu Pro Lys Thr Lys Asp Asn Arg Val Ser Gln Ala 
            260                 265                 270         


Tyr Lys Lys Arg Val Tyr His Met Leu Ala Ile Val Gly Gln Ile Ala 
        275                 280                 285             


Gln Cys Val Phe Ala Asp Lys Ser Gly Ala Lys Arg Phe Asp Leu Tyr 
    290                 295                 300                 


Ser Phe Ile Asn Asn Ile Asp Pro Glu Tyr Arg Asp Thr Leu Asp Tyr 
305                 310                 315                 320 


Leu Val Glu Glu Arg Leu Lys Ser Ile Asn Lys Asp Phe Ile Glu Asp 
                325                 330                 335     


Asn Lys Val Asn Ile Ser Leu Leu Ile Asp Met Met Lys Gly Tyr Glu 
            340                 345                 350         


Ala Asp Asp Ile Ile Arg Leu Tyr Tyr Asp Phe Ile Val Leu Lys Ser 
        355                 360                 365             


Gln Lys Asn Leu Gly Phe Ser Ile Lys Lys Leu Arg Glu Lys Met Leu 
    370                 375                 380                 


Asp Glu Tyr Gly Phe Arg Phe Lys Asp Lys Gln Tyr Asp Ser Val Arg 
385                 390                 395                 400 


Ser Lys Met Tyr Lys Leu Met Asp Phe Leu Leu Phe Cys Asn Tyr Tyr 
                405                 410                 415     


Arg Asn Asp Ile Ala Ala Gly Glu Ser Leu Val Arg Lys Leu Arg Phe 
            420                 425                 430         


Ser Met Thr Asp Asp Glu Lys Glu Gly Ile Tyr Ala Asp Glu Ala Ala 
        435                 440                 445             


Lys Leu Trp Gly Lys Phe Arg Asn Asp Phe Glu Asn Ile Ala Asp His 
    450                 455                 460                 


Met Asn Gly Asp Val Ile Lys Glu Leu Gly Lys Ala Asp Met Asp Phe 
465                 470                 475                 480 


Asp Glu Lys Ile Leu Asp Ser Glu Lys Lys Asn Ala Ser Asp Leu Leu 
                485                 490                 495     


Tyr Phe Ser Lys Met Ile Tyr Met Leu Thr Tyr Phe Leu Asp Gly Lys 
            500                 505                 510         


Glu Ile Asn Asp Leu Leu Thr Thr Leu Ile Ser Lys Phe Asp Asn Ile 
        515                 520                 525             


Lys Glu Phe Leu Lys Ile Met Lys Ser Ser Ala Val Asp Val Glu Cys 
    530                 535                 540                 


Glu Leu Thr Ala Gly Tyr Lys Leu Phe Asn Asp Ser Gln Arg Ile Thr 
545                 550                 555                 560 


Asn Glu Leu Phe Ile Val Lys Asn Ile Ala Ser Met Arg Lys Pro Ala 
                565                 570                 575     


Ala Ser Ala Lys Leu Thr Met Phe Arg Asp Ala Leu Thr Ile Leu Gly 
            580                 585                 590         


Ile Asp Asp Lys Ile Thr Asp Asp Arg Ile Ser Gly Ile Leu Lys Leu 
        595                 600                 605             


Lys Glu Lys Gly Lys Gly Ile His Gly Leu Arg Asn Phe Ile Thr Asn 
    610                 615                 620                 


Asn Val Ile Glu Ser Ser Arg Phe Val Tyr Leu Ile Lys Tyr Ala Asn 
625                 630                 635                 640 


Ala Gln Lys Ile Arg Glu Val Ala Lys Asn Glu Lys Val Val Met Phe 
                645                 650                 655     


Val Leu Gly Gly Ile Pro Asp Thr Gln Ile Glu Arg Tyr Tyr Lys Ser 
            660                 665                 670         


Cys Val Glu Phe Pro Asp Met Asn Ser Ser Leu Gly Val Lys Arg Ser 
        675                 680                 685             


Glu Leu Ala Arg Met Ile Lys Asn Ile Ser Phe Asp Asp Phe Lys Asn 
    690                 695                 700                 


Val Lys Gln Gln Ala Lys Gly Arg Glu Asn Val Ala Lys Glu Arg Ala 
705                 710                 715                 720 


Lys Ala Val Ile Gly Leu Tyr Leu Thr Val Met Tyr Leu Leu Val Lys 
                725                 730                 735     


Asn Leu Val Asn Val Asn Ala Arg Tyr Val Ile Ala Ile His Cys Leu 
            740                 745                 750         


Glu Arg Asp Phe Gly Leu Tyr Lys Glu Ile Ile Pro Glu Leu Ala Ser 
        755                 760                 765             


Lys Asn Leu Lys Asn Asp Tyr Arg Ile Leu Ser Gln Thr Leu Cys Glu 
    770                 775                 780                 


Leu Cys Asp Lys Ser Pro Asn Leu Phe Leu Lys Lys Asn Glu Arg Leu 
785                 790                 795                 800 


Arg Lys Cys Val Glu Val Asp Ile Asn Asn Ala Asp Ser Ser Met Thr 
                805                 810                 815     


Arg Lys Tyr Ala Asn Cys Ile Ala Ala Leu Thr Val Val Arg Glu Leu 
            820                 825                 830         


Lys Glu Tyr Ile Gly Asp Ile Cys Thr Val Asp Ser Tyr Phe Ser Ile 
        835                 840                 845             


Tyr His Tyr Val Met Gln Arg Cys Ile Thr Lys Arg Glu Asn Asp Thr 
    850                 855                 860                 


Lys Gln Glu Glu Lys Ile Lys Tyr Glu Asp Asp Leu Leu Lys Asn His 
865                 870                 875                 880 


Gly Tyr Thr Lys Asp Phe Val Lys Ala Leu Asn Ser Pro Phe Gly Tyr 
                885                 890                 895     


Asn Ile Pro Arg Phe Lys Asn Leu Ser Ile Glu Gln Leu Phe Asp Arg 
            900                 905                 910         


Asn Glu Tyr Leu Thr Glu Lys 
        915                 


<210>  217
<211>  6
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  native HEPN domain


<220>
<221>  MISC_FEATURE
<222>  (2)..(5)
<223>  X is any amino acid

<400>  217

Arg Xaa Xaa Xaa Xaa His 
1               5       


<210>  218
<211>  33
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  NLS signal


<220>
<221>  CDS
<222>  (1)..(33)

<400>  218
agc ccc aag aag aag aga aag gtg gag gcc agc                             33
Ser Pro Lys Lys Lys Arg Lys Val Glu Ala Ser                               
1               5                   10                                    


<210>  219
<211>  11
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  219

Ser Pro Lys Lys Lys Arg Lys Val Glu Ala Ser 
1               5                   10      


<210>  220
<211>  33
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  NLS signa


<220>
<221>  CDS
<222>  (1)..(33)

<400>  220
gga cct aag aaa aag agg aag gtg gcg gcc gct                             33
Gly Pro Lys Lys Lys Arg Lys Val Ala Ala Ala                               
1               5                   10                                    


<210>  221
<211>  11
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  221

Gly Pro Lys Lys Lys Arg Lys Val Ala Ala Ala 
1               5                   10      


<210>  222
<211>  1020
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  contig e-k87_11092736

<400>  222

Met Lys Arg Gln Lys Thr Phe Ala Lys Arg Ile Gly Ile Lys Ser Thr 
1               5                   10                  15      


Val Ala Tyr Gly Gln Gly Lys Tyr Ala Ile Thr Thr Phe Gly Lys Gly 
            20                  25                  30          


Ser Lys Ala Glu Ile Ala Val Arg Ser Ala Asp Pro Pro Glu Glu Thr 
        35                  40                  45              


Leu Pro Thr Glu Ser Asp Ala Thr Leu Ser Ile His Ala Lys Phe Ala 
    50                  55                  60                  


Lys Ala Gly Arg Asp Gly Arg Glu Phe Lys Cys Gly Asp Val Asp Glu 
65                  70                  75                  80  


Thr Arg Ile His Thr Ser Arg Ser Glu Tyr Glu Ser Leu Ile Ser Asn 
                85                  90                  95      


Pro Ala Glu Ser Pro Arg Glu Asp Tyr Leu Gly Leu Lys Gly Thr Leu 
            100                 105                 110         


Glu Arg Lys Phe Phe Gly Asp Glu Tyr Pro Lys Asp Asn Leu Arg Ile 
        115                 120                 125             


Gln Ile Ile Tyr Ser Ile Leu Asp Ile Gln Lys Ile Leu Gly Leu Tyr 
    130                 135                 140                 


Val Glu Asp Ile Leu His Phe Val Asp Gly Leu Gln Asp Glu Pro Glu 
145                 150                 155                 160 


Asp Leu Val Gly Leu Gly Leu Gly Asp Glu Lys Met Gln Lys Leu Leu 
                165                 170                 175     


Ser Lys Ala Leu Pro Tyr Met Gly Phe Phe Gly Ser Thr Asp Val Phe 
            180                 185                 190         


Lys Val Thr Lys Lys Arg Glu Glu Arg Ala Ala Ala Asp Glu His Asn 
        195                 200                 205             


Ala Lys Val Phe Arg Ala Leu Gly Ala Ile Arg Gln Lys Leu Ala His 
    210                 215                 220                 


Phe Lys Trp Lys Glu Ser Leu Ala Ile Phe Gly Ala Asn Ala Asn Met 
225                 230                 235                 240 


Pro Ile Arg Phe Phe Gln Gly Ala Thr Gly Gly Arg Gln Leu Trp Asn 
                245                 250                 255     


Asp Val Ile Ala Pro Leu Trp Lys Lys Arg Ile Glu Arg Val Arg Lys 
            260                 265                 270         


Ser Phe Leu Ser Asn Ser Ala Lys Asn Leu Trp Val Leu Tyr Gln Val 
        275                 280                 285             


Phe Lys Asp Asp Thr Asp Glu Lys Lys Lys Ala Arg Ala Arg Gln Tyr 
    290                 295                 300                 


Tyr His Phe Ser Val Leu Lys Glu Gly Lys Asn Leu Gly Phe Asn Leu 
305                 310                 315                 320 


Thr Lys Thr Arg Glu Tyr Phe Leu Asp Lys Phe Phe Pro Ile Phe His 
                325                 330                 335     


Ser Ser Ala Pro Asp Val Lys Arg Lys Val Asp Thr Phe Arg Ser Lys 
            340                 345                 350         


Phe Tyr Ala Ile Leu Asp Phe Ile Ile Tyr Glu Ala Ser Val Ser Val 
        355                 360                 365             


Ala Asn Ser Gly Gln Met Gly Lys Val Ala Pro Trp Lys Gly Ala Ile 
    370                 375                 380                 


Asp Asn Ala Leu Val Lys Leu Arg Glu Ala Pro Asp Glu Glu Ala Lys 
385                 390                 395                 400 


Glu Lys Ile Tyr Asn Val Leu Ala Ala Ser Ile Arg Asn Asp Ser Leu 
                405                 410                 415     


Phe Leu Arg Leu Lys Ser Ala Cys Asp Lys Phe Gly Ala Glu Gln Asn 
            420                 425                 430         


Arg Pro Val Phe Pro Asn Glu Leu Arg Asn Asn Arg Asp Ile Arg Asn 
        435                 440                 445             


Val Arg Ser Glu Trp Leu Glu Ala Thr Gln Asp Val Asp Ala Ala Ala 
    450                 455                 460                 


Phe Val Gln Leu Ile Ala Phe Leu Cys Asn Phe Leu Glu Gly Lys Glu 
465                 470                 475                 480 


Ile Asn Glu Leu Val Thr Ala Leu Ile Lys Lys Phe Glu Gly Ile Gln 
                485                 490                 495     


Ala Leu Ile Asp Leu Leu Arg Asn Leu Glu Gly Val Asp Ser Ile Arg 
            500                 505                 510         


Phe Glu Asn Glu Phe Ala Leu Phe Asn Asp Asp Lys Gly Asn Met Ala 
        515                 520                 525             


Gly Arg Ile Ala Arg Gln Leu Arg Leu Leu Ala Ser Val Gly Lys Met 
    530                 535                 540                 


Lys Pro Asp Met Thr Asp Ala Lys Arg Val Leu Tyr Lys Ser Ala Leu 
545                 550                 555                 560 


Glu Ile Leu Gly Ala Pro Pro Asp Glu Val Ser Asp Glu Trp Leu Ala 
                565                 570                 575     


Glu Asn Ile Leu Leu Asp Lys Ser Asn Asn Asp Tyr Gln Lys Ala Lys 
            580                 585                 590         


Lys Thr Val Asn Pro Phe Arg Asn Tyr Ile Ala Lys Asn Val Ile Thr 
        595                 600                 605             


Ser Arg Ser Phe Tyr Tyr Leu Val Arg Tyr Ala Lys Pro Thr Ala Val 
    610                 615                 620                 


Arg Lys Leu Met Ser Asn Pro Lys Ile Val Arg Tyr Val Leu Lys Arg 
625                 630                 635                 640 


Leu Pro Glu Lys Gln Val Ala Ser Tyr Tyr Ser Ala Ile Trp Thr Gln 
                645                 650                 655     


Ser Glu Ser Asn Ser Asn Glu Met Val Lys Leu Ile Glu Met Ile Asp 
            660                 665                 670         


Arg Leu Thr Thr Glu Ile Ala Gly Phe Ser Phe Ala Val Leu Lys Asp 
        675                 680                 685             


Lys Lys Asp Ser Ile Val Ser Ala Ser Arg Glu Ser Arg Ala Val Asn 
    690                 695                 700                 


Leu Glu Val Glu Arg Leu Lys Lys Leu Thr Thr Leu Tyr Met Ser Ile 
705                 710                 715                 720 


Ala Tyr Ile Ala Val Lys Ser Leu Val Lys Val Asn Ala Arg Tyr Phe 
                725                 730                 735     


Ile Ala Tyr Ser Ala Leu Glu Arg Asp Leu Tyr Phe Phe Asn Glu Lys 
            740                 745                 750         


Tyr Gly Glu Glu Phe Arg Leu His Phe Ile Pro Tyr Glu Leu Asn Gly 
        755                 760                 765             


Lys Thr Cys Gln Phe Glu Tyr Leu Ala Ile Leu Lys Tyr Tyr Leu Ala 
    770                 775                 780                 


Arg Asp Glu Glu Thr Leu Lys Arg Lys Cys Glu Ile Cys Glu Glu Ile 
785                 790                 795                 800 


Lys Val Gly Cys Glu Lys His Lys Lys Asn Ala Asn Pro Pro Tyr Glu 
                805                 810                 815     


Tyr Asp Gln Glu Trp Ile Asp Lys Lys Lys Ala Leu Asn Ser Glu Arg 
            820                 825                 830         


Lys Ala Cys Glu Arg Arg Leu His Phe Ser Thr His Trp Ala Gln Tyr 
        835                 840                 845             


Ala Thr Lys Arg Asp Glu Asn Met Ala Lys His Pro Gln Lys Trp Tyr 
    850                 855                 860                 


Asp Ile Leu Ala Ser His Tyr Asp Glu Leu Leu Ala Leu Gln Ala Thr 
865                 870                 875                 880 


Gly Trp Leu Ala Thr Gln Ala Arg Asn Asp Ala Glu His Leu Asn Pro 
                885                 890                 895     


Val Asn Glu Phe Asp Val Tyr Ile Glu Asp Leu Arg Arg Tyr Pro Glu 
            900                 905                 910         


Gly Thr Pro Lys Asn Lys Asp Tyr His Ile Gly Ser Tyr Phe Glu Ile 
        915                 920                 925             


Tyr His Tyr Ile Arg Gln Arg Ala Tyr Leu Glu Glu Val Leu Ala Lys 
    930                 935                 940                 


Arg Lys Glu Tyr Arg Asp Ser Gly Ser Phe Thr Asp Glu Gln Leu Asp 
945                 950                 955                 960 


Lys Leu Gln Lys Ile Leu Asp Asp Ile Arg Ala Arg Gly Ser Tyr Asp 
                965                 970                 975     


Lys Asn Leu Leu Lys Leu Glu Tyr Leu Pro Phe Ala Tyr Asn Leu Pro 
            980                 985                 990         


Arg Tyr Lys Asn Leu Thr Thr Glu  Ala Leu Phe Asp Asp  Asp Ser Val 
        995                 1000                 1005             


Ser Gly  Lys Lys Arg Val Ala  Glu Trp Arg Glu Arg  
    1010                 1015                 1020 


<210>  223
<211>  36
<212>  RNA
<213>  Ruminococcus flavefaciens

<400>  223
caaguaaacc ccuaccaacu ggucgggguu ugaaac                                 36


<210>  224
<211>  36
<212>  RNA
<213>  Artificial Sequence

<220>
<223>  Ruminococcus_bicirculans

<400>  224
cuacuacacu ggugcgaauu ugcacuaguc uaaaac                                 36


<210>  225
<211>  3965
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV2 transfer plasmid encoding N-terminal Abe8e with two sgRNA 
       expression cassettes

<400>  225
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc       60

gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggcca      120

actccatcac taggggttcc tgcggccgcg gaaaaaaagc accgactcgg tgccactttt      180

tcaagttgat aacggactag ccttatttta acttgctatt tctagctcta aaacgctgca      240

gaacaggaga taaccggtgt ttcgtccttt ccacaagata tataaagcca agaaatcgaa      300

atactttcaa gttacggtaa gcatatgata gtccatttta aaacataatt ttaaaactgc      360

aaactaccca agaaattatt actttctacg tcacgtattt tgtactaata tctttgtgtt      420

tacagtcaaa ttaattccaa ttatctctct aacagccttg tatcgtatat gcaaatatga      480

aggaatcatg ggaaataggc cctcacctag gcgttacata acttacggta aatggcccgc      540

ctggctgacc gcccaacgac ccccgcccat tgacgtcaat aatgacgtat gttcccatag      600

taacgccaat agggactttc cattgacgtc aatgggtgga gtatttacgg taaactgccc      660

acttggcagt acatcaagtg tatcatatgc caagtacgcc ccctattgac gtcaatgacg      720

gtaaatggcc cgcctggcat tatgcccagt acatgacctt atgggacttt cctacttggc      780

agtacatcta cgtattagtc atcgctatta ccatggtgat gcggttttgg cagtacatca      840

atgggcgtgg atagcggttt gactcacggg gatttccaag tctccacccc attgacgtca      900

atgggagttt gttttggcac caaaatcaac gggactttcc aaaatgtcgt aacaactccg      960

ccccattgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata agcagagctt     1020

ggatgttgcc tttacttcta ggcgcgccac catgaaacgg acagccgacg gaagcgagtt     1080

cgagtcacca aagaagaagc ggaaagtctc tgaggtggag ttttcccacg agtactggat     1140

gagacatgcc ctgaccctgg ccaagagggc acgggatgag agggaggtgc ctgtgggagc     1200

cgtgctggtg ctgaacaata gagtgatcgg cgagggctgg aacagagcca tcggcctgca     1260

cgacccaaca gcccatgccg aaattatggc cctgagacag ggcggcctgg tcatgcagaa     1320

ctacagactg attgacgcca ccctgtacgt gacattcgag ccttgcgtga tgtgcgccgg     1380

cgccatgatc cactctagga tcggccgcgt ggtgtttggc gtgaggaact caaaaagagg     1440

cgccgcaggc tccctgatga acgtgctgaa ctaccccggc atgaatcacc gcgtcgaaat     1500

taccgaggga atcctggcag atgaatgtgc cgccctgctg tgcgatttct atcggatgcc     1560

tagacaggtg ttcaatgctc agaagaaggc ccagagctcc atcaactccg gaggatctag     1620

cggaggctcc tctggctctg agacacctgg cacaagcgag agcgcaacac ctgaaagcag     1680

cgggggcagc agcggggggt cagacaagaa gtacagcatc ggcctggcca tcggcaccaa     1740

ttctgttggc tgggccgtga tcaccgacga gtacaaggtg cccagcaaga aattcaaggt     1800

gctgggcaac accgaccggc acagcatcaa gaagaatctg atcggcgccc tgctgttcga     1860

ctctggcgaa acagccgaag ccaccagact gaagaggaca gccagacggc ggtacaccag     1920

aagaaagaac cggatctgct acctgcaaga gatcttcagc aacgagatgg ccaaggtgga     1980

cgacagcttc ttccaccggc tggaagagtc cttcctggtg gaagaggata agaagcacga     2040

gcggcacccc atcttcggca acatcgtgga tgaggtggcc taccacgaga agtaccccac     2100

catctaccac ctgagaaaga aactggtgga cagcaccgac aaggccgacc tgagactgat     2160

ctatctggcc ctggctcaca tgatcaagtt ccggggccac ttcctgatcg agggcgacct     2220

gaatcctgac aacagcgacg tggacaagct gttcatccag ctggtgcaga cctacaacca     2280

gctgttcgag gaaaacccca tcaacgccag cggagtggat gccaaggcca tcctgtctgc     2340

ccggctgagc aagagcagac ggctggaaaa cctgatcgct cagctgcccg gcgagaagaa     2400

gaatggcctg ttcggcaacc tgattgccct gagcctgggc ctgacaccta acttcaagag     2460

caacttcgac ctggccgagg acgccaaact gcagctgtcc aaggacacct acgacgacga     2520

cctggacaat ctgctggccc agatcggcga tcagtacgcc gacttgtttc tggccgccaa     2580

gaacctgtcc gacgccatcc tgctgagcga catcctgaga gtgaacaccg agatcacaaa     2640

ggcccctctg agcgcctcta tgatcaagag atacgacgag caccaccagg atctgaccct     2700

gctgaaggcc ctcgttagac agcagctgcc tgagaagtac aaagagattt tcttcgacca     2760

gagcaagaac ggctacgccg gctacattga tggcggagcc agccaagagg aattctacaa     2820

gttcatcaag cccatcctcg agaagatgga cggcaccgag gaactgctgg tcaagctgaa     2880

cagagaggac ctgctgcgga agcagcggac cttcgacaat ggctctatcc ctcaccaaat     2940

ccacctggga gagctgcacg ccattctgcg gagacaagag gacttttacc cattcctgaa     3000

ggacaaccgg gaaaagattg agaagatcct gaccttcagg atcccctact acgtgggacc     3060

actggccaga ggcaatagca gattcgcctg gatgaccaga aagagcgagg aaaccatcac     3120

accctggaac ttcgaggaag tggtggataa gggcgccagc gctcagtcct tcatcgagcg     3180

gatgaccaac ttcgataaga acctgcctaa cgagaaggta agtattagct ctttctttcc     3240

atgggttggc ctcgccgcgt gggctgaggg aaggactgtc ctgggactgg acaggcgggt     3300

tatgggacct gaagcgataa aaggcatgca cgtttgcggc tacgtgcatg ccaaaaggag     3360

tcgggcttgc ctccgtgccc gactccaaaa gacctgctcg aggaggtgga cgagcaggtc     3420

aaaaatccgg gtaccaataa aatatcttta ttttcattac atctgtgtgt tggttttttg     3480

tgtgactagt tgaacgctga cgtcatcaac ccgctccaag gaatcgcggg cccagtgtca     3540

ctaggcggga acacccagcg cgcgtgcgcc ctggcaggaa gatggctgtg agggacaggg     3600

gagtggcgcc ctgcaatatt tgcatgtcgc tatgtgttct gggaaatcac cataaacgtg     3660

aaatgtcttt ggatttggga atcgtataag aactgtatga gaccacggtt atctcctgtt     3720

ctgcagcgtt ttagagctag aaatagcaag ttaaaataag gctagtccgt tatcaacttg     3780

aaaaagtggc accgagtcgg tgcttttttt actagtgcgg ccgcaggaac ccctagtgat     3840

ggagttggcc actccctctc tgcgcgctcg ctcgctcact gaggccgggc gaccaaaggt     3900

cgcccgacgc ccgggctttg cccgggcggc ctcagtgagc gagcgagcgc gcagctgcct     3960

gcagg                                                                 3965


<210>  226
<211>  4551
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV2 transfer plasmid encoding C-terminal Abe8e with two sgRNA 
       expression cassettes

<400>  226
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc       60

gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggcca      120

actccatcac taggggttcc tgcggccgcg gaaaaaaagc accgactcgg tgccactttt      180

tcaagttgat aacggactag ccttatttta acttgctatt tctagctcta aaacgctgca      240

gaacaggaga taaccggtgt ttcgtccttt ccacaagata tataaagcca agaaatcgaa      300

atactttcaa gttacggtaa gcatatgata gtccatttta aaacataatt ttaaaactgc      360

aaactaccca agaaattatt actttctacg tcacgtattt tgtactaata tctttgtgtt      420

tacagtcaaa ttaattccaa ttatctctct aacagccttg tatcgtatat gcaaatatga      480

aggaatcatg ggaaataggc cctcacctag gcgttacata acttacggta aatggcccgc      540

ctggctgacc gcccaacgac ccccgcccat tgacgtcaat aatgacgtat gttcccatag      600

taacgccaat agggactttc cattgacgtc aatgggtgga gtatttacgg taaactgccc      660

acttggcagt acatcaagtg tatcatatgc caagtacgcc ccctattgac gtcaatgacg      720

gtaaatggcc cgcctggcat tatgcccagt acatgacctt atgggacttt cctacttggc      780

agtacatcta cgtattagtc atcgctatta ccatggtgat gcggttttgg cagtacatca      840

atgggcgtgg atagcggttt gactcacggg gatttccaag tctccacccc attgacgtca      900

atgggagttt gttttggcac caaaatcaac gggactttcc aaaatgtcgt aacaactccg      960

ccccattgac gcaaatgggc ggtaggcgtg tacggtggga ggtctatata agcagagctt     1020

ggatgttgcc tttaggattt ttgacctgct cgattgtcca ctgcgagcag gtcttttgga     1080

gtcgggcgag gcggaagccc gactcctttt ggcatgcacg ctagccgcgt cgtgcatgcc     1140

ttttatcttc gggttatggg accagtgaag gctgagggaa ggactgtcct gggactggac     1200

aggcgggtta tgggacctga aaatactaac aatcgatttt ttttcccttt ttttccaggt     1260

gctgcccaag cacagcctgc tgtacgagta cttcaccgtg tacaacgagc tgaccaaagt     1320

gaaatacgtg accgagggaa tgagaaagcc cgcctttctg agcggcgagc agaaaaaggc     1380

cattgtggat ctgctgttca agaccaaccg gaaagtgacc gtgaagcagc tgaaagagga     1440

ctacttcaag aaaatcgagt gcttcgacag cgtggaaatc agcggcgtgg aagatcggtt     1500

caatgccagc ctgggcacat accacgacct gctgaaaatt atcaaggaca aggacttcct     1560

ggacaacgaa gagaacgagg acatcctgga agatatcgtg ctgaccctga cactgtttga     1620

ggacagagag atgatcgagg aacggctgaa aacatacgcc cacctgttcg acgacaaagt     1680

gatgaagcaa ctgaagcggc ggagatacac cggctggggc agactgtctc ggaagctgat     1740

caacggcatc cgggataagc agtccggcaa gaccatcctg gactttctga agtccgacgg     1800

cttcgccaat cggaacttca tgcagctgat ccacgacgac agcctgacct ttaaagagga     1860

tatccagaaa gcccaggtgt ccggccaggg cgattctctg catgagcaca ttgccaacct     1920

ggccggctct cccgccatta agaagggcat tctgcagaca gtgaaggtgg tggacgagct     1980

ggtcaaagtc atgggcagac acaagcccga gaacatcgtg atcgaaatgg ccagagagaa     2040

ccagaccaca cagaagggcc agaagaacag ccgcgagaga atgaagcgga tcgaagaggg     2100

catcaaagag ctgggcagcc agatcctgaa agaacacccc gtggaaaaca cccagctgca     2160

gaacgagaag ctgtacctgt actacctcca aaacggccgg gatatgtatg tggaccaaga     2220

gctggacatc aaccggctgt ccgactacga tgtggaccat atcgtgcccc agtcttttct     2280

gaaagacgac tccatcgaca acaaggtcct gaccagaagc gacaagaacc ggggcaagag     2340

cgataacgtg ccctccgaag aggtcgtgaa gaagatgaag aactactggc gacagctgct     2400

gaacgccaag ctgattaccc agcggaagtt cgataacctg accaaggccg agagaggcgg     2460

cctgtctgaa ctggataagg ccggcttcat caagagacag ctggtggaaa cccggcagat     2520

caccaaacac gtggcacaga tcctggactc ccggatgaac actaagtacg acgagaatga     2580

caagctgatc cgggaagtga aagtgatcac cctgaagtcc aagctggtgt ccgatttccg     2640

gaaggatttc cagttttaca aagtgcgcga gatcaacaac taccaccacg cccacgacgc     2700

ctacctgaac gccgtcgtgg gaaccgccct gatcaaaaag taccctaagc tggaaagcga     2760

gttcgtgtac ggcgactaca aggtgtacga cgtgcggaag atgatcgcca agagcgagca     2820

ggaaatcggc aaggctaccg ccaagtactt cttctacagc aacatcatga actttttcaa     2880

gaccgagatt accctggcca acggcgagat ccggaagcgg cctctgatcg agacaaacgg     2940

cgaaaccggg gagatcgtgt gggataaggg ccgggatttt gccaccgtgc ggaaagtgct     3000

gagcatgccc caagtgaata tcgtgaaaaa gaccgaggtg cagacaggcg gcttcagcaa     3060

agagtctatc aggcccaaga ggaacagcga taagctgatc gccagaaaga aggactggga     3120

ccctaagaag tacggcggct tcgtcagccc caccgtggcc tattctgtgc tggtggtggc     3180

caaagtggaa aagggcaagt ccaagaaact gaagagtgtg aaagagctgc tggggatcac     3240

catcatggaa agaagcagct tcgagaagaa tcccatcgac tttctggaag ccaagggcta     3300

caaagaagtg aaaaaggacc tgatcatcaa gctgcctaag tactccctgt tcgagctgga     3360

aaacggccgg aagagaatgc tggcctctgc cagattcctg cagaagggaa acgaactggc     3420

cctgccctcc aaatatgtga acttcctgta cctggccagc cactatgaga agctgaaggg     3480

ctcccccgag gataatgagc agaaacagct gtttgtggaa cagcacaagc actacctgga     3540

cgagatcatc gagcagatca gcgagttctc caagagagtg atcctggccg acgctaatct     3600

ggacaaagtg ctgtccgcct acaacaagca ccgggataag cccatcagag agcaggccga     3660

gaatatcatc cacctgttta ccctgaccaa tctgggagcc cctagggcct tcaagtactt     3720

tgacaccacc atcgaccgga aggtgtacag gagcaccaaa gaggtgctgg acgccaccct     3780

gatccaccag agcatcaccg gcctgtacga gacacggatc gacctgtctc agctgggagg     3840

tgactctggc ggctcaaaaa gaaccgccga cggcagcgaa ttcgagccca agaagaagag     3900

gaaagtctaa ggtaccaatt cctcacctgc gatctcgatg ctttatttgt gaaatttgtg     3960

atgctattgc tttatttgta accattataa gctgcaataa acaagttaac aacaacaatt     4020

gcattcattt tatgtttcag gttcaggggg aggtgtggga ggttttttaa actagttgaa     4080

cgctgacgtc atcaacccgc tccaaggaat cgcgggccca gtgtcactag gcgggaacac     4140

ccagcgcgcg tgcgccctgg caggaagatg gctgtgaggg acaggggagt ggcgccctgc     4200

aatatttgca tgtcgctatg tgttctggga aatcaccata aacgtgaaat gtctttggat     4260

ttgggaatcg tataagaact gtatgagacc acggttatct cctgttctgc agcgttttag     4320

agctagaaat agcaagttaa aataaggcta gtccgttatc aacttgaaaa agtggcaccg     4380

agtcggtgct ttttttacta gtgcggccgc aggaacccct agtgatggag ttggccactc     4440

cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc cgacgcccgg     4500

gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag ctgcctgcag g              4551


