                         SEQUENCE LISTING

<110>  Oxford Nanopore Technologies Limited
 
<120>  Method

<130>  N404112WO

<140>  GB1418159.8
<141>  2014-10-14

<160>  37    

<170>  PatentIn version 3.5

<210>  1
<211>  558
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Mycobacterium smegmatis porin A mutant 
       (D90N/D91N/D93N/D118R/D134R/E193K)

<400>  1
atgggtctgg ataatgaact gagcctggtg gacggtcaag atcgtaccct gacggtgcaa       60

caatgggata cctttctgaa tggcgttttt ccgctggatc gtaatcgcct gacccgtgaa      120

tggtttcatt ccggtcgcgc aaaatatatc gtcgcaggcc cgggtgctga cgaattcgaa      180

ggcacgctgg aactgggtta tcagattggc tttccgtggt cactgggcgt tggtatcaac      240

ttctcgtaca ccacgccgaa tattctgatc aacaatggta acattaccgc accgccgttt      300

ggcctgaaca gcgtgattac gccgaacctg tttccgggtg ttagcatctc tgcccgtctg      360

ggcaatggtc cgggcattca agaagtggca acctttagtg tgcgcgtttc cggcgctaaa      420

ggcggtgtcg cggtgtctaa cgcccacggt accgttacgg gcgcggccgg cggtgtcctg      480

ctgcgtccgt tcgcgcgcct gattgcctct accggcgaca gcgttacgac ctatggcgaa      540

ccgtggaata tgaactaa                                                    558


<210>  2
<211>  184
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Mycobacterium smegmatis porin A mutant

       (D90N/D91N/D93N/D118R/D134R/E139K)

<400>  2

Gly Leu Asp Asn Glu Leu Ser Leu Val Asp Gly Gln Asp Arg Thr Leu 
1               5                   10                  15      


Thr Val Gln Gln Trp Asp Thr Phe Leu Asn Gly Val Phe Pro Leu Asp 
            20                  25                  30          


Arg Asn Arg Leu Thr Arg Glu Trp Phe His Ser Gly Arg Ala Lys Tyr 
        35                  40                  45              


Ile Val Ala Gly Pro Gly Ala Asp Glu Phe Glu Gly Thr Leu Glu Leu 
    50                  55                  60                  


Gly Tyr Gln Ile Gly Phe Pro Trp Ser Leu Gly Val Gly Ile Asn Phe 
65                  70                  75                  80  


Ser Tyr Thr Thr Pro Asn Ile Leu Ile Asn Asn Gly Asn Ile Thr Ala 
                85                  90                  95      


Pro Pro Phe Gly Leu Asn Ser Val Ile Thr Pro Asn Leu Phe Pro Gly 
            100                 105                 110         


Val Ser Ile Ser Ala Arg Leu Gly Asn Gly Pro Gly Ile Gln Glu Val 
        115                 120                 125             


Ala Thr Phe Ser Val Arg Val Ser Gly Ala Lys Gly Gly Val Ala Val 
    130                 135                 140                 


Ser Asn Ala His Gly Thr Val Thr Gly Ala Ala Gly Gly Val Leu Leu 
145                 150                 155                 160 


Arg Pro Phe Ala Arg Leu Ile Ala Ser Thr Gly Asp Ser Val Thr Thr 
                165                 170                 175     


Tyr Gly Glu Pro Trp Asn Met Asn 
            180                 


<210>  3
<211>  885
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  alpha-hemolysin mutant (E111N/K147N)

<400>  3
atggcagatt ctgatattaa tattaaaacc ggtactacag atattggaag caatactaca       60

gtaaaaacag gtgatttagt cacttatgat aaagaaaatg gcatgcacaa aaaagtattt      120

tatagtttta tcgatgataa aaatcacaat aaaaaactgc tagttattag aacaaaaggt      180

accattgctg gtcaatatag agtttatagc gaagaaggtg ctaacaaaag tggtttagcc      240

tggccttcag cctttaaggt acagttgcaa ctacctgata atgaagtagc tcaaatatct      300

gattactatc caagaaattc gattgataca aaaaactata tgagtacttt aacttatgga      360

ttcaacggta atgttactgg tgatgataca ggaaaaattg gcggccttat tggtgcaaat      420

gtttcgattg gtcatacact gaactatgtt caacctgatt tcaaaacaat tttagagagc      480

ccaactgata aaaaagtagg ctggaaagtg atatttaaca atatggtgaa tcaaaattgg      540

ggaccatacg atcgagattc ttggaacccg gtatatggca atcaactttt catgaaaact      600

agaaatggtt ctatgaaagc agcagataac ttccttgatc ctaacaaagc aagttctcta      660

ttatcttcag ggttttcacc agacttcgct acagttatta ctatggatag aaaagcatcc      720

aaacaacaaa caaatataga tgtaatatac gaacgagttc gtgatgatta ccaattgcat      780

tggacttcaa caaattggaa aggtaccaat actaaagata aatggacaga tcgttcttca      840

gaaagatata aaatcgattg ggaaaaagaa gaaatgacaa attaa                      885


<210>  4
<211>  293
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  alpha-hemolysin mutant (E111N/K147N)

<400>  4

Ala Asp Ser Asp Ile Asn Ile Lys Thr Gly Thr Thr Asp Ile Gly Ser 
1               5                   10                  15      


Asn Thr Thr Val Lys Thr Gly Asp Leu Val Thr Tyr Asp Lys Glu Asn 
            20                  25                  30          


Gly Met His Lys Lys Val Phe Tyr Ser Phe Ile Asp Asp Lys Asn His 
        35                  40                  45              


Asn Lys Lys Leu Leu Val Ile Arg Thr Lys Gly Thr Ile Ala Gly Gln 
    50                  55                  60                  


Tyr Arg Val Tyr Ser Glu Glu Gly Ala Asn Lys Ser Gly Leu Ala Trp 
65                  70                  75                  80  


Pro Ser Ala Phe Lys Val Gln Leu Gln Leu Pro Asp Asn Glu Val Ala 
                85                  90                  95      


Gln Ile Ser Asp Tyr Tyr Pro Arg Asn Ser Ile Asp Thr Lys Asn Tyr 
            100                 105                 110         


Met Ser Thr Leu Thr Tyr Gly Phe Asn Gly Asn Val Thr Gly Asp Asp 
        115                 120                 125             


Thr Gly Lys Ile Gly Gly Leu Ile Gly Ala Asn Val Ser Ile Gly His 
    130                 135                 140                 


Thr Leu Asn Tyr Val Gln Pro Asp Phe Lys Thr Ile Leu Glu Ser Pro 
145                 150                 155                 160 


Thr Asp Lys Lys Val Gly Trp Lys Val Ile Phe Asn Asn Met Val Asn 
                165                 170                 175     


Gln Asn Trp Gly Pro Tyr Asp Arg Asp Ser Trp Asn Pro Val Tyr Gly 
            180                 185                 190         


Asn Gln Leu Phe Met Lys Thr Arg Asn Gly Ser Met Lys Ala Ala Asp 
        195                 200                 205             


Asn Phe Leu Asp Pro Asn Lys Ala Ser Ser Leu Leu Ser Ser Gly Phe 
    210                 215                 220                 


Ser Pro Asp Phe Ala Thr Val Ile Thr Met Asp Arg Lys Ala Ser Lys 
225                 230                 235                 240 


Gln Gln Thr Asn Ile Asp Val Ile Tyr Glu Arg Val Arg Asp Asp Tyr 
                245                 250                 255     


Gln Leu His Trp Thr Ser Thr Asn Trp Lys Gly Thr Asn Thr Lys Asp 
            260                 265                 270         


Lys Trp Thr Asp Arg Ser Ser Glu Arg Tyr Lys Ile Asp Trp Glu Lys 
        275                 280                 285             


Glu Glu Met Thr Asn 
    290             


<210>  5
<211>  184
<212>  PRT
<213>  Mycobacterium smegmatis

<400>  5

Gly Leu Asp Asn Glu Leu Ser Leu Val Asp Gly Gln Asp Arg Thr Leu 
1               5                   10                  15      


Thr Val Gln Gln Trp Asp Thr Phe Leu Asn Gly Val Phe Pro Leu Asp 
            20                  25                  30          


Arg Asn Arg Leu Thr Arg Glu Trp Phe His Ser Gly Arg Ala Lys Tyr 
        35                  40                  45              


Ile Val Ala Gly Pro Gly Ala Asp Glu Phe Glu Gly Thr Leu Glu Leu 
    50                  55                  60                  


Gly Tyr Gln Ile Gly Phe Pro Trp Ser Leu Gly Val Gly Ile Asn Phe 
65                  70                  75                  80  


Ser Tyr Thr Thr Pro Asn Ile Leu Ile Asp Asp Gly Asp Ile Thr Ala 
                85                  90                  95      


Pro Pro Phe Gly Leu Asn Ser Val Ile Thr Pro Asn Leu Phe Pro Gly 
            100                 105                 110         


Val Ser Ile Ser Ala Asp Leu Gly Asn Gly Pro Gly Ile Gln Glu Val 
        115                 120                 125             


Ala Thr Phe Ser Val Asp Val Ser Gly Pro Ala Gly Gly Val Ala Val 
    130                 135                 140                 


Ser Asn Ala His Gly Thr Val Thr Gly Ala Ala Gly Gly Val Leu Leu 
145                 150                 155                 160 


Arg Pro Phe Ala Arg Leu Ile Ala Ser Thr Gly Asp Ser Val Thr Thr 
                165                 170                 175     


Tyr Gly Glu Pro Trp Asn Met Asn 
            180                 


<210>  6
<211>  184
<212>  PRT
<213>  Mycobacterium smegmatis

<400>  6

Gly Leu Asp Asn Glu Leu Ser Leu Val Asp Gly Gln Asp Arg Thr Leu 
1               5                   10                  15      


Thr Val Gln Gln Trp Asp Thr Phe Leu Asn Gly Val Phe Pro Leu Asp 
            20                  25                  30          


Arg Asn Arg Leu Thr Arg Glu Trp Phe His Ser Gly Arg Ala Lys Tyr 
        35                  40                  45              


Ile Val Ala Gly Pro Gly Ala Asp Glu Phe Glu Gly Thr Leu Glu Leu 
    50                  55                  60                  


Gly Tyr Gln Ile Gly Phe Pro Trp Ser Leu Gly Val Gly Ile Asn Phe 
65                  70                  75                  80  


Ser Tyr Thr Thr Pro Asn Ile Leu Ile Asp Asp Gly Asp Ile Thr Gly 
                85                  90                  95      


Pro Pro Phe Gly Leu Glu Ser Val Ile Thr Pro Asn Leu Phe Pro Gly 
            100                 105                 110         


Val Ser Ile Ser Ala Asp Leu Gly Asn Gly Pro Gly Ile Gln Glu Val 
        115                 120                 125             


Ala Thr Phe Ser Val Asp Val Ser Gly Pro Ala Gly Gly Val Ala Val 
    130                 135                 140                 


Ser Asn Ala His Gly Thr Val Thr Gly Ala Ala Gly Gly Val Leu Leu 
145                 150                 155                 160 


Arg Pro Phe Ala Arg Leu Ile Ala Ser Thr Gly Asp Ser Val Thr Thr 
                165                 170                 175     


Tyr Gly Glu Pro Trp Asn Met Asn 
            180                 


<210>  7
<211>  183
<212>  PRT
<213>  Mycobacterium smegmatis

<400>  7

Val Asp Asn Gln Leu Ser Val Val Asp Gly Gln Gly Arg Thr Leu Thr 
1               5                   10                  15      


Val Gln Gln Ala Glu Thr Phe Leu Asn Gly Val Phe Pro Leu Asp Arg 
            20                  25                  30          


Asn Arg Leu Thr Arg Glu Trp Phe His Ser Gly Arg Ala Thr Tyr His 
        35                  40                  45              


Val Ala Gly Pro Gly Ala Asp Glu Phe Glu Gly Thr Leu Glu Leu Gly 
    50                  55                  60                  


Tyr Gln Val Gly Phe Pro Trp Ser Leu Gly Val Gly Ile Asn Phe Ser 
65                  70                  75                  80  


Tyr Thr Thr Pro Asn Ile Leu Ile Asp Gly Gly Asp Ile Thr Gln Pro 
                85                  90                  95      


Pro Phe Gly Leu Asp Thr Ile Ile Thr Pro Asn Leu Phe Pro Gly Val 
            100                 105                 110         


Ser Ile Ser Ala Asp Leu Gly Asn Gly Pro Gly Ile Gln Glu Val Ala 
        115                 120                 125             


Thr Phe Ser Val Asp Val Lys Gly Ala Lys Gly Ala Val Ala Val Ser 
    130                 135                 140                 


Asn Ala His Gly Thr Val Thr Gly Ala Ala Gly Gly Val Leu Leu Arg 
145                 150                 155                 160 


Pro Phe Ala Arg Leu Ile Ala Ser Thr Gly Asp Ser Val Thr Thr Tyr 
                165                 170                 175     


Gly Glu Pro Trp Asn Met Asn 
            180             


<210>  8
<211>  1830
<212>  DNA
<213>  Bacillus subtilis phage phi29

<400>  8
atgaaacaca tgccgcgtaa aatgtatagc tgcgcgtttg aaaccacgac caaagtggaa       60

gattgtcgcg tttgggccta tggctacatg aacatcgaag atcattctga atacaaaatc      120

ggtaacagtc tggatgaatt tatggcatgg gtgctgaaag ttcaggcgga tctgtacttc      180

cacaacctga aatttgatgg cgcattcatt atcaactggc tggaacgtaa tggctttaaa      240

tggagcgcgg atggtctgcc gaacacgtat aataccatta tctctcgtat gggccagtgg      300

tatatgattg atatctgcct gggctacaaa ggtaaacgca aaattcatac cgtgatctat      360

gatagcctga aaaaactgcc gtttccggtg aagaaaattg cgaaagattt caaactgacg      420

gttctgaaag gcgatattga ttatcacaaa gaacgtccgg ttggttacaa aatcaccccg      480

gaagaatacg catacatcaa aaacgatatc cagatcatcg cagaagcgct gctgattcag      540

tttaaacagg gcctggatcg catgaccgcg ggcagtgata gcctgaaagg tttcaaagat      600

atcatcacga ccaaaaaatt caaaaaagtg ttcccgacgc tgagcctggg tctggataaa      660

gaagttcgtt atgcctaccg cggcggtttt acctggctga acgatcgttt caaagaaaaa      720

gaaattggcg agggtatggt gtttgatgtt aatagtctgt atccggcaca gatgtacagc      780

cgcctgctgc cgtatggcga accgatcgtg ttcgagggta aatatgtttg ggatgaagat      840

tacccgctgc atattcagca catccgttgt gaatttgaac tgaaagaagg ctatattccg      900

accattcaga tcaaacgtag tcgcttctat aagggtaacg aatacctgaa aagctctggc      960

ggtgaaatcg cggatctgtg gctgagtaac gtggatctgg aactgatgaa agaacactac     1020

gatctgtaca acgttgaata catcagcggc ctgaaattta aagccacgac cggtctgttc     1080

aaagatttca tcgataaatg gacctacatc aaaacgacct ctgaaggcgc gattaaacag     1140

ctggccaaac tgatgctgaa cagcctgtat ggcaaattcg cctctaatcc ggatgtgacc     1200

ggtaaagttc cgtacctgaa agaaaatggc gcactgggtt ttcgcctggg cgaagaagaa     1260

acgaaagatc cggtgtatac cccgatgggt gttttcatta cggcctgggc acgttacacg     1320

accatcaccg cggcccaggc atgctatgat cgcattatct actgtgatac cgattctatt     1380

catctgacgg gcaccgaaat cccggatgtg attaaagata tcgttgatcc gaaaaaactg     1440

ggttattggg cccacgaaag tacgtttaaa cgtgcaaaat acctgcgcca gaaaacctac     1500

atccaggata tctacatgaa agaagtggat ggcaaactgg ttgaaggttc tccggatgat     1560

tacaccgata tcaaattcag tgtgaaatgc gccggcatga cggataaaat caaaaaagaa     1620

gtgaccttcg aaaacttcaa agttggtttc agccgcaaaa tgaaaccgaa accggtgcag     1680

gttccgggcg gtgtggttct ggtggatgat acgtttacca ttaaatctgg cggtagtgcg     1740

tggagccatc cgcagttcga aaaaggcggt ggctctggtg gcggttctgg cggtagtgcc     1800

tggagccacc cgcagtttga aaaataataa                                      1830


<210>  9
<211>  608
<212>  PRT
<213>  Bacillus subtilis phage phi29

<400>  9

Met Lys His Met Pro Arg Lys Met Tyr Ser Cys Ala Phe Glu Thr Thr 
1               5                   10                  15      


Thr Lys Val Glu Asp Cys Arg Val Trp Ala Tyr Gly Tyr Met Asn Ile 
            20                  25                  30          


Glu Asp His Ser Glu Tyr Lys Ile Gly Asn Ser Leu Asp Glu Phe Met 
        35                  40                  45              


Ala Trp Val Leu Lys Val Gln Ala Asp Leu Tyr Phe His Asn Leu Lys 
    50                  55                  60                  


Phe Asp Gly Ala Phe Ile Ile Asn Trp Leu Glu Arg Asn Gly Phe Lys 
65                  70                  75                  80  


Trp Ser Ala Asp Gly Leu Pro Asn Thr Tyr Asn Thr Ile Ile Ser Arg 
                85                  90                  95      


Met Gly Gln Trp Tyr Met Ile Asp Ile Cys Leu Gly Tyr Lys Gly Lys 
            100                 105                 110         


Arg Lys Ile His Thr Val Ile Tyr Asp Ser Leu Lys Lys Leu Pro Phe 
        115                 120                 125             


Pro Val Lys Lys Ile Ala Lys Asp Phe Lys Leu Thr Val Leu Lys Gly 
    130                 135                 140                 


Asp Ile Asp Tyr His Lys Glu Arg Pro Val Gly Tyr Lys Ile Thr Pro 
145                 150                 155                 160 


Glu Glu Tyr Ala Tyr Ile Lys Asn Asp Ile Gln Ile Ile Ala Glu Ala 
                165                 170                 175     


Leu Leu Ile Gln Phe Lys Gln Gly Leu Asp Arg Met Thr Ala Gly Ser 
            180                 185                 190         


Asp Ser Leu Lys Gly Phe Lys Asp Ile Ile Thr Thr Lys Lys Phe Lys 
        195                 200                 205             


Lys Val Phe Pro Thr Leu Ser Leu Gly Leu Asp Lys Glu Val Arg Tyr 
    210                 215                 220                 


Ala Tyr Arg Gly Gly Phe Thr Trp Leu Asn Asp Arg Phe Lys Glu Lys 
225                 230                 235                 240 


Glu Ile Gly Glu Gly Met Val Phe Asp Val Asn Ser Leu Tyr Pro Ala 
                245                 250                 255     


Gln Met Tyr Ser Arg Leu Leu Pro Tyr Gly Glu Pro Ile Val Phe Glu 
            260                 265                 270         


Gly Lys Tyr Val Trp Asp Glu Asp Tyr Pro Leu His Ile Gln His Ile 
        275                 280                 285             


Arg Cys Glu Phe Glu Leu Lys Glu Gly Tyr Ile Pro Thr Ile Gln Ile 
    290                 295                 300                 


Lys Arg Ser Arg Phe Tyr Lys Gly Asn Glu Tyr Leu Lys Ser Ser Gly 
305                 310                 315                 320 


Gly Glu Ile Ala Asp Leu Trp Leu Ser Asn Val Asp Leu Glu Leu Met 
                325                 330                 335     


Lys Glu His Tyr Asp Leu Tyr Asn Val Glu Tyr Ile Ser Gly Leu Lys 
            340                 345                 350         


Phe Lys Ala Thr Thr Gly Leu Phe Lys Asp Phe Ile Asp Lys Trp Thr 
        355                 360                 365             


Tyr Ile Lys Thr Thr Ser Glu Gly Ala Ile Lys Gln Leu Ala Lys Leu 
    370                 375                 380                 


Met Leu Asn Ser Leu Tyr Gly Lys Phe Ala Ser Asn Pro Asp Val Thr 
385                 390                 395                 400 


Gly Lys Val Pro Tyr Leu Lys Glu Asn Gly Ala Leu Gly Phe Arg Leu 
                405                 410                 415     


Gly Glu Glu Glu Thr Lys Asp Pro Val Tyr Thr Pro Met Gly Val Phe 
            420                 425                 430         


Ile Thr Ala Trp Ala Arg Tyr Thr Thr Ile Thr Ala Ala Gln Ala Cys 
        435                 440                 445             


Tyr Asp Arg Ile Ile Tyr Cys Asp Thr Asp Ser Ile His Leu Thr Gly 
    450                 455                 460                 


Thr Glu Ile Pro Asp Val Ile Lys Asp Ile Val Asp Pro Lys Lys Leu 
465                 470                 475                 480 


Gly Tyr Trp Ala His Glu Ser Thr Phe Lys Arg Ala Lys Tyr Leu Arg 
                485                 490                 495     


Gln Lys Thr Tyr Ile Gln Asp Ile Tyr Met Lys Glu Val Asp Gly Lys 
            500                 505                 510         


Leu Val Glu Gly Ser Pro Asp Asp Tyr Thr Asp Ile Lys Phe Ser Val 
        515                 520                 525             


Lys Cys Ala Gly Met Thr Asp Lys Ile Lys Lys Glu Val Thr Phe Glu 
    530                 535                 540                 


Asn Phe Lys Val Gly Phe Ser Arg Lys Met Lys Pro Lys Pro Val Gln 
545                 550                 555                 560 


Val Pro Gly Gly Val Val Leu Val Asp Asp Thr Phe Thr Ile Lys Ser 
                565                 570                 575     


Gly Gly Ser Ala Trp Ser His Pro Gln Phe Glu Lys Gly Gly Gly Ser 
            580                 585                 590         


Gly Gly Gly Ser Gly Gly Ser Ala Trp Ser His Pro Gln Phe Glu Lys 
        595                 600                 605             


<210>  10
<211>  1390
<212>  DNA
<213>  Escherichia coli

<400>  10
atgatgaacg atggcaaaca gcagagcacc ttcctgtttc atgattatga aaccttcggt       60

acccatccgg ccctggatcg tccggcgcag tttgcggcca ttcgcaccga tagcgaattc      120

aatgtgattg gcgaaccgga agtgttttat tgcaaaccgg ccgatgatta tctgccgcag      180

ccgggtgcgg tgctgattac cggtattacc ccgcaggaag cgcgcgcgaa aggtgaaaac      240

gaagcggcgt ttgccgcgcg cattcatagc ctgtttaccg tgccgaaaac ctgcattctg      300

ggctataaca atgtgcgctt cgatgatgaa gttacccgta atatctttta tcgtaacttt      360

tatgatccgt atgcgtggag ctggcagcat gataacagcc gttgggatct gctggatgtg      420

atgcgcgcgt gctatgcgct gcgcccggaa ggcattaatt ggccggaaaa cgatgatggc      480

ctgccgagct ttcgtctgga acatctgacc aaagccaacg gcattgaaca tagcaatgcc      540

catgatgcga tggccgatgt ttatgcgacc attgcgatgg cgaaactggt taaaacccgt      600

cagccgcgcc tgtttgatta tctgtttacc caccgtaaca aacacaaact gatggcgctg      660

attgatgttc cgcagatgaa accgctggtg catgtgagcg gcatgtttgg cgcctggcgc      720

ggcaacacca gctgggtggc cccgctggcc tggcacccgg aaaatcgtaa cgccgtgatt      780

atggttgatc tggccggtga tattagcccg ctgctggaac tggatagcga taccctgcgt      840

gaacgcctgt ataccgccaa aaccgatctg ggcgataatg ccgccgtgcc ggtgaaactg      900

gttcacatta acaaatgccc ggtgctggcc caggcgaaca ccctgcgccc ggaagatgcg      960

gatcgtctgg gtattaatcg ccagcattgt ctggataatc tgaaaatcct gcgtgaaaac     1020

ccgcaggtgc gtgaaaaagt ggtggcgatc ttcgcggaag cggaaccgtt caccccgagc     1080

gataacgtgg atgcgcagct gtataacggc ttctttagcg atgccgatcg cgcggcgatg     1140

aaaatcgttc tggaaaccga accgcgcaat ctgccggcgc tggatattac ctttgttgat     1200

aaacgtattg aaaaactgct gtttaattat cgtgcgcgca attttccggg taccctggat     1260

tatgccgaac agcagcgttg gctggaacat cgtcgtcagg ttttcacccc ggaatttctg     1320

cagggttatg cggatgaact gcagatgctg gttcagcagt atgccgatga taaagaaaaa     1380

gtggcgctgc                                                            1390


<210>  11
<211>  485
<212>  PRT
<213>  Escherichia coli

<400>  11

Met Met Asn Asp Gly Lys Gln Gln Ser Thr Phe Leu Phe His Asp Tyr 
1               5                   10                  15      


Glu Thr Phe Gly Thr His Pro Ala Leu Asp Arg Pro Ala Gln Phe Ala 
            20                  25                  30          


Ala Ile Arg Thr Asp Ser Glu Phe Asn Val Ile Gly Glu Pro Glu Val 
        35                  40                  45              


Phe Tyr Cys Lys Pro Ala Asp Asp Tyr Leu Pro Gln Pro Gly Ala Val 
    50                  55                  60                  


Leu Ile Thr Gly Ile Thr Pro Gln Glu Ala Arg Ala Lys Gly Glu Asn 
65                  70                  75                  80  


Glu Ala Ala Phe Ala Ala Arg Ile His Ser Leu Phe Thr Val Pro Lys 
                85                  90                  95      


Thr Cys Ile Leu Gly Tyr Asn Asn Val Arg Phe Asp Asp Glu Val Thr 
            100                 105                 110         


Arg Asn Ile Phe Tyr Arg Asn Phe Tyr Asp Pro Tyr Ala Trp Ser Trp 
        115                 120                 125             


Gln His Asp Asn Ser Arg Trp Asp Leu Leu Asp Val Met Arg Ala Cys 
    130                 135                 140                 


Tyr Ala Leu Arg Pro Glu Gly Ile Asn Trp Pro Glu Asn Asp Asp Gly 
145                 150                 155                 160 


Leu Pro Ser Phe Arg Leu Glu His Leu Thr Lys Ala Asn Gly Ile Glu 
                165                 170                 175     


His Ser Asn Ala His Asp Ala Met Ala Asp Val Tyr Ala Thr Ile Ala 
            180                 185                 190         


Met Ala Lys Leu Val Lys Thr Arg Gln Pro Arg Leu Phe Asp Tyr Leu 
        195                 200                 205             


Phe Thr His Arg Asn Lys His Lys Leu Met Ala Leu Ile Asp Val Pro 
    210                 215                 220                 


Gln Met Lys Pro Leu Val His Val Ser Gly Met Phe Gly Ala Trp Arg 
225                 230                 235                 240 


Gly Asn Thr Ser Trp Val Ala Pro Leu Ala Trp His Pro Glu Asn Arg 
                245                 250                 255     


Asn Ala Val Ile Met Val Asp Leu Ala Gly Asp Ile Ser Pro Leu Leu 
            260                 265                 270         


Glu Leu Asp Ser Asp Thr Leu Arg Glu Arg Leu Tyr Thr Ala Lys Thr 
        275                 280                 285             


Asp Leu Gly Asp Asn Ala Ala Val Pro Val Lys Leu Val His Ile Asn 
    290                 295                 300                 


Lys Cys Pro Val Leu Ala Gln Ala Asn Thr Leu Arg Pro Glu Asp Ala 
305                 310                 315                 320 


Asp Arg Leu Gly Ile Asn Arg Gln His Cys Leu Asp Asn Leu Lys Ile 
                325                 330                 335     


Leu Arg Glu Asn Pro Gln Val Arg Glu Lys Val Val Ala Ile Phe Ala 
            340                 345                 350         


Glu Ala Glu Pro Phe Thr Pro Ser Asp Asn Val Asp Ala Gln Leu Tyr 
        355                 360                 365             


Asn Gly Phe Phe Ser Asp Ala Asp Arg Ala Ala Met Lys Ile Val Leu 
    370                 375                 380                 


Glu Thr Glu Pro Arg Asn Leu Pro Ala Leu Asp Ile Thr Phe Val Asp 
385                 390                 395                 400 


Lys Arg Ile Glu Lys Leu Leu Phe Asn Tyr Arg Ala Arg Asn Phe Pro 
                405                 410                 415     


Gly Thr Leu Asp Tyr Ala Glu Gln Gln Arg Trp Leu Glu His Arg Arg 
            420                 425                 430         


Gln Val Phe Thr Pro Glu Phe Leu Gln Gly Tyr Ala Asp Glu Leu Gln 
        435                 440                 445             


Met Leu Val Gln Gln Tyr Ala Asp Asp Lys Glu Lys Val Ala Leu Leu 
    450                 455                 460                 


Lys Ala Leu Trp Gln Tyr Ala Glu Glu Ile Val Ser Gly Ser Gly His 
465                 470                 475                 480 


His His His His His 
                485 


<210>  12
<211>  804
<212>  DNA
<213>  Escherichia coli

<400>  12
atgaaatttg tctcttttaa tatcaacggc ctgcgcgcca gacctcacca gcttgaagcc       60

atcgtcgaaa agcaccaacc ggatgtgatt ggcctgcagg agacaaaagt tcatgacgat      120

atgtttccgc tcgaagaggt ggcgaagctc ggctacaacg tgttttatca cgggcagaaa      180

ggccattatg gcgtggcgct gctgaccaaa gagacgccga ttgccgtgcg tcgcggcttt      240

cccggtgacg acgaagaggc gcagcggcgg attattatgg cggaaatccc ctcactgctg      300

ggtaatgtca ccgtgatcaa cggttacttc ccgcagggtg aaagccgcga ccatccgata      360

aaattcccgg caaaagcgca gttttatcag aatctgcaaa actacctgga aaccgaactc      420

aaacgtgata atccggtact gattatgggc gatatgaata tcagccctac agatctggat      480

atcggcattg gcgaagaaaa ccgtaagcgc tggctgcgta ccggtaaatg ctctttcctg      540

ccggaagagc gcgaatggat ggacaggctg atgagctggg ggttggtcga taccttccgc      600

catgcgaatc cgcaaacagc agatcgtttc tcatggtttg attaccgctc aaaaggtttt      660

gacgataacc gtggtctgcg catcgacctg ctgctcgcca gccaaccgct ggcagaatgt      720

tgcgtagaaa ccggcatcga ctatgaaatc cgcagcatgg aaaaaccgtc cgatcacgcc      780

cccgtctggg cgaccttccg ccgc                                             804


<210>  13
<211>  268
<212>  PRT
<213>  Escherichia coli

<400>  13

Met Lys Phe Val Ser Phe Asn Ile Asn Gly Leu Arg Ala Arg Pro His 
1               5                   10                  15      


Gln Leu Glu Ala Ile Val Glu Lys His Gln Pro Asp Val Ile Gly Leu 
            20                  25                  30          


Gln Glu Thr Lys Val His Asp Asp Met Phe Pro Leu Glu Glu Val Ala 
        35                  40                  45              


Lys Leu Gly Tyr Asn Val Phe Tyr His Gly Gln Lys Gly His Tyr Gly 
    50                  55                  60                  


Val Ala Leu Leu Thr Lys Glu Thr Pro Ile Ala Val Arg Arg Gly Phe 
65                  70                  75                  80  


Pro Gly Asp Asp Glu Glu Ala Gln Arg Arg Ile Ile Met Ala Glu Ile 
                85                  90                  95      


Pro Ser Leu Leu Gly Asn Val Thr Val Ile Asn Gly Tyr Phe Pro Gln 
            100                 105                 110         


Gly Glu Ser Arg Asp His Pro Ile Lys Phe Pro Ala Lys Ala Gln Phe 
        115                 120                 125             


Tyr Gln Asn Leu Gln Asn Tyr Leu Glu Thr Glu Leu Lys Arg Asp Asn 
    130                 135                 140                 


Pro Val Leu Ile Met Gly Asp Met Asn Ile Ser Pro Thr Asp Leu Asp 
145                 150                 155                 160 


Ile Gly Ile Gly Glu Glu Asn Arg Lys Arg Trp Leu Arg Thr Gly Lys 
                165                 170                 175     


Cys Ser Phe Leu Pro Glu Glu Arg Glu Trp Met Asp Arg Leu Met Ser 
            180                 185                 190         


Trp Gly Leu Val Asp Thr Phe Arg His Ala Asn Pro Gln Thr Ala Asp 
        195                 200                 205             


Arg Phe Ser Trp Phe Asp Tyr Arg Ser Lys Gly Phe Asp Asp Asn Arg 
    210                 215                 220                 


Gly Leu Arg Ile Asp Leu Leu Leu Ala Ser Gln Pro Leu Ala Glu Cys 
225                 230                 235                 240 


Cys Val Glu Thr Gly Ile Asp Tyr Glu Ile Arg Ser Met Glu Lys Pro 
                245                 250                 255     


Ser Asp His Ala Pro Val Trp Ala Thr Phe Arg Arg 
            260                 265             


<210>  14
<211>  1275
<212>  DNA
<213>  Thermus thermophilus

<400>  14
atgtttcgtc gtaaagaaga tctggatccg ccgctggcac tgctgccgct gaaaggcctg       60

cgcgaagccg ccgcactgct ggaagaagcg ctgcgtcaag gtaaacgcat tcgtgttcac      120

ggcgactatg atgcggatgg cctgaccggc accgcgatcc tggttcgtgg tctggccgcc      180

ctgggtgcgg atgttcatcc gtttatcccg caccgcctgg aagaaggcta tggtgtcctg      240

atggaacgcg tcccggaaca tctggaagcc tcggacctgt ttctgaccgt tgactgcggc      300

attaccaacc atgcggaact gcgcgaactg ctggaaaatg gcgtggaagt cattgttacc      360

gatcatcata cgccgggcaa aacgccgccg ccgggtctgg tcgtgcatcc ggcgctgacg      420

ccggatctga aagaaaaacc gaccggcgca ggcgtggcgt ttctgctgct gtgggcactg      480

catgaacgcc tgggcctgcc gccgccgctg gaatacgcgg acctggcagc cgttggcacc      540

attgccgacg ttgccccgct gtggggttgg aatcgtgcac tggtgaaaga aggtctggca      600

cgcatcccgg cttcatcttg ggtgggcctg cgtctgctgg ctgaagccgt gggctatacc      660

ggcaaagcgg tcgaagtcgc tttccgcatc gcgccgcgca tcaatgcggc ttcccgcctg      720

ggcgaagcgg aaaaagccct gcgcctgctg ctgacggatg atgcggcaga agctcaggcg      780

ctggtcggcg aactgcaccg tctgaacgcc cgtcgtcaga ccctggaaga agcgatgctg      840

cgcaaactgc tgccgcaggc cgacccggaa gcgaaagcca tcgttctgct ggacccggaa      900

ggccatccgg gtgttatggg tattgtggcc tctcgcatcc tggaagcgac cctgcgcccg      960

gtctttctgg tggcccaggg caaaggcacc gtgcgttcgc tggctccgat ttccgccgtc     1020

gaagcactgc gcagcgcgga agatctgctg ctgcgttatg gtggtcataa agaagcggcg     1080

ggtttcgcaa tggatgaagc gctgtttccg gcgttcaaag cacgcgttga agcgtatgcc     1140

gcacgtttcc cggatccggt tcgtgaagtg gcactgctgg atctgctgcc ggaaccgggc     1200

ctgctgccgc aggtgttccg tgaactggca ctgctggaac cgtatggtga aggtaacccg     1260

gaaccgctgt tcctg                                                      1275


<210>  15
<211>  425
<212>  PRT
<213>  Thermus thermophilus

<400>  15

Met Phe Arg Arg Lys Glu Asp Leu Asp Pro Pro Leu Ala Leu Leu Pro 
1               5                   10                  15      


Leu Lys Gly Leu Arg Glu Ala Ala Ala Leu Leu Glu Glu Ala Leu Arg 
            20                  25                  30          


Gln Gly Lys Arg Ile Arg Val His Gly Asp Tyr Asp Ala Asp Gly Leu 
        35                  40                  45              


Thr Gly Thr Ala Ile Leu Val Arg Gly Leu Ala Ala Leu Gly Ala Asp 
    50                  55                  60                  


Val His Pro Phe Ile Pro His Arg Leu Glu Glu Gly Tyr Gly Val Leu 
65                  70                  75                  80  


Met Glu Arg Val Pro Glu His Leu Glu Ala Ser Asp Leu Phe Leu Thr 
                85                  90                  95      


Val Asp Cys Gly Ile Thr Asn His Ala Glu Leu Arg Glu Leu Leu Glu 
            100                 105                 110         


Asn Gly Val Glu Val Ile Val Thr Asp His His Thr Pro Gly Lys Thr 
        115                 120                 125             


Pro Pro Pro Gly Leu Val Val His Pro Ala Leu Thr Pro Asp Leu Lys 
    130                 135                 140                 


Glu Lys Pro Thr Gly Ala Gly Val Ala Phe Leu Leu Leu Trp Ala Leu 
145                 150                 155                 160 


His Glu Arg Leu Gly Leu Pro Pro Pro Leu Glu Tyr Ala Asp Leu Ala 
                165                 170                 175     


Ala Val Gly Thr Ile Ala Asp Val Ala Pro Leu Trp Gly Trp Asn Arg 
            180                 185                 190         


Ala Leu Val Lys Glu Gly Leu Ala Arg Ile Pro Ala Ser Ser Trp Val 
        195                 200                 205             


Gly Leu Arg Leu Leu Ala Glu Ala Val Gly Tyr Thr Gly Lys Ala Val 
    210                 215                 220                 


Glu Val Ala Phe Arg Ile Ala Pro Arg Ile Asn Ala Ala Ser Arg Leu 
225                 230                 235                 240 


Gly Glu Ala Glu Lys Ala Leu Arg Leu Leu Leu Thr Asp Asp Ala Ala 
                245                 250                 255     


Glu Ala Gln Ala Leu Val Gly Glu Leu His Arg Leu Asn Ala Arg Arg 
            260                 265                 270         


Gln Thr Leu Glu Glu Ala Met Leu Arg Lys Leu Leu Pro Gln Ala Asp 
        275                 280                 285             


Pro Glu Ala Lys Ala Ile Val Leu Leu Asp Pro Glu Gly His Pro Gly 
    290                 295                 300                 


Val Met Gly Ile Val Ala Ser Arg Ile Leu Glu Ala Thr Leu Arg Pro 
305                 310                 315                 320 


Val Phe Leu Val Ala Gln Gly Lys Gly Thr Val Arg Ser Leu Ala Pro 
                325                 330                 335     


Ile Ser Ala Val Glu Ala Leu Arg Ser Ala Glu Asp Leu Leu Leu Arg 
            340                 345                 350         


Tyr Gly Gly His Lys Glu Ala Ala Gly Phe Ala Met Asp Glu Ala Leu 
        355                 360                 365             


Phe Pro Ala Phe Lys Ala Arg Val Glu Ala Tyr Ala Ala Arg Phe Pro 
    370                 375                 380                 


Asp Pro Val Arg Glu Val Ala Leu Leu Asp Leu Leu Pro Glu Pro Gly 
385                 390                 395                 400 


Leu Leu Pro Gln Val Phe Arg Glu Leu Ala Leu Leu Glu Pro Tyr Gly 
                405                 410                 415     


Glu Gly Asn Pro Glu Pro Leu Phe Leu 
            420                 425 


<210>  16
<211>  738
<212>  DNA
<213>  Bacteriophage lambda

<400>  16
tccggaagcg gctctggtag tggttctggc atgacaccgg acattatcct gcagcgtacc       60

gggatcgatg tgagagctgt cgaacagggg gatgatgcgt ggcacaaatt acggctcggc      120

gtcatcaccg cttcagaagt tcacaacgtg atagcaaaac cccgctccgg aaagaagtgg      180

cctgacatga aaatgtccta cttccacacc ctgcttgctg aggtttgcac cggtgtggct      240

ccggaagtta acgctaaagc actggcctgg ggaaaacagt acgagaacga cgccagaacc      300

ctgtttgaat tcacttccgg cgtgaatgtt actgaatccc cgatcatcta tcgcgacgaa      360

agtatgcgta ccgcctgctc tcccgatggt ttatgcagtg acggcaacgg ccttgaactg      420

aaatgcccgt ttacctcccg ggatttcatg aagttccggc tcggtggttt cgaggccata      480

aagtcagctt acatggccca ggtgcagtac agcatgtggg tgacgcgaaa aaatgcctgg      540

tactttgcca actatgaccc gcgtatgaag cgtgaaggcc tgcattatgt cgtgattgag      600

cgggatgaaa agtacatggc gagttttgac gagatcgtgc cggagttcat cgaaaaaatg      660

gacgaggcac tggctgaaat tggttttgta tttggggagc aatggcgatc tggctctggt      720

tccggcagcg gttccgga                                                    738


<210>  17
<211>  226
<212>  PRT
<213>  Bacteriophage lambda

<400>  17

Met Thr Pro Asp Ile Ile Leu Gln Arg Thr Gly Ile Asp Val Arg Ala 
1               5                   10                  15      


Val Glu Gln Gly Asp Asp Ala Trp His Lys Leu Arg Leu Gly Val Ile 
            20                  25                  30          


Thr Ala Ser Glu Val His Asn Val Ile Ala Lys Pro Arg Ser Gly Lys 
        35                  40                  45              


Lys Trp Pro Asp Met Lys Met Ser Tyr Phe His Thr Leu Leu Ala Glu 
    50                  55                  60                  


Val Cys Thr Gly Val Ala Pro Glu Val Asn Ala Lys Ala Leu Ala Trp 
65                  70                  75                  80  


Gly Lys Gln Tyr Glu Asn Asp Ala Arg Thr Leu Phe Glu Phe Thr Ser 
                85                  90                  95      


Gly Val Asn Val Thr Glu Ser Pro Ile Ile Tyr Arg Asp Glu Ser Met 
            100                 105                 110         


Arg Thr Ala Cys Ser Pro Asp Gly Leu Cys Ser Asp Gly Asn Gly Leu 
        115                 120                 125             


Glu Leu Lys Cys Pro Phe Thr Ser Arg Asp Phe Met Lys Phe Arg Leu 
    130                 135                 140                 


Gly Gly Phe Glu Ala Ile Lys Ser Ala Tyr Met Ala Gln Val Gln Tyr 
145                 150                 155                 160 


Ser Met Trp Val Thr Arg Lys Asn Ala Trp Tyr Phe Ala Asn Tyr Asp 
                165                 170                 175     


Pro Arg Met Lys Arg Glu Gly Leu His Tyr Val Val Ile Glu Arg Asp 
            180                 185                 190         


Glu Lys Tyr Met Ala Ser Phe Asp Glu Ile Val Pro Glu Phe Ile Glu 
        195                 200                 205             


Lys Met Asp Glu Ala Leu Ala Glu Ile Gly Phe Val Phe Gly Glu Gln 
    210                 215                 220                 


Trp Arg 
225     


<210>  18
<211>  760
<212>  PRT
<213>  Methanococcoides burtonii

<400>  18

Met Met Ile Arg Glu Leu Asp Ile Pro Arg Asp Ile Ile Gly Phe Tyr 
1               5                   10                  15      


Glu Asp Ser Gly Ile Lys Glu Leu Tyr Pro Pro Gln Ala Glu Ala Ile 
            20                  25                  30          


Glu Met Gly Leu Leu Glu Lys Lys Asn Leu Leu Ala Ala Ile Pro Thr 
        35                  40                  45              


Ala Ser Gly Lys Thr Leu Leu Ala Glu Leu Ala Met Ile Lys Ala Ile 
    50                  55                  60                  


Arg Glu Gly Gly Lys Ala Leu Tyr Ile Val Pro Leu Arg Ala Leu Ala 
65                  70                  75                  80  


Ser Glu Lys Phe Glu Arg Phe Lys Glu Leu Ala Pro Phe Gly Ile Lys 
                85                  90                  95      


Val Gly Ile Ser Thr Gly Asp Leu Asp Ser Arg Ala Asp Trp Leu Gly 
            100                 105                 110         


Val Asn Asp Ile Ile Val Ala Thr Ser Glu Lys Thr Asp Ser Leu Leu 
        115                 120                 125             


Arg Asn Gly Thr Ser Trp Met Asp Glu Ile Thr Thr Val Val Val Asp 
    130                 135                 140                 


Glu Ile His Leu Leu Asp Ser Lys Asn Arg Gly Pro Thr Leu Glu Val 
145                 150                 155                 160 


Thr Ile Thr Lys Leu Met Arg Leu Asn Pro Asp Val Gln Val Val Ala 
                165                 170                 175     


Leu Ser Ala Thr Val Gly Asn Ala Arg Glu Met Ala Asp Trp Leu Gly 
            180                 185                 190         


Ala Ala Leu Val Leu Ser Glu Trp Arg Pro Thr Asp Leu His Glu Gly 
        195                 200                 205             


Val Leu Phe Gly Asp Ala Ile Asn Phe Pro Gly Ser Gln Lys Lys Ile 
    210                 215                 220                 


Asp Arg Leu Glu Lys Asp Asp Ala Val Asn Leu Val Leu Asp Thr Ile 
225                 230                 235                 240 


Lys Ala Glu Gly Gln Cys Leu Val Phe Glu Ser Ser Arg Arg Asn Cys 
                245                 250                 255     


Ala Gly Phe Ala Lys Thr Ala Ser Ser Lys Val Ala Lys Ile Leu Asp 
            260                 265                 270         


Asn Asp Ile Met Ile Lys Leu Ala Gly Ile Ala Glu Glu Val Glu Ser 
        275                 280                 285             


Thr Gly Glu Thr Asp Thr Ala Ile Val Leu Ala Asn Cys Ile Arg Lys 
    290                 295                 300                 


Gly Val Ala Phe His His Ala Gly Leu Asn Ser Asn His Arg Lys Leu 
305                 310                 315                 320 


Val Glu Asn Gly Phe Arg Gln Asn Leu Ile Lys Val Ile Ser Ser Thr 
                325                 330                 335     


Pro Thr Leu Ala Ala Gly Leu Asn Leu Pro Ala Arg Arg Val Ile Ile 
            340                 345                 350         


Arg Ser Tyr Arg Arg Phe Asp Ser Asn Phe Gly Met Gln Pro Ile Pro 
        355                 360                 365             


Val Leu Glu Tyr Lys Gln Met Ala Gly Arg Ala Gly Arg Pro His Leu 
    370                 375                 380                 


Asp Pro Tyr Gly Glu Ser Val Leu Leu Ala Lys Thr Tyr Asp Glu Phe 
385                 390                 395                 400 


Ala Gln Leu Met Glu Asn Tyr Val Glu Ala Asp Ala Glu Asp Ile Trp 
                405                 410                 415     


Ser Lys Leu Gly Thr Glu Asn Ala Leu Arg Thr His Val Leu Ser Thr 
            420                 425                 430         


Ile Val Asn Gly Phe Ala Ser Thr Arg Gln Glu Leu Phe Asp Phe Phe 
        435                 440                 445             


Gly Ala Thr Phe Phe Ala Tyr Gln Gln Asp Lys Trp Met Leu Glu Glu 
    450                 455                 460                 


Val Ile Asn Asp Cys Leu Glu Phe Leu Ile Asp Lys Ala Met Val Ser 
465                 470                 475                 480 


Glu Thr Glu Asp Ile Glu Asp Ala Ser Lys Leu Phe Leu Arg Gly Thr 
                485                 490                 495     


Arg Leu Gly Ser Leu Val Ser Met Leu Tyr Ile Asp Pro Leu Ser Gly 
            500                 505                 510         


Ser Lys Ile Val Asp Gly Phe Lys Asp Ile Gly Lys Ser Thr Gly Gly 
        515                 520                 525             


Asn Met Gly Ser Leu Glu Asp Asp Lys Gly Asp Asp Ile Thr Val Thr 
    530                 535                 540                 


Asp Met Thr Leu Leu His Leu Val Cys Ser Thr Pro Asp Met Arg Gln 
545                 550                 555                 560 


Leu Tyr Leu Arg Asn Thr Asp Tyr Thr Ile Val Asn Glu Tyr Ile Val 
                565                 570                 575     


Ala His Ser Asp Glu Phe His Glu Ile Pro Asp Lys Leu Lys Glu Thr 
            580                 585                 590         


Asp Tyr Glu Trp Phe Met Gly Glu Val Lys Thr Ala Met Leu Leu Glu 
        595                 600                 605             


Glu Trp Val Thr Glu Val Ser Ala Glu Asp Ile Thr Arg His Phe Asn 
    610                 615                 620                 


Val Gly Glu Gly Asp Ile His Ala Leu Ala Asp Thr Ser Glu Trp Leu 
625                 630                 635                 640 


Met His Ala Ala Ala Lys Leu Ala Glu Leu Leu Gly Val Glu Tyr Ser 
                645                 650                 655     


Ser His Ala Tyr Ser Leu Glu Lys Arg Ile Arg Tyr Gly Ser Gly Leu 
            660                 665                 670         


Asp Leu Met Glu Leu Val Gly Ile Arg Gly Val Gly Arg Val Arg Ala 
        675                 680                 685             


Arg Lys Leu Tyr Asn Ala Gly Phe Val Ser Val Ala Lys Leu Lys Gly 
    690                 695                 700                 


Ala Asp Ile Ser Val Leu Ser Lys Leu Val Gly Pro Lys Val Ala Tyr 
705                 710                 715                 720 


Asn Ile Leu Ser Gly Ile Gly Val Arg Val Asn Asp Lys His Phe Asn 
                725                 730                 735     


Ser Ala Pro Ile Ser Ser Asn Thr Leu Asp Thr Leu Leu Asp Lys Asn 
            740                 745                 750         


Gln Lys Thr Phe Asn Asp Phe Gln 
        755                 760 


<210>  19
<211>  707
<212>  PRT
<213>  Cenarchaeum symbiosum

<400>  19

Met Arg Ile Ser Glu Leu Asp Ile Pro Arg Pro Ala Ile Glu Phe Leu 
1               5                   10                  15      


Glu Gly Glu Gly Tyr Lys Lys Leu Tyr Pro Pro Gln Ala Ala Ala Ala 
            20                  25                  30          


Lys Ala Gly Leu Thr Asp Gly Lys Ser Val Leu Val Ser Ala Pro Thr 
        35                  40                  45              


Ala Ser Gly Lys Thr Leu Ile Ala Ala Ile Ala Met Ile Ser His Leu 
    50                  55                  60                  


Ser Arg Asn Arg Gly Lys Ala Val Tyr Leu Ser Pro Leu Arg Ala Leu 
65                  70                  75                  80  


Ala Ala Glu Lys Phe Ala Glu Phe Gly Lys Ile Gly Gly Ile Pro Leu 
                85                  90                  95      


Gly Arg Pro Val Arg Val Gly Val Ser Thr Gly Asp Phe Glu Lys Ala 
            100                 105                 110         


Gly Arg Ser Leu Gly Asn Asn Asp Ile Leu Val Leu Thr Asn Glu Arg 
        115                 120                 125             


Met Asp Ser Leu Ile Arg Arg Arg Pro Asp Trp Met Asp Glu Val Gly 
    130                 135                 140                 


Leu Val Ile Ala Asp Glu Ile His Leu Ile Gly Asp Arg Ser Arg Gly 
145                 150                 155                 160 


Pro Thr Leu Glu Met Val Leu Thr Lys Leu Arg Gly Leu Arg Ser Ser 
                165                 170                 175     


Pro Gln Val Val Ala Leu Ser Ala Thr Ile Ser Asn Ala Asp Glu Ile 
            180                 185                 190         


Ala Gly Trp Leu Asp Cys Thr Leu Val His Ser Thr Trp Arg Pro Val 
        195                 200                 205             


Pro Leu Ser Glu Gly Val Tyr Gln Asp Gly Glu Val Ala Met Gly Asp 
    210                 215                 220                 


Gly Ser Arg His Glu Val Ala Ala Thr Gly Gly Gly Pro Ala Val Asp 
225                 230                 235                 240 


Leu Ala Ala Glu Ser Val Ala Glu Gly Gly Gln Ser Leu Ile Phe Ala 
                245                 250                 255     


Asp Thr Arg Ala Arg Ser Ala Ser Leu Ala Ala Lys Ala Ser Ala Val 
            260                 265                 270         


Ile Pro Glu Ala Lys Gly Ala Asp Ala Ala Lys Leu Ala Ala Ala Ala 
        275                 280                 285             


Lys Lys Ile Ile Ser Ser Gly Gly Glu Thr Lys Leu Ala Lys Thr Leu 
    290                 295                 300                 


Ala Glu Leu Val Glu Lys Gly Ala Ala Phe His His Ala Gly Leu Asn 
305                 310                 315                 320 


Gln Asp Cys Arg Ser Val Val Glu Glu Glu Phe Arg Ser Gly Arg Ile 
                325                 330                 335     


Arg Leu Leu Ala Ser Thr Pro Thr Leu Ala Ala Gly Val Asn Leu Pro 
            340                 345                 350         


Ala Arg Arg Val Val Ile Ser Ser Val Met Arg Tyr Asn Ser Ser Ser 
        355                 360                 365             


Gly Met Ser Glu Pro Ile Ser Ile Leu Glu Tyr Lys Gln Leu Cys Gly 
    370                 375                 380                 


Arg Ala Gly Arg Pro Gln Tyr Asp Lys Ser Gly Glu Ala Ile Val Val 
385                 390                 395                 400 


Gly Gly Val Asn Ala Asp Glu Ile Phe Asp Arg Tyr Ile Gly Gly Glu 
                405                 410                 415     


Pro Glu Pro Ile Arg Ser Ala Met Val Asp Asp Arg Ala Leu Arg Ile 
            420                 425                 430         


His Val Leu Ser Leu Val Thr Thr Ser Pro Gly Ile Lys Glu Asp Asp 
        435                 440                 445             


Val Thr Glu Phe Phe Leu Gly Thr Leu Gly Gly Gln Gln Ser Gly Glu 
    450                 455                 460                 


Ser Thr Val Lys Phe Ser Val Ala Val Ala Leu Arg Phe Leu Gln Glu 
465                 470                 475                 480 


Glu Gly Met Leu Gly Arg Arg Gly Gly Arg Leu Ala Ala Thr Lys Met 
                485                 490                 495     


Gly Arg Leu Val Ser Arg Leu Tyr Met Asp Pro Met Thr Ala Val Thr 
            500                 505                 510         


Leu Arg Asp Ala Val Gly Glu Ala Ser Pro Gly Arg Met His Thr Leu 
        515                 520                 525             


Gly Phe Leu His Leu Val Ser Glu Cys Ser Glu Phe Met Pro Arg Phe 
    530                 535                 540                 


Ala Leu Arg Gln Lys Asp His Glu Val Ala Glu Met Met Leu Glu Ala 
545                 550                 555                 560 


Gly Arg Gly Glu Leu Leu Arg Pro Val Tyr Ser Tyr Glu Cys Gly Arg 
                565                 570                 575     


Gly Leu Leu Ala Leu His Arg Trp Ile Gly Glu Ser Pro Glu Ala Lys 
            580                 585                 590         


Leu Ala Glu Asp Leu Lys Phe Glu Ser Gly Asp Val His Arg Met Val 
        595                 600                 605             


Glu Ser Ser Gly Trp Leu Leu Arg Cys Ile Trp Glu Ile Ser Lys His 
    610                 615                 620                 


Gln Glu Arg Pro Asp Leu Leu Gly Glu Leu Asp Val Leu Arg Ser Arg 
625                 630                 635                 640 


Val Ala Tyr Gly Ile Lys Ala Glu Leu Val Pro Leu Val Ser Ile Lys 
                645                 650                 655     


Gly Ile Gly Arg Val Arg Ser Arg Arg Leu Phe Arg Gly Gly Ile Lys 
            660                 665                 670         


Gly Pro Gly Asp Leu Ala Ala Val Pro Val Glu Arg Leu Ser Arg Val 
        675                 680                 685             


Glu Gly Ile Gly Ala Thr Leu Ala Asn Asn Ile Lys Ser Gln Leu Arg 
    690                 695                 700                 


Lys Gly Gly 
705         


<210>  20
<211>  720
<212>  PRT
<213>  Thermococcus gammatolerans

<400>  20

Met Lys Val Asp Glu Leu Pro Val Asp Glu Arg Leu Lys Ala Val Leu 
1               5                   10                  15      


Lys Glu Arg Gly Ile Glu Glu Leu Tyr Pro Pro Gln Ala Glu Ala Leu 
            20                  25                  30          


Lys Ser Gly Ala Leu Glu Gly Arg Asn Leu Val Leu Ala Ile Pro Thr 
        35                  40                  45              


Ala Ser Gly Lys Thr Leu Val Ser Glu Ile Val Met Val Asn Lys Leu 
    50                  55                  60                  


Ile Gln Glu Gly Gly Lys Ala Val Tyr Leu Val Pro Leu Lys Ala Leu 
65                  70                  75                  80  


Ala Glu Glu Lys Tyr Arg Glu Phe Lys Glu Trp Glu Lys Leu Gly Leu 
                85                  90                  95      


Lys Val Ala Ala Thr Thr Gly Asp Tyr Asp Ser Thr Asp Asp Trp Leu 
            100                 105                 110         


Gly Arg Tyr Asp Ile Ile Val Ala Thr Ala Glu Lys Phe Asp Ser Leu 
        115                 120                 125             


Leu Arg His Gly Ala Arg Trp Ile Asn Asp Val Lys Leu Val Val Ala 
    130                 135                 140                 


Asp Glu Val His Leu Ile Gly Ser Tyr Asp Arg Gly Ala Thr Leu Glu 
145                 150                 155                 160 


Met Ile Leu Thr His Met Leu Gly Arg Ala Gln Ile Leu Ala Leu Ser 
                165                 170                 175     


Ala Thr Val Gly Asn Ala Glu Glu Leu Ala Glu Trp Leu Asp Ala Ser 
            180                 185                 190         


Leu Val Val Ser Asp Trp Arg Pro Val Gln Leu Arg Arg Gly Val Phe 
        195                 200                 205             


His Leu Gly Thr Leu Ile Trp Glu Asp Gly Lys Val Glu Ser Tyr Pro 
    210                 215                 220                 


Glu Asn Trp Tyr Ser Leu Val Val Asp Ala Val Lys Arg Gly Lys Gly 
225                 230                 235                 240 


Ala Leu Val Phe Val Asn Thr Arg Arg Ser Ala Glu Lys Glu Ala Leu 
                245                 250                 255     


Ala Leu Ser Lys Leu Val Ser Ser His Leu Thr Lys Pro Glu Lys Arg 
            260                 265                 270         


Ala Leu Glu Ser Leu Ala Ser Gln Leu Glu Asp Asn Pro Thr Ser Glu 
        275                 280                 285             


Lys Leu Lys Arg Ala Leu Arg Gly Gly Val Ala Phe His His Ala Gly 
    290                 295                 300                 


Leu Ser Arg Val Glu Arg Thr Leu Ile Glu Asp Ala Phe Arg Glu Gly 
305                 310                 315                 320 


Leu Ile Lys Val Ile Thr Ala Thr Pro Thr Leu Ser Ala Gly Val Asn 
                325                 330                 335     


Leu Pro Ser Phe Arg Val Ile Ile Arg Asp Thr Lys Arg Tyr Ala Gly 
            340                 345                 350         


Phe Gly Trp Thr Asp Ile Pro Val Leu Glu Ile Gln Gln Met Met Gly 
        355                 360                 365             


Arg Ala Gly Arg Pro Arg Tyr Asp Lys Tyr Gly Glu Ala Ile Ile Val 
    370                 375                 380                 


Ala Arg Thr Asp Glu Pro Gly Lys Leu Met Glu Arg Tyr Ile Arg Gly 
385                 390                 395                 400 


Lys Pro Glu Lys Leu Phe Ser Met Leu Ala Asn Glu Gln Ala Phe Arg 
                405                 410                 415     


Ser Gln Val Leu Ala Leu Ile Thr Asn Phe Gly Ile Arg Ser Phe Pro 
            420                 425                 430         


Glu Leu Val Arg Phe Leu Glu Arg Thr Phe Tyr Ala His Gln Arg Lys 
        435                 440                 445             


Asp Leu Ser Ser Leu Glu Tyr Lys Ala Lys Glu Val Val Tyr Phe Leu 
    450                 455                 460                 


Ile Glu Asn Glu Phe Ile Asp Leu Asp Leu Glu Asp Arg Phe Ile Pro 
465                 470                 475                 480 


Leu Pro Phe Gly Lys Arg Thr Ser Gln Leu Tyr Ile Asp Pro Leu Thr 
                485                 490                 495     


Ala Lys Lys Phe Lys Asp Ala Phe Pro Ala Ile Glu Arg Asn Pro Asn 
            500                 505                 510         


Pro Phe Gly Ile Phe Gln Leu Ile Ala Ser Thr Pro Asp Met Ala Thr 
        515                 520                 525             


Leu Thr Ala Arg Arg Arg Glu Met Glu Asp Tyr Leu Asp Leu Ala Tyr 
    530                 535                 540                 


Glu Leu Glu Asp Lys Leu Tyr Ala Ser Ile Pro Tyr Tyr Glu Asp Ser 
545                 550                 555                 560 


Arg Phe Gln Gly Phe Leu Gly Gln Val Lys Thr Ala Lys Val Leu Leu 
                565                 570                 575     


Asp Trp Ile Asn Glu Val Pro Glu Ala Arg Ile Tyr Glu Thr Tyr Ser 
            580                 585                 590         


Ile Asp Pro Gly Asp Leu Tyr Arg Leu Leu Glu Leu Ala Asp Trp Leu 
        595                 600                 605             


Met Tyr Ser Leu Ile Glu Leu Tyr Lys Leu Phe Glu Pro Lys Glu Glu 
    610                 615                 620                 


Ile Leu Asn Tyr Leu Arg Asp Leu His Leu Arg Leu Arg His Gly Val 
625                 630                 635                 640 


Arg Glu Glu Leu Leu Glu Leu Val Arg Leu Pro Asn Ile Gly Arg Lys 
                645                 650                 655     


Arg Ala Arg Ala Leu Tyr Asn Ala Gly Phe Arg Ser Val Glu Ala Ile 
            660                 665                 670         


Ala Asn Ala Lys Pro Ala Glu Leu Leu Ala Val Glu Gly Ile Gly Ala 
        675                 680                 685             


Lys Ile Leu Asp Gly Ile Tyr Arg His Leu Gly Ile Glu Lys Arg Val 
    690                 695                 700                 


Thr Glu Glu Lys Pro Lys Arg Lys Gly Thr Leu Glu Asp Phe Leu Arg 
705                 710                 715                 720 


<210>  21
<211>  799
<212>  PRT
<213>  Methanospirillum hungatei

<400>  21

Met Glu Ile Ala Ser Leu Pro Leu Pro Asp Ser Phe Ile Arg Ala Cys 
1               5                   10                  15      


His Ala Lys Gly Ile Arg Ser Leu Tyr Pro Pro Gln Ala Glu Cys Ile 
            20                  25                  30          


Glu Lys Gly Leu Leu Glu Gly Lys Asn Leu Leu Ile Ser Ile Pro Thr 
        35                  40                  45              


Ala Ser Gly Lys Thr Leu Leu Ala Glu Met Ala Met Trp Ser Arg Ile 
    50                  55                  60                  


Ala Ala Gly Gly Lys Cys Leu Tyr Ile Val Pro Leu Arg Ala Leu Ala 
65                  70                  75                  80  


Ser Glu Lys Tyr Asp Glu Phe Ser Lys Lys Gly Val Ile Arg Val Gly 
                85                  90                  95      


Ile Ala Thr Gly Asp Leu Asp Arg Thr Asp Ala Tyr Leu Gly Glu Asn 
            100                 105                 110         


Asp Ile Ile Val Ala Thr Ser Glu Lys Thr Asp Ser Leu Leu Arg Asn 
        115                 120                 125             


Arg Thr Pro Trp Leu Ser Gln Ile Thr Cys Ile Val Leu Asp Glu Val 
    130                 135                 140                 


His Leu Ile Gly Ser Glu Asn Arg Gly Ala Thr Leu Glu Met Val Ile 
145                 150                 155                 160 


Thr Lys Leu Arg Tyr Thr Asn Pro Val Met Gln Ile Ile Gly Leu Ser 
                165                 170                 175     


Ala Thr Ile Gly Asn Pro Ala Gln Leu Ala Glu Trp Leu Asp Ala Thr 
            180                 185                 190         


Leu Ile Thr Ser Thr Trp Arg Pro Val Asp Leu Arg Gln Gly Val Tyr 
        195                 200                 205             


Tyr Asn Gly Lys Ile Arg Phe Ser Asp Ser Glu Arg Pro Ile Gln Gly 
    210                 215                 220                 


Lys Thr Lys His Asp Asp Leu Asn Leu Cys Leu Asp Thr Ile Glu Glu 
225                 230                 235                 240 


Gly Gly Gln Cys Leu Val Phe Val Ser Ser Arg Arg Asn Ala Glu Gly 
                245                 250                 255     


Phe Ala Lys Lys Ala Ala Gly Ala Leu Lys Ala Gly Ser Pro Asp Ser 
            260                 265                 270         


Lys Ala Leu Ala Gln Glu Leu Arg Arg Leu Arg Asp Arg Asp Glu Gly 
        275                 280                 285             


Asn Val Leu Ala Asp Cys Val Glu Arg Gly Ala Ala Phe His His Ala 
    290                 295                 300                 


Gly Leu Ile Arg Gln Glu Arg Thr Ile Ile Glu Glu Gly Phe Arg Asn 
305                 310                 315                 320 


Gly Tyr Ile Glu Val Ile Ala Ala Thr Pro Thr Leu Ala Ala Gly Leu 
                325                 330                 335     


Asn Leu Pro Ala Arg Arg Val Ile Ile Arg Asp Tyr Asn Arg Phe Ala 
            340                 345                 350         


Ser Gly Leu Gly Met Val Pro Ile Pro Val Gly Glu Tyr His Gln Met 
        355                 360                 365             


Ala Gly Arg Ala Gly Arg Pro His Leu Asp Pro Tyr Gly Glu Ala Val 
    370                 375                 380                 


Leu Leu Ala Lys Asp Ala Pro Ser Val Glu Arg Leu Phe Glu Thr Phe 
385                 390                 395                 400 


Ile Asp Ala Glu Ala Glu Arg Val Asp Ser Gln Cys Val Asp Asp Ala 
                405                 410                 415     


Ser Leu Cys Ala His Ile Leu Ser Leu Ile Ala Thr Gly Phe Ala His 
            420                 425                 430         


Asp Gln Glu Ala Leu Ser Ser Phe Met Glu Arg Thr Phe Tyr Phe Phe 
        435                 440                 445             


Gln His Pro Lys Thr Arg Ser Leu Pro Arg Leu Val Ala Asp Ala Ile 
    450                 455                 460                 


Arg Phe Leu Thr Thr Ala Gly Met Val Glu Glu Arg Glu Asn Thr Leu 
465                 470                 475                 480 


Ser Ala Thr Arg Leu Gly Ser Leu Val Ser Arg Leu Tyr Leu Asn Pro 
                485                 490                 495     


Cys Thr Ala Arg Leu Ile Leu Asp Ser Leu Lys Ser Cys Lys Thr Pro 
            500                 505                 510         


Thr Leu Ile Gly Leu Leu His Val Ile Cys Val Ser Pro Asp Met Gln 
        515                 520                 525             


Arg Leu Tyr Leu Lys Ala Ala Asp Thr Gln Leu Leu Arg Thr Phe Leu 
    530                 535                 540                 


Phe Lys His Lys Asp Asp Leu Ile Leu Pro Leu Pro Phe Glu Gln Glu 
545                 550                 555                 560 


Glu Glu Glu Leu Trp Leu Ser Gly Leu Lys Thr Ala Leu Val Leu Thr 
                565                 570                 575     


Asp Trp Ala Asp Glu Phe Ser Glu Gly Met Ile Glu Glu Arg Tyr Gly 
            580                 585                 590         


Ile Gly Ala Gly Asp Leu Tyr Asn Ile Val Asp Ser Gly Lys Trp Leu 
        595                 600                 605             


Leu His Gly Thr Glu Arg Leu Val Ser Val Glu Met Pro Glu Met Ser 
    610                 615                 620                 


Gln Val Val Lys Thr Leu Ser Val Arg Val His His Gly Val Lys Ser 
625                 630                 635                 640 


Glu Leu Leu Pro Leu Val Ala Leu Arg Asn Ile Gly Arg Val Arg Ala 
                645                 650                 655     


Arg Thr Leu Tyr Asn Ala Gly Tyr Pro Asp Pro Glu Ala Val Ala Arg 
            660                 665                 670         


Ala Gly Leu Ser Thr Ile Ala Arg Ile Ile Gly Glu Gly Ile Ala Arg 
        675                 680                 685             


Gln Val Ile Asp Glu Ile Thr Gly Val Lys Arg Ser Gly Ile His Ser 
    690                 695                 700                 


Ser Asp Asp Asp Tyr Gln Gln Lys Thr Pro Glu Leu Leu Thr Asp Ile 
705                 710                 715                 720 


Pro Gly Ile Gly Lys Lys Met Ala Glu Lys Leu Gln Asn Ala Gly Ile 
                725                 730                 735     


Ile Thr Val Ser Asp Leu Leu Thr Ala Asp Glu Val Leu Leu Ser Asp 
            740                 745                 750         


Val Leu Gly Ala Ala Arg Ala Arg Lys Val Leu Ala Phe Leu Ser Asn 
        755                 760                 765             


Ser Glu Lys Glu Asn Ser Ser Ser Asp Lys Thr Glu Glu Ile Pro Asp 
    770                 775                 780                 


Thr Gln Lys Ile Arg Gly Gln Ser Ser Trp Glu Asp Phe Gly Cys 
785                 790                 795                 


<210>  22
<211>  1756
<212>  PRT
<213>  Escherichia coli

<400>  22

Met Met Ser Ile Ala Gln Val Arg Ser Ala Gly Ser Ala Gly Asn Tyr 
1               5                   10                  15      


Tyr Thr Asp Lys Asp Asn Tyr Tyr Val Leu Gly Ser Met Gly Glu Arg 
            20                  25                  30          


Trp Ala Gly Lys Gly Ala Glu Gln Leu Gly Leu Gln Gly Ser Val Asp 
        35                  40                  45              


Lys Asp Val Phe Thr Arg Leu Leu Glu Gly Arg Leu Pro Asp Gly Ala 
    50                  55                  60                  


Asp Leu Ser Arg Met Gln Asp Gly Ser Asn Lys His Arg Pro Gly Tyr 
65                  70                  75                  80  


Asp Leu Thr Phe Ser Ala Pro Lys Ser Val Ser Met Met Ala Met Leu 
                85                  90                  95      


Gly Gly Asp Lys Arg Leu Ile Asp Ala His Asn Gln Ala Val Asp Phe 
            100                 105                 110         


Ala Val Arg Gln Val Glu Ala Leu Ala Ser Thr Arg Val Met Thr Asp 
        115                 120                 125             


Gly Gln Ser Glu Thr Val Leu Thr Gly Asn Leu Val Met Ala Leu Phe 
    130                 135                 140                 


Asn His Asp Thr Ser Arg Asp Gln Glu Pro Gln Leu His Thr His Ala 
145                 150                 155                 160 


Val Val Ala Asn Val Thr Gln His Asn Gly Glu Trp Lys Thr Leu Ser 
                165                 170                 175     


Ser Asp Lys Val Gly Lys Thr Gly Phe Ile Glu Asn Val Tyr Ala Asn 
            180                 185                 190         


Gln Ile Ala Phe Gly Arg Leu Tyr Arg Glu Lys Leu Lys Glu Gln Val 
        195                 200                 205             


Glu Ala Leu Gly Tyr Glu Thr Glu Val Val Gly Lys His Gly Met Trp 
    210                 215                 220                 


Glu Met Pro Gly Val Pro Val Glu Ala Phe Ser Gly Arg Ser Gln Ala 
225                 230                 235                 240 


Ile Arg Glu Ala Val Gly Glu Asp Ala Ser Leu Lys Ser Arg Asp Val 
                245                 250                 255     


Ala Ala Leu Asp Thr Arg Lys Ser Lys Gln His Val Asp Pro Glu Ile 
            260                 265                 270         


Arg Met Ala Glu Trp Met Gln Thr Leu Lys Glu Thr Gly Phe Asp Ile 
        275                 280                 285             


Arg Ala Tyr Arg Asp Ala Ala Asp Gln Arg Thr Glu Ile Arg Thr Gln 
    290                 295                 300                 


Ala Pro Gly Pro Ala Ser Gln Asp Gly Pro Asp Val Gln Gln Ala Val 
305                 310                 315                 320 


Thr Gln Ala Ile Ala Gly Leu Ser Glu Arg Lys Val Gln Phe Thr Tyr 
                325                 330                 335     


Thr Asp Val Leu Ala Arg Thr Val Gly Ile Leu Pro Pro Glu Asn Gly 
            340                 345                 350         


Val Ile Glu Arg Ala Arg Ala Gly Ile Asp Glu Ala Ile Ser Arg Glu 
        355                 360                 365             


Gln Leu Ile Pro Leu Asp Arg Glu Lys Gly Leu Phe Thr Ser Gly Ile 
    370                 375                 380                 


His Val Leu Asp Glu Leu Ser Val Arg Ala Leu Ser Arg Asp Ile Met 
385                 390                 395                 400 


Lys Gln Asn Arg Val Thr Val His Pro Glu Lys Ser Val Pro Arg Thr 
                405                 410                 415     


Ala Gly Tyr Ser Asp Ala Val Ser Val Leu Ala Gln Asp Arg Pro Ser 
            420                 425                 430         


Leu Ala Ile Val Ser Gly Gln Gly Gly Ala Ala Gly Gln Arg Glu Arg 
        435                 440                 445             


Val Ala Glu Leu Val Met Met Ala Arg Glu Gln Gly Arg Glu Val Gln 
    450                 455                 460                 


Ile Ile Ala Ala Asp Arg Arg Ser Gln Met Asn Leu Lys Gln Asp Glu 
465                 470                 475                 480 


Arg Leu Ser Gly Glu Leu Ile Thr Gly Arg Arg Gln Leu Leu Glu Gly 
                485                 490                 495     


Met Ala Phe Thr Pro Gly Ser Thr Val Ile Val Asp Gln Gly Glu Lys 
            500                 505                 510         


Leu Ser Leu Lys Glu Thr Leu Thr Leu Leu Asp Gly Ala Ala Arg His 
        515                 520                 525             


Asn Val Gln Val Leu Ile Thr Asp Ser Gly Gln Arg Thr Gly Thr Gly 
    530                 535                 540                 


Ser Ala Leu Met Ala Met Lys Asp Ala Gly Val Asn Thr Tyr Arg Trp 
545                 550                 555                 560 


Gln Gly Gly Glu Gln Arg Pro Ala Thr Ile Ile Ser Glu Pro Asp Arg 
                565                 570                 575     


Asn Val Arg Tyr Ala Arg Leu Ala Gly Asp Phe Ala Ala Ser Val Lys 
            580                 585                 590         


Ala Gly Glu Glu Ser Val Ala Gln Val Ser Gly Val Arg Glu Gln Ala 
        595                 600                 605             


Ile Leu Thr Gln Ala Ile Arg Ser Glu Leu Lys Thr Gln Gly Val Leu 
    610                 615                 620                 


Gly His Pro Glu Val Thr Met Thr Ala Leu Ser Pro Val Trp Leu Asp 
625                 630                 635                 640 


Ser Arg Ser Arg Tyr Leu Arg Asp Met Tyr Arg Pro Gly Met Val Met 
                645                 650                 655     


Glu Gln Trp Asn Pro Glu Thr Arg Ser His Asp Arg Tyr Val Ile Asp 
            660                 665                 670         


Arg Val Thr Ala Gln Ser His Ser Leu Thr Leu Arg Asp Ala Gln Gly 
        675                 680                 685             


Glu Thr Gln Val Val Arg Ile Ser Ser Leu Asp Ser Ser Trp Ser Leu 
    690                 695                 700                 


Phe Arg Pro Glu Lys Met Pro Val Ala Asp Gly Glu Arg Leu Arg Val 
705                 710                 715                 720 


Thr Gly Lys Ile Pro Gly Leu Arg Val Ser Gly Gly Asp Arg Leu Gln 
                725                 730                 735     


Val Ala Ser Val Ser Glu Asp Ala Met Thr Val Val Val Pro Gly Arg 
            740                 745                 750         


Ala Glu Pro Ala Ser Leu Pro Val Ser Asp Ser Pro Phe Thr Ala Leu 
        755                 760                 765             


Lys Leu Glu Asn Gly Trp Val Glu Thr Pro Gly His Ser Val Ser Asp 
    770                 775                 780                 


Ser Ala Thr Val Phe Ala Ser Val Thr Gln Met Ala Met Asp Asn Ala 
785                 790                 795                 800 


Thr Leu Asn Gly Leu Ala Arg Ser Gly Arg Asp Val Arg Leu Tyr Ser 
                805                 810                 815     


Ser Leu Asp Glu Thr Arg Thr Ala Glu Lys Leu Ala Arg His Pro Ser 
            820                 825                 830         


Phe Thr Val Val Ser Glu Gln Ile Lys Ala Arg Ala Gly Glu Thr Leu 
        835                 840                 845             


Leu Glu Thr Ala Ile Ser Leu Gln Lys Ala Gly Leu His Thr Pro Ala 
    850                 855                 860                 


Gln Gln Ala Ile His Leu Ala Leu Pro Val Leu Glu Ser Lys Asn Leu 
865                 870                 875                 880 


Ala Phe Ser Met Val Asp Leu Leu Thr Glu Ala Lys Ser Phe Ala Ala 
                885                 890                 895     


Glu Gly Thr Gly Phe Thr Glu Leu Gly Gly Glu Ile Asn Ala Gln Ile 
            900                 905                 910         


Lys Arg Gly Asp Leu Leu Tyr Val Asp Val Ala Lys Gly Tyr Gly Thr 
        915                 920                 925             


Gly Leu Leu Val Ser Arg Ala Ser Tyr Glu Ala Glu Lys Ser Ile Leu 
    930                 935                 940                 


Arg His Ile Leu Glu Gly Lys Glu Ala Val Thr Pro Leu Met Glu Arg 
945                 950                 955                 960 


Val Pro Gly Glu Leu Met Glu Thr Leu Thr Ser Gly Gln Arg Ala Ala 
                965                 970                 975     


Thr Arg Met Ile Leu Glu Thr Ser Asp Arg Phe Thr Val Val Gln Gly 
            980                 985                 990         


Tyr Ala Gly Val Gly Lys Thr Thr  Gln Phe Arg Ala Val  Met Ser Ala 
        995                 1000                 1005             


Val Asn  Met Leu Pro Ala Ser  Glu Arg Pro Arg Val  Val Gly Leu 
    1010                 1015                 1020             


Gly Pro  Thr His Arg Ala Val  Gly Glu Met Arg Ser  Ala Gly Val 
    1025                 1030                 1035             


Asp Ala  Gln Thr Leu Ala Ser  Phe Leu His Asp Thr  Gln Leu Gln 
    1040                 1045                 1050             


Gln Arg  Ser Gly Glu Thr Pro  Asp Phe Ser Asn Thr  Leu Phe Leu 
    1055                 1060                 1065             


Leu Asp  Glu Ser Ser Met Val  Gly Asn Thr Glu Met  Ala Arg Ala 
    1070                 1075                 1080             


Tyr Ala  Leu Ile Ala Ala Gly  Gly Gly Arg Ala Val  Ala Ser Gly 
    1085                 1090                 1095             


Asp Thr  Asp Gln Leu Gln Ala  Ile Ala Pro Gly Gln  Ser Phe Arg 
    1100                 1105                 1110             


Leu Gln  Gln Thr Arg Ser Ala  Ala Asp Val Val Ile  Met Lys Glu 
    1115                 1120                 1125             


Ile Val  Arg Gln Thr Pro Glu  Leu Arg Glu Ala Val  Tyr Ser Leu 
    1130                 1135                 1140             


Ile Asn  Arg Asp Val Glu Arg  Ala Leu Ser Gly Leu  Glu Ser Val 
    1145                 1150                 1155             


Lys Pro  Ser Gln Val Pro Arg  Leu Glu Gly Ala Trp  Ala Pro Glu 
    1160                 1165                 1170             


His Ser  Val Thr Glu Phe Ser  His Ser Gln Glu Ala  Lys Leu Ala 
    1175                 1180                 1185             


Glu Ala  Gln Gln Lys Ala Met  Leu Lys Gly Glu Ala  Phe Pro Asp 
    1190                 1195                 1200             


Ile Pro  Met Thr Leu Tyr Glu  Ala Ile Val Arg Asp  Tyr Thr Gly 
    1205                 1210                 1215             


Arg Thr  Pro Glu Ala Arg Glu  Gln Thr Leu Ile Val  Thr His Leu 
    1220                 1225                 1230             


Asn Glu  Asp Arg Arg Val Leu  Asn Ser Met Ile His  Asp Ala Arg 
    1235                 1240                 1245             


Glu Lys  Ala Gly Glu Leu Gly  Lys Glu Gln Val Met  Val Pro Val 
    1250                 1255                 1260             


Leu Asn  Thr Ala Asn Ile Arg  Asp Gly Glu Leu Arg  Arg Leu Ser 
    1265                 1270                 1275             


Thr Trp  Glu Lys Asn Pro Asp  Ala Leu Ala Leu Val  Asp Asn Val 
    1280                 1285                 1290             


Tyr His  Arg Ile Ala Gly Ile  Ser Lys Asp Asp Gly  Leu Ile Thr 
    1295                 1300                 1305             


Leu Gln  Asp Ala Glu Gly Asn  Thr Arg Leu Ile Ser  Pro Arg Glu 
    1310                 1315                 1320             


Ala Val  Ala Glu Gly Val Thr  Leu Tyr Thr Pro Asp  Lys Ile Arg 
    1325                 1330                 1335             


Val Gly  Thr Gly Asp Arg Met  Arg Phe Thr Lys Ser  Asp Arg Glu 
    1340                 1345                 1350             


Arg Gly  Tyr Val Ala Asn Ser  Val Trp Thr Val Thr  Ala Val Ser 
    1355                 1360                 1365             


Gly Asp  Ser Val Thr Leu Ser  Asp Gly Gln Gln Thr  Arg Val Ile 
    1370                 1375                 1380             


Arg Pro  Gly Gln Glu Arg Ala  Glu Gln His Ile Asp  Leu Ala Tyr 
    1385                 1390                 1395             


Ala Ile  Thr Ala His Gly Ala  Gln Gly Ala Ser Glu  Thr Phe Ala 
    1400                 1405                 1410             


Ile Ala  Leu Glu Gly Thr Glu  Gly Asn Arg Lys Leu  Met Ala Gly 
    1415                 1420                 1425             


Phe Glu  Ser Ala Tyr Val Ala  Leu Ser Arg Met Lys  Gln His Val 
    1430                 1435                 1440             


Gln Val  Tyr Thr Asp Asn Arg  Gln Gly Trp Thr Asp  Ala Ile Asn 
    1445                 1450                 1455             


Asn Ala  Val Gln Lys Gly Thr  Ala His Asp Val Leu  Glu Pro Lys 
    1460                 1465                 1470             


Pro Asp  Arg Glu Val Met Asn  Ala Gln Arg Leu Phe  Ser Thr Ala 
    1475                 1480                 1485             


Arg Glu  Leu Arg Asp Val Ala  Ala Gly Arg Ala Val  Leu Arg Gln 
    1490                 1495                 1500             


Ala Gly  Leu Ala Gly Gly Asp  Ser Pro Ala Arg Phe  Ile Ala Pro 
    1505                 1510                 1515             


Gly Arg  Lys Tyr Pro Gln Pro  Tyr Val Ala Leu Pro  Ala Phe Asp 
    1520                 1525                 1530             


Arg Asn  Gly Lys Ser Ala Gly  Ile Trp Leu Asn Pro  Leu Thr Thr 
    1535                 1540                 1545             


Asp Asp  Gly Asn Gly Leu Arg  Gly Phe Ser Gly Glu  Gly Arg Val 
    1550                 1555                 1560             


Lys Gly  Ser Gly Asp Ala Gln  Phe Val Ala Leu Gln  Gly Ser Arg 
    1565                 1570                 1575             


Asn Gly  Glu Ser Leu Leu Ala  Asp Asn Met Gln Asp  Gly Val Arg 
    1580                 1585                 1590             


Ile Ala  Arg Asp Asn Pro Asp  Ser Gly Val Val Val  Arg Ile Ala 
    1595                 1600                 1605             


Gly Glu  Gly Arg Pro Trp Asn  Pro Gly Ala Ile Thr  Gly Gly Arg 
    1610                 1615                 1620             


Val Trp  Gly Asp Ile Pro Asp  Asn Ser Val Gln Pro  Gly Ala Gly 
    1625                 1630                 1635             


Asn Gly  Glu Pro Val Thr Ala  Glu Val Leu Ala Gln  Arg Gln Ala 
    1640                 1645                 1650             


Glu Glu  Ala Ile Arg Arg Glu  Thr Glu Arg Arg Ala  Asp Glu Ile 
    1655                 1660                 1665             


Val Arg  Lys Met Ala Glu Asn  Lys Pro Asp Leu Pro  Asp Gly Lys 
    1670                 1675                 1680             


Thr Glu  Leu Ala Val Arg Asp  Ile Ala Gly Gln Glu  Arg Asp Arg 
    1685                 1690                 1695             


Ser Ala  Ile Ser Glu Arg Glu  Thr Ala Leu Pro Glu  Ser Val Leu 
    1700                 1705                 1710             


Arg Glu  Ser Gln Arg Glu Arg  Glu Ala Val Arg Glu  Val Ala Arg 
    1715                 1720                 1725             


Glu Asn  Leu Leu Gln Glu Arg  Leu Gln Gln Met Glu  Arg Asp Met 
    1730                 1735                 1740             


Val Arg  Asp Leu Gln Lys Glu  Lys Thr Leu Gly Gly  Asp 
    1745                 1750                 1755     


<210>  23
<211>  726
<212>  PRT
<213>  Methanococcoides burtonii

<400>  23

Met Ser Asp Lys Pro Ala Phe Met Lys Tyr Phe Thr Gln Ser Ser Cys 
1               5                   10                  15      


Tyr Pro Asn Gln Gln Glu Ala Met Asp Arg Ile His Ser Ala Leu Met 
            20                  25                  30          


Gln Gln Gln Leu Val Leu Phe Glu Gly Ala Cys Gly Thr Gly Lys Thr 
        35                  40                  45              


Leu Ser Ala Leu Val Pro Ala Leu His Val Gly Lys Met Leu Gly Lys 
    50                  55                  60                  


Thr Val Ile Ile Ala Thr Asn Val His Gln Gln Met Val Gln Phe Ile 
65                  70                  75                  80  


Asn Glu Ala Arg Asp Ile Lys Lys Val Gln Asp Val Lys Val Ala Val 
                85                  90                  95      


Ile Lys Gly Lys Thr Ala Met Cys Pro Gln Glu Ala Asp Tyr Glu Glu 
            100                 105                 110         


Cys Ser Val Lys Arg Glu Asn Thr Phe Glu Leu Met Glu Thr Glu Arg 
        115                 120                 125             


Glu Ile Tyr Leu Lys Arg Gln Glu Leu Asn Ser Ala Arg Asp Ser Tyr 
    130                 135                 140                 


Lys Lys Ser His Asp Pro Ala Phe Val Thr Leu Arg Asp Glu Leu Ser 
145                 150                 155                 160 


Lys Glu Ile Asp Ala Val Glu Glu Lys Ala Arg Gly Leu Arg Asp Arg 
                165                 170                 175     


Ala Cys Asn Asp Leu Tyr Glu Val Leu Arg Ser Asp Ser Glu Lys Phe 
            180                 185                 190         


Arg Glu Trp Leu Tyr Lys Glu Val Arg Ser Pro Glu Glu Ile Asn Asp 
        195                 200                 205             


His Ala Ile Lys Asp Gly Met Cys Gly Tyr Glu Leu Val Lys Arg Glu 
    210                 215                 220                 


Leu Lys His Ala Asp Leu Leu Ile Cys Asn Tyr His His Val Leu Asn 
225                 230                 235                 240 


Pro Asp Ile Phe Ser Thr Val Leu Gly Trp Ile Glu Lys Glu Pro Gln 
                245                 250                 255     


Glu Thr Ile Val Ile Phe Asp Glu Ala His Asn Leu Glu Ser Ala Ala 
            260                 265                 270         


Arg Ser His Ser Ser Leu Ser Leu Thr Glu His Ser Ile Glu Lys Ala 
        275                 280                 285             


Ile Thr Glu Leu Glu Ala Asn Leu Asp Leu Leu Ala Asp Asp Asn Ile 
    290                 295                 300                 


His Asn Leu Phe Asn Ile Phe Leu Glu Val Ile Ser Asp Thr Tyr Asn 
305                 310                 315                 320 


Ser Arg Phe Lys Phe Gly Glu Arg Glu Arg Val Arg Lys Asn Trp Tyr 
                325                 330                 335     


Asp Ile Arg Ile Ser Asp Pro Tyr Glu Arg Asn Asp Ile Val Arg Gly 
            340                 345                 350         


Lys Phe Leu Arg Gln Ala Lys Gly Asp Phe Gly Glu Lys Asp Asp Ile 
        355                 360                 365             


Gln Ile Leu Leu Ser Glu Ala Ser Glu Leu Gly Ala Lys Leu Asp Glu 
    370                 375                 380                 


Thr Tyr Arg Asp Gln Tyr Lys Lys Gly Leu Ser Ser Val Met Lys Arg 
385                 390                 395                 400 


Ser His Ile Arg Tyr Val Ala Asp Phe Met Ser Ala Tyr Ile Glu Leu 
                405                 410                 415     


Ser His Asn Leu Asn Tyr Tyr Pro Ile Leu Asn Val Arg Arg Asp Met 
            420                 425                 430         


Asn Asp Glu Ile Tyr Gly Arg Val Glu Leu Phe Thr Cys Ile Pro Lys 
        435                 440                 445             


Asn Val Thr Glu Pro Leu Phe Asn Ser Leu Phe Ser Val Ile Leu Met 
    450                 455                 460                 


Ser Ala Thr Leu His Pro Phe Glu Met Val Lys Lys Thr Leu Gly Ile 
465                 470                 475                 480 


Thr Arg Asp Thr Cys Glu Met Ser Tyr Gly Thr Ser Phe Pro Glu Glu 
                485                 490                 495     


Lys Arg Leu Ser Ile Ala Val Ser Ile Pro Pro Leu Phe Ala Lys Asn 
            500                 505                 510         


Arg Asp Asp Arg His Val Thr Glu Leu Leu Glu Gln Val Leu Leu Asp 
        515                 520                 525             


Ser Ile Glu Asn Ser Lys Gly Asn Val Ile Leu Phe Phe Gln Ser Ala 
    530                 535                 540                 


Phe Glu Ala Lys Arg Tyr Tyr Ser Lys Ile Glu Pro Leu Val Asn Val 
545                 550                 555                 560 


Pro Val Phe Leu Asp Glu Val Gly Ile Ser Ser Gln Asp Val Arg Glu 
                565                 570                 575     


Glu Phe Phe Ser Ile Gly Glu Glu Asn Gly Lys Ala Val Leu Leu Ser 
            580                 585                 590         


Tyr Leu Trp Gly Thr Leu Ser Glu Gly Ile Asp Tyr Arg Asp Gly Arg 
        595                 600                 605             


Gly Arg Thr Val Ile Ile Ile Gly Val Gly Tyr Pro Ala Leu Asn Asp 
    610                 615                 620                 


Arg Met Asn Ala Val Glu Ser Ala Tyr Asp His Val Phe Gly Tyr Gly 
625                 630                 635                 640 


Ala Gly Trp Glu Phe Ala Ile Gln Val Pro Thr Ile Arg Lys Ile Arg 
                645                 650                 655     


Gln Ala Met Gly Arg Val Val Arg Ser Pro Thr Asp Tyr Gly Ala Arg 
            660                 665                 670         


Ile Leu Leu Asp Gly Arg Phe Leu Thr Asp Ser Lys Lys Arg Phe Gly 
        675                 680                 685             


Lys Phe Ser Val Phe Glu Val Phe Pro Pro Ala Glu Arg Ser Glu Phe 
    690                 695                 700                 


Val Asp Val Asp Pro Glu Lys Val Lys Tyr Ser Leu Met Asn Phe Phe 
705                 710                 715                 720 


Met Asp Asn Asp Glu Gln 
                725     


<210>  24
<211>  439
<212>  PRT
<213>  Enterobacteria phage T4

<400>  24

Met Thr Phe Asp Asp Leu Thr Glu Gly Gln Lys Asn Ala Phe Asn Ile 
1               5                   10                  15      


Val Met Lys Ala Ile Lys Glu Lys Lys His His Val Thr Ile Asn Gly 
            20                  25                  30          


Pro Ala Gly Thr Gly Lys Thr Thr Leu Thr Lys Phe Ile Ile Glu Ala 
        35                  40                  45              


Leu Ile Ser Thr Gly Glu Thr Gly Ile Ile Leu Ala Ala Pro Thr His 
    50                  55                  60                  


Ala Ala Lys Lys Ile Leu Ser Lys Leu Ser Gly Lys Glu Ala Ser Thr 
65                  70                  75                  80  


Ile His Ser Ile Leu Lys Ile Asn Pro Val Thr Tyr Glu Glu Asn Val 
                85                  90                  95      


Leu Phe Glu Gln Lys Glu Val Pro Asp Leu Ala Lys Cys Arg Val Leu 
            100                 105                 110         


Ile Cys Asp Glu Val Ser Met Tyr Asp Arg Lys Leu Phe Lys Ile Leu 
        115                 120                 125             


Leu Ser Thr Ile Pro Pro Trp Cys Thr Ile Ile Gly Ile Gly Asp Asn 
    130                 135                 140                 


Lys Gln Ile Arg Pro Val Asp Pro Gly Glu Asn Thr Ala Tyr Ile Ser 
145                 150                 155                 160 


Pro Phe Phe Thr His Lys Asp Phe Tyr Gln Cys Glu Leu Thr Glu Val 
                165                 170                 175     


Lys Arg Ser Asn Ala Pro Ile Ile Asp Val Ala Thr Asp Val Arg Asn 
            180                 185                 190         


Gly Lys Trp Ile Tyr Asp Lys Val Val Asp Gly His Gly Val Arg Gly 
        195                 200                 205             


Phe Thr Gly Asp Thr Ala Leu Arg Asp Phe Met Val Asn Tyr Phe Ser 
    210                 215                 220                 


Ile Val Lys Ser Leu Asp Asp Leu Phe Glu Asn Arg Val Met Ala Phe 
225                 230                 235                 240 


Thr Asn Lys Ser Val Asp Lys Leu Asn Ser Ile Ile Arg Lys Lys Ile 
                245                 250                 255     


Phe Glu Thr Asp Lys Asp Phe Ile Val Gly Glu Ile Ile Val Met Gln 
            260                 265                 270         


Glu Pro Leu Phe Lys Thr Tyr Lys Ile Asp Gly Lys Pro Val Ser Glu 
        275                 280                 285             


Ile Ile Phe Asn Asn Gly Gln Leu Val Arg Ile Ile Glu Ala Glu Tyr 
    290                 295                 300                 


Thr Ser Thr Phe Val Lys Ala Arg Gly Val Pro Gly Glu Tyr Leu Ile 
305                 310                 315                 320 


Arg His Trp Asp Leu Thr Val Glu Thr Tyr Gly Asp Asp Glu Tyr Tyr 
                325                 330                 335     


Arg Glu Lys Ile Lys Ile Ile Ser Ser Asp Glu Glu Leu Tyr Lys Phe 
            340                 345                 350         


Asn Leu Phe Leu Gly Lys Thr Ala Glu Thr Tyr Lys Asn Trp Asn Lys 
        355                 360                 365             


Gly Gly Lys Ala Pro Trp Ser Asp Phe Trp Asp Ala Lys Ser Gln Phe 
    370                 375                 380                 


Ser Lys Val Lys Ala Leu Pro Ala Ser Thr Phe His Lys Ala Gln Gly 
385                 390                 395                 400 


Met Ser Val Asp Arg Ala Phe Ile Tyr Thr Pro Cys Ile His Tyr Ala 
                405                 410                 415     


Asp Val Glu Leu Ala Gln Gln Leu Leu Tyr Val Gly Val Thr Arg Gly 
            420                 425                 430         


Arg Tyr Asp Val Phe Tyr Val 
        435                 


<210>  25
<211>  970
<212>  PRT
<213>  Clostridium botulinum

<400>  25

Met Leu Ser Val Ala Asn Val Arg Ser Pro Ser Ala Ala Ala Ser Tyr 
1               5                   10                  15      


Phe Ala Ser Asp Asn Tyr Tyr Ala Ser Ala Asp Ala Asp Arg Ser Gly 
            20                  25                  30          


Gln Trp Ile Gly Asp Gly Ala Lys Arg Leu Gly Leu Glu Gly Lys Val 
        35                  40                  45              


Glu Ala Arg Ala Phe Asp Ala Leu Leu Arg Gly Glu Leu Pro Asp Gly 
    50                  55                  60                  


Ser Ser Val Gly Asn Pro Gly Gln Ala His Arg Pro Gly Thr Asp Leu 
65                  70                  75                  80  


Thr Phe Ser Val Pro Lys Ser Trp Ser Leu Leu Ala Leu Val Gly Lys 
                85                  90                  95      


Asp Glu Arg Ile Ile Ala Ala Tyr Arg Glu Ala Val Val Glu Ala Leu 
            100                 105                 110         


His Trp Ala Glu Lys Asn Ala Ala Glu Thr Arg Val Val Glu Lys Gly 
        115                 120                 125             


Met Val Val Thr Gln Ala Thr Gly Asn Leu Ala Ile Gly Leu Phe Gln 
    130                 135                 140                 


His Asp Thr Asn Arg Asn Gln Glu Pro Asn Leu His Phe His Ala Val 
145                 150                 155                 160 


Ile Ala Asn Val Thr Gln Gly Lys Asp Gly Lys Trp Arg Thr Leu Lys 
                165                 170                 175     


Asn Asp Arg Leu Trp Gln Leu Asn Thr Thr Leu Asn Ser Ile Ala Met 
            180                 185                 190         


Ala Arg Phe Arg Val Ala Val Glu Lys Leu Gly Tyr Glu Pro Gly Pro 
        195                 200                 205             


Val Leu Lys His Gly Asn Phe Glu Ala Arg Gly Ile Ser Arg Glu Gln 
    210                 215                 220                 


Val Met Ala Phe Ser Thr Arg Arg Lys Glu Val Leu Glu Ala Arg Arg 
225                 230                 235                 240 


Gly Pro Gly Leu Asp Ala Gly Arg Ile Ala Ala Leu Asp Thr Arg Ala 
                245                 250                 255     


Ser Lys Glu Gly Ile Glu Asp Arg Ala Thr Leu Ser Lys Gln Trp Ser 
            260                 265                 270         


Glu Ala Ala Gln Ser Ile Gly Leu Asp Leu Lys Pro Leu Val Asp Arg 
        275                 280                 285             


Ala Arg Thr Lys Ala Leu Gly Gln Gly Met Glu Ala Thr Arg Ile Gly 
    290                 295                 300                 


Ser Leu Val Glu Arg Gly Arg Ala Trp Leu Ser Arg Phe Ala Ala His 
305                 310                 315                 320 


Val Arg Gly Asp Pro Ala Asp Pro Leu Val Pro Pro Ser Val Leu Lys 
                325                 330                 335     


Gln Asp Arg Gln Thr Ile Ala Ala Ala Gln Ala Val Ala Ser Ala Val 
            340                 345                 350         


Arg His Leu Ser Gln Arg Glu Ala Ala Phe Glu Arg Thr Ala Leu Tyr 
        355                 360                 365             


Lys Ala Ala Leu Asp Phe Gly Leu Pro Thr Thr Ile Ala Asp Val Glu 
    370                 375                 380                 


Lys Arg Thr Arg Ala Leu Val Arg Ser Gly Asp Leu Ile Ala Gly Lys 
385                 390                 395                 400 


Gly Glu His Lys Gly Trp Leu Ala Ser Arg Asp Ala Val Val Thr Glu 
                405                 410                 415     


Gln Arg Ile Leu Ser Glu Val Ala Ala Gly Lys Gly Asp Ser Ser Pro 
            420                 425                 430         


Ala Ile Thr Pro Gln Lys Ala Ala Ala Ser Val Gln Ala Ala Ala Leu 
        435                 440                 445             


Thr Gly Gln Gly Phe Arg Leu Asn Glu Gly Gln Leu Ala Ala Ala Arg 
    450                 455                 460                 


Leu Ile Leu Ile Ser Lys Asp Arg Thr Ile Ala Val Gln Gly Ile Ala 
465                 470                 475                 480 


Gly Ala Gly Lys Ser Ser Val Leu Lys Pro Val Ala Glu Val Leu Arg 
                485                 490                 495     


Asp Glu Gly His Pro Val Ile Gly Leu Ala Ile Gln Asn Thr Leu Val 
            500                 505                 510         


Gln Met Leu Glu Arg Asp Thr Gly Ile Gly Ser Gln Thr Leu Ala Arg 
        515                 520                 525             


Phe Leu Gly Gly Trp Asn Lys Leu Leu Asp Asp Pro Gly Asn Val Ala 
    530                 535                 540                 


Leu Arg Ala Glu Ala Gln Ala Ser Leu Lys Asp His Val Leu Val Leu 
545                 550                 555                 560 


Asp Glu Ala Ser Met Val Ser Asn Glu Asp Lys Glu Lys Leu Val Arg 
                565                 570                 575     


Leu Ala Asn Leu Ala Gly Val His Arg Leu Val Leu Ile Gly Asp Arg 
            580                 585                 590         


Lys Gln Leu Gly Ala Val Asp Ala Gly Lys Pro Phe Ala Leu Leu Gln 
        595                 600                 605             


Arg Ala Gly Ile Ala Arg Ala Glu Met Ala Thr Asn Leu Arg Ala Arg 
    610                 615                 620                 


Asp Pro Val Val Arg Glu Ala Gln Ala Ala Ala Gln Ala Gly Asp Val 
625                 630                 635                 640 


Arg Lys Ala Leu Arg His Leu Lys Ser His Thr Val Glu Ala Arg Gly 
                645                 650                 655     


Asp Gly Ala Gln Val Ala Ala Glu Thr Trp Leu Ala Leu Asp Lys Glu 
            660                 665                 670         


Thr Arg Ala Arg Thr Ser Ile Tyr Ala Ser Gly Arg Ala Ile Arg Ser 
        675                 680                 685             


Ala Val Asn Ala Ala Val Gln Gln Gly Leu Leu Ala Ser Arg Glu Ile 
    690                 695                 700                 


Gly Pro Ala Lys Met Lys Leu Glu Val Leu Asp Arg Val Asn Thr Thr 
705                 710                 715                 720 


Arg Glu Glu Leu Arg His Leu Pro Ala Tyr Arg Ala Gly Arg Val Leu 
                725                 730                 735     


Glu Val Ser Arg Lys Gln Gln Ala Leu Gly Leu Phe Ile Gly Glu Tyr 
            740                 745                 750         


Arg Val Ile Gly Gln Asp Arg Lys Gly Lys Leu Val Glu Val Glu Asp 
        755                 760                 765             


Lys Arg Gly Lys Arg Phe Arg Phe Asp Pro Ala Arg Ile Arg Ala Gly 
    770                 775                 780                 


Lys Gly Asp Asp Asn Leu Thr Leu Leu Glu Pro Arg Lys Leu Glu Ile 
785                 790                 795                 800 


His Glu Gly Asp Arg Ile Arg Trp Thr Arg Asn Asp His Arg Arg Gly 
                805                 810                 815     


Leu Phe Asn Ala Asp Gln Ala Arg Val Val Glu Ile Ala Asn Gly Lys 
            820                 825                 830         


Val Thr Phe Glu Thr Ser Lys Gly Asp Leu Val Glu Leu Lys Lys Asp 
        835                 840                 845             


Asp Pro Met Leu Lys Arg Ile Asp Leu Ala Tyr Ala Leu Asn Val His 
    850                 855                 860                 


Met Ala Gln Gly Leu Thr Ser Asp Arg Gly Ile Ala Val Met Asp Ser 
865                 870                 875                 880 


Arg Glu Arg Asn Leu Ser Asn Gln Lys Thr Phe Leu Val Thr Val Thr 
                885                 890                 895     


Arg Leu Arg Asp His Leu Thr Leu Val Val Asp Ser Ala Asp Lys Leu 
            900                 905                 910         


Gly Ala Ala Val Ala Arg Asn Lys Gly Glu Lys Ala Ser Ala Ile Glu 
        915                 920                 925             


Val Thr Gly Ser Val Lys Pro Thr Ala Thr Lys Gly Ser Gly Val Asp 
    930                 935                 940                 


Gln Pro Lys Ser Val Glu Ala Asn Lys Ala Glu Lys Glu Leu Thr Arg 
945                 950                 955                 960 


Ser Lys Ser Lys Thr Leu Asp Phe Gly Ile 
                965                 970 


<210>  26
<211>  50
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Example MuA substrate of the invention.

<400>  26
gttttcgcat ttatcgtgaa acgctttcgc gtttttcgtg cgccgcttca                  50


<210>  27
<211>  50
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Example MuA substrate of the invention.

<400>  27
caaaagcgta aatagcactt tgcgaaagcg caaaaagcac gcggcgaagt                  50


<210>  28
<211>  54
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Example MuA substrate of the invention.

<400>  28
caaaagcgta aatagcactt tgcgaaagcg caaaaagcac gcggcgaagt ctag             54


<210>  29
<211>  100
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 1.

<400>  29
gcgttctgtt tcggatgtat gttttcatac atccgaaaca gaacgctttt gttttcgcat       60

ttatcgtgaa acgctttcgc gtttttcgtg cgccgcttca                            100


<210>  30
<211>  48
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 1.

<400>  30
gaagcggcgc acgaaaaacg cgaaagcgtt tcacgataat gcgaaaac                    48


<210>  31
<211>  48502
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 1.

<400>  31
gggcggcgac ctcgcgggtt ttcgctattt atgaaaattt tccggtttaa ggcgtttccg       60

ttcttcttcg tcataactta atgtttttat ttaaaatacc ctctgaaaag aaaggaaacg      120

acaggtgctg aaagcgaggc tttttggcct ctgtcgtttc ctttctctgt ttttgtccgt      180

ggaatgaaca atggaagtca acaaaaagca gctggctgac attttcggtg cgagtatccg      240

taccattcag aactggcagg aacagggaat gcccgttctg cgaggcggtg gcaagggtaa      300

tgaggtgctt tatgactctg ccgccgtcat aaaatggtat gccgaaaggg atgctgaaat      360

tgagaacgaa aagctgcgcc gggaggttga agaactgcgg caggccagcg aggcagatct      420

ccagccagga actattgagt acgaacgcca tcgacttacg cgtgcgcagg ccgacgcaca      480

ggaactgaag aatgccagag actccgctga agtggtggaa accgcattct gtactttcgt      540

gctgtcgcgg atcgcaggtg aaattgccag tattctcgac gggctccccc tgtcggtgca      600

gcggcgtttt ccggaactgg aaaaccgaca tgttgatttc ctgaaacggg atatcatcaa      660

agccatgaac aaagcagccg cgctggatga actgataccg gggttgctga gtgaatatat      720

cgaacagtca ggttaacagg ctgcggcatt ttgtccgcgc cgggcttcgc tcactgttca      780

ggccggagcc acagaccgcc gttgaatggg cggatgctaa ttactatctc ccgaaagaat      840

ccgcatacca ggaagggcgc tgggaaacac tgccctttca gcgggccatc atgaatgcga      900

tgggcagcga ctacatccgt gaggtgaatg tggtgaagtc tgcccgtgtc ggttattcca      960

aaatgctgct gggtgtttat gcctacttta tagagcataa gcagcgcaac acccttatct     1020

ggttgccgac ggatggtgat gccgagaact ttatgaaaac ccacgttgag ccgactattc     1080

gtgatattcc gtcgctgctg gcgctggccc cgtggtatgg caaaaagcac cgggataaca     1140

cgctcaccat gaagcgtttc actaatgggc gtggcttctg gtgcctgggc ggtaaagcgg     1200

caaaaaacta ccgtgaaaag tcggtggatg tggcgggtta tgatgaactt gctgcttttg     1260

atgatgatat tgaacaggaa ggctctccga cgttcctggg tgacaagcgt attgaaggct     1320

cggtctggcc aaagtccatc cgtggctcca cgccaaaagt gagaggcacc tgtcagattg     1380

agcgtgcagc cagtgaatcc ccgcatttta tgcgttttca tgttgcctgc ccgcattgcg     1440

gggaggagca gtatcttaaa tttggcgaca aagagacgcc gtttggcctc aaatggacgc     1500

cggatgaccc ctccagcgtg ttttatctct gcgagcataa tgcctgcgtc atccgccagc     1560

aggagctgga ctttactgat gcccgttata tctgcgaaaa gaccgggatc tggacccgtg     1620

atggcattct ctggttttcg tcatccggtg aagagattga gccacctgac agtgtgacct     1680

ttcacatctg gacagcgtac agcccgttca ccacctgggt gcagattgtc aaagactgga     1740

tgaaaacgaa aggggatacg ggaaaacgta aaaccttcgt aaacaccacg ctcggtgaga     1800

cgtgggaggc gaaaattggc gaacgtccgg atgctgaagt gatggcagag cggaaagagc     1860

attattcagc gcccgttcct gaccgtgtgg cttacctgac cgccggtatc gactcccagc     1920

tggaccgcta cgaaatgcgc gtatggggat gggggccggg tgaggaaagc tggctgattg     1980

accggcagat tattatgggc cgccacgacg atgaacagac gctgctgcgt gtggatgagg     2040

ccatcaataa aacctatacc cgccggaatg gtgcagaaat gtcgatatcc cgtatctgct     2100

gggatactgg cgggattgac ccgaccattg tgtatgaacg ctcgaaaaaa catgggctgt     2160

tccgggtgat ccccattaaa ggggcatccg tctacggaaa gccggtggcc agcatgccac     2220

gtaagcgaaa caaaaacggg gtttacctta ccgaaatcgg tacggatacc gcgaaagagc     2280

agatttataa ccgcttcaca ctgacgccgg aaggggatga accgcttccc ggtgccgttc     2340

acttcccgaa taacccggat atttttgatc tgaccgaagc gcagcagctg actgctgaag     2400

agcaggtcga aaaatgggtg gatggcagga aaaaaatact gtgggacagc aaaaagcgac     2460

gcaatgaggc actcgactgc ttcgtttatg cgctggcggc gctgcgcatc agtatttccc     2520

gctggcagct ggatctcagt gcgctgctgg cgagcctgca ggaagaggat ggtgcagcaa     2580

ccaacaagaa aacactggca gattacgccc gtgccttatc cggagaggat gaatgacgcg     2640

acaggaagaa cttgccgctg cccgtgcggc actgcatgac ctgatgacag gtaaacgggt     2700

ggcaacagta cagaaagacg gacgaagggt ggagtttacg gccacttccg tgtctgacct     2760

gaaaaaatat attgcagagc tggaagtgca gaccggcatg acacagcgac gcaggggacc     2820

tgcaggattt tatgtatgaa aacgcccacc attcccaccc ttctggggcc ggacggcatg     2880

acatcgctgc gcgaatatgc cggttatcac ggcggtggca gcggatttgg agggcagttg     2940

cggtcgtgga acccaccgag tgaaagtgtg gatgcagccc tgttgcccaa ctttacccgt     3000

ggcaatgccc gcgcagacga tctggtacgc aataacggct atgccgccaa cgccatccag     3060

ctgcatcagg atcatatcgt cgggtctttt ttccggctca gtcatcgccc aagctggcgc     3120

tatctgggca tcggggagga agaagcccgt gccttttccc gcgaggttga agcggcatgg     3180

aaagagtttg ccgaggatga ctgctgctgc attgacgttg agcgaaaacg cacgtttacc     3240

atgatgattc gggaaggtgt ggccatgcac gcctttaacg gtgaactgtt cgttcaggcc     3300

acctgggata ccagttcgtc gcggcttttc cggacacagt tccggatggt cagcccgaag     3360

cgcatcagca acccgaacaa taccggcgac agccggaact gccgtgccgg tgtgcagatt     3420

aatgacagcg gtgcggcgct gggatattac gtcagcgagg acgggtatcc tggctggatg     3480

ccgcagaaat ggacatggat accccgtgag ttacccggcg ggcgcgcctc gttcattcac     3540

gtttttgaac ccgtggagga cgggcagact cgcggtgcaa atgtgtttta cagcgtgatg     3600

gagcagatga agatgctcga cacgctgcag aacacgcagc tgcagagcgc cattgtgaag     3660

gcgatgtatg ccgccaccat tgagagtgag ctggatacgc agtcagcgat ggattttatt     3720

ctgggcgcga acagtcagga gcagcgggaa aggctgaccg gctggattgg tgaaattgcc     3780

gcgtattacg ccgcagcgcc ggtccggctg ggaggcgcaa aagtaccgca cctgatgccg     3840

ggtgactcac tgaacctgca gacggctcag gatacggata acggctactc cgtgtttgag     3900

cagtcactgc tgcggtatat cgctgccggg ctgggtgtct cgtatgagca gctttcccgg     3960

aattacgccc agatgagcta ctccacggca cgggccagtg cgaacgagtc gtgggcgtac     4020

tttatggggc ggcgaaaatt cgtcgcatcc cgtcaggcga gccagatgtt tctgtgctgg     4080

ctggaagagg ccatcgttcg ccgcgtggtg acgttacctt caaaagcgcg cttcagtttt     4140

caggaagccc gcagtgcctg ggggaactgc gactggatag gctccggtcg tatggccatc     4200

gatggtctga aagaagttca ggaagcggtg atgctgatag aagccggact gagtacctac     4260

gagaaagagt gcgcaaaacg cggtgacgac tatcaggaaa tttttgccca gcaggtccgt     4320

gaaacgatgg agcgccgtgc agccggtctt aaaccgcccg cctgggcggc tgcagcattt     4380

gaatccgggc tgcgacaatc aacagaggag gagaagagtg acagcagagc tgcgtaatct     4440

cccgcatatt gccagcatgg cctttaatga gccgctgatg cttgaacccg cctatgcgcg     4500

ggttttcttt tgtgcgcttg caggccagct tgggatcagc agcctgacgg atgcggtgtc     4560

cggcgacagc ctgactgccc aggaggcact cgcgacgctg gcattatccg gtgatgatga     4620

cggaccacga caggcccgca gttatcaggt catgaacggc atcgccgtgc tgccggtgtc     4680

cggcacgctg gtcagccgga cgcgggcgct gcagccgtac tcggggatga ccggttacaa     4740

cggcattatc gcccgtctgc aacaggctgc cagcgatccg atggtggacg gcattctgct     4800

cgatatggac acgcccggcg ggatggtggc gggggcattt gactgcgctg acatcatcgc     4860

ccgtgtgcgt gacataaaac cggtatgggc gcttgccaac gacatgaact gcagtgcagg     4920

tcagttgctt gccagtgccg cctcccggcg tctggtcacg cagaccgccc ggacaggctc     4980

catcggcgtc atgatggctc acagtaatta cggtgctgcg ctggagaaac agggtgtgga     5040

aatcacgctg atttacagcg gcagccataa ggtggatggc aacccctaca gccatcttcc     5100

ggatgacgtc cgggagacac tgcagtcccg gatggacgca acccgccaga tgtttgcgca     5160

gaaggtgtcg gcatataccg gcctgtccgt gcaggttgtg ctggataccg aggctgcagt     5220

gtacagcggt caggaggcca ttgatgccgg actggctgat gaacttgtta acagcaccga     5280

tgcgatcacc gtcatgcgtg atgcactgga tgcacgtaaa tcccgtctct caggagggcg     5340

aatgaccaaa gagactcaat caacaactgt ttcagccact gcttcgcagg ctgacgttac     5400

tgacgtggtg ccagcgacgg agggcgagaa cgccagcgcg gcgcagccgg acgtgaacgc     5460

gcagatcacc gcagcggttg cggcagaaaa cagccgcatt atggggatcc tcaactgtga     5520

ggaggctcac ggacgcgaag aacaggcacg cgtgctggca gaaacccccg gtatgaccgt     5580

gaaaacggcc cgccgcattc tggccgcagc accacagagt gcacaggcgc gcagtgacac     5640

tgcgctggat cgtctgatgc agggggcacc ggcaccgctg gctgcaggta acccggcatc     5700

tgatgccgtt aacgatttgc tgaacacacc agtgtaaggg atgtttatga cgagcaaaga     5760

aacctttacc cattaccagc cgcagggcaa cagtgacccg gctcataccg caaccgcgcc     5820

cggcggattg agtgcgaaag cgcctgcaat gaccccgctg atgctggaca cctccagccg     5880

taagctggtt gcgtgggatg gcaccaccga cggtgctgcc gttggcattc ttgcggttgc     5940

tgctgaccag accagcacca cgctgacgtt ctacaagtcc ggcacgttcc gttatgagga     6000

tgtgctctgg ccggaggctg ccagcgacga gacgaaaaaa cggaccgcgt ttgccggaac     6060

ggcaatcagc atcgtttaac tttacccttc atcactaaag gccgcctgtg cggctttttt     6120

tacgggattt ttttatgtcg atgtacacaa ccgcccaact gctggcggca aatgagcaga     6180

aatttaagtt tgatccgctg tttctgcgtc tctttttccg tgagagctat cccttcacca     6240

cggagaaagt ctatctctca caaattccgg gactggtaaa catggcgctg tacgtttcgc     6300

cgattgtttc cggtgaggtt atccgttccc gtggcggctc cacctctgaa tttacgccgg     6360

gatatgtcaa gccgaagcat gaagtgaatc cgcagatgac cctgcgtcgc ctgccggatg     6420

aagatccgca gaatctggcg gacccggctt accgccgccg tcgcatcatc atgcagaaca     6480

tgcgtgacga agagctggcc attgctcagg tcgaagagat gcaggcagtt tctgccgtgc     6540

ttaagggcaa atacaccatg accggtgaag ccttcgatcc ggttgaggtg gatatgggcc     6600

gcagtgagga gaataacatc acgcagtccg gcggcacgga gtggagcaag cgtgacaagt     6660

ccacgtatga cccgaccgac gatatcgaag cctacgcgct gaacgccagc ggtgtggtga     6720

atatcatcgt gttcgatccg aaaggctggg cgctgttccg ttccttcaaa gccgtcaagg     6780

agaagctgga tacccgtcgt ggctctaatt ccgagctgga gacagcggtg aaagacctgg     6840

gcaaagcggt gtcctataag gggatgtatg gcgatgtggc catcgtcgtg tattccggac     6900

agtacgtgga aaacggcgtc aaaaagaact tcctgccgga caacacgatg gtgctgggga     6960

acactcaggc acgcggtctg cgcacctatg gctgcattca ggatgcggac gcacagcgcg     7020

aaggcattaa cgcctctgcc cgttacccga aaaactgggt gaccaccggc gatccggcgc     7080

gtgagttcac catgattcag tcagcaccgc tgatgctgct ggctgaccct gatgagttcg     7140

tgtccgtaca actggcgtaa tcatggccct tcggggccat tgtttctctg tggaggagtc     7200

catgacgaaa gatgaactga ttgcccgtct ccgctcgctg ggtgaacaac tgaaccgtga     7260

tgtcagcctg acggggacga aagaagaact ggcgctccgt gtggcagagc tgaaagagga     7320

gcttgatgac acggatgaaa ctgccggtca ggacacccct ctcagccggg aaaatgtgct     7380

gaccggacat gaaaatgagg tgggatcagc gcagccggat accgtgattc tggatacgtc     7440

tgaactggtc acggtcgtgg cactggtgaa gctgcatact gatgcacttc acgccacgcg     7500

ggatgaacct gtggcatttg tgctgccggg aacggcgttt cgtgtctctg ccggtgtggc     7560

agccgaaatg acagagcgcg gcctggccag aatgcaataa cgggaggcgc tgtggctgat     7620

ttcgataacc tgttcgatgc tgccattgcc cgcgccgatg aaacgatacg cgggtacatg     7680

ggaacgtcag ccaccattac atccggtgag cagtcaggtg cggtgatacg tggtgttttt     7740

gatgaccctg aaaatatcag ctatgccgga cagggcgtgc gcgttgaagg ctccagcccg     7800

tccctgtttg tccggactga tgaggtgcgg cagctgcggc gtggagacac gctgaccatc     7860

ggtgaggaaa atttctgggt agatcgggtt tcgccggatg atggcggaag ttgtcatctc     7920

tggcttggac ggggcgtacc gcctgccgtt aaccgtcgcc gctgaaaggg ggatgtatgg     7980

ccataaaagg tcttgagcag gccgttgaaa acctcagccg tatcagcaaa acggcggtgc     8040

ctggtgccgc cgcaatggcc attaaccgcg ttgcttcatc cgcgatatcg cagtcggcgt     8100

cacaggttgc ccgtgagaca aaggtacgcc ggaaactggt aaaggaaagg gccaggctga     8160

aaagggccac ggtcaaaaat ccgcaggcca gaatcaaagt taaccggggg gatttgcccg     8220

taatcaagct gggtaatgcg cgggttgtcc tttcgcgccg caggcgtcgt aaaaaggggc     8280

agcgttcatc cctgaaaggt ggcggcagcg tgcttgtggt gggtaaccgt cgtattcccg     8340

gcgcgtttat tcagcaactg aaaaatggcc ggtggcatgt catgcagcgt gtggctggga     8400

aaaaccgtta ccccattgat gtggtgaaaa tcccgatggc ggtgccgctg accacggcgt     8460

ttaaacaaaa tattgagcgg atacggcgtg aacgtcttcc gaaagagctg ggctatgcgc     8520

tgcagcatca actgaggatg gtaataaagc gatgaaacat actgaactcc gtgcagccgt     8580

actggatgca ctggagaagc atgacaccgg ggcgacgttt tttgatggtc gccccgctgt     8640

ttttgatgag gcggattttc cggcagttgc cgtttatctc accggcgctg aatacacggg     8700

cgaagagctg gacagcgata cctggcaggc ggagctgcat atcgaagttt tcctgcctgc     8760

tcaggtgccg gattcagagc tggatgcgtg gatggagtcc cggatttatc cggtgatgag     8820

cgatatcccg gcactgtcag atttgatcac cagtatggtg gccagcggct atgactaccg     8880

gcgcgacgat gatgcgggct tgtggagttc agccgatctg acttatgtca ttacctatga     8940

aatgtgagga cgctatgcct gtaccaaatc ctacaatgcc ggtgaaaggt gccgggacca     9000

ccctgtgggt ttataagggg agcggtgacc cttacgcgaa tccgctttca gacgttgact     9060

ggtcgcgtct ggcaaaagtt aaagacctga cgcccggcga actgaccgct gagtcctatg     9120

acgacagcta tctcgatgat gaagatgcag actggactgc gaccgggcag gggcagaaat     9180

ctgccggaga taccagcttc acgctggcgt ggatgcccgg agagcagggg cagcaggcgc     9240

tgctggcgtg gtttaatgaa ggcgataccc gtgcctataa aatccgcttc ccgaacggca     9300

cggtcgatgt gttccgtggc tgggtcagca gtatcggtaa ggcggtgacg gcgaaggaag     9360

tgatcacccg cacggtgaaa gtcaccaatg tgggacgtcc gtcgatggca gaagatcgca     9420

gcacggtaac agcggcaacc ggcatgaccg tgacgcctgc cagcacctcg gtggtgaaag     9480

ggcagagcac cacgctgacc gtggccttcc agccggaggg cgtaaccgac aagagctttc     9540

gtgcggtgtc tgcggataaa acaaaagcca ccgtgtcggt cagtggtatg accatcaccg     9600

tgaacggcgt tgctgcaggc aaggtcaaca ttccggttgt atccggtaat ggtgagtttg     9660

ctgcggttgc agaaattacc gtcaccgcca gttaatccgg agagtcagcg atgttcctga     9720

aaaccgaatc atttgaacat aacggtgtga ccgtcacgct ttctgaactg tcagccctgc     9780

agcgcattga gcatctcgcc ctgatgaaac ggcaggcaga acaggcggag tcagacagca     9840

accggaagtt tactgtggaa gacgccatca gaaccggcgc gtttctggtg gcgatgtccc     9900

tgtggcataa ccatccgcag aagacgcaga tgccgtccat gaatgaagcc gttaaacaga     9960

ttgagcagga agtgcttacc acctggccca cggaggcaat ttctcatgct gaaaacgtgg    10020

tgtaccggct gtctggtatg tatgagtttg tggtgaataa tgcccctgaa cagacagagg    10080

acgccgggcc cgcagagcct gtttctgcgg gaaagtgttc gacggtgagc tgagttttgc    10140

cctgaaactg gcgcgtgaga tggggcgacc cgactggcgt gccatgcttg ccgggatgtc    10200

atccacggag tatgccgact ggcaccgctt ttacagtacc cattattttc atgatgttct    10260

gctggatatg cacttttccg ggctgacgta caccgtgctc agcctgtttt tcagcgatcc    10320

ggatatgcat ccgctggatt tcagtctgct gaaccggcgc gaggctgacg aagagcctga    10380

agatgatgtg ctgatgcaga aagcggcagg gcttgccgga ggtgtccgct ttggcccgga    10440

cgggaatgaa gttatccccg cttccccgga tgtggcggac atgacggagg atgacgtaat    10500

gctgatgaca gtatcagaag ggatcgcagg aggagtccgg tatggctgaa ccggtaggcg    10560

atctggtcgt tgatttgagt ctggatgcgg ccagatttga cgagcagatg gccagagtca    10620

ggcgtcattt ttctggtacg gaaagtgatg cgaaaaaaac agcggcagtc gttgaacagt    10680

cgctgagccg acaggcgctg gctgcacaga aagcggggat ttccgtcggg cagtataaag    10740

ccgccatgcg tatgctgcct gcacagttca ccgacgtggc cacgcagctt gcaggcgggc    10800

aaagtccgtg gctgatcctg ctgcaacagg gggggcaggt gaaggactcc ttcggcggga    10860

tgatccccat gttcaggggg cttgccggtg cgatcaccct gccgatggtg ggggccacct    10920

cgctggcggt ggcgaccggt gcgctggcgt atgcctggta tcagggcaac tcaaccctgt    10980

ccgatttcaa caaaacgctg gtcctttccg gcaatcaggc gggactgacg gcagatcgta    11040

tgctggtcct gtccagagcc gggcaggcgg cagggctgac gtttaaccag accagcgagt    11100

cactcagcgc actggttaag gcgggggtaa gcggtgaggc tcagattgcg tccatcagcc    11160

agagtgtggc gcgtttctcc tctgcatccg gcgtggaggt ggacaaggtc gctgaagcct    11220

tcgggaagct gaccacagac ccgacgtcgg ggctgacggc gatggctcgc cagttccata    11280

acgtgtcggc ggagcagatt gcgtatgttg ctcagttgca gcgttccggc gatgaagccg    11340

gggcattgca ggcggcgaac gaggccgcaa cgaaagggtt tgatgaccag acccgccgcc    11400

tgaaagagaa catgggcacg ctggagacct gggcagacag gactgcgcgg gcattcaaat    11460

ccatgtggga tgcggtgctg gatattggtc gtcctgatac cgcgcaggag atgctgatta    11520

aggcagaggc tgcgtataag aaagcagacg acatctggaa tctgcgcaag gatgattatt    11580

ttgttaacga tgaagcgcgg gcgcgttact gggatgatcg tgaaaaggcc cgtcttgcgc    11640

ttgaagccgc ccgaaagaag gctgagcagc agactcaaca ggacaaaaat gcgcagcagc    11700

agagcgatac cgaagcgtca cggctgaaat ataccgaaga ggcgcagaag gcttacgaac    11760

ggctgcagac gccgctggag aaatataccg cccgtcagga agaactgaac aaggcactga    11820

aagacgggaa aatcctgcag gcggattaca acacgctgat ggcggcggcg aaaaaggatt    11880

atgaagcgac gctgaaaaag ccgaaacagt ccagcgtgaa ggtgtctgcg ggcgatcgtc    11940

aggaagacag tgctcatgct gccctgctga cgcttcaggc agaactccgg acgctggaga    12000

agcatgccgg agcaaatgag aaaatcagcc agcagcgccg ggatttgtgg aaggcggaga    12060

gtcagttcgc ggtactggag gaggcggcgc aacgtcgcca gctgtctgca caggagaaat    12120

ccctgctggc gcataaagat gagacgctgg agtacaaacg ccagctggct gcacttggcg    12180

acaaggttac gtatcaggag cgcctgaacg cgctggcgca gcaggcggat aaattcgcac    12240

agcagcaacg ggcaaaacgg gccgccattg atgcgaaaag ccgggggctg actgaccggc    12300

aggcagaacg ggaagccacg gaacagcgcc tgaaggaaca gtatggcgat aatccgctgg    12360

cgctgaataa cgtcatgtca gagcagaaaa agacctgggc ggctgaagac cagcttcgcg    12420

ggaactggat ggcaggcctg aagtccggct ggagtgagtg ggaagagagc gccacggaca    12480

gtatgtcgca ggtaaaaagt gcagccacgc agacctttga tggtattgca cagaatatgg    12540

cggcgatgct gaccggcagt gagcagaact ggcgcagctt cacccgttcc gtgctgtcca    12600

tgatgacaga aattctgctt aagcaggcaa tggtggggat tgtcgggagt atcggcagcg    12660

ccattggcgg ggctgttggt ggcggcgcat ccgcgtcagg cggtacagcc attcaggccg    12720

ctgcggcgaa attccatttt gcaaccggag gatttacggg aaccggcggc aaatatgagc    12780

cagcggggat tgttcaccgt ggtgagtttg tcttcacgaa ggaggcaacc agccggattg    12840

gcgtggggaa tctttaccgg ctgatgcgcg gctatgccac cggcggttat gtcggtacac    12900

cgggcagcat ggcagacagc cggtcgcagg cgtccgggac gtttgagcag aataaccatg    12960

tggtgattaa caacgacggc acgaacgggc agataggtcc ggctgctctg aaggcggtgt    13020

atgacatggc ccgcaagggt gcccgtgatg aaattcagac acagatgcgt gatggtggcc    13080

tgttctccgg aggtggacga tgaagacctt ccgctggaaa gtgaaacccg gtatggatgt    13140

ggcttcggtc ccttctgtaa gaaaggtgcg ctttggtgat ggctattctc agcgagcgcc    13200

tgccgggctg aatgccaacc tgaaaacgta cagcgtgacg ctttctgtcc cccgtgagga    13260

ggccacggta ctggagtcgt ttctggaaga gcacgggggc tggaaatcct ttctgtggac    13320

gccgccttat gagtggcggc agataaaggt gacctgcgca aaatggtcgt cgcgggtcag    13380

tatgctgcgt gttgagttca gcgcagagtt tgaacaggtg gtgaactgat gcaggatatc    13440

cggcaggaaa cactgaatga atgcacccgt gcggagcagt cggccagcgt ggtgctctgg    13500

gaaatcgacc tgacagaggt cggtggagaa cgttattttt tctgtaatga gcagaacgaa    13560

aaaggtgagc cggtcacctg gcaggggcga cagtatcagc cgtatcccat tcaggggagc    13620

ggttttgaac tgaatggcaa aggcaccagt acgcgcccca cgctgacggt ttctaacctg    13680

tacggtatgg tcaccgggat ggcggaagat atgcagagtc tggtcggcgg aacggtggtc    13740

cggcgtaagg tttacgcccg ttttctggat gcggtgaact tcgtcaacgg aaacagttac    13800

gccgatccgg agcaggaggt gatcagccgc tggcgcattg agcagtgcag cgaactgagc    13860

gcggtgagtg cctcctttgt actgtccacg ccgacggaaa cggatggcgc tgtttttccg    13920

ggacgtatca tgctggccaa cacctgcacc tggacctatc gcggtgacga gtgcggttat    13980

agcggtccgg ctgtcgcgga tgaatatgac cagccaacgt ccgatatcac gaaggataaa    14040

tgcagcaaat gcctgagcgg ttgtaagttc cgcaataacg tcggcaactt tggcggcttc    14100

ctttccatta acaaactttc gcagtaaatc ccatgacaca gacagaatca gcgattctgg    14160

cgcacgcccg gcgatgtgcg ccagcggagt cgtgcggctt cgtggtaagc acgccggagg    14220

gggaaagata tttcccctgc gtgaatatct ccggtgagcc ggaggctatt tccgtatgtc    14280

gccggaagac tggctgcagg cagaaatgca gggtgagatt gtggcgctgg tccacagcca    14340

ccccggtggt ctgccctggc tgagtgaggc cgaccggcgg ctgcaggtgc agagtgattt    14400

gccgtggtgg ctggtctgcc gggggacgat tcataagttc cgctgtgtgc cgcatctcac    14460

cgggcggcgc tttgagcacg gtgtgacgga ctgttacaca ctgttccggg atgcttatca    14520

tctggcgggg attgagatgc cggactttca tcgtgaggat gactggtggc gtaacggcca    14580

gaatctctat ctggataatc tggaggcgac ggggctgtat caggtgccgt tgtcagcggc    14640

acagccgggc gatgtgctgc tgtgctgttt tggttcatca gtgccgaatc acgccgcaat    14700

ttactgcggc gacggcgagc tgctgcacca tattcctgaa caactgagca aacgagagag    14760

gtacaccgac aaatggcagc gacgcacaca ctccctctgg cgtcaccggg catggcgcgc    14820

atctgccttt acggggattt acaacgattt ggtcgccgca tcgaccttcg tgtgaaaacg    14880

ggggctgaag ccatccgggc actggccaca cagctcccgg cgtttcgtca gaaactgagc    14940

gacggctggt atcaggtacg gattgccggg cgggacgtca gcacgtccgg gttaacggcg    15000

cagttacatg agactctgcc tgatggcgct gtaattcata ttgttcccag agtcgccggg    15060

gccaagtcag gtggcgtatt ccagattgtc ctgggggctg ccgccattgc cggatcattc    15120

tttaccgccg gagccaccct tgcagcatgg ggggcagcca ttggggccgg tggtatgacc    15180

ggcatcctgt tttctctcgg tgccagtatg gtgctcggtg gtgtggcgca gatgctggca    15240

ccgaaagcca gaactccccg tatacagaca acggataacg gtaagcagaa cacctatttc    15300

tcctcactgg ataacatggt tgcccagggc aatgttctgc ctgttctgta cggggaaatg    15360

cgcgtggggt cacgcgtggt ttctcaggag atcagcacgg cagacgaagg ggacggtggt    15420

caggttgtgg tgattggtcg ctgatgcaaa atgttttatg tgaaaccgcc tgcgggcggt    15480

tttgtcattt atggagcgtg aggaatgggt aaaggaagca gtaaggggca taccccgcgc    15540

gaagcgaagg acaacctgaa gtccacgcag ttgctgagtg tgatcgatgc catcagcgaa    15600

gggccgattg aaggtccggt ggatggctta aaaagcgtgc tgctgaacag tacgccggtg    15660

ctggacactg aggggaatac caacatatcc ggtgtcacgg tggtgttccg ggctggtgag    15720

caggagcaga ctccgccgga gggatttgaa tcctccggct ccgagacggt gctgggtacg    15780

gaagtgaaat atgacacgcc gatcacccgc accattacgt ctgcaaacat cgaccgtctg    15840

cgctttacct tcggtgtaca ggcactggtg gaaaccacct caaagggtga caggaatccg    15900

tcggaagtcc gcctgctggt tcagatacaa cgtaacggtg gctgggtgac ggaaaaagac    15960

atcaccatta agggcaaaac cacctcgcag tatctggcct cggtggtgat gggtaacctg    16020

ccgccgcgcc cgtttaatat ccggatgcgc aggatgacgc cggacagcac cacagaccag    16080

ctgcagaaca aaacgctctg gtcgtcatac actgaaatca tcgatgtgaa acagtgctac    16140

ccgaacacgg cactggtcgg cgtgcaggtg gactcggagc agttcggcag ccagcaggtg    16200

agccgtaatt atcatctgcg cgggcgtatt ctgcaggtgc cgtcgaacta taacccgcag    16260

acgcggcaat acagcggtat ctgggacgga acgtttaaac cggcatacag caacaacatg    16320

gcctggtgtc tgtgggatat gctgacccat ccgcgctacg gcatggggaa acgtcttggt    16380

gcggcggatg tggataaatg ggcgctgtat gtcatcggcc agtactgcga ccagtcagtg    16440

ccggacggct ttggcggcac ggagccgcgc atcacctgta atgcgtacct gaccacacag    16500

cgtaaggcgt gggatgtgct cagcgatttc tgctcggcga tgcgctgtat gccggtatgg    16560

aacgggcaga cgctgacgtt cgtgcaggac cgaccgtcgg ataagacgtg gacctataac    16620

cgcagtaatg tggtgatgcc ggatgatggc gcgccgttcc gctacagctt cagcgccctg    16680

aaggaccgcc ataatgccgt tgaggtgaac tggattgacc cgaacaacgg ctgggagacg    16740

gcgacagagc ttgttgaaga tacgcaggcc attgcccgtt acggtcgtaa tgttacgaag    16800

atggatgcct ttggctgtac cagccggggg caggcacacc gcgccgggct gtggctgatt    16860

aaaacagaac tgctggaaac gcagaccgtg gatttcagcg tcggcgcaga agggcttcgc    16920

catgtaccgg gcgatgttat tgaaatctgc gatgatgact atgccggtat cagcaccggt    16980

ggtcgtgtgc tggcggtgaa cagccagacc cggacgctga cgctcgaccg tgaaatcacg    17040

ctgccatcct ccggtaccgc gctgataagc ctggttgacg gaagtggcaa tccggtcagc    17100

gtggaggttc agtccgtcac cgacggcgtg aaggtaaaag tgagccgtgt tcctgacggt    17160

gttgctgaat acagcgtatg ggagctgaag ctgccgacgc tgcgccagcg actgttccgc    17220

tgcgtgagta tccgtgagaa cgacgacggc acgtatgcca tcaccgccgt gcagcatgtg    17280

ccggaaaaag aggccatcgt ggataacggg gcgcactttg acggcgaaca gagtggcacg    17340

gtgaatggtg tcacgccgcc agcggtgcag cacctgaccg cagaagtcac tgcagacagc    17400

ggggaatatc aggtgctggc gcgatgggac acaccgaagg tggtgaaggg cgtgagtttc    17460

ctgctccgtc tgaccgtaac agcggacgac ggcagtgagc ggctggtcag cacggcccgg    17520

acgacggaaa ccacataccg cttcacgcaa ctggcgctgg ggaactacag gctgacagtc    17580

cgggcggtaa atgcgtgggg gcagcagggc gatccggcgt cggtatcgtt ccggattgcc    17640

gcaccggcag caccgtcgag gattgagctg acgccgggct attttcagat aaccgccacg    17700

ccgcatcttg ccgtttatga cccgacggta cagtttgagt tctggttctc ggaaaagcag    17760

attgcggata tcagacaggt tgaaaccagc acgcgttatc ttggtacggc gctgtactgg    17820

atagccgcca gtatcaatat caaaccgggc catgattatt acttttatat ccgcagtgtg    17880

aacaccgttg gcaaatcggc attcgtggag gccgtcggtc gggcgagcga tgatgcggaa    17940

ggttacctgg attttttcaa aggcaagata accgaatccc atctcggcaa ggagctgctg    18000

gaaaaagtcg agctgacgga ggataacgcc agcagactgg aggagttttc gaaagagtgg    18060

aaggatgcca gtgataagtg gaatgccatg tgggctgtca aaattgagca gaccaaagac    18120

ggcaaacatt atgtcgcggg tattggcctc agcatggagg acacggagga aggcaaactg    18180

agccagtttc tggttgccgc caatcgtatc gcatttattg acccggcaaa cgggaatgaa    18240

acgccgatgt ttgtggcgca gggcaaccag atattcatga acgacgtgtt cctgaagcgc    18300

ctgacggccc ccaccattac cagcggcggc aatcctccgg ccttttccct gacaccggac    18360

ggaaagctga ccgctaaaaa tgcggatatc agtggcagtg tgaatgcgaa ctccgggacg    18420

ctcagtaatg tgacgatagc tgaaaactgt acgataaacg gtacgctgag ggcggaaaaa    18480

atcgtcgggg acattgtaaa ggcggcgagc gcggcttttc cgcgccagcg tgaaagcagt    18540

gtggactggc cgtcaggtac ccgtactgtc accgtgaccg atgaccatcc ttttgatcgc    18600

cagatagtgg tgcttccgct gacgtttcgc ggaagtaagc gtactgtcag cggcaggaca    18660

acgtattcga tgtgttatct gaaagtactg atgaacggtg cggtgattta tgatggcgcg    18720

gcgaacgagg cggtacaggt gttctcccgt attgttgaca tgccagcggg tcggggaaac    18780

gtgatcctga cgttcacgct tacgtccaca cggcattcgg cagatattcc gccgtatacg    18840

tttgccagcg atgtgcaggt tatggtgatt aagaaacagg cgctgggcat cagcgtggtc    18900

tgagtgtgtt acagaggttc gtccgggaac gggcgtttta ttataaaaca gtgagaggtg    18960

aacgatgcgt aatgtgtgta ttgccgttgc tgtctttgcc gcacttgcgg tgacagtcac    19020

tccggcccgt gcggaaggtg gacatggtac gtttacggtg ggctattttc aagtgaaacc    19080

gggtacattg ccgtcgttgt cgggcgggga taccggtgtg agtcatctga aagggattaa    19140

cgtgaagtac cgttatgagc tgacggacag tgtgggggtg atggcttccc tggggttcgc    19200

cgcgtcgaaa aagagcagca cagtgatgac cggggaggat acgtttcact atgagagcct    19260

gcgtggacgt tatgtgagcg tgatggccgg accggtttta caaatcagta agcaggtcag    19320

tgcgtacgcc atggccggag tggctcacag tcggtggtcc ggcagtacaa tggattaccg    19380

taagacggaa atcactcccg ggtatatgaa agagacgacc actgccaggg acgaaagtgc    19440

aatgcggcat acctcagtgg cgtggagtgc aggtatacag attaatccgg cagcgtccgt    19500

cgttgttgat attgcttatg aaggctccgg cagtggcgac tggcgtactg acggattcat    19560

cgttggggtc ggttataaat tctgattagc caggtaacac agtgttatga cagcccgccg    19620

gaaccggtgg gcttttttgt ggggtgaata tggcagtaaa gatttcagga gtcctgaaag    19680

acggcacagg aaaaccggta cagaactgca ccattcagct gaaagccaga cgtaacagca    19740

ccacggtggt ggtgaacacg gtgggctcag agaatccgga tgaagccggg cgttacagca    19800

tggatgtgga gtacggtcag tacagtgtca tcctgcaggt tgacggtttt ccaccatcgc    19860

acgccgggac catcaccgtg tatgaagatt cacaaccggg gacgctgaat gattttctct    19920

gtgccatgac ggaggatgat gcccggccgg aggtgctgcg tcgtcttgaa ctgatggtgg    19980

aagaggtggc gcgtaacgcg tccgtggtgg cacagagtac ggcagacgcg aagaaatcag    20040

ccggcgatgc cagtgcatca gctgctcagg tcgcggccct tgtgactgat gcaactgact    20100

cagcacgcgc cgccagcacg tccgccggac aggctgcatc gtcagctcag gaagcgtcct    20160

ccggcgcaga agcggcatca gcaaaggcca ctgaagcgga aaaaagtgcc gcagccgcag    20220

agtcctcaaa aaacgcggcg gccaccagtg ccggtgcggc gaaaacgtca gaaacgaatg    20280

ctgcagcgtc acaacaatca gccgccacgt ctgcctccac cgcggccacg aaagcgtcag    20340

aggccgccac ttcagcacga gatgcggtgg cctcaaaaga ggcagcaaaa tcatcagaaa    20400

cgaacgcatc atcaagtgcc ggtcgtgcag cttcctcggc aacggcggca gaaaattctg    20460

ccagggcggc aaaaacgtcc gagacgaatg ccaggtcatc tgaaacagca gcggaacgga    20520

gcgcctctgc cgcggcagac gcaaaaacag cggcggcggg gagtgcgtca acggcatcca    20580

cgaaggcgac agaggctgcg ggaagtgcgg tatcagcatc gcagagcaaa agtgcggcag    20640

aagcggcggc aatacgtgca aaaaattcgg caaaacgtgc agaagatata gcttcagctg    20700

tcgcgcttga ggatgcggac acaacgagaa aggggatagt gcagctcagc agtgcaacca    20760

acagcacgtc tgaaacgctt gctgcaacgc caaaggcggt taaggtggta atggatgaaa    20820

cgaacagaaa agcccactgg acagtccggc actgaccgga acgccaacag caccaaccgc    20880

gctcagggga acaaacaata cccagattgc gaacaccgct tttgtactgg ccgcgattgc    20940

agatgttatc gacgcgtcac ctgacgcact gaatacgctg aatgaactgg ccgcagcgct    21000

cgggaatgat ccagattttg ctaccaccat gactaacgcg cttgcgggta aacaaccgaa    21060

gaatgcgaca ctgacggcgc tggcagggct ttccacggcg aaaaataaat taccgtattt    21120

tgcggaaaat gatgccgcca gcctgactga actgactcag gttggcaggg atattctggc    21180

aaaaaattcc gttgcagatg ttcttgaata ccttggggcc ggtgagaatt cggcctttcc    21240

ggcaggtgcg ccgatcccgt ggccatcaga tatcgttccg tctggctacg tcctgatgca    21300

ggggcaggcg tttgacaaat cagcctaccc aaaacttgct gtcgcgtatc catcgggtgt    21360

gcttcctgat atgcgaggct ggacaatcaa ggggaaaccc gccagcggtc gtgctgtatt    21420

gtctcaggaa caggatggaa ttaagtcgca cacccacagt gccagtgcat ccggtacgga    21480

tttggggacg aaaaccacat cgtcgtttga ttacgggacg aaaacaacag gcagtttcga    21540

ttacggcacc aaatcgacga ataacacggg ggctcatgct cacagtctga gcggttcaac    21600

aggggccgcg ggtgctcatg cccacacaag tggtttaagg atgaacagtt ctggctggag    21660

tcagtatgga acagcaacca ttacaggaag tttatccaca gttaaaggaa ccagcacaca    21720

gggtattgct tatttatcga aaacggacag tcagggcagc cacagtcact cattgtccgg    21780

tacagccgtg agtgccggtg cacatgcgca tacagttggt attggtgcgc accagcatcc    21840

ggttgttatc ggtgctcatg cccattcttt cagtattggt tcacacggac acaccatcac    21900

cgttaacgct gcgggtaacg cggaaaacac cgtcaaaaac attgcattta actatattgt    21960

gaggcttgca taatggcatt cagaatgagt gaacaaccac ggaccataaa aatttataat    22020

ctgctggccg gaactaatga atttattggt gaaggtgacg catatattcc gcctcatacc    22080

ggtctgcctg caaacagtac cgatattgca ccgccagata ttccggctgg ctttgtggct    22140

gttttcaaca gtgatgaggc atcgtggcat ctcgttgaag accatcgggg taaaaccgtc    22200

tatgacgtgg cttccggcga cgcgttattt atttctgaac tcggtccgtt accggaaaat    22260

tttacctggt tatcgccggg aggggaatat cagaagtgga acggcacagc ctgggtgaag    22320

gatacggaag cagaaaaact gttccggatc cgggaggcgg aagaaacaaa aaaaagcctg    22380

atgcaggtag ccagtgagca tattgcgccg cttcaggatg ctgcagatct ggaaattgca    22440

acgaaggaag aaacctcgtt gctggaagcc tggaagaagt atcgggtgtt gctgaaccgt    22500

gttgatacat caactgcacc tgatattgag tggcctgctg tccctgttat ggagtaatcg    22560

ttttgtgata tgccgcagaa acgttgtatg aaataacgtt ctgcggttag ttagtatatt    22620

gtaaagctga gtattggttt atttggcgat tattatcttc aggagaataa tggaagttct    22680

atgactcaat tgttcatagt gtttacatca ccgccaattg cttttaagac tgaacgcatg    22740

aaatatggtt tttcgtcatg ttttgagtct gctgttgata tttctaaagt cggttttttt    22800

tcttcgtttt ctctaactat tttccatgaa atacattttt gattattatt tgaatcaatt    22860

ccaattacct gaagtctttc atctataatt ggcattgtat gtattggttt attggagtag    22920

atgcttgctt ttctgagcca tagctctgat atccaaatga agccataggc atttgttatt    22980

ttggctctgt cagctgcata acgccaaaaa atatatttat ctgcttgatc ttcaaatgtt    23040

gtattgatta aatcaattgg atggaattgt ttatcataaa aaattaatgt ttgaatgtga    23100

taaccgtcct ttaaaaaagt cgtttctgca agcttggctg tatagtcaac taactcttct    23160

gtcgaagtga tatttttagg cttatctacc agttttagac gctctttaat atcttcagga    23220

attattttat tgtcatattg tatcatgcta aatgacaatt tgcttatgga gtaatctttt    23280

aattttaaat aagttattct cctggcttca tcaaataaag agtcgaatga tgttggcgaa    23340

atcacatcgt cacccattgg attgtttatt tgtatgccaa gagagttaca gcagttatac    23400

attctgccat agattatagc taaggcatgt aataattcgt aatcttttag cgtattagcg    23460

acccatcgtc tttctgattt aataatagat gattcagtta aatatgaagg taatttcttt    23520

tgtgcaagtc tgactaactt ttttatacca atgtttaaca tactttcatt tgtaataaac    23580

tcaatgtcat tttcttcaat gtaagatgaa ataagagtag cctttgcctc gctatacatt    23640

tctaaatcgc cttgtttttc tatcgtattg cgagaatttt tagcccaagc cattaatgga    23700

tcatttttcc atttttcaat aacattattg ttataccaaa tgtcatatcc tataatctgg    23760

tttttgtttt tttgaataat aaatgttact gttcttgcgg tttggaggaa ttgattcaaa    23820

ttcaagcgaa ataattcagg gtcaaaatat gtatcaatgc agcatttgag caagtgcgat    23880

aaatctttaa gtcttctttc ccatggtttt ttagtcataa aactctccat tttgataggt    23940

tgcatgctag atgctgatat attttagagg tgataaaatt aactgcttaa ctgtcaatgt    24000

aatacaagtt gtttgatctt tgcaatgatt cttatcagaa accatatagt aaattagtta    24060

cacaggaaat ttttaatatt attattatca ttcattatgt attaaaatta gagttgtggc    24120

ttggctctgc taacacgttg ctcataggag atatggtaga gccgcagaca cgtcgtatgc    24180

aggaacgtgc tgcggctggc tggtgaactt ccgatagtgc gggtgttgaa tgatttccag    24240

ttgctaccga ttttacatat tttttgcatg agagaatttg taccacctcc caccgaccat    24300

ctatgactgt acgccactgt ccctaggact gctatgtgcc ggagcggaca ttacaaacgt    24360

ccttctcggt gcatgccact gttgccaatg acctgcctag gaattggtta gcaagttact    24420

accggatttt gtaaaaacag ccctcctcat ataaaaagta ttcgttcact tccgataagc    24480

gtcgtaattt tctatctttc atcatattct agatccctct gaaaaaatct tccgagtttg    24540

ctaggcactg atacataact cttttccaat aattggggaa gtcattcaaa tctataatag    24600

gtttcagatt tgcttcaata aattctgact gtagctgctg aaacgttgcg gttgaactat    24660

atttccttat aacttttacg aaagagtttc tttgagtaat cacttcactc aagtgcttcc    24720

ctgcctccaa acgatacctg ttagcaatat ttaatagctt gaaatgatga agagctctgt    24780

gtttgtcttc ctgcctccag ttcgccgggc attcaacata aaaactgata gcacccggag    24840

ttccggaaac gaaatttgca tatacccatt gctcacgaaa aaaaatgtcc ttgtcgatat    24900

agggatgaat cgcttggtgt acctcatcta ctgcgaaaac ttgacctttc tctcccatat    24960

tgcagtcgcg gcacgatgga actaaattaa taggcatcac cgaaaattca ggataatgtg    25020

caataggaag aaaatgatct atattttttg tctgtcctat atcaccacaa aatggacatt    25080

tttcacctga tgaaacaagc atgtcatcgt aatatgttct agcgggtttg tttttatctc    25140

ggagattatt ttcataaagc ttttctaatt taacctttgt caggttacca actactaagg    25200

ttgtaggctc aagagggtgt gtcctgtcgt aggtaaataa ctgacctgtc gagcttaata    25260

ttctatattg ttgttctttc tgcaaaaaag tggggaagtg agtaatgaaa ttatttctaa    25320

catttatctg catcatacct tccgagcatt tattaagcat ttcgctataa gttctcgctg    25380

gaagaggtag ttttttcatt gtactttacc ttcatctctg ttcattatca tcgcttttaa    25440

aacggttcga ccttctaatc ctatctgacc attataattt tttagaatgg tttcataaga    25500

aagctctgaa tcaacggact gcgataataa gtggtggtat ccagaatttg tcacttcaag    25560

taaaaacacc tcacgagtta aaacacctaa gttctcaccg aatgtctcaa tatccggacg    25620

gataatattt attgcttctc ttgaccgtag gactttccac atgcaggatt ttggaacctc    25680

ttgcagtact actggggaat gagttgcaat tattgctaca ccattgcgtg catcgagtaa    25740

gtcgcttaat gttcgtaaaa aagcagagag caaaggtgga tgcagatgaa cctctggttc    25800

atcgaataaa actaatgact tttcgccaac gacatctact aatcttgtga tagtaaataa    25860

aacaattgca tgtccagagc tcattcgaag cagatatttc tggatattgt cataaaacaa    25920

tttagtgaat ttatcatcgt ccacttgaat ctgtggttca ttacgtctta actcttcata    25980

tttagaaatg aggctgatga gttccatatt tgaaaagttt tcatcactac ttagtttttt    26040

gatagcttca agccagagtt gtctttttct atctactctc atacaaccaa taaatgctga    26100

aatgaattct aagcggagat cgcctagtga ttttaaacta ttgctggcag cattcttgag    26160

tccaatataa aagtattgtg taccttttgc tgggtcaggt tgttctttag gaggagtaaa    26220

aggatcaaat gcactaaacg aaactgaaac aagcgatcga aaatatccct ttgggattct    26280

tgactcgata agtctattat tttcagagaa aaaatattca ttgttttctg ggttggtgat    26340

tgcaccaatc attccattca aaattgttgt tttaccacac ccattccgcc cgataaaagc    26400

atgaatgttc gtgctgggca tagaattaac cgtcacctca aaaggtatag ttaaatcact    26460

gaatccggga gcactttttc tattaaatga aaagtggaaa tctgacaatt ctggcaaacc    26520

atttaacaca cgtgcgaact gtccatgaat ttctgaaaga gttacccctc taagtaatga    26580

ggtgttaagg acgctttcat tttcaatgtc ggctaatcga tttggccata ctactaaatc    26640

ctgaatagct ttaagaaggt tatgtttaaa accatcgctt aatttgctga gattaacata    26700

gtagtcaatg ctttcaccta aggaaaaaaa catttcaggg agttgactga attttttatc    26760

tattaatgaa taagtgctta cttcttcttt ttgacctaca aaaccaattt taacatttcc    26820

gatatcgcat ttttcaccat gctcatcaaa gacagtaaga taaaacattg taacaaagga    26880

atagtcattc caaccatctg ctcgtaggaa tgccttattt ttttctactg caggaatata    26940

cccgcctctt tcaataacac taaactccaa catatagtaa cccttaattt tattaaaata    27000

accgcaattt atttggcggc aacacaggat ctctctttta agttactctc tattacatac    27060

gttttccatc taaaaattag tagtattgaa cttaacgggg catcgtattg tagttttcca    27120

tatttagctt tctgcttcct tttggataac ccactgttat tcatgttgca tggtgcactg    27180

tttataccaa cgatatagtc tattaatgca tatatagtat cgccgaacga ttagctcttc    27240

aggcttctga agaagcgttt caagtactaa taagccgata gatagccacg gacttcgtag    27300

ccatttttca taagtgttaa cttccgctcc tcgctcataa cagacattca ctacagttat    27360

ggcggaaagg tatgcatgct gggtgtgggg aagtcgtgaa agaaaagaag tcagctgcgt    27420

cgtttgacat cactgctatc ttcttactgg ttatgcaggt cgtagtgggt ggcacacaaa    27480

gctttgcact ggattgcgag gctttgtgct tctctggagt gcgacaggtt tgatgacaaa    27540

aaattagcgc aagaagacaa aaatcacctt gcgctaatgc tctgttacag gtcactaata    27600

ccatctaagt agttgattca tagtgactgc atatgttgtg ttttacagta ttatgtagtc    27660

tgttttttat gcaaaatcta atttaatata ttgatattta tatcatttta cgtttctcgt    27720

tcagcttttt tatactaagt tggcattata aaaaagcatt gcttatcaat ttgttgcaac    27780

gaacaggtca ctatcagtca aaataaaatc attatttgat ttcaattttg tcccactccc    27840

tgcctctgtc atcacgatac tgtgatgcca tggtgtccga cttatgcccg agaagatgtt    27900

gagcaaactt atcgcttatc tgcttctcat agagtcttgc agacaaactg cgcaactcgt    27960

gaaaggtagg cggatcccct tcgaaggaaa gacctgatgc ttttcgtgcg cgcataaaat    28020

accttgatac tgtgccggat gaaagcggtt cgcgacgagt agatgcaatt atggtttctc    28080

cgccaagaat ctctttgcat ttatcaagtg tttccttcat tgatattccg agagcatcaa    28140

tatgcaatgc tgttgggatg gcaattttta cgcctgtttt gctttgctcg acataaagat    28200

atccatctac gatatcagac cacttcattt cgcataaatc accaactcgt tgcccggtaa    28260

caacagccag ttccattgca agtctgagcc aacatggtga tgattctgct gcttgataaa    28320

ttttcaggta ttcgtcagcc gtaagtcttg atctccttac ctctgatttt gctgcgcgag    28380

tggcagcgac atggtttgtt gttatatggc cttcagctat tgcctctcgg aatgcatcgc    28440

tcagtgttga tctgattaac ttggctgacg ccgccttgcc ctcgtctatg tatccattga    28500

gcattgccgc aatttctttt gtggtgatgt cttcaagtgg agcatcaggc agacccctcc    28560

ttattgcttt aattttgctc atgtaattta tgagtgtctt ctgcttgatt cctctgctgg    28620

ccaggatttt ttcgtagcga tcaagccatg aatgtaacgt aacggaatta tcactgttga    28680

ttctcgctgt cagaggcttg tgtttgtgtc ctgaaaataa ctcaatgttg gcctgtatag    28740

cttcagtgat tgcgattcgc ctgtctctgc ctaatccaaa ctctttaccc gtccttgggt    28800

ccctgtagca gtaatatcca ttgtttctta tataaaggtt agggggtaaa tcccggcgct    28860

catgacttcg ccttcttccc atttctgatc ctcttcaaaa ggccacctgt tactggtcga    28920

tttaagtcaa cctttaccgc tgattcgtgg aacagatact ctcttccatc cttaaccgga    28980

ggtgggaata tcctgcattc ccgaacccat cgacgaactg tttcaaggct tcttggacgt    29040

cgctggcgtg cgttccactc ctgaagtgtc aagtacatcg caaagtctcc gcaattacac    29100

gcaagaaaaa accgccatca ggcggcttgg tgttctttca gttcttcaat tcgaatattg    29160

gttacgtctg catgtgctat ctgcgcccat atcatccagt ggtcgtagca gtcgttgatg    29220

ttctccgctt cgataactct gttgaatggc tctccattcc attctcctgt gactcggaag    29280

tgcatttatc atctccataa aacaaaaccc gccgtagcga gttcagataa aataaatccc    29340

cgcgagtgcg aggattgtta tgtaatattg ggtttaatca tctatatgtt ttgtacagag    29400

agggcaagta tcgtttccac cgtactcgtg ataataattt tgcacggtat cagtcatttc    29460

tcgcacattg cagaatgggg atttgtcttc attagactta taaaccttca tggaatattt    29520

gtatgccgac tctatatcta taccttcatc tacataaaca ccttcgtgat gtctgcatgg    29580

agacaagaca ccggatctgc acaacattga taacgcccaa tctttttgct cagactctaa    29640

ctcattgata ctcatttata aactccttgc aatgtatgtc gtttcagcta aacggtatca    29700

gcaatgttta tgtaaagaaa cagtaagata atactcaacc cgatgtttga gtacggtcat    29760

catctgacac tacagactct ggcatcgctg tgaagacgac gcgaaattca gcattttcac    29820

aagcgttatc ttttacaaaa ccgatctcac tctcctttga tgcgaatgcc agcgtcagac    29880

atcatatgca gatactcacc tgcatcctga acccattgac ctccaacccc gtaatagcga    29940

tgcgtaatga tgtcgatagt tactaacggg tcttgttcga ttaactgccg cagaaactct    30000

tccaggtcac cagtgcagtg cttgataaca ggagtcttcc caggatggcg aacaacaaga    30060

aactggtttc cgtcttcacg gacttcgttg ctttccagtt tagcaatacg cttactccca    30120

tccgagataa caccttcgta atactcacgc tgctcgttga gttttgattt tgctgtttca    30180

agctcaacac gcagtttccc tactgttagc gcaatatcct cgttctcctg gtcgcggcgt    30240

ttgatgtatt gctggtttct ttcccgttca tccagcagtt ccagcacaat cgatggtgtt    30300

accaattcat ggaaaaggtc tgcgtcaaat ccccagtcgt catgcattgc ctgctctgcc    30360

gcttcacgca gtgcctgaga gttaatttcg ctcacttcga acctctctgt ttactgataa    30420

gttccagatc ctcctggcaa cttgcacaag tccgacaacc ctgaacgacc aggcgtcttc    30480

gttcatctat cggatcgcca cactcacaac aatgagtggc agatatagcc tggtggttca    30540

ggcggcgcat ttttattgct gtgttgcgct gtaattcttc tatttctgat gctgaatcaa    30600

tgatgtctgc catctttcat taatccctga actgttggtt aatacgcttg agggtgaatg    30660

cgaataataa aaaaggagcc tgtagctccc tgatgatttt gcttttcatg ttcatcgttc    30720

cttaaagacg ccgtttaaca tgccgattgc caggcttaaa tgagtcggtg tgaatcccat    30780

cagcgttacc gtttcgcggt gcttcttcag tacgctacgg caaatgtcat cgacgttttt    30840

atccggaaac tgctgtctgg ctttttttga tttcagaatt agcctgacgg gcaatgctgc    30900

gaagggcgtt ttcctgctga ggtgtcattg aacaagtccc atgtcggcaa gcataagcac    30960

acagaatatg aagcccgctg ccagaaaaat gcattccgtg gttgtcatac ctggtttctc    31020

tcatctgctt ctgctttcgc caccatcatt tccagctttt gtgaaaggga tgcggctaac    31080

gtatgaaatt cttcgtctgt ttctactggt attggcacaa acctgattcc aatttgagca    31140

aggctatgtg ccatctcgat actcgttctt aactcaacag aagatgcttt gtgcatacag    31200

cccctcgttt attatttatc tcctcagcca gccgctgtgc tttcagtgga tttcggataa    31260

cagaaaggcc gggaaatacc cagcctcgct ttgtaacgga gtagacgaaa gtgattgcgc    31320

ctacccggat attatcgtga ggatgcgtca tcgccattgc tccccaaata caaaaccaat    31380

ttcagccagt gcctcgtcca ttttttcgat gaactccggc acgatctcgt caaaactcgc    31440

catgtacttt tcatcccgct caatcacgac ataatgcagg ccttcacgct tcatacgcgg    31500

gtcatagttg gcaaagtacc aggcattttt tcgcgtcacc cacatgctgt actgcacctg    31560

ggccatgtaa gctgacttta tggcctcgaa accaccgagc cggaacttca tgaaatcccg    31620

ggaggtaaac gggcatttca gttcaaggcc gttgccgtca ctgcataaac catcgggaga    31680

gcaggcggta cgcatacttt cgtcgcgata gatgatcggg gattcagtaa cattcacgcc    31740

ggaagtgaat tcaaacaggg ttctggcgtc gttctcgtac tgttttcccc aggccagtgc    31800

tttagcgtta acttccggag ccacaccggt gcaaacctca gcaagcaggg tgtggaagta    31860

ggacattttc atgtcaggcc acttctttcc ggagcggggt tttgctatca cgttgtgaac    31920

ttctgaagcg gtgatgacgc cgagccgtaa tttgtgccac gcatcatccc cctgttcgac    31980

agctctcaca tcgatcccgg tacgctgcag gataatgtcc ggtgtcatgc tgccaccttc    32040

tgctctgcgg ctttctgttt caggaatcca agagctttta ctgcttcggc ctgtgtcagt    32100

tctgacgatg cacgaatgtc gcggcgaaat atctgggaac agagcggcaa taagtcgtca    32160

tcccatgttt tatccagggc gatcagcaga gtgttaatct cctgcatggt ttcatcgtta    32220

accggagtga tgtcgcgttc cggctgacgt tctgcagtgt atgcagtatt ttcgacaatg    32280

cgctcggctt catccttgtc atagatacca gcaaatccga aggccagacg ggcacactga    32340

atcatggctt tatgacgtaa catccgtttg ggatgcgact gccacggccc cgtgatttct    32400

ctgccttcgc gagttttgaa tggttcgcgg cggcattcat ccatccattc ggtaacgcag    32460

atcggatgat tacggtcctt gcggtaaatc cggcatgtac aggattcatt gtcctgctca    32520

aagtccatgc catcaaactg ctggttttca ttgatgatgc gggaccagcc atcaacgccc    32580

accaccggaa cgatgccatt ctgcttatca ggaaaggcgt aaatttcttt cgtccacgga    32640

ttaaggccgt actggttggc aacgatcagt aatgcgatga actgcgcatc gctggcatca    32700

cctttaaatg ccgtctggcg aagagtggtg atcagttcct gtgggtcgac agaatccatg    32760

ccgacacgtt cagccagctt cccagccagc gttgcgagtg cagtactcat tcgttttata    32820

cctctgaatc aatatcaacc tggtggtgag caatggtttc aaccatgtac cggatgtgtt    32880

ctgccatgcg ctcctgaaac tcaacatcgt catcaaacgc acgggtaatg gattttttgc    32940

tggccccgtg gcgttgcaaa tgatcgatgc atagcgattc aaacaggtgc tggggcaggc    33000

ctttttccat gtcgtctgcc agttctgcct ctttctcttc acgggcgagc tgctggtagt    33060

gacgcgccca gctctgagcc tcaagacgat cctgaatgta ataagcgttc atggctgaac    33120

tcctgaaata gctgtgaaaa tatcgcccgc gaaatgccgg gctgattagg aaaacaggaa    33180

agggggttag tgaatgcttt tgcttgatct cagtttcagt attaatatcc attttttata    33240

agcgtcgacg gcttcacgaa acatcttttc atcgccaata aaagtggcga tagtgaattt    33300

agtctggata gccataagtg tttgatccat tctttgggac tcctggctga ttaagtatgt    33360

cgataaggcg tttccatccg tcacgtaatt tacgggtgat tcgttcaagt aaagattcgg    33420

aagggcagcc agcaacaggc caccctgcaa tggcatattg catggtgtgc tccttattta    33480

tacataacga aaaacgcctc gagtgaagcg ttattggtat gcggtaaaac cgcactcagg    33540

cggccttgat agtcatatca tctgaatcaa atattcctga tgtatcgata tcggtaattc    33600

ttattccttc gctaccatcc attggaggcc atccttcctg accatttcca tcattccagt    33660

cgaactcaca cacaacacca tatgcattta agtcgcttga aattgctata agcagagcat    33720

gttgcgccag catgattaat acagcattta atacagagcc gtgtttattg agtcggtatt    33780

cagagtctga ccagaaatta ttaatctggt gaagtttttc ctctgtcatt acgtcatggt    33840

cgatttcaat ttctattgat gctttccagt cgtaatcaat gatgtatttt ttgatgtttg    33900

acatctgttc atatcctcac agataaaaaa tcgccctcac actggagggc aaagaagatt    33960

tccaataatc agaacaagtc ggctcctgtt tagttacgag cgacattgct ccgtgtattc    34020

actcgttgga atgaatacac agtgcagtgt ttattctgtt atttatgcca aaaataaagg    34080

ccactatcag gcagctttgt tgttctgttt accaagttct ctggcaatca ttgccgtcgt    34140

tcgtattgcc catttatcga catatttccc atcttccatt acaggaaaca tttcttcagg    34200

cttaaccatg cattccgatt gcagcttgca tccattgcat cgcttgaatt gtccacacca    34260

ttgattttta tcaatagtcg tagtcatacg gatagtcctg gtattgttcc atcacatcct    34320

gaggatgctc ttcgaactct tcaaattctt cttccatata tcaccttaaa tagtggattg    34380

cggtagtaaa gattgtgcct gtcttttaac cacatcaggc tcggtggttc tcgtgtaccc    34440

ctacagcgag aaatcggata aactattaca acccctacag tttgatgagt atagaaatgg    34500

atccactcgt tattctcgga cgagtgttca gtaatgaacc tctggagaga accatgtata    34560

tgatcgttat ctgggttgga cttctgcttt taagcccaga taactggcct gaatatgtta    34620

atgagagaat cggtattcct catgtgtggc atgttttcgt ctttgctctt gcattttcgc    34680

tagcaattaa tgtgcatcga ttatcagcta ttgccagcgc cagatataag cgatttaagc    34740

taagaaaacg cattaagatg caaaacgata aagtgcgatc agtaattcaa aaccttacag    34800

aagagcaatc tatggttttg tgcgcagccc ttaatgaagg caggaagtat gtggttacat    34860

caaaacaatt cccatacatt agtgagttga ttgagcttgg tgtgttgaac aaaacttttt    34920

cccgatggaa tggaaagcat atattattcc ctattgagga tatttactgg actgaattag    34980

ttgccagcta tgatccatat aatattgaga taaagccaag gccaatatct aagtaactag    35040

ataagaggaa tcgattttcc cttaattttc tggcgtccac tgcatgttat gccgcgttcg    35100

ccaggcttgc tgtaccatgt gcgctgattc ttgcgctcaa tacgttgcag gttgctttca    35160

atctgtttgt ggtattcagc cagcactgta aggtctatcg gatttagtgc gctttctact    35220

cgtgatttcg gtttgcgatt cagcgagaga atagggcggt taactggttt tgcgcttacc    35280

ccaaccaaca ggggatttgc tgctttccat tgagcctgtt tctctgcgcg acgttcgcgg    35340

cggcgtgttt gtgcatccat ctggattctc ctgtcagtta gctttggtgg tgtgtggcag    35400

ttgtagtcct gaacgaaaac cccccgcgat tggcacattg gcagctaatc cggaatcgca    35460

cttacggcca atgcttcgtt tcgtatcaca caccccaaag ccttctgctt tgaatgctgc    35520

ccttcttcag ggcttaattt ttaagagcgt caccttcatg gtggtcagtg cgtcctgctg    35580

atgtgctcag tatcaccgcc agtggtattt atgtcaacac cgccagagat aatttatcac    35640

cgcagatggt tatctgtatg ttttttatat gaatttattt tttgcagggg ggcattgttt    35700

ggtaggtgag agatctgaat tgctatgttt agtgagttgt atctatttat ttttcaataa    35760

atacaattgg ttatgtgttt tgggggcgat cgtgaggcaa agaaaacccg gcgctgaggc    35820

cgggttattc ttgttctctg gtcaaattat atagttggaa aacaaggatg catatatgaa    35880

tgaacgatgc agaggcaatg ccgatggcga tagtgggtat catgtagccg cttatgctgg    35940

aaagaagcaa taacccgcag aaaaacaaag ctccaagctc aacaaaacta agggcataga    36000

caataactac cgatgtcata tacccatact ctctaatctt ggccagtcgg cgcgttctgc    36060

ttccgattag aaacgtcaag gcagcaatca ggattgcaat catggttcct gcatatgatg    36120

acaatgtcgc cccaagacca tctctatgag ctgaaaaaga aacaccagga atgtagtggc    36180

ggaaaaggag atagcaaatg cttacgataa cgtaaggaat tattactatg taaacaccag    36240

gcatgattct gttccgcata attactcctg ataattaatc cttaactttg cccacctgcc    36300

ttttaaaaca ttccagtata tcacttttca ttcttgcgta gcaatatgcc atctcttcag    36360

ctatctcagc attggtgacc ttgttcagag gcgctgagag atggcctttt tctgatagat    36420

aatgttctgt taaaatatct ccggcctcat cttttgcccg caggctaatg tctgaaaatt    36480

gaggtgacgg gttaaaaata atatccttgg caaccttttt tatatccctt ttaaattttg    36540

gcttaatgac tatatccaat gagtcaaaaa gctccccttc aatatctgtt gcccctaaga    36600

cctttaatat atcgccaaat acaggtagct tggcttctac cttcaccgtt gttcggccga    36660

tgaaatgcat atgcataaca tcgtctttgg tggttcccct catcagtggc tctatctgaa    36720

cgcgctctcc actgcttaat gacattcctt tcccgattaa aaaatctgtc agatcggatg    36780

tggtcggccc gaaaacagtt ctggcaaaac caatggtgtc gccttcaaca aacaaaaaag    36840

atgggaatcc caatgattcg tcatctgcga ggctgttctt aatatcttca actgaagctt    36900

tagagcgatt tatcttctga accagactct tgtcatttgt tttggtaaag agaaaagttt    36960

ttccatcgat tttatgaata tacaaataat tggagccaac ctgcaggtga tgattatcag    37020

ccagcagaga attaaggaaa acagacaggt ttattgagcg cttatctttc cctttatttt    37080

tgctgcggta agtcgcataa aaaccattct tcataattca atccatttac tatgttatgt    37140

tctgagggga gtgaaaattc ccctaattcg atgaagattc ttgctcaatt gttatcagct    37200

atgcgccgac cagaacacct tgccgatcag ccaaacgtct cttcaggcca ctgactagcg    37260

ataactttcc ccacaacgga acaactctca ttgcatggga tcattgggta ctgtgggttt    37320

agtggttgta aaaacacctg accgctatcc ctgatcagtt tcttgaaggt aaactcatca    37380

cccccaagtc tggctatgca gaaatcacct ggctcaacag cctgctcagg gtcaacgaga    37440

attaacattc cgtcaggaaa gcttggcttg gagcctgttg gtgcggtcat ggaattacct    37500

tcaacctcaa gccagaatgc agaatcactg gcttttttgg ttgtgcttac ccatctctcc    37560

gcatcacctt tggtaaaggt tctaagctta ggtgagaaca tccctgcctg aacatgagaa    37620

aaaacagggt actcatactc acttctaagt gacggctgca tactaaccgc ttcatacatc    37680

tcgtagattt ctctggcgat tgaagggcta aattcttcaa cgctaacttt gagaattttt    37740

gtaagcaatg cggcgttata agcatttaat gcattgatgc cattaaataa agcaccaacg    37800

cctgactgcc ccatccccat cttgtctgcg acagattcct gggataagcc aagttcattt    37860

ttcttttttt cataaattgc tttaaggcga cgtgcgtcct caagctgctc ttgtgttaat    37920

ggtttctttt ttgtgctcat acgttaaatc tatcaccgca agggataaat atctaacacc    37980

gtgcgtgttg actattttac ctctggcggt gataatggtt gcatgtacta aggaggttgt    38040

atggaacaac gcataaccct gaaagattat gcaatgcgct ttgggcaaac caagacagct    38100

aaagatctcg gcgtatatca aagcgcgatc aacaaggcca ttcatgcagg ccgaaagatt    38160

tttttaacta taaacgctga tggaagcgtt tatgcggaag aggtaaagcc cttcccgagt    38220

aacaaaaaaa caacagcata aataaccccg ctcttacaca ttccagccct gaaaaagggc    38280

atcaaattaa accacaccta tggtgtatgc atttatttgc atacattcaa tcaattgtta    38340

tctaaggaaa tacttacata tggttcgtgc aaacaaacgc aacgaggctc tacgaatcga    38400

gagtgcgttg cttaacaaaa tcgcaatgct tggaactgag aagacagcgg aagctgtggg    38460

cgttgataag tcgcagatca gcaggtggaa gagggactgg attccaaagt tctcaatgct    38520

gcttgctgtt cttgaatggg gggtcgttga cgacgacatg gctcgattgg cgcgacaagt    38580

tgctgcgatt ctcaccaata aaaaacgccc ggcggcaacc gagcgttctg aacaaatcca    38640

gatggagttc tgaggtcatt actggatcta tcaacaggag tcattatgac aaatacagca    38700

aaaatactca acttcggcag aggtaacttt gccggacagg agcgtaatgt ggcagatctc    38760

gatgatggtt acgccagact atcaaatatg ctgcttgagg cttattcggg cgcagatctg    38820

accaagcgac agtttaaagt gctgcttgcc attctgcgta aaacctatgg gtggaataaa    38880

ccaatggaca gaatcaccga ttctcaactt agcgagatta caaagttacc tgtcaaacgg    38940

tgcaatgaag ccaagttaga actcgtcaga atgaatatta tcaagcagca aggcggcatg    39000

tttggaccaa ataaaaacat ctcagaatgg tgcatccctc aaaacgaggg aaaatcccct    39060

aaaacgaggg ataaaacatc cctcaaattg ggggattgct atccctcaaa acagggggac    39120

acaaaagaca ctattacaaa agaaaaaaga aaagattatt cgtcagagaa ttctggcgaa    39180

tcctctgacc agccagaaaa cgacctttct gtggtgaaac cggatgctgc aattcagagc    39240

ggcagcaagt gggggacagc agaagacctg accgccgcag agtggatgtt tgacatggtg    39300

aagactatcg caccatcagc cagaaaaccg aattttgctg ggtgggctaa cgatatccgc    39360

ctgatgcgtg aacgtgacgg acgtaaccac cgcgacatgt gtgtgctgtt ccgctgggca    39420

tgccaggaca acttctggtc cggtaacgtg ctgagcccgg ccaaactccg cgataagtgg    39480

acccaactcg aaatcaaccg taacaagcaa caggcaggcg tgacagccag caaaccaaaa    39540

ctcgacctga caaacacaga ctggatttac ggggtggatc tatgaaaaac atcgccgcac    39600

agatggttaa ctttgaccgt gagcagatgc gtcggatcgc caacaacatg ccggaacagt    39660

acgacgaaaa gccgcaggta cagcaggtag cgcagatcat caacggtgtg ttcagccagt    39720

tactggcaac tttcccggcg agcctggcta accgtgacca gaacgaagtg aacgaaatcc    39780

gtcgccagtg ggttctggct tttcgggaaa acgggatcac cacgatggaa caggttaacg    39840

caggaatgcg cgtagcccgt cggcagaatc gaccatttct gccatcaccc gggcagtttg    39900

ttgcatggtg ccgggaagaa gcatccgtta ccgccggact gccaaacgtc agcgagctgg    39960

ttgatatggt ttacgagtat tgccggaagc gaggcctgta tccggatgcg gagtcttatc    40020

cgtggaaatc aaacgcgcac tactggctgg ttaccaacct gtatcagaac atgcgggcca    40080

atgcgcttac tgatgcggaa ttacgccgta aggccgcaga tgagcttgtc catatgactg    40140

cgagaattaa ccgtggtgag gcgatccctg aaccagtaaa acaacttcct gtcatgggcg    40200

gtagacctct aaatcgtgca caggctctgg cgaagatcgc agaaatcaaa gctaagttcg    40260

gactgaaagg agcaagtgta tgacgggcaa agaggcaatt attcattacc tggggacgca    40320

taatagcttc tgtgcgccgg acgttgccgc gctaacaggc gcaacagtaa ccagcataaa    40380

tcaggccgcg gctaaaatgg cacgggcagg tcttctggtt atcgaaggta aggtctggcg    40440

aacggtgtat taccggtttg ctaccaggga agaacgggaa ggaaagatga gcacgaacct    40500

ggtttttaag gagtgtcgcc agagtgccgc gatgaaacgg gtattggcgg tatatggagt    40560

taaaagatga ccatctacat tactgagcta ataacaggcc tgctggtaat cgcaggcctt    40620

tttatttggg ggagagggaa gtcatgaaaa aactaacctt tgaaattcga tctccagcac    40680

atcagcaaaa cgctattcac gcagtacagc aaatccttcc agacccaacc aaaccaatcg    40740

tagtaaccat tcaggaacgc aaccgcagct tagaccaaaa caggaagcta tgggcctgct    40800

taggtgacgt ctctcgtcag gttgaatggc atggtcgctg gctggatgca gaaagctgga    40860

agtgtgtgtt taccgcagca ttaaagcagc aggatgttgt tcctaacctt gccgggaatg    40920

gctttgtggt aataggccag tcaaccagca ggatgcgtgt aggcgaattt gcggagctat    40980

tagagcttat acaggcattc ggtacagagc gtggcgttaa gtggtcagac gaagcgagac    41040

tggctctgga gtggaaagcg agatggggag acagggctgc atgataaatg tcgttagttt    41100

ctccggtggc aggacgtcag catatttgct ctggctaatg gagcaaaagc gacgggcagg    41160

taaagacgtg cattacgttt tcatggatac aggttgtgaa catccaatga catatcggtt    41220

tgtcagggaa gttgtgaagt tctgggatat accgctcacc gtattgcagg ttgatatcaa    41280

cccggagctt ggacagccaa atggttatac ggtatgggaa ccaaaggata ttcagacgcg    41340

aatgcctgtt ctgaagccat ttatcgatat ggtaaagaaa tatggcactc catacgtcgg    41400

cggcgcgttc tgcactgaca gattaaaact cgttcccttc accaaatact gtgatgacca    41460

tttcgggcga gggaattaca ccacgtggat tggcatcaga gctgatgaac cgaagcggct    41520

aaagccaaag cctggaatca gatatcttgc tgaactgtca gactttgaga aggaagatat    41580

cctcgcatgg tggaagcaac aaccattcga tttgcaaata ccggaacatc tcggtaactg    41640

catattctgc attaaaaaat caacgcaaaa aatcggactt gcctgcaaag atgaggaggg    41700

attgcagcgt gtttttaatg aggtcatcac gggatcccat gtgcgtgacg gacatcggga    41760

aacgccaaag gagattatgt accgaggaag aatgtcgctg gacggtatcg cgaaaatgta    41820

ttcagaaaat gattatcaag ccctgtatca ggacatggta cgagctaaaa gattcgatac    41880

cggctcttgt tctgagtcat gcgaaatatt tggagggcag cttgatttcg acttcgggag    41940

ggaagctgca tgatgcgatg ttatcggtgc ggtgaatgca aagaagataa ccgcttccga    42000

ccaaatcaac cttactggaa tcgatggtgt ctccggtgtg aaagaacacc aacaggggtg    42060

ttaccactac cgcaggaaaa ggaggacgtg tggcgagaca gcgacgaagt atcaccgaca    42120

taatctgcga aaactgcaaa taccttccaa cgaaacgcac cagaaataaa cccaagccaa    42180

tcccaaaaga atctgacgta aaaaccttca actacacggc tcacctgtgg gatatccggt    42240

ggctaagacg tcgtgcgagg aaaacaaggt gattgaccaa aatcgaagtt acgaacaaga    42300

aagcgtcgag cgagctttaa cgtgcgctaa ctgcggtcag aagctgcatg tgctggaagt    42360

tcacgtgtgt gagcactgct gcgcagaact gatgagcgat ccgaatagct cgatgcacga    42420

ggaagaagat gatggctaaa ccagcgcgaa gacgatgtaa aaacgatgaa tgccgggaat    42480

ggtttcaccc tgcattcgct aatcagtggt ggtgctctcc agagtgtgga accaagatag    42540

cactcgaacg acgaagtaaa gaacgcgaaa aagcggaaaa agcagcagag aagaaacgac    42600

gacgagagga gcagaaacag aaagataaac ttaagattcg aaaactcgcc ttaaagcccc    42660

gcagttactg gattaaacaa gcccaacaag ccgtaaacgc cttcatcaga gaaagagacc    42720

gcgacttacc atgtatctcg tgcggaacgc tcacgtctgc tcagtgggat gccggacatt    42780

accggacaac tgctgcggca cctcaactcc gatttaatga acgcaatatt cacaagcaat    42840

gcgtggtgtg caaccagcac aaaagcggaa atctcgttcc gtatcgcgtc gaactgatta    42900

gccgcatcgg gcaggaagca gtagacgaaa tcgaatcaaa ccataaccgc catcgctgga    42960

ctatcgaaga gtgcaaggcg atcaaggcag agtaccaaca gaaactcaaa gacctgcgaa    43020

atagcagaag tgaggccgca tgacgttctc agtaaaaacc attccagaca tgctcgttga    43080

aacatacgga aatcagacag aagtagcacg cagactgaaa tgtagtcgcg gtacggtcag    43140

aaaatacgtt gatgataaag acgggaaaat gcacgccatc gtcaacgacg ttctcatggt    43200

tcatcgcgga tggagtgaaa gagatgcgct attacgaaaa aattgatggc agcaaatacc    43260

gaaatatttg ggtagttggc gatctgcacg gatgctacac gaacctgatg aacaaactgg    43320

atacgattgg attcgacaac aaaaaagacc tgcttatctc ggtgggcgat ttggttgatc    43380

gtggtgcaga gaacgttgaa tgcctggaat taatcacatt cccctggttc agagctgtac    43440

gtggaaacca tgagcaaatg atgattgatg gcttatcaga gcgtggaaac gttaatcact    43500

ggctgcttaa tggcggtggc tggttcttta atctcgatta cgacaaagaa attctggcta    43560

aagctcttgc ccataaagca gatgaacttc cgttaatcat cgaactggtg agcaaagata    43620

aaaaatatgt tatctgccac gccgattatc cctttgacga atacgagttt ggaaagccag    43680

ttgatcatca gcaggtaatc tggaaccgcg aacgaatcag caactcacaa aacgggatcg    43740

tgaaagaaat caaaggcgcg gacacgttca tctttggtca tacgccagca gtgaaaccac    43800

tcaagtttgc caaccaaatg tatatcgata ccggcgcagt gttctgcgga aacctaacat    43860

tgattcaggt acagggagaa ggcgcatgag actcgaaagc gtagctaaat ttcattcgcc    43920

aaaaagcccg atgatgagcg actcaccacg ggccacggct tctgactctc tttccggtac    43980

tgatgtgatg gctgctatgg ggatggcgca atcacaagcc ggattcggta tggctgcatt    44040

ctgcggtaag cacgaactca gccagaacga caaacaaaag gctatcaact atctgatgca    44100

atttgcacac aaggtatcgg ggaaataccg tggtgtggca aagcttgaag gaaatactaa    44160

ggcaaaggta ctgcaagtgc tcgcaacatt cgcttatgcg gattattgcc gtagtgccgc    44220

gacgccgggg gcaagatgca gagattgcca tggtacaggc cgtgcggttg atattgccaa    44280

aacagagctg tgggggagag ttgtcgagaa agagtgcgga agatgcaaag gcgtcggcta    44340

ttcaaggatg ccagcaagcg cagcatatcg cgctgtgacg atgctaatcc caaaccttac    44400

ccaacccacc tggtcacgca ctgttaagcc gctgtatgac gctctggtgg tgcaatgcca    44460

caaagaagag tcaatcgcag acaacatttt gaatgcggtc acacgttagc agcatgattg    44520

ccacggatgg caacatatta acggcatgat attgacttat tgaataaaat tgggtaaatt    44580

tgactcaacg atgggttaat tcgctcgttg tggtagtgag atgaaaagag gcggcgctta    44640

ctaccgattc cgcctagttg gtcacttcga cgtatcgtct ggaactccaa ccatcgcagg    44700

cagagaggtc tgcaaaatgc aatcccgaaa cagttcgcag gtaatagtta gagcctgcat    44760

aacggtttcg ggatttttta tatctgcaca acaggtaaga gcattgagtc gataatcgtg    44820

aagagtcggc gagcctggtt agccagtgct ctttccgttg tgctgaatta agcgaatacc    44880

ggaagcagaa ccggatcacc aaatgcgtac aggcgtcatc gccgcccagc aacagcacaa    44940

cccaaactga gccgtagcca ctgtctgtcc tgaattcatt agtaatagtt acgctgcggc    45000

cttttacaca tgaccttcgt gaaagcgggt ggcaggaggt cgcgctaaca acctcctgcc    45060

gttttgcccg tgcatatcgg tcacgaacaa atctgattac taaacacagt agcctggatt    45120

tgttctatca gtaatcgacc ttattcctaa ttaaatagag caaatcccct tattgggggt    45180

aagacatgaa gatgccagaa aaacatgacc tgttggccgc cattctcgcg gcaaaggaac    45240

aaggcatcgg ggcaatcctt gcgtttgcaa tggcgtacct tcgcggcaga tataatggcg    45300

gtgcgtttac aaaaacagta atcgacgcaa cgatgtgcgc cattatcgcc tagttcattc    45360

gtgaccttct cgacttcgcc ggactaagta gcaatctcgc ttatataacg agcgtgttta    45420

tcggctacat cggtactgac tcgattggtt cgcttatcaa acgcttcgct gctaaaaaag    45480

ccggagtaga agatggtaga aatcaataat caacgtaagg cgttcctcga tatgctggcg    45540

tggtcggagg gaactgataa cggacgtcag aaaaccagaa atcatggtta tgacgtcatt    45600

gtaggcggag agctatttac tgattactcc gatcaccctc gcaaacttgt cacgctaaac    45660

ccaaaactca aatcaacagg cgccggacgc taccagcttc tttcccgttg gtgggatgcc    45720

taccgcaagc agcttggcct gaaagacttc tctccgaaaa gtcaggacgc tgtggcattg    45780

cagcagatta aggagcgtgg cgctttacct atgattgatc gtggtgatat ccgtcaggca    45840

atcgaccgtt gcagcaatat ctgggcttca ctgccgggcg ctggttatgg tcagttcgag    45900

cataaggctg acagcctgat tgcaaaattc aaagaagcgg gcggaacggt cagagagatt    45960

gatgtatgag cagagtcacc gcgattatct ccgctctggt tatctgcatc atcgtctgcc    46020

tgtcatgggc tgttaatcat taccgtgata acgccattac ctacaaagcc cagcgcgaca    46080

aaaatgccag agaactgaag ctggcgaacg cggcaattac tgacatgcag atgcgtcagc    46140

gtgatgttgc tgcgctcgat gcaaaataca cgaaggagtt agctgatgct aaagctgaaa    46200

atgatgctct gcgtgatgat gttgccgctg gtcgtcgtcg gttgcacatc aaagcagtct    46260

gtcagtcagt gcgtgaagcc accaccgcct ccggcgtgga taatgcagcc tccccccgac    46320

tggcagacac cgctgaacgg gattatttca ccctcagaga gaggctgatc actatgcaaa    46380

aacaactgga aggaacccag aagtatatta atgagcagtg cagatagagt tgcccatatc    46440

gatgggcaac tcatgcaatt attgtgagca atacacacgc gcttccagcg gagtataaat    46500

gcctaaagta ataaaaccga gcaatccatt tacgaatgtt tgctgggttt ctgttttaac    46560

aacattttct gcgccgccac aaattttggc tgcatcgaca gttttcttct gcccaattcc    46620

agaaacgaag aaatgatggg tgatggtttc ctttggtgct actgctgccg gtttgttttg    46680

aacagtaaac gtctgttgag cacatcctgt aataagcagg gccagcgcag tagcgagtag    46740

catttttttc atggtgttat tcccgatgct ttttgaagtt cgcagaatcg tatgtgtaga    46800

aaattaaaca aaccctaaac aatgagttga aatttcatat tgttaatatt tattaatgta    46860

tgtcaggtgc gatgaatcgt cattgtattc ccggattaac tatgtccaca gccctgacgg    46920

ggaacttctc tgcgggagtg tccgggaata attaaaacga tgcacacagg gtttagcgcg    46980

tacacgtatt gcattatgcc aacgccccgg tgctgacacg gaagaaaccg gacgttatga    47040

tttagcgtgg aaagatttgt gtagtgttct gaatgctctc agtaaatagt aatgaattat    47100

caaaggtata gtaatatctt ttatgttcat ggatatttgt aacccatcgg aaaactcctg    47160

ctttagcaag attttccctg tattgctgaa atgtgatttc tcttgatttc aacctatcat    47220

aggacgtttc tataagatgc gtgtttcttg agaatttaac atttacaacc tttttaagtc    47280

cttttattaa cacggtgtta tcgttttcta acacgatgtg aatattatct gtggctagat    47340

agtaaatata atgtgagacg ttgtgacgtt ttagttcaga ataaaacaat tcacagtcta    47400

aatcttttcg cacttgatcg aatatttctt taaaaatggc aacctgagcc attggtaaaa    47460

ccttccatgt gatacgaggg cgcgtagttt gcattatcgt ttttatcgtt tcaatctggt    47520

ctgacctcct tgtgttttgt tgatgattta tgtcaaatat taggaatgtt ttcacttaat    47580

agtattggtt gcgtaacaaa gtgcggtcct gctggcattc tggagggaaa tacaaccgac    47640

agatgtatgt aaggccaacg tgctcaaatc ttcatacaga aagatttgaa gtaatatttt    47700

aaccgctaga tgaagagcaa gcgcatggag cgacaaaatg aataaagaac aatctgctga    47760

tgatccctcc gtggatctga ttcgtgtaaa aaatatgctt aatagcacca tttctatgag    47820

ttaccctgat gttgtaattg catgtataga acataaggtg tctctggaag cattcagagc    47880

aattgaggca gcgttggtga agcacgataa taatatgaag gattattccc tggtggttga    47940

ctgatcacca taactgctaa tcattcaaac tatttagtct gtgacagagc caacacgcag    48000

tctgtcactg tcaggaaagt ggtaaaactg caactcaatt actgcaatgc cctcgtaatt    48060

aagtgaattt acaatatcgt cctgttcgga gggaagaacg cgggatgttc attcttcatc    48120

acttttaatt gatgtatatg ctctcttttc tgacgttagt ctccgacggc aggcttcaat    48180

gacccaggct gagaaattcc cggacccttt ttgctcaaga gcgatgttaa tttgttcaat    48240

catttggtta ggaaagcgga tgttgcgggt tgttgttctg cgggttctgt tcttcgttga    48300

catgaggttg ccccgtattc agtgtcgctg atttgtattg tctgaagttg tttttacgtt    48360

aagttgatgc agatcaatta atacgatacc tgcgtcataa ttgattattt gacgtggttt    48420

gatggcctcc acgcacgttg tgatatgtag atgataatca ttatcacttt acgggtcctt    48480

tccggtgatc cgacaggtta cg                                             48502


<210>  32
<211>  12
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 1.

<400>  32
tttttttttt tt                                                           12


<210>  33
<211>  51
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 1.

<400>  33
ggttgtttct gttggtgctg atattgcggc gtctgcttgg gtgtttaacc t                51


<210>  34
<211>  68
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 1.

<400>  34
ggttaaacac ccaagcagac gccgcaatat cagcaccaac agaaacaacc tttgaggcga       60

gcggtcaa                                                                68


<210>  35
<211>  15
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 1.

<400>  35
ttgaccgctc gcctc                                                        15


<210>  36
<211>  53
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 2.

<400>  36
gatctgaagc ggcgcacgaa aaacgcgaaa gcgtttcacg ataatgcgaa aac              53


<210>  37
<211>  54
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Sequence used in Example 2.

<400>  37
ttttgttttc gcatttatcg tgaaacgctt tcgcgttttt cgtgcgccgc ttca             54


