                         SEQUENCE LISTING

<110>  Romesberg, Floyd
       Malyshev, Denis
 
<120>  IMPORT OF UNNATURAL OR MODIFIED NUCLEOSIDE TRIPHOSPHATES INTO 
       CELLS VIA NUCLEIC ACID TRIPHOSPHATE TRANSPORTERS

<130>  46085-701.601

<150>  US 61/977,439
<151>  2014-04-09

<150>  US 61/977,430
<151>  2014-04-09

<160>  50    

<170>  PatentIn version 3.5

<210>  1
<211>  575
<212>  PRT
<213>  Phaeodactylum tricornutum

<400>  1

Met Arg Pro Tyr Pro Thr Ile Ala Leu Ile Ser Val Phe Leu Ser Ala 
1               5                   10                  15      


Ala Thr Arg Ile Ser Ala Thr Ser Ser His Gln Ala Ser Ala Leu Pro 
            20                  25                  30          


Val Lys Lys Gly Thr His Val Pro Asp Ser Pro Lys Leu Ser Lys Leu 
        35                  40                  45              


Tyr Ile Met Ala Lys Thr Lys Ser Val Ser Ser Ser Phe Asp Pro Pro 
    50                  55                  60                  


Arg Gly Gly Ser Thr Val Ala Pro Thr Thr Pro Leu Ala Thr Gly Gly 
65                  70                  75                  80  


Ala Leu Arg Lys Val Arg Gln Ala Val Phe Pro Ile Tyr Gly Asn Gln 
                85                  90                  95      


Glu Val Thr Lys Phe Leu Leu Ile Gly Ser Ile Lys Phe Phe Ile Ile 
            100                 105                 110         


Leu Ala Leu Thr Leu Thr Arg Asp Thr Lys Asp Thr Leu Ile Val Thr 
        115                 120                 125             


Gln Cys Gly Ala Glu Ala Ile Ala Phe Leu Lys Ile Tyr Gly Val Leu 
    130                 135                 140                 


Pro Ala Ala Thr Ala Phe Ile Ala Leu Tyr Ser Lys Met Ser Asn Ala 
145                 150                 155                 160 


Met Gly Lys Lys Met Leu Phe Tyr Ser Thr Cys Ile Pro Phe Phe Thr 
                165                 170                 175     


Phe Phe Gly Leu Phe Asp Val Phe Ile Tyr Pro Asn Ala Glu Arg Leu 
            180                 185                 190         


His Pro Ser Leu Glu Ala Val Gln Ala Ile Leu Pro Gly Gly Ala Ala 
        195                 200                 205             


Ser Gly Gly Met Ala Val Leu Ala Lys Ile Ala Thr His Trp Thr Ser 
    210                 215                 220                 


Ala Leu Phe Tyr Val Met Ala Glu Ile Tyr Ser Ser Val Ser Val Gly 
225                 230                 235                 240 


Leu Leu Phe Trp Gln Phe Ala Asn Asp Val Val Asn Val Asp Gln Ala 
                245                 250                 255     


Lys Arg Phe Tyr Pro Leu Phe Ala Gln Met Ser Gly Leu Ala Pro Val 
            260                 265                 270         


Leu Ala Gly Gln Tyr Val Val Arg Phe Ala Ser Lys Ala Val Asn Phe 
        275                 280                 285             


Glu Ala Ser Met His Arg Leu Thr Ala Ala Val Thr Phe Ala Gly Ile 
    290                 295                 300                 


Met Ile Cys Ile Phe Tyr Gln Leu Ser Ser Ser Tyr Val Glu Arg Thr 
305                 310                 315                 320 


Glu Ser Ala Lys Pro Ala Ala Asp Asn Glu Gln Ser Ile Lys Pro Lys 
                325                 330                 335     


Lys Lys Lys Pro Lys Met Ser Met Val Glu Ser Gly Lys Phe Leu Ala 
            340                 345                 350         


Ser Ser Gln Tyr Leu Arg Leu Ile Ala Met Leu Val Leu Gly Tyr Gly 
        355                 360                 365             


Leu Ser Ile Asn Phe Thr Glu Ile Met Trp Lys Ser Leu Val Lys Lys 
    370                 375                 380                 


Gln Tyr Pro Asp Pro Leu Asp Tyr Gln Arg Phe Met Gly Asn Phe Ser 
385                 390                 395                 400 


Ser Ala Val Gly Leu Ser Thr Cys Ile Val Ile Phe Phe Gly Val His 
                405                 410                 415     


Val Ile Arg Leu Leu Gly Trp Lys Val Gly Ala Leu Ala Thr Pro Gly 
            420                 425                 430         


Ile Met Ala Ile Leu Ala Leu Pro Phe Phe Ala Cys Ile Leu Leu Gly 
        435                 440                 445             


Leu Asp Ser Pro Ala Arg Leu Glu Ile Ala Val Ile Phe Gly Thr Ile 
    450                 455                 460                 


Gln Ser Leu Leu Ser Lys Thr Ser Lys Tyr Ala Leu Phe Asp Pro Thr 
465                 470                 475                 480 


Thr Gln Met Ala Tyr Ile Pro Leu Asp Asp Glu Ser Lys Val Lys Gly 
                485                 490                 495     


Lys Ala Ala Ile Asp Val Leu Gly Ser Arg Ile Gly Lys Ser Gly Gly 
            500                 505                 510         


Ser Leu Ile Gln Gln Gly Leu Val Phe Val Phe Gly Asn Ile Ile Asn 
        515                 520                 525             


Ala Ala Pro Val Val Gly Val Val Tyr Tyr Ser Val Leu Val Ala Trp 
    530                 535                 540                 


Met Ser Ala Ala Gly Arg Leu Ser Gly Leu Phe Gln Ala Gln Thr Glu 
545                 550                 555                 560 


Met Asp Lys Ala Asp Lys Met Glu Ala Lys Thr Asn Lys Glu Lys 
                565                 570                 575 


<210>  2
<211>  1728
<212>  DNA
<213>  Phaeodactylum tricornutum

<400>  2
atgagaccat atccgacgat tgccttgatt tcggtttttc tttcggcggc gactcgtatt       60

tcggccactt cctctcatca agcaagtgca cttcctgtca aaaagggaac gcatgtcccg      120

gactctccga agttgtcaaa gctatatatc atggccaaaa ccaagagtgt atcctcgtcc      180

ttcgaccccc ctcggggagg cagtactgtt gcgccaacta caccgttggc aaccggcggt      240

gcgctccgca aagtgcgaca agccgtcttt cccatctacg gaaaccaaga agtcaccaaa      300

tttctgctca tcggatccat taaattcttt ataatcttgg cactcacgct cacgcgtgat      360

accaaggaca cgttgattgt cacgcaatgt ggtgccgaag cgattgcctt tctcaaaata      420

tacggggtgc tacccgcagc gaccgcattt atcgcgctct attccaaaat gtccaacgcc      480

atgggcaaaa aaatgctatt ttattccact tgcattcctt tctttacctt tttcgggctg      540

tttgatgttt tcatttaccc gaacgcggag cgactgcacc ctagtttgga agccgtgcag      600

gcaattctcc cgggcggtgc cgcatctggc ggcatggcgg ttctggccaa gattgcgaca      660

cactggacat ccgccttatt ttacgtcatg gcggaaatat attcttccgt atcggtgggg      720

ctattgtttt ggcagtttgc gaacgacgtc gtcaacgtgg atcaggccaa gcgcttttat      780

ccattatttg ctcaaatgag tggcctcgct ccagttttag cgggccagta tgtggtacgg      840

tttgccagca aagcggtcaa ctttgaggca tccatgcatc gactcacggc ggccgtaaca      900

tttgctggta ttatgatttg catcttttac caactcagtt cgtcatatgt ggagcgaacg      960

gaatcagcaa agccagcggc agataacgag cagtctatca aaccgaaaaa gaagaaaccc     1020

aaaatgtcca tggttgaatc ggggaaattt ctcgcgtcaa gtcagtacct gcgtctaatt     1080

gccatgctgg tgctgggata cggcctcagt attaacttta ccgaaatcat gtggaaaagc     1140

ttggtgaaga aacaatatcc agacccgcta gattatcaac gatttatggg taacttctcg     1200

tcagcggttg gtttgagcac atgcattgtt attttcttcg gtgtgcacgt gatccgtttg     1260

ttggggtgga aagtcggagc gttggctaca cctgggatca tggccattct agcgttaccc     1320

ttttttgctt gcattttgtt gggtttggat agtccagcac gattggagat cgccgtaatc     1380

tttggaacaa ttcagagttt gctgagcaaa acctccaagt atgccctttt cgaccctacc     1440

acacaaatgg cttatattcc tctggacgac gaatcaaagg tcaaaggaaa agcggcaatt     1500

gatgttttgg gatcgcggat tggaaagagt ggaggctcac tgatccagca gggcttggtc     1560

tttgtttttg gaaatatcat taatgccgca cctgtagtag gggttgtcta ctacagtgtc     1620

cttgttgcgt ggatgagcgc agctggccga ctaagtgggc tttttcaagc acaaacagaa     1680

atggataagg ccgacaaaat ggaggcaaag accaacaaag aaaagtag                  1728


<210>  3
<211>  515
<212>  PRT
<213>  Protochlamydia amoebophila

<400>  3

Met Ser Gln Gln Glu Ser Glu Phe Gly Lys Leu Arg Ala Phe Phe Trp 
1               5                   10                  15      


Pro Ile His Gly His Glu Val Lys Lys Val Leu Pro Met Met Leu Met 
            20                  25                  30          


Leu Phe Leu Ile Cys Phe Asn Tyr Ser Ile Leu Arg Asn Val Lys Asp 
        35                  40                  45              


Ala Ile Val Val Thr Ala Lys Ala Ser Gly Ala Glu Val Ile Pro Phe 
    50                  55                  60                  


Ile Lys Val Trp Val Leu Leu Pro Thr Ala Val Leu Phe Thr Leu Ile 
65                  70                  75                  80  


Phe Thr Lys Leu Ser Asn Arg Phe Ser Gln Glu Lys Val Phe Tyr Ile 
                85                  90                  95      


Val Ile Ser Thr Phe Leu Leu Phe Phe Gly Ser Phe Thr Tyr Ile Phe 
            100                 105                 110         


Tyr Pro Leu Arg Asp Val Leu His Pro His Gln Leu Cys Asp Tyr Leu 
        115                 120                 125             


Glu Thr Ile Leu Pro Ala Gly Phe Lys Gly Leu Ile Ala Met Phe Arg 
    130                 135                 140                 


Asn Trp Ser Phe Thr Leu Phe Tyr Val Ile Cys Glu Leu Trp Gly Ser 
145                 150                 155                 160 


Ile Val Leu Thr Val Leu Phe Trp Gly Phe Ala Asn Glu Ile Thr Lys 
                165                 170                 175     


Met Thr Glu Ala Arg Arg Phe Tyr Ser Met Leu Gly Val Ile Ala Ser 
            180                 185                 190         


Phe Ala Ala Thr Ile Ala Gly Ile Ile Ala Asn Leu Leu Ser Asn Asp 
        195                 200                 205             


Gln Ser Trp Glu Gln Thr Leu Asn Ile Leu Met Val Ala Val Ile Val 
    210                 215                 220                 


Ser Gly Thr Ile Ala Met Val Ile Phe Arg Trp Met Asn Lys Asn Val 
225                 230                 235                 240 


Asn Gly Pro Glu Phe Gln Glu Phe His Glu Ala Lys Arg Ile Gln Lys 
                245                 250                 255     


Met Lys Lys Arg Leu Ser Ile Arg Glu Ser Phe Thr Tyr Leu Ala Asn 
            260                 265                 270         


Ser Lys Tyr Leu Ile Cys Ile Ala Val Leu Val Ile Ser Tyr Asn Leu 
        275                 280                 285             


Val Ile Asn Leu Val Glu Ile Val Trp Lys Asp Gln Leu Arg Gln Leu 
    290                 295                 300                 


Tyr Ser Ser Ala Leu Asp Tyr Asn Arg Tyr Met Asn Asn Met Thr Ser 
305                 310                 315                 320 


Ala Val Gly Ile Ile Ala Thr Ile Thr Ser Leu Phe Met Ser Thr Met 
                325                 330                 335     


Ile Thr Arg Phe Gly Trp Thr Arg Thr Ala Leu Val Thr Pro Thr Ile 
            340                 345                 350         


Met Leu Val Thr Ser Val Gly Phe Phe Ala Phe Met Leu Phe Arg Asn 
        355                 360                 365             


Asp Leu Ala Asp Pro Val Tyr Ile Leu Thr Gly Thr Thr Pro Leu Thr 
    370                 375                 380                 


Ile Ala Val Phe Phe Gly Ala Ala Gln Val Cys Met Ser Lys Ala Cys 
385                 390                 395                 400 


Lys Tyr Ser Val Phe Asp Ser Thr Lys Glu Met Ala Phe Ile Pro Leu 
                405                 410                 415     


Asp Tyr Glu Ser Lys Leu Lys Gly Lys Ala Ala Ile Asp Gly Val Gly 
            420                 425                 430         


Ser Arg Leu Gly Lys Ser Gly Gly Ser Leu Ile His Gln Ser Leu Leu 
        435                 440                 445             


Met Ile Phe Ala Thr Val Ser Ser Ser Ala Pro Tyr Val Ala Val Ile 
    450                 455                 460                 


Leu Ile Gly Val Ile Ile Val Trp Met Leu Cys Val Arg Ser Leu Gly 
465                 470                 475                 480 


Lys Gln Phe Ala Ala Ile Ile Gly Glu Lys Ala Arg Glu Asp Ile Gly 
                485                 490                 495     


Glu Ser Thr Pro Arg Thr Ser Glu Glu Gln Val Leu His Pro Leu Lys 
            500                 505                 510         


Ala Ala Ser 
        515 


<210>  4
<211>  1551
<212>  DNA
<213>  Protochlamydia amoebophila

<400>  4
atgtctcaac aagaatcaga gtttggtaaa ttgagggcat ttttttggcc tattcacggc       60

catgaagtca aaaaagtgct gccgatgatg ttgatgctat ttttgatttg tttcaactat      120

agtattttac gcaatgttaa agatgctatt gttgtgactg ctaaggcttc aggggctgaa      180

gttattccat ttattaaagt atgggtgctg ttacccacgg cagtcttatt tactttaatt      240

tttactaagt tgtctaaccg ttttagccaa gaaaaagttt tttatattgt catttctaca      300

tttttgctat tttttggttc gtttacttat attttttatc ctttacgtga cgtactacat      360

cctcatcaac tatgcgatta cttagaaacg attttaccag cgggatttaa aggattaatt      420

gccatgttcc gtaattggtc atttactttg ttttatgtaa tttgtgaact ttggggcagt      480

attgttttaa ctgtcctttt ttggggattt gcgaatgaaa tcacaaaaat gactgaagct      540

cgtcgttttt atagtatgct tggtgtcatt gcaagttttg ccgcgacgat agcaggaatc      600

atagccaatc ttctttctaa tgatcaaagt tgggaacaga ctttaaatat tctcatggtt      660

gctgtaattg taagtggaac gatagccatg gttatttttc gttggatgaa taaaaatgta      720

ctcaatggcc cagaattcca agaattccat gaagcaaaac gcattcaaaa aatgaaaaaa      780

agattatcga tccgagaaag ttttacctat ctcgctaatt ctaaatatct tatttgtatt      840

gcagttttag ttatttctta taatcttgtc attaacttag ttgaaattgt atggaaagac      900

cagcttcgcc aactttattc gtcagccctt gattataatc gctatatgaa taacatgaca      960

tcagcagtcg gaattattgc cacaatcaca tccttattta tgtctacaat gattactcgg     1020

tttggatgga cacggacagc tctagtaaca ccgactatta tgcttgtcac aagtgtggga     1080

ttttttgctt ttatgctatt tcgaaatgat ttggctgatc ctgtttatat attaacagga     1140

acgacacctt taactatagc cgtctttttt ggtgcagctc aagtctgcat gagtaaagcc     1200

tgtaagtatt ctgtttttga ttctacaaaa gaaatggctt ttatccctct ggattatgaa     1260

agtaaattga aaggaaaagc tgcgattgat ggtgtgggtt ctcgtcttgg taaatcgggc     1320

ggttccttaa ttcatcaaag tttattgatg atttttgcaa ctgttagctc cagcgctcct     1380

tatgtagctg tgatcttaat tggcgttatc attgtttgga tgctctgcgt acgttcatta     1440

ggtaagcaat ttgctgctat tattggggaa aaggctcgag aagatattgg tgaatctact     1500

ccaagaacga gtgaagagca agttttacat cccttaaaag ctgcatctta a              1551


<210>  5
<211>  536
<212>  PRT
<213>  Protochlamydia amoebophila

<400>  5

Met Ser Gln Thr Pro Thr Gly Ser Arg Glu Phe Ser Pro Trp Arg Ser 
1               5                   10                  15      


Asn Leu Trp Pro Val His Arg Tyr Glu Leu Lys Lys Leu Ile Pro Met 
            20                  25                  30          


Leu Leu Ile Phe Phe Phe Ile Ser Phe Asp Tyr Asn Ile Leu Arg Thr 
        35                  40                  45              


Leu Lys Asp Ser Leu Leu Ile Thr Ala Lys Ser Ser Gly Ala Glu Val 
    50                  55                  60                  


Ile Pro Phe Val Lys Val Trp Ala Met Phe Pro Gly Ala Ile Leu Met 
65                  70                  75                  80  


Thr Leu Leu Phe Thr Trp Leu Ser Asn Arg Leu Ser Arg Glu Ile Val 
                85                  90                  95      


Phe Tyr Leu Ile Thr Ser Leu Phe Leu Ser Tyr Phe Phe Ile Phe Thr 
            100                 105                 110         


Phe Ile Leu Tyr Pro Ile Arg Asp Ile Ile His Pro His Ala Thr Ala 
        115                 120                 125             


Asp Tyr Leu Glu Thr Ile Leu Pro Ile Gly Phe Lys Gly Leu Val Ala 
    130                 135                 140                 


Met Phe Arg Tyr Trp Thr Phe Thr Ile Phe Tyr Val Met Ser Glu Leu 
145                 150                 155                 160 


Trp Gly Ser Thr Val Leu Phe Val Leu Phe Trp Gly Phe Ala Asn Gln 
                165                 170                 175     


Val Thr Lys Ile Ser Glu Ala Lys Arg Phe Tyr Gly Leu Phe Gly Val 
            180                 185                 190         


Gly Ala Asn Leu Ser Gly Ile Phe Ala Gly Gln Ala Ser Val Tyr Cys 
        195                 200                 205             


Cys Gln Phe Asn Lys Gln Asn Asp Leu Gly Ile Leu Gly Ser Asp Pro 
    210                 215                 220                 


Trp Tyr Gln Ser Leu Val Met Met Val Ser Leu Ile Leu Leu Ser Gly 
225                 230                 235                 240 


Ala Leu Val Leu Ala Leu Phe Arg Trp Met Asn Val Glu Val Leu Thr 
                245                 250                 255     


Asp Lys Arg Phe Tyr Asp Pro Ser Ser Val Lys Thr Glu Gly Glu Ala 
            260                 265                 270         


Lys Gly Lys Leu Ser Leu Lys Gln Ser Phe Ser Tyr Leu Leu Arg Ser 
        275                 280                 285             


Asn Tyr Leu Leu Cys Ile Ala Leu Ile Val Ile Ser Tyr Asn Leu Val 
    290                 295                 300                 


Ile Asn Leu Thr Glu Val Leu Trp Lys His Gln Val Arg Glu Leu Tyr 
305                 310                 315                 320 


Pro Asp Pro Asn Asp Tyr Thr Leu Tyr Met Asn His Ile Val Ser Ile 
                325                 330                 335     


Ile Gly Val Val Ala Thr Leu Ser Ser Leu Phe Val Ser Gly Asn Ala 
            340                 345                 350         


Ile Arg Lys Phe Gly Trp Thr Thr Thr Ala Leu Ile Thr Pro Ile Ile 
        355                 360                 365             


Leu Ala Val Thr Ser Leu Gly Phe Phe Ser Phe Phe Phe Leu Lys Lys 
    370                 375                 380                 


Ala Ser Pro Glu Ile Phe Leu Ser Phe Ser Gly Val Thr Pro Leu Val 
385                 390                 395                 400 


Leu Val Val Phe Phe Gly Thr Ala Gln Asn Ile Leu Ser Arg Gly Ala 
                405                 410                 415     


Lys Tyr Ser Val Phe Asp Ala Thr Lys Glu Met Ser Phe Val Pro Leu 
            420                 425                 430         


Asn Pro Glu Ser Lys Leu Val Gly Lys Ala Ala Ile Asp Gly Val Cys 
        435                 440                 445             


Ser Arg Leu Gly Lys Ser Gly Gly Ser Val Val His Gln Ser Leu Leu 
    450                 455                 460                 


Leu Leu Phe Ser Thr Ile Asn Ala Ser Ala Pro Tyr Val Ala Ile Val 
465                 470                 475                 480 


Leu Phe Ala Val Ile Leu Val Trp Ala Met Ala Ile Arg Val Leu Gly 
                485                 490                 495     


Lys Gln Phe Asn Glu Leu Thr Ser Gln Val Glu Asn Asn Glu Thr Ser 
            500                 505                 510         


Gly Thr Leu Met Thr Pro Ile Arg Ala Val Asn Ile Leu Ser Asp Thr 
        515                 520                 525             


Ile Leu Lys Glu Gln Lys Ala Val 
    530                 535     


<210>  6
<211>  1611
<212>  DNA
<213>  Protochlamydia amoebophila

<400>  6
atgtcacaga caccaacagg gtcccgtgaa tttagtccat ggcggagcaa tctttggccc       60

gttcatcgct atgagcttaa aaaactcatc ccaatgttgt taatattctt ttttatttct      120

tttgattaca acatattacg tactttaaaa gactcactac ttataactgc aaaatcttca      180

ggtgctgagg tcattccttt tgtaaaggtt tgggctatgt tccctggagc tattttaatg      240

acccttttgt tcacttggtt gtctaatcgc ctgtcaagag aaatcgtttt ttaccttatc      300

acttctcttt ttttatctta tttttttatt ttcactttta ttctctatcc tattcgagat      360

attatccatc ctcacgcaac tgctgactat cttgaaacaa ttttaccgat tggatttaaa      420

gggctagttg cgatgtttcg ttactggact tttactattt tctatgtgat gtcagaactt      480

tggggaagta ctgttttatt tgtcttattt tggggttttg ctaatcaagt gactaaaatt      540

agtgaggcaa aaagatttta cggtctgttt ggggtaggtg ctaatctttc gggtattttc      600

gcaggacaag cttctgtgta ctgttgtcaa tttaataagc agaacgattt gggaatcctt      660

ggtagtgatc catggtatca atcattagtg atgatggttt ctttaatttt attatcgggt      720

gctttagttt tagctttatt tcgttggatg aatgtagaag tcttaaccga taaacgtttt      780

tatgatcctt cttcggttaa aacagaagga gaagctaaag gtaagctttc tctaaagcaa      840

agcttttcct atcttcttcg ctctaattac ttactttgta ttgctcttat tgttatttct      900

tataacctag ttattaacct cacagaagtt ttatggaaac atcaagtccg agagctatat      960

cctgatccta atgattatac tttatatatg aatcatatcg tatccattat tggggtagta     1020

gcgaccttaa gttccctttt cgtatcagga aatgcgattc gcaaatttgg gtggaccact     1080

actgctttaa ttacacctat cattttagct gtaacaagtt tgggcttttt ctcctttttc     1140

ttccttaaaa aggcatctcc cgaaattttc ttatcttttt ccggagtaac tcctttggtt     1200

ttagtggttt tctttggaac tgctcaaaac atattgagtc gaggagctaa atactctgta     1260

tttgatgcca ctaaagaaat gagttttgtt cctttaaatc ctgaatccaa actcgttgga     1320

aaagcggcga ttgatggagt ttgttctcgc ctcggaaaat cgggtggatc tgtggttcat     1380

cagagccttc tacttttgtt ttctacaatt aatgcaagtg ccccttatgt agctatcgtc     1440

ttgttcgccg taattctagt ctgggcaatg gcaattcgcg ttttaggtaa acaatttaat     1500

gaattgacaa gtcaggtaga aaacaatgaa acttctggga cattgatgac tcctattcga     1560

gctgttaata ttctttcaga cacaattttg aaagaacaga aagctgtata a              1611


<210>  7
<211>  489
<212>  PRT
<213>  Protochlamydia amoebophila

<400>  7

Met Lys Asn Gln Gln Asn Ser Val Ser Ser Thr Leu Leu Ile Leu Lys 
1               5                   10                  15      


Lys Arg Ser Leu Ile Leu Phe Gln Phe Phe Leu Ile Ile Ile Val Tyr 
            20                  25                  30          


His Thr Leu Lys Asp Leu Lys Asp Thr Ile Val Ile Thr Ala Ser Asp 
        35                  40                  45              


Ala Gly Ala Glu Ile Ile Pro Phe Ile Lys Ile Trp Gly Met Leu Pro 
    50                  55                  60                  


Leu Ala Ile Cys Ala Ser Tyr Phe Phe Ala Lys Phe Tyr Asn Lys Phe 
65                  70                  75                  80  


Gly Arg Glu Lys Thr Phe Tyr Ile Phe Ser Ser Phe Leu Leu Val Asn 
                85                  90                  95      


Tyr Leu Phe Phe Ala Phe Val Leu Tyr Pro Phe Arg Lys Phe Phe Tyr 
            100                 105                 110         


Leu Glu Asn Val Ala Asp Tyr Leu His Met Ile Leu Pro Val Gly Ala 
        115                 120                 125             


Lys Gly Phe Val Ala Met Val Ser Tyr Trp His Tyr Thr Leu Phe Tyr 
    130                 135                 140                 


Leu Thr Ala Glu Leu Trp Ser Met Leu Ile Leu Ser Ile Leu Phe Trp 
145                 150                 155                 160 


Gly Tyr Val Ser Asp Thr Thr Ser Leu Val Glu Ala Lys Lys Phe Tyr 
                165                 170                 175     


Pro Leu Cys Met Phe Val Gly Asn Met Ala Gly Ile Ile Ser Gly Gln 
            180                 185                 190         


Leu Ser His Phe Leu Cys Gln His Leu Ser Asp Phe Met Ser Trp Glu 
        195                 200                 205             


Arg Thr Leu Gln Trp Met Ile Gly Ile Val Cys Val Cys Gly Leu Leu 
    210                 215                 220                 


Ile Met Ile Ile Asn Arg Arg Leu Ala Leu Thr Thr Asp Phe Ser Ala 
225                 230                 235                 240 


Ile Lys Gln Lys Val Lys Lys Gln Ile Ala Pro Ser Ser Phe Lys Asp 
                245                 250                 255     


Asn Val Met Asp Val Leu Arg Thr Gly Pro Leu Leu Cys Ile Ala Val 
            260                 265                 270         


Leu Val Val Gly Phe Gly Leu Thr Thr Asn Leu Ile Glu Val Ile Trp 
        275                 280                 285             


Lys Glu Asn Ile Arg Gln Leu His Pro Thr Pro Gln Ala Tyr Asn Ala 
    290                 295                 300                 


Tyr Ile Asn Gln Leu Thr Ser Leu Ile Gly Thr Gly Ala Val Cys Ile 
305                 310                 315                 320 


Ala Leu Leu Ser Ser Trp Ile Phe Arg Lys Phe Thr Trp Thr Gln Ile 
                325                 330                 335     


Ala Leu Thr Thr Pro Leu Cys Leu Leu Ile Thr Ser Ser Ala Phe Phe 
            340                 345                 350         


Ser Ser Leu Leu Met Pro Lys Glu Leu Leu Ala Glu Ile Ala Ser Phe 
        355                 360                 365             


Phe Gln Phe Ser Pro Thr Gln Leu Ile Val Thr Leu Gly Ser Ile Cys 
    370                 375                 380                 


Tyr Val Phe Ser Met Ser Ala Lys Tyr Thr Ile Phe Asp Thr Ser Lys 
385                 390                 395                 400 


Glu Ile Ala Phe Leu Ser Ile Glu Thr Glu Lys Arg Thr Tyr Ala Lys 
                405                 410                 415     


Ser Val Ile Asp Ser Ile Gly Ser Arg Leu Gly Lys Ser Gly Ala Ser 
            420                 425                 430         


Cys Phe Tyr Gln Phe Leu Leu Ile Ala Phe Gly Ile Ala Ser Glu His 
        435                 440                 445             


Ile Leu Leu Ile Gly Val Val Ser Ile Ile Met Ile Gly Ile Ser Ile 
    450                 455                 460                 


Phe Ala Thr Lys Lys Leu Gly Gly Gln Leu Ser Gly Lys Asn Glu Asn 
465                 470                 475                 480 


His Arg Phe Ile Glu Ala Ser His Gly 
                485                 


<210>  8
<211>  1470
<212>  DNA
<213>  Protochlamydia amoebophila

<400>  8
atgaaaaatc aacaaaattc tgtatcttct accttactaa tcttaaaaaa gcgtagcttg       60

atcctatttc aattttttct aattatcatt gtttatcata cattaaaaga cctcaaagat      120

acgattgtta tcacagcaag tgatgcaggt gcagagatca ttccttttat taaaatttgg      180

ggaatgcttc ctcttgccat ttgtgctagt tatttttttg ctaaatttta taataaattt      240

ggaagagaaa aaacatttta tatttttagc tctttcttac tagttaacta tcttttcttt      300

gcttttgtat tatatccatt ccgcaagttt ttttatttag aaaatgttgc agattattta      360

catatgattt tacctgttgg agcgaaaggg tttgttgcca tggtaagcta ttggcattac      420

actctatttt atttaacggc agaattatgg tcgatgctca ttctatctat ccttttttgg      480

ggttatgtga gtgatacgac ttctttagta gaagccaaaa aattttaccc cctctgtatg      540

ttcgttggaa atatggcagg aattatttct ggtcagctct ctcatttctt atgtcaacat      600

ttgtctgatt tcatgtcatg ggaaagaacc ctgcaatgga tgattggtat tgtctgtgtt      660

tgcggccttt taattatgat tattaataga cggctggctc ttacaactga tttttcggca      720

attaaacaaa aagtaaaaaa acaaatagct ccctcttctt tcaaagataa tgttatggat      780

gttttaagaa caggtccctt actttgtata gctgtattgg tagtggggtt tggactgaca      840

acgaatctaa ttgaagttat ttggaaagaa aatattaggc aactacaccc gacacctcaa      900

gcctacaatg cttatattaa tcaattgact tctttaattg ggactggtgc tgtttgtata      960

gccttgttat caagctggat ttttagaaag tttacttgga cgcaaattgc cctcacaacc     1020

cctttatgtt tattaatcac aagctctgct tttttttcat cgcttcttat gcctaaagag     1080

ctgttagcgg aaattgcttc tttttttcag ttttccccaa ctcaattgat agtgacacta     1140

ggatctattt gctatgtttt tagcatgtct gcgaagtaca caatttttga tactagtaaa     1200

gaaatagctt ttctttctat tgaaacagaa aaaagaacgt atgctaaatc tgtaattgat     1260

agcattggct ctcgtttggg aaaatctggc gcttcttgtt tttatcaatt tcttcttatt     1320

gcctttggaa ttgcttccga acatatttta ttaattggag ttgtatccat tataatgatt     1380

ggaatttcga tttttgctac gaaaaaattg ggtgggcagc tgtctggtaa aaatgaaaac     1440

catcgcttta tagaagcttc ccatggataa                                      1470


<210>  9
<211>  632
<212>  PRT
<213>  Thalassiosira pseudonana

<400>  9

Met Lys Thr Ser Cys Thr Ile Gln Arg Arg Val Lys Ser Ile Ser Ser 
1               5                   10                  15      


Lys His Ser Ile Ile Asp Thr His His Ser Thr Ser Arg Arg Leu Ser 
            20                  25                  30          


Val Ile Leu Leu Phe Phe Leu Leu His Ser Ser Ala Glu Met Leu Phe 
        35                  40                  45              


Ala Ser Ala Thr Gly Asn His Asn Ala Asn Thr Ser Pro Pro Pro Ala 
    50                  55                  60                  


Asn Ile Pro Met Ile Ser Thr Asn Asn Lys Ser Cys Met Met Arg Arg 
65                  70                  75                  80  


Thr Arg Ser Gln Ser Arg Arg Asp Ser Ser Arg Ser Pro Asp Ser Val 
                85                  90                  95      


Ala Ser Ala Asn Val Val Gly Arg Gly Gly Asp Gly Gly Thr Ile Met 
            100                 105                 110         


Gly Ala Lys Ser Val Phe Gln Thr Ala Ser Lys Ala Leu Pro Pro Asn 
        115                 120                 125             


Thr Val Ser Ser Thr Ala Ser Gly Ser Val Ser Lys Ala Ser Arg Leu 
    130                 135                 140                 


Arg Thr Val Leu Phe Pro Ile Gln Asn Asp Glu Met Lys Lys Phe Leu 
145                 150                 155                 160 


Leu Ile Gly Ser Ile Lys Phe Phe Val Ile Leu Ala Leu Thr Leu Thr 
                165                 170                 175     


Arg Asp Asn Lys Asp Thr Met Val Val Thr Glu Cys Gly Ala Glu Ala 
            180                 185                 190         


Ile Ala Phe Leu Lys Ile Tyr Gly Val Leu Pro Ser Ala Thr Leu Phe 
        195                 200                 205             


Ile Ala Leu Tyr Ser Lys Met Ala Thr Ile Phe Asp Lys Lys Thr Leu 
    210                 215                 220                 


Phe Tyr Ala Thr Cys Ile Pro Phe Phe Ala Phe Phe Phe Leu Phe Asp 
225                 230                 235                 240 


Ala Ile Ile Tyr Pro Asn Arg Asn Val Ile Gln Pro Ser Leu Glu Ser 
                245                 250                 255     


Val Gln Arg Val Met Arg Ile Thr Ala Asp Ser Ser Gly Ala Met Ser 
            260                 265                 270         


Ile Phe Ala Lys Leu Phe Ala Asn Trp Thr Ser Ala Leu Phe Tyr Ile 
        275                 280                 285             


Val Ala Glu Val Tyr Ser Ser Val Ser Val Gly Ile Leu Phe Trp Gln 
    290                 295                 300                 


Tyr Ala Asn Asp Val Val Ser Val Ser Gln Ala Lys Arg Phe Tyr Pro 
305                 310                 315                 320 


Leu Phe Ala Gln Met Ser Gly Leu Ala Pro Ile Val Ala Gly Gln Tyr 
                325                 330                 335     


Val Val Arg Tyr Ala Ser Arg Ala Asn Asp Phe Glu Glu Ser Leu His 
            340                 345                 350         


Arg Leu Thr Trp Met Val Ser Phe Ser Gly Val Met Ile Cys Leu Phe 
        355                 360                 365             


Tyr Lys Trp Ser Asn Glu Tyr Asn Asp Gln Thr Ser Gly Gly Leu Asn 
    370                 375                 380                 


Gly Gly Ile Glu Asp Gly Val Lys Glu Thr Lys Val Val Lys Lys Lys 
385                 390                 395                 400 


Lys Ala Lys Met Ser Met Arg Asp Ser Ala Lys Phe Leu Ala Ser Ser 
                405                 410                 415     


Glu Tyr Leu Arg Leu Ile Ala Ala Leu Val Val Gly Tyr Gly Leu Ser 
            420                 425                 430         


Ile Asn Phe Thr Asp Ile Met Trp Lys Ser Ile Val Lys Arg Gln Tyr 
        435                 440                 445             


Pro Asp Pro Leu Asp Tyr Gln Arg Phe Met Gly Asn Phe Ser Ser Val 
    450                 455                 460                 


Val Gly Leu Ser Thr Cys Ile Val Ile Phe Leu Gly Val His Ala Ile 
465                 470                 475                 480 


Arg Ile Leu Gly Trp Arg Met Gly Ala Leu Ala Thr Pro Ala Val Met 
                485                 490                 495     


Ala Ile Leu Ala Phe Pro Tyr Phe Ser Ser Ile Leu Val Gly Leu Asp 
            500                 505                 510         


Ser Pro Gly Ser Leu Arg Ile Ala Val Ile Phe Gly Thr Ile Gln Cys 
        515                 520                 525             


Leu Leu Ser Lys Thr Ala Lys Tyr Ala Leu Phe Asp Pro Thr Thr Gln 
    530                 535                 540                 


Met Ala Tyr Ile Pro Leu Asp Asp Glu Ser Lys Ile Lys Gly Lys Ala 
545                 550                 555                 560 


Ala Ile Glu Val Leu Gly Ser Arg Ile Gly Lys Ser Gly Gly Ser Leu 
                565                 570                 575     


Ile Gln Gln Gly Leu Val Leu Val Phe Gly Asn Ile Ile Asn Ala Ala 
            580                 585                 590         


Pro Ala Leu Val Val Leu Tyr Tyr Ser Val Leu Ala Trp Trp Val Tyr 
        595                 600                 605             


Ser Ala Asn Arg Leu Gly Ser Leu Phe Leu Ala Lys Thr Ala Met Gln 
    610                 615                 620                 


Glu Glu Thr Lys Glu His Gln Lys 
625                 630         


<210>  10
<211>  1899
<212>  DNA
<213>  Thalassiosira pseudonana

<400>  10
atgaaaacat cttgtacaat ccaacgtcgt gtcaaatcca tctcatccaa acacagtatc       60

atcgacacac accactctac ttctcgccgt ttaagtgtca tcctactctt ctttctacta      120

cactcctcgg cagagatgct atttgcttcc gccacgggca atcacaacgc caatacatca      180

ccaccacctg cgaatattcc catgattagc actaacaaca aatcatgtat gatgcgacga      240

accaggagtc aatcacgacg agatagcagc cgttcgcctg attcggtggc ctcggccaat      300

gttgttggga ggggcggcga tgggggtacc attatgggtg ccaagagtgt cttccagact      360

gcttcgaaag cattacctcc caacactgtg tcgtccacag caagcggcag tgtatccaaa      420

gcatcgcgcc tacgaacggt cctcttcccc attcaaaatg acgagatgaa gaagtttctc      480

ttgattggaa gtatcaagtt ctttgtaatt ctagcgttga cactcacgag agataataag      540

gatacaatgg tggttaccga gtgtggagct gaggccatcg cttttctaaa gatctacgga      600

gtactaccat ccgccacact cttcatagca ctctactcga aaatggccac tatctttgac      660

aaaaagacct tattctacgc cacgtgcatt ccattctttg cattcttctt cttattcgat      720

gcaatcatct atcctaaccg gaatgtcatt cagccttcct tagagagtgt tcagcgtgtc      780

atgagaatca cagccgattc atcgggtgcc atgtccatct ttgcaaagtt gttcgccaat      840

tggacgtcgg ccttgtttta tattgtagca gaggtatact cgtctgtttc agtggggata      900

ttgttctggc agtatgccaa tgatgtggtg tctgtctcgc aagcaaaacg attttaccca      960

ctctttgcac agatgagtgg acttgccccc attgtggctg gacagtatgt ggtacgatat     1020

gctagtagag ccaatgactt tgaagaatca ttgcataggt tgacgtggat ggtatccttt     1080

tcgggagtga tgatttgtct gttttacaag tggagcaatg agtacaatga tcagacgtct     1140

ggagggttaa atgggggaat tgaggatgga gtaaaagaga cgaaggtggt gaagaaaaag     1200

aaagccaaaa tgtcaatgag ggattcagcc aagtttttgg cttcatccga gtatttgaga     1260

ctgattgctg ctttggttgt gggatatggt ctgtcgatca actttacaga tataatgtgg     1320

aaatcaatcg tcaagagaca atatcccgat cctctcgact atcaacgttt catggggaac     1380

ttttcatcag tagttggatt gtctacgtgc atcgttatct ttctcggtgt acatgctatt     1440

cgtatactag gctggcgaat gggtgcccta gcgactccag ccgtcatggc aatcttggca     1500

ttcccttact tctcgagcat tctcgttggg ttggacagtc caggtagttt acgaattgca     1560

gtgatctttg gtactattca atgcctgctt agtaagacag caaagtatgc cctgttcgat     1620

ccgacaactc aaatggccta cattcctttg gatgacgaat caaagatcaa gggaaaggca     1680

gcaatagaag tacttggttc tcggattgga aaaagtggtg gttcgttgat acaacaaggt     1740

cttgtgttgg tgtttgggaa cattatcaat gctgctcccg cgttggttgt tctttactac     1800

tcagtgttgg cgtggtgggt gtactcagca aatcggctcg gatcattgtt cttggcaaag     1860

acagctatgc aagaggaaac aaaagagcac cagaagtag                            1899


<210>  11
<211>  517
<212>  PRT
<213>  Simkania negevensis

<400>  11

Met Ser Ser Thr Glu Tyr Glu Lys Ser Thr Trp Thr Gln Lys Ile Trp 
1               5                   10                  15      


Pro Ile Arg Arg Phe Glu Leu Lys Lys Val Leu Pro Leu Leu Ile Leu 
            20                  25                  30          


Lys Phe Leu Val Ser Met Val Tyr Ala Thr Leu Thr Leu Ile Lys Asp 
        35                  40                  45              


Pro Leu Val Val Thr Ala Lys His Ser Gly Ala Glu Val Ile Pro Val 
    50                  55                  60                  


Leu Lys Gly Trp Ile Val Phe Pro Leu Ser Ile Leu Cys Ala Ile Gly 
65                  70                  75                  80  


Tyr Ser Lys Leu Ser Asn His Phe Lys Arg Ser Thr Leu Phe Tyr Gly 
                85                  90                  95      


Ile Ile Thr Ala Phe Leu Ala Ile Val Leu Ile Tyr Gly Phe Val Leu 
            100                 105                 110         


Tyr Pro Asn Met Gly Ile Leu Thr Pro Ser Asp Ser Ala Asn Leu Leu 
        115                 120                 125             


Thr Ala Lys Phe Gly Glu Lys Tyr Thr His Trp Ile Ala Val Tyr Arg 
    130                 135                 140                 


Asn Trp Ile His Ser Leu Phe Phe Val Thr Thr Glu Leu Trp Gly Gln 
145                 150                 155                 160 


Val Val Ile Phe Leu Leu Tyr Trp Gly Phe Ala Asn His Ile Cys Gln 
                165                 170                 175     


Val Lys Glu Ala Lys Arg Ser Tyr Thr Leu Phe Ile Ala Ala Gly Asp 
            180                 185                 190         


Leu Ala Thr Ile Leu Ala Gly Pro Leu Thr Tyr Tyr Tyr Gly Lys Lys 
        195                 200                 205             


Phe Leu Gly Gln Ser Tyr Ala Leu Thr Leu Gln Ser Leu Leu Gly Tyr 
    210                 215                 220                 


Val Leu Val Cys Gly Leu Leu Ile Met Ala Val Tyr Trp Trp Met Asn 
225                 230                 235                 240 


Arg Tyr Val Leu Thr Asp Lys Arg Tyr Tyr Asp Pro Ser Val Thr Lys 
                245                 250                 255     


Gln Thr Val Asn Gln Lys Thr Lys Leu Ser Leu Arg Asp Ser Ile Arg 
            260                 265                 270         


His Ile Phe Ser Ser Lys Tyr Leu Leu Ala Ile Ala Val Leu Val Val 
        275                 280                 285             


Gly Cys Ala Leu Thr Ile Asn Met Val Glu Val Thr Trp Lys Ala His 
    290                 295                 300                 


Leu Lys Met Gln Tyr Pro Thr Thr Ala Asp Tyr Gln Met Phe Met Gly 
305                 310                 315                 320 


Arg Val Thr Thr Ile Val Gly Val Val Ala Leu Ile Thr Val Phe Phe 
                325                 330                 335     


Leu Gly Gly Asn Phe Leu Arg Arg Phe Gly Trp His Phe Ser Ala Gln 
            340                 345                 350         


Ile Thr Pro Trp Ala Ile Gly Ile Thr Gly Gly Val Phe Phe Leu Leu 
        355                 360                 365             


Cys Leu Leu Lys Pro Tyr Leu Gly Ser Phe Ala His Tyr Val Gly Leu 
    370                 375                 380                 


Thr Pro Leu Met Met Ile Val Ile Phe Gly Ala Phe Gln Asn Ile Thr 
385                 390                 395                 400 


Ser Lys Val Val Lys Tyr Ser Phe Phe Asp Ser Thr Lys Glu Met Ala 
                405                 410                 415     


Tyr Ile Pro Leu Asp Pro Glu Ser Lys Val Lys Gly Lys Ala Ala Ile 
            420                 425                 430         


Asp Met Val Gly Ser Arg Leu Gly Lys Ser Ser Ser Ser Trp Leu Gln 
        435                 440                 445             


Ile Gly Leu Ile Glu Leu Val Gly Thr Gly Ser Val Ile Ser Ile Thr 
    450                 455                 460                 


Pro Tyr Leu Leu Pro Ile Val Leu Gly Ala Ala Leu Tyr Trp Ser Tyr 
465                 470                 475                 480 


Ser Val Arg Tyr Leu Asn Lys Glu Leu Ser Val Arg Glu Glu Thr Leu 
                485                 490                 495     


Leu Glu Glu Glu Glu Ala Lys Lys Arg Ala Gly Glu Leu Gln Pro Glu 
            500                 505                 510         


Pro Glu Pro Ala Thr 
        515         


<210>  12
<211>  1554
<212>  DNA
<213>  Simkania negevensis

<400>  12
atgagtagta ccgaatatga gaagtccaca tggactcaaa aaatctggcc aataaggcgc       60

tttgaactta agaaagtcct tcctctttta atccttaaat ttctagtctc tatggtttat      120

gccactctca ccttaatcaa ggatcccctt gtggtgacgg caaaacattc tggagcagaa      180

gtcattccag ttctaaaagg ttggattgtt ttccccttat cgattctttg tgctattggt      240

tactcaaagt taagcaacca cttcaaacgt tccaccctct tttacggaat cattacagct      300

ttcctagcta ttgttcttat ctacggcttc gttttgtatc ccaatatggg aattctcaca      360

ccaagcgact ctgcaaactt gttaacagct aaatttgggg aaaaatacac acactggatt      420

gcagtttatc ggaattggat ccattctctc tttttcgtca ccacagagct ttgggggcaa      480

gttgtcattt tcctcctcta ctggggattt gccaaccaca tttgccaagt gaaagaagct      540

aaaagatctt acactctttt catcgctgca ggcgatttag caacgatctt ggctggtcca      600

cttacctatt actacggaaa aaagtttcta ggacaaagct atgctctcac tcttcaatcc      660

ctactaggat atgtcttagt ctgcgggcta ctcatcatgg cagtctattg gtggatgaat      720

cgatatgtcc taacagacaa acggtactac gatccatcag tgacgaagca aacagtcaac      780

caaaagacca aactctctct gcgtgatagt atccggcata tcttttcatc aaagtatctc      840

cttgctattg cggtcctcgt tgtcggttgc gctctcacca tcaacatggt agaagtcacc      900

tggaaagctc acttaaagat gcaataccca acaactgctg attaccaaat gttcatgggg      960

cgagtcacaa ctattgttgg agttgttgcc ctcatcactg tattcttctt aggaggaaac     1020

ttcctgagac ggtttggatg gcacttcagt gctcaaatca ccccatgggc gattggaatc     1080

acaggtggtg ttttcttttt actctgcctt ttgaagccct atctcgggtc tttcgctcat     1140

tatgttggac tcacccctct tatgatgatt gtcatctttg gagccttcca aaatatcact     1200

agtaaagtcg tcaaatactc gttctttgat tcgacgaaag aaatggctta tattccacta     1260

gaccctgaat ctaaagtgaa aggaaaagca gccatcgaca tggtcggttc aagattgggt     1320

aagtcgagct cctcctggct acaaattggc ttgattgaac tagttgggac tggttcggtg     1380

atctcaatca ctccttatct actgcctatc gttctaggtg ccgccctcta ttggagctac     1440

tctgtacgct acctcaataa agagctttct gtgcgtgaag aaacactcct cgaggaagaa     1500

gaagctaaga aaagagcggg agagcttcag cctgaacctg agcctgccac ttga           1554


<210>  13
<211>  517
<212>  PRT
<213>  Simkania negevensis

<400>  13

Met Ser Ser Thr Glu Tyr Glu Lys Ser Thr Trp Thr Gln Lys Ile Trp 
1               5                   10                  15      


Pro Ile Arg Arg Phe Glu Leu Lys Lys Val Leu Pro Leu Leu Ile Leu 
            20                  25                  30          


Lys Phe Leu Val Ser Met Val Tyr Ala Thr Leu Thr Leu Ile Lys Asp 
        35                  40                  45              


Pro Leu Val Val Thr Ala Lys His Ser Gly Ala Glu Val Ile Pro Val 
    50                  55                  60                  


Leu Lys Gly Trp Ile Val Phe Pro Leu Ser Ile Leu Cys Ala Ile Gly 
65                  70                  75                  80  


Tyr Ser Lys Leu Ser Asn His Phe Lys Arg Ser Thr Leu Phe Tyr Gly 
                85                  90                  95      


Ile Ile Thr Ala Phe Leu Ala Ile Val Leu Ile Tyr Gly Phe Val Leu 
            100                 105                 110         


Tyr Pro Asn Met Gly Ile Leu Thr Pro Ser Asp Ser Ala Asn Leu Leu 
        115                 120                 125             


Thr Ala Lys Phe Gly Glu Lys Tyr Thr His Trp Ile Ala Val Tyr Arg 
    130                 135                 140                 


Asn Trp Ile His Ser Leu Phe Phe Val Thr Thr Glu Leu Trp Gly Gln 
145                 150                 155                 160 


Val Val Ile Phe Leu Leu Tyr Trp Gly Phe Ala Asn His Ile Cys Gln 
                165                 170                 175     


Val Lys Glu Ala Lys Arg Ser Tyr Thr Leu Phe Ile Ala Ala Gly Asp 
            180                 185                 190         


Leu Ala Thr Ile Leu Ala Gly Pro Leu Thr Tyr Tyr Tyr Gly Lys Lys 
        195                 200                 205             


Phe Leu Gly Gln Ser Tyr Ala Leu Thr Leu Gln Ser Leu Leu Gly Tyr 
    210                 215                 220                 


Val Leu Val Cys Gly Leu Leu Ile Met Ala Val Tyr Trp Trp Met Asn 
225                 230                 235                 240 


Arg Tyr Val Leu Thr Asp Lys Trp Tyr Tyr Asp Pro Ser Val Thr Lys 
                245                 250                 255     


Gln Thr Val Asn Gln Lys Thr Lys Leu Ser Leu Arg Asp Ser Ile Arg 
            260                 265                 270         


His Ile Phe Ser Ser Lys Tyr Leu Leu Ala Ile Ala Val Leu Val Val 
        275                 280                 285             


Gly Cys Ala Leu Thr Ile Asn Met Val Glu Val Thr Trp Lys Ala His 
    290                 295                 300                 


Leu Lys Met Gln Tyr Pro Thr Thr Ala Asp Tyr Gln Met Phe Met Gly 
305                 310                 315                 320 


Arg Val Thr Thr Ile Val Gly Val Val Ala Leu Ile Thr Val Phe Phe 
                325                 330                 335     


Leu Gly Gly Asn Phe Leu Arg Arg Phe Gly Trp His Phe Ser Ala Gln 
            340                 345                 350         


Ile Thr Pro Trp Ala Ile Gly Ile Thr Gly Gly Val Phe Phe Leu Leu 
        355                 360                 365             


Cys Leu Leu Lys Pro Tyr Leu Gly Ser Phe Ala His Tyr Val Gly Leu 
    370                 375                 380                 


Thr Pro Leu Met Met Ile Val Ile Phe Gly Ala Phe Gln Asn Ile Thr 
385                 390                 395                 400 


Ser Lys Val Val Lys Tyr Ser Phe Phe Asp Ser Thr Lys Glu Met Ala 
                405                 410                 415     


Tyr Ile Pro Leu Asp Pro Glu Ser Lys Val Lys Gly Lys Ala Ala Ile 
            420                 425                 430         


Asp Met Val Gly Ser Arg Leu Gly Lys Ser Ser Ser Ser Trp Leu Gln 
        435                 440                 445             


Ile Gly Leu Ile Glu Leu Val Gly Thr Gly Ser Val Ile Ser Ile Thr 
    450                 455                 460                 


Pro Tyr Leu Leu Pro Ile Val Leu Gly Ala Ala Leu Tyr Trp Ser Tyr 
465                 470                 475                 480 


Ser Val Arg Tyr Leu Asn Lys Glu Leu Ser Val Arg Glu Glu Thr Leu 
                485                 490                 495     


Leu Glu Glu Glu Glu Ala Lys Lys Arg Ala Gly Glu Leu Gln Pro Glu 
            500                 505                 510         


Pro Glu Pro Ala Thr 
        515         


<210>  14
<211>  526
<212>  PRT
<213>  Simkania negevensis

<400>  14

Met Ser Thr Gln Thr Asp Val Ser Phe Ser Lys Trp Arg Ser Phe Leu 
1               5                   10                  15      


Trp Pro Ile Gln Gly Arg Glu Ile Lys Lys Phe Leu Pro Leu Leu Leu 
            20                  25                  30          


Ile Tyr Ala Leu Ile Cys Leu Asn Tyr Ser Val Leu Lys Val Ala Lys 
        35                  40                  45              


Asp Thr Leu Val Ile Thr Ala Pro Gly Ser Gly Ala Glu Ala Ile Pro 
    50                  55                  60                  


Phe Ile Lys Val Trp Val Ile Leu Pro Met Ala Leu Leu Val Thr Tyr 
65                  70                  75                  80  


Leu Phe Thr Arg Leu Phe Asn Arg Phe Ser Gln Glu Gln Val Phe Tyr 
                85                  90                  95      


Ile Met Ile Gly Ser Phe Ile Ser Phe Phe Ala Leu Phe Ala Phe Val 
            100                 105                 110         


Leu Tyr Pro Leu Arg Asp Phe Leu His Pro His Asp Thr Ala Asp Lys 
        115                 120                 125             


Leu Gln Ala Met Leu Pro Gln Gly Phe Gln Gly Leu Ile Ala Ile Phe 
    130                 135                 140                 


Arg Asn Trp Ser Tyr Thr Leu Phe Tyr Val Met Ser Glu Leu Trp Gly 
145                 150                 155                 160 


Thr Ala Ile Met Ser Val Leu Phe Trp Gly Phe Thr Asn Glu Ile Ile 
                165                 170                 175     


Ser Val Gly Glu Ala Lys Arg Tyr Tyr Gly Ile Leu Ser Val Gly Ala 
            180                 185                 190         


Asn Ile Ala Thr Ile Phe Ser Gly Tyr Ile Thr Thr Phe Leu Ser Leu 
        195                 200                 205             


Gln Val Ile Asp Met Ser Phe Ile Phe Gly Pro Asp Arg Trp Gly Gln 
    210                 215                 220                 


Ser Leu Gly Leu Val Thr Cys Val Val Val Ala Ala Gly Leu Leu Ile 
225                 230                 235                 240 


Met Ala Leu Phe Arg Trp Tyr Asn Lys Arg Val Ile Asn Arg Asp Ala 
                245                 250                 255     


Val Leu Leu Lys Met Lys Gln Asp His Thr Glu Thr Lys Lys Thr Met 
            260                 265                 270         


Lys Met Gly Met Arg Lys Asn Phe Ala Tyr Leu Ala Lys Ser Lys Tyr 
        275                 280                 285             


Leu Ile Cys Ile Ala Val Leu Val Val Ala Phe Asn Val Gly Ile Asn 
    290                 295                 300                 


Met Val Glu Ile Ile Trp Lys Asp Gln Ile Lys Glu Leu Tyr Pro Asn 
305                 310                 315                 320 


Pro Asn Asp Phe Ile Val Tyr Met Gly Lys Val Met Ser Ala Ile Gly 
                325                 330                 335     


Trp Val Ala Thr Phe Val Gly Leu Phe Leu Ser Ser Asn Leu Ile Arg 
            340                 345                 350         


Arg Leu Gly Trp Thr Val Ser Ala Leu Ile Thr Pro Val Ala Leu Leu 
        355                 360                 365             


Val Thr Gly Val Phe Phe Phe Gly Phe Ile Leu Phe Lys Asn Asn Pro 
    370                 375                 380                 


Thr Leu Val Gly Trp Thr Ala Ala Ile Gly Phe Thr Pro Leu Ala Leu 
385                 390                 395                 400 


Gly Val Leu Phe Gly Thr Ile Gln Asn Val Met Ser Arg Ala Cys Lys 
                405                 410                 415     


Tyr Thr Leu Phe Asp Ser Thr Lys Glu Ile Ala Phe Ile Pro Leu Ser 
            420                 425                 430         


Pro Glu Ser Lys Leu Lys Gly Lys Ala Ala Ile Asp Gly Val Gly Ser 
        435                 440                 445             


Arg Val Gly Lys Ser Gly Gly Ser Ile Val His Gly Gly Leu Leu Met 
    450                 455                 460                 


Leu Phe Gly Ser Val Ser Leu Ser Ala Pro Tyr Val Gly Leu Ile Leu 
465                 470                 475                 480 


Leu Ala Val Val Phe Gly Trp Ile Gly Ala Ala Arg Ser Leu Gly Arg 
                485                 490                 495     


Gln Phe Asn Leu Leu Thr Thr His His Glu Lys Leu Glu Ile Asn Glu 
            500                 505                 510         


Glu Ala Gln Pro Ser Glu Lys Lys Pro Leu Leu Glu Ser Val 
        515                 520                 525     


<210>  15
<211>  1581
<212>  DNA
<213>  Simkania negevensis

<400>  15
atgtcaacac agactgatgt gagtttcagt aaatggcgct catttttgtg gccaattcaa       60

ggaagagaaa ttaaaaaatt tcttcctctt ctcctgattt acgctctcat ttgtcttaac      120

tatagcgtct taaaagtcgc aaaagacaca cttgtcatta cagcccctgg atcaggcgca      180

gaagcaatcc cgtttatcaa ggtctgggtc attctcccca tggcactcct cgtaacttat      240

ctctttactc gcctcttcaa tcgatttagc caagaacaag tgttttacat catgatcggg      300

agcttcattt cgtttttcgc tctatttgca tttgtcctct accccttgcg agattttctt      360

catcctcatg acacagctga taaattacaa gccatgcttc cacagggatt ccaagggctc      420

atagccattt tccgtaactg gtcctatacc ctcttttatg tgatgtctga gctatgggga      480

accgctatta tgtctgtcct cttttgggga ttcacaaatg aaattatttc tgtaggtgag      540

gccaaaaggt attatggaat tctcagtgta ggggccaata ttgcaactat tttttcaggg      600

tacatcacca cctttctctc tttgcaagtg attgacatgt cattcatttt tggacctgac      660

cgctggggac aatcattagg tcttgtgaca tgtgttgttg ttgcagcagg tctccttatt      720

atggcccttt tcagatggta caacaagcga gtcattaatc gtgatgcagt actattaaaa      780

atgaaacaag accacacaga aacgaagaag accatgaaaa tgggaatgcg taagaatttt      840

gcttaccttg caaaatcaaa gtatttaatt tgcattgcag ttctggttgt tgcattcaat      900

gttggaatca acatggtcga aattatctgg aaagatcaaa tcaaagaact gtatcccaat      960

cccaacgatt ttattgttta tatggggaag gtcatgagtg caattggttg ggttgcaaca     1020

tttgtcggac tatttctcag tagtaattta atcaggcgct taggatggac tgtcagcgcc     1080

ttaatcactc ctgttgctct cctcgtaaca ggtgttttct ttttcggatt cattctcttt     1140

aaaaacaacc ctacattagt gggttggaca gccgccatag gatttacacc tcttgcacta     1200

ggggttctct ttgggacaat ccaaaatgtg atgtctcgag catgtaaata caccttattt     1260

gactctacaa aagaaatagc gtttatcccc cttagccctg agtccaagct aaaaggaaaa     1320

gctgcaattg atggagtagg ctctcgcgtt ggaaagtccg gagggtcgat tgttcatggt     1380

ggactactga tgctcttcgg ctccgtttct ctcagcgcac cttacgtcgg cttgatctta     1440

ctcgccgttg ttttcggttg gattggtgca gctcgttcac tcgggagaca atttaatctc     1500

cttacgacgc atcatgaaaa actcgagatt aacgaggagg cacagccctc cgaaaaaaag     1560

cccttacttg aatccgttta a                                               1581


<210>  16
<211>  498
<212>  PRT
<213>  Rickettsia prowazekii

<400>  16

Met Ser Thr Ser Lys Ser Glu Asn Tyr Leu Ser Glu Leu Arg Lys Ile 
1               5                   10                  15      


Ile Trp Pro Ile Glu Gln Tyr Glu Asn Lys Lys Phe Leu Pro Leu Ala 
            20                  25                  30          


Phe Met Met Phe Cys Ile Leu Leu Asn Tyr Ser Thr Leu Arg Ser Ile 
        35                  40                  45              


Lys Asp Gly Phe Val Val Thr Asp Ile Gly Thr Glu Ser Ile Ser Phe 
    50                  55                  60                  


Leu Lys Thr Tyr Ile Val Leu Pro Ser Ala Val Ile Ala Met Ile Ile 
65                  70                  75                  80  


Tyr Val Lys Leu Cys Asp Ile Leu Lys Gln Glu Asn Val Phe Tyr Val 
                85                  90                  95      


Ile Thr Ser Phe Phe Leu Gly Tyr Phe Ala Leu Phe Ala Phe Val Leu 
            100                 105                 110         


Tyr Pro Tyr Pro Asp Leu Val His Pro Asp His Lys Thr Ile Glu Ser 
        115                 120                 125             


Leu Ser Leu Ala Tyr Pro Asn Phe Lys Trp Phe Ile Lys Ile Val Gly 
    130                 135                 140                 


Lys Trp Ser Phe Ala Ser Phe Tyr Thr Ile Ala Glu Leu Trp Gly Thr 
145                 150                 155                 160 


Met Met Leu Ser Leu Leu Phe Trp Gln Phe Ala Asn Gln Ile Thr Lys 
                165                 170                 175     


Ile Ala Glu Ala Lys Arg Phe Tyr Ser Met Phe Gly Leu Leu Ala Asn 
            180                 185                 190         


Leu Ala Leu Pro Val Thr Ser Val Val Ile Gly Tyr Phe Leu His Glu 
        195                 200                 205             


Lys Thr Gln Ile Val Ala Glu His Leu Lys Phe Val Pro Leu Phe Val 
    210                 215                 220                 


Ile Met Ile Thr Ser Ser Phe Leu Ile Ile Leu Thr Tyr Arg Trp Met 
225                 230                 235                 240 


Asn Lys Asn Val Leu Thr Asp Pro Arg Leu Tyr Asp Pro Ala Leu Val 
                245                 250                 255     


Lys Glu Lys Lys Thr Lys Ala Lys Leu Ser Phe Ile Glu Ser Leu Lys 
            260                 265                 270         


Met Ile Phe Thr Ser Lys Tyr Val Gly Tyr Ile Ala Leu Leu Ile Ile 
        275                 280                 285             


Ala Tyr Gly Val Ser Val Asn Leu Val Glu Gly Val Trp Lys Ser Lys 
    290                 295                 300                 


Val Lys Glu Leu Tyr Pro Thr Lys Glu Ala Tyr Thr Ile Tyr Met Gly 
305                 310                 315                 320 


Gln Phe Gln Phe Tyr Gln Gly Trp Val Ala Ile Ala Phe Met Leu Ile 
                325                 330                 335     


Gly Ser Asn Ile Leu Arg Lys Val Ser Trp Leu Thr Ala Ala Met Ile 
            340                 345                 350         


Thr Pro Leu Met Met Phe Ile Thr Gly Ala Ala Phe Phe Ser Phe Ile 
        355                 360                 365             


Phe Phe Asp Ser Val Ile Ala Met Asn Leu Thr Gly Ile Leu Ala Ser 
    370                 375                 380                 


Ser Pro Leu Thr Leu Ala Val Met Ile Gly Met Ile Gln Asn Val Leu 
385                 390                 395                 400 


Ser Lys Gly Val Lys Tyr Ser Leu Phe Asp Ala Thr Lys Asn Met Ala 
                405                 410                 415     


Tyr Ile Pro Leu Asp Lys Asp Leu Arg Val Lys Gly Gln Ala Ala Val 
            420                 425                 430         


Glu Val Ile Gly Gly Arg Leu Gly Lys Ser Gly Gly Ala Ile Ile Gln 
        435                 440                 445             


Ser Thr Phe Phe Ile Leu Phe Pro Val Phe Gly Phe Ile Glu Ala Thr 
    450                 455                 460                 


Pro Tyr Phe Ala Ser Ile Phe Phe Ile Ile Val Ile Leu Trp Ile Phe 
465                 470                 475                 480 


Ala Val Lys Gly Leu Asn Lys Glu Tyr Gln Val Leu Val Asn Lys Asn 
                485                 490                 495     


Glu Lys 
        


<210>  17
<211>  1497
<212>  DNA
<213>  Rickettsia prowazekii

<400>  17
atgagtactt ccaaaagtga aaattatctt tcagaactaa gaaagataat ttggcctata       60

gaacaatatg aaaataagaa gtttttgcca cttgcattta tgatgttctg tattttatta      120

aactactcaa ctcttcgttc aattaaagac ggttttgtag taacagatat aggtacagaa      180

tcgataagtt ttttaaaaac atatatagta ctaccttctg ctgtaattgc tatgataatt      240

tatgttaagc tatgtgatat tttaaagcaa gaaaacgtat tttatgttat tacttcattt      300

tttttagggt attttgcatt atttgccttt gttctttacc catatcctga tttagtccac      360

cctgatcata aaactataga atctttaagt ttagcttatc ctaatttcaa atggtttata      420

aaaatagttg gtaaatggag ttttgcatct ttttatacta ttgccgagct ttggggaaca      480

atgatgctta gtttattatt ttggcaattt gctaatcaaa ttactaaaat cgctgaagct      540

aaacgtttct actcaatgtt tggtttactt gcgaatttag cattgcctgt aacatcagtg      600

gttattggat attttctaca cgaaaaaact caaatagttg cagaacattt aaaatttgta      660

cctttatttg ttataatgat aacaagtagt ttcttaataa tattaacata tagatggatg      720

aataaaaatg ttctaactga tcctagacta tatgatccag cattagtaaa agaaaaaaaa      780

actaaagcta aattgtcgtt catagaaagt ttaaaaatga tctttacttc gaaatatgta      840

ggttatattg cattattaat tattgcttat ggtgtttcag taaatttagt tgaaggtgtt      900

tggaaatcca aagtaaaaga attatatccg acaaaggagg cttataccat atatatgggt      960

cagttccaat tttatcaggg ttgggttgca attgctttta tgctgatagg tagtaatatt     1020

ttaagaaaag tatcatggct aactgcagct atgatcactc cattaatgat gttcataaca     1080

ggtgcggcat ttttttcatt tatatttttt gatagcgtta ttgcaatgaa tttaaccggc     1140

atccttgctt caagtccttt aacacttgct gttatgatcg gtatgattca aaatgtttta     1200

agtaaaggtg tgaaatattc tttatttgat gcaactaaaa atatggcgta tattccactt     1260

gataaggatt tacgagtcaa agggcaagct gccgttgaag ttatcggagg aaggctcggt     1320

aaatcaggcg gtgctattat tcaatctaca ttctttattt tatttcctgt atttggtttt     1380

atagaggcga ctccttattt tgcttctata ttctttataa tagtaatatt atggatattt     1440

gcagttaaag gtttaaataa agagtatcaa gttttggtaa ataaaaatga aaaatag        1497


<210>  18
<211>  507
<212>  PRT
<213>  Rickettsia prowazekii

<400>  18

Met Asn Ile Val Asp Ser Asn Cys Thr Ile Trp His Lys Ala Arg Asn 
1               5                   10                  15      


Ser Lys Phe Arg His Ile Val Trp Pro Ile Arg Ser Tyr Glu Leu Thr 
            20                  25                  30          


Lys Phe Ile Pro Met Thr Leu Leu Met Phe Phe Ile Leu Leu Asn Gln 
        35                  40                  45              


Asn Leu Val Arg Ser Ile Lys Asp Ser Phe Val Val Thr Leu Ile Ser 
    50                  55                  60                  


Ser Glu Val Leu Ser Phe Ile Lys Leu Trp Gly Glu Met Pro Met Gly 
65                  70                  75                  80  


Val Leu Phe Val Ile Leu Tyr Ser Lys Leu Cys Asn Ile Met Thr Thr 
                85                  90                  95      


Glu Gln Val Phe Arg Ile Ile Thr Ser Thr Phe Leu Phe Phe Phe Ala 
            100                 105                 110         


Ile Phe Gly Phe Ile Leu Phe Pro Tyr Lys Glu Phe Phe His Pro Asn 
        115                 120                 125             


Pro Glu Leu Ile Asn Gln Tyr Ile Ile Val Leu Pro His Leu Lys Trp 
    130                 135                 140                 


Phe Leu Ile Ile Trp Gly Gln Trp Ser Leu Val Leu Phe Tyr Ile Met 
145                 150                 155                 160 


Gly Glu Leu Trp Pro Val Ile Val Phe Thr Leu Leu Tyr Trp Gln Leu 
                165                 170                 175     


Ala Asn Lys Ile Thr Lys Val Glu Glu Ala Pro Arg Phe Tyr Ser Phe 
            180                 185                 190         


Phe Thr Leu Phe Gly Gln Thr Asn Leu Leu Phe Ser Gly Thr Val Ile 
        195                 200                 205             


Ile Tyr Phe Ala Lys Ser Glu His Phe Leu Leu Pro Leu Phe Ala His 
    210                 215                 220                 


Leu Asn Asp Thr Asn Glu Ile Leu Leu Lys Ser Phe Ile Thr Val Ile 
225                 230                 235                 240 


Leu Ile Ser Gly Leu Ile Cys Leu Ala Leu His Lys Leu Ile Asp Lys 
                245                 250                 255     


Ser Val Val Glu Ala Asp Lys Asn Ile Lys Phe Lys Asn Gln Arg Thr 
            260                 265                 270         


Asp Ile Leu Lys Leu Ser Leu Leu Glu Ser Ala Lys Ile Ile Leu Thr 
        275                 280                 285             


Ser Arg Tyr Leu Gly Phe Ile Cys Leu Leu Val Met Ser Tyr Ser Met 
    290                 295                 300                 


Ser Ile Asn Leu Ile Glu Gly Leu Trp Met Ser Lys Val Lys Gln Leu 
305                 310                 315                 320 


Tyr Pro Ala Thr Lys Asp Phe Ile Ser Tyr His Gly Glu Val Leu Phe 
                325                 330                 335     


Trp Thr Gly Val Leu Thr Leu Val Ser Ala Phe Leu Gly Ser Ser Leu 
            340                 345                 350         


Ile Arg Ile Tyr Gly Trp Phe Trp Gly Ala Ile Ile Thr Pro Ile Met 
        355                 360                 365             


Met Phe Val Ala Gly Val Met Phe Phe Ser Phe Thr Ile Phe Glu Gln 
    370                 375                 380                 


His Leu Gly Asn Ile Val Asn Thr Leu Gly Tyr Ser Ser Pro Leu Val 
385                 390                 395                 400 


Ile Ile Val Phe Ile Gly Gly Leu Trp His Val Phe Ala Lys Ser Val 
                405                 410                 415     


Lys Tyr Ser Leu Phe Asp Ala Thr Lys Glu Met Val Tyr Ile Pro Leu 
            420                 425                 430         


Asp Asn Glu Ile Lys Thr Lys Gly Lys Ala Ala Val Asp Val Met Gly 
        435                 440                 445             


Ala Lys Ile Gly Lys Ser Ile Gly Ala Ile Ile Gln Phe Ile Ser Phe 
    450                 455                 460                 


Ser Ile Phe Pro Asn Ala Val His Asn Asp Ile Ala Gly Leu Leu Met 
465                 470                 475                 480 


Val Thr Phe Ile Ile Val Cys Ile Leu Trp Leu Tyr Gly Val Lys Val 
                485                 490                 495     


Leu Ser Gln Asn Tyr Asn Lys Met Ile Lys Arg 
            500                 505         


<210>  19
<211>  1524
<212>  DNA
<213>  Rickettsia prowazekii

<400>  19
atgaatatag tagattctaa ctgtacaatt tggcataaag caagaaatag taaatttagg       60

catatagtat ggccaattag atcgtatgaa ttaacaaaat tcatcccgat gactttatta      120

atgtttttta ttttacttaa tcaaaattta gtgcgtagta ttaaagatag ttttgttgtt      180

acattaatta gttcagaagt attaagtttt ataaaacttt ggggtgaaat gccgatgggg      240

gttttatttg ttattcttta ttctaaactc tgtaatatta tgaccacaga gcaagttttt      300

aggataatta ccagtacctt tttatttttc tttgcaattt ttggttttat tttattccca      360

tacaaagagt tttttcatcc taaccctgaa ttaattaatc aatatatcat tgttctgcct      420

cacttaaagt ggtttttaat aatttgggga caatggagtt tagtattatt ttatataatg      480

ggtgagttat ggcctgttat agtttttact cttttatatt ggcagcttgc aaataaaatc      540

accaaagtcg aagaagcacc aagattttac tcatttttta ctttatttgg acaaactaat      600

ttgctcttct caggcactgt aattatttat tttgctaaga gcgaacattt tttattacct      660

ttatttgctc atttaaatga cacaaatgaa attcttttaa aatcattcat cacagttatt      720

ttaatatcag gattaatttg tttagctctc cataagctaa ttgataaatc agttgtagaa      780

gctgataaaa atataaaatt taaaaaccaa agaacagata tattaaaatt aagcttgctc      840

gaaagtgcaa aaataatctt aacgtctaga tatcttggtt ttatttgtct tctcgtaatg      900

tcttattcta tgagtattaa cctaatagaa ggattgtgga tgtcaaaagt aaaacaactc      960

tatcctgcta caaaggattt tatatcatat cacggtgaag tattgttttg gactggagtg     1020

ttaactttag ttagtgcatt tttaggcagt agtttaatta gaatttatgg ctggttttgg     1080

ggggctatta taacaccgat tatgatgttt gtagcagggg ttatgttttt ttcattcaca     1140

atttttgaac aacacttagg aaatatagta aatactcttg gctatagttc tccacttgtc     1200

attatagttt ttattggtgg actttggcat gtatttgcta aatctgtaaa gtattccctt     1260

ttcgatgcta ctaaagaaat ggtgtatatt ccactagata atgaaattaa gactaaaggt     1320

aaagcagcag ttgatgttat gggtgctaaa attggtaagt caataggtgc tattattcaa     1380

ttcatatcct ttagtatctt tccaaatgct gtacataacg acatagcagg cttattgatg     1440

gttactttta ttatcgtatg tatattatgg ctatatggag tgaaagtttt atcacaaaat     1500

tataataaaa tgataaaacg ttaa                                            1524


<210>  20
<211>  501
<212>  PRT
<213>  Rickettsia prowazekii

<400>  20

Met Leu Pro Pro Lys Ile Phe Phe Glu Lys Val Lys Glu Ile Ile Trp 
1               5                   10                  15      


Pro Ile Glu Arg Lys Glu Leu Lys Leu Phe Ile Pro Met Ala Leu Met 
            20                  25                  30          


Met Leu Cys Ile Leu Phe Asn Phe Gly Ala Leu Arg Ser Ile Lys Asp 
        35                  40                  45              


Ser Leu Val Val Pro Ser Met Gly Ala Glu Ile Ile Ser Phe Leu Lys 
    50                  55                  60                  


Leu Trp Leu Val Leu Pro Ser Cys Val Ile Phe Thr Ile Leu Tyr Val 
65                  70                  75                  80  


Lys Leu Ser Asn Lys Leu Asn Phe Glu Tyr Ile Phe Tyr Ser Ile Val 
                85                  90                  95      


Gly Thr Phe Leu Leu Phe Phe Leu Leu Phe Ala Tyr Ile Ile Tyr Pro 
            100                 105                 110         


Asn Gln Asp Ile Tyr His Pro Asn Asp Ala Met Ile Asn Asn Leu Ile 
        115                 120                 125             


Ala Ser Tyr Pro Asn Leu Lys Trp Phe Ile Lys Ile Gly Ser Lys Trp 
    130                 135                 140                 


Ser Tyr Ala Leu Met Tyr Ile Phe Ser Glu Leu Trp Ser Ala Val Val 
145                 150                 155                 160 


Ile Asn Leu Met Phe Trp Gln Phe Ala Asn His Ile Phe Asp Thr Ala 
                165                 170                 175     


Lys Ala Lys Arg Phe Tyr Pro Val Leu Gly Met Val Gly Asn Ile Gly 
            180                 185                 190         


Leu Ile Ile Ala Gly Ser Val Leu Val Phe Phe Ser Ser Gly Gln Tyr 
        195                 200                 205             


Ile Ile Asp Ser Glu Leu Leu Thr Asp Ser Tyr Asn Ser Ser Ser Asn 
    210                 215                 220                 


Asn Ser Ile Met Leu Gln Pro Ile Ile Ser Ile Ile Val Thr Ala Gly 
225                 230                 235                 240 


Ile Ile Ala Met Phe Leu Phe Arg Ile Ile Asn Lys Phe Ile Leu Thr 
                245                 250                 255     


Asn Ser Ile Asn Val Leu Asp Val Lys Lys Val Ala Ala Lys Thr Lys 
            260                 265                 270         


Thr Lys Leu Ala Leu Ile Glu Ser Ile Lys Leu Ile Ile His Ser Lys 
        275                 280                 285             


Tyr Ile Gly Arg Ile Ala Leu Leu Ile Ile Cys Tyr Gly Leu Leu Ile 
    290                 295                 300                 


Asn Ile Val Glu Gly Pro Trp Lys Ala Lys Ile Lys Glu Leu His Pro 
305                 310                 315                 320 


Asn Thr Val Asp Tyr Val Asn Phe Met Gly Met Phe Asn Ile Trp Met 
                325                 330                 335     


Gly Ile Ser Cys Val Thr Phe Met Ile Ile Gly Ser Asn Ile Leu Arg 
            340                 345                 350         


Arg Leu Gly Trp Leu Ile Ser Ala Leu Leu Thr Pro Ile Met Leu Ser 
        355                 360                 365             


Ile Thr Gly Phe Met Phe Phe Ile Phe Ile Ile Phe Ile Glu Glu Ile 
    370                 375                 380                 


Gly Thr Cys Phe Gly Asp Phe Asn Leu Leu Tyr Val Ala Ile Ile Val 
385                 390                 395                 400 


Gly Ala Ile Gln Asn Ile Leu Ser Lys Ser Ser Lys Tyr Ser Leu Phe 
                405                 410                 415     


Asp Ser Thr Lys Glu Met Ala Tyr Ile Pro Leu Ser Leu Glu Leu Arg 
            420                 425                 430         


Thr Lys Gly Lys Ala Ala Val Glu Val Ile Gly Thr Lys Phe Gly Lys 
        435                 440                 445             


Ser Leu Gly Ala Phe Ile Gln Ser Leu Ile Phe Ile Ile Ile Pro Thr 
    450                 455                 460                 


Ala Thr Phe Asp Ser Ile Ile Ile Tyr Leu Leu Val Ile Phe Ile Val 
465                 470                 475                 480 


Met Met Asn Leu Trp Ile Trp Asn Ile Ile Lys Leu Asn Lys Glu Tyr 
                485                 490                 495     


Ile Lys Leu Cys Gln 
            500     


<210>  21
<211>  1506
<212>  DNA
<213>  Rickettsia prowazekii

<400>  21
atgttaccgc ctaaaatttt ctttgaaaaa gttaaagaaa taatttggcc tatagaaagg       60

aaagaattaa agctatttat accaatggct ttaatgatgt tatgtatcct gtttaatttt      120

ggggctttaa gatctattaa agatagttta gtagtaccct ctatgggggc tgaaattatt      180

agtttcttaa aattatggtt agtgctaccc tcgtgcgtaa tttttacgat actttacgtt      240

aaacttagta ataaattaaa ttttgaatat attttctata gtatagtcgg tactttttta      300

ctatttttct tattatttgc ctatattatt tatccaaatc aagatattta tcatcctaat      360

gatgcaatga taaataattt aattgcttca taccctaatt taaagtggtt tattaaaata      420

ggtagtaaat ggagttatgc gctaatgtat attttctcag aattatggag tgcagtagtt      480

ataaacttaa tgttttggca atttgctaat cacatttttg atactgctaa agctaaacga      540

ttttatcctg ttcttgggat ggttggtaat atcggtctta taatagcagg cagcgtactt      600

gttttttttt caagtgggca gtacatcatt gattcagaat tattaacgga ttcttataat      660

tcatcttcta acaattctat catgcttcag ccaatcatat caattattgt tactgcagga      720

ataattgcta tgtttttatt tagaataata aataaattta ttttaactaa ttctataaat      780

gttttagatg taaaaaaagt tgctgctaaa acaaaaacaa aacttgcatt aattgaaagt      840

ataaaattaa taattcattc aaaatatata ggtcgtattg cattattaat aatctgttat      900

ggattactaa taaatatagt tgaaggacct tggaaagcga aaataaaaga attacatcca      960

aatactgtag attatgttaa ttttatgggc atgtttaata tttggatggg gatctcatgt     1020

gttactttca tgataatagg tagtaatatt cttagaaggc ttggttggct catttctgca     1080

ttattaactc ctattatgtt atctattaca ggcttcatgt tttttatctt tataattttt     1140

attgaagaaa taggtacatg ttttggtgat tttaatcttc tatatgtagc gattattgtc     1200

ggagcaattc agaatatact tagtaaatcg tctaaatatt cattattcga ttcaacaaaa     1260

gaaatggcat atattccttt atctttagaa ctgagaacta agggaaaagc cgctgtagag     1320

gtaataggaa cgaaatttgg taaatcactt ggagcattta tccagtcttt gatatttatt     1380

attattccaa cggctacctt tgattctatt ataatatatt tactagtaat ttttatagtg     1440

atgatgaatt tatggatttg gaatattata aaattaaata aggaatatat aaagctgtgt     1500

caataa                                                                1506


<210>  22
<211>  512
<212>  PRT
<213>  Rickettsia prowazekii

<400>  22

Met Thr Ile Asn Ala Ser Asn Ile Glu Asn Ser Phe Ser Lys Ile Asn 
1               5                   10                  15      


Ser His Phe Ser Lys Leu Thr Asp Tyr Ile Trp Pro Ile Lys Arg His 
            20                  25                  30          


Glu Ile Ser Lys Phe Leu Phe Ile Thr Leu Leu Met Phe Cys Ile Leu 
        35                  40                  45              


Phe Ile Gln Asn Leu Ile Arg Ala Leu Lys Asp Ser Ile Val Thr Thr 
    50                  55                  60                  


Met Ile Gly Ala Glu Thr Ile Ser Phe Leu Lys Phe Trp Gly Val Met 
65                  70                  75                  80  


Pro Ser Ala Phe Leu Ile Thr Val Ile Tyr Val Lys Leu Val Asn Arg 
                85                  90                  95      


Met Lys Ala Glu Asn Ile Phe Tyr Leu Ile Ile Ser Ile Phe Leu Thr 
            100                 105                 110         


Phe Phe Ala Leu Phe Ala Tyr Val Ile Phe Pro Asn His Glu Met Leu 
        115                 120                 125             


His Leu Arg Pro Val Thr Val His Asn Leu Thr Ala Ser Leu Pro Asn 
    130                 135                 140                 


Leu Lys Trp Phe Ile Leu Leu Leu Ser Lys Trp Ser Phe Ser Leu Phe 
145                 150                 155                 160 


Tyr Ile Ile Ala Glu Leu Trp Pro Asn Val Val Phe Ala Leu Leu Phe 
                165                 170                 175     


Trp Gln Phe Val Asn Asn Ile Thr Thr Val Glu Glu Ser Lys Arg Phe 
            180                 185                 190         


Tyr Pro Leu Phe Gly Leu Leu Ser Gln Thr Gly Ile Tyr Leu Ala Gly 
        195                 200                 205             


His Phe Leu Glu Asn Leu Ser Asn Ile Asn Tyr Tyr Val Thr Asn Lys 
    210                 215                 220                 


Phe Ala Leu Gln Ser Ser Phe His Thr Leu Ser Ile Gln Ile Ile Leu 
225                 230                 235                 240 


Thr Ile Val Leu Ile Leu Gly Ile Val Ser Ile Lys Thr Phe Trp Leu 
                245                 250                 255     


Leu Asn His Lys Val Leu Asp Lys Lys His Met Ala Leu Leu Arg Phe 
            260                 265                 270         


Lys Thr Lys Asn Lys Ser Ile Thr Ile Ala Lys Ser Phe Gln Met Ile 
        275                 280                 285             


Leu Ser Ser Arg His Ile Arg Leu Ile Ala Thr Leu Leu Ile Cys Tyr 
    290                 295                 300                 


Gly Ile Ala Ile Asn Leu Val Glu Gly Pro Trp Lys Ala Ala Ala Thr 
305                 310                 315                 320 


Lys Ile Tyr Lys Thr Pro Thr Glu Tyr Ala Ala Phe Ile Gly Ser Tyr 
                325                 330                 335     


Leu Ser Tyr Thr Gly Val Phe Thr Ile Phe Phe Val Leu Leu Gly Ser 
            340                 345                 350         


Asn Ile Val Arg Arg Met Gly Trp Phe Thr Ser Ala Val Ile Thr Pro 
        355                 360                 365             


Ser Ile Val Phe Ile Thr Gly Ile Leu Phe Phe Ala Val Asn Asn Phe 
    370                 375                 380                 


Glu Gly Phe Ala Gly Leu Ile Ile Ala Asn Phe Ile Leu Thr Asp Pro 
385                 390                 395                 400 


Ala Leu Val Ala Ile Thr Ile Gly Ala Ile Gln Asn Val Leu Ser Lys 
                405                 410                 415     


Ser Ser Lys Tyr Thr Leu Phe Asp Ser Thr Lys Glu Met Ala Tyr Val 
            420                 425                 430         


Pro Leu Glu Pro Glu Ile Lys Ile Ser Gly Lys Ala Ala Ala Asp Val 
        435                 440                 445             


Ile Gly Thr Lys Leu Gly Lys Ser Gly Ser Ala Phe Leu Gln Ser Leu 
    450                 455                 460                 


Ile Phe Ile Ile Leu Pro Ser Ala Ser Tyr Gln Ser Ile Ser Ile Cys 
465                 470                 475                 480 


Leu Met Ile Ile Phe Ile Leu Thr Cys Val Thr Trp Ile Trp Ala Thr 
                485                 490                 495     


Lys Glu Leu Asn Lys Glu Tyr Lys Asn Ser Ile Lys Phe Ser Gln Lys 
            500                 505                 510         


<210>  23
<211>  1539
<212>  DNA
<213>  Rickettsia prowazekii

<400>  23
atgacgatta acgccagtaa tatagaaaat tctttttcta aaatcaatag ccatttttct       60

aagcttacag attatatctg gcctataaaa cgccacgaaa tttctaagtt tttattcatt      120

acattattaa tgttctgtat tttatttatt caaaatctca tcagagcttt aaaagatagt      180

attgttacta ctatgatagg tgctgagact atatcatttt tgaaattttg gggcgtgatg      240

ccgtcagcat tcttaataac tgttatatat gttaaacttg tcaataggat gaaagcagaa      300

aatatatttt atcttattat atcaattttt ttaacattct ttgctttgtt tgcatacgtt      360

attttcccaa atcatgaaat gctgcattta aggcctgtaa ccgtgcataa tttaacggca      420

agtttaccga atttaaaatg gtttatactt cttttatcaa aatggagttt ttcactattt      480

tatataatag ccgaattatg gccaaatgta gtttttgcat tactgttttg gcagtttgtg      540

aataatatta ctacagtaga agaatcgaaa agattttatc cattatttgg tttacttagt      600

caaacaggta tttatttagc aggacatttt ttagaaaatc taagtaatat aaattattat      660

gtcactaata aatttgcatt gcaatcgtct tttcatacac tttctataca aattatacta      720

actatagtat taattttagg catagtatcg ataaaaactt tttggttact taatcataaa      780

gtactagaca aaaagcatat ggcattactc aggttcaaaa caaaaaataa atctattact      840

attgctaaaa gttttcagat gattctatcg tcaagacaca ttagattaat tgcaactttg      900

cttatctgct atggcattgc aattaattta gtagaaggcc cttggaaagc agcagcaact      960

aaaatttata aaactccaac cgaatatgca gcttttatag gaagttattt aagctacact     1020

ggagtattta ctattttctt tgttctactt ggttccaata tagttagaag aatgggctgg     1080

tttacttcag ctgtgatcac accttcaata gtttttatta ccggtatatt attttttgct     1140

gttaataatt ttgaaggctt tgctggctta ataatagcaa attttatttt gaccgatcct     1200

gctttagttg ctataacaat aggtgctatt caaaatgtac ttagtaaatc aagcaaatat     1260

actttatttg attctacaaa agaaatggct tatgttcctt tagaaccaga aatcaaaata     1320

agtggtaagg ctgctgccga cgttataggt acaaaactcg gtaaatccgg tagtgcattt     1380

ttacaatcat taatatttat aatattacct tctgctagtt atcaatctat ttcaatctgt     1440

ttaatgatta tatttatcct cacttgcgta acttggattt gggctactaa agaactaaat     1500

aaagaatata aaaattctat taaattttct caaaaataa                            1539


<210>  24
<211>  500
<212>  PRT
<213>  Rickettsia prowazekii

<400>  24

Met Leu Ser Thr Ser Pro Ser Arg Ser Phe Lys Asn Lys Phe Arg Ala 
1               5                   10                  15      


Ala Phe Trp Pro Val His Asn Tyr Glu Leu Gly Lys Phe Ile Pro Ile 
            20                  25                  30          


Ser Ala Leu Met Phe Cys Ile Leu Phe Asn Gln Asn Ile Leu Arg Ile 
        35                  40                  45              


Leu Lys Asp Ser Ile Leu Ile Ser Glu Ile Ser Ala Glu Ile Ala Gly 
    50                  55                  60                  


Phe Ala Lys Val Tyr Cys Val Thr Pro Val Ala Ala Leu Phe Val Ile 
65                  70                  75                  80  


Ile Tyr Ala Lys Met Ile Asn His Leu Thr Phe Glu Lys Ile Phe Tyr 
                85                  90                  95      


Tyr Leu Ser Ala Phe Phe Ile Ser Cys Phe Ile Leu Phe Ala Phe Val 
            100                 105                 110         


Ile Tyr Pro Asn Ile His Ile Phe His Val His Pro Asp Thr Leu Ser 
        115                 120                 125             


Asp Trp Met Asn Lys Tyr Pro His Phe Lys Trp Tyr Ile Ser Leu Val 
    130                 135                 140                 


Gly Asn Trp Gly Tyr Ile Val Tyr Tyr Ser Leu Ala Glu Leu Trp Pro 
145                 150                 155                 160 


Asn Ile Phe Tyr Val Leu Leu Phe Trp Gln Phe Thr Asn Glu Leu Thr 
                165                 170                 175     


Thr Thr Glu Glu Ala Lys Arg Phe Tyr Thr Leu Phe Ser Leu Phe Gly 
            180                 185                 190         


Asn Ser Ser Leu Ile Leu Val Gly Phe Leu Met Met Asn Leu Ser Ser 
        195                 200                 205             


Glu Asp Thr Ile Ile Lys Lys Phe Ile Ser Ile Ser Asp Ser Lys Ile 
    210                 215                 220                 


Thr Leu Val Gln Val Ser Thr Thr Ile Ile Ala Ile Val Ala Ile Ile 
225                 230                 235                 240 


Cys Cys Leu Leu Val Arg Phe Ile Ser Lys Tyr Ile Phe Thr Asn Pro 
                245                 250                 255     


Leu Phe Tyr His Lys Thr Lys Ser Ser Arg Ser Thr Ala Gln Arg Met 
            260                 265                 270         


Gly Leu Ile Lys Ser Phe Lys Tyr Ile Val Lys Ser Lys Tyr Leu Trp 
        275                 280                 285             


Leu Leu Leu Ile Cys Ser Ala Ala Phe Gly Phe Ala Ile Asn Leu Val 
    290                 295                 300                 


Glu Ala Val Trp Lys Ala Lys Ile Lys Glu Leu Tyr Pro Thr Val Asn 
305                 310                 315                 320 


Thr Tyr Ala Glu Phe Asn Ser Leu Tyr Ile Leu Trp Thr Gly Val Ala 
                325                 330                 335     


Ile Ile Val Met Thr Ile Ile Gly Asn Asn Val Met Arg Met His Asn 
            340                 345                 350         


Trp Phe Val Ala Ala Val Ile Ser Pro Val Ile Ile Met Val Thr Gly 
        355                 360                 365             


Val Leu Phe Phe Gly Leu Ile Val Phe Asp Gln Gln Ile Leu Ser Leu 
    370                 375                 380                 


Phe Asp Gly Ala Ile Leu Met Ser Pro Leu Ala Leu Ala Val Ser Ile 
385                 390                 395                 400 


Gly Gly Ile Gln Asn Ile Leu Ala Lys Gly Thr Lys Tyr Ser Ile Trp 
                405                 410                 415     


Asp Thr Ser Arg Glu Met Leu Tyr Ile Pro Leu Asp Asp Glu Leu Lys 
            420                 425                 430         


Thr Lys Gly Lys Ala Ala Val Asp Val Ile Ser Ala Lys Val Gly Lys 
        435                 440                 445             


Ser Ser Ser Gly Leu Val Gln Ser Ile Ile Phe Thr Leu Val Pro Asn 
    450                 455                 460                 


Ala Thr Phe Thr Ser Ile Ser Pro Ile Leu Met Val Val Phe Thr Phe 
465                 470                 475                 480 


Val Cys Phe Ala Trp Ile Tyr Ala Val Arg Lys Ile Tyr Phe Glu Tyr 
                485                 490                 495     


Gln Lys Ile Ala 
            500 


<210>  25
<211>  1503
<212>  DNA
<213>  Rickettsia prowazekii

<400>  25
atgctaagta cctcaccgtc acgatcgttt aaaaacaaat ttagagcagc attttggcct       60

gtgcataatt atgaacttgg gaaatttatt ccgatcagcg ccttaatgtt ttgtatttta      120

tttaatcaaa atattttgcg aatcttaaag gatagtattt taatctctga gattagtgca      180

gaaatagcag gatttgctaa agtttactgc gttacacctg tagctgcttt gtttgttatt      240

atttatgcta aaatgatcaa tcatttgaca tttgaaaaaa tcttttatta tttaagtgca      300

ttttttataa gctgttttat tttatttgcc tttgtgattt atcctaatat tcatattttt      360

catgtacatc ctgatacact atcagactgg atgaacaaat atcctcattt taagtggtat      420

atctcattag taggtaattg gggttatata gtatattata gtcttgccga gctttggcct      480

aatatttttt acgtattatt attttggcag tttactaatg aacttactac taccgaagaa      540

gcaaaaagat tttatactct cttttcgcta ttcggtaatt cttccttaat attagtcggc      600

tttttaatga tgaatttatc atcggaagat actattatta agaaatttat aagtatttca      660

gatagtaaaa tcactttagt tcaagtatca acgacgatta tagcaattgt tgcaatcatt      720

tgttgtttgt tagttaggtt tattagcaag tacattttta ctaatccatt attttatcat      780

aaaacaaaaa gcagtagatc aactgcacaa cggatgggac taattaaaag ctttaaatat      840

attgtgaaat caaaatattt atggctactt ttaatttgtt ctgcagcttt cggatttgct      900

ataaacttag tcgaagcagt atggaaagca aaaattaagg aattatatcc gactgtaaat      960

acctacgctg aattcaatag tctgtatata ctttggacag gcgttgcgat aattgttatg     1020

acaattatcg gtaataacgt catgcgtatg cataattggt ttgtagccgc agttatttcc     1080

ccagtgataa taatggtgac aggtgttttg ttctttggac taattgtatt tgatcaacaa     1140

attttatcat tatttgatgg cgcgatttta atgtcacctc ttgcacttgc tgtttctatt     1200

ggcggtattc agaatatttt agccaaaggc actaaatatt ctatatggga tacttcaaga     1260

gaaatgttat atataccact tgatgatgaa cttaaaacaa agggtaaagc agcagttgat     1320

gttataagtg caaaagttgg aaaatcctct agtggtcttg tacaatccat tatttttact     1380

ttagtgccaa atgcgacctt tacctcaatc tcgccgattt taatggtagt atttacgttc     1440

gtatgctttg cttggattta tgcagtaaga aaaatatatt ttgaatatca aaaaatagcc     1500

tga                                                                   1503


<210>  26
<211>  194
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide


<220>
<221>  misc_feature
<222>  (69)..(69)
<223>  dNaM or its biotinylated analog dMMO2^SSBIO

<400>  26
gcaggcatgc aagcttggcg taatcatggt catagctgtt tcctgtgtga aattgttatc       60

cgctcacant tccacacaac atacgagccg gaagcataaa gtgtaaagcc tggggtgcct      120

aatgagtgag ctaactcaca ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa      180

acctgtcgtg ccag                                                        194


<210>  27
<211>  304
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide


<220>
<221>  misc_feature
<222>  (179)..(179)
<223>  dNaM

<400>  27
gctgcaaggc gattaagttg ggtaacgcca gggttttccc agtcacgacg ttgtaaaacg       60

acggccagtg aattcgagct cggtacccgg ggatcctcta gagtcgacct gcaggcatgc      120

aagcttggcg taatcatggt catagctgtt tcctgtgtga aattgttatc cgctcacant      180

tccacacaac atacgagccg gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag      240

ctaactcaca ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg      300

ccag                                                                   304


<210>  28
<211>  5151
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide; sequence of the pACS plasmid

<400>  28
ggggaattgt gagcggataa caattcccct gtagaaataa ttttgtttaa ctttaataag       60

gagatatacc atgagaccat ttccgacgat tgccttgatt tcggtttttc tttcggcggc      120

gactcgcatt tcggcaactt cctctcatca agcaagtgca cttcctctca aaaagggaac      180

gcatgtcccg gactctccga agttgtcaaa gctatatatc atggccaaaa ccaagagtgt      240

atcctcgtcc ttcgaccccc ctcggggagg cagtactgtt gcaccaacta caccgttggc      300

aaccggcggt gcgctccgca aagtgcgaca agccgtcttt cccatctacg gaaaccaaga      360

agtcaccaaa tttctgctca tcggatccat taaattcttt ataatcttgg cactcacgct      420

cacgcgtgat accaaggaca cgttgattgt cacgcaatgt ggtgccgaag cgattgcctt      480

tctcaaaata tacggggtgc tacccgcagc gaccgcattt atcgcgctct attccaaaat      540

gtccaacgcc atgggcaaaa aaatgctatt ttattccact tgcattcctt tctttacctt      600

tttcgggctg tttgatgttt tcatttaccc gaacgcggag cgactgcacc ctagtttgga      660

agccgtgcag gcaattctcc cgggcggtgc cgcatctggc ggcatggcgg ttctggccaa      720

gattgcgaca cactggacat cggccttatt ttacgtcatg gcggaaatat attcttccgt      780

atcggtgggg ctattgtttt ggcagtttgc gaacgacgtc gtcaacgtgg atcaggccaa      840

gcgcttttat ccattatttg ctcaaatgag tggcctcgct ccagttttag cgggccagta      900

tgtggtacgg tttgccagca aagcggtcaa ctttgaggca tccatgcatc gactcacggc      960

ggccgtaaca tttgctggta ttatgatttg catcttttac caactcagtt cgtcatatgt     1020

ggagcgaacg gaatcagcaa agccagcggc agataacgag cagtctatca aaccgaaaaa     1080

gaagaaaccc aaaatgtcca tggttgaatc ggggaaattt ctcgcgtcaa gtcagtacct     1140

gcgtctaatt gccatgctgg tgctgggata cggcctcagt attaacttta ccgaaatcat     1200

gtggaaaagc ttggtgaaga aacaatatcc agacccgcta gattatcaac gatttatggg     1260

taacttctcg tcagcggttg gtttgagcac atgcattgtt attttcttcg gtgtgcacgt     1320

gatccgtttg ttggggtgga aagtcggagc gttggctaca cctgggatca tggccattct     1380

agcgttaccc ttttttgctt gcattttgtt gggtttggat agtccagcac gattggagat     1440

cgccgtaatc tttggaacaa ttcagagttt gctgagcaaa acctccaagt atgccctttt     1500

cgaccctacc acacaaatgg cttatattcc tctggacgac gaatcaaagg tcaaaggaaa     1560

agcggcaatt gatgttttgg gatcgcggat tggcaagagt ggaggctcac tgatccagca     1620

gggcttggtc tttgtttttg gaaatatcat taatgccgca cctgtagtag gggttgtcta     1680

ctacagtgtc cttgttgcgt ggatgagcgc agctggccga ctaagtgggc tttttcaagc     1740

acaaacagaa atggataagg ccgacaaaat ggaggcaaag accaacaaag aaaagtagtt     1800

aacctaggct gctgccaccg ctgagcaata actagcataa ccccttgggg cctctaaacg     1860

ggtcttgagg ggttttttgc tgaaacctca ggcatttgag aagcacacgg tcacactgct     1920

tccggtagtc aataaaccgg taaaccagca atagacataa gcggctattt aacgaccctg     1980

ccctgaaccg acgaccgggt catcgtggcc ggatcttgcg gcccctcggc ttgaacgaat     2040

tgttagacat tatttgccga ctaccttggt gatctcgcct ttcacgtagt ggacaaattc     2100

ttccaactga tctgcgcgcg aggccaagcg atcttcttct tgtccaagat aagcctgtct     2160

agcttcaagt atgacgggct gatactgggc cggcaggcgc tccattgccc agtcggcagc     2220

gacatccttc ggcgcgattt tgccggttac tgcgctgtac caaatgcggg acaacgtaag     2280

cactacattt cgctcatcgc cagcccagtc gggcggcgag ttccatagcg ttaaggtttc     2340

atttagcgcc tcaaatagat cctgttcagg aaccggatca aagagttcct ccgccgctgg     2400

acctaccaag gcaacgctat gttctcttgc ttttgtcagc aagatagcca gatcaatgtc     2460

gatcgtggct ggctcgaaga tacctgcaag aatgtcattg cgctgccatt ctccaaattg     2520

cagttcgcgc ttagctggat aacgccacgg aatgatgtcg tcgtgcacaa caatggtgac     2580

ttctacagcg cggagaatct cgctctctcc aggggaagcc gaagtttcca aaaggtcgtt     2640

gatcaaagct cgccgcgttg tttcatcaag ccttacggtc accgtaacca gcaaatcaat     2700

atcactgtgt ggcttcaggc cgccatccac tgcggagccg tacaaatgta cggccagcaa     2760

cgtcggttcg agatggcgct cgatgacgcc aactacctct gatagttgag tcgatacttc     2820

ggcgatcacc gcttccctca tactcttcct ttttcaatat tattgaagca tttatcaggg     2880

ttattgtctc atgagcggat acatatttga atgtatttag aaaaataaac aaatagctag     2940

ctcactcggt cgctacgctc cgggcgtgag actgcggcgg gcgctgcgga cacatacaaa     3000

gttacccaca gattccgtgg ataagcaggg gactaacatg tgaggcaaaa cagcagggcc     3060

gcgccggtgg cgtttttcca taggctccgc cctcctgcca gagttcacat aaacagacgc     3120

ttttccggtg catctgtggg agccgtgagg ctcaaccatg aatctgacag tacgggcgaa     3180

acccgacagg acttaaagat ccccaccgtt tccggcgggt cgctccctct tgcgctctcc     3240

tgttccgacc ctgccgttta ccggatacct gttccgcctt tctcccttac gggaagtgtg     3300

gcgctttctc atagctcaca cactggtatc tcggctcggt gtaggtcgtt cgctccaagc     3360

tgggctgtaa gcaagaactc cccgttcagc ccgactgctg cgccttatcc ggtaactgtt     3420

cacttgagtc caacccggaa aagcacggta aaacgccact ggcagcagcc attggtaact     3480

gggagttcgc agaggatttg tttagctaaa cacgcggttg ctcttgaagt gtgcgccaaa     3540

gtccggctac actggaagga cagatttggt tgctgtgctc tgcgaaagcc agttaccacg     3600

gttaagcagt tccccaactg acttaacctt cgatcaaacc acctccccag gtggtttttt     3660

cgtttacagg gcaaaagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct     3720

tttctactga accgctctag atttcagtgc aatttatctc ttcaaatgta gcacctgaag     3780

tcagccccat acgatataag ttgtaattct catgttagtc atgccccgcg cccaccggaa     3840

ggagctgact gggttgaagg ctctcaaggg catcggtcga gatcccggtg cctaatgagt     3900

gagctaactt acattaattg cgttgcgctc actgcccgct ttccagtcgg gaaacctgtc     3960

gtgccagctg cattaatgaa tcggccaacg cgcggggaga ggcggtttgc gtattgggcg     4020

ccagggtggt ttttcttttc accagtgaga cgggcaacag ctgattgccc ttcaccgcct     4080

ggccctgaga gagttgcagc aagcggtcca cgctggtttg ccccagcagg cgaaaatcct     4140

gtttgatggt ggttaacggc gggatataac atgagctgtc ttcggtatcg tcgtatccca     4200

ctaccgagat gtccgcacca acgcgcagcc cggactcggt aatggcgcgc attgcgccca     4260

gcgccatctg atcgttggca accagcatcg cagtgggaac gatgccctca ttcagcattt     4320

gcatggtttg ttgaaaaccg gacatggcac tccagtcgcc ttcccgttcc gctatcggct     4380

gaatttgatt gcgagtgaga tatttatgcc agccagccag acgcagacgc gccgagacag     4440

aacttaatgg gcccgctaac agcgcgattt gctggtgacc caatgcgacc agatgctcca     4500

cgcccagtcg cgtaccgtct tcatgggaga aaataatact gttgatgggt gtctggtcag     4560

agacatcaag aaataacgcc ggaacattag tgcaggcagc ttccacagca atggcatcct     4620

ggtcatccag cggatagtta atgatcagcc cactgacgcg ttgcgcgaga agattgtgca     4680

ccgccgcttt acaggcttcg acgccgcttc gttctaccat cgacaccacc acgctggcac     4740

ccagttgatc ggcgcgagat ttaatcgccg cgacaatttg cgacggcgcg tgcagggcca     4800

gactggaggt ggcaacgcca atcagcaacg actgtttgcc cgccagttgt tgtgccacgc     4860

ggttgggaat gtaattcagc tccgccatcg ccgcttccac tttttcccgc gttttcgcag     4920

aaacgtggct ggcctggttc accacgcggg aaacggtctg ataagagaca ccggcatact     4980

ctgcgacatc gtataacgtt actggtttca cattcaccac cctgaattga ctctcttccg     5040

ggcgctatca tgccataccg cgaaaggttt tgcgccattc gatggtgtcc gggatctcga     5100

cgctctccct tatgcgactc ctgcattagg aaattaatac gactcactat a              5151


<210>  29
<211>  45
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide

<400>  29
ggtatatctc cttattaaag ttaaacaaaa ttatttctac agggg                       45


<210>  30
<211>  43
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide

<400>  30
gtttaacttt aataaggaga tataccatga gaccatttcc gac                         43


<210>  31
<211>  40
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide

<400>  31
gcagcagcct aggttaacta cttttctttg ttggtctttg                             40


<210>  32
<211>  48
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide

<400>  32
gtttaacttt aataaggaga tataccatga aaaaatcttg tacaatcc                    48


<210>  33
<211>  37
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide

<400>  33
gcagcagcct aggttaacta cttctggtgc tcttttg                                37


<210>  34
<211>  26
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide

<400>  34
ctatgaccat gattacgcca agcttg                                            26


<210>  35
<211>  75
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide


<220>
<221>  misc_feature
<222>  (34)..(34)
<223>  dNaM

<400>  35
ctgtttcctg tgtgaaattg ttatccgctc acanttccac acaacatacg agccggaagc       60

ataaagtgta aagcc                                                        75


<210>  36
<211>  75
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide

<400>  36
ctgtttcctg tgtgaaattg ttatccgctc acatttccac acaacatacg agccggaagc       60

ataaagtgta aagcc                                                        75


<210>  37
<211>  54
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide

<400>  37
caagcttggc gtaatcatgg tcatagctgt ttcctgtgtg aaattgttat ccgc             54


<210>  38
<211>  45
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide

<400>  38
gctcactcat taggcacccc aggctttaca ctttatgctt ccggc                       45


<210>  39
<211>  29
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide

<400>  39
gcaggcatgc aagcttggcg taatcatgg                                         29


<210>  40
<211>  29
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide

<400>  40
gctgcaaggc gattaagttg ggtaacgcc                                         29


<210>  41
<211>  25
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide

<400>  41
ctggcacgac aggtttcccg actgg                                             25


<210>  42
<211>  9
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic polypeptide

<400>  42

Asp Tyr Lys Asp Asp Asp Asp Lys Gly 
1               5                   


<210>  43
<211>  14
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic polypeptide

<400>  43

Gly Lys Pro Ile Pro Asn Pro Leu Leu Gly Leu Asp Ser Thr 
1               5                   10                  


<210>  44
<211>  10
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic polypeptide

<400>  44

Glu Gln Lys Leu Ile Ser Glu Glu Asp Leu 
1               5                   10  


<210>  45
<211>  11
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic polypeptide

<400>  45

Gln Pro Glu Leu Ala Pro Glu Asp Pro Glu Asp 
1               5                   10      


<210>  46
<211>  9
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic polypeptide

<400>  46

Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
1               5                   


<210>  47
<211>  11
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic polypeptide

<400>  47

Tyr Thr Asp Ile Glu Met Asn Arg Leu Gly Lys 
1               5                   10      


<210>  48
<211>  6
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic polypeptide

<400>  48

Cys Cys Pro Gly Cys Cys 
1               5       


<210>  49
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide

<400>  49
ttaacctagg ctgctgccac cg                                                22


<210>  50
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic polynucleotide

<400>  50
tggggtgcct aatgagtgag c                                                 21


