                               SEQUENCE LISTING

<110> CALIFORNIA INSTITUTE OF TECHNOLOGY
 
<120> A METHOD FOR ENANTIOSELECTIVE CARBENE C-H INSERTION USING AN 
      IRON-CONTAINING PROTEIN CATALYST

<130> 086544-1123836 021910PC

<140> PCT/US2019/015027
<141> 2019-01-24

<150> 62/734,059
<151> 2018-09-20

<150> 62/693,547
<151> 2018-07-03

<150> 62/621,749
<151> 2018-01-25

<160> 18    

<170> PatentIn version 3.5

<210> 1
<211> 664
<212> PRT
<213> Bacillus megaterium

<400> 1
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn 
1               5                   10                  15      


Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile 
            20                  25                  30          


Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val 
        35                  40                  45              


Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu 
    50                  55                  60                  


Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp 
65                  70                  75                  80  


Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn Trp 
                85                  90                  95      


Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met 
            100                 105                 110         


Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln 
        115                 120                 125             


Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp 
    130                 135                 140                 


Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr 
145                 150                 155                 160 


Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser 
                165                 170                 175     


Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn 
            180                 185                 190         


Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp 
        195                 200                 205             


Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys 
    210                 215                 220                 


Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly 
225                 230                 235                 240 


Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr 
                245                 250                 255     


Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu 
            260                 265                 270         


Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln 
        275                 280                 285             


Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser 
    290                 295                 300                 


Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu 
305                 310                 315                 320 


Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys 
                325                 330                 335     


Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu 
            340                 345                 350         


Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly 
        355                 360                 365             


Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala 
    370                 375                 380                 


Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys 
385                 390                 395                 400 


Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met 
                405                 410                 415     


Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp 
            420                 425                 430         


Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala 
        435                 440                 445             


Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu 
    450                 455                 460                 


Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr 
465                 470                 475                 480 


Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr 
                485                 490                 495     


Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln 
            500                 505                 510         


Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala 
        515                 520                 525             


Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala 
    530                 535                 540                 


Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys 
545                 550                 555                 560 


Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr 
                565                 570                 575     


Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys 
            580                 585                 590         


Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp 
        595                 600                 605             


Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val 
    610                 615                 620                 


Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser 
625                 630                 635                 640 


Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala 
                645                 650                 655     


Lys Met His Gly Ala Phe Ser Thr 
            660                 


<210> 2
<211> 1048
<212> PRT
<213> Bacillus megaterium

<400> 2
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn 
1               5                   10                  15      


Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile 
            20                  25                  30          


Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val 
        35                  40                  45              


Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu 
    50                  55                  60                  


Ser Arg Phe Asp Lys Asn Leu Ser Gln Ala Leu Lys Phe Val Arg Asp 
65                  70                  75                  80  


Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn Trp 
                85                  90                  95      


Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met 
            100                 105                 110         


Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln 
        115                 120                 125             


Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Pro Glu Asp 
    130                 135                 140                 


Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr 
145                 150                 155                 160 


Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Thr Ser 
                165                 170                 175     


Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gln Arg Ala Asn 
            180                 185                 190         


Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp 
        195                 200                 205             


Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys 
    210                 215                 220                 


Ala Ser Gly Glu Gln Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly 
225                 230                 235                 240 


Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn Ile Arg Tyr 
                245                 250                 255     


Gln Ile Ile Thr Phe Leu Ile Ala Gly His Glu Thr Thr Ser Gly Leu 
            260                 265                 270         


Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln 
        275                 280                 285             


Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser 
    290                 295                 300                 


Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu 
305                 310                 315                 320 


Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys 
                325                 330                 335     


Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu 
            340                 345                 350         


Leu Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Ile Trp Gly 
        355                 360                 365             


Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala 
    370                 375                 380                 


Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Cys 
385                 390                 395                 400 


Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met 
                405                 410                 415     


Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp 
            420                 425                 430         


Ile Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala 
        435                 440                 445             


Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu 
    450                 455                 460                 


Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr 
465                 470                 475                 480 


Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr 
                485                 490                 495     


Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln 
            500                 505                 510         


Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala 
        515                 520                 525             


Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala 
    530                 535                 540                 


Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys 
545                 550                 555                 560 


Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr 
                565                 570                 575     


Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys 
            580                 585                 590         


Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp 
        595                 600                 605             


Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val 
    610                 615                 620                 


Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser 
625                 630                 635                 640 


Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala 
                645                 650                 655     


Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu 
            660                 665                 670         


Gln Gln Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu Ile Glu Leu 
        675                 680                 685             


Pro Lys Glu Ala Ser Tyr Gln Glu Gly Asp His Leu Gly Val Ile Pro 
    690                 695                 700                 


Arg Asn Tyr Glu Gly Ile Val Asn Arg Val Thr Ala Arg Phe Gly Leu 
705                 710                 715                 720 


Asp Ala Ser Gln Gln Ile Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala 
                725                 730                 735     


His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gln Tyr 
            740                 745                 750         


Val Glu Leu Gln Asp Pro Val Thr Arg Thr Gln Leu Arg Ala Met Ala 
        755                 760                 765             


Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu 
    770                 775                 780                 


Glu Lys Gln Ala Tyr Lys Glu Gln Val Leu Ala Lys Arg Leu Thr Met 
785                 790                 795                 800 


Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu 
                805                 810                 815     


Phe Ile Ala Leu Leu Pro Ser Ile Arg Pro Arg Tyr Tyr Ser Ile Ser 
            820                 825                 830         


Ser Ser Pro Arg Val Asp Glu Lys Gln Ala Ser Ile Thr Val Ser Val 
        835                 840                 845             


Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly Ile Ala 
    850                 855                 860                 


Ser Asn Tyr Leu Ala Glu Leu Gln Glu Gly Asp Thr Ile Thr Cys Phe 
865                 870                 875                 880 


Ile Ser Thr Pro Gln Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr 
                885                 890                 895     


Pro Leu Ile Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly 
            900                 905                 910         


Phe Val Gln Ala Arg Lys Gln Leu Lys Glu Gln Gly Gln Ser Leu Gly 
        915                 920                 925             


Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu 
    930                 935                 940                 


Tyr Gln Glu Glu Leu Glu Asn Ala Gln Ser Glu Gly Ile Ile Thr Leu 
945                 950                 955                 960 


His Thr Ala Phe Ser Arg Met Pro Asn Gln Pro Lys Thr Tyr Val Gln 
                965                 970                 975     


His Val Met Glu Gln Asp Gly Lys Lys Leu Ile Glu Leu Leu Asp Gln 
            980                 985                 990         


Gly Ala His Phe Tyr Ile Cys Gly  Asp Gly Ser Gln Met  Ala Pro Ala 
        995                 1000                 1005             


Val Glu  Ala Thr Leu Met Lys  Ser Tyr Ala Asp Val  His Gln Val 
    1010                 1015                 1020             


Ser Glu  Ala Asp Ala Arg Leu  Trp Leu Gln Gln Leu  Glu Glu Lys 
    1025                 1030                 1035             


Gly Arg  Tyr Ala Lys Asp Val  Trp Ala Gly 
    1040                 1045             


<210> 3
<211> 664
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      polypeptide

<400> 3
Thr Ile Lys Glu Met Pro Gln Pro Lys Thr Phe Gly Glu Leu Lys Asn 
1               5                   10                  15      


Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gln Ala Leu Met Lys Ile 
            20                  25                  30          


Ala Asp Glu Leu Gly Glu Ile Phe Lys Phe Glu Ala Pro Gly Arg Val 
        35                  40                  45              


Thr Arg Tyr Leu Ser Ser Gln Arg Leu Ile Lys Glu Ala Cys Asp Glu 
    50                  55                  60                  


Ser Arg Phe Asp Lys Glu Leu Ser Gln Pro Leu Lys Phe Leu Arg Asp 
65                  70                  75                  80  


Phe Leu Gly Asp Gly Leu Ala Thr Ser Trp Thr His Glu Lys Asn Trp 
                85                  90                  95      


Lys Lys Ala His Asn Ile Leu Leu Pro Ser Phe Ser Gln Gln Ala Met 
            100                 105                 110         


Lys Gly Tyr His Ala Met Met Val Asp Ile Ala Val Gln Leu Val Gln 
        115                 120                 125             


Lys Trp Glu Arg Leu Asn Ala Asp Glu His Ile Glu Val Ser Glu Asp 
    130                 135                 140                 


Met Thr Arg Leu Thr Leu Asp Thr Ile Gly Leu Cys Gly Phe Asn Tyr 
145                 150                 155                 160 


Arg Phe Asn Ser Phe Tyr Arg Asp Gln Pro His Pro Phe Ile Ile Ser 
                165                 170                 175     


Leu Val Arg Ala Leu Asp Glu Val Met Asn Lys Leu Gln Arg Ala Asn 
            180                 185                 190         


Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gln Phe Gln Glu Asp 
        195                 200                 205             


Ile Lys Val Met Asn Asp Leu Val Asp Lys Ile Ile Ala Asp Arg Lys 
    210                 215                 220                 


Ala Arg Gly Glu Gln Ser Asp Asp Leu Leu Thr Gln Met Leu Asn Gly 
225                 230                 235                 240 


Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Gly Asn Ile Arg Tyr 
                245                 250                 255     


Gln Ile Ile Thr Phe Leu Tyr Ala Gly Val Glu Gly Thr Ser Gly Leu 
            260                 265                 270         


Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gln 
        275                 280                 285             


Lys Val Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser 
    290                 295                 300                 


Tyr Lys Gln Val Lys Gln Leu Lys Tyr Val Gly Met Val Leu Asn Glu 
305                 310                 315                 320 


Ala Leu Arg Leu Trp Pro Thr Val Pro Tyr Phe Ser Leu Tyr Ala Lys 
                325                 330                 335     


Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu 
            340                 345                 350         


Val Met Val Leu Ile Pro Gln Leu His Arg Asp Lys Thr Val Trp Gly 
        355                 360                 365             


Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala 
    370                 375                 380                 


Ile Pro Gln His Ala Phe Lys Pro Phe Gly Asn Gly Gln Arg Ala Ser 
385                 390                 395                 400 


Ile Gly Gln Gln Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met 
                405                 410                 415     


Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp 
            420                 425                 430         


Ile Lys Glu Leu Leu Thr Leu Lys Pro Lys Gly Phe Val Val Lys Ala 
        435                 440                 445             


Lys Ser Lys Lys Ile Pro Leu Gly Gly Ile Pro Ser Pro Ser Thr Glu 
    450                 455                 460                 


Gln Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr 
465                 470                 475                 480 


Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr 
                485                 490                 495     


Ala Arg Asp Leu Ala Asp Ile Ala Met Ser Lys Gly Phe Ala Pro Gln 
            500                 505                 510         


Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala 
        515                 520                 525             


Val Leu Ile Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala 
    530                 535                 540                 


Lys Gln Phe Val Asp Trp Leu Asp Gln Ala Ser Ala Asp Glu Val Lys 
545                 550                 555                 560 


Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr 
                565                 570                 575     


Thr Tyr Gln Lys Val Pro Ala Phe Ile Asp Glu Thr Leu Ala Ala Lys 
            580                 585                 590         


Gly Ala Glu Asn Ile Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp 
        595                 600                 605             


Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val 
    610                 615                 620                 


Ala Ala Tyr Phe Asn Leu Asp Ile Glu Asn Ser Glu Asp Asn Lys Ser 
625                 630                 635                 640 


Thr Leu Ser Leu Gln Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala 
                645                 650                 655     


Lys Met His Gly Ala Phe Ser Thr 
            660                 


<210> 4
<211> 145
<212> PRT
<213> Rhodothermus marinus

<400> 4
Met Ala Pro Thr Leu Ser Glu Gln Thr Arg Gln Leu Val Arg Ala Ser 
1               5                   10                  15      


Val Pro Ala Leu Gln Lys His Ser Val Ala Ile Ser Ala Thr Met Tyr 
            20                  25                  30          


Arg Leu Leu Phe Glu Arg Tyr Pro Glu Thr Arg Ser Leu Phe Glu Leu 
        35                  40                  45              


Pro Glu Arg Gln Ile His Lys Leu Ala Ser Ala Leu Leu Ala Tyr Ala 
    50                  55                  60                  


Arg Ser Ile Asp Asn Pro Ser Ala Leu Gln Ala Ala Ile Arg Arg Met 
65                  70                  75                  80  


Val Leu Ser His Ala Arg Ala Gly Val Gln Ala Val His Tyr Pro Leu 
                85                  90                  95      


Val Trp Glu Cys Leu Arg Asp Ala Ile Lys Glu Val Leu Gly Pro Asp 
            100                 105                 110         


Ala Thr Glu Thr Leu Leu Gln Ala Trp Lys Glu Ala Tyr Asp Phe Leu 
        115                 120                 125             


Ala His Leu Leu Ser Thr Lys Glu Ala Gln Val Tyr Ala Val Leu Ala 
    130                 135                 140                 


Glu 
145 


<210> 5
<211> 133
<212> PRT
<213> Methylacidiphilum infernorum

<400> 5
Met Ile Asp Gln Lys Glu Lys Glu Leu Ile Lys Glu Ser Trp Lys Arg 
1               5                   10                  15      


Ile Glu Pro Asn Lys Asn Glu Ile Gly Leu Leu Phe Tyr Ala Asn Leu 
            20                  25                  30          


Phe Lys Glu Glu Pro Thr Val Ser Val Leu Phe Gln Asn Pro Ile Ser 
        35                  40                  45              


Ser Gln Ser Arg Lys Leu Met Gln Val Leu Gly Ile Leu Val Gln Gly 
    50                  55                  60                  


Ile Asp Asn Leu Glu Gly Leu Ile Pro Thr Leu Gln Asp Leu Gly Arg 
65                  70                  75                  80  


Arg His Lys Gln Tyr Gly Val Val Asp Ser His Tyr Pro Leu Val Gly 
                85                  90                  95      


Asp Cys Leu Leu Lys Ser Ile Gln Glu Tyr Leu Gly Gln Gly Phe Thr 
            100                 105                 110         


Glu Glu Ala Lys Ala Ala Trp Thr Lys Val Tyr Gly Ile Ala Ala Gln 
        115                 120                 125             


Val Met Thr Ala Glu 
    130             


<210> 6
<211> 140
<212> PRT
<213> Campylobacter jejuni

<400> 6
Met Thr Lys Glu Gln Ile Gln Ile Ile Lys Asp Cys Val Pro Ile Leu 
1               5                   10                  15      


Gln Lys Asn Gly Glu Asp Leu Thr Asn Glu Phe Tyr Lys Ile Met Phe 
            20                  25                  30          


Asn Asp Tyr Pro Glu Val Lys Pro Met Phe Asn Met Glu Lys Gln Ile 
        35                  40                  45              


Ser Gly Glu Gln Pro Lys Ala Leu Ala Met Ala Ile Leu Met Ala Ala 
    50                  55                  60                  


Lys Asn Ile Glu Asn Leu Glu Asn Met Arg Ser Phe Val Asp Lys Val 
65                  70                  75                  80  


Ala Ile Thr His Val Asn Leu Gly Val Lys Glu Glu His Tyr Pro Ile 
                85                  90                  95      


Val Gly Ala Cys Leu Leu Lys Ala Ile Lys Asn Leu Leu Asn Pro Asp 
            100                 105                 110         


Glu Ala Thr Leu Lys Ala Trp Glu Val Ala Tyr Gly Lys Ile Ala Lys 
        115                 120                 125             


Phe Tyr Ile Asp Ile Glu Lys Lys Leu Tyr Asp Lys 
    130                 135                 140 


<210> 7
<211> 146
<212> PRT
<213> Vitreoscilla stercoraria

<400> 7
Met Leu Asp Gln Gln Thr Ile Asn Ile Ile Lys Ala Thr Val Pro Val 
1               5                   10                  15      


Leu Lys Glu His Gly Val Thr Ile Thr Thr Thr Phe Tyr Lys Asn Leu 
            20                  25                  30          


Phe Ala Lys His Pro Glu Val Arg Pro Leu Phe Asp Met Gly Arg Gln 
        35                  40                  45              


Glu Ser Leu Glu Gln Pro Lys Ala Leu Ala Met Thr Val Leu Ala Ala 
    50                  55                  60                  


Ala Gln Asn Ile Glu Asn Leu Pro Ala Ile Leu Pro Ala Val Lys Lys 
65                  70                  75                  80  


Ile Ala Val Lys His Cys Gln Ala Gly Val Ala Ala Ala His Tyr Pro 
                85                  90                  95      


Ile Val Gly Gln Glu Leu Leu Gly Ala Ile Lys Glu Val Leu Gly Asp 
            100                 105                 110         


Ala Ala Thr Asp Asp Ile Leu Asp Ala Trp Gly Lys Ala Tyr Gly Val 
        115                 120                 125             


Ile Ala Asp Val Phe Ile Gln Val Glu Ala Asp Leu Tyr Ala Gln Ala 
    130                 135                 140                 


Val Glu 
145     


<210> 8
<211> 151
<212> PRT
<213> Mus musculus

<400> 8
Met Glu Arg Pro Glu Ser Glu Leu Ile Arg Gln Ser Trp Arg Val Val 
1               5                   10                  15      


Ser Arg Ser Pro Leu Glu His Gly Thr Val Leu Phe Ala Arg Leu Phe 
            20                  25                  30          


Ala Leu Glu Pro Ser Leu Leu Pro Leu Phe Gln Tyr Asn Gly Arg Gln 
        35                  40                  45              


Phe Ser Ser Pro Glu Asp Cys Leu Ser Ser Pro Glu Phe Leu Asp His 
    50                  55                  60                  


Ile Arg Lys Val Met Leu Val Ile Asp Ala Ala Val Thr Asn Val Glu 
65                  70                  75                  80  


Asp Leu Ser Ser Leu Glu Glu Tyr Leu Thr Ser Leu Gly Arg Lys His 
                85                  90                  95      


Arg Ala Val Gly Val Arg Leu Ser Ser Phe Ser Thr Val Gly Glu Ser 
            100                 105                 110         


Leu Leu Tyr Met Leu Glu Lys Cys Leu Gly Pro Asp Phe Thr Pro Ala 
        115                 120                 125             


Thr Arg Thr Ala Trp Ser Arg Leu Tyr Gly Ala Val Val Gln Ala Met 
    130                 135                 140                 


Ser Arg Gly Trp Asp Gly Glu 
145                 150     


<210> 9
<211> 151
<212> PRT
<213> Homo sapiens

<400> 9
Met Glu Arg Pro Glu Pro Glu Leu Ile Arg Gln Ser Trp Arg Ala Val 
1               5                   10                  15      


Ser Arg Ser Pro Leu Glu His Gly Thr Val Leu Phe Ala Arg Leu Phe 
            20                  25                  30          


Ala Leu Glu Pro Asp Leu Leu Pro Leu Phe Gln Tyr Asn Cys Arg Gln 
        35                  40                  45              


Phe Ser Ser Pro Glu Asp Cys Leu Ser Ser Pro Glu Phe Leu Asp His 
    50                  55                  60                  


Ile Arg Lys Val Met Leu Val Ile Asp Ala Ala Val Thr Asn Val Glu 
65                  70                  75                  80  


Asp Leu Ser Ser Leu Glu Glu Tyr Leu Ala Ser Leu Gly Arg Lys His 
                85                  90                  95      


Arg Ala Val Gly Val Lys Leu Ser Ser Phe Ser Thr Val Gly Glu Ser 
            100                 105                 110         


Leu Leu Tyr Met Leu Glu Lys Cys Leu Gly Pro Ala Phe Thr Pro Ala 
        115                 120                 125             


Thr Arg Ala Ala Trp Ser Gln Leu Tyr Gly Ala Val Val Gln Ala Met 
    130                 135                 140                 


Ser Arg Gly Trp Asp Gly Glu 
145                 150     


<210> 10
<211> 154
<212> PRT
<213> Physeter catodon

<400> 10
Met Val Leu Ser Glu Gly Glu Trp Gln Leu Val Leu His Val Trp Ala 
1               5                   10                  15      


Lys Val Glu Ala Asp Val Ala Gly His Gly Gln Asp Ile Leu Ile Arg 
            20                  25                  30          


Leu Phe Lys Ser His Pro Glu Thr Leu Glu Lys Phe Asp Arg Phe Lys 
        35                  40                  45              


His Leu Lys Thr Glu Ala Glu Met Lys Ala Ser Glu Asp Leu Lys Lys 
    50                  55                  60                  


His Gly Val Thr Val Leu Thr Ala Leu Gly Ala Ile Leu Lys Lys Lys 
65                  70                  75                  80  


Gly His His Glu Ala Glu Leu Lys Pro Leu Ala Gln Ser His Ala Thr 
                85                  90                  95      


Lys His Lys Ile Pro Ile Lys Tyr Leu Glu Phe Ile Ser Glu Ala Ile 
            100                 105                 110         


Ile His Val Leu His Ser Arg His Pro Gly Asp Phe Gly Ala Asp Ala 
        115                 120                 125             


Gln Gly Ala Met Asn Lys Ala Leu Glu Leu Phe Arg Lys Asp Ile Ala 
    130                 135                 140                 


Ala Lys Tyr Lys Glu Leu Gly Tyr Gln Gly 
145                 150                 


<210> 11
<211> 190
<212> PRT
<213> Homo sapiens

<400> 11
Met Glu Lys Val Pro Gly Glu Met Glu Ile Glu Arg Arg Glu Arg Ser 
1               5                   10                  15      


Glu Glu Leu Ser Glu Ala Glu Arg Lys Ala Val Gln Ala Met Trp Ala 
            20                  25                  30          


Arg Leu Tyr Ala Asn Cys Glu Asp Val Gly Val Ala Ile Leu Val Arg 
        35                  40                  45              


Phe Phe Val Asn Phe Pro Ser Ala Lys Gln Tyr Phe Ser Gln Phe Lys 
    50                  55                  60                  


His Met Glu Asp Pro Leu Glu Met Glu Arg Ser Pro Gln Leu Arg Lys 
65                  70                  75                  80  


His Ala Cys Arg Val Met Gly Ala Leu Asn Thr Val Val Glu Asn Leu 
                85                  90                  95      


His Asp Pro Asp Lys Val Ser Ser Val Leu Ala Leu Val Gly Lys Ala 
            100                 105                 110         


His Ala Leu Lys His Lys Val Glu Pro Val Tyr Phe Lys Ile Leu Ser 
        115                 120                 125             


Gly Val Ile Leu Glu Val Val Ala Glu Glu Phe Ala Ser Asp Phe Pro 
    130                 135                 140                 


Pro Glu Thr Gln Arg Ala Trp Ala Lys Leu Arg Gly Leu Ile Tyr Ser 
145                 150                 155                 160 


His Val Thr Ala Ala Tyr Lys Glu Val Gly Trp Val Gln Gln Val Pro 
                165                 170                 175     


Asn Ala Thr Thr Pro Pro Ala Thr Leu Pro Ser Ser Gly Pro 
            180                 185                 190 


<210> 12
<211> 338
<212> PRT
<213> Ascaris suum

<400> 12
Met Arg Ser Leu Leu Leu Leu Ser Ile Val Phe Phe Val Val Thr Val 
1               5                   10                  15      


Ser Ala Asn Lys Thr Arg Glu Leu Cys Met Lys Ser Leu Glu His Ala 
            20                  25                  30          


Lys Val Asp Thr Ser Asn Glu Ala Arg Gln Asp Gly Ile Asp Leu Tyr 
        35                  40                  45              


Lys His Met Phe Glu Asn Tyr Pro Pro Leu Arg Lys Tyr Phe Lys Asn 
    50                  55                  60                  


Arg Glu Glu Tyr Thr Ala Glu Asp Val Gln Asn Asp Pro Phe Phe Ala 
65                  70                  75                  80  


Lys Gln Gly Gln Lys Ile Leu Leu Ala Cys His Val Leu Cys Ala Thr 
                85                  90                  95      


Tyr Asp Asp Arg Glu Thr Phe Asn Ala Tyr Thr Arg Glu Leu Leu Asp 
            100                 105                 110         


Arg His Ala Arg Asp His Val His Met Pro Pro Glu Val Trp Thr Asp 
        115                 120                 125             


Phe Trp Lys Leu Phe Glu Glu Tyr Leu Gly Lys Lys Thr Thr Leu Asp 
    130                 135                 140                 


Glu Pro Thr Lys Gln Ala Trp His Glu Ile Gly Arg Glu Phe Ala Lys 
145                 150                 155                 160 


Glu Ile Asn Lys His Gly Arg His Ala Val Arg His Gln Cys Met Arg 
                165                 170                 175     


Ser Leu Gln His Ile Asp Ile Gly His Ser Glu Thr Ala Lys Gln Asn 
            180                 185                 190         


Gly Ile Asp Leu Tyr Lys His Met Phe Glu Asn Tyr Pro Ser Met Arg 
        195                 200                 205             


Glu Ala Phe Lys Asp Arg Glu Asn Tyr Thr Ala Glu Asp Val Gln Lys 
    210                 215                 220                 


Asp Pro Phe Phe Val Lys Gln Gly Gln Arg Ile Leu Leu Ala Cys His 
225                 230                 235                 240 


Leu Leu Cys Ala Ser Tyr Asp Asp Glu Glu Thr Phe His Met Tyr Val 
                245                 250                 255     


His Glu Leu Met Glu Arg His Glu Arg Leu Gly Val Gln Leu Pro Asp 
            260                 265                 270         


Gln His Trp Thr Asp Phe Trp Lys Leu Phe Glu Glu Phe Leu Glu Lys 
        275                 280                 285             


Lys Ser His Leu Cys Glu His Thr Lys His Ala Trp Ala Val Ile Gly 
    290                 295                 300                 


Lys Glu Phe Ala Tyr Glu Ala Thr Arg His Gly Lys Glu His His Glu 
305                 310                 315                 320 


His Lys Glu Glu His Lys Glu Glu His Lys Glu Glu His Lys Glu Glu 
                325                 330                 335     


Gln His 
        


<210> 13
<211> 132
<212> PRT
<213> Bacillus subtilis

<400> 13
Met Gly Gln Ser Phe Asn Ala Pro Tyr Glu Ala Ile Gly Glu Glu Leu 
1               5                   10                  15      


Leu Ser Gln Leu Val Asp Thr Phe Tyr Glu Arg Val Ala Ser His Pro 
            20                  25                  30          


Leu Leu Lys Pro Ile Phe Pro Ser Asp Leu Thr Glu Thr Ala Arg Lys 
        35                  40                  45              


Gln Lys Gln Phe Leu Thr Gln Tyr Leu Gly Gly Pro Pro Leu Tyr Thr 
    50                  55                  60                  


Glu Glu His Gly His Pro Met Leu Arg Ala Arg His Leu Pro Phe Pro 
65                  70                  75                  80  


Ile Thr Asn Glu Arg Ala Asp Ala Trp Leu Ser Cys Met Lys Asp Ala 
                85                  90                  95      


Met Asp His Val Gly Leu Glu Gly Glu Ile Arg Glu Phe Leu Phe Gly 
            100                 105                 110         


Arg Leu Glu Leu Thr Ala Arg His Met Val Asn Gln Thr Glu Ala Glu 
        115                 120                 125             


Asp Arg Ser Ser 
    130         


<210> 14
<211> 195
<212> PRT
<213> Methanosarcina acetivorans

<400> 14
Met Ser Val Glu Lys Ile Pro Gly Tyr Thr Tyr Gly Glu Thr Glu Asn 
1               5                   10                  15      


Arg Ala Pro Phe Asn Leu Glu Asp Leu Lys Leu Leu Lys Glu Ala Val 
            20                  25                  30          


Met Phe Thr Ala Glu Asp Glu Glu Tyr Ile Gln Lys Ala Gly Glu Val 
        35                  40                  45              


Leu Glu Asp Gln Val Glu Glu Ile Leu Asp Thr Trp Tyr Gly Phe Val 
    50                  55                  60                  


Gly Ser His Pro His Leu Leu Tyr Tyr Phe Thr Ser Pro Asp Gly Thr 
65                  70                  75                  80  


Pro Asn Glu Lys Tyr Leu Ala Ala Val Arg Lys Arg Phe Ser Arg Trp 
                85                  90                  95      


Ile Leu Asp Thr Cys Asn Arg Ser Tyr Asp Gln Ala Trp Leu Asp Tyr 
            100                 105                 110         


Gln Tyr Glu Ile Gly Leu Arg His His Arg Thr Lys Lys Asn Gln Thr 
        115                 120                 125             


Asp Asn Val Glu Ser Val Pro Asn Ile Gly Tyr Arg Tyr Leu Val Ala 
    130                 135                 140                 


Phe Ile Tyr Pro Ile Thr Ala Thr Met Lys Pro Phe Leu Ala Arg Lys 
145                 150                 155                 160 


Gly His Thr Pro Glu Glu Val Glu Lys Met Tyr Gln Ala Trp Phe Lys 
                165                 170                 175     


Ala Thr Thr Leu Gln Val Ala Leu Trp Ser Tyr Pro Tyr Val Lys Tyr 
            180                 185                 190         


Gly Asp Phe 
        195 


<210> 15
<211> 195
<212> PRT
<213> Aeropyrum pernix

<400> 15
Met Thr Pro Ser Asp Ile Pro Gly Tyr Asp Tyr Gly Arg Val Glu Lys 
1               5                   10                  15      


Ser Pro Ile Thr Asp Leu Glu Phe Asp Leu Leu Lys Lys Thr Val Met 
            20                  25                  30          


Leu Gly Glu Lys Asp Val Met Tyr Leu Lys Lys Ala Cys Asp Val Leu 
        35                  40                  45              


Lys Asp Gln Val Asp Glu Ile Leu Asp Leu Trp Tyr Gly Trp Val Ala 
    50                  55                  60                  


Ser Asn Glu His Leu Ile Tyr Tyr Phe Ser Asn Pro Asp Thr Gly Glu 
65                  70                  75                  80  


Pro Ile Lys Glu Tyr Leu Glu Arg Val Arg Ala Arg Phe Gly Ala Trp 
                85                  90                  95      


Ile Leu Asp Thr Thr Cys Arg Asp Tyr Asn Arg Glu Trp Leu Asp Tyr 
            100                 105                 110         


Gln Tyr Glu Val Gly Leu Arg His His Arg Ser Lys Lys Gly Val Thr 
        115                 120                 125             


Asp Gly Val Arg Thr Val Pro His Ile Pro Leu Arg Tyr Leu Ile Ala 
    130                 135                 140                 


Phe Ile Tyr Pro Ile Thr Ala Thr Ile Lys Pro Phe Leu Ala Lys Lys 
145                 150                 155                 160 


Gly Gly Ser Pro Glu Asp Ile Glu Gly Met Tyr Asn Ala Trp Phe Lys 
                165                 170                 175     


Ser Val Val Leu Gln Val Ala Ile Trp Ser His Pro Tyr Thr Lys Glu 
            180                 185                 190         


Asn Asp Trp 
        195 


<210> 16
<211> 192
<212> PRT
<213> Pyrobaculum ferrireducens

<400> 16
Met Arg Glu Ile Pro Gly Tyr Glu Phe Gly Lys Val Pro Asp Ala Pro 
1               5                   10                  15      


Ile Ser Asp Glu Glu Phe Glu Leu Leu Lys Lys Ser Val Met Trp Thr 
            20                  25                  30          


Glu Glu Asp Glu Lys Tyr Arg Lys Leu Ala Cys Glu Val Leu Lys Gly 
        35                  40                  45              


Gln Val Glu Gln Ile Leu Asp Leu Trp Tyr Gly Trp Val Gly Ser Asn 
    50                  55                  60                  


Pro His Leu Val Tyr Tyr Phe Gly Asp Arg Ser Gly Arg Pro Ile Pro 
65                  70                  75                  80  


Gln Tyr Leu Glu Ala Val Arg Lys Arg Phe Gly Gln Trp Ile Leu Asp 
                85                  90                  95      


Thr Val Cys Arg Ser Tyr Asp Arg Gln Trp Leu Asn Tyr Val Tyr Glu 
            100                 105                 110         


Ile Gly Leu Arg His His Arg Thr Lys Lys Gly Lys Thr Asp Gly Val 
        115                 120                 125             


Glu Thr Val Glu His Ile Pro Leu Arg Tyr Met Val Ala Phe Ile Ala 
    130                 135                 140                 


Pro Ile Gly Leu Thr Ile Lys Pro Phe Leu Glu Lys Gly Gly His Pro 
145                 150                 155                 160 


Pro Asp Val Val Glu Lys Met Trp Ala Ala Trp Ile Lys Ser Val Val 
                165                 170                 175     


Leu Gln Val Ala Ile Trp Ser His Pro Tyr Ala Lys Pro Gly Glu Trp 
            180                 185                 190         


<210> 17
<211> 400
<212> PRT
<213> Cupriavidus necator

<400> 17
Met Leu Thr Gln Lys Thr Lys Asp Ile Val Lys Ala Thr Ala Pro Val 
1               5                   10                  15      


Leu Ala Glu His Gly Tyr Asp Ile Ile Lys Cys Phe Tyr Gln Arg Met 
            20                  25                  30          


Phe Glu Ala His Pro Glu Leu Lys Asn Val Phe Asn Met Ala His Gln 
        35                  40                  45              


Glu Gln Gly Gln Gln Gln Gln Ala Leu Ala Arg Ala Val Tyr Ala Tyr 
    50                  55                  60                  


Ala Glu Asn Ile Glu Asp Pro Asn Ser Leu Met Ala Val Leu Lys Asn 
65                  70                  75                  80  


Ile Ala Asn Lys His Ala Ser Leu Gly Val Lys Pro Glu Gln Tyr Pro 
                85                  90                  95      


Ile Val Gly Glu His Leu Leu Ala Ala Ile Lys Glu Val Leu Gly Asn 
            100                 105                 110         


Ala Ala Thr Asp Asp Ile Ile Ser Ala Trp Ala Gln Ala Tyr Gly Asn 
        115                 120                 125             


Leu Ala Asp Val Leu Met Gly Met Glu Ser Glu Leu Tyr Glu Arg Ser 
    130                 135                 140                 


Ala Glu Gln Pro Gly Gly Trp Lys Gly Trp Arg Thr Phe Val Ile Arg 
145                 150                 155                 160 


Glu Lys Arg Pro Glu Ser Asp Val Ile Thr Ser Phe Ile Leu Glu Pro 
                165                 170                 175     


Ala Asp Gly Gly Pro Val Val Asn Phe Glu Pro Gly Gln Tyr Thr Ser 
            180                 185                 190         


Val Ala Ile Asp Val Pro Ala Leu Gly Leu Gln Gln Ile Arg Gln Tyr 
        195                 200                 205             


Ser Leu Ser Asp Met Pro Asn Gly Arg Ser Tyr Arg Ile Ser Val Lys 
    210                 215                 220                 


Arg Glu Gly Gly Gly Pro Gln Pro Pro Gly Tyr Val Ser Asn Leu Leu 
225                 230                 235                 240 


His Asp His Val Asn Val Gly Asp Gln Val Lys Leu Ala Ala Pro Tyr 
                245                 250                 255     


Gly Ser Phe His Ile Asp Val Asp Ala Lys Thr Pro Ile Val Leu Ile 
            260                 265                 270         


Ser Gly Gly Val Gly Leu Thr Pro Met Val Ser Met Leu Lys Val Ala 
        275                 280                 285             


Leu Gln Ala Pro Pro Arg Gln Val Val Phe Val His Gly Ala Arg Asn 
    290                 295                 300                 


Ser Ala Val His Ala Met Arg Asp Arg Leu Arg Glu Ala Ala Lys Thr 
305                 310                 315                 320 


Tyr Glu Asn Leu Asp Leu Phe Val Phe Tyr Asp Gln Pro Leu Pro Glu 
                325                 330                 335     


Asp Val Gln Gly Arg Asp Tyr Asp Tyr Pro Gly Leu Val Asp Val Lys 
            340                 345                 350         


Gln Ile Glu Lys Ser Ile Leu Leu Pro Asp Ala Asp Tyr Tyr Ile Cys 
        355                 360                 365             


Gly Pro Ile Pro Phe Met Arg Met Gln His Asp Ala Leu Lys Asn Leu 
    370                 375                 380                 


Gly Ile His Glu Ala Arg Ile His Tyr Glu Val Phe Gly Pro Asp Leu 
385                 390                 395                 400 


<210> 18
<211> 6
<212> PRT
<213> Artificial Sequence

<220>
<223> Description of Artificial Sequence: Synthetic
      6xHis tag

<400> 18
His His His His His His 
1               5       


