                         SEQUENCE LISTING

<110>  California Institute of Technology
 
<120>  METHODS AND ENZYME CATALYSTS FOR THE SYNTHESIS OF NON-CANONICAL 
       AMINO ACIDS

<130>  1082068

<140>  PCT/US2018/030951
<141>  2018-05-03

<150>  US 62/500,698
<151>  2017-05-03

<160>  37    

<170>  PatentIn version 3.5

<210>  1
<211>  388
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  1

Met Trp Phe Gly Glu Phe Gly Gly Gln Tyr Val Pro Glu Thr Leu Val 
1               5                   10                  15      


Gly Pro Leu Lys Glu Leu Glu Lys Ala Tyr Lys Arg Phe Lys Asp Asp 
            20                  25                  30          


Glu Glu Phe Asn Arg Gln Leu Asn Tyr Tyr Leu Lys Thr Trp Ala Gly 
        35                  40                  45              


Arg Pro Thr Pro Leu Tyr Tyr Ala Lys Arg Leu Thr Glu Lys Ile Gly 
    50                  55                  60                  


Gly Ala Lys Val Tyr Leu Lys Arg Glu Asp Leu Val His Gly Gly Ala 
65                  70                  75                  80  


His Lys Thr Asn Asn Ala Ile Gly Gln Ala Leu Leu Ala Lys Leu Met 
                85                  90                  95      


Gly Lys Thr Arg Leu Ile Ala Glu Thr Gly Ala Gly Gln His Gly Val 
            100                 105                 110         


Ala Thr Ala Met Ala Gly Ala Leu Leu Gly Met Lys Val Asp Ile Tyr 
        115                 120                 125             


Met Gly Ala Glu Asp Val Glu Arg Gln Lys Met Asn Val Phe Arg Met 
    130                 135                 140                 


Lys Leu Leu Gly Ala Asn Val Ile Pro Val Asn Ser Gly Ser Arg Thr 
145                 150                 155                 160 


Leu Lys Asp Ala Ile Asn Glu Ala Leu Arg Asp Trp Val Ala Thr Phe 
                165                 170                 175     


Glu Tyr Thr His Tyr Leu Ile Gly Ser Val Val Gly Pro His Pro Tyr 
            180                 185                 190         


Pro Thr Ile Val Arg Asp Phe Gln Ser Val Ile Gly Arg Glu Ala Lys 
        195                 200                 205             


Ala Gln Ile Leu Glu Ala Glu Gly Gln Leu Pro Asp Val Ile Val Ala 
    210                 215                 220                 


Cys Val Gly Gly Gly Ser Asn Ala Met Gly Ile Phe Tyr Pro Phe Val 
225                 230                 235                 240 


Asn Asp Lys Lys Val Lys Leu Val Gly Val Glu Ala Gly Gly Lys Gly 
                245                 250                 255     


Leu Glu Ser Gly Lys His Ser Ala Ser Leu Asn Ala Gly Gln Val Gly 
            260                 265                 270         


Val Ser His Gly Met Leu Ser Tyr Phe Leu Gln Asp Glu Glu Gly Gln 
        275                 280                 285             


Ile Lys Pro Ser His Ser Ile Ala Pro Gly Leu Asp Tyr Pro Gly Val 
    290                 295                 300                 


Gly Pro Glu His Ala Tyr Leu Lys Lys Ile Gln Arg Ala Glu Tyr Val 
305                 310                 315                 320 


Ala Val Thr Asp Glu Glu Ala Leu Lys Ala Phe His Glu Leu Ser Arg 
                325                 330                 335     


Thr Glu Gly Ile Ile Pro Ala Leu Glu Ser Ala His Ala Val Ala Tyr 
            340                 345                 350         


Ala Met Lys Leu Ala Lys Glu Met Ser Arg Asp Glu Ile Ile Ile Val 
        355                 360                 365             


Asn Leu Ser Gly Arg Gly Asp Lys Asp Leu Asp Ile Val Leu Lys Ala 
    370                 375                 380                 


Ser Gly Asn Val 
385             


<210>  2
<211>  388
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  2

Met Trp Phe Gly Glu Phe Gly Gly Gln Tyr Val Pro Glu Thr Leu Val 
1               5                   10                  15      


Gly Pro Leu Lys Glu Leu Glu Lys Ala Tyr Lys Arg Phe Lys Asp Asp 
            20                  25                  30          


Glu Glu Phe Asn Arg Gln Leu Asn Tyr Tyr Leu Lys Thr Trp Ala Gly 
        35                  40                  45              


Arg Pro Thr Pro Leu Tyr Tyr Ala Lys Arg Leu Thr Glu Lys Ile Gly 
    50                  55                  60                  


Gly Ala Lys Val Tyr Leu Lys Arg Glu Asp Leu Val His Gly Gly Ala 
65                  70                  75                  80  


His Lys Thr Asn Asn Ala Ile Gly Gln Ala Pro Leu Ala Lys Leu Met 
                85                  90                  95      


Gly Lys Thr Arg Leu Ile Ala Glu Thr Gly Ala Gly Gln His Gly Val 
            100                 105                 110         


Ala Thr Ala Met Ala Gly Ala Leu Leu Gly Met Lys Val Asp Ile Tyr 
        115                 120                 125             


Met Gly Ala Glu Asp Val Glu Arg Gln Lys Met Asn Val Phe Arg Met 
    130                 135                 140                 


Lys Leu Leu Gly Ala Asn Val Ile Pro Val Asn Ser Gly Ser Arg Thr 
145                 150                 155                 160 


Ala Lys Asp Ala Ile Asn Glu Ala Leu Arg Asp Trp Val Ala Thr Phe 
                165                 170                 175     


Glu Tyr Thr His Tyr Leu Ile Gly Ser Val Val Gly Pro His Pro Tyr 
            180                 185                 190         


Pro Thr Ile Val Arg Asp Phe Gln Ser Val Ile Gly Arg Glu Ala Lys 
        195                 200                 205             


Ala Gln Ile Leu Glu Ala Glu Gly Gln Leu Pro Asp Val Ile Val Ala 
    210                 215                 220                 


Cys Val Gly Gly Gly Ser Asn Ala Met Gly Ile Phe Tyr Pro Phe Val 
225                 230                 235                 240 


Asn Asp Lys Lys Val Lys Leu Val Gly Val Glu Ala Gly Gly Lys Gly 
                245                 250                 255     


Leu Glu Ser Gly Lys His Ser Ala Ser Leu Asn Ala Gly Gln Val Gly 
            260                 265                 270         


Val Ser His Gly Met Leu Ser Tyr Phe Leu Gln Asp Glu Glu Gly Gln 
        275                 280                 285             


Ile Lys Pro Ser His Ser Ile Ala Pro Gly Leu Asp Tyr Pro Gly Val 
    290                 295                 300                 


Gly Pro Glu His Ala Tyr Leu Lys Lys Ile Gln Arg Ala Glu Tyr Val 
305                 310                 315                 320 


Ala Val Thr Asp Glu Glu Ala Leu Lys Ala Phe His Glu Leu Ser Arg 
                325                 330                 335     


Thr Glu Gly Ile Ile Pro Ala Leu Glu Ser Ala His Ala Val Ala Tyr 
            340                 345                 350         


Ala Met Lys Leu Ala Lys Glu Met Ser Arg Asp Glu Ile Ile Ile Val 
        355                 360                 365             


Asn Leu Ser Gly Arg Gly Asp Lys Asp Leu Asp Ile Val Leu Lys Ala 
    370                 375                 380                 


Ser Gly Asn Val 
385             


<210>  3
<211>  388
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  3

Met Trp Phe Gly Glu Phe Gly Gly Gln Tyr Val Pro Glu Thr Leu Val 
1               5                   10                  15      


Gly Pro Leu Lys Glu Leu Glu Lys Ala Tyr Lys Arg Phe Lys Asp Asp 
            20                  25                  30          


Glu Glu Phe Asn Arg Gln Leu Asn Tyr Tyr Leu Lys Thr Trp Ala Gly 
        35                  40                  45              


Arg Pro Thr Pro Leu Tyr Tyr Ala Lys Arg Leu Thr Glu Lys Ile Gly 
    50                  55                  60                  


Gly Ala Lys Val Tyr Leu Lys Arg Glu Asp Leu Val His Gly Gly Ala 
65                  70                  75                  80  


His Lys Thr Asn Asn Ala Ile Gly Gln Ala Pro Leu Ala Lys Leu Met 
                85                  90                  95      


Gly Lys Thr Arg Leu Ile Ala Glu Thr Gly Ala Gly Gln His Gly Val 
            100                 105                 110         


Ala Thr Ala Met Ala Gly Ala Leu Leu Gly Met Lys Val Asp Ile Tyr 
        115                 120                 125             


Met Gly Ala Glu Asp Val Glu Arg Gln Lys Met Asn Val Phe Arg Met 
    130                 135                 140                 


Lys Leu Leu Gly Ala Asn Val Ile Pro Val Asn Ser Gly Ser Arg Thr 
145                 150                 155                 160 


Ala Lys Asp Ala Ile Asn Glu Ala Leu Arg Asp Trp Glu Ala Thr Phe 
                165                 170                 175     


Glu Tyr Thr His Tyr Leu Ile Gly Ser Val Val Gly Pro His Pro Tyr 
            180                 185                 190         


Pro Thr Ile Val Arg Asp Phe Gln Ser Val Ile Gly Arg Glu Ala Lys 
        195                 200                 205             


Ala Gln Ile Leu Glu Ala Glu Gly Gln Leu Pro Asp Val Ile Val Ala 
    210                 215                 220                 


Cys Val Gly Gly Gly Ser Asn Ala Met Gly Ile Phe Tyr Pro Phe Val 
225                 230                 235                 240 


Asn Asp Lys Lys Val Lys Leu Val Gly Val Glu Ala Gly Gly Lys Gly 
                245                 250                 255     


Leu Glu Ser Gly Lys His Ser Ala Ser Leu Asn Ala Gly Gln Val Gly 
            260                 265                 270         


Val Ser His Gly Met Leu Ser Tyr Phe Leu Gln Asp Glu Glu Gly Gln 
        275                 280                 285             


Ile Lys Pro Ser His Ser Ile Ala Pro Gly Leu Asp Tyr Pro Gly Val 
    290                 295                 300                 


Gly Pro Glu His Ala Tyr Leu Lys Lys Ile Gln Arg Ala Glu Tyr Val 
305                 310                 315                 320 


Ala Val Thr Asp Glu Glu Ala Leu Lys Ala Phe His Glu Leu Ser Arg 
                325                 330                 335     


Thr Glu Gly Ile Ile Pro Ala Leu Glu Ser Ala His Ala Val Ala Tyr 
            340                 345                 350         


Ala Met Lys Leu Ala Lys Glu Met Ser Arg Asp Glu Ile Ile Ile Val 
        355                 360                 365             


Asn Leu Ser Gly Arg Gly Asp Lys Asp Leu Asp Ile Val Leu Lys Ala 
    370                 375                 380                 


Ser Gly Asn Val 
385             


<210>  4
<211>  388
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  4

Met Trp Phe Gly Glu Phe Gly Gly Gln Tyr Val Pro Glu Thr Leu Val 
1               5                   10                  15      


Gly Pro Leu Lys Glu Leu Glu Lys Ala Tyr Lys Arg Phe Lys Asp Asp 
            20                  25                  30          


Glu Glu Phe Asn Arg Gln Leu Asn Tyr Tyr Leu Lys Thr Trp Ala Gly 
        35                  40                  45              


Arg Pro Thr Pro Leu Tyr Tyr Ala Lys Arg Leu Thr Glu Lys Ile Gly 
    50                  55                  60                  


Gly Ala Lys Ile Tyr Leu Lys Arg Glu Asp Leu Val His Gly Gly Ala 
65                  70                  75                  80  


His Lys Thr Asn Asn Ala Ile Gly Gln Ala Pro Leu Ala Lys Leu Met 
                85                  90                  95      


Gly Lys Thr Arg Leu Ile Ala Glu Thr Gly Ala Gly Gln His Gly Val 
            100                 105                 110         


Ala Thr Ala Met Ala Gly Ala Leu Leu Gly Met Lys Val Asp Ile Tyr 
        115                 120                 125             


Met Gly Ala Glu Asp Val Glu Arg Gln Lys Met Asn Val Phe Arg Met 
    130                 135                 140                 


Lys Leu Leu Gly Ala Asn Val Ile Pro Val Asn Ser Gly Ser Arg Thr 
145                 150                 155                 160 


Ala Lys Asp Ala Ile Asn Glu Ala Leu Arg Asp Trp Glu Ala Thr Phe 
                165                 170                 175     


Glu Tyr Thr His Tyr Leu Ile Gly Ser Val Val Gly Pro His Pro Tyr 
            180                 185                 190         


Pro Thr Ile Val Arg Asp Phe Gln Ser Val Ile Gly Arg Glu Ala Lys 
        195                 200                 205             


Ala Gln Ile Leu Glu Ala Glu Gly Gln Leu Pro Asp Val Ile Val Ala 
    210                 215                 220                 


Cys Val Gly Gly Gly Ser Asn Ala Met Gly Ile Phe Tyr Pro Phe Val 
225                 230                 235                 240 


Asn Asp Lys Lys Val Lys Leu Val Gly Val Glu Ala Gly Gly Lys Gly 
                245                 250                 255     


Leu Glu Ser Gly Lys His Ser Ala Ser Leu Asn Ala Gly Gln Val Gly 
            260                 265                 270         


Val Leu His Gly Met Leu Ser Tyr Phe Leu Gln Asp Glu Glu Gly Gln 
        275                 280                 285             


Ile Lys Pro Ser His Ser Ile Ala Pro Gly Leu Asp Tyr Pro Gly Val 
    290                 295                 300                 


Gly Pro Glu His Ala Tyr Leu Lys Lys Ile Gln Arg Ala Glu Tyr Val 
305                 310                 315                 320 


Thr Val Thr Asp Glu Glu Ala Leu Lys Ala Phe His Glu Leu Ser Arg 
                325                 330                 335     


Thr Glu Gly Ile Ile Pro Ala Leu Glu Ser Ala His Ala Val Ala Tyr 
            340                 345                 350         


Ala Met Lys Leu Ala Lys Glu Met Ser Arg Asp Glu Ile Ile Ile Val 
        355                 360                 365             


Asn Leu Ser Gly Arg Gly Asp Lys Asp Leu Asp Ile Val Leu Lys Ala 
    370                 375                 380                 


Ser Gly Asn Val 
385             


<210>  5
<211>  388
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  5

Met Trp Phe Gly Glu Phe Gly Gly Gln Tyr Val Pro Glu Thr Leu Val 
1               5                   10                  15      


Gly Pro Leu Lys Glu Leu Glu Lys Ala Tyr Lys Arg Phe Lys Asp Asp 
            20                  25                  30          


Glu Glu Phe Asn Arg Gln Leu Asn Tyr Tyr Leu Lys Thr Trp Ala Gly 
        35                  40                  45              


Arg Pro Thr Pro Leu Tyr Tyr Ala Lys Arg Leu Thr Glu Lys Ile Gly 
    50                  55                  60                  


Gly Ala Lys Ile Tyr Leu Lys Arg Glu Asp Leu Val His Gly Gly Ala 
65                  70                  75                  80  


His Lys Thr Asn Asn Ala Ile Gly Gln Ala Leu Leu Ala Lys Leu Met 
                85                  90                  95      


Gly Lys Thr Arg Leu Ile Ala Glu Thr Gly Ala Gly Gln His Gly Val 
            100                 105                 110         


Ala Thr Ala Met Ala Gly Ala Leu Leu Gly Met Lys Val Asp Ile Tyr 
        115                 120                 125             


Met Gly Ala Glu Asp Val Glu Arg Gln Lys Leu Asn Val Phe Arg Met 
    130                 135                 140                 


Lys Leu Leu Gly Ala Asn Val Ile Pro Val Asn Ser Gly Ser Arg Thr 
145                 150                 155                 160 


Ala Lys Asp Ala Ile Asp Glu Ala Leu Arg Asp Trp Glu Ala Thr Phe 
                165                 170                 175     


Glu Tyr Thr His Tyr Leu Ile Gly Ser Val Val Gly Pro His Pro Tyr 
            180                 185                 190         


Pro Thr Ile Val Arg Asp Phe Gln Ser Val Ile Gly Arg Glu Ala Lys 
        195                 200                 205             


Ala Gln Ile Leu Glu Ala Glu Gly Gln Leu Pro Asp Val Ile Val Ala 
    210                 215                 220                 


Cys Val Gly Gly Gly Ser Asn Ala Met Gly Ile Phe Tyr Pro Phe Val 
225                 230                 235                 240 


Asn Asp Lys Lys Val Lys Leu Val Gly Val Glu Ala Gly Gly Lys Gly 
                245                 250                 255     


Leu Glu Ser Gly Lys His Ser Ala Ser Leu Asn Ala Gly Gln Val Gly 
            260                 265                 270         


Val Leu His Gly Met Leu Ser Tyr Phe Leu Gln Asp Glu Glu Gly Gln 
        275                 280                 285             


Ile Lys Pro Ser His Ser Ile Ala Pro Gly Leu Asp Tyr Pro Gly Val 
    290                 295                 300                 


Gly Pro Glu His Ala Tyr Leu Lys Lys Ile Gln Arg Ala Glu Tyr Val 
305                 310                 315                 320 


Thr Val Thr Asp Glu Glu Ala Leu Lys Ala Phe His Glu Leu Asn Arg 
                325                 330                 335     


Thr Glu Gly Ile Ile Pro Ala Leu Glu Ser Ala His Ala Val Ala Tyr 
            340                 345                 350         


Ala Met Lys Leu Ala Lys Glu Met Ser Arg Asp Glu Ile Ile Ile Val 
        355                 360                 365             


Asn Leu Ser Gly Arg Gly Asp Lys Asp Leu Asp Ile Val Leu Lys Ala 
    370                 375                 380                 


Ser Gly Asn Val 
385             


<210>  6
<211>  343
<212>  PRT
<213>  Thermotoga maritima

<400>  6

Met Ile Asp Leu Arg Ser Asp Thr Val Thr Lys Pro Thr Glu Glu Met 
1               5                   10                  15      


Arg Lys Ala Met Ala Gln Ala Glu Val Gly Asp Asp Val Tyr Gly Glu 
            20                  25                  30          


Asp Pro Thr Ile Asn Glu Leu Glu Arg Leu Ala Ala Glu Thr Phe Gly 
        35                  40                  45              


Lys Glu Ala Ala Leu Phe Val Pro Ser Gly Thr Met Gly Asn Gln Val 
    50                  55                  60                  


Ser Ile Met Ala His Thr Gln Arg Gly Asp Glu Val Ile Leu Glu Ala 
65                  70                  75                  80  


Asp Ser His Ile Phe Trp Tyr Glu Val Gly Ala Met Ala Val Leu Ser 
                85                  90                  95      


Gly Val Met Pro His Pro Val Pro Gly Lys Asn Gly Ala Met Asp Pro 
            100                 105                 110         


Asp Asp Val Arg Lys Ala Ile Arg Pro Arg Asn Ile His Phe Pro Arg 
        115                 120                 125             


Thr Ser Leu Ile Ala Ile Glu Asn Thr His Asn Arg Ser Gly Gly Arg 
    130                 135                 140                 


Val Val Pro Leu Glu Asn Ile Lys Glu Ile Cys Thr Ile Ala Lys Glu 
145                 150                 155                 160 


His Gly Ile Asn Val His Ile Asp Gly Ala Arg Ile Phe Asn Ala Ser 
                165                 170                 175     


Ile Ala Ser Gly Val Pro Val Lys Glu Tyr Ala Gly Tyr Ala Asp Ser 
            180                 185                 190         


Val Met Phe Cys Leu Ser Lys Gly Leu Cys Ala Pro Val Gly Ser Val 
        195                 200                 205             


Val Val Gly Asp Arg Asp Phe Ile Glu Arg Ala Arg Lys Ala Arg Lys 
    210                 215                 220                 


Met Leu Gly Gly Gly Met Arg Gln Ala Gly Val Leu Ala Ala Ala Gly 
225                 230                 235                 240 


Ile Ile Ala Leu Thr Lys Met Val Asp Arg Leu Lys Glu Asp His Glu 
                245                 250                 255     


Asn Ala Arg Phe Leu Ala Leu Lys Leu Lys Glu Ile Gly Tyr Ser Val 
            260                 265                 270         


Asn Pro Glu Asp Val Lys Thr Asn Met Val Ile Leu Arg Thr Asp Asn 
        275                 280                 285             


Leu Lys Val Asn Ala His Gly Phe Ile Glu Ala Leu Arg Asn Ser Gly 
    290                 295                 300                 


Val Leu Ala Asn Ala Val Ser Asp Thr Glu Ile Arg Leu Val Thr His 
305                 310                 315                 320 


Lys Asp Val Ser Arg Asn Asp Ile Glu Glu Ala Leu Asn Ile Phe Glu 
                325                 330                 335     


Lys Leu Phe Arg Lys Phe Ser 
            340             


<210>  7
<211>  389
<212>  PRT
<213>  Thermotoga maritima

<400>  7

Met Lys Gly Tyr Phe Gly Pro Tyr Gly Gly Gln Tyr Val Pro Glu Ile 
1               5                   10                  15      


Leu Met Pro Ala Leu Glu Glu Leu Glu Ala Ala Tyr Glu Glu Ile Met 
            20                  25                  30          


Lys Asp Glu Ser Phe Trp Lys Glu Phe Asn Asp Leu Leu Arg Asp Tyr 
        35                  40                  45              


Ala Gly Arg Pro Thr Pro Leu Tyr Phe Ala Arg Arg Leu Ser Glu Lys 
    50                  55                  60                  


Tyr Gly Ala Arg Ile Tyr Leu Lys Arg Glu Asp Leu Leu His Thr Gly 
65                  70                  75                  80  


Ala His Lys Ile Asn Asn Ala Ile Gly Gln Val Leu Leu Ala Lys Lys 
                85                  90                  95      


Met Gly Lys Thr Arg Ile Ile Ala Glu Thr Gly Ala Gly Gln His Gly 
            100                 105                 110         


Val Ala Thr Ala Thr Ala Ala Ala Leu Phe Gly Met Glu Cys Val Ile 
        115                 120                 125             


Tyr Met Gly Glu Glu Asp Thr Ile Arg Gln Lys Pro Asn Val Glu Arg 
    130                 135                 140                 


Met Lys Leu Leu Gly Ala Lys Val Val Pro Val Lys Ser Gly Ser Arg 
145                 150                 155                 160 


Thr Leu Lys Asp Ala Ile Asn Glu Ala Leu Arg Asp Trp Ile Thr Asn 
                165                 170                 175     


Leu Gln Thr Thr Tyr Tyr Val Ile Gly Ser Val Val Gly Pro His Pro 
            180                 185                 190         


Tyr Pro Ile Ile Val Arg Asn Phe Gln Lys Val Ile Gly Glu Glu Thr 
        195                 200                 205             


Lys Lys Gln Ile Leu Glu Lys Glu Gly Arg Leu Pro Asp Tyr Ile Val 
    210                 215                 220                 


Ala Cys Val Gly Gly Gly Ser Asn Ala Ala Gly Ile Phe Tyr Pro Phe 
225                 230                 235                 240 


Ile Asp Ser Gly Val Lys Leu Ile Gly Val Glu Ala Gly Gly Glu Gly 
                245                 250                 255     


Leu Glu Thr Gly Lys His Ala Ala Ser Leu Leu Lys Gly Lys Ile Gly 
            260                 265                 270         


Tyr Leu His Gly Ser Lys Thr Phe Val Leu Gln Asp Asp Trp Gly Gln 
        275                 280                 285             


Val Gln Val Thr His Ser Val Ser Ala Gly Leu Asp Tyr Ser Gly Val 
    290                 295                 300                 


Gly Pro Glu His Ala Tyr Trp Arg Glu Thr Gly Lys Val Leu Tyr Asp 
305                 310                 315                 320 


Ala Val Thr Asp Glu Glu Ala Leu Asp Ala Phe Ile Glu Leu Ser Arg 
                325                 330                 335     


Leu Glu Gly Ile Ile Pro Ala Leu Glu Ser Ser His Ala Leu Ala Tyr 
            340                 345                 350         


Leu Lys Lys Ile Asn Ile Lys Gly Lys Val Val Val Val Asn Leu Ser 
        355                 360                 365             


Gly Arg Gly Asp Lys Asp Leu Glu Ser Val Leu Asn His Pro Tyr Val 
    370                 375                 380                 


Arg Glu Arg Ile Arg 
385                 


<210>  8
<211>  397
<212>  PRT
<213>  Archaeoglobus fulgidus

<400>  8

Met Arg Cys Trp Leu Glu Asn Leu Ser Gly Gly Arg Lys Met Lys Phe 
1               5                   10                  15      


Gly Glu Phe Gly Gly Arg Phe Val Pro Glu Val Leu Ile Pro Pro Leu 
            20                  25                  30          


Glu Glu Leu Glu Lys Ala Tyr Asp Arg Phe Lys Asp Asp Glu Glu Phe 
        35                  40                  45              


Lys Ala Arg Leu Glu Tyr Tyr Leu Lys Ser Tyr Ala Gly Arg Pro Thr 
    50                  55                  60                  


Pro Leu Tyr Phe Ala Glu Asn Leu Ser Arg Glu Leu Gly Val Lys Ile 
65                  70                  75                  80  


Tyr Leu Lys Arg Glu Asp Leu Leu His Gly Gly Ala His Lys Ile Asn 
                85                  90                  95      


Asn Thr Ile Gly Gln Ala Leu Leu Ala Lys Phe Met Gly Lys Lys Arg 
            100                 105                 110         


Val Ile Ala Glu Thr Gly Ala Gly Gln His Gly Val Ala Thr Ala Met 
        115                 120                 125             


Ala Ala Ala Leu Leu Gly Leu Glu Ala Glu Ile Tyr Met Gly Ala Glu 
    130                 135                 140                 


Asp Tyr Glu Arg Gln Lys Met Asn Val Phe Arg Met Glu Leu Leu Gly 
145                 150                 155                 160 


Ala Lys Val Thr Ala Val Glu Ser Gly Ser Arg Thr Leu Lys Asp Ala 
                165                 170                 175     


Ile Asn Glu Ala Leu Arg Asp Trp Val Glu Ser Phe Glu His Thr His 
            180                 185                 190         


Tyr Leu Ile Gly Ser Val Val Gly Pro His Pro Phe Pro Thr Ile Val 
        195                 200                 205             


Arg Asp Phe Gln Ala Val Ile Gly Lys Glu Ala Arg Arg Gln Ile Ile 
    210                 215                 220                 


Glu Ala Glu Gly Gly Met Pro Asp Ala Ile Ile Ala Cys Val Gly Gly 
225                 230                 235                 240 


Gly Ser Asn Ala Met Gly Ile Phe His Pro Phe Leu Asn Asp Asp Val 
                245                 250                 255     


Arg Leu Ile Gly Val Glu Ala Gly Gly Glu Gly Ile Glu Ser Gly Arg 
            260                 265                 270         


His Ser Ala Ser Leu Thr Ala Gly Ser Lys Gly Val Leu His Gly Met 
        275                 280                 285             


Leu Ser Tyr Phe Leu Gln Asp Glu Glu Gly Met Met Leu Asp Thr His 
    290                 295                 300                 


Ser Val Ser Ala Gly Leu Asp Tyr Pro Gly Val Gly Pro Glu His Ala 
305                 310                 315                 320 


Tyr Leu Lys Glu Thr Gly Arg Cys Glu Tyr Val Thr Val Asn Asp Glu 
                325                 330                 335     


Glu Ala Leu Arg Ala Phe Lys Thr Leu Ser Lys Leu Glu Gly Ile Ile 
            340                 345                 350         


Pro Ala Leu Glu Ser Ala His Ala Ile Ala Tyr Ala Met Lys Met Ala 
        355                 360                 365             


Glu Glu Met Gln Arg Asp Asp Val Leu Val Val Asn Leu Ser Gly Arg 
    370                 375                 380                 


Gly Asp Lys Asp Met Asp Ile Val Arg Arg Arg Leu Ala 
385                 390                 395         


<210>  9
<211>  397
<212>  PRT
<213>  Escherichia coli

<400>  9

Met Thr Thr Leu Leu Asn Pro Tyr Phe Gly Glu Phe Gly Gly Met Tyr 
1               5                   10                  15      


Val Pro Gln Ile Leu Met Pro Ala Leu Arg Gln Leu Glu Glu Ala Phe 
            20                  25                  30          


Val Ser Ala Gln Lys Asp Pro Glu Phe Gln Ala Gln Phe Asn Asp Leu 
        35                  40                  45              


Leu Lys Asn Tyr Ala Gly Arg Pro Thr Ala Leu Thr Lys Cys Gln Asn 
    50                  55                  60                  


Ile Thr Ala Gly Thr Asn Thr Thr Leu Tyr Leu Lys Arg Glu Asp Leu 
65                  70                  75                  80  


Leu His Gly Gly Ala His Lys Thr Asn Gln Val Leu Gly Gln Ala Leu 
                85                  90                  95      


Leu Ala Lys Arg Met Gly Lys Thr Glu Ile Ile Ala Glu Thr Gly Ala 
            100                 105                 110         


Gly Gln His Gly Val Ala Ser Ala Leu Ala Ser Ala Leu Leu Gly Leu 
        115                 120                 125             


Lys Cys Arg Ile Tyr Met Gly Ala Lys Asp Val Glu Arg Gln Ser Pro 
    130                 135                 140                 


Asn Val Phe Arg Met Arg Leu Met Gly Ala Glu Val Ile Pro Val His 
145                 150                 155                 160 


Ser Gly Ser Ala Thr Leu Lys Asp Ala Cys Asn Glu Ala Leu Arg Asp 
                165                 170                 175     


Trp Ser Gly Ser Tyr Glu Thr Ala His Tyr Met Leu Gly Thr Ala Ala 
            180                 185                 190         


Gly Pro His Pro Tyr Pro Thr Ile Val Arg Glu Phe Gln Arg Met Ile 
        195                 200                 205             


Gly Glu Glu Thr Lys Ala Gln Ile Leu Glu Arg Glu Gly Arg Leu Pro 
    210                 215                 220                 


Asp Ala Val Ile Ala Cys Val Gly Gly Gly Ser Asn Ala Ile Gly Met 
225                 230                 235                 240 


Phe Ala Asp Phe Ile Asn Glu Thr Asn Val Gly Leu Ile Gly Val Glu 
                245                 250                 255     


Pro Gly Gly His Gly Ile Glu Thr Gly Glu His Gly Ala Pro Leu Lys 
            260                 265                 270         


His Gly Arg Val Gly Ile Tyr Phe Gly Met Lys Ala Pro Met Met Gln 
        275                 280                 285             


Thr Glu Asp Gly Gln Ile Glu Glu Ser Tyr Ser Ile Ser Ala Gly Leu 
    290                 295                 300                 


Asp Phe Pro Ser Val Gly Pro Gln His Ala Tyr Leu Asn Ser Thr Gly 
305                 310                 315                 320 


Arg Ala Asp Tyr Val Ser Ile Thr Asp Asp Glu Ala Leu Glu Ala Phe 
                325                 330                 335     


Lys Thr Leu Cys Leu His Glu Gly Ile Ile Pro Ala Leu Glu Ser Ser 
            340                 345                 350         


His Ala Leu Ala His Ala Leu Lys Met Met Arg Glu Asn Pro Asp Lys 
        355                 360                 365             


Glu Gln Leu Leu Val Val Asn Leu Ser Gly Arg Gly Asp Lys Asp Ile 
    370                 375                 380                 


Phe Thr Val His Asp Ile Leu Lys Ala Arg Gly Glu Ile 
385                 390                 395         


<210>  10
<211>  41
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  10
gaaataattt tgtttaactt taagaaggag atatacatat g                           41


<210>  11
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  11
gccggatctc agtggtggtg gtggtggtgc tcgag                                  35


<210>  12
<211>  41
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  12
catatgtata tctccttctt aaagttaaac aaaattattt c                           41


<210>  13
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  13
ctcgagcacc accaccacca ccactgagat ccggc                                  35


<210>  14
<211>  41
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  14
gaaataattt tgtttaactt taagaaggag atatacatat g                           41


<210>  15
<211>  26
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  15
ccagaaacgc tgrtagracc cctgaa                                            26


<210>  16
<211>  33
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  16
ggctctgcgt gattgggwag ctacttttga ata                                    33


<210>  17
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  17
gtgctgaata cgtgrcagta accgatgaag aa                                     32


<210>  18
<211>  41
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  18
gaaataattt tgtttaactt taagaaggag atatacatat g                           41


<210>  19
<211>  29
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  19
cggtggtgct aaartatacc tgaaacgtg                                         29


<210>  20
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  20
ggtcaggttg gtgtgtykca tggcatgctg tc                                     32


<210>  21
<211>  33
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  21
gatattgtcc tgaaagyatc tggcaacgtg ctc                                    33


<210>  22
<211>  26
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  22
ttcaggggty ctaycagcgt ttctgg                                            26


<210>  23
<211>  33
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  23
tattcaaaag tagctwccca atcacgcaga gcc                                    33


<210>  24
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  24
ttcttcatcg gttactgyca cgtattcagc ac                                     32


<210>  25
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  25
gccggatctc agtggtggtg gtggtggtgc tcgag                                  35


<210>  26
<211>  29
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  26
cacgtttcag gtatayttta gcaccaccg                                         29


<210>  27
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  27
gacagcatgc catgmracac accaacctga cc                                     32


<210>  28
<211>  33
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  28
gagcacgttg ccagatrctt tcaggacaat atc                                    33


<210>  29
<211>  35
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  29
gccggatctc agtggtggtg gtggtggtgc tcgag                                  35


<210>  30
<211>  33
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  30
ccggttctcg caccgggaaa gacgcaatca acg                                    33


<210>  31
<211>  50
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct


<220>
<221>  misc_feature
<222>  (32)..(32)
<223>  n =  A, C, G, or T

<220>
<221>  misc_feature
<222>  (33)..(33)
<223>  n = A, G, or T

<220>
<221>  misc_feature
<222>  (34)..(34)
<223>  n = T

<400>  31
cgtaattcca gttaactccg gttctcgcac cnnnaaagac gcaatcaacg                  50


<210>  32
<211>  23
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  32
ttctcgcacc gtgaaagacg caa                                               23


<210>  33
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  33
ggccaagagc gtgccctttc tgcgttagtt gc                                     32


<210>  34
<211>  36
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  34
ggtgcgagaa ccggagttaa ctggaattac gtttgc                                 36


<210>  35
<211>  25
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  35
ccggagttaa ctggaattac gtttg                                             25


<210>  36
<211>  50
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct


<220>
<221>  misc_feature
<222>  (32)..(32)
<223>  n = A, C, or G

<220>
<221>  misc_feature
<222>  (33)..(33)
<223>  n = A, C, or T

<220>
<221>  misc_feature
<222>  (34)..(34)
<223>  n = G

<400>  36
cgtaattcca gttaactccg gttctcgcac cnnnaaagac gcaatcaacg                  50


<210>  37
<211>  50
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct


<220>
<221>  misc_feature
<222>  (32)..(32)
<223>  n = T

<220>
<221>  misc_feature
<222>  (33)..(33)
<223>  n = G

<220>
<221>  misc_feature
<222>  (34)..(34)
<223>  n = G

<400>  37
cgtaattcca gttaactccg gttctcgcac cnnnaaagac gcaatcaacg                  50


