                         SEQUENCE LISTING

<110>  Westfaelische Wilhelms-Universitaet Muenster
       Yeda Research & Development Co. Ltd
 
<120>  Cysteine-free inteins

<130>  P78495

<160>  292   

<170>  PatentIn version 3.5

<210>  1
<211>  120
<212>  PRT
<213>  T4-like bacteriophage of Aeromonas salmonicida

<400>  1

Ser Val Val Gly Asp Thr Ile Ile Asp Val Ser Gly Lys Lys Met Thr 
1               5                   10                  15      


Ile Ala Glu Phe Tyr Asp Ser Thr Pro Asp Val Phe Met Arg Arg Asn 
            20                  25                  30          


Asp Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser Leu 
        35                  40                  45              


Ser Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn Tyr 
    50                  55                  60                  


Ile Met Lys His Thr Val Lys Lys Arg Met Phe Lys Ile Lys Ala Gly 
65                  70                  75                  80  


Gly Lys Glu Val Ile Val Thr Ala Asp His Ser Val Met Val Lys Arg 
                85                  90                  95      


Asp Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr Asp 
            100                 105                 110         


Arg Val Val Lys Trp Met Leu Thr 
        115                 120 


<210>  2
<211>  39
<212>  PRT
<213>  T4-like bacteriophage of Aeromonas salmonicida

<400>  2

Met Ile Glu Phe Ile Glu Phe Glu Ile Glu Asp Leu Gly Val Met Glu 
1               5                   10                  15      


Ile Asp Val Tyr Asp Ile Glu Val Asp Gly Asn His Asn Phe Phe Gly 
            20                  25                  30          


Asn Asp Ile Leu Val His Asn 
        35                  


<210>  3
<211>  26
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN (CLN) from T4-like bacteriophage of Aeromonas 
       salmonicida

<400>  3

Ser Val Val Gly Asp Thr Ile Ile Asp Val Ser Gly Lys Lys Met Thr 
1               5                   10                  15      


Ile Ala Glu Phe Tyr Asp Ser Thr Pro Asp 
            20                  25      


<210>  4
<211>  129
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC (CLC) from T4-like bacteriophage of Aeromonas 
       salmonicida

<400>  4

Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser Leu Ser 
1               5                   10                  15      


Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn Tyr Ile 
            20                  25                  30          


Met Lys His Thr Val Lys Lys Arg Met Phe Lys Ile Lys Ala Gly Gly 
        35                  40                  45              


Lys Glu Val Ile Val Thr Ala Asp His Ser Val Met Val Lys Arg Asp 
    50                  55                  60                  


Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr Asp Arg 
65                  70                  75                  80  


Val Val Lys Trp Met Leu Thr Gly Ser His Met Ile Glu Phe Ile Glu 
                85                  90                  95      


Phe Glu Ile Glu Asp Leu Gly Val Met Glu Ile Asp Val Tyr Asp Ile 
            100                 105                 110         


Glu Val Asp Gly Asn His Asn Phe Phe Gly Asn Asp Ile Leu Val His 
        115                 120                 125             


Asn 
    


<210>  5
<211>  120
<212>  PRT
<213>  Aeromonas phage Aes123

<400>  5

Ser Val Val Gly Asp Thr Ile Ile Asp Val Ser Gly Lys Lys Met Thr 
1               5                   10                  15      


Ile Ala Glu Phe Tyr Asp Ser Thr Pro Asp Val Phe Met Arg Arg Asn 
            20                  25                  30          


Asp Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser Leu 
        35                  40                  45              


Ser Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn Tyr 
    50                  55                  60                  


Ile Met Lys His Thr Val Lys Lys Arg Met Phe Lys Ile Lys Ala Gly 
65                  70                  75                  80  


Gly Lys Glu Val Ile Val Thr Ala Asp His Ser Val Met Val Lys Arg 
                85                  90                  95      


Asp Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr Asp 
            100                 105                 110         


Arg Val Val Lys Trp Met Leu Thr 
        115                 120 


<210>  6
<211>  39
<212>  PRT
<213>  Aeromonas phage Aes123

<400>  6

Met Ile Glu Phe Ile Glu Phe Glu Ile Glu Asp Leu Gly Val Met Glu 
1               5                   10                  15      


Ile Asp Val Tyr Asp Ile Glu Val Asp Gly Asn His Asn Phe Phe Gly 
            20                  25                  30          


Asn Asp Ile Leu Val His Asn 
        35                  


<210>  7
<211>  120
<212>  PRT
<213>  Aeromonas phage Aes144

<400>  7

Ser Val Val Gly Asp Thr Ile Ile Asp Val Ser Gly Lys Lys Met Thr 
1               5                   10                  15      


Ile Ala Glu Phe Tyr Asp Ser Thr Pro Asp Val Phe Met Arg Arg Asn 
            20                  25                  30          


Asp Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser Leu 
        35                  40                  45              


Ser Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn Tyr 
    50                  55                  60                  


Ile Met Lys His Thr Val Lys Lys Arg Met Phe Lys Ile Lys Ala Gly 
65                  70                  75                  80  


Gly Lys Glu Val Ile Val Thr Ala Asp His Ser Val Met Val Lys Arg 
                85                  90                  95      


Asp Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr Asp 
            100                 105                 110         


Arg Val Val Lys Trp Met Leu Thr 
        115                 120 


<210>  8
<211>  39
<212>  PRT
<213>  Aeromonas phage Aes144

<400>  8

Met Ile Glu Phe Ile Glu Phe Glu Ile Glu Asp Leu Gly Val Met Glu 
1               5                   10                  15      


Ile Asp Val Tyr Asp Ile Glu Val Asp Gly Asn His Asn Phe Phe Gly 
            20                  25                  30          


Asn Asp Ile Leu Val His Asn 
        35                  


<210>  9
<211>  120
<212>  PRT
<213>  Aeromonas phage Aes508

<400>  9

Ser Val Val Gly Asp Thr Ile Ile Asp Val Ser Gly Lys Lys Met Thr 
1               5                   10                  15      


Ile Ala Glu Phe Tyr Asp Ser Thr Pro Asp Val Phe Met Arg Arg Asn 
            20                  25                  30          


Asp Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser Leu 
        35                  40                  45              


Ser Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn Tyr 
    50                  55                  60                  


Ile Met Lys His Thr Val Lys Lys Arg Met Phe Lys Ile Lys Ala Gly 
65                  70                  75                  80  


Gly Lys Glu Val Ile Val Thr Ala Asp His Ser Val Met Val Lys Arg 
                85                  90                  95      


Asp Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr Asp 
            100                 105                 110         


Arg Val Val Lys Trp Met Leu Thr 
        115                 120 


<210>  10
<211>  39
<212>  PRT
<213>  Aeromonas phage Aes508

<400>  10

Met Ile Glu Phe Ile Glu Phe Glu Ile Glu Asp Leu Gly Val Met Glu 
1               5                   10                  15      


Ile Asp Val Tyr Asp Ile Glu Val Asp Gly Asn His Asn Phe Phe Gly 
            20                  25                  30          


Asn Asp Ile Leu Val His Asn 
        35                  


<210>  11
<211>  120
<212>  PRT
<213>  Aeromonas phage Aes509

<400>  11

Ser Val Val Gly Asp Thr Ile Ile Asp Val Ser Gly Lys Lys Met Thr 
1               5                   10                  15      


Ile Ala Glu Phe Tyr Asp Ser Thr Pro Asp Val Phe Met Arg Arg Asn 
            20                  25                  30          


Asp Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser Leu 
        35                  40                  45              


Ser Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn Tyr 
    50                  55                  60                  


Ile Met Lys His Thr Val Lys Lys Arg Met Phe Lys Ile Lys Ala Gly 
65                  70                  75                  80  


Gly Lys Glu Val Ile Val Thr Ala Asp His Ser Val Met Val Lys Arg 
                85                  90                  95      


Asp Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr Asp 
            100                 105                 110         


Arg Val Val Lys Trp Met Leu Thr 
        115                 120 


<210>  12
<211>  39
<212>  PRT
<213>  Aeromonas phage Aes509

<400>  12

Met Ile Glu Phe Ile Glu Phe Glu Ile Glu Asp Leu Gly Val Met Glu 
1               5                   10                  15      


Ile Asp Val Tyr Asp Ile Glu Val Asp Gly Asn His Asn Phe Phe Gly 
            20                  25                  30          


Asn Asp Ile Leu Val His Asn 
        35                  


<210>  13
<211>  120
<212>  PRT
<213>  Aeromonas phage Aes512

<400>  13

Ser Val Val Gly Asp Thr Ile Ile Asp Val Ser Gly Lys Lys Met Thr 
1               5                   10                  15      


Ile Ala Glu Phe Tyr Asp Ser Thr Pro Asp Val Phe Met Arg Arg Asn 
            20                  25                  30          


Asp Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser Leu 
        35                  40                  45              


Ser Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn Tyr 
    50                  55                  60                  


Ile Met Lys His Thr Val Lys Lys Arg Met Phe Lys Ile Lys Ala Gly 
65                  70                  75                  80  


Gly Lys Glu Val Ile Val Thr Ala Asp His Ser Val Met Val Lys Arg 
                85                  90                  95      


Asp Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr Asp 
            100                 105                 110         


Arg Val Val Lys Trp Met Leu Thr 
        115                 120 


<210>  14
<211>  39
<212>  PRT
<213>  Aeromonas phage Aes512

<400>  14

Met Ile Glu Phe Ile Glu Phe Glu Ile Glu Asp Leu Gly Val Met Glu 
1               5                   10                  15      


Ile Asp Val Tyr Asp Ile Glu Val Asp Gly Asn His Asn Phe Phe Gly 
            20                  25                  30          


Asn Asp Ile Leu Val His Asn 
        35                  


<210>  15
<211>  141
<212>  PRT
<213>  Aeromonas phage Aes516

<400>  15

Ser Val Val Gly Asp Thr Ile Ile Asp Val Ser Gly Lys Lys Met Thr 
1               5                   10                  15      


Ile Ala Glu Phe Tyr Asp Ser Thr Pro Asp Val Phe Met Arg Arg Asn 
            20                  25                  30          


Asp Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser Leu 
        35                  40                  45              


Ser Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn Tyr 
    50                  55                  60                  


Ile Met Lys His Thr Val Lys Lys Arg Met Phe Lys Ile Lys Ala Gly 
65                  70                  75                  80  


Gly Lys Glu Val Ile Val Thr Ala Asp His Ser Val Met Val Lys Arg 
                85                  90                  95      


Asp Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr Asp 
            100                 105                 110         


Arg Val Val Val Trp Asp Asn Glu Tyr Leu Ala Asn Lys Asp Gly Thr 
        115                 120                 125             


Ile Asn Arg Ile Val Glu Glu Ile Tyr Asp Arg Val Tyr 
    130                 135                 140     


<210>  16
<211>  39
<212>  PRT
<213>  Aeromonas phage Aes516

<400>  16

Met Ile Glu Phe Ile Glu Phe Glu Ile Glu Asp Leu Gly Val Met Glu 
1               5                   10                  15      


Ile Asp Val Tyr Asp Ile Glu Val Asp Gly Asn His Asn Phe Phe Gly 
            20                  25                  30          


Asn Asp Ile Leu Val His Asn 
        35                  


<210>  17
<211>  120
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from Aeromonas phage Aes517


<220>
<221>  misc_feature
<222>  (74)..(74)
<223>  Xaa can be any naturally occurring amino acid

<400>  17

Ser Val Val Gly Asp Thr Ile Ile Asp Val Ser Gly Lys Lys Met Thr 
1               5                   10                  15      


Ile Ala Glu Phe Tyr Asp Ser Thr Pro Asp Val Phe Met Arg Arg Asn 
            20                  25                  30          


Asp Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser Leu 
        35                  40                  45              


Ser Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn Tyr 
    50                  55                  60                  


Ile Met Lys His Thr Val Lys Lys Arg Xaa Phe Lys Ile Lys Ala Gly 
65                  70                  75                  80  


Gly Lys Glu Val Ile Val Thr Ala Asp His Ser Val Met Val Lys Arg 
                85                  90                  95      


Asp Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr Asp 
            100                 105                 110         


Arg Val Val Lys Trp Met Leu Thr 
        115                 120 


<210>  18
<211>  39
<212>  PRT
<213>  Aeromonas phage Aes517

<400>  18

Met Ile Glu Phe Ile Glu Phe Glu Ile Glu Asp Leu Gly Val Met Glu 
1               5                   10                  15      


Ile Asp Val Tyr Asp Ile Glu Val Asp Gly Asn His Asn Phe Phe Gly 
            20                  25                  30          


Asn Asp Ile Leu Val His Asn 
        35                  


<210>  19
<211>  126
<212>  PRT
<213>  Unknown

<220>
<223>  IntN from Russia Kulunda-steppe soda lake Tanatar-5 brine 
       environmental genomics

<400>  19

Ser Val Ala His Asp Ser Leu Ile Arg Ile Ser Arg Asp Asn Gly Thr 
1               5                   10                  15      


Val Gln Asn Thr Thr Ile Glu Glu Leu Phe Leu Gln Gly Asn Glu Tyr 
            20                  25                  30          


Trp Glu Ser Asn Gly Lys Glu Tyr Ser Leu Asn Ser Asp Ile Lys Ile 
        35                  40                  45              


Ala His Thr Gly Ser Ser Gly Val Leu Asn Phe Val Asn Tyr Asn Tyr 
    50                  55                  60                  


Val Tyr Arg His Lys Val Lys Asn Lys Ala Arg Tyr Arg Val Thr Thr 
65                  70                  75                  80  


Ser Asn Gly Lys Ser Val Val Val Thr Asp Asp His Ser Val Met Ile 
                85                  90                  95      


Met Gln Asp Gly Arg Leu Ile Glu Lys Lys Pro Ser Glu Ile Lys Gln 
            100                 105                 110         


Gly Asp Leu Val Ile Thr Ile Val Asp Ser Asp Thr Ser Thr 
        115                 120                 125     


<210>  20
<211>  43
<212>  PRT
<213>  Unknown

<220>
<223>  IntC from Russia Kulunda-steppe soda lake Tanatar-5 brine 
       environmental genomics

<400>  20

Met Asp Ala Lys Val Ser Glu Val Val Gly Val Glu Arg Leu Asp Asp 
1               5                   10                  15      


Phe Asp Asp Glu Tyr Val Tyr Asp Ile Gly Val Ala Asn Asp Asp Pro 
            20                  25                  30          


Tyr Phe Phe Ala Asn Asp Ile Leu Val His Asn 
        35                  40              


<210>  21
<211>  123
<212>  PRT
<213>  Rattus norvegicus

<400>  21

Ser Gln Ser Ala Leu Thr Ile Asn Tyr Leu Asp Gln Glu Lys Met Thr 
1               5                   10                  15      


Val Glu Asp Met Phe Asn Lys Leu Lys Tyr Glu Asn Asn Asp Val Val 
            20                  25                  30          


Leu Arg Val Ser Asn Gly Ser Glu Val Val Pro Val Lys Asn His Thr 
        35                  40                  45              


Thr Lys Thr Phe Ile Met Lys Glu Gly Val Ile Asp Arg Pro Leu Lys 
    50                  55                  60                  


Tyr Ile Met Arg His Lys Val Thr Lys Ser Lys Trp Arg Leu Arg Thr 
65                  70                  75                  80  


Glu Ser Gly Lys Glu Ile Ile Val Thr Gly Asp His Ser Leu Met Val 
                85                  90                  95      


Leu Arg Asp Asn Glu Leu Ile Ser Leu Lys Pro Lys Asp Val Asn Pro 
            100                 105                 110         


Lys Thr Asp Lys Ile Ile Thr Ile Lys Asp Val 
        115                 120             


<210>  22
<211>  42
<212>  PRT
<213>  Rattus norvegicus

<400>  22

Met Asn Tyr Asn Ile Glu Asn Ile Ala Val Ile Glu Gln Ile Glu Asp 
1               5                   10                  15      


Phe Gln Asp Glu Tyr Val Tyr Asp Leu Glu Val Glu Asp Thr His Thr 
            20                  25                  30          


Phe Phe Gly Asn Asp Ile Leu Ile His Asn 
        35                  40          


<210>  23
<211>  123
<212>  PRT
<213>  Rattus norvegicus

<400>  23

Ser Gln Ser Ala Leu Thr Ile Asn Tyr Leu Asp Gln Glu Lys Met Thr 
1               5                   10                  15      


Val Glu Asp Met Phe Asn Lys Leu Lys Tyr Glu Asn Asn Asp Val Val 
            20                  25                  30          


Leu Arg Val Ser Asn Gly Ser Glu Val Val Pro Val Lys Asn His Thr 
        35                  40                  45              


Thr Lys Thr Phe Ile Met Lys Glu Gly Val Ile Asp Arg Pro Leu Lys 
    50                  55                  60                  


Tyr Ile Met Arg His Lys Val Thr Lys Ser Lys Trp Arg Leu Arg Thr 
65                  70                  75                  80  


Glu Ser Gly Lys Glu Ile Ile Val Thr Gly Asp His Ser Leu Met Val 
                85                  90                  95      


Leu Arg Asp Asn Glu Leu Ile Ser Leu Lys Pro Lys Asp Val Asn Pro 
            100                 105                 110         


Lys Thr Asp Lys Ile Ile Thr Ile Lys Asp Val 
        115                 120             


<210>  24
<211>  42
<212>  PRT
<213>  Rattus norvegicus

<400>  24

Met Asn Tyr Asn Ile Glu Asn Ile Ala Val Ile Glu Gln Ile Glu Asp 
1               5                   10                  15      


Phe Gln Asp Glu Tyr Val Tyr Asp Leu Glu Val Glu Asp Thr His Thr 
            20                  25                  30          


Phe Phe Gly Asn Asp Ile Leu Ile His Asn 
        35                  40          


<210>  25
<211>  122
<212>  PRT
<213>  Prevotella megaphage Lak-B4

<400>  25

Ser Gln Thr Ala Asp Thr Gln Val Val Ile Asp Asn Lys Val Phe Ser 
1               5                   10                  15      


Met Glu Gly Phe Phe Thr Lys Ala Lys Tyr Glu Asn Asp Asp Val Val 
            20                  25                  30          


Ile Lys Leu Gln Asn Gly Ser Glu Val Ile Pro Val His Asn His Asp 
        35                  40                  45              


Thr Leu Ser Tyr Lys Asp His Asp Tyr Lys Thr Ile Thr Arg Pro Ile 
    50                  55                  60                  


Asn Tyr Ile Met Arg His Lys Val Thr Lys Asp Lys Phe Arg Leu Lys 
65                  70                  75                  80  


Thr Lys Ser Gly Lys Glu Leu Ile Val Thr Gly Asp His Ser Ile Met 
                85                  90                  95      


Val Ile Arg Asn Asn Glu Leu Ile Ser Val Pro Ala Arg Glu Ile Lys 
            100                 105                 110         


Lys Ser Asp Lys Ile Ile Thr Leu Asp Arg 
        115                 120         


<210>  26
<211>  44
<212>  PRT
<213>  Prevotella megaphage Lak-B4

<400>  26

Met Asn Tyr Asn Ile Gln Val Glu Asp Ile Asp Val Ile Glu Gln Leu 
1               5                   10                  15      


Glu Asp Phe Asn Asp Glu Tyr Val Tyr Asp Ile Glu Val Asp Asp Thr 
            20                  25                  30          


His Thr Phe Phe Ala Asn Glu Ile Leu Ala His Asn 
        35                  40                  


<210>  27
<211>  131
<212>  PRT
<213>  Prevotella megaphage Lak-C1

<400>  27

Ser Gln Leu Gly Ser Thr Gln Phe Arg Val Asp Asn Asn Ile Thr Thr 
1               5                   10                  15      


Met Glu Asp Phe Phe Ile Lys Ala Lys Tyr Glu Asn Asn Asp Val Val 
            20                  25                  30          


Ile Lys Leu Thr Asn Gly Ser Glu Ile Ile Pro Val His Asn His Met 
        35                  40                  45              


Thr Leu Ser Tyr Lys Asp His Asp Tyr Lys Thr Ser Glu Arg Pro Ile 
    50                  55                  60                  


Asn Tyr Ile Met Arg His Lys Val Thr Lys Asp Lys Phe Arg Leu Lys 
65                  70                  75                  80  


Thr Lys Ser Gly Lys Glu Leu Ile Val Thr Gly Asp His Ser Ile Met 
                85                  90                  95      


Val Ile Arg Asp Asn Glu Leu Ile Ser Leu Pro Ala Arg Glu Ile Lys 
            100                 105                 110         


Lys Ser Asp Lys Ile Ile Thr Ile Ala Gly Asp Asn Phe Thr Lys Ala 
        115                 120                 125             


Gly Asp Lys 
    130     


<210>  28
<211>  44
<212>  PRT
<213>  Prevotella megaphage Lak-C1

<400>  28

Met Asn Tyr Asp Ile Gln Ile Glu Asp Ile Asp Val Ile Glu Gln Leu 
1               5                   10                  15      


Glu Asp Phe Asn Asp Glu Tyr Val Tyr Asp Ile Glu Val Asp Asp Thr 
            20                  25                  30          


His Thr Phe Phe Ala Asn Asp Ile Leu Ala His Asn 
        35                  40                  


<210>  29
<211>  122
<212>  PRT
<213>  Prevotella megaphage Lak-B1

<400>  29

Ser Gln Thr Ala Asp Thr Gln Val Val Ile Asp Asn Lys Val Phe Ser 
1               5                   10                  15      


Met Glu Gly Phe Phe Thr Lys Ala Lys Tyr Glu Asn Asp Asp Val Val 
            20                  25                  30          


Ile Lys Leu Gln Asn Gly Ser Glu Val Ile Pro Val His Asn His Asp 
        35                  40                  45              


Thr Leu Ser Tyr Lys Asp His Asp Tyr Lys Thr Ile Thr Arg Pro Ile 
    50                  55                  60                  


Asn Tyr Ile Met Arg His Lys Val Thr Lys Asp Lys Phe Arg Leu Lys 
65                  70                  75                  80  


Thr Lys Ser Gly Lys Glu Leu Ile Val Thr Gly Asp His Ser Ile Met 
                85                  90                  95      


Val Ile Arg Asn Asn Glu Leu Ile Ser Val Pro Ala Arg Glu Ile Lys 
            100                 105                 110         


Lys Ser Asp Lys Ile Ile Thr Leu Asp Arg 
        115                 120         


<210>  30
<211>  44
<212>  PRT
<213>  Prevotella megaphage Lak-B1

<400>  30

Met Asn Tyr Asn Ile Gln Val Glu Asp Ile Asp Val Ile Glu Gln Leu 
1               5                   10                  15      


Glu Asp Phe Asn Asp Glu Tyr Val Tyr Asp Ile Glu Val Asp Asp Thr 
            20                  25                  30          


His Thr Phe Phe Ala Asn Glu Ile Leu Ala His Asn 
        35                  40                  


<210>  31
<211>  124
<212>  PRT
<213>  Prevotella megaphage Lak-A2

<400>  31

Ser Thr Ser Trp Lys Ser Ser Ile Tyr Val Asp Ser Val Lys Leu Lys 
1               5                   10                  15      


Val Gln Asp Ala Phe Asn Lys Phe Lys Tyr Glu Asn Asn Asp Thr Val 
            20                  25                  30          


Leu Lys Leu Asn Asn Gly Gln Glu Ile Val Pro Val His Asn His Asn 
        35                  40                  45              


Ile Leu Ser Tyr Val Asp His Asp Ala Glu Ala Thr Tyr Arg Pro Ile 
    50                  55                  60                  


Lys Tyr Ile Met Arg His Lys Val Tyr Asn Lys Ser Arg Phe Arg Ile 
65                  70                  75                  80  


Lys Ser Lys Ser Gly Lys Glu Leu Glu Val Thr Gly Asp His Ser Met 
                85                  90                  95      


Met Ile Ile Arg Asn Asn Glu Leu Ile Thr Val Lys Ala Lys Asp Ile 
            100                 105                 110         


Leu Lys Thr Asp Lys Ile Ile Thr Ile Ala Gly Asp 
        115                 120                 


<210>  32
<211>  44
<212>  PRT
<213>  Prevotella megaphage Lak-A2

<400>  32

Met Asn Tyr Asp Ile Gln Ile Glu Asp Ile Asp Val Ile Glu Gln Leu 
1               5                   10                  15      


Glu Asp Phe Asn Asp Glu Tyr Val Tyr Asp Ile Glu Val Asp Asp Thr 
            20                  25                  30          


His Thr Phe Phe Ala Asn Asp Ile Leu Ala His Asn 
        35                  40                  


<210>  33
<211>  122
<212>  PRT
<213>  Prevotella megaphage Lak-A1

<400>  33

Ser Gln Ile Gly Ser Thr Gln Phe Tyr Val Asp Asn Asn Ile Thr Thr 
1               5                   10                  15      


Met Glu Asp Phe Phe Thr Lys Ala Lys Tyr Glu Asn Asn Asp Val Val 
            20                  25                  30          


Ile Lys Leu Gln Asn Gly Ser Glu Val Val Pro Val His Asn His Asn 
        35                  40                  45              


Thr Leu Thr Tyr Ile Asp His Asp Phe Lys Thr Ser Glu Arg Pro Ile 
    50                  55                  60                  


Asn Tyr Ile Met Arg His Lys Val Thr Lys Asp Lys Phe Arg Leu Lys 
65                  70                  75                  80  


Thr Lys Ser Gly Lys Glu Leu Ile Val Thr Gly Asp His Ser Ile Met 
                85                  90                  95      


Val Ile Arg Asn Asn Glu Leu Ile Ser Leu Pro Ala Arg Glu Ile Lys 
            100                 105                 110         


Asn Thr Asp Lys Ile Ile Thr Leu Asp Lys 
        115                 120         


<210>  34
<211>  44
<212>  PRT
<213>  Prevotella megaphage Lak-A1

<400>  34

Met Asn Tyr Asp Ile Gln Ile Glu Asp Ile Asp Val Ile Glu Gln Leu 
1               5                   10                  15      


Glu Asp Phe Asn Asp Glu Tyr Val Tyr Asp Ile Glu Val Asp Asp Thr 
            20                  25                  30          


His Thr Phe Phe Ala Asn Asp Ile Leu Ala His Asn 
        35                  40                  


<210>  35
<211>  117
<212>  PRT
<213>  Pseudomonas phage vB_PaeM_PA5oct

<400>  35

Ser Val Asp Gly Ser Thr Ile Leu Asn Thr Ser Leu Gly Lys Ile Thr 
1               5                   10                  15      


Ile Glu Glu Leu Phe Asn Val Ser Asp Lys His Val Val His Ala Glu 
            20                  25                  30          


Lys Glu Phe Ala Ser Asn Glu Asp Val Met Val Met Ser Trp Asp Asn 
        35                  40                  45              


Ser Ala Lys Gln Pro Tyr Met Gly His Ile Asn Tyr Val Tyr Arg His 
    50                  55                  60                  


Glu Val Glu Lys Glu Leu Phe Glu Ile Glu Asp Asn Asn Gly Asn Lys 
65                  70                  75                  80  


Val Ile Val Thr Glu Asp His Ser Ile Met Val Ile Arg Asn Ala Glu 
                85                  90                  95      


Leu Leu Glu Val Lys Pro Ala Glu Leu Thr Asp Ser Asp Ile Ile Leu 
            100                 105                 110         


Ser Ile Val Tyr Glu 
        115         


<210>  36
<211>  85
<212>  PRT
<213>  Pseudomonas phage vB_PaeM_PA5oct

<400>  36

Met His Asn Leu Thr Lys His Leu Leu Gly Val Phe Ser Ala Lys Thr 
1               5                   10                  15      


Glu Asp Glu Glu Tyr Lys Ser Ala Lys Arg Ala Leu Glu Glu Leu Asn 
            20                  25                  30          


Lys Asn Ile Lys Glu Arg Asp Pro Asn Lys Phe Asn Val Ser Leu Gly 
        35                  40                  45              


Lys Val Ser Lys Val Thr Asn Leu Gly Lys Lys Lys Gln Tyr Val Tyr 
    50                  55                  60                  


Asp Ile Gly Met Lys Asn Pro Asp Asn Pro Tyr Phe Phe Gly Asn Asn 
65                  70                  75                  80  


Ile Leu Val His Asn 
                85  


<210>  37
<211>  131
<212>  PRT
<213>  Unknown

<220>
<223>  IntN from megaphage from human gut metagenome Denmark fecal 
       sample

<400>  37

Ser Gln Leu Gly Ser Thr Gln Phe Arg Val Asp Asn Asn Ile Thr Thr 
1               5                   10                  15      


Met Glu Asp Phe Phe Ile Lys Ala Lys Tyr Glu Asn Asn Asp Val Val 
            20                  25                  30          


Ile Lys Leu Thr Asn Gly Ser Glu Ile Ile Pro Val His Asn His Met 
        35                  40                  45              


Thr Leu Ser Tyr Lys Asp His Asp Tyr Lys Thr Ser Glu Met Pro Ile 
    50                  55                  60                  


Asn Tyr Ile Met Arg His Lys Val Thr Lys Asp Lys Phe Arg Leu Lys 
65                  70                  75                  80  


Thr Lys Ser Gly Lys Glu Leu Ile Val Thr Gly Asp His Ser Ile Met 
                85                  90                  95      


Val Ile Arg Asp Asn Glu Leu Ile Ser Leu Pro Ala Arg Glu Ile Lys 
            100                 105                 110         


Lys Ser Asp Lys Ile Ile Thr Ile Ala Gly Asp Asn Phe Thr Lys Ala 
        115                 120                 125             


Glu Asp Lys 
    130     


<210>  38
<211>  44
<212>  PRT
<213>  Unknown

<220>
<223>  IntC from megaphage from human gut metagenome Denmark fecal 
       sample

<400>  38

Met Asn Tyr Asp Ile Gln Ile Glu Asp Ile Asp Val Ile Glu Gln Leu 
1               5                   10                  15      


Glu Asp Phe Asn Asp Glu Tyr Val Tyr Asp Ile Glu Val Asp Asp Thr 
            20                  25                  30          


His Thr Phe Phe Ala Asn Asp Ile Leu Ala His Asn 
        35                  40                  


<210>  39
<211>  124
<212>  PRT
<213>  Unknown

<220>
<223>  IntN from megaphage from human gut metagenome Denmark fecal 
       sample

<400>  39

Ser Thr Ser Trp Lys Ser Ser Ile Tyr Val Asp Ser Val Lys Leu Lys 
1               5                   10                  15      


Val Gln Ala Ala Phe Asn Lys Phe Lys Tyr Glu Asn Asn Asp Thr Val 
            20                  25                  30          


Leu Lys Leu Asn Asn Gly Gln Glu Ile Val Pro Val His Asn His Asn 
        35                  40                  45              


Ile Leu Ser Tyr Val Asp His Asp Ala Glu Ala Thr Tyr Arg Pro Ile 
    50                  55                  60                  


Lys Tyr Ile Met Arg His Lys Val Tyr Asn Lys Ser Arg Phe Arg Ile 
65                  70                  75                  80  


Lys Thr Lys Ser Gly Lys Glu Leu Glu Val Thr Gly Asp His Ser Met 
                85                  90                  95      


Met Ile Ile Arg Asn Asn Glu Leu Ile Thr Val Lys Ala Lys Asp Ile 
            100                 105                 110         


Leu Lys Thr Asp Lys Ile Ile Thr Ile Thr Gly Asp 
        115                 120                 


<210>  40
<211>  44
<212>  PRT
<213>  Unknown

<220>
<223>  IntC from megaphage from human gut metagenome Denmark fecal 
       sample

<400>  40

Met Asn Tyr Asp Ile Gln Ile Glu Asp Ile Asp Val Ile Glu Gln Leu 
1               5                   10                  15      


Glu Asp Phe Asn Asp Glu Tyr Val Tyr Asp Ile Glu Val Asp Asp Thr 
            20                  25                  30          


His Thr Phe Phe Ala Asn Asp Ile Leu Ala His Asn 
        35                  40                  


<210>  41
<211>  131
<212>  PRT
<213>  Unknown

<220>
<223>  IntN from megaphage from human gut metagenome Tanzanian Hadza 
       hunter-gatherer fecal sample

<400>  41

Ser Gln Leu Gly Ser Thr Gln Phe Arg Val Asp Asn Asp Ile Thr Thr 
1               5                   10                  15      


Met Glu Asp Phe Phe Val Lys Ala Lys Tyr Glu Asn Asn Asp Val Val 
            20                  25                  30          


Ile Lys Leu Gln Asn Gly Ser Glu Val Val Pro Val His Asn His Asn 
        35                  40                  45              


Thr Leu Thr Tyr Lys Asp His Asp Tyr Lys Thr Ile Thr Arg Pro Ile 
    50                  55                  60                  


Asn Tyr Ile Met Arg His Lys Val Ser Lys Asp Lys Phe Arg Leu Lys 
65                  70                  75                  80  


Thr Lys Ser Gly Lys Glu Leu Ile Val Thr Gly Asp His Ser Ile Met 
                85                  90                  95      


Val Ile Arg Asn Asn Glu Leu Val Ser Val Ala Ala Arg Glu Ile Lys 
            100                 105                 110         


Lys Thr Asp Lys Ile Ile Thr Ile Ala Gly Asp Asn Phe Asn Lys Ala 
        115                 120                 125             


Gly Asp Lys 
    130     


<210>  42
<211>  44
<212>  PRT
<213>  Unknown

<220>
<223>  IntC from megaphage from human gut metagenome Tanzanian Hadza 
       hunter-gatherer fecal sample

<400>  42

Met Asn Tyr Asp Ile Gln Ile Glu Asp Ile Asp Val Ile Glu Gln Leu 
1               5                   10                  15      


Glu Asp Phe Asn Asp Glu Tyr Val Tyr Asp Ile Glu Val Asp Asp Thr 
            20                  25                  30          


His Thr Phe Phe Ala Asn Asp Ile Leu Ala His Asn 
        35                  40                  


<210>  43
<211>  122
<212>  PRT
<213>  Unknown

<220>
<223>  IntN from megaphage from human gut metagenome Denmark fecal 
       sample

<400>  43

Ser Gln Thr Ala Asp Thr Gln Leu Phe Ile Asp Asn Lys Glu Ile Ser 
1               5                   10                  15      


Met Glu Glu Phe Phe Arg Thr Ala Lys Tyr Glu Asn Asn Asp Val Val 
            20                  25                  30          


Ile Lys Leu Gln Asn Gly Ser Glu Val Val Pro Val His Asn His Asn 
        35                  40                  45              


Thr Leu Ser Tyr Lys Asp His Asp Tyr Lys Thr Ile Glu Arg Pro Ile 
    50                  55                  60                  


Asn Tyr Ile Met Arg His Lys Val Thr Lys Asp Lys Phe Arg Leu Lys 
65                  70                  75                  80  


Thr Lys Ser Gly Lys Glu Leu Ile Val Thr Gly Asp His Ser Ile Met 
                85                  90                  95      


Val Ile Arg Asn Asn Glu Leu Val Ser Leu Pro Ala Arg Glu Ile Lys 
            100                 105                 110         


Asn Thr Asp Lys Ile Ile Thr Leu Asp Lys 
        115                 120         


<210>  44
<211>  44
<212>  PRT
<213>  Unknown

<220>
<223>  IntC from megaphage from human gut metagenome Denmark fecal 
       sample

<400>  44

Met Asn Tyr Asp Ile Gln Ile Glu Asp Ile Asp Val Ile Glu Gln Leu 
1               5                   10                  15      


Glu Asp Phe Asn Asp Glu Tyr Val Tyr Asp Ile Glu Val Asp Asp Thr 
            20                  25                  30          


His Thr Phe Phe Ala Asn Asp Ile Leu Ala His Asn 
        35                  40                  


<210>  45
<211>  131
<212>  PRT
<213>  Unknown

<220>
<223>  IntN from megaphage from human gut metagenome Bangladeshi 
       cholera-succession

<400>  45

Ser Gln Leu Gly Ser Thr Gln Phe Arg Val Asp Asn Asn Ile Thr Thr 
1               5                   10                  15      


Met Glu Asp Phe Phe Ile Lys Ala Lys Tyr Glu Asn Asn Asp Val Val 
            20                  25                  30          


Ile Lys Leu Thr Asn Gly Ser Glu Ile Ile Pro Val His Asn His Met 
        35                  40                  45              


Thr Leu Ser Tyr Lys Asp His Asp Tyr Lys Thr Ser Glu Arg Pro Ile 
    50                  55                  60                  


Asn Tyr Ile Met Arg His Lys Val Thr Lys Asp Lys Phe Arg Leu Lys 
65                  70                  75                  80  


Thr Lys Ser Gly Lys Glu Leu Ile Val Thr Gly Asp His Ser Ile Met 
                85                  90                  95      


Val Ile Arg Asp Asn Glu Leu Ile Ser Leu Pro Ala Arg Glu Ile Lys 
            100                 105                 110         


Lys Ser Asp Lys Ile Ile Thr Ile Ala Gly Asp Asn Phe Thr Lys Ala 
        115                 120                 125             


Gly Asp Lys 
    130     


<210>  46
<211>  44
<212>  PRT
<213>  Unknown

<220>
<223>  IntC from megaphage from human gut metagenome Bangladeshi 
       cholera-succession

<400>  46

Met Asn Tyr Asp Ile Gln Ile Glu Asp Ile Asp Val Ile Glu Gln Leu 
1               5                   10                  15      


Glu Asp Phe Asn Asp Glu Tyr Val Tyr Asp Ile Glu Val Asp Asp Thr 
            20                  25                  30          


His Thr Phe Phe Ala Asn Asp Ile Leu Ala His Asn 
        35                  40                  


<210>  47
<211>  156
<212>  PRT
<213>  Unknown

<220>
<223>  Contiguous intein from sheep gut metagenome

<400>  47

Ser Ile Arg Gly Asp Ser Leu Leu Ser Ile Asn Asn Lys Thr Ile Ser 
1               5                   10                  15      


Ile Ser Asn Phe Phe Asn Tyr Ser Glu Gly Ser Ile Lys Thr Asn Gly 
            20                  25                  30          


Asp Glu Lys Tyr Ile Lys His Leu Ser Arg Asp Tyr Phe Thr Thr Thr 
        35                  40                  45              


Tyr Asp Asp Gly Asn Ile Ile Glu Thr Lys Val Asn Tyr Val Met Lys 
    50                  55                  60                  


His Lys Thr Lys Lys Arg Met Tyr Lys Leu Lys Val Val Asn Lys Glu 
65                  70                  75                  80  


Ile Val Val Thr Glu Asp His Ser Ile Met Ile Glu Arg Asp Asn Asn 
                85                  90                  95      


Leu Ile Glu Gly Ser Val Lys Asp Leu His Ser Asn Asp Lys Ile Ile 
            100                 105                 110         


Val Tyr Asn Lys Asn Ala Val Ile Lys Ser Glu Asn Trp Gln Val Glu 
        115                 120                 125             


Asp Leu Gly Ile Ile Glu Asp Tyr Val Tyr Asp Ile Glu Thr Glu Asn 
    130                 135                 140                 


His Met Phe Phe Gly Asn Asp Ile Leu Val His Asn 
145                 150                 155     


<210>  48
<211>  115
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from Campylobacter phage CP21


<220>
<221>  misc_feature
<222>  (43)..(43)
<223>  Xaa can be any naturally occurring amino acid

<400>  48

Ser Val Val Gly Asp Ser Ile Ile Lys Val Asn Gly Lys Asn Ile Lys 
1               5                   10                  15      


Ile Glu Asp Phe Tyr Asp Ser Ile Lys Val Asp Pro Ile Val Thr Lys 
            20                  25                  30          


Ser Gly Asn Asn Val Lys Leu Val Asp Asn Xaa Phe Thr Glu Ser Val 
        35                  40                  45              


Asn Lys Asn Leu Gln Ile Glu Thr Lys Lys Ile Asn Tyr Ile Met Lys 
    50                  55                  60                  


His Lys Val Lys Lys Glu Phe Phe Lys Ile Lys Val Asn Asn Lys Glu 
65                  70                  75                  80  


Val Val Val Thr Glu Asp His Ser Ile Met Val Leu Arg Asn Ser Glu 
                85                  90                  95      


Leu Ile Glu Val Lys Pro Arg Asp Ile Lys Asn Gly Asp Leu Ile Ile 
            100                 105                 110         


Leu Asn Asp 
        115 


<210>  49
<211>  39
<212>  PRT
<213>  Campylobacter phage CP21

<400>  49

Met Ile Val Thr Glu Asn Phe Gln Val Glu Ser Leu Gly Ile Gln Glu 
1               5                   10                  15      


Leu Asp Val Tyr Asp Ile Glu Val Asp Ser Asn His Asn Phe Phe Ala 
            20                  25                  30          


Asn Asp Ile Leu Val His Asn 
        35                  


<210>  50
<211>  345
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Natronorubrum bangense JCM 10635


<220>
<221>  misc_feature
<222>  (120)..(120)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (211)..(211)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (300)..(300)
<223>  Xaa can be any naturally occurring amino acid

<400>  50

Ser Val Pro Met Asn Glu Pro Ile Leu Ile Arg Asp Glu Asn Gly Ser 
1               5                   10                  15      


Ile Asp Ile Val Glu Ile Gln Glu Leu Asp Gly Arg Asp Gly Asp Val 
            20                  25                  30          


Glu Val Trp Thr Glu Lys Gly Phe Thr Arg Val Lys Arg Val Ile Arg 
        35                  40                  45              


Lys Pro Asn Arg Lys Lys Leu Tyr Thr Ile Arg Thr Lys Lys Gly Val 
    50                  55                  60                  


Val His Ala Thr Glu Asp His Ser Leu Val Arg Ala Asp Gly Ser Glu 
65                  70                  75                  80  


Val Glu Pro Gly Glu Leu Gln Glu Gly Glu Ser Leu Leu His Arg Asn 
                85                  90                  95      


Val Ser Asp Ala Ser Thr Asp Val Gln Thr Asp Leu Ser Leu Asp Arg 
            100                 105                 110         


Ala Trp Leu Tyr Gly Phe Phe Xaa Gly Asp Gly Ser Ser Gly Asp Tyr 
        115                 120                 125             


Ala Tyr Asp His Pro Lys Asn Thr Asp Trp Asp Thr Arg Lys Thr Ser 
    130                 135                 140                 


Trp Ser Leu Asn Asn Asn Asn Arg Glu Leu Leu Gln Arg Ala Ala Val 
145                 150                 155                 160 


Ala Leu Ser Lys Glu Phe Gly Val Asn Ser Arg Ile Asn Glu Thr Leu 
                165                 170                 175     


Glu Ser Ser Gly Thr Tyr Lys Leu Gln Pro Ser Asn Asn Gly Lys Arg 
            180                 185                 190         


Gly Ala Gly Ser Asn Gly Met Leu Pro Ser Leu Val Lys His Phe Asn 
        195                 200                 205             


Glu Thr Xaa Tyr Thr Pro Ser Arg Gln Lys Arg Val Pro Gln Asp Val 
    210                 215                 220                 


Leu Asn Gly Asp Thr Gln Ala Ile Gln Ala Phe Leu Asp Gly Tyr Met 
225                 230                 235                 240 


Ala Ala Asp Gly His Val Gly Ser Arg Tyr Ser Lys Arg Phe His Glu 
                245                 250                 255     


Ala Asp Thr Arg His Gln Pro Leu Ala Ser Gly Leu Val Phe Leu Leu 
            260                 265                 270         


Gln Arg Ile Gly Tyr Thr Phe Asn Ile Asn Val Arg Gln Val Glu Arg 
        275                 280                 285             


Asp Gly Gly Val Thr Glu Tyr Tyr Lys Leu Arg Xaa Gln Thr Ser His 
    290                 295                 300                 


Arg Gly Asp Pro Asn Glu Val Lys Lys Ile Val Asp Tyr Glu Tyr Asp 
305                 310                 315                 320 


Gly Glu Tyr Val Tyr Asp Leu Glu Thr Glu Asn His His Phe His Ala 
                325                 330                 335     


Gly Ala Gly Asn Ile Ile Val His Asn 
            340                 345 


<210>  51
<211>  138
<212>  PRT
<213>  Unknown

<220>
<223>  IntN from Russia Kulunda-steppe soda lake Tanatar-5 brine 
       environmental genomics

<400>  51

Ser Val Val Gly Asn Ser Val Ile Ser Val Asn Gly Lys Lys Ile Asn 
1               5                   10                  15      


Ile Glu Asp Tyr Tyr Asp Arg Ile Asp Asn Asn Phe Ile Lys Asn Asp 
            20                  25                  30          


Gln Phe Asn Asp Asp Tyr Val Lys Val Val Asp Asn Gly Asp Thr Thr 
        35                  40                  45              


Gln Ser Ile Asn Lys Asp Gly Lys Leu Glu Asn Lys Pro Ile Asn Tyr 
    50                  55                  60                  


Ile Met Lys His Arg Val Arg Lys Glu Met Phe Arg Ile Asp Asp Ser 
65                  70                  75                  80  


Ser Gly Asn Ser Val Ile Val Thr Glu Asp His Ser Val Ile Val Arg 
                85                  90                  95      


Asp Lys Lys Thr Lys Glu Ile Leu Asp Val Lys Pro Lys Glu Leu Asn 
            100                 105                 110         


Pro Lys Lys His Glu Ile Ile Asn Ile Ile Ala Asn Asp Thr Asp Ser 
        115                 120                 125             


Gly Gly Ile Tyr Gly Val Asp Asn Arg Lys 
    130                 135             


<210>  52
<211>  42
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from Russia Kulunda-steppe soda lake Tanatar-5 
       brine environmental genomics


<220>
<221>  misc_feature
<222>  (39)..(39)
<223>  Xaa can be any naturally occurring amino acid

<400>  52

Met Ser Lys Ile Lys Phe Asp Asp Glu Phe Ser Val Lys Ser Leu Gly 
1               5                   10                  15      


Ile Val Asp Asn Tyr Val Tyr Asp Ile Glu Val Glu Asp Asn His Asn 
            20                  25                  30          


Phe Phe Ala Asn Asn Ile Xaa Val His Asn 
        35                  40          


<210>  53
<211>  120
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from oil-polluted sediment collected from 3 
       locations (0.5 km, 0.7 km and 0.9 km) around wellhead MC252 after
       the Deepwater Horizon oil spill


<220>
<221>  misc_feature
<222>  (57)..(57)
<223>  Xaa can be any naturally occurring amino acid

<400>  53

Ser Val Ala Phe Asn Ser Ile Ile Glu Ile Asp Gly Ile Lys Asp Thr 
1               5                   10                  15      


Ile Glu Ser Trp Phe Asn Lys Leu Ala Glu Glu His Gly Arg His Val 
            20                  25                  30          


Asp Gly Glu Lys Glu Phe Thr Lys Ile Ser Ser Leu Asp Leu His Thr 
        35                  40                  45              


Pro Thr Tyr Asp Val Ala Tyr Asp Xaa Met Val Asp Lys Pro Leu Met 
    50                  55                  60                  


Thr Ile Tyr Arg His Lys Ile Glu Lys Lys Met Tyr Thr Val Thr Ser 
65                  70                  75                  80  


Val Asp Gly His Ser Val Thr Thr Thr Ala Asp His Ser Leu Met Val 
                85                  90                  95      


Met Arg Gly Gly Asn Ile Ile Glu Ile Ile Pro Thr Asp Ile Leu Ser 
            100                 105                 110         


Gly Asp Gln Leu Val Ile Phe Glu 
        115                 120 


<210>  54
<211>  46
<212>  PRT
<213>  Unknown

<220>
<223>  IntC from oil-polluted sediment collected from 3 locations (0.5 
       km, 0.7 km and 0.9 km) around wellhead MC252 after the Deepwater 
       Horizon oil spill

<400>  54

Met His Glu Arg Lys Tyr Lys Leu Val Asp Ile Ala Lys Val Glu Glu 
1               5                   10                  15      


Val Lys Tyr Thr Asp Glu Tyr Val Tyr Asp Val Val Met Phe Glu Pro 
            20                  25                  30          


Glu Ser Pro Tyr Phe Val Ala Asn Asp Ile Leu Val His Asn 
        35                  40                  45      


<210>  55
<211>  116
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from oil-polluted sediment collected from 3 
       locations (0.5 km, 0.7 km and 0.9 km) around wellhead MC252 after
       the Deepwater Horizon oil spill


<220>
<221>  misc_feature
<222>  (46)..(46)
<223>  Xaa can be any naturally occurring amino acid

<400>  55

Ser Val Val Gly Asp Thr Ser Ile Tyr Ile Asp Gly Asn Pro Ile Lys 
1               5                   10                  15      


Ile Glu Asp Leu Tyr Asp Lys Thr Gln Gly Glu Leu Ile Glu Arg Gly 
            20                  25                  30          


Asp Tyr Asp Phe Ile Lys Lys Pro Asn Gln Asn Ile Leu Xaa Arg Ala 
        35                  40                  45              


Phe Asp Thr Lys Lys Gln Arg Val Val Glu Gln Lys Val Asn Tyr Ile 
    50                  55                  60                  


Met Lys His Lys Val Lys Lys Lys Met Phe Lys Ile Lys Tyr Lys Gly 
65                  70                  75                  80  


Lys Glu Val Lys Val Thr Gln Asp His Ser Val Ile Ile Gln Arg Glu 
                85                  90                  95      


Asp Leu Phe Ile Ser Val Thr Pro Glu Gln Ile Lys Lys Gly Asp Lys 
            100                 105                 110         


Ile Ile Ile Ile 
        115     


<210>  56
<211>  42
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from oil-polluted sediment collected from 3 
       locations (0.5 km, 0.7 km and 0.9 km) around wellhead MC252 after
       the Deepwater Horizon oil spill


<220>
<221>  misc_feature
<222>  (14)..(15)
<223>  Xaa can be any naturally occurring amino acid

<400>  56

Met Thr Ser Phe Glu Leu Ile Glu Asp Phe Glu Val Glu Xaa Xaa Gly 
1               5                   10                  15      


Glu Glu Glu Ile Trp Val Tyr Asp Ile Glu Val Glu Glu His His Asn 
            20                  25                  30          


Phe Phe Ala Asn Asp Ile Leu Val His Asn 
        35                  40          


<210>  57
<211>  156
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Anaerobic digester Denmark WWTP 
       Viborg metagenome


<220>
<221>  misc_feature
<222>  (111)..(111)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (124)..(124)
<223>  Xaa can be any naturally occurring amino acid

<400>  57

Ser Val His Ser Asp Ser Ile Val His Thr Asp Lys Gly Ser Ile Lys 
1               5                   10                  15      


Ile Glu Asp Trp Tyr Asn Lys Asn Lys Thr Asn Gly Gly Thr Thr Leu 
            20                  25                  30          


Glu Gly His Glu Ser Val Leu Thr Asn Asp Lys Ile Leu Asn Trp Asp 
        35                  40                  45              


Asn Glu Leu Tyr Phe Ala Pro Val Lys Arg Ile Ile Arg His Lys Val 
    50                  55                  60                  


Thr Lys Pro Lys Trp Lys Ile Lys Thr Lys Ser Gly Lys Glu Ile Ile 
65                  70                  75                  80  


Ile Thr Asn Asp His Ser Leu Ile Val Phe Arg Asn Asn Glu Lys Leu 
                85                  90                  95      


Glu Ile Lys Pro Gln Gly Val Ile Tyr Gln Asp Lys Val Leu Xaa Leu 
            100                 105                 110         


Gly Thr Lys Phe Tyr Phe Asp Glu Ile Glu Ser Xaa Glu Glu Met Gly 
        115                 120                 125             


Thr Phe Asp Asn Glu Tyr Val Tyr Asp Val Glu Val Asp Asp Asp Thr 
    130                 135                 140                 


His Thr Phe Ile Ala Asn Asp Ile Leu Val His Asn 
145                 150                 155     


<210>  58
<211>  121
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from Tara Oceans expedition station TARA_022 Ionian
       sea (N=39.8386, E=17.4155) on 2009-11-16T08:16, depth of 3-7 m, 
       size-fractionated (<-0.22 micrometres)


<220>
<221>  misc_feature
<222>  (29)..(29)
<223>  Xaa can be any naturally occurring amino acid

<400>  58

Ser Val Asp Gly Lys Ser Thr Val Arg Ala Asn Gly Lys Asn Val Ser 
1               5                   10                  15      


Ile Glu Asn Leu Tyr Ser Glu Leu Glu Ser Asp Gly Xaa Asn Thr Ile 
            20                  25                  30          


Ile Thr Asp Phe Thr Gly Arg Gln Phe Val Phe Pro Lys Asp Val Lys 
        35                  40                  45              


Leu Pro Tyr Tyr Asn Glu Ser Asn Lys Lys Ile Glu Asn Gly Asn Val 
    50                  55                  60                  


Glu Tyr Ile Glu Lys His Arg Val Lys Lys Lys Met Phe Lys Ile Arg 
65                  70                  75                  80  


Ser Ser Asn Gly Lys Ser Val Ile Ile Thr Glu Asp His Ser Ile Met 
                85                  90                  95      


Val Met Arg Asn Lys Lys Leu Ile Lys Ile Thr Pro Asp Lys Leu Lys 
            100                 105                 110         


Lys Ser Asp Lys Leu Val Thr Leu Leu 
        115                 120     


<210>  59
<211>  42
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from Tara Oceans expedition station TARA_022 Ionian
       sea (N=39.8386, E=17.4155) on 2009-11-16T08:16, depth of 3-7 m, 
       size-fractionated (<-0.22 micrometres)


<220>
<221>  misc_feature
<222>  (13)..(13)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (29)..(29)
<223>  Xaa can be any naturally occurring amino acid

<400>  59

Met Lys Tyr Lys Ile Glu Asp Ile Glu Glu Ile Glu Xaa Leu Gly Glu 
1               5                   10                  15      


Val Asp Gln Asp Val Tyr Asp Ile Gly Met Lys Asp Xaa Pro His Thr 
            20                  25                  30          


Phe Phe Ala Asn Asp Ile Leu Val His Asn 
        35                  40          


<210>  60
<211>  168
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from C-1 sludge sample from hydrolytic
       chamber (first chamber) of Anaerobic Baffled Reactor in India


<220>
<221>  misc_feature
<222>  (2)..(2)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (129)..(129)
<223>  Xaa can be any naturally occurring amino acid

<400>  60

Ser Xaa Gln Asn Asp Thr Leu Ile Tyr Val Asp Asn Val Lys Met Ser 
1               5                   10                  15      


Ile Glu Asp Val Tyr Ile Met Met Lys Glu Glu Asn Phe Asp Ser Gly 
            20                  25                  30          


Ile Val Thr Ser Asn Gly Ser Tyr Ile Ile Pro Asn Arg Phe Gly His 
        35                  40                  45              


Thr Ile Lys Thr Tyr Asp Glu Lys Thr Lys Lys Phe Ile Tyr Lys Pro 
    50                  55                  60                  


Ile Lys Tyr Ile Met Arg His Lys Val Ser Lys Ala Lys Tyr Lys Ile 
65                  70                  75                  80  


Thr Thr Ser Ser Gly Lys Ser Val Ile Val Thr Gly Asp His Ser Ile 
                85                  90                  95      


Met Ile Leu Arg Asn Asn Lys Leu Gln Ser Ile Lys Ala Lys Asp Ile 
            100                 105                 110         


Asn Ile Lys Thr Asp Lys Thr Ile Ser Val Ile Asp Asn Gln Leu Asp 
        115                 120                 125             


Xaa Ile Ile Glu Asp Ile Ala Ser Val Glu Gln Leu Glu Asp Phe Asn 
    130                 135                 140                 


Asp Glu Tyr Val Tyr Asp Val Glu Val Glu Asp Thr His Thr Phe Val 
145                 150                 155                 160 


Gly Asn Asp Ile Leu Leu His Asn 
                165             


<210>  61
<211>  116
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from Anaerobic digester Denmark WWTP Viborg 
       metagenome


<220>
<221>  misc_feature
<222>  (47)..(47)
<223>  Xaa can be any naturally occurring amino acid

<400>  61

Ser Val Asp Gly Ser Thr Leu Ile Arg Thr Asn Ile Gly Ile Leu Pro 
1               5                   10                  15      


Ile Lys Ile Leu Phe Asp Lys Phe Ser Lys Lys Ser Lys Ile Lys Thr 
            20                  25                  30          


Glu Ser Tyr Gly His Glu Met Ile Asp Val Leu Asn Leu Glu Xaa Leu 
        35                  40                  45              


Thr Leu Lys Asp Asn Lys Ile Lys Met Gly Lys Ile Lys Arg Leu Ile 
    50                  55                  60                  


Arg His Lys Thr Asn Lys Lys Met Tyr Lys Ile Arg Val Asn Gly Lys 
65                  70                  75                  80  


Glu Ile Ile Thr Thr Glu Asp His Gly Ile Met Val Gln Arg Asp Gly 
                85                  90                  95      


Asn Leu Ile Arg Ile Ser Pro Lys Glu Ile Lys Lys Gly Asp Leu Met 
            100                 105                 110         


Val Asn Val Ile 
        115     


<210>  62
<211>  43
<212>  PRT
<213>  Unknown

<220>
<223>  IntC from Anaerobic digester Denmark WWTP Viborg metagenome

<400>  62

Met Lys Tyr Glu Leu Ser Pro Ile Glu Ser Ile Glu Val Val Asn Asp 
1               5                   10                  15      


Arg Phe Asp Tyr Val Tyr Asp Ile Glu Met Asp Asp Thr Thr Asp His 
            20                  25                  30          


Val Phe Phe Gly Asn Asp Ile Leu Val His Asn 
        35                  40              


<210>  63
<211>  128
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from Switchgrass-associated bovine rumen microbial 
       communities from Urbana, Illinois, USA


<220>
<221>  misc_feature
<222>  (24)..(24)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (37)..(37)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (54)..(54)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (101)..(101)
<223>  Xaa can be any naturally occurring amino acid

<400>  63

Ser Val Ser Ala Asp Ser Asn Ile Lys Ile Ser Tyr Ser Asp Thr Gly 
1               5                   10                  15      


Ser Lys Lys Asp Ile Lys Val Xaa Asp Leu Phe Thr Glu Leu Lys Tyr 
            20                  25                  30          


Asn Asn Asn Asp Xaa Val Ile Ile Thr Gly Asn Gly Ser Glu Val Val 
        35                  40                  45              


Pro Val Lys Asp Ile Xaa Ala Gln Thr Tyr Ser Thr Glu Lys Asp Lys 
    50                  55                  60                  


Val Ile Phe Arg Pro Val Asn Tyr Ile Met Arg His Lys Val Lys Lys 
65                  70                  75                  80  


Ser Arg Phe Arg Ile Thr Thr Glu Ser Gly Lys Gln Ile Ile Val Thr 
                85                  90                  95      


Gly Asp His Ser Xaa Met Val Val Arg Asp Asn Glu Leu Ile Ser Val 
            100                 105                 110         


Lys Ala Lys Asp Ile Lys Met Ser Asp Lys Ile Ile Thr Val Asp Asn 
        115                 120                 125             


<210>  64
<211>  49
<212>  PRT
<213>  Unknown

<220>
<223>  IntC from Switchgrass-associated bovine rumen microbial 
       communities from Urbana, Illinois, USA

<400>  64

Met Val Met Asp Phe Lys Glu Thr Asn Tyr Lys Ile Glu Ser Ile Ala 
1               5                   10                  15      


Ala Ile Glu Gln Ile Glu Asp Phe Asp Glu Glu Tyr Val Tyr Asp Leu 
            20                  25                  30          


Glu Ile Asp Asp Thr His Met Phe Phe Ala Asn Asp Ile Leu Val His 
        35                  40                  45              


Asn 
    


<210>  65
<211>  129
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from Rattus norvegicus (Denmark: Riget) gut 
       metagenome


<220>
<221>  misc_feature
<222>  (99)..(99)
<223>  Xaa can be any naturally occurring amino acid

<400>  65

Ser Val Ala Gly Asp Thr Lys Val Asp Ile Ser Ser Ala Asp Ile Lys 
1               5                   10                  15      


Lys Arg Ile Asp Ile Ala Asp Leu Phe Thr Lys Ala Lys Tyr Leu Asn 
            20                  25                  30          


Asp Asp His Val Leu Ser Val Ser Asn Gly Ser Glu Val Ile Pro Gly 
        35                  40                  45              


Asn Gly Ile Leu Ile Arg Ala Tyr Asp Lys Glu Leu Asp Met Ala Val 
    50                  55                  60                  


Tyr Lys Pro Met Lys Tyr Val Met Arg His Lys Val Ser Lys Ala Arg 
65                  70                  75                  80  


Phe Arg Ile Lys Thr Glu Ser Gly Lys Glu Val Ile Val Thr Gly Asp 
                85                  90                  95      


His Ser Xaa Ile Val Leu Arg Asn Gly Glu Leu Ile Asp Ile Lys Ala 
            100                 105                 110         


Lys Asp Ile Asn Lys Glu Thr Asp Lys Ile Ile Thr Ile Asn Ser Lys 
        115                 120                 125             


Lys 
    


<210>  66
<211>  47
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from Rattus norvegicus (Denmark: Riget) gut 
       metagenome


<220>
<221>  misc_feature
<222>  (35)..(35)
<223>  Xaa can be any naturally occurring amino acid

<400>  66

Met Asp Phe Lys Glu Lys Asn Tyr Lys Ile Glu Ser Ile Ala Glu Ile 
1               5                   10                  15      


Glu Gln Leu Asp Asp Phe Glu Asp Glu Tyr Val Tyr Asp Val Glu Val 
            20                  25                  30          


Asp Asp Xaa His Asn Phe Phe Ala Asn Asp Val Leu Val His Asn 
        35                  40                  45          


<210>  67
<211>  115
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from Campylobacter coli strain NC_C3306


<220>
<221>  misc_feature
<222>  (43)..(43)
<223>  Xaa can be any naturally occurring amino acid

<400>  67

Ser Val Val Gly Asp Ser Ile Ile Lys Val Asn Gly Glu Asn Ile Lys 
1               5                   10                  15      


Ile Glu Asp Phe Tyr Asp Ser Ile Lys Val Asp Pro Ile Val Thr Lys 
            20                  25                  30          


Ser Gly Asn Asn Val Lys Leu Val Asp Asn Xaa Phe Thr Glu Ser Val 
        35                  40                  45              


Asn Lys Asn Leu Gln Ile Glu Thr Lys Lys Ile Asn Tyr Ile Met Lys 
    50                  55                  60                  


His Arg Val Lys Lys Glu Phe Phe Lys Ile Lys Val Asn Asn Lys Glu 
65                  70                  75                  80  


Val Ile Val Thr Glu Asp His Ser Ile Met Ile Leu Arg Asn Ser Glu 
                85                  90                  95      


Leu Ile Glu Val Lys Pro Arg Asp Ile Lys Thr Gly Asp Ser Ile Ile 
            100                 105                 110         


Leu Asn Val 
        115 


<210>  68
<211>  39
<212>  PRT
<213>  Campylobacter coli strain NC_C3306

<400>  68

Met Ile Val Thr Glu Asn Phe Gln Val Glu Ser Leu Gly Ile Gln Glu 
1               5                   10                  15      


Leu Asp Val Tyr Asp Ile Glu Val Asp Ser Asn His Asn Phe Phe Ala 
            20                  25                  30          


Asn Asp Ile Leu Val His Asn 
        35                  


<210>  69
<211>  153
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Campylobacter jejuni SO-54


<220>
<221>  misc_feature
<222>  (43)..(43)
<223>  Xaa can be any naturally occurring amino acid

<400>  69

Ser Val Val Gly Asp Ser Ile Ile Lys Val Asn Gly Lys Asn Ile Lys 
1               5                   10                  15      


Ile Glu Asp Phe Tyr Asp Ser Ile Lys Val Asp Pro Ile Val Thr Lys 
            20                  25                  30          


Ser Gly Asn Asn Val Lys Leu Val Asp Asn Xaa Phe Thr Glu Ser Val 
        35                  40                  45              


Asn Lys Asn Leu Gln Ile Glu Thr Lys Lys Ile Asn Tyr Ile Met Lys 
    50                  55                  60                  


His Lys Val Lys Lys Glu Phe Phe Lys Ile Lys Val Asn Asn Lys Glu 
65                  70                  75                  80  


Val Val Val Thr Glu Asp His Ser Ile Met Val Leu Arg Asn Ser Glu 
                85                  90                  95      


Leu Ile Glu Val Lys Pro Arg Asp Ile Lys Ile Asp Asp Ile Leu Ile 
            100                 105                 110         


Leu Ile Asp Ser Lys Ser Ser Asp Phe Gln Val Glu Ser Leu Gly Ile 
        115                 120                 125             


Gln Glu Leu Asp Val Tyr Asp Ile Glu Val Asp Ser Asn His Asn Phe 
    130                 135                 140                 


Phe Ala Asn Asp Ile Leu Val His Asn 
145                 150             


<210>  70
<211>  115
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from Campylobacter jejuni CFSAN038801


<220>
<221>  misc_feature
<222>  (43)..(43)
<223>  Xaa can be any naturally occurring amino acid

<400>  70

Ser Val Val Gly Asp Ser Ile Ile Lys Val Asn Gly Glu Asn Ile Lys 
1               5                   10                  15      


Ile Glu Asp Phe Tyr Asp Ser Ile Lys Val Asp Pro Ile Val Thr Lys 
            20                  25                  30          


Ser Gly Asn Asn Val Lys Leu Val Asp Asn Xaa Phe Thr Glu Ser Val 
        35                  40                  45              


Asn Lys Asn Leu Gln Ile Glu Thr Lys Lys Ile Asn Tyr Ile Met Lys 
    50                  55                  60                  


His Arg Val Lys Lys Glu Phe Phe Lys Ile Lys Val Asn Asn Lys Glu 
65                  70                  75                  80  


Val Ile Val Thr Glu Asp His Ser Ile Met Val Leu Arg Asn Ser Glu 
                85                  90                  95      


Leu Ile Glu Val Lys Pro Arg Asp Ile Lys Thr Gly Asp Ser Ile Ile 
            100                 105                 110         


Leu Asn Asp 
        115 


<210>  71
<211>  39
<212>  PRT
<213>  Campylobacter jejuni CFSAN038801

<400>  71

Met Ile Val Thr Glu Asn Phe Gln Val Glu Ser Leu Gly Ile Gln Glu 
1               5                   10                  15      


Leu Asp Val Tyr Asp Ile Glu Val Asp Ser Asn His Asn Phe Phe Ala 
            20                  25                  30          


Asn Asp Ile Leu Val His Asn 
        35                  


<210>  72
<211>  153
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Campylobacter coli CFSAN038801


<220>
<221>  misc_feature
<222>  (43)..(43)
<223>  Xaa can be any naturally occurring amino acid

<400>  72

Ser Val Val Gly Asp Ser Ile Ile Lys Val Asn Gly Glu Asn Ile Lys 
1               5                   10                  15      


Ile Glu Asp Phe Tyr Asp Ser Ile Lys Val Asp Pro Ile Val Thr Lys 
            20                  25                  30          


Ser Gly Asn Asn Val Lys Leu Val Asp Asn Xaa Phe Thr Glu Ser Val 
        35                  40                  45              


Asn Lys Asn Leu Gln Ile Glu Thr Lys Lys Ile Asn Tyr Ile Met Lys 
    50                  55                  60                  


His Arg Val Lys Lys Glu Phe Phe Lys Ile Lys Val Asn Asn Lys Glu 
65                  70                  75                  80  


Val Ile Val Thr Glu Asp His Ser Ile Met Val Leu Arg Asn Ser Glu 
                85                  90                  95      


Leu Ile Glu Val Lys Pro Arg Asp Ile Lys Ile Asp Asp Ile Leu Ile 
            100                 105                 110         


Leu Ile Asp Ser Lys Ser Ser Asp Phe Gln Val Glu Ser Leu Gly Ile 
        115                 120                 125             


Gln Glu Leu Asp Val Tyr Asp Ile Glu Val Asp Ser Asn His Asn Phe 
    130                 135                 140                 


Phe Ala Asn Asp Ile Leu Val His Asn 
145                 150             


<210>  73
<211>  162
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Rhizobiales bacterium NORP22


<220>
<221>  misc_feature
<222>  (49)..(49)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (76)..(76)
<223>  Xaa can be any naturally occurring amino acid

<400>  73

Ser Val Ala Gly Asp Ser Leu Ile Tyr Val Asn Asp Lys Lys Ile Lys 
1               5                   10                  15      


Ile Glu Asp Tyr Tyr Asn Ser Leu Glu Asn Asn Phe Ile Leu Asn Asp 
            20                  25                  30          


Thr Phe Asn Glu Asn Tyr Val Lys Glu Val Ser Gly Asp Thr Thr Lys 
        35                  40                  45              


Xaa Tyr Ile Glu Gly Ile Lys Asn Lys Lys Ile Asn Tyr Ile Met Lys 
    50                  55                  60                  


His Lys Val Ser Lys Lys Met Tyr Arg Ile Lys Xaa Asn Gly Asn Phe 
65                  70                  75                  80  


Val Asp Val Thr Glu Asp His Ser Val Ile Val Lys Asn Lys Lys Thr 
                85                  90                  95      


Lys Lys Ile Ser Ser Ile Lys Pro Lys Asn Leu Asn Pro Asn Leu His 
            100                 105                 110         


Ser Ile Ile Asn Ile Lys Thr Lys Lys Ile Asp Glu Leu Ile Thr Asp 
        115                 120                 125             


Asp Phe Glu Val Glu Tyr Leu Gly Ile Ile Glu Asn Tyr Val Tyr Asp 
    130                 135                 140                 


Ile Glu Val Glu Glu Ala His Asn Phe Phe Ala Asn Asp Ile Leu Val 
145                 150                 155                 160 


His Asn 
        


<210>  74
<211>  119
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from human gut metagenome Denmark fecal sample


<220>
<221>  misc_feature
<222>  (24)..(24)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (117)..(117)
<223>  Xaa can be any naturally occurring amino acid

<400>  74

Ser Leu Val Gly Ser Ser Ile Ile Ile Val Asn Gly Lys Lys Ile Lys 
1               5                   10                  15      


Ile Glu Asp Tyr Tyr Asn Gln Xaa Asn Gly Ile Leu Ile Lys Asn Asp 
            20                  25                  30          


Ile Asn Asn Gln Asn Phe Ile Lys Glu Ile Asp Asn Asp Asp Lys Gly 
        35                  40                  45              


Leu Ser Tyr Asp Ile Asn Asn Gln Gln Ile Val Asn Asn Lys Ile Lys 
    50                  55                  60                  


Tyr Ile Lys Lys His Lys Val Lys Lys Glu Phe Phe Lys Ile Ser Tyr 
65                  70                  75                  80  


Lys Asp Lys Glu Val Ile Ile Thr Glu Asp His Ser Val Met Ile Glu 
                85                  90                  95      


Arg Asn Asp Lys Ile Ile Glu Ile Lys Pro Arg Glu Ile Lys Gln Gly 
            100                 105                 110         


Asp Lys Ile Ile Xaa Ile Gln 
        115                 


<210>  75
<211>  41
<212>  PRT
<213>  Unknown

<220>
<223>  IntC from human gut metagenome Denmark fecal sample

<400>  75

Met Gln Ile Gln Lys Thr Thr Asp Phe Lys Ile Glu Ser Leu Gly Ile 
1               5                   10                  15      


Gln Glu Gln Tyr Val Tyr Asp Ile Glu Ile Glu Asp Thr His Asn Phe 
            20                  25                  30          


Phe Ala Asn Thr Ile Leu Val His Asn 
        35                  40      


<210>  76
<211>  119
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from human gut metagenome Peru National Reserve 
       Matses (Amazon) fecal sample


<220>
<221>  misc_feature
<222>  (31)..(31)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (77)..(77)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (114)..(114)
<223>  Xaa can be any naturally occurring amino acid

<400>  76

Ser Val Val Gly Asp Thr Ile Ile Asn Val Asn Gly Lys Pro Ile Thr 
1               5                   10                  15      


Ile Ala Asp Tyr Tyr Asn Ser Ile Val Pro Asn Tyr Ile Lys Xaa Asp 
            20                  25                  30          


Asp Ile Asn Lys Asn Tyr Ile Lys Arg Val Asp Asn Gly Asp Val Ala 
        35                  40                  45              


Leu Ala Tyr Asp Asn Gly Ile Val Gln Asn Lys Ile Lys His Val Met 
    50                  55                  60                  


Lys His Thr Val Lys Lys Arg Met Tyr Lys Ile Lys Xaa Asn Gly Lys 
65                  70                  75                  80  


Glu Val Ile Met Thr Glu Asp His Ser Ile Ile Val Asn Arg Asn Gly 
                85                  90                  95      


Lys Asn Ile Ser Val Ser Pro Lys Asp Ile Leu Lys Gly Asp Arg Leu 
            100                 105                 110         


Ile Xaa Leu Asn Asn Tyr Lys 
        115                 


<210>  77
<211>  46
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from human gut metagenome Peru National Reserve 
       Matses (Amazon) fecal sample


<220>
<221>  misc_feature
<222>  (2)..(2)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (8)..(8)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (34)..(34)
<223>  Xaa can be any naturally occurring amino acid

<400>  77

Met Xaa Lys Ile Tyr Thr Glu Xaa Asn Val Val Asp Asp Phe Glu Val 
1               5                   10                  15      


Glu Asp Leu Gly Val Gln Glu Leu Asp Val Tyr Asp Ile Glu Val Glu 
            20                  25                  30          


Glu Xaa His Asn Phe Phe Ala Asn Asp Ile Leu Val His Asn 
        35                  40                  45      


<210>  78
<211>  119
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from human gut metagenome Spanish infant fecal 
       sample


<220>
<221>  misc_feature
<222>  (31)..(31)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (77)..(77)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (114)..(114)
<223>  Xaa can be any naturally occurring amino acid

<400>  78

Ser Val Val Gly Asp Thr Ile Ile Asn Val Asn Gly Lys Pro Ile Thr 
1               5                   10                  15      


Ile Ala Asp Tyr Tyr Asn Ser Ile Ala Pro Asn Tyr Ile Lys Xaa Asp 
            20                  25                  30          


Asp Ile Asn Lys Asn Tyr Ile Lys Arg Val Asp Asn Gly Asp Val Ala 
        35                  40                  45              


Leu Ala Tyr Asp Asn Gly Ile Val Gln Asn Lys Ile Lys His Val Met 
    50                  55                  60                  


Lys His Thr Val Lys Lys Arg Met Phe Lys Ile Lys Xaa Asn Gly Lys 
65                  70                  75                  80  


Glu Val Ile Met Thr Glu Asp His Ser Ile Ile Val Asn Arg Asn Gly 
                85                  90                  95      


Lys Asn Ile Ser Val Ser Pro Lys Asp Ile Leu Lys Gly Asp Arg Leu 
            100                 105                 110         


Ile Xaa Leu Asn Asn Tyr Lys 
        115                 


<210>  79
<211>  46
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from human gut metagenome Spanish infant fecal 
       sample


<220>
<221>  misc_feature
<222>  (2)..(2)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (8)..(8)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (34)..(34)
<223>  Xaa can be any naturally occurring amino acid

<400>  79

Met Xaa Lys Ile Tyr Thr Glu Xaa Asn Val Val Asp Asp Phe Glu Ile 
1               5                   10                  15      


Glu Asp Leu Gly Val Gln Glu Leu Asp Val Tyr Asp Ile Glu Val Glu 
            20                  25                  30          


Glu Xaa His Asn Phe Phe Ala Asn Gly Ile Leu Val His Asn 
        35                  40                  45      


<210>  80
<211>  122
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from megaphage from human gut metagenome Denmark 
       fecal sample


<220>
<221>  misc_feature
<222>  (9)..(9)
<223>  Xaa can be any naturally occurring amino acid

<400>  80

Ser Gln Ile Gly Ser Thr Gln Phe Xaa Val Asp Asn Asn Ile Thr Thr 
1               5                   10                  15      


Met Glu Asp Phe Phe Thr Lys Ala Lys Tyr Glu Asn Asn Asp Val Val 
            20                  25                  30          


Ile Lys Leu Gln Asn Gly Ser Glu Val Val Pro Val His Asn His Asn 
        35                  40                  45              


Thr Leu Ser Tyr Lys Asp His Asp Tyr Lys Thr Ile Glu Arg Pro Ile 
    50                  55                  60                  


Asn Tyr Ile Met Arg His Lys Val Thr Lys Asp Lys Phe Arg Leu Lys 
65                  70                  75                  80  


Thr Lys Ser Gly Lys Glu Leu Ile Val Thr Gly Asp His Ser Ile Met 
                85                  90                  95      


Val Ile Arg Asn Asn Glu Leu Val Ser Leu Pro Ala Arg Glu Ile Lys 
            100                 105                 110         


Lys Ser Asp Lys Ile Ile Thr Leu Asp Lys 
        115                 120         


<210>  81
<211>  44
<212>  PRT
<213>  Unknown

<220>
<223>  IntC from megaphage from human gut metagenome Denmark fecal 
       sample

<400>  81

Met Asn Tyr Asn Ile Gln Val Glu Asn Ile Asp Val Ile Glu Gln Leu 
1               5                   10                  15      


Glu Asp Phe Asn Asp Glu Tyr Val Tyr Asp Ile Glu Val Asp Asp Thr 
            20                  25                  30          


His Thr Phe Phe Ala Asn Glu Ile Leu Ala His Asn 
        35                  40                  


<210>  82
<211>  264
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from human gut metagenome Denmark fecal sample


<220>
<221>  misc_feature
<222>  (46)..(46)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (156)..(156)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (207)..(207)
<223>  Xaa can be any naturally occurring amino acid

<400>  82

Ser Ile Val Ala Ser Thr Met Ile Tyr Gly Asp Thr Phe Ala Gly Thr 
1               5                   10                  15      


Ile Glu Glu Leu Phe Lys Lys Ser Ala Glu Gly Arg Val Leu Lys Gln 
            20                  25                  30          


Thr Leu Asn Gly Thr Leu Met Thr Glu Ser Asp Val Lys Xaa Leu Asn 
        35                  40                  45              


Trp Ser Glu Ser Lys Gly Leu His Tyr Ala Pro Ile Val Tyr Ile Ser 
    50                  55                  60                  


Lys His Leu Val Lys Lys Asn Met Trp Lys Leu Arg Gly Lys Ser Gly 
65                  70                  75                  80  


Lys Glu Val Ile Val Thr Glu Asp His Ser Leu Ile Val Phe Arg Asp 
                85                  90                  95      


Gly Lys Lys Leu Glu Val Lys Pro Lys Glu Ile Leu Pro Thr Asp Lys 
            100                 105                 110         


Ile Leu Ile Val Phe Pro Leu Lys Gln Arg Ile Ala Leu Ile Ile Lys 
        115                 120                 125             


Ala Arg Glu Arg Gly Tyr Ser Leu Ser Ser Gln Glu Val Phe Asp Lys 
    130                 135                 140                 


Met Arg Pro Ile Val Glu Glu Met Gly Tyr Thr Xaa Lys Tyr Ala Thr 
145                 150                 155                 160 


His Gly Gly Glu His Val Val Ile Thr Leu Asp Lys Ser Tyr Phe Leu 
                165                 170                 175     


Asp Phe Tyr Ile Pro Glu Leu Lys Ile Gly Ile Glu Tyr Asn Gly Gly 
            180                 185                 190         


Met Phe His Gly Asp Pro Arg Leu Tyr Lys Asp Asp Glu Tyr Xaa Asn 
        195                 200                 205             


Pro Trp Asn Ile Asn Glu Thr Ala Lys Asp Met Arg Glu Arg Asp Gln 
    210                 215                 220                 


Gln Arg Tyr Asp Phe Leu Leu Asn Asn Tyr Gly Ile Lys Thr Tyr Ile 
225                 230                 235                 240 


Ile Trp Glu Leu Asp Tyr Asn Gln Gly Leu Asp Val Glu Ser Phe Ile 
                245                 250                 255     


Arg Arg Ile Leu Asn Glu Ser Lys 
            260                 


<210>  83
<211>  47
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from human gut metagenome Denmark fecal sample


<220>
<221>  misc_feature
<222>  (18)..(18)
<223>  Xaa can be any naturally occurring amino acid

<400>  83

Met Arg Val Ser Glu Phe Glu Tyr Gln Phe Asp Glu Ile Glu Ser Ile 
1               5                   10                  15      


Glu Xaa Leu Gly Glu Thr Asn Glu Tyr Val Tyr Asp Ile Glu Val Ala 
            20                  25                  30          


Asp Glu Ser His Thr Phe Ile Ala Asn Asp Ile Leu Val His Asn 
        35                  40                  45          


<210>  84
<211>  170
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from human gut metagenome Denmark 
       Roux-en-Y gastric bypass surgery of morbidly obese patient


<220>
<221>  misc_feature
<222>  (85)..(85)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (109)..(109)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (138)..(138)
<223>  Xaa can be any naturally occurring amino acid

<400>  84

Ser Ile Ser Tyr Asp Ser Ile Ile Lys Thr Ser Arg His Pro Asn Gly 
1               5                   10                  15      


Ile Thr Ile Ser Glu Trp Tyr Lys Glu Asn Glu Asn Asn Ile Gly Glu 
            20                  25                  30          


Arg Thr Leu Ala Gly His Glu Ser Val His Thr Asp Asp Lys Ala Leu 
        35                  40                  45              


Asn Phe Asp Asp Asp Lys Leu Thr Phe Thr Asn Val Lys Arg Ile Ile 
    50                  55                  60                  


Arg His Lys Val Ser Lys Pro Lys Trp Lys Leu Arg Thr Ser Ser Gly 
65                  70                  75                  80  


Lys Glu Ile Val Xaa Thr Asp Asn His Ser Leu Ile Val Phe Arg Asn 
                85                  90                  95      


Gly Thr Lys Lys Lys Val Lys Pro Ser Glu Ile Thr Xaa Glu Asp Lys 
            100                 105                 110         


Val Leu Thr Val Asn Leu Ser Thr Ser Ile Glu Glu Asn Ile Arg Tyr 
        115                 120                 125             


Val Gln Ser Phe Asp Asn Val Glu Ile Xaa Val Asn Ile Gly Glu Tyr 
    130                 135                 140                 


Thr Asp Glu Tyr Val Tyr Asp Ile Glu Met Asp Asp Asp Ser His Thr 
145                 150                 155                 160 


Phe Ile Ala Asn Asp Ile Leu Val His Asn 
                165                 170 


<210>  85
<211>  12
<212>  PRT
<213>  Unknown

<220>
<223>  IntN from human gut metagenome Bangladeshi cholera-succession

<400>  85

Ser Ala Ala Phe Ser Thr Lys Ile Met Ile Lys Arg 
1               5                   10          


<210>  86
<211>  154
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from human gut metagenome Bangladeshi 
       cholera-succession


<220>
<221>  misc_feature
<222>  (68)..(68)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (82)..(82)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (136)..(136)
<223>  Xaa can be any naturally occurring amino acid

<400>  86

Met Lys Gln Asp Ile Glu Ile Gly Arg Leu Phe Asp Glu Leu Leu Glu 
1               5                   10                  15      


Ser Gly Leu Glu Leu Lys Thr Arg Asn Gly Tyr Glu Tyr Met Glu Pro 
            20                  25                  30          


Lys Gly Ile Lys Ile Leu Thr Gly Gly Asn Arg Tyr Val Trp Leu Val 
        35                  40                  45              


Ala Ile Ser Arg His Arg Thr Pro Lys His Leu Val Arg Ile Ser Ile 
    50                  55                  60                  


Thr Thr Ser Xaa Ser Gly Lys Tyr Asp Val Thr Val Thr Thr Asp His 
65                  70                  75                  80  


Val Xaa Met Val Met Asn Arg Asp His Phe Phe Asp Asn Val Asn Ala 
                85                  90                  95      


Lys Asn Leu Glu Ile Gly Asp Tyr Val Gln Val Tyr Asp Ala Gly Tyr 
            100                 105                 110         


Gly Lys Glu Val Leu Gly Ala Ile Ser Lys Ile Glu Asp Leu Gly Pro 
        115                 120                 125             


Thr Asp Asp Tyr Val Tyr Asp Xaa Glu Val Asp Asp Asn Gly His Thr 
    130                 135                 140                 


Phe Tyr Gly Asn Ser Val Leu Leu His Asn 
145                 150                 


<210>  87
<211>  14
<212>  PRT
<213>  Unknown

<220>
<223>  IntN from pig gut metagenome

<400>  87

Ser Ala Ile Tyr Ser Thr Lys Leu Arg Ile Arg Ile Lys Asp 
1               5                   10                  


<210>  88
<211>  172
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from pig gut metagenome


<220>
<221>  misc_feature
<222>  (84)..(84)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (101)..(101)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (149)..(149)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (162)..(162)
<223>  Xaa can be any naturally occurring amino acid

<400>  88

Met Glu Lys Gln Glu Glu Ile Gly Ala Leu Tyr Asp Lys Leu Val Ala 
1               5                   10                  15      


Glu Gly Arg Lys Ile Val Asn Asp Gly Lys Tyr Glu Leu Val Asp Ala 
            20                  25                  30          


Lys Gly Ile Glu Val Leu Ser Tyr Gly Gly Lys Thr Asp Asn Gly Arg 
        35                  40                  45              


Tyr Ala Pro Ile Lys Tyr Val Ser Arg His Lys Thr Ser Lys Asn Leu 
    50                  55                  60                  


Val Glu Ile Thr Val Thr Ile Gly Ser Asp Ala Val Asp Gly Gly Gly 
65                  70                  75                  80  


Leu Ala Ser Xaa Gln Lys Lys Arg Leu Glu Lys Lys Val Val Val Thr 
                85                  90                  95      


Thr Asp His Ile Xaa Met Lys Tyr Asp Arg Asp Arg Val Phe Gln Asn 
            100                 105                 110         


Val Asp Ala Lys Asn Leu Thr Pro Gly Asp Tyr Val Ser Val Tyr Asp 
        115                 120                 125             


Ser Gly Tyr Gly Met Glu Thr Tyr Gly Ile Val Ser Ala Ile Lys Asn 
    130                 135                 140                 


Leu Gly Pro Thr Xaa Glu Tyr Val Tyr Asp Leu Glu Val Asp Asp Thr 
145                 150                 155                 160 


His Xaa Phe Tyr Ala Asn Asp Val Leu Ile His Asn 
                165                 170         


<210>  89
<211>  116
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from bovine gut rumen metagenome


<220>
<221>  misc_feature
<222>  (9)..(9)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (42)..(42)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (84)..(84)
<223>  Xaa can be any naturally occurring amino acid

<400>  89

Ser Val Val Asp Ser Lys Ile Arg Xaa Met Ser Gly Glu Lys Met Leu 
1               5                   10                  15      


Ser Asp Ile Phe Asn Glu Ser Lys Asn Gln Tyr Lys Thr Pro Phe Ile 
            20                  25                  30          


Thr Ser His Gly Ala Glu Leu Tyr Pro Xaa Asp Glu Arg Val Leu Asn 
        35                  40                  45              


Tyr Asp Asp Thr Gly Leu His Tyr Thr Arg Ile Lys Tyr Ile Ser Arg 
    50                  55                  60                  


His Lys Val Asn Lys Pro Met Trp Arg Leu Arg Thr Lys Ser Gly Lys 
65                  70                  75                  80  


Glu Ile Leu Xaa Thr Glu Asp His Ser Leu Val Val Tyr Arg Asp Gly 
                85                  90                  95      


Lys Lys Lys Ser Ile Lys Pro Asp Lys Ile Leu Pro Thr Asp Lys Ile 
            100                 105                 110         


Val Thr Ile Lys 
        115     


<210>  90
<211>  43
<212>  PRT
<213>  Unknown

<220>
<223>  IntC from bovine gut rumen metagenome

<400>  90

Met Glu Ile Ile Phe Asp Asp Ile Glu Thr Ile Glu Cys Val Ser Asp 
1               5                   10                  15      


Gly Trp Asp Asp Tyr Val Tyr Asp Ile Glu Val Glu Asp Asp Ser His 
            20                  25                  30          


Thr Phe Ile Ala Asn Asp Ile Leu Val His Asn 
        35                  40              


<210>  91
<211>  223
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from pig gut metagenome


<220>
<221>  misc_feature
<222>  (2)..(2)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (93)..(93)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (191)..(191)
<223>  Xaa can be any naturally occurring amino acid

<400>  91

Ser Xaa Glu Lys Ser Thr Phe Ile Asn Ile Ser Asp Asn Val Ile Ser 
1               5                   10                  15      


Ile Glu Thr Asn Ser Gly Ser Lys Val Tyr Asn Lys Asn Asp Ile Val 
            20                  25                  30          


Lys Val Asn Arg Asn Gly Ile Glu Met Glu Ile Tyr Ala Ser Asp Ile 
        35                  40                  45              


Asn Glu Asn Asp Leu Ile Trp Glu Glu Ser Ile Lys Lys Ile Glu Lys 
    50                  55                  60                  


Thr Asn Lys Ile Thr Ile Glu Gln Leu Tyr Asn Leu Gly Thr Gln Asp 
65                  70                  75                  80  


Met Gly Ser Thr Leu Ala Gly His Glu Ser Val Asn Xaa Asp His Asp 
                85                  90                  95      


Val Leu Asn Ile Lys Gln Asn Asp Leu Ser Tyr Ser Gly Asn Gly Val 
            100                 105                 110         


Asp Phe Thr Ser Tyr Tyr Ala Pro Val Lys Arg Val Ile Arg His Lys 
        115                 120                 125             


Val Thr Lys Ala Lys Trp Ser Leu Val Asp Ser Asp Asn Asn Glu Val 
    130                 135                 140                 


Ile Val Thr Asn Asp His Ser Leu Met Val Leu Arg Asp Gly Asn Leu 
145                 150                 155                 160 


Gln Lys Ile Lys Pro Tyr Asp Val Asp Val Glu Asn Asp Phe Leu Val 
                165                 170                 175     


Thr Ile Ser Asp Gly Asn Ile Glu Ile Arg Asp Ile Val Arg Xaa Glu 
            180                 185                 190         


Gln Ile Gly Asn Phe Glu Asp Glu Tyr Val Tyr Asp Ile Glu Met Asn 
        195                 200                 205             


Asp Asp Thr His Thr Phe Val Ala Asn Asn Ile Leu Val His Asn 
    210                 215                 220             


<210>  92
<211>  141
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (4)..(4)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (59)..(59)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (101)..(101)
<223>  Xaa can be any naturally occurring amino acid

<400>  92

Ser Val Ser Xaa Asp Thr Ile Ile Tyr Trp Gly Phe Ala Gly Ser Ala 
1               5                   10                  15      


Glu Gln Ser Thr Thr Ile Glu Glu Met Phe Asn Ile Ile Lys Glu Gln 
            20                  25                  30          


Asn Met Asp Thr Val Leu Asn Leu Glu Asn Gly Ser Glu Val Val Pro 
        35                  40                  45              


Thr Ser Glu Leu Thr Ala Ile Arg Thr Tyr Xaa Pro Ile Leu Asn His 
    50                  55                  60                  


Val Thr Gln Lys Pro Val Lys Tyr Val Met Arg His Lys Val Lys Lys 
65                  70                  75                  80  


Ala Arg Tyr Arg Ile Thr Thr Ser Ser Gly Lys Gln Val Ile Val Thr 
                85                  90                  95      


Gly Asp His Ser Xaa Met Val Leu Arg Asp Gly Lys Leu Thr Ala Val 
            100                 105                 110         


Lys Ala Asn Glu Ile Asn Pro Asn Thr Asp Lys Ile Ile Ser Asp Lys 
        115                 120                 125             


His Tyr Arg Pro Lys Ile Asn Glu Glu Lys Asp Glu Gly 
    130                 135                 140     


<210>  93
<211>  75
<212>  PRT
<213>  Unknown

<220>
<223>  IntC from sheep gut metagenome

<400>  93

Met Glu Ala Gln Lys Gln Asn Asn Asn Asn Glu Tyr Ile Asp Leu Asp 
1               5                   10                  15      


Ile Asp Glu Phe Asn Pro Val Val Gly Asp Val Ala Asp Thr Asp Gly 
            20                  25                  30          


Ser Asp Thr Tyr Glu Val Glu Asp Ile Val Ser Ile Glu Gln Leu Asp 
        35                  40                  45              


Asp Phe Asp Asp Glu Tyr Val Tyr Asp Val Glu Val Glu Asp Thr His 
    50                  55                  60                  


Thr Phe Phe Ala Asn Asp Ile Leu Val His Asn 
65                  70                  75  


<210>  94
<211>  168
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (6)..(6)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (47)..(47)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (96)..(96)
<223>  Xaa can be any naturally occurring amino acid

<400>  94

Ser Thr Phe Ser Lys Xaa Leu Leu Leu Ile Asn Arg Asn Asn Lys Asn 
1               5                   10                  15      


Val Gln Ile Thr Ile Glu Asp Leu Phe Asn Glu Ser Leu Lys Gln Asn 
            20                  25                  30          


Gly Ile Lys Asp Ile Thr Asn Asn Gly His Glu Ile Val Lys Xaa Asn 
        35                  40                  45              


Asp Asp Val Leu Asn Trp Thr Glu Lys Asn Gly Leu Asn Phe Val Pro 
    50                  55                  60                  


Ile Lys Trp Ile Met Arg His Lys Val Ser Lys Pro Met Phe Lys Ile 
65                  70                  75                  80  


Thr Thr Lys Ser Gly Lys Thr Ile Thr Val Thr Glu Asp His Ser Xaa 
                85                  90                  95      


Val Ile Phe Arg Asn Gly Glu Gln Ile Val Ile Lys Ala Lys Asp Ile 
            100                 105                 110         


Asn Lys Glu Thr Asp Lys Ile Leu Ser Val Ile Asn Glu Asn Glu Thr 
        115                 120                 125             


Tyr Gln Phe Glu Glu Ile Glu Asn Ile Glu Gln Val Glu Tyr Lys Asp 
    130                 135                 140                 


Asp Tyr Val Tyr Asp Ile Glu Val Asp Asp Asn Ser His Thr Phe Ile 
145                 150                 155                 160 


Gly Asn Asp Ile Leu Val His Asn 
                165             


<210>  95
<211>  166
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (2)..(2)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (109)..(109)
<223>  Xaa can be any naturally occurring amino acid

<400>  95

Ser Xaa Ala Gly Asn Ser Ile Ile Glu Leu Asn Gly Arg Lys Met Ser 
1               5                   10                  15      


Ile Glu Asn Ala Phe Ser Tyr Leu Lys Glu Glu Asn Asp His Ile Val 
            20                  25                  30          


Leu Arg Thr Ser Asn Gly Ser Glu Val Val Pro Val Glu Asn Thr Thr 
        35                  40                  45              


Thr Lys Thr Tyr Asp Ser Leu Ser Lys Lys Ile Val Asp Arg Asp Val 
    50                  55                  60                  


Lys Tyr Ile Met Arg His Lys Val Ser Lys Pro Lys Trp Lys Leu Thr 
65                  70                  75                  80  


Thr Ser Ser Gly Lys Met Ile Glu Val Thr Gly Asp His Ser Leu Met 
                85                  90                  95      


Val Met Arg Asp Gly Glu Leu Leu Ser Val Lys Ala Xaa Glu Val Asp 
            100                 105                 110         


Pro Lys Lys Asp Lys Ile Val Thr Tyr Ile Asn Thr Gln Glu Tyr Ile 
        115                 120                 125             


Ile Glu Asp Ile Glu Ser Ile Glu Gln Val Glu Asp Phe Asn Asp Glu 
    130                 135                 140                 


Tyr Val Tyr Asp Ile Glu Val Asp Asp Thr His Thr Phe Phe Ala Asn 
145                 150                 155                 160 


Asp Ile Leu Val His Asn 
                165     


<210>  96
<211>  166
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (2)..(2)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (109)..(109)
<223>  Xaa can be any naturally occurring amino acid

<400>  96

Ser Xaa Ala Gly Asn Ser Ile Ile Glu Leu Asn Gly Arg Lys Met Ser 
1               5                   10                  15      


Ile Glu Asn Ala Phe Ser Tyr Leu Lys Glu Glu Asn Asp His Ile Val 
            20                  25                  30          


Leu Arg Thr Ser Asn Gly Ser Glu Val Val Pro Val Glu Asn Thr Thr 
        35                  40                  45              


Thr Lys Thr Tyr Asp Ser Leu Ser Lys Lys Ile Val Asp Arg Asp Val 
    50                  55                  60                  


Lys Tyr Ile Met Arg His Lys Val Ser Lys Pro Lys Trp Lys Leu Thr 
65                  70                  75                  80  


Thr Ser Ser Gly Lys Met Ile Glu Val Thr Gly Asp His Ser Leu Met 
                85                  90                  95      


Val Met Arg Asp Gly Glu Leu Leu Ser Val Lys Ala Xaa Glu Val Asp 
            100                 105                 110         


Pro Lys Lys Asp Lys Ile Val Thr Tyr Ile Asn Thr Gln Glu Tyr Ile 
        115                 120                 125             


Ile Glu Asp Ile Glu Ser Ile Glu Gln Val Glu Asp Phe Asn Asp Glu 
    130                 135                 140                 


Tyr Val Tyr Asp Ile Glu Val Asp Asp Thr His Thr Phe Phe Ala Asn 
145                 150                 155                 160 


Asp Ile Leu Val His Asn 
                165     


<210>  97
<211>  129
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (99)..(99)
<223>  Xaa can be any naturally occurring amino acid

<400>  97

Ser Val Ala Gly Asp Thr Lys Val Asp Ile Ser Ser Ala Asp Ile Lys 
1               5                   10                  15      


Lys Arg Ile Asp Ile Ser Glu Leu Phe Thr Lys Ala Lys Tyr Leu Asn 
            20                  25                  30          


Asp Asp His Val Leu Ser Val Ser Asn Gly Ser Glu Val Ile Pro Gly 
        35                  40                  45              


Asn Gly Ile Leu Ile Arg Ala Tyr Asp Lys Asp Leu Asp Met Ala Val 
    50                  55                  60                  


Tyr Lys Pro Met Lys Tyr Val Met Arg His Lys Val Ser Lys Ala Arg 
65                  70                  75                  80  


Phe Arg Ile Lys Thr Glu Ser Gly Lys Glu Val Ile Val Thr Gly Asp 
                85                  90                  95      


His Ser Xaa Ile Val Leu Arg Asp Gly Glu Leu Ile Asp Ile Lys Ala 
            100                 105                 110         


Lys Asp Ile Asn Lys Glu Thr Asp Lys Ile Ile Thr Ile Asn Ser Lys 
        115                 120                 125             


Lys 
    


<210>  98
<211>  47
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (35)..(35)
<223>  Xaa can be any naturally occurring amino acid

<400>  98

Met Asp Phe Lys Glu Lys Asn Tyr Lys Ile Glu Ser Ile Ala Glu Ile 
1               5                   10                  15      


Glu Gln Leu Asp Asp Phe Glu Asp Glu Tyr Val Tyr Asp Val Glu Val 
            20                  25                  30          


Asp Asp Xaa His Asn Phe Phe Ala Asn Asp Val Leu Val His Asn 
        35                  40                  45          


<210>  99
<211>  166
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (85)..(85)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (122)..(122)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (134)..(134)
<223>  Xaa can be any naturally occurring amino acid

<400>  99

Ser Val Ser Gly Asp Thr Val Val Val Thr Lys Asn His Pro Asn Gly 
1               5                   10                  15      


Ile Ser Ile Glu Arg Leu Tyr Gly Glu Asn Ala Glu Tyr Ala Val Glu 
            20                  25                  30          


Phe Gly Asp Asn His Glu Ser Val Leu Thr Gly Asp Met Val Leu Asn 
        35                  40                  45              


Tyr Tyr Asp Asn Glu Leu Tyr Val Ala Pro Val Ser Arg Ile Ile Arg 
    50                  55                  60                  


His Lys Val Thr Lys Asp Lys Tyr Leu Leu Lys Ala Phe Asn Ser Gly 
65                  70                  75                  80  


Asn Ala Val Glu Xaa Thr Ser Asp His Ser Leu Val Val Tyr Arg Gln 
                85                  90                  95      


Asn Gly Asp Glu Gly Lys Tyr Val Ser Ala Val Val Lys Pro Tyr Glu 
            100                 105                 110         


Ile Arg Asn Gly Asp Leu Val Val Thr Xaa Lys Asn Asp Ser Tyr Thr 
        115                 120                 125             


Phe Glu Glu Ala Thr Xaa Glu Lys Ile Gly Glu Tyr Asn Asp Glu Tyr 
    130                 135                 140                 


Val Tyr Asp Ile Glu Met Asp Asp Leu Thr Ser Thr Phe Val Ala Asn 
145                 150                 155                 160 


Gly Ile Leu Val His Asn 
                165     


<210>  100
<211>  14
<212>  PRT
<213>  Unknown

<220>
<223>  IntN from sheep gut metagenome

<400>  100

Ser Ser Val Phe Asp Thr Tyr Ile Asp Val Ile Glu Glu Asp 
1               5                   10                  


<210>  101
<211>  159
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (58)..(58)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (72)..(72)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (87)..(87)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (114)..(114)
<223>  Xaa can be any naturally occurring amino acid

<400>  101

Met Lys Leu Tyr Leu Glu Arg Leu Ser Met Thr Ser Val Lys Ile Gly 
1               5                   10                  15      


Glu Leu Tyr Asn Lys Tyr Leu Ala Lys Gly Tyr Glu Ile Val Asn Thr 
            20                  25                  30          


Pro Thr Gly His Glu Leu Ile Tyr Pro Lys Ser Leu Lys Val Arg Ser 
        35                  40                  45              


Ile Gly Asn Glu Phe Lys Lys Val Lys Xaa Leu Ser Arg His Arg Thr 
    50                  55                  60                  


Ser Lys Pro Leu Val Arg Ile Xaa Phe Gln Lys Ser Glu Pro Leu Val 
65                  70                  75                  80  


Val Thr Thr Asp His Val Xaa Met Ala Tyr Asn Asp Asp Arg Met Leu 
                85                  90                  95      


Glu Asn Ile Ala Ser Lys Asp Leu Arg Val Gly Met Met Val Asp His 
            100                 105                 110         


Tyr Xaa Arg Thr Ser Asp Lys Glu Val Ile Asp Val Ile Thr Asn Ile 
        115                 120                 125             


Glu Pro Leu Gly Thr Thr Asp Asp Tyr Val Tyr Asp Leu Glu Val Glu 
    130                 135                 140                 


Asp Glu Ser His Val Phe Tyr Ala Asn Asp Thr Leu Ile His Asn 
145                 150                 155                 


<210>  102
<211>  124
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (63)..(63)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (98)..(98)
<223>  Xaa can be any naturally occurring amino acid

<400>  102

Ser Ile Asp Gly Asn Ser Val Ile Asp Ile Asn Asp Glu Lys Ile Ala 
1               5                   10                  15      


Ile Lys Asp Ala Phe Ala Ala Ile Lys Tyr Met Asn Ile Asp Thr Val 
            20                  25                  30          


Ile Met Leu Pro Asn Gly Thr Gln Val Val Pro Ala Pro Ser Asp Ile 
        35                  40                  45              


Thr Leu Thr Thr Lys Thr Tyr Asp Ala Ser Thr Asp Thr Val Xaa Asp 
    50                  55                  60                  


Lys Gln Ile Arg Tyr Ile Met Arg His Lys Val Lys Lys Ser Lys Tyr 
65                  70                  75                  80  


Lys Ile Thr Thr Glu Ser Gly Lys Glu Val Ile Val Thr Gly Asp His 
                85                  90                  95      


Ser Xaa Met Val Val Arg Asp Gly Ile Leu Ile Ser Val Thr Ala Gln 
            100                 105                 110         


Glu Ile Asn Pro Glu Thr Asp Lys Ile Ile Thr Ile 
        115                 120                 


<210>  103
<211>  51
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from sheep gut metagenome

<400>  103

Met Leu Lys Ile Met Asp Phe Ser Gln Thr Asn Tyr Lys Val Glu Asn 
1               5                   10                  15      


Ile Lys Ser Ile Glu Val Leu Pro Asp Phe Asp Asp Glu Tyr Val Tyr 
            20                  25                  30          


Asp Ile Glu Val Gly Glu Thr His Met Phe Phe Ala Asn Glu Ile Leu 
        35                  40                  45              


Val His Asn 
    50      


<210>  104
<211>  171
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (139)..(139)
<223>  Xaa can be any naturally occurring amino acid

<400>  104

Ser Val Ser Lys Asp Thr Ile Ile Arg Thr Arg Leu His Pro Asp Gly 
1               5                   10                  15      


Ile Met Ile Glu Asp Phe Tyr Asp Glu Asn Ser Ser Asn Lys Gly Glu 
            20                  25                  30          


Asp Thr Arg Ala Gly His Glu Ser Val His Thr Ser Asp Gln Val Leu 
        35                  40                  45              


Asn Phe Asn Gly Ser Ile Leu Thr Asn Thr Ser Asn Leu Tyr Tyr Gly 
    50                  55                  60                  


Asn Val Lys Arg Ile Ile Arg His Lys Val Ser Lys Pro Lys Trp Thr 
65                  70                  75                  80  


Ile Thr Thr Arg Phe Gly His Ser Val Ser Val Thr Asn Asp His Ser 
                85                  90                  95      


Leu Met Val Leu Arg Asp Asp Lys Leu Tyr Lys Val Lys Pro Ser Glu 
            100                 105                 110         


Val Lys Pro Gly Asp Glu Ala Met Ser Val Glu Val Met Tyr Gly Asp 
        115                 120                 125             


Ile Val Glu Gly Ala Asp Lys Ile Thr Ser Xaa Val His Thr Gly Glu 
    130                 135                 140                 


Tyr Asn Asp Glu Tyr Val Tyr Asp Ile Glu Met Asp Asp Asp Asn His 
145                 150                 155                 160 


Thr Phe Phe Ala Asn Asp Ile Leu Val His Asn 
                165                 170     


<210>  105
<211>  166
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (2)..(2)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (109)..(109)
<223>  Xaa can be any naturally occurring amino acid

<400>  105

Ser Xaa Ala Gly Asn Ser Ile Ile Glu Leu Asn Gly Arg Lys Met Ser 
1               5                   10                  15      


Ile Glu Asn Ala Phe Ser Tyr Leu Lys Glu Glu Asn Asp His Ile Val 
            20                  25                  30          


Leu Arg Thr Ser Asn Gly Ser Glu Val Val Pro Val Glu Asn Thr Thr 
        35                  40                  45              


Thr Lys Thr Tyr Asp Ser Leu Ser Lys Lys Ile Val Asp Arg Asp Val 
    50                  55                  60                  


Lys Tyr Ile Met Arg His Lys Val Ser Lys Pro Lys Trp Lys Leu Thr 
65                  70                  75                  80  


Thr Ser Ser Gly Lys Met Ile Glu Val Thr Gly Asp His Ser Leu Met 
                85                  90                  95      


Val Met Arg Asp Gly Glu Leu Leu Ser Val Lys Ala Xaa Glu Val Asp 
            100                 105                 110         


Pro Lys Lys Asp Lys Ile Val Thr Tyr Ile Asn Thr Gln Glu Tyr Ile 
        115                 120                 125             


Ile Glu Asp Ile Glu Ser Ile Glu Gln Val Glu Asp Phe Asn Asp Glu 
    130                 135                 140                 


Tyr Val Tyr Asp Ile Glu Val Asp Asp Thr His Thr Phe Phe Ala Asn 
145                 150                 155                 160 


Asp Ile Leu Val His Asn 
                165     


<210>  106
<211>  124
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (63)..(63)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (98)..(98)
<223>  Xaa can be any naturally occurring amino acid

<400>  106

Ser Ile Asp Gly Asn Ser Val Ile Asp Ile Asn Asp Asn Arg Ile Ala 
1               5                   10                  15      


Ile Arg Asp Ala Phe Ala Ala Ile Lys Gln Met Asn Gln Asp Thr Val 
            20                  25                  30          


Ile Val Leu Pro Asn Gly Thr Gln Val Val Pro Ala Pro Ser Asp Val 
        35                  40                  45              


Glu Leu Thr Thr Lys Thr Tyr Asp Ala Asn Thr Asp Thr Ile Xaa Asp 
    50                  55                  60                  


Met Pro Ile Lys Tyr Ile Met Arg His Lys Val Arg Lys Ala Lys Tyr 
65                  70                  75                  80  


Lys Ile Thr Thr Glu Ser Gly Lys Gln Val Ile Val Thr Gly Asp His 
                85                  90                  95      


Ser Xaa Met Val Ile Arg Asp Gly Val Leu Ile Ser Val Thr Ala Gln 
            100                 105                 110         


Glu Ile Asn Pro Glu Thr Asp Lys Ile Ile Thr Ile 
        115                 120                 


<210>  107
<211>  47
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (4)..(4)
<223>  Xaa can be any naturally occurring amino acid

<400>  107

Met Glu Phe Xaa Gln Asn Asn Tyr Lys Val Glu Asn Ile Lys Ser Ile 
1               5                   10                  15      


Glu Val Leu Pro Asp Phe Glu Asp Glu Asp Val Tyr Asp Leu Glu Val 
            20                  25                  30          


Gly Gly Thr His Met Phe Phe Ala Asn Asp Ile Leu Val His Asn 
        35                  40                  45          


<210>  108
<211>  128
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (48)..(48)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (52)..(52)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (97)..(97)
<223>  Xaa can be any naturally occurring amino acid

<400>  108

Ser Leu Ala Lys Lys Ser Leu Leu Leu Ile Lys Asp Thr Lys Asn Ile 
1               5                   10                  15      


Lys Asn Lys Ile Thr Ile Glu Asp Leu Phe Asn Gln Ser Leu Asp Lys 
            20                  25                  30          


Asn Gly Leu Ser Asp Ile Thr Gln Asn Asn Gln Glu Ile Val Lys Xaa 
        35                  40                  45              


Asp Gln Gln Xaa Leu Asn Trp Thr Lys Glu Asn Gly Leu Gln Tyr Val 
    50                  55                  60                  


Pro Ile Lys Tyr Ile Met Arg His Lys Val Ser Lys Glu Gln Phe Lys 
65                  70                  75                  80  


Ile Lys Thr Lys Ser Gly Lys Glu Ile Ile Val Thr Gly Asp His Ser 
                85                  90                  95      


Xaa Ile Val Phe Arg Asn Gly Lys Gln Leu Thr Ile Lys Ala Arg Asp 
            100                 105                 110         


Ile Asn Lys Ser Thr Asp Lys Ile Leu Ser Ile Ile Asn Asn Glu Glu 
        115                 120                 125             


<210>  109
<211>  44
<212>  PRT
<213>  Unknown

<220>
<223>  IntC from sheep gut metagenome

<400>  109

Met Leu Glu Tyr Gln Phe Glu Glu Ile Glu Ser Ile Glu Gln Leu Asp 
1               5                   10                  15      


Asn Phe Asn Asp Glu Tyr Val Tyr Asp Ile Glu Val Asp Asp Asn Ser 
            20                  25                  30          


His Thr Phe Ile Ala Asn Asp Ile Leu Val His Asn 
        35                  40                  


<210>  110
<211>  169
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (4)..(4)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (48)..(48)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (97)..(97)
<223>  Xaa can be any naturally occurring amino acid

<400>  110

Ser Thr Phe Xaa Lys Ser Leu Leu Leu Ile Lys Asp Asn Lys Asn Ile 
1               5                   10                  15      


Glu Asn Lys Val Thr Ile Glu Ser Leu Phe Asn Lys Ser Leu Met Asp 
            20                  25                  30          


Asn Gly Leu Thr Asp Leu Thr Gln Asn Asn Gln Glu Ile Val Lys Xaa 
        35                  40                  45              


Asp Tyr Ser Thr Leu Asn Trp Thr Glu Glu Lys Gly Leu Glu Tyr Val 
    50                  55                  60                  


Pro Ile Lys Tyr Ile Met Arg His Lys Val Ser Lys Gln Gln Phe Lys 
65                  70                  75                  80  


Ile Lys Thr Lys Ser Gly Lys Glu Ile Ile Val Thr Gly Asp His Ser 
                85                  90                  95      


Xaa Ile Val Phe Arg Asn Gly Glu Glu Thr Val Val Lys Ala Lys Asp 
            100                 105                 110         


Ile Asn Lys Asp Thr Asp Lys Ile Leu Ser Ile Ile Thr Phe Asn Lys 
        115                 120                 125             


Tyr Gln Ile Glu Asn Ile Lys Ser Ile Glu Gln Leu Glu Asp Phe Asp 
    130                 135                 140                 


Asn Glu Tyr Val Tyr Asp Ile Glu Val Asp Asp Asp Ser His Thr Phe 
145                 150                 155                 160 


Ile Ala Asn Asp Ile Leu Val His Asn 
                165                 


<210>  111
<211>  15
<212>  PRT
<213>  Unknown

<220>
<223>  IntN from sheep gut metagenome

<400>  111

Ser Val His Gly Lys Thr His Val Phe Ile Arg Ser Ile Lys Asn 
1               5                   10                  15  


<210>  112
<211>  168
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (96)..(96)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (150)..(150)
<223>  Xaa can be any naturally occurring amino acid

<400>  112

Met Gln Glu Ala Lys Ile Asp Ile Lys Ser Leu Tyr Asp Ser Leu Ala 
1               5                   10                  15      


Lys Lys Tyr Asp Val Gln His Lys Asn Ser Tyr Glu Val Ile Tyr Pro 
            20                  25                  30          


Lys Gly Tyr Glu Ile Lys Val Leu Gly Asn Lys Tyr Val Lys Leu Val 
        35                  40                  45              


Ala Met Ser Arg His Lys Thr Gln Lys His Leu Val Lys Ile Val Val 
    50                  55                  60                  


Lys Ser Glu Lys Thr Ile Asp Ser Leu Asp Pro Ile Arg Gln Lys Ser 
65                  70                  75                  80  


Leu Leu Lys Lys Gln Asp Glu Val Val Val Thr Thr Asp His Ile Xaa 
                85                  90                  95      


Met Val Tyr Asn Asp Asp His Phe Phe Glu Asn Val Asn Ala Lys Asn 
            100                 105                 110         


Leu Lys Val Gly Asn Tyr Val Ser Val Tyr Asp Glu Ala Ser Asp Lys 
        115                 120                 125             


Glu Val Ile Gly Glu Ile Ala Ser Ile Glu Asp Leu Gly Met Thr Asp 
    130                 135                 140                 


Asp Tyr Val Tyr Asp Xaa Glu Val Asp Asp Asp Ser His Ala Phe Tyr 
145                 150                 155                 160 


Ala Ser Asn Ile Leu Val His Asn 
                165             


<210>  113
<211>  171
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (2)..(2)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (111)..(111)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (139)..(139)
<223>  Xaa can be any naturally occurring amino acid

<400>  113

Ser Xaa Asn Phe Asp Thr Leu Ile Arg Thr Lys Asn Tyr Pro Asp Gly 
1               5                   10                  15      


Ile Thr Ile Glu Asp Phe Tyr Asn Ile Asn Ser Glu Asn Lys Gly Asp 
            20                  25                  30          


Thr Thr Leu Val Gly His Glu Ser Val Tyr Thr Thr Asp Lys Val Leu 
        35                  40                  45              


Asn Phe Lys Gly Asn Gln Leu Thr Asn Thr Ser Gln Leu Tyr Tyr Gly 
    50                  55                  60                  


Asp Val Lys Arg Ile Ile Arg His Lys Val Ser Lys Pro Lys Trp Val 
65                  70                  75                  80  


Ile Thr Thr Arg Phe Gly His Ser Val Thr Val Thr Asn Asp His Ser 
                85                  90                  95      


Met Met Val Leu Arg Asp Asp Lys Leu Tyr Met Val Lys Pro Xaa Glu 
            100                 105                 110         


Ile His Pro Gly Asp Asp Val Leu Ser Leu Glu Val Met Tyr Gly Asp 
        115                 120                 125             


Phe Val Glu Gly Ser Asp Lys Val Thr Ser Xaa Leu His Val Gly Glu 
    130                 135                 140                 


Tyr Glu Asp Glu Tyr Val Tyr Asp Ile Glu Met Gly Asp Asp Thr His 
145                 150                 155                 160 


Thr Phe Phe Ala Asn Asp Ile Leu Val His Asn 
                165                 170     


<210>  114
<211>  74
<212>  PRT
<213>  Unknown

<220>
<223>  IntN from sheep gut metagenome

<400>  114

Ser Val Ser Lys Asp Thr Ile Ile Arg Thr Lys Lys His Val Asp Gly 
1               5                   10                  15      


Ile Thr Ile Glu Asp Phe Tyr Glu Glu Asn Ser Glu Lys Asp Val Phe 
            20                  25                  30          


Glu Ile Lys Thr Glu Ser Gly Lys Ile Ile Arg Tyr Asn Lys Asn Asp 
        35                  40                  45              


Lys Val Lys Val Lys Arg Ser Asp Lys Ile Ile Tyr Val Tyr Pro Asp 
    50                  55                  60                  


Asp Ile Leu Tyr Glu Asp Glu Ile Phe Thr 
65                  70                  


<210>  115
<211>  159
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (99)..(99)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (127)..(127)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (129)..(129)
<223>  Xaa can be any naturally occurring amino acid

<400>  115

Met Ser Gln Phe Glu Lys Ile Val Ser Ile Arg Lys Ile Arg Ser Ser 
1               5                   10                  15      


Asn Lys Gly Glu Ser Thr Leu Ala Gly His Glu Ser Val Tyr Thr Asp 
            20                  25                  30          


Asp Lys Val Leu Asn Phe Lys Gly Ser Ile Leu Thr Asn Ile Ser Gln 
        35                  40                  45              


Leu Tyr Tyr Gly Ser Val Lys Arg Ile Ile Arg His Lys Val Ser Lys 
    50                  55                  60                  


Pro Lys Trp Thr Ile Thr Thr Arg Phe Gly His Ser Val Thr Val Thr 
65                  70                  75                  80  


Asn Asp His Ser Met Met Val Leu Arg Asp Asp Lys Leu Tyr Lys Val 
                85                  90                  95      


Lys Pro Xaa Glu Ile His Pro Asp Asp Tyr Val Leu Ser Leu Glu Ala 
            100                 105                 110         


Met Tyr Gly Asp Ile Val Glu Gly Ser Asp Lys Val Val Ser Xaa Leu 
        115                 120                 125             


Xaa Thr Gly Glu Tyr Asp Asn Glu Tyr Val Tyr Asp Ile Glu Met Asp 
    130                 135                 140                 


Asp Asp Thr His Thr Phe Phe Ala Asn Asp Ile Leu Val His Asn 
145                 150                 155                 


<210>  116
<211>  170
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (48)..(48)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (98)..(99)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (104)..(105)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (108)..(108)
<223>  Xaa can be any naturally occurring amino acid

<400>  116

Ser Leu Ala Lys Asn Ser Leu Ile Leu Ile Glu Asp Asn Lys Asn Thr 
1               5                   10                  15      


Lys Asp Lys Ile Tyr Ile Glu Ser Leu Phe Asn Lys Ala Leu Arg Asp 
            20                  25                  30          


Asn Gly Leu Glu Asp Ile Ser Gln Asn Asn Gln Glu Ile Val Lys Xaa 
        35                  40                  45              


Asp Asp Tyr Asn Val Leu Asn Trp Thr Lys Glu Asn Gly Leu Gln Tyr 
    50                  55                  60                  


Val Pro Ile Lys Tyr Ile Met Arg His Lys Val Ser Lys Pro Lys Phe 
65                  70                  75                  80  


Lys Ile Lys Thr Lys Ser Gly Lys Glu Ile Ile Val Thr Gly Asp His 
                85                  90                  95      


Ser Xaa Xaa Val Phe Arg Asn Xaa Xaa Gln Leu Xaa Ile Lys Ala Lys 
            100                 105                 110         


Asp Ile Asn Lys Asp Thr Asp Lys Ile Leu Ser Ile Ile Tyr Asp Met 
        115                 120                 125             


Lys Tyr Ile Ile Glu Glu Ile Glu Ser Val Glu Gln Leu Asp Asn Phe 
    130                 135                 140                 


Asn Asp Glu Tyr Val Tyr Asp Ile Glu Val Asp Asp Asn Ser His Thr 
145                 150                 155                 160 


Phe Ile Ala Asn Asp Ile Leu Val His Asn 
                165                 170 


<210>  117
<211>  120
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from marine sediment metagenome LCGC14


<220>
<221>  misc_feature
<222>  (24)..(24)
<223>  Xaa can be any naturally occurring amino acid

<400>  117

Ser Val Asp Ala Asp Thr Ile Ile Lys Thr Asn Tyr Gly Glu Met Thr 
1               5                   10                  15      


Ile Glu Asn Leu Phe Lys Ser Xaa Ser Ile Lys Gly Pro Ser Trp Ala 
            20                  25                  30          


Ile Asp Asp Gln Glu Phe Thr Ile Tyr Asp Gln Ile Gln Ile Leu Thr 
        35                  40                  45              


Tyr Asp Pro Lys Thr Asn Glu Glu Ile Tyr Arg Pro Phe Glu Tyr Val 
    50                  55                  60                  


Tyr Arg His Lys Val Ser Lys Pro Arg Trp Lys Ile Ile Asp Glu Asn 
65                  70                  75                  80  


Gly Asn Glu Ile Ile Leu Thr Asn Asp His Ser Val Met Ile Glu Arg 
                85                  90                  95      


Asp Gly Lys Leu Ile Glu Ala Lys Pro Ser Glu Ile Asn Pro Asp Thr 
            100                 105                 110         


Asp Ile Leu Ile Thr Ile Gly Glu 
        115                 120 


<210>  118
<211>  42
<212>  PRT
<213>  Unknown

<220>
<223>  IntC from Marine sediment metagenome LCGC14

<400>  118

Met Val Glu Lys Leu Lys Ile Gln Lys Ile Glu Lys Leu Glu Asp Phe 
1               5                   10                  15      


Asp Asn Glu Tyr Val Tyr Asp Ile Ser Val Asp Lys Glu Thr Pro Tyr 
            20                  25                  30          


Phe Phe Gly Asn Asn Ile Leu Val His Asn 
        35                  40          


<210>  119
<211>  469
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Stanford USA drinking water 
       system tap filter metagenome genome assembly


<220>
<221>  misc_feature
<222>  (106)..(106)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (111)..(111)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (121)..(121)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (240)..(240)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (284)..(284)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (290)..(290)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (304)..(304)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (415)..(415)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (423)..(423)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (444)..(444)
<223>  Xaa can be any naturally occurring amino acid

<400>  119

Ser Phe Ser Ser Ile Thr Pro Leu Leu Ile His Asp Asp His Gly Gln 
1               5                   10                  15      


Ile Asp Ile Val Gln Ala Lys Asp Leu Val Lys Pro Asn Asp Pro Ala 
            20                  25                  30          


Trp Met Ser Lys Glu Asp Phe Trp Lys Asn Tyr Asn Pro Asp Pro Val 
        35                  40                  45              


Ser Thr Thr Lys Ala Ser Ser Thr Ser Ile Ser Thr Thr Lys Thr Gly 
    50                  55                  60                  


Arg Val Ser Lys Ala Thr Ser Ser Asn Ser Thr Thr Lys Pro Ser Gly 
65                  70                  75                  80  


Thr Ser Asn Thr Ile Ala Ile Glu Pro Thr Ser Thr Arg Gly Ser Arg 
                85                  90                  95      


Ser Arg Thr Ser Arg Ser Ser Asn Ser Xaa Asp Ile Ala Ser Xaa Ser 
            100                 105                 110         


Asn Asp Asn Thr Ile Pro Gln Ser Xaa Ser Ser Glu Thr Pro Asp Thr 
        115                 120                 125             


Ser Gln Phe Ser Gly Ser Thr Gln Gln Tyr Met Asp Ile Arg Asp Lys 
    130                 135                 140                 


Asn Tyr Asn Ile Trp Ser Glu Lys Gly Trp Thr Gln Ile Lys Tyr Ile 
145                 150                 155                 160 


Met Arg His Lys Thr Gly Lys Gln Met Tyr Arg Ile Asn Thr His Thr 
                165                 170                 175     


Gly Val Ile Asp Val Thr Glu Asp His Ser Leu Leu Asp Ile Lys Gly 
            180                 185                 190         


Asp Pro Val Thr Pro Asn Glu Val Lys Ile Gly Ser Glu Leu Leu His 
        195                 200                 205             


His Asp Leu Pro Asn Val Val Asn Glu His Gln Asn Val Ile Ile Asp 
    210                 215                 220                 


Ala Asp Thr Ala Trp Leu Tyr Gly Phe Phe Tyr Ala Glu Gly Thr Xaa 
225                 230                 235                 240 


Gly Thr Tyr Gln Tyr Lys Lys Gly Pro Arg Ser Ser Trp Ser Ile Ser 
                245                 250                 255     


Asn Gln Asp Leu Ala Pro Met Asn Lys Ala Leu Glu Ile Leu Gln Arg 
            260                 265                 270         


Ile Glu Pro Asn Tyr Lys Phe Lys Ile Asp Asn Xaa Met Lys Ser Ser 
        275                 280                 285             


Ala Xaa His Lys Leu Ser Ala Arg Asn Gly Ser Thr Ser Lys Asn Xaa 
    290                 295                 300                 


Pro Arg Glu Tyr His Val Ser Ser Leu Val Glu Lys Tyr His Asn Met 
305                 310                 315                 320 


Phe His Ala Asp Ser Gly Gln Gln Arg His Asn Ile Lys Ser Asn Ser 
                325                 330                 335     


Tyr Thr Ala Leu Tyr Tyr Lys Lys Val Pro Lys Glu Ile Leu Asn Ala 
            340                 345                 350         


Ser Asn Asp Ile Lys Lys Ser Phe Tyr Asp Gly Tyr Tyr Val Gly Asp 
        355                 360                 365             


Gly Leu Lys Ala Thr Thr Ala Asn Glu Val Phe Asp Asn Lys Gly Gln 
    370                 375                 380                 


Ile Gly Ala Ala Gly Leu Tyr Tyr Ile Ala Ser Ala Leu Gly Tyr Asp 
385                 390                 395                 400 


Val Ser Leu Tyr Leu Arg Glu Asp Lys Pro Gln Ile Phe Arg Xaa Thr 
                405                 410                 415     


Leu Ser Lys Asp Thr Lys Xaa Val Lys Ser Val Gln Arg Lys Asn Arg 
            420                 425                 430         


Asn Ala Ile Lys Lys Ile Ile Pro Leu Gly Thr Xaa Asp Asp Tyr Val 
        435                 440                 445             


Tyr Asp Ile Glu Thr Glu Asn His His Phe Ala Ala Gly Ile Gly Arg 
    450                 455                 460                 


Met Val Val His Asn 
465                 


<210>  120
<211>  118
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Stanford USA drinking water 
       system tap filter metagenome genome assembly


<220>
<221>  misc_feature
<222>  (42)..(42)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (74)..(74)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (82)..(82)
<223>  Xaa can be any naturally occurring amino acid

<400>  120

Ser Val Asp Phe Gln Thr Leu Met Lys Thr Glu Lys Asp Glu Ile Thr 
1               5                   10                  15      


Ile Glu Glu Leu Tyr Asn Arg Asn Ile Ile Asn Gly Ser Ala Gly Ile 
            20                  25                  30          


Thr Leu Asn Gly His Glu Ser Val Lys Xaa Thr Asp Ile Ile Leu Asn 
        35                  40                  45              


Tyr Ser Lys Ser Lys Gly Leu Tyr Phe Asn Asn Val Arg Arg Ile Ile 
    50                  55                  60                  


Arg His Lys Val Ser Lys Glu Lys Trp Xaa Leu Lys Thr Thr Asn Gly 
65                  70                  75                  80  


Lys Xaa Ile Tyr Ile Thr Asn Asp His Ser Leu Ile Val Phe Arg Asp 
                85                  90                  95      


Gly Glu Lys Leu Glu Ile Lys Pro Lys Asp Val Leu Lys Ile Asp Lys 
            100                 105                 110         


Val Leu Thr Ile Lys Lys 
        115             


<210>  121
<211>  52
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from Lake Huron low oxygen high sulfur sink hole 
       purple microbial mat bin unclassified.02


<220>
<221>  misc_feature
<222>  (14)..(14)
<223>  Xaa can be any naturally occurring amino acid

<400>  121

Met Met Glu Phe Glu Tyr Glu Leu Val Asp Ile Glu Ser Xaa Glu Leu 
1               5                   10                  15      


Val Gly Ile Phe Asp Asp Glu Tyr Val Tyr Asp Ile Glu Val Glu Glu 
            20                  25                  30          


Asp Glu Asn Asp Val Glu Asp Thr His Thr Phe Phe Gly Asn Asp Ile 
        35                  40                  45              


Leu Val His Asn 
    50          


<210>  122
<211>  125
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from Rattus norvegicus (Malaysia) gut metagenome


<220>
<221>  misc_feature
<222>  (45)..(46)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (96)..(96)
<223>  Xaa can be any naturally occurring amino acid

<400>  122

Ser Ser Ser Ser Asn Ser Asn Ile Tyr Ile Lys Arg Gly Asn Gln Val 
1               5                   10                  15      


Leu Lys Arg Thr Phe Ile Glu Leu Trp Arg Asp Thr Val Glu Glu His 
            20                  25                  30          


Met Pro Phe Gly Leu Gly Thr His Gly Gln Glu Leu Xaa Xaa Ser Asp 
        35                  40                  45              


Asp Tyr Val Leu Asn Tyr Asp Glu Lys Arg Gly Leu His Trp Val Pro 
    50                  55                  60                  


Ile Lys Tyr Ile Met Lys His Gly Thr Lys Lys Arg Lys Phe Arg Val 
65                  70                  75                  80  


Lys Thr Lys Ser Gly Lys Glu Ile Ile Val Thr Glu Asp His Ser Xaa 
                85                  90                  95      


Val Val Ile Arg Asn Gly Glu Lys Ile Ala Val Lys Ala Ser Glu Ile 
            100                 105                 110         


Asn Lys Asp Thr Asp Lys Ile Val Ser Ile Ser Thr Lys 
        115                 120                 125 


<210>  123
<211>  49
<212>  PRT
<213>  Rattus norvegicus

<400>  123

Met Ile Glu Glu Gly Phe Asp Tyr Ile Ile Glu Asp Ile Glu Ser Val 
1               5                   10                  15      


Glu Asp Ile Gly Phe Phe Asp Glu Asp Glu Pro Val Phe Asp Ile Glu 
            20                  25                  30          


Val Glu Asp Asp Thr His Thr Phe Ile Gly Asn Asp Val Leu Val His 
        35                  40                  45              


Asn 
    


<210>  124
<211>  125
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from Rattus norvegicus (Denmark: Riget) gut 
       metagenome


<220>
<221>  misc_feature
<222>  (45)..(46)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (96)..(96)
<223>  Xaa can be any naturally occurring amino acid

<400>  124

Ser Ser Ser Ser Asn Ser Asn Leu Tyr Ile Lys Arg Gly Asn Gln Val 
1               5                   10                  15      


Leu Lys Arg Thr Phe Ile Glu Leu Trp Arg Asp Thr Val Glu Glu His 
            20                  25                  30          


Met Pro Phe Gly Leu Gly Thr His Gly Gln Glu Leu Xaa Xaa Ser Asp 
        35                  40                  45              


Asp Tyr Val Leu Asn Tyr Asp Glu Lys Arg Gly Leu His Trp Val Pro 
    50                  55                  60                  


Ile Lys Tyr Ile Met Lys His Gly Thr Lys Lys Arg Lys Phe Arg Val 
65                  70                  75                  80  


Lys Thr Lys Ser Gly Lys Glu Ile Ile Val Thr Glu Asp His Ser Xaa 
                85                  90                  95      


Val Val Ile Arg Asn Gly Glu Lys Ile Ala Val Lys Ala Ser Glu Ile 
            100                 105                 110         


Asn Lys Asp Thr Asp Lys Ile Val Ser Ile Ser Thr Lys 
        115                 120                 125 


<210>  125
<211>  49
<212>  PRT
<213>  Rattus norvegicus

<400>  125

Met Ile Glu Glu Gly Phe Asp Tyr Ile Ile Glu Glu Ile Glu Ser Val 
1               5                   10                  15      


Glu Asp Ile Gly Phe Phe Asp Glu Asp Glu Pro Val Phe Asp Ile Glu 
            20                  25                  30          


Val Glu Asp Asp Thr His Thr Phe Ile Gly Asn Asp Val Leu Val His 
        35                  40                  45              


Asn 
    


<210>  126
<211>  198
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (24)..(24)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (52)..(52)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (178)..(178)
<223>  Xaa can be any naturally occurring amino acid

<400>  126

Ser Phe Thr Glu Asp Thr Pro Val Phe Val Arg Asn Val Lys Asn Gly 
1               5                   10                  15      


Ala Ile Asp Ile Lys Pro Ile Xaa Glu Leu Ile Asp Glu Asn Lys Ile 
            20                  25                  30          


Glu Thr Asp Ala Leu Gly Arg Glu Tyr Asp Tyr Ser Pro Lys Pro Tyr 
        35                  40                  45              


Gln Val Leu Xaa Arg Ser Gly Trp Val Thr Pro Ser Tyr Ile Tyr Arg 
    50                  55                  60                  


His Lys Thr Asp Lys Asp Ile Tyr Glu Ile Thr Asp Gly Asp Met Lys 
65                  70                  75                  80  


Ile Glu Val Thr Glu Asp His Ser Leu Phe Asn Asp Lys Lys Glu Lys 
                85                  90                  95      


Ile Lys Pro Ser Glu Val Asn Asn Glu Thr His Leu Glu Tyr Phe Asn 
            100                 105                 110         


Asp Tyr Glu Val Phe Lys Lys Ala Lys Trp Leu Pro Ala Asp Thr Arg 
        115                 120                 125             


Asn Pro His Phe Tyr Ala Lys Ala Leu Ala Asn Gly Lys Ile Asp Arg 
    130                 135                 140                 


Val Pro Ser Trp Phe Leu Asn Arg Pro Thr Lys Glu Gly Arg Glu Phe 
145                 150                 155                 160 


Tyr Glu Val Phe Ile Lys Asn Tyr Arg Asp Asp Ile Gln Tyr Ser Lys 
                165                 170                 175     


Thr Xaa Leu Ala Gly Leu Tyr Phe Leu Lys Met Ile Ser Glu Val Ser 
            180                 185                 190         


Thr Tyr Ser Gly Ile Lys 
        195             


<210>  127
<211>  34
<212>  PRT
<213>  Unknown

<220>
<223>  IntC from sheep gut metagenome

<400>  127

Met Leu Glu Lys Thr Asn Lys Gly Lys Thr Ser Ala Tyr Val Tyr Asp 
1               5                   10                  15      


Ile Ser Leu Asp Gly Thr Val Val Asn Ala Leu Gly Met Asn Val Asn 
            20                  25                  30          


Ser Asn 
        


<210>  128
<211>  195
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (52)..(52)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (179)..(179)
<223>  Xaa can be any naturally occurring amino acid

<400>  128

Ser Phe Thr Pro Asp Thr Pro Met Phe Ile Lys Tyr Lys Asp Ser Gly 
1               5                   10                  15      


Leu Ile Asp Ile Lys Pro Ile Glu Glu Leu Ile Asn Glu Lys Glu Ile 
            20                  25                  30          


Lys Ile Asp Ala Leu Gly Arg Glu Tyr Asp Tyr Ser Lys Lys Asp Tyr 
        35                  40                  45              


Tyr Val Leu Xaa Arg Ser Gly Trp Val Glu Pro Ser Tyr Ile Tyr Arg 
    50                  55                  60                  


His Lys Thr Glu Lys Asp Ile Tyr Glu Ile Thr Asp Gly Glu Met Lys 
65                  70                  75                  80  


Val Glu Val Thr Glu Asp His Ser Leu Phe Asn Ser Lys Gln Glu Lys 
                85                  90                  95      


Ile Lys Pro Ser Glu Ile Thr Asn Lys Thr Glu Leu Glu Tyr Tyr Thr 
            100                 105                 110         


Glu Glu Val Arg Pro Asp Gly Tyr Thr Arg Tyr Phe Ala Thr Gln Ala 
        115                 120                 125             


Pro Gln Ala Lys Gln Met Ala Met Asp Leu Ala Asn Gly Lys Ile Asp 
    130                 135                 140                 


Arg Val Pro Met Lys Val Leu Asn Tyr Gln Glu Val Tyr Gln Lys Ile 
145                 150                 155                 160 


Phe Tyr Asp Thr Phe Ile Glu Asn Tyr Lys Asn Asp Ile Lys Tyr Ser 
                165                 170                 175     


Lys Thr Xaa Leu Ala Gly Leu Gln Tyr Ile Arg Lys Met Leu Met Glu 
            180                 185                 190         


Lys Lys Phe 
        195 


<210>  129
<211>  33
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (9)..(9)
<223>  Xaa can be any naturally occurring amino acid

<400>  129

Met Lys Ile Leu Asn Lys Gly Lys Xaa Leu Asp Tyr Val Tyr Asp Ile 
1               5                   10                  15      


Ser Leu Asp Gly Thr Val Val Asn Ala Leu Gly Met Asn Ile Leu Ser 
            20                  25                  30          


Asn 
    


<210>  130
<211>  188
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from cow rumen metagenome


<220>
<221>  misc_feature
<222>  (50)..(50)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (153)..(153)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (185)..(185)
<223>  Xaa can be any naturally occurring amino acid

<400>  130

Ser Val Thr Glu Asp Thr Pro Leu Phe Val Arg Lys Asn Gly Met Ile 
1               5                   10                  15      


Asp Ile Lys Pro Ile Gly Glu Leu Phe Gly Ser Glu Asn Ile Glu Ser 
            20                  25                  30          


Tyr Pro Asp Gly Arg Glu Tyr Asp Arg Asn Glu Lys Asp Tyr Glu Val 
        35                  40                  45              


Leu Xaa Arg Ser Gly Trp Val Arg Pro Ser Tyr Ile Tyr Arg His Asn 
    50                  55                  60                  


Thr Asp Lys Asp Val Tyr Lys Val Thr Gly Asp Gly Ile Glu Val Asp 
65                  70                  75                  80  


Ala Thr Arg Asp His Ser Leu Phe Asp Ala Asp Arg Asn Glu Ile Lys 
                85                  90                  95      


Pro Thr Glu Ile Ser Arg Gly Thr Lys Leu Glu Thr Tyr Ser Gly Asp 
            100                 105                 110         


Ser Leu Phe Asp Phe Asn Thr Ile Glu Leu Ser Asp Phe Ala Ile Asp 
        115                 120                 125             


Val Ala Ala Met Leu Val Met Thr Gly Lys Thr Asp Arg Ile Pro Asp 
    130                 135                 140                 


Glu Ile Leu Asn Ala Thr Leu Glu Xaa Lys Ala Lys Phe Ile Asp Leu 
145                 150                 155                 160 


Leu Lys Asn Ala Glu Val Thr Thr Asp Arg Tyr Pro Lys Thr Leu Val 
                165                 170                 175     


Ala Gly Tyr Gln Tyr Ile Lys Asn Xaa Val Leu Leu 
            180                 185             


<210>  131
<211>  39
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from cow rumen metagenome


<220>
<221>  misc_feature
<222>  (24)..(24)
<223>  Xaa can be any naturally occurring amino acid

<400>  131

Met Gln Asn Asn Ile Ile Met Gln Ala Ile Lys Leu Lys Ser Glu Lys 
1               5                   10                  15      


Arg Thr Val Tyr Asp Ile Ser Xaa Asp Gly Thr Phe Val Asn Ala Leu 
            20                  25                  30          


Gly Met Asn Val Leu His Asn 
        35                  


<210>  132
<211>  188
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from Prevotella species CAG 1092 phage contig 445


<220>
<221>  misc_feature
<222>  (52)..(52)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (83)..(83)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (85)..(85)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (120)..(120)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (152)..(152)
<223>  Xaa can be any naturally occurring amino acid

<400>  132

Ser Phe Thr Gly Asp Thr Pro Val Phe Ile Lys Tyr Asp Asn Thr Asn 
1               5                   10                  15      


Leu Ile Asp Ile Lys Pro Ile Ser Glu Leu Ile Asp Ile Asp Asn Ile 
            20                  25                  30          


Asp Lys Asp Val Leu Gly Arg Glu Tyr Asp Thr Ser Glu Lys Asn Tyr 
        35                  40                  45              


Ser Ile Leu Xaa Arg Ser Gly Trp Tyr Lys Pro Ser Tyr Ile Tyr Arg 
    50                  55                  60                  


His Lys Thr Val Lys Asn Ile Tyr Arg Val Glu Asp Asn Thr Pro Ser 
65                  70                  75                  80  


Ala Gly Xaa Ile Xaa Asp Ile Thr Glu Asp His Ser Leu Phe Asn Asp 
                85                  90                  95      


Glu Arg Glu Lys Ile Lys Pro Ser Glu Ile Gly Gln Asn Thr Lys Leu 
            100                 105                 110         


Glu Tyr Lys Ser Lys Ile Phe Xaa Arg Arg Thr His Thr Ile Ser Asp 
        115                 120                 125             


Asp Lys Phe Asn Lys Leu Leu Asp Phe Thr Val Lys Phe Pro Ile Lys 
    130                 135                 140                 


Ile Pro Ile Glu Ile Leu Asn Xaa Glu Ile Asn Thr Arg Lys Arg Phe 
145                 150                 155                 160 


Ala Tyr Glu Leu His Lys Arg Leu Lys Asp Ser Ile Asp Ile Gly His 
                165                 170                 175     


Tyr Ser Lys Val Phe Val Ala Gly Phe Lys Phe Leu 
            180                 185             


<210>  133
<211>  34
<212>  PRT
<213>  Prevotella species CAG 1092 phage

<400>  133

Met Ile Gln Val Glu Asn Ile Gly Lys Thr Asp Asp Tyr Val Tyr Asp 
1               5                   10                  15      


Ile Ser Leu Asp Gly Thr Val Val Asn Ala Leu Gly Met Asn Val Leu 
            20                  25                  30          


Ser Asn 
        


<210>  134
<211>  199
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from Lake Huron low oxygen high sulfur sink hole 
       purple microbial mat bin unclassified.01


<220>
<221>  misc_feature
<222>  (52)..(52)
<223>  Xaa can be any naturally occurring amino acid

<400>  134

Ser Phe Leu Tyr Asp Thr Pro Ile Tyr Ile Lys Tyr Lys Asn Ser Asp 
1               5                   10                  15      


Ile Ile Asp Ile Lys Ser Ile Gly Gly Val Phe Asn Ser Asn Glu Ser 
            20                  25                  30          


Asp Ile Asp Glu Leu Gly Arg Glu Tyr Asp Leu Ser Glu Lys Pro Tyr 
        35                  40                  45              


Gln Val Leu Xaa Arg Gly Gly Trp Ile Asp Val Asn Tyr Val Tyr Arg 
    50                  55                  60                  


His Lys Thr Asp Lys Gln Ile His Arg Ile Ser Phe Asn Asp Gly Tyr 
65                  70                  75                  80  


Val Asp Val Thr Ala Asp His Ser Val Phe Asn Glu Asn Lys Glu Lys 
                85                  90                  95      


Leu His Ser Lys Asp Val Ile Pro Asn Glu Thr Lys Leu Glu Met Ala 
            100                 105                 110         


Asn Leu Asp Tyr Ser Asn Phe Ile Ile Lys Gln Gln Asn Leu Asn Val 
        115                 120                 125             


Glu Thr Ile Glu Lys Met Ala Ser Val Leu Ala Leu Ser Val Asp Ile 
    130                 135                 140                 


Asn Lys Lys Val Pro Ala Glu Ile Ile Asn Thr Asn Lys Glu Asn Gln 
145                 150                 155                 160 


Ile Ile Phe Leu Asn Lys Phe Met Ser Val Thr Lys Leu Asn Lys Val 
                165                 170                 175     


Ser Gln Asp Asn Glu Asn Lys Val Leu Lys Ala Gly Ile Leu Phe Leu 
            180                 185                 190         


Val Asn Arg Thr Lys Asn Asn 
        195                 


<210>  135
<211>  40
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from Lake Huron low oxygen high sulfur sink hole 
       purple microbial mat bin unclassified.01


<220>
<221>  misc_feature
<222>  (24)..(24)
<223>  Xaa can be any naturally occurring amino acid

<400>  135

Met Asn Asn Leu Val Leu Gly Asn Glu Ile Ile Asp Asn Tyr Asp Pro 
1               5                   10                  15      


Glu Met Tyr Val Tyr Asp Ile Xaa Leu Asp Gly Thr Leu Ile Asn Ala 
            20                  25                  30          


Leu Gly Asn Asn Val Val Thr Gln 
        35                  40  


<210>  136
<211>  186
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (50)..(50)
<223>  Xaa can be any naturally occurring amino acid

<400>  136

Ser Phe Thr Ser Asp Thr Pro Val Phe Ile Lys Tyr Asp Lys Ser Gly 
1               5                   10                  15      


Leu Ile Asp Ile Lys Ala Ile Ser Glu Leu Ile Gly Glu Thr Glu Val 
            20                  25                  30          


Asp Gly Leu Gly Arg Glu Tyr Asp Tyr Ser Glu Lys Asp Tyr Thr Val 
        35                  40                  45              


Leu Xaa Arg Ser Gly Trp Val Lys Pro Lys Tyr Ile Tyr Arg His Lys 
    50                  55                  60                  


Thr Asp Lys Asp Ile Tyr Arg Val Glu Asp Asn Gly Ala Met Ile Asp 
65                  70                  75                  80  


Val Thr Glu Asp His Ser Leu Phe Asn Asp Lys Gln Glu Lys Ile Lys 
                85                  90                  95      


Pro Ser Glu Ile Asn Asp Glu Thr Lys Leu Glu Tyr Tyr Lys Asp Glu 
            100                 105                 110         


Ile Thr Pro Glu Gly Thr Met Lys Trp Leu Asn Glu Thr Arg Ala Arg 
        115                 120                 125             


Arg Leu Ala Lys Trp Ile Lys Asp Gly Thr Leu Thr Glu Val Pro Leu 
    130                 135                 140                 


Pro Ile Leu Asn Ser Met Asn Arg Gly His Ile Asn Ala Phe Leu Asp 
145                 150                 155                 160 


Glu Leu Gly Asn Trp Asp Tyr Ser Lys Ser Ser Lys Val Leu Gln Ala 
                165                 170                 175     


Gly Ile Met Tyr Val Lys Asn Lys Arg Arg 
            180                 185     


<210>  137
<211>  34
<212>  PRT
<213>  Unknown

<220>
<223>  IntC from sheep gut metagenome

<400>  137

Met Ala Gln Ala Thr Lys Ile Asn Lys Thr Asp Asp Tyr Val Tyr Asp 
1               5                   10                  15      


Ile Ser Leu Asp Gly Thr Val Val Asn Ala Leu Gly Leu Asn Val Ile 
            20                  25                  30          


Ser Asn 
        


<210>  138
<211>  185
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (52)..(52)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (155)..(155)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (171)..(171)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (184)..(184)
<223>  Xaa can be any naturally occurring amino acid

<400>  138

Ser Phe Thr Glu Asp Thr Pro Leu Phe Ile Lys Tyr Asn Asn Ser Gly 
1               5                   10                  15      


Leu Ile Asp Ile Lys Pro Ile Ser Glu Leu Val Asn Glu Asp Lys Ile 
            20                  25                  30          


Glu Phe Asp Gly Leu Gly Arg Glu Tyr Asp Tyr Ser Lys Lys Asp Phe 
        35                  40                  45              


Lys Val Leu Xaa Arg Ser Gly Trp Val Glu Pro Ser Tyr Ile Tyr Arg 
    50                  55                  60                  


His Lys Thr Thr Lys Pro Ile Tyr Arg Ile Ser Asp Asp Glu Val Lys 
65                  70                  75                  80  


Met Ser Ile Asp Val Thr Glu Asp His Ser Leu Phe Asn Glu Lys Gln 
                85                  90                  95      


Glu Lys Ile Lys Pro Ser Glu Ile Asn Ser Asp Thr Lys Leu Glu Tyr 
            100                 105                 110         


Tyr Asn Asn Asn Ile Ser Asn Asn Gln Ile Val Ile Asp Glu Ile Arg 
        115                 120                 125             


Ile Asp Lys Tyr Ser Lys Leu Leu Ser Asn Gly Arg Leu Asn Ala Ile 
    130                 135                 140                 


Pro Ile Asp Leu Leu Asn Ala Thr Val Glu Xaa Lys Lys Glu Phe Leu 
145                 150                 155                 160 


Ser Lys Met Asp Leu Ser Lys Val Ser Glu Xaa Lys Thr Leu Ile Ala 
                165                 170                 175     


Gly Ile Leu Tyr Leu Arg Ser Xaa Ile 
            180                 185 


<210>  139
<211>  36
<212>  PRT
<213>  Unknown

<220>
<223>  IntC from sheep gut metagenome

<400>  139

Met Leu Asn Ser Lys Lys Ile Lys Gly Lys Glu Glu Val Gly Val Val 
1               5                   10                  15      


Tyr Asp Ile Ser Leu Asp Gly Thr Val Val Asn Ala Leu Gly Met Asn 
            20                  25                  30          


Val Ile Ser Asn 
        35      


<210>  140
<211>  195
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (49)..(49)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (174)..(174)
<223>  Xaa can be any naturally occurring amino acid

<400>  140

Ser Phe Thr Gly Asp Thr Pro Leu Phe Ile Lys Tyr Asn Asp Gly Lys 
1               5                   10                  15      


Ile Asp Ile Lys Pro Ile Glu Glu Leu Ile Gly Glu Thr Glu Thr Asp 
            20                  25                  30          


Ala Leu Gly Arg Glu Tyr Asp Tyr Ser Lys Lys Pro Tyr Lys Val Leu 
        35                  40                  45              


Xaa Arg Ser Gly Trp Val Arg Pro Ser Tyr Ile Tyr Arg His Lys Thr 
    50                  55                  60                  


Asn Lys Pro Leu Tyr Thr Val Ser Glu Gly Asn Met Ser Val Thr Val 
65                  70                  75                  80  


Thr Glu Asp His Ser Leu Phe Asn Asp Lys Gln Lys Lys Ile Lys Pro 
                85                  90                  95      


Ser Glu Ile Thr Glu Gly Thr Arg Leu Glu Tyr Tyr Thr Asp Lys Val 
            100                 105                 110         


Glu Thr Ser Thr Lys Gly Phe Ile His Leu Asn Glu Gln Arg Val Lys 
        115                 120                 125             


Ile Met Ala Lys Thr Leu Lys Asn Gly Val Ile Asn Arg Val Pro Ile 
    130                 135                 140                 


Gln Leu Phe Asn Thr Asn Asn Ile Asp Ala Val Lys Thr Phe Leu Asn 
145                 150                 155                 160 


Glu Leu Glu Gly Trp Asp Tyr Ser Asn Thr Ser Lys Thr Xaa Arg Ala 
                165                 170                 175     


Gly Ile Gln Phe Leu Lys Lys Lys Ile Asp Gly Phe Asn Phe His Asp 
            180                 185                 190         


Val Tyr Lys 
        195 


<210>  141
<211>  38
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (12)..(12)
<223>  Xaa can be any naturally occurring amino acid

<400>  141

Met Asn Asn Ile Asn Ile Lys Lys Met Glu Asp Xaa Gln Asp Ile Gly 
1               5                   10                  15      


Val Val Tyr Asp Ile Ser Leu Asp Gly Thr Val Val Asn Ala Leu Gly 
            20                  25                  30          


Met Asn Val Ile Ser Asn 
        35              


<210>  142
<211>  193
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (23)..(23)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (49)..(49)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (167)..(167)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (173)..(173)
<223>  Xaa can be any naturally occurring amino acid

<400>  142

Ser Phe Thr Ala Asp Thr Pro Leu Phe Ile Lys Tyr Asp Asp Gly Lys 
1               5                   10                  15      


Ile Asp Ile Lys Pro Ile Xaa Glu Leu Ile Gly Glu Thr Glu Thr Asp 
            20                  25                  30          


Lys Leu Gly Arg Glu Tyr Asp Tyr Ser Pro Lys Pro Tyr Arg Val Leu 
        35                  40                  45              


Xaa Arg Ser Gly Trp Met Arg Pro Ser Tyr Ile Tyr Arg His Lys Thr 
    50                  55                  60                  


Asn Lys Pro Leu Tyr Glu Val Ser Glu Gly Asn Met Ser Ile Thr Val 
65                  70                  75                  80  


Thr Glu Asp His Ser Leu Phe Asn Asp Lys Gln His Lys Ile Lys Pro 
                85                  90                  95      


Ser Glu Ile Thr Glu Asn Thr Lys Leu Glu Tyr Tyr Lys Lys Ser Ile 
            100                 105                 110         


Glu Thr Asp Thr Lys Tyr Arg Trp Leu Thr Glu Asp Arg Ala Arg Arg 
        115                 120                 125             


Met Ser Lys Met Val Leu Asp Gly Thr Ile Asp Arg Val Pro Met Ala 
    130                 135                 140                 


Ile Leu Asn Thr Glu Asn Leu Lys Ile Val Met Ala Phe Leu Lys Glu 
145                 150                 155                 160 


Trp Glu Tyr Met Pro Leu Xaa Thr Phe Ser Lys Thr Xaa Gln Ala Gly 
                165                 170                 175     


Ile Asn Phe Leu Lys Met Lys Leu Tyr Gly Gln Asn Arg Asn Ile His 
            180                 185                 190         


Lys 
    


<210>  143
<211>  33
<212>  PRT
<213>  Unknown

<220>
<223>  IntC from sheep gut metagenome

<400>  143

Met Lys Ile Lys Asn Ile Gly Ser Thr Ser Asp Tyr Val Tyr Asp Ile 
1               5                   10                  15      


Gln Leu Asp Gly Thr Val Val Asn Ala Leu Gly Met Asn Ile Ile Ser 
            20                  25                  30          


Asn 
    


<210>  144
<211>  121
<212>  PRT
<213>  Unknown

<220>
<223>  IntN from Antarctica Vida lake environmental sampling 
       brine-hole-2

<400>  144

Ser Val Thr Gly Asp Thr Leu Ile Thr Leu Ser Asp Lys Lys Ile Ser 
1               5                   10                  15      


Ile Glu Asp Leu Phe Asn Arg Phe Asp Tyr Arg Val Ile Gln Asp Gln 
            20                  25                  30          


Gly Lys Glu Tyr Ala Ile Pro Arg Thr Glu Leu Glu Lys Glu Asn Asn 
        35                  40                  45              


Ala Val Leu Gly Tyr Asn Ala Phe Glu Asp Glu Ala Val Leu Gly Glu 
    50                  55                  60                  


Ile Ala Tyr Val Met Arg His Lys Thr Lys Lys Lys Leu Tyr Glu Ile 
65                  70                  75                  80  


Glu Thr Asp Asp Gly Lys Lys Val Thr Val Thr Glu Asp His Ser Leu 
                85                  90                  95      


Ile Val Asp Arg His Gly Ile Thr Thr Glu Val Thr Pro Lys Asn Leu 
            100                 105                 110         


Glu Glu Asp Asp Leu Ile Ile Thr Ile 
        115                 120     


<210>  145
<211>  42
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from Antarctica Vida lake environmental sampling 
       brine-hole-2


<220>
<221>  misc_feature
<222>  (13)..(13)
<223>  Xaa can be any naturally occurring amino acid

<400>  145

Met Asn Thr Ile Arg Thr Lys Val Lys Arg Val Thr Xaa Leu Gly Glu 
1               5                   10                  15      


Val Asp Asp Tyr Val Tyr Asp Val Ser Met Val Arg Gln Asp Pro Phe 
            20                  25                  30          


Phe Phe Ala Asn Asp Ile Leu Val His Asn 
        35                  40          


<210>  146
<211>  187
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN (partial) from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (49)..(49)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (153)..(153)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (173)..(173)
<223>  Xaa can be any naturally occurring amino acid

<400>  146

Ser Phe Thr Gly Asp Thr Pro Leu Phe Ile Lys Tyr Asp Asn Gly Ile 
1               5                   10                  15      


Ile Asp Ile Val Pro Ile Ser Asp Leu Ile Gly Asp Thr Glu Met Asp 
            20                  25                  30          


Asp Leu Gly Arg Glu Tyr Asp Val Ser Asp Lys Pro Tyr Lys Val Leu 
        35                  40                  45              


Xaa Arg Ser Gly Trp Val Lys Pro Glu Tyr Ile Tyr Arg His Lys Thr 
    50                  55                  60                  


Lys Lys Pro Leu Tyr Glu Val Ser Asp Gly Asp Met Thr Val Met Val 
65                  70                  75                  80  


Thr Glu Asp His Ser Leu Phe Asn Asn Lys Gln Glu Lys Ile Lys Pro 
                85                  90                  95      


Ser Glu Ile Asn Glu Lys Thr Lys Leu Glu Tyr Tyr Gly His Lys Ile 
            100                 105                 110         


Glu Ser Ser Tyr Glu Tyr Gly Trp Leu Asn Glu Lys Arg Ala Leu Arg 
        115                 120                 125             


Met Ala Lys Trp Leu Lys Asp Gly Thr Leu Asn Gln Val Pro Thr Pro 
    130                 135                 140                 


Leu Leu Asn Thr Lys Lys Ile Asn Xaa Ile Lys Val Phe Leu Gly Glu 
145                 150                 155                 160 


Leu Gln Gly Phe Asp Phe Asn Asn Ala Thr Lys Thr Xaa Arg Ala Gly 
                165                 170                 175     


Ile Leu Phe Leu Lys Glu Lys Leu Lys Asn Lys 
            180                 185         


<210>  147
<211>  124
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN (partial) from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (16)..(16)
<223>  Xaa can be any naturally occurring amino acid

<400>  147

Ser Val Thr Glu Asp Ser Leu Ile Arg Thr Gly Ser Gly Asn Ile Xaa 
1               5                   10                  15      


Val Lys Asp Ile Trp Asp Lys Tyr Ser Lys Glu Ile Pro Glu Val Phe 
            20                  25                  30          


Glu Tyr Ser Asn Asn Phe Lys Thr Ala Tyr Leu Lys Met Asn Asp Lys 
        35                  40                  45              


Glu Tyr Ile Leu Asn Pro Val Met Thr Ile Leu Gly Ser Asp Asp Arg 
    50                  55                  60                  


Leu Tyr Ile Pro Arg Tyr Ile Met Lys His Lys Thr Lys Lys Lys Leu 
65                  70                  75                  80  


Tyr Lys Ile Thr Thr Glu Ser Gly Lys Gln Val Thr Val Thr Glu Asp 
                85                  90                  95      


His Ser Leu Ile Val Val Arg Asp Asn Val Lys Thr Val Ile Lys Pro 
            100                 105                 110         


Thr Glu Leu Leu Asp Thr Asp Glu Val Ile Val Ile 
        115                 120                 


<210>  148
<211>  43
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC (partial) from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (33)..(33)
<223>  Xaa can be any naturally occurring amino acid

<400>  148

Met Ile Asn Met Tyr Leu Glu His Ile Lys Ser Val Glu Val Leu Glu 
1               5                   10                  15      


Pro Thr Glu Asp Leu Asp Val Tyr Asp Leu Glu Leu Pro Val Asp His 
            20                  25                  30          


Xaa Phe Phe Ala Asn Asp Ile Leu Val His Asn 
        35                  40              


<210>  149
<211>  53
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC (partial) from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (5)..(5)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (43)..(43)
<223>  Xaa can be any naturally occurring amino acid

<400>  149

Met Pro Val Ile Xaa Leu Lys Glu Ile Ile Lys Leu Tyr Lys Val Ser 
1               5                   10                  15      


Asn Ile Lys Ser Ile Glu Val Ile Thr Pro Glu Glu Pro Ile Asp Val 
            20                  25                  30          


Tyr Asp Ile Glu Met Pro Asp Asp Asp His Xaa Phe Phe Ala Asn Asp 
        35                  40                  45              


Ile Leu Val His Asn 
    50              


<210>  150
<211>  47
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC (partial) from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (37)..(37)
<223>  Xaa can be any naturally occurring amino acid

<400>  150

Met Gly Gly Glu Trp Glu Met Lys Ile Glu Ser Val Lys Ser Ile Glu 
1               5                   10                  15      


Val Val Thr Pro Glu Glu Pro Ile Asp Val Tyr Asp Ile Glu Met Pro 
            20                  25                  30          


Asp Asp Asp His Xaa Phe Phe Ala Asn Asp Ile Leu Val His Asn 
        35                  40                  45          


<210>  151
<211>  50
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC (partial) from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (40)..(40)
<223>  Xaa can be any naturally occurring amino acid

<400>  151

Met Leu Lys Tyr Ile Leu Glu Val Val Ser Met Asn Leu Ser Lys Ile 
1               5                   10                  15      


Lys Ser Val Glu Val Ile Thr Pro Glu Glu Pro Thr Glu Val Tyr Asp 
            20                  25                  30          


Ile Glu Leu Pro Val Asp His Xaa Phe Tyr Ala Asn Asp Ile Leu Val 
        35                  40                  45              


His Asn 
    50  


<210>  152
<211>  42
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC from Antarctica Vida lake environmental sampling 
       brine-hole-2


<220>
<221>  misc_feature
<222>  (13)..(13)
<223>  Xaa can be any naturally occurring amino acid

<400>  152

Met Asp Thr Asn Arg Ala Lys Val Lys Ser Val Lys Xaa Leu Gly Glu 
1               5                   10                  15      


Val Asp Asp Tyr Val Tyr Asp Val Ser Met Lys Asp Gln Asp Pro Phe 
            20                  25                  30          


Phe Phe Ala Asn Gly Ile Leu Val His Asn 
        35                  40          


<210>  153
<211>  379
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Nanosalina species J07AB43


<220>
<221>  misc_feature
<222>  (151)..(151)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (154)..(154)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (187)..(187)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (213)..(213)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (277)..(277)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (379)..(379)
<223>  Xaa can be any naturally occurring amino acid

<400>  153

Ser Leu Asn Tyr Ser Arg Gln Val Val Val Lys Asn Pro Asn Asn Glu 
1               5                   10                  15      


Ile Gln Phe Met Glu Val Gly Lys Phe Val Glu Asp Ile Asp Val Pro 
            20                  25                  30          


Glu Asn Tyr Glu Thr Leu Ala Trp Asp Glu Glu Lys Asp Arg Ser Val 
        35                  40                  45              


Phe Lys Pro Val Lys Arg Ala Ile Met His Lys Tyr Asn Gly Asn Leu 
    50                  55                  60                  


Leu Arg Phe Asp Thr Ser Arg Gly Arg Thr Glu Val Thr Pro Gln His 
65                  70                  75                  80  


Ser Val Tyr Arg Tyr Glu Asp Gly Glu Ile Lys Leu Ala Asp Ala Glu 
                85                  90                  95      


Asp Leu Glu Glu Gly Asp Arg Leu Val Ser Leu Ser Glu Leu Pro Glu 
            100                 105                 110         


Thr Glu Lys Lys Tyr Ser Glu Gly Asp Thr Ile Asp Leu Ala Asp Leu 
        115                 120                 125             


Glu Tyr Glu Asn Ser Asp Leu Met Ala Tyr Arg Asp Lys Lys Lys Phe 
    130                 135                 140                 


Pro Ala Glu Lys Gly Glu Xaa Pro Tyr Xaa Gly Glu Val Tyr Tyr Leu 
145                 150                 155                 160 


Ser Ser His Val His Arg Asp His Gln Asp Arg Arg Ile Ala Leu Gly 
                165                 170                 175     


Glu Ala Ser Gln Asp Tyr Ser Tyr Ile Gly Xaa Lys Asn Ala Lys Ala 
            180                 185                 190         


Gly Lys Ile Pro Arg Phe Trp Lys Leu Thr Ser Glu Leu Ala Trp Ile 
        195                 200                 205             


Leu Gly Phe Tyr Xaa Gly Asp Gly Ser Ala Ser Leu Gly Asp Lys Gln 
    210                 215                 220                 


Met Val Ser Phe Gly Gly Gln Asn Lys Glu Asn Ile Arg Arg Val Lys 
225                 230                 235                 240 


Arg Phe Phe Asp Gln Ile Leu Asp Glu Glu Leu Ser Ile Ile Glu Asp 
                245                 250                 255     


Val Asp Ser Arg Thr Gly Gly Lys Met Tyr Tyr Tyr Arg Ile Gln Arg 
            260                 265                 270         


Ile Pro Val Val Xaa Leu Phe Val Asn Gly Leu Gly Ala Gly Ser Gly 
        275                 280                 285             


Ser Asp Gly Lys Lys Val Pro Ser Met Ile Ile Asn Gly Asp Lys Gly 
    290                 295                 300                 


Leu Arg Lys Ala Phe Val Glu Gly Tyr Phe Asp Ala Asp Gly Ser Arg 
305                 310                 315                 320 


Asp Lys Asp Tyr Asp Asp Arg Tyr Asp Ser Glu Asn Met Arg Phe Ser 
                325                 330                 335     


Thr Lys Ser Ser Tyr Leu Ala Asn Gln Val Gln Tyr Ile Leu Lys Gln 
            340                 345                 350         


Leu Asn Leu Gly Glu Asn Arg Tyr Val Glu Ile Ser Ala Met Ser Leu 
        355                 360                 365             


Ser Ser Ile Val Lys Ile Asn Leu Arg Ser Xaa 
    370                 375                 


<210>  154
<211>  351
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Acanthamoeba castellanii 
       mamavirus Hal-V


<220>
<221>  misc_feature
<222>  (88)..(88)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (110)..(110)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (147)..(147)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (173)..(173)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (259)..(259)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (262)..(262)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (272)..(272)
<223>  Xaa can be any naturally occurring amino acid

<400>  154

Ser Val Thr Gly Asp Thr Pro Ile Ile Thr Arg His Gln Asn Gly Asp 
1               5                   10                  15      


Ile Asn Ile Thr Thr Ile Glu Glu Leu Gly Ser Lys Trp Lys Pro Tyr 
            20                  25                  30          


Glu Ile Phe Lys Ala His Glu Lys Asn Ser Asn Arg Lys Phe Lys Gln 
        35                  40                  45              


Gln Ser Gln Tyr Pro Thr Asp Ser Glu Val Trp Thr Ala Lys Gly Trp 
    50                  55                  60                  


Ala Lys Ile Lys Arg Val Ile Arg His Lys Thr Val Lys Lys Ile Tyr 
65                  70                  75                  80  


Arg Val Leu Thr His Thr Gly Xaa Ile Asp Val Thr Glu Asp His Ser 
                85                  90                  95      


Leu Leu Asp Pro Asn Gln Asn Ile Ile Lys Pro Ile Asn Xaa Gln Ile 
            100                 105                 110         


Gly Thr Glu Leu Leu His Gly Phe Pro Glu Ser Asn Asn Val Tyr Asp 
        115                 120                 125             


Asn Ile Ser Glu Gln Glu Ala Tyr Val Trp Gly Phe Phe Met Gly Asp 
    130                 135                 140                 


Gly Ser Xaa Gly Ser Tyr Gln Thr Lys Asn Gly Ile Lys Tyr Ser Trp 
145                 150                 155                 160 


Ala Leu Asn Asn Gln Asp Leu Asp Val Leu Asn Lys Xaa Lys Lys Tyr 
                165                 170                 175     


Leu Glu Glu Thr Glu Asn Ile Gln Phe Lys Ile Leu Asp Thr Met Lys 
            180                 185                 190         


Ser Ser Ser Val Tyr Lys Leu Val Pro Ile Arg Lys Ile Lys Tyr Met 
        195                 200                 205             


Val Asn Lys Tyr Arg Lys Ile Phe Tyr Asp Asn Lys Lys Tyr Lys Leu 
    210                 215                 220                 


Val Pro Lys Glu Ile Leu Asn Ser Thr Lys Asp Ile Lys Asn Ser Phe 
225                 230                 235                 240 


Leu Glu Gly Tyr Tyr Ala Ala Asp Gly Ser Arg Lys Glu Thr Glu Asn 
                245                 250                 255     


Met Gly Xaa Arg Arg Xaa Asp Ile Lys Gly Lys Ile Ser Ala Gln Xaa 
            260                 265                 270         


Leu Phe Tyr Leu Leu Lys Ser Leu Gly Tyr Asn Val Ser Ile Asn Ile 
        275                 280                 285             


Arg Ser Asp Lys Asn Gln Ile Tyr Arg Leu Thr Phe Ser Asn Lys Lys 
    290                 295                 300                 


Gln Arg Lys Asn Pro Ile Ala Ile Lys Lys Ile Gln Leu Met Asn Glu 
305                 310                 315                 320 


Thr Ser Asn Asp His Asp Gly Asp Tyr Val Tyr Asp Leu Glu Thr Glu 
                325                 330                 335     


Ser Gly Ser Phe His Ala Gly Val Gly Glu Met Ile Val Lys Asn 
            340                 345                 350     


<210>  155
<211>  351
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Acanthamoeba polyphaga 
       lentillevirus


<220>
<221>  misc_feature
<222>  (88)..(88)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (110)..(110)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (147)..(147)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (173)..(173)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (259)..(259)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (262)..(262)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (272)..(272)
<223>  Xaa can be any naturally occurring amino acid

<400>  155

Ser Val Thr Gly Asp Thr Pro Ile Ile Thr Arg His Gln Asn Gly Asp 
1               5                   10                  15      


Ile Asn Ile Thr Thr Ile Glu Glu Leu Gly Ser Lys Trp Lys Pro Tyr 
            20                  25                  30          


Glu Ile Phe Lys Ala His Glu Lys Asn Ser Asn Arg Lys Phe Lys Gln 
        35                  40                  45              


Gln Ser Gln Tyr Pro Thr Asp Ser Glu Val Trp Thr Ala Lys Gly Trp 
    50                  55                  60                  


Ala Lys Ile Lys Arg Val Ile Arg His Lys Thr Val Lys Lys Ile Tyr 
65                  70                  75                  80  


Arg Val Leu Thr His Thr Gly Xaa Ile Asp Val Thr Glu Asp His Ser 
                85                  90                  95      


Leu Leu Asp Pro Asn Gln Asn Ile Ile Lys Pro Ile Asn Xaa Gln Ile 
            100                 105                 110         


Gly Thr Glu Leu Leu His Gly Phe Pro Glu Ser Asn Asn Val Tyr Asp 
        115                 120                 125             


Asn Ile Ser Glu Gln Glu Ala Tyr Val Trp Gly Phe Phe Met Gly Asp 
    130                 135                 140                 


Gly Ser Xaa Gly Ser Tyr Gln Thr Lys Asn Gly Ile Lys Tyr Ser Trp 
145                 150                 155                 160 


Ala Leu Asn Asn Gln Asp Leu Asp Val Leu Asn Lys Xaa Lys Lys Tyr 
                165                 170                 175     


Leu Glu Glu Thr Glu Asn Ile Gln Phe Lys Ile Leu Asp Thr Met Lys 
            180                 185                 190         


Ser Ser Ser Val Tyr Lys Leu Val Pro Ile Arg Lys Ile Lys Tyr Met 
        195                 200                 205             


Val Asn Lys Tyr Arg Lys Ile Phe Tyr Asp Asn Lys Lys Tyr Lys Leu 
    210                 215                 220                 


Val Pro Lys Glu Ile Leu Asn Ser Thr Lys Asp Ile Lys Asn Ser Phe 
225                 230                 235                 240 


Leu Glu Gly Tyr Tyr Ala Ala Asp Gly Ser Arg Lys Glu Thr Glu Asn 
                245                 250                 255     


Met Gly Xaa Arg Arg Xaa Asp Ile Lys Gly Lys Ile Ser Ala Gln Xaa 
            260                 265                 270         


Leu Phe Tyr Leu Leu Lys Ser Leu Gly Tyr Asn Val Ser Ile Asn Ile 
        275                 280                 285             


Arg Ser Asp Lys Asn Gln Ile Tyr Arg Leu Thr Phe Ser Asn Lys Lys 
    290                 295                 300                 


Gln Arg Lys Asn Pro Ile Ala Ile Lys Lys Ile Gln Leu Met Asn Glu 
305                 310                 315                 320 


Thr Ser Asn Asp His Asp Gly Asp Tyr Val Tyr Asp Leu Glu Thr Glu 
                325                 330                 335     


Ser Gly Ser Phe His Ala Gly Val Gly Glu Met Ile Val Lys Asn 
            340                 345                 350     


<210>  156
<211>  351
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Mimivirus Rowbotham-Bradford


<220>
<221>  misc_feature
<222>  (88)..(88)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (110)..(110)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (147)..(147)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (173)..(173)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (259)..(259)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (262)..(262)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (272)..(272)
<223>  Xaa can be any naturally occurring amino acid

<400>  156

Ser Val Thr Gly Asp Thr Pro Ile Ile Thr Arg His Gln Asn Gly Asp 
1               5                   10                  15      


Ile Asn Ile Thr Thr Ile Glu Glu Leu Gly Ser Lys Trp Lys Pro Tyr 
            20                  25                  30          


Glu Ile Phe Lys Ala His Glu Lys Asn Ser Asn Arg Lys Phe Lys Gln 
        35                  40                  45              


Gln Ser Gln Tyr Pro Thr Asp Ser Glu Val Trp Thr Ala Lys Gly Trp 
    50                  55                  60                  


Ala Lys Ile Lys Arg Val Ile Arg His Lys Thr Val Lys Lys Ile Tyr 
65                  70                  75                  80  


Arg Val Leu Thr His Thr Gly Xaa Ile Asp Val Thr Glu Asp His Ser 
                85                  90                  95      


Leu Leu Asp Pro Asn Gln Asn Ile Ile Lys Pro Ile Asn Xaa Gln Ile 
            100                 105                 110         


Gly Thr Glu Leu Leu His Gly Phe Pro Glu Ser Asn Asn Val Tyr Asp 
        115                 120                 125             


Asn Ile Ser Glu Gln Glu Ala Tyr Val Trp Gly Phe Phe Met Gly Asp 
    130                 135                 140                 


Gly Ser Xaa Gly Ser Tyr Gln Thr Lys Asn Gly Ile Lys Tyr Ser Trp 
145                 150                 155                 160 


Ala Leu Asn Asn Gln Asp Leu Asp Val Leu Asn Lys Xaa Lys Lys Tyr 
                165                 170                 175     


Leu Glu Glu Thr Glu Asn Ile Gln Phe Lys Ile Leu Asp Thr Met Lys 
            180                 185                 190         


Ser Ser Ser Val Tyr Lys Leu Val Pro Ile Arg Lys Ile Lys Tyr Met 
        195                 200                 205             


Val Asn Lys Tyr Arg Lys Ile Phe Tyr Asp Asn Lys Lys Tyr Lys Leu 
    210                 215                 220                 


Val Pro Lys Glu Ile Leu Asn Ser Thr Lys Asp Ile Lys Asn Ser Phe 
225                 230                 235                 240 


Leu Glu Gly Tyr Tyr Ala Ala Asp Gly Ser Arg Lys Glu Thr Glu Asn 
                245                 250                 255     


Met Gly Xaa Arg Arg Xaa Asp Ile Lys Gly Lys Ile Ser Ala Gln Xaa 
            260                 265                 270         


Leu Phe Tyr Leu Leu Lys Ser Leu Gly Tyr Asn Val Ser Ile Asn Ile 
        275                 280                 285             


Arg Ser Asp Lys Asn Gln Ile Tyr Arg Leu Thr Phe Ser Asn Lys Lys 
    290                 295                 300                 


Gln Arg Lys Asn Pro Ile Ala Ile Lys Lys Ile Gln Leu Met Asn Glu 
305                 310                 315                 320 


Thr Ser Asn Asp His Asp Gly Asp Tyr Val Tyr Asp Leu Glu Thr Glu 
                325                 330                 335     


Ser Gly Ser Phe His Ala Gly Val Gly Glu Met Ile Val Lys Asn 
            340                 345                 350     


<210>  157
<211>  282
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Marseilleviridae Montpellier MTP3


<220>
<221>  misc_feature
<222>  (86)..(86)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (108)..(108)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (145)..(145)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (161)..(161)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (181)..(181)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (217)..(217)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (234)..(234)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (244)..(244)
<223>  Xaa can be any naturally occurring amino acid

<400>  157

Ser Val Thr Gly Asp Thr Pro Ile Met Ile Lys Asp Lys Asn Asn Asn 
1               5                   10                  15      


Ile Asn Ile Val Thr Ile Lys Glu Leu Gly Glu Lys Trp Lys Pro Tyr 
            20                  25                  30          


Asp Ile Phe Lys Ser His Glu Ile Asn Ser Asn Arg Lys Tyr Lys Gln 
        35                  40                  45              


Gln Ala Asp Phe Asn Gly Glu Val Trp Thr Ser Asn Gly Trp Ala Lys 
    50                  55                  60                  


Ile Lys Arg Val Ile Arg His Lys Thr Val Lys Lys Leu Tyr Arg Val 
65                  70                  75                  80  


Leu Thr Asn Thr Gly Xaa Ile Asp Val Thr Glu Asp His Ser Leu Leu 
                85                  90                  95      


Asp Thr Asn Lys Asn Ile Ile Lys Pro Ile Asp Xaa Lys Ile Gly Thr 
            100                 105                 110         


Glu Leu Leu His Gly Phe Pro Glu Ile Asn Asn Asn His Asn Lys Leu 
        115                 120                 125             


Ser Leu Glu Ile Tyr Lys Glu Leu Asp Ile Thr Ser Glu Leu Phe Asp 
    130                 135                 140                 


Xaa Met Ile Glu Ser Asn Lys Lys Trp Asn Asn Glu Lys Met Lys Ala 
145                 150                 155                 160 


Xaa Phe Ile Gly Ser Glu Tyr Arg Lys Gln Asn Lys Asn Ile Ser Asn 
                165                 170                 175     


Glu Ile Leu Asn Xaa Ser Lys Lys Ile Lys Lys Tyr Phe Leu Leu Gly 
            180                 185                 190         


Tyr Leu Gly Asn Asp Lys Glu Tyr Ile Thr Asn Asn Lys Ile Asn Ala 
        195                 200                 205             


Gln Met Val Tyr Tyr Leu Met Lys Xaa Leu Gly Tyr Asn Ile Val Ile 
    210                 215                 220                 


Asp Leu Ile Glu Ser Ser Tyr Lys Leu Xaa Ile Val Asp Asn Ile Asn 
225                 230                 235                 240 


Lys Pro Tyr Xaa Ile Asn Lys Ile Ile Gln Leu Gln Asp Thr Ser Ile 
                245                 250                 255     


Asn Gly Glu Tyr Val Tyr Asp Leu Glu Thr Glu Ser Gly Thr Phe His 
            260                 265                 270         


Ala Gly Ile Gly Glu Leu Ile Val Lys Asn 
        275                 280         


<210>  158
<211>  282
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Megavirus chiliensis


<220>
<221>  misc_feature
<222>  (86)..(86)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (108)..(108)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (145)..(145)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (181)..(181)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (213)..(213)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (217)..(217)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (234)..(234)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (244)..(244)
<223>  Xaa can be any naturally occurring amino acid

<400>  158

Ser Val Thr Gly Asp Thr Pro Ile Met Ile Lys Asp Lys Asn Asn Asn 
1               5                   10                  15      


Ile Asn Ile Val Thr Ile Lys Glu Leu Gly Glu Lys Trp Lys Pro Tyr 
            20                  25                  30          


Asp Ile Phe Lys Ser His Glu Ile Asn Ser Asn Arg Lys Tyr Lys Gln 
        35                  40                  45              


Gln Ala Asp Phe Asn Gly Glu Val Trp Thr Ser Asn Gly Trp Ala Lys 
    50                  55                  60                  


Ile Lys Arg Val Ile Arg His Lys Thr Val Lys Lys Leu Tyr Arg Val 
65                  70                  75                  80  


Leu Thr Asn Thr Gly Xaa Ile Asp Val Thr Glu Asp His Ser Leu Leu 
                85                  90                  95      


Asp Thr Asn Lys Asn Ile Ile Lys Pro Ile Asp Xaa Lys Ile Gly Thr 
            100                 105                 110         


Glu Leu Leu His Gly Phe Pro Glu Ile Asn Asn Asn His Asn Lys Leu 
        115                 120                 125             


Ser Leu Glu Ile Tyr Lys Glu Leu Asp Ile Thr Ser Glu Leu Phe Asp 
    130                 135                 140                 


Xaa Met Ile Glu Ser Asn Lys Lys Trp Asn Asn Glu Lys Met Lys Ala 
145                 150                 155                 160 


Tyr Phe Ile Gly Ser Glu Tyr Arg Lys Gln Asn Lys Asn Ile Ser Asn 
                165                 170                 175     


Glu Ile Leu Asn Xaa Ser Lys Lys Ile Lys Lys Tyr Phe Leu Leu Gly 
            180                 185                 190         


Tyr Leu Gly Asn Asp Lys Glu Tyr Ile Thr Asn Asn Lys Ile Asn Ala 
        195                 200                 205             


Gln Ile Ile Tyr Xaa Leu Met Lys Xaa Leu Glu Tyr Asn Ile Val Ile 
    210                 215                 220                 


Asp Leu Ile Glu Ser Ser Tyr Lys Leu Xaa Ile Ile Asp Asn Ile Asn 
225                 230                 235                 240 


Glu Pro Tyr Xaa Ile Asn Lys Ile Ile Gln Leu Gln Asp Thr Ser Ile 
                245                 250                 255     


Asn Gly Glu Tyr Val Tyr Asp Leu Glu Thr Glu Ser Gly Thr Phe His 
            260                 265                 270         


Ala Gly Ile Gly Glu Leu Ile Val Lys Asn 
        275                 280         


<210>  159
<211>  282
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Megavirus courdo11


<220>
<221>  misc_feature
<222>  (86)..(86)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (108)..(108)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (145)..(145)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (161)..(161)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (181)..(181)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (217)..(217)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (234)..(234)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (244)..(244)
<223>  Xaa can be any naturally occurring amino acid

<400>  159

Ser Val Thr Gly Asp Thr Pro Ile Met Ile Lys Asp Lys Asn Asn Asn 
1               5                   10                  15      


Ile Asn Ile Val Thr Ile Lys Glu Leu Gly Glu Lys Trp Lys Pro Tyr 
            20                  25                  30          


Asp Ile Phe Lys Ser His Glu Ile Asn Ser Asn Arg Lys Tyr Lys Gln 
        35                  40                  45              


Gln Ala Asp Phe Asn Gly Glu Val Trp Thr Ser Asn Gly Trp Ala Lys 
    50                  55                  60                  


Ile Lys Arg Val Ile Arg His Lys Thr Val Lys Lys Leu Tyr Arg Val 
65                  70                  75                  80  


Leu Thr Asn Thr Gly Xaa Ile Asp Val Thr Glu Asp His Ser Leu Leu 
                85                  90                  95      


Asp Thr Asn Lys Asn Ile Ile Lys Pro Ile Asp Xaa Lys Ile Gly Thr 
            100                 105                 110         


Glu Leu Leu His Gly Phe Pro Glu Ile Asn Asn Asn His Asn Lys Leu 
        115                 120                 125             


Ser Leu Glu Ile Tyr Lys Glu Leu Asp Ile Thr Ser Glu Leu Phe Asp 
    130                 135                 140                 


Xaa Met Ile Glu Ser Asn Lys Lys Trp Asn Asn Glu Lys Met Lys Ala 
145                 150                 155                 160 


Xaa Phe Ile Gly Ser Glu Tyr Arg Lys Gln Asn Lys Asn Ile Ser Asn 
                165                 170                 175     


Glu Ile Leu Asn Xaa Ser Lys Lys Ile Lys Lys Tyr Phe Leu Leu Gly 
            180                 185                 190         


Tyr Leu Gly Asn Asp Lys Glu Tyr Ile Thr Asn Asn Lys Ile Asn Ala 
        195                 200                 205             


Gln Met Val Tyr Tyr Leu Met Lys Xaa Leu Gly Tyr Asn Ile Val Ile 
    210                 215                 220                 


Asp Leu Ile Glu Ser Ser Tyr Lys Leu Xaa Ile Val Asp Asn Ile Asn 
225                 230                 235                 240 


Lys Pro Tyr Xaa Ile Asn Lys Ile Ile Gln Leu Gln Asp Thr Ser Ile 
                245                 250                 255     


Asn Gly Glu Tyr Val Tyr Asp Leu Glu Thr Glu Ser Gly Thr Phe His 
            260                 265                 270         


Ala Gly Ile Gly Glu Leu Ile Val Lys Asn 
        275                 280         


<210>  160
<211>  282
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Megavirus courdo7


<220>
<221>  misc_feature
<222>  (86)..(86)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (108)..(108)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (145)..(145)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (161)..(161)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (181)..(181)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (217)..(217)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (234)..(234)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (244)..(244)
<223>  Xaa can be any naturally occurring amino acid

<400>  160

Ser Val Thr Gly Asp Thr Pro Ile Met Ile Lys Asp Lys Asn Asn Asn 
1               5                   10                  15      


Ile Asn Ile Val Thr Ile Lys Glu Leu Gly Glu Lys Trp Lys Pro Tyr 
            20                  25                  30          


Asp Ile Phe Lys Ser His Glu Ile Asn Ser Asn Arg Lys Tyr Lys Gln 
        35                  40                  45              


Gln Ala Asp Phe Asn Gly Glu Val Trp Thr Ser Asn Gly Trp Ala Lys 
    50                  55                  60                  


Ile Lys Arg Val Ile Arg His Lys Thr Val Lys Lys Leu Tyr Arg Val 
65                  70                  75                  80  


Leu Thr Asn Thr Gly Xaa Ile Asp Val Thr Glu Asp His Ser Leu Leu 
                85                  90                  95      


Asp Thr Asn Lys Asn Ile Ile Lys Pro Ile Asp Xaa Lys Ile Gly Thr 
            100                 105                 110         


Glu Leu Leu His Gly Phe Pro Glu Ile Asn Asn Asn His Asn Lys Leu 
        115                 120                 125             


Ser Leu Glu Ile Tyr Lys Glu Leu Asp Ile Thr Ser Glu Leu Phe Asp 
    130                 135                 140                 


Xaa Met Ile Glu Ser Asn Lys Lys Trp Asn Asn Glu Lys Met Lys Ala 
145                 150                 155                 160 


Xaa Phe Ile Gly Ser Glu Tyr Arg Lys Gln Asn Lys Asn Ile Ser Asn 
                165                 170                 175     


Glu Ile Leu Asn Xaa Ser Lys Lys Ile Lys Lys Tyr Phe Leu Leu Gly 
            180                 185                 190         


Tyr Leu Gly Asn Asp Lys Glu Tyr Ile Thr Asn Asn Lys Ile Asn Ala 
        195                 200                 205             


Gln Met Val Tyr Tyr Leu Met Lys Xaa Leu Gly Tyr Asn Ile Val Ile 
    210                 215                 220                 


Asp Leu Ile Glu Ser Ser Tyr Lys Leu Xaa Ile Val Asp Asn Ile Asn 
225                 230                 235                 240 


Lys Pro Tyr Xaa Ile Asn Lys Ile Ile Gln Leu Gln Asp Thr Ser Ile 
                245                 250                 255     


Asn Gly Glu Tyr Val Tyr Asp Leu Glu Thr Glu Ser Gly Thr Phe His 
            260                 265                 270         


Ala Gly Ile Gly Glu Leu Ile Val Lys Asn 
        275                 280         


<210>  161
<211>  282
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Megavirus terra1 TE1


<220>
<221>  misc_feature
<222>  (86)..(86)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (108)..(108)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (145)..(145)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (161)..(161)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (181)..(181)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (217)..(217)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (234)..(234)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (244)..(244)
<223>  Xaa can be any naturally occurring amino acid

<400>  161

Ser Val Thr Gly Asp Thr Pro Ile Met Ile Lys Asp Lys Asn Asn Asn 
1               5                   10                  15      


Ile Asn Ile Val Thr Ile Lys Glu Leu Gly Glu Lys Trp Lys Pro Tyr 
            20                  25                  30          


Asp Ile Phe Lys Ser His Glu Ile Asn Ser Asn Arg Lys Tyr Lys Gln 
        35                  40                  45              


Gln Ala Asp Phe Asn Gly Glu Val Trp Thr Ser Asn Gly Trp Ala Lys 
    50                  55                  60                  


Ile Lys Arg Val Ile Arg His Lys Thr Val Lys Lys Leu Tyr Arg Val 
65                  70                  75                  80  


Leu Thr Asn Thr Gly Xaa Ile Asp Val Thr Glu Asp His Ser Leu Leu 
                85                  90                  95      


Asp Thr Asn Lys Asn Ile Ile Lys Pro Ile Asp Xaa Lys Ile Gly Thr 
            100                 105                 110         


Glu Leu Leu His Gly Phe Pro Glu Ile Asn Asn Asn His Asn Lys Leu 
        115                 120                 125             


Ser Leu Glu Ile Tyr Lys Glu Leu Asp Ile Thr Ser Glu Leu Phe Asp 
    130                 135                 140                 


Xaa Met Ile Glu Ser Asn Lys Lys Trp Asn Asn Glu Lys Met Lys Ala 
145                 150                 155                 160 


Xaa Phe Ile Gly Ser Glu Tyr Arg Lys Gln Asn Lys Asn Ile Ser Asn 
                165                 170                 175     


Glu Ile Leu Asn Xaa Ser Lys Lys Ile Lys Lys Tyr Phe Leu Leu Gly 
            180                 185                 190         


Tyr Leu Gly Asn Asp Lys Glu Tyr Ile Thr Asn Asn Lys Ile Asn Ala 
        195                 200                 205             


Gln Met Val Tyr Tyr Leu Met Lys Xaa Leu Gly Tyr Asn Ile Val Ile 
    210                 215                 220                 


Asp Leu Ile Glu Ser Ser Tyr Lys Leu Xaa Ile Val Asp Asn Ile Asn 
225                 230                 235                 240 


Lys Pro Tyr Xaa Ile Asn Lys Ile Ile Gln Leu Gln Asp Thr Ser Ile 
                245                 250                 255     


Asn Gly Glu Tyr Val Tyr Asp Leu Glu Thr Glu Ser Gly Thr Phe His 
            260                 265                 270         


Ala Gly Ile Gly Glu Leu Ile Val Lys Asn 
        275                 280         


<210>  162
<211>  265
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Moumouvirus Monve


<220>
<221>  misc_feature
<222>  (107)..(107)
<223>  Xaa can be any naturally occurring amino acid

<400>  162

Ser Val Thr Gly Asp Thr Pro Ile Ile Val Lys Leu Pro Asn Ser Asn 
1               5                   10                  15      


Asp Val Glu Ile Lys Thr Ile Gln Glu Leu Thr Thr Phe Trp Tyr Glu 
            20                  25                  30          


Tyr Asp Ala Phe Lys Ala Gly Asp Ser Asn Arg Lys Asp Lys Gln Gln 
        35                  40                  45              


Ala Ile Leu Asp Tyr Glu Val Trp Thr Asp Lys Gly Trp Ala Lys Ile 
    50                  55                  60                  


Lys Arg Val Ile Arg His Gln Thr Lys Lys Ser Ile Tyr Arg Val Lys 
65                  70                  75                  80  


Thr Asn Asn Gly Val Val Asp Val Thr Glu Asp His Ser Leu Leu Asn 
                85                  90                  95      


Thr Asp Lys Glu Ile Ile Lys Pro Leu Asp Xaa Asn Pro Asn Thr Lys 
            100                 105                 110         


Leu Leu His Gly Phe Met Glu Thr Asn Asn Ile Tyr Gln Asn Ile Thr 
        115                 120                 125             


Pro Ser Gln Ala Tyr Leu Leu Gly Leu Asn Phe Gly Lys Val Asp Val 
    130                 135                 140                 


Tyr Trp Asp Ile Ile Asn Ala Gln Ser Ile Trp Asp Val Ile Asn Ile 
145                 150                 155                 160 


Ile Thr Asn Ala Thr Thr Lys Ile Lys Gln Glu Phe Ile Lys Gly Trp 
                165                 170                 175     


Lys Lys Gln Gly Phe Tyr Asn Ile Glu Asn Lys Val Glu Ala Gln Phe 
            180                 185                 190         


Leu Tyr Tyr Val Leu Lys Ser Leu Gly His Asn Val Asn Ile His Met 
        195                 200                 205             


Pro Ile Thr Lys Glu Asp Ile Tyr Arg Leu Ser Tyr Gly Lys Asn Ile 
    210                 215                 220                 


Ser Asn Ile Asn Lys Thr Thr Ile Gln Tyr Leu Arg Glu Thr Tyr Asp 
225                 230                 235                 240 


Gly Glu Tyr Val Tyr Asp Leu Glu Thr Glu Ser Gly Thr Phe His Ala 
                245                 250                 255     


Gly Ile Gly Glu Met Ile Val Lys Asn 
            260                 265 


<210>  163
<211>  323
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Ostreococcus tauri virus 1


<220>
<221>  misc_feature
<222>  (48)..(48)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (97)..(97)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (135)..(135)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (161)..(161)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (208)..(208)
<223>  Xaa can be any naturally occurring amino acid

<400>  163

Ser Val Thr Pro Asp Thr Pro Leu Leu Ile Arg Glu Asn Gly Glu Val 
1               5                   10                  15      


Lys Thr Thr Arg Ile Asp Ser Leu Val Asp Leu Tyr Glu Val Arg Asp 
            20                  25                  30          


Asp Gly Lys Glu Ile Ala Glu Ile Asp Ala Glu Val Trp Thr Glu Xaa 
        35                  40                  45              


Gly Phe Thr Pro Ile Lys Gln Ile Val Arg His Lys Thr Thr Lys Asn 
    50                  55                  60                  


Ile His Arg Val Leu Thr His Thr Gly Ile Val Asp Val Thr Glu Asp 
65                  70                  75                  80  


His Ser Leu Leu Leu Lys Asn Lys Glu Met Ile Lys Pro Ser Glu Val 
                85                  90                  95      


Xaa Leu Gly Thr Glu Leu Leu His Gly Asn Ser Leu Glu Ala Phe Gly 
            100                 105                 110         


Glu Thr His Thr Asp Val Thr Pro Glu Glu Ala Lys Val Met Gly Phe 
        115                 120                 125             


Phe Phe Gly Asp Gly Ser Xaa Gly His Tyr Asp Gly Lys Tyr Thr Trp 
    130                 135                 140                 


Ala Leu Asn Asn Ala Asp Met Thr Phe Leu Glu Glu Met Ser Glu Leu 
145                 150                 155                 160 


Xaa Pro Phe Glu Thr Arg Val Tyr Asp Thr Ile Gln Ser Ser Gly Val 
                165                 170                 175     


Tyr Lys Leu Asn Ala Val Gly Asp Val Lys Ser Ile Ser Val Arg Tyr 
            180                 185                 190         


Arg Ser Leu Phe Tyr Asn Glu His Lys Glu Lys Val Val Pro Pro Xaa 
        195                 200                 205             


Ile Leu Gly Ala Pro Leu His Ile Val Gln Ser Phe Trp Asp Gly Tyr 
    210                 215                 220                 


Tyr Met Ala Asp Gly Asp Lys Asp Val His Gly Tyr Thr Arg Met Asp 
225                 230                 235                 240 


Ile Lys Gly Lys Glu Gly Ser Met Gly Met Tyr Ile Leu Gly Arg Arg 
                245                 250                 255     


Leu Gly Tyr Asn Val Ser Met Asn Thr Arg Thr Asp Lys Pro Asp Ile 
            260                 265                 270         


Phe Arg Gln Thr Trp Thr Thr Ser Ser Gln Arg Lys Asn Pro Ile Ala 
        275                 280                 285             


Ile Lys Lys Leu Glu Leu Leu Gly Glu Thr Glu Gly Tyr Val Tyr Asp 
    290                 295                 300                 


Leu Thr Thr Gly Ser His His Phe His Val Gly Pro Gly Asp Leu Val 
305                 310                 315                 320 


Val His Asn 
            


<210>  164
<211>  317
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from unknown phycodnavirus KBvp-11


<220>
<221>  misc_feature
<222>  (19)..(19)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (107)..(107)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (135)..(135)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (161)..(161)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (208)..(208)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (281)..(281)
<223>  Xaa can be any naturally occurring amino acid

<400>  164

Ser Val Thr Pro Asp Thr Pro Leu Leu Ile Arg Gln Asp Gly Ile Val 
1               5                   10                  15      


Lys Thr Xaa Arg Ile Asp Ser Leu Val Asn Ala Tyr Glu Val Arg Asp 
            20                  25                  30          


Asp Gly Lys Glu Val Ala Thr Ile Asp Ala Glu Val Trp Thr Glu Lys 
        35                  40                  45              


Gly Phe Thr Pro Ile His Gln Ile Val Arg His Lys Thr Thr Lys Arg 
    50                  55                  60                  


Ile His Arg Val Leu Thr His Thr Gly Val Val Asp Val Thr Glu Asp 
65                  70                  75                  80  


His Ser Leu Leu Leu Glu Asp Ala Lys Met Ile Thr Pro Lys Glu Val 
                85                  90                  95      


Gln Leu Gly Thr Lys Leu Leu His Gly Ser Xaa Val Asn Ala Ile Ile 
            100                 105                 110         


Asp Gly Thr Ser Arg Val Ser Val Asn Glu Ala Lys Val Met Gly Phe 
        115                 120                 125             


Phe Phe Gly Asp Gly Ser Xaa Gly Ala Tyr Asn Gly Lys Tyr Thr Trp 
    130                 135                 140                 


Thr Leu Asn Asn Ala Asn Ile Gln Tyr Leu Asp Lys Met Ala Ser Leu 
145                 150                 155                 160 


Xaa Pro Phe Glu Thr Arg Ile Tyr Ala Thr Met Glu Ser Ser Gly Val 
                165                 170                 175     


Tyr Lys Leu Asn Ala Ile Gly Asp Val Lys Thr Ile Ser Leu Arg Tyr 
            180                 185                 190         


Arg Ser Leu Phe Tyr Asn Ala Ala Lys Glu Lys Val Ile Pro Pro Xaa 
        195                 200                 205             


Ile Leu Asn Ala Pro Glu Glu Val Val Lys Ala Phe Val Glu Gly Tyr 
    210                 215                 220                 


Tyr Met Ala Asp Gly Asp Thr Arg Met Asp Ile Lys Gly Lys Glu Gly 
225                 230                 235                 240 


Ser Met Gly Met Phe Ile Leu Gly Lys Arg Leu Gly Tyr Asn Val Ser 
                245                 250                 255     


Ile Asn Thr Arg Ser Asp Lys Pro Asp Ile Tyr Arg Gln Thr Trp Thr 
            260                 265                 270         


Thr Tyr Ser Gln Arg Lys Glu Pro Xaa Ala Ile Lys Lys Leu Glu Phe 
        275                 280                 285             


Leu Glu Glu Thr Asp Gly Tyr Val Tyr Asp Leu Thr Thr Glu Ser His 
    290                 295                 300                 


His Phe His Val Gly Pro Gly Glu Leu Val Val His Asn 
305                 310                 315         


<210>  165
<211>  359
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Antarctica Vida lake 
       environmental sampling brine-hole-2


<220>
<221>  misc_feature
<222>  (81)..(81)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (191)..(191)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (207)..(207)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (333)..(333)
<223>  Xaa can be any naturally occurring amino acid

<400>  165

Ser Val Thr His Asp Thr Pro Ile Tyr Ile Lys Trp Lys Asp Ser Asn 
1               5                   10                  15      


Lys Leu Asp Ile Leu Pro Ile Ser Asp Ile Phe Asn Glu Asp Ser Glu 
            20                  25                  30          


Val Leu Asp Glu Glu Ser Leu Arg Asp Leu Glu Ile Lys Pro Tyr Glu 
        35                  40                  45              


Val Leu Thr Val Asn Gly Trp Lys Glu Ile Asn Tyr Val Tyr Arg His 
    50                  55                  60                  


Glu Thr Asn Lys Lys Ile His Arg Ile Ser Thr Lys Asp Lys Leu Val 
65                  70                  75                  80  


Xaa Val Thr Glu Asp His Ser Leu Phe Gln Asn Gly Lys Gln Ile Lys 
                85                  90                  95      


Pro Lys Asn Leu Lys Arg Gly Asp Val Leu Asp Val Arg Glu Leu Pro 
            100                 105                 110         


Thr Phe Glu Leu Asp Asp Asp Asn Asn Leu Asn Glu Asp Leu Phe Tyr 
        115                 120                 125             


Leu Tyr Gly Tyr Phe Leu Gly Asp Gly Ser Ala Thr Tyr Gly Asn Arg 
    130                 135                 140                 


Lys Gln Tyr Tyr Lys Ser Lys Lys Thr Ser Lys Thr Asn Ile Asn Lys 
145                 150                 155                 160 


Gly Lys Arg Ser Val Phe Lys Ile Ser Ser Ser Asn Tyr Asp Lys Leu 
                165                 170                 175     


Ile Arg Leu Gln Lys Ile Ile Lys Asp Asn Phe Asp Val Asn Xaa Lys 
            180                 185                 190         


Ile Lys Asp His Arg Glu Ser Ser Asn Val Tyr Asn Leu Ile Xaa Tyr 
        195                 200                 205             


Val Lys Lys Met Ser Val Lys Phe Ser Glu Asp Phe Tyr Thr Ser Tyr 
    210                 215                 220                 


Lys Glu Lys Lys Ile Pro Asn Tyr Val Leu Asn Thr Asn Lys Lys Asn 
225                 230                 235                 240 


Lys Leu Ala Phe Leu Glu Gly Val Phe Ser Ser Asp Gly Tyr Gly Asp 
                245                 250                 255     


Thr Leu Glu Glu Val Ser Asp Ile Gly Met Lys Ser Gln Val Ala Met 
            260                 265                 270         


Ser Gly Ile Ser Tyr Ile Met Glu Thr Leu Ser Ile Asp Arg Glu Ile 
        275                 280                 285             


Lys Val Arg Lys Asp Lys Glu Asn Phe Ile Ser Leu Lys Leu Lys Asn 
    290                 295                 300                 


Arg Asn Arg Ser Asn Ser Lys Phe Ala Asn Lys Ile Lys Met Lys Ser 
305                 310                 315                 320 


Asp Glu Val Trp Leu Asn Glu Val Ile Thr Asn Lys Xaa Glu Lys Asn 
                325                 330                 335     


Tyr Val Tyr Asp Ile Ser Thr Glu Asp Gly Thr Phe Ile Gly Gly Ile 
            340                 345                 350         


Gly Gly Val Asp Leu Lys Asn 
        355                 


<210>  166
<211>  296
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from botany bay environmental sample 
       BBAY15 library


<220>
<221>  misc_feature
<222>  (32)..(32)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (43)..(43)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (144)..(144)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (149)..(149)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (185)..(185)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (209)..(209)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (212)..(212)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (223)..(223)
<223>  Xaa can be any naturally occurring amino acid

<400>  166

Ser Val Thr Ala Glu Thr Pro Val Leu Val Lys Ser Pro Asp Gly Val 
1               5                   10                  15      


Val Gln Tyr Ile Gln Ile Ser Glu Leu Gly Asn Lys Trp Asp Lys Xaa 
            20                  25                  30          


Val Glu Asp Gly Lys Glu Glu Lys Glu Phe Xaa Ser Leu Glu Gly Lys 
        35                  40                  45              


Gly Trp Glu Ala Trp Ser Asp Asp Gly Trp Thr Pro Ile Lys Asn Val 
    50                  55                  60                  


Ile Arg His Glu Leu Ala Pro His Lys Lys Val Phe Arg Val Leu Thr 
65                  70                  75                  80  


His Thr Gly Tyr Val Glu Val Thr Asp Asp His Ser Leu Leu Asn Asn 
                85                  90                  95      


Glu Lys Gly Ile Ile Lys Thr Asn Glu Ile Lys Asn Gly Gln Leu Leu 
            100                 105                 110         


Leu Gln Asn Glu Tyr Ile Glu Leu Asp Ser Lys Ile Asn Thr Ile Ser 
        115                 120                 125             


Glu Asp Glu Ala Arg Ile Met Gly Phe Phe Phe Gly Asp Gly Ser Xaa 
    130                 135                 140                 


Gly Thr Tyr Asn Xaa Lys Ser Gly Lys Lys Ser Ser Trp Ala Leu Asn 
145                 150                 155                 160 


Asn Asn Asn Leu Ile Arg Leu Lys Lys Tyr Gln Asn Leu Phe Tyr Lys 
                165                 170                 175     


Asn Lys Asn Lys Val Ile Pro Asp Xaa Ile Phe Lys Ser Asn Lys Asn 
            180                 185                 190         


Val Arg Lys Ala Phe Phe Glu Gly Leu Tyr Asp Ala Asp Gly Asp Lys 
        195                 200                 205             


Xaa Gly Tyr Xaa Thr Arg Ile Asp Gln Lys Ser Met Ile Ser Xaa Ala 
    210                 215                 220                 


Tyr Ile Tyr Lys Leu Gly Lys Ser Leu Asp Tyr Asn Val Ser Ile Asn 
225                 230                 235                 240 


Val His Ser Lys Lys Lys Asn Ile Phe Arg Leu Thr Phe Thr Thr Lys 
                245                 250                 255     


Lys Gln Arg Lys Ile Val Asn Ala Val Lys Lys Leu Met Lys Ser Asn 
            260                 265                 270         


Asn Gly Tyr Val Tyr Asp Leu Thr Thr Glu Ser Ser Leu Ser Ser Arg 
        275                 280                 285             


Ile Gly Ser Ile Ile Val His Asn 
    290                 295     


<210>  167
<211>  371
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Chrysochromulina ericina virus 
       isolate 01


<220>
<221>  misc_feature
<222>  (20)..(20)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (42)..(42)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (58)..(58)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (112)..(112)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (142)..(142)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (147)..(147)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (172)..(172)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (196)..(196)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (271)..(271)
<223>  Xaa can be any naturally occurring amino acid

<400>  167

Ser Val Ala Ser Tyr Thr Pro Ile Tyr Val Arg Tyr Asn Lys Ser Ile 
1               5                   10                  15      


Ile Asp Ile Xaa Ser Val Glu Glu Leu Ala Glu Lys Tyr Gly Asn Gly 
            20                  25                  30          


Trp His Leu Glu Ser Pro Lys Glu Tyr Xaa Glu Leu Asn Asn Ile Glu 
        35                  40                  45              


Ser Trp Thr Glu Asn Gly Trp Thr Glu Xaa His Arg Val Ile Arg His 
    50                  55                  60                  


Arg Leu Ala Pro Tyr Lys Lys Met Val Arg Ile Leu Thr His Thr Gly 
65                  70                  75                  80  


Leu Val Asp Val Thr Asp Asp His Ser Leu Val Lys Asn Thr Gly Glu 
                85                  90                  95      


Glu Ile Ser Pro Lys Asp Val Ser Ile Gly Thr Lys Leu Leu His Xaa 
            100                 105                 110         


Thr Met Ser Glu Asn Glu Ser Asn Ile Glu Ser Asp Ile Ser Ile Asp 
        115                 120                 125             


Glu Ala Arg Ile Met Gly Phe Phe Phe Gly Asp Gly Ser Xaa Gly Ile 
    130                 135                 140                 


Tyr Asp Xaa Pro Ser Gly His Lys Ala Ser Trp Ala Leu Asn Asn Ser 
145                 150                 155                 160 


Asn Lys Glu Leu Ile Glu Lys Tyr Tyr Asn Leu Xaa Lys Ser Val Tyr 
                165                 170                 175     


Pro Glu Phe Glu Trp Lys Val Tyr Asp Thr Leu Asn Ser Ser Gly Val 
            180                 185                 190         


Tyr Lys Ile Xaa Phe Asn Lys Lys Ser Gly Ser Lys Ser Lys Ile Gln 
        195                 200                 205             


Phe Ile Glu Lys Tyr Arg Ser Met Leu Tyr Asn Lys Lys Ser Lys Ile 
    210                 215                 220                 


Ile Pro Ser Glu Ile Ile Asn Gly Ser Ile Glu Leu Arg Lys Ser Phe 
225                 230                 235                 240 


Trp Glu Gly Leu Tyr Asp Ala Asp Gly Asp Lys Asp Lys Asn Gly Tyr 
                245                 250                 255     


Thr Arg Ile Asp Gln Lys Ser Gln Ile Ser Ala Ala Tyr Ile Xaa Trp 
            260                 265                 270         


Leu Ala Asn Ser Ile Gly Tyr Lys Thr Ser Leu Asn Ile Arg Asp Asp 
        275                 280                 285             


Lys Thr Asp Ile Tyr Arg Ile Thr Ala Thr Lys Asn Lys Gln Arg Arg 
    290                 295                 300                 


Asp Gly Asp Lys Ile Lys Lys Ile Val Asn Ile Gln Asn Ser Ala Asn 
305                 310                 315                 320 


Ile Gln Asn Ser Ala Asn Ile Gln Asn Ser Val Asn Ile Gln Asn Ser 
                325                 330                 335     


Val Asn Ile Gln Asn Ser Lys Asp Asn Gln Asp Tyr Val Tyr Asp Leu 
            340                 345                 350         


Thr Thr Glu Asn His His His Phe Ala Ala Gly Ile Gly Asn Met Ile 
        355                 360                 365             


Val His Asn 
    370     


<210>  168
<211>  348
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Antarctica Ace lake environmental
       sampling


<220>
<221>  misc_feature
<222>  (36)..(36)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (47)..(47)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (149)..(149)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (154)..(154)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (179)..(179)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (277)..(277)
<223>  Xaa can be any naturally occurring amino acid

<400>  168

Ser Val Ala Lys Tyr Thr Pro Val Tyr Val Lys Val Asp Gly Lys Leu 
1               5                   10                  15      


Gln Ile Val Glu Met Glu Lys Leu Ala Glu Gln Tyr Gly Gly Asn Gln 
            20                  25                  30          


Trp Thr Thr Xaa Leu Glu Glu Gly Lys Gln Glu Lys Glu Phe Xaa Glu 
        35                  40                  45              


Leu Tyr Gly Val Glu Thr Trp Thr Asp Lys Gly Trp Thr Lys Leu His 
    50                  55                  60                  


Arg Ile Ile Arg His Gln Leu Ala Ser His Lys Lys Met Ile Arg Ile 
65                  70                  75                  80  


Leu Thr His Thr Gly Met Val Asp Val Thr Asp Asp His Ser Leu Val 
                85                  90                  95      


Leu Glu Asp Gly Asn Glu Ile Ser Pro Lys Glu Val Asp Ile Gly Thr 
            100                 105                 110         


Lys Leu Leu His Lys Thr Leu Glu Tyr Glu Ser Pro Gln Phe Glu Val 
        115                 120                 125             


Glu Asn Asn Ile Ser Ala Asp Met Ala Lys Ile Tyr Gly Phe Phe Phe 
    130                 135                 140                 


Gly Asp Gly Ser Xaa Gly Ile Tyr Asp Xaa Pro Ser Gly Lys Lys Ala 
145                 150                 155                 160 


Ser Trp Ala Leu Asn Asn Ala Asn Glu Glu Leu Leu Asp Lys Tyr Ile 
                165                 170                 175     


Lys Leu Xaa Arg Lys Ser Tyr Pro Glu Phe Asp Trp Gln Ile Tyr Asp 
            180                 185                 190         


Thr Ile Glu Ser Ser Gly Val Tyr Lys Leu Thr Phe Asn Gly Asn Val 
        195                 200                 205             


Tyr Gly Asn Lys Ser Lys Phe Ile Glu Thr Tyr Arg Lys His Met Tyr 
    210                 215                 220                 


Ser Gly Lys Ser Lys Ile Ile Pro Asp Phe Ile Leu Asn Gly Thr His 
225                 230                 235                 240 


Glu Ile Arg Glu Ala Phe Trp Glu Gly Leu Tyr Asp Ala Asp Gly Asp 
                245                 250                 255     


Lys Asp Lys Asn Gly Tyr Val Arg Ile Asp Gln Lys Asn Gln Ile Ser 
            260                 265                 270         


Ala Ala His Ile Xaa Trp Leu Ala Asn Ser Ile Gly Tyr Lys Thr Ser 
        275                 280                 285             


Ile Asn Thr Arg Thr Asp Lys Gln Asn Ile Tyr Arg Ile Thr Ala Thr 
    290                 295                 300                 


Lys Gly Ala Gln Arg Lys Glu Gly Asn Ala Ile Lys Lys Leu Met Glu 
305                 310                 315                 320 


Leu Asp Tyr His Asp Tyr Val Tyr Asp Leu Thr Thr Glu Asn His His 
                325                 330                 335     


Phe Ala Ala Gly Ile Gly Asn Met Ile Val His Asn 
            340                 345             


<210>  169
<211>  290
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein (partial) from Antarctica Ace lake 
       environmental sampling


<220>
<221>  misc_feature
<222>  (36)..(36)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (47)..(47)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (129)..(129)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (150)..(150)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (155)..(155)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (180)..(180)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (183)..(183)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (223)..(223)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (280)..(280)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (290)..(290)
<223>  Xaa can be any naturally occurring amino acid

<400>  169

Ser Val Ala Lys Tyr Thr Pro Val Tyr Val Arg Ala Asn Gly Gln Leu 
1               5                   10                  15      


Gln Ile Val Glu Met Glu Gln Leu Ala Glu Lys Tyr Gly Gly Asn Asn 
            20                  25                  30          


Trp Ser Lys Xaa Ile Glu Glu Gly Lys Gln Glu Lys Glu Phe Xaa Glu 
        35                  40                  45              


Leu Thr Asn Ile Glu Thr Trp Thr Asn Lys Gly Trp Thr Lys Leu Tyr 
    50                  55                  60                  


Arg Ile Ile Arg His Lys Leu Ala Ser His Lys Lys Met Val Arg Ile 
65                  70                  75                  80  


Leu Thr His Thr Gly Met Val Asp Val Thr Asp Asp His Ser Leu Leu 
                85                  90                  95      


Leu Glu Asp Gly Ser Glu Val Ser Pro Lys Asn Val Glu Ile Gly Thr 
            100                 105                 110         


Lys Leu Leu His Lys Thr Leu Lys His Asp Ala Leu Lys Pro Asn Ala 
        115                 120                 125             


Xaa Leu Asp Thr Val Ser Val Asp Met Ala Lys Ile Tyr Gly Phe Phe 
    130                 135                 140                 


Phe Gly Asp Gly Ser Xaa Gly Ile Tyr Asn Xaa Pro Ser Gly Lys Lys 
145                 150                 155                 160 


Ala Thr Trp Ala Leu Asn Asn Ser Asn Val Glu Leu Leu Asp Lys Tyr 
                165                 170                 175     


Ile Asn Leu Xaa Lys Lys Xaa Tyr Pro Glu Phe Thr Trp Gln Thr Tyr 
            180                 185                 190         


Asn Thr Ile Glu Ser Ser Gly Leu Tyr Lys Ile Thr Phe Asn Ser Asn 
        195                 200                 205             


Glu Tyr Gly Thr Lys Ser Arg Phe Ile Glu Thr Tyr Arg Asn Xaa Met 
    210                 215                 220                 


Tyr Thr Gly Asn Ser Lys Ile Ile Pro Asp Phe Ile Leu Asn Gly Ser 
225                 230                 235                 240 


Asn Glu Ile Arg Glu Ala Phe Trp Glu Gly Met Tyr Asp Ala Ala Ala 
                245                 250                 255     


Asp Gly Asp Lys Asp Glu Asn Gly Tyr Val Arg Ile Asp Gln Lys Asn 
            260                 265                 270         


Gln Ile Ser Ala Ala His Ile Xaa Trp Leu Ala Asn Ser Met Asp Ile 
        275                 280                 285             


Arg Xaa 
    290 


<210>  170
<211>  328
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Bathycoccus species RCC1105 virus
       BpV2


<220>
<221>  misc_feature
<222>  (19)..(19)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (107)..(107)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (114)..(114)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (136)..(136)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (141)..(141)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (166)..(166)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (213)..(213)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (299)..(299)
<223>  Xaa can be any naturally occurring amino acid

<400>  170

Ser Val Thr Pro Asp Thr Pro Leu Leu Ile Arg Gln Asn Gly Thr Val 
1               5                   10                  15      


His Thr Xaa Arg Ile Asp Ser Leu Val Asn Glu Tyr Thr Leu Arg Asp 
            20                  25                  30          


Asp Gly Lys Glu Ile Gly Tyr Ile Asn Ala Glu Val Trp Thr Glu Asn 
        35                  40                  45              


Gly Phe Thr Ser Ile Gln Gln Ile Val Arg His Lys Thr Asn Lys Asn 
    50                  55                  60                  


Ile His Arg Val Val Thr His Thr Gly Ile Val Asp Val Thr Glu Asp 
65                  70                  75                  80  


His Ser Leu Leu Leu Glu Asn Lys Glu Ile Ala Lys Pro Thr Gln Val 
                85                  90                  95      


Gly Val Gly Thr Ala Leu Leu His Gly Asn Xaa Val Glu Ser Ile Asp 
            100                 105                 110         


Thr Xaa Thr Asp Thr Ser Ile Thr Lys Glu Glu Ala Lys Val Met Gly 
        115                 120                 125             


Phe Phe Phe Gly Asp Gly Ser Xaa Gly Thr Tyr Gln Xaa Arg Ser Gly 
    130                 135                 140                 


Val Lys Ser Thr Trp Ala Leu Asn Asn Ser Lys Leu Glu Tyr Leu Glu 
145                 150                 155                 160 


Glu Met Gln Lys Leu Xaa Pro Phe Glu Thr Lys Ile Tyr Asp Thr Ile 
                165                 170                 175     


Lys Ser Ser Gly Val Tyr Lys Leu Asn Ala Lys Gly Leu Val Val Asp 
            180                 185                 190         


Ile Val Asn Lys Tyr Arg Asn Leu Phe Tyr Asn Ser His Lys Glu Lys 
        195                 200                 205             


Val Val Pro Ser Xaa Ile Leu Asn Ala Pro Leu Glu Ile Ile Lys Ser 
    210                 215                 220                 


Phe Val Asp Gly Tyr Tyr Met Ala Asp Gly Asp Lys Asp Lys Asn Gly 
225                 230                 235                 240 


Tyr Thr Arg Met Asp Val Lys Gly Lys Glu Gly Ser Met Gly Met Tyr 
                245                 250                 255     


Met Leu Gly Arg Lys Leu Gly Tyr Asn Val Ser Ile Asn Thr Arg Thr 
            260                 265                 270         


Asp Lys Val Asn Val Phe Arg Gln Thr Trp Thr Lys Ser Leu Gln Arg 
        275                 280                 285             


Lys Ser Pro Thr Lys Ile Lys Lys Leu Glu Xaa Leu Gly Glu Thr Asp 
    290                 295                 300                 


Gly Tyr Val Tyr Asp Leu Thr Thr Lys Ser His His Phe His Val Gly 
305                 310                 315                 320 


Pro Gly Asp Leu Val Val His Asn 
                325             


<210>  171
<211>  323
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from unknown phycodnavirus KBvp-16


<220>
<221>  misc_feature
<222>  (19)..(19)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (94)..(94)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (107)..(107)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (135)..(135)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (161)..(161)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (189)..(189)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (208)..(208)
<223>  Xaa can be any naturally occurring amino acid

<400>  171

Ser Val Thr Pro Asp Thr Pro Leu Leu Leu Arg Ile Lys Gly Glu Val 
1               5                   10                  15      


Lys Thr Xaa Arg Ile Asp Ser Leu Val Glu Ser Tyr Glu Glu Arg Asp 
            20                  25                  30          


Asp Gly Lys Glu Val Ala Glu Ile Asp Ala Glu Val Trp Thr Glu Lys 
        35                  40                  45              


Gly Phe Thr Pro Ile Gln Gln Ile Val Arg His Lys Thr Thr Lys Asn 
    50                  55                  60                  


Ile His Arg Val Leu Thr His Thr Gly Val Val Asp Val Thr Glu Asp 
65                  70                  75                  80  


His Ser Leu Leu Leu Glu Asn Lys Gln Met Ile Lys Pro Xaa Glu Val 
                85                  90                  95      


Ser Leu Gly Thr Asn Leu Leu His Gly Asp Xaa Val Tyr Gly Leu Asn 
            100                 105                 110         


Trp Asn Asp Thr Thr Val Ser Val Asn Glu Ala Lys Val Met Gly Phe 
        115                 120                 125             


Phe Phe Gly Asp Gly Ser Xaa Gly His Tyr Gly Asp Lys Tyr Thr Trp 
    130                 135                 140                 


Ala Leu Asn Asn Ser Asn Val Asp Tyr Leu Ile Glu Met Gln Asn Leu 
145                 150                 155                 160 


Xaa Pro Phe Glu Thr Ser Ile Tyr Asp Thr Ile Glu Ser Ser Gly Val 
                165                 170                 175     


Tyr Lys Leu Asn Ala Lys Gly Asp Val Lys Asn Ile Xaa Glu Arg Tyr 
            180                 185                 190         


Arg Ser Met Phe Tyr Asn Ala His Lys Glu Lys Ile Val Pro Ser Xaa 
        195                 200                 205             


Ile Leu Asn Ala Pro Ile Glu Val Val Glu Ser Phe Trp Glu Gly Tyr 
    210                 215                 220                 


Tyr Met Ala Asp Gly Asp Lys Asp Val His Gly Tyr Thr Arg Met Asp 
225                 230                 235                 240 


Ile Lys Gly Lys Glu Gly Ser Met Gly Met Phe Ile Leu Gly Lys Arg 
                245                 250                 255     


Leu Asn Tyr Asn Val Ser Leu Asn Thr Arg Lys Asp Lys Pro Asp Val 
            260                 265                 270         


Phe Arg Gln Thr Trp Thr Lys Ser Thr Gln Arg Lys Ser Pro Asn Ala 
        275                 280                 285             


Ile Lys Lys Leu Glu Leu Val Gly Glu Thr Glu Gly Tyr Val Tyr Asp 
    290                 295                 300                 


Leu Thr Thr Glu Ser His His Phe His Ile Gly Pro Gly Asp Leu Val 
305                 310                 315                 320 


Val His Asn 
            


<210>  172
<211>  437
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Haloferax alexandrinus JCM 10717


<220>
<221>  misc_feature
<222>  (395)..(395)
<223>  Xaa can be any naturally occurring amino acid

<400>  172

Ser Val Thr Gly Asp Arg Pro Val Val Val Arg Asp Pro Gly Gly Thr 
1               5                   10                  15      


Val Arg Ile Leu Pro Ile Glu Asp Leu Phe Ala Arg Gly Thr Thr Glu 
            20                  25                  30          


Ser Glu Val Leu Ile Ala Ala Asp Gly Asp Val Val Ala Ser Ala Thr 
        35                  40                  45              


Pro Gly Lys Thr Arg Arg Ala Leu Asp Gly Trp Asp Ala Leu Ser Val 
    50                  55                  60                  


Asn Glu Ala Gly Glu Ala Glu Trp Gln Pro Ile Ala Gln Ala Ile Arg 
65                  70                  75                  80  


His Lys Thr Asp Lys Pro Val Val Asn Leu Gln His Lys Phe Gly Glu 
                85                  90                  95      


Ser Thr Thr Thr Arg Asp His Ser Tyr Val Val Pro Gly Glu His Gly 
            100                 105                 110         


Leu Thr Thr Val Ser Pro Asp Asp Val Ala Glu Pro Tyr Arg Val Ser 
        115                 120                 125             


Gly Val Pro Asp Val Glu Pro Val Glu Gln Val Asp Val Tyr Glu Val 
    130                 135                 140                 


Leu Arg Gly Tyr Glu Arg Glu Tyr Glu Asp Gly Arg Ser Val Gly Ser 
145                 150                 155                 160 


Asp Asn Ser Ile Thr Lys Arg Lys Gln Ile His Ala Asp Asp Glu Tyr 
                165                 170                 175     


Val Trp Phe Gly His Glu His His Arg Asp Val Asp Ser Thr Val Lys 
            180                 185                 190         


Val Lys Arg Phe Val Asp Ile Asp Ser Glu Asp Gly Ala Ala Leu Ile 
        195                 200                 205             


Arg Leu Leu Gly Ala Tyr Val Ser Glu Gly Ser Ala Ser Thr Gly Glu 
    210                 215                 220                 


Thr Ala Thr Ser Lys Phe Gly Ala Ser Ile Ala Glu Ser Asp Arg Glu 
225                 230                 235                 240 


Trp Leu Ala Gln Leu Gln Arg Asp Tyr Ser Arg Leu Phe Glu Asn Thr 
                245                 250                 255     


Thr Ala Asp Ile Ile Thr Ser Asp Arg Arg Ala Glu Arg Thr Val Glu 
            260                 265                 270         


Tyr Gln Thr Asp Thr Gly Gly Ala Ser Val Thr Tyr Asn Asp Glu Thr 
        275                 280                 285             


Leu Lys Leu Gln Met Met Asn Glu Leu Ala Ala Val Phe Phe Arg Glu 
    290                 295                 300                 


Phe Ala Gly Gln Thr Ser Arg Gly Lys Arg Ile Pro Ser Phe Val Phe 
305                 310                 315                 320 


His Leu Pro Glu Glu Lys Gln Asp Leu Phe Leu Thr Leu Leu Val Glu 
                325                 330                 335     


Gly Asp Gly Ser Arg Glu Phe Pro Arg Tyr Thr Glu Ala Tyr Ala Gln 
            340                 345                 350         


Arg Asn Phe Asp Phe Glu Thr Thr Ser Arg Glu Leu Ala Ala Gly Leu 
        355                 360                 365             


Ser Met Leu Leu Thr Gln Arg Gly Gln Lys His Ser Leu Lys Tyr Arg 
    370                 375                 380                 


Asp Ser Lys Asp Ser Tyr Thr Ile Arg Thr Xaa Ser Thr Tyr Arg Glu 
385                 390                 395                 400 


Gly Arg Asp Pro Val Leu Thr Glu Val Asp His Asp Gly Tyr Val Tyr 
                405                 410                 415     


Asp Leu Ser Val Glu Glu Asn Glu Asn Phe Val Asp Gly Val Gly Gly 
            420                 425                 430         


Ile Val Leu His Asn 
        435         


<210>  173
<211>  437
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Haloferax prahovense DSM 18310


<220>
<221>  misc_feature
<222>  (395)..(395)
<223>  Xaa can be any naturally occurring amino acid

<400>  173

Ser Val Thr Gly Asp Arg Pro Val Val Val Arg Asp Pro Gly Gly Thr 
1               5                   10                  15      


Val Arg Ile Leu Pro Ile Glu Asp Leu Phe Ala Arg Gly Thr Thr Glu 
            20                  25                  30          


Ser Glu Val Leu Ile Ala Ala Asp Gly Asp Val Val Ala Ser Ala Thr 
        35                  40                  45              


Pro Gly Lys Thr Arg Arg Ala Leu Asp Gly Trp Asp Ala Leu Ser Val 
    50                  55                  60                  


Asn Glu Asp Gly Glu Ala Glu Trp Gln Pro Ile Ala Gln Ala Ile Arg 
65                  70                  75                  80  


His Asn Thr Asp Lys Pro Val Val Asn Leu Gln His Lys Phe Gly Glu 
                85                  90                  95      


Ser Thr Thr Thr Arg Asp His Ser Tyr Val Val Pro Gly Glu Asp Gly 
            100                 105                 110         


Leu Thr Thr Val Ser Pro Asp Asp Val Ala Glu Pro Tyr Arg Val Ser 
        115                 120                 125             


Gly Val Pro Asp Val Glu Pro Val Glu Gln Val Asp Val Tyr Glu Val 
    130                 135                 140                 


Leu Arg Gly Tyr Glu Arg Glu Tyr Glu Asp Gly Arg Ser Val Gly Ser 
145                 150                 155                 160 


Asp Asn Ser Ile Thr Lys Arg Lys Gln Ile His Ala Asp Asp Glu Tyr 
                165                 170                 175     


Val Trp Phe Gly His Glu His His Arg Asp Val Asp Ser Thr Val Lys 
            180                 185                 190         


Val Lys Arg Phe Val Asp Ile Asp Ser Glu Asp Gly Ala Ala Leu Ile 
        195                 200                 205             


Arg Leu Leu Gly Ala Tyr Val Pro Glu Gly Ser Ala Ser Thr Gly Glu 
    210                 215                 220                 


Thr Ala Thr Ser Glu Phe Gly Ala Ser Ile Ala Glu Ser Asp Arg Glu 
225                 230                 235                 240 


Trp Leu Ala Gln Leu Gln Arg Asp Tyr Ser Arg Leu Phe Glu Asn Thr 
                245                 250                 255     


Thr Ala Gly Ile Ile Thr Ser Asp Arg Arg Ala Glu Arg Thr Val Glu 
            260                 265                 270         


Tyr Gln Thr Asp Thr Gly Gly Ala Ser Val Thr Tyr Asn Asp Glu Thr 
        275                 280                 285             


Leu Lys Leu Gln Met Met Asn Glu Leu Ala Ala Val Phe Phe Arg Glu 
    290                 295                 300                 


Phe Ala Gly Gln Thr Ser Arg Gly Lys Arg Ile Pro Ser Phe Val Phe 
305                 310                 315                 320 


His Leu Pro Glu Glu Lys Gln Asp Leu Phe Leu Thr Leu Leu Val Glu 
                325                 330                 335     


Gly Asp Gly Ser Arg Glu Phe Pro Arg Tyr Thr Glu Ala Tyr Ala Gln 
            340                 345                 350         


Arg Asn Phe Asp Phe Glu Thr Thr Ser Arg Glu Leu Ala Ala Gly Leu 
        355                 360                 365             


Ser Met Leu Leu Thr Gln Arg Gly Gln Lys His Ser Leu Lys Tyr Arg 
    370                 375                 380                 


Asp Ser Lys Asp Ser Tyr Thr Ile Arg Thr Xaa Ser Thr Tyr Arg Glu 
385                 390                 395                 400 


Gly Arg Asp Pro Val Leu Thr Glu Ala Asp His Asp Gly Tyr Val Tyr 
                405                 410                 415     


Asp Leu Ser Val Glu Glu Asn Glu Asn Phe Val Asp Gly Val Gly Gly 
            420                 425                 430         


Ile Val Leu His Asn 
        435         


<210>  174
<211>  437
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Natronomonas moolapensis 8.8.11


<220>
<221>  misc_feature
<222>  (204)..(204)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (208)..(208)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (228)..(228)
<223>  Xaa can be any naturally occurring amino acid

<400>  174

Ser Val Ser Gly Asp Arg Pro Val Val Val Arg Asp Pro Asp Gly Thr 
1               5                   10                  15      


Ile Arg Ile Val Pro Ile Glu Ala Leu Phe Asp Arg Ala Glu Lys Arg 
            20                  25                  30          


Pro Asp Asp Arg Val Phe Val Ala Ala Asp Gly Asp Thr Gln Ile Asp 
        35                  40                  45              


Asp Gly Ser Arg Lys Glu Phe Ala Asp Leu Asp Gly Trp Asp Ala Leu 
    50                  55                  60                  


Ser Leu Ser Glu Glu Gly Thr Ala Gly Trp Arg Pro Ile Glu Arg Val 
65                  70                  75                  80  


Ile Arg His Arg Thr Asp Arg Pro Val Val Asn Leu Gln His Lys Phe 
                85                  90                  95      


Gly Glu Ser Thr Thr Thr Arg Asn His Ser Tyr Ile Val Asp Asp Asn 
            100                 105                 110         


Ser Lys Tyr Val Glu Ala Thr Pro Glu Glu Val Asp Glu Pro Leu Arg 
        115                 120                 125             


Ile Pro Asp Leu Pro Ala Glu Thr Arg Asn Asp Ala Ile Asp Val Tyr 
    130                 135                 140                 


Glu Val Leu Ser Gly Tyr Glu Arg Glu Tyr Glu Asp Gly Arg Gly Thr 
145                 150                 155                 160 


Gly Gly Thr Thr Ile Lys Thr Lys Arg Val His Ala Asn Asp Glu Tyr 
                165                 170                 175     


Val Trp Phe Gly His Asp His Tyr Gly Glu Leu Asp Ser Thr Val Lys 
            180                 185                 190         


Val Lys Arg Tyr Ile Arg Pro Gly Thr Gln Glu Xaa Val Ser Leu Xaa 
        195                 200                 205             


Arg Leu Leu Ala Ala Tyr Val Thr Glu Gly Ser Ala Thr Thr Lys Glu 
    210                 215                 220                 


Thr Ser Asp Xaa Arg Tyr Gly Ala Ser Ile Ala Glu Ser Arg Arg Arg 
225                 230                 235                 240 


Trp Leu Val Gly Leu Arg Asp Asp Tyr Tyr Arg Leu Phe Glu Asn Thr 
                245                 250                 255     


Thr Ala Ser Val Ile Asp Asn Asp Ser Ser Asp Glu Arg Thr Ile Glu 
            260                 265                 270         


Tyr Arg Thr Asp His Gly Asp Thr Ala Val Ser Tyr Asp Asp Gly Thr 
        275                 280                 285             


Lys Lys Leu Gln Met Met Asn Glu Leu Ala Ala Val Phe Phe Arg Glu 
    290                 295                 300                 


Phe Ala Gly Gln Thr Ser Arg Glu Lys Arg Leu Pro Ser Phe Val Tyr 
305                 310                 315                 320 


His Leu Gln Asp Asp Glu Gln Asp Leu Phe Leu Glu Thr Leu Ile Glu 
                325                 330                 335     


Gly Asp Gly Ser Arg Glu Phe Pro Arg Tyr Ser Asp Ala Tyr Ala Glu 
            340                 345                 350         


Arg Asn Phe Asp Phe Glu Thr Ile Ser Arg Glu Leu Ala Ala Gly Leu 
        355                 360                 365             


Ser Met Leu Leu Ile Gln Arg Gly Lys Lys His Ser Leu Lys Tyr Arg 
    370                 375                 380                 


Asp Ala Lys Asp Ser Tyr Thr Ile Arg Thr Val Asp Ser Tyr Arg Arg 
385                 390                 395                 400 


Gly Arg Asp Pro Val Leu Arg Glu Val Asp His Asp Gly Tyr Val Tyr 
                405                 410                 415     


Asp Leu Ser Val Ala Glu Asn Asp Asn Phe Val Asp Gly Val Gly Gly 
            420                 425                 430         


Val Val Leu His Asn 
        435         


<210>  175
<211>  437
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Haloferax volcanii DS2 ATCC29605


<220>
<221>  misc_feature
<222>  (395)..(395)
<223>  Xaa can be any naturally occurring amino acid

<400>  175

Ser Val Thr Gly Asp Arg Pro Val Val Val Arg Asp Pro Gly Gly Thr 
1               5                   10                  15      


Val Arg Ile Leu Pro Ile Glu Asp Leu Phe Ala Arg Gly Thr Thr Glu 
            20                  25                  30          


Ser Glu Val Leu Ile Ala Ala Asp Gly Asp Val Val Ala Ser Ala Thr 
        35                  40                  45              


Pro Gly Lys Thr Arg Arg Ala Leu Asp Gly Trp Asp Ala Leu Ser Val 
    50                  55                  60                  


Asn Glu Asp Gly Glu Ala Glu Trp Gln Pro Ile Ala Gln Ala Ile Arg 
65                  70                  75                  80  


His Asn Thr Asp Lys Pro Val Val Asn Leu Gln His Lys Phe Gly Glu 
                85                  90                  95      


Ser Thr Thr Thr Arg Asp His Ser Tyr Val Val Pro Gly Glu Asp Gly 
            100                 105                 110         


Leu Thr Thr Val Ser Pro Asp Asp Val Ala Glu Pro Tyr Arg Val Ser 
        115                 120                 125             


Gly Val Pro Asp Val Glu Pro Val Glu Gln Val Asp Val Tyr Glu Val 
    130                 135                 140                 


Leu Arg Gly Tyr Glu Arg Glu Tyr Glu Asp Gly Arg Ser Val Gly Ser 
145                 150                 155                 160 


Asp Asn Ser Ile Thr Lys Arg Lys Gln Ile His Ala Asp Asp Glu Tyr 
                165                 170                 175     


Val Trp Phe Gly His Glu His His Arg Asp Val Asp Ser Thr Val Lys 
            180                 185                 190         


Val Lys Arg Phe Val Asp Ile Asp Ser Glu Asp Gly Ala Ala Leu Ile 
        195                 200                 205             


Arg Leu Leu Gly Ala Tyr Val Pro Glu Gly Ser Ala Ser Thr Gly Glu 
    210                 215                 220                 


Thr Ala Thr Ser Lys Phe Gly Ala Ser Leu Ala Glu Ser Asp Arg Glu 
225                 230                 235                 240 


Trp Leu Ala Gln Leu Gln Arg Asp Tyr Ser Arg Leu Phe Glu Asn Thr 
                245                 250                 255     


Thr Ala Gly Ile Ile Thr Ser Asp Arg Arg Ala Glu Arg Thr Val Glu 
            260                 265                 270         


Tyr Gln Thr Asp Thr Gly Gly Ala Ser Val Thr Tyr Asn Asp Glu Thr 
        275                 280                 285             


Leu Lys Leu Gln Met Met Asn Glu Leu Ala Ala Val Phe Phe Arg Glu 
    290                 295                 300                 


Phe Ala Gly Gln Thr Ser Arg Gly Lys Arg Ile Pro Ser Phe Val Phe 
305                 310                 315                 320 


His Leu Pro Glu Glu Lys Gln Asp Leu Phe Leu Thr Leu Leu Val Glu 
                325                 330                 335     


Gly Asp Gly Ser Arg Glu Phe Pro Arg Tyr Thr Glu Ala Tyr Ala Gln 
            340                 345                 350         


Arg Asn Phe Asp Phe Glu Thr Thr Ser Arg Glu Leu Ala Ala Gly Leu 
        355                 360                 365             


Ser Met Leu Leu Thr Gln Arg Gly Gln Lys His Ser Leu Lys Tyr Arg 
    370                 375                 380                 


Asp Ser Lys Asp Ser Tyr Thr Ile Arg Thr Xaa Ser Thr Tyr Arg Glu 
385                 390                 395                 400 


Gly Arg Asp Pro Val Leu Thr Glu Ala Asp His Asp Gly Tyr Val Tyr 
                405                 410                 415     


Asp Leu Ser Val Glu Glu Asn Glu Asn Phe Val Asp Gly Val Gly Gly 
            420                 425                 430         


Ile Val Leu His Asn 
        435         


<210>  176
<211>  431
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Haloquadratum walsbyi DSM 16790


<220>
<221>  misc_feature
<222>  (169)..(169)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (233)..(233)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (247)..(247)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (279)..(279)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (369)..(369)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (396)..(396)
<223>  Xaa can be any naturally occurring amino acid

<400>  176

Ser Val Thr Gly Asp Arg Pro Val Val Val Arg Asp Pro Ser Asp Tyr 
1               5                   10                  15      


Ile Gln Ile Val Pro Ile Lys Leu Leu Phe Glu Gln Ala Thr Ala Pro 
            20                  25                  30          


Glu Gln Asn Met Arg Leu Thr Ala Asp Gly Ala Pro Ser Val Asn Ser 
        35                  40                  45              


Glu Leu Pro Lys Glu Arg Arg His Leu Asp Gln Trp Glu Ala Leu Ser 
    50                  55                  60                  


Leu Ser Asp Thr Gly Glu Thr Glu Trp Gln Pro Ile Asn Gln Ile Ile 
65                  70                  75                  80  


Arg His Gln Thr Asp Lys Glu Ile Leu Thr Leu Gln His Glu Tyr Gly 
                85                  90                  95      


Glu Ser Thr Thr Thr Arg Asp His Ser Tyr Ile Thr Ala Asp Asp Gly 
            100                 105                 110         


Glu Tyr Val Glu Thr Ser Pro Glu Asn Val Asp Glu Pro Leu Pro Ile 
        115                 120                 125             


Pro Asn Ile Ala Ser Val Lys Thr Ile Glu Thr Ile Asp Ile Tyr Gln 
    130                 135                 140                 


Thr Leu Thr Thr Asp Thr Gln Ala Gln Ile Gly Asn Asp Thr Glu Pro 
145                 150                 155                 160 


Asp Lys Trp Leu Pro Ser Ala Asp Xaa Ile His Ala Asn Asp Glu Tyr 
                165                 170                 175     


Val Trp Ile Gly Thr Thr Asp Lys Gln Gln Asp Arg Asp Asp Ser Thr 
            180                 185                 190         


Pro Ala Ile Pro Arg Tyr Ile Asp Leu Thr Ser Asp Thr Gly His Ala 
        195                 200                 205             


Leu Ile Arg Phe Leu Ala Val Tyr Leu Ser Asp Trp Ser Lys Ser Thr 
    210                 215                 220                 


Ile Thr Thr Thr Glu Arg Gly Gln Xaa Leu His Ile Thr Gly Pro Gln 
225                 230                 235                 240 


Glu Ser Ala Leu Lys Thr Xaa Ala Ala Asp Ala Asp Gln Leu Phe Thr 
                245                 250                 255     


His Ile Thr Pro Ser Ile Ala Val Asp Ala Glu Ser Asn Thr Asn Thr 
            260                 265                 270         


Val Asp Ser Gly Phe Arg Xaa His Ile Pro Thr Thr Leu Ala Thr Thr 
        275                 280                 285             


Leu Ile Ser Ala Phe Ala Gly His Pro Ala His Thr Lys Gln Ile Pro 
    290                 295                 300                 


Ser Ile Val Tyr His Leu Pro Ala Ala Glu Gln Ser Leu Phe Ile Arg 
305                 310                 315                 320 


His Leu Ile Gln Ala Glu Ser Thr Pro Glu Ser Asp Gly Val Ser Gly 
                325                 330                 335     


Arg Pro Gln Lys Ser Asp Lys Pro Ile Leu Leu Glu Asn Glu Phe Ile 
            340                 345                 350         


Thr Thr Asn Arg Glu Leu Ala Ala Gly Val Ser Met Leu Leu Thr Gln 
        355                 360                 365             


Xaa Gly Gln Ser Tyr Thr Ile Ser Lys Gln Asp Thr Lys Gly Ala Tyr 
    370                 375                 380                 


Thr Ile His Ile Asn Asn Ser Ser Ser Ser Gly Xaa Thr Pro Thr Leu 
385                 390                 395                 400 


Thr Glu Thr Thr His Ser Gly Tyr Val Tyr Asp Leu Ser Val Ala Thr 
                405                 410                 415     


Asn Gln Asn Phe Val Asp Gly Leu Gly Gly Leu Val Leu His Asn 
            420                 425                 430     


<210>  177
<211>  431
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Haloquadratum walsbyi DSM 16854


<220>
<221>  misc_feature
<222>  (169)..(169)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (233)..(233)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (247)..(247)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (279)..(279)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (369)..(369)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (396)..(396)
<223>  Xaa can be any naturally occurring amino acid

<400>  177

Ser Val Thr Gly Asp Arg Pro Val Val Val Arg Asp Pro Ser Asp Tyr 
1               5                   10                  15      


Ile Gln Ile Val Pro Ile Lys Leu Leu Phe Glu Gln Ala Thr Ala Pro 
            20                  25                  30          


Glu Gln Asn Ile Arg Leu Thr Ala Asp Gly Ala Pro Ser Gly Asn Ser 
        35                  40                  45              


Glu Phe Pro Lys Glu Arg Arg His Leu Asp Gln Trp Glu Ala Leu Ser 
    50                  55                  60                  


Leu Ser Asp Thr Gly Glu Thr Glu Trp Gln Pro Ile Asp Gln Ile Ile 
65                  70                  75                  80  


Arg His Gln Thr Asp Lys Glu Ile Leu Thr Leu Gln His Glu Tyr Gly 
                85                  90                  95      


Glu Ser Thr Thr Thr Arg Asp His Ser Tyr Ile Thr Ala Asp Asp Gly 
            100                 105                 110         


Gly Tyr Val Glu Thr Ser Pro Glu Asn Val Asp Glu Pro Leu Pro Ile 
        115                 120                 125             


Pro Asn Ile Ala Pro Val Lys Thr Val Glu Thr Ile Asp Ile Tyr Gln 
    130                 135                 140                 


Thr Leu Thr Thr Asp Thr Gln Ala Gln Ile Gly Ser Asp Thr Glu Pro 
145                 150                 155                 160 


Asp Lys Trp Leu Pro Ser Ala Asp Xaa Ile His Ala Asn Asp Glu Tyr 
                165                 170                 175     


Val Trp Ile Asp Thr Thr Asp Lys Gln Gln His Arg Asp Asp Ser Ile 
            180                 185                 190         


Pro Thr Ile Pro Arg Tyr Ile Asp Leu Ser Ser Asp Thr Gly His Ala 
        195                 200                 205             


Leu Ile Arg Phe Leu Ala Val Tyr Leu Ser Asp Trp Ser Glu Ser Thr 
    210                 215                 220                 


Ile Thr Thr Thr Glu Arg Gly Arg Xaa Leu His Ile Thr Gly Pro Gln 
225                 230                 235                 240 


Glu Ser Ala Leu Lys Thr Xaa Ala Ala Asp Ala Asp Gln Leu Phe Thr 
                245                 250                 255     


His Ile Thr Pro Ser Ile Thr Val Asp Ala Glu Ser Asn Thr Asn Thr 
            260                 265                 270         


Val Asp Ser Gly Phe Thr Xaa His Ile Pro Thr Thr Leu Ala Thr Ala 
        275                 280                 285             


Leu Ile Ser Ala Phe Ala Gly His Pro Ala His Thr Lys Gln Ile Pro 
    290                 295                 300                 


Ser Ile Val Tyr His Leu Pro Ala Ala Glu Gln Ser Leu Phe Ile Arg 
305                 310                 315                 320 


His Leu Ile Gln Ala Glu Ser Thr Pro Glu Ser Asp Gly Val Ser Gly 
                325                 330                 335     


Arg Pro Gln Lys Ser Asn Lys Pro Ile Leu Leu Glu Asn Glu Phe Ile 
            340                 345                 350         


Thr Thr Asn Arg Glu Leu Ala Ala Gly Val Ser Met Leu Leu Thr Gln 
        355                 360                 365             


Xaa Gly Gln Ser Tyr Thr Ile Ser Lys Gln Asp Thr Lys Gly Ala Tyr 
    370                 375                 380                 


Thr Ile His Ile Asn Asn Ser Ser Pro Ser Gly Xaa Thr Pro Thr Leu 
385                 390                 395                 400 


Thr Glu Thr Thr His Ser Gly Tyr Val Tyr Asp Leu Ser Val Ala Thr 
                405                 410                 415     


Asn Gln Asn Phe Val Asp Gly Leu Gly Gly Leu Val Leu His Asn 
            420                 425                 430     


<210>  178
<211>  439
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Halorubrum californiensis DSM 
       19288


<220>
<221>  misc_feature
<222>  (353)..(353)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (397)..(397)
<223>  Xaa can be any naturally occurring amino acid

<400>  178

Ser Val Thr Gly Gly Arg Pro Val Val Val Arg Asp Pro Glu Gly Ile 
1               5                   10                  15      


Val Arg Ile Leu Pro Ile Ala Asp Leu Phe Glu Arg Ser Asp Ala Thr 
            20                  25                  30          


Ala Ser Glu Asp Leu Ile Val Thr Ala Asp Gly Gly Pro Val Ala Ser 
        35                  40                  45              


Val Ser Ile Asp Lys Glu Arg Arg Arg Ala Gly Gly Trp Glu Ala Leu 
    50                  55                  60                  


Ser Val Thr Glu Asp Gly Glu Pro Glu Trp Gln Pro Ile Glu Gln Val 
65                  70                  75                  80  


Ile Arg His Glu Thr Asp Lys Ser Val Val Asn Leu Gln His Lys Phe 
                85                  90                  95      


Gly Glu Ser Thr Thr Thr Arg Asp His Ser Tyr Val Val Glu Glu Asp 
            100                 105                 110         


Gly Gln Leu Val Glu Thr Lys Pro Glu Asp Val Glu Glu Pro Leu Arg 
        115                 120                 125             


Val Pro Gly Leu Pro Glu Val Glu Thr Val Glu Lys Ile Asp Val Tyr 
    130                 135                 140                 


Asp Val Leu Glu Gly Tyr Thr Arg Glu Tyr Glu Asp Gly Arg Ser Val 
145                 150                 155                 160 


Gly Ser Glu Asn Ala Glu Thr Lys Val Lys Arg Val His Ala Asn Asp 
                165                 170                 175     


Glu Trp Val Trp Phe Gly His Lys His His Asn Ala Ile Glu Arg Ser 
            180                 185                 190         


Ile Lys Val Gln Arg Tyr Val Asp Leu Asp Ser Glu Asp Gly Arg Ala 
        195                 200                 205             


Leu Val Arg Leu Leu Ala Ala Tyr Val Thr Glu Gly Ser Ala Ser Thr 
    210                 215                 220                 


Ile Glu Thr Thr Asp Ser Arg Phe Gly Ala Ser Ile Ala Glu Ser Arg 
225                 230                 235                 240 


Thr Glu Trp Leu Glu Gly Leu Arg Glu Asp Tyr Gln Arg Leu Phe Asp 
                245                 250                 255     


Gly Ala Thr Ala Ser Val Ile Ala Ser Asp Ala Ser Asp Arg Arg Thr 
            260                 265                 270         


Val Glu Tyr Gly Thr Glu Asp Gly Asp Gln Ser Val Thr Tyr Asp Asp 
        275                 280                 285             


Gly Thr His Lys Leu Gln Met Met Asn Glu Leu Ser Ala Val Phe Phe 
    290                 295                 300                 


Arg Glu Phe Ala Gly Gln Thr Ser Arg Gly Lys Arg Ile Pro Gly Ser 
305                 310                 315                 320 


Val Phe Asn Leu Pro Glu Asp Leu Gln Gln Leu Phe Val Asp Val Leu 
                325                 330                 335     


Val Glu Gly Asp Gly Ser Arg Glu Phe Pro Arg Tyr Ser Thr Glu Tyr 
            340                 345                 350         


Xaa Glu Arg Asn Phe Asp Phe Glu Thr Thr Ser Arg Glu Leu Ala Ala 
        355                 360                 365             


Gly Leu Ser Thr Leu Leu Thr Gln Arg Gly Glu Lys His Ser Leu Lys 
    370                 375                 380                 


Tyr Arg Glu Ser Lys Gly Ser Tyr Thr Ile Arg Thr Xaa Asp Tyr Tyr 
385                 390                 395                 400 


Arg Ser Gly Arg Asp Pro Val Val Glu Glu Val Asp His Asp Gly Tyr 
                405                 410                 415     


Val Tyr Asp Leu Ser Val Ala Glu Asn Glu Asn Phe Val Asp Gly Val 
            420                 425                 430         


Gly Gly Val Val Leu His Asn 
        435                 


<210>  179
<211>  347
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Cafeteria roenbergensis virus 
       BV-PW1


<220>
<221>  misc_feature
<222>  (40)..(40)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (136)..(136)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (145)..(145)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (152)..(152)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (158)..(158)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (176)..(176)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (231)..(231)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (263)..(263)
<223>  Xaa can be any naturally occurring amino acid

<400>  179

Ser Val Ile Gly Asp Thr Pro Leu Leu Leu Lys Asn Lys Phe Thr Asn 
1               5                   10                  15      


Glu Ile Leu Ile Asn Lys Ile Lys Asp Leu Ser Ser Asn Trp Ser Asn 
            20                  25                  30          


Tyr His Asn Gly Lys Glu Ser Xaa Glu Ile Asp Thr Tyr Gln Thr Trp 
        35                  40                  45              


Thr Glu Thr Gly Trp Thr Asp Ile Lys Arg Val Ile Arg His Lys Leu 
    50                  55                  60                  


Glu Ser Asn Lys Lys Leu Leu Lys Ile Gln Thr His Asn Gly Glu Val 
65                  70                  75                  80  


Ile Val Thr Asp Glu His Ser Leu Leu Asn Lys Asn Gly Lys Thr Ile 
                85                  90                  95      


Asn Ala Lys Asn Val Lys Val Gly Asp Asn Ile Leu His Ser Phe Pro 
            100                 105                 110         


Ser Tyr Ile Asn Asn Ile Asp Asn Thr Asn Ser Ile Asn Tyr His Asn 
        115                 120                 125             


Lys Phe Tyr Asn Lys Lys Met Xaa Asn Glu Leu Ala Tyr Ile Leu Gly 
    130                 135                 140                 


Xaa Phe Met Lys Tyr Gly Leu Xaa Asp Ser Ser Lys Lys Xaa Phe Thr 
145                 150                 155                 160 


Ile Asn Asn Lys Asp Ile Asn Leu Ile Glu Ser Leu Lys Lys Met Xaa 
                165                 170                 175     


Glu Asn Ile Phe Asp Glu Phe Lys Trp Lys Ile Ser Ser Ser Ser His 
            180                 185                 190         


Leu Ser Asp Asn Ile Tyr Lys Leu Val Pro Phe Gln Asn Glu Ile Lys 
        195                 200                 205             


Leu Ile Asp Phe Ile Lys Tyr Phe Thr Asn Lys Met Tyr Asn Asn Gly 
    210                 215                 220                 


Glu Lys Lys Val Pro Gln Xaa Ile Leu Asn Ser Ser Lys Glu Tyr Ile 
225                 230                 235                 240 


Lys Ile Phe Leu Ile Gly Leu Tyr Pro Glu Tyr Lys Leu Glu Asn Asn 
                245                 250                 255     


Gln Gln Phe Ile Tyr Thr Xaa Lys Asn Asn Glu Phe Ser Leu Gly Ile 
            260                 265                 270         


Tyr Tyr Leu Ile Lys Lys Leu Gly Tyr His Val Lys Leu Asn Ser Asn 
        275                 280                 285             


Asp Ser Ser Asp Ser Ser Asp Ser Ile Tyr Thr Phe Glu Ile Ser His 
    290                 295                 300                 


Lys Leu Glu Asn Asn Asn Asn Val Ile Thr Lys Ile Thr Glu Trp Glu 
305                 310                 315                 320 


His Lys Glu Thr Tyr Val Tyr Asp Leu Thr Thr Glu Asn His His Phe 
                325                 330                 335     


His Ala Gly Val Gly Ser Ile Ile Val His Asn 
            340                 345         


<210>  180
<211>  257
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from unknown phycodnavirus KBvp-2


<220>
<221>  misc_feature
<222>  (164)..(164)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (168)..(168)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (211)..(211)
<223>  Xaa can be any naturally occurring amino acid

<400>  180

Ser Val Lys Gly Asp Thr Pro Leu Leu Leu Lys Thr Glu His Gly Val 
1               5                   10                  15      


Phe Phe Gln Ser Ile Asp Glu Leu Phe Lys Ile Ser Lys Ser Ile Glu 
            20                  25                  30          


Thr Gly Leu Arg Leu Ala Lys Glu Tyr Ala Asn Ile Glu Asn His Asn 
        35                  40                  45              


Ile Tyr Val Trp Ser Asp Val Gly Phe Thr Lys Ile Arg Arg Val Met 
    50                  55                  60                  


Arg His Tyr Thr Thr Lys Gly Met Phe Arg Val Thr Thr Lys Thr Gly 
65                  70                  75                  80  


Tyr Val Asp Val Thr Glu Asp His Ser Leu Leu Leu Glu Asn Gly Phe 
                85                  90                  95      


Glu Val Arg Pro Ser Asp Thr Thr Val Gly Thr Arg Leu Leu His Lys 
            100                 105                 110         


Lys Pro Thr Ile Asn Lys Tyr Ile Gln Lys Ser Phe Ser Ser Glu His 
        115                 120                 125             


Leu Ser Glu Ala Lys Met Met Gly Glu Ser Phe Leu Asp Glu Glu Thr 
    130                 135                 140                 


Ile Pro Ser Phe Val Leu Asn Ser Pro Val Asn Val Leu Arg Lys Tyr 
145                 150                 155                 160 


Phe Glu Gly Xaa Ile Lys Ala Xaa Gly Val Ile Lys Asn Asp Thr Asn 
                165                 170                 175     


Ile Glu Phe His Phe Ser Lys Ser Lys Gln Gly Val Ala Glu Phe Val 
            180                 185                 190         


Phe Val Ala Gln Gln Leu Gly Tyr His Val Phe Ile Lys Pro Tyr Gly 
        195                 200                 205             


Val His Xaa Ser Leu Asp Pro Gln Asp Leu Lys Ile Gln Glu Ile Thr 
    210                 215                 220                 


Ala Ile Glu Tyr Met Gly Lys Thr Lys Asp Tyr Val Tyr Asp Leu Glu 
225                 230                 235                 240 


Thr Asp Asn His His Phe His Val Gly Pro Gly Asn Leu Ile Val His 
                245                 250                 255     


Asn 
    


<210>  181
<211>  232
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Heterosigma akashiwo virus HaV01


<220>
<221>  misc_feature
<222>  (16)..(16)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (109)..(109)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (142)..(142)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (171)..(171)
<223>  Xaa can be any naturally occurring amino acid

<400>  181

Ser Val Thr Lys Glu Thr Pro Leu Met Leu Arg Thr Met Glu Thr Xaa 
1               5                   10                  15      


Gly Asn His Lys His Glu Val Ile Ser Ile Glu Asn Val Phe Thr Asp 
            20                  25                  30          


Asn Met Arg Ser Ile Asp Met Tyr Ser Ile Ile Gly Glu Lys Glu His 
        35                  40                  45              


Val Met Leu Ser Arg Asn Glu Glu Ile Trp Thr Gly Glu Asn Trp Ser 
    50                  55                  60                  


Arg Ile Ile Arg Val Ile Arg His Lys Thr Gln Lys Lys Ile Tyr Gly 
65                  70                  75                  80  


Val Leu Thr Glu Asn Gly Tyr Val Glu Val Thr Glu Asp His Ser Leu 
                85                  90                  95      


Ile Ser Ser Asp Tyr Glu Leu Leu Lys Pro Lys Asn Xaa Ile Val Lys 
            100                 105                 110         


Glu Thr Gln Leu Leu Gln Ser Phe Pro Asp Ile Val Glu Asn Ser Thr 
        115                 120                 125             


Ile Glu Asn Asn Met Ile Asp Ile Pro Lys Gly Gln Pro Xaa Arg Leu 
    130                 135                 140                 


Thr Val Phe Gly Gln Val Ser Ala Met Ile Ile Tyr Thr Tyr Leu Lys 
145                 150                 155                 160 


Arg Lys Asn Tyr Ser Ile Thr Leu Asn Val Xaa Asn Val Asn Ser Asn 
                165                 170                 175     


Lys Phe Tyr Ile Ser Phe Met Glu Arg Pro Arg Phe Lys Asn Thr Lys 
            180                 185                 190         


Lys Asn Ile Ile Lys Lys Ile Phe Phe Ile Arg Asn Thr Asp Asn Glu 
        195                 200                 205             


Glu Tyr Val Tyr Asp Val Glu Thr Glu Asp Gly Ile Phe His Ala Gly 
    210                 215                 220                 


Ile Gly Glu Ile Ile Val Lys Asn 
225                 230         


<210>  182
<211>  386
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Archaeon GW2011_AR16


<220>
<221>  misc_feature
<222>  (204)..(204)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (239)..(239)
<223>  Xaa can be any naturally occurring amino acid

<400>  182

Ser Val Gly Lys Asp Thr Glu Ile Val Met Asn Glu Asn Gly Thr Ile 
1               5                   10                  15      


Arg Phe Val Lys Ile Ser Glu Leu Phe Glu Arg Thr Gln Lys Arg Thr 
            20                  25                  30          


Ser Asp Gly Lys Glu Tyr Phe Phe Pro Pro Ser Arg Leu Val Leu Thr 
        35                  40                  45              


Leu Asp Ala Gln Gly Lys Ser Val Phe Lys Lys Val Lys Tyr Val Met 
    50                  55                  60                  


Lys His Arg Val Gln Lys Lys Met Tyr Arg Ile Phe Phe Thr Asn Asp 
65                  70                  75                  80  


His Tyr Ile Asp Val Thr Glu Asp His Ser Leu Ile Gly Tyr Val Asn 
                85                  90                  95      


Lys Gln Lys Asn Asn Gln Leu Ala Asp Leu Asp Arg Leu Ile Glu Val 
            100                 105                 110         


Lys Pro Thr Asp Ile Gly Lys Arg Val Arg Thr Ile Ile Thr Ile Lys 
        115                 120                 125             


Asn Ile Pro Arg Ser Ser Ile Lys Thr Arg Asn Tyr His Arg Glu Leu 
    130                 135                 140                 


Tyr Glu Phe Met Gly Leu Phe Ile Gly Asp Gly Ser Phe Asp Arg Gln 
145                 150                 155                 160 


Lys Lys Gln Asn Tyr Tyr Leu His Leu Ala Gly Gly Leu Asp Ser Trp 
                165                 170                 175     


Glu Ile Ile Thr Lys Val Leu Val Pro Leu Lys Glu Lys Glu Tyr Ile 
            180                 185                 190         


Lys Asn Tyr Trp Leu Lys Lys Lys Gly Asp Ile Xaa Ile Asn Gly Leu 
        195                 200                 205             


Arg Leu Val Arg Leu Phe Asn Asp Glu Phe Arg Lys Glu Ser Lys Lys 
    210                 215                 220                 


Ser Ile Pro Ala Phe Leu Leu Arg Glu Lys Gln Glu Ala Ile Xaa Ser 
225                 230                 235                 240 


Phe Leu Arg Gly Leu Phe Ser Ala Asp Gly Ser Val Leu Phe Arg Asn 
                245                 250                 255     


Lys Lys Pro Ile Ile Lys Phe Thr Asn Thr Asn Thr Glu Ile Ile Lys 
            260                 265                 270         


Met Thr Ser Arg Leu Leu His Leu Val Gly Ile Ser His Ser Thr Phe 
        275                 280                 285             


Ser Glu Thr Arg Lys Asn Arg Tyr Lys Gly Lys Glu Ser Glu Thr Ile 
    290                 295                 300                 


Ser Lys His Ile Tyr Ile Lys Asp Ala Leu Ser Phe Arg Glu Lys Val 
305                 310                 315                 320 


Gly Phe Val Ile Asn Arg Lys Gln Glu Arg Leu Ser Leu Val Ser Lys 
                325                 330                 335     


Asn Ser Thr His Arg Arg Thr Ile Lys Asn Tyr Asp Phe Asp Leu Ser 
            340                 345                 350         


Lys Val Ile Lys Ile Glu Pro Ile Glu Tyr Arg Gly Asp Val Tyr Asp 
        355                 360                 365             


Leu Glu Ile Glu Asp Thr His Arg Phe Phe Ala Asn Asn Val Leu Val 
    370                 375                 380                 


His Asn 
385     


<210>  183
<211>  184
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Methanomethylicus oleusabulum 
       isolate V3


<220>
<221>  misc_feature
<222>  (38)..(38)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (67)..(67)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (84)..(84)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (93)..(93)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (143)..(143)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (160)..(160)
<223>  Xaa can be any naturally occurring amino acid

<400>  183

Ser Val Ala Gly Asp Ser Ile Val Ala Val Asp Asp Gly Lys Gly Arg 
1               5                   10                  15      


Ser Glu Val Arg Ile Asp Gly Leu Phe Lys Gly Thr Ser Ala Lys Ala 
            20                  25                  30          


Gly Glu Lys Glu Tyr Xaa Arg Pro Arg Ser Leu Arg Thr Ile Ser Leu 
        35                  40                  45              


Gly Pro Asp Gly Arg Val Thr Trp Ser Arg Ile Asn Ala Ile Met Arg 
    50                  55                  60                  


His Arg Xaa Gly Lys Arg Met Phe Arg Val Trp Leu Ser Asp Ser Trp 
65                  70                  75                  80  


His Val Asp Xaa Thr Glu Asp His Ser Leu Ile Gly Xaa Leu Val Asp 
                85                  90                  95      


Gly Asp Gly Lys Val Asp Gly Glu Leu Pro Ala Ala Gly Leu Arg Leu 
            100                 105                 110         


Val Asn Leu Lys Pro Thr Glu Ile Gly Lys Val Ala Asn Arg Leu Val 
        115                 120                 125             


Ala Leu Asp Ala Gln Pro His Ser Asn Gly Gly Ser Ala Gly Xaa Gly 
    130                 135                 140                 


Ala Gly Ile Arg Thr Val Thr Pro Val Arg Val Glu Glu Ile Pro Xaa 
145                 150                 155                 160 


Asp Gly Tyr Val Tyr Asp Met Glu Val Asp Gly Thr His Arg Phe Phe 
                165                 170                 175     


Ala Asn Arg Val Leu Val His Asn 
            180                 


<210>  184
<211>  184
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Verstraetearchaeota species 
       UBA156


<220>
<221>  misc_feature
<222>  (38)..(38)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (67)..(67)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (84)..(84)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (93)..(93)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (143)..(143)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (160)..(160)
<223>  Xaa can be any naturally occurring amino acid

<400>  184

Ser Val Ala Gly Asp Ser Ile Val Ala Val Asp Asp Gly Lys Gly Arg 
1               5                   10                  15      


Ser Glu Val Arg Ile Asp Gly Leu Phe Lys Gly Thr Ser Ala Lys Ala 
            20                  25                  30          


Gly Glu Lys Glu Tyr Xaa Arg Pro Arg Ser Leu Arg Thr Ile Ser Leu 
        35                  40                  45              


Gly Pro Asp Gly Arg Val Thr Trp Ser Arg Ile Asn Ala Ile Met Arg 
    50                  55                  60                  


His Arg Xaa Gly Lys Arg Met Phe Arg Val Trp Leu Ser Asp Ser Trp 
65                  70                  75                  80  


His Val Asp Xaa Thr Glu Asp His Ser Leu Ile Gly Xaa Leu Val Asp 
                85                  90                  95      


Gly Asp Gly Lys Val Asp Gly Glu Leu Pro Ala Ala Gly Leu Arg Leu 
            100                 105                 110         


Val Asn Leu Lys Pro Thr Glu Ile Gly Lys Val Ala Asn Arg Leu Val 
        115                 120                 125             


Ala Leu Asp Ala Gln Pro His Ser Asn Gly Gly Ser Ala Gly Xaa Gly 
    130                 135                 140                 


Ala Gly Ile Arg Thr Val Thr Pro Val Arg Val Glu Glu Ile Pro Xaa 
145                 150                 155                 160 


Asp Gly Tyr Val Tyr Asp Met Glu Val Asp Gly Thr His Arg Phe Phe 
                165                 170                 175     


Ala Asn Arg Val Leu Val His Asn 
            180                 


<210>  185
<211>  179
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Verstraetearchaeota species UBA76


<220>
<221>  misc_feature
<222>  (38)..(38)
<223>  Xaa can be any naturally occurring amino acid

<400>  185

Ser Val Ala Gly Ser Ser Val Val Ser Val Asp Ala Gly Gly Lys Lys 
1               5                   10                  15      


Ser Asp Val Pro Val Glu Ser Leu Phe Gly Arg Pro Asp Gln Ser Val 
            20                  25                  30          


Gly Gly Lys Glu Tyr Xaa Tyr Pro Ala Ser Leu Arg Ala Leu Ser Leu 
        35                  40                  45              


Asp Pro Gly Gly Arg Ala Val Tyr Ser Arg Val Asn Ala Ile Met Arg 
    50                  55                  60                  


His Arg Ser Gly Lys Lys Met Phe Arg Val Arg Leu Ala Asp Ser Trp 
65                  70                  75                  80  


His Ile Asp Val Thr Glu Asp His Ser Leu Ile Gly Tyr Arg Asp Gly 
                85                  90                  95      


Gly Gly Ser Gly Ser Ser Gly Ile Asp Gly His Leu Val Asp Val Arg 
            100                 105                 110         


Pro Ala Glu Ile Gly Lys Ala Val Lys Arg Leu Val Val Leu Lys Lys 
        115                 120                 125             


Gly Pro Leu Val Ala Gly Arg Arg Ala Pro Ser Ala Asp Phe Asp Thr 
    130                 135                 140                 


Ala Ala Pro Leu Arg Ile Glu Glu Val Pro Tyr Asp Gly Tyr Val Tyr 
145                 150                 155                 160 


Asp Leu Glu Ile Glu Gly Thr Arg Arg Phe Phe Ala Asn Gly Val Leu 
                165                 170                 175     


Val His Asn 
            


<210>  186
<211>  179
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Methanomethylicus mesodigestum 
       isolate V2


<220>
<221>  misc_feature
<222>  (38)..(38)
<223>  Xaa can be any naturally occurring amino acid

<400>  186

Ser Val Ala Gly Ser Ser Val Val Ser Val Asp Ala Gly Gly Lys Lys 
1               5                   10                  15      


Ser Asp Val Pro Val Glu Ser Leu Phe Gly Arg Pro Asp Gln Ser Val 
            20                  25                  30          


Gly Gly Lys Glu Tyr Xaa Tyr Pro Ala Ser Leu Arg Ala Leu Ser Leu 
        35                  40                  45              


Asp Pro Gly Gly Arg Ala Val Tyr Ser Arg Val Asn Ala Ile Met Arg 
    50                  55                  60                  


His Arg Ser Gly Lys Lys Met Phe Arg Val Arg Leu Ala Asp Ser Trp 
65                  70                  75                  80  


His Ile Asp Val Thr Glu Asp His Ser Leu Ile Gly Tyr Arg Asp Gly 
                85                  90                  95      


Gly Gly Ser Gly Ser Ser Gly Ile Asp Gly His Leu Val Asp Val Arg 
            100                 105                 110         


Pro Ala Glu Ile Gly Lys Ala Val Lys Arg Leu Val Val Leu Lys Lys 
        115                 120                 125             


Gly Pro Leu Val Ala Gly Arg Arg Ala Pro Ser Ala Asp Phe Asp Thr 
    130                 135                 140                 


Ala Ala Pro Leu Arg Ile Glu Glu Val Pro Tyr Asp Gly Tyr Val Tyr 
145                 150                 155                 160 


Asp Leu Glu Ile Glu Gly Thr Arg Arg Phe Phe Ala Asn Gly Val Leu 
                165                 170                 175     


Val His Asn 
            


<210>  187
<211>  351
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Mimivirus terra2


<220>
<221>  misc_feature
<222>  (88)..(88)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (110)..(110)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (147)..(147)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (173)..(173)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (259)..(259)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (262)..(262)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (272)..(272)
<223>  Xaa can be any naturally occurring amino acid

<400>  187

Ser Val Thr Gly Asp Thr Pro Ile Ile Thr Arg His Gln Asn Gly Asp 
1               5                   10                  15      


Ile Asn Ile Thr Thr Ile Glu Glu Leu Gly Ser Lys Trp Lys Pro Tyr 
            20                  25                  30          


Glu Ile Phe Lys Ala His Glu Lys Asn Ser Asn Arg Lys Phe Lys Gln 
        35                  40                  45              


Gln Ser Gln Tyr Pro Thr Asp Ser Glu Val Trp Thr Ala Lys Gly Trp 
    50                  55                  60                  


Ala Lys Ile Lys Arg Val Ile Arg His Lys Thr Val Lys Lys Ile Tyr 
65                  70                  75                  80  


Arg Val Leu Thr His Thr Gly Xaa Ile Asp Val Thr Glu Asp His Ser 
                85                  90                  95      


Leu Leu Asp Pro Asn Gln Asn Ile Ile Lys Pro Ile Asn Xaa Gln Ile 
            100                 105                 110         


Gly Thr Glu Leu Leu His Gly Phe Pro Glu Ser Asn Asn Val Tyr Asp 
        115                 120                 125             


Asn Ile Ser Glu Gln Glu Ala Tyr Val Trp Gly Phe Phe Met Gly Asp 
    130                 135                 140                 


Gly Ser Xaa Gly Ser Tyr Gln Thr Lys Asn Gly Ile Lys Tyr Ser Trp 
145                 150                 155                 160 


Ala Leu Asn Asn Gln Asp Leu Asp Val Leu Asn Lys Xaa Lys Lys Tyr 
                165                 170                 175     


Leu Glu Glu Thr Glu Asn Ile Gln Phe Lys Ile Leu Asp Thr Met Lys 
            180                 185                 190         


Ser Ser Ser Val Tyr Lys Leu Val Pro Ile Gly Lys Ile Lys Tyr Met 
        195                 200                 205             


Val Asn Lys Tyr Arg Lys Ile Phe Tyr Asp Asn Lys Lys Tyr Lys Leu 
    210                 215                 220                 


Val Pro Lys Glu Ile Leu Asn Ser Thr Lys Asp Ile Lys Asn Ser Phe 
225                 230                 235                 240 


Leu Glu Gly Tyr Tyr Ala Ala Asp Gly Ser Arg Lys Glu Thr Glu Asn 
                245                 250                 255     


Met Gly Xaa Arg Arg Xaa Asp Val Lys Gly Lys Ile Ser Ala Gln Xaa 
            260                 265                 270         


Leu Phe Tyr Leu Leu Lys Ser Leu Gly Tyr Asn Val Ser Ile Asn Ile 
        275                 280                 285             


Arg Ser Asp Lys Asn Gln Ile Tyr Arg Leu Thr Phe Ser Asn Lys Lys 
    290                 295                 300                 


Gln Arg Lys Asn Pro Ile Ala Ile Lys Lys Ile Gln Leu Met Asn Glu 
305                 310                 315                 320 


Thr Ser Asn Asp His Asp Gly Asp Tyr Val Tyr Asp Leu Glu Thr Glu 
                325                 330                 335     


Ser Gly Ser Phe His Ala Gly Val Gly Glu Met Ile Val Lys Asn 
            340                 345                 350     


<210>  188
<211>  86
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein (partial) from activated-carbon 
       dual-media filters Ann Arbor (MI, USA) drinking water treatment 
       plant metagenome


<220>
<221>  misc_feature
<222>  (20)..(20)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (25)..(25)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (60)..(60)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (76)..(76)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (86)..(86)
<223>  Xaa can be any naturally occurring amino acid

<400>  188

Ser Val Ala Ala Asp Thr Pro Ile Leu Val Lys Arg Asn Asp Gln Ile 
1               5                   10                  15      


Glu Trp Ile Xaa Ile Arg Asp Leu Xaa Gln His Glu Gln Asp Pro Asp 
            20                  25                  30          


Lys Lys Thr Glu Leu Asn Thr Glu His Phe Asn Tyr Glu Val Trp Ser 
        35                  40                  45              


Asp Ile Gly Trp Thr Lys Ile Lys Arg Leu Ile Xaa His Lys Thr Thr 
    50                  55                  60                  


Lys Gln Met Tyr Arg Val Leu Thr His Thr Gly Xaa Val Asp Val Thr 
65                  70                  75                  80  


Glu Asp His Ser Leu Xaa 
                85      


<210>  189
<211>  282
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Megavirus LBA111


<220>
<221>  misc_feature
<222>  (86)..(86)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (108)..(108)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (145)..(145)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (181)..(181)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (213)..(213)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (217)..(217)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (234)..(234)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (244)..(244)
<223>  Xaa can be any naturally occurring amino acid

<400>  189

Ser Val Thr Gly Asp Thr Pro Ile Met Ile Lys Asp Lys Asn Asn Asn 
1               5                   10                  15      


Ile Asn Ile Val Thr Ile Lys Glu Leu Gly Glu Lys Trp Lys Pro Tyr 
            20                  25                  30          


Asp Ile Phe Lys Ser His Glu Ile Asn Ser Asn Arg Lys Tyr Lys Gln 
        35                  40                  45              


Gln Ala Asp Phe Asn Gly Glu Val Trp Thr Ser Asn Gly Trp Ala Lys 
    50                  55                  60                  


Ile Lys Arg Val Ile Arg His Lys Thr Val Lys Lys Leu Tyr Arg Val 
65                  70                  75                  80  


Leu Thr Asn Thr Gly Xaa Ile Asp Val Thr Glu Asp His Ser Leu Leu 
                85                  90                  95      


Asp Thr Asn Lys Asn Ile Ile Lys Pro Ile Asp Xaa Lys Ile Gly Thr 
            100                 105                 110         


Glu Leu Leu His Gly Phe Pro Glu Ile Asn Asn Asn His Asn Lys Leu 
        115                 120                 125             


Ser Leu Glu Ile Tyr Lys Glu Leu Glu Ile Thr Arg Met Leu Phe Asp 
    130                 135                 140                 


Xaa Met Ile Glu Ser Asn Lys Lys Trp Asn Asn Glu Lys Met Lys Ala 
145                 150                 155                 160 


Tyr Phe Ile Gly Ser Glu Tyr Arg Lys Gln Asn Lys Asn Ile Ser Asn 
                165                 170                 175     


Glu Ile Leu Asn Xaa Ser Lys Lys Ile Lys Lys Tyr Phe Leu Leu Gly 
            180                 185                 190         


Tyr Leu Gly Asn Asp Lys Glu Tyr Ile Thr Asn Asn Lys Ile Asn Ala 
        195                 200                 205             


Gln Ile Ile Tyr Xaa Leu Met Lys Xaa Leu Glu Tyr Asn Ile Val Ile 
    210                 215                 220                 


Asp Leu Ile Glu Ser Ser Tyr Lys Leu Xaa Ile Ile Asp Asn Ile Asn 
225                 230                 235                 240 


Glu Pro Tyr Xaa Ile Asn Lys Ile Ile Gln Leu Gln Asp Thr Ser Ile 
                245                 250                 255     


Asn Gly Glu Tyr Val Tyr Asp Leu Glu Thr Glu Ser Gly Thr Phe His 
            260                 265                 270         


Ala Gly Ile Gly Glu Leu Ile Val Lys Asn 
        275                 280         


<210>  190
<211>  348
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Picocystis salinarum CCMP1897


<220>
<221>  misc_feature
<222>  (17)..(17)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (73)..(73)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (148)..(148)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (153)..(153)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (217)..(217)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (251)..(251)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (260)..(260)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (322)..(322)
<223>  Xaa can be any naturally occurring amino acid

<400>  190

Ser Val Met Pro Tyr Thr Pro Val Leu Val Arg Asn Lys Arg Thr Gln 
1               5                   10                  15      


Xaa Ile Ser Ala Val Ala Ile Lys His Leu Ala Gln Asp Trp Met Pro 
            20                  25                  30          


Tyr Glu Ala Phe His Glu Gly Asp Pro Arg Arg Phe Glu Lys Glu Gln 
        35                  40                  45              


Gly Asn Ala Ala His Met Gln Ala Trp Thr Asp Gln Gly Trp Ala Asp 
    50                  55                  60                  


Val Leu Arg Val Val Arg His Lys Xaa Ser Lys Lys Ile Tyr Arg Val 
65                  70                  75                  80  


Val Thr His Thr Gly Leu Val Asp Val Thr Glu Asp His Ser Leu Leu 
                85                  90                  95      


Leu Pro Asp Arg Arg Lys Val Lys Pro His Gln Leu Glu Val Gly Thr 
            100                 105                 110         


Ala Leu Leu His Ser Phe Pro Gly Leu Glu Leu Trp Pro Asp Arg Leu 
        115                 120                 125             


Glu Ala Gly Thr Pro Glu Gln Ala Tyr Met Tyr Gly Val Phe Val Gly 
    130                 135                 140                 


Asn Gly Ser Xaa Ala Lys Tyr Asp Xaa Pro Ser Gly Arg Lys Tyr Tyr 
145                 150                 155                 160 


Trp Ala Ile Lys Asn Ser Asp Ile Thr Leu Ile Asn Lys Trp Lys Thr 
                165                 170                 175     


Val Leu Glu Met Ile His Lys Arg Pro Phe Lys Ile Val Asp Thr Leu 
            180                 185                 190         


Lys His Ser Gly Asp Tyr Lys Leu Val Pro Thr Asp Ser Ser Lys Asp 
        195                 200                 205             


Leu Val Ile Leu Tyr Trp Gln Ser Xaa Tyr Asp Asn Thr Gly Ala Lys 
    210                 215                 220                 


Val Val Pro Tyr Glu Ile Leu Asn Gly Gln Ile Asp His Val Asp Ala 
225                 230                 235                 240 


Phe Ile Glu Gly Leu Ser Val Ala Asp Gly Xaa Arg Arg Asp Leu Asp 
                245                 250                 255     


Thr Thr Gly Xaa Arg Arg Ile Asp Thr Lys Ser Gln Ile Ser Ala Gln 
            260                 265                 270         


His Tyr Tyr Val Leu Leu Lys Arg Leu Gly Tyr Arg Val Ser Ile Asn 
        275                 280                 285             


Ala Arg Asp Asp Lys Thr Asn Met Phe Arg Leu Thr Trp Thr Met Gly 
    290                 295                 300                 


Arg Gln Arg Arg Glu Thr Thr Ile Ile Lys Lys Gln His Met Leu His 
305                 310                 315                 320 


Glu Xaa Tyr Asp Gly Phe Val Tyr Asp Leu Glu Thr Ser Gln Gly Val 
                325                 330                 335     


Phe Gln Ala Gly Val Gly Glu Leu Ile Val Lys Asn 
            340                 345             


<210>  191
<211>  144
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified possibly N-part of a contiguous intein (partial) from 
       bovine gut rumen metagenome


<220>
<221>  misc_feature
<222>  (49)..(49)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (144)..(144)
<223>  Xaa can be any naturally occurring amino acid

<400>  191

Ser Phe Thr Gly Asp Thr Pro Leu Phe Ile Lys Tyr Asp Asp Gly Asn 
1               5                   10                  15      


Ile Asp Ile Lys Pro Ile Glu Glu Leu Ile Gly Glu Thr Glu Thr Asp 
            20                  25                  30          


Ala Leu Gly Arg Glu Tyr Asp Tyr Ser Glu Lys Pro Tyr Ser Val Leu 
        35                  40                  45              


Xaa Arg Ser Gly Trp Val Lys Pro Lys Tyr Ile Tyr Arg His Lys Thr 
    50                  55                  60                  


Asn Lys Gln Leu Tyr Thr Val Ser Glu Gly Asp Met Ser Ile Thr Val 
65                  70                  75                  80  


Thr Glu Asp His Ser Leu Phe Asn Asn Glu Lys Lys Lys Ile Lys Pro 
                85                  90                  95      


Ser Gln Ile Asn Ser Thr Thr Lys Leu Glu Tyr Tyr Thr Lys Asp Ile 
            100                 105                 110         


Lys Thr Ser Ser Asp Phe Lys Trp Leu Thr Lys Gln Arg Ala Lys Thr 
        115                 120                 125             


Met Ala Lys Met Ile Ile Asp Gly Thr Ile Asp Arg Val Ser Ile Xaa 
    130                 135                 140                 


<210>  192
<211>  355
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified continuous intein from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (23)..(23)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (47)..(47)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (82)..(82)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (142)..(142)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (249)..(250)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (261)..(261)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (288)..(288)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (325)..(325)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (353)..(353)
<223>  Xaa can be any naturally occurring amino acid

<400>  192

Ser Val Thr Tyr Asp Thr Pro Ile Leu Val Arg Gly Glu Asp Lys Arg 
1               5                   10                  15      


Ile Asp Ile Val Pro Ile Xaa Asp Ile Phe Asn Asn Asn Glu Ala Ile 
            20                  25                  30          


Glu Phe Gly Glu Glu Gln Tyr Arg Asp Phe Ser His Lys Asn Xaa Glu 
        35                  40                  45              


Val Leu Thr Arg Asn Gly Trp Lys Pro Ile Glu Tyr Val Tyr Lys His 
    50                  55                  60                  


Lys Thr Asn Lys Thr Leu Lys Arg Val Glu Thr Lys Asn Gly Leu Ile 
65                  70                  75                  80  


Asp Xaa Thr Glu Asp His Ser Leu Phe Asp Asn Lys Arg Asn Glu Val 
                85                  90                  95      


Lys Pro Ser Thr Leu Asn Arg Gly Asp Lys Ile Glu Ile Tyr Thr Lys 
            100                 105                 110         


Asp Ile Asp Tyr Tyr Ala Ser Ser Thr Val Thr Asp Arg Glu Ala Trp 
        115                 120                 125             


Leu Phe Gly Phe Phe Met Ala Asp Gly Ser Ser Val Tyr Xaa Asp Arg 
    130                 135                 140                 


Thr Gln Lys Tyr Tyr Ser Lys Arg Lys Gly Glu Trp Val Ile His Asn 
145                 150                 155                 160 


Gly Lys Arg Ala Asn Trp Lys Ile Ser Asn Lys Ser Ile Asp Arg Leu 
                165                 170                 175     


Asn Lys Ala Lys Glu Ile Leu Glu Asp Ser Phe Phe Leu Lys Ala Ser 
            180                 185                 190         


Ile Lys Asp His Arg Thr Ser Ser Asn Val Tyr Asp Leu Val Val Glu 
        195                 200                 205             


Asn Ala Glu Asn Ala Lys Phe Phe Ser Asn Asn Phe Tyr Thr Ser Tyr 
    210                 215                 220                 


Arg Tyr Lys Lys Val Pro Glu Phe Ile Leu Asn Ala Lys Lys Glu Val 
225                 230                 235                 240 


Lys Lys Ala Phe Leu Asp Gly Phe Xaa Xaa Gly Asp Gly Gln Asn Asp 
                245                 250                 255     


Thr Ile Asp Glu Xaa Ile Glu Phe Gly Gln Lys Ser Lys Val Ala Met 
            260                 265                 270         


Ala Gly Leu Tyr Leu Leu Met Lys Glu Leu Gly Tyr Asn Phe Arg Xaa 
        275                 280                 285             


His Asn Arg Asn Asp Lys Gln Glu Phe Ile Ser Phe Arg Leu Arg Asn 
    290                 295                 300                 


His Arg Gly Asn Leu Leu Asn Glu Asn Tyr Ser Glu Arg Lys Glu Asp 
305                 310                 315                 320 


Glu Val Trp Asn Xaa Gly Asn Ile Thr Ser Lys Ser Glu Tyr Val Tyr 
                325                 330                 335     


Asp Ile Ser Ala Asp Gly Thr Phe Val Asn Ala Leu Gly Met Ile Val 
            340                 345                 350         


Xaa His Asn 
        355 


<210>  193
<211>  348
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from bovine gut rumen metagenome


<220>
<221>  misc_feature
<222>  (23)..(23)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (82)..(82)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (142)..(142)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (181)..(181)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (242)..(243)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (254)..(254)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (281)..(281)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (318)..(318)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (346)..(346)
<223>  Xaa can be any naturally occurring amino acid

<400>  193

Ser Val Thr Tyr Asp Thr Pro Ile Leu Val Arg Gly Glu Asp Lys Arg 
1               5                   10                  15      


Ile Asn Ile Ile Pro Ile Xaa Asp Ile Phe Asn Asn Asn Glu Ala Ile 
            20                  25                  30          


Glu Phe Gly Glu Glu Gln Tyr Arg Asp Phe Ser Arg Lys Asn Tyr Glu 
        35                  40                  45              


Val Leu Thr Arg Asn Gly Trp Lys Pro Ile Glu Tyr Val Tyr Lys His 
    50                  55                  60                  


Lys Thr Thr Lys Gln Leu Lys Arg Val Glu Thr Lys Asn Gly Val Ile 
65                  70                  75                  80  


Asp Xaa Thr Glu Asp His Ser Leu Phe Asp Asn Asn Gly Asn Glu Val 
                85                  90                  95      


Lys Pro Ser Thr Leu Val Arg Gly Asp Lys Ile Glu Ile Tyr Asn Asn 
            100                 105                 110         


Asp Ile Asp Tyr Phe Ala Ser Ser Thr Val Thr Asp Arg Glu Ala Trp 
        115                 120                 125             


Leu Phe Gly Phe Phe Met Ala Asp Gly Ser Ser Val Tyr Xaa Asp Arg 
    130                 135                 140                 


Thr Gln Lys Tyr Phe Ser Lys Arg Met Gly Lys Arg Ala Asn Trp Lys 
145                 150                 155                 160 


Ile Ser Asn Lys Ser Leu Asp Arg Leu Asn Lys Ala Lys Glu Ile Met 
                165                 170                 175     


Glu Asn Ser Phe Xaa Leu Lys Ala Ser Ile Lys Asp His Arg Ala Ser 
            180                 185                 190         


Ser Asn Val Tyr Asn Leu Val Val Glu Asn Ala Glu Asn Ala Lys Phe 
        195                 200                 205             


Phe Ser Asn Asn Phe Tyr Thr Ser Tyr Arg Tyr Lys Lys Val Pro Glu 
    210                 215                 220                 


Phe Ile Leu Asn Ala Ser Lys Glu Val Lys Lys Ala Phe Leu Asp Gly 
225                 230                 235                 240 


Phe Xaa Xaa Gly Asp Gly Gln Asn Asp Thr Ile Asp Glu Xaa Ile Glu 
                245                 250                 255     


Phe Gly Gln Lys Ser Lys Val Ala Met Ala Gly Leu Tyr Phe Ile Met 
            260                 265                 270         


Lys Glu Leu Gly Tyr Asn Phe Arg Xaa His Asn Arg Asn Asp Lys Pro 
        275                 280                 285             


Glu Phe Ile Ser Phe Arg Leu Arg Asn His His Gly Asn Leu Leu Asn 
    290                 295                 300                 


Glu Asn Tyr Ser Glu Arg Lys Glu Asp Glu Ala Trp Leu Xaa Asn Asp 
305                 310                 315                 320 


Ile Thr Ser Lys Ser Glu Tyr Val Tyr Asp Ile Ser Ala Asp Gly Thr 
                325                 330                 335     


Phe Val Asn Ala Leu Gly Met Ile Val Xaa His Asn 
            340                 345             


<210>  194
<211>  355
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (23)..(23)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (82)..(82)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (142)..(142)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (249)..(250)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (261)..(261)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (288)..(288)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (325)..(325)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (353)..(353)
<223>  Xaa can be any naturally occurring amino acid

<400>  194

Ser Val Thr Tyr Asp Thr Pro Ile Ile Val Arg Gly Glu Asp Lys Arg 
1               5                   10                  15      


Ile Asp Ile Ile Pro Ile Xaa Asp Ile Phe Asn Asn Asp Glu Ala Val 
            20                  25                  30          


Glu Phe Asp Asn Glu Gln Tyr Arg Asp Phe Ser Arg Lys Asn Tyr Asp 
        35                  40                  45              


Val Leu Thr Arg Asp Gly Trp Lys Pro Ile Glu Tyr Ile Tyr Lys His 
    50                  55                  60                  


Lys Thr Asn Lys Gln Leu Lys Arg Val Glu Thr Lys Asn Gly Leu Val 
65                  70                  75                  80  


Asp Xaa Thr Glu Asp His Ser Leu Phe Asp Asn Asn Gly Asn Glu Val 
                85                  90                  95      


Lys Pro Ser Thr Leu Thr Arg Gly Asn Lys Ile Glu Ile Tyr Thr Lys 
            100                 105                 110         


Asp Ile Asp Tyr Phe Ala Ser Ser Thr Val Thr Asp Arg Glu Ala Trp 
        115                 120                 125             


Leu Phe Gly Phe Phe Met Ala Asp Gly Ser Ser Val Tyr Xaa Asp Arg 
    130                 135                 140                 


Thr Gln Lys Tyr Tyr Ser Lys Arg Lys Gly Glu Trp Val Ile His Asn 
145                 150                 155                 160 


Gly Lys Arg Ala Asn Trp Lys Ile Ser Asn Lys Ser Leu Asp Arg Leu 
                165                 170                 175     


Asn Lys Ala Lys Glu Ile Leu Glu Asp Ser Phe Phe Leu Lys Ala Ser 
            180                 185                 190         


Ile Lys Asp His Arg Ala Ser Ser Asn Val Tyr Asn Leu Val Val Glu 
        195                 200                 205             


Asn Thr Asp Asn Ala Lys Phe Phe Ser Asn Asn Phe Tyr Thr Ser Tyr 
    210                 215                 220                 


Arg Tyr Lys Lys Val Pro Glu Phe Ile Leu Asn Ala Arg Lys Glu Val 
225                 230                 235                 240 


Lys Lys Ala Phe Leu Asp Gly Phe Xaa Xaa Gly Asp Gly Gln Asn Asp 
                245                 250                 255     


Thr Ile Asp Glu Xaa Ile Glu Phe Gly Gln Lys Ser Lys Val Ala Met 
            260                 265                 270         


Ala Gly Leu Tyr Phe Leu Met Lys Glu Leu Gly Tyr Asn Phe Arg Xaa 
        275                 280                 285             


His Asn Arg Asn Asp Lys Gln Glu Phe Ile Ser Phe Arg Leu Arg Asn 
    290                 295                 300                 


His Arg Gly Ser Leu Leu Asn Glu Asn Tyr Ser Glu Lys Lys Glu Asp 
305                 310                 315                 320 


Glu Val Trp Asn Xaa Glu Asp Ile Thr Ser Lys Ser Glu Tyr Val Tyr 
                325                 330                 335     


Asp Ile Ser Ala Asp Gly Thr Phe Val Asn Ala Leu Gly Met Ile Val 
            340                 345                 350         


Xaa His Asn 
        355 


<210>  195
<211>  356
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Lake Huron low oxygen high sulfur
       sink hole purple microbial mat bin unclassified.01


<220>
<221>  misc_feature
<222>  (23)..(23)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (80)..(81)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (209)..(209)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (216)..(216)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (267)..(267)
<223>  Xaa can be any naturally occurring amino acid

<400>  195

Ser Val Glu Tyr Asp Thr Pro Ile Tyr Leu Lys Asp Lys Tyr Asp Asn 
1               5                   10                  15      


Leu Asn Ile Leu Pro Ile Xaa Asp Leu Phe Asp Asp Asn Ser Asn Phe 
            20                  25                  30          


Leu Ser Pro Asp Gly Leu Arg Asp Phe Ser Glu Lys Asp Phe Met Val 
        35                  40                  45              


Leu Thr Lys Asn Gly Trp Lys Asn Ile Asn Tyr Val Tyr Lys His Glu 
    50                  55                  60                  


Thr Asn Lys Pro Ile His Lys Ile Val Thr Lys Asp Arg Leu Val Xaa 
65                  70                  75                  80  


Xaa Thr Ser Asp His Ser Val Phe Gln Asn Gly Glu Gln Ile Lys Pro 
                85                  90                  95      


Thr Glu Leu Lys Arg Gly Asp Lys Ile Asp Ile Ile Asp Ile Pro Ile 
            100                 105                 110         


Leu Lys Ser Leu Asn Val Ile Thr Pro Asn Gln Ala Lys Leu Ile Gly 
        115                 120                 125             


Phe Phe Ile Gly Asp Gly Ser Ser Ser Tyr Lys Lys Lys Pro Tyr Lys 
    130                 135                 140                 


Tyr Asn Ser Val Lys Asn Gly Glu Lys Thr Tyr Gln Val Met Ser Gly 
145                 150                 155                 160 


Asn Phe Ser Leu Asn Asn Ser Arg Ile Glu Leu Leu Glu Glu Phe Lys 
                165                 170                 175     


Leu Ile Met Lys Asn Glu Tyr Asn Val Asp Thr Gln Ile Asn Asn Thr 
            180                 185                 190         


Met Lys Ser Ser Ser Val Tyr Lys Leu Gln Thr Ser Asn Ala Glu Ile 
        195                 200                 205             


Xaa Lys Trp Phe Ser Lys Asn Xaa Tyr Thr Ser Tyr Arg Gln Lys Met 
    210                 215                 220                 


Ile Pro Tyr Glu Ile Leu Asn Gly Ser Lys Glu Ile Met Lys Ser Phe 
225                 230                 235                 240 


Met Asp Gly Phe Tyr Leu Ala Asp Gly Trp Gly Asp Asn Phe Asp Gln 
                245                 250                 255     


Pro Leu Asp Ile Thr Gln Lys Ser Lys Val Xaa Val Ala Gly Leu Thr 
            260                 265                 270         


His Ile Leu Lys Thr Leu Asp Val Asn Tyr Arg Ile Leu Ile Arg Thr 
        275                 280                 285             


Asp Lys Pro Asn Ile Gln Ser Leu Thr Leu Gly Ser Phe Asn Asn Lys 
    290                 295                 300                 


Ile Lys Tyr His Pro Leu Asn Asp Glu Lys Ser Lys Arg Lys Thr Asn 
305                 310                 315                 320 


Glu Val Trp Asn Asn Val Ile Tyr Glu Asn Lys Gln Gln Tyr Val Tyr 
                325                 330                 335     


Asp Ile Ser Thr Glu Asp Gly Thr Phe Val Gly Gly Ile Gly Gly Val 
            340                 345                 350         


Leu Leu Lys Asn 
        355     


<210>  196
<211>  285
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Bedford Basin environmental 
       sampling


<220>
<221>  misc_feature
<222>  (78)..(78)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (114)..(114)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (136)..(136)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (170)..(170)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (196)..(196)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (203)..(203)
<223>  Xaa can be any naturally occurring amino acid

<400>  196

Ser Val Thr Gly Asp Thr Pro Leu Leu Ile Arg Met Arg Asp Gly Ser 
1               5                   10                  15      


Ile His Thr Lys Arg Ile Asp Glu Leu Thr Asn Glu Tyr His Ala Ser 
            20                  25                  30          


Asp Gly Gly Lys Glu Ser Phe Pro Gly Asn Tyr Glu Val Trp Thr Glu 
        35                  40                  45              


Lys Gly Phe Thr Pro Val Glu Arg Val Ile Arg His Lys Thr Met Lys 
    50                  55                  60                  


Lys Met Tyr Arg Val Leu Thr His Thr Gly Val Val Asp Xaa Thr Glu 
65                  70                  75                  80  


Asp His Ser Leu Leu Asp Gln Ala Ala Thr Met Val Lys Pro Thr Asp 
                85                  90                  95      


Ile Thr Ile Gly Thr Lys Leu Leu His Gly Asn Thr Leu Asp Ala Phe 
            100                 105                 110         


Asp Xaa Ile Asp Met Thr Val Ser Ile Asn Glu Ala Lys Val Met Gly 
        115                 120                 125             


Phe Phe Phe Gly Asp Gly Ser Xaa Gly Gln Tyr Gly Asn Lys Phe Thr 
    130                 135                 140                 


Trp Ala Leu Asn Asn Ser Asn Pro Asp Tyr Arg Ser Leu Phe Tyr Asn 
145                 150                 155                 160 


Asp Ala Lys Glu Lys Ile Val Pro Ser Xaa Ile Leu Asn Ala Pro Ile 
                165                 170                 175     


Glu Ile Val Gln Ser Phe Ile Asn Gly Tyr Tyr Met Ala Asp Gly Asp 
            180                 185                 190         


Lys Asp Ala Xaa Gly Tyr Thr Arg Met Asp Xaa Lys Ser Lys Gln Gly 
        195                 200                 205             


Thr Met Gly Leu Gln Leu Leu Gly Arg Arg Leu Gly Tyr Ser Val Ser 
    210                 215                 220                 


Leu Asn Thr Arg Ser Asp Lys Leu Asn Val Phe Arg Gln Thr Trp Thr 
225                 230                 235                 240 


Lys Ser Thr Gln Arg Lys Glu Pro Asn Ala Ile Lys Lys Val Leu Tyr 
                245                 250                 255     


Leu Gly Glu Thr Glu Gln Tyr Val Tyr Asp Leu Thr Thr Glu Ser His 
            260                 265                 270         


His Phe His Val Gly Pro Gly Glu Leu Ile Val His Asn 
        275                 280                 285 


<210>  197
<211>  370
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Chrysochromulina ericina virus 
       01B


<220>
<221>  misc_feature
<222>  (20)..(20)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (42)..(42)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (58)..(58)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (112)..(112)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (142)..(142)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (147)..(147)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (172)..(172)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (196)..(196)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (271)..(271)
<223>  Xaa can be any naturally occurring amino acid

<400>  197

Ser Val Ala Ser Tyr Thr Pro Ile Tyr Val Arg Tyr Asn Lys Ser Ile 
1               5                   10                  15      


Ile Asp Ile Xaa Ser Val Glu Glu Leu Ala Glu Lys Tyr Gly Asn Gly 
            20                  25                  30          


Trp His Leu Glu Ser Pro Lys Glu Tyr Xaa Glu Leu Asn Asn Ile Glu 
        35                  40                  45              


Ser Trp Thr Glu Asn Gly Trp Thr Glu Xaa His Arg Val Ile Arg His 
    50                  55                  60                  


Arg Leu Ala Pro Tyr Lys Lys Met Val Arg Ile Leu Thr His Thr Gly 
65                  70                  75                  80  


Leu Val Asp Val Thr Asp Asp His Ser Leu Val Lys Asn Thr Gly Glu 
                85                  90                  95      


Glu Ile Ser Pro Lys Asp Val Ser Ile Gly Thr Lys Leu Leu His Xaa 
            100                 105                 110         


Thr Met Ser Glu Asn Glu Ser Asn Ile Glu Ser Asp Ile Ser Ile Asp 
        115                 120                 125             


Glu Ala Arg Ile Met Gly Phe Phe Phe Gly Asp Gly Ser Xaa Gly Ile 
    130                 135                 140                 


Tyr Asp Xaa Pro Ser Gly His Lys Ala Ser Trp Ala Leu Asn Asn Ser 
145                 150                 155                 160 


Asn Lys Glu Leu Ile Glu Lys Tyr Tyr Asn Leu Xaa Lys Ser Val Tyr 
                165                 170                 175     


Pro Glu Phe Glu Trp Lys Val Tyr Asp Thr Leu Asn Ser Ser Gly Val 
            180                 185                 190         


Tyr Lys Ile Xaa Phe Asn Lys Lys Ser Gly Ser Lys Ser Lys Ile Gln 
        195                 200                 205             


Phe Ile Glu Lys Tyr Arg Ser Met Leu Tyr Asn Lys Lys Ser Lys Ile 
    210                 215                 220                 


Ile Pro Ser Glu Ile Ile Asn Gly Ser Ile Glu Leu Arg Lys Ser Phe 
225                 230                 235                 240 


Trp Glu Gly Leu Tyr Asp Ala Asp Gly Asp Lys Asp Lys Asn Gly Tyr 
                245                 250                 255     


Thr Arg Ile Asp Gln Lys Ser Gln Ile Ser Ala Ala Tyr Ile Xaa Trp 
            260                 265                 270         


Leu Ala Asn Ser Ile Gly Tyr Lys Thr Ser Leu Asn Ile Arg Asp Asp 
        275                 280                 285             


Lys Thr Asp Ile Tyr Arg Ile Thr Ala Thr Lys Asn Lys Gln Arg Arg 
    290                 295                 300                 


Asp Gly Asp Lys Ile Lys Lys Ile Val Asn Ile Gln Asn Ser Ala Asn 
305                 310                 315                 320 


Ile Gln Asn Ser Ala Asn Ile Gln Asn Ser Val Asn Ile Gln Asn Ser 
                325                 330                 335     


Val Asn Ile Gln Asn Ser Lys Asp Asn Gln Asp Tyr Val Tyr Asp Leu 
            340                 345                 350         


Thr Thr Glu Asn His His Phe Ala Ala Gly Ile Gly Asn Met Ile Val 
        355                 360                 365             


His Asn 
    370 


<210>  198
<211>  110
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein (partial) from Hyphochytrium 
       catenoides ATCC 18719


<220>
<221>  misc_feature
<222>  (36)..(36)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (63)..(63)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (110)..(110)
<223>  Xaa can be any naturally occurring amino acid

<400>  198

Ser Val Ala Ser Tyr Thr Pro Ile Met Val Arg Leu Ser Lys Arg Asn 
1               5                   10                  15      


Arg Val Leu Ile Thr Ile Glu Glu Leu Ser Arg Leu Ser Gly Lys Arg 
            20                  25                  30          


Trp Thr Ser Xaa Gly Asp Pro Gly Arg Asp Asn Lys Glu Phe Ile Asp 
        35                  40                  45              


Leu Asn Asp Val Glu Thr Trp Ser Asp Lys Gly Trp Thr Pro Xaa His 
    50                  55                  60                  


Arg Ile Ile Arg His Gln Leu Ala Pro Asn Lys Lys Met Ile Arg Val 
65                  70                  75                  80  


Val Thr Arg Ser Ser Val Val Asp Val Thr Asp Asp His Ser Leu Leu 
                85                  90                  95      


Arg Pro Asp Gly Thr Met Val Ser Pro Lys Asp Leu Arg Xaa 
            100                 105                 110 


<210>  199
<211>  169
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein (partial) from Hyphochytrium 
       catenoides ATCC 18719


<220>
<221>  misc_feature
<222>  (1)..(1)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (7)..(7)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (14)..(14)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (39)..(39)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (123)..(123)
<223>  Xaa can be any naturally occurring amino acid

<400>  199

Xaa Gly Glu His Leu Leu Xaa His Ala Leu Ala Pro Arg Xaa Ser Phe 
1               5                   10                  15      


Asn Asp Thr Phe Pro Val Thr Ser Glu Glu Ala Arg Gln Met Gly Tyr 
            20                  25                  30          


Phe Phe Gly Asn Gly Thr Xaa Gly Gly Thr Trp Asp Leu Ile Pro Gln 
        35                  40                  45              


Asp Val Leu Asn Gly Ser Arg Ala Val Arg Gln Ala Phe Trp Asp Ala 
    50                  55                  60                  


Met His Gly Ile Asp Thr Gly Asp Gly His Asp His Gly Arg Asn Met 
65                  70                  75                  80  


Leu Gln Ile Asp Gln Lys Ser Gln Leu Ser Ile Ala His Ile Asn Leu 
                85                  90                  95      


Leu Ala Gln Ser Leu Gly Tyr Thr Thr Ser Val Thr Ile Leu Thr Thr 
            100                 105                 110         


Ser Thr Glu Ser Thr Val Tyr Arg Leu Ile Xaa Thr Ala Ala Thr Asn 
        115                 120                 125             


Glu Arg Ser Ile Asn Ser Glu Ile Val Glu Ile Tyr Glu Ile Pro Tyr 
    130                 135                 140                 


Asp Gly Tyr Val Tyr Asp Leu Thr Thr Glu Asn His His Phe Ala Ala 
145                 150                 155                 160 


Gly Ala Gly Asn Ile Ile Val His Asn 
                165                 


<210>  200
<211>  269
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein (partial) from Paramoeba atlantica 
       CCAP 1560/9 (621/1)


<220>
<221>  misc_feature
<222>  (1)..(1)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (69)..(69)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (72)..(72)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (74)..(74)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (171)..(171)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (208)..(208)
<223>  Xaa can be any naturally occurring amino acid

<400>  200

Xaa Ile Tyr Arg Val Thr Ser His Thr Gly Val Val Lys Val Thr Glu 
1               5                   10                  15      


Asp His Ser Leu Leu Tyr His Asn Gly Thr Glu Val Arg Pro Lys Asp 
            20                  25                  30          


Val Phe Ala Gly Glu Asn Leu Leu Thr Ser Thr Leu Pro Ser Ile Asp 
        35                  40                  45              


Gly Thr Val Asp His Ser Asp Ile Ser Trp Val Trp Gly Leu Phe Tyr 
    50                  55                  60                  


Gly Asp Gly Ser Xaa Gly Tyr Xaa Asp Xaa Leu Thr Gly Lys Lys Tyr 
65                  70                  75                  80  


Thr Trp Ala Ile Asn Asn Gln Asn Gly Glu Tyr Leu Ser Lys Ala Lys 
                85                  90                  95      


Glu Ile Leu Gln Gly Tyr Tyr Ser Asp Tyr Gly Phe Lys Ile Leu Glu 
            100                 105                 110         


Thr Met Lys Lys Ser Ser Ser Val Tyr Lys Leu Val Pro Thr Gly Lys 
        115                 120                 125             


Val Ser Glu Ile Val Glu Glu Trp Arg Ser Leu Phe Tyr Asp Pro Glu 
    130                 135                 140                 


Thr Arg His Lys Met Val Pro Asn Val Leu Trp Ser Thr Thr Leu Lys 
145                 150                 155                 160 


Thr Arg Glu Glu Phe Phe Glu Gly Tyr Tyr Xaa Ala Gly Gly Glu Lys 
                165                 170                 175     


Gly Glu Asn Gly Pro Thr Arg Phe Asp Asn Lys Gly Asp Ile Gly Ser 
            180                 185                 190         


Ala Gly Leu Phe Tyr Leu Gly Ile Ser Leu Gly Tyr Lys Ala Ser Xaa 
        195                 200                 205             


Asn Thr Arg Lys Asp Asn Leu Asp Ile Thr Arg Ile Thr Leu Thr Lys 
    210                 215                 220                 


Ser Tyr Gln Arg Lys Lys Pro Gly Gly Ile Ile Lys Lys Ile Glu Asp 
225                 230                 235                 240 


Phe Gly Gln Thr Gly Asp Tyr Val Tyr Asp Leu Glu Thr Glu Asn His 
                245                 250                 255     


His Phe Ser Ala Gly Ile Gly Glu Leu Val Val His Asn 
            260                 265                 


<210>  201
<211>  437
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Haloferax lucentense DSM 14919


<220>
<221>  misc_feature
<222>  (395)..(395)
<223>  Xaa can be any naturally occurring amino acid

<400>  201

Ser Val Thr Gly Asp Arg Pro Val Val Val Arg Asp Pro Gly Gly Thr 
1               5                   10                  15      


Val Arg Ile Leu Pro Ile Glu Asp Leu Phe Ala Arg Gly Thr Thr Glu 
            20                  25                  30          


Ser Glu Val Leu Ile Ala Ala Asp Gly Asp Val Val Ala Ser Ala Thr 
        35                  40                  45              


Pro Gly Lys Thr Arg Arg Ala Leu Asp Gly Trp Asp Ala Leu Ser Val 
    50                  55                  60                  


Asn Glu Ala Gly Glu Ala Glu Trp Gln Pro Ile Ala Gln Ala Ile Arg 
65                  70                  75                  80  


His Lys Thr Asp Lys Pro Val Val Asn Leu Gln His Lys Phe Gly Glu 
                85                  90                  95      


Ser Thr Thr Thr Arg Asp His Ser Tyr Val Val Pro Gly Glu Asp Gly 
            100                 105                 110         


Leu Thr Thr Val Ser Pro Asp Asp Val Ala Glu Pro Tyr Arg Val Ser 
        115                 120                 125             


Gly Val Pro Asp Val Glu Pro Val Glu Gln Val Asp Val Tyr Glu Val 
    130                 135                 140                 


Leu Arg Gly Tyr Glu Arg Glu Tyr Glu Asp Gly Arg Ser Val Gly Ser 
145                 150                 155                 160 


Asp Asn Ser Ile Thr Lys Arg Lys Gln Ile His Ala Asp Asp Glu Tyr 
                165                 170                 175     


Val Trp Phe Gly His Glu His His Arg Asp Val Asp Ser Thr Val Lys 
            180                 185                 190         


Val Lys Arg Phe Val Asp Ile Asp Ser Glu Asp Gly Ala Ala Leu Ile 
        195                 200                 205             


Arg Leu Leu Gly Ala Tyr Val Ser Glu Gly Ser Ala Ser Thr Gly Glu 
    210                 215                 220                 


Thr Ala Thr Ser Lys Phe Gly Ala Ser Ile Ala Glu Ser Asp Arg Glu 
225                 230                 235                 240 


Trp Leu Ala Gln Leu Gln Arg Asp Tyr Ser Arg Leu Phe Glu Asn Thr 
                245                 250                 255     


Thr Ala Gly Ile Ile Thr Ser Asp Arg Arg Ala Glu Arg Thr Val Glu 
            260                 265                 270         


Tyr Gln Thr Asp Thr Gly Gly Ala Ser Val Thr Tyr Asn Asp Glu Thr 
        275                 280                 285             


Leu Lys Leu Gln Met Met Asn Glu Leu Ala Ala Val Phe Phe Arg Glu 
    290                 295                 300                 


Phe Ala Gly Gln Thr Ser Arg Gly Lys Arg Ile Pro Ser Phe Val Phe 
305                 310                 315                 320 


His Leu Pro Glu Glu Lys Gln Asp Leu Phe Leu Thr Leu Leu Val Glu 
                325                 330                 335     


Gly Asp Gly Ser Arg Glu Phe Pro Arg Tyr Thr Glu Ala Tyr Ala Gln 
            340                 345                 350         


Arg Asn Phe Asp Phe Glu Thr Thr Ser Arg Glu Leu Ala Ala Gly Leu 
        355                 360                 365             


Ser Met Leu Leu Thr Gln Arg Gly Gln Lys His Ser Leu Lys Tyr Arg 
    370                 375                 380                 


Asp Ser Lys Asp Ser Tyr Thr Ile Arg Thr Xaa Ser Thr Tyr Arg Glu 
385                 390                 395                 400 


Gly Arg Asp Pro Val Leu Thr Glu Val Asp His Asp Gly Tyr Val Tyr 
                405                 410                 415     


Asp Leu Ser Val Glu Glu Asn Glu Asn Phe Val Asp Gly Val Gly Gly 
            420                 425                 430         


Ile Val Leu His Asn 
        435         


<210>  202
<211>  437
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Haloferax species Q22


<220>
<221>  misc_feature
<222>  (395)..(395)
<223>  Xaa can be any naturally occurring amino acid

<400>  202

Ser Val Thr Gly Asp Arg Pro Val Val Val Arg Asp Pro Gly Gly Thr 
1               5                   10                  15      


Val Arg Ile Leu Pro Ile Glu Asp Leu Phe Ala Arg Gly Thr Thr Glu 
            20                  25                  30          


Ser Glu Val Leu Ile Ala Ala Asp Gly Asp Val Val Ala Ser Ala Thr 
        35                  40                  45              


Pro Gly Lys Thr Arg Arg Ala Leu Asp Gly Trp Asp Ala Leu Ser Val 
    50                  55                  60                  


Asn Glu Asp Gly Glu Ala Glu Trp Gln Pro Ile Ala Gln Ala Ile Arg 
65                  70                  75                  80  


His Lys Thr Asp Lys Pro Val Val Asn Leu Gln His Lys Phe Gly Glu 
                85                  90                  95      


Ser Thr Thr Thr Arg Asp His Ser Tyr Val Val Pro Gly Glu Asp Gly 
            100                 105                 110         


Leu Thr Thr Val Ser Pro Asp Asp Val Ala Glu Pro Tyr Arg Val Ser 
        115                 120                 125             


Gly Val Pro Asp Val Glu Pro Val Glu Gln Val Asp Val Tyr Glu Val 
    130                 135                 140                 


Leu Arg Gly Tyr Glu Arg Glu Tyr Glu Asp Gly Arg Ser Val Gly Ser 
145                 150                 155                 160 


Asp Asn Ser Ile Thr Lys Arg Lys Gln Ile His Ala Asp Asp Glu Tyr 
                165                 170                 175     


Val Trp Phe Gly His Glu His His Arg Asp Val Asp Ser Thr Val Lys 
            180                 185                 190         


Val Lys Arg Phe Val Asp Ile Asp Ser Glu Asp Gly Ala Ala Leu Ile 
        195                 200                 205             


Arg Leu Leu Gly Ala Tyr Val Pro Glu Gly Ser Ala Ser Thr Gly Glu 
    210                 215                 220                 


Thr Val Thr Ser Lys Phe Gly Ala Ser Ile Ala Glu Ser Asp Arg Glu 
225                 230                 235                 240 


Trp Leu Ala Gln Leu Gln Arg Asp Tyr Ser Arg Leu Phe Glu Asn Thr 
                245                 250                 255     


Thr Ala Gly Ile Ile Thr Ser Asp Arg Arg Ala Glu Arg Thr Val Glu 
            260                 265                 270         


Tyr Gln Thr Asp Thr Gly Gly Ala Ser Val Thr Tyr Asn Asp Glu Thr 
        275                 280                 285             


Leu Lys Leu Gln Met Met Asn Glu Leu Ala Ala Val Phe Phe Arg Glu 
    290                 295                 300                 


Phe Ala Gly Gln Thr Ser Arg Gly Lys Arg Ile Pro Ser Phe Val Phe 
305                 310                 315                 320 


His Leu Pro Glu Glu Lys Gln Asp Leu Phe Leu Thr Leu Leu Val Glu 
                325                 330                 335     


Gly Asp Gly Ser Arg Glu Phe Pro Arg Tyr Thr Glu Ala Tyr Ala Gln 
            340                 345                 350         


Arg Asn Phe Asp Phe Glu Thr Thr Ser Arg Glu Leu Ala Ala Gly Leu 
        355                 360                 365             


Ser Met Leu Leu Thr Gln Arg Gly Gln Lys His Ser Leu Lys Tyr Arg 
    370                 375                 380                 


Asp Ser Lys Asp Ser Tyr Thr Ile Arg Thr Xaa Ser Ser Tyr Arg Glu 
385                 390                 395                 400 


Gly Arg Asp Pro Val Leu Thr Glu Val Asp His Asp Gly Tyr Val Tyr 
                405                 410                 415     


Asp Leu Ser Val Glu Glu Asn Glu Asn Phe Val Asp Gly Val Gly Gly 
            420                 425                 430         


Ile Val Leu His Asn 
        435         


<210>  203
<211>  437
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Haloferax massiliensis ArcH


<220>
<221>  misc_feature
<222>  (395)..(395)
<223>  Xaa can be any naturally occurring amino acid

<400>  203

Ser Val Thr Gly Asp Arg Pro Val Val Ala Arg Asp Pro Gly Gly Thr 
1               5                   10                  15      


Val Arg Ile Leu Pro Ile Glu Asp Leu Phe Ala Arg Gly Thr Thr Glu 
            20                  25                  30          


Ser Glu Val Leu Ile Ala Ala Asp Gly Asp Val Val Ala Ser Ala Thr 
        35                  40                  45              


Pro Gly Lys Thr Arg Arg Ala Leu Asp Gly Trp Asp Ala Leu Ser Val 
    50                  55                  60                  


Asn Glu Asp Gly Glu Ala Glu Trp Gln Pro Ile Ala Gln Ala Ile Arg 
65                  70                  75                  80  


His Lys Thr Asp Lys Pro Val Val Asn Leu Gln His Lys Phe Gly Glu 
                85                  90                  95      


Ser Thr Thr Thr Arg Asp His Ser Tyr Val Val Pro Gly Glu Asp Gly 
            100                 105                 110         


Leu Thr Thr Val Ser Pro Asp Asp Val Ala Glu Pro Tyr Arg Val Ser 
        115                 120                 125             


Gly Val Pro Asp Val Glu Pro Val Glu Arg Val Asp Val Tyr Glu Val 
    130                 135                 140                 


Leu Arg Gly Tyr Glu Arg Glu Tyr Glu Asp Gly Arg Ser Val Gly Ser 
145                 150                 155                 160 


Asp Asn Ser Ile Thr Lys Arg Lys Gln Ile His Ala Asp Asp Glu Tyr 
                165                 170                 175     


Val Trp Phe Gly His Glu His His Arg Asp Val Asp Ser Thr Val Lys 
            180                 185                 190         


Val Lys Arg Phe Val Asp Ile Asp Ser Glu Asp Gly Ala Ala Leu Ile 
        195                 200                 205             


Arg Leu Leu Gly Ala Tyr Val Pro Glu Gly Ser Ala Ser Thr Gly Glu 
    210                 215                 220                 


Thr Ala Thr Ser Lys Phe Gly Ala Ser Ile Ala Glu Ser Asp Arg Glu 
225                 230                 235                 240 


Trp Leu Ala Gln Leu Gln Arg Asp Tyr Ser Arg Leu Phe Glu Asn Thr 
                245                 250                 255     


Thr Ala Gly Ile Ile Thr Ser Asp Arg Arg Ala Glu Arg Thr Ala Glu 
            260                 265                 270         


Tyr Gln Thr Asp Thr Gly Gly Ala Ser Val Thr Tyr Asn Asp Glu Thr 
        275                 280                 285             


Leu Lys Leu Gln Met Met Asn Glu Leu Ala Ala Val Phe Phe Arg Glu 
    290                 295                 300                 


Phe Ala Gly Gln Thr Ser Arg Gly Lys Arg Ile Pro Ser Phe Val Phe 
305                 310                 315                 320 


His Leu Pro Glu Glu Lys Gln Asp Leu Phe Leu Thr Leu Leu Val Glu 
                325                 330                 335     


Gly Asp Gly Ser Arg Glu Phe Pro Arg Tyr Thr Glu Ala Tyr Ala Gln 
            340                 345                 350         


Arg Asn Phe Asp Phe Glu Thr Thr Ser Arg Glu Leu Ala Ala Gly Leu 
        355                 360                 365             


Ser Met Leu Leu Thr Gln Arg Gly Gln Lys His Ser Leu Lys Tyr Arg 
    370                 375                 380                 


Asn Ser Lys Asp Ser Tyr Thr Ile Arg Thr Xaa Gly Thr Tyr Arg Lys 
385                 390                 395                 400 


Gly Arg Asp Pro Val Leu Thr Glu Val Asp His Asp Gly Tyr Val Tyr 
                405                 410                 415     


Asp Leu Ser Val Glu Glu Asn Glu Asn Phe Val Asp Gly Val Gly Gly 
            420                 425                 430         


Ile Val Leu His Asn 
        435         


<210>  204
<211>  437
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Haloferax gibbonsii ATCC 33959


<220>
<221>  misc_feature
<222>  (395)..(395)
<223>  Xaa can be any naturally occurring amino acid

<400>  204

Ser Val Thr Gly Asp Arg Pro Val Val Val Arg Asp Pro Gly Gly Thr 
1               5                   10                  15      


Val Arg Ile Leu Pro Ile Glu Asp Leu Phe Ala Arg Gly Thr Thr Glu 
            20                  25                  30          


Ser Glu Val Leu Ile Ala Ala Asp Gly Asp Val Val Ala Ser Ala Thr 
        35                  40                  45              


Pro Gly Lys Thr Arg Arg Ala Leu Asp Gly Trp Asp Ala Leu Ser Val 
    50                  55                  60                  


Asn Glu Asp Gly Glu Ala Glu Trp Gln Pro Ile Ala Gln Ala Ile Arg 
65                  70                  75                  80  


His Asn Thr Asp Lys Pro Val Val Asn Leu Gln His Lys Phe Gly Glu 
                85                  90                  95      


Ser Thr Thr Thr Arg Asp His Ser Tyr Val Val Pro Gly Glu Asp Gly 
            100                 105                 110         


Leu Thr Thr Val Ser Pro Asp Asp Val Ala Glu Pro Tyr Arg Val Ser 
        115                 120                 125             


Gly Val Pro Asp Val Glu Pro Val Glu Gln Val Asp Val Tyr Glu Val 
    130                 135                 140                 


Leu Arg Gly Tyr Glu Arg Glu Tyr Glu Asp Gly Arg Ser Val Gly Ser 
145                 150                 155                 160 


Asp Asn Ser Ile Thr Lys Arg Lys Gln Ile His Ala Asp Asp Glu Tyr 
                165                 170                 175     


Val Trp Phe Gly His Glu His His Arg Asp Val Asp Ser Thr Val Lys 
            180                 185                 190         


Val Lys Arg Phe Val Asp Ile Asp Ser Glu Asp Gly Ala Ala Leu Ile 
        195                 200                 205             


Arg Leu Leu Gly Ala Tyr Val Ser Glu Gly Ser Ala Ser Thr Gly Glu 
    210                 215                 220                 


Thr Ala Thr Ser Lys Phe Gly Ala Ser Ile Ala Glu Ser Asp Arg Glu 
225                 230                 235                 240 


Trp Leu Ala Gln Leu Gln Arg Asp Tyr Ser Arg Leu Phe Glu Asn Thr 
                245                 250                 255     


Thr Ala Gly Ile Ile Thr Ser Asp Arg Arg Ala Glu Arg Thr Val Glu 
            260                 265                 270         


Tyr Gln Thr Asp Thr Gly Gly Ala Ser Val Thr Tyr Asn Asp Glu Thr 
        275                 280                 285             


Leu Lys Leu Gln Met Met Asn Glu Leu Ala Ala Val Phe Phe Arg Glu 
    290                 295                 300                 


Phe Ala Gly Gln Thr Ser Arg Gly Lys Arg Ile Pro Ser Phe Val Phe 
305                 310                 315                 320 


His Leu Pro Glu Glu Lys Gln Asp Leu Phe Leu Thr Leu Leu Val Glu 
                325                 330                 335     


Gly Asp Gly Ser Arg Glu Phe Pro Arg Tyr Thr Glu Ala Tyr Ala Gln 
            340                 345                 350         


Arg Asn Phe Asp Phe Glu Thr Thr Ser Arg Glu Leu Ala Ala Gly Leu 
        355                 360                 365             


Ser Met Leu Leu Thr Gln Arg Gly Gln Lys His Ser Leu Lys Tyr Arg 
    370                 375                 380                 


Asp Ser Lys Asp Ser Tyr Thr Ile Arg Thr Xaa Ser Ser Tyr Arg Glu 
385                 390                 395                 400 


Gly Arg Asp Pro Val Leu Thr Glu Val Asp His Asp Gly Tyr Val Tyr 
                405                 410                 415     


Asp Leu Ser Val Glu Glu Asn Glu Asn Phe Val Asp Gly Val Gly Gly 
            420                 425                 430         


Ile Val Leu His Asn 
        435         


<210>  205
<211>  437
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Haloferax gibbonsii ARA6


<220>
<221>  misc_feature
<222>  (395)..(395)
<223>  Xaa can be any naturally occurring amino acid

<400>  205

Ser Val Thr Gly Asp Arg Pro Val Val Val Arg Asp Pro Gly Gly Thr 
1               5                   10                  15      


Val Arg Ile Leu Pro Ile Glu Asp Leu Phe Ala Arg Gly Thr Thr Glu 
            20                  25                  30          


Ser Glu Val Leu Ile Ala Ala Asp Gly Asp Val Val Ala Ser Ala Thr 
        35                  40                  45              


Pro Gly Lys Thr Arg Arg Ala Leu Asp Gly Trp Glu Ala Leu Ser Val 
    50                  55                  60                  


Asn Glu Asp Gly Glu Ala Glu Trp Gln Pro Ile Ala Gln Ala Ile Arg 
65                  70                  75                  80  


His Lys Thr Asp Lys Pro Val Val Asn Leu Gln His Lys Phe Gly Glu 
                85                  90                  95      


Ser Thr Thr Thr Arg Asp His Ser Tyr Val Val Pro Gly Glu Gly Gly 
            100                 105                 110         


Leu Thr Thr Val Ser Pro Asp Asp Val Ala Glu Pro Tyr Arg Val Ser 
        115                 120                 125             


Gly Val Pro Asp Val Glu Pro Val Glu Gln Val Asp Val Tyr Glu Val 
    130                 135                 140                 


Leu Arg Gly Tyr Glu Arg Glu Tyr Glu Asp Gly Arg Ser Val Gly Ser 
145                 150                 155                 160 


Asp Asn Ser Ile Thr Lys Arg Lys Gln Ile His Ala Asp Asp Glu Tyr 
                165                 170                 175     


Val Trp Phe Gly His Glu His His Arg Asp Val Asp Ser Thr Val Lys 
            180                 185                 190         


Val Lys Arg Phe Val Asp Ile Asp Ser Glu Asp Gly Ala Ala Leu Ile 
        195                 200                 205             


Arg Leu Leu Gly Ala Tyr Val Pro Glu Gly Ser Ala Ser Thr Gly Glu 
    210                 215                 220                 


Thr Ala Ala Ser Lys Phe Gly Ala Ser Ile Ala Glu Ser Asp Arg Glu 
225                 230                 235                 240 


Trp Leu Ala Gln Leu Gln Arg Asp Tyr Ser Arg Leu Phe Glu Asn Thr 
                245                 250                 255     


Thr Ala Gly Ile Ile Thr Ser Asp Arg Arg Ala Glu Arg Thr Val Glu 
            260                 265                 270         


Tyr Gln Thr Asp Thr Gly Gly Ala Ser Val Thr Tyr Asn Asp Glu Thr 
        275                 280                 285             


Leu Lys Leu Gln Met Met Asn Glu Leu Ala Ala Val Phe Phe Arg Glu 
    290                 295                 300                 


Phe Ala Gly Gln Thr Ser Arg Gly Lys Arg Ile Pro Ser Phe Val Phe 
305                 310                 315                 320 


His Leu Pro Glu Glu Lys Gln Asp Leu Phe Leu Thr Leu Leu Val Glu 
                325                 330                 335     


Gly Asp Gly Ser Arg Glu Phe Pro Arg Tyr Thr Glu Ala Tyr Ala Gln 
            340                 345                 350         


Arg Asn Phe Asp Phe Glu Thr Thr Ser Arg Glu Leu Ala Ala Gly Leu 
        355                 360                 365             


Ser Met Leu Leu Thr Gln Arg Gly Gln Lys His Ser Leu Lys Tyr Arg 
    370                 375                 380                 


Asp Ser Lys Gly Ser Tyr Thr Ile Arg Thr Xaa Ser Ser Tyr Arg Glu 
385                 390                 395                 400 


Gly Arg Asp Pro Val Leu Thr Glu Val Asp His Asp Gly Tyr Val Tyr 
                405                 410                 415     


Asp Leu Ser Val Glu Glu Asn Glu Asn Phe Val Asp Gly Val Gly Gly 
            420                 425                 430         


Ile Val Leu His Asn 
        435         


<210>  206
<211>  434
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein Halarchaeum acidiphilum MH1-52-1


<220>
<221>  misc_feature
<222>  (61)..(61)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (392)..(392)
<223>  Xaa can be any naturally occurring amino acid

<400>  206

Ser Val Thr Gly Asp Arg Pro Val Val Val Arg Asp Pro Glu Gly Arg 
1               5                   10                  15      


Val Arg Ile Thr Pro Ile Ala Glu Leu Phe Glu Arg Ala Ala Arg Ser 
            20                  25                  30          


Glu Asn Val Leu Val Thr Ala Asp Gly Gly Pro Val Thr Ser Ala Ser 
        35                  40                  45              


Val Gly Lys Asp Arg Arg Thr Leu Asp Gly Trp Asp Xaa Leu Ser Leu 
    50                  55                  60                  


Asn Asp Asp Gly Glu Thr Glu Trp Lys Pro Ile Glu Gln Ala Ile Arg 
65                  70                  75                  80  


His Glu Thr Asp Glu Pro Val Val Lys Leu Gln His Glu Phe Gly Glu 
                85                  90                  95      


Ser Thr Thr Thr Arg Asp His Ser Tyr Val Val Asp Gly Ala Asp Gly 
            100                 105                 110         


Leu Glu Glu Ala Val Pro Ala Asp Val Asp Glu Pro Leu Arg Val Pro 
        115                 120                 125             


Asp Met Pro Asp Ala Gly Thr Val Thr Glu Ile Asp Val Tyr Glu Val 
    130                 135                 140                 


Leu Arg Gly Tyr Glu Arg Glu Tyr Glu Asp Gly Arg Gly Thr Gly Gly 
145                 150                 155                 160 


Ser Thr Val Lys Thr Lys Arg Val Tyr Ala Asp Asp Glu Ser Val Trp 
                165                 170                 175     


Phe Gly His Glu His Tyr Gly Asp Leu Asp Ser Thr Val Thr Val Gln 
            180                 185                 190         


Arg His Ile Asp Leu Ala Ser Glu Asp Gly Ala Ala Leu Val Arg Leu 
        195                 200                 205             


Leu Gly Ala Tyr Val Pro Glu Gly Ser Ala Ser Thr Val Glu Thr Ala 
    210                 215                 220                 


Asp Gly Lys Phe Gly Ala Ser Ile Ala Glu Ser Arg Arg Glu Trp Ile 
225                 230                 235                 240 


Glu Gln Leu Glu Asp Asp Tyr His Arg Leu Phe Glu Asn Ala Glu Ala 
                245                 250                 255     


Ser Ile Ile Ala Ser Asp Ser Arg Asp Glu Arg Ala Leu Glu Tyr Glu 
            260                 265                 270         


Thr Glu Ser Gly Ala Glu Ser Ala Thr Tyr Asp Asp Arg Thr Leu Lys 
        275                 280                 285             


Leu Gln Met Met Asn Glu Leu Ser Ala Val Phe Phe Arg Glu Phe Ala 
    290                 295                 300                 


Gly Gln Thr Ser His Arg Thr Arg Ile Pro Ser Phe Val Tyr His Leu 
305                 310                 315                 320 


Asp Asp Asp Leu Gln Ala Leu Phe Leu Asp Val Leu Val Glu Gly Asp 
                325                 330                 335     


Gly Ser Arg Glu Phe Pro Tyr Ser Glu Gly Tyr Ala Ala Arg Asn Phe 
            340                 345                 350         


Asp Phe Glu Thr Thr Ser Arg Glu Leu Ala Ala Gly Leu Ser Met Leu 
        355                 360                 365             


Leu Thr Gln Arg Gly Lys Lys His Ser Leu Lys Tyr Arg Asp Gly Lys 
    370                 375                 380                 


Gly Ser Tyr Thr Val Arg Thr Xaa Asp Ser Tyr Arg Gly Gly Arg Asp 
385                 390                 395                 400 


Pro Val Leu Thr Thr Val Glu His Asp Gly Tyr Val Tyr Asp Leu Ser 
                405                 410                 415     


Val Ala Asp Asn Glu Asn Phe Val Asp Ala Leu Gly Gly Ile Val Leu 
            420                 425                 430         


His Asn 
        


<210>  207
<211>  420
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Micrarchaeota species CG1_02_55_2


<220>
<221>  misc_feature
<222>  (7)..(7)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (13)..(13)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (36)..(36)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (105)..(105)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (190)..(190)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (241)..(241)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (271)..(271)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (288)..(288)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (325)..(325)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (374)..(374)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (378)..(378)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (413)..(413)
<223>  Xaa can be any naturally occurring amino acid

<400>  207

Ser Ile Thr Asp Glu Arg Xaa Ile Ala Tyr Leu Asp Xaa Asn Gly Ile 
1               5                   10                  15      


Leu Arg Leu Ala Pro Ile Ala Glu Ile Phe Asp Arg Tyr Gly Lys Thr 
            20                  25                  30          


Lys Thr Ala Xaa Gly Asp Lys Glu Val Ile Tyr Ala Pro Gly Ile Arg 
        35                  40                  45              


Ala Leu Ser Val Asp Pro Lys Thr Met Glu Pro Ile Trp Arg Pro Val 
    50                  55                  60                  


Thr Glu Ile Ile Arg His Arg Asn Thr Lys Arg Val Tyr Arg Val Arg 
65                  70                  75                  80  


Gln Lys Thr Gly Glu Thr Arg Val Thr Glu Asp His Ser Ile Met Ile 
                85                  90                  95      


Asp Lys Arg Gly Phe Leu Glu Glu Xaa Lys Pro Ala Asp Ile Gly Lys 
            100                 105                 110         


Lys Arg Leu Ala Tyr Leu Arg Lys Ile Pro Ala Val Lys Glu Ile Thr 
        115                 120                 125             


Glu Ile Asn Leu Thr Asp Trp Leu Gly Glu Tyr Glu Asn Lys Val Arg 
    130                 135                 140                 


Tyr Lys Gly Arg Ile Lys Thr Arg Ala Ile Lys Val Ala Asp Asp Gly 
145                 150                 155                 160 


Ser Leu Thr Phe Ser Trp Thr Ala Gln Lys Arg Gln Ile Lys Val Gln 
                165                 170                 175     


Arg Arg Tyr Pro Val Glu Ser Pro Arg Phe Glu Ala Leu Xaa Arg Leu 
            180                 185                 190         


Leu Gly Ala Tyr Ile Ala Glu Gly Ser Ala Ser Thr Pro Glu Thr Thr 
        195                 200                 205             


Gly Thr Arg Met Gly Ala Ser Ile Ala Ser Gly Asn Arg Glu Trp Leu 
    210                 215                 220                 


Glu Ser Leu Lys Met Asp Tyr Glu Ser Leu Phe Thr Asn Ala Arg Ala 
225                 230                 235                 240 


Xaa Val Ile Arg Ser Asn Ile Lys Thr Arg His Leu Asp Tyr Val Asn 
                245                 250                 255     


Leu Asp Gly Met Ala His Ser Thr Val Tyr Asp Asp Ala Thr Xaa Lys 
            260                 265                 270         


Leu Gln Met Met Asn Ala Val Ser Ala Met Val Phe Lys Gln Leu Xaa 
        275                 280                 285             


Gly Gln Lys Ser Tyr Gly Lys His Leu Pro Glu Phe Ile Tyr His Val 
    290                 295                 300                 


Pro Arg Lys Tyr Lys Leu Leu Met Leu Glu Lys Met Val Glu Gly Asp 
305                 310                 315                 320 


Gly Ser Arg Ala Xaa Gly Pro Arg Tyr Thr Arg Glu Tyr Arg Asp Arg 
                325                 330                 335     


Asn Phe Lys Tyr His Thr Ser Ser Leu Arg Leu Ala Ser Gly Leu Ser 
            340                 345                 350         


Leu Leu Leu Asn Gln Leu Gly Ile Asn His Ser Ile Arg Tyr Tyr Ser 
        355                 360                 365             


Lys Arg Lys Ser Tyr Xaa Val Thr Thr Xaa Ser Ile Thr Asn Asp Arg 
    370                 375                 380                 


Leu Gly Thr Asp Val Val Glu Glu Pro Tyr Gln Gly Phe Val Tyr Asp 
385                 390                 395                 400 


Leu Ser Val Glu Gly Ser Arg Met Phe Thr Asp Ala Xaa Gly Ser Val 
                405                 410                 415     


Val Leu His Asn 
            420 


<210>  208
<211>  419
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Micrarchaeota archaeon UBA95


<220>
<221>  misc_feature
<222>  (36)..(36)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (49)..(49)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (156)..(156)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (189)..(189)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (285)..(285)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (364)..(364)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (412)..(412)
<223>  Xaa can be any naturally occurring amino acid

<400>  208

Ser Val Thr Ser Glu Arg Phe Leu Val Leu Leu Asp Asp Lys Glu Leu 
1               5                   10                  15      


Val His Val Lys Asn Val Glu Glu Leu Phe Glu Glu Asn Ala Lys His 
            20                  25                  30          


Leu Ile Glu Xaa Gly Glu Lys Gln Val Ile Pro Leu Thr Gly Trp Arg 
        35                  40                  45              


Xaa Leu Ser Val Asn Pro Ala Ser Lys Lys Thr Glu Trp Lys Lys Val 
    50                  55                  60                  


Thr Glu Leu Ile Arg His Lys Thr Asn Lys Arg Val Tyr Arg Val Asn 
65                  70                  75                  80  


Gln Lys Phe Gly Glu Thr Arg Val Thr Glu Asp His Ser Leu Met Ala 
                85                  90                  95      


Asp Thr Pro Asn Gly Leu Val Glu Val Lys Pro Val Asn Ala Lys Lys 
            100                 105                 110         


His Arg Leu Ala Gln Ala Glu Val Leu Lys Ala Lys Gly Gly Val Glu 
        115                 120                 125             


Lys Ile Asp Val Tyr Glu Val Leu Lys Asp Tyr Ser Glu Lys Thr Val 
    130                 135                 140                 


Tyr Lys Gly Phe Gly Lys Ile Lys Thr Ile Lys Xaa Asn Ser Glu Arg 
145                 150                 155                 160 


Val Trp Phe Gly Trp Thr Asn Gln Lys Asn Pro Val Lys Val Lys Arg 
                165                 170                 175     


Phe Ile Gly Ile Glu Thr Lys Glu Phe Glu Ser Leu Xaa Arg Leu Leu 
            180                 185                 190         


Gly Ala Tyr Ala Ala Glu Gly Ser Ser Ser Thr Ile Glu Thr Thr Arg 
        195                 200                 205             


Ser Arg Tyr Gly Ala Ser Ile Ala Gly Lys Arg Lys Trp Leu Glu Gly 
    210                 215                 220                 


Leu Gln Lys Asp Tyr Leu Ala Leu Phe Thr Ala Lys Ala Gly Val Ile 
225                 230                 235                 240 


Pro Ser Gln Lys Lys Thr Arg His Leu Thr Tyr Arg Thr Gln Lys Gly 
                245                 250                 255     


Val Lys Lys Thr Val Val Tyr Lys Asp Asp Thr His Lys Leu Gln Met 
            260                 265                 270         


Met Asn Ser Leu Ser Ala Val Phe Phe Lys Met Phe Xaa Gly Gln Lys 
        275                 280                 285             


Ser Ala Gly Lys Lys Leu Pro Asp Phe Ile Tyr Asn Val Pro Lys Lys 
    290                 295                 300                 


Tyr Gln Leu Ile Phe Leu Lys Lys Leu Leu Glu Gly Asp Gly Ser Arg 
305                 310                 315                 320 


Ser Val Asn Glu Arg Leu Gly Tyr Ser Ala Glu Tyr Lys Lys Lys Asn 
                325                 330                 335     


Phe Lys Tyr Thr Thr Ile Ser Ala Gly Leu Ala Ser Gly Leu Ser Val 
            340                 345                 350         


Leu Leu Arg Gln Leu Glu Leu Asn His Ser Ile Xaa Tyr Arg Pro Ser 
        355                 360                 365             


Lys Lys Ala Tyr Thr Leu Ser Thr Ser Gly Lys Tyr Asn Lys Arg Ile 
    370                 375                 380                 


Gln Thr Lys Val Ala Arg Glu Glu Tyr Ser Gly Trp Val Tyr Asp Leu 
385                 390                 395                 400 


Ser Val Glu Asp Asn His Ala Phe Thr Asp Ala Xaa Gly Gln Ile Val 
                405                 410                 415     


Leu His Asn 
            


<210>  209
<211>  437
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Haloarchaeon species J07HX64


<220>
<221>  misc_feature
<222>  (171)..(171)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (395)..(395)
<223>  Xaa can be any naturally occurring amino acid

<400>  209

Ser Val Thr Gly Glu Arg Pro Val Val Val Arg Asp Pro Asp Gly Ile 
1               5                   10                  15      


Val Arg Ile Tyr Pro Ile Glu Arg Leu Tyr Gln Arg Ala Thr Gln Ser 
            20                  25                  30          


Pro Ala Glu Asp Thr Val Ile Thr Val Asp Gly Met Pro Ile Thr Gly 
        35                  40                  45              


Ile Glu Ser Ser Lys Glu Tyr Ala Thr Leu Glu Gly Trp Asp Ala Leu 
    50                  55                  60                  


Ser Val Asp Asp Asp Gly Gln Ser Glu Trp Ser Pro Ile Glu Gln Val 
65                  70                  75                  80  


Val Arg His Glu Thr Asp Lys Pro Val Ile Lys Leu Gln His Lys Phe 
                85                  90                  95      


Gly Glu Ser Val Thr Thr Arg Asp His Ser Tyr Val Val Glu Ser Gly 
            100                 105                 110         


Gly Glu Leu Val Glu Ala Pro Pro Asn Glu Val Glu Ser Pro Leu Arg 
        115                 120                 125             


Ile Pro Asp Met Pro Glu Thr Asp Glu Ile Glu Thr Ile Asp Val Tyr 
    130                 135                 140                 


Glu Val Leu Asn Gly Tyr Thr Arg Arg Tyr Glu Asp Gly Arg Gly Ser 
145                 150                 155                 160 


Gly Gly Val Thr Thr Lys Thr Lys Arg Val Xaa Ala Asp Asp Glu Ala 
                165                 170                 175     


Val Trp Phe Gly His Glu His His Arg Gly Leu Asp Lys Thr Val Glu 
            180                 185                 190         


Val Gln Arg Tyr Ile Asp Leu Asp Ser Glu Asp Gly Glu Ala Leu Leu 
        195                 200                 205             


Arg Leu Leu Gly Ala Tyr Val Pro Glu Gly Ser Ala Ser Thr Val Glu 
    210                 215                 220                 


Thr Thr Asp Ser Arg Phe Gly Ala Ser Ile Ala Glu Ser Arg Arg Thr 
225                 230                 235                 240 


Trp Leu Glu Gln Leu Gln Asp Asp Tyr His Arg Leu Phe Glu Asn Thr 
                245                 250                 255     


Ser Val Ser Ile Val Ala Ser Asp Thr Arg Asp Glu Arg Thr Val Glu 
            260                 265                 270         


His Pro Gly Asp Asp Ser Glu Thr Ala Leu Thr Tyr Asp Asp Gln Thr 
        275                 280                 285             


Leu Lys Leu Gln Met Met Asn Glu Leu Ala Ala Val Phe Phe Arg Glu 
    290                 295                 300                 


Phe Ala Gly Gln Arg Ser Arg Gly Lys Lys Ile Pro Ser Phe Val Phe 
305                 310                 315                 320 


His Leu Pro Ala Glu Lys Gln Glu Leu Phe Ile Gln Met Leu Val Glu 
                325                 330                 335     


Gly Asp Gly Ser Arg Glu Phe Pro Arg Tyr Ser Ala Glu Tyr Ala Glu 
            340                 345                 350         


Arg Asn Phe Asp Phe Glu Thr Thr Ser Arg Glu Leu Ala Gly Gly Leu 
        355                 360                 365             


Ser Val Leu Leu Thr Gln Arg Gly Thr Lys His Ser Leu Lys Tyr Arg 
    370                 375                 380                 


Glu Glu Lys Glu Ser Tyr Thr Val Arg Thr Xaa Asp Tyr Tyr Asp Ser 
385                 390                 395                 400 


Gly Arg Glu Pro Val Leu Thr Glu Val Ala His Asp Gly Tyr Val Tyr 
                405                 410                 415     


Asp Leu Ser Val Arg Glu Asn Glu Asn Phe Val Asp Ala Val Gly Gly 
            420                 425                 430         


Ile Val Leu His Asn 
        435         


<210>  210
<211>  403
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein (partial) from halophilic archaeon 
       J07HX5


<220>
<221>  misc_feature
<222>  (315)..(315)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (403)..(403)
<223>  Xaa can be any naturally occurring amino acid

<400>  210

Ser Val Thr Gly Asp Arg Pro Ile Val Ala Arg Thr Pro Asp Gly Leu 
1               5                   10                  15      


Ile Arg Ile Val Pro Ile Glu Glu Leu Phe Glu Arg Ala Ser Pro Ala 
            20                  25                  30          


Pro Ser Asp Arg Ala Leu Val Thr Thr Asp Gly Gly Pro Ala Ala Thr 
        35                  40                  45              


Ala Gly Ser Ala Lys Glu Tyr Arg Ser Leu Asp Gly Trp Asp Ala Leu 
    50                  55                  60                  


Ser Val Asn Asn Arg Gly Met Thr Glu Trp Gln Pro Ile Glu Gln Val 
65                  70                  75                  80  


Leu Arg His Glu Thr Glu Lys Glu Val Val Arg Leu Gln His Glu Arg 
                85                  90                  95      


Gly Glu Ser Val Thr Thr Arg Asp His Ser Tyr Val Ile Glu Glu Asn 
            100                 105                 110         


Gly Glu Leu Ile Glu Ala Pro Pro Glu Asp Val Gly Ser Pro Leu Gln 
        115                 120                 125             


Ile Pro Asp Val Pro Ala Thr Gln Glu Val Gly Lys Ile Asp Val Tyr 
    130                 135                 140                 


Glu Leu Leu His Gly His Glu Arg Glu Ser Met Asp Arg Gln Gly Thr 
145                 150                 155                 160 


Asp Ser Thr Thr Ile Val Arg Arg Ile His Ala Asn Asp Asp Arg Val 
                165                 170                 175     


Trp Phe Gly His Glu His Ala Ser Asn Arg His Glu Gln Thr Val Ser 
            180                 185                 190         


Val Gln Arg Tyr Ile Asp Leu Asp Ser Glu Ser Gly Asn Ala Leu Val 
        195                 200                 205             


Arg Leu Leu Gly Ala Tyr Ser Ser Asn Glu Ser Ala Ser Thr Val Glu 
    210                 215                 220                 


Thr Thr Asp Asn Arg Pro Gly Val Ser Ile Ser Ala Ser Asn Arg Arg 
225                 230                 235                 240 


Arg Leu Glu Gln Leu Gln Ala Asp Tyr His Arg Leu Phe Glu Asn Thr 
                245                 250                 255     


Thr Gly Arg Ile Val Ala Arg Glu Thr Gly Ser Glu Arg Thr Val Ser 
            260                 265                 270         


Asp Ala Ser Ser Glu Gly Glu Ala Ala Thr Ala Pro Ala Val Ala Ser 
        275                 280                 285             


Val Asp Asp Thr Leu Lys Leu Gln Met Met Asp Glu Leu Ala Ala Val 
    290                 295                 300                 


Phe Phe Arg Glu Phe Ala Gly Gln Thr Pro Xaa Glu Thr Arg Ile Pro 
305                 310                 315                 320 


Ser Phe Ile Tyr His Leu Pro Glu Glu Lys Gln Asp Leu Phe Leu Arg 
                325                 330                 335     


Met His Leu Gly Asp Asp Gly Ala Arg Thr Phe Pro Gln Ala Ala Glu 
            340                 345                 350         


Glu Ser Ala Glu Gln Asp Ile Gln Phe Glu Thr Thr Ser Arg Glu Leu 
        355                 360                 365             


Ala Gly Gly Leu Ser Leu Leu Leu Thr Gln Arg Gly Lys Lys His Ser 
    370                 375                 380                 


Phe Asn Tyr Arg Asn Gln Arg Asn Ser Tyr Thr Ile Gln Thr Gly Glu 
385                 390                 395                 400 


Tyr Asp Xaa 
            


<210>  211
<211>  437
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Halorubrum species J07HR59


<220>
<221>  misc_feature
<222>  (298)..(298)
<223>  Xaa can be any naturally occurring amino acid

<400>  211

Ser Val Thr Gly Glu Arg Pro Leu Val Val Arg Asp Gly Asp Gly Arg 
1               5                   10                  15      


Val Arg Ile Leu Pro Ala Ala Glu Leu Phe Glu Arg Ala Glu Ala Asn 
            20                  25                  30          


Asp His Ile Ala Ile Ala Ala Asp Gly Gly Pro Ala Gly Ser His Gly 
        35                  40                  45              


Leu Gly Lys Gly Arg Ala Ser Leu Pro Gly Trp Glu Ala Leu Ser Leu 
    50                  55                  60                  


Ala Ser Asp Gly Thr Ala Glu Trp Gln Pro Ile Glu Glu Val Ile Arg 
65                  70                  75                  80  


His Asp Thr Asp Gly Thr Val Val His Leu Gln His Gln Leu Gly Glu 
                85                  90                  95      


Ser Thr Thr Thr Arg Asp His Ser Tyr Val Val Glu Glu Asp Gly Glu 
            100                 105                 110         


Tyr Ile Glu Arg Ala Pro Glu Asp Val Thr Asp Pro Leu Arg Ile Pro 
        115                 120                 125             


Asp Val Pro Ser Val Asp Thr Val Asp Ser Ile Asp Val His Glu Ile 
    130                 135                 140                 


Leu Asp Gly Tyr Thr Gly Glu His Ala Asp Arg Thr Thr Pro Arg Gly 
145                 150                 155                 160 


Glu Thr Gly Ser Gln Lys Arg Pro Arg Val His Thr Asp Gly Glu Ser 
                165                 170                 175     


Val Trp Val Gly His Ser Arg Glu Gly Glu Ser Glu Asn Thr Thr Lys 
            180                 185                 190         


Val Gln Arg Glu Ile Asp Leu Thr Gly Pro Thr Gly Arg Ser Leu Val 
        195                 200                 205             


Arg Leu Leu Ala Ala Tyr Ile Arg Asp Gly Ser Thr Thr Thr Ala Glu 
    210                 215                 220                 


Thr Ala Asp Ser Thr Val Gly Val Ser Ile Ser Glu Ser Arg Pro Glu 
225                 230                 235                 240 


Trp Leu Glu Thr Ile Ala Thr Asp Ser Glu Arg Leu Phe Glu Asn Ala 
                245                 250                 255     


Gln Val Ser Val Asn Ala Gly Gly Thr Asp Asp Glu Gln Thr Leu Val 
            260                 265                 270         


Ser Glu Thr Arg Thr Asp Glu Thr Val Ser Ser Asn Gly Gly Gly Thr 
        275                 280                 285             


Gly Glu Leu Arg Met Met Asp Glu Leu Xaa Ala Val Phe Phe Thr Glu 
    290                 295                 300                 


Phe Val Gly His Thr Ala Pro Glu Lys His Ile Pro Ser Phe Val Tyr 
305                 310                 315                 320 


His Leu Asn Asp Glu Leu Gln Arg Val Phe Thr Glu Thr Leu Val Ala 
                325                 330                 335     


Val Asp Asp Leu Ala Glu Leu Ala Ser His Thr Glu Ala His Ala Gln 
            340                 345                 350         


Arg Gln Met Asn Phe Glu Ala Thr Ser Arg Glu Leu Ala Ala Gly Val 
        355                 360                 365             


Ser Met Leu Leu Thr Gln Gln Asp Arg Thr His Ser Leu His Tyr Arg 
    370                 375                 380                 


Asp Glu Thr Asn Arg Tyr Thr Ile Thr Pro Asp Glu Ser Asp Gln Ser 
385                 390                 395                 400 


Ala Arg Asp Pro Val Val Thr Glu Gln Glu His Asp Gly Tyr Val Tyr 
                405                 410                 415     


Asp Leu Ser Val Ala Glu Asn Glu Asn Phe Val Asp Ala Val Gly Gly 
            420                 425                 430         


Ile Val Leu His Asn 
        435         


<210>  212
<211>  437
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Archaeon species A07HR60


<220>
<221>  misc_feature
<222>  (298)..(298)
<223>  Xaa can be any naturally occurring amino acid

<400>  212

Ser Val Thr Gly Glu Arg Pro Leu Val Val Arg Asp Gly Asp Gly Arg 
1               5                   10                  15      


Val Arg Ile Leu Pro Ala Ala Glu Leu Phe Glu Arg Ala Glu Ala Asn 
            20                  25                  30          


Asp His Ile Ala Ile Ala Ala Asp Gly Gly Pro Ala Gly Ser His Gly 
        35                  40                  45              


Leu Gly Lys Gly Arg Ala Ser Leu Pro Gly Trp Glu Ala Leu Ser Leu 
    50                  55                  60                  


Ala Ser Asp Gly Thr Ala Glu Trp Gln Pro Ile Glu Glu Val Ile Arg 
65                  70                  75                  80  


His Asp Thr Asp Gly Thr Val Val His Leu Gln His Gln Leu Gly Glu 
                85                  90                  95      


Ser Thr Thr Thr Arg Asp His Ser Tyr Val Val Glu Glu Asp Gly Glu 
            100                 105                 110         


Tyr Leu Glu Arg Ala Pro Glu Asp Val Thr Asp Pro Leu Arg Ile Pro 
        115                 120                 125             


Asp Val Pro Ser Val Asp Thr Val Asp Ser Ile Asp Val His Glu Ile 
    130                 135                 140                 


Leu Asp Gly Tyr Thr Gly Glu His Ala Asp Arg Thr Thr Pro Arg Gly 
145                 150                 155                 160 


Glu Thr Gly Ser Gln Lys Arg Pro Arg Val His Thr Asp Gly Glu Ser 
                165                 170                 175     


Val Trp Val Gly His Ser Arg Glu Gly Glu Ser Glu Asn Thr Thr Lys 
            180                 185                 190         


Val Gln Arg Glu Ile Asp Leu Thr Gly Pro Thr Gly Arg Ser Leu Val 
        195                 200                 205             


Arg Leu Leu Ala Ala Tyr Ile Arg Asp Gly Ser Thr Thr Thr Ala Glu 
    210                 215                 220                 


Thr Ala Asp Ser Thr Val Gly Val Ser Ile Ser Glu Ser Arg Pro Glu 
225                 230                 235                 240 


Trp Leu Glu Ala Ile Ala Thr Asp Ser Glu Arg Leu Phe Glu Asn Ala 
                245                 250                 255     


Gln Val Ser Val Asn Ala Gly Gly Thr Asn Asp Glu Gln Thr Leu Val 
            260                 265                 270         


Ser Glu Thr Arg Thr Asp Glu Thr Val Ser Ser Asn Gly Gly Gly Thr 
        275                 280                 285             


Gly Glu Leu Arg Met Met Asp Glu Leu Xaa Ala Val Phe Phe Thr Glu 
    290                 295                 300                 


Phe Val Gly His Thr Ala Pro Glu Lys His Ile Pro Ser Phe Val Tyr 
305                 310                 315                 320 


His Leu Asn Asp Glu Leu Gln Arg Val Phe Thr Glu Thr Leu Val Ala 
                325                 330                 335     


Val Asp Asp Leu Ala Glu Leu Ala Ser His Thr Glu Ala His Ala Gln 
            340                 345                 350         


Arg Gln Met Asn Phe Glu Ala Thr Ser Arg Glu Leu Ala Ala Gly Val 
        355                 360                 365             


Ser Met Leu Leu Thr Gln Gln Asp Arg Thr His Ser Leu His Tyr Arg 
    370                 375                 380                 


Asp Glu Thr Asn Arg Tyr Thr Ile Thr Pro Asp Glu Ser Asp Gln Ser 
385                 390                 395                 400 


Ala Arg Asp Pro Val Val Thr Glu Gln Glu His Asp Gly Tyr Val Tyr 
                405                 410                 415     


Asp Leu Ser Val Ala Glu Asn Glu Asn Phe Val Asp Ala Val Gly Gly 
            420                 425                 430         


Ile Val Leu His Asn 
        435         


<210>  213
<211>  439
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Halonotius species J07HN6


<220>
<221>  misc_feature
<222>  (397)..(397)
<223>  Xaa can be any naturally occurring amino acid

<400>  213

Ser Val Thr Gly Asp Arg Pro Val Val Val Arg Asp Pro Asp Gly Ile 
1               5                   10                  15      


Ile Arg Val Leu Pro Ile Glu Asn Leu Phe Glu Arg Ala Thr Ser Ser 
            20                  25                  30          


Thr Ser Asp Thr Val Val Ile Thr Ala Asp Gly Gly Ala Val Gly Ser 
        35                  40                  45              


Ala Ser Ala Gly Lys Asp Arg Arg Arg Leu Asp Gly Trp Glu Ala Leu 
    50                  55                  60                  


Ser Leu Ala Ala Asp Gly Glu Pro Glu Trp Gln Pro Ile Gln Gln Ala 
65                  70                  75                  80  


Ile Arg His Asp Thr Asp Lys Pro Val Val Asn Leu Gln His Lys Phe 
                85                  90                  95      


Gly Glu Ser Thr Thr Thr Arg Asp His Ser Tyr Val Val Gly Asp Asp 
            100                 105                 110         


Gly Lys Phe Ala Glu Ala Thr Pro Asp Asp Val Asp Glu Pro Leu Arg 
        115                 120                 125             


Ile Pro Gly Val Pro Ala Val Asp Thr Ile Glu Arg Ile Asp Val Tyr 
    130                 135                 140                 


Glu Val Leu Asp Gly Tyr Thr Arg Glu Tyr Glu Asp Gly Arg Ser Val 
145                 150                 155                 160 


Gly Ser Ala Asn Ala Thr Ser Lys Thr Lys Arg Val His Ala Asn Asp 
                165                 170                 175     


Glu Trp Val Trp Phe Gly His Asp His His Asn Glu Leu Ser Lys Pro 
            180                 185                 190         


Val Lys Val Gln Arg Tyr Ile Asp Ile Asp Ser Glu Asp Gly Ala Ala 
        195                 200                 205             


Leu Leu Arg Leu Leu Ala Ala Tyr Ile Thr Glu Gly Ser Ala Ser Thr 
    210                 215                 220                 


Ile Glu Thr Thr Glu Ser Arg Phe Gly Ala Ser Ile Ser Glu Ser Arg 
225                 230                 235                 240 


Glu Glu Trp Leu Asp Gly Leu Gln Ser Asp Tyr Tyr Arg Leu Phe Glu 
                245                 250                 255     


Asn Thr Thr Ala Ser Val Ile Ala Ser Asp Ser Ser Gly Asp Arg Thr 
            260                 265                 270         


Val Glu Tyr Asp Thr Ser Asp Gly Ala Gln Ser Val Thr Tyr Asp Asp 
        275                 280                 285             


Thr Thr His Lys Leu Gln Leu Met Asn Glu Leu Ser Ala Val Phe Phe 
    290                 295                 300                 


Arg Glu Phe Ala Gly Gln Thr Ser Arg Gly Lys Arg Ile Pro Gly Leu 
305                 310                 315                 320 


Val Phe Asn Leu Pro Ala Asp Ala Gln Asp Leu Phe Leu Asp Thr Leu 
                325                 330                 335     


Ile Glu Gly Asp Gly Ser Arg Glu Phe Pro Arg Tyr Thr Asp Ala Tyr 
            340                 345                 350         


Ser Glu Arg His Phe Asp Phe Glu Thr Thr Ser Arg Glu Leu Ala Ala 
        355                 360                 365             


Gly Leu Ser Met Leu Leu Thr Gln Arg Glu Gln Lys His Ser Leu Lys 
    370                 375                 380                 


Tyr Arg Asp Thr Lys Asn Ser Tyr Thr Ile Arg Thr Xaa Asp Ser Tyr 
385                 390                 395                 400 


Arg Ser Gly Arg Asp Pro Val Leu Thr Glu Val Asp His Asp Gly Tyr 
                405                 410                 415     


Val Tyr Asp Leu Ser Val Ala Asp Asn Asp Asn Phe Val Asp Ala Val 
            420                 425                 430         


Gly Gly Val Val Leu His Asn 
        435                 


<210>  214
<211>  282
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein (partial) from Halonotius species 
       J07HN4


<220>
<221>  misc_feature
<222>  (237)..(237)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (282)..(282)
<223>  Xaa can be any naturally occurring amino acid

<400>  214

Ser Val Thr Gly Asp Arg Pro Ile Val Val Arg Asp Pro Asp Gly Ile 
1               5                   10                  15      


Val Arg Val Leu Pro Ile Glu Asn Leu Phe Glu Arg Ala Thr Ser Ser 
            20                  25                  30          


Thr Gly Asp Thr Val Val Ile Thr Ala Asp Gly Gly Ala Val Gly Ser 
        35                  40                  45              


Thr Thr Thr Gly Lys Asp Arg Arg Gln Leu Ala Asp Trp Glu Ala Leu 
    50                  55                  60                  


Ser Leu Ser Ala Asp Gly Thr Pro Glu Trp Gln Pro Ile Gln Gln Ala 
65                  70                  75                  80  


Ile Arg His Glu Val Asp Lys Pro Val Ile Asn Leu Gln His Lys Phe 
                85                  90                  95      


Gly Glu Ser Thr Thr Thr Arg Asp His Ser Tyr Val Val Gly Asp Asp 
            100                 105                 110         


Gly Glu Leu Val Glu Ala Thr Pro Asp Asp Val Asp Glu Pro Leu Arg 
        115                 120                 125             


Ile Pro Gly Met Pro Ala Val Asp Thr Val Glu Thr Ile Asp Val Tyr 
    130                 135                 140                 


Glu Ile Leu Asp Gly Tyr Thr Arg Glu Tyr Glu Asp Gly Arg Ser Val 
145                 150                 155                 160 


Gly Ser Glu Asp Ala Ala Thr Lys Thr Lys Arg Val His Ala Asn Asn 
                165                 170                 175     


Glu Trp Val Trp Phe Gly His Asp His His Asn Glu Leu Ser Lys Pro 
            180                 185                 190         


Val Lys Val Gln Arg Tyr Ile Asp Ile Asp Ser Glu Asp Gly Ala Ala 
        195                 200                 205             


Leu Leu Arg Leu Leu Ala Ala Tyr Ile Thr Glu Gly Ser Ala Ser Thr 
    210                 215                 220                 


Ile Glu Thr Thr Glu Ser Arg Phe Gly Ala Ser Ile Xaa Glu Ser Arg 
225                 230                 235                 240 


Glu Glu Trp Leu His Gly Leu Gln Ser Asn Tyr Tyr Arg Leu Phe Glu 
                245                 250                 255     


Asn Thr Thr Ala Ser Val Ile Ala Ser Asp Ser Ser Gly Glu Arg Thr 
            260                 265                 270         


Val Glu Tyr Glu Thr Asn Asn Gly Met Xaa 
        275                 280         


<210>  215
<211>  435
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Haloquadratum walsbyi J07HQW1


<220>
<221>  misc_feature
<222>  (250)..(250)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (282)..(282)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (302)..(302)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (324)..(324)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (400)..(400)
<223>  Xaa can be any naturally occurring amino acid

<400>  215

Ser Val Thr Gly Asp Arg Pro Val Val Val Arg Asp Pro Arg Gly Tyr 
1               5                   10                  15      


Leu Arg Ile Ile Pro Ile Gly Thr Leu Phe Glu Gln Ala Thr Val Pro 
            20                  25                  30          


Glu Gln Asn Ala Arg Leu Thr Ala Asp Gly Ala Pro Thr Gly Ser Ser 
        35                  40                  45              


Gly Phe Pro Lys Glu Arg Arg His Leu Ser Gln Trp Glu Ala Leu Ser 
    50                  55                  60                  


Leu Thr Asp Ala Gly Glu Thr Glu Trp Gln Pro Ile Lys Gln Ile Ile 
65                  70                  75                  80  


Arg His Gln Thr Glu Lys Glu Val Ile Thr Leu Gln His Glu His Gly 
                85                  90                  95      


Glu Ser Thr Thr Thr Arg Asp His Ser Tyr Ile Thr Asp Asp Asn Gly 
            100                 105                 110         


Glu Tyr Val Glu Thr Pro Pro Glu Asp Val Asp Glu Pro Leu Pro Ile 
        115                 120                 125             


Pro Glu Ile Ala Pro Ile Lys Thr Ile Glu Thr Ile Asp Ile Tyr Gln 
    130                 135                 140                 


Thr Leu Thr Thr Ala Ala His Thr His Thr Gly Ser Asp Ile Glu Thr 
145                 150                 155                 160 


Ser Glu Arg Leu Pro Ser Thr Asp His Ile His Ala Thr Asp Glu Tyr 
                165                 170                 175     


Val Trp Ile Asp Thr Thr Pro Glu Gly Gln Asp Lys His Asp Ser Ile 
            180                 185                 190         


Pro Ala Ile Pro Arg Tyr Ile Asp Leu Thr Ser Asp Thr Gly His Ala 
        195                 200                 205             


Leu Ile Arg Phe Leu Gly Val Tyr Leu Ser Asp Trp Ser Glu Ser Thr 
    210                 215                 220                 


Val Thr Ile Ser Glu Gln Glu Gln Gln Ala Gln Gln Leu Asp Ile Thr 
225                 230                 235                 240 


Gly Pro His Glu Ser Ala Leu Arg Thr Xaa Thr Ala Asp Ala Asp Gln 
                245                 250                 255     


Leu Phe Ile Asn Val Thr Pro Thr Val Thr Ala Asn Ala Glu Asn Asn 
            260                 265                 270         


Ala Asn Ala Val Asp Gly Glu Tyr Ser Xaa His Ile Pro Thr Thr Leu 
        275                 280                 285             


Ala Thr Thr Leu Val Ser Ala Leu Ala Gly His Pro Ala Xaa Val Lys 
    290                 295                 300                 


Gln Val Pro Ser Val Ile Tyr His Leu Pro Asp Ala Glu Gln Ser Arg 
305                 310                 315                 320 


Phe Val Gln Xaa Leu Ile Gly Ala Glu Ser Lys Leu Thr Ser Gly Thr 
                325                 330                 335     


Leu Leu Asn Asp Ser Gln Asn Ile Asn Asp Pro Ile Ser Ser Thr Gly 
            340                 345                 350         


Glu Phe Thr Ala Asn Arg Glu Leu Ala Ala Gly Val Ser Met Leu Leu 
        355                 360                 365             


Thr Gln Arg Glu Gln Ser Tyr Ser Ile Val Gln Gln Asp Ala Glu Asp 
    370                 375                 380                 


Thr Tyr Thr Ile Arg Ile Asn Asp Thr Glu Ser Ser Ser Ser Glu Xaa 
385                 390                 395                 400 


His Pro Thr Leu Thr Glu Thr Ala His Ser Gly Tyr Val Tyr Asp Leu 
                405                 410                 415     


Ser Val Glu Ala Asn Gln Asn Phe Val Asp Gly Leu Gly Gly Leu Val 
            420                 425                 430         


Leu His Asn 
        435 


<210>  216
<211>  425
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Haloquadratum walsbyi J07HQW2


<220>
<221>  misc_feature
<222>  (169)..(169)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (176)..(176)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (214)..(214)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (245)..(245)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (277)..(277)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (331)..(331)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (363)..(363)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (390)..(390)
<223>  Xaa can be any naturally occurring amino acid

<400>  216

Ser Val Thr Gly Asp Arg Pro Val Val Val Arg Asp Pro Thr Gly Arg 
1               5                   10                  15      


Ile Gln Ile Val Pro Ile Glu Thr Leu Phe Glu Leu Ala Thr Ala Pro 
            20                  25                  30          


Lys Arg Asn Thr Arg Ile Thr Ala Asp Gly Ala Pro Thr Gly Ser Ser 
        35                  40                  45              


Gly Leu Pro Lys Glu Arg Arg His Leu Asp Gln Trp Glu Ala Leu Ser 
    50                  55                  60                  


Leu Ser Asp Thr Gly Glu Thr Glu Trp Gln Ser Ile Lys Gln Ile Ile 
65                  70                  75                  80  


Arg His Gln Thr Asp Lys Glu Ile Leu Thr Leu Gln His Glu Asp Gly 
                85                  90                  95      


Glu Ser Thr Thr Thr Arg Asp His Ser Tyr Ile Thr Ala Asp Asp Gly 
            100                 105                 110         


Glu Tyr Val Glu Thr Ser Pro Lys Asn Val Asp Glu Pro Leu Ser Ile 
        115                 120                 125             


Pro Glu Ile Ala Pro Val Lys Thr Ile Glu Thr Ile Asp Ile Tyr Gln 
    130                 135                 140                 


Ile Leu Thr Ala Asn Thr Gln Thr His Ala Gly Ser Asn Ile Asp Pro 
145                 150                 155                 160 


Gly Glu Trp Leu Pro Ser Thr Asp Xaa Ile His Ala Asn Asp Glu Xaa 
                165                 170                 175     


Val Trp Ile Asn Pro Thr Gly Glu Glu Arg Glu Asp Ser Thr Pro Thr 
            180                 185                 190         


Ile Gln Arg His Ile Asp Leu Thr Ser Asp Ala Gly His Ala Leu Ile 
        195                 200                 205             


Arg Phe Leu Ala Val Xaa Leu Ser Ser Arg Ser Lys Ser Thr Val Arg 
    210                 215                 220                 


Thr Ile Glu Ser Lys Gln Tyr Leu Gln Ile Ile Gly Pro His Lys Ser 
225                 230                 235                 240 


Glu Ile Lys Thr Xaa Lys Val Ala Ile Asp Gln Leu Phe Thr Asn Val 
                245                 250                 255     


Thr Thr Arg Ile Ala Gly Asp Ala Glu Asn Asn Thr Asn Thr Gly Asp 
            260                 265                 270         


Ser Thr Phe Arg Xaa His Ile Ser Thr Thr Leu Ala Ala Thr Val Met 
        275                 280                 285             


Thr Ala Phe Val Gly His Pro Ala Glu Asn Asn Gln Leu Pro Ser Val 
    290                 295                 300                 


Val Tyr His Leu Pro Asp Ala Glu Gln Ser Tyr Phe Ile Gln Gln Leu 
305                 310                 315                 320 


Ile Arg Pro Lys Ser Asn Glu Leu Leu Ser Xaa Pro Gln Asn Ile Asp 
                325                 330                 335     


Asp Ser Ile Ser Ile Glu Ser Asp Phe Thr Thr Ser Ser Arg Glu Leu 
            340                 345                 350         


Ala Ala Gly Val Ser Met Leu Leu Thr Gln Xaa Gly Gln Ser Tyr Ser 
        355                 360                 365             


Ile Leu Gln Arg Gly Ser Glu Asp Val Tyr Thr Ile Gln Val Gly Asp 
    370                 375                 380                 


Pro Pro Ser Ser Glu Xaa Glu Pro Met Leu Ile Glu Thr Ala Asp Ser 
385                 390                 395                 400 


Gly Tyr Val Tyr Asp Leu Ser Val Glu Ala Asn Gln Asn Phe Val Asp 
                405                 410                 415     


Gly Leu Gly Gly Leu Val Leu His Asn 
            420                 425 


<210>  217
<211>  439
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Halopenitus species DYS4


<220>
<221>  misc_feature
<222>  (353)..(353)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (397)..(397)
<223>  Xaa can be any naturally occurring amino acid

<400>  217

Ser Val Thr Gly Asp Arg Pro Val Val Val Arg Asp Pro Asp Gly Val 
1               5                   10                  15      


Val Arg Val Val Pro Ile Glu Arg Leu Phe Glu Arg Ala Glu Met Glu 
            20                  25                  30          


Arg Gly Glu Glu Leu Leu Val Thr Ala Asp Gly Gly Pro Val Ala Ser 
        35                  40                  45              


Val Ala Ala Gly Lys Glu Arg Arg Asn Leu Ser Glu Trp Glu Ala Leu 
    50                  55                  60                  


Ser Val Ser Glu Gly Gly Val Pro Glu Trp Gln Pro Ile Glu Glu Ile 
65                  70                  75                  80  


Val Arg His Glu Thr Asp Lys Asp Ile Val Asn Leu Gln His Lys Phe 
                85                  90                  95      


Gly Glu Ser Thr Thr Thr Thr Asp His Ser Tyr Val Val Glu Asp Gly 
            100                 105                 110         


Asp Asp Leu Val Glu Thr Lys Pro Glu Asp Val Thr Ala Pro Leu Arg 
        115                 120                 125             


Ile Pro Gly Leu Pro Glu Val Asp Thr Val Asp Arg Ile Asp Val Tyr 
    130                 135                 140                 


Glu Val Leu Asp Gly Tyr Thr Arg Ser Tyr Glu Asp Gly Arg Ser Val 
145                 150                 155                 160 


Gly Ser Asp Asn Ala Glu Thr Lys Ile Asn Arg Val His Ala Asn Glu 
                165                 170                 175     


Glu Tyr Val Trp Phe Gly His Glu His Gln Ala Asp Gln Ser Arg Thr 
            180                 185                 190         


Val Lys Val Lys Arg His Ile Asp Leu Glu Gly Ala Asp Gly Glu Ala 
        195                 200                 205             


Leu Val Arg Leu Leu Ala Ala Tyr Val Thr Glu Gly Ser Ala Ser Thr 
    210                 215                 220                 


Ile Glu Thr Thr Asp Ser Arg Phe Gly Ala Ser Ile Ala Glu Ala Arg 
225                 230                 235                 240 


Thr Asp Trp Leu Asp Gly Leu Lys Glu Asp Tyr Asp Arg Leu Phe Glu 
                245                 250                 255     


Gly Val Thr Ala Thr Val Ile Ala Ser Asp Thr Ser Ala Glu Arg Thr 
            260                 265                 270         


Val Glu Tyr Glu Thr Asp Glu Gly Pro Ser Ser Thr Thr Tyr Asn Asp 
        275                 280                 285             


Gly Thr His Lys Leu Gln Met Met Asn Glu Leu Thr Ala Val Phe Phe 
    290                 295                 300                 


Arg Glu Phe Ala Gly Gln Thr Ser Arg Gly Lys Arg Ile Pro Gly Phe 
305                 310                 315                 320 


Val Phe Asn Leu Asp Glu Asp Leu Gln Asn Leu Phe Leu Asp Val Leu 
                325                 330                 335     


Ile Glu Gly Asp Gly Ser Arg Glu Phe Pro Arg Tyr Ser Glu Glu Tyr 
            340                 345                 350         


Xaa Glu Arg Asn Phe Asp Phe Glu Thr Thr Ser Arg Glu Leu Ala Ala 
        355                 360                 365             


Gly Leu Ser Thr Leu Leu Thr Gln Arg Gly Lys Lys His Ser Leu Lys 
    370                 375                 380                 


Tyr Arg Asp Ser Lys Gly Ser Tyr Thr Val Arg Thr Xaa Glu Phe Tyr 
385                 390                 395                 400 


Arg Gly Gly Arg Asp Pro Val Ile Lys Asp Val Asp His Asp Gly Tyr 
                405                 410                 415     


Val Tyr Asp Leu Ser Val Ala Glu Asn Glu Asn Phe Val Asp Gly Val 
            420                 425                 430         


Gly Gly Ile Val Leu His Asn 
        435                 


<210>  218
<211>  402
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Pandoravirus salinus


<220>
<221>  misc_feature
<222>  (52)..(52)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (89)..(89)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (127)..(127)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (189)..(189)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (197)..(197)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (201)..(201)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (234)..(234)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (305)..(305)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (309)..(309)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (327)..(327)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (336)..(336)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (349)..(349)
<223>  Xaa can be any naturally occurring amino acid

<400>  218

Ser Val Gly Ala Asp Thr Pro Leu Leu Leu Arg Phe Asp Gly Lys His 
1               5                   10                  15      


Val Asp Tyr Val Arg Ala Asp Gln Val Asp Gly Ile Leu Ala Arg Gly 
            20                  25                  30          


Asp Thr Ser Ala Ala Ala Ser Leu Trp Ser Ala Tyr Gln Gly Asp Lys 
        35                  40                  45              


Glu Ala Phe Xaa Leu Ala Arg Pro Val Glu Val Trp Thr Glu Arg Gly 
    50                  55                  60                  


Trp Thr Ala Val Asn Arg Val Ile Arg His Arg Ala Gly Lys Lys Met 
65                  70                  75                  80  


Phe Arg Val Leu Thr His Thr Gly Xaa Val Asp Val Thr Glu Asp His 
                85                  90                  95      


Ser Leu Leu Asp Pro Asn Ala Glu Lys Val Lys Pro Thr Glu Val Ser 
            100                 105                 110         


Val Gly Ser Ala Leu Leu His Ala Asp Leu Pro Thr His Glu Xaa Ala 
        115                 120                 125             


Thr Ala Ser Leu Lys Ser Arg Val Pro Met Ser Thr Gln Asp Ile Thr 
    130                 135                 140                 


Asp Asp Asp Gly Asp Asp Asp Ser Ala Lys Val Ser Ala Ala Ser Ser 
145                 150                 155                 160 


Thr Ser Ala Asp Gly Ser Val Leu Val Ser Ala Ala Asp Lys Ala His 
                165                 170                 175     


Ala Trp Ala Met Gly Met Phe Phe Ala Glu Gly Ser Xaa Asn Glu His 
            180                 185                 190         


Pro Arg Asp Asn Xaa Thr Gln Tyr Xaa Trp Arg Ile Ala Asn Lys Asp 
        195                 200                 205             


Met Ala Leu Val Arg Met Ala Phe Asp Gly Leu Glu Gly Arg Tyr Pro 
    210                 215                 220                 


Gly Val Thr Phe Ser Ile Ser Gly Pro Xaa Thr Asp Gly Met Ala Tyr 
225                 230                 235                 240 


Val Val Ala Asn Gly Pro Gly Lys Met Gly Leu Val Ala Asn Tyr Arg 
                245                 250                 255     


Thr Ala Phe Tyr Asp Pro Val Tyr Ala Leu Lys Arg Val Pro Thr Glu 
            260                 265                 270         


Ile Leu Asn Ala His Val Glu Ile Lys Arg Ala Phe Ile Arg Gly Tyr 
        275                 280                 285             


Phe Ala Gly Asp Gly Asn Lys Lys Leu Tyr Gly Ala Asp Gly Thr Tyr 
    290                 295                 300                 


Xaa Gly Ser Arg Xaa Gly Gly Lys Gly Lys Ile Gly Met Ala Gly Ile 
305                 310                 315                 320 


Tyr Tyr Leu Leu Ser Ala Xaa Gly Tyr Leu Ala Ser Val Asp Thr Xaa 
                325                 330                 335     


Gly Gly Pro Glu Arg Asp Thr Tyr Arg Ile Asn Phe Xaa Asp Ala Ala 
            340                 345                 350         


Thr Ala Arg Arg Thr Gln Arg Lys Pro Ala Asp Thr Val Lys Lys Ile 
        355                 360                 365             


Ile Pro Leu Asp Ser Ala Ala Phe Asp Gly Ala Tyr Val Tyr Asp Leu 
    370                 375                 380                 


Glu Thr Ala Asn His His Phe Ala Ala Gly Ile Gly Arg Leu Val Val 
385                 390                 395                 400 


His Asn 
        


<210>  219
<211>  395
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Pandoravirus inopinatum KlaHel


<220>
<221>  misc_feature
<222>  (52)..(52)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (89)..(89)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (127)..(127)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (189)..(189)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (201)..(201)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (298)..(298)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (302)..(302)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (320)..(320)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (329)..(329)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (342)..(342)
<223>  Xaa can be any naturally occurring amino acid

<400>  219

Ser Val Gly Ala Asp Thr Pro Leu Leu Leu Arg Phe Asp Gly Lys His 
1               5                   10                  15      


Ile Asp Tyr Val Arg Ala Asp Gln Val Asp Asp Val Leu Ala Arg Gly 
            20                  25                  30          


Asp Ala Ser Ala Ala Ala Arg Leu Trp Ser Ala Tyr Gln Gly Asp Lys 
        35                  40                  45              


Glu Ala Phe Xaa Pro Ala Arg Pro Val Glu Val Trp Thr Glu Arg Gly 
    50                  55                  60                  


Trp Thr Ala Val Asn Arg Val Ile Arg His Arg Ala Gly Lys Lys Met 
65                  70                  75                  80  


Phe Arg Val Leu Thr His Thr Gly Xaa Val Asp Val Thr Glu Asp His 
                85                  90                  95      


Ser Leu Leu Asp Pro Asn Ala Glu Lys Val Lys Pro Thr Glu Val Thr 
            100                 105                 110         


Val Gly Ser Ala Leu Leu His Ala Asp Leu Pro Ala Tyr Glu Xaa Ile 
        115                 120                 125             


Thr Ala Ser Leu Lys Ser His Val Pro Ala Leu Thr Gln Asp Lys Thr 
    130                 135                 140                 


Asp Asp Asn Gly Asp Asp Asp Gly Thr Thr Arg Val Ser Thr Thr Leu 
145                 150                 155                 160 


Ser Ala Thr Ala Asp Ser Ile Leu Val Thr Ala Ala Asp Lys Ala His 
                165                 170                 175     


Ala Trp Ala Met Gly Met Phe Phe Ala Glu Gly Ser Xaa Asn Gly Tyr 
            180                 185                 190         


Pro Arg Gly Asn Leu Gln Pro Tyr Xaa Trp Arg Ile Ala Asn Lys Asp 
        195                 200                 205             


Met Val Leu Met Arg Met Ala Leu Asp Gly Leu Glu Gly Arg Tyr Pro 
    210                 215                 220                 


Asp Val Thr Phe Ser Ile Asn Gly Pro Tyr Ala Asp Gly Met Ala Tyr 
225                 230                 235                 240 


Val Val Ala Asn Gly Thr Gly Lys Met Gly Leu Val Ala Asn Tyr Arg 
                245                 250                 255     


Ala Ala Phe Tyr Asp Pro Val His Ala Leu Lys Arg Val Pro Thr Glu 
            260                 265                 270         


Ile Leu Asn Ala His Val Glu Ile Lys Arg Ala Phe Ile Arg Gly Tyr 
        275                 280                 285             


Phe Ala Gly Asp Gly Asn Lys Thr Asp Xaa Gly Ser Arg Xaa Asp Gly 
    290                 295                 300                 


Arg Gly Lys Ile Gly Met Ala Gly Ile Tyr Tyr Leu Leu Ser Ala Xaa 
305                 310                 315                 320 


Gly Tyr Leu Ala Ser Ile Asn Thr Xaa Gly Gly Pro Glu Arg Asp Thr 
                325                 330                 335     


Tyr Arg Ile Asn Phe Xaa Asp Ala Ala Thr Ala Lys Arg Thr Gln Arg 
            340                 345                 350         


Lys Pro Arg Asp Ala Ile Lys Lys Ile Ile Pro Leu Asp Ala Glu Ala 
        355                 360                 365             


Phe Asp Gly Ala Tyr Val Tyr Asp Leu Glu Thr Ala Asn His His Phe 
    370                 375                 380                 


Ala Ala Gly Val Gly Arg Leu Val Val His Asn 
385                 390                 395 


<210>  220
<211>  384
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Pandoravirus macleodensis 
       macleodensis


<220>
<221>  misc_feature
<222>  (52)..(52)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (89)..(89)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (127)..(127)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (177)..(177)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (189)..(189)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (286)..(286)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (290)..(290)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (308)..(308)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (317)..(317)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (330)..(330)
<223>  Xaa can be any naturally occurring amino acid

<400>  220

Ser Val Gly Ala Asp Thr Pro Leu Leu Leu Arg Phe Asp Asn Lys Tyr 
1               5                   10                  15      


Ile Asp Tyr Val Arg Ala Asp Gln Val Asp Glu Ser Leu Ala Gly Gly 
            20                  25                  30          


Asp Ala Ser Ala Ala Ala Arg Met Trp Ser Ala Tyr Gln Gly Asp Lys 
        35                  40                  45              


Glu Ala Phe Xaa Pro Pro Arg Ser Ile Glu Val Trp Thr Glu Arg Gly 
    50                  55                  60                  


Trp Thr Ala Val Asn Arg Val Ile Arg His Lys Ala Gly Lys Lys Met 
65                  70                  75                  80  


Tyr Arg Val Leu Thr His Thr Gly Xaa Val Asp Val Thr Glu Asp His 
                85                  90                  95      


Ser Leu Leu Asp Arg His Ala Arg Lys Ile Lys Pro Thr Asp Val Ala 
            100                 105                 110         


Val Gly Ser Ala Leu Leu His Thr Asp Leu Pro Pro His Glu Xaa Pro 
        115                 120                 125             


Ser Pro Ser Leu Gln Arg His Thr Asn Ala Ser Ala Asp Pro Val Thr 
    130                 135                 140                 


Gly Asp Ala Lys Leu Leu Ala Ser Ser Thr Ala Gln Asn Met Thr Asp 
145                 150                 155                 160 


Asp Thr Ala His Ala Trp Ala Leu Gly Met Phe Phe Ala Glu Gly Ser 
                165                 170                 175     


Xaa Asn Glu Tyr Val Arg Arg Asp Arg Asp Gln Tyr Xaa Trp Arg Ile 
            180                 185                 190         


Ala Asn Lys Asp Met Val Leu Leu Arg Met Ala Leu Asp Gly Leu Asp 
        195                 200                 205             


Gly Arg Tyr Pro Gly Ile Thr Phe Ser Ile Asn Gly Pro Tyr Lys Asp 
    210                 215                 220                 


Gly Met Ala Tyr Val Val Ala Asn Gly Pro Ser Lys Met Gly Leu Val 
225                 230                 235                 240 


Ala Asn Tyr Arg Ala Ala Phe Tyr Asp Pro Ala Tyr Ala Leu Lys Arg 
                245                 250                 255     


Val Pro Thr Glu Ile Leu Asn Ala Asn Thr Lys Ile Lys Arg Ala Phe 
            260                 265                 270         


Ile Arg Gly Tyr Phe Ala Gly Asp Gly Asn Lys Thr Asp Xaa Gly Ser 
        275                 280                 285             


Arg Xaa Asp Gly Arg Gly Lys Ile Gly Met Ala Gly Ile Tyr Tyr Leu 
    290                 295                 300                 


Leu Ser Thr Xaa Gly Tyr Ser Val Ser Ile Asn Thr Xaa Gly Gly Pro 
305                 310                 315                 320 


Glu Arg Asp Thr Tyr Arg Val Asn Phe Xaa Asp Ala Ser Ala Pro Lys 
                325                 330                 335     


Arg Thr Gln Arg Lys Ala Pro Asp Thr Ile Lys Lys Ile Ile Pro Leu 
            340                 345                 350         


Asp Asp Lys Ser Phe Gly Val Asn Ala Tyr Val Tyr Asp Leu Glu Thr 
        355                 360                 365             


Ala Asn His His Phe Ala Ala Gly Ile Gly Arg Leu Val Val His Asn 
    370                 375                 380                 


<210>  221
<211>  389
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Pandoravirus neocaledonia 
       neocaledonia


<220>
<221>  misc_feature
<222>  (52)..(52)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (89)..(89)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (127)..(127)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (134)..(134)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (181)..(181)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (193)..(193)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (290)..(290)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (294)..(294)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (312)..(312)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (321)..(321)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (334)..(334)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (358)..(358)
<223>  Xaa can be any naturally occurring amino acid

<400>  221

Ser Val Gly Ala Asp Thr Pro Leu Leu Leu Arg Phe Asp Asn Lys Tyr 
1               5                   10                  15      


Ile Asp Tyr Val Arg Ala Asp Gln Val Asp Glu Ser Leu Ala Gly Gly 
            20                  25                  30          


Asp Ala Ser Ala Ala Ala Arg Met Trp Gly Ala Tyr Gln Gly Asp Lys 
        35                  40                  45              


Glu Ala Phe Xaa Pro Pro Arg Ser Ile Glu Val Trp Thr Glu Arg Gly 
    50                  55                  60                  


Trp Thr Ala Val Asn Arg Val Ile Arg His Lys Ala Gly Lys Lys Met 
65                  70                  75                  80  


Tyr Arg Val Leu Thr His Thr Gly Xaa Val Asp Val Thr Glu Asp His 
                85                  90                  95      


Ser Leu Leu Asp Arg His Ala Asn Lys Ile Lys Pro Thr Asp Val Ala 
            100                 105                 110         


Val Gly Ser Ala Leu Leu His Ala Asp Leu Pro Pro Tyr Glu Xaa Pro 
        115                 120                 125             


Ser Pro Ile Phe Glu Xaa Arg Ala Asp Ala Ser Ala Lys Ser Ala Thr 
    130                 135                 140                 


Gly Ser Ala Asn Thr Val Val Ala Ser Ser Val Ala Asp Asn Gly Ala 
145                 150                 155                 160 


Ala Val Ala Asp Asp Ala Ala His Ala Trp Ala Leu Gly Met Phe Phe 
                165                 170                 175     


Ala Glu Gly Ser Xaa Asn Glu Tyr Val Arg His Asp Arg Asp Gln Tyr 
            180                 185                 190         


Xaa Trp Arg Ile Ala Asn Lys Asp Met Ala Leu Leu Arg Ile Ala Leu 
        195                 200                 205             


Ala Gly Leu Gly Gly Arg Tyr Pro Asp Ile Ser Phe Ser Ile Val Gly 
    210                 215                 220                 


Pro Tyr Lys Asp Gly Met Ala Tyr Val Val Ala Asn Gly Pro Gly Lys 
225                 230                 235                 240 


Met Gly Leu Val Ala Asn Tyr Arg Ala Ala Phe Tyr Asp Pro Val His 
                245                 250                 255     


Ala Leu Lys Arg Val Pro Thr Glu Ile Leu Asn Ala Asn Ala Lys Ile 
            260                 265                 270         


Lys Arg Ala Phe Ile Arg Gly Tyr Phe Ala Gly Asp Gly Asn Lys Thr 
        275                 280                 285             


Asp Xaa Gly Ser Arg Xaa Asp Gly Arg Gly Lys Ile Gly Met Ala Gly 
    290                 295                 300                 


Ile Tyr Tyr Leu Leu Ser Thr Xaa Gly Tyr Ser Val Ser Ile Asn Thr 
305                 310                 315                 320 


Xaa Gly Gly Pro Glu Arg Asp Thr Tyr Arg Val Asn Phe Xaa Asp Ala 
                325                 330                 335     


Ala Thr Pro Lys Arg Thr Gln Arg Lys Ala Pro Asp Thr Ile Lys Lys 
            340                 345                 350         


Ile Ile Pro Leu Asp Xaa Asp Lys Ser Leu Gly Gly Asn Val Tyr Val 
        355                 360                 365             


Tyr Asp Leu Glu Thr Ala Asn His His Phe Ala Ala Gly Ile Gly Arg 
    370                 375                 380                 


Leu Val Val His Asn 
385                 


<210>  222
<211>  389
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Pandoravirus pampulha strain 8.5


<220>
<221>  misc_feature
<222>  (52)..(52)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (89)..(89)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (127)..(127)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (183)..(183)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (188)..(188)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (195)..(195)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (292)..(292)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (296)..(296)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (314)..(314)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (323)..(323)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (336)..(336)
<223>  Xaa can be any naturally occurring amino acid

<400>  222

Ser Val Gly Ala Asp Thr Pro Leu Leu Leu Arg Phe Glu Gly Lys Tyr 
1               5                   10                  15      


Val Asp Tyr Val Arg Ala Asp Gln Val Ala Glu Thr Leu Ala Ala Gly 
            20                  25                  30          


Asp Gly Val Ala Ala Ala Gln Leu Trp Ser Ala Tyr Gln Gly Asp Lys 
        35                  40                  45              


Glu Ala Phe Xaa Pro Arg Arg Pro Ile Glu Val Trp Thr Glu Arg Gly 
    50                  55                  60                  


Trp Thr Val Val Asn Arg Val Ile Arg His Arg Ala Gly Lys Lys Met 
65                  70                  75                  80  


Phe Arg Val Leu Thr His Thr Gly Xaa Val Asp Val Thr Glu Asp His 
                85                  90                  95      


Ser Leu Leu Asp Pro Asn Ala Glu Lys Val Lys Pro Thr Glu Ile Ser 
            100                 105                 110         


Ile Gly Ser Ala Leu Leu His Ala Asp Leu Pro Ala His Glu Xaa Ala 
        115                 120                 125             


Ala Val Ser Leu Lys Ser Arg Ala Ile Arg Pro Ala Asp Thr Thr Asn 
    130                 135                 140                 


Pro Ala Ile Leu Ala Ala Pro Thr Ala Val Asp Ser Asp Gly Asp Val 
145                 150                 155                 160 


Ala Ile Asp Asp Ala Asn Asp Thr Ala His Ala Trp Ala Met Gly Met 
                165                 170                 175     


Phe Phe Ala Glu Gly Ser Xaa Ser Glu Tyr Leu Xaa Gly Asn Ala Gln 
            180                 185                 190         


Gln Tyr Xaa Trp Arg Ile Ala Asn Lys Asp Met Val Leu Val Arg Met 
        195                 200                 205             


Thr Leu Asp Gly Leu Lys Gly Arg Tyr Pro Asp Val Asp Phe Ser Ile 
    210                 215                 220                 


Ile Gly Pro Tyr Ala Asp Gly Met Ala Tyr Val Val Ala Asn Gly Pro 
225                 230                 235                 240 


Ala Lys Met Gly Leu Val Ala Asn Tyr Arg Ala Ala Phe Tyr Asp Pro 
                245                 250                 255     


Val His Ala Leu Lys Arg Val Pro Thr Glu Ile Leu Asn Ala Asn Val 
            260                 265                 270         


Glu Ile Lys Arg Ala Phe Ile Arg Gly Tyr Phe Ala Gly Asp Gly Asn 
        275                 280                 285             


Lys Thr Asp Xaa Gly Ser Arg Xaa Asp Gly Arg Gly Lys Ile Gly Met 
    290                 295                 300                 


Ala Gly Ile Tyr Tyr Leu Leu Ser Ala Xaa Gly Tyr Leu Gly Ser Ile 
305                 310                 315                 320 


Asn Thr Xaa Gly Gly Pro Glu Arg Asp Thr Tyr Arg Ile Asn Phe Xaa 
                325                 330                 335     


Asp Ala Ala Ala Pro Lys Arg Thr Gln Arg Lys Pro Pro Asp Asn Ile 
            340                 345                 350         


Lys Lys Ile Ile Pro Leu Asp Ala Lys Ala Phe Asp Gly Ala Tyr Val 
        355                 360                 365             


Tyr Asp Leu Glu Thr Ala Asn His His Phe Ala Ala Gly Ile Gly Arg 
    370                 375                 380                 


Leu Val Val His Asn 
385                 


<210>  223
<211>  387
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Pandoravirus braziliensis strain 
       SL2


<220>
<221>  misc_feature
<222>  (52)..(52)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (89)..(89)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (127)..(127)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (180)..(180)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (192)..(192)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (279)..(279)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (289)..(289)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (293)..(293)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (311)..(311)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (320)..(320)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (333)..(333)
<223>  Xaa can be any naturally occurring amino acid

<400>  223

Ser Val Gly Ala Asp Thr Pro Leu Leu Leu Arg Phe Asp Asn Lys Tyr 
1               5                   10                  15      


Ile Asp Tyr Val Arg Ala Asp Gln Val Asp Glu Ser Leu Ser Gly Gly 
            20                  25                  30          


Asp Ala Ala Ala Ala Ala Arg Met Trp Ser Ala Tyr Gln Gly Asp Lys 
        35                  40                  45              


Glu Ala Phe Xaa Pro Pro Arg Ser Ile Glu Val Trp Thr Glu Arg Gly 
    50                  55                  60                  


Trp Thr Ala Val Asn Arg Val Ile Arg His Arg Ala Gly Lys Arg Met 
65                  70                  75                  80  


Tyr Arg Val Leu Thr His Thr Gly Xaa Val Asp Val Thr Lys Asp His 
                85                  90                  95      


Ser Leu Leu Asp Pro Gln Ala Asn Lys Ile Lys Pro Thr Asp Val Ala 
            100                 105                 110         


Val Gly Ser Ala Leu Leu His Ala Asp Leu Pro Pro Tyr Glu Xaa Pro 
        115                 120                 125             


Ser Ser Ser Leu Gln Arg Arg Ala Ala Ala Ser Asp Asp Pro Val Ile 
    130                 135                 140                 


Asp Ser Ala Lys Ser Leu Ala Ser Ser Ala Gly Glu Ser Thr Ala Val 
145                 150                 155                 160 


Ser Thr Asn Asp Thr Ala His Ala Trp Ala Leu Gly Met Phe Phe Ala 
                165                 170                 175     


Glu Gly Ser Xaa Asn Glu Tyr Val Arg Gly Asp Arg Asp Gln Tyr Xaa 
            180                 185                 190         


Trp Arg Ile Ala Asp Lys Asp Met Val Leu Leu Arg Thr Ala Leu Asp 
        195                 200                 205             


Gly Leu Asp Gly Arg Tyr Pro Gly Val Thr Phe Ser Ile Asn Gly Pro 
    210                 215                 220                 


Tyr Lys Asp Gly Met Ala Phe Val Val Ala Asn Gly Pro Gly Lys Met 
225                 230                 235                 240 


Gly Leu Val Ala Asn Tyr Arg Ala Ala Phe Tyr Asp Pro Val His Ala 
                245                 250                 255     


Leu Lys Arg Val Pro Thr Glu Ile Leu Asn Ala Asn Thr Glu Ile Lys 
            260                 265                 270         


Arg Ala Phe Ile Arg Gly Xaa Phe Ala Gly Asp Gly Asn Lys Thr Asp 
        275                 280                 285             


Xaa Gly Ser Arg Xaa Asp Gly Leu Gly Lys Ile Gly Met Ala Gly Ile 
    290                 295                 300                 


Tyr Tyr Leu Leu Ser Thr Xaa Gly Tyr Ser Ala Ser Val Asn Thr Xaa 
305                 310                 315                 320 


Gly Gly Pro Glu Arg Asp Thr Tyr Arg Ile Asn Phe Xaa Asp Ala Ala 
                325                 330                 335     


Thr Pro Lys Arg Thr Gln Arg Lys Ala Pro Asn Ile Ile Lys Lys Ile 
            340                 345                 350         


Ile Pro Leu Asp Gly Glu Ala Phe Gly Gly Asn Ala Tyr Val Tyr Asp 
        355                 360                 365             


Leu Glu Thr Ala Asn His His Phe Ala Ala Gly Ile Gly Arg Leu Ile 
    370                 375                 380                 


Val His Asn 
385         


<210>  224
<211>  308
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Labyrinthulid quahog parasite QPX
       NY07348D


<220>
<221>  misc_feature
<222>  (17)..(17)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (65)..(65)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (71)..(71)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (87)..(87)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (100)..(100)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (104)..(104)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (120)..(120)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (137)..(137)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (164)..(164)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (220)..(220)
<223>  Xaa can be any naturally occurring amino acid

<400>  224

Ser Val Val Gly Ser Thr Gln Leu Leu Leu Lys Val Asn Gly Met Leu 
1               5                   10                  15      


Xaa Val Ser Arg Ile Ser Glu Met Val Tyr Leu Leu His Ser Ser Arg 
            20                  25                  30          


Trp Thr Thr Trp Arg His Glu Lys Gln Ala Val Gln Ile Asp Pro Arg 
        35                  40                  45              


Lys Asp Glu Ile Phe Thr Trp Thr Asp Thr Gly Trp Ser Arg Val Arg 
    50                  55                  60                  


Xaa Ile Ile Arg His Arg Xaa Thr Lys Ser Leu Phe Lys Ile Lys Thr 
65                  70                  75                  80  


Thr His Gly Asp Val Thr Xaa Thr Asn Asp His Ser Leu Leu Gln Pro 
                85                  90                  95      


Asp Gly Ser Xaa Ile Ser Pro Xaa Ala Leu Ser Ile Gly Lys Thr Arg 
            100                 105                 110         


Leu Leu Ser Ser Phe Pro Asn Xaa Arg Asp Phe Pro Gln Glu Asp Ser 
        115                 120                 125             


Pro Glu Leu Leu Val Pro Gly Pro Xaa Val Pro Gln Ser Phe Val Leu 
    130                 135                 140                 


Asp Ala Met Ile Ala Glu Leu Ala Gly Leu Tyr Ile Ser Gln Gly Arg 
145                 150                 155                 160 


Asp Asp Leu Xaa Val Val Val Pro Asn Pro Val Arg Arg Gln Ala Phe 
                165                 170                 175     


Leu His Thr Leu Gln His Arg Leu Gly His Leu Asn Trp Ala Ile Val 
            180                 185                 190         


Asn Asp Leu Lys Ile Val Pro Val Asp Leu Gly Asp Ala Lys Ala Leu 
        195                 200                 205             


Lys Glu Thr Arg Arg Phe Trp Asn His Asp Val Xaa Asn Asp Gly Leu 
    210                 215                 220                 


Gly Val Pro Asn Trp Val Leu Asn Ser Asn Arg Arg Ile Arg Val Ala 
225                 230                 235                 240 


Phe Ser Lys Gly Phe Gly Ala Gly Asp Leu Glu Leu Gly Gly Phe Thr 
                245                 250                 255     


Leu Glu Ala Ser Leu Thr Leu Asn His Asp Arg Lys Leu Glu Glu Ala 
            260                 265                 270         


Leu Val Leu Asp Ile Leu Pro Val Glu Asn His Gly Glu Phe Val Tyr 
        275                 280                 285             


Asp Leu Thr Thr Glu Asn His His Phe Gln Ala Gly Ser Gly Ser Leu 
    290                 295                 300                 


Ile Val His Asn 
305             


<210>  225
<211>  308
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Labyrinthulid quahog parasite QPX
       NY0313808BC1


<220>
<221>  misc_feature
<222>  (17)..(17)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (65)..(65)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (71)..(71)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (87)..(87)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (100)..(100)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (104)..(104)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (120)..(120)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (137)..(137)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (164)..(164)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (220)..(220)
<223>  Xaa can be any naturally occurring amino acid

<400>  225

Ser Val Val Gly Ser Thr Gln Leu Leu Leu Lys Val Asn Gly Met Leu 
1               5                   10                  15      


Xaa Val Ser Arg Ile Ser Glu Met Val Tyr Leu Leu His Ser Ser Arg 
            20                  25                  30          


Trp Thr Thr Trp Arg His Glu Lys Gln Ala Val Gln Ile Asp Pro Arg 
        35                  40                  45              


Lys Asp Glu Ile Phe Thr Trp Thr Asp Thr Gly Trp Ser Arg Val Arg 
    50                  55                  60                  


Xaa Ile Ile Arg His Arg Xaa Thr Lys Ser Leu Phe Lys Ile Lys Thr 
65                  70                  75                  80  


Thr His Gly Asp Val Thr Xaa Thr Asn Asp His Ser Leu Leu Gln Pro 
                85                  90                  95      


Asp Gly Ser Xaa Ile Ser Pro Xaa Ala Leu Ser Ile Gly Lys Thr Arg 
            100                 105                 110         


Leu Leu Ser Ser Phe Pro Asn Xaa Arg Asp Phe Pro Gln Glu Asp Ser 
        115                 120                 125             


Pro Glu Leu Leu Val Pro Gly Pro Xaa Val Pro Gln Ser Phe Val Leu 
    130                 135                 140                 


Asp Ala Met Ile Ala Glu Leu Ala Gly Leu Tyr Ile Ser Gln Gly Arg 
145                 150                 155                 160 


Asp Asp Leu Xaa Val Val Val Pro Asn Pro Val Arg Arg Gln Ala Phe 
                165                 170                 175     


Leu His Thr Leu Gln His Arg Leu Gly His Leu Asn Trp Ala Ile Val 
            180                 185                 190         


Asn Asp Leu Lys Ile Val Pro Val Asp Leu Gly Asp Ala Lys Ala Leu 
        195                 200                 205             


Lys Glu Thr Arg Arg Phe Trp Asn His Asp Val Xaa Asn Asp Gly Leu 
    210                 215                 220                 


Gly Val Pro Asn Trp Val Leu Asn Ser Asn Arg Arg Ile Arg Val Ala 
225                 230                 235                 240 


Phe Ser Lys Gly Phe Gly Ala Gly Asp Leu Glu Leu Gly Gly Phe Thr 
                245                 250                 255     


Leu Glu Ala Ser Leu Thr Leu Asn His Asp Arg Lys Leu Glu Glu Ala 
            260                 265                 270         


Leu Val Leu Asp Ile Leu Pro Val Glu Asn His Gly Glu Phe Val Tyr 
        275                 280                 285             


Asp Leu Thr Thr Glu Asn His His Phe Gln Ala Gly Ser Gly Ser Leu 
    290                 295                 300                 


Ile Val His Asn 
305             


<210>  226
<211>  229
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Russia Kulunda-steppe soda lake 
       Tanatar-5 brine environmental genomics


<220>
<221>  misc_feature
<222>  (82)..(82)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (117)..(117)
<223>  Xaa can be any naturally occurring amino acid

<400>  226

Ser Val Leu Lys Asp Thr Pro Ile Ile Val Tyr Asp Ile Ile Glu Lys 
1               5                   10                  15      


Glu Ile Lys Ile Lys Thr Ile Glu Glu Leu Gly Gly Lys Glu Pro Arg 
            20                  25                  30          


Ile Trp His Ser Tyr Lys Asn Phe Lys Val Leu Asp Ser Tyr Leu Ser 
        35                  40                  45              


Asn Arg Arg Ser Lys Lys Gln Ser Phe Ile Ser Asn Glu Arg Tyr Leu 
    50                  55                  60                  


Val Trp Thr Ala Ser Gly Trp Ser Ser Ile Asn Arg Val Ile Lys His 
65                  70                  75                  80  


Lys Xaa Asn Lys Lys Ile Tyr Arg Ile Gln Thr Ser His Ser Ile Val 
                85                  90                  95      


Asp Val Thr Glu Asp His Ser Leu Ile Asn Glu Lys Gly Asn Lys Ile 
            100                 105                 110         


Lys Pro Ser Glu Xaa Thr Ile Gly Thr Arg Leu Leu Tyr His Pro Leu 
        115                 120                 125             


Leu Met Asn Leu Ile Asp Lys Glu Glu Tyr Arg Ile Lys Tyr Lys Lys 
    130                 135                 140                 


Val Ser Asp Lys Arg His Phe Thr Ser Lys Lys Thr Leu Ser Lys Tyr 
145                 150                 155                 160 


Ile Leu Lys Ser Lys Tyr Pro Met Asn Val Asn Val Ser Arg Asp Thr 
                165                 170                 175     


Gln Gly Asn Glu Ile Tyr Ser Ala Ser Val His Ile Gly Leu Tyr Asp 
            180                 185                 190         


Asn Arg Ile Thr Lys Ile Glu Leu Leu His Asp Lys Val Val Glu Tyr 
        195                 200                 205             


Val Tyr Asp Ile Glu Thr Gln Asp Gly Thr Phe Asn Val Gly Phe Pro 
    210                 215                 220                 


Leu Ile Val Lys Asn 
225                 


<210>  227
<211>  229
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Russia Kulunda-steppe soda lake 
       Tanatar-5 brine environmental genomics


<220>
<221>  misc_feature
<222>  (82)..(82)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (117)..(117)
<223>  Xaa can be any naturally occurring amino acid

<400>  227

Ser Val Leu Lys Asp Thr Pro Ile Ile Val Tyr Asp Val Ile Lys Arg 
1               5                   10                  15      


Lys Ile Asn Ile Lys Thr Ile Glu Glu Leu Gly Gly Ile Asp Glu Tyr 
            20                  25                  30          


Val Trp His Ser Tyr Asn Asn Phe Lys Val Phe Asp Ser Tyr Leu Ser 
        35                  40                  45              


Asn Arg Arg Phe Lys Lys Gln Ser Phe Ile Ser Asn Glu Arg Tyr Leu 
    50                  55                  60                  


Val Trp Thr Ser Ser Gly Trp Ser Ser Ile His Arg Ile Ile Lys His 
65                  70                  75                  80  


Lys Xaa Asn Lys Lys Ile Tyr Arg Ile Gln Thr Asn His Ser Ile Val 
                85                  90                  95      


Asp Val Thr Glu Asp His Ser Leu Ile Asp Glu Thr Gly Lys Lys Ile 
            100                 105                 110         


Lys Pro Ser Gln Xaa Ser Ile Gly Thr Arg Leu Leu Tyr Asn Pro Leu 
        115                 120                 125             


Leu Phe Gly Leu Glu Asn Lys Asp Asn Tyr Glu Met Lys Tyr Arg Lys 
    130                 135                 140                 


Val Ser Asp Ile Lys Tyr Tyr Glu Thr Lys Lys Glu Leu Ser Lys Tyr 
145                 150                 155                 160 


Val Leu Gln Ser Lys Tyr Pro Met Lys Ile Glu Ser Ser Arg Asp Ser 
                165                 170                 175     


Gln Gly Asn Glu Ile Tyr Ser Ala Ser Val His Ile Gly Leu Tyr Asp 
            180                 185                 190         


Asn Arg Ile Lys Lys Ile Glu Leu Leu His Asn Arg Val Gly Asp Tyr 
        195                 200                 205             


Val Tyr Asp Ile Glu Thr Thr Asp Gly Thr Phe Asn Val Gly Phe Pro 
    210                 215                 220                 


Leu Ile Val Lys Asn 
225                 


<210>  228
<211>  228
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Russia Kulunda-steppe soda lake 
       Tanatar-5 brine environmental genomics


<220>
<221>  misc_feature
<222>  (19)..(19)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (82)..(82)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (117)..(117)
<223>  Xaa can be any naturally occurring amino acid

<400>  228

Ser Val Leu Gly Asp Thr Pro Ile Ile Val Tyr Asp Leu Ile Glu His 
1               5                   10                  15      


Lys Val Xaa Val Lys Thr Ile Lys Glu Leu Gly Asp Gly Arg Ile His 
            20                  25                  30          


Gln Trp Thr Ser Tyr Lys Asn Phe Lys Val Phe Asp Ser Tyr Leu Ser 
        35                  40                  45              


Asn Arg Arg Phe Lys Lys Gln Ser Tyr Asn Ala Asn Glu Arg Tyr Leu 
    50                  55                  60                  


Val Trp Ser Ala Ser Gly Trp Ser Ser Ile Arg Arg Ile Ile Lys His 
65                  70                  75                  80  


Lys Xaa Asn Lys Lys Ile Tyr Arg Val Gln Thr Pro His Ser Ile Val 
                85                  90                  95      


Asp Val Thr Glu Asp His Ser Leu Leu Asp Tyr Lys Gly Arg Lys Leu 
            100                 105                 110         


Lys Pro Ser Glu Xaa Val Val Gly Thr Lys Leu Leu Tyr Asn Pro Ile 
        115                 120                 125             


Leu Phe Gly Leu Pro Asp Lys Glu Phe Tyr Gln Leu Lys Tyr His Lys 
    130                 135                 140                 


Asp Thr Gln Glu Arg Met Ser Phe Lys Thr Lys Lys Glu Leu Ala Thr 
145                 150                 155                 160 


Phe Ile Leu Arg Ser Lys Tyr Pro Met Lys Val Glu Tyr Arg Lys Gly 
                165                 170                 175     


Asp Glu Lys Pro Tyr Ser Ala Thr Thr His Ile Gly Leu Tyr Asp Asn 
            180                 185                 190         


Arg Ile Thr His Met Glu Val Leu His Ser Arg Ile Met Glu Tyr Val 
        195                 200                 205             


Tyr Asp Ile Glu Thr Ser Asp Gly Thr Phe Asn Val Gly Phe Pro Leu 
    210                 215                 220                 


Ile Val Lys Asn 
225             


<210>  229
<211>  149
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein (partial) from Gujarat India activated
       biomass of a wastewater treatment plant treating hydrocarbon 
       contaminated wastewater at high TDS (total dissolved solids)


<220>
<221>  misc_feature
<222>  (1)..(1)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (15)..(15)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (29)..(29)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (103)..(103)
<223>  Xaa can be any naturally occurring amino acid

<400>  229

Xaa Ser Ile Gly Asp Tyr Thr Ile Glu Asp Leu Tyr Asn Gln Xaa Thr 
1               5                   10                  15      


Asp Phe Tyr Ser Lys Gly Ser Lys Glu Tyr Gly Lys Xaa Ser Leu Leu 
            20                  25                  30          


Val Leu Gly Tyr Asp His Ser Ser Gln Thr Ala Asp Phe Lys Ser Val 
        35                  40                  45              


Asn Tyr Val Met Arg His Lys Thr Thr Lys Asp Ile Tyr Lys Leu Thr 
    50                  55                  60                  


Leu Gln Asp Gly Lys Thr Val Lys Thr Thr His Asp His Ser Ala Met 
65                  70                  75                  80  


Val Ile Arg Asp Gly Thr Leu Ile Glu Val Lys Pro Phe Glu Ile Asn 
                85                  90                  95      


Asn Thr Asp Met Phe Ile Xaa Ile Asp Asp Thr Asn Lys Val Tyr Asn 
            100                 105                 110         


Thr Ala Ile Asp Lys Val Glu Leu Leu Gly Thr Phe Asp Asp Tyr Val 
        115                 120                 125             


Tyr Asp Val Ser Ile Asn Asp Asn Ser His Tyr Phe Phe Gly Asn Asp 
    130                 135                 140                 


Ile Leu Leu His Asn 
145                 


<210>  230
<211>  164
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (25)..(25)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (88)..(88)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (96)..(96)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (161)..(161)
<223>  Xaa can be any naturally occurring amino acid

<400>  230

Ser Val Val Gly Ser Ser Glu Val Met Thr Asp Ser Gly Lys Met Thr 
1               5                   10                  15      


Ile Glu Asp Met Phe Asp Glu Ile Xaa Asp Ser Asp Ser Asp Ser Lys 
            20                  25                  30          


Ile Arg Val Arg Ser Asn Arg Lys Val Tyr Val Leu Ala Asp Gly Val 
        35                  40                  45              


Pro Glu Leu Arg Glu Ile Asn Tyr Ile Met Arg His Lys Thr Glu Lys 
    50                  55                  60                  


Lys Ile Tyr Arg Ile Glu Ser Glu Asn Gly Lys Ile Val His Ala Thr 
65                  70                  75                  80  


Ser Asp His Ser Ile Ile Val Xaa Arg Asp Gly Ile Ile Ser Arg Xaa 
                85                  90                  95      


Lys Pro Glu Glu Leu Thr Ser Ser Asp Glu Leu Leu Thr Ile Asn Asp 
            100                 105                 110         


Tyr Pro Asp Leu Gly Phe Ile Pro Ser Lys Val Lys Ser Val Thr Val 
        115                 120                 125             


Ser Glu Tyr Asn Trp Glu Asp Asn Tyr Val Tyr Asp Ile Ser Val Lys 
    130                 135                 140                 


Val Glu His Glu Asp Asp Leu Glu His Thr Phe Phe Ala Asn Gly Ile 
145                 150                 155                 160 


Xaa Val His Asn 
                


<210>  231
<211>  159
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from human gut metagenome Denmark 
       Roux-en-Y gastric bypass surgery of morbidly obese patient


<220>
<221>  misc_feature
<222>  (26)..(26)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (38)..(38)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (45)..(45)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (52)..(52)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (95)..(95)
<223>  Xaa can be any naturally occurring amino acid

<400>  231

Ser Ile Thr Gly Asp Ser Ala Ile Arg Met Glu Asp Gly Ser Tyr Glu 
1               5                   10                  15      


Thr Ile Glu Asn Leu Phe Asn Ala Tyr Xaa Asp Thr Asp Ser Asp Asp 
            20                  25                  30          


Lys Ile Arg Val Ser Xaa Asp Lys Arg Val Tyr Gly Xaa Ser Ser Asp 
        35                  40                  45              


Phe Asn Pro Xaa Thr Tyr Pro Ile Lys Tyr Ile Met Arg His Lys Thr 
    50                  55                  60                  


Asn Lys Thr Ile Trp Arg Val Thr Ser Thr Gln Lys Thr Ile Asn Val 
65                  70                  75                  80  


Thr Glu Asp His Ser Ile Val Ile Val Arg Asn Asn Ile Met Xaa Asn 
                85                  90                  95      


Ile Lys Pro Asn Asp Ile Asp Ile Asp Thr Asp Lys Leu Ile Ile Tyr 
            100                 105                 110         


Asp Asn Asp Lys Ile Val Leu Ala Asp Ile Gln Ser Asn Thr Ile Thr 
        115                 120                 125             


Asn Leu Ser Asp Glu Tyr Val Tyr Asp Ile Glu Ile Asp Ser Asp Asp 
    130                 135                 140                 


Ala Glu Lys His Val Phe Phe Ala Asn Asp Ile Leu Val His Asn 
145                 150                 155                 


<210>  232
<211>  166
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (29)..(29)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (78)..(78)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (137)..(137)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (146)..(146)
<223>  Xaa can be any naturally occurring amino acid

<400>  232

Ser Val Ala Gly Asp Ser Arg Ile Leu Leu Lys Tyr Gly Asp Asn Tyr 
1               5                   10                  15      


Phe Thr Lys Glu Ile Gln Glu Leu Phe Asn Glu Phe Xaa Asp Ser Asp 
            20                  25                  30          


Ser Gly Asp Lys Ile Arg Val Lys Val Gly Ser Lys Tyr Leu Val Gly 
        35                  40                  45              


Thr Tyr Asp Pro Glu Thr Asp Lys Pro Ile Tyr Arg Gly Ile Asp Tyr 
    50                  55                  60                  


Val Met Arg His Lys Thr Asn Lys Arg Arg Phe Arg Ile Xaa Leu Asp 
65                  70                  75                  80  


Gly Gly Asn Ser Val Val Val Thr Glu Asp His Ser Ile Met Val Leu 
                85                  90                  95      


Lys Asp Asp Asn Leu Val Glu Ala Ala Val Lys Asp Leu His Ile Gly 
            100                 105                 110         


Asp Lys Leu Ile Val Tyr Ala Asn Arg Glu Ser Val Val Gln Lys Asp 
        115                 120                 125             


Ile His Ser Ile Ile Tyr Val Gly Xaa Asp Asp Glu Tyr Val Tyr Asp 
    130                 135                 140                 


Leu Xaa Val Phe Ala Asp Lys Asp Ile Gln His Thr Phe Phe Ala Asp 
145                 150                 155                 160 


Asn Ile Leu Val His Asn 
                165     


<210>  233
<211>  253
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from uncultured linear-genome virus ML
       43 kmer contig 2588 from Midtre Lovenbreen (Svalbard) glacier 
       cryoconite holes sediment


<220>
<221>  misc_feature
<222>  (88)..(88)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (142)..(142)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (171)..(171)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (197)..(197)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (212)..(212)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (239)..(239)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (243)..(243)
<223>  Xaa can be any naturally occurring amino acid

<400>  233

Ser Val Leu Gly Ser Thr Val Leu Val Leu Arg Asp Pro Lys Thr Asn 
1               5                   10                  15      


Trp Ile Ser Ile Lys Thr Ile Glu Gln Leu Ser Ile Gly Thr Asn Ser 
            20                  25                  30          


Tyr Ser Tyr Asp Asn Phe Lys Leu Asp Gly Thr Leu Arg Ser His Lys 
        35                  40                  45              


Ser Tyr Thr Leu Thr Asn Tyr Gln Ile Trp Thr Asp Val Gly Trp Ser 
    50                  55                  60                  


Asn Ile Lys Arg Val Ile Lys His Lys Thr Asn Lys Lys Ile Phe Arg 
65                  70                  75                  80  


Val Val Asn Asn Asn Gly Trp Xaa Asp Val Thr Glu Asp His Ser Leu 
                85                  90                  95      


Leu Asp Ser Gln Leu Asn Pro Ile Lys Pro Glu Gln Ile Asn Gly Asp 
            100                 105                 110         


Thr Val Leu Ala Asn Thr Phe Ile Asn Glu Phe Asn Lys Asp Lys Thr 
        115                 120                 125             


Asn Ile Ser Ser Ser Leu Ala Tyr Lys Met Gly Tyr Lys Xaa Ile Leu 
    130                 135                 140                 


Asp Pro Val Ile Asn Gly Ser Ile Ile Asn Ser Pro Lys Asn Ile Arg 
145                 150                 155                 160 


Ala Ser Phe Tyr Lys Gly Tyr Leu Lys Asn Xaa Asn Ile Lys Gln Val 
                165                 170                 175     


Lys Ser Gln Val Trp Trp Gln Gly Ile Tyr Tyr Ile Gln Lys Ser Leu 
            180                 185                 190         


Gly Phe Asn Asn Xaa Ile Ser Lys Ser Asp Ser Pro Phe Val Asn Thr 
        195                 200                 205             


Asn Leu Lys Xaa Ser Asn Asp Glu Thr Ser Ile Gln Glu Leu Glu Met 
    210                 215                 220                 


Ser Ser Ile Asp Gly Asp Phe Val Tyr Asp Leu Glu Thr Glu Xaa Gly 
225                 230                 235                 240 


Arg Phe Xaa Ala Gly Val Gly Ser Leu Leu Leu Lys Asn 
                245                 250             


<210>  234
<211>  253
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from uncultured linear-genome virus AB
       43 kmer contig 3071 from Austre Broggerbreen (Svalbard) glacier 
       cryoconite holes sediment


<220>
<221>  misc_feature
<222>  (88)..(88)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (142)..(142)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (171)..(171)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (197)..(197)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (212)..(212)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (239)..(239)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (243)..(243)
<223>  Xaa can be any naturally occurring amino acid

<400>  234

Ser Val Leu Gly Ser Thr Val Leu Val Leu Arg Asp Pro Lys Thr Asn 
1               5                   10                  15      


Trp Ile Ser Ile Lys Thr Ile Glu Gln Leu Ser Ile Gly Thr Asn Ser 
            20                  25                  30          


Tyr Ser Tyr Asp Asn Phe Lys Leu Asp Gly Thr Leu Arg Ser His Lys 
        35                  40                  45              


Ser Tyr Thr Leu Thr Asn Tyr Gln Ile Trp Thr Asp Val Gly Trp Ser 
    50                  55                  60                  


Asn Ile Lys Arg Val Ile Lys His Lys Thr Asn Lys Lys Ile Phe Arg 
65                  70                  75                  80  


Val Val Asn Asn Asn Gly Trp Xaa Asp Val Thr Glu Asp His Ser Leu 
                85                  90                  95      


Leu Asp Ser Gln Leu Asn Pro Ile Lys Pro Glu Gln Ile Asn Gly Asp 
            100                 105                 110         


Thr Val Leu Ala Asn Thr Phe Ile Asn Glu Phe Asn Lys Asp Lys Thr 
        115                 120                 125             


Asn Ile Ser Ser Ser Leu Ala Tyr Lys Met Gly Tyr Lys Xaa Ile Leu 
    130                 135                 140                 


Asp Pro Val Ile Asn Gly Ser Ile Ile Asn Ser Pro Lys Asn Ile Arg 
145                 150                 155                 160 


Ala Ser Phe Tyr Lys Gly Tyr Leu Lys Asn Xaa Asn Ile Lys Gln Val 
                165                 170                 175     


Lys Ser Gln Val Trp Trp Gln Gly Ile Tyr Tyr Ile Gln Lys Ser Leu 
            180                 185                 190         


Gly Phe Asn Asn Xaa Ile Ser Lys Ser Asp Ser Pro Phe Val Asn Thr 
        195                 200                 205             


Asn Leu Lys Xaa Ser Asn Asp Glu Thr Ser Ile Gln Glu Leu Glu Met 
    210                 215                 220                 


Ser Ser Ile Asp Gly Asp Phe Val Tyr Asp Leu Glu Thr Glu Xaa Gly 
225                 230                 235                 240 


Arg Phe Xaa Ala Gly Val Gly Ser Leu Leu Leu Lys Asn 
                245                 250             


<210>  235
<211>  505
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Klosneuvirus-KNV1


<220>
<221>  misc_feature
<222>  (26)..(26)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (163)..(163)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (177)..(177)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (393)..(393)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (432)..(432)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (437)..(437)
<223>  Xaa can be any naturally occurring amino acid

<400>  235

Ser Ile Leu Gly Asn Glu Val Leu Thr Leu Leu Asn Asp Lys Asn Asp 
1               5                   10                  15      


Ile Glu Phe His Arg Ile Asp Lys Leu Xaa Asp Lys Trp Glu Ser Tyr 
            20                  25                  30          


Asp Asn Phe Lys Thr Asp Glu Tyr Asn Glu Tyr Phe Thr Asn Ile Leu 
        35                  40                  45              


Asn Lys Leu Phe Lys Asp Lys Thr Asp Ser Asn Asn Ser Ile Leu Met 
    50                  55                  60                  


Asn Glu Thr Asp Tyr Ile Asn Gly Lys Ile Met Gly Ser Ile Tyr Thr 
65                  70                  75                  80  


Asn Asn Ile Arg Lys Asn His Glu Val Ser Ile Gly Ala Leu Gly Lys 
                85                  90                  95      


Ile Asp Gly Lys Arg Thr Lys Lys Gln Phe Tyr Phe Ser Met Tyr Gly 
            100                 105                 110         


Asn Asp Thr Lys Ala Leu His Glu Ala Leu Lys Tyr Arg Leu Lys Leu 
        115                 120                 125             


Asn Glu Glu Leu Asp Leu Ile Lys Asn Lys Tyr Arg Tyr Met Ile Asp 
    130                 135                 140                 


Leu Asp Ser Asn Arg Tyr Ile Glu Val Gln Val Asn Asn Asn Lys Ile 
145                 150                 155                 160 


Met Leu Xaa Asp Ile Asp Asp Ile Asp Ile Ile Glu Lys Tyr Thr Trp 
                165                 170                 175     


Xaa Ile Asn Glu Lys Asn Tyr Val Val Ser Asn Asn Val Asn Ser Glu 
            180                 185                 190         


Ser Trp Gln Tyr His Lys Leu Val Leu Asn Lys Ile Leu Asn Lys Leu 
        195                 200                 205             


Pro Asn Asn Ile Arg Asn Gln Ile Lys Asp Leu Thr Val Asp His Met 
    210                 215                 220                 


Asn Arg Asn Thr Leu Asp Asn Arg Lys Gln Asn Leu Arg Leu Val Asn 
225                 230                 235                 240 


Ser Lys Glu Gln Gln Trp Asn Gln Asp Ile Phe Lys Thr Asn Thr Ser 
                245                 250                 255     


Gly Thr Arg Gly Val Tyr Tyr Arg Thr Asn Arg Lys Ala Trp Ile Ala 
            260                 265                 270         


Asn Trp Leu Asn Leu Asp Gly Lys Arg Glu Ser Lys Tyr Phe Lys Asn 
        275                 280                 285             


Lys Gln Asp Ala Ile Asn Glu Arg Lys Thr Gln Glu Gln Ile Met Glu 
    290                 295                 300                 


Thr Tyr Phe Asn Asp Glu Arg Lys Lys Leu Ile Asp Lys Leu Lys Gln 
305                 310                 315                 320 


Leu Leu Ile Asn Asp Gln Lys Asp Arg Tyr Asp Lys Glu Gln Ser Ser 
                325                 330                 335     


Ala Asn Tyr Lys Val Trp Thr Asp Lys Gly Trp Ser Asn Ile Asn Arg 
            340                 345                 350         


Val Ile Arg His Lys Thr Tyr Lys Arg Ile Phe Arg Ile Val Thr Lys 
        355                 360                 365             


Thr Ser Ile Ile Asp Val Thr Glu Asp His Ser Leu Ile Asp Lys Asn 
    370                 375                 380                 


Gly Asn Tyr Ile Thr Pro Lys Thr Xaa Thr Ile Gly Thr Glu Leu Met 
385                 390                 395                 400 


Tyr Gly Ile Asn Asn Phe Asp Thr Ile Glu Phe Val Asp Ile Lys Ser 
                405                 410                 415     


Asn Lys Tyr Asp Ile Leu Ala Phe Ser Ser Asp Asp Lys Val Glu Xaa 
            420                 425                 430         


Met Arg Tyr Tyr Xaa Tyr Asn Lys Lys Leu Gly Tyr Asn Ile Leu Ile 
        435                 440                 445             


Asp Val Asn Asn Asp Lys Phe Val Leu His Arg Thr Asn Asp Val Ile 
    450                 455                 460                 


Glu Asn Pro Asn Lys Ile Ile Asn Ile Gln Gln Leu Asn Asp Val Val 
465                 470                 475                 480 


Asp Asp Tyr Val Tyr Asp Leu Glu Thr Glu Val Gly His Phe His Ala 
                485                 490                 495     


Gly Ile Gly Glu Ile Ile Val Lys Asn 
            500                 505 


<210>  236
<211>  355
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Catovirus CTV1932-1286


<220>
<221>  misc_feature
<222>  (104)..(104)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (114)..(114)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (157)..(157)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (229)..(229)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (246)..(246)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (281)..(281)
<223>  Xaa can be any naturally occurring amino acid

<400>  236

Ser Val Thr Gly Glu Thr Pro Ile Leu Leu Lys Asn Pro Thr Thr Asn 
1               5                   10                  15      


Lys Ile Val Ile Lys Arg Ile Asp Glu Leu Gly Thr Glu Trp Lys Asn 
            20                  25                  30          


Tyr Asn Asn Tyr Lys Ser Ser Asp Ser Asn Arg Tyr Phe Arg Glu Leu 
        35                  40                  45              


Leu Thr Ile Leu Phe Lys Asn Lys Asn Val Glu Lys Lys Arg Glu Asn 
    50                  55                  60                  


Phe Val Pro Val Ser Gln Trp Met Tyr Ile Lys Asn Asn Lys Tyr Arg 
65                  70                  75                  80  


Tyr Ile Asn Asp Ala Tyr Ser Asn Asn Thr Tyr Leu Gln Val Asn Leu 
                85                  90                  95      


Asn Asn Asn Lys Asn Met Ile Xaa Asp Val Glu Asp Ile Asp Ile Ile 
            100                 105                 110         


Asn Xaa Glu Leu Lys Asn Ile Ser Lys Met Ile Leu Asn Lys Ile Ile 
        115                 120                 125             


Lys Leu Phe Pro Lys Gln Ile Gln Ala Lys Leu Gln Lys Tyr Glu Val 
    130                 135                 140                 


Asn Tyr Ile Asn Glu Asn Ser Leu Asp Asn Arg Lys Xaa Asn Leu Ser 
145                 150                 155                 160 


Ile Asp Val Asp Asn Gln Thr Ile Asp Lys Ile Ile Glu Asp Ile Glu 
                165                 170                 175     


Asn Asp Gln Lys Asp Arg Tyr Ser Lys Glu Gln Ser Thr Thr Asn Tyr 
            180                 185                 190         


Leu Ala Tyr Thr Asp Lys Gly Trp Ala Lys Ile Asn Lys Ile Ile Arg 
        195                 200                 205             


His Lys Thr Thr Lys Lys Ile Tyr Arg Ile Leu Thr Thr Asn Ser Ile 
    210                 215                 220                 


Val Glu Val Thr Xaa Asp His Ser Leu Ile Asp Glu Asn Gly Asn Tyr 
225                 230                 235                 240 


Ile Met Pro Lys Asp Xaa Val Ile Gly Lys Gly Leu Met Gln Ser Tyr 
                245                 250                 255     


Pro His Thr Asn Lys Leu Lys Tyr Asn Asp Lys Tyr Asp Lys Ser Ile 
            260                 265                 270         


Phe Thr Asn Lys Asn Tyr Lys Glu Xaa Met Lys Tyr Tyr His Tyr Asn 
        275                 280                 285             


Lys Ser His Gly Phe Asn Ile Thr Ile Asp Glu Asn Asn Gly Ile Ile 
    290                 295                 300                 


Thr Leu Lys Arg Thr Lys Asp Asn Ile Asn Asn Ile Asn Lys Ile Val 
305                 310                 315                 320 


Lys Ile Phe Asp Leu Gly Asp Ile Ser Thr Asp Arg Tyr Val Tyr Asp 
                325                 330                 335     


Leu Glu Thr Glu His Gly Arg Phe His Ala Gly Val Gly Glu Leu Ile 
            340                 345                 350         


Val Thr Asn 
        355 


<210>  237
<211>  444
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from coal oil point, Santa-Barbara 
       California, crude oil metagenome 3 and 7


<220>
<221>  misc_feature
<222>  (28)..(28)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (48)..(48)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (57)..(57)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (104)..(104)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (171)..(171)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (252)..(252)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (278)..(278)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (341)..(341)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (353)..(353)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (373)..(373)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (435)..(435)
<223>  Xaa can be any naturally occurring amino acid

<400>  237

Ser Val Asp Ala Gly Thr Asp Ile Ile Ile Arg Glu Gly Lys Leu Val 
1               5                   10                  15      


Lys Ile Val Asn Ile Lys Asp Leu Phe Glu Ser Xaa Asp Ser Glu Ile 
            20                  25                  30          


Tyr Arg Lys Gly Asp His Glu Tyr Lys Ala Pro Glu Lys Ile Glu Xaa 
        35                  40                  45              


Leu Ser Val Ser Lys Glu Gly Ile Xaa Glu Trp Lys Lys Ala Asn Ser 
    50                  55                  60                  


Ile Lys Arg His Pro Tyr Glu Gly Lys Ile Leu Asn Ile Gln Thr Arg 
65                  70                  75                  80  


Arg Gly Ser Ile Ser Val Thr Lys Asn His Ser Leu Tyr Ala Leu Ser 
                85                  90                  95      


Ser Gly Leu Lys Glu Ile Leu Xaa Arg Asp Leu Asn Lys Asn Thr Pro 
            100                 105                 110         


Leu Ala His Ile Ser Lys Tyr Ser Gln Ala Glu Lys Lys Leu Lys Ile 
        115                 120                 125             


Asn Ala Leu Glu Pro Leu Arg Lys Phe Ser Asn Glu Leu Asn Ile Trp 
    130                 135                 140                 


Leu Ser Val Pro Ile Asn Asp Leu Thr Lys Arg Leu Leu Lys Tyr His 
145                 150                 155                 160 


Gln Pro Arg Asn Asn Phe Lys Gly Gly Arg Xaa Lys Lys Glu Phe Ile 
                165                 170                 175     


Arg Ile Pro Ile Pro Thr Ala Ile Glu Leu Tyr Asn Lys Lys Val Ile 
            180                 185                 190         


Glu Asp Lys Asp Ile Gln Asn Ala Phe Ile Ser Ser Tyr Gly Gly Lys 
        195                 200                 205             


Gly Lys Ile Pro Val Ile Tyr Glu Leu Asn Lys Asp Phe Ala Arg Ile 
    210                 215                 220                 


Leu Gly Ala Tyr Ala Ala Glu Gly Ser Leu His Ile Arg Lys Arg Lys 
225                 230                 235                 240 


Gly Ile Ser Lys Glu Gly Ala Tyr Ile Phe Ala Xaa Gly His Asp Ile 
                245                 250                 255     


Gln Ser Leu Glu Glu Leu Lys Lys Ile Leu Ala Arg Ile Phe Arg Arg 
            260                 265                 270         


Asn Phe Asn Val Thr Xaa Ser Gly Val Asp Lys Asn Gly Arg Asn Phe 
        275                 280                 285             


Arg Ile Lys Ser Asn Ser Ala Val Ala Tyr Leu Phe Lys Phe Val Leu 
    290                 295                 300                 


Asp Val Gly Gln Gly Ser Gln Gly Lys Glu Val Ser Pro Tyr Ile Leu 
305                 310                 315                 320 


Ser Ser Ser Lys Ser Ile Gln Arg Ala Phe Phe Asp Glu Tyr Thr Lys 
                325                 330                 335     


Gly Glu Gly Tyr Xaa Asp Lys Arg Arg Arg Val Asn Pro Leu Leu Glu 
            340                 345                 350         


Xaa Thr Thr Lys Ser Lys Lys Leu Ala Glu Gly Leu Ser Leu Met Ala 
        355                 360                 365             


Ile Asn Leu Asn Xaa Gly Leu Pro Ser Ile Arg Phe Arg Lys Glu Asn 
    370                 375                 380                 


Ser Ser Tyr Gln Leu Arg Phe Val Gln Tyr Asp Leu Asn Ser Val Lys 
385                 390                 395                 400 


Tyr Arg Asp Leu Ser Gly Leu Leu Pro Lys Glu Ile Lys Glu Val Lys 
                405                 410                 415     


Pro Thr Asp Gly Tyr Val Tyr Asp Val Gly Val Glu Gly Asn Asn Asn 
            420                 425                 430         


Phe Val Xaa Ala Lys Gly Leu Ile Leu Ala His Asn 
        435                 440                 


<210>  238
<211>  186
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein (partial) from Euryarchaeon species 
       RBG_13_31_8


<220>
<221>  misc_feature
<222>  (48)..(48)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (57)..(57)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (104)..(104)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (109)..(109)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (186)..(186)
<223>  Xaa can be any naturally occurring amino acid

<400>  238

Ser Val Asp Ser Glu Thr Asp Ile Ile Ile Lys Gly Asn Asn Leu Ile 
1               5                   10                  15      


Lys Ile Thr Asn Ile Lys Asp Leu Phe Glu Lys Gln Ser Ser His Ile 
            20                  25                  30          


Tyr Lys Lys Glu Asn His Glu Tyr Lys Lys Leu Lys Asp Ile Glu Xaa 
        35                  40                  45              


Leu Ser Val Asn Asn Asn Gly Ile Xaa Glu Trp Lys Lys Ala Asn Phe 
    50                  55                  60                  


Ile Lys Arg His Pro Tyr Lys Asn Asp Ile Ile Lys Val Lys Thr Gln 
65                  70                  75                  80  


Arg Gly Met Ile Ser Val Thr Lys Asn His Ser Leu Tyr Thr Ile Ser 
                85                  90                  95      


Lys Asn Ile Lys Glu Ile Tyr Xaa Lys Gln Leu Asn Xaa Lys Asp Thr 
            100                 105                 110         


Asn Ile Val His Ile Ser Asn Phe Asn Glu Lys Gly Lys Lys Ile Ile 
        115                 120                 125             


Ile Asn Ala Leu Asn Pro Leu Val Glu Phe Asn Asn Glu Leu Asn Ile 
    130                 135                 140                 


Val Phe Asn Ile Pro Leu Lys Asn Ser Thr Lys Asn Leu Leu Lys Tyr 
145                 150                 155                 160 


Tyr Gln His Arg Lys Asn Tyr Lys Gly Lys Val Thr Leu Asn Lys Lys 
                165                 170                 175     


Tyr Ile Arg Ile Lys Leu Ile Asp Ala Xaa 
            180                 185     


<210>  239
<211>  393
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from coal oil point, Santa-Barbara 
       California, crude oil metagenome 7


<220>
<221>  misc_feature
<222>  (7)..(7)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (28)..(29)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (82)..(82)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (171)..(171)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (177)..(177)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (267)..(267)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (321)..(321)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (386)..(386)
<223>  Xaa can be any naturally occurring amino acid

<400>  239

Ser Ile Val Gly Ala Arg Xaa Ile Ala Ile Lys Asp Asp Asn Leu Ile 
1               5                   10                  15      


Asn Ile Ile Pro Ile Glu Glu Leu Trp Lys Lys Xaa Xaa Glu Pro Met 
            20                  25                  30          


Glu Lys Ile Asn Gly Lys Glu Ile Lys Thr Pro Gln Ser Val Phe Thr 
        35                  40                  45              


Leu Ser Lys Asn Gly Glu Trp Gln Lys Ile Lys Lys Ile Ile Arg His 
    50                  55                  60                  


Lys Thr Asn Lys Ile Ile Phe Arg Ile Asn Gln Lys Asn Gly Glu Thr 
65                  70                  75                  80  


Ile Xaa Thr Glu Asp His Ser Leu Ile Thr Glu Asn Tyr Lys Pro Ile 
                85                  90                  95      


Lys Pro Lys Glu Leu Gly Asp Lys Lys Ile Leu Phe Leu Lys Lys Val 
            100                 105                 110         


Pro His Glu Pro Thr Ile Phe Asp Arg Lys Ile Asp Leu Tyr Pro Leu 
        115                 120                 125             


Val Ser His Tyr Ser Phe Glu Thr Glu Tyr Lys Glu Arg Lys Lys Gln 
    130                 135                 140                 


Asn Ala Trp Lys Ala Asp Asn Lys Phe Leu Trp Phe Gly Trp Thr Ser 
145                 150                 155                 160 


Arg Glu Asn Gln His Lys Ile Asn Arg Phe Xaa Asn Leu Ser Asp Leu 
                165                 170                 175     


Xaa Lys Leu Leu Gly Ile Tyr Ile Ala Asp Gly His Ser Ser Leu His 
            180                 185                 190         


Lys Gly Lys Tyr Gly Ile Lys Ala Thr Ala Gly Ile Ser Ser Lys Asp 
        195                 200                 205             


Thr Leu Phe Leu Asn Glu Leu Lys Glu Ile Met Lys Arg Ile Asp Tyr 
    210                 215                 220                 


Asn His Gln Ile Ser Ile Leu Arg Val Ser Lys Gly Glu Arg Val Ile 
225                 230                 235                 240 


Gln Gly Tyr Lys Tyr Glu Asp Arg Thr Tyr Arg Leu Gln Thr Asn Ser 
                245                 250                 255     


Thr Thr Trp Thr Ala Phe Phe Ser Ser Leu Xaa Gly Ser Gly Ser Glu 
            260                 265                 270         


Asn Lys His Leu Pro Gln Phe Ile Phe Asn Val Glu Lys Lys Tyr Gln 
        275                 280                 285             


Glu Leu Leu Tyr Lys Tyr Tyr Leu Leu Gly Asp Gly Ser Val Glu Gln 
    290                 295                 300                 


Gly Lys Asn Thr Phe Thr Ser Lys Ser Leu Gln Leu Val Ser Gly Leu 
305                 310                 315                 320 


Xaa Phe Leu Leu Lys Ser Trp Asn Ile Asp Thr Ser Ile Tyr Tyr His 
                325                 330                 335     


Glu Lys Arg Asp Val Tyr Arg Val Arg Glu Arg Glu Arg Ala Val Asp 
            340                 345                 350         


Ser Met His Pro Ile Lys Thr Lys Leu Ile Glu Leu Pro Lys Val Glu 
        355                 360                 365             


Arg Tyr Val Tyr Asp Leu Ser Val Glu Asn Thr Glu Met Phe Val Asp 
    370                 375                 380                 


Ala Xaa Gly Met Leu Leu Leu His Asn 
385                 390             


<210>  240
<211>  359
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Chlamydomonas sphaeroides 
       NIES-2242


<220>
<221>  misc_feature
<222>  (19)..(19)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (156)..(156)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (182)..(182)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (222)..(224)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (239)..(239)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (260)..(260)
<223>  Xaa can be any naturally occurring amino acid

<400>  240

Ser Val Thr Gly Tyr Thr Pro Val Leu Val Arg Ile Gln His Lys Val 
1               5                   10                  15      


Arg Tyr Xaa Thr Ile Glu Lys Leu Pro Gly Leu Leu Gly Gly Gly Glu 
            20                  25                  30          


Trp Val Pro Val Val Pro Ala Glu Pro Ala Ala His His Glu Lys Pro 
        35                  40                  45              


Lys Glu Ala Leu Asp Leu Thr His His His Val Glu Thr Trp Thr Glu 
    50                  55                  60                  


Ser Gly Trp Thr Arg Leu Arg Arg Ile Ile Arg His Ala Leu Pro Pro 
65                  70                  75                  80  


Gly Lys Arg Ile Leu Arg Ile Thr Thr Pro Thr Gly Val Val Asp Ala 
                85                  90                  95      


Thr Asp Asp His Ser Leu Leu Ala Pro Asn Gly Asp Pro Ile Thr Pro 
            100                 105                 110         


Arg Asn Leu His Ile Gly Thr Pro Leu Met His Ala Pro Leu Pro Leu 
        115                 120                 125             


Asp Glu Trp Arg Arg Met Ala Gln Thr Ala Ala Arg Thr Thr Thr Pro 
    130                 135                 140                 


Glu Arg Ala Arg Ile Leu Gly Ile Phe Val Ala Xaa Arg Ala Met Ala 
145                 150                 155                 160 


Phe Ser Gln Arg Gly Phe Ile Trp Thr Ala Ser Lys Ser Ile Gly Ser 
                165                 170                 175     


Tyr Phe Met Asp Leu Xaa Asn Lys Glu Phe Gly Gly Phe Thr Trp Thr 
            180                 185                 190         


Met Thr Thr Ser Pro Glu Asp Gly Ile Thr Val Val Lys Pro Glu Gly 
        195                 200                 205             


Val Pro Gly His Ile Leu Tyr Ser Thr Leu Leu Gln Thr Xaa Xaa Xaa 
    210                 215                 220                 


Gly Gly Ser Ala Asp Glu Pro Met Val Pro Asp Asp Val Leu Xaa Ser 
225                 230                 235                 240 


Asn Ser Ile His Val His Met Ala Phe Ile Glu Gly Phe Arg Met Gly 
                245                 250                 255     


Asn Pro Met Xaa Leu Ile Phe Thr His Leu Leu Gly Ala Thr Leu Phe 
            260                 265                 270         


Val Leu Leu Gln Phe Met Ile Glu Ala His Gly Asp Ser Gly Ala Thr 
        275                 280                 285             


Asp Val Arg Val Ala Gly Ser Met Pro Asn Ile Asn Leu Val Thr Ser 
    290                 295                 300                 


Pro Ser Ser Thr Pro Ser Thr Glu Asp Ser Thr Ser Val Met Ser Met 
305                 310                 315                 320 


Val Asp Ile Thr Asp Ala Leu Arg Ser Lys Gly Asp Pro Thr Leu Leu 
                325                 330                 335     


Met Val Tyr Asp Leu Thr Thr Asp Asn His His Phe Ala Ala Gly Val 
            340                 345                 350         


Gly Gln Met Val Val His Asn 
        355                 


<210>  241
<211>  337
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Marine sediment metagenome LCGC14


<220>
<221>  misc_feature
<222>  (143)..(143)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (335)..(335)
<223>  Xaa can be any naturally occurring amino acid

<400>  241

Ser Ile Met Gly Thr Glu Ala Val Val Val Arg Phe Ala Asp Asp Glu 
1               5                   10                  15      


Pro Pro Met Ile Leu Ser Ile Gln Glu Leu Trp Asp Met Ala His Glu 
            20                  25                  30          


Lys Gly Tyr Gln Ile Val Tyr Arg Ala Asp Gly Lys Glu Gln Leu Val 
        35                  40                  45              


Val Gln Gly Leu Arg Val Trp Asp Gly Lys Gly Trp Asn Arg Val Tyr 
    50                  55                  60                  


Lys Leu Ile Arg His Phe Thr Ile Lys Thr Ile Tyr Arg Thr Ala Thr 
65                  70                  75                  80  


Gly Lys Gly Ala Val Asp Thr Thr Glu Asp His Ser Leu Val Asp Asp 
                85                  90                  95      


Glu Gly His Glu Phe Lys Pro Glu His Ile Ser Ser Arg Gly Lys Glu 
            100                 105                 110         


Pro Val Ser Ser Gln Phe Ile Leu Pro Ser Lys Arg Leu Ser Tyr Thr 
        115                 120                 125             


Asp Asp Ile Ser Ala Thr Leu Leu Leu Phe Leu Phe Ser Met Xaa Gly 
    130                 135                 140                 


Thr Ala Ser Gly Thr Tyr Thr Gly Tyr Pro Ser Gln Arg Thr Val Ala 
145                 150                 155                 160 


Leu His Phe Ser Pro Lys Ile Lys Asp Ile Ala Gln Lys Leu Trp Trp 
                165                 170                 175     


Glu Ala Lys Pro Ala Ala Ala Ala Phe Gly Ala Lys His Lys Leu Tyr 
            180                 185                 190         


Ser Thr Lys Lys Glu Lys Leu Val Ile Lys Phe Ser Asn His Thr Gln 
        195                 200                 205             


Leu Trp Ala Phe Val Arg Arg Asn Leu Tyr Ile Lys Asn Gln Lys Ile 
    210                 215                 220                 


Lys Pro Leu Pro Ser Val Leu Ser Leu Pro Glu His Tyr Leu Glu Thr 
225                 230                 235                 240 


Ile Tyr Gly Met Leu Lys Asp Tyr Tyr Tyr Asn Phe Asp Glu Lys Thr 
                245                 250                 255     


Lys Gln Asp Arg Trp Thr Ile Lys Ser Ser Ser Glu Met Val Leu Phe 
            260                 265                 270         


Thr Ala Leu Ala Asn Arg Phe Lys Glu Glu Met Thr Val Val Pro Gln 
        275                 280                 285             


Pro Lys Gly Lys Ile Tyr Gln Ile Lys Arg Thr Ser Asp Met Lys Arg 
    290                 295                 300                 


Arg Ala Thr Val Ser Ile Thr Arg Ile Leu Pro Asp Tyr Val Tyr Asp 
305                 310                 315                 320 


Met Glu Thr Glu Asp Gly Thr Phe Met Ala Gly Asn Ile Leu Xaa His 
                325                 330                 335     


Asn 
    


<210>  242
<211>  309
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Russia Kulunda-steppe soda lake 
       Tanatar-5 brine environmental genomics


<220>
<221>  misc_feature
<222>  (37)..(37)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (48)..(48)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (123)..(123)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (173)..(173)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (212)..(212)
<223>  Xaa can be any naturally occurring amino acid

<400>  242

Ser Val Thr Gly Tyr Thr Pro Val Tyr Val Lys Asp Gln Asn Gly Ser 
1               5                   10                  15      


Ile Met Leu Val Thr Ile Glu Ser Leu Ala Glu Lys Tyr Gly Asp Ser 
            20                  25                  30          


Thr Trp Ile Pro Xaa Val Glu Asn Gly Arg Gln Thr Lys Glu Ala Xaa 
        35                  40                  45              


Glu Leu Val Gly Leu Glu Thr Trp Thr Glu Lys Gly Trp Thr Pro Leu 
    50                  55                  60                  


Lys Arg Val Ile Arg His Ile Leu Ala Pro His Lys Lys Ile Ile Arg 
65                  70                  75                  80  


Ile Val Thr His Thr Gly Ile Val Asp Val Thr Asp Asp His Ser Leu 
                85                  90                  95      


Leu Asp Ala Glu Gly Asn Pro Val Thr Ser Lys Asp Ile Ser Met Gly 
            100                 105                 110         


Thr Pro Leu Leu His His Val Leu Pro Pro Xaa Glu Glu Glu Glu Glu 
        115                 120                 125             


Glu Glu Gly Lys Glu Glu Glu Lys Glu Ala Glu Val Met Gly Phe Ser 
    130                 135                 140                 


Tyr Gly Glu Gly Glu His Asp Glu Ile Pro Arg His Ile Phe Asp Ala 
145                 150                 155                 160 


Pro Leu His Ile Lys Gln Ala Phe Trp Lys Gly Leu Xaa Asp Ala Lys 
                165                 170                 175     


Arg Gly Asp Glu Lys Asp Val Glu Gly Asn Thr Thr Phe Asp His Arg 
            180                 185                 190         


Ser Met Leu Gly Ala Ser His Ile Ala Arg Leu Ala Thr Tyr Leu Gly 
        195                 200                 205             


Tyr Ser Tyr Xaa Leu Ser Ile Asp Ser Met Asp Ser Met Asp Ser Met 
    210                 215                 220                 


Asp Ser Met Lys Gln Asp Lys Phe Arg Ser Ala Gly Ala Lys Phe Arg 
225                 230                 235                 240 


Ser Ala Gly Thr Lys Phe Arg Ile Thr Leu Lys Ser Thr Thr Ser His 
                245                 250                 255     


Glu Thr Arg Gly His Thr Asp His Thr Asp His Thr Asp Asn Ser Asp 
            260                 265                 270         


Arg Thr Thr Ile Ser Lys Met Tyr Glu Ile Pro Tyr Glu Gly Tyr Val 
        275                 280                 285             


Tyr Asp Leu Thr Thr Glu Asn His His Phe Ala Gly Gly Val Gly Gln 
    290                 295                 300                 


Leu Ile Leu His Asn 
305                 


<210>  243
<211>  303
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Chlamydomonas asymmetrica 
       NIES-2207


<220>
<221>  misc_feature
<222>  (5)..(5)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (39)..(39)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (57)..(57)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (95)..(95)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (141)..(141)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (156)..(156)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (262)..(262)
<223>  Xaa can be any naturally occurring amino acid

<400>  243

Ser Val Ala Ala Xaa Thr Pro Val Ile Val Arg Ser Ser Ala Asp Ala 
1               5                   10                  15      


Asp Asp Trp Asp Ile Leu Pro Ile Glu Leu Val Pro Tyr Ala Tyr Gly 
            20                  25                  30          


Asp Ala His Trp Val Arg Xaa His Asp Glu Ser Gly Asp Asp Asn Ser 
        35                  40                  45              


Ser Asp Val Asp Ala Lys Glu Ala Xaa Glu Leu Tyr Asp Arg Val Glu 
    50                  55                  60                  


Ala Trp Thr Glu Asp Gly Trp Thr Pro Leu His Arg Val Ile Arg His 
65                  70                  75                  80  


Val Leu Pro Gln His Thr Ser Leu Thr Arg Val Val Thr Pro Xaa Gly 
                85                  90                  95      


Leu Val Asp Val Thr Asn Asp His Ser Leu Leu Thr Ala Pro Pro Glu 
            100                 105                 110         


Pro Val Pro Val Ser Pro Arg Asp Val Val Ala Gly Ser Thr Arg Leu 
        115                 120                 125             


Leu His His Ala Ala Tyr Pro Pro Ala Pro Thr Ser Xaa Ser Ser Ile 
    130                 135                 140                 


Asn Glu Asp Trp Ala Gln Arg Ala Ala Val Glu Xaa Val Ile Thr Ala 
145                 150                 155                 160 


Thr Phe Asn Gly Ser Ala Ala Leu Pro Trp Gln Leu Lys Gln Val Ala 
                165                 170                 175     


Met Ala Pro Gly Arg Ile Val Arg Glu Phe Trp Lys Ala Met Leu Gly 
            180                 185                 190         


Thr Met Lys Ala Leu Arg Ala Ser Pro Gly Ala Asp Ile Leu Ser Leu 
        195                 200                 205             


Arg Met Ser Gln Ala Ser Ala Ala Trp Leu Leu Val Val Ala Ala Arg 
    210                 215                 220                 


Ile Gln Glu Arg Ala Thr Val His Asp Thr Thr Ser His Asp Thr Trp 
225                 230                 235                 240 


Met Gly Asp Asp Val Val Val Val Ser Phe Ser Ala Thr Pro Ser Ser 
                245                 250                 255     


Ser Leu Gln Ser His Xaa Asp Asp His Val Val Arg Thr Val Arg Ala 
            260                 265                 270         


Leu Pro Thr Ser Ser Ile Pro Arg Tyr Val Tyr Asp Leu Thr Thr Asp 
        275                 280                 285             


Asn His His Phe Ala Ala Gly Pro Gly Arg Met Val Val His Asn 
    290                 295                 300             


<210>  244
<211>  210
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Gulf of Maine environmental 
       sampling


<220>
<221>  misc_feature
<222>  (28)..(28)
<223>  Xaa can be any naturally occurring amino acid

<400>  244

Ser Val Met Gly Tyr Thr Pro Ile Val Leu Lys Asn Asp Lys Asp Ile 
1               5                   10                  15      


Ile Asn Ile Ile Ser Phe Asp Asp Leu Glu Tyr Xaa Lys Trp Leu Ser 
            20                  25                  30          


Tyr Asn Asn Leu Asp Lys Asn Gly Ser Asn Lys Glu Gln Leu Phe Asn 
        35                  40                  45              


Pro Gly Tyr Phe Ile Trp Thr His Asn Gly Trp Ser Pro Ile Ile Arg 
    50                  55                  60                  


Phe Ile Arg His Lys Thr Ile Lys Gln Ile Tyr Arg Ile Ile Thr Asn 
65                  70                  75                  80  


Ser Gly Ile Ile Asp Val Thr Glu Asp His Ser Leu Leu Asp Leu Asn 
                85                  90                  95      


Val Lys Glu Ile Lys Pro Asn Gln Leu Lys Ile Asn Asp Asn Leu Leu 
            100                 105                 110         


His Ser Lys Ile Lys Thr Lys Lys Ile Asn Lys Glu Ser Ser Lys Phe 
        115                 120                 125             


Phe Asn Ile Asn Asn Lys Leu Ile Tyr Lys Asp Leu Gln Asp Phe Val 
    130                 135                 140                 


Leu Arg Ser Lys Asn Ile Lys Ile Glu Lys Ile Asn Asn Asn Asp Tyr 
145                 150                 155                 160 


Met Ile Val Asn Val Asp Tyr Lys Tyr Arg Asp Glu Asn Lys Ile Ile 
                165                 170                 175     


Glu Ile Ile Lys Leu His Glu Lys Tyr Glu Gly Tyr Val Tyr Asp Ile 
            180                 185                 190         


Glu Thr Lys Glu Gly Val Phe Asn Ala Gly Ile Gly Asn Leu Val Val 
        195                 200                 205             


Lys Asn 
    210 


<210>  245
<211>  211
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein (partial) from Block Island 
       environmental sampling


<220>
<221>  misc_feature
<222>  (105)..(105)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (112)..(112)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (171)..(171)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (211)..(211)
<223>  Xaa can be any naturally occurring amino acid

<400>  245

Ser Val Met Pro Tyr Thr Pro Leu Thr Ile Leu Arg Asp Asp Gly Val 
1               5                   10                  15      


Val Glu Val Thr Thr Phe Asn Asp Phe Asn Asp Leu Glu Trp Thr Ser 
            20                  25                  30          


Tyr Ser Glu Phe Asn Lys Val Gly Thr His Lys Glu Gln Ile Phe Asn 
        35                  40                  45              


Pro Gly Phe Lys Val Trp Thr Asn Asn Gly Trp Ser Ser Val Val Arg 
    50                  55                  60                  


Leu Ile Arg His Lys Thr Val Lys Lys Ile Tyr Arg Val Leu Thr Asn 
65                  70                  75                  80  


Ser Gly Leu Val Asp Val Thr Glu Asp His Arg Leu Leu Asp Lys Asp 
                85                  90                  95      


Leu Asn Ile Ile Lys Pro Ser Gln Xaa Glu Lys Gly Gln Glu Leu Xaa 
            100                 105                 110         


His Ser Lys Ile Lys Ile Gly Lys His Ile Leu Lys Tyr Arg Lys Gln 
        115                 120                 125             


Asp Glu Ile Tyr Ile Glu Lys Tyr Gly Lys Ile Tyr Leu Asn Asp Asp 
    130                 135                 140                 


Gln Gln Asn Glu Ala Gln Tyr Ile Tyr Ile Leu Ser Gln His Phe Asn 
145                 150                 155                 160 


Asp Asp Asn Ile Thr Ile Ser Ile Glu Asn Xaa Lys Ile Val Leu Ser 
                165                 170                 175     


Tyr Glu Glu Arg Glu Ile Met Lys Glu Asp Lys Ser Lys Thr Arg Ile 
            180                 185                 190         


Gln Ser Ile Tyr Val Leu Tyr Glu Tyr Tyr Gly Asp Thr Thr Pro Thr 
        195                 200                 205             


Pro Tyr Xaa 
    210     


<210>  246
<211>  184
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Aureococcus anophagefferens virus
       BtV-01


<220>
<221>  misc_feature
<222>  (36)..(36)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (47)..(47)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (52)..(52)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (85)..(86)
<223>  Xaa can be any naturally occurring amino acid

<400>  246

Ser Val Thr Gly Tyr Thr Pro Ile Thr Ile Lys Tyr Lys Glu Gln Ile 
1               5                   10                  15      


Phe Ile Glu Lys Ile Glu Asn Val Ala Lys Ile Phe Gly Glu Asp Lys 
            20                  25                  30          


Trp Gln Lys Xaa Ile Asp Pro Gly Lys Gln Glu Lys Glu Ala Xaa Glu 
        35                  40                  45              


Leu Asn Glu Xaa Phe Thr Trp Thr Ser Lys Gly Trp Thr Lys Leu His 
    50                  55                  60                  


Arg Val Ile Arg His Ile Leu Val Lys Glu Lys Lys Ile Ile Arg Val 
65                  70                  75                  80  


Leu Thr His Thr Xaa Xaa Val Asp Val Thr Asp Asp His Ser Leu Ile 
                85                  90                  95      


Leu Lys Ser Gly Glu Glu Ile Ser Pro Lys Asp Leu Lys Ile Gly Asp 
            100                 105                 110         


Glu Leu Leu Gln Arg Asp Ile Glu Phe Asp Glu Ile Ile Glu Tyr Ser 
        115                 120                 125             


Asn Asp Glu Tyr Val Lys Lys Gln Ile Lys Tyr Leu Lys Glu Asn Met 
    130                 135                 140                 


Lys Ser Glu Asn Glu Ser Ile Leu Ser Leu Asn Glu Ile Asp Tyr Lys 
145                 150                 155                 160 


Gly Tyr Val Tyr Asp Leu Thr Thr Asp Asn His Glu Phe Gln Ala Gly 
                165                 170                 175     


Ile Gly Asn Ile Ile Val His Asn 
            180                 


<210>  247
<211>  225
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Browns Bank environmental 
       sampling


<220>
<221>  misc_feature
<222>  (36)..(36)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (88)..(88)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (110)..(110)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (121)..(121)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (183)..(183)
<223>  Xaa can be any naturally occurring amino acid

<400>  247

Ser Val Ala Asn Tyr Thr Pro Ile Tyr Val Lys Ile Asp Gly Lys Phe 
1               5                   10                  15      


Glu Ile Ile Gln Ile Asp Glu Leu Gly Lys Lys Tyr Gly Asn Asp Asn 
            20                  25                  30          


Trp Lys Gln Xaa Val Glu Pro Gly Lys Gln Thr Lys Glu Tyr Ile Glu 
        35                  40                  45              


Leu Thr Asp Lys Asn Ile Tyr Thr Trp Thr Glu Asn Ser Trp Thr Gln 
    50                  55                  60                  


Leu Lys Thr Ile Ile Arg His Lys Leu Ala Ser Glu Lys Lys Met Met 
65                  70                  75                  80  


Arg Ile Leu Thr His Thr Gly Xaa Val Asp Val Thr Asp Asp His Ser 
                85                  90                  95      


Leu Ile Arg Asn Asp Gly Val Glu Ile Ser Pro Lys Asp Xaa Glu Ile 
            100                 105                 110         


Gly Thr Glu Leu Leu His His Pro Xaa Leu Ser Asp Asn Ile Arg Asn 
        115                 120                 125             


Asp Ala Ser Asn Ser Thr Phe Val Gln Asp Glu Phe Asp Leu Ser Ile 
    130                 135                 140                 


Pro Ser Asn His Ile Leu Met Ala Lys Tyr Ile His His Lys Tyr Asn 
145                 150                 155                 160 


Ala Thr Asp Thr Lys Tyr Lys Leu Val Ser His Asp Ser Asn Asp Ser 
                165                 170                 175     


Phe Met Leu Ser Ser Glu Xaa Ser Glu Thr Thr Gly His Lys Ile Lys 
            180                 185                 190         


Lys Ile Met Thr Leu Asp Asp Tyr Asp Asp Tyr Val Tyr Asp Leu Thr 
        195                 200                 205             


Thr Asp Asn His His Phe Ala Ala Gly Ile Gly Asn Met Ile Val His 
    210                 215                 220                 


Asn 
225 


<210>  248
<211>  163
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from Salicola phage SCTP-2


<220>
<221>  misc_feature
<222>  (119)..(119)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (125)..(125)
<223>  Xaa can be any naturally occurring amino acid

<400>  248

Ser Val Thr His Asp Thr Ile Ile Ser Thr Asn Asn Gly Glu Tyr Thr 
1               5                   10                  15      


Ile Glu Lys Leu Phe Val Asn Ser Ser Arg Phe Trp Glu Asp Ser Glu 
            20                  25                  30          


Thr Gly Lys Glu Tyr Ser Tyr Asp Pro Ser Ile Lys Val Lys Thr Tyr 
        35                  40                  45              


Asp Pro Gln Asn Gly Thr Ser Tyr Tyr Gly Glu Ile Asn Tyr Ile Tyr 
    50                  55                  60                  


Arg His Lys Thr Ser Lys Ser Lys Trp Leu Ile Arg Asp Ser Asn Asn 
65                  70                  75                  80  


Asn Glu Val Ile Val Thr Glu Asp His Ser Ile Met Ile Glu Asn Asp 
                85                  90                  95      


Asp Gly Met Ile Lys Ile Ser Pro Lys Asp Ile Val Lys Gly Glu Asp 
            100                 105                 110         


Val Leu Ile Thr Val Lys Xaa Gly Glu Val Leu Lys Xaa Ser Ile Glu 
        115                 120                 125             


Asp Ile Val Trp Leu Gly Tyr Phe Asp Asn Glu Tyr Val Tyr Asp Val 
    130                 135                 140                 


Gly Met Lys Asn Ala Glu His Asn Trp Phe Phe Gly Asn Asn Ile Leu 
145                 150                 155                 160 


Leu Lys Asn 
            


<210>  249
<211>  162
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (46)..(46)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (83)..(83)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (121)..(121)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (129)..(129)
<223>  Xaa can be any naturally occurring amino acid

<400>  249

Ser Val Val Gly Asp Ser Trp Ile Thr Leu Ser Asp Asp Ser Phe Met 
1               5                   10                  15      


Met Ile Ser Asp Leu Phe Asn Glu Gly Thr Asp Thr Glu Ile Asp Gln 
            20                  25                  30          


Phe Gly Lys Glu Arg Val Lys Ser Asn Arg Ser Ile Tyr Xaa Val Asp 
        35                  40                  45              


Leu Thr Thr Gly Lys Ile Asp Lys Lys Pro Ile Lys Tyr Val Met Arg 
    50                  55                  60                  


His Lys Val Asn Lys Arg Leu Tyr Lys Val Phe Asn Asp Tyr Thr Tyr 
65                  70                  75                  80  


Val Val Xaa Thr Glu Asp His Ser Leu Leu Lys Tyr Tyr Asp Thr Asn 
                85                  90                  95      


Phe Lys Glu Val Lys Pro Asp Asp Ile Asp Lys Asp Thr Phe Leu Phe 
            100                 105                 110         


Leu Lys Gly Glu Asn Asp Leu Leu Xaa Val Ser Gly Phe Ser Val Glu 
        115                 120                 125             


Xaa Val Ser Asn Arg Asn Val Ala Tyr Val Tyr Asp Ile Glu Val Asp 
    130                 135                 140                 


Ser Asp Thr Asp Asn Tyr His Asn Phe Phe Ala Asn Gly Ile Leu Val 
145                 150                 155                 160 


His Asn 
        


<210>  250
<211>  164
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified contiguous intein from sheep gut metagenome


<220>
<221>  misc_feature
<222>  (46)..(46)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (53)..(53)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (83)..(83)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  misc_feature
<222>  (129)..(129)
<223>  Xaa can be any naturally occurring amino acid

<400>  250

Ser Val Ile Gly Ser Ser Met Ile Ser Leu Ser Asp Asn Ser Leu Lys 
1               5                   10                  15      


Arg Ile Ser Thr Leu Phe Asn Glu Gly Asp Asn Ile Glu Thr Asp Val 
            20                  25                  30          


Phe Gly Lys Glu Arg Val Ser Ser Asp Lys Ser Val Tyr Xaa Leu Asn 
        35                  40                  45              


Thr Val Thr Gly Xaa Val Glu Ile Lys Lys Ile Lys Tyr Val Met Arg 
    50                  55                  60                  


His Lys Leu Thr Lys Arg Leu Tyr Lys Val Ser Asn Gly Asp Tyr Glu 
65                  70                  75                  80  


Ile Ile Xaa Thr Glu Asp His Ser Ile Met Lys Leu Asp Asp Lys Ser 
                85                  90                  95      


Asn Lys Ile Ile Glu Val Arg Pro Ile Glu Ile Thr Lys Asn Asp Asn 
            100                 105                 110         


Leu Ile Val Arg Tyr Thr Asp Arg Thr Ile Tyr Val Ser Gly Phe Glu 
        115                 120                 125             


Xaa Thr Pro Ile Lys Asp Lys Arg Ile Lys Tyr Val Tyr Asp Ile Glu 
    130                 135                 140                 


Val Asp Ser Asp Thr Asp Glu Tyr His Asn Phe Phe Ala Asn Gly Leu 
145                 150                 155                 160 


Leu Val His Asn 
                


<210>  251
<211>  524
<212>  PRT
<213>  Artificial sequence

<220>
<223>  MBP-AesN-H6 (1)

<400>  251

Met Lys Thr Glu Glu Gly Lys Leu Val Ile Trp Ile Asn Gly Asp Lys 
1               5                   10                  15      


Gly Tyr Asn Gly Leu Ala Glu Val Gly Lys Lys Phe Glu Lys Asp Thr 
            20                  25                  30          


Gly Ile Lys Val Thr Val Glu His Pro Asp Lys Leu Glu Glu Lys Phe 
        35                  40                  45              


Pro Gln Val Ala Ala Thr Gly Asp Gly Pro Asp Ile Ile Phe Trp Ala 
    50                  55                  60                  


His Asp Arg Phe Gly Gly Tyr Ala Gln Ser Gly Leu Leu Ala Glu Ile 
65                  70                  75                  80  


Thr Pro Asp Lys Ala Phe Gln Asp Lys Leu Tyr Pro Phe Thr Trp Asp 
                85                  90                  95      


Ala Val Arg Tyr Asn Gly Lys Leu Ile Ala Tyr Pro Ile Ala Val Glu 
            100                 105                 110         


Ala Leu Ser Leu Ile Tyr Asn Lys Asp Leu Leu Pro Asn Pro Pro Lys 
        115                 120                 125             


Thr Trp Glu Glu Ile Pro Ala Leu Asp Lys Glu Leu Lys Ala Lys Gly 
    130                 135                 140                 


Lys Ser Ala Leu Met Phe Asn Leu Gln Glu Pro Tyr Phe Thr Trp Pro 
145                 150                 155                 160 


Leu Ile Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn Gly Lys 
                165                 170                 175     


Tyr Asp Ile Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys Ala Gly 
            180                 185                 190         


Leu Thr Phe Leu Val Asp Leu Ile Lys Asn Lys His Met Asn Ala Asp 
        195                 200                 205             


Thr Asp Tyr Ser Ile Ala Glu Ala Ala Phe Asn Lys Gly Glu Thr Ala 
    210                 215                 220                 


Met Thr Ile Asn Gly Pro Trp Ala Trp Ser Asn Ile Asp Thr Ser Lys 
225                 230                 235                 240 


Val Asn Tyr Gly Val Thr Val Leu Pro Thr Phe Lys Gly Gln Pro Ser 
                245                 250                 255     


Lys Pro Phe Val Gly Val Leu Ser Ala Gly Ile Asn Ala Ala Ser Pro 
            260                 265                 270         


Asn Lys Glu Leu Ala Lys Glu Phe Leu Glu Asn Tyr Leu Leu Thr Asp 
        275                 280                 285             


Glu Gly Leu Glu Ala Val Asn Lys Asp Lys Pro Leu Gly Ala Val Ala 
    290                 295                 300                 


Leu Lys Ser Tyr Glu Glu Glu Leu Ala Lys Asp Pro Arg Ile Ala Ala 
305                 310                 315                 320 


Thr Met Glu Asn Ala Gln Lys Gly Glu Ile Met Pro Asn Ile Pro Gln 
                325                 330                 335     


Met Ser Ala Phe Trp Tyr Ala Val Arg Thr Ala Val Ile Asn Ala Ala 
            340                 345                 350         


Ser Gly Arg Gln Thr Val Asp Glu Ala Leu Lys Asp Ala Gln Thr Asn 
        355                 360                 365             


Ser Ser Ser Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Leu Gly Ile 
    370                 375                 380                 


Glu Gly Arg Ile Ser Glu Phe Tyr Ile Asp Thr Asp Ser Val Val Gly 
385                 390                 395                 400 


Asp Thr Ile Ile Asp Val Ser Gly Lys Lys Met Thr Ile Ala Glu Phe 
                405                 410                 415     


Tyr Asp Ser Thr Pro Asp Val Phe Met Arg Arg Asn Asp Glu Ala Arg 
            420                 425                 430         


Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser Leu Ser Val Asn Thr 
        435                 440                 445             


Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn Tyr Ile Met Lys His 
    450                 455                 460                 


Thr Val Lys Lys Arg Met Phe Lys Ile Lys Ala Gly Gly Lys Glu Val 
465                 470                 475                 480 


Ile Val Thr Ala Asp His Ser Val Met Val Lys Arg Asp Gly Lys Ile 
                485                 490                 495     


Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr Asp Arg Val Val Lys 
            500                 505                 510         


Trp Met Leu Thr Gly Ser His His His His His His 
        515                 520                 


<210>  252
<211>  132
<212>  PRT
<213>  Artificial sequence

<220>
<223>  SBP-AesC-SBP (2)

<400>  252

Met Asp Glu Lys Thr Thr Gly Trp Arg Gly Gly His Val Val Glu Gly 
1               5                   10                  15      


Leu Ala Gly Glu Leu Glu Gln Leu Arg Ala Arg Leu Glu His His Pro 
            20                  25                  30          


Gln Gly Gln Arg Glu Pro Gly Ala Ser Gly Gly Gly Gly Ser Ser Ser 
        35                  40                  45              


Met Ile Glu Phe Ile Glu Phe Glu Ile Glu Asp Leu Gly Val Met Glu 
    50                  55                  60                  


Ile Asp Val Tyr Asp Ile Glu Val Asp Gly Asn His Asn Phe Phe Gly 
65                  70                  75                  80  


Asn Asp Ile Leu Val His Asn Ser Val Tyr Leu Asn Gly Thr Met Asp 
                85                  90                  95      


Glu Lys Thr Thr Gly Trp Arg Gly Gly His Val Val Glu Gly Leu Ala 
            100                 105                 110         


Gly Glu Leu Glu Gln Leu Arg Ala Arg Leu Glu His His Pro Gln Gly 
        115                 120                 125             


Gln Arg Glu Pro 
    130         


<210>  253
<211>  430
<212>  PRT
<213>  Artificial sequence

<220>
<223>  MBP-CLN-H6 (3)

<400>  253

Met Lys Thr Glu Glu Gly Lys Leu Val Ile Trp Ile Asn Gly Asp Lys 
1               5                   10                  15      


Gly Tyr Asn Gly Leu Ala Glu Val Gly Lys Lys Phe Glu Lys Asp Thr 
            20                  25                  30          


Gly Ile Lys Val Thr Val Glu His Pro Asp Lys Leu Glu Glu Lys Phe 
        35                  40                  45              


Pro Gln Val Ala Ala Thr Gly Asp Gly Pro Asp Ile Ile Phe Trp Ala 
    50                  55                  60                  


His Asp Arg Phe Gly Gly Tyr Ala Gln Ser Gly Leu Leu Ala Glu Ile 
65                  70                  75                  80  


Thr Pro Asp Lys Ala Phe Gln Asp Lys Leu Tyr Pro Phe Thr Trp Asp 
                85                  90                  95      


Ala Val Arg Tyr Asn Gly Lys Leu Ile Ala Tyr Pro Ile Ala Val Glu 
            100                 105                 110         


Ala Leu Ser Leu Ile Tyr Asn Lys Asp Leu Leu Pro Asn Pro Pro Lys 
        115                 120                 125             


Thr Trp Glu Glu Ile Pro Ala Leu Asp Lys Glu Leu Lys Ala Lys Gly 
    130                 135                 140                 


Lys Ser Ala Leu Met Phe Asn Leu Gln Glu Pro Tyr Phe Thr Trp Pro 
145                 150                 155                 160 


Leu Ile Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn Gly Lys 
                165                 170                 175     


Tyr Asp Ile Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys Ala Gly 
            180                 185                 190         


Leu Thr Phe Leu Val Asp Leu Ile Lys Asn Lys His Met Asn Ala Asp 
        195                 200                 205             


Thr Asp Tyr Ser Ile Ala Glu Ala Ala Phe Asn Lys Gly Glu Thr Ala 
    210                 215                 220                 


Met Thr Ile Asn Gly Pro Trp Ala Trp Ser Asn Ile Asp Thr Ser Lys 
225                 230                 235                 240 


Val Asn Tyr Gly Val Thr Val Leu Pro Thr Phe Lys Gly Gln Pro Ser 
                245                 250                 255     


Lys Pro Phe Val Gly Val Leu Ser Ala Gly Ile Asn Ala Ala Ser Pro 
            260                 265                 270         


Asn Lys Glu Leu Ala Lys Glu Phe Leu Glu Asn Tyr Leu Leu Thr Asp 
        275                 280                 285             


Glu Gly Leu Glu Ala Val Asn Lys Asp Lys Pro Leu Gly Ala Val Ala 
    290                 295                 300                 


Leu Lys Ser Tyr Glu Glu Glu Leu Ala Lys Asp Pro Arg Ile Ala Ala 
305                 310                 315                 320 


Thr Met Glu Asn Ala Gln Lys Gly Glu Ile Met Pro Asn Ile Pro Gln 
                325                 330                 335     


Met Ser Ala Phe Trp Tyr Ala Val Arg Thr Ala Val Ile Asn Ala Ala 
            340                 345                 350         


Ser Gly Arg Gln Thr Val Asp Glu Ala Leu Lys Asp Ala Gln Thr Asn 
        355                 360                 365             


Ser Ser Ser Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Leu Gly Ile 
    370                 375                 380                 


Glu Gly Arg Ile Ser Glu Phe Tyr Ile Asp Thr Asp Ser Val Val Gly 
385                 390                 395                 400 


Asp Thr Ile Ile Asp Val Ser Gly Lys Lys Met Thr Ile Ala Glu Phe 
                405                 410                 415     


Tyr Asp Ser Thr Pro Asp Gly Ser His His His His His His 
            420                 425                 430 


<210>  254
<211>  305
<212>  PRT
<213>  Artificial sequence

<220>
<223>  SBP-CLC-Trx-H6 (4)

<400>  254

Met Asp Glu Lys Thr Thr Gly Trp Arg Gly Gly His Val Val Glu Gly 
1               5                   10                  15      


Leu Ala Gly Glu Leu Glu Gln Leu Arg Ala Arg Leu Glu His His Pro 
            20                  25                  30          


Gln Gly Gln Arg Glu Pro Gly Ala Ser Gly Gly Gly Gly Ser Ser Ser 
        35                  40                  45              


Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser Leu Ser 
    50                  55                  60                  


Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn Tyr Ile 
65                  70                  75                  80  


Met Lys His Thr Val Lys Lys Arg Met Phe Lys Ile Lys Ala Gly Gly 
                85                  90                  95      


Lys Glu Val Ile Val Thr Ala Asp His Ser Val Met Val Lys Arg Asp 
            100                 105                 110         


Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr Asp Arg 
        115                 120                 125             


Val Val Lys Trp Met Leu Thr Gly Ser His Met Ile Glu Phe Ile Glu 
    130                 135                 140                 


Phe Glu Ile Glu Asp Leu Gly Val Met Glu Ile Asp Val Tyr Asp Ile 
145                 150                 155                 160 


Glu Val Asp Gly Asn His Asn Phe Phe Gly Asn Asp Ile Leu Val His 
                165                 170                 175     


Asn Ser Val Tyr Leu Asn Gly Thr Gly Ser Asp Lys Ile Ile His Leu 
            180                 185                 190         


Thr Asp Asp Ser Phe Asp Thr Asp Val Leu Lys Ala Asp Gly Ala Ile 
        195                 200                 205             


Leu Val Asp Phe Trp Ala His Trp Cys Gly Pro Cys Lys Met Ile Ala 
    210                 215                 220                 


Pro Ile Leu Asp Glu Ile Ala Asp Glu Tyr Gln Gly Lys Leu Thr Val 
225                 230                 235                 240 


Ala Lys Leu Asn Ile Asp His Asn Pro Gly Thr Ala Pro Lys Tyr Gly 
                245                 250                 255     


Ile Arg Gly Ile Pro Thr Leu Leu Leu Phe Lys Asn Gly Glu Val Ala 
            260                 265                 270         


Ala Thr Lys Val Gly Ala Leu Ser Lys Gly Gln Leu Lys Glu Phe Leu 
        275                 280                 285             


Asp Ala Asn Leu Ala Gly Ser Glu Phe Arg Ser His His His His His 
    290                 295                 300                 


His 
305 


<210>  255
<211>  430
<212>  PRT
<213>  Artificial sequence

<220>
<223>  MBP-CLN(S1C)-H6 (3a)

<400>  255

Met Lys Thr Glu Glu Gly Lys Leu Val Ile Trp Ile Asn Gly Asp Lys 
1               5                   10                  15      


Gly Tyr Asn Gly Leu Ala Glu Val Gly Lys Lys Phe Glu Lys Asp Thr 
            20                  25                  30          


Gly Ile Lys Val Thr Val Glu His Pro Asp Lys Leu Glu Glu Lys Phe 
        35                  40                  45              


Pro Gln Val Ala Ala Thr Gly Asp Gly Pro Asp Ile Ile Phe Trp Ala 
    50                  55                  60                  


His Asp Arg Phe Gly Gly Tyr Ala Gln Ser Gly Leu Leu Ala Glu Ile 
65                  70                  75                  80  


Thr Pro Asp Lys Ala Phe Gln Asp Lys Leu Tyr Pro Phe Thr Trp Asp 
                85                  90                  95      


Ala Val Arg Tyr Asn Gly Lys Leu Ile Ala Tyr Pro Ile Ala Val Glu 
            100                 105                 110         


Ala Leu Ser Leu Ile Tyr Asn Lys Asp Leu Leu Pro Asn Pro Pro Lys 
        115                 120                 125             


Thr Trp Glu Glu Ile Pro Ala Leu Asp Lys Glu Leu Lys Ala Lys Gly 
    130                 135                 140                 


Lys Ser Ala Leu Met Phe Asn Leu Gln Glu Pro Tyr Phe Thr Trp Pro 
145                 150                 155                 160 


Leu Ile Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn Gly Lys 
                165                 170                 175     


Tyr Asp Ile Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys Ala Gly 
            180                 185                 190         


Leu Thr Phe Leu Val Asp Leu Ile Lys Asn Lys His Met Asn Ala Asp 
        195                 200                 205             


Thr Asp Tyr Ser Ile Ala Glu Ala Ala Phe Asn Lys Gly Glu Thr Ala 
    210                 215                 220                 


Met Thr Ile Asn Gly Pro Trp Ala Trp Ser Asn Ile Asp Thr Ser Lys 
225                 230                 235                 240 


Val Asn Tyr Gly Val Thr Val Leu Pro Thr Phe Lys Gly Gln Pro Ser 
                245                 250                 255     


Lys Pro Phe Val Gly Val Leu Ser Ala Gly Ile Asn Ala Ala Ser Pro 
            260                 265                 270         


Asn Lys Glu Leu Ala Lys Glu Phe Leu Glu Asn Tyr Leu Leu Thr Asp 
        275                 280                 285             


Glu Gly Leu Glu Ala Val Asn Lys Asp Lys Pro Leu Gly Ala Val Ala 
    290                 295                 300                 


Leu Lys Ser Tyr Glu Glu Glu Leu Ala Lys Asp Pro Arg Ile Ala Ala 
305                 310                 315                 320 


Thr Met Glu Asn Ala Gln Lys Gly Glu Ile Met Pro Asn Ile Pro Gln 
                325                 330                 335     


Met Ser Ala Phe Trp Tyr Ala Val Arg Thr Ala Val Ile Asn Ala Ala 
            340                 345                 350         


Ser Gly Arg Gln Thr Val Asp Glu Ala Leu Lys Asp Ala Gln Thr Asn 
        355                 360                 365             


Ser Ser Ser Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Leu Gly Ile 
    370                 375                 380                 


Glu Gly Arg Ile Ser Glu Phe Tyr Ile Asp Thr Asp Cys Val Val Gly 
385                 390                 395                 400 


Asp Thr Ile Ile Asp Val Ser Gly Lys Lys Met Thr Ile Ala Glu Phe 
                405                 410                 415     


Tyr Asp Ser Thr Pro Asp Gly Ser His His His His His His 
            420                 425                 430 


<210>  256
<211>  305
<212>  PRT
<213>  Artificial sequence

<220>
<223>  SBP-CLC(S+1C)-Trx-H6 (4a)

<400>  256

Met Asp Glu Lys Thr Thr Gly Trp Arg Gly Gly His Val Val Glu Gly 
1               5                   10                  15      


Leu Ala Gly Glu Leu Glu Gln Leu Arg Ala Arg Leu Glu His His Pro 
            20                  25                  30          


Gln Gly Gln Arg Glu Pro Gly Ala Ser Gly Gly Gly Gly Ser Ser Ser 
        35                  40                  45              


Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser Leu Ser 
    50                  55                  60                  


Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn Tyr Ile 
65                  70                  75                  80  


Met Lys His Thr Val Lys Lys Arg Met Phe Lys Ile Lys Ala Gly Gly 
                85                  90                  95      


Lys Glu Val Ile Val Thr Ala Asp His Ser Val Met Val Lys Arg Asp 
            100                 105                 110         


Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr Asp Arg 
        115                 120                 125             


Val Val Lys Trp Met Leu Thr Gly Ser His Met Ile Glu Phe Ile Glu 
    130                 135                 140                 


Phe Glu Ile Glu Asp Leu Gly Val Met Glu Ile Asp Val Tyr Asp Ile 
145                 150                 155                 160 


Glu Val Asp Gly Asn His Asn Phe Phe Gly Asn Asp Ile Leu Val His 
                165                 170                 175     


Asn Cys Val Tyr Leu Asn Gly Thr Gly Ser Asp Lys Ile Ile His Leu 
            180                 185                 190         


Thr Asp Asp Ser Phe Asp Thr Asp Val Leu Lys Ala Asp Gly Ala Ile 
        195                 200                 205             


Leu Val Asp Phe Trp Ala His Trp Cys Gly Pro Cys Lys Met Ile Ala 
    210                 215                 220                 


Pro Ile Leu Asp Glu Ile Ala Asp Glu Tyr Gln Gly Lys Leu Thr Val 
225                 230                 235                 240 


Ala Lys Leu Asn Ile Asp His Asn Pro Gly Thr Ala Pro Lys Tyr Gly 
                245                 250                 255     


Ile Arg Gly Ile Pro Thr Leu Leu Leu Phe Lys Asn Gly Glu Val Ala 
            260                 265                 270         


Ala Thr Lys Val Gly Ala Leu Ser Lys Gly Gln Leu Lys Glu Phe Leu 
        275                 280                 285             


Asp Ala Asn Leu Ala Gly Ser Glu Phe Arg Ser His His His His His 
    290                 295                 300                 


His 
305 


<210>  257
<211>  294
<212>  PRT
<213>  Artificial sequence

<220>
<223>  eGFP-CLN(S1A)-H6 (5)

<400>  257

Met Ala Ser Trp Ser His Pro Gln Phe Glu Lys Ala Ser Gly Thr Val 
1               5                   10                  15      


Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu 
            20                  25                  30          


Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly 
        35                  40                  45              


Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr 
    50                  55                  60                  


Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr 
65                  70                  75                  80  


Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His 
                85                  90                  95      


Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr 
            100                 105                 110         


Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys 
        115                 120                 125             


Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp 
    130                 135                 140                 


Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr 
145                 150                 155                 160 


Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly Ile 
                165                 170                 175     


Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln 
            180                 185                 190         


Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val 
        195                 200                 205             


Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys 
    210                 215                 220                 


Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr 
225                 230                 235                 240 


Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys Gly Ser Tyr 
                245                 250                 255     


Ile Asp Thr Asp Ala Val Val Gly Asp Thr Ile Ile Asp Val Ser Gly 
            260                 265                 270         


Lys Lys Met Thr Ile Ala Glu Phe Tyr Asp Ser Thr Pro Asp Gly Ser 
        275                 280                 285             


His His His His His His 
    290                 


<210>  258
<211>  305
<212>  PRT
<213>  Artificial sequence

<220>
<223>  SBP-CLC(N159'A)-Trx-H6 (6)

<400>  258

Met Asp Glu Lys Thr Thr Gly Trp Arg Gly Gly His Val Val Glu Gly 
1               5                   10                  15      


Leu Ala Gly Glu Leu Glu Gln Leu Arg Ala Arg Leu Glu His His Pro 
            20                  25                  30          


Gln Gly Gln Arg Glu Pro Gly Ala Ser Gly Gly Gly Gly Ser Ser Ser 
        35                  40                  45              


Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser Leu Ser 
    50                  55                  60                  


Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn Tyr Ile 
65                  70                  75                  80  


Met Lys His Thr Val Lys Lys Arg Met Phe Lys Ile Lys Ala Gly Gly 
                85                  90                  95      


Lys Glu Val Ile Val Thr Ala Asp His Ser Val Met Val Lys Arg Asp 
            100                 105                 110         


Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr Asp Arg 
        115                 120                 125             


Val Val Lys Trp Met Leu Thr Gly Ser His Met Ile Glu Phe Ile Glu 
    130                 135                 140                 


Phe Glu Ile Glu Asp Leu Gly Val Met Glu Ile Asp Val Tyr Asp Ile 
145                 150                 155                 160 


Glu Val Asp Gly Asn His Asn Phe Phe Gly Asn Asp Ile Leu Val His 
                165                 170                 175     


Ala Ser Val Tyr Leu Asn Gly Thr Gly Ser Asp Lys Ile Ile His Leu 
            180                 185                 190         


Thr Asp Asp Ser Phe Asp Thr Asp Val Leu Lys Ala Asp Gly Ala Ile 
        195                 200                 205             


Leu Val Asp Phe Trp Ala His Trp Cys Gly Pro Cys Lys Met Ile Ala 
    210                 215                 220                 


Pro Ile Leu Asp Glu Ile Ala Asp Glu Tyr Gln Gly Lys Leu Thr Val 
225                 230                 235                 240 


Ala Lys Leu Asn Ile Asp His Asn Pro Gly Thr Ala Pro Lys Tyr Gly 
                245                 250                 255     


Ile Arg Gly Ile Pro Thr Leu Leu Leu Phe Lys Asn Gly Glu Val Ala 
            260                 265                 270         


Ala Thr Lys Val Gly Ala Leu Ser Lys Gly Gln Leu Lys Glu Phe Leu 
        275                 280                 285             


Asp Ala Asn Leu Ala Gly Ser Glu Phe Arg Ser His His His His His 
    290                 295                 300                 


His 
305 


<210>  259
<211>  87
<212>  PRT
<213>  Artificial sequence

<220>
<223>  CysTag-CLN-SBP (7)

<400>  259

Met Gly Cys Asp Thr Asp Ser Val Val Gly Asp Thr Ile Ile Asp Val 
1               5                   10                  15      


Ser Gly Lys Lys Met Thr Ile Ala Glu Phe Tyr Asp Ser Thr Pro Asp 
            20                  25                  30          


Ser Gly Gly Ser Pro Arg Lys Val Ile Lys Met Glu Ser Glu Glu Arg 
        35                  40                  45              


Ser Met Asp Glu Lys Thr Thr Gly Trp Arg Gly Gly His Val Val Glu 
    50                  55                  60                  


Gly Leu Ala Gly Glu Leu Glu Gln Leu Arg Ala Arg Leu Glu His His 
65                  70                  75                  80  


Pro Gln Gly Gln Arg Glu Pro 
                85          


<210>  260
<211>  310
<212>  PRT
<213>  Artificial sequence

<220>
<223>  SBP-CLC-VHHGFP-H6 (8)

<400>  260

Met Asp Glu Lys Thr Thr Gly Trp Arg Gly Gly His Val Val Glu Gly 
1               5                   10                  15      


Leu Ala Gly Glu Leu Glu Gln Leu Arg Ala Arg Leu Glu His His Pro 
            20                  25                  30          


Gln Gly Gln Arg Glu Pro Gly Ala Ser Gly Gly Gly Gly Ser Ser Ser 
        35                  40                  45              


Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser Leu Ser 
    50                  55                  60                  


Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn Tyr Ile 
65                  70                  75                  80  


Met Lys His Thr Val Lys Lys Arg Met Phe Lys Ile Lys Ala Gly Gly 
                85                  90                  95      


Lys Glu Val Ile Val Thr Ala Asp His Ser Val Met Val Lys Arg Asp 
            100                 105                 110         


Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr Asp Arg 
        115                 120                 125             


Val Val Lys Trp Met Leu Thr Gly Ser His Met Ile Glu Phe Ile Glu 
    130                 135                 140                 


Phe Glu Ile Glu Asp Leu Gly Val Met Glu Ile Asp Val Tyr Asp Ile 
145                 150                 155                 160 


Glu Val Asp Gly Asn His Asn Phe Phe Gly Asn Asp Ile Leu Val His 
                165                 170                 175     


Asn Ser Val Tyr Leu Asn Gly Met Ala Gln Val Gln Leu Val Glu Ser 
            180                 185                 190         


Gly Gly Ala Leu Val Gln Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala 
        195                 200                 205             


Ala Ser Gly Phe Pro Val Asn Arg Tyr Ser Met Arg Trp Tyr Arg Gln 
    210                 215                 220                 


Ala Pro Gly Lys Glu Arg Glu Trp Val Ala Gly Met Ser Ser Ala Gly 
225                 230                 235                 240 


Asp Arg Ser Ser Tyr Glu Asp Ser Val Lys Gly Arg Phe Thr Ile Ser 
                245                 250                 255     


Arg Asp Asp Ala Arg Asn Thr Val Tyr Leu Gln Met Asn Ser Leu Lys 
            260                 265                 270         


Pro Glu Asp Thr Ala Val Tyr Tyr Cys Asn Val Asn Val Gly Phe Glu 
        275                 280                 285             


Tyr Trp Gly Gln Gly Thr Gln Val Thr Val Ser Ser Pro Asp Arg Ser 
    290                 295                 300                 


His His His His His His 
305                 310 


<210>  261
<211>  19
<212>  PRT
<213>  Artificial sequence

<220>
<223>  First amino acid sequence from Fluorescein-CLN (9)

<400>  261

Tyr Ile Asp Thr Asp Ser Val Val Gly Asp Thr Ile Ile Asp Val Ser 
1               5                   10                  15      


Gly Lys Lys 
            


<210>  262
<211>  11
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Second amino acid sequence from Fluorescein-CLN (9)

<400>  262

Thr Ile Ala Glu Phe Tyr Asp Ser Thr Pro Asp 
1               5                   10      


<210>  263
<211>  197
<212>  PRT
<213>  Artificial sequence

<220>
<223>  SBP-CLC-CysTag-H6 (11)

<400>  263

Met Asp Glu Lys Thr Thr Gly Trp Arg Gly Gly His Val Val Glu Gly 
1               5                   10                  15      


Leu Ala Gly Glu Leu Glu Gln Leu Arg Ala Arg Leu Glu His His Pro 
            20                  25                  30          


Gln Gly Gln Arg Glu Pro Gly Ala Ser Gly Gly Gly Gly Ser Ser Ser 
        35                  40                  45              


Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser Leu Ser 
    50                  55                  60                  


Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn Tyr Ile 
65                  70                  75                  80  


Met Lys His Thr Val Lys Lys Arg Met Phe Lys Ile Lys Ala Gly Gly 
                85                  90                  95      


Lys Glu Val Ile Val Thr Ala Asp His Ser Val Met Val Lys Arg Asp 
            100                 105                 110         


Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr Asp Arg 
        115                 120                 125             


Val Val Lys Trp Met Leu Thr Gly Ser His Met Ile Glu Phe Ile Glu 
    130                 135                 140                 


Phe Glu Ile Glu Asp Leu Gly Val Met Glu Ile Asp Val Tyr Asp Ile 
145                 150                 155                 160 


Glu Val Asp Gly Asn His Asn Phe Phe Gly Asn Asp Ile Leu Val His 
                165                 170                 175     


Asn Ser Val Tyr Ala Ser Pro Ala Ala Pro Ala Pro Ala Ser Cys His 
            180                 185                 190         


His His His His His 
        195         


<210>  264
<211>  310
<212>  PRT
<213>  Artificial sequence

<220>
<223>  VHHEGFR-CLN-SBP (10)

<400>  264

Met Asp Glu Lys Thr Thr Gly Trp Arg Gly Gly His Val Val Glu Gly 
1               5                   10                  15      


Leu Ala Gly Glu Leu Glu Gln Leu Arg Ala Arg Leu Glu His His Pro 
            20                  25                  30          


Gln Gly Gln Arg Glu Pro Gly Ala Ser Gly Gly Gly Gly Ser Ser Ser 
        35                  40                  45              


Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser Leu Ser 
    50                  55                  60                  


Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn Tyr Ile 
65                  70                  75                  80  


Met Lys His Thr Val Lys Lys Arg Met Phe Lys Ile Lys Ala Gly Gly 
                85                  90                  95      


Lys Glu Val Ile Val Thr Ala Asp His Ser Val Met Val Lys Arg Asp 
            100                 105                 110         


Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr Asp Arg 
        115                 120                 125             


Val Val Lys Trp Met Leu Thr Gly Ser His Met Ile Glu Phe Ile Glu 
    130                 135                 140                 


Phe Glu Ile Glu Asp Leu Gly Val Met Glu Ile Asp Val Tyr Asp Ile 
145                 150                 155                 160 


Glu Val Asp Gly Asn His Asn Phe Phe Gly Asn Asp Ile Leu Val His 
                165                 170                 175     


Asn Ser Val Tyr Leu Asn Gly Met Ala Gln Val Gln Leu Val Glu Ser 
            180                 185                 190         


Gly Gly Ala Leu Val Gln Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala 
        195                 200                 205             


Ala Ser Gly Phe Pro Val Asn Arg Tyr Ser Met Arg Trp Tyr Arg Gln 
    210                 215                 220                 


Ala Pro Gly Lys Glu Arg Glu Trp Val Ala Gly Met Ser Ser Ala Gly 
225                 230                 235                 240 


Asp Arg Ser Ser Tyr Glu Asp Ser Val Lys Gly Arg Phe Thr Ile Ser 
                245                 250                 255     


Arg Asp Asp Ala Arg Asn Thr Val Tyr Leu Gln Met Asn Ser Leu Lys 
            260                 265                 270         


Pro Glu Asp Thr Ala Val Tyr Tyr Cys Asn Val Asn Val Gly Phe Glu 
        275                 280                 285             


Tyr Trp Gly Gln Gly Thr Gln Val Thr Val Ser Ser Pro Asp Arg Ser 
    290                 295                 300                 


His His His His His His 
305                 310 


<210>  265
<211>  603
<212>  PRT
<213>  Artificial sequence

<220>
<223>  HA-CLC-Trx-TMD-mCherry (13)

<400>  265

Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro 
1               5                   10                  15      


Gly Ser Thr Gly Asp Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Gly Ala 
            20                  25                  30          


Gln Ser Gly Gly Gly Gly Ser Ser Ser Glu Ala Arg Asp Trp Val Lys 
        35                  40                  45              


Arg Val Gly Gly Lys Thr Ser Leu Ser Val Asn Thr Tyr Ser Gly Glu 
    50                  55                  60                  


Val Glu Arg Lys Asn Ile Asn Tyr Ile Met Lys His Thr Val Lys Lys 
65                  70                  75                  80  


Arg Met Phe Lys Ile Lys Ala Gly Gly Lys Glu Val Ile Val Thr Ala 
                85                  90                  95      


Asp His Ser Val Met Val Lys Arg Asp Gly Lys Ile Ile Asp Val Lys 
            100                 105                 110         


Pro Thr Glu Met Lys Gln Thr Asp Arg Val Val Lys Trp Met Leu Thr 
        115                 120                 125             


Gly Ser His Met Ile Glu Phe Ile Glu Phe Glu Ile Glu Asp Leu Gly 
    130                 135                 140                 


Val Met Glu Ile Asp Val Tyr Asp Ile Glu Val Asp Gly Asn His Asn 
145                 150                 155                 160 


Phe Phe Gly Asn Asp Ile Leu Val His Asn Ser Val Tyr Leu Asn Ser 
                165                 170                 175     


Gly Gly Ser Gly Thr Gly Ser Asp Lys Ile Ile His Leu Thr Asp Asp 
            180                 185                 190         


Ser Phe Asp Thr Asp Val Leu Lys Ala Asp Gly Ala Ile Leu Val Asp 
        195                 200                 205             


Phe Trp Ala His Trp Cys Gly Pro Cys Lys Met Ile Ala Pro Ile Leu 
    210                 215                 220                 


Asp Glu Ile Ala Asp Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu 
225                 230                 235                 240 


Asn Ile Asp His Asn Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly 
                245                 250                 255     


Ile Pro Thr Leu Leu Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys 
            260                 265                 270         


Val Gly Ala Leu Ser Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn 
        275                 280                 285             


Leu Ala Gly Ser Glu Phe Arg Ser His His His His His His Tyr Val 
    290                 295                 300                 


Asp Glu Gln Lys Leu Ile Ser Glu Glu Asp Leu Asn Ala Val Gly Gln 
305                 310                 315                 320 


Asp Thr Gln Glu Val Ile Val Val Pro His Ser Leu Pro Phe Lys Val 
                325                 330                 335     


Val Val Ile Ser Ala Ile Leu Ala Leu Val Val Leu Thr Ile Ile Ser 
            340                 345                 350         


Leu Ile Ile Leu Ile Met Leu Trp Gln Lys Lys Pro Arg Ala Ser Met 
        355                 360                 365             


Val Ser Lys Gly Glu Glu Asp Asn Met Ala Ile Ile Lys Glu Phe Met 
    370                 375                 380                 


Arg Phe Lys Val His Met Glu Gly Ser Val Asn Gly His Glu Phe Glu 
385                 390                 395                 400 


Ile Glu Gly Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala 
                405                 410                 415     


Lys Leu Lys Val Thr Lys Gly Gly Pro Leu Pro Phe Ala Trp Asp Ile 
            420                 425                 430         


Leu Ser Pro Gln Phe Met Tyr Gly Ser Lys Ala Tyr Val Lys His Pro 
        435                 440                 445             


Ala Asp Ile Pro Asp Tyr Leu Lys Leu Ser Phe Pro Glu Gly Phe Lys 
    450                 455                 460                 


Trp Glu Arg Val Met Asn Phe Glu Asp Gly Gly Val Val Thr Val Thr 
465                 470                 475                 480 


Gln Asp Ser Ser Leu Gln Asp Gly Glu Phe Ile Tyr Lys Val Lys Leu 
                485                 490                 495     


Arg Gly Thr Asn Phe Pro Ser Asp Gly Pro Val Met Gln Lys Lys Thr 
            500                 505                 510         


Met Gly Trp Glu Ala Ser Ser Glu Arg Met Tyr Pro Glu Asp Gly Ala 
        515                 520                 525             


Leu Lys Gly Glu Ile Lys Gln Arg Leu Lys Leu Lys Asp Gly Gly His 
    530                 535                 540                 


Tyr Asp Ala Glu Val Lys Thr Thr Tyr Lys Ala Lys Lys Pro Val Gln 
545                 550                 555                 560 


Leu Pro Gly Ala Tyr Asn Val Asn Ile Lys Leu Asp Ile Thr Ser His 
                565                 570                 575     


Asn Glu Asp Tyr Thr Ile Val Glu Gln Tyr Glu Arg Ala Glu Gly Arg 
            580                 585                 590         


His Ser Thr Gly Gly Met Asp Glu Leu Tyr Lys 
        595                 600             


<210>  266
<211>  699
<212>  PRT
<213>  Artificial sequence

<220>
<223>  MBP-Fc-CLN-SBP (12)

<400>  266

Met Lys Thr Glu Glu Gly Lys Leu Val Ile Trp Ile Asn Gly Asp Lys 
1               5                   10                  15      


Gly Tyr Asn Gly Leu Ala Glu Val Gly Lys Lys Phe Glu Lys Asp Thr 
            20                  25                  30          


Gly Ile Lys Val Thr Val Glu His Pro Asp Lys Leu Glu Glu Lys Phe 
        35                  40                  45              


Pro Gln Val Ala Ala Thr Gly Asp Gly Pro Asp Ile Ile Phe Trp Ala 
    50                  55                  60                  


His Asp Arg Phe Gly Gly Tyr Ala Gln Ser Gly Leu Leu Ala Glu Ile 
65                  70                  75                  80  


Thr Pro Asp Lys Ala Phe Gln Asp Lys Leu Tyr Pro Phe Thr Trp Asp 
                85                  90                  95      


Ala Val Arg Tyr Asn Gly Lys Leu Ile Ala Tyr Pro Ile Ala Val Glu 
            100                 105                 110         


Ala Leu Ser Leu Ile Tyr Asn Lys Asp Leu Leu Pro Asn Pro Pro Lys 
        115                 120                 125             


Thr Trp Glu Glu Ile Pro Ala Leu Asp Lys Glu Leu Lys Ala Lys Gly 
    130                 135                 140                 


Lys Ser Ala Leu Met Phe Asn Leu Gln Glu Pro Tyr Phe Thr Trp Pro 
145                 150                 155                 160 


Leu Ile Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn Gly Lys 
                165                 170                 175     


Tyr Asp Ile Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys Ala Gly 
            180                 185                 190         


Leu Thr Phe Leu Val Asp Leu Ile Lys Asn Lys His Met Asn Ala Asp 
        195                 200                 205             


Thr Asp Tyr Ser Ile Ala Glu Ala Ala Phe Asn Lys Gly Glu Thr Ala 
    210                 215                 220                 


Met Thr Ile Asn Gly Pro Trp Ala Trp Ser Asn Ile Asp Thr Ser Lys 
225                 230                 235                 240 


Val Asn Tyr Gly Val Thr Val Leu Pro Thr Phe Lys Gly Gln Pro Ser 
                245                 250                 255     


Lys Pro Phe Val Gly Val Leu Ser Ala Gly Ile Asn Ala Ala Ser Pro 
            260                 265                 270         


Asn Lys Glu Leu Ala Lys Glu Phe Leu Glu Asn Tyr Leu Leu Thr Asp 
        275                 280                 285             


Glu Gly Leu Glu Ala Val Asn Lys Asp Lys Pro Leu Gly Ala Val Ala 
    290                 295                 300                 


Leu Lys Ser Tyr Glu Glu Glu Leu Ala Lys Asp Pro Arg Ile Ala Ala 
305                 310                 315                 320 


Thr Met Glu Asn Ala Gln Lys Gly Glu Ile Met Pro Asn Ile Pro Gln 
                325                 330                 335     


Met Ser Ala Phe Trp Tyr Ala Val Arg Thr Ala Val Ile Asn Ala Ala 
            340                 345                 350         


Ser Gly Arg Gln Thr Val Asp Glu Ala Leu Lys Asp Ala Gln Thr Asn 
        355                 360                 365             


Ser Ser Ser Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Leu Gly Ile 
    370                 375                 380                 


Glu Gly Arg Ile Ser Glu Phe Asp Lys Thr His Thr Cys Pro Pro Cys 
385                 390                 395                 400 


Pro Ala Pro Glu Ala Glu Gly Ala Pro Ser Val Phe Leu Phe Pro Pro 
                405                 410                 415     


Lys Pro Lys Asp Thr Leu Met Ile Ser Arg Thr Pro Glu Val Thr Cys 
            420                 425                 430         


Val Val Val Asp Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp 
        435                 440                 445             


Tyr Val Asp Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu 
    450                 455                 460                 


Glu Gln Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu 
465                 470                 475                 480 


His Gln Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn 
                485                 490                 495     


Lys Ala Leu Pro Ala Pro Ile Glu Lys Thr Ile Ser Lys Ala Lys Gly 
            500                 505                 510         


Gln Pro Arg Glu Pro Gln Val Tyr Thr Leu Pro Pro Ser Arg Glu Glu 
        515                 520                 525             


Met Thr Lys Asn Gln Val Ser Leu Thr Cys Leu Val Lys Gly Phe Tyr 
    530                 535                 540                 


Pro Ser Asp Ile Ala Val Glu Trp Glu Ser Asn Gly Gln Pro Glu Asn 
545                 550                 555                 560 


Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser Asp Gly Ser Phe Phe 
                565                 570                 575     


Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser Arg Trp Gln Gln Gly Asn 
            580                 585                 590         


Val Phe Ser Cys Ser Val Met His Glu Ala Leu His Asn His Tyr Thr 
        595                 600                 605             


Gln Lys Ser Leu Ser Leu Ser Pro Gly Lys Gly Ser Glu Phe Tyr Ile 
    610                 615                 620                 


Asp Thr Asp Ser Val Val Gly Asp Thr Ile Ile Asp Val Ser Gly Lys 
625                 630                 635                 640 


Lys Met Thr Ile Ala Glu Phe Tyr Asp Ser Thr Pro Asp Gly Ser Gly 
                645                 650                 655     


Ser Gly Ser Ser Arg Met Asp Glu Lys Thr Thr Gly Trp Arg Gly Gly 
            660                 665                 670         


His Val Val Glu Gly Leu Ala Gly Glu Leu Glu Gln Leu Arg Ala Arg 
        675                 680                 685             


Leu Glu His His Pro Gln Gly Gln Arg Glu Pro 
    690                 695                 


<210>  267
<211>  727
<212>  PRT
<213>  Artificial sequence

<220>
<223>  HA-VHHGFP-CLC-Trx-TMD-mCherry (15)

<400>  267

Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro 
1               5                   10                  15      


Gly Ser Thr Gly Asp Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Gly Ala 
            20                  25                  30          


His Ser Arg Met Ala Gln Val Gln Leu Val Glu Ser Gly Gly Ala Leu 
        35                  40                  45              


Val Gln Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe 
    50                  55                  60                  


Pro Val Asn Arg Tyr Ser Met Arg Trp Tyr Arg Gln Ala Pro Gly Lys 
65                  70                  75                  80  


Glu Arg Glu Trp Val Ala Gly Met Ser Ser Ala Gly Asp Arg Ser Ser 
                85                  90                  95      


Tyr Glu Asp Ser Val Lys Gly Arg Phe Thr Ile Ser Arg Asp Asp Ala 
            100                 105                 110         


Arg Asn Thr Val Tyr Leu Gln Met Asn Ser Leu Lys Pro Glu Asp Thr 
        115                 120                 125             


Ala Val Tyr Tyr Cys Asn Val Asn Val Gly Phe Glu Tyr Trp Gly Gln 
    130                 135                 140                 


Gly Thr Gln Val Thr Val Ser Ser Pro Asp Trp Ala Gln Ser Gly Gly 
145                 150                 155                 160 


Gly Gly Ser Ser Ser Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly 
                165                 170                 175     


Lys Thr Ser Leu Ser Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys 
            180                 185                 190         


Asn Ile Asn Tyr Ile Met Lys His Thr Val Lys Lys Arg Met Phe Lys 
        195                 200                 205             


Ile Lys Ala Gly Gly Lys Glu Val Ile Val Thr Ala Asp His Ser Val 
    210                 215                 220                 


Met Val Lys Arg Asp Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met 
225                 230                 235                 240 


Lys Gln Thr Asp Arg Val Val Lys Trp Met Leu Thr Gly Ser His Met 
                245                 250                 255     


Ile Glu Phe Ile Glu Phe Glu Ile Glu Asp Leu Gly Val Met Glu Ile 
            260                 265                 270         


Asp Val Tyr Asp Ile Glu Val Asp Gly Asn His Asn Phe Phe Gly Asn 
        275                 280                 285             


Asp Ile Leu Val His Asn Ser Val Tyr Leu Asn Ser Gly Gly Ser Gly 
    290                 295                 300                 


Thr Gly Ser Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr 
305                 310                 315                 320 


Asp Val Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala His 
                325                 330                 335     


Trp Cys Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala 
            340                 345                 350         


Asp Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp His 
        355                 360                 365             


Asn Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu 
    370                 375                 380                 


Leu Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu 
385                 390                 395                 400 


Ser Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser 
                405                 410                 415     


Glu Phe Arg Ser His His His His His His Tyr Val Asp Glu Gln Lys 
            420                 425                 430         


Leu Ile Ser Glu Glu Asp Leu Asn Ala Val Gly Gln Asp Thr Gln Glu 
        435                 440                 445             


Val Ile Val Val Pro His Ser Leu Pro Phe Lys Val Val Val Ile Ser 
    450                 455                 460                 


Ala Ile Leu Ala Leu Val Val Leu Thr Ile Ile Ser Leu Ile Ile Leu 
465                 470                 475                 480 


Ile Met Leu Trp Gln Lys Lys Pro Arg Ala Ser Met Val Ser Lys Gly 
                485                 490                 495     


Glu Glu Asp Asn Met Ala Ile Ile Lys Glu Phe Met Arg Phe Lys Val 
            500                 505                 510         


His Met Glu Gly Ser Val Asn Gly His Glu Phe Glu Ile Glu Gly Glu 
        515                 520                 525             


Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys Leu Lys Val 
    530                 535                 540                 


Thr Lys Gly Gly Pro Leu Pro Phe Ala Trp Asp Ile Leu Ser Pro Gln 
545                 550                 555                 560 


Phe Met Tyr Gly Ser Lys Ala Tyr Val Lys His Pro Ala Asp Ile Pro 
                565                 570                 575     


Asp Tyr Leu Lys Leu Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg Val 
            580                 585                 590         


Met Asn Phe Glu Asp Gly Gly Val Val Thr Val Thr Gln Asp Ser Ser 
        595                 600                 605             


Leu Gln Asp Gly Glu Phe Ile Tyr Lys Val Lys Leu Arg Gly Thr Asn 
    610                 615                 620                 


Phe Pro Ser Asp Gly Pro Val Met Gln Lys Lys Thr Met Gly Trp Glu 
625                 630                 635                 640 


Ala Ser Ser Glu Arg Met Tyr Pro Glu Asp Gly Ala Leu Lys Gly Glu 
                645                 650                 655     


Ile Lys Gln Arg Leu Lys Leu Lys Asp Gly Gly His Tyr Asp Ala Glu 
            660                 665                 670         


Val Lys Thr Thr Tyr Lys Ala Lys Lys Pro Val Gln Leu Pro Gly Ala 
        675                 680                 685             


Tyr Asn Val Asn Ile Lys Leu Asp Ile Thr Ser His Asn Glu Asp Tyr 
    690                 695                 700                 


Thr Ile Val Glu Gln Tyr Glu Arg Ala Glu Gly Arg His Ser Thr Gly 
705                 710                 715                 720 


Gly Met Asp Glu Leu Tyr Lys 
                725         


<210>  268
<211>  294
<212>  PRT
<213>  Artificial sequence

<220>
<223>  eGFP-CLN-H6 (14)

<400>  268

Met Ala Ser Trp Ser His Pro Gln Phe Glu Lys Ala Ser Gly Thr Val 
1               5                   10                  15      


Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile Leu Val Glu 
            20                  25                  30          


Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly Glu Gly 
        35                  40                  45              


Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe Ile Cys Thr 
    50                  55                  60                  


Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu Thr 
65                  70                  75                  80  


Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gln His 
                85                  90                  95      


Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln Glu Arg Thr 
            100                 105                 110         


Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val Lys 
        115                 120                 125             


Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys Gly Ile Asp 
    130                 135                 140                 


Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu Tyr Asn Tyr 
145                 150                 155                 160 


Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys Asn Gly Ile 
                165                 170                 175     


Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly Ser Val Gln 
            180                 185                 190         


Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp Gly Pro Val 
        195                 200                 205             


Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala Leu Ser Lys 
    210                 215                 220                 


Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe Val Thr 
225                 230                 235                 240 


Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys Gly Ser Tyr 
                245                 250                 255     


Ile Asp Thr Asp Ser Val Val Gly Asp Thr Ile Ile Asp Val Ser Gly 
            260                 265                 270         


Lys Lys Met Thr Ile Ala Glu Phe Tyr Asp Ser Thr Pro Asp Gly Ser 
        275                 280                 285             


His His His His His His 
    290                 


<210>  269
<211>  288
<212>  PRT
<213>  Artificial sequence

<220>
<223>  CysTag-H8-CLN-eGFP(C49S) (16)

<400>  269

Met Gly Cys His His His His His His His His Tyr Ile Asp Thr Asp 
1               5                   10                  15      


Ser Val Val Gly Asp Thr Ile Ile Asp Val Ser Gly Lys Lys Met Thr 
            20                  25                  30          


Ile Ala Glu Phe Tyr Asp Ser Thr Pro Asp Ser Gly Gly Ser Gly Gly 
        35                  40                  45              


Ser Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro Ile 
    50                  55                  60                  


Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser 
65                  70                  75                  80  


Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe 
                85                  90                  95      


Ile Ser Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr 
            100                 105                 110         


Thr Leu Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His Met 
        115                 120                 125             


Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gln 
    130                 135                 140                 


Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala 
145                 150                 155                 160 


Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu Lys 
                165                 170                 175     


Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu Glu 
            180                 185                 190         


Tyr Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln Lys 
        195                 200                 205             


Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp Gly 
    210                 215                 220                 


Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly Asp 
225                 230                 235                 240 


Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser Ala 
                245                 250                 255     


Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu 
            260                 265                 270         


Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr Lys 
        275                 280                 285             


<210>  270
<211>  727
<212>  PRT
<213>  Artificial sequence

<220>
<223>  HA-VHHGFP-CLC(N159'A)-Trx-TMD-mCherry (15a)

<400>  270

Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro 
1               5                   10                  15      


Gly Ser Thr Gly Asp Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Gly Ala 
            20                  25                  30          


His Ser Arg Met Ala Gln Val Gln Leu Val Glu Ser Gly Gly Ala Leu 
        35                  40                  45              


Val Gln Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe 
    50                  55                  60                  


Pro Val Asn Arg Tyr Ser Met Arg Trp Tyr Arg Gln Ala Pro Gly Lys 
65                  70                  75                  80  


Glu Arg Glu Trp Val Ala Gly Met Ser Ser Ala Gly Asp Arg Ser Ser 
                85                  90                  95      


Tyr Glu Asp Ser Val Lys Gly Arg Phe Thr Ile Ser Arg Asp Asp Ala 
            100                 105                 110         


Arg Asn Thr Val Tyr Leu Gln Met Asn Ser Leu Lys Pro Glu Asp Thr 
        115                 120                 125             


Ala Val Tyr Tyr Cys Asn Val Asn Val Gly Phe Glu Tyr Trp Gly Gln 
    130                 135                 140                 


Gly Thr Gln Val Thr Val Ser Ser Pro Asp Trp Ala Gln Ser Gly Gly 
145                 150                 155                 160 


Gly Gly Ser Ser Ser Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly 
                165                 170                 175     


Lys Thr Ser Leu Ser Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys 
            180                 185                 190         


Asn Ile Asn Tyr Ile Met Lys His Thr Val Lys Lys Arg Met Phe Lys 
        195                 200                 205             


Ile Lys Ala Gly Gly Lys Glu Val Ile Val Thr Ala Asp His Ser Val 
    210                 215                 220                 


Met Val Lys Arg Asp Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met 
225                 230                 235                 240 


Lys Gln Thr Asp Arg Val Val Lys Trp Met Leu Thr Gly Ser His Met 
                245                 250                 255     


Ile Glu Phe Ile Glu Phe Glu Ile Glu Asp Leu Gly Val Met Glu Ile 
            260                 265                 270         


Asp Val Tyr Asp Ile Glu Val Asp Gly Asn His Asn Phe Phe Gly Asn 
        275                 280                 285             


Asp Ile Leu Val His Asn Ser Val Tyr Leu Asn Ser Gly Gly Ser Gly 
    290                 295                 300                 


Thr Gly Ser Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr 
305                 310                 315                 320 


Asp Val Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala His 
                325                 330                 335     


Trp Cys Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala 
            340                 345                 350         


Asp Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp His 
        355                 360                 365             


Asn Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu 
    370                 375                 380                 


Leu Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu 
385                 390                 395                 400 


Ser Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser 
                405                 410                 415     


Glu Phe Arg Ser His His His His His His Tyr Val Asp Glu Gln Lys 
            420                 425                 430         


Leu Ile Ser Glu Glu Asp Leu Asn Ala Val Gly Gln Asp Thr Gln Glu 
        435                 440                 445             


Val Ile Val Val Pro His Ser Leu Pro Phe Lys Val Val Val Ile Ser 
    450                 455                 460                 


Ala Ile Leu Ala Leu Val Val Leu Thr Ile Ile Ser Leu Ile Ile Leu 
465                 470                 475                 480 


Ile Met Leu Trp Gln Lys Lys Pro Arg Ala Ser Met Val Ser Lys Gly 
                485                 490                 495     


Glu Glu Asp Asn Met Ala Ile Ile Lys Glu Phe Met Arg Phe Lys Val 
            500                 505                 510         


His Met Glu Gly Ser Val Asn Gly His Glu Phe Glu Ile Glu Gly Glu 
        515                 520                 525             


Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys Leu Lys Val 
    530                 535                 540                 


Thr Lys Gly Gly Pro Leu Pro Phe Ala Trp Asp Ile Leu Ser Pro Gln 
545                 550                 555                 560 


Phe Met Tyr Gly Ser Lys Ala Tyr Val Lys His Pro Ala Asp Ile Pro 
                565                 570                 575     


Asp Tyr Leu Lys Leu Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg Val 
            580                 585                 590         


Met Asn Phe Glu Asp Gly Gly Val Val Thr Val Thr Gln Asp Ser Ser 
        595                 600                 605             


Leu Gln Asp Gly Glu Phe Ile Tyr Lys Val Lys Leu Arg Gly Thr Asn 
    610                 615                 620                 


Phe Pro Ser Asp Gly Pro Val Met Gln Lys Lys Thr Met Gly Trp Glu 
625                 630                 635                 640 


Ala Ser Ser Glu Arg Met Tyr Pro Glu Asp Gly Ala Leu Lys Gly Glu 
                645                 650                 655     


Ile Lys Gln Arg Leu Lys Leu Lys Asp Gly Gly His Tyr Asp Ala Glu 
            660                 665                 670         


Val Lys Thr Thr Tyr Lys Ala Lys Lys Pro Val Gln Leu Pro Gly Ala 
        675                 680                 685             


Tyr Asn Val Asn Ile Lys Leu Asp Ile Thr Ser His Asn Glu Asp Tyr 
    690                 695                 700                 


Thr Ile Val Glu Gln Tyr Glu Arg Ala Glu Gly Arg His Ser Thr Gly 
705                 710                 715                 720 


Gly Met Asp Glu Leu Tyr Lys 
                725         


<210>  271
<211>  1076
<212>  PRT
<213>  Artificial sequence

<220>
<223>  HA-VHHGFP-CLC-IFNAR1-mCherry (17)

<400>  271

Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro 
1               5                   10                  15      


Gly Ser Thr Gly Asp Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Gly Ala 
            20                  25                  30          


His Ser Arg Met Ala Gln Val Gln Leu Val Glu Ser Gly Gly Ala Leu 
        35                  40                  45              


Val Gln Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe 
    50                  55                  60                  


Pro Val Asn Arg Tyr Ser Met Arg Trp Tyr Arg Gln Ala Pro Gly Lys 
65                  70                  75                  80  


Glu Arg Glu Trp Val Ala Gly Met Ser Ser Ala Gly Asp Arg Ser Ser 
                85                  90                  95      


Tyr Glu Asp Ser Val Lys Gly Arg Phe Thr Ile Ser Arg Asp Asp Ala 
            100                 105                 110         


Arg Asn Thr Val Tyr Leu Gln Met Asn Ser Leu Lys Pro Glu Asp Thr 
        115                 120                 125             


Ala Val Tyr Tyr Cys Asn Val Asn Val Gly Phe Glu Tyr Trp Gly Gln 
    130                 135                 140                 


Gly Thr Gln Val Thr Val Ser Ser Pro Asp Trp Ala Gln Ser Gly Gly 
145                 150                 155                 160 


Gly Gly Ser Ser Ser Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly 
                165                 170                 175     


Lys Thr Ser Leu Ser Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys 
            180                 185                 190         


Asn Ile Asn Tyr Ile Met Lys His Thr Val Lys Lys Arg Met Phe Lys 
        195                 200                 205             


Ile Lys Ala Gly Gly Lys Glu Val Ile Val Thr Ala Asp His Ser Val 
    210                 215                 220                 


Met Val Lys Arg Asp Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met 
225                 230                 235                 240 


Lys Gln Thr Asp Arg Val Val Lys Trp Met Leu Thr Gly Ser His Met 
                245                 250                 255     


Ile Glu Phe Ile Glu Phe Glu Ile Glu Asp Leu Gly Val Met Glu Ile 
            260                 265                 270         


Asp Val Tyr Asp Ile Glu Val Asp Gly Asn His Asn Phe Phe Gly Asn 
        275                 280                 285             


Asp Ile Leu Val His Asn Ser Val Tyr Leu Asn Ser Gln Pro Ala Arg 
    290                 295                 300                 


Ser Lys Asn Leu Lys Ser Pro Gln Lys Val Glu Val Asp Ile Ile Asp 
305                 310                 315                 320 


Asp Asn Phe Ile Leu Arg Trp Asn Arg Ser Asp Glu Ser Val Gly Asn 
                325                 330                 335     


Val Thr Phe Ser Phe Asp Tyr Gln Lys Thr Gly Met Asp Asn Trp Ile 
            340                 345                 350         


Lys Leu Ser Gly Cys Gln Asn Ile Thr Ser Thr Lys Cys Asn Phe Ser 
        355                 360                 365             


Ser Leu Lys Leu Asn Val Tyr Glu Glu Ile Lys Leu Arg Ile Arg Ala 
    370                 375                 380                 


Glu Lys Glu Asn Thr Ser Ser Trp Tyr Glu Val Asp Ser Phe Thr Pro 
385                 390                 395                 400 


Phe Arg Lys Ala Gln Ile Gly Pro Pro Glu Val His Leu Glu Ala Glu 
                405                 410                 415     


Asp Lys Ala Ile Val Ile His Ile Ser Pro Gly Thr Lys Asp Ser Val 
            420                 425                 430         


Met Trp Ala Leu Asp Gly Leu Ser Phe Thr Tyr Ser Leu Leu Ile Trp 
        435                 440                 445             


Lys Asn Ser Ser Gly Val Glu Glu Arg Ile Glu Asn Ile Tyr Ser Arg 
    450                 455                 460                 


His Lys Ile Tyr Lys Leu Ser Pro Glu Thr Thr Tyr Cys Leu Lys Val 
465                 470                 475                 480 


Lys Ala Ala Leu Leu Thr Ser Trp Lys Ile Gly Val Tyr Ser Pro Val 
                485                 490                 495     


His Cys Ile Lys Thr Thr Val Glu Asn Glu Leu Pro Pro Pro Glu Asn 
            500                 505                 510         


Ile Glu Val Ser Val Gln Asn Gln Asn Tyr Val Leu Lys Trp Asp Tyr 
        515                 520                 525             


Thr Tyr Ala Asn Met Thr Phe Gln Val Gln Trp Leu His Ala Phe Leu 
    530                 535                 540                 


Lys Arg Asn Pro Gly Asn His Leu Tyr Lys Trp Lys Gln Ile Pro Asp 
545                 550                 555                 560 


Cys Glu Asn Val Lys Thr Thr Gln Cys Val Phe Pro Gln Asn Val Phe 
                565                 570                 575     


Gln Lys Gly Ile Tyr Leu Leu Arg Val Gln Ala Ser Asp Gly Asn Asn 
            580                 585                 590         


Thr Ser Phe Trp Ser Glu Glu Ile Lys Phe Asp Thr Glu Ile Gln Ala 
        595                 600                 605             


Phe Leu Leu Pro Pro Val Phe Asn Ile Arg Ser Leu Ser Asp Ser Phe 
    610                 615                 620                 


His Ile Tyr Ile Gly Ala Pro Lys Gln Ser Gly Asn Thr Pro Val Ile 
625                 630                 635                 640 


Gln Asp Tyr Pro Leu Ile Tyr Glu Ile Ile Phe Trp Glu Asn Thr Ser 
                645                 650                 655     


Asn Ala Glu Arg Lys Ile Ile Glu Lys Lys Thr Asp Val Thr Val Pro 
            660                 665                 670         


Asn Leu Lys Pro Leu Thr Val Tyr Cys Val Lys Ala Arg Ala His Thr 
        675                 680                 685             


Met Asp Glu Lys Leu Asn Lys Ser Ser Val Phe Ser Asp Ala Val Cys 
    690                 695                 700                 


Glu Lys Thr Lys Pro Gly Asn Thr Ser Lys Ile Trp Leu Ile Val Gly 
705                 710                 715                 720 


Ile Cys Ile Ala Leu Phe Ala Leu Pro Phe Val Ile Tyr Ala Ala Lys 
                725                 730                 735     


Val Phe Leu Arg Cys Ile Asn Tyr Val Phe Phe Pro Ser Leu Lys Pro 
            740                 745                 750         


Ser Ser Ser Ile Asp Glu Tyr Phe Ser Glu Gln Pro Leu Lys Asn Leu 
        755                 760                 765             


Leu Leu Ser Thr Ser Glu Glu Gln Ile Glu Lys Cys Phe Ile Ile Glu 
    770                 775                 780                 


Asn Ile Ser Thr Ile Ala Thr Val Glu Glu Thr Asn Gln Thr Asp Glu 
785                 790                 795                 800 


Asp His Lys Lys Tyr Ser Ser Gln Thr Ser Gln Asp Ser Gly Asn Tyr 
                805                 810                 815     


Ser Asn Glu Asp Glu Ser Glu Ser Lys Thr Ser Glu Glu Leu Gln Gln 
            820                 825                 830         


Asp Phe Val Leu Ala Glu Ala Ser Met Val Ser Lys Gly Glu Glu Asp 
        835                 840                 845             


Asn Met Ala Ile Ile Lys Glu Phe Met Arg Phe Lys Val His Met Glu 
    850                 855                 860                 


Gly Ser Val Asn Gly His Glu Phe Glu Ile Glu Gly Glu Gly Glu Gly 
865                 870                 875                 880 


Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys Leu Lys Val Thr Lys Gly 
                885                 890                 895     


Gly Pro Leu Pro Phe Ala Trp Asp Ile Leu Ser Pro Gln Phe Met Tyr 
            900                 905                 910         


Gly Ser Lys Ala Tyr Val Lys His Pro Ala Asp Ile Pro Asp Tyr Leu 
        915                 920                 925             


Lys Leu Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg Val Met Asn Phe 
    930                 935                 940                 


Glu Asp Gly Gly Val Val Thr Val Thr Gln Asp Ser Ser Leu Gln Asp 
945                 950                 955                 960 


Gly Glu Phe Ile Tyr Lys Val Lys Leu Arg Gly Thr Asn Phe Pro Ser 
                965                 970                 975     


Asp Gly Pro Val Met Gln Lys Lys Thr Met Gly Trp Glu Ala Ser Ser 
            980                 985                 990         


Glu Arg Met Tyr Pro Glu Asp Gly  Ala Leu Lys Gly Glu  Ile Lys Gln 
        995                 1000                 1005             


Arg Leu  Lys Leu Lys Asp Gly  Gly His Tyr Asp Ala  Glu Val Lys 
    1010                 1015                 1020             


Thr Thr  Tyr Lys Ala Lys Lys  Pro Val Gln Leu Pro  Gly Ala Tyr 
    1025                 1030                 1035             


Asn Val  Asn Ile Lys Leu Asp  Ile Thr Ser His Asn  Glu Asp Tyr 
    1040                 1045                 1050             


Thr Ile  Val Glu Gln Tyr Glu  Arg Ala Glu Gly Arg  His Ser Thr 
    1055                 1060                 1065             


Gly Gly  Met Asp Glu Leu Tyr  Lys 
    1070                 1075     


<210>  272
<211>  1076
<212>  PRT
<213>  Artificial sequence

<220>
<223>  HA-VHHGFP-CLC(N159'A)-IFNAR1-mCherry (17a)

<400>  272

Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro 
1               5                   10                  15      


Gly Ser Thr Gly Asp Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Gly Ala 
            20                  25                  30          


His Ser Arg Met Ala Gln Val Gln Leu Val Glu Ser Gly Gly Ala Leu 
        35                  40                  45              


Val Gln Pro Gly Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe 
    50                  55                  60                  


Pro Val Asn Arg Tyr Ser Met Arg Trp Tyr Arg Gln Ala Pro Gly Lys 
65                  70                  75                  80  


Glu Arg Glu Trp Val Ala Gly Met Ser Ser Ala Gly Asp Arg Ser Ser 
                85                  90                  95      


Tyr Glu Asp Ser Val Lys Gly Arg Phe Thr Ile Ser Arg Asp Asp Ala 
            100                 105                 110         


Arg Asn Thr Val Tyr Leu Gln Met Asn Ser Leu Lys Pro Glu Asp Thr 
        115                 120                 125             


Ala Val Tyr Tyr Cys Asn Val Asn Val Gly Phe Glu Tyr Trp Gly Gln 
    130                 135                 140                 


Gly Thr Gln Val Thr Val Ser Ser Pro Asp Trp Ala Gln Ser Gly Gly 
145                 150                 155                 160 


Gly Gly Ser Ser Ser Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly 
                165                 170                 175     


Lys Thr Ser Leu Ser Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys 
            180                 185                 190         


Asn Ile Asn Tyr Ile Met Lys His Thr Val Lys Lys Arg Met Phe Lys 
        195                 200                 205             


Ile Lys Ala Gly Gly Lys Glu Val Ile Val Thr Ala Asp His Ser Val 
    210                 215                 220                 


Met Val Lys Arg Asp Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met 
225                 230                 235                 240 


Lys Gln Thr Asp Arg Val Val Lys Trp Met Leu Thr Gly Ser His Met 
                245                 250                 255     


Ile Glu Phe Ile Glu Phe Glu Ile Glu Asp Leu Gly Val Met Glu Ile 
            260                 265                 270         


Asp Val Tyr Asp Ile Glu Val Asp Gly Asn His Asn Phe Phe Gly Asn 
        275                 280                 285             


Asp Ile Leu Val His Ala Ser Val Tyr Leu Asn Ser Gln Pro Ala Arg 
    290                 295                 300                 


Ser Lys Asn Leu Lys Ser Pro Gln Lys Val Glu Val Asp Ile Ile Asp 
305                 310                 315                 320 


Asp Asn Phe Ile Leu Arg Trp Asn Arg Ser Asp Glu Ser Val Gly Asn 
                325                 330                 335     


Val Thr Phe Ser Phe Asp Tyr Gln Lys Thr Gly Met Asp Asn Trp Ile 
            340                 345                 350         


Lys Leu Ser Gly Cys Gln Asn Ile Thr Ser Thr Lys Cys Asn Phe Ser 
        355                 360                 365             


Ser Leu Lys Leu Asn Val Tyr Glu Glu Ile Lys Leu Arg Ile Arg Ala 
    370                 375                 380                 


Glu Lys Glu Asn Thr Ser Ser Trp Tyr Glu Val Asp Ser Phe Thr Pro 
385                 390                 395                 400 


Phe Arg Lys Ala Gln Ile Gly Pro Pro Glu Val His Leu Glu Ala Glu 
                405                 410                 415     


Asp Lys Ala Ile Val Ile His Ile Ser Pro Gly Thr Lys Asp Ser Val 
            420                 425                 430         


Met Trp Ala Leu Asp Gly Leu Ser Phe Thr Tyr Ser Leu Leu Ile Trp 
        435                 440                 445             


Lys Asn Ser Ser Gly Val Glu Glu Arg Ile Glu Asn Ile Tyr Ser Arg 
    450                 455                 460                 


His Lys Ile Tyr Lys Leu Ser Pro Glu Thr Thr Tyr Cys Leu Lys Val 
465                 470                 475                 480 


Lys Ala Ala Leu Leu Thr Ser Trp Lys Ile Gly Val Tyr Ser Pro Val 
                485                 490                 495     


His Cys Ile Lys Thr Thr Val Glu Asn Glu Leu Pro Pro Pro Glu Asn 
            500                 505                 510         


Ile Glu Val Ser Val Gln Asn Gln Asn Tyr Val Leu Lys Trp Asp Tyr 
        515                 520                 525             


Thr Tyr Ala Asn Met Thr Phe Gln Val Gln Trp Leu His Ala Phe Leu 
    530                 535                 540                 


Lys Arg Asn Pro Gly Asn His Leu Tyr Lys Trp Lys Gln Ile Pro Asp 
545                 550                 555                 560 


Cys Glu Asn Val Lys Thr Thr Gln Cys Val Phe Pro Gln Asn Val Phe 
                565                 570                 575     


Gln Lys Gly Ile Tyr Leu Leu Arg Val Gln Ala Ser Asp Gly Asn Asn 
            580                 585                 590         


Thr Ser Phe Trp Ser Glu Glu Ile Lys Phe Asp Thr Glu Ile Gln Ala 
        595                 600                 605             


Phe Leu Leu Pro Pro Val Phe Asn Ile Arg Ser Leu Ser Asp Ser Phe 
    610                 615                 620                 


His Ile Tyr Ile Gly Ala Pro Lys Gln Ser Gly Asn Thr Pro Val Ile 
625                 630                 635                 640 


Gln Asp Tyr Pro Leu Ile Tyr Glu Ile Ile Phe Trp Glu Asn Thr Ser 
                645                 650                 655     


Asn Ala Glu Arg Lys Ile Ile Glu Lys Lys Thr Asp Val Thr Val Pro 
            660                 665                 670         


Asn Leu Lys Pro Leu Thr Val Tyr Cys Val Lys Ala Arg Ala His Thr 
        675                 680                 685             


Met Asp Glu Lys Leu Asn Lys Ser Ser Val Phe Ser Asp Ala Val Cys 
    690                 695                 700                 


Glu Lys Thr Lys Pro Gly Asn Thr Ser Lys Ile Trp Leu Ile Val Gly 
705                 710                 715                 720 


Ile Cys Ile Ala Leu Phe Ala Leu Pro Phe Val Ile Tyr Ala Ala Lys 
                725                 730                 735     


Val Phe Leu Arg Cys Ile Asn Tyr Val Phe Phe Pro Ser Leu Lys Pro 
            740                 745                 750         


Ser Ser Ser Ile Asp Glu Tyr Phe Ser Glu Gln Pro Leu Lys Asn Leu 
        755                 760                 765             


Leu Leu Ser Thr Ser Glu Glu Gln Ile Glu Lys Cys Phe Ile Ile Glu 
    770                 775                 780                 


Asn Ile Ser Thr Ile Ala Thr Val Glu Glu Thr Asn Gln Thr Asp Glu 
785                 790                 795                 800 


Asp His Lys Lys Tyr Ser Ser Gln Thr Ser Gln Asp Ser Gly Asn Tyr 
                805                 810                 815     


Ser Asn Glu Asp Glu Ser Glu Ser Lys Thr Ser Glu Glu Leu Gln Gln 
            820                 825                 830         


Asp Phe Val Leu Ala Glu Ala Ser Met Val Ser Lys Gly Glu Glu Asp 
        835                 840                 845             


Asn Met Ala Ile Ile Lys Glu Phe Met Arg Phe Lys Val His Met Glu 
    850                 855                 860                 


Gly Ser Val Asn Gly His Glu Phe Glu Ile Glu Gly Glu Gly Glu Gly 
865                 870                 875                 880 


Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys Leu Lys Val Thr Lys Gly 
                885                 890                 895     


Gly Pro Leu Pro Phe Ala Trp Asp Ile Leu Ser Pro Gln Phe Met Tyr 
            900                 905                 910         


Gly Ser Lys Ala Tyr Val Lys His Pro Ala Asp Ile Pro Asp Tyr Leu 
        915                 920                 925             


Lys Leu Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg Val Met Asn Phe 
    930                 935                 940                 


Glu Asp Gly Gly Val Val Thr Val Thr Gln Asp Ser Ser Leu Gln Asp 
945                 950                 955                 960 


Gly Glu Phe Ile Tyr Lys Val Lys Leu Arg Gly Thr Asn Phe Pro Ser 
                965                 970                 975     


Asp Gly Pro Val Met Gln Lys Lys Thr Met Gly Trp Glu Ala Ser Ser 
            980                 985                 990         


Glu Arg Met Tyr Pro Glu Asp Gly  Ala Leu Lys Gly Glu  Ile Lys Gln 
        995                 1000                 1005             


Arg Leu  Lys Leu Lys Asp Gly  Gly His Tyr Asp Ala  Glu Val Lys 
    1010                 1015                 1020             


Thr Thr  Tyr Lys Ala Lys Lys  Pro Val Gln Leu Pro  Gly Ala Tyr 
    1025                 1030                 1035             


Asn Val  Asn Ile Lys Leu Asp  Ile Thr Ser His Asn  Glu Asp Tyr 
    1040                 1045                 1050             


Thr Ile  Val Glu Gln Tyr Glu  Arg Ala Glu Gly Arg  His Ser Thr 
    1055                 1060                 1065             


Gly Gly  Met Asp Glu Leu Tyr  Lys 
    1070                 1075     


<210>  273
<211>  700
<212>  PRT
<213>  Artificial sequence

<220>
<223>  HA-eGFP-Trx-TMD-mCherry

<400>  273

Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro 
1               5                   10                  15      


Gly Ser Thr Gly Asp Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Gly Ala 
            20                  25                  30          


His Ser Arg Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro 
        35                  40                  45              


Ile Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val 
    50                  55                  60                  


Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys 
65                  70                  75                  80  


Phe Ile Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val 
                85                  90                  95      


Thr Thr Leu Thr Tyr Gly Val Gln Cys Phe Ser Arg Tyr Pro Asp His 
            100                 105                 110         


Met Lys Gln His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val 
        115                 120                 125             


Gln Glu Arg Thr Ile Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg 
    130                 135                 140                 


Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg Ile Glu Leu 
145                 150                 155                 160 


Lys Gly Ile Asp Phe Lys Glu Asp Gly Asn Ile Leu Gly His Lys Leu 
                165                 170                 175     


Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr Ile Met Ala Asp Lys Gln 
            180                 185                 190         


Lys Asn Gly Ile Lys Val Asn Phe Lys Ile Arg His Asn Ile Glu Asp 
        195                 200                 205             


Gly Ser Val Gln Leu Ala Asp His Tyr Gln Gln Asn Thr Pro Ile Gly 
    210                 215                 220                 


Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gln Ser 
225                 230                 235                 240 


Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu 
                245                 250                 255     


Glu Phe Val Thr Ala Ala Gly Ile Thr Leu Gly Met Asp Glu Leu Tyr 
            260                 265                 270         


Lys Ser Ser Ser Gly Thr Gly Ser Asp Lys Ile Ile His Leu Thr Asp 
        275                 280                 285             


Asp Ser Phe Asp Thr Asp Val Leu Lys Ala Asp Gly Ala Ile Leu Val 
    290                 295                 300                 


Asp Phe Trp Ala His Trp Cys Gly Pro Cys Lys Met Ile Ala Pro Ile 
305                 310                 315                 320 


Leu Asp Glu Ile Ala Asp Glu Tyr Gln Gly Lys Leu Thr Val Ala Lys 
                325                 330                 335     


Leu Asn Ile Asp His Asn Pro Gly Thr Ala Pro Lys Tyr Gly Ile Arg 
            340                 345                 350         


Gly Ile Pro Thr Leu Leu Leu Phe Lys Asn Gly Glu Val Ala Ala Thr 
        355                 360                 365             


Lys Val Gly Ala Leu Ser Lys Gly Gln Leu Lys Glu Phe Leu Asp Ala 
    370                 375                 380                 


Asn Leu Ala Gly Ser Glu Phe Arg Ser His His His His His His Tyr 
385                 390                 395                 400 


Val Asp Glu Gln Lys Leu Ile Ser Glu Glu Asp Leu Asn Ala Val Gly 
                405                 410                 415     


Gln Asp Thr Gln Glu Val Ile Val Val Pro His Ser Leu Pro Phe Lys 
            420                 425                 430         


Val Val Val Ile Ser Ala Ile Leu Ala Leu Val Val Leu Thr Ile Ile 
        435                 440                 445             


Ser Leu Ile Ile Leu Ile Met Leu Trp Gln Lys Lys Pro Arg Ala Ser 
    450                 455                 460                 


Met Val Ser Lys Gly Glu Glu Asp Asn Met Ala Ile Ile Lys Glu Phe 
465                 470                 475                 480 


Met Arg Phe Lys Val His Met Glu Gly Ser Val Asn Gly His Glu Phe 
                485                 490                 495     


Glu Ile Glu Gly Glu Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr 
            500                 505                 510         


Ala Lys Leu Lys Val Thr Lys Gly Gly Pro Leu Pro Phe Ala Trp Asp 
        515                 520                 525             


Ile Leu Ser Pro Gln Phe Met Tyr Gly Ser Lys Ala Tyr Val Lys His 
    530                 535                 540                 


Pro Ala Asp Ile Pro Asp Tyr Leu Lys Leu Ser Phe Pro Glu Gly Phe 
545                 550                 555                 560 


Lys Trp Glu Arg Val Met Asn Phe Glu Asp Gly Gly Val Val Thr Val 
                565                 570                 575     


Thr Gln Asp Ser Ser Leu Gln Asp Gly Glu Phe Ile Tyr Lys Val Lys 
            580                 585                 590         


Leu Arg Gly Thr Asn Phe Pro Ser Asp Gly Pro Val Met Gln Lys Lys 
        595                 600                 605             


Thr Met Gly Trp Glu Ala Ser Ser Glu Arg Met Tyr Pro Glu Asp Gly 
    610                 615                 620                 


Ala Leu Lys Gly Glu Ile Lys Gln Arg Leu Lys Leu Lys Asp Gly Gly 
625                 630                 635                 640 


His Tyr Asp Ala Glu Val Lys Thr Thr Tyr Lys Ala Lys Lys Pro Val 
                645                 650                 655     


Gln Leu Pro Gly Ala Tyr Asn Val Asn Ile Lys Leu Asp Ile Thr Ser 
            660                 665                 670         


His Asn Glu Asp Tyr Thr Ile Val Glu Gln Tyr Glu Arg Ala Glu Gly 
        675                 680                 685             


Arg His Ser Thr Gly Gly Met Asp Glu Leu Tyr Lys 
    690                 695                 700 


<210>  274
<211>  935
<212>  PRT
<213>  Artificial sequence

<220>
<223>  HA-EGFR-mCherry

<400>  274

Met Glu Thr Asp Thr Leu Leu Leu Trp Val Leu Leu Leu Trp Val Pro 
1               5                   10                  15      


Gly Ser Thr Gly Asp Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Gly Ala 
            20                  25                  30          


His Ser Arg Glu Leu Glu Glu Lys Lys Val Cys Gln Gly Thr Ser Asn 
        35                  40                  45              


Lys Leu Thr Gln Leu Gly Thr Phe Glu Asp His Phe Leu Ser Leu Gln 
    50                  55                  60                  


Arg Met Phe Asn Asn Cys Glu Val Val Leu Gly Asn Leu Glu Ile Thr 
65                  70                  75                  80  


Tyr Val Gln Arg Asn Tyr Asp Leu Ser Phe Leu Lys Thr Ile Gln Glu 
                85                  90                  95      


Val Ala Gly Tyr Val Leu Ile Ala Leu Asn Thr Val Glu Arg Ile Pro 
            100                 105                 110         


Leu Glu Asn Leu Gln Ile Ile Arg Gly Asn Met Tyr Tyr Glu Asn Ser 
        115                 120                 125             


Tyr Ala Leu Ala Val Leu Ser Asn Tyr Asp Ala Asn Lys Thr Gly Leu 
    130                 135                 140                 


Lys Glu Leu Pro Met Arg Asn Leu Gln Glu Ile Leu His Gly Ala Val 
145                 150                 155                 160 


Arg Phe Ser Asn Asn Pro Ala Leu Cys Asn Val Glu Ser Ile Gln Trp 
                165                 170                 175     


Arg Asp Ile Val Ser Ser Asp Phe Leu Ser Asn Met Ser Met Asp Phe 
            180                 185                 190         


Gln Asn His Leu Gly Ser Cys Gln Lys Cys Asp Pro Ser Cys Pro Asn 
        195                 200                 205             


Gly Ser Cys Trp Gly Ala Gly Glu Glu Asn Cys Gln Lys Leu Thr Lys 
    210                 215                 220                 


Ile Ile Cys Ala Gln Gln Cys Ser Gly Arg Cys Arg Gly Lys Ser Pro 
225                 230                 235                 240 


Ser Asp Cys Cys His Asn Gln Cys Ala Ala Gly Cys Thr Gly Pro Arg 
                245                 250                 255     


Glu Ser Asp Cys Leu Val Cys Arg Lys Phe Arg Asp Glu Ala Thr Cys 
            260                 265                 270         


Lys Asp Thr Cys Pro Pro Leu Met Leu Tyr Asn Pro Thr Thr Tyr Gln 
        275                 280                 285             


Met Asp Val Asn Pro Glu Gly Lys Tyr Ser Phe Gly Ala Thr Cys Val 
    290                 295                 300                 


Lys Lys Cys Pro Arg Asn Tyr Val Val Thr Asp His Gly Ser Cys Val 
305                 310                 315                 320 


Arg Ala Cys Gly Ala Asp Ser Tyr Glu Met Glu Glu Asp Gly Val Arg 
                325                 330                 335     


Lys Cys Lys Lys Cys Glu Gly Pro Cys Arg Lys Val Cys Asn Gly Ile 
            340                 345                 350         


Gly Ile Gly Glu Phe Lys Asp Ser Leu Ser Ile Asn Ala Thr Asn Ile 
        355                 360                 365             


Lys His Phe Lys Asn Cys Thr Ser Ile Ser Gly Asp Leu His Ile Leu 
    370                 375                 380                 


Pro Val Ala Phe Arg Gly Asp Ser Phe Thr His Thr Pro Pro Leu Asp 
385                 390                 395                 400 


Pro Gln Glu Leu Asp Ile Leu Lys Thr Val Lys Glu Ile Thr Gly Phe 
                405                 410                 415     


Leu Leu Ile Gln Ala Trp Pro Glu Asn Arg Thr Asp Leu His Ala Phe 
            420                 425                 430         


Glu Asn Leu Glu Ile Ile Arg Gly Arg Thr Lys Gln His Gly Gln Phe 
        435                 440                 445             


Ser Leu Ala Val Val Ser Leu Asn Ile Thr Ser Leu Gly Leu Arg Ser 
    450                 455                 460                 


Leu Lys Glu Ile Ser Asp Gly Asp Val Ile Ile Ser Gly Asn Lys Asn 
465                 470                 475                 480 


Leu Cys Tyr Ala Asn Thr Ile Asn Trp Lys Lys Leu Phe Gly Thr Ser 
                485                 490                 495     


Gly Gln Lys Thr Lys Ile Ile Ser Asn Arg Gly Glu Asn Ser Cys Lys 
            500                 505                 510         


Ala Thr Gly Gln Val Cys His Ala Leu Cys Ser Pro Glu Gly Cys Trp 
        515                 520                 525             


Gly Pro Glu Pro Arg Asp Cys Val Ser Cys Arg Asn Val Ser Arg Gly 
    530                 535                 540                 


Arg Glu Cys Val Asp Lys Cys Asn Leu Leu Glu Gly Glu Pro Arg Glu 
545                 550                 555                 560 


Phe Val Glu Asn Ser Glu Cys Ile Gln Cys His Pro Glu Cys Leu Pro 
                565                 570                 575     


Gln Ala Met Asn Ile Thr Cys Thr Gly Arg Gly Pro Asp Asn Cys Ile 
            580                 585                 590         


Gln Cys Ala His Tyr Ile Asp Gly Pro His Cys Val Lys Thr Cys Pro 
        595                 600                 605             


Ala Gly Val Met Gly Glu Asn Asn Thr Leu Val Trp Lys Tyr Ala Asp 
    610                 615                 620                 


Ala Gly His Val Cys His Leu Cys His Pro Asn Cys Thr Tyr Gly Cys 
625                 630                 635                 640 


Thr Gly Pro Gly Leu Glu Gly Cys Pro Thr Asn Gly Pro Lys Ile Pro 
                645                 650                 655     


Ser Ile Ala Thr Gly Met Val Gly Ala Leu Leu Leu Leu Leu Val Val 
            660                 665                 670         


Ala Leu Gly Ile Gly Leu Phe Met Arg Arg Arg His Ile Val Arg Lys 
        675                 680                 685             


Arg Thr Leu Arg Arg Gly Ser Gly Ser Ala Ser Met Val Ser Lys Gly 
    690                 695                 700                 


Glu Glu Asp Asn Met Ala Ile Ile Lys Glu Phe Met Arg Phe Lys Val 
705                 710                 715                 720 


His Met Glu Gly Ser Val Asn Gly His Glu Phe Glu Ile Glu Gly Glu 
                725                 730                 735     


Gly Glu Gly Arg Pro Tyr Glu Gly Thr Gln Thr Ala Lys Leu Lys Val 
            740                 745                 750         


Thr Lys Gly Gly Pro Leu Pro Phe Ala Trp Asp Ile Leu Ser Pro Gln 
        755                 760                 765             


Phe Met Tyr Gly Ser Lys Ala Tyr Val Lys His Pro Ala Asp Ile Pro 
    770                 775                 780                 


Asp Tyr Leu Lys Leu Ser Phe Pro Glu Gly Phe Lys Trp Glu Arg Val 
785                 790                 795                 800 


Met Asn Phe Glu Asp Gly Gly Val Val Thr Val Thr Gln Asp Ser Ser 
                805                 810                 815     


Leu Gln Asp Gly Glu Phe Ile Tyr Lys Val Lys Leu Arg Gly Thr Asn 
            820                 825                 830         


Phe Pro Ser Asp Gly Pro Val Met Gln Lys Lys Thr Met Gly Trp Glu 
        835                 840                 845             


Ala Ser Ser Glu Arg Met Tyr Pro Glu Asp Gly Ala Leu Lys Gly Glu 
    850                 855                 860                 


Ile Lys Gln Arg Leu Lys Leu Lys Asp Gly Gly His Tyr Asp Ala Glu 
865                 870                 875                 880 


Val Lys Thr Thr Tyr Lys Ala Lys Lys Pro Val Gln Leu Pro Gly Ala 
                885                 890                 895     


Tyr Asn Val Asn Ile Lys Leu Asp Ile Thr Ser His Asn Glu Asp Tyr 
            900                 905                 910         


Thr Ile Val Glu Gln Tyr Glu Arg Ala Glu Gly Arg His Ser Thr Gly 
        915                 920                 925             


Gly Met Asp Glu Leu Tyr Lys 
    930                 935 


<210>  275
<211>  259
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Amino acid sequence of CL-intein construct IntC-Trx-H6

<400>  275

Met Gly Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser 
1               5                   10                  15      


Leu Ser Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn 
            20                  25                  30          


Tyr Ile Met Lys His Thr Val Lys Lys Arg Met Phe Lys Ile Lys Ala 
        35                  40                  45              


Gly Gly Lys Glu Val Ile Val Thr Ala Asp His Ser Val Met Val Lys 
    50                  55                  60                  


Arg Asp Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr 
65                  70                  75                  80  


Asp Arg Val Val Lys Trp Met Leu Thr Gly Ser His Met Ile Glu Phe 
                85                  90                  95      


Ile Glu Phe Glu Ile Glu Asp Leu Gly Val Met Glu Ile Asp Val Tyr 
            100                 105                 110         


Asp Ile Glu Val Asp Gly Asn His Asn Phe Phe Gly Asn Asp Ile Leu 
        115                 120                 125             


Val His Asn Ser Val Tyr Leu Asn Gly Thr Gly Ser Asp Lys Ile Ile 
    130                 135                 140                 


His Leu Thr Asp Asp Ser Phe Asp Thr Asp Val Leu Lys Ala Asp Gly 
145                 150                 155                 160 


Ala Ile Leu Val Asp Phe Trp Ala His Trp Cys Gly Pro Cys Lys Met 
                165                 170                 175     


Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp Glu Tyr Gln Gly Lys Leu 
            180                 185                 190         


Thr Val Ala Lys Leu Asn Ile Asp His Asn Pro Gly Thr Ala Pro Lys 
        195                 200                 205             


Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu Leu Phe Lys Asn Gly Glu 
    210                 215                 220                 


Val Ala Ala Thr Lys Val Gly Ala Leu Ser Lys Gly Gln Leu Lys Glu 
225                 230                 235                 240 


Phe Leu Asp Ala Asn Leu Ala Gly Ser Glu Phe Arg Ser His His His 
                245                 250                 255     


His His His 
            


<210>  276
<211>  222
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Amino acid sequence of CL-intein construct SBP-IntC-SBP

<400>  276

Met Asp Glu Lys Thr Thr Gly Trp Arg Gly Gly His Val Val Glu Gly 
1               5                   10                  15      


Leu Ala Gly Glu Leu Glu Gln Leu Arg Ala Arg Leu Glu His His Pro 
            20                  25                  30          


Gln Gly Gln Arg Glu Pro Gly Ala Ser Gly Gly Gly Gly Ser Ser Ser 
        35                  40                  45              


Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser Leu Ser 
    50                  55                  60                  


Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn Tyr Ile 
65                  70                  75                  80  


Met Lys His Thr Val Lys Lys Arg Met Phe Lys Ile Lys Ala Gly Gly 
                85                  90                  95      


Lys Glu Val Ile Val Thr Ala Asp His Ser Val Met Val Lys Arg Asp 
            100                 105                 110         


Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr Asp Arg 
        115                 120                 125             


Val Val Lys Trp Met Leu Thr Gly Ser His Met Ile Glu Phe Ile Glu 
    130                 135                 140                 


Phe Glu Ile Glu Asp Leu Gly Val Met Glu Ile Asp Val Tyr Asp Ile 
145                 150                 155                 160 


Glu Val Asp Gly Asn His Asn Phe Phe Gly Asn Asp Ile Leu Val His 
                165                 170                 175     


Asn Ser Val Tyr Leu Asn Gly Thr Met Asp Glu Lys Thr Thr Gly Trp 
            180                 185                 190         


Arg Gly Gly His Val Val Glu Gly Leu Ala Gly Glu Leu Glu Gln Leu 
        195                 200                 205             


Arg Ala Arg Leu Glu His His Pro Gln Gly Gln Arg Glu Pro 
    210                 215                 220         


<210>  277
<211>  419
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Amino acid sequence of PolB-16_OarG-intein construct MBP-IntN-H6

<400>  277

Met Lys Thr Glu Glu Gly Lys Leu Val Ile Trp Ile Asn Gly Asp Lys 
1               5                   10                  15      


Gly Tyr Asn Gly Leu Ala Glu Val Gly Lys Lys Phe Glu Lys Asp Thr 
            20                  25                  30          


Gly Ile Lys Val Thr Val Glu His Pro Asp Lys Leu Glu Glu Lys Phe 
        35                  40                  45              


Pro Gln Val Ala Ala Thr Gly Asp Gly Pro Asp Ile Ile Phe Trp Ala 
    50                  55                  60                  


His Asp Arg Phe Gly Gly Tyr Ala Gln Ser Gly Leu Leu Ala Glu Ile 
65                  70                  75                  80  


Thr Pro Asp Lys Ala Phe Gln Asp Lys Leu Tyr Pro Phe Thr Trp Asp 
                85                  90                  95      


Ala Val Arg Tyr Asn Gly Lys Leu Ile Ala Tyr Pro Ile Ala Val Glu 
            100                 105                 110         


Ala Leu Ser Leu Ile Tyr Asn Lys Asp Leu Leu Pro Asn Pro Pro Lys 
        115                 120                 125             


Thr Trp Glu Glu Ile Pro Ala Leu Asp Lys Glu Leu Lys Ala Lys Gly 
    130                 135                 140                 


Lys Ser Ala Leu Met Phe Asn Leu Gln Glu Pro Tyr Phe Thr Trp Pro 
145                 150                 155                 160 


Leu Ile Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn Gly Lys 
                165                 170                 175     


Tyr Asp Ile Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys Ala Gly 
            180                 185                 190         


Leu Thr Phe Leu Val Asp Leu Ile Lys Asn Lys His Met Asn Ala Asp 
        195                 200                 205             


Thr Asp Tyr Ser Ile Ala Glu Ala Ala Phe Asn Lys Gly Glu Thr Ala 
    210                 215                 220                 


Met Thr Ile Asn Gly Pro Trp Ala Trp Ser Asn Ile Asp Thr Ser Lys 
225                 230                 235                 240 


Val Asn Tyr Gly Val Thr Val Leu Pro Thr Phe Lys Gly Gln Pro Ser 
                245                 250                 255     


Lys Pro Phe Val Gly Val Leu Ser Ala Gly Ile Asn Ala Ala Ser Pro 
            260                 265                 270         


Asn Lys Glu Leu Ala Lys Glu Phe Leu Glu Asn Tyr Leu Leu Thr Asp 
        275                 280                 285             


Glu Gly Leu Glu Ala Val Asn Lys Asp Lys Pro Leu Gly Ala Val Ala 
    290                 295                 300                 


Leu Lys Ser Tyr Glu Glu Glu Leu Ala Lys Asp Pro Arg Ile Ala Ala 
305                 310                 315                 320 


Thr Met Glu Asn Ala Gln Lys Gly Glu Ile Met Pro Asn Ile Pro Gln 
                325                 330                 335     


Met Ser Ala Phe Trp Tyr Ala Val Arg Thr Ala Val Ile Asn Ala Ala 
            340                 345                 350         


Ser Gly Arg Gln Thr Val Asp Glu Ala Leu Lys Asp Ala Gln Thr Asn 
        355                 360                 365             


Ser Ser Ser Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Leu Gly Ile 
    370                 375                 380                 


Glu Gly Arg Ile Ser Glu Phe Ser Gly Asp Thr Asp Ser Val His Gly 
385                 390                 395                 400 


Lys Thr His Val Phe Ile Arg Ser Ile Lys Asn Gly Ser His His His 
                405                 410                 415     


His His His 
            


<210>  278
<211>  296
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Amino acid sequence of PolB-16_OarG-intein construct IntC-Trx-H6

<400>  278

Met Gln Glu Ala Lys Ile Asp Ile Lys Ser Leu Tyr Asp Ser Leu Ala 
1               5                   10                  15      


Lys Lys Tyr Asp Val Gln His Lys Asn Ser Tyr Glu Val Ile Tyr Pro 
            20                  25                  30          


Lys Gly Tyr Glu Ile Lys Val Leu Gly Asn Lys Tyr Val Lys Leu Val 
        35                  40                  45              


Ala Met Ser Arg His Lys Thr Gln Lys His Leu Val Lys Ile Val Val 
    50                  55                  60                  


Lys Ser Glu Lys Thr Ile Asp Ser Leu Asp Pro Ile Arg Gln Lys Ser 
65                  70                  75                  80  


Leu Leu Lys Lys Gln Asp Glu Val Val Val Thr Thr Asp His Ile Cys 
                85                  90                  95      


Met Val Tyr Asn Asp Asp His Phe Phe Glu Asn Val Asn Ala Lys Asn 
            100                 105                 110         


Leu Lys Val Gly Asn Tyr Val Ser Val Tyr Asp Glu Ala Ser Asp Lys 
        115                 120                 125             


Glu Val Ile Gly Glu Ile Ala Ser Ile Glu Asp Leu Gly Met Thr Asp 
    130                 135                 140                 


Asp Tyr Val Tyr Asp Cys Glu Val Asp Asp Asp Ser His Ala Phe Tyr 
145                 150                 155                 160 


Ala Ser Asn Ile Leu Val His Asn Ser Gln Phe Cys Asn Gly Thr Gly 
                165                 170                 175     


Ser Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp Val 
            180                 185                 190         


Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala His Trp Cys 
        195                 200                 205             


Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp Glu 
    210                 215                 220                 


Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp His Asn Pro 
225                 230                 235                 240 


Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu Leu 
                245                 250                 255     


Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser Lys 
            260                 265                 270         


Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Glu Phe 
        275                 280                 285             


Arg Ser His His His His His His 
    290                 295     


<210>  279
<211>  296
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Amino acid sequence of PolB-16_OarG-intein construct IntC(C96A, 
       C150A)-Trx-H6

<400>  279

Met Gln Glu Ala Lys Ile Asp Ile Lys Ser Leu Tyr Asp Ser Leu Ala 
1               5                   10                  15      


Lys Lys Tyr Asp Val Gln His Lys Asn Ser Tyr Glu Val Ile Tyr Pro 
            20                  25                  30          


Lys Gly Tyr Glu Ile Lys Val Leu Gly Asn Lys Tyr Val Lys Leu Val 
        35                  40                  45              


Ala Met Ser Arg His Lys Thr Gln Lys His Leu Val Lys Ile Val Val 
    50                  55                  60                  


Lys Ser Glu Lys Thr Ile Asp Ser Leu Asp Pro Ile Arg Gln Lys Ser 
65                  70                  75                  80  


Leu Leu Lys Lys Gln Asp Glu Val Val Val Thr Thr Asp His Ile Ala 
                85                  90                  95      


Met Val Tyr Asn Asp Asp His Phe Phe Glu Asn Val Asn Ala Lys Asn 
            100                 105                 110         


Leu Lys Val Gly Asn Tyr Val Ser Val Tyr Asp Glu Ala Ser Asp Lys 
        115                 120                 125             


Glu Val Ile Gly Glu Ile Ala Ser Ile Glu Asp Leu Gly Met Thr Asp 
    130                 135                 140                 


Asp Tyr Val Tyr Asp Ala Glu Val Asp Asp Asp Ser His Ala Phe Tyr 
145                 150                 155                 160 


Ala Ser Asn Ile Leu Val His Asn Ser Gln Phe Cys Asn Gly Thr Gly 
                165                 170                 175     


Ser Asp Lys Ile Ile His Leu Thr Asp Asp Ser Phe Asp Thr Asp Val 
            180                 185                 190         


Leu Lys Ala Asp Gly Ala Ile Leu Val Asp Phe Trp Ala His Trp Cys 
        195                 200                 205             


Gly Pro Cys Lys Met Ile Ala Pro Ile Leu Asp Glu Ile Ala Asp Glu 
    210                 215                 220                 


Tyr Gln Gly Lys Leu Thr Val Ala Lys Leu Asn Ile Asp His Asn Pro 
225                 230                 235                 240 


Gly Thr Ala Pro Lys Tyr Gly Ile Arg Gly Ile Pro Thr Leu Leu Leu 
                245                 250                 255     


Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser Lys 
            260                 265                 270         


Gly Gln Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Glu Phe 
        275                 280                 285             


Arg Ser His His His His His His 
    290                 295     


<210>  280
<211>  6
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Cys tag

<400>  280

Met Gly Cys Asp Thr Asp 
1               5       


<210>  281
<211>  14
<212>  PRT
<213>  Artificial sequence

<220>
<223>  Cys tag

<400>  281

Ser Val Tyr Ala Ser Pro Ala Ala Pro Ala Pro Ala Ser Cys 
1               5                   10                  


<210>  282
<211>  7
<212>  PRT
<213>  Artificial sequence

<220>
<223>  linker sequence

<400>  282

Met Gly Ser Gly Gly Ser Gly 
1               5           


<210>  283
<211>  5
<212>  PRT
<213>  Artificial sequence

<220>
<223>  flanking amino acid sequence

<400>  283

Tyr Ile Asp Thr Asp 
1               5   


<210>  284
<211>  5
<212>  PRT
<213>  Artificial sequence

<220>
<223>  flanking amino acid sequence

<400>  284

Ser Val Tyr Leu Asn 
1               5   


<210>  285
<211>  4
<212>  PRT
<213>  Artificial sequence

<220>
<223>  flanking amino acid sequence

<400>  285

Ile Asp Thr Asp 
1               


<210>  286
<211>  4
<212>  PRT
<213>  Artificial sequence

<220>
<223>  flanking amino acid sequence

<400>  286

Ser Val Tyr Leu 
1               


<210>  287
<211>  125
<212>  PRT
<213>  T4-like bacteriophage of Aeromonas salmonicida

<400>  287

Tyr Ile Asp Thr Asp Ser Val Val Gly Asp Thr Ile Ile Asp Val Ser 
1               5                   10                  15      


Gly Lys Lys Met Thr Ile Ala Glu Phe Tyr Asp Ser Thr Pro Asp Val 
            20                  25                  30          


Phe Met Arg Arg Asn Asp Glu Ala Arg Asp Trp Val Lys Arg Val Gly 
        35                  40                  45              


Gly Lys Thr Ser Leu Ser Val Asn Thr Tyr Ser Gly Glu Val Glu Arg 
    50                  55                  60                  


Lys Asn Ile Asn Tyr Ile Met Lys His Thr Val Lys Lys Arg Met Phe 
65                  70                  75                  80  


Lys Ile Lys Ala Gly Gly Lys Glu Val Ile Val Thr Ala Asp His Ser 
                85                  90                  95      


Val Met Val Lys Arg Asp Gly Lys Ile Ile Asp Val Lys Pro Thr Glu 
            100                 105                 110         


Met Lys Gln Thr Asp Arg Val Val Lys Trp Met Leu Thr 
        115                 120                 125 


<210>  288
<211>  44
<212>  PRT
<213>  T4-like bacteriophage of Aeromonas salmonicida

<400>  288

Met Ile Glu Phe Ile Glu Phe Glu Ile Glu Asp Leu Gly Val Met Glu 
1               5                   10                  15      


Ile Asp Val Tyr Asp Ile Glu Val Asp Gly Asn His Asn Phe Phe Gly 
            20                  25                  30          


Asn Asp Ile Leu Val His Asn Ser Val Tyr Leu Asn 
        35                  40                  


<210>  289
<211>  160
<212>  PRT
<213>  T4-like bacteriophage of Aeromonas salmonicida

<400>  289

Ser Val Val Gly Asp Thr Ile Ile Asp Val Ser Gly Lys Lys Met Thr 
1               5                   10                  15      


Ile Ala Glu Phe Tyr Asp Ser Thr Pro Asp Val Phe Met Arg Arg Asn 
            20                  25                  30          


Asp Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser Leu 
        35                  40                  45              


Ser Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn Tyr 
    50                  55                  60                  


Ile Met Lys His Thr Val Lys Lys Arg Met Phe Lys Ile Lys Ala Gly 
65                  70                  75                  80  


Gly Lys Glu Val Ile Val Thr Ala Asp His Ser Val Met Val Lys Arg 
                85                  90                  95      


Asp Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr Asp 
            100                 105                 110         


Arg Val Val Lys Trp Met Leu Thr Met Ile Glu Phe Ile Glu Phe Glu 
        115                 120                 125             


Ile Glu Asp Leu Gly Val Met Glu Ile Asp Val Tyr Asp Ile Glu Val 
    130                 135                 140                 


Asp Gly Asn His Asn Phe Phe Gly Asn Asp Ile Leu Val His Asn Ser 
145                 150                 155                 160 


<210>  290
<211>  155
<212>  PRT
<213>  Unknown

<220>
<223>  Ssp DnaB

<400>  290

Cys Ile Ser Gly Asp Ser Leu Ile Ser Leu Ala Ser Thr Gly Lys Arg 
1               5                   10                  15      


Val Ser Ile Lys Asp Leu Leu Asp Glu Lys Asp Phe Glu Ile Trp Ala 
            20                  25                  30          


Ile Asn Glu Gln Thr Met Lys Leu Glu Ser Ala Lys Val Ser Arg Val 
        35                  40                  45              


Phe Met Thr Gly Lys Lys Leu Val Tyr Ile Leu Lys Thr Arg Leu Gly 
    50                  55                  60                  


Arg Thr Ile Lys Ala Thr Ala Asn His Arg Phe Leu Thr Ile Asp Gly 
65                  70                  75                  80  


Trp Lys Arg Leu Asp Glu Leu Ser Leu Lys Glu His Ile Ala Leu Pro 
                85                  90                  95      


Arg Lys Leu Glu Ser Ser Ser Leu Gln Leu Ser Pro Glu Ile Glu Lys 
            100                 105                 110         


Leu Ser Gln Ser Asp Ile Ser Trp Asp Ser Ile Val Ser Ile Thr Glu 
        115                 120                 125             


Thr Gly Val Glu Glu Val Phe Asp Leu Thr Val Pro Gly Pro His Asn 
    130                 135                 140                 


Phe Val Ala Asn Asp Ile Ile Val His Asn Ser 
145                 150                 155 


<210>  291
<211>  31
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntN (CLN) from T4-like bacteriophage of Aeromonas 

       salmonicida with five residues of extein sequence

<400>  291

Tyr Ile Asp Thr Asp Ser Val Val Gly Asp Thr Ile Ile Asp Val Ser 
1               5                   10                  15      


Gly Lys Lys Met Thr Ile Ala Glu Phe Tyr Asp Ser Thr Pro Asp 
            20                  25                  30      


<210>  292
<211>  134
<212>  PRT
<213>  Artificial sequence

<220>
<223>  modified IntC (CLC) from T4-like bacteriophage of Aeromonas 

       salmonicida with five residues of extein sequence

<400>  292

Glu Ala Arg Asp Trp Val Lys Arg Val Gly Gly Lys Thr Ser Leu Ser 
1               5                   10                  15      


Val Asn Thr Tyr Ser Gly Glu Val Glu Arg Lys Asn Ile Asn Tyr Ile 
            20                  25                  30          


Met Lys His Thr Val Lys Lys Arg Met Phe Lys Ile Lys Ala Gly Gly 
        35                  40                  45              


Lys Glu Val Ile Val Thr Ala Asp His Ser Val Met Val Lys Arg Asp 
    50                  55                  60                  


Gly Lys Ile Ile Asp Val Lys Pro Thr Glu Met Lys Gln Thr Asp Arg 
65                  70                  75                  80  


Val Val Lys Trp Met Leu Thr Gly Ser His Met Ile Glu Phe Ile Glu 
                85                  90                  95      


Phe Glu Ile Glu Asp Leu Gly Val Met Glu Ile Asp Val Tyr Asp Ile 
            100                 105                 110         


Glu Val Asp Gly Asn His Asn Phe Phe Gly Asn Asp Ile Leu Val His 
        115                 120                 125             


Asn Ser Val Tyr Leu Asn 
    130                 


