               SEQUENCE LISTING

<110> SNIPR Technologies Ltd

<120> Vectors and Methods

<130> 157039.00028

<150> GB1710126.2
<151> 2017-06-25

<160> 74

<170> BiSSAP 1.3.6

<210> 1
<211> 30
<212> DNA
<213> Artificial Sequence


<220> 
<223> UP-IGLB

<400> 1
ttgttctcct tcatatgctc cgacatttct                                     30


<210> 2
<211> 30
<212> DNA
<213> Artificial Sequence


<220> 
<223> DOWN-IGLB

<400> 2
cttcgggaat gattgttatc aatgacgata                                     30


<210> 3
<211> 314
<212> PRT
<213> Artificial Sequence


<220> 
<223> Escherichia coli K-12 MG1655 LeuO

<400> 3
Met Pro Glu Val Gln Thr Asp His Pro Glu Thr Ala Glu Leu Ser Lys 
1               5                   10                  15      
Pro Gln Leu Arg Met Val Asp Leu Asn Leu Leu Thr Val Phe Asp Ala 
            20                  25                  30          
Val Met Gln Glu Gln Asn Ile Thr Arg Ala Ala His Val Leu Gly Met 
        35                  40                  45              
Ser Gln Pro Ala Val Ser Asn Ala Val Ala Arg Leu Lys Val Met Phe 
    50                  55                  60                  
Asn Asp Glu Leu Phe Val Arg Tyr Gly Arg Gly Ile Gln Pro Thr Ala 
65                  70                  75                  80  
Arg Ala Phe Gln Leu Phe Gly Ser Val Arg Gln Ala Leu Gln Leu Val 
                85                  90                  95      
Gln Asn Glu Leu Pro Gly Ser Gly Phe Glu Pro Ala Ser Ser Glu Arg 
            100                 105                 110         
Val Phe His Leu Cys Val Cys Ser Pro Leu Asp Ser Ile Leu Thr Ser 
        115                 120                 125             
Gln Ile Tyr Asn His Ile Glu Gln Ile Ala Pro Asn Ile His Val Met 
    130                 135                 140                 
Phe Lys Ser Ser Leu Asn Gln Asn Thr Glu His Gln Leu Arg Tyr Gln 
145                 150                 155                 160 
Glu Thr Glu Phe Val Ile Ser Tyr Glu Asp Phe His Arg Pro Glu Phe 
                165                 170                 175     
Thr Ser Val Pro Leu Phe Lys Asp Glu Met Val Leu Val Ala Ser Lys 
            180                 185                 190         
Asn His Pro Thr Ile Lys Gly Pro Leu Leu Lys His Asp Val Tyr Asn 
        195                 200                 205             
Glu Gln His Ala Ala Val Ser Leu Asp Arg Phe Ala Ser Phe Ser Gln 
    210                 215                 220                 
Pro Trp Tyr Asp Thr Val Asp Lys Gln Ala Ser Ile Ala Tyr Gln Gly 
225                 230                 235                 240 
Met Ala Met Met Ser Val Leu Ser Val Val Ser Gln Thr His Leu Val 
                245                 250                 255     
Ala Ile Ala Pro Arg Trp Leu Ala Glu Glu Phe Ala Glu Ser Leu Glu 
            260                 265                 270         
Leu Gln Val Leu Pro Leu Pro Leu Lys Gln Asn Ser Arg Thr Cys Tyr 
        275                 280                 285             
Leu Ser Trp His Glu Ala Ala Gly Arg Asp Lys Gly His Gln Trp Met 
    290                 295                 300                 
Glu Glu Gln Leu Val Ser Ile Cys Lys Arg 
305                 310                 

<210> 4
<211> 945
<212> DNA
<213> Artificial Sequence


<220> 
<223> Escherichia coli K-12 MG1655 LeuO

<400> 4
atgccagagg tacaaacaga tcatccagag acggcggagt taagcaaacc acagctacgc      60

atggtcgatc tcaacttatt aaccgttttc gatgccgtga tgcaggagca aaacattact     120

cgtgccgctc atgttctggg aatgtcgcaa cctgcggtca gtaacgctgt tgcacgcctg     180

aaggtgatgt ttaatgacga gctttttgtt cgttatggcc gtggtattca accgactgct     240

cgcgcatttc aactttttgg ttcagttcgt caggcattgc aactagtaca aaatgaattg     300

cctggttcag gttttgaacc cgcgagcagt gaacgtgtat ttcatctttg tgtttgcagc     360

ccgttagaca gcattctgac ctcgcagatt tataatcaca ttgagcagat tgcgccaaat     420

atacatgtta tgttcaagtc ttcattaaat cagaacactg aacatcagct gcgttatcag     480

gaaacggagt ttgtgattag ttatgaagac ttccatcgtc ctgaatttac cagcgtacca     540

ttatttaaag atgaaatggt gctggtagcc agcaaaaatc atccaacaat taagggcccg     600

ttactgaaac atgatgttta taacgaacaa catgcggcgg tttcgctcga tcgtttcgcg     660

tcatttagtc aaccttggta tgacacggta gataagcaag ccagtatcgc gtatcagggc     720

atggcaatga tgagcgtact tagcgtggtg tcgcaaacgc atttggtcgc tattgcgccg     780

cgttggctgg ctgaagagtt cgctgaatcc ttagaattac aggtattacc gctgccgtta     840

aaacaaaaca gcagaacctg ttatctctcc tggcatgaag ctgccgggcg cgataaaggc     900

catcagtgga tggaagagca attagtctca atttgcaaac gctaa                     945


<210> 5
<211> 320
<212> PRT
<213> Artificial Sequence


<220> 
<223> Escherichia coli O157 H7 EDL933 (EHEC) LeuO

<400> 5
Met Thr Val Glu Leu Ser Met Pro Glu Val Gln Thr Asp His Pro Glu 
1               5                   10                  15      
Thr Ala Glu Phe Ser Lys Pro Gln Leu Arg Met Val Asp Leu Asn Leu 
            20                  25                  30          
Leu Thr Val Phe Asp Ala Val Met Gln Glu Gln Asn Ile Thr Arg Ala 
        35                  40                  45              
Ala His Val Leu Gly Met Ser Gln Pro Ala Val Ser Asn Ala Val Ala 
    50                  55                  60                  
Arg Leu Lys Val Met Phe Asn Asp Glu Leu Phe Val Arg Tyr Gly Arg 
65                  70                  75                  80  
Gly Ile Gln Pro Thr Ala Arg Ala Phe Gln Leu Phe Gly Ser Val Arg 
                85                  90                  95      
Gln Ala Leu Gln Leu Val Gln Asn Glu Leu Pro Gly Ser Gly Phe Glu 
            100                 105                 110         
Pro Ala Ser Ser Glu Arg Val Phe His Leu Cys Val Cys Ser Pro Leu 
        115                 120                 125             
Asp Ser Ile Leu Thr Ser Gln Ile Tyr Asn His Ile Glu Gln Ile Ala 
    130                 135                 140                 
Pro Asn Ile His Val Met Phe Lys Ser Ser Leu Asn Gln Asn Thr Glu 
145                 150                 155                 160 
His Gln Leu Arg Tyr Gln Glu Thr Glu Phe Val Ile Ser Tyr Glu Asp 
                165                 170                 175     
Phe His Arg Pro Glu Phe Thr Ser Val Pro Leu Phe Lys Asp Glu Met 
            180                 185                 190         
Val Leu Val Ala Ser Lys Asn His Pro Thr Ile Lys Gly Pro Leu Leu 
        195                 200                 205             
Lys His Asp Val Tyr Asn Glu Gln His Ala Ala Val Ser Leu Asp Arg 
    210                 215                 220                 
Phe Ala Ser Phe Ser Gln Pro Trp Tyr Asp Thr Val Asp Lys Gln Ala 
225                 230                 235                 240 
Ser Ile Ala Tyr Gln Gly Met Ala Met Met Ser Val Leu Ser Val Val 
                245                 250                 255     
Ser Gln Thr His Leu Val Ala Ile Ala Pro Arg Trp Leu Ala Glu Glu 
            260                 265                 270         
Phe Ala Glu Ser Leu Glu Leu Gln Val Leu Pro Leu Pro Leu Lys Leu 
        275                 280                 285             
Asn Ser Arg Thr Cys Tyr Leu Ser Trp His Glu Ala Ala Gly Arg Asp 
    290                 295                 300                 
Lys Gly His Gln Trp Met Glu Glu Gln Leu Val Ser Ile Cys Lys Arg 
305                 310                 315                 320 



<210> 6
<211> 963
<212> DNA
<213> Artificial Sequence


<220> 
<223> Escherichia coli O157 H7 EDL933 (EHEC) LeuO

<400> 6
gtgacagtgg agttaagtat gccagaggta caaacagatc atccagagac ggcggagttc      60

agcaagccac agctacgcat ggtcgatctc aacttattaa ccgttttcga tgccgtgatg     120

caggagcaaa acattacccg tgctgctcat gttctgggaa tgtcgcaacc tgcggtcagt     180

aacgctgttg cacgcctgaa ggtgatgttt aatgacgagc tttttgttcg ttatggccgt     240

ggtattcaac cgactgctcg cgcatttcaa ctttttggtt cagttcgtca ggcattgcaa     300

ctagtacaaa atgaattgcc tggttcaggt tttgaacccg cgagcagtga acgtgtattt     360

catctttgtg tttgcagccc gttagacagt attctgacct cgcagattta taatcacatt     420

gagcagattg cgccaaatat acatgttatg ttcaagtctt cattaaatca gaacactgaa     480

catcagctgc gttatcagga aacggagttt gtgattagtt atgaagactt ccatcgtcct     540

gaatttacca gcgtgccatt atttaaagat gaaatggtgc tggtagccag caaaaatcac     600

ccaacaatta agggcccgtt actgaaacat gatgtttata acgaacaaca tgcggcggtt     660

tcgctcgatc gtttcgcgtc atttagtcaa ccttggtatg acacggtaga taagcaagcc     720

agtatcgcgt atcagggcat ggcaatgatg agcgtactta gcgtggtgtc gcaaacgcat     780

ttggtcgcta ttgcgccgcg ttggctggct gaagagttcg ctgaatcctt agaattacag     840

gtattaccgc tgccgttaaa actaaatagc agaacctgtt atctctcctg gcatgaagct     900

gccgggcgtg ataaaggcca tcagtggatg gaagagcaat tagtctcaat ttgcaaacgc     960

taa                                                                   963


<210> 7
<211> 314
<212> PRT
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhi CT18: STY0134
      LeuO

<400> 7
Met Pro Glu Val Lys Thr Glu Lys Pro His Leu Leu Asp Met Gly Lys 
1               5                   10                  15      
Pro Gln Leu Arg Met Val Asp Leu Asn Leu Leu Thr Val Phe Asp Ala 
            20                  25                  30          
Val Met Gln Glu Gln Asn Ile Thr Arg Ala Ala His Thr Leu Gly Met 
        35                  40                  45              
Ser Gln Pro Ala Val Ser Asn Ala Val Ala Arg Leu Lys Val Met Phe 
    50                  55                  60                  
Asn Asp Glu Leu Phe Val Arg Tyr Gly Arg Gly Ile Gln Pro Thr Ala 
65                  70                  75                  80  
Arg Ala Phe Gln Leu Phe Gly Ser Val Arg Gln Ala Leu Gln Leu Val 
                85                  90                  95      
Gln Asn Glu Leu Pro Gly Ser Gly Phe Glu Pro Thr Ser Ser Glu Arg 
            100                 105                 110         
Val Phe Asn Leu Cys Val Cys Ser Pro Leu Asp Asn Ile Leu Thr Ser 
        115                 120                 125             
Gln Ile Tyr Asn Arg Val Glu Lys Ile Ala Pro Asn Ile His Val Val 
    130                 135                 140                 
Phe Lys Ala Ser Leu Asn Gln Asn Thr Glu His Gln Leu Arg Tyr Gln 
145                 150                 155                 160 
Glu Thr Glu Phe Val Ile Ser Tyr Glu Glu Phe Arg Arg Pro Glu Phe 
                165                 170                 175     
Thr Ser Val Pro Leu Phe Lys Asp Glu Met Val Leu Val Ala Ser Arg 
            180                 185                 190         
Lys His Pro Arg Ile Ser Gly Pro Leu Leu Glu Gly Asp Val Tyr Asn 
        195                 200                 205             
Glu Gln His Ala Val Val Ser Leu Asp Arg Tyr Ala Ser Phe Ser Arg 
    210                 215                 220                 
Pro Trp Tyr Asp Thr Pro Asp Lys Gln Ser Ser Val Ala Tyr Gln Gly 
225                 230                 235                 240 
Met Ala Leu Ile Ser Val Leu Asn Val Val Ser Gln Thr His Leu Val 
                245                 250                 255     
Ala Ile Ala Pro Cys Trp Leu Ala Glu Glu Phe Ala Glu Ser Leu Glu 
            260                 265                 270         
Leu Gln Ile Leu Pro Leu Pro Leu Lys Leu Asn Ser Arg Thr Cys Tyr 
        275                 280                 285             
Leu Ser Trp His Glu Ala Ala Gly Arg Asp Lys Gly His Gln Trp Met 
    290                 295                 300                 
Glu Asp Leu Leu Val Ser Val Cys Lys Arg 
305                 310                 

<210> 8
<211> 945
<212> DNA
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhi CT18: STY0134
      LeuO

<400> 8
atgccagagg tcaaaaccga aaagccgcat cttttagata tgggcaaacc acagcttcgc      60

atggttgatt tgaacctatt gaccgtgttc gatgcggtaa tgcaagagca gaatattacg     120

cgcgccgccc acacgctggg aatgtcgcag cctgcggtca gtaacgccgt agcgcgtctg     180

aaggttatgt ttaatgacga actttttgtt cgatatggac gaggaattca gccgactgcc     240

cgtgcatttc agttatttgg ttcagtccgt caggcgttgc aattggtgca aaatgaattg     300

ccgggatcgg ggtttgagcc gaccagcagc gaacgtgtat tcaatctttg cgtgtgcagt     360

ccgctggata atatcctgac gtcacagatt tataatcgtg tagaaaaaat tgcgccaaat     420

attcatgtcg tttttaaagc gtcgttgaat cagaatactg agcatcagtt acgctatcag     480

gaaaccgagt tcgttattag ttatgaagaa ttccgtcgtc ctgagtttac cagcgtaccg     540

ctatttaaag atgaaatggt tttagtcgcc agccgaaaac acccgcgtat tagcggcccg     600

ctactggaag gcgatgttta taatgaacaa catgcggttg tttctctcga tcgttatgcg     660

tcatttagtc ggccgtggta tgacacgccg gataaacagt cgagcgtggc ttatcagggc     720

atggcgctta tcagcgttct gaacgtggtt tcgcagacgc atttggtcgc tattgccccg     780

tgctggctgg cggaagagtt tgcggagtcg ctggagctgc aaatactgcc gttgccttta     840

aaactgaata gccggacatg ctacctttcc tggcatgaag cggctgggcg tgataaaggg     900

catcaatgga tggaagattt attagtctct gtttgtaagc gataa                     945


<210> 9
<211> 314
<212> PRT
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhimurium LT2:
      STM0115 LeuO

<400> 9
Met Pro Glu Val Lys Thr Glu Lys Pro His Leu Leu Asp Met Gly Lys 
1               5                   10                  15      
Pro Gln Leu Arg Met Val Asp Leu Asn Leu Leu Thr Val Phe Asp Ala 
            20                  25                  30          
Val Met Gln Glu Gln Asn Ile Thr Arg Ala Ala His Thr Leu Gly Met 
        35                  40                  45              
Ser Gln Pro Ala Val Ser Asn Ala Val Ala Arg Leu Lys Val Met Phe 
    50                  55                  60                  
Asn Asp Glu Leu Phe Val Arg Tyr Gly Arg Gly Ile Gln Pro Thr Ala 
65                  70                  75                  80  
Arg Ala Phe Gln Leu Phe Gly Ser Val Arg Gln Ala Leu Gln Leu Val 
                85                  90                  95      
Gln Asn Glu Leu Pro Gly Ser Gly Phe Glu Pro Thr Ser Ser Glu Arg 
            100                 105                 110         
Val Phe Asn Leu Cys Val Cys Ser Pro Leu Asp Asn Ile Leu Thr Ser 
        115                 120                 125             
Gln Ile Tyr Asn Arg Val Glu Lys Ile Ala Pro Asn Ile His Val Val 
    130                 135                 140                 
Phe Lys Ala Ser Leu Asn Gln Asn Thr Glu His Gln Leu Arg Tyr Gln 
145                 150                 155                 160 
Glu Thr Glu Phe Val Ile Ser Tyr Glu Glu Phe Arg Arg Pro Glu Phe 
                165                 170                 175     
Thr Ser Val Pro Leu Phe Lys Asp Glu Met Val Leu Val Ala Ser Arg 
            180                 185                 190         
Lys His Pro Arg Ile Ser Gly Pro Leu Leu Glu Gly Asp Val Tyr Asn 
        195                 200                 205             
Glu Gln His Ala Val Val Ser Leu Asp Arg Tyr Ala Ser Phe Ser Gln 
    210                 215                 220                 
Pro Trp Tyr Asp Thr Pro Asp Lys Gln Ser Ser Val Ala Tyr Gln Gly 
225                 230                 235                 240 
Met Ala Leu Ile Ser Val Leu Asn Val Val Ser Gln Thr His Leu Val 
                245                 250                 255     
Ala Ile Ala Pro Arg Trp Leu Ala Glu Glu Phe Ala Glu Ser Leu Asp 
            260                 265                 270         
Leu Gln Ile Leu Pro Leu Pro Leu Lys Leu Asn Ser Arg Thr Cys Tyr 
        275                 280                 285             
Leu Ser Trp His Glu Ala Ala Gly Arg Asp Lys Gly His Gln Trp Met 
    290                 295                 300                 
Glu Asp Leu Leu Val Ser Val Cys Lys Arg 
305                 310                 

<210> 10
<211> 945
<212> DNA
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhimurium LT2:
      STM0115 LeuO

<400> 10
atgccagagg tcaaaaccga aaagccgcat cttttagata tgggcaaacc acagcttcgc      60

atggttgatt tgaacctatt gaccgtgttc gatgcggtaa tgcaagagca gaatattacg     120

cgcgccgccc acacgctggg aatgtcgcag cctgcggtca gtaacgccgt agcgcgtctg     180

aaggttatgt ttaatgacga actttttgtt cgatatggac gaggaattca gccgactgcc     240

cgtgcatttc agttatttgg ttcagtccgt caggcgttgc aattggtgca aaatgaattg     300

ccgggatcgg ggtttgagcc gaccagcagc gaacgtgtat tcaatctttg cgtgtgcagt     360

ccgctggata atatcctgac gtcacagatt tataatcgtg tagaaaaaat tgcgccaaat     420

attcatgtcg tttttaaagc gtcgttgaat cagaatactg agcatcagtt acgctatcag     480

gaaaccgagt tcgttattag ttatgaagaa ttccgtcgtc ctgagtttac cagcgtaccg     540

ctatttaaag atgaaatggt tttagtcgcc agccgaaaac acccgcgtat tagcggcccg     600

ctactggaag gcgatgttta taatgaacaa catgcggttg tttccctcga tcgttatgcg     660

tcatttagtc agccgtggta tgacacgccg gataaacagt cgagcgtggc ttatcagggc     720

atggcgctta tcagcgttct gaacgtggtt tcgcagacgc atttggtcgc tattgccccg     780

cgctggctgg cggaagagtt tgcggaatcg ctggatctgc aaatattgcc gttgccttta     840

aaactgaata gccggacatg ctacctttcc tggcatgaag cggctgggcg tgataaaggg     900

catcaatgga tggaagattt attagtctct gtttgtaagc gataa                     945


<210> 11
<211> 314
<212> PRT
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Paratyphi A ATCC9150:
      SPA0117 LeuO

<400> 11
Met Pro Glu Val Lys Thr Glu Lys Pro His Leu Leu Asp Met Gly Lys 
1               5                   10                  15      
Pro Gln Leu Arg Met Val Asp Leu Asn Leu Leu Thr Val Phe Asp Ala 
            20                  25                  30          
Val Met Gln Glu Gln Asn Ile Thr Arg Ala Ala His Thr Leu Gly Met 
        35                  40                  45              
Ser Gln Pro Ala Val Ser Asn Ala Val Ala Arg Leu Lys Val Met Phe 
    50                  55                  60                  
Asn Asp Glu Leu Phe Val Arg Tyr Gly Arg Gly Ile Gln Pro Thr Ala 
65                  70                  75                  80  
Arg Ala Phe Gln Leu Phe Gly Ser Val Arg Gln Ala Leu Gln Leu Val 
                85                  90                  95      
Gln Asn Glu Leu Pro Gly Ser Gly Phe Glu Pro Thr Ser Ser Glu Arg 
            100                 105                 110         
Val Phe Asn Leu Cys Val Cys Ser Pro Leu Asp Asn Ile Leu Thr Ser 
        115                 120                 125             
Gln Ile Tyr Asn Arg Val Glu Lys Ile Ala Pro Asn Ile His Val Val 
    130                 135                 140                 
Phe Lys Ala Ser Leu Asn Gln Asn Thr Glu His Gln Leu Arg Tyr Gln 
145                 150                 155                 160 
Glu Thr Glu Phe Val Ile Ser Tyr Glu Glu Phe Arg Arg Pro Glu Phe 
                165                 170                 175     
Thr Ser Val Pro Leu Phe Lys Asp Glu Met Val Leu Val Ala Ser Arg 
            180                 185                 190         
Lys His Pro Arg Ile Ser Gly Pro Leu Leu Glu Gly Asp Val Tyr Asn 
        195                 200                 205             
Glu Gln His Ala Val Val Ser Leu Asp Arg Tyr Ala Ser Phe Ser Gln 
    210                 215                 220                 
Pro Trp Tyr Asp Thr Pro Asp Lys Gln Ser Ser Val Ala Tyr Gln Gly 
225                 230                 235                 240 
Met Ala Leu Ile Ser Val Leu Asn Val Val Ser Gln Thr His Leu Val 
                245                 250                 255     
Ala Ile Ala Pro Arg Trp Leu Ala Glu Glu Phe Ala Glu Ser Leu Glu 
            260                 265                 270         
Leu Gln Ile Leu Pro Leu Pro Leu Lys Leu Asn Ser Arg Thr Cys Tyr 
        275                 280                 285             
Leu Ser Trp His Glu Ala Ala Gly Arg Asp Lys Gly His Gln Trp Met 
    290                 295                 300                 
Glu Asp Leu Leu Val Ser Val Cys Lys Arg 
305                 310                 

<210> 12
<211> 945
<212> DNA
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Paratyphi A ATCC9150:
      SPA0117 LeuO

<400> 12
atgccagagg tcaaaaccga aaagccgcat cttttagata tgggcaaacc acagcttcgc      60

atggttgatt tgaacctatt gaccgtgttc gatgcggtaa tgcaagagca gaatattacg     120

cgcgccgccc acacgctggg aatgtcgcag cctgcggtca gtaacgccgt agcgcgtctg     180

aaggttatgt ttaatgacga actttttgtt cgatatggac gaggaattca gccgactgcc     240

cgtgcatttc agttatttgg ttcagtccgt caggcgttgc aattggtgca aaatgaattg     300

ccgggatcag ggtttgagcc gaccagcagc gaacgtgtat tcaatctttg cgtgtgcagt     360

ccgctggata atatcctgac gtcacagatt tataatcgtg tagaaaaaat tgcgccaaat     420

attcatgtcg tttttaaagc gtcgttgaat cagaatactg agcatcagtt acgctatcag     480

gaaaccgagt tcgttattag ttatgaagaa ttccgtcgtc ctgagtttac cagcgtaccg     540

ctatttaaag atgaaatggt tttagtcgcc agccgaaaac acccgcgtat tagcggcccg     600

ctactggaag gcgatgttta taatgaacaa catgcggttg tttctctcga tcgttatgcg     660

tcatttagtc agccgtggta tgacacgccg gataaacagt cgagcgtggc ttatcagggc     720

atggcgctta tcagcgttct gaacgtggtt tcgcagacgc atttggtcgc tattgccccg     780

cgctggctgg cggaagagtt tgcggagtcg ctggagctgc aaatactgcc gttgccttta     840

aaactgaata gccggacatg ctacctttcc tggcatgaag cggctgggcg tgataaaggg     900

catcaatgga tggaagattt attagtttct gtttgtaagc gataa                     945


<210> 13
<211> 314
<212> PRT
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Enteritidis
      OLF-SE1-1019-1: IY59_00600 LeuO

<400> 13
Met Pro Glu Val Lys Thr Glu Lys Pro His Leu Leu Asp Met Gly Lys 
1               5                   10                  15      
Pro Gln Leu Arg Met Val Asp Leu Asn Leu Leu Thr Val Phe Asp Ala 
            20                  25                  30          
Val Met Gln Glu Gln Asn Ile Thr Arg Ala Ala His Thr Leu Gly Met 
        35                  40                  45              
Ser Gln Pro Ala Val Ser Asn Ala Val Ala Arg Leu Lys Val Met Phe 
    50                  55                  60                  
Asn Asp Glu Leu Phe Val Arg Tyr Gly Arg Gly Ile Gln Pro Thr Ala 
65                  70                  75                  80  
Arg Ala Phe Gln Leu Phe Gly Ser Val Arg Gln Ala Leu Gln Leu Val 
                85                  90                  95      
Gln Asn Glu Leu Pro Gly Ser Gly Phe Glu Pro Thr Ser Ser Glu Arg 
            100                 105                 110         
Val Phe Asn Leu Cys Val Cys Ser Pro Leu Asp Asn Ile Leu Thr Ser 
        115                 120                 125             
Gln Ile Tyr Asn Arg Val Glu Lys Ile Ala Pro Asn Ile His Val Val 
    130                 135                 140                 
Phe Lys Ala Ser Leu Asn Gln Asn Thr Glu His Gln Leu Arg Tyr Gln 
145                 150                 155                 160 
Glu Thr Glu Phe Val Ile Ser Tyr Glu Glu Phe Arg Arg Pro Glu Phe 
                165                 170                 175     
Thr Ser Val Pro Leu Phe Lys Asp Glu Met Val Leu Val Ala Ser Arg 
            180                 185                 190         
Lys His Pro Arg Ile Ser Gly Pro Leu Leu Glu Gly Asp Val Tyr Asn 
        195                 200                 205             
Glu Gln His Ala Val Val Ser Leu Asp Arg Tyr Ala Ser Phe Ser Gln 
    210                 215                 220                 
Pro Trp Tyr Asp Thr Pro Asp Lys Gln Ser Ser Val Ala Tyr Gln Gly 
225                 230                 235                 240 
Met Ala Leu Ile Ser Val Leu Asn Val Val Ser Gln Thr His Leu Val 
                245                 250                 255     
Ala Ile Ala Pro Arg Trp Leu Ala Glu Glu Phe Ala Glu Ser Leu Asp 
            260                 265                 270         
Leu Gln Ile Leu Pro Leu Pro Leu Lys Leu Asn Ser Arg Thr Cys Tyr 
        275                 280                 285             
Leu Ser Trp His Glu Ala Ala Gly Arg Asp Lys Gly His Gln Trp Met 
    290                 295                 300                 
Glu Asp Leu Leu Val Ser Val Cys Lys Arg 
305                 310                 

<210> 14
<211> 945
<212> DNA
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Enteritidis
      OLF-SE1-1019-1: IY59_00600 LeuO

<400> 14
atgccagagg tcaaaaccga aaagccgcat cttttagata tgggcaaacc acagcttcgc      60

atggttgatt tgaacctatt gaccgtgttc gatgcggtaa tgcaagagca gaatattacg     120

cgcgccgccc acacgctggg aatgtcgcag cctgcggtca gtaacgccgt agcgcgtctg     180

aaggttatgt ttaatgacga actttttgtt cgatatggac gaggaattca gccgactgcc     240

cgtgcatttc agttatttgg ttcagtccgt caggcgttac aattggtgca aaatgaattg     300

ccgggatcgg ggtttgagcc gaccagcagc gaacgtgtat tcaatctttg cgtgtgcagt     360

ccgctggata atatcctgac gtcacagatt tataatcgtg tagaaaaaat tgcgccaaat     420

attcatgtcg tttttaaagc gtcgttgaat cagaatactg agcatcagtt acgctatcag     480

gaaaccgagt tcgttattag ttatgaagaa ttccgtcgtc ctgagtttac cagcgtaccg     540

ctatttaaag atgaaatggt tttagtcgcc agccgaaaac acccgcgtat tagcggcccg     600

ctactggaag gcgatgttta taatgaacaa catgcggttg tttctctcga tcgttatgcg     660

tcatttagtc agccgtggta tgacacgccg gataaacagt cgagcgtggc ttatcagggc     720

atggcgctta tcagcgttct gaacgtggtt tcgcagacgc atttggtcgc tattgccccg     780

cgctggctgg cggaagagtt tgcggaatcg ctggatctgc aaatattgcc gttgccttta     840

aaactgaata gccggacatg ctacctttcc tggcatgaag cggctgggcg tgataaaggg     900

caccaatgga tggaagattt attagtttct gtttgtaagc gataa                     945


<210> 15
<211> 373
<212> PRT
<213> Artificial Sequence


<220> 
<223> Shigella flexneri 301 (serotype 2a): SF0071

<400> 15
Met Thr His Ser Thr Ala Met Asp Ser Val Phe Ile Arg Thr Arg Ile 
1               5                   10                  15      
Phe Met Phe Ser Glu Phe Tyr Ser Phe Cys Phe Phe Leu Phe Tyr Met 
            20                  25                  30          
His Asp Lys Ser Tyr Ser Ser Gly Leu Phe Leu Cys Ile Pro Ile Arg 
        35                  40                  45              
Glu Arg Glu Leu Ser Val Thr Val Glu Leu Ser Met Pro Glu Val Gln 
    50                  55                  60                  
Thr Asp His Ser Glu Thr Ala Glu Leu Ser Lys Pro Gln Leu Arg Met 
65                  70                  75                  80  
Val Asp Leu Asn Leu Leu Thr Val Phe Asp Ala Val Met Gln Glu Gln 
                85                  90                  95      
Asn Ile Thr Arg Ala Ala His Val Leu Gly Met Ser Gln Pro Ala Val 
            100                 105                 110         
Ser Asn Ala Val Ala Arg Leu Lys Val Met Phe Asn Asp Glu Leu Phe 
        115                 120                 125             
Val Arg Tyr Gly Arg Gly Ile Gln Pro Thr Ala Arg Ala Phe Gln Leu 
    130                 135                 140                 
Phe Gly Ser Val Arg Gln Ala Leu Gln Leu Val Gln Asn Glu Leu Pro 
145                 150                 155                 160 
Gly Ser Gly Phe Glu Pro Ala Ser Ser Glu Arg Val Phe His Leu Cys 
                165                 170                 175     
Val Cys Ser Pro Leu Asp Ser Ile Leu Thr Ser Gln Ile Tyr Asn His 
            180                 185                 190         
Ile Glu Gln Ile Ala Pro Asn Ile His Val Met Phe Lys Ser Ser Leu 
        195                 200                 205             
Asn Gln Asn Thr Glu His Gln Leu Arg Tyr Gln Glu Thr Glu Phe Val 
    210                 215                 220                 
Ile Ser Tyr Glu Asp Phe His Arg Pro Glu Phe Thr Ser Val Pro Leu 
225                 230                 235                 240 
Phe Lys Asp Glu Met Val Leu Val Ala Ser Lys Asn His Pro Thr Ile 
                245                 250                 255     
Lys Gly Pro Leu Leu Lys His Asp Val Tyr Asn Glu Gln His Ala Ala 
            260                 265                 270         
Val Ser Leu Asp Arg Phe Ala Ser Phe Ser Gln Pro Trp Tyr Asp Thr 
        275                 280                 285             
Val Asp Lys Gln Ala Ser Ile Ala Tyr Gln Gly Met Ala Met Met Ser 
    290                 295                 300                 
Val Leu Ser Val Val Ser Gln Thr His Leu Val Ala Ile Ala Pro Arg 
305                 310                 315                 320 
Trp Leu Ala Glu Glu Phe Ala Glu Ser Leu Glu Leu Gln Val Leu Pro 
                325                 330                 335     
Leu Pro Leu Lys Gln Asn Ser Arg Thr Cys Tyr Leu Ser Trp His Glu 
            340                 345                 350         
Ala Ala Gly Arg Asp Lys Gly His Gln Trp Met Glu Glu Gln Leu Val 
        355                 360                 365             
Ser Ile Cys Lys Arg 
    370             

<210> 16
<211> 1122
<212> DNA
<213> Artificial Sequence


<220> 
<223> Shigella flexneri 301 (serotype 2a): SF0071

<400> 16
atgactcatt ccacggcaat ggattctgtt tttatcagaa cccgtatctt tatgttttcc      60

gaattttact cattttgctt tttcttattt tatatgcatg ataaatcata ttcttcagga     120

ttatttctct gcattccaat aagggaaagg gagttaagtg tgacagtgga gttaagtatg     180

ccagaggtac aaacagatca ttcagagacg gcggagttaa gcaagccaca gctacgcatg     240

gtcgatctca acttattaac cgttttcgat gccgtgatgc aggagcaaaa cattacccgt     300

gccgctcatg ttctgggtat gtcgcaacct gcggtcagta acgctgttgc acgcctgaag     360

gtgatgttta atgacgagct ttttgttcgt tatggccgtg gtattcaacc gactgctcgc     420

gcatttcaac tttttggttc agttcgccag gcattgcaac tagtacaaaa tgaattgcct     480

ggttcaggtt ttgaacccgc gagcagtgaa cgtgtatttc atctttgtgt ttgcagcccg     540

ttagacagca ttctgacctc gcagatttat aatcacattg agcagattgc gccaaatata     600

catgttatgt tcaagtcttc attaaatcag aacactgaac atcagctgcg ttatcaggaa     660

acggagtttg tgattagtta tgaagacttc catcgtcctg aatttaccag cgtgccatta     720

tttaaagatg aaatggtgct ggtagccagc aaaaatcatc caacaattaa aggcccgtta     780

ctgaaacatg atgtttataa cgaacaacat gcggcggttt cgctcgatcg tttcgcgtca     840

tttagtcaac cttggtatga cacggtagat aagcaagcca gtatcgcgta tcagggcatg     900

gcaatgatga gcgtacttag cgtggtgtcg caaacgcatt tggtcgctat tgcgccgcgt     960

tggctggctg aagagttcgc tgaatcctta gaattacagg tattaccgct gccgttaaaa    1020

caaaacagca gaacctgtta tctctcttgg catgaagctg ccgggcgcga taaaggccat    1080

cagtggatgg aagaacaatt agtctcaatt tgcaaacgct aa                       1122


<210> 17
<211> 137
<212> PRT
<213> Artificial Sequence


<220> 
<223> Escherichia coli O157 H7 EDL933 (EHEC): Z2013 H-NS

<400> 17
Met Ser Glu Ala Leu Lys Ile Leu Asn Asn Ile Arg Thr Leu Arg Ala 
1               5                   10                  15      
Gln Ala Arg Glu Cys Thr Leu Glu Thr Leu Glu Glu Met Leu Glu Lys 
            20                  25                  30          
Leu Glu Val Val Val Asn Glu Arg Arg Glu Glu Glu Ser Ala Ala Ala 
        35                  40                  45              
Ala Glu Val Glu Glu Arg Thr Arg Lys Leu Gln Gln Tyr Arg Glu Met 
    50                  55                  60                  
Leu Ile Ala Asp Gly Ile Asp Pro Asn Glu Leu Leu Asn Ser Leu Ala 
65                  70                  75                  80  
Ala Val Lys Ser Gly Thr Lys Ala Lys Arg Ala Gln Arg Pro Ala Lys 
                85                  90                  95      
Tyr Ser Tyr Val Asp Glu Asn Gly Glu Thr Lys Thr Trp Thr Gly Gln 
            100                 105                 110         
Gly Arg Thr Pro Ala Val Ile Lys Lys Ala Met Asp Glu Gln Gly Lys 
        115                 120                 125             
Ser Leu Asp Asp Phe Leu Ile Lys Gln 
    130                 135         

<210> 18
<211> 414
<212> DNA
<213> Artificial Sequence


<220> 
<223> Escherichia coli O157 H7 EDL933 (EHEC): Z2013 H-NS

<400> 18
atgagcgaag cacttaaaat tctgaacaac atccgtactc ttcgtgcgca ggcaagagaa      60

tgtacacttg aaacgctgga agaaatgctg gaaaaattag aagttgttgt taacgaacgt     120

cgcgaagaag aaagcgcggc tgctgctgaa gttgaagagc gcactcgtaa actgcagcaa     180

tatcgcgaaa tgctgatcgc tgacggtatt gacccgaacg agctgctgaa tagccttgcc     240

gccgttaaat ctggcaccaa agctaaacgt gctcagcgtc cggcaaaata tagctacgtt     300

gacgaaaacg gcgaaactaa aacctggact ggccagggcc gtactccagc tgtaatcaaa     360

aaagcaatgg atgagcaagg taaatccctc gacgatttcc tgatcaagca ataa           414


<210> 19
<211> 137
<212> PRT
<213> Artificial Sequence


<220> 
<223> Escherichia coli O127 H6 E2348/69 (EPEC): E2348C_1364 H-NS

<400> 19
Met Ser Glu Ala Leu Lys Ile Leu Asn Asn Ile Arg Thr Leu Arg Ala 
1               5                   10                  15      
Gln Ala Arg Glu Cys Thr Leu Glu Thr Leu Glu Glu Met Leu Glu Lys 
            20                  25                  30          
Leu Glu Val Val Val Asn Glu Arg Arg Glu Glu Glu Ser Ala Ala Ala 
        35                  40                  45              
Ala Glu Val Glu Glu Arg Thr Arg Lys Leu Gln Gln Tyr Arg Glu Met 
    50                  55                  60                  
Leu Ile Ala Asp Gly Ile Asp Pro Asn Glu Leu Leu Asn Ser Leu Ala 
65                  70                  75                  80  
Ala Val Lys Ser Gly Thr Lys Ala Lys Arg Ala Gln Arg Pro Ala Lys 
                85                  90                  95      
Tyr Ser Tyr Val Asp Glu Asn Gly Glu Thr Lys Thr Trp Thr Gly Gln 
            100                 105                 110         
Gly Arg Thr Pro Ala Val Ile Lys Lys Ala Met Asp Glu Gln Gly Lys 
        115                 120                 125             
Ser Leu Asp Asp Phe Leu Ile Lys Gln 
    130                 135         

<210> 20
<211> 414
<212> DNA
<213> Artificial Sequence


<220> 
<223> Escherichia coli O127 H6 E2348/69 (EPEC): E2348C_1364 H-NS

<400> 20
atgagcgaag cacttaaaat tctgaacaac atccgtactc ttcgtgcgca ggcaagagaa      60

tgtacacttg aaacgctgga agaaatgctg gaaaaattag aagttgttgt taacgaacgt     120

cgcgaagaag aaagcgcggc tgctgctgaa gttgaagagc gcactcgtaa actgcagcaa     180

tatcgcgaaa tgctgatcgc tgacggtatt gacccgaacg aactgctgaa tagccttgct     240

gccgttaaat ctggcaccaa agctaagcgt gctcagcgtc cggcaaaata tagctacgtt     300

gacgaaaacg gcgaaactaa aacctggact ggccagggcc gtactccagc tgtaatcaaa     360

aaagcaatgg atgagcaagg taaatccctc gacgatttcc tgatcaagca ataa           414


<210> 21
<211> 137
<212> PRT
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhi CT18: STY1299
      H-NS

<400> 21
Met Ser Glu Ala Leu Lys Ile Leu Asn Asn Ile Arg Thr Leu Arg Ala 
1               5                   10                  15      
Gln Ala Arg Glu Cys Thr Leu Glu Thr Leu Glu Glu Met Leu Glu Lys 
            20                  25                  30          
Leu Glu Val Val Val Asn Glu Arg Arg Glu Glu Glu Ser Ala Ala Ala 
        35                  40                  45              
Ala Glu Val Glu Glu Arg Thr Arg Lys Leu Gln Gln Tyr Arg Glu Met 
    50                  55                  60                  
Leu Ile Ala Asp Gly Ile Asp Pro Asn Glu Leu Leu Asn Ser Met Ala 
65                  70                  75                  80  
Ala Ala Lys Ser Gly Thr Lys Ala Lys Arg Ala Ala Arg Pro Ala Lys 
                85                  90                  95      
Tyr Ser Tyr Val Asp Glu Asn Gly Glu Thr Lys Thr Trp Thr Gly Gln 
            100                 105                 110         
Gly Arg Thr Pro Ala Val Ile Lys Lys Ala Met Glu Glu Gln Gly Lys 
        115                 120                 125             
Gln Leu Glu Asp Phe Leu Ile Lys Glu 
    130                 135         

<210> 22
<211> 414
<212> DNA
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhi CT18: STY1299
      H-NS

<400> 22
atgagcgaag cacttaaaat tctgaacaac atccgtactc ttcgtgcgca ggcaagagaa      60

tgtactctgg aaacgcttga agaaatgctg gaaaaattag aagttgtcgt taatgagcgt     120

cgtgaagaag aaagcgctgc tgctgctgaa gtggaagaac gcactcgtaa actgcaacag     180

tatcgtgaaa tgttaattgc cgacggcatt gacccgaatg aactgctgaa tagcatggct     240

gccgctaaat ccggtaccaa agctaaacgc gcagctcgtc cggctaaata tagctatgtt     300

gacgaaaacg gtgaaactaa aacctggact ggccagggtc gtacaccggc tgtaatcaaa     360

aaagcaatgg aagaacaagg taagcaactg gaagatttcc tgatcaagga ataa           414


<210> 23
<211> 137
<212> PRT
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhimurium LT2:
      STM1751 H-NS

<400> 23
Met Ser Glu Ala Leu Lys Ile Leu Asn Asn Ile Arg Thr Leu Arg Ala 
1               5                   10                  15      
Gln Ala Arg Glu Cys Thr Leu Glu Thr Leu Glu Glu Met Leu Glu Lys 
            20                  25                  30          
Leu Glu Val Val Val Asn Glu Arg Arg Glu Glu Glu Ser Ala Ala Ala 
        35                  40                  45              
Ala Glu Val Glu Glu Arg Thr Arg Lys Leu Gln Gln Tyr Arg Glu Met 
    50                  55                  60                  
Leu Ile Ala Asp Gly Ile Asp Pro Asn Glu Leu Leu Asn Ser Met Ala 
65                  70                  75                  80  
Ala Ala Lys Ser Gly Thr Lys Ala Lys Arg Ala Ala Arg Pro Ala Lys 
                85                  90                  95      
Tyr Ser Tyr Val Asp Glu Asn Gly Glu Thr Lys Thr Trp Thr Gly Gln 
            100                 105                 110         
Gly Arg Thr Pro Ala Val Ile Lys Lys Ala Met Glu Glu Gln Gly Lys 
        115                 120                 125             
Gln Leu Glu Asp Phe Leu Ile Lys Glu 
    130                 135         

<210> 24
<211> 414
<212> DNA
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhimurium LT2:
      STM1751 H-NS

<400> 24
atgagcgaag cacttaaaat tctgaacaac atccgtactc ttcgtgcgca ggcaagagaa      60

tgtactctgg aaacgcttga agaaatgctg gaaaaattag aagttgtcgt taatgagcgt     120

cgtgaagaag aaagcgctgc tgctgctgaa gtggaagaac gcactcgtaa actgcaacag     180

tatcgtgaaa tgttaattgc cgacggcatt gacccgaatg aactgctgaa tagcatggct     240

gccgctaaat ccggtaccaa agctaaacgc gcagctcgtc cggctaaata tagctatgtt     300

gacgaaaacg gtgaaactaa aacctggact ggccagggtc gtacaccggc tgtaatcaaa     360

aaagcaatgg aagaacaagg taagcaactg gaagatttcc tgatcaagga ataa           414


<210> 25
<211> 137
<212> PRT
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Enteritidis
      EC20090193: AU37_06605 H-NS

<400> 25
Met Ser Glu Ala Leu Lys Ile Leu Asn Asn Ile Arg Thr Leu Arg Ala 
1               5                   10                  15      
Gln Ala Arg Glu Cys Thr Leu Glu Thr Leu Glu Glu Met Leu Glu Lys 
            20                  25                  30          
Leu Glu Val Val Val Asn Glu Arg Arg Glu Glu Glu Ser Ala Ala Ala 
        35                  40                  45              
Ala Glu Val Glu Glu Arg Thr Arg Lys Leu Gln Gln Tyr Arg Glu Met 
    50                  55                  60                  
Leu Ile Ala Asp Gly Ile Asp Pro Asn Glu Leu Leu Asn Ser Met Ala 
65                  70                  75                  80  
Ala Ala Lys Ser Gly Thr Lys Ala Lys Arg Ala Ala Arg Pro Ala Lys 
                85                  90                  95      
Tyr Ser Tyr Val Asp Glu Asn Gly Glu Thr Lys Thr Trp Thr Gly Gln 
            100                 105                 110         
Gly Arg Thr Pro Ala Val Ile Lys Lys Ala Met Glu Glu Gln Gly Lys 
        115                 120                 125             
Gln Leu Glu Asp Phe Leu Ile Lys Glu 
    130                 135         

<210> 26
<211> 414
<212> DNA
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Enteritidis
      EC20090193: AU37_06605 H-NS

<400> 26
atgagcgaag cacttaaaat tctgaacaac atccgtactc ttcgtgcgca ggcaagagaa      60

tgtactctgg aaacgcttga agaaatgctg gaaaaattag aagttgtcgt taatgagcgt     120

cgtgaagaag aaagcgctgc tgctgctgaa gtggaagaac gcactcgtaa actgcaacag     180

tatcgtgaaa tgttaattgc cgacggcatt gacccgaatg aactgctgaa tagcatggct     240

gccgctaaat ccggtaccaa agctaaacgc gcagctcgtc cggctaaata tagctatgtt     300

gacgaaaacg gtgaaactaa aacctggact ggccagggtc gtacaccggc tgtaatcaaa     360

aaagcaatgg aagaacaagg taagcaactg gaagatttcc tgatcaagga ataa           414


<210> 27
<211> 137
<212> PRT
<213> Artificial Sequence


<220> 
<223> Shigella flexneri 2457T (serotype 2a): S1323 H-NS

<400> 27
Met Ser Glu Ala Leu Lys Ile Leu Asn Asn Ile Arg Thr Leu Arg Ala 
1               5                   10                  15      
Gln Ala Arg Glu Cys Thr Leu Glu Thr Leu Glu Glu Met Leu Glu Lys 
            20                  25                  30          
Leu Glu Val Val Val Asn Glu Arg Arg Glu Glu Glu Ser Ala Ala Ala 
        35                  40                  45              
Ala Glu Val Glu Glu Arg Thr Arg Lys Leu Gln Gln Tyr Arg Glu Met 
    50                  55                  60                  
Leu Ile Ala Asp Gly Ile Asp Pro Asn Glu Leu Leu Asn Ser Leu Ala 
65                  70                  75                  80  
Ala Val Lys Ser Gly Thr Lys Ala Lys Arg Ala Gln Arg Pro Ala Lys 
                85                  90                  95      
Tyr Ser Tyr Val Asp Glu Asn Gly Glu Thr Lys Thr Trp Thr Gly Gln 
            100                 105                 110         
Gly Arg Thr Pro Ala Val Ile Lys Lys Ala Met Asp Glu Gln Gly Lys 
        115                 120                 125             
Ser Leu Asp Asp Phe Leu Ile Lys Gln 
    130                 135         

<210> 28
<211> 414
<212> DNA
<213> Artificial Sequence


<220> 
<223> Shigella flexneri 2457T (serotype 2a): S1323 H-NS

<400> 28
atgagcgaag cacttaaaat tctgaacaac atccgtactc ttcgtgcgca ggcaagagaa      60

tgtacacttg aaacgctgga agaaatgctg gaaaaattag aagttgttgt taacgaacgt     120

cgcgaagaag aaagcgcggc tgctgctgaa gttgaagagc gcactcgtaa gctgcagcaa     180

tatcgcgaaa tgctgatcgc tgacggtatt gacccgaacg aactgctgaa tagccttgct     240

gccgttaaat ctggcaccaa agctaaacgt gctcagcgtc cggcaaaata tagctacgtt     300

gacgaaaacg gcgaaactaa aacctggact ggccaaggcc gtactccagc tgtaatcaaa     360

aaagcaatgg atgagcaagg taaatccctc gacgatttcc tgatcaagca ataa           414


<210> 29
<211> 134
<212> PRT
<213> Artificial Sequence


<220> 
<223> Escherichia coli K-12 MG1655: b2669 StpA

<400> 29
Met Ser Val Met Leu Gln Ser Leu Asn Asn Ile Arg Thr Leu Arg Ala 
1               5                   10                  15      
Met Ala Arg Glu Phe Ser Ile Asp Val Leu Glu Glu Met Leu Glu Lys 
            20                  25                  30          
Phe Arg Val Val Thr Lys Glu Arg Arg Glu Glu Glu Glu Gln Gln Gln 
        35                  40                  45              
Arg Glu Leu Ala Glu Arg Gln Glu Lys Ile Ser Thr Trp Leu Glu Leu 
    50                  55                  60                  
Met Lys Ala Asp Gly Ile Asn Pro Glu Glu Leu Leu Gly Asn Ser Ser 
65                  70                  75                  80  
Ala Ala Ala Pro Arg Ala Gly Lys Lys Arg Gln Pro Arg Pro Ala Lys 
                85                  90                  95      
Tyr Lys Phe Thr Asp Val Asn Gly Glu Thr Lys Thr Trp Thr Gly Gln 
            100                 105                 110         
Gly Arg Thr Pro Lys Pro Ile Ala Gln Ala Leu Ala Glu Gly Lys Ser 
        115                 120                 125             
Leu Asp Asp Phe Leu Ile 
    130                 

<210> 30
<211> 405
<212> DNA
<213> Artificial Sequence


<220> 
<223> Escherichia coli K-12 MG1655: b2669 StpA

<400> 30
atgtccgtaa tgttacaaag tttaaataac attcgcaccc tccgtgcgat ggctcgcgaa      60

ttctccattg acgttcttga agaaatgctc gaaaaattca gggttgtcac taaagaaaga     120

cgtgaagaag aagaacagca gcagcgtgaa ctggcagagc gccaggaaaa aattagcacc     180

tggctggagc tgatgaaagc tgacggaatt aacccggaag agttattggg taatagctct     240

gctgctgcac cacgcgctgg taaaaaacgc cagccgcgtc cggcgaaata taaattcacc     300

gatgttaacg gtgaaactaa aacctggacc ggtcagggcc gtacaccgaa gccaattgct     360

caggcgctgg cagaaggtaa atctctcgac gatttcctga tctaa                     405


<210> 31
<211> 133
<212> PRT
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhimurium LT2:
      STM2799 StpA

<400> 31
Met Asn Leu Met Leu Gln Asn Leu Asn Asn Ile Arg Thr Leu Arg Ala 
1               5                   10                  15      
Met Ala Arg Glu Phe Ser Ile Asp Val Leu Glu Glu Met Leu Glu Lys 
            20                  25                  30          
Phe Arg Val Val Thr Lys Glu Arg Arg Glu Glu Glu Glu Leu Gln Gln 
        35                  40                  45              
Arg Gln Leu Ala Glu Lys Gln Glu Lys Ile Asn Ala Phe Leu Glu Leu 
    50                  55                  60                  
Met Lys Ala Asp Gly Ile Asn Pro Glu Glu Leu Phe Ala Met Asp Ser 
65                  70                  75                  80  
Ala Met Pro Arg Ser Ala Lys Lys Arg Gln Pro Arg Pro Ala Lys Tyr 
                85                  90                  95      
Arg Phe Thr Asp Phe Asn Gly Glu Glu Lys Thr Trp Thr Gly Gln Gly 
            100                 105                 110         
Arg Thr Pro Lys Pro Ile Ala Gln Ala Leu Ala Ala Gly Lys Ser Leu 
        115                 120                 125             
Asp Asp Phe Leu Ile 
    130             

<210> 32
<211> 402
<212> DNA
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhimurium LT2:
      STM2799 StpA

<400> 32
atgaatttga tgttacagaa cttaaataat atccgcacgc tgcgcgctat ggctcgcgaa      60

ttctccattg acgttcttga agaaatgctc gaaaaattca gggttgtcac taaagaaaga     120

cgcgaagaag aagaattgca gcaacgccag cttgccgaga agcaggagaa aattaatgcc     180

tttctggagc tgatgaaagc agacggtatt aacccggaag agttatttgc catggattca     240

gcaatgccgc gttctgctaa aaagcgccag ccgcgtccgg caaaatatcg ttttactgat     300

ttcaatggcg aagaaaaaac ctggaccgga caaggtcgta cgcctaaacc gattgcccag     360

gcgctggcgg cggggaaatc tctggatgat ttcttaatct aa                        402


<210> 33
<211> 133
<212> PRT
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhimurium UK-1:
      STMUK_2788 StpA

<400> 33
Met Asn Leu Met Leu Gln Asn Leu Asn Asn Ile Arg Thr Leu Arg Ala 
1               5                   10                  15      
Met Ala Arg Glu Phe Ser Ile Asp Val Leu Glu Glu Met Leu Glu Lys 
            20                  25                  30          
Phe Arg Val Val Thr Lys Glu Arg Arg Glu Glu Glu Glu Leu Gln Gln 
        35                  40                  45              
Arg Gln Leu Ala Glu Lys Gln Glu Lys Ile Asn Ala Phe Leu Glu Leu 
    50                  55                  60                  
Met Lys Ala Asp Gly Ile Asn Pro Glu Glu Leu Phe Ala Met Asp Ser 
65                  70                  75                  80  
Ala Met Pro Arg Ser Ala Lys Lys Arg Gln Pro Arg Pro Ala Lys Tyr 
                85                  90                  95      
Arg Phe Thr Asp Phe Asn Gly Glu Glu Lys Thr Trp Thr Gly Gln Gly 
            100                 105                 110         
Arg Thr Pro Lys Pro Ile Ala Gln Ala Leu Ala Ala Gly Lys Ser Leu 
        115                 120                 125             
Asp Asp Phe Leu Ile 
    130             

<210> 34
<211> 402
<212> DNA
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhimurium UK-1:
      STMUK_2788 StpA

<400> 34
atgaatttga tgttacagaa cttaaataat atccgcacgc tgcgcgctat ggctcgcgaa      60

ttctccattg acgttcttga agaaatgctc gaaaaattca gggttgtcac taaagaaaga     120

cgcgaagaag aagaattgca gcaacgccag cttgccgaga agcaggagaa aattaatgcc     180

tttctggagc tgatgaaagc agacggtatt aacccggaag agttatttgc catggattca     240

gcaatgccgc gttctgctaa aaagcgccag ccgcgtccgg caaaatatcg ttttactgat     300

ttcaatggcg aagaaaaaac ctggaccgga caaggtcgta cgcctaaacc gattgcccag     360

gcgctggcgg cggggaaatc tctggatgat ttcttaatct aa                        402


<210> 35
<211> 134
<212> PRT
<213> Artificial Sequence


<220> 
<223> Shigella flexneri 2457T (serotype 2a): S2883 StpA

<400> 35
Met Ser Val Met Leu Gln Ser Leu Asn Asn Ile Arg Thr Leu Arg Ala 
1               5                   10                  15      
Met Ala Arg Glu Phe Ser Ile Asp Val Leu Glu Glu Met Leu Glu Lys 
            20                  25                  30          
Phe Arg Val Val Thr Lys Glu Arg Arg Glu Glu Glu Glu Gln Gln Gln 
        35                  40                  45              
Arg Glu Leu Ala Glu Arg Gln Glu Lys Ile Ser Thr Trp Leu Glu Leu 
    50                  55                  60                  
Met Lys Ala Asp Gly Ile Asn Pro Glu Glu Leu Leu Gly Asn Ser Ser 
65                  70                  75                  80  
Ala Ala Ala Pro Arg Ala Gly Lys Lys Arg Gln Pro Arg Pro Ala Lys 
                85                  90                  95      
Tyr Lys Phe Thr Asp Val Asn Gly Glu Thr Lys Thr Trp Thr Gly Gln 
            100                 105                 110         
Gly Arg Thr Pro Lys Pro Ile Ala Gln Ala Leu Ala Glu Gly Lys Ser 
        115                 120                 125             
Leu Asp Asp Phe Leu Ile 
    130                 

<210> 36
<211> 405
<212> DNA
<213> Artificial Sequence


<220> 
<223> Shigella flexneri 2457T (serotype 2a): S2883 StpA

<400> 36
atgtccgtaa tgttacaaag tttaaataac attcgcaccc tccgtgcgat ggctcgcgaa      60

ttctccattg acgttcttga agaaatgctc gaaaaattca gggttgtcac taaagaaaga     120

cgtgaagaag aagaacagca gcagcgtgaa ctggctgagc gtcaggaaaa aattagcacc     180

tggctggagc tgatgaaagc tgacggaatt aacccggaag agttattggg taatagctct     240

gctgctgcac cacgtgctgg taaaaaacgc cagccgcgtc cggcgaaata taaattcact     300

gatgttaacg gtgaaactaa aacctggacc ggtcagggcc gtacaccgaa gccaattgct     360

caggcgctgg cagaaggtaa atctctcgac gatttcctga tctaa                     405


<210> 37
<211> 164
<212> PRT
<213> Artificial Sequence


<220> 
<223> Escherichia coli K-12 MG1655: b0889 LRP

<400> 37
Met Val Asp Ser Lys Lys Arg Pro Gly Lys Asp Leu Asp Arg Ile Asp 
1               5                   10                  15      
Arg Asn Ile Leu Asn Glu Leu Gln Lys Asp Gly Arg Ile Ser Asn Val 
            20                  25                  30          
Glu Leu Ser Lys Arg Val Gly Leu Ser Pro Thr Pro Cys Leu Glu Arg 
        35                  40                  45              
Val Arg Arg Leu Glu Arg Gln Gly Phe Ile Gln Gly Tyr Thr Ala Leu 
    50                  55                  60                  
Leu Asn Pro His Tyr Leu Asp Ala Ser Leu Leu Val Phe Val Glu Ile 
65                  70                  75                  80  
Thr Leu Asn Arg Gly Ala Pro Asp Val Phe Glu Gln Phe Asn Thr Ala 
                85                  90                  95      
Val Gln Lys Leu Glu Glu Ile Gln Glu Cys His Leu Val Ser Gly Asp 
            100                 105                 110         
Phe Asp Tyr Leu Leu Lys Thr Arg Val Pro Asp Met Ser Ala Tyr Arg 
        115                 120                 125             
Lys Leu Leu Gly Glu Thr Leu Leu Arg Leu Pro Gly Val Asn Asp Thr 
    130                 135                 140                 
Arg Thr Tyr Val Val Met Glu Glu Val Lys Gln Ser Asn Arg Leu Val 
145                 150                 155                 160 
Ile Lys Thr Arg 
                

<210> 38
<211> 495
<212> DNA
<213> Artificial Sequence


<220> 
<223> Escherichia coli K-12 MG1655: b0889 LRP

<400> 38
atggtagata gcaagaagcg ccctggcaaa gatctcgacc gtatcgatcg taacattctt      60

aatgagttgc aaaaggatgg gcgtatttct aacgtcgagc tttctaaacg tgtgggactt     120

tccccaacgc cgtgccttga gcgtgtgcgt cggctggaaa gacaagggtt tattcagggc     180

tatacggcgc tgcttaaccc ccattatctg gatgcatcac ttctggtatt cgttgagatt     240

actctgaatc gtggcgcacc ggatgtgttt gaacaattca ataccgctgt acaaaaactt     300

gaagaaattc aggagtgtca tttagtatcc ggtgatttcg actacctgtt gaaaacacgc     360

gtgccggata tgtcagccta ccgtaagttg ctgggggaaa ccctgctgcg tctgcctggc     420

gtcaatgaca cacggacata cgttgttatg gaagaagtca agcagagtaa tcgtctggtt     480

attaagacgc gctaa                                                      495


<210> 39
<211> 164
<212> PRT
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhimurium DT104:
      DT104_09341 LRP

<400> 39
Met Val Asp Ser Lys Lys Arg Pro Gly Lys Asp Leu Asp Arg Ile Asp 
1               5                   10                  15      
Arg Asn Ile Leu Asn Glu Leu Gln Lys Asp Gly Arg Ile Ser Asn Val 
            20                  25                  30          
Glu Leu Ser Lys Arg Val Gly Leu Ser Pro Thr Pro Cys Leu Glu Arg 
        35                  40                  45              
Val Arg Arg Leu Glu Arg Gln Gly Phe Ile Gln Gly Tyr Thr Ala Leu 
    50                  55                  60                  
Leu Asn Pro His Tyr Leu Asp Ala Ser Leu Leu Val Phe Val Glu Ile 
65                  70                  75                  80  
Thr Leu Asn Arg Gly Ala Pro Asp Val Phe Glu Gln Phe Asn Ala Ala 
                85                  90                  95      
Val Gln Lys Leu Glu Glu Ile Gln Glu Cys His Leu Val Ser Gly Asp 
            100                 105                 110         
Phe Asp Tyr Leu Leu Lys Thr Arg Val Pro Asp Met Ser Ala Tyr Arg 
        115                 120                 125             
Lys Leu Leu Gly Glu Thr Leu Leu Arg Leu Pro Gly Val Asn Asp Thr 
    130                 135                 140                 
Arg Thr Tyr Val Val Met Glu Glu Val Lys Gln Ser Asn Arg Leu Val 
145                 150                 155                 160 
Ile Lys Thr Arg 
                

<210> 40
<211> 495
<212> DNA
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhimurium DT104:
      DT104_09341 LRP

<400> 40
atggtagata gcaagaagcg ccctggcaaa gatctcgacc gtatcgatcg taacattctt      60

aatgaactgc aaaaggatgg gcgtatttcc aacgtcgagc tttctaaacg agtaggactt     120

tcgccgacac cttgccttga gcgtgtgcgt cggctggagc gacaggggtt tatccagggc     180

tatacggcgc tgttgaaccc gcattatctg gatgcgtcac ttctggtatt cgttgagatt     240

accttaaatc gcggcgcgcc ggatgtgttt gaacagttta atgccgccgt gcaaaagctt     300

gaagagattc aggagtgtca tttggtttcc ggcgatttcg actacctgtt gaaaacccgt     360

gtaccggata tgtcagcgta tcgaaaacta ttgggagaga cgttgctgcg cttgccaggt     420

gtgaacgaca cccgaactta cgtagtgatg gaagaggtaa aacagagtaa tcgtctggtt     480

attaagacac gctaa                                                      495


<210> 41
<211> 164
<212> PRT
<213> Artificial Sequence


<220> 
<223> Shigella flexneri 2457T (serotype 2a): S0889 LRP

<400> 41
Met Val Asp Ser Lys Lys Arg Pro Gly Lys Asp Leu Asp Arg Ile Asp 
1               5                   10                  15      
Arg Asn Ile Leu Asn Glu Leu Gln Lys Asp Gly Arg Ile Ser Asn Val 
            20                  25                  30          
Glu Leu Ser Lys Arg Val Gly Leu Ser Pro Thr Pro Cys Leu Glu Arg 
        35                  40                  45              
Val Arg Arg Leu Glu Arg Gln Gly Phe Ile Gln Gly Tyr Thr Ala Leu 
    50                  55                  60                  
Leu Asn Pro His Tyr Leu Asp Ala Ser Leu Leu Val Phe Val Glu Ile 
65                  70                  75                  80  
Thr Leu Asn Arg Gly Ala Pro Asp Val Phe Glu Gln Phe Asn Thr Ala 
                85                  90                  95      
Val Gln Lys Leu Glu Glu Ile Gln Glu Cys His Leu Val Ser Gly Asp 
            100                 105                 110         
Phe Asp Tyr Leu Leu Lys Thr Arg Val Pro Asp Met Ser Ala Tyr Arg 
        115                 120                 125             
Lys Leu Leu Gly Glu Thr Leu Leu Arg Leu Pro Gly Val Asn Asp Thr 
    130                 135                 140                 
Arg Thr Tyr Val Val Met Glu Glu Val Lys Gln Ser Asn Arg Leu Val 
145                 150                 155                 160 
Ile Lys Thr Arg 
                

<210> 42
<211> 495
<212> DNA
<213> Artificial Sequence


<220> 
<223> Shigella flexneri 2457T (serotype 2a): S0889 LRP

<400> 42
atggtagata gcaagaagcg ccctggcaaa gatctcgacc gtatcgatcg taacattctt      60

aatgagttgc aaaaggatgg gcgtatttct aacgtcgagc tttctaaacg tgtgggactt     120

tccccaacgc cgtgccttga gcgtgtgcgt cggctggaaa gacaagggtt tattcagggc     180

tatacggcgc tgcttaaccc ccattatctg gatgcatcac ttctggtatt cgttgagatt     240

actctgaatc gtggcgcacc ggatgtgttt gaacaattca ataccgctgt acaaaaactt     300

gaagaaattc aggagtgtca tttagtatct ggtgatttcg actacctgtt gaaaacacgc     360

gtgccggata tgtcagctta ccgtaagttg ctgggggaaa ccctgctgcg tctgcctggc     420

gtcaatgaca cacggacata cgttgttatg gaagaagtca agcagagtaa tcgtctggtt     480

attaagacgc gctaa                                                      495


<210> 43
<211> 210
<212> PRT
<213> Artificial Sequence


<220> 
<223> Escherichia coli K-12 W3110: JW5702 CRP

<400> 43
Met Val Leu Gly Lys Pro Gln Thr Asp Pro Thr Leu Glu Trp Phe Leu 
1               5                   10                  15      
Ser His Cys His Ile His Lys Tyr Pro Ser Lys Ser Lys Leu Ile His 
            20                  25                  30          
Gln Gly Glu Lys Ala Glu Thr Leu Tyr Tyr Ile Val Lys Gly Ser Val 
        35                  40                  45              
Ala Val Leu Ile Lys Asp Glu Glu Gly Lys Glu Met Ile Leu Ser Tyr 
    50                  55                  60                  
Leu Asn Gln Gly Asp Phe Ile Gly Glu Leu Gly Leu Phe Glu Glu Gly 
65                  70                  75                  80  
Gln Glu Arg Ser Ala Trp Val Arg Ala Lys Thr Ala Cys Glu Val Ala 
                85                  90                  95      
Glu Ile Ser Tyr Lys Lys Phe Arg Gln Leu Ile Gln Val Asn Pro Asp 
            100                 105                 110         
Ile Leu Met Arg Leu Ser Ala Gln Met Ala Arg Arg Leu Gln Val Thr 
        115                 120                 125             
Ser Glu Lys Val Gly Asn Leu Ala Phe Leu Asp Val Thr Gly Arg Ile 
    130                 135                 140                 
Ala Gln Thr Leu Leu Asn Leu Ala Lys Gln Pro Asp Ala Met Thr His 
145                 150                 155                 160 
Pro Asp Gly Met Gln Ile Lys Ile Thr Arg Gln Glu Ile Gly Gln Ile 
                165                 170                 175     
Val Gly Cys Ser Arg Glu Thr Val Gly Arg Ile Leu Lys Met Leu Glu 
            180                 185                 190         
Asp Gln Asn Leu Ile Ser Ala His Gly Lys Thr Ile Val Val Tyr Gly 
        195                 200                 205             
Thr Arg 
    210 

<210> 44
<211> 633
<212> DNA
<213> Artificial Sequence


<220> 
<223> Escherichia coli K-12 W3110: JW5702 CRP

<400> 44
atggtgcttg gcaaaccgca aacagacccg actctcgaat ggttcttgtc tcattgccac      60

attcataagt acccatccaa gagcaagctt attcaccagg gtgaaaaagc ggaaacgctg     120

tactacatcg ttaaaggctc tgtggcagtg ctgatcaaag acgaagaggg taaagaaatg     180

atcctctcct atctgaatca gggtgatttt attggcgaac tgggcctgtt tgaagagggc     240

caggaacgta gcgcatgggt acgtgcgaaa accgcctgtg aagtggctga aatttcgtac     300

aaaaaatttc gccaattgat tcaggtaaac ccggacattc tgatgcgttt gtctgcacag     360

atggcgcgtc gtctgcaagt cacttcagag aaagtgggca acctggcgtt cctcgacgtg     420

acgggccgca ttgcacagac tctgctgaat ctggcaaaac aaccagacgc tatgactcac     480

ccggacggta tgcaaatcaa aattacccgt caggaaattg gtcagattgt cggctgttct     540

cgtgaaaccg tgggacgcat tctgaagatg ctggaagatc agaacctgat ctccgcacac     600

ggtaaaacca tcgtcgttta cggcactcgt taa                                  633


<210> 45
<211> 210
<212> PRT
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhimurium DT104:
      DT104_34511 CRP

<400> 45
Met Val Leu Gly Lys Pro Gln Thr Asp Pro Thr Leu Glu Trp Phe Leu 
1               5                   10                  15      
Ser His Cys His Ile His Lys Tyr Pro Ser Lys Ser Thr Leu Ile His 
            20                  25                  30          
Gln Gly Glu Lys Ala Glu Thr Leu Tyr Tyr Ile Val Lys Gly Ser Val 
        35                  40                  45              
Ala Val Leu Ile Lys Asp Glu Glu Gly Lys Glu Met Ile Leu Ser Tyr 
    50                  55                  60                  
Leu Asn Gln Gly Asp Phe Ile Gly Glu Leu Gly Leu Phe Glu Glu Gly 
65                  70                  75                  80  
Gln Glu Arg Ser Ala Trp Val Arg Ala Lys Thr Ala Cys Glu Val Ala 
                85                  90                  95      
Glu Ile Ser Tyr Lys Lys Phe Arg Gln Leu Ile Gln Val Asn Pro Asp 
            100                 105                 110         
Ile Leu Met Arg Leu Ser Ser Gln Met Ala Arg Arg Leu Gln Val Thr 
        115                 120                 125             
Ser Glu Lys Val Gly Asn Leu Ala Phe Leu Asp Val Thr Gly Arg Ile 
    130                 135                 140                 
Ala Gln Thr Leu Leu Asn Leu Ala Lys Gln Pro Asp Ala Met Thr His 
145                 150                 155                 160 
Pro Asp Gly Met Gln Ile Lys Ile Thr Arg Gln Glu Ile Gly Gln Ile 
                165                 170                 175     
Val Gly Cys Ser Arg Glu Thr Val Gly Arg Ile Leu Lys Met Leu Glu 
            180                 185                 190         
Asp Gln Asn Leu Ile Ser Ala His Gly Lys Thr Ile Val Val Tyr Gly 
        195                 200                 205             
Thr Arg 
    210 

<210> 46
<211> 633
<212> DNA
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhimurium DT104:
      DT104_34511 CRP

<400> 46
atggtgcttg gcaaaccgca aacagacccg actcttgaat ggttcttgtc tcattgccac      60

attcataagt acccgtcaaa gagcacgctg attcaccagg gtgaaaaagc agaaacgctg     120

tactacatcg ttaaaggctc cgtggcagtg ctgatcaaag atgaagaagg gaaagaaatg     180

atcctttctt atctgaatca gggtgatttt attggtgaac tgggcctgtt tgaagaaggc     240

caggaacgca gcgcctgggt acgtgcgaaa accgcatgtg aggtcgctga aatttcctac     300

aaaaaatttc gccaattaat ccaggtcaac ccggatattc tgatgcgcct ctcttcccag     360

atggctcgtc gcttacaagt cacctctgaa aaagtaggta acctcgcctt ccttgacgtc     420

accgggcgta tcgctcagac gctgctgaat ctggcgaaac agcccgatgc catgacgcac     480

ccggatggga tgcagatcaa aatcactcgt caggaaatcg gccagatcgt cggctgctcc     540

cgcgaaaccg ttggtcgtat tttgaaaatg ctggaagatc aaaacctgat ctccgcgcat     600

ggcaagacca tcgtcgtcta cggcacccgt taa                                  633


<210> 47
<211> 210
<212> PRT
<213> Artificial Sequence


<220> 
<223> Shigella flexneri 2002017 (serotype Fxv): SFxv_3687 CRP

<400> 47
Met Val Leu Gly Lys Pro Gln Thr Asp Pro Thr Leu Glu Trp Phe Leu 
1               5                   10                  15      
Ser His Cys His Ile His Lys Tyr Pro Ser Lys Ser Thr Leu Ile His 
            20                  25                  30          
Gln Gly Glu Lys Ala Glu Thr Leu Tyr Tyr Ile Val Lys Gly Ser Val 
        35                  40                  45              
Ala Val Leu Ile Lys Asp Glu Glu Gly Lys Glu Met Ile Leu Ser Tyr 
    50                  55                  60                  
Leu Asn Gln Gly Asp Phe Ile Gly Glu Leu Gly Leu Phe Glu Glu Gly 
65                  70                  75                  80  
Gln Glu Arg Ser Ala Trp Val Arg Ala Lys Thr Ala Cys Glu Val Ala 
                85                  90                  95      
Glu Ile Ser Tyr Lys Lys Phe Arg Gln Leu Ile Gln Val Asn Pro Asp 
            100                 105                 110         
Ile Leu Met Arg Leu Ser Ala Gln Met Ala Arg Arg Leu Gln Val Thr 
        115                 120                 125             
Ser Glu Lys Val Gly Asn Leu Ala Phe Leu Asp Val Thr Gly Arg Ile 
    130                 135                 140                 
Ala Gln Thr Leu Leu Asn Leu Ala Lys Gln Pro Asp Ala Met Thr His 
145                 150                 155                 160 
Pro Asp Gly Met Gln Ile Lys Ile Thr Arg Gln Glu Ile Gly Gln Ile 
                165                 170                 175     
Val Gly Cys Ser Arg Glu Thr Val Gly Arg Ile Leu Lys Met Leu Glu 
            180                 185                 190         
Asp Gln Asn Leu Ile Ser Ala His Gly Lys Thr Ile Val Val Tyr Gly 
        195                 200                 205             
Thr Arg 
    210 

<210> 48
<211> 633
<212> DNA
<213> Artificial Sequence


<220> 
<223> Shigella flexneri 2002017 (serotype Fxv): SFxv_3687 CRP

<400> 48
atggtgcttg gcaaaccgca aacagacccg actctcgaat ggttcttgtc tcattgccac      60

attcataagt acccatccaa gagcacgctt attcaccagg gtgaaaaagc ggaaacgctg     120

tactacatcg ttaaaggctc tgtggcagtg ctgatcaaag acgaagaggg taaagaaatg     180

atcctctcct atctgaatca gggtgatttt attggcgaac tgggcctgtt tgaagagggc     240

caggaacgta gcgcatgggt acgtgcgaaa accgcctgtg aagtggctga aatttcgtac     300

aaaaaatttc gccaattgat tcaggtaaac ccggacattc tgatgcgtct gtctgcacag     360

atggcgcgtc gtctgcaagt cacttcagag aaagtgggca acctggcgtt cctcgacgtg     420

acgggccgca ttgcacagac tctgctgaac ctggcaaaac aaccagatgc tatgactcac     480

ccggacggta tgcaaatcaa aattacccgt caggaaatcg gtcagattgt cggctgttct     540

cgtgaaaccg tgggacgcat tctgaagatg ctggaagatc agaacctgat ctccgcacac     600

ggtaaaacca tcgtcgttta cggcactcgt taa                                  633


<210> 49
<211> 29
<212> DNA
<213> Artificial Sequence


<220> 
<223> K12 Repeat

<400> 49
cggtttatcc ccgctggcgc ggggaactc                                      29


<210> 50
<211> 28
<212> DNA
<213> Artificial Sequence


<220> 
<223> Repeat

<400> 50
ggtttatccc cgctggcgcg gggaacac                                       28


<210> 51
<211> 27
<212> DNA
<213> Artificial Sequence


<220> 
<223> Repeat

<400> 51
cggtttatcc ccgctggcgc ggggaac                                        27


<210> 52
<211> 29
<212> DNA
<213> Artificial Sequence


<220> 
<223> O157:H7 Repeat

<400> 52
cggtttatcc ccgctggcgc ggggaacac                                      29


<210> 53
<211> 29
<212> DNA
<213> Artificial Sequence


<220> 
<223> UK-1 Repeat

<400> 53
cggtttatcc ccgctggcgc ggggaacac                                      29


<210> 54
<211> 30
<212> DNA
<213> Artificial Sequence


<220> 
<223> E coli K12 CRISPR I leader sequence

<400> 54
ctaaaagtat acatttgttc ttaaagcatt                                     30


<210> 55
<211> 32
<212> DNA
<213> Artificial Sequence


<220> 
<223> E coli K12 CRISPR II leader sequence

<400> 55
tctaaacata acctattatt aattaatgat tt                                   32


<210> 56
<211> 887
<212> PRT
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhimurium 14028S
      Cas3

<400> 56
Met Ser Ile Tyr His Tyr Trp Gly Lys Ser Arg Arg Gly Glu Thr Asp 
1               5                   10                  15      
Gly Gly Asp Asp Tyr His Leu Leu Cys Trp His Ser Leu Asp Val Ala 
            20                  25                  30          
Ala Val Gly Tyr Trp Met Val Ile Asn Asn Ile Tyr Phe Ile Asp His 
        35                  40                  45              
Tyr Leu Lys Lys Leu Gly Ile Gln Asp Lys Glu Gln Ala Ala Gln Phe 
    50                  55                  60                  
Phe Ala Trp Ile Leu Cys Trp His Asp Ile Gly Lys Phe Ala His Ser 
65                  70                  75                  80  
Phe Gln Gln Leu Tyr Arg His Glu Ala Leu Asn Ile Phe Asn Glu Pro 
                85                  90                  95      
Thr Arg His Tyr Glu Lys Ile Ala His Thr Thr Leu Gly Tyr Met Leu 
            100                 105                 110         
Trp Asn Ser Trp Leu Ser Glu Cys Pro Glu Leu Phe Pro Pro Ser Ser 
        115                 120                 125             
Leu Ser Val Arg Lys Ser Lys Arg Val Met Ala Leu Trp Met Pro Val 
    130                 135                 140                 
Thr Thr Gly His His Gly Arg Pro Pro Glu Ala Ile Gln Glu Leu Asp 
145                 150                 155                 160 
His Phe Arg Gln Gln Asp Lys Asp Ala Ala Arg Asp Phe Leu Leu Arg 
                165                 170                 175     
Ile Lys Ala Leu Phe Pro Leu Ile Thr Leu Pro Glu Ala Trp Asp Glu 
            180                 185                 190         
Asp Glu Gly Ile Asp Gln Phe Gln Gln Leu Ser Trp Phe Ile Ser Ala 
        195                 200                 205             
Ala Val Val Leu Ala Asp Trp Thr Gly Ser Ala Ser Arg Tyr Phe Pro 
    210                 215                 220                 
Arg Thr Ala Glu Lys Met Pro Val Asp Thr Tyr Trp Gln Gln Ala Leu 
225                 230                 235                 240 
Ala Lys Ala Gln Thr Ala Ile Thr Leu Phe Pro Ser Ala Ala Asn Val 
                245                 250                 255     
Ser Ala Phe Thr Gly Ile Glu Thr Leu Phe Pro Phe Ile Gln His Pro 
            260                 265                 270         
Thr Pro Leu Gln Gln Lys Ala Leu Glu Leu Asp Ile Asn Val Asp Gly 
        275                 280                 285             
Ala Gln Leu Phe Ile Leu Glu Asp Val Thr Gly Ala Gly Lys Thr Glu 
    290                 295                 300                 
Ala Ala Leu Ile Leu Ala His Arg Leu Met Ala Ala Gly Lys Ala Gln 
305                 310                 315                 320 
Gly Leu Tyr Phe Gly Leu Pro Thr Met Ala Thr Ala Asn Ala Met Phe 
                325                 330                 335     
Glu Arg Met Ala Asn Thr Trp Leu Ala Leu Tyr Gln Pro Asp Ser Arg 
            340                 345                 350         
Pro Ser Leu Ile Leu Ala His Ser Ala Arg Arg Leu Met Asp Arg Phe 
        355                 360                 365             
Asn Gln Ser Ile Trp Ser Val Thr Leu Ser Gly Thr Glu Glu Pro Asp 
    370                 375                 380                 
Glu Ala Gln Pro Tyr Ser Gln Gly Cys Ala Ala Trp Phe Ala Asp Ser 
385                 390                 395                 400 
Asn Lys Lys Ala Leu Leu Ala Glu Val Gly Val Gly Thr Leu Asp Gln 
                405                 410                 415     
Ala Met Met Ala Val Met Pro Phe Lys His Asn Asn Leu Arg Leu Leu 
            420                 425                 430         
Gly Leu Ser Asn Lys Ile Leu Leu Ala Asp Glu Ile His Ala Cys Asp 
        435                 440                 445             
Ala Trp Met Ser Arg Ile Leu Glu Gly Leu Ile Glu Arg Gln Ala Ser 
    450                 455                 460                 
Asn Gly Asn Ala Thr Ile Leu Leu Ser Ala Thr Leu Ser Gln Gln Gln 
465                 470                 475                 480 
Arg Asp Lys Leu Val Ala Ala Phe Ser Arg Gly Val Arg Arg Ser Val 
                485                 490                 495     
Gln Ala Pro Leu Leu Gly His Asp Asp Tyr Pro Trp Leu Thr Gln Val 
            500                 505                 510         
Thr Gln Thr Glu Leu Ile Ser Gln Arg Val Asp Thr Arg Lys Glu Val 
        515                 520                 525             
Glu Arg Cys Val Asp Ile Gly Trp Leu His Ser Glu Glu Ala Cys Leu 
    530                 535                 540                 
Glu Arg Ile Gly Glu Ala Val Glu Lys Gly Asn Cys Ile Ala Trp Ile 
545                 550                 555                 560 
Arg Asn Ser Val Asp Asp Ala Ile Arg Ile Tyr Arg Gln Leu Gln Leu 
                565                 570                 575     
Ser Lys Val Val Val Thr Glu Asn Leu Leu Leu Phe His Ser Arg Phe 
            580                 585                 590         
Ala Phe Tyr Asp Arg Gln Arg Ile Glu Ser Gln Thr Leu Asn Leu Phe 
        595                 600                 605             
Gly Lys Gln Ser Gly Ala Gln Arg Ala Gly Lys Val Ile Ile Ala Thr 
    610                 615                 620                 
Gln Val Ile Glu Gln Ser Leu Asp Ile Asp Cys Asp Glu Met Ile Ser 
625                 630                 635                 640 
Asp Leu Ala Pro Val Asp Leu Leu Ile Gln Arg Ala Gly Arg Leu Gln 
                645                 650                 655     
Arg His Ile Arg Asp Arg Asn Gly Leu Val Lys Lys Ser Gly Gln Asp 
            660                 665                 670         
Glu Arg Glu Thr Pro Val Leu Arg Ile Leu Ala Pro Glu Trp Asp Asp 
        675                 680                 685             
Ala Pro Arg Glu Asn Trp Leu Ser Ser Ala Met Arg Asn Ser Ala Tyr 
    690                 695                 700                 
Val Tyr Pro Asp His Gly Arg Met Trp Leu Thr Gln Arg Ile Leu Arg 
705                 710                 715                 720 
Glu Gln Gly Thr Ile Arg Met Pro Gln Ser Ala Arg Leu Leu Ile Glu 
                725                 730                 735     
Ser Val Tyr Gly Glu Asp Val Asn Met Pro Val Gly Phe Ala Lys Thr 
            740                 745                 750         
Glu Gln Leu Gln Glu Gly Lys Phe Tyr Cys Asp Arg Ala Phe Ala Gly 
        755                 760                 765             
Gln Met Leu Leu Asn Phe Ala Pro Gly Tyr Cys Ala Glu Ile Ser Asp 
    770                 775                 780                 
Ser Leu Pro Glu Lys Met Ser Thr Arg Leu Ala Glu Glu Ser Val Thr 
785                 790                 795                 800 
Leu Trp Leu Ala Lys Ile Val Asp Ser Val Val Thr Pro Tyr Ala Ser 
                805                 810                 815     
Gly Glu His Ala Trp Glu Met Ser Val Leu Arg Val Arg Gln Ser Trp 
            820                 825                 830         
Trp Asn Lys His Lys Asp Glu Phe Glu Lys Leu Asp Gly Glu Pro Leu 
        835                 840                 845             
Arg Lys Trp Cys Ala Gln Gln His Gln Asp Lys Asp Phe Ala Thr Val 
    850                 855                 860                 
Ile Val Val Thr Asp Phe Ala Ala Cys Gly Tyr Ser Ala Asn Glu Gly 
865                 870                 875                 880 
Leu Ile Gly Met Met Gly Glu 
                885         

<210> 57
<211> 2664
<212> DNA
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhimurium 14028S
      Cas3 nucleotide sequence

<400> 57
gtgtcgatat atcactattg gggaaagtct cgacgaggag aaactgacgg cggtgatgat      60

taccatttgc tttgctggca ttctttagat gttgcggctg tgggttactg gatggtgata     120

aataatattt attttattga ccactatcta aaaaaattag gcatccagga taaggagcag     180

gcggcgcaat tttttgcctg gattttatgt tggcatgata ttggaaagtt tgctcattcc     240

ttccagcaac tataccgtca tgaggcttta aatatcttta atgagcctac acggcattat     300

gaaaaaatcg cgcataccac gctgggatac atgttgtgga actcctggct aagtgaatgc     360

cctgaattgt ttcctccttc ttcgctttca gttcgtaaaa gtaagcgcgt tatggcgctt     420

tggatgccag tcactacagg tcatcatgga cgccctccag aggcaatcca ggagctggac     480

cattttcgcc agcaggataa agacgcggca agagattttc ttctgagaat aaaagcgctc     540

tttcctttaa ttactttgcc tgaagcctgg gatgaagatg agggtatcga ccaatttcag     600

caactttcct ggtttatttc cgctgcggtt gtactggctg actggactgg ttctgccagc     660

cgttattttc cgcgtactgc ggaaaaaatg cctgttgata cctactggca gcaagctctc     720

gctaaagcac aaactgccat cacgctattt ccctcagcgg cgaatgtgtc tgcctttacg     780

ggcatagaaa cgcttttccc ttttattcag catcccacac cgttacaaca aaaggcgctt     840

gagctggata tcaacgtgga tggcgcccaa ctctttattc ttgaagatgt caccggggcc     900

ggaaaaacag aggcggcgct catattagct catcgactga tggcggcagg taaagcgcag     960

ggactctatt ttggactgcc gacaatggcg acagccaacg cgatgtttga acgtatggcg    1020

aacacctggc tggcgctgta tcagccggac tcccgtccca gcctgattct ggcgcatagc    1080

gcgcgtcgct taatggatcg tttcaatcag tcaatatggt cggtcactct ttctggtacg    1140

gaagaacccg atgaagcgca gccttatagt cagggatgcg ccgcctggtt tgccgacagc    1200

aataaaaaag cgttgttggc ggaggttggc gtaggcacgt tggatcaggc gatgatggcg    1260

gtaatgccat ttaaacataa caacctgcgg ttactgggtc ttagcaacaa gatcttactg    1320

gctgatgaga tccatgcctg tgatgcctgg atgtcccgaa tacttgaagg tttgatcgaa    1380

cggcaggcca gtaatggcaa cgccactatt ctgttatctg cgacgctatc gcagcagcag    1440

cgagataagc tggtggcggc attttcccgt ggggtgaggc gtagtgtgca ggcgccgttg    1500

ctaggccatg acgattatcc ctggctgact caggtcacac aaacagagct gatttctcag    1560

cgggttgata cacgcaaaga ggttgagcgt tgcgtagata ttggctggct acatagtgaa    1620

gaggcgtgtc ttgaacgtat aggtgaagca gtggaaaaag gaaactgtat cgcctggata    1680

cgtaactccg ttgatgatgc gattcgtatc tatcgccagc ttcaactgag taaggtcgtc    1740

gtcacggaaa accttttact cttccatagt cgctttgctt tttacgatcg tcagcggatt    1800

gagtcacaga cgctgaatct ctttggcaaa cagagcggcg cgcaacgtgc cggtaaggtc    1860

attatcgcca cgcaggtcat cgaacaaagt ctggatattg actgcgatga gatgatctct    1920

gatttagcgc cggtggattt attaattcag cgggccggtc gactacagcg tcatattcgc    1980

gatcgtaacg gtctggtgaa aaagagtggg caggatgagc gagagacgcc agtgctgcgc    2040

attcttgctc cggagtggga tgacgcgccg cgagagaact ggttatccag cgccatgcgt    2100

aacagcgcct atgtctatcc cgatcatggg cgcatgtggc tgacacagcg catattacgt    2160

gagcagggga cgattcggat gccgcaatct gcccgattgt tgattgagtc ggtctacggc    2220

gaggatgtca acatgccggt tggatttgca aaaaccgagc aattgcagga aggcaaattt    2280

tattgcgacc gggcatttgc cggccagatg ctgcttaact ttgcgccggg ctactgtgct    2340

gaaattagcg attctttacc ggagaaaatg tcaacgcggc tggcggaaga gtctgtcacg    2400

ctgtggctgg cgaaaatcgt ggatagcgtc gtaacccctt atgccagcgg tgaacacgcc    2460

tgggagatga gcgtgctgcg agtacgtcag agctggtgga ataaacataa agacgagttt    2520

gaaaaattag acggcgaacc cttgcgtaag tggtgtgcgc aacagcatca ggataaggat    2580

tttgccacgg tgattgtggt gacggacttt gccgcttgtg gttattcggc gaatgaggga    2640

ttgattggca tgatggggga ataa                                           2664


<210> 58
<211> 899
<212> PRT
<213> Artificial Sequence


<220> 
<223> Escherichia coli Cas3

<400> 58
Met Arg Lys Tyr Pro Leu Ser Leu Leu Lys Asp Lys Asn Ile Val Thr 
1               5                   10                  15      
Phe Phe Asp Phe Trp Gly Lys Thr Arg Arg Gly Glu Lys Glu Gly Gly 
            20                  25                  30          
Asp Gly Tyr His Leu Leu Cys Trp His Ser Leu Asp Val Ala Ala Met 
        35                  40                  45              
Gly Tyr Leu Met Val Lys Arg Asn Cys Phe Gly Leu Ala Asp Tyr Phe 
    50                  55                  60                  
Arg Gln Leu Gly Ile Ser Asp Lys Glu Gln Ala Ala Gln Phe Phe Ala 
65                  70                  75                  80  
Trp Leu Leu Cys Trp His Asp Ile Gly Lys Phe Ala Arg Ser Phe Gln 
                85                  90                  95      
Gln Leu Tyr Leu Ala Pro Glu Leu Lys Ile Pro Glu Gly Ser Arg Lys 
            100                 105                 110         
Asn Tyr Glu Lys Ile Ser His Ser Thr Leu Gly Tyr Trp Leu Trp Asn 
        115                 120                 125             
Tyr Tyr Leu Ser Glu Cys Glu Glu Leu Leu Pro Ser Ser Ser Leu Ser 
    130                 135                 140                 
Ser Arg Lys Leu Thr Arg Val Ile Glu Met Trp Met Ser Ile Thr Thr 
145                 150                 155                 160 
Gly His His Gly Arg Pro Pro Asp Arg Ile Asp Glu Leu Asp Asn Phe 
                165                 170                 175     
Leu Pro Glu Asp Lys Ala Ala Ala Arg Asp Phe Leu Leu Glu Ile Lys 
            180                 185                 190         
Ala Leu Phe Pro Leu Ile Glu Ile Pro Thr Phe Trp Asp Asp Asp Glu 
        195                 200                 205             
Gly Val Glu Leu Leu Lys Gln Leu Ser Trp Tyr Ile Ser Ala Thr Val 
    210                 215                 220                 
Val Leu Ala Asp Trp Thr Gly Ser Ser Thr Arg Phe Phe Pro Arg Val 
225                 230                 235                 240 
Ala His Pro Met Asp Ile Lys Asp Tyr Trp Gln Lys Thr Leu Val Gln 
                245                 250                 255     
Ala Gln Asn Ala Leu Thr Val Phe Pro Pro Lys Ala Glu Thr Ala Pro 
            260                 265                 270         
Phe Thr Gly Ile Asn Thr Leu Phe Pro Phe Ile Glu His Pro Thr Pro 
        275                 280                 285             
Leu Gln Gln Lys Val Leu Asp Leu Asp Ile Ser Gln Pro Gly Pro Gln 
    290                 295                 300                 
Leu Phe Ile Leu Glu Asp Val Thr Gly Ala Gly Lys Thr Glu Ala Ala 
305                 310                 315                 320 
Leu Ile Leu Ala His Arg Leu Met Ala Ala Arg Lys Ala Gln Gly Leu 
                325                 330                 335     
Phe Phe Gly Leu Pro Thr Met Ala Thr Ala Asn Ala Met Tyr Asp Arg 
            340                 345                 350         
Leu Val Lys Thr Trp Leu Ala Phe Tyr Ser Pro Glu Ser Arg Pro Ser 
        355                 360                 365             
Leu Val Leu Ala His Ser Ala Arg Thr Leu Met Asp Arg Phe Asn Glu 
    370                 375                 380                 
Ser Leu Trp Ser Gly Asp Leu Val Gly Ser Glu Glu Pro Asp Glu Gln 
385                 390                 395                 400 
Thr Phe Ser Gln Gly Cys Ala Ala Trp Phe Ala Asn Ser Asn Lys Lys 
                405                 410                 415     
Ala Leu Leu Ala Glu Ile Gly Val Gly Thr Leu Asp Gln Ala Met Met 
            420                 425                 430         
Ala Val Met Pro Phe Lys His Asn Asn Leu Arg Leu Leu Gly Leu Ser 
        435                 440                 445             
Asn Lys Ile Leu Leu Ala Asp Glu Ile His Ala Cys Asp Ala Tyr Met 
    450                 455                 460                 
Ser Cys Ile Leu Glu Gly Leu Ile Glu Arg Gln Ala Arg Gly Gly Asn 
465                 470                 475                 480 
Ser Val Ile Leu Leu Ser Ala Thr Leu Ser Gln Gln Gln Arg Asp Lys 
                485                 490                 495     
Leu Val Ala Ala Phe Ala Arg Gly Thr Glu Gly Gln Gln Glu Ala Pro 
            500                 505                 510         
Phe Leu Glu Lys Asp Asp Tyr Pro Trp Leu Thr His Val Thr Lys Ser 
        515                 520                 525             
Asp Val Asn Ser His Arg Val Ala Thr Arg Lys Asp Val Glu Arg Ser 
    530                 535                 540                 
Val Ser Val Gly Trp Leu His Ser Glu Gln Glu Ser Ile Ala Arg Ile 
545                 550                 555                 560 
Glu Ser Ala Val Ser Gln Gly Lys Cys Ile Ala Trp Ile Arg Asn Ser 
                565                 570                 575     
Val Asp Asp Ala Ile Lys Val His Arg Gln Leu Leu Ala Arg Gly Val 
            580                 585                 590         
Ile Pro Ala Ser Ser Leu Ser Leu Phe His Ser Arg Phe Ala Phe Ser 
        595                 600                 605             
Asp Arg Gln Arg Ile Glu Met Glu Thr Leu Ala Arg Phe Gly Lys Glu 
    610                 615                 620                 
Asp Gly Ser Gln Arg Ala Gly Lys Val Leu Ile Cys Thr Gln Val Leu 
625                 630                 635                 640 
Glu Gln Ser Val Asp Cys Asp Leu Asp Glu Met Ile Ser Asp Leu Ala 
                645                 650                 655     
Pro Val Asp Leu Leu Ile Gln Arg Ala Gly Arg Leu Gln Arg His Ile 
            660                 665                 670         
Arg Asp Ile Asn Gly Gln Leu Lys Arg Asp Gly Lys Asp Glu Arg Ser 
        675                 680                 685             
Pro Pro Glu Leu Leu Ile Leu Ala Pro Val Trp Asp Asp Ala Pro Gly 
    690                 695                 700                 
Asp Glu Trp Phe Gly Ser Ala Met Arg Asn Ser Ala Tyr Val Tyr Pro 
705                 710                 715                 720 
Asp His Gly Arg Ile Trp Leu Thr Gln Arg Val Leu Arg Glu Gln Gly 
                725                 730                 735     
Ala Ile Gln Met Pro His Ala Ala Arg Leu Leu Ile Glu Ser Val Tyr 
            740                 745                 750         
Gly Glu Asp Val Val Met Pro Glu Gly Phe Ala Arg Ser Glu Gln Glu 
        755                 760                 765             
Gln Val Gly Lys Tyr Tyr Cys Asp Arg Ala Met Ala Lys Lys Phe Val 
    770                 775                 780                 
Leu Asn Phe Lys Pro Gly Tyr Ala Ala Asn Ile Asn Asp Tyr Leu Pro 
785                 790                 795                 800 
Glu Lys Leu Ser Thr Arg Leu Ala Glu Glu Ser Val Ser Leu Trp Leu 
                805                 810                 815     
Ala Thr Cys Ile Ala Gly Val Val Lys Pro Tyr Ala Thr Gly Ala His 
            820                 825                 830         
Ala Trp Glu Met Ser Val Val Arg Val Arg Arg Ser Trp Trp Lys Lys 
        835                 840                 845             
His Arg Asp Glu Phe Ser Leu Leu Glu Gly Glu Ala Phe Arg Gln Trp 
    850                 855                 860                 
Cys Ile Glu Gln Arg Gln Asp Pro Glu Met Ala Asn Val Ile Leu Val 
865                 870                 875                 880 
Thr Asp Asp Glu Ser Cys Gly Tyr Ser Ala Arg Glu Gly Leu Ile Gly 
                885                 890                 895     
Lys Val Asp 
            

<210> 59
<211> 2700
<212> DNA
<213> Artificial Sequence


<220> 
<223> Escherichia coli Cas3 Strain O157:H7 EDL933 (EHEC)

<400> 59
atgcgtaaat atcctttaag tttactgaag gataaaaata ttgtgacttt ctttgatttc      60

tggggaaaaa cccgacgtgg cgagaaagag ggtggcgacg gctatcacct tctttgctgg     120

cattcgctgg atgtggccgc aatgggctat ttaatggtta aaagaaattg cttcgggctg     180

gctgattact ttcgtcaatt agggatttct gacaaggaac aggcggctca atttttcgct     240

tggttgctgt gctggcacga tattggaaaa tttgcccgct cttttcagca actttacctg     300

gcccctgaac tcaagattcc ggaaggttcc agaaagaatt acgaaaagat ctctcattca     360

acgctgggtt actggctgtg gaattattat ttaagtgaat gtgaggagtt gcttccttca     420

tcttcactct cttctcgtaa acttacacgt gtaatagaga tgtggatgtc cataactacc     480

gggcatcatg gtcgaccacc tgaccgtatt gatgagctgg ataattttct gcctgaagac     540

aaagctgccg cgcgagattt tctccttgaa atcaaggcac tgtttccgct catagagatt     600

cccacattct gggatgatga cgagggcgtt gaacttttaa aacaactttc ctggtatatc     660

tctgcaacag tcgtactcgc agactggacg ggttcgtcaa cgcgattttt tccacgcgtc     720

gcacacccaa tggatattaa agattactgg cagaaaactt tagttcaggc tcaaaacgcc     780

ttaaccgtct ttcctccaaa agcagaaacc gcacctttca ccggaattaa tacgctgttt     840

ccttttattg agcacccgac accattacag caaaaggtac tggatctgga tatcagccag     900

ccagggccac agttatttat tctggaagac gtgactggcg caggtaaaac agaagcggcg     960

cttatcctgg cgcacaggtt gatggctgcg aggaaagcac agggtttgtt ttttggcctg    1020

ccaacaatgg caacggccaa tgccatgtac gatcggctgg tcaaaacctg gcttgctttc    1080

tattcgccag agtcccgccc cagcttggtg ctggcacaca gtgcccgcac attaatggac    1140

cgcttcaatg aatcactctg gtccggtgat ttagtcgggt cagaagaacc ggatgaacaa    1200

acattcagtc agggatgtgc ggcctggttt gccaacagta acaagaaggc gctactggct    1260

gaaattggcg tcggcacgct ggatcaggcg atgatggcag tgatgccgtt taaacataat    1320

aatctgcggc ttctggggtt gagtaacaaa atcctgctgg ctgatgagat ccatgcctgt    1380

gatgcttaca tgtcgtgcat tcttgaaggg ctgatcgagc ggcaggcgcg tggcggaaac    1440

agcgtcattt tgctttctgc tacgttatcc caacagcagc gcgacaaact cgtcgccgcc    1500

tttgcgcgtg gcacagaggg ccagcaagaa gctccgttcc ttgaaaagga tgattacccc    1560

tggctgacgc atgtcacgaa atccgatgtg aactcacacc gggtagcgac gcgcaaagac    1620

gttgagcgta gcgtcagcgt gggttggctt catagtgaac aagagagtat tgcgcgtatc    1680

gaatcggcgg taagtcaggg aaaatgcatc gcctggatcc ggaattctgt cgatgacgct    1740

attaaggttc atcgtcagct gcttgcccgc ggcgtcattc ccgcttccag cctttcactc    1800

tttcatagcc gctttgcttt tagcgatcgc cagcgaattg aaatggagac gctggcacgc    1860

tttggtaaag aagacggttc acagcgtgcc ggaaaagtcc tcatttgtac tcaggtctta    1920

gagcagagcg ttgattgtga cctggacgaa atgatctccg acctggcccc tgttgatttg    1980

ctgattcagc gagcggggcg attacagcgg catatccgcg atattaatgg tcagttaaag    2040

cgtgacggaa aagacgagcg ttcccctcct gaattgctga ttctggcccc cgtctgggac    2100

gacgctcctg gtgacgaatg gttcggcagt gccatgcgta acagtgcata tgtctatccc    2160

gatcatggac gaatctggct gacgcagcgt gtactgcgtg agcaaggcgc tattcaaatg    2220

ccacacgcag cccgccttct tattgaatca gtctacggtg aggacgtggt aatgccggaa    2280

ggatttgccc gcagcgagca ggagcaagtg ggcaaatatt actgcgatcg cgcaatggct    2340

aaaaagtttg tcctgaactt caagcctggc tatgccgcca atatcaacga ttaccttccg    2400

gaaaagctgt cgacacgtct ggctgaggaa tctgtttccc tgtggctggc tacctgtatt    2460

gccggtgtgg tgaagcctta tgccaccggt gctcacgcat gggaaatgag cgttgtcaga    2520

gtgcgtcgaa gctggtggaa aaaacatcgg gatgagtttt ctttactgga aggggaagcg    2580

ttcaggcagt ggtgcattga acagcggcaa gatccggaaa tggcaaacgt gattttagtc    2640

actgatgacg aaagttgcgg gtattcggcc agggagggat tgattggcaa ggttgattga    2700


<210> 60
<211> 888
<212> PRT
<213> Artificial Sequence


<220> 
<223> Escherichia coli Cas3 Strain K12

<400> 60
Met Glu Pro Phe Lys Tyr Ile Cys His Tyr Trp Gly Lys Ser Ser Lys 
1               5                   10                  15      
Ser Leu Thr Lys Gly Asn Asp Ile His Leu Leu Ile Tyr His Cys Leu 
            20                  25                  30          
Asp Val Ala Ala Val Ala Asp Cys Trp Trp Asp Gln Ser Val Val Leu 
        35                  40                  45              
Gln Asn Thr Phe Cys Arg Asn Glu Met Leu Ser Lys Gln Arg Val Lys 
    50                  55                  60                  
Ala Trp Leu Leu Phe Phe Ile Ala Leu His Asp Ile Gly Lys Phe Asp 
65                  70                  75                  80  
Ile Arg Phe Gln Tyr Lys Ser Ala Glu Ser Trp Leu Lys Leu Asn Pro 
                85                  90                  95      
Ala Thr Pro Ser Leu Asn Gly Pro Ser Thr Gln Met Cys Arg Lys Phe 
            100                 105                 110         
Asn His Gly Ala Ala Gly Leu Tyr Trp Phe Asn Gln Asp Ser Leu Ser 
        115                 120                 125             
Glu Gln Ser Leu Gly Asp Phe Phe Ser Phe Phe Asp Ala Ala Pro His 
    130                 135                 140                 
Pro Tyr Glu Ser Trp Phe Pro Trp Val Glu Ala Val Thr Gly His His 
145                 150                 155                 160 
Gly Phe Ile Leu His Ser Gln Asp Gln Asp Lys Ser Arg Trp Glu Met 
                165                 170                 175     
Pro Ala Ser Leu Ala Ser Tyr Ala Ala Gln Asp Lys Gln Ala Arg Glu 
            180                 185                 190         
Glu Trp Ile Ser Val Leu Glu Ala Leu Phe Leu Thr Pro Ala Gly Leu 
        195                 200                 205             
Ser Ile Asn Asp Ile Pro Pro Asp Cys Ser Ser Leu Leu Ala Gly Phe 
    210                 215                 220                 
Cys Ser Leu Ala Asp Trp Leu Gly Ser Trp Thr Thr Thr Asn Thr Phe 
225                 230                 235                 240 
Leu Phe Asn Glu Asp Ala Pro Ser Asp Ile Asn Ala Leu Arg Thr Tyr 
                245                 250                 255     
Phe Gln Asp Arg Gln Gln Asp Ala Ser Arg Val Leu Glu Leu Ser Gly 
            260                 265                 270         
Leu Val Ser Asn Lys Arg Cys Tyr Glu Gly Val His Ala Leu Leu Asp 
        275                 280                 285             
Asn Gly Tyr Gln Pro Arg Gln Leu Gln Val Leu Val Asp Ala Leu Pro 
    290                 295                 300                 
Val Ala Pro Gly Leu Thr Val Ile Glu Ala Pro Thr Gly Ser Gly Lys 
305                 310                 315                 320 
Thr Glu Thr Ala Leu Ala Tyr Ala Trp Lys Leu Ile Asp Gln Gln Ile 
                325                 330                 335     
Ala Asp Ser Val Ile Phe Ala Leu Pro Thr Gln Ala Thr Ala Asn Ala 
            340                 345                 350         
Met Leu Thr Arg Met Glu Ala Ser Ala Ser His Leu Phe Ser Ser Pro 
        355                 360                 365             
Asn Leu Ile Leu Ala His Gly Asn Ser Arg Phe Asn His Leu Phe Gln 
    370                 375                 380                 
Ser Ile Lys Ser Arg Ala Ile Thr Glu Gln Gly Gln Glu Glu Ala Trp 
385                 390                 395                 400 
Val Gln Cys Cys Gln Trp Leu Ser Gln Ser Asn Lys Lys Val Phe Leu 
                405                 410                 415     
Gly Gln Ile Gly Val Cys Thr Ile Asp Gln Val Leu Ile Ser Val Leu 
            420                 425                 430         
Pro Val Lys His Arg Phe Ile Arg Gly Leu Gly Ile Gly Arg Ser Val 
        435                 440                 445             
Leu Ile Val Asp Glu Val His Ala Tyr Asp Thr Tyr Met Asn Gly Leu 
    450                 455                 460                 
Leu Glu Ala Val Leu Lys Ala Gln Ala Asp Val Gly Gly Ser Val Ile 
465                 470                 475                 480 
Leu Leu Ser Ala Thr Leu Pro Met Lys Gln Lys Gln Lys Leu Leu Asp 
                485                 490                 495     
Thr Tyr Gly Leu His Thr Asp Pro Val Glu Asn Asn Ser Ala Tyr Pro 
            500                 505                 510         
Leu Ile Asn Trp Arg Gly Val Asn Gly Ala Gln Arg Phe Asp Leu Leu 
        515                 520                 525             
Ala His Pro Glu Gln Leu Pro Pro Arg Phe Ser Ile Gln Pro Glu Pro 
    530                 535                 540                 
Ile Cys Leu Ala Asp Met Leu Pro Asp Leu Thr Met Leu Glu Arg Met 
545                 550                 555                 560 
Ile Ala Ala Ala Asn Ala Gly Ala Gln Val Cys Leu Ile Cys Asn Leu 
                565                 570                 575     
Val Asp Val Ala Gln Val Cys Tyr Gln Arg Leu Lys Glu Leu Asn Asn 
            580                 585                 590         
Thr Gln Val Asp Ile Asp Leu Phe His Ala Arg Phe Thr Leu Asn Asp 
        595                 600                 605             
Arg Arg Glu Lys Glu Asn Arg Val Ile Ser Asn Phe Gly Lys Asn Gly 
    610                 615                 620                 
Lys Arg Asn Val Gly Arg Ile Leu Val Ala Thr Gln Val Val Glu Gln 
625                 630                 635                 640 
Ser Leu Asp Val Asp Phe Asp Trp Leu Ile Thr Gln His Cys Pro Ala 
                645                 650                 655     
Asp Leu Leu Phe Gln Arg Leu Gly Arg Leu His Arg His His Arg Lys 
            660                 665                 670         
Tyr Arg Pro Ala Gly Phe Glu Ile Pro Val Ala Thr Ile Leu Leu Pro 
        675                 680                 685             
Asp Gly Glu Gly Tyr Gly Arg His Glu His Ile Tyr Ser Asn Val Arg 
    690                 695                 700                 
Val Met Trp Arg Thr Gln Gln His Ile Glu Glu Leu Asn Gly Ala Ser 
705                 710                 715                 720 
Leu Phe Phe Pro Asp Ala Tyr Arg Gln Trp Leu Asp Ser Ile Tyr Asp 
                725                 730                 735     
Asp Ala Glu Met Asp Glu Pro Glu Trp Val Gly Asn Gly Met Asp Lys 
            740                 745                 750         
Phe Glu Ser Ala Glu Cys Glu Lys Arg Phe Lys Ala Arg Lys Val Leu 
        755                 760                 765             
Gln Trp Ala Glu Glu Tyr Ser Leu Gln Asp Asn Asp Glu Thr Ile Leu 
    770                 775                 780                 
Ala Val Thr Arg Asp Gly Glu Met Ser Leu Pro Leu Leu Pro Tyr Val 
785                 790                 795                 800 
Gln Thr Ser Ser Gly Lys Gln Leu Leu Asp Gly Gln Val Tyr Glu Asp 
                805                 810                 815     
Leu Ser His Glu Gln Gln Tyr Glu Ala Leu Ala Leu Asn Arg Val Asn 
            820                 825                 830         
Val Pro Phe Thr Trp Lys Arg Ser Phe Ser Glu Val Val Asp Glu Asp 
        835                 840                 845             
Gly Leu Leu Trp Leu Glu Gly Lys Gln Asn Leu Asp Gly Trp Val Trp 
    850                 855                 860                 
Gln Gly Asn Ser Ile Val Ile Thr Tyr Thr Gly Asp Glu Gly Met Thr 
865                 870                 875                 880 
Arg Val Ile Pro Ala Asn Pro Lys 
                885             

<210> 61
<211> 2667
<212> DNA
<213> Artificial Sequence


<220> 
<223> Escherichia coli Cas3 Nucleotide sequence

<400> 61
atggaacctt ttaaatatat atgccattac tggggaaaat cctcaaaaag cttgacgaaa      60

ggaaatgata ttcatctgtt aatttatcat tgccttgatg ttgctgctgt tgcagattgc     120

tggtgggatc aatcagtcgt actgcaaaat actttttgcc gaaatgaaat gctatcaaaa     180

cagagggtga aggcctggct gttatttttc attgctcttc atgatattgg aaagtttgat     240

atacgattcc aatataaatc agcagaaagt tggctgaaat taaatcctgc aacgccatca     300

cttaatggtc catcaacaca aatgtgccgt aaatttaatc atggtgcagc cggtctgtat     360

tggtttaacc aggattcact ttcagagcaa tctctcgggg attttttcag tttttttgat     420

gccgctcctc atccttatga gtcctggttt ccatgggtag aggccgttac aggacatcat     480

ggttttatat tacattccca ggatcaagat aagtcgcgtt gggaaatgcc agcttctctg     540

gcatcttatg ctgcgcaaga taaacaggct cgtgaggagt ggatatctgt actggaagca     600

ttatttttaa cgccagcggg gttatctata aacgatatac cacctgattg ttcatcactg     660

ttagcaggtt tttgctcgct tgctgactgg ttaggctcct ggactacaac gaataccttt     720

ctgtttaatg aggatgcgcc ttccgacata aatgctctga gaacgtattt ccaggaccga     780

cagcaggatg cgagccgggt attggagttg agtggacttg tatcaaataa gcgatgttat     840

gaaggtgttc atgcactact ggacaatggc tatcaaccca gacaattaca ggtgttagtt     900

gatgctcttc cagtagctcc cgggctgacg gtaatagagg cacctacagg ctccggtaaa     960

acggaaacag cgctggccta tgcttggaaa cttattgatc aacaaattgc ggatagtgtt    1020

atttttgccc tcccaacaca agctaccgcg aatgctatgc ttacgagaat ggaagcgagc    1080

gcgagccact tattttcatc cccaaatctt attcttgctc atggcaattc acggtttaac    1140

cacctctttc aatcaataaa atcacgcgcg attactgaac aggggcaaga agaagcgtgg    1200

gttcagtgtt gtcagtggtt gtcacaaagc aataagaaag tgtttcttgg gcaaatcggc    1260

gtttgcacga ttgatcaggt gttgatatcg gtattgccag ttaaacaccg ctttatccgt    1320

ggtttgggaa ttggtcgaag tgttttaatt gttgatgaag ttcatgctta cgacacctat    1380

atgaacggct tgctggaggc agtgctcaag gctcaggctg atgtgggagg gagtgttatt    1440

cttctttccg caaccctacc aatgaaacaa aaacagaaac ttctggatac ttatggtctg    1500

catacagatc cagtggaaaa taactccgca tatccactca ttaactggcg aggtgtgaat    1560

ggtgcgcaac gttttgatct gctagctcat ccagaacaac tcccgccccg cttttcgatt    1620

cagccagaac ctatttgttt agctgacatg ttacctgacc ttacgatgtt agagcgaatg    1680

atcgcagcgg caaacgcggg tgcacaggtc tgtcttattt gcaatttggt tgacgttgca    1740

caagtatgct accaacggct aaaggagcta aataacacgc aagtagatat agatttgttt    1800

catgcgcgct ttacgctgaa cgatcgtcgt gaaaaagaga atcgagttat tagcaatttc    1860

ggcaaaaatg ggaagcgaaa tgttggacgg atacttgtcg caacccaggt cgtggaacaa    1920

tcactcgacg ttgattttga ttggttaatt actcagcatt gtcctgcaga tttgcttttc    1980

caacgattgg gccgtttaca tcgccatcat cgcaaatatc gtcccgctgg ttttgagatt    2040

cctgttgcca ccattttgct gcctgatggc gagggttacg gacgacatga gcatatttat    2100

agcaacgtta gagtcatgtg gcggacgcag caacatattg aggagcttaa tggagcatcc    2160

ttatttttcc ctgatgctta ccggcaatgg ctggatagca tttacgatga tgcggaaatg    2220

gatgagccag aatgggtcgg caatggcatg gataaatttg aaagcgccga gtgtgaaaaa    2280

aggttcaagg ctcgcaaggt cctgcagtgg gctgaagaat atagcttgca ggataacgat    2340

gaaaccattc ttgcggtaac gagggatggg gaaatgagcc tgccattatt gccttatgta    2400

caaacgtctt caggtaaaca actgctcgat ggccaggtct acgaggacct aagtcatgaa    2460

cagcagtatg aggcgcttgc acttaatcgc gtcaatgtac ccttcacctg gaaacgtagt    2520

ttttctgaag tagtagatga agatgggtta ctttggctgg aagggaaaca gaatctggat    2580

ggatgggtct ggcagggtaa cagtattgtt attacctata caggggatga agggatgacc    2640

agagtcatcc ctgcaaatcc caaataa                                        2667


<210> 62
<211> 926
<212> PRT
<213> Artificial Sequence


<220> 
<223> Streptococcus thermophilus Cas3

<400> 62
Met Lys His Ile Asn Asp Tyr Phe Trp Ala Lys Lys Thr Glu Glu Asn 
1               5                   10                  15      
Ser Arg Leu Leu Trp Leu Pro Leu Thr Gln His Leu Glu Asp Thr Lys 
            20                  25                  30          
Asn Ile Ala Gly Leu Leu Trp Glu His Trp Leu Ser Glu Gly Gln Lys 
        35                  40                  45              
Val Leu Ile Glu Asn Ser Ile Asn Val Lys Ser Asn Ile Glu Asn Gln 
    50                  55                  60                  
Gly Lys Arg Leu Ala Gln Phe Leu Gly Ala Val His Asp Ile Gly Lys 
65                  70                  75                  80  
Ala Thr Pro Ala Phe Gln Thr Gln Lys Gly Tyr Ala Asn Ser Val Asp 
                85                  90                  95      
Leu Asp Ile Gln Leu Leu Glu Lys Leu Glu Arg Ala Gly Phe Ser Gly 
            100                 105                 110         
Ile Ser Ser Leu Gln Leu Ala Ser Pro Lys Lys Ser His His Ser Ile 
        115                 120                 125             
Ala Gly Gln Tyr Leu Leu Ser His Tyr Gly Val Asp Glu Asp Ile Ala 
    130                 135                 140                 
Thr Ile Ile Gly Gly His His Gly Arg Pro Val Asp Asp Leu Asp Gly 
145                 150                 155                 160 
Leu Asn Ser Gln Lys Ser Tyr Pro Ser Asn Tyr Tyr Gln Asp Glu Lys 
                165                 170                 175     
Lys Asp Ser Leu Val Tyr Gln Lys Trp Lys Ser Asn Gln Glu Ala Phe 
            180                 185                 190         
Leu Asn Trp Ala Leu Thr Glu Thr Gly Phe Asn Ser Val Ser Gln Leu 
        195                 200                 205             
Pro Lys Ile Lys Gln Pro Ala Gln Val Ile Leu Ser Gly Leu Leu Ile 
    210                 215                 220                 
Met Ser Asp Trp Ile Ala Ser Asn Glu His Phe Phe Pro Leu Leu Ser 
225                 230                 235                 240 
Leu Asp Glu Thr Asp Val Lys Asn Lys Ser Gln Arg Ile Glu Thr Gly 
                245                 250                 255     
Phe Lys Lys Trp Lys Lys Ser Asn Leu Trp Gln Pro Glu Thr Phe Val 
            260                 265                 270         
Asp Leu Val Thr Leu Tyr Gln Glu Arg Phe Gly Phe Ser Pro Arg Asn 
        275                 280                 285             
Phe Gln Leu Ile Leu Ser Gln Thr Ile Glu Lys Thr Thr Asn Pro Gly 
    290                 295                 300                 
Ile Val Ile Leu Glu Ala Pro Met Gly Ile Gly Lys Thr Glu Ala Ala 
305                 310                 315                 320 
Leu Ala Val Ser Glu Gln Leu Ser Ser Lys Lys Gly Cys Ser Gly Leu 
                325                 330                 335     
Phe Phe Gly Leu Pro Thr Gln Ala Thr Ser Asn Gly Ile Phe Lys Arg 
            340                 345                 350         
Ile Glu Gln Trp Thr Glu Asn Ile Lys Gly Asn Asn Ser Asp His Phe 
        355                 360                 365             
Ser Ile Gln Leu Val His Gly Lys Ala Ala Leu Asn Thr Asp Phe Ile 
    370                 375                 380                 
Glu Leu Leu Lys Gly Asn Thr Ile Asn Met Asp Asp Ser Glu Asn Gly 
385                 390                 395                 400 
Ser Ile Phe Val Asn Glu Trp Phe Ser Gly Arg Lys Thr Ser Ala Leu 
                405                 410                 415     
Asp Asp Phe Val Val Gly Thr Val Asp Gln Phe Leu Met Val Ala Leu 
            420                 425                 430         
Lys Gln Lys His Leu Ala Leu Arg His Leu Gly Phe Ser Lys Lys Val 
        435                 440                 445             
Ile Val Ile Asp Glu Val His Ala Tyr Asp Ala Tyr Met Ser Gln Tyr 
    450                 455                 460                 
Leu Leu Glu Ala Ile Arg Trp Met Gly Ala Tyr Gly Val Pro Val Ile 
465                 470                 475                 480 
Ile Leu Ser Ala Thr Leu Pro Ala Gln Gln Arg Glu Lys Leu Ile Lys 
                485                 490                 495     
Ser Tyr Met Ala Gly Met Gly Val Lys Trp Arg Asp Ile Glu Asn Ile 
            500                 505                 510         
Asp Gln Ile Lys Ile Asp Ala Tyr Pro Leu Ile Thr Tyr Asn Asp Gly 
        515                 520                 525             
Pro Asp Ile His Gln Val Lys Met Phe Glu Lys Gln Glu Gln Lys Asn 
    530                 535                 540                 
Ile Tyr Ile His Arg Leu Pro Glu Glu Gln Leu Phe Asp Ile Val Lys 
545                 550                 555                 560 
Glu Gly Leu Asp Asn Gly Gly Val Val Gly Ile Ile Val Asn Thr Val 
                565                 570                 575     
Arg Lys Ser Gln Glu Leu Ala Arg Asn Phe Ser Asp Ile Phe Gly Asp 
            580                 585                 590         
Asp Met Val Asp Leu Leu His Ser Asn Phe Ile Ala Thr Glu Arg Ile 
        595                 600                 605             
Arg Lys Glu Lys Asp Leu Leu Gln Glu Ile Gly Lys Lys Ala Ile Arg 
    610                 615                 620                 
Pro Pro Lys Lys Ile Ile Ile Gly Thr Gln Val Leu Glu Gln Ser Leu 
625                 630                 635                 640 
Asp Ile Asp Phe Asp Val Leu Ile Ser Asp Leu Ala Pro Met Asp Leu 
                645                 650                 655     
Leu Ile Gln Arg Ile Gly Arg Leu His Arg His Lys Ile Lys Arg Pro 
            660                 665                 670         
Gln Lys His Glu Val Ala Arg Phe Tyr Val Leu Gly Thr Phe Glu Glu 
        675                 680                 685             
Phe Asp Phe Asp Glu Gly Thr Arg Leu Val Tyr Gly Asp Tyr Leu Leu 
    690                 695                 700                 
Ala Arg Thr Gln Tyr Phe Leu Pro Asp Lys Ile Arg Leu Pro Asp Asp 
705                 710                 715                 720 
Ile Ser Pro Leu Val Gln Lys Val Tyr Asn Ser Asp Leu Thr Ile Thr 
                725                 730                 735     
Phe Pro Lys Pro Glu Leu His Lys Lys Tyr Leu Asp Ala Lys Ile Glu 
            740                 745                 750         
His Asp Asp Lys Ile Lys Asn Lys Glu Thr Lys Ala Lys Ser Tyr Arg 
        755                 760                 765             
Ile Ala Asn Pro Val Leu Lys Lys Ser Arg Val Arg Thr Asn Ser Leu 
    770                 775                 780                 
Ile Gly Trp Leu Lys Asn Leu His Pro Asn Asp Ser Glu Glu Lys Ala 
785                 790                 795                 800 
Tyr Ala Gln Val Arg Asp Ile Glu Asp Thr Val Glu Val Ile Ala Leu 
                805                 810                 815     
Lys Lys Ile Ser Asp Gly Tyr Gly Leu Phe Ile Glu Asn Lys Asp Ile 
            820                 825                 830         
Ser Gln Asn Ile Thr Asp Pro Ile Ile Ala Lys Lys Val Ala Gln Asn 
        835                 840                 845             
Thr Leu Arg Leu Pro Met Ser Leu Ser Lys Ala Tyr Asn Ile Asp Gln 
    850                 855                 860                 
Thr Ile Asn Glu Leu Glu Arg Tyr Asn Asn Ser His Leu Ser Gln Trp 
865                 870                 875                 880 
Gln Asn Ser Ser Trp Leu Lys Gly Ser Leu Gly Ile Ile Phe Asp Lys 
                885                 890                 895     
Asn Asn Glu Phe Ile Leu Asn Gly Phe Lys Leu Leu Tyr Asp Glu Lys 
            900                 905                 910         
Tyr Gly Val Thr Ile Glu Arg Leu Asp Lys Asn Glu Ser Val 
        915                 920                 925     

<210> 63
<211> 2781
<212> DNA
<213> Artificial Sequence


<220> 
<223> Streptococcus thermophilus Cas3

<400> 63
atgaaacata ttaatgatta tttttgggct aagaaaacag aggaaaatag tagacttctt      60

tggttaccat taactcaaca cttagaagac acgaaaaata ttgcaggcct cttatgggaa     120

cattggttaa gtgaaggaca aaaggtatta attgaaaatt ctattaatgt taaatcaaat     180

attgaaaacc aagggaaaag attggcacaa ttcctaggag ctgttcatga tatcggtaaa     240

gcaacaccag cttttcagac gcaaaaaggt tatgcaaatt cagtagattt ggatattcaa     300

ttgttagaaa aattggaacg cgcaggtttt tctggcatta gttctctcca actagcctcc     360

cccaaaaaga gtcatcatag cattgcaggt caatatttgt tatcccatta tggcgtggac     420

gaagatattg caacaattat tggtggacac catggacgac cagttgatga tttagacggt     480

ttaaattctc aaaaaagcta tccctccaat tattaccagg atgaaaagaa agatagtctc     540

gtttatcaga aatggaagtc aaatcaagaa gcttttttaa actgggcttt aacagaaaca     600

gggtttaatt ctgtgtctca gcttccaaaa atcaaacagc ctgctcaagt tattctatca     660

ggtttactca taatgtctga ctggattgct agtaatgagc atttttttcc tttgttaagt     720

ttggatgaaa ctgatgtgaa aaacaagagt caacgtattg aaactgggtt taaaaagtgg     780

aaaaaatcta acttgtggca acctgaaact ttcgttgacc ttgttactct ttatcaggaa     840

agatttggat ttagtccacg aaattttcag ctgatactct cacaaacaat cgaaaagacg     900

actaatcctg ggatagtgat actggaagcg ccaatgggaa tcgggaaaac agaggcggct     960

ctagcggtat cagagcagtt atctagtaaa aaaggatgta gtggattgtt ttttggattg    1020

cccacacaag caacctccaa tggaattttt aagaggattg aacagtggac agagaatata    1080

aagggtaaca attctgatca tttttccatt cagctggttc atggaaaagc agccttaaat    1140

acggatttta ttgagttact taaaggaaat acaattaata tggacgactc ggaaaacggc    1200

agtatttttg tcaatgagtg gttttctggg agaaaaactt cagcattaga tgattttgta    1260

gttgggacgg tcgaccaatt tttaatggtg gctttaaaac aaaaacattt ggccttacgt    1320

catttaggat ttagtaaaaa agttatcgtt attgatgaag tccacgctta tgatgcttat    1380

atgagccaat atttgttgga agctatcaga tggatgggag cttatggtgt tcctgtaatt    1440

attttatcag caactttacc tgcccaacaa agagaaaaac tcataaaaag ctatatggct    1500

ggaatgggag tgaaatggcg agatattgaa aatatagatc agataaaaat agacgcatac    1560

cctttaatca cttataatga cgggcctgac attcatcaag ttaaaatgtt cgaaaagcaa    1620

gaacaaaaaa atatctacat tcatcgttta ccagaagaac agttatttga tattgtaaaa    1680

gaaggtcttg acaatggtgg agtagttggg ataattgtca atacggtgag aaaatctcaa    1740

gaattggcaa gaaatttttc agatattttt ggagatgata tggtagattt gcttcattct    1800

aatttcatag caactgaaag aatccgaaaa gaaaaggatt tattgcaaga aattgggaaa    1860

aaagcaatac gtccaccaaa gaaaatcatt attggtacac aggtgcttga acagtcgtta    1920

gatattgatt ttgatgtact gataagcgac ttagcgccta tggatttact cattcaacgt    1980

atcggacgac tacatcgtca caaaatcaaa aggccccaaa agcacgaagt agcaagattt    2040

tatgttttag gaacatttga agagtttgat tttgatgaag gaacgcgttt ggtttatggg    2100

gactacctat tagctagaac tcagtacttt ttaccagata aaatacgact tcctgatgat    2160

atttcaccgc tagtccaaaa ggtttataat tcagacctaa caattacgtt tccaaagcca    2220

gaacttcata aaaaatattt ggatgctaaa atagaacatg atgataagat taaaaataaa    2280

gaaacaaagg caaagtcata ccgtattgct aatcctgtct taaaaaaatc gagagttcga    2340

actaacagtt tgattggttg gttaaagaac ctccatccaa atgatagtga agaaaaagca    2400

tatgctcaag ttcgagatat tgaagataca gttgaagtga ttgcattaaa aaaaatatct    2460

gatgggtatg gtttgttcat agaaaataaa gatatatctc agaacattac tgatcctata    2520

attgcaaaaa aggtagcaca aaatacttta cgacttccga tgagtttatc caaagcctat    2580

aatattgatc aaacgattaa tgagcttgaa agatataaca atagccactt aagtcaatgg    2640

caaaactcat catggttaaa gggatctctt gggattattt ttgataaaaa caatgagttt    2700

atactgaatg gatttaaact attatatgat gaaaaatatg gtgttaccat agaaaggttg    2760

gataagaatg agtcggttta a                                              2781


<210> 64
<211> 887
<212> PRT
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhimurium LT2 Cas 3

<400> 64
Met Ser Ile Tyr His Tyr Trp Gly Lys Ser Arg Arg Gly Glu Thr Asp 
1               5                   10                  15      
Gly Gly Asp Asp Tyr His Leu Leu Cys Trp His Ser Leu Asp Val Ala 
            20                  25                  30          
Ala Val Gly Tyr Trp Met Val Ile Asn Asn Ile Tyr Phe Ile Asp His 
        35                  40                  45              
Tyr Leu Lys Lys Leu Gly Ile Gln Asp Lys Glu Gln Ala Ala Gln Phe 
    50                  55                  60                  
Phe Ala Trp Ile Leu Cys Trp His Asp Ile Gly Lys Phe Ala His Ser 
65                  70                  75                  80  
Phe Gln Gln Leu Tyr Arg His Glu Ala Leu Asn Ile Phe Asn Glu Pro 
                85                  90                  95      
Thr Arg His Tyr Glu Lys Ile Ala His Thr Thr Leu Gly Tyr Met Leu 
            100                 105                 110         
Trp Asn Ser Trp Leu Ser Glu Cys Pro Glu Leu Phe Pro Pro Ser Ser 
        115                 120                 125             
Leu Ser Val Arg Lys Ser Lys Arg Val Met Ala Leu Trp Met Pro Val 
    130                 135                 140                 
Thr Thr Gly His His Gly Arg Pro Pro Glu Ala Ile Gln Glu Leu Asp 
145                 150                 155                 160 
His Phe Arg Gln Gln Asp Lys Asp Ala Ala Arg Asp Phe Leu Leu Arg 
                165                 170                 175     
Ile Lys Ala Leu Phe Pro Leu Ile Thr Leu Pro Glu Ala Trp Asp Glu 
            180                 185                 190         
Asp Glu Gly Ile Asp Gln Phe Gln Gln Leu Ser Trp Phe Ile Ser Ala 
        195                 200                 205             
Ala Val Val Leu Ala Asp Trp Thr Gly Ser Ala Ser Arg Tyr Phe Pro 
    210                 215                 220                 
Arg Thr Ala Glu Lys Met Pro Val Asp Thr Tyr Trp Gln Gln Ala Leu 
225                 230                 235                 240 
Ala Lys Ala Gln Thr Ala Ile Thr Leu Phe Pro Ser Ala Ala Asn Val 
                245                 250                 255     
Ser Ala Phe Thr Gly Ile Glu Thr Leu Phe Pro Phe Ile Gln His Pro 
            260                 265                 270         
Thr Pro Leu Gln Gln Lys Ala Leu Glu Leu Asp Ile Asn Val Asp Gly 
        275                 280                 285             
Ala Gln Leu Phe Ile Leu Glu Asp Val Thr Gly Ala Gly Lys Thr Glu 
    290                 295                 300                 
Ala Ala Leu Ile Leu Ala His Arg Leu Met Ala Ala Gly Lys Ala Gln 
305                 310                 315                 320 
Gly Leu Tyr Phe Gly Leu Pro Thr Met Ala Thr Ala Asn Ala Met Phe 
                325                 330                 335     
Glu Arg Met Ala Asn Thr Trp Leu Ala Leu Tyr Gln Pro Asp Ser Arg 
            340                 345                 350         
Pro Ser Leu Ile Leu Ala His Ser Ala Arg Arg Leu Met Asp Arg Phe 
        355                 360                 365             
Asn Gln Ser Ile Trp Ser Val Thr Leu Ser Gly Thr Glu Glu Pro Asp 
    370                 375                 380                 
Glu Ala Gln Pro Tyr Ser Gln Gly Cys Ala Ala Trp Phe Ala Asp Ser 
385                 390                 395                 400 
Asn Lys Lys Ala Leu Leu Ala Glu Val Gly Val Gly Thr Leu Asp Gln 
                405                 410                 415     
Ala Met Met Ala Val Met Pro Phe Lys His Asn Asn Leu Arg Leu Leu 
            420                 425                 430         
Gly Leu Ser Asn Lys Ile Leu Leu Ala Asp Glu Ile His Ala Cys Asp 
        435                 440                 445             
Ala Trp Met Ser Arg Ile Leu Glu Gly Leu Ile Glu Arg Gln Ala Ser 
    450                 455                 460                 
Asn Gly Asn Ala Thr Ile Leu Leu Ser Ala Thr Leu Ser Gln Gln Gln 
465                 470                 475                 480 
Arg Asp Lys Leu Val Ala Ala Phe Ser Arg Gly Val Arg Arg Ser Val 
                485                 490                 495     
Gln Ala Pro Leu Leu Gly His Asp Asp Tyr Pro Trp Leu Thr Gln Val 
            500                 505                 510         
Thr Gln Thr Glu Leu Ile Ser Gln Arg Val Asp Thr Arg Lys Glu Val 
        515                 520                 525             
Glu Arg Cys Val Asp Ile Gly Trp Leu His Ser Glu Glu Ala Cys Leu 
    530                 535                 540                 
Glu Arg Ile Gly Glu Ala Val Glu Lys Gly Asn Cys Ile Ala Trp Ile 
545                 550                 555                 560 
Arg Asn Ser Val Asp Asp Ala Ile Arg Ile Tyr Arg Gln Leu Gln Leu 
                565                 570                 575     
Ser Lys Val Val Val Thr Glu Asn Leu Leu Leu Phe His Ser Arg Phe 
            580                 585                 590         
Ala Phe Tyr Asp Arg Gln Arg Ile Glu Ser Gln Thr Leu Asn Leu Phe 
        595                 600                 605             
Gly Lys Gln Ser Gly Ala Gln Arg Ala Gly Lys Val Ile Ile Ala Thr 
    610                 615                 620                 
Gln Val Ile Glu Gln Ser Leu Asp Ile Asp Cys Asp Glu Met Ile Ser 
625                 630                 635                 640 
Asp Leu Ala Pro Val Asp Leu Leu Ile Gln Arg Ala Gly Arg Leu Gln 
                645                 650                 655     
Arg His Ile Arg Asp Arg Asn Gly Leu Val Lys Lys Ser Gly Gln Asp 
            660                 665                 670         
Glu Arg Glu Thr Pro Val Leu Arg Ile Leu Ala Pro Glu Trp Asp Asp 
        675                 680                 685             
Ala Pro Arg Glu Asn Trp Leu Ser Ser Ala Met Arg Asn Ser Ala Tyr 
    690                 695                 700                 
Val Tyr Pro Asp His Gly Arg Met Trp Leu Thr Gln Arg Ile Leu Arg 
705                 710                 715                 720 
Glu Gln Gly Thr Ile Arg Met Pro Gln Ser Ala Arg Leu Leu Ile Glu 
                725                 730                 735     
Ser Val Tyr Gly Glu Asp Val Asn Met Pro Val Gly Phe Ala Lys Thr 
            740                 745                 750         
Glu Gln Leu Gln Glu Gly Lys Phe Tyr Cys Asp Arg Ala Phe Ala Gly 
        755                 760                 765             
Gln Met Leu Leu Asn Phe Ala Pro Gly Tyr Cys Ala Glu Ile Ser Asp 
    770                 775                 780                 
Ser Leu Pro Glu Lys Met Ser Thr Arg Leu Ala Glu Glu Ser Val Thr 
785                 790                 795                 800 
Leu Trp Leu Ala Lys Ile Val Asp Ser Val Val Thr Pro Tyr Ala Ser 
                805                 810                 815     
Gly Glu His Ala Trp Glu Met Ser Val Leu Arg Val Arg Gln Ser Trp 
            820                 825                 830         
Trp Asn Lys His Lys Asp Glu Phe Glu Lys Leu Asp Gly Glu Pro Leu 
        835                 840                 845             
Arg Lys Trp Cys Ala Gln Gln His Gln Asp Lys Asp Phe Ala Thr Val 
    850                 855                 860                 
Ile Val Val Thr Asp Phe Ala Ala Cys Gly Tyr Ser Ala Asn Glu Gly 
865                 870                 875                 880 
Leu Ile Gly Met Met Gly Glu 
                885         

<210> 65
<211> 2664
<212> DNA
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhimurium LT2 Cas 3
      nucleotide sequence

<400> 65
gtgtcgatat atcactattg gggaaagtct cgacgaggag aaactgacgg cggtgatgat      60

taccatttgc tttgctggca ttctttagat gttgcggctg tgggttactg gatggtgata     120

aataatattt attttattga ccactatcta aaaaaattag gcatccagga taaggagcag     180

gcggcgcaat tttttgcctg gattttatgt tggcatgata ttggaaagtt tgctcattcc     240

ttccagcaac tataccgtca tgaggcttta aatatcttta atgagcctac acggcattat     300

gaaaaaatcg cgcataccac gctgggatac atgttgtgga actcctggct aagtgaatgc     360

cctgaattgt ttcctccttc ttcgctttca gttcgtaaaa gtaagcgcgt tatggcgctt     420

tggatgccag tcactacagg tcatcatgga cgccctccag aggcaatcca ggagctggac     480

cattttcgcc agcaggataa agacgcggca agagattttc ttctgagaat aaaagcgctc     540

tttcctttaa ttactttgcc tgaagcctgg gatgaagatg agggtatcga ccaatttcag     600

caactttcct ggtttatttc cgctgcggtt gtactggctg actggactgg ttctgccagc     660

cgttattttc cgcgtactgc ggaaaaaatg cctgttgata cctactggca gcaagctctc     720

gctaaagcac aaactgccat cacgctattt ccctcagcgg cgaatgtgtc tgcctttacg     780

ggcatagaaa cgcttttccc ttttattcag catcccacac cgttacaaca aaaggcgctt     840

gagctggata tcaacgtgga tggcgcccaa ctctttattc ttgaagatgt caccggggcc     900

ggaaaaacag aggcggcgct catattagct catcgactga tggcggcagg taaagcgcag     960

ggactctatt ttggactgcc gacaatggcg acagccaacg cgatgtttga acgtatggcg    1020

aacacctggc tggcgctgta tcagccggac tcccgtccca gcctgattct ggcgcatagc    1080

gcgcgtcgct taatggatcg tttcaatcag tcaatatggt cggtcactct ttctggtacg    1140

gaagaacccg atgaagcgca gccttatagt cagggatgcg ccgcctggtt tgccgacagc    1200

aataaaaaag cgttgttggc ggaggttggc gtaggcacgt tggatcaggc gatgatggcg    1260

gtaatgccat ttaaacataa caacctgcgg ttactgggtc ttagcaacaa gatcttactg    1320

gctgatgaga tccatgcctg tgatgcctgg atgtcccgaa tacttgaagg tttgatcgaa    1380

cggcaggcca gtaatggcaa cgccactatt ctgttatctg cgacgctatc gcagcagcag    1440

cgagataagc tggtggcggc attttcccgt ggggtgaggc gtagtgtgca ggcgccgttg    1500

ctaggccatg acgattatcc ctggctgact caggtcacac aaacagagct gatttctcag    1560

cgggttgata cacgcaaaga ggttgagcgt tgcgtagata ttggctggct acatagtgaa    1620

gaggcgtgtc ttgaacgtat aggtgaagca gtggaaaaag gaaactgtat cgcctggata    1680

cgtaactccg ttgatgatgc gattcgtatc tatcgccagc ttcaactgag taaggtcgtc    1740

gtcacggaaa accttttact cttccatagt cgctttgctt tttacgatcg tcagcggatt    1800

gagtcacaga cgctgaatct ctttggcaaa cagagcggcg cgcaacgtgc cggtaaggtc    1860

attatcgcca cgcaggtcat cgaacaaagt ctggatattg actgcgatga gatgatctct    1920

gatttagcgc cggtggattt attaattcag cgggccggtc gactacagcg tcatattcgc    1980

gatcgtaacg gtctggtgaa aaagagtggg caggatgagc gagagacgcc agtgctgcgc    2040

attcttgctc cggagtggga tgacgcgccg cgagagaact ggttatccag cgccatgcgt    2100

aacagcgcct atgtctatcc cgatcatggg cgcatgtggc tgacacagcg catattacgt    2160

gagcagggga cgattcggat gccgcaatct gcccgattgt tgattgagtc ggtctacggc    2220

gaggatgtca acatgccggt tggatttgca aaaaccgagc aattgcagga aggcaaattt    2280

tattgcgacc gggcatttgc cggccagatg ctgcttaact ttgcgccggg ctactgtgct    2340

gaaattagcg attctttacc ggagaaaatg tcaacgcggc tggcggaaga gtctgtcacg    2400

ctgtggctgg cgaaaatcgt ggatagcgtc gtaacccctt atgccagcgg tgaacacgcc    2460

tgggagatga gcgtgctgcg agtacgtcag agctggtgga ataaacataa agacgagttt    2520

gaaaaattag acggcgaacc cttgcgtaag tggtgtgcgc aacagcatca ggataaggat    2580

tttgccacgg tgattgtggt gacggacttt gccgcttgtg gttattcggc gaatgaggga    2640

ttgattggca tgatggggga ataa                                           2664


<210> 66
<211> 502
<212> PRT
<213> Artificial Sequence


<220> 
<223> Escherichia coli K-12 MG1655: b2760 CasA

<400> 66
Met Asn Leu Leu Ile Asp Asn Trp Ile Pro Val Arg Pro Arg Asn Gly 
1               5                   10                  15      
Gly Lys Val Gln Ile Ile Asn Leu Gln Ser Leu Tyr Cys Ser Arg Asp 
            20                  25                  30          
Gln Trp Arg Leu Ser Leu Pro Arg Asp Asp Met Glu Leu Ala Ala Leu 
        35                  40                  45              
Ala Leu Leu Val Cys Ile Gly Gln Ile Ile Ala Pro Ala Lys Asp Asp 
    50                  55                  60                  
Val Glu Phe Arg His Arg Ile Met Asn Pro Leu Thr Glu Asp Glu Phe 
65                  70                  75                  80  
Gln Gln Leu Ile Ala Pro Trp Ile Asp Met Phe Tyr Leu Asn His Ala 
                85                  90                  95      
Glu His Pro Phe Met Gln Thr Lys Gly Val Lys Ala Asn Asp Val Thr 
            100                 105                 110         
Pro Met Glu Lys Leu Leu Ala Gly Val Ser Gly Ala Thr Asn Cys Ala 
        115                 120                 125             
Phe Val Asn Gln Pro Gly Gln Gly Glu Ala Leu Cys Gly Gly Cys Thr 
    130                 135                 140                 
Ala Ile Ala Leu Phe Asn Gln Ala Asn Gln Ala Pro Gly Phe Gly Gly 
145                 150                 155                 160 
Gly Phe Lys Ser Gly Leu Arg Gly Gly Thr Pro Val Thr Thr Phe Val 
                165                 170                 175     
Arg Gly Ile Asp Leu Arg Ser Thr Val Leu Leu Asn Val Leu Thr Leu 
            180                 185                 190         
Pro Arg Leu Gln Lys Gln Phe Pro Asn Glu Ser His Thr Glu Asn Gln 
        195                 200                 205             
Pro Thr Trp Ile Lys Pro Ile Lys Ser Asn Glu Ser Ile Pro Ala Ser 
    210                 215                 220                 
Ser Ile Gly Phe Val Arg Gly Leu Phe Trp Gln Pro Ala His Ile Glu 
225                 230                 235                 240 
Leu Cys Asp Pro Ile Gly Ile Gly Lys Cys Ser Cys Cys Gly Gln Glu 
                245                 250                 255     
Ser Asn Leu Arg Tyr Thr Gly Phe Leu Lys Glu Lys Phe Thr Phe Thr 
            260                 265                 270         
Val Asn Gly Leu Trp Pro His Pro His Ser Pro Cys Leu Val Thr Val 
        275                 280                 285             
Lys Lys Gly Glu Val Glu Glu Lys Phe Leu Ala Phe Thr Thr Ser Ala 
    290                 295                 300                 
Pro Ser Trp Thr Gln Ile Ser Arg Val Val Val Asp Lys Ile Ile Gln 
305                 310                 315                 320 
Asn Glu Asn Gly Asn Arg Val Ala Ala Val Val Asn Gln Phe Arg Asn 
                325                 330                 335     
Ile Ala Pro Gln Ser Pro Leu Glu Leu Ile Met Gly Gly Tyr Arg Asn 
            340                 345                 350         
Asn Gln Ala Ser Ile Leu Glu Arg Arg His Asp Val Leu Met Phe Asn 
        355                 360                 365             
Gln Gly Trp Gln Gln Tyr Gly Asn Val Ile Asn Glu Ile Val Thr Val 
    370                 375                 380                 
Gly Leu Gly Tyr Lys Thr Ala Leu Arg Lys Ala Leu Tyr Thr Phe Ala 
385                 390                 395                 400 
Glu Gly Phe Lys Asn Lys Asp Phe Lys Gly Ala Gly Val Ser Val His 
                405                 410                 415     
Glu Thr Ala Glu Arg His Phe Tyr Arg Gln Ser Glu Leu Leu Ile Pro 
            420                 425                 430         
Asp Val Leu Ala Asn Val Asn Phe Ser Gln Ala Asp Glu Val Ile Ala 
        435                 440                 445             
Asp Leu Arg Asp Lys Leu His Gln Leu Cys Glu Met Leu Phe Asn Gln 
    450                 455                 460                 
Ser Val Ala Pro Tyr Ala His His Pro Lys Leu Ile Ser Thr Leu Ala 
465                 470                 475                 480 
Leu Ala Arg Ala Thr Leu Tyr Lys His Leu Arg Glu Leu Lys Pro Gln 
                485                 490                 495     
Gly Gly Pro Ser Asn Gly 
            500         

<210> 67
<211> 1509
<212> DNA
<213> Artificial Sequence


<220> 
<223> Escherichia coli K-12 MG1655: b2760 CasA

<400> 67
atgaatttgc ttattgataa ctggatccct gtacgcccgc gaaacggggg gaaagtccaa      60

atcataaatc tgcaatcgct atactgcagt agagatcagt ggcgattaag tttgccccgt     120

gacgatatgg aactggccgc tttagcactg ctggtttgca ttgggcaaat tatcgccccg     180

gcaaaagatg acgttgaatt tcgacatcgc ataatgaatc cgctcactga agatgagttt     240

caacaactca tcgcgccgtg gatagatatg ttctacctta atcacgcaga acatcccttt     300

atgcagacca aaggtgtcaa agcaaatgat gtgactccaa tggaaaaact gttggctggg     360

gtaagcggcg cgacgaattg tgcatttgtc aatcaaccgg ggcagggtga agcattatgt     420

ggtggatgca ctgcgattgc gttattcaac caggcgaatc aggcaccagg ttttggtggt     480

ggttttaaaa gcggtttacg tggaggaaca cctgtaacaa cgttcgtacg tgggatcgat     540

cttcgttcaa cggtgttact caatgtcctc acattacctc gtcttcaaaa acaatttcct     600

aatgaatcac atacggaaaa ccaacctacc tggattaaac ctatcaagtc caatgagtct     660

atacctgctt cgtcaattgg gtttgtccgt ggtctattct ggcaaccagc gcatattgaa     720

ttatgcgatc ccattgggat tggtaaatgt tcttgctgtg gacaggaaag caatttgcgt     780

tataccggtt ttcttaagga aaaatttacc tttacagtta atgggctatg gccccatccg     840

cattcccctt gtctggtaac agtcaagaaa ggggaggttg aggaaaaatt tcttgctttc     900

accacctccg caccatcatg gacacaaatc agccgagttg tggtagataa gattattcaa     960

aatgaaaatg gaaatcgcgt ggcggcggtt gtgaatcaat tcagaaatat tgcgccgcaa    1020

agtcctcttg aattgattat ggggggatat cgtaataatc aagcatctat tcttgaacgg    1080

cgtcatgatg tgttgatgtt taatcagggg tggcaacaat acggcaatgt gataaacgaa    1140

atagtgactg ttggtttggg atataaaaca gccttacgca aggcgttata tacctttgca    1200

gaagggttta aaaataaaga cttcaaaggg gccggagtct ctgttcatga gactgcagaa    1260

aggcatttct atcgacagag tgaattatta attcccgatg tactggcgaa tgttaatttt    1320

tcccaggctg atgaggtaat agctgattta cgagacaaac ttcatcaatt gtgtgaaatg    1380

ctatttaatc aatctgtagc tccctatgca catcatccta aattaataag cacattagcg    1440

cttgcccgcg ccacgctata caaacattta cgggagttaa aaccgcaagg agggccatca    1500

aatggctga                                                            1509


<210> 68
<211> 520
<212> PRT
<213> Artificial Sequence


<220> 
<223> Escherichia coli O157 H7 EC4115 (EHEC): ECH74115_4013 Cse1

<400> 68
Met Asn Ser Phe Ser Leu Leu Thr Thr Pro Trp Leu Pro Val Arg Phe 
1               5                   10                  15      
Lys Asp Gly Thr Thr Gly Lys Leu Ala Pro Val Asp Leu Ala Asp Glu 
            20                  25                  30          
Asn Val Val Asp Ile Ala Ala Pro Arg Ala Asp Leu Gln Gly Ala Ala 
        35                  40                  45              
Trp Gln Phe Leu Leu Gly Leu Leu Gln Ser Ser Phe Ala Pro Lys Asp 
    50                  55                  60                  
Tyr Arg Arg Trp Asp Asp Ile Trp Glu Asp Gly Leu Glu Ala Glu Lys 
65                  70                  75                  80  
Leu Arg Glu Ala Leu Leu Ser Leu Glu His Pro Phe Gln Phe Gly Pro 
                85                  90                  95      
Asp Ser Pro Ser Phe Met Gln Asp Phe Glu Val Leu Met Gly Asp Lys 
            100                 105                 110         
Val Gln Val Ala Ser Leu Leu Pro Glu Ile Pro Gly Ala Gln Thr Thr 
        115                 120                 125             
Lys Phe Asn Lys Asp His Phe Ile Lys Arg Gly Val Thr Glu His Val 
    130                 135                 140                 
Cys Ser His Cys Ser Ala Leu Ala Leu Phe Ser Leu Gln Leu Asn Ala 
145                 150                 155                 160 
Pro Ser Gly Gly Lys Gly Tyr Arg Thr Gly Leu Arg Gly Gly Gly Pro 
                165                 170                 175     
Met Thr Thr Leu Ile Glu Leu Gln Glu Tyr Gln Gly Asn Gln Gln Ala 
            180                 185                 190         
Pro Leu Trp Arg Lys Leu Trp Leu Asn Val Met Pro Gln Asp Glu Ala 
        195                 200                 205             
Asp Leu Pro Leu Pro Lys Lys Phe Asp Asp Leu Val Phe Pro Trp Leu 
    210                 215                 220                 
Gly Pro Thr Arg Thr Ser Glu Leu Ala Gly Ala Val Val Thr Asp Asp 
225                 230                 235                 240 
Gln Val Asn Lys Leu Gln Ala Tyr Trp Gly Met Pro Arg Arg Ile Arg 
                245                 250                 255     
Ile Asp Phe Asn Thr Thr Thr Val Gly Asn Cys Asp Ile Cys Gly Glu 
            260                 265                 270         
Gln Ser Asp Ala Leu Leu Ser Leu Met Thr Thr Lys Asn Tyr Gly Ala 
        275                 280                 285             
Asn Tyr Ala Met Trp Gln His Pro Leu Thr Pro Tyr Arg Val Pro Leu 
    290                 295                 300                 
Lys Glu Gly Gly Glu Phe Tyr Ser Val Lys Pro Gln Pro Gly Gly Leu 
305                 310                 315                 320 
Ile Trp Arg Asp Trp Leu Gly Leu Ile Glu Thr Gly Lys Ser Glu Asn 
                325                 330                 335     
Asn Thr Glu Leu Pro Ala Leu Val Val Lys Leu Phe Asn Ala Ser Ser 
            340                 345                 350         
Leu Lys Gln Ala Lys Val Gly Leu Trp Gly Phe Gly Tyr Asp Phe Asp 
        355                 360                 365             
Asn Met Lys Ala Arg Cys Trp Tyr Glu His His Phe Pro Leu Leu Leu 
    370                 375                 380                 
Asn Lys Lys Glu Gly Gln Ile Pro Lys Leu Arg Leu Ala Ala Gln Thr 
385                 390                 395                 400 
Ala Ser Arg Ile Leu Ser Leu Leu Arg Ser Ala Leu Lys Glu Ala Trp 
                405                 410                 415     
Phe Ser Asp Pro Lys Gly Ala Arg Gly Asp Phe Ser Phe Val Asp Ile 
            420                 425                 430         
Asp Phe Trp Asn Lys Thr Gln His Arg Phe Leu Arg Leu Val Arg Gln 
        435                 440                 445             
Ile Glu Glu Gly Gln Asp Ala Asp Glu Leu Leu Gly Lys Trp Gln Lys 
    450                 455                 460                 
Glu Ile Trp Leu Phe Ala Arg Gln Asp Phe Asp Glu Arg Val Phe Thr 
465                 470                 475                 480 
Asn Pro Tyr Glu Pro Val Asp Leu Glu Arg Val Met Thr Ala Arg Lys 
                485                 490                 495     
Lys Tyr Phe Thr Thr Ser Ala Glu Lys Gln Ser Ala Lys Ala Ala Arg 
            500                 505                 510         
Glu Lys Lys Gln Glu Ala Ala Glu 
        515                 520 

<210> 69
<211> 1563
<212> DNA
<213> Artificial Sequence


<220> 
<223> Escherichia coli O157 H7 EC4115 (EHEC): ECH74115_4013 Cse1

<400> 69
atgaactcgt tttcacttct gacaaccccg tggttgcccg ttcgttttaa agacggaaca      60

acaggcaagc tggcgccagt cgatctggcg gatgaaaatg ttgtcgatat cgctgcgccg     120

cgggcagatc tccagggggc ggcatggcag tttttgctgg ggttactaca aagcagtttc     180

gcgccaaaag attatcgtcg ttgggatgat atctgggaag acgggctgga agctgaaaag     240

ctacgggaag cattgctgtc attagaacac cctttccagt ttggcccaga ttcaccttca     300

tttatgcagg atttcgaggt gctcatgggc gataaagttc aggtcgcttc gctactgcct     360

gagattcccg gcgctcaaac aacgaagttt aataaagacc actttattaa gcgtggcgtg     420

actgaacacg tatgctctca ttgttctgcg ttagctctgt tctccctaca gttaaatgcg     480

ccgtcaggtg gcaaaggcta tcgcaccggt ttacgcggcg gtgggccgat gacgactctg     540

attgaattgc aggagtatca gggcaatcaa caagccccct tgtggcgcaa actgtggctc     600

aacgtgatgc cgcaggatga agccgactta ccgctaccca aaaaatttga cgatctggtt     660

ttcccctggc ttggcccgac gcgtaccagc gaactggccg gtgcggtggt aaccgatgat     720

caggtcaata aactccaggc gtactgggga atgccgcggc gtattcgtat tgattttaat     780

accacgacag tcggcaactg cgatatttgc ggtgagcaga gtgacgcgct tctgagtttg     840

atgactacca aaaattacgg tgcgaattat gccatgtggc agcatccctt aacgccttac     900

cgtgtaccac ttaaagaggg cggtgagttt tactccgtta aaccacaacc gggcggttta     960

atctggcgcg actggttagg ccttatcgaa acgggtaagt cagaaaacaa tacggaactt    1020

cccgcgctgg tggtgaaact ctttaatgcc agcagtctga aacaggcaaa agtgggcctg    1080

tggggatttg gttatgattt cgacaacatg aaagcgcgct gttggtacga acaccatttc    1140

ccgctgctgc tcaataaaaa agaaggccag ataccgaagc tgcggctggc tgcgcaaacg    1200

gcttcacgga ttctgagtct gttacggagt gcattgaaag aagcatggtt ctccgatcca    1260

aaaggtgcaa ggggtgattt cagttttgtg gatatcgact tctggaacaa aactcagcat    1320

cgcttcctga ggttagtgcg ccaaattgaa gaaggtcagg atgcggatga attactcggc    1380

aaatggcaaa aggaaatttg gttattcgca cgtcaggatt ttgacgagcg tgtattcacc    1440

aatccttatg agcccgttga tttggaacgc gtcatgaccg cgcgcaagaa atattttaca    1500

acatcggcgg agaagcaaag tgctaaagcc gccagggaga aaaagcagga ggctgctgaa    1560

tga                                                                  1563


<210> 70
<211> 518
<212> PRT
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhimurium var. 5-
      CFSAN001921: CFSAN001921_02360 CasA

<400> 70
Met Asp Asn Phe Ser Leu Leu Thr Thr Pro Trp Leu Pro Val Arg Phe 
1               5                   10                  15      
Lys Asp Gly Ser Thr Gly Lys Leu Ala Pro Val Asp Leu Ala Asp Glu 
            20                  25                  30          
Asn Val Val Asp Ile Ala Ala Thr Arg Ala Asp Leu Gln Gly Ala Ala 
        35                  40                  45              
Trp Gln Phe Leu Leu Gly Leu Leu Gln Cys Ser Ile Ala Pro Lys Arg 
    50                  55                  60                  
Tyr Lys Asn Trp Glu Asp Ile Trp Phe Asp Gly Leu His Ala Asp Val 
65                  70                  75                  80  
Leu His Lys Ala Leu Ala Pro Leu Glu His Ala Phe Gln Phe Gly Ala 
                85                  90                  95      
Glu Thr Pro Ser Phe Met Gln Asp Phe Glu Pro Leu Ser Gly Glu Lys 
            100                 105                 110         
Val Ser Ile Ala Ser Leu Leu Pro Glu Ile Pro Gly Ala Gln Thr Thr 
        115                 120                 125             
Lys Phe Asn Lys Asp His Phe Val Lys Arg Gly Val Thr Glu Arg Phe 
    130                 135                 140                 
Cys Pro His Cys Ala Ala Leu Ala Leu Phe Ser Leu Gln Leu Asn Ala 
145                 150                 155                 160 
Pro Ala Gly Gly Lys Gly Tyr Arg Thr Gly Leu Arg Gly Gly Gly Pro 
                165                 170                 175     
Leu Thr Thr Leu Val Glu Leu Gln Glu Tyr Gln Gly Glu Arg Gln Thr 
            180                 185                 190         
Pro Leu Trp Arg Lys Leu Trp Leu Asn Val Met Pro Gln Asp Thr Ala 
        195                 200                 205             
Asp Leu Pro Leu Pro Asp Gln Cys Asp Ala Thr Val Phe Pro Trp Leu 
    210                 215                 220                 
Ala Ala Thr Arg Thr Ser Glu Gln Ala Asn Ala Val Thr Thr Pro Glu 
225                 230                 235                 240 
Gln Val Asn Lys Leu Gln Ala Tyr Trp Gly Met Pro Arg Arg Ile Arg 
                245                 250                 255     
Leu Asp Phe Ala Thr Leu Gln Ser Gly Cys Cys Asp Ile Cys Gly Ala 
            260                 265                 270         
Glu Ser Asp Glu Leu Leu Gly Phe Met Thr Val Lys Asn Tyr Gly Val 
        275                 280                 285             
Asn Tyr Asp Gly Trp Arg His Pro Leu Thr Pro Tyr Arg Ala Pro Val 
    290                 295                 300                 
Lys Asp Gln Asn Ala Phe Phe Ser Val Lys Pro Gln Pro Gly Gly Leu 
305                 310                 315                 320 
Ile Trp Arg Asp Trp Leu Gly Leu Ser Gln Asn Asn Gln Thr Glu Ala 
                325                 330                 335     
Asn Tyr Glu Ser Pro Ala Gln Val Val Lys Val Phe Asn Ala Arg Ser 
            340                 345                 350         
Leu Thr Asp Val Lys Ala Gly Ile Trp Gly Phe Gly Ala Asp Phe Asp 
        355                 360                 365             
Asn Met Lys Ile Arg Cys Trp Tyr Glu His His Phe Pro Leu Leu Met 
    370                 375                 380                 
Thr Glu Gly Leu Ile Pro Asp Leu Arg Lys Ala Val Gln Thr Ala Ala 
385                 390                 395                 400 
Arg Leu Leu Ser Leu Leu Arg Ser Ala Leu Lys Glu Ala Trp Phe Ala 
                405                 410                 415     
Asp Ala Lys Gly Ala Arg Gly Asp Phe Ser Phe Ile Asp Ile Asp Phe 
            420                 425                 430         
Trp Asn Leu Thr Gln Gly Arg Phe Leu Asn Leu Ile His Asp Leu Glu 
        435                 440                 445             
Asn Gly His Lys Pro Asp Glu Arg Leu Asn Lys Trp Gln Arg Glu Leu 
    450                 455                 460                 
Trp Leu Phe Thr Arg His Tyr Phe Asp Asp His Val Phe Thr Asn Pro 
465                 470                 475                 480 
Tyr Glu Ser Ser Asp Leu Glu Arg Ile Met Thr Ala Arg Lys Lys Tyr 
                485                 490                 495     
Phe Thr Thr Ser Ala Glu Lys Gln Ser Ala Lys Ala Ala Lys Ala Lys 
            500                 505                 510         
Lys Gln Glu Ala Ala Glu 
        515             

<210> 71
<211> 1557
<212> DNA
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Typhimurium var. 5-
      CFSAN001921: CFSAN001921_02360 CasA

<400> 71
atggacaatt tttcactttt aacaacgccc tggctccccg tccgtttcaa agacggttcc      60

acgggcaagc tggcccccgt cgatctggcg gatgaaaacg tggtggacat cgccgcaacg     120

cgagcagatt tacagggagc ggcttggcag tttctgttgg gattgctgca atgcagtatc     180

gcgccgaaaa gatacaaaaa ttgggaggat atctggtttg atggattgca tgccgatgtg     240

ctccataagg cattagcacc gttagaacac gcttttcagt ttggcgcgga aacgccgtct     300

tttatgcagg attttgaacc gttaagcggc gaaaaagtct ctattgcctc attgttgccg     360

gaaatacctg gcgcgcaaac cacgaagttc aataaagatc attttgtcaa acgcggcgta     420

acggaacgtt tttgtccgca ctgcgcggcg ctggcgctgt tctcgttgca gcttaacgcg     480

cctgcgggcg gcaaaggcta tcgtaccggg ctgcgcggcg gcgggccact gaccacgctg     540

gttgaattgc aggaatatca gggcgagcgg caaacgccgc tctggcgcaa gctgtggctc     600

aacgtgatgc cgcaggatac tgcggatctg cctttaccag accagtgtga tgcgaccgtt     660

ttcccgtggc ttgccgcgac gcggaccagc gagcaggcga atgccgttac cacgccggag     720

caggtcaata aactccaggc gtactggggg atgccgcgtc gtatccgcct ggattttgcc     780

accttacagt caggttgctg cgatatttgc ggcgctgaaa gcgatgagct tcttggcttt     840

atgaccgtca agaactacgg cgttaactac gatggctggc ggcacccgct gacgccttat     900

cgcgccccgg taaaagatca aaacgccttc ttttccgtta aaccgcagcc cggcggcctt     960

atctggcgcg actggctggg attaagtcag aacaaccaga cggaagcgaa ttacgaatct    1020

cccgcgcagg tagtcaaggt gtttaacgcc cgctcgctga ctgacgttaa agcggggatc    1080

tggggctttg gcgcggattt cgacaatatg aaaatccgct gctggtatga gcatcacttc    1140

ccgttgctga tgacggaagg tctgatccct gatttacgta aggccgtgca aactgcggcc    1200

cgcctgttga gcctgcttcg cagcgcgctc aaagaggcct ggtttgccga tgcgaagggt    1260

gctcgcggtg atttcagttt tatcgacatt gatttctgga acctgacgca gggacgtttt    1320

ctcaacctga ttcacgatct ggaaaacggc cacaagccgg acgaaaggct gaataaatgg    1380

caaagagaac tttggctgtt tacccgtcat tacttcgatg atcacgtctt taccaacccc    1440

tacgagagca gcgatctgga acgcatcatg accgcgcgca agaaatattt tacgacatcg    1500

gcggaaaaac aaagtgcaaa agccgccaaa gcaaagaaac aggaggctgc tgaatga       1557


<210> 72
<211> 518
<212> PRT
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Enteritidis
      EC20090193: AU37_14140 CasA

<400> 72
Met Asp Asn Phe Ser Leu Leu Thr Thr Pro Trp Leu Pro Val Arg Phe 
1               5                   10                  15      
Lys Asp Gly Ser Thr Gly Lys Leu Ala Pro Val Asp Leu Ala Asp Glu 
            20                  25                  30          
Asn Val Val Asp Ile Ala Ala Thr Arg Ala Asp Leu Gln Gly Ala Ala 
        35                  40                  45              
Trp Gln Phe Leu Leu Gly Leu Leu Gln Cys Ser Ile Ala Pro Lys Arg 
    50                  55                  60                  
Tyr Lys Asn Trp Glu Asp Ile Trp Phe Asp Gly Leu His Ala Asp Val 
65                  70                  75                  80  
Leu His Lys Ala Leu Ala Pro Leu Glu His Ala Phe Gln Phe Gly Ala 
                85                  90                  95      
Glu Ser Pro Ser Phe Met Gln Asp Phe Glu Pro Leu Ser Gly Glu Lys 
            100                 105                 110         
Val Ser Ile Ala Ser Leu Leu Pro Glu Ile Pro Gly Ala Gln Thr Thr 
        115                 120                 125             
Lys Phe Asn Lys Asp His Phe Val Lys Arg Gly Val Thr Glu Arg Phe 
    130                 135                 140                 
Cys Pro His Cys Ala Ala Leu Ala Leu Phe Ser Leu Gln Leu Asn Ala 
145                 150                 155                 160 
Pro Ala Gly Gly Lys Gly Tyr Arg Thr Gly Leu Arg Gly Gly Gly Pro 
                165                 170                 175     
Leu Thr Thr Leu Val Glu Leu Gln Glu Tyr Gln Gly Glu Arg Gln Thr 
            180                 185                 190         
Pro Ile Trp Arg Lys Leu Trp Leu Asn Val Met Pro Gln Asp Thr Ala 
        195                 200                 205             
Asp Leu Pro Leu Pro Asp Gln Cys Asp Ala Thr Val Phe Pro Trp Leu 
    210                 215                 220                 
Ala Ala Thr Arg Thr Ser Glu Gln Ala Asn Ala Val Thr Thr Pro Glu 
225                 230                 235                 240 
Gln Val Asn Lys Leu Gln Ala Tyr Trp Gly Met Pro Arg Arg Ile Arg 
                245                 250                 255     
Leu Asp Phe Ala Thr Leu Gln Ser Gly Cys Cys Asp Ile Cys Gly Ala 
            260                 265                 270         
Glu Ser Asp Glu Leu Leu Gly Phe Met Thr Val Lys Asn Tyr Gly Val 
        275                 280                 285             
Asn Tyr Asp Gly Trp Arg His Pro Leu Thr Pro Tyr Arg Ala Pro Val 
    290                 295                 300                 
Lys Asp Gln Asn Ala Phe Phe Ser Val Lys Pro Gln Pro Gly Gly Leu 
305                 310                 315                 320 
Ile Trp Arg Asp Trp Leu Gly Leu Ser Gln Asn Asn Gln Thr Glu Ala 
                325                 330                 335     
Asn Tyr Glu Ser Pro Ala Gln Val Val Lys Val Phe Asn Ala Arg Ser 
            340                 345                 350         
Leu Thr Asp Val Lys Ala Gly Ile Arg Gly Phe Gly Ala Asp Phe Asp 
        355                 360                 365             
Asn Met Lys Ile Arg Cys Trp Tyr Glu His His Phe Pro Leu Leu Met 
    370                 375                 380                 
Thr Glu Gly Leu Ile Pro Asp Leu Arg Lys Ala Val Gln Thr Ala Ala 
385                 390                 395                 400 
Arg Leu Leu Ser Leu Leu Arg Ser Ala Leu Lys Glu Ala Trp Phe Thr 
                405                 410                 415     
Asn Ala Lys Asp Ala Arg Gly Asp Phe Ser Phe Ile Asp Ile Asp Phe 
            420                 425                 430         
Trp Asn Leu Thr Gln Gly Arg Phe Leu Asn Leu Ile His Asp Leu Glu 
        435                 440                 445             
Asn Gly His Lys Pro Asp Glu Arg Leu Asn Lys Trp Gln Arg Glu Leu 
    450                 455                 460                 
Trp Leu Phe Thr Arg Cys Tyr Phe Asp Asp His Val Phe Thr Asn Pro 
465                 470                 475                 480 
Tyr Glu Ser Ser Asp Leu Glu Arg Ile Met Lys Ala Arg Lys Lys Tyr 
                485                 490                 495     
Phe Thr Ser Ser Ala Glu Lys Gln Ser Ala Lys Ala Ala Lys Ala Lys 
            500                 505                 510         
Lys Gln Glu Ala Ala Glu 
        515             

<210> 73
<211> 1557
<212> DNA
<213> Artificial Sequence


<220> 
<223> Salmonella enterica subsp. enterica serovar Enteritidis
      EC20090193: AU37_14140 CasA

<400> 73
atggacaatt tttcactttt aacaacgccc tggctccccg tccgtttcaa agacggttcc      60

acgggcaagc tggcccccgt cgatctggcg gatgaaaacg tggtggacat cgccgcaacg     120

cgagcagatt tacagggagc ggcctggcag tttctgttgg gattgctgca atgcagtatc     180

gcgccgaaaa gatacaaaaa ttgggaggat atctggtttg atggattgca tgccgatgtg     240

ctccataagg cattagcacc gttagaacac gcttttcagt ttggcgcgga atccccctcg     300

tttatgcagg attttgaacc gttaagcggc gaaaaagtct ctattgcctc attgttgccg     360

gaaatacctg gcgcgcaaac cacgaagttc aataaagatc attttgtcaa acgcggcgta     420

acggaacgtt tttgtccgca ctgcgcggcg ctggcgctgt tctcgttgca gcttaacgcg     480

cctgcgggcg gcaaaggcta tcgtaccggg ctgcgcggcg gcgggccact gaccacgctg     540

gttgaattgc aggaatatca gggcgagcgg caaacgccga tctggcgcaa gctgtggctc     600

aacgtgatgc cgcaggatac tgcggatctg cctttaccag accagtgtga tgcgaccgtt     660

ttcccgtggc ttgccgcgac gcggaccagc gagcaggcga atgccgttac cacgccggag     720

caggtcaata aactccaggc gtactggggg atgccgcgtc gtatccgcct ggattttgcc     780

accttacagt caggttgctg cgatatttgc ggcgctgaaa gcgatgagct tcttggcttt     840

atgaccgtca agaactacgg cgttaactac gatggctggc ggcacccgct gacgccttat     900

cgcgccccgg taaaagatca aaacgccttc ttttccgtta aaccgcagcc cggcggcctt     960

atctggcgcg actggctggg attaagtcag aacaaccaga cggaagcgaa ttacgaatct    1020

cccgcgcagg tagtcaaggt gtttaacgcc cgctcgctga ctgacgttaa agcggggatc    1080

cggggctttg gcgcggattt cgacaatatg aaaatccgct gctggtatga gcatcacttc    1140

ccgttgctga tgacggaagg tctgatccct gatttacgta aggccgtgca aactgcggcc    1200

cgcctgttga gcctgcttcg cagtgcgcta aaagaagcgt ggttcaccaa tgcgaaggat    1260

gcgcggggtg atttcagttt tatcgacatt gatttctgga acctgacgca ggggcgcttt    1320

ctcaatctga tccacgatct ggaaaacgga cacaagccgg acgaaaggct gaataaatgg    1380

caaagagaac tttggctgtt tacccgttgt tacttcgatg atcacgtctt taccaacccc    1440

tacgagagca gcgatctgga gcgcatcatg aaggcgcgca aaaaatattt tacttcatcg    1500

gcggaaaagc aaagcgcaaa agccgccaaa gcaaagaaac aggaggctgc tgaatga       1557


<210> 74
<211> 28
<212> DNA
<213> Artificial Sequence


<220> 
<223> S thermophilus CRISPR4 repeat

<400> 74
gtttttcccg cacacgcggg ggtgatcc                                       28


