                         SEQUENCE LISTING

<110>  PIONEER HI-BRED INTERNATIONAL, INC.
 
<120>  NOVEL CRISPR-CAS SYSTEMS FOR GENOME EDITING

<130>  8207-WO-PCT

<150>  62980750
<151>  2020-02-24

<150>  63030964
<151>  2020-05-28

<160>  155   

<170>  PatentIn version 3.5

<210>  1
<211>  337
<212>  PRT
<213>  Armatimonadetes bacterium

<400>  1

Met Val Asp Gly Tyr Gly Ser Gln Val Leu Lys His Ser Glu Arg Leu 
1               5                   10                  15      


Val Leu Arg Ile Pro Ser Ala Leu Asp Lys Asn Ile Lys Arg Glu Val 
            20                  25                  30          


Pro Val Leu His Leu Asp His Leu Leu Val Gly Thr Lys Gly Val Leu 
        35                  40                  45              


Val Ser Ser Asp Ala Leu Ala Leu Cys Cys Glu Arg Gly Ile Pro Val 
    50                  55                  60                  


Thr Val Val Asp Trp Arg Gly Arg Pro Val Gly Arg Phe Gly Ser Pro 
65                  70                  75                  80  


Ala Leu His Gly Ser Ala His Ile Arg Arg Ala Gln Leu Glu Ala Phe 
                85                  90                  95      


Asp Ala Asn Leu Gly Ala Glu Phe Ala Arg Glu Val Val Cys Gly Lys 
            100                 105                 110         


Leu Leu Asn Gln Ala Asp Asn Leu Arg Tyr Phe Gly Lys Asn Arg Lys 
        115                 120                 125             


Thr Arg Asp Pro Ala Gln His Glu Leu Leu Glu Thr Ser Ala Asp Asp 
    130                 135                 140                 


Ile Asn Glu Ile Ser Lys Arg Ala Ser Cys Ile Ser Glu Lys Cys Ala 
145                 150                 155                 160 


Asn Thr Ala Arg Leu Pro Leu Met Thr Leu Glu Ala Glu Gly Ala Arg 
                165                 170                 175     


Ile Tyr Trp Ser Ala Leu Ser Arg Leu Tyr Gly Thr Arg Ser Gly Phe 
            180                 185                 190         


Ala Arg Arg Glu Gln Arg Gly Thr Arg Asp Pro Val Asn Ala Ala Leu 
        195                 200                 205             


Asn Tyr Ala Tyr Gly Val Leu Asn Gly Glu Val Trp Asn Ala Val Val 
    210                 215                 220                 


Leu Ala Gly Leu Glu Pro Tyr Ala Gly Leu Leu His Val Asp Arg Pro 
225                 230                 235                 240 


Gly Arg Leu Ser Phe Val Leu Asp Leu Met Glu Glu Phe Arg Pro Ile 
                245                 250                 255     


Ile Ala Asp Arg Leu Val Phe Gly Leu Ala Ala Lys Gly Trp Lys Ile 
            260                 265                 270         


Gly Gln Glu Glu Asn Gly Trp Leu Asp Phe Ala Thr Lys Gln Arg Leu 
        275                 280                 285             


Leu Lys Ala Ile Ser Glu Arg Trp Asp Ala Arg Val Asn Tyr Gln Gly 
    290                 295                 300                 


Arg Lys Ile Arg Leu Arg Ser Val Leu Gln Leu Gln Ala Arg Asp Ala 
305                 310                 315                 320 


Ala Arg His Phe Leu Gly Arg Ala Gln Tyr Arg Ala Phe Arg Gln Arg 
                325                 330                 335     


Trp 
    


<210>  2
<211>  350
<212>  PRT
<213>  Armatimonadetes bacterium

<400>  2

Met Arg Tyr Glu Ile Val Asp Gly Tyr Gly Cys Gln Val Leu Lys His 
1               5                   10                  15      


Ser Glu Arg Leu Val Leu Arg Tyr Pro Leu Ala Pro Pro Gly Glu Arg 
            20                  25                  30          


Gly Pro Gly Gly Glu Gly Lys Gln Pro Lys Arg Glu Val Pro Val Leu 
        35                  40                  45              


His Leu Asp His Leu Leu Ile Gly Thr Lys Gly Val Thr Val Ser Thr 
    50                  55                  60                  


Asp Ala Leu Ala Leu Cys Cys Glu Arg Gly Ile Pro Val Thr Val Val 
65                  70                  75                  80  


Asp Trp Arg Gly Arg Pro Val Gly Arg Phe Gly Ser Pro Ala Leu His 
                85                  90                  95      


Gly Thr Ala Gln Val Arg Arg Ala Gln Ile Ala Ala Phe Ala Thr Glu 
            100                 105                 110         


Ile Gly Ala Thr Phe Ala Arg Glu Val Val Ser Gly Lys Leu Leu Asn 
        115                 120                 125             


Gln Ala Asp Asn Leu Arg Tyr Ile Gly Lys Asn Arg Lys Thr Arg Ala 
    130                 135                 140                 


Pro Asn Glu His Glu Ala Leu Thr Arg Thr Ala Asp Thr Leu Gln Arg 
145                 150                 155                 160 


Leu Ala Lys Lys Ala Ala Thr Val Lys Gly Lys Asn Ala Asp Asp Val 
                165                 170                 175     


Arg Leu Pro Leu Met Thr Val Glu Ala Glu Gly Ala Arg Ala Tyr Trp 
            180                 185                 190         


Ser Val Leu Ser Glu Val Tyr Gly Glu Arg Ser Gly Phe Ala Lys Arg 
        195                 200                 205             


Glu Gln Arg Gly Thr Arg Asp Pro Val Asn Ala Ala Leu Asn Tyr Ala 
    210                 215                 220                 


Tyr Gly Val Leu Asn Gly Glu Val Trp Asn Ala Thr Ile Leu Ala Gly 
225                 230                 235                 240 


Leu Glu Pro Tyr Ala Gly Phe Leu His Val Asp Arg Pro Gly Arg Leu 
                245                 250                 255     


Ser Phe Val Leu Asp Leu Met Glu Glu Phe Arg Pro Val Val Ala Asp 
            260                 265                 270         


Arg Val Val Phe Gly Leu Val Ala Lys Gly Trp Lys Ile Gly Gln Glu 
        275                 280                 285             


Glu Asn Gly Trp Leu Asp Met Pro Thr Lys Arg Arg Leu Ile Gln Ala 
    290                 295                 300                 


Ile Gly Glu Arg Trp Gly Ala Arg Val Leu His Gln Ala Arg Lys Leu 
305                 310                 315                 320 


Gln Leu Arg Ser Val Leu Gln Leu Gln Ala Arg Asp Ala Ala Arg His 
                325                 330                 335     


Phe Gln Gly Lys Ala Glu Tyr Ile Ala Phe Arg Leu Arg Trp 
            340                 345                 350 


<210>  3
<211>  343
<212>  PRT
<213>  Armatimonadetes bacterium

<400>  3

Met Val Lys Tyr Glu Tyr Val Glu Thr Phe Gly Ser Ala Val Gln Lys 
1               5                   10                  15      


His Ser Glu Arg Leu Val Val Ser Glu Pro Ser Gly Glu Gly Gly Gln 
            20                  25                  30          


Arg Thr Lys Arg Gln Val Pro Ala Leu His Leu Asp His Leu Leu Ile 
        35                  40                  45              


Gly Ser Arg Gly Val Ser Ile Ser Ser Asp Ala Leu Glu Leu Cys Cys 
    50                  55                  60                  


Glu Arg Gly Ile Pro Val Thr Ile Val Asp Arg Arg Gly Lys Pro Val 
65                  70                  75                  80  


Gly Lys Phe Thr Ala Pro Ala Ile His Gly Thr Ser Arg Thr Arg Arg 
                85                  90                  95      


Ala Gln Ile Arg Ala Tyr Glu Asn Gly Leu Gly Val Thr Phe Ala Arg 
            100                 105                 110         


Ser Val Val Ile Gly Lys Ala Ala Asn Gln Ala Ile Asn Leu Lys Tyr 
        115                 120                 125             


Phe Ala Lys Asn Arg Arg Glu Arg Ser Pro Asp Gln Tyr Glu Thr Leu 
    130                 135                 140                 


Arg Lys Ser Ala Glu Ala Ile Asp Arg Val Ala Arg Arg Ala Lys Lys 
145                 150                 155                 160 


Ile Ser Ala Asn Cys Ile Asp Glu Val Arg Gln Pro Leu Met Val Leu 
                165                 170                 175     


Glu Ala Glu Ala Ser Arg Ile Tyr Trp Ser Ser Leu Ser Ala Leu Tyr 
            180                 185                 190         


Gly Ser Ser Ser Gly Phe Val His Arg Glu Gln Arg Gly Thr Lys Asn 
        195                 200                 205             


Pro Val Asn Ala Ala Leu Asn Tyr Ala Tyr Gly Val Leu Thr Gly Glu 
    210                 215                 220                 


Val Trp Thr Ala Cys Leu Leu Ala Gly Leu Glu Pro Tyr Ala Gly Phe 
225                 230                 235                 240 


Leu His Ala Asp Arg Pro Gly Arg Leu Ser Phe Val Leu Asp Leu Ile 
                245                 250                 255     


Glu Glu Phe Arg Pro Val Val Ala Asp Arg Val Val Phe Ala Leu Ala 
            260                 265                 270         


Ala Lys Gly Trp Arg Ile Glu Gln Glu Glu Asn Gly Trp Leu Ser Leu 
        275                 280                 285             


Ala Ser Lys Asn Lys Leu Leu Ala Ser Leu Ala Glu Arg Leu Asp Ser 
    290                 295                 300                 


Pro Glu Pro Asp Arg Gly Arg Arg Arg Lys Leu Arg Asn Val Ile Gln 
305                 310                 315                 320 


Arg Gln Ala Tyr Ala Ala Ala Gln His Phe Leu Gly Asn Glu Thr Tyr 
                325                 330                 335     


Val Pro Tyr Lys Gln Arg Trp 
            340             


<210>  4
<211>  92
<212>  PRT
<213>  Armatimonadetes bacterium

<400>  4

Met Lys Trp Leu Val Cys Tyr Asp Ile Glu Lys Asp Ser Val Arg Gln 
1               5                   10                  15      


Lys Ile Ala Asp Phe Cys Leu Asp Lys Gly Leu Glu Arg Val Gln Tyr 
            20                  25                  30          


Ser Val Phe Leu Gly Asp Met Asn Gln Thr Leu Ala Phe Asp Leu Ala 
        35                  40                  45              


Ala Gln Ile Arg Arg Arg Met Gly Asp His Pro Gly Gln Val Arg Phe 
    50                  55                  60                  


Ile Pro Ile Cys Asp Arg Asp Trp Lys Lys Thr Phe Arg Ile Gln Ile 
65                  70                  75                  80  


Gly Asn Tyr Met Gly Val Lys Pro Ser Asn Gly Lys 
                85                  90          


<210>  5
<211>  92
<212>  PRT
<213>  Armatimonadetes bacterium

<400>  5

Met Lys Trp Leu Val Cys Tyr Asp Ile Glu Lys Asp Ser Val Arg Asn 
1               5                   10                  15      


Lys Val Ala Asp Phe Cys Leu Asp Lys Gly Leu Glu Arg Val Gln Tyr 
            20                  25                  30          


Ser Val Phe Leu Gly Ser Met Thr Arg Thr Leu Ala Lys Glu Leu Gly 
        35                  40                  45              


Ala Gln Ile Arg Arg Lys Met Gly Lys Asn Pro Gly Gln Val Arg Phe 
    50                  55                  60                  


Val Pro Ile Cys Asp Lys Asp Trp Lys Thr Ser Phe Arg Val Gln Val 
65                  70                  75                  80  


Gly Asp His Met Gly Glu Lys Ala Ser Asp Gly Lys 
                85                  90          


<210>  6
<211>  82
<212>  PRT
<213>  Armatimonadetes bacterium

<400>  6

Met Thr Trp Leu Val Val Tyr Asp Ile Glu Asp Asp Arg Val Arg Thr 
1               5                   10                  15      


Lys Val Ala Asp Tyr Cys Leu Asp Lys Gly Leu Glu Arg Ile Gln Tyr 
            20                  25                  30          


Ser Cys Phe Leu Gly Glu Met Ser Arg Thr Leu Ala Arg Glu Leu Ala 
        35                  40                  45              


Ser Lys Cys Lys Arg Lys Leu Gly Asp Lys Pro Gly Lys Ile Arg Leu 
    50                  55                  60                  


Val Pro Val Cys Glu Lys Asp Leu Ala Ser Gln Val Arg Ile Glu Asn 
65                  70                  75                  80  


Val Pro 
        


<210>  7
<211>  177
<212>  PRT
<213>  Armatimonadetes bacterium

<400>  7

Met Lys Ala Gly Ile Asp Ala Glu Ala Glu Arg Gln Arg Leu Glu Thr 
1               5                   10                  15      


Arg Arg Gly Phe Ser Gln Tyr Gly Ile Thr Ala Val Asp Lys Arg Phe 
            20                  25                  30          


Gln Phe Ser Val Arg Ser Asp Ser Leu Gly Leu Ala Gly Arg Ile Asp 
        35                  40                  45              


Cys Leu Ile Glu Thr Thr Asp Val Thr Phe Glu Glu Ala Gln Ala Gly 
    50                  55                  60                  


Leu Arg Pro Asn Glu Trp Asn Trp Asp Asp Pro Leu Phe Val Pro Val 
65                  70                  75                  80  


Glu Tyr Lys Thr Thr Phe Arg Val Gln Gln Lys His Asn Val Met Gln 
                85                  90                  95      


Leu Ala Ala Tyr Ala Arg Met Leu Glu Ser Leu Thr Gly Thr Ala Val 
            100                 105                 110         


Pro Phe Gly Phe Ile Val Met Leu Pro Glu Glu Glu Val Leu Lys Ile 
        115                 120                 125             


Glu Ile Ser Ser Glu Ile Lys Arg Ser Leu Asp Phe Leu Ile Glu Glu 
    130                 135                 140                 


Val Gln Arg Gly Leu Val Ser Asp Glu Leu Pro Arg Pro Thr Pro His 
145                 150                 155                 160 


Ser Gly Lys Cys Gln Asn Cys Glu Phe Arg Arg Phe Cys Asn Asp Val 
                165                 170                 175     


Trp 
    


<210>  8
<211>  221
<212>  PRT
<213>  Armatimonadetes bacterium

<400>  8

Met Ala Ser Ser Ala His Ala Tyr Gln Pro Arg Asp Thr Val Ser Val 
1               5                   10                  15      


Ser Glu Leu Arg Gln Trp Met Tyr Cys Pro Arg Val Val Trp Tyr Gly 
            20                  25                  30          


Arg Ser Met Gly Asp Tyr Arg Pro Thr Thr Gly Ala Met Lys Val Gly 
        35                  40                  45              


Ile Glu Ala Glu Ala Glu Arg Gln Arg Leu Glu Glu Arg Arg Ser Phe 
    50                  55                  60                  


Ala Gln Tyr Gly Leu Glu Ala Cys Asn Lys Arg Phe Gln Val Pro Val 
65                  70                  75                  80  


Ala Ser Glu Ala Leu Gly Leu Ser Gly Arg Ile Asp Cys Leu Ile Glu 
                85                  90                  95      


Leu Thr Pro Val Ser Leu Glu Asp Ala Gln Val Gly Val Arg Pro Leu 
            100                 105                 110         


Asn Trp Lys Val Gly Asp Pro Met Phe Val Pro Val Glu Tyr Lys Trp 
        115                 120                 125             


Thr Ser Arg Ala Asp Gln Arg Gln Asn Thr Ile Gln Leu Ala Ala Tyr 
    130                 135                 140                 


Gly Met Ile Leu Glu Ser Leu Thr Gly Thr Pro Val Pro Leu Gly Phe 
145                 150                 155                 160 


Ile Ala Leu Leu Pro Glu Glu Glu Val Val Arg Val Glu Leu Val Pro 
                165                 170                 175     


Arg Val Arg Arg Ala Val Lys Cys Thr Leu Asp Glu Ala Arg Glu Gly 
            180                 185                 190         


Leu Ser Ala Pro Glu Leu Pro Trp Pro Thr Leu His Arg Gly Lys Cys 
        195                 200                 205             


Gln Asp Cys Glu Phe Arg Arg Phe Cys Asn Asp Val Trp 
    210                 215                 220     


<210>  9
<211>  213
<212>  PRT
<213>  Armatimonadetes bacterium

<400>  9

Met Glu Val Ser Pro Ser Asp Gly Phe Val Ser Val Ser Glu Val Arg 
1               5                   10                  15      


Gln Trp Ser Tyr Cys Pro Arg Val Val Trp His Asn Arg Trp Leu Gly 
            20                  25                  30          


Glu Arg Arg Pro Gln Thr Ser Arg Met Glu Glu Gly Arg Ala Asp Gln 
        35                  40                  45              


Ala Glu Arg Glu Arg Lys Glu Lys Arg Arg Thr Phe Ala Glu Tyr Arg 
    50                  55                  60                  


Leu Pro Ala Gln Ser Arg Arg Phe Asn Val Tyr Leu Arg Ser Glu Arg 
65                  70                  75                  80  


Leu Gly Val Ser Gly Val Val Asp Ala Val Leu Glu Leu Thr Asn Arg 
                85                  90                  95      


Ser Ile Asp Glu Val Asp Ser Gln Gly Leu Asp Pro Glu Arg Pro Tyr 
            100                 105                 110         


Phe Ala Pro Val Glu Tyr Lys Ser Thr Gln Glu Arg Val Gly Arg His 
        115                 120                 125             


His Leu Leu Gln Leu Ala Gly Tyr Ala Ala Leu Leu Ser Asp Ile Thr 
    130                 135                 140                 


Gly Thr Ser Val Pro Phe Gly Tyr Phe Val Ser Leu Pro Asn Gly Arg 
145                 150                 155                 160 


Ala Ser Arg Val Glu Leu Ser Glu Lys Ala Arg Glu Glu Phe Leu Ser 
                165                 170                 175     


Cys Val Gln Gly Ile Arg Asn Met Val Val Glu Cys Arg Met Pro Glu 
            180                 185                 190         


Pro Thr Pro Ser Arg Ala Lys Cys Arg Asp Cys Glu Phe Arg Arg Phe 
        195                 200                 205             


Cys Asn Asp Val Trp 
    210             


<210>  10
<211>  2565
<212>  DNA
<213>  Armatimonadetes bacterium

<400>  10
atgcccaaaa agacatcgac ggtcgctctg tcacccagag atattcgctt gcgcgaactt       60

ggagagaagc gacttcaaag gttgcgacag cgcgaagaga agattcgtcg tcacctggag      120

tcggagcgcg ggcggcgtga ctttcaatcg ctgcactttc ttcttcataa aattgaagtt      180

gaacgaaacg atctgtaccg aaatctttac cagaacgaag ggcacgagtc gtatgtgcca      240

aagccaggta agacgaaaca taggaaagaa ctttctttgc catccacaga gttaccgtct      300

ccacctgatg agaagaaagg gccccggccc aaaaagagtc gctatgtgat tccccagccc      360

gtacctggaa tcaatctccc acgattgatc aatagattcg gtaaatcgga tcaaaagtcc      420

gaatcggatc aagaaggcag attttggact tcagcgcctt tcatcgaagt tgagctgcct      480

atgcttaatg cccatcgagt cataaaggcg ttgatgcgat tcgttcagaa agacgagcgt      540

tcggttgtcc gaacatgggc tgtgaccaag tttggcagca ttgaggccgc cagagaagtc      600

ctgctagcag gagctttgct gcaaagagag ccggaaatca tgagaggctt cctccagaat      660

attgacccct gggggagttt gagcgatgag gaactcattc gcgatgaaaa agcgtggcgg      720

acggtgaagc tcctagccca aaagaattgg gtggatcaaa tcgcgaagtc gatcaaggac      780

tcggcgccta agggcgtaga taaagacact ttggatcgtc gcctgcggag tggcttaaaa      840

gcattccatt ctgcggcaaa ttcaggaaaa cacacgaatc cccagtttcc ctatttgaca      900

tcggagaaac cgtccgcgaa ctttgaatca gttgtcgact ctgtgcttga gttcctcgat      960

ctggaggaca aggatcgata cacgattgcg aaggttgacg acaagaaacg ccaccgagtg     1020

acggctctgc aaaaggagct aggccaagcc aaaccacgtg taaggttgga gcaggaacgt     1080

agcaggtggg ctggccactc gtatctccaa gggaccatta ccaggaaaag gcaggcttcc     1140

ctcgtttggg atggtcaccg aacggagaac ggtttggctc tcgccatccc attagatggc     1200

atgccgaaaa ttgacgtgca gcgatatatg tatcaagatg gcacctccct tctctcggat     1260

cggcaaatta cttccaagac caagtccgag ggtaaggact gtgccttgat gcctctacga     1320

tttaagcatg cctttcttcg atggtatacg aaacacgtcg aaaatcacgt ggccgaggcc     1380

cctttggaac ggcgatgcat tcataacaca acgcagtttg tcatcgtcga cccagaagga     1440

aagcatcctc ggctgttcat ccgacctgtc ttcaaattct atgactctaa taagacaata     1500

cagaacagta acgccccctg gtgcaaaccg cagtgtcgat accttatcgg cattgatcgg     1560

ggcatcaact acgtgctacg agcggttgtt gtagatactg aagagaaagc cgtaatcgac     1620

gacatccctc taccgggtcg aaagcgggag tggcgagcca ttcggcaaga gatcgcgtac     1680

tttcagcgca tgcgggacct ctccaagagt gcccaagaaa ggaaccgcta tgtggttgcg     1740

cttgccaaag ctcgtaggaa agatcgcagc ttgggcaaga cagagacggt tgaggctgtt     1800

gcaaaacttg ttcagaattg cagcgagaga tttggggaag gaaactactg cttcgtgttg     1860

gagaaccttg agttaggtgc tcttaactta aaaagaaata accgagttaa acatctcgct     1920

tccatggaag aggcattgat ctatcagatg cggaaaagag gctacttcta caactcccga     1980

tcgaaccggg tggatggtgt tcgttgggaa gccgcacgct atacaagtca agtttctccg     2040

tttggctggt gggccaagcg cgatgaagtg gagaaggcca agaaacaaga taaaagcatg     2100

gcgattggcc gcaagattgg cgagggatat gaaggtccgc aggacgatga aatagaaagt     2160

cattcgacta tctatcggca gggcagatgg atgaaactca gaaatgaaga aggaaaggcc     2220

tacggaagaa gtcggtttgt ggttcagccg gaagacttgg accctgcaca acccagaagg     2280

ttcagttggg gaagtgaact tttctgggat ccctatcaaa aggaatttaa aggaaagtcc     2340

ttctctcaag gcgttgtgtt ggatgctgat tttgtgggag ccctaaacat tgcccttcgg     2400

ccactagtca acgacggcaa aggcaaaggt ttcaccaccg cgatgatggc ggaagcacat     2460

gtcaagttga acccgacctt tgagatccgt tgcaagatcc cggtttacga atttatcgct     2520

gagaacgaca attctcgtgc cgcgctgaga aggattgtga tatag                     2565


<210>  11
<211>  2604
<212>  DNA
<213>  Armatimonadetes bacterium

<400>  11
atgggtaaga atcgatcctc gtcctcggat ttgagcccgc tcgaacggtc cttgcggaag       60

gtcggtgaga atcgccttga gcggctgcgg gtgcgagagg agaagattag gaagcacata      120

gaacagcacc cccgcggtaa gaacgatcat caggctctcc acttcttatt gcaccaaatc      180

gaggtcgagc gtaacgacct gtaccgaaac ctcaaagacc ccgagtacgt gcccaaacca      240

gcgaaacagc ggcgcgaaag acggcagatc aacgtcgcca aacccccgac tcgaccaaag      300

aaggaaaagg ggcctcaacc agagtcgacg aagtacgtga tccgtccacc agtccctggg      360

aaaaaccttc ctgcctttgc tagcaagtac gaggcgcgag acacgcggga cgattcctac      420

caggacggtc gctcatggac ctccgcacca tatgttgaag tcgaacttcc catccttggt      480

gcagacaaag tcatccagaa actgatgaag ttcgtgcaga aggacgagcg gtcgatcgtg      540

cgcgactggg cgacaaagac gtatagctcg atcgaagccg caagagaagc actccttgtc      600

ggggcacaag tctcggaaga cgtttcggtc tggcgcggac tcctcgcaga aacgaagaac      660

gcacagaact tcgccgccct ctccgacgat cagatcgaag cagcgatgtc gaaggaggcg      720

aagggcgcgg acttgcgtcc gaggcgcgcc gcactgctgg tcgcacagcg ccactgggtg      780

gatcagaccg tcaaagcaat caaggagtcc gcaccgtccg gcgtcgacaa ggacactctc      840

gatcgccgtc tgcgcgcagg tctgaggggg tttcatactg cggccaactc aggcaagcac      900

acgaacccgc agttcccata cctcaccgca gagaagccgg tagtcccgat ggagtctgtt      960

gttcagagcg tattggcctt tctcgacgat ccagacgatc aaaggtacac gaaggacaaa     1020

gaagacgaca agaagcgcca ccgcgtcact gtcttgcaga aggagctcgg aaaggcgagg     1080

ccacgaaaac ggttagaact ccaaacgccg aaatgggccg gcaggcccac ggtaaaagga     1140

accatcagca aacggcgcga cgcagcgctc gtctgggaca caagcaaaga agcgaacggg     1200

ctttgtctcg cgctcccaat cgggggcatg ccgaagatag acgtcgagca gttcatctac     1260

caggatggga cgtcgctcct gtccgattgc cagatcgcat cgaaaacgac caagaagggc     1320

gcggcttgcg cagtcttgcc gctcaagccc aagcatgact tcctgcgctg gttcaccaag     1380

cacgtcgaga accacaatcc cgacgctcca ctggaacgca ggtgcctcca caacacgacc     1440

cagttcgtca tagtcgaccc agaagggccg cgcccacgtc tcttcgtccg gcccgtcttc     1500

aagttctacg accccggcaa gacggtgccg aacacgcatg aaacttggaa aaagcccgac     1560

tgccgctacc tggttggaat cgaccgaggc atcaattacg ttctgcgagc cgtcgtcgtc     1620

gatactgaag agaagaaggt tatcgccgat atcggcttgc cgggcaggaa gcacgaatgg     1680

aggatgatcc gtgacgagat cgcctaccac caacagatgc gtgatcttgc ccgcaacact     1740

ggcaaacacg cgagcgtcgt ggccaagcac gtccgcgccc tcgcgctcgc gcgcaagaag     1800

gaccgcgcgc tcggcaagtt cgcaacagtc gaagccgtcg cagaacttgt caagaagtgt     1860

gaacaggact atggtagcgg caactactgt ttcgtgctcg aagacctcga catgggggcg     1920

atgaatctca agcgaaacaa cagagtcaaa cacatggcgg tcatggagga ggccctcgtc     1980

aatcaaatgc gcaagcaggg ctatgcctat gacgggcgtc gcggtcgggt ggacggcgtg     2040

aggcacgagg gcgcttggta cacgagccag gtctcgccct ttggctggtg ggccaagcgc     2100

gacgaagtcg aggaggcgtg gaagagggac aagactcgcc ccatcgggcg caaggtcggc     2160

aactggtacg agatgcccga gccaggccaa gacggagacc ggcccgacac gtatcggaag     2220

ggctactggt cgaaaccgaa gaacgcggag ggcaagccgt atgggcgcaa ccgcttcagc     2280

gtcgagcctg gcgacgagaa gccggacgct gagcggcgct tctgctgggg cagcgagctg     2340

ttctgggatc cgaacgtgaa gtccttcaag ggcaaggagt ttcccgaggg cgtcgtgctg     2400

gacgccgact tcgtaggagc cctcaacatc gctctccgcc cgttggtcaa cgacggccag     2460

ggtaaaggct tcaaggccga ggacatggcg agggagcaca cgatactaaa cccgcagttc     2520

aagatcgcct gccagatacc agtttacgag ttcgtcgaag aggacggcga caagtgggca     2580

gctctgcgcc ggatcatgct atag                                            2604


<210>  12
<211>  2604
<212>  DNA
<213>  Armatimonadetes bacterium

<400>  12
atggcgaagg caaccaaaga agtcaagtcg aagcgcgtgg aagcgttgcg gcaggtggcg       60

tatcaacggc tggaacgcct cgagcggaag gctcagaaga tcggagcgca tctgcgcaag      120

ccgggaaaag ccgctgacct ccaatcactc cattatcttc ttcacaaggt cgaagtcgaa      180

tatcacgata tcgcaaggaa cctggagaag gacccgactt ggacaccaaa accgaaaatg      240

cgacgagaga agcgtgccat cgtgccggag tccggcccgg ctgcgcccct cccgaccacg      300

gcaaagggtg agccgggtag accggcaaac cgtcatattc cgccaccagt gccgctcgat      360

tcagcaagga tccccgaaga ccaacagtcg atgggccaag gaagcggggg gaggagttgg      420

tgttctgcgc ctttcgttga ggtgaagtta ccgccgactc aatggtcgaa tgtccgggag      480

aagcttctga aattccgaat tgaggacgac gccgacatcg tcaggcggtg ggccgaggcc      540

aagttcggaa gcatcgagac ggcgcgcgat ggattacgtg cgagcgcaga gatcggaacg      600

agcccggatg tctggcgttc cttcatcagc cgcgcgatct cgaacggcaa gaaggacttt      660

gagccacttc tctcgttgga cgatgacgaa ttgaccgcgg atgcaacagc cgagcgcgtt      720

gtgcgtcggt ggcatcagat tgactgggtg ggccgaatgc tcgactccat cctggaaacc      780

gtcccgtcgg gggtctcgaa agacacgttt cgaagcaggg tcgaatcgcg tctcaagacg      840

tttcactcgt ctgtgaacag cttcgagctc aagaagagga aggacggtac ggtcgagcgc      900

aagcggaagc acaccaaccc gcagtttccg tacttgtcac cgagcgcagt gagcatcgat      960

cctgatgttg tgactatgga ggcggtcgaa ctgctccaga tgcagcccga ggaacgcttt     1020

gcaaaggacc cgaacgatgc gaatggcaga atgaggctga gggttttgca ggcggaactc     1080

ggcaaagcac gacgcgaggc tctgggtcgg cggggcgaga aggccccgcc gtggagtggc     1140

cgcaaggtct ttcgcggaac cacgaccagg aagagggaag cgtgcctggt ttgggacaaa     1200

gaggcacaag cggatggact ttacttcgcg ctcgtgatgt cgggcggacc aaagatcgac     1260

gacaaacggt ttgtctacat ggacggtcag ccgctacaaa gcgattggca actgcacaac     1320

ggagtggccg gtaaggcaaa gtcatgcagg gcgatgcctc tcattttgaa gcatgacttc     1380

ctgcggtggt accaccgcca cattaagaac cacgacgtca atgctcccct cgaaaagcgg     1440

tgcgttcaca cgacgaccca gttcgttttc gtggagccgg acgaaaagaa gggccttcag     1500

ccccggctgt tcatcagacc cgtattcaag ttctacgatc cggtctatga agtgccggat     1560

agccactcga ttgacaagaa gccggactgc cgatatttga tcggaattga ccgaggcgtt     1620

aactacccct atcgtgccgc agtatacgat tgcgagacaa actccataat cgccgacaag     1680

ttcgtggacg gacgaaaggc agattgggag cggatacgaa atgaactcgc ataccaccag     1740

cggcgacgtg acctcctgcg caactcgcgt gcctcttccg ccgcaataca gcgagagatt     1800

cgagccattg cacggattcg caagagggag cgtgggctga acaaagtcga gacggtcgag     1860

agcatcgcgc ggctcgtcga ctgggcggaa gagaatctcg ggaagtgcaa ttactgcttc     1920

gttctcgaag acctttcttc aaacttgaat ctggggcgaa acaacagggt caagcacatt     1980

gccgcgatca aggaggcgct gatcaaccag atgcgcaagc gcggatatcg tttcaaaaag     2040

agcgggaaag ttgacggcgt gcgagaggag tccgcgtggt acacgagtgc cgttgcgcca     2100

tccggttggt gggcgaagaa ggaagaagtg gacggggcct ggaaagcgga caagacgcgg     2160

ccattggcga gaaagatcgg cagttactat tgctgcgaag aaatcgacgg actccatttg     2220

cgcggcgtgc tgaaggggct cggaagggcg aagcgactcg ttcttcaaag cgacgaccca     2280

tccgcgccga ctcgcagacg agggtttgga tcagagttgt tctgggaccc ctattgcacc     2340

gaactctgcg gccacgcttt cccgcaaggc gtcgtactgg acgcagactt catcggcgcc     2400

ttcaatattg cgctgcgacc gctggtgagg gaggaacttg ggaagaaggc gaaggccgtg     2460

gacctggccg acaggcacca gacgctcaat ccgacggttg ccctccgatg cggcgtaacg     2520

gcgtacgagt tcgtcgaagt cgggggcgat ccccggggcg gtctccgaaa aatcttgctc     2580

aatcccgcag aggccgtgat ataa                                            2604


<210>  13
<211>  854
<212>  PRT
<213>  Armatimonadetes bacterium

<400>  13

Met Pro Lys Lys Thr Ser Thr Val Ala Leu Ser Pro Arg Asp Ile Arg 
1               5                   10                  15      


Leu Arg Glu Leu Gly Glu Lys Arg Leu Gln Arg Leu Arg Gln Arg Glu 
            20                  25                  30          


Glu Lys Ile Arg Arg His Leu Glu Ser Glu Arg Gly Arg Arg Asp Phe 
        35                  40                  45              


Gln Ser Leu His Phe Leu Leu His Lys Ile Glu Val Glu Arg Asn Asp 
    50                  55                  60                  


Leu Tyr Arg Asn Leu Tyr Gln Asn Glu Gly His Glu Ser Tyr Val Pro 
65                  70                  75                  80  


Lys Pro Gly Lys Thr Lys His Arg Lys Glu Leu Ser Leu Pro Ser Thr 
                85                  90                  95      


Glu Leu Pro Ser Pro Pro Asp Glu Lys Lys Gly Pro Arg Pro Lys Lys 
            100                 105                 110         


Ser Arg Tyr Val Ile Pro Gln Pro Val Pro Gly Ile Asn Leu Pro Arg 
        115                 120                 125             


Leu Ile Asn Arg Phe Gly Lys Ser Asp Gln Lys Ser Glu Ser Asp Gln 
    130                 135                 140                 


Glu Gly Arg Phe Trp Thr Ser Ala Pro Phe Ile Glu Val Glu Leu Pro 
145                 150                 155                 160 


Met Leu Asn Ala His Arg Val Ile Lys Ala Leu Met Arg Phe Val Gln 
                165                 170                 175     


Lys Asp Glu Arg Ser Val Val Arg Thr Trp Ala Val Thr Lys Phe Gly 
            180                 185                 190         


Ser Ile Glu Ala Ala Arg Glu Val Leu Leu Ala Gly Ala Leu Leu Gln 
        195                 200                 205             


Arg Glu Pro Glu Ile Met Arg Gly Phe Leu Gln Asn Ile Asp Pro Trp 
    210                 215                 220                 


Gly Ser Leu Ser Asp Glu Glu Leu Ile Arg Asp Glu Lys Ala Trp Arg 
225                 230                 235                 240 


Thr Val Lys Leu Leu Ala Gln Lys Asn Trp Val Asp Gln Ile Ala Lys 
                245                 250                 255     


Ser Ile Lys Asp Ser Ala Pro Lys Gly Val Asp Lys Asp Thr Leu Asp 
            260                 265                 270         


Arg Arg Leu Arg Ser Gly Leu Lys Ala Phe His Ser Ala Ala Asn Ser 
        275                 280                 285             


Gly Lys His Thr Asn Pro Gln Phe Pro Tyr Leu Thr Ser Glu Lys Pro 
    290                 295                 300                 


Ser Ala Asn Phe Glu Ser Val Val Asp Ser Val Leu Glu Phe Leu Asp 
305                 310                 315                 320 


Leu Glu Asp Lys Asp Arg Tyr Thr Ile Ala Lys Val Asp Asp Lys Lys 
                325                 330                 335     


Arg His Arg Val Thr Ala Leu Gln Lys Glu Leu Gly Gln Ala Lys Pro 
            340                 345                 350         


Arg Val Arg Leu Glu Gln Glu Arg Ser Arg Trp Ala Gly His Ser Tyr 
        355                 360                 365             


Leu Gln Gly Thr Ile Thr Arg Lys Arg Gln Ala Ser Leu Val Trp Asp 
    370                 375                 380                 


Gly His Arg Thr Glu Asn Gly Leu Ala Leu Ala Ile Pro Leu Asp Gly 
385                 390                 395                 400 


Met Pro Lys Ile Asp Val Gln Arg Tyr Met Tyr Gln Asp Gly Thr Ser 
                405                 410                 415     


Leu Leu Ser Asp Arg Gln Ile Thr Ser Lys Thr Lys Ser Glu Gly Lys 
            420                 425                 430         


Asp Cys Ala Leu Met Pro Leu Arg Phe Lys His Ala Phe Leu Arg Trp 
        435                 440                 445             


Tyr Thr Lys His Val Glu Asn His Val Ala Glu Ala Pro Leu Glu Arg 
    450                 455                 460                 


Arg Cys Ile His Asn Thr Thr Gln Phe Val Ile Val Asp Pro Glu Gly 
465                 470                 475                 480 


Lys His Pro Arg Leu Phe Ile Arg Pro Val Phe Lys Phe Tyr Asp Ser 
                485                 490                 495     


Asn Lys Thr Ile Gln Asn Ser Asn Ala Pro Trp Cys Lys Pro Gln Cys 
            500                 505                 510         


Arg Tyr Leu Ile Gly Ile Asp Arg Gly Ile Asn Tyr Val Leu Arg Ala 
        515                 520                 525             


Val Val Val Asp Thr Glu Glu Lys Ala Val Ile Asp Asp Ile Pro Leu 
    530                 535                 540                 


Pro Gly Arg Lys Arg Glu Trp Arg Ala Ile Arg Gln Glu Ile Ala Tyr 
545                 550                 555                 560 


Phe Gln Arg Met Arg Asp Leu Ser Lys Ser Ala Gln Glu Arg Asn Arg 
                565                 570                 575     


Tyr Val Val Ala Leu Ala Lys Ala Arg Arg Lys Asp Arg Ser Leu Gly 
            580                 585                 590         


Lys Thr Glu Thr Val Glu Ala Val Ala Lys Leu Val Gln Asn Cys Ser 
        595                 600                 605             


Glu Arg Phe Gly Glu Gly Asn Tyr Cys Phe Val Leu Glu Asn Leu Glu 
    610                 615                 620                 


Leu Gly Ala Leu Asn Leu Lys Arg Asn Asn Arg Val Lys His Leu Ala 
625                 630                 635                 640 


Ser Met Glu Glu Ala Leu Ile Tyr Gln Met Arg Lys Arg Gly Tyr Phe 
                645                 650                 655     


Tyr Asn Ser Arg Ser Asn Arg Val Asp Gly Val Arg Trp Glu Ala Ala 
            660                 665                 670         


Arg Tyr Thr Ser Gln Val Ser Pro Phe Gly Trp Trp Ala Lys Arg Asp 
        675                 680                 685             


Glu Val Glu Lys Ala Lys Lys Gln Asp Lys Ser Met Ala Ile Gly Arg 
    690                 695                 700                 


Lys Ile Gly Glu Gly Tyr Glu Gly Pro Gln Asp Asp Glu Ile Glu Ser 
705                 710                 715                 720 


His Ser Thr Ile Tyr Arg Gln Gly Arg Trp Met Lys Leu Arg Asn Glu 
                725                 730                 735     


Glu Gly Lys Ala Tyr Gly Arg Ser Arg Phe Val Val Gln Pro Glu Asp 
            740                 745                 750         


Leu Asp Pro Ala Gln Pro Arg Arg Phe Ser Trp Gly Ser Glu Leu Phe 
        755                 760                 765             


Trp Asp Pro Tyr Gln Lys Glu Phe Lys Gly Lys Ser Phe Ser Gln Gly 
    770                 775                 780                 


Val Val Leu Asp Ala Asp Phe Val Gly Ala Leu Asn Ile Ala Leu Arg 
785                 790                 795                 800 


Pro Leu Val Asn Asp Gly Lys Gly Lys Gly Phe Thr Thr Ala Met Met 
                805                 810                 815     


Ala Glu Ala His Val Lys Leu Asn Pro Thr Phe Glu Ile Arg Cys Lys 
            820                 825                 830         


Ile Pro Val Tyr Glu Phe Ile Ala Glu Asn Asp Asn Ser Arg Ala Ala 
        835                 840                 845             


Leu Arg Arg Ile Val Ile 
    850                 


<210>  14
<211>  867
<212>  PRT
<213>  Armatimonadetes bacterium

<400>  14

Met Gly Lys Asn Arg Ser Ser Ser Ser Asp Leu Ser Pro Leu Glu Arg 
1               5                   10                  15      


Ser Leu Arg Lys Val Gly Glu Asn Arg Leu Glu Arg Leu Arg Val Arg 
            20                  25                  30          


Glu Glu Lys Ile Arg Lys His Ile Glu Gln His Pro Arg Gly Lys Asn 
        35                  40                  45              


Asp His Gln Ala Leu His Phe Leu Leu His Gln Ile Glu Val Glu Arg 
    50                  55                  60                  


Asn Asp Leu Tyr Arg Asn Leu Lys Asp Pro Glu Tyr Val Pro Lys Pro 
65                  70                  75                  80  


Ala Lys Gln Arg Arg Glu Arg Arg Gln Ile Asn Val Ala Lys Pro Pro 
                85                  90                  95      


Thr Arg Pro Lys Lys Glu Lys Gly Pro Gln Pro Glu Ser Thr Lys Tyr 
            100                 105                 110         


Val Ile Arg Pro Pro Val Pro Gly Lys Asn Leu Pro Ala Phe Ala Ser 
        115                 120                 125             


Lys Tyr Glu Ala Arg Asp Thr Arg Asp Asp Ser Tyr Gln Asp Gly Arg 
    130                 135                 140                 


Ser Trp Thr Ser Ala Pro Tyr Val Glu Val Glu Leu Pro Ile Leu Gly 
145                 150                 155                 160 


Ala Asp Lys Val Ile Gln Lys Leu Met Lys Phe Val Gln Lys Asp Glu 
                165                 170                 175     


Arg Ser Ile Val Arg Asp Trp Ala Thr Lys Thr Tyr Ser Ser Ile Glu 
            180                 185                 190         


Ala Ala Arg Glu Ala Leu Leu Val Gly Ala Gln Val Ser Glu Asp Val 
        195                 200                 205             


Ser Val Trp Arg Gly Leu Leu Ala Glu Thr Lys Asn Ala Gln Asn Phe 
    210                 215                 220                 


Ala Ala Leu Ser Asp Asp Gln Ile Glu Ala Ala Met Ser Lys Glu Ala 
225                 230                 235                 240 


Lys Gly Ala Asp Leu Arg Pro Arg Arg Ala Ala Leu Leu Val Ala Gln 
                245                 250                 255     


Arg His Trp Val Asp Gln Thr Val Lys Ala Ile Lys Glu Ser Ala Pro 
            260                 265                 270         


Ser Gly Val Asp Lys Asp Thr Leu Asp Arg Arg Leu Arg Ala Gly Leu 
        275                 280                 285             


Arg Gly Phe His Thr Ala Ala Asn Ser Gly Lys His Thr Asn Pro Gln 
    290                 295                 300                 


Phe Pro Tyr Leu Thr Ala Glu Lys Pro Val Val Pro Met Glu Ser Val 
305                 310                 315                 320 


Val Gln Ser Val Leu Ala Phe Leu Asp Asp Pro Asp Asp Gln Arg Tyr 
                325                 330                 335     


Thr Lys Asp Lys Glu Asp Asp Lys Lys Arg His Arg Val Thr Val Leu 
            340                 345                 350         


Gln Lys Glu Leu Gly Lys Ala Arg Pro Arg Lys Arg Leu Glu Leu Gln 
        355                 360                 365             


Thr Pro Lys Trp Ala Gly Arg Pro Thr Val Lys Gly Thr Ile Ser Lys 
    370                 375                 380                 


Arg Arg Asp Ala Ala Leu Val Trp Asp Thr Ser Lys Glu Ala Asn Gly 
385                 390                 395                 400 


Leu Cys Leu Ala Leu Pro Ile Gly Gly Met Pro Lys Ile Asp Val Glu 
                405                 410                 415     


Gln Phe Ile Tyr Gln Asp Gly Thr Ser Leu Leu Ser Asp Cys Gln Ile 
            420                 425                 430         


Ala Ser Lys Thr Thr Lys Lys Gly Ala Ala Cys Ala Val Leu Pro Leu 
        435                 440                 445             


Lys Pro Lys His Asp Phe Leu Arg Trp Phe Thr Lys His Val Glu Asn 
    450                 455                 460                 


His Asn Pro Asp Ala Pro Leu Glu Arg Arg Cys Leu His Asn Thr Thr 
465                 470                 475                 480 


Gln Phe Val Ile Val Asp Pro Glu Gly Pro Arg Pro Arg Leu Phe Val 
                485                 490                 495     


Arg Pro Val Phe Lys Phe Tyr Asp Pro Gly Lys Thr Val Pro Asn Thr 
            500                 505                 510         


His Glu Thr Trp Lys Lys Pro Asp Cys Arg Tyr Leu Val Gly Ile Asp 
        515                 520                 525             


Arg Gly Ile Asn Tyr Val Leu Arg Ala Val Val Val Asp Thr Glu Glu 
    530                 535                 540                 


Lys Lys Val Ile Ala Asp Ile Gly Leu Pro Gly Arg Lys His Glu Trp 
545                 550                 555                 560 


Arg Met Ile Arg Asp Glu Ile Ala Tyr His Gln Gln Met Arg Asp Leu 
                565                 570                 575     


Ala Arg Asn Thr Gly Lys His Ala Ser Val Val Ala Lys His Val Arg 
            580                 585                 590         


Ala Leu Ala Leu Ala Arg Lys Lys Asp Arg Ala Leu Gly Lys Phe Ala 
        595                 600                 605             


Thr Val Glu Ala Val Ala Glu Leu Val Lys Lys Cys Glu Gln Asp Tyr 
    610                 615                 620                 


Gly Ser Gly Asn Tyr Cys Phe Val Leu Glu Asp Leu Asp Met Gly Ala 
625                 630                 635                 640 


Met Asn Leu Lys Arg Asn Asn Arg Val Lys His Met Ala Val Met Glu 
                645                 650                 655     


Glu Ala Leu Val Asn Gln Met Arg Lys Gln Gly Tyr Ala Tyr Asp Gly 
            660                 665                 670         


Arg Arg Gly Arg Val Asp Gly Val Arg His Glu Gly Ala Trp Tyr Thr 
        675                 680                 685             


Ser Gln Val Ser Pro Phe Gly Trp Trp Ala Lys Arg Asp Glu Val Glu 
    690                 695                 700                 


Glu Ala Trp Lys Arg Asp Lys Thr Arg Pro Ile Gly Arg Lys Val Gly 
705                 710                 715                 720 


Asn Trp Tyr Glu Met Pro Glu Pro Gly Gln Asp Gly Asp Arg Pro Asp 
                725                 730                 735     


Thr Tyr Arg Lys Gly Tyr Trp Ser Lys Pro Lys Asn Ala Glu Gly Lys 
            740                 745                 750         


Pro Tyr Gly Arg Asn Arg Phe Ser Val Glu Pro Gly Asp Glu Lys Pro 
        755                 760                 765             


Asp Ala Glu Arg Arg Phe Cys Trp Gly Ser Glu Leu Phe Trp Asp Pro 
    770                 775                 780                 


Asn Val Lys Ser Phe Lys Gly Lys Glu Phe Pro Glu Gly Val Val Leu 
785                 790                 795                 800 


Asp Ala Asp Phe Val Gly Ala Leu Asn Ile Ala Leu Arg Pro Leu Val 
                805                 810                 815     


Asn Asp Gly Gln Gly Lys Gly Phe Lys Ala Glu Asp Met Ala Arg Glu 
            820                 825                 830         


His Thr Ile Leu Asn Pro Gln Phe Lys Ile Ala Cys Gln Ile Pro Val 
        835                 840                 845             


Tyr Glu Phe Val Glu Glu Asp Gly Asp Lys Trp Ala Ala Leu Arg Arg 
    850                 855                 860                 


Ile Met Leu 
865         


<210>  15
<211>  867
<212>  PRT
<213>  Armatimonadetes bacterium

<400>  15

Met Ala Lys Ala Thr Lys Glu Val Lys Ser Lys Arg Val Glu Ala Leu 
1               5                   10                  15      


Arg Gln Val Ala Tyr Gln Arg Leu Glu Arg Leu Glu Arg Lys Ala Gln 
            20                  25                  30          


Lys Ile Gly Ala His Leu Arg Lys Pro Gly Lys Ala Ala Asp Leu Gln 
        35                  40                  45              


Ser Leu His Tyr Leu Leu His Lys Val Glu Val Glu Tyr His Asp Ile 
    50                  55                  60                  


Ala Arg Asn Leu Glu Lys Asp Pro Thr Trp Thr Pro Lys Pro Lys Met 
65                  70                  75                  80  


Arg Arg Glu Lys Arg Ala Ile Val Pro Glu Ser Gly Pro Ala Ala Pro 
                85                  90                  95      


Leu Pro Thr Thr Ala Lys Gly Glu Pro Gly Arg Pro Ala Asn Arg His 
            100                 105                 110         


Ile Pro Pro Pro Val Pro Leu Asp Ser Ala Arg Ile Pro Glu Asp Gln 
        115                 120                 125             


Gln Ser Met Gly Gln Gly Ser Gly Gly Arg Ser Trp Cys Ser Ala Pro 
    130                 135                 140                 


Phe Val Glu Val Lys Leu Pro Pro Thr Gln Trp Ser Asn Val Arg Glu 
145                 150                 155                 160 


Lys Leu Leu Lys Phe Arg Ile Glu Asp Asp Ala Asp Ile Val Arg Arg 
                165                 170                 175     


Trp Ala Glu Ala Lys Phe Gly Ser Ile Glu Thr Ala Arg Asp Gly Leu 
            180                 185                 190         


Arg Ala Ser Ala Glu Ile Gly Thr Ser Pro Asp Val Trp Arg Ser Phe 
        195                 200                 205             


Ile Ser Arg Ala Ile Ser Asn Gly Lys Lys Asp Phe Glu Pro Leu Leu 
    210                 215                 220                 


Ser Leu Asp Asp Asp Glu Leu Thr Ala Asp Ala Thr Ala Glu Arg Val 
225                 230                 235                 240 


Val Arg Arg Trp His Gln Ile Asp Trp Val Gly Arg Met Leu Asp Ser 
                245                 250                 255     


Ile Leu Glu Thr Val Pro Ser Gly Val Ser Lys Asp Thr Phe Arg Ser 
            260                 265                 270         


Arg Val Glu Ser Arg Leu Lys Thr Phe His Ser Ser Val Asn Ser Phe 
        275                 280                 285             


Glu Leu Lys Lys Arg Lys Asp Gly Thr Val Glu Arg Lys Arg Lys His 
    290                 295                 300                 


Thr Asn Pro Gln Phe Pro Tyr Leu Ser Pro Ser Ala Val Ser Ile Asp 
305                 310                 315                 320 


Pro Asp Val Val Thr Met Glu Ala Val Glu Leu Leu Gln Met Gln Pro 
                325                 330                 335     


Glu Glu Arg Phe Ala Lys Asp Pro Asn Asp Ala Asn Gly Arg Met Arg 
            340                 345                 350         


Leu Arg Val Leu Gln Ala Glu Leu Gly Lys Ala Arg Arg Glu Ala Leu 
        355                 360                 365             


Gly Arg Arg Gly Glu Lys Ala Pro Pro Trp Ser Gly Arg Lys Val Phe 
    370                 375                 380                 


Arg Gly Thr Thr Thr Arg Lys Arg Glu Ala Cys Leu Val Trp Asp Lys 
385                 390                 395                 400 


Glu Ala Gln Ala Asp Gly Leu Tyr Phe Ala Leu Val Met Ser Gly Gly 
                405                 410                 415     


Pro Lys Ile Asp Asp Lys Arg Phe Val Tyr Met Asp Gly Gln Pro Leu 
            420                 425                 430         


Gln Ser Asp Trp Gln Leu His Asn Gly Val Ala Gly Lys Ala Lys Ser 
        435                 440                 445             


Cys Arg Ala Met Pro Leu Ile Leu Lys His Asp Phe Leu Arg Trp Tyr 
    450                 455                 460                 


His Arg His Ile Lys Asn His Asp Val Asn Ala Pro Leu Glu Lys Arg 
465                 470                 475                 480 


Cys Val His Thr Thr Thr Gln Phe Val Phe Val Glu Pro Asp Glu Lys 
                485                 490                 495     


Lys Gly Leu Gln Pro Arg Leu Phe Ile Arg Pro Val Phe Lys Phe Tyr 
            500                 505                 510         


Asp Pro Val Tyr Glu Val Pro Asp Ser His Ser Ile Asp Lys Lys Pro 
        515                 520                 525             


Asp Cys Arg Tyr Leu Ile Gly Ile Asp Arg Gly Val Asn Tyr Pro Tyr 
    530                 535                 540                 


Arg Ala Ala Val Tyr Asp Cys Glu Thr Asn Ser Ile Ile Ala Asp Lys 
545                 550                 555                 560 


Phe Val Asp Gly Arg Lys Ala Asp Trp Glu Arg Ile Arg Asn Glu Leu 
                565                 570                 575     


Ala Tyr His Gln Arg Arg Arg Asp Leu Leu Arg Asn Ser Arg Ala Ser 
            580                 585                 590         


Ser Ala Ala Ile Gln Arg Glu Ile Arg Ala Ile Ala Arg Ile Arg Lys 
        595                 600                 605             


Arg Glu Arg Gly Leu Asn Lys Val Glu Thr Val Glu Ser Ile Ala Arg 
    610                 615                 620                 


Leu Val Asp Trp Ala Glu Glu Asn Leu Gly Lys Cys Asn Tyr Cys Phe 
625                 630                 635                 640 


Val Leu Glu Asp Leu Ser Ser Asn Leu Asn Leu Gly Arg Asn Asn Arg 
                645                 650                 655     


Val Lys His Ile Ala Ala Ile Lys Glu Ala Leu Ile Asn Gln Met Arg 
            660                 665                 670         


Lys Arg Gly Tyr Arg Phe Lys Lys Ser Gly Lys Val Asp Gly Val Arg 
        675                 680                 685             


Glu Glu Ser Ala Trp Tyr Thr Ser Ala Val Ala Pro Ser Gly Trp Trp 
    690                 695                 700                 


Ala Lys Lys Glu Glu Val Asp Gly Ala Trp Lys Ala Asp Lys Thr Arg 
705                 710                 715                 720 


Pro Leu Ala Arg Lys Ile Gly Ser Tyr Tyr Cys Cys Glu Glu Ile Asp 
                725                 730                 735     


Gly Leu His Leu Arg Gly Val Leu Lys Gly Leu Gly Arg Ala Lys Arg 
            740                 745                 750         


Leu Val Leu Gln Ser Asp Asp Pro Ser Ala Pro Thr Arg Arg Arg Gly 
        755                 760                 765             


Phe Gly Ser Glu Leu Phe Trp Asp Pro Tyr Cys Thr Glu Leu Cys Gly 
    770                 775                 780                 


His Ala Phe Pro Gln Gly Val Val Leu Asp Ala Asp Phe Ile Gly Ala 
785                 790                 795                 800 


Phe Asn Ile Ala Leu Arg Pro Leu Val Arg Glu Glu Leu Gly Lys Lys 
                805                 810                 815     


Ala Lys Ala Val Asp Leu Ala Asp Arg His Gln Thr Leu Asn Pro Thr 
            820                 825                 830         


Val Ala Leu Arg Cys Gly Val Thr Ala Tyr Glu Phe Val Glu Val Gly 
        835                 840                 845             


Gly Asp Pro Arg Gly Gly Leu Arg Lys Ile Leu Leu Asn Pro Ala Glu 
    850                 855                 860                 


Ala Val Ile 
865         


<210>  16
<211>  8727
<212>  DNA
<213>  Armatimonadetes bacterium

<400>  16
cgcatctggg gctaaccctt tttccgtcac cgctcacccg gatcgcggac cagcgccaca       60

cgatacgcgc cgcttcccgt aatcgccatt aggaccgtca gcagcccctt ccccggaacg      120

tttcccgcct caaaccgcac cgtctccccc gccgcaatct cgaaggtctc cgtaagcgag      180

taaagccgat tcgtcacctt gatgcggtgc tctcccggct ctaacggagc ttcaaacgac      240

tcgccaaacg ccaacgtctc ctccggccaa tcgtccaccc gaaggtagag atcgcgaatc      300

ccaatgtccg tcgacatcgt ccgatccacc accaacaacc ccccatcccc ctcaccacca      360

tcaaccatcc ccccattctt tcaccatcgg cgcccagact gagcctttcg gcaatagtat      420

cgactgctca tcgatccaat ttttcagtgc ctttgatccg gcattgacac cgttggccct      480

tctgtgatag tacattctcc atgcccaaaa agacatcgac ggtcgctctg tcacccagag      540

atattcgctt gcgcgaactt ggagagaagc gacttcaaag gttgcgacag cgcgaagaga      600

agattcgtcg tcacctggag tcggagcgcg ggcggcgtga ctttcaatcg ctgcactttc      660

ttcttcataa aattgaagtt gaacgaaacg atctgtaccg aaatctttac cagaacgaag      720

ggcacgagtc gtatgtgcca aagccaggta agacgaaaca taggaaagaa ctttctttgc      780

catccacaga gttaccgtct ccacctgatg agaagaaagg gccccggccc aaaaagagtc      840

gctatgtgat tccccagccc gtacctggaa tcaatctccc acgattgatc aatagattcg      900

gtaaatcgga tcaaaagtcc gaatcggatc aagaaggcag attttggact tcagcgcctt      960

tcatcgaagt tgagctgcct atgcttaatg cccatcgagt cataaaggcg ttgatgcgat     1020

tcgttcagaa agacgagcgt tcggttgtcc gaacatgggc tgtgaccaag tttggcagca     1080

ttgaggccgc cagagaagtc ctgctagcag gagctttgct gcaaagagag ccggaaatca     1140

tgagaggctt cctccagaat attgacccct gggggagttt gagcgatgag gaactcattc     1200

gcgatgaaaa agcgtggcgg acggtgaagc tcctagccca aaagaattgg gtggatcaaa     1260

tcgcgaagtc gatcaaggac tcggcgccta agggcgtaga taaagacact ttggatcgtc     1320

gcctgcggag tggcttaaaa gcattccatt ctgcggcaaa ttcaggaaaa cacacgaatc     1380

cccagtttcc ctatttgaca tcggagaaac cgtccgcgaa ctttgaatca gttgtcgact     1440

ctgtgcttga gttcctcgat ctggaggaca aggatcgata cacgattgcg aaggttgacg     1500

acaagaaacg ccaccgagtg acggctctgc aaaaggagct aggccaagcc aaaccacgtg     1560

taaggttgga gcaggaacgt agcaggtggg ctggccactc gtatctccaa gggaccatta     1620

ccaggaaaag gcaggcttcc ctcgtttggg atggtcaccg aacggagaac ggtttggctc     1680

tcgccatccc attagatggc atgccgaaaa ttgacgtgca gcgatatatg tatcaagatg     1740

gcacctccct tctctcggat cggcaaatta cttccaagac caagtccgag ggtaaggact     1800

gtgccttgat gcctctacga tttaagcatg cctttcttcg atggtatacg aaacacgtcg     1860

aaaatcacgt ggccgaggcc cctttggaac ggcgatgcat tcataacaca acgcagtttg     1920

tcatcgtcga cccagaagga aagcatcctc ggctgttcat ccgacctgtc ttcaaattct     1980

atgactctaa taagacaata cagaacagta acgccccctg gtgcaaaccg cagtgtcgat     2040

accttatcgg cattgatcgg ggcatcaact acgtgctacg agcggttgtt gtagatactg     2100

aagagaaagc cgtaatcgac gacatccctc taccgggtcg aaagcgggag tggcgagcca     2160

ttcggcaaga gatcgcgtac tttcagcgca tgcgggacct ctccaagagt gcccaagaaa     2220

ggaaccgcta tgtggttgcg cttgccaaag ctcgtaggaa agatcgcagc ttgggcaaga     2280

cagagacggt tgaggctgtt gcaaaacttg ttcagaattg cagcgagaga tttggggaag     2340

gaaactactg cttcgtgttg gagaaccttg agttaggtgc tcttaactta aaaagaaata     2400

accgagttaa acatctcgct tccatggaag aggcattgat ctatcagatg cggaaaagag     2460

gctacttcta caactcccga tcgaaccggg tggatggtgt tcgttgggaa gccgcacgct     2520

atacaagtca agtttctccg tttggctggt gggccaagcg cgatgaagtg gagaaggcca     2580

agaaacaaga taaaagcatg gcgattggcc gcaagattgg cgagggatat gaaggtccgc     2640

aggacgatga aatagaaagt cattcgacta tctatcggca gggcagatgg atgaaactca     2700

gaaatgaaga aggaaaggcc tacggaagaa gtcggtttgt ggttcagccg gaagacttgg     2760

accctgcaca acccagaagg ttcagttggg gaagtgaact tttctgggat ccctatcaaa     2820

aggaatttaa aggaaagtcc ttctctcaag gcgttgtgtt ggatgctgat tttgtgggag     2880

ccctaaacat tgcccttcgg ccactagtca acgacggcaa aggcaaaggt ttcaccaccg     2940

cgatgatggc ggaagcacat gtcaagttga acccgacctt tgagatccgt tgcaagatcc     3000

cggtttacga atttatcgct gagaacgaca attctcgtgc cgcgctgaga aggattgtga     3060

tatagtttct aagttccatc tcgatgcgga acggatacta cgctgtagtc tatacgacac     3120

gagtgatagc cctgcggggt tcgcccctaa gtccgtatga caagcctctt tcaggcggtg     3180

gacttctaag agtgctggtg ggtggaatcc ctaagccacc cacctccttc acaatctggc     3240

agaatcacga gttccattaa gaaaaggtga cgttaaccta atttgtgcgt tacgaaattg     3300

ttgatggtta tggtagccag gtcctcaagc atagcgagcg tctcgtactt cgcatccctt     3360

ctgccctaga taagaatatt aagcgggaag tgcctgtcct ccatctggac catctgcttg     3420

tggggacaaa gggagtgctc gtatcgagcg atgcccttgc cttatgctgc gagcgaggta     3480

ttcccgttac ggttgtcgat tggagggggc gtcccgtagg aaggttcggt agccccgcgc     3540

ttcacggctc cgcccatatt cgccgagctc agctcgaggc ctttgacgcc aatttaggcg     3600

ctgagttcgc tcgggaagtc gtctgcggga agctcttgaa tcaagctgac aatcttcgat     3660

attttggcaa aaatcgaaag actcgcgacc ccgcgcaaca cgaactgctt gagacctcgg     3720

cagatgatat caatgagatt tcaaaaagag cttcctgtat ctctgaaaag tgcgcgaata     3780

cagcacggtt gccgttgatg accttagaag ccgaaggcgc tcgtatctat tggtccgccc     3840

tctcgcgcct ctatgggaca agatccggat tcgctcggcg ggagcaaaga ggcacgagag     3900

accccgtaaa cgccgcgctc aactacgcct atggagtact caatggcgag gtgtggaacg     3960

ccgtggtcct ggctgggcta gagccatacg ccggattgtt gcacgttgac cggcccggaa     4020

gactttcgtt tgtcttggac ctgatggaag agtttcgccc gattatcgcg gatcgcctcg     4080

tctttggtct tgcggctaag ggttggaaaa tcggtcaaga ggagaacgga tggctggatt     4140

ttgccaccaa gcagagactg cttaaagcga tttccgagcg gtgggatgca cgcgttaact     4200

atcaaggacg aaaaattcgc ctgagaagtg tgcttcaact tcaagccaga gatgccgctc     4260

gccacttcct cggtcgcgcc caatatcgtg cttttcgaca aaggtggtaa tgagatgaag     4320

tggctcgttt gttacgatat cgaaaaggac agcgttcggc agaagattgc cgacttttgt     4380

ctggataaag gactcgaacg tgtccagtac agcgtcttcc tgggcgatat gaatcaaacg     4440

ttggcgtttg atctcgccgc tcagattcgt cgtcgaatgg gtgatcatcc aggtcaagtc     4500

cgatttattc ctatctgcga ccgtgattgg aaaaagacct ttcggattca gataggcaac     4560

tacatgggag ttaagcctag caatggtaaa tgaaactcat ccctacgcgc aggcggatat     4620

tgttagtgtc agtgaattaa ggcagtggag ctattgtccc cgggtcgttt ggtatggccg     4680

ttcaatgggc gattaccgcc caacaagcgg agctatgaaa gcaggcattg atgcagaggc     4740

agagcgtcaa aggttggaaa ctcgccgagg tttttcgcag tatgggatca ccgccgttga     4800

caaaagattc cagttttcag ttcgttcgga ttcgcttggg cttgccggaa gaatagattg     4860

cctgatcgag acaaccgatg taacttttga ggaagcccaa gcgggcctac gccccaacga     4920

atggaattgg gatgatcctc tctttgtgcc cgtggagtac aagacaacgt tccgggttca     4980

acaaaagcac aacgtcatgc agcttgcggc ttatgcccga atgctggaga gtctaacagg     5040

aactgccgtg ccgttcggat tcattgtgat gctgcccgaa gaagaggttc ttaagattga     5100

gataagttcc gagatcaagc ggtcactcga cttcttaatc gaagaagttc aaagaggctt     5160

ggtgtcggat gaacttccca ggcccacccc gcactcgggc aagtgtcaaa actgcgagtt     5220

tcgtcgattt tgcaacgacg tttggtgaga ccttcgaaac cttgttgatg gccgattgca     5280

tcggcagatg tgcgaatctg aactacatcg ttctgcgaat tgagcctcgt gagcgctccc     5340

aggtagatca gttcgcctga tgttcgagct ttcctaacgt caaattcttt gattctatgc     5400

tgtatcctaa tatcgaacca actgttggct tcgcacatca aggcaccttg ataggttaat     5460

tgtggatttc gaaaattcgc ccttctactt gttagggaca agttgggcga gagtttgtat     5520

gacgcctcag ttcaaatgag aacggtcgca tgaggcgaga agcactctta gggaatgaaa     5580

gcgctcttgg cgtgccatgc ccccgagttg tgggcggtca gtcgcatgag gcgagaagca     5640

ctcttaggga atgaaaggcc attcctcctc aggattcaga agctcgggga attgtcgcat     5700

gaggcgagaa gcactcttag ggaatgaaag ttcaagaagg catcagacga gtacttcgcc     5760

gcccgtcgca tgaggcgaga agcactctta gggaatgaaa gaagattcgc gctattacta     5820

cgcggttctg gtgctgacgt cgcatgaggc gagaagcact cttaggaaat gaaagttcca     5880

tcgtccatcg gtgcccgagg ggtccaaact tcgtcgcatg aggcgagaag cactcttagg     5940

gaatgaaagc gagatcggtt cgagatcgtc cttgggcttt gcccgtcgca tgaggcgaga     6000

agcactctta gggaatgaaa gaatcaatga tccccggaca ctctttagga gtgtccatgt     6060

cgcatgaggc gagaagcact cttagggaat gaaagtggaa gatctgaacg tcgcgctgct     6120

taaatggcag tgtcgcatga ggcgagaagc actcttaggg aatgaaagct cttgagcttt     6180

ggccgcttcc aggtcagtat cagtcgcatg aggcgagaag cactcttagg gaatgaaaga     6240

tgatcaaatg cactcgcaag gattgtgaca actggatcgt cgcatgaggc gagaagcact     6300

cttagggaat gaaagaacga gcaaccggca gtagagcttc ttgagactga tgtcgcatga     6360

ggcgagaagc actcttaggg aatgaaaggt gagaatgcca tgacgatgga cgcgcggcgg     6420

tcgtcgcatg aggcgagaag cactcttagg gaatgaaaga aatcgagttg tcaaggatta     6480

cttgacaatt gagaaagtcg catgaggcga gaagcactct tagggaatga aagtgtttcg     6540

cagattcagg caggacttgc gacatcttca gcattgtcgc atgaggcgag aagcactctt     6600

agggaatgaa agagcagccg ccgacccgtt gcccctccgg actccagcgt cgcatgaggc     6660

gagaagcact cttagggaat gaaagcccga agaggcggta aagagcccca tctctcgcaa     6720

ggtcgcatga ggcgagaagc actcttaggg aatgaaaggt ggtcaccgat ggtggagtca     6780

agaaaacagt cggtagtcgc atgaggcgag aagcactctt agggaatgaa agaagagagc     6840

gatgcggcct gtccaaagta gctccatccg tcgcttgagg cgagaagcac tcttagggaa     6900

tgaaagacct acgcttgctt ggtcgccagt tgtttctagt cgcatgaggc gagaagcact     6960

cttagggaat gaaagcgcga agtcgtcttc gtggtagcgg tggaacatga cgtcgcatga     7020

ggcgagaagc actcttaggg aatgaaagtt gtaccagtcc gggcgtgtgc cgatctcctg     7080

ggttggtcgc atgaggcgag aagcactctt agggaatgaa agaacttttc gtccagtcgt     7140

cacgtcgatg atatcgaaaa gtcgcatgag gcgagaagca ctcttaggga atgaaaggga     7200

ggaagcgctc ccgctccggg tccttgagcg aggtcgcatg aggcgagaag cactcttagg     7260

gaatgaaaga ttgtgccgcc aggtgcatcc cctggactgc ttatggcgtc gcatgaggcg     7320

agaagcactc ttagggaatg aaagcctcac gcaaaacgtt tatgcccccg gtctcgcttc     7380

ctgtcgcatg aggcgagaag cactcttagg gaatgaaagg ctcgatcaag gccatggcca     7440

aggaactcgg tgtccgtcgc atgaggcgag aagcactctt agggaatgaa agcgcacaac     7500

tcgggctaga atcaagcggg gatgccgagt cgcatgaggc gagaagcact cttagggaat     7560

gaaagaactt gcgacctgtg gacgtgcctg caaaaagtct gtgtcgcatg aggcgagaag     7620

cactcttagg gaatgaaagt tggacctcag tgtctaagtt gttggacgca agcgtagcat     7680

gaggcgagaa gcactcttag ggaatgaaag ttatgacgca gcgagggctc acctttgatc     7740

gctacgtcgc atgaggcgag aagcactctt agggaatgaa agaagcaaaa cgaagatcgt     7800

tgccctcgtc gactctggtc gtcgcatgag gcgagaagca ctcttaggga atgaaagtgg     7860

agaggctggc tacatcgttg ggaacaacct gggtcgcatg aggcgagaag cactcttagg     7920

gaatgaaaga acgtctgtaa gtggggctga tcttgcggtg ttttatgtcg catgaggcga     7980

gaagcactct tagggaatga aagctagtag ttcgtcgatg ttcatctcaa ctcccaggcc     8040

gtcgcatgag gcgagaagca ctcttaggga atgaaaggca gaacatttcg aactcggcgg     8100

cgacttacat cctggtcgca tgaggcgaga agcactctta gggaatgaaa gttacagata     8160

gggcaaacaa atttccagtc atagaagtcg gtcgcatgag gcgagaagca ctcttagggg     8220

cacctgatct tagcaactgg tcgcaacaac ctcgcacggc gaggttgttg ttgatttggg     8280

ggtaggcata acgcgtaatc gcgtagcccg ggccgagtcg tcgtgccaag gcgcaattgc     8340

aaagctaagc taggccaagc aaagcaaagc ggagcaaacg agcgggcccg ccccgccaga     8400

cccctagtct gctaaaattc caagtaatga gcttcagcgt ccgactatcc accgacctga     8460

tcgcggtaga ggccggcgcc accgtgcccc tctccatcga agtctcgaac aaagggaccg     8520

agacggatcg ctatgagctc caaatcgagg gcattgatgc cgaatggacg gccgtgccgg     8580

aggccgtctt cgtcgtcgat tcaggcgaac ttcacacgga aaaggtcttc tttaaggtcc     8640

cccgcaccag cgagagcctg gccgggaatt acccctttgt cgtcaaggtg cgctcgctca     8700

actcaggcga cgcccgtacc gctcaag                                         8727


<210>  17
<211>  7476
<212>  DNA
<213>  Armatimonadetes bacterium

<400>  17
tcgggagctg aagatcgccc gcgatccgga ctgcccggtg tgcggcgact cgccgacgat       60

ccgccagctc atcgactacc cctcgttctg cggcttgccc ggctacggct ccgtcccctc      120

tcacttcgac cacgagatcg aacctgccga actgaggtcc ctgctcgatt cggggggcgg      180

attcgtcctg ctcgacgttc gagagcccca tgagttcgag atcaaccgga tcgaaggctc      240

tcggttgctg cctctgggcg atctgcccgc gagggttcat gaactggatt cggccgacga      300

aatcgtggtt tattgcctga tgggctcgcg cagcgccgac gcatgcaggt tcctccgcgc      360

ggcgggtttt cagaaggtgc gcaacctccg cggcggcatc cgctcgtgga tcgatgaggt      420

cgagccgggg atggtgaagt attgagtccc gcgttcagtg cgggttgcaa ctcccaattg      480

gttgggggta acaagcaagg atgggtaaga atcgatcctc gtcctcggat ttgagcccgc      540

tcgaacggtc cttgcggaag gtcggtgaga atcgccttga gcggctgcgg gtgcgagagg      600

agaagattag gaagcacata gaacagcacc cccgcggtaa gaacgatcat caggctctcc      660

acttcttatt gcaccaaatc gaggtcgagc gtaacgacct gtaccgaaac ctcaaagacc      720

ccgagtacgt gcccaaacca gcgaaacagc ggcgcgaaag acggcagatc aacgtcgcca      780

aacccccgac tcgaccaaag aaggaaaagg ggcctcaacc agagtcgacg aagtacgtga      840

tccgtccacc agtccctggg aaaaaccttc ctgcctttgc tagcaagtac gaggcgcgag      900

acacgcggga cgattcctac caggacggtc gctcatggac ctccgcacca tatgttgaag      960

tcgaacttcc catccttggt gcagacaaag tcatccagaa actgatgaag ttcgtgcaga     1020

aggacgagcg gtcgatcgtg cgcgactggg cgacaaagac gtatagctcg atcgaagccg     1080

caagagaagc actccttgtc ggggcacaag tctcggaaga cgtttcggtc tggcgcggac     1140

tcctcgcaga aacgaagaac gcacagaact tcgccgccct ctccgacgat cagatcgaag     1200

cagcgatgtc gaaggaggcg aagggcgcgg acttgcgtcc gaggcgcgcc gcactgctgg     1260

tcgcacagcg ccactgggtg gatcagaccg tcaaagcaat caaggagtcc gcaccgtccg     1320

gcgtcgacaa ggacactctc gatcgccgtc tgcgcgcagg tctgaggggg tttcatactg     1380

cggccaactc aggcaagcac acgaacccgc agttcccata cctcaccgca gagaagccgg     1440

tagtcccgat ggagtctgtt gttcagagcg tattggcctt tctcgacgat ccagacgatc     1500

aaaggtacac gaaggacaaa gaagacgaca agaagcgcca ccgcgtcact gtcttgcaga     1560

aggagctcgg aaaggcgagg ccacgaaaac ggttagaact ccaaacgccg aaatgggccg     1620

gcaggcccac ggtaaaagga accatcagca aacggcgcga cgcagcgctc gtctgggaca     1680

caagcaaaga agcgaacggg ctttgtctcg cgctcccaat cgggggcatg ccgaagatag     1740

acgtcgagca gttcatctac caggatggga cgtcgctcct gtccgattgc cagatcgcat     1800

cgaaaacgac caagaagggc gcggcttgcg cagtcttgcc gctcaagccc aagcatgact     1860

tcctgcgctg gttcaccaag cacgtcgaga accacaatcc cgacgctcca ctggaacgca     1920

ggtgcctcca caacacgacc cagttcgtca tagtcgaccc agaagggccg cgcccacgtc     1980

tcttcgtccg gcccgtcttc aagttctacg accccggcaa gacggtgccg aacacgcatg     2040

aaacttggaa aaagcccgac tgccgctacc tggttggaat cgaccgaggc atcaattacg     2100

ttctgcgagc cgtcgtcgtc gatactgaag agaagaaggt tatcgccgat atcggcttgc     2160

cgggcaggaa gcacgaatgg aggatgatcc gtgacgagat cgcctaccac caacagatgc     2220

gtgatcttgc ccgcaacact ggcaaacacg cgagcgtcgt ggccaagcac gtccgcgccc     2280

tcgcgctcgc gcgcaagaag gaccgcgcgc tcggcaagtt cgcaacagtc gaagccgtcg     2340

cagaacttgt caagaagtgt gaacaggact atggtagcgg caactactgt ttcgtgctcg     2400

aagacctcga catgggggcg atgaatctca agcgaaacaa cagagtcaaa cacatggcgg     2460

tcatggagga ggccctcgtc aatcaaatgc gcaagcaggg ctatgcctat gacgggcgtc     2520

gcggtcgggt ggacggcgtg aggcacgagg gcgcttggta cacgagccag gtctcgccct     2580

ttggctggtg ggccaagcgc gacgaagtcg aggaggcgtg gaagagggac aagactcgcc     2640

ccatcgggcg caaggtcggc aactggtacg agatgcccga gccaggccaa gacggagacc     2700

ggcccgacac gtatcggaag ggctactggt cgaaaccgaa gaacgcggag ggcaagccgt     2760

atgggcgcaa ccgcttcagc gtcgagcctg gcgacgagaa gccggacgct gagcggcgct     2820

tctgctgggg cagcgagctg ttctgggatc cgaacgtgaa gtccttcaag ggcaaggagt     2880

ttcccgaggg cgtcgtgctg gacgccgact tcgtaggagc cctcaacatc gctctccgcc     2940

cgttggtcaa cgacggccag ggtaaaggct tcaaggccga ggacatggcg agggagcaca     3000

cgatactaaa cccgcagttc aagatcgcct gccagatacc agtttacgag ttcgtcgaag     3060

aggacggcga caagtgggca gctctgcgcc ggatcatgct atagttaggc gttccgtctc     3120

gactatgccg taccactaga ccgagcctac acggcacgcg gtcatagcgt taaccaaggc     3180

gtggtgacaa gcctctttca ggcgtcggac acttaagagc gttaggcggg cggtccctaa     3240

gccgcccgcc cccttatttg cacgttttcc ccgaaccccg taactctgcc agccacaaac     3300

acccagaagc gcgtctacac tccctacggg ggtagacaga catgcgctat gagatcgtag     3360

acggttacgg gtgccaagtg ctcaagcaca gcgagcgcct cgttcttcga taccccctcg     3420

cccctccggg ggagaggggg ccagggggtg aggggaagca acccaagcgc gaggtgcccg     3480

tcttgcacct tgaccacttg ctcatcggca cgaagggcgt gacggtctcg accgacgccc     3540

tcgctctgtg ctgtgagcgc gggattcccg tcacggtcgt ggactggcgc gggcggccgg     3600

tgggtaggtt cggaagccct gctctccacg gcacagcaca ggtgcgccgc gcgcagatcg     3660

ctgctttcgc caccgagata ggagcaacct tcgcccgaga ggtcgtcagc ggcaagttgc     3720

tcaaccaggc cgacaacctc cgatacatcg gcaagaacag aaagacacgg gcgcccaacg     3780

agcacgaggc cctgacaagg acagctgaca cgctccaacg tctggcaaag aaggcagcga     3840

cggtcaaggg caagaacgcg gacgacgtgc ggctcccgct catgacggtc gaggccgagg     3900

gcgcgcgcgc ctactggtcc gttctcagcg aggtttacgg cgagaggtcg ggcttcgcaa     3960

agcgagagca acgtggcaca cgcgacccgg tcaacgccgc gctcaactac gcctacggag     4020

tcctgaacgg cgaggtctgg aatgccacga tcctcgctgg cctcgagccg tacgcagggt     4080

tcctgcatgt ggatcggccg ggacgcctga gcttcgtgct cgatctgatg gaagagttcc     4140

gccccgtggt cgcggaccgc gtcgtgttcg gactggtggc gaagggctgg aagatcggcc     4200

aggaggagaa cggctggctg gacatgccga caaaaaggag actgattcag gcgatcggcg     4260

agaggtgggg agcgcgcgtc ctgcatcaag cccggaaact gcaattgcga tctgtcctcc     4320

agcttcaggc acgcgacgcc gcaaggcact tccagggcaa ggcggagtac atcgcgttcc     4380

gcctgaggtg gtaggccatg aagtggctcg tgtgctacga catcgagaag gatagtgtcc     4440

gaaacaaggt ggcagacttc tgcctggaca aggggctcga gcgggttcaa tacagcgtct     4500

tccttgggtc gatgacaaga acgctcgcca aagaacttgg cgcacagatc agacggaaga     4560

tgggcaagaa ccccggccag gtgcggttcg tgccgatctg cgacaaagac tggaagacgt     4620

cgttccgcgt ccaggtcggc gaccacatgg gagagaaggc gagcgatggc aagtagcgcc     4680

cacgcctatc agccgagaga cacggtaagc gtaagcgaac tccgccagtg gatgtactgc     4740

ccgcgcgttg tttggtacgg acgctcgatg ggcgactacc gtcccacgac aggcgctatg     4800

aaagtcggga tcgaggcgga ggcggagcgc caaaggctgg aggagcgccg gtcgttcgcg     4860

cagtacgggc tggaggcatg caacaagcga ttccaagtgc ctgtagcgtc ggaggcgttg     4920

gggctgtcgg ggcgcattga ctgcctgatc gaacttacgc ccgtttcgct ggaggacgct     4980

caggtcgggg tgaggccgtt gaactggaag gtgggcgatc cgatgttcgt gccagtcgag     5040

tacaaatgga cgtccagggc tgaccagagg cagaacacaa ttcagctggc cgcttatggg     5100

atgattctgg aaagcttgac ggggacgccc gtgccgctcg gcttcatcgc gctcctgcct     5160

gaggaagagg ttgtccgtgt cgaactcgtc cctagagtta ggcgcgcagt caaatgcacc     5220

ctagacgagg ctcgcgaagg cctctccgcc ccggaactcc cctggccaac gctccaccgc     5280

ggaaagtgcc aagactgcga gttccgacgg ttttgcaatg acgtctggta gatggggttc     5340

gaaaccctcg caaccgctcg gatttcgggg cgatgtgcga atcctaacgc ccccgacact     5400

caagatgatg gaggccttcg acccacaatc aagcccgact cagacagcga agagccaatg     5460

gtgtgcgcaa attcgctctc accgggtaca ttcattcgca caacggcgga ctttgtaagc     5520

taaatagcgg gtttcgaaac gacgcggccg aacctgttag ggcagggttg cagcaaatgg     5580

acggcgtgct tgagtccggc catgcatggt cgcaggggat caagaacgct cttagggaat     5640

gaaagacggg cagtatcgga gggcgcggga gaggtgccag ccttcgtcgc aggggatcaa     5700

gaacgctctt agggaatgaa agcgttgcag gcaatgatct ctgcaactgg ccgtcgaatg     5760

tcgcagggga tcaagaacgc tcttagggaa tgaaagggag tgggtgcgac ttaaagcgtt     5820

tgacggtggc tgtcgcaggg gatcaagaac gctcttaggg aatgaaagaa acaaagacga     5880

attgcctgga cgcaggcaaa atacgtcgca ggggatcaag aacgctctta gggaatgaaa     5940

gattgcgaca gcctcgcctc attccaggcc gacaggctgg tcgcagggga tcaagaacgc     6000

tcttagggaa tgaaagatcg ttcctgcacg agagcggagt gagtcgccgg ggggtcgcag     6060

gggatcaaga acgctcttag ggaatgaaag gattacctaa ccacccccag cagggcttcc     6120

ggccctggtc gcaggggatc aagaacgctc ttagggaatg aaagggagag acgggcaggg     6180

agaatgaaca atgaaagcga tccgtcgcag gggatcaaga acgctcttag ggaatgaaag     6240

ctcgcccaac cctcggacag gttgggtggc ggggtcgcag gggatcaaga acgctcttag     6300

ggaatgaaag ttgaacccgc caaagcagtt cagccacgcc gccaggtcgc aggggatcaa     6360

gaacgctctt agggaatgaa agtattccac gcagcacatc ggagaacttg cgtttcggtg     6420

tcgcagggga tcaagaacgc tcttagggaa tgaaagtgcg tagtgcgagc ggatctatcc     6480

gaggttgacc ggagtcgcag gggatcaaga acgctcttag ggaatgaaag gtatgctcca     6540

agaccttcgt cgtcgggttc caggccatgt cgcaggggat caagaacgct cttagggaat     6600

gaaagtgtaa ttctgataca ggcgcgacct cacgagcacc cgtcgcaggg gatcaagaac     6660

gctcttaggg aatgaaagca ttctgcgcgg cctatacatc gaatacctgc gaacgtcgca     6720

ggggatcaag aacgctctta gggaatgaaa gattgccttc agccgcttag ccattcgctc     6780

ggcatagcaa gtcgcagggg atcaagaacg ctcttaggga atgaaagttt gccggccgag     6840

acgtgccacc gcctagtcag gcgatgtcgc aggggatcaa gaacgctctt agggaatgaa     6900

agaggtttcg ggcctggaat acccggcacc cggtcggatg tcgcagggga tcaagaacgc     6960

tcttagggaa tgagaattga ggtaaacaga gttgtggcgg acttcatttg cgtcaggagg     7020

cgacgatgag gcgagagaga acggttctgt acttgttggc gggcggattt cttctgaccg     7080

cgatcgaggt gcgatatctg caccgggagg tcttagggga gcactggcaa gcttgggttc     7140

ctgtggctta tggggcgcta ggcacggccg tagccatcgt tgcggccgct tcggccaagg     7200

cgcgaaggct ggcttcgggc gtgttcgcgc tcggcgtggt ggcgggcctg ctcggtttct     7260

accttcacac cgagggcaac cccgcggagg tcacgaagat gttggggccg gcactggtgg     7320

cgcaggccga cgaagaagac gagcaccggt cttccggcga gagccatgag agttcggagg     7380

aaccgccctc gttcgcgccg ctcggccttt cgggccttgc cgcaataggg ttcgtgtgta     7440

cctcgaagct gttcgcggcc gaaaaacgtt gaacct                               7476


<210>  18
<211>  5575
<212>  DNA
<213>  Armatimonadetes bacterium

<400>  18
acgaggcagc gcgtctattt cttcagcggt ctgctcgttt tgatcggcgt tctgtgcatg       60

aggtggaacg tagtcatcgg cggacaactg ttctcaaaat cgctgcaggg gttgacgacg      120

taccggctgg aactcgtcgg aggagaggga gcgttggtgg cgatcggcgt cctattcctc      180

ccgctcgcga tcctgttcgt cctcgcgaag atcctcccgc cctggaaggc ccacgaagaa      240

gcccgcgcgt agggacagat ccgctcgccg gctggcgcag tctagcggct agcccttgac      300

gggccggtcg ggcgaaccgg ctcaagcgca gcgggcgccc ctgccaaccc gcgcttgagg      360

ggcagtcccg ccgtctcggc ggtggcatgc gcgctgacac cccagctgag cgcaccaacc      420

cagggcagga gactgggaag tcatggagga gccgccggga gggccggaga aggagaagtc      480

ggatcttgaa ggaacgagcg atggcgaagg caaccaaaga agtcaagtcg aagcgcgtgg      540

aagcgttgcg gcaggtggcg tatcaacggc tggaacgcct cgagcggaag gctcagaaga      600

tcggagcgca tctgcgcaag ccgggaaaag ccgctgacct ccaatcactc cattatcttc      660

ttcacaaggt cgaagtcgaa tatcacgata tcgcaaggaa cctggagaag gacccgactt      720

ggacaccaaa accgaaaatg cgacgagaga agcgtgccat cgtgccggag tccggcccgg      780

ctgcgcccct cccgaccacg gcaaagggtg agccgggtag accggcaaac cgtcatattc      840

cgccaccagt gccgctcgat tcagcaagga tccccgaaga ccaacagtcg atgggccaag      900

gaagcggggg gaggagttgg tgttctgcgc ctttcgttga ggtgaagtta ccgccgactc      960

aatggtcgaa tgtccgggag aagcttctga aattccgaat tgaggacgac gccgacatcg     1020

tcaggcggtg ggccgaggcc aagttcggaa gcatcgagac ggcgcgcgat ggattacgtg     1080

cgagcgcaga gatcggaacg agcccggatg tctggcgttc cttcatcagc cgcgcgatct     1140

cgaacggcaa gaaggacttt gagccacttc tctcgttgga cgatgacgaa ttgaccgcgg     1200

atgcaacagc cgagcgcgtt gtgcgtcggt ggcatcagat tgactgggtg ggccgaatgc     1260

tcgactccat cctggaaacc gtcccgtcgg gggtctcgaa agacacgttt cgaagcaggg     1320

tcgaatcgcg tctcaagacg tttcactcgt ctgtgaacag cttcgagctc aagaagagga     1380

aggacggtac ggtcgagcgc aagcggaagc acaccaaccc gcagtttccg tacttgtcac     1440

cgagcgcagt gagcatcgat cctgatgttg tgactatgga ggcggtcgaa ctgctccaga     1500

tgcagcccga ggaacgcttt gcaaaggacc cgaacgatgc gaatggcaga atgaggctga     1560

gggttttgca ggcggaactc ggcaaagcac gacgcgaggc tctgggtcgg cggggcgaga     1620

aggccccgcc gtggagtggc cgcaaggtct ttcgcggaac cacgaccagg aagagggaag     1680

cgtgcctggt ttgggacaaa gaggcacaag cggatggact ttacttcgcg ctcgtgatgt     1740

cgggcggacc aaagatcgac gacaaacggt ttgtctacat ggacggtcag ccgctacaaa     1800

gcgattggca actgcacaac ggagtggccg gtaaggcaaa gtcatgcagg gcgatgcctc     1860

tcattttgaa gcatgacttc ctgcggtggt accaccgcca cattaagaac cacgacgtca     1920

atgctcccct cgaaaagcgg tgcgttcaca cgacgaccca gttcgttttc gtggagccgg     1980

acgaaaagaa gggccttcag ccccggctgt tcatcagacc cgtattcaag ttctacgatc     2040

cggtctatga agtgccggat agccactcga ttgacaagaa gccggactgc cgatatttga     2100

tcggaattga ccgaggcgtt aactacccct atcgtgccgc agtatacgat tgcgagacaa     2160

actccataat cgccgacaag ttcgtggacg gacgaaaggc agattgggag cggatacgaa     2220

atgaactcgc ataccaccag cggcgacgtg acctcctgcg caactcgcgt gcctcttccg     2280

ccgcaataca gcgagagatt cgagccattg cacggattcg caagagggag cgtgggctga     2340

acaaagtcga gacggtcgag agcatcgcgc ggctcgtcga ctgggcggaa gagaatctcg     2400

ggaagtgcaa ttactgcttc gttctcgaag acctttcttc aaacttgaat ctggggcgaa     2460

acaacagggt caagcacatt gccgcgatca aggaggcgct gatcaaccag atgcgcaagc     2520

gcggatatcg tttcaaaaag agcgggaaag ttgacggcgt gcgagaggag tccgcgtggt     2580

acacgagtgc cgttgcgcca tccggttggt gggcgaagaa ggaagaagtg gacggggcct     2640

ggaaagcgga caagacgcgg ccattggcga gaaagatcgg cagttactat tgctgcgaag     2700

aaatcgacgg actccatttg cgcggcgtgc tgaaggggct cggaagggcg aagcgactcg     2760

ttcttcaaag cgacgaccca tccgcgccga ctcgcagacg agggtttgga tcagagttgt     2820

tctgggaccc ctattgcacc gaactctgcg gccacgcttt cccgcaaggc gtcgtactgg     2880

acgcagactt catcggcgcc ttcaatattg cgctgcgacc gctggtgagg gaggaacttg     2940

ggaagaaggc gaaggccgtg gacctggccg acaggcacca gacgctcaat ccgacggttg     3000

ccctccgatg cggcgtaacg gcgtacgagt tcgtcgaagt cgggggcgat ccccggggcg     3060

gtctccgaaa aatcttgctc aatcccgcag aggccgtgat ataatttgaa tgtgctctgc     3120

cgaagacgcc gcacggagcc tgggccggaa tcgtagatcg aacgcggcat cgaagccctg     3180

cagcccttcg gggccaaggc ggcgcagcaa gcctctttca ggcggcagag tcctttagag     3240

tgtaacgagg gcccccagga acgggggccc agccatctcc agggaaggga cagaggaggt     3300

ggatagtgaa gtacgaatac gtcgagacat tcgggtcggc ggttcaaaag cactccgagc     3360

gacttgtcgt gtcggagcct tccggcgaag gagggcagag aacaaagagg caggtgcccg     3420

ctctacacct ggaccacctg ctgatcggct cacgcggcgt cagcatttcg tcggacgctc     3480

tcgaactctg ctgcgaacga ggcattcccg tcacaatcgt ggatcgccgc ggcaagccgg     3540

tggggaagtt caccgccccg gcaattcacg gcaccagccg gacgcgccga gcgcagatca     3600

gagcgtacga gaacggcctc ggggttactt tcgctcgctc ggtcgtcatt ggaaaggcgg     3660

cgaatcaggc aataaacctg aaatacttcg caaagaatcg acgagagcga agcccggatc     3720

agtacgagac gctcagaaaa tccgcagagg cgatcgaccg cgttgctcgt agagcgaaga     3780

agatctcggc gaactgcatt gatgaagttc gacaaccgct catggtcctt gaggccgagg     3840

cttcgcgcat ctactggagt tcgttgtcag ctctttacgg cagcagctct ggctttgtgc     3900

accgcgagca aaggggtacc aagaatccag tcaatgctgc gctgaactac gcctacggtg     3960

tactgacagg cgaagtttgg acagcgtgcc tcctggctgg gcttgaaccg tacgcaggat     4020

tcctacacgc ggaccgacca gggaggctca gtttcgtgtt ggaccttatc gaggagtttc     4080

ggccagtggt cgcggatagg gtcgtattcg cactcgcggc gaaggggtgg aggattgaac     4140

aagaggagaa tggatggctc tcgctcgcgt cgaaaaacaa gctcctcgcg agtttggccg     4200

agaggttgga ttctcccgag cctgaccgcg ggaggaggcg caaactgcgc aacgtaattc     4260

agcggcaggc atacgcagcg gcacagcatt tcttaggaaa tgaaacctac gtgccatata     4320

agcagaggtg gtagcagaat gacctggctc gttgtgtatg acattgagga tgacagagtt     4380

cgaacgaagg tcgcagacta ttgcctggac aagggtctgg agcggatcca atacagttgc     4440

tttcttggcg agatgtcgcg aacattggct cgcgagctgg catcaaagtg caagcggaag     4500

ctcggggaca agcccgggaa gattcggctt gttcccgttt gtgaaaagga ccttgcaagc     4560

caggttcgaa tcgagaatgt gccttgatca tggaagtctc gccgagtgat ggattcgtct     4620

ccgtatccga ggtcagacag tggtcatatt gtccgcgtgt cgtctggcac aaccgctggc     4680

taggggaacg cagaccccag acgtctcgaa tggaagaagg gagggccgac caggcggaac     4740

gggagcggaa ggagaagagg cgcacgttcg ccgaataccg gttgcctgcc caatcgcgaa     4800

gattcaacgt gtacttgagg tcggagcggc tcggtgtttc tggtgtcgtg gacgccgtgc     4860

tggaacttac gaatcgctcc atcgacgaag tggactccca agggctcgat ccggaacgcc     4920

cttatttcgc gccagttgag tataagagca cgcaagagag ggtcggccgc catcatcttc     4980

tgcagctcgc agggtatgcg gcgctgctct ccgatattac gggaacgagc gtaccgttcg     5040

gatacttcgt ttcgcttccc aacgggcgag caagcagggt cgaactgagc gagaaggcga     5100

gggaagagtt cctttcgtgc gtgcaaggga tacgtaacat ggtagtcgag tgccgaatgc     5160

cggagcccac gccttcgcgg gcgaagtgcc gagactgtga gttccgccgc ttctgcaatg     5220

acgtgtggtg agcccgcaac cctgtcaacc gacggattcc cggcccaatg tgcgaaagta     5280

gcgcgggcgc cgttgctcga tcctccatgc cgggagtgcg aagaggtgcc tccattttgg     5340

ggcagtagga ggcttatcgg cctcctccgc cgtccgcagc aggccggagc cgccgaactg     5400

ttcgcacatt ggcgcagata cactggcaga tatggggttg cgaatctcgc ggacgaatcg     5460

ggtgtttgaa ccaaaaatgc ggcgataatg tacggaagcc cgagtgcgga gccccagcct     5520

ttgaggctgg cccctacggg cgcaggacaa aatgcacact ctaaaggaat gaaag          5575


<210>  19
<211>  37
<212>  DNA
<213>  Armatimonadetes bacterium

<400>  19
gtcgcatgag gcgagaagca ctcttaggga atgaaag                                37


<210>  20
<211>  37
<212>  DNA
<213>  Armatimonadetes bacterium

<400>  20
gtcgcagggg atcaagaacg ctcttaggga atgaaag                                37


<210>  21
<211>  37
<212>  DNA
<213>  Armatimonadetes bacterium

<400>  21
ggcgcaggac aaaatgcaca ctctaaagga atgaaag                                37


<210>  22
<211>  74
<212>  RNA
<213>  Armatimonadetes bacterium


<220>
<221>  misc_feature
<222>  (38)..(74)
<223>  n is a, c, g, or u

<400>  22
gucgcaugag gcgagaagca cucuuaggga augaaagnnn nnnnnnnnnn nnnnnnnnnn       60

nnnnnnnnnn nnnn                                                         74


<210>  23
<211>  74
<212>  RNA
<213>  Armatimonadetes bacterium


<220>
<221>  misc_feature
<222>  (38)..(74)
<223>  n is a, c, g, or u

<400>  23
gucgcagggg aucaagaacg cucuuaggga augaaagnnn nnnnnnnnnn nnnnnnnnnn       60

nnnnnnnnnn nnnn                                                         74


<210>  24
<211>  74
<212>  RNA
<213>  Armatimonadetes bacterium


<220>
<221>  misc_feature
<222>  (38)..(74)
<223>  n is a, c, g, or u

<400>  24
ggcgcaggac aaaaugcaca cucuaaagga augaaagnnn nnnnnnnnnn nnnnnnnnnn       60

nnnnnnnnnn nnnn                                                         74


<210>  25
<211>  164
<212>  RNA
<213>  Armatimonadetes bacterium

<400>  25
uuucuaaguu ccaucucgau gcggaacgga uacuacgcug uagucuauac gacacgagug       60

auagcccugc gggguucgcc ccuaaguccg uaugacaagc cucuuucagg cgguggacuu      120

cuaagagugc uggugggugg aaucccuaag ccacccaccu ccuu                       164


<210>  26
<211>  131
<212>  RNA
<213>  Armatimonadetes bacterium

<400>  26
uuucuaaguu ccaucucgau gcggaacgga uacuacgcug uagucuauac gacacgagug       60

auagcccugc gggguucgcc ccuaaguccg uaugacaagc cucuuucagg cgguggacuu      120

cuaagagugc u                                                           131


<210>  27
<211>  151
<212>  RNA
<213>  Armatimonadetes bacterium

<400>  27
uuaggcguuc cgucucgacu augccguacc acuagaccga gccuacacgg cacgcgguca       60

uagcguuaac caaggcgugg ugacaagccu cuuucaggcg ucggacacuu aagagcguua      120

ggcgggcggu cccuaagccg cccgcccccu u                                     151


<210>  28
<211>  119
<212>  RNA
<213>  Armatimonadetes bacterium

<400>  28
uuaggcguuc cgucucgacu augccguacc acuagaccga gccuacacgg cacgcgguca       60

uagcguuaac caaggcgugg ugacaagccu cuuucaggcg ucggacacuu aagagcguu       119


<210>  29
<211>  178
<212>  RNA
<213>  Armatimonadetes bacterium

<400>  29
uuugaaugug cucugccgaa gacgccgcac ggagccuggg ccggaaucgu agaucgaacg       60

cggcaucgaa gcccugcagc ccuucggggc caaggcggcg cagcaagccu cuuucaggcg      120

gcagaguccu uuagagugua acgagggccc ccaggaacgg gggcccagcc aucuccag        178


<210>  30
<211>  139
<212>  RNA
<213>  Armatimonadetes bacterium

<400>  30
uuugaaugug cucugccgaa gacgccgcac ggagccuggg ccggaaucgu agaucgaacg       60

cggcaucgaa gcccugcagc ccuucggggc caaggcggcg cagcaagccu cuuucaggcg      120

gcagaguccu uuagagugu                                                   139


<210>  31
<211>  242
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta 1 sgRNA with terminator-like hairpin


<220>
<221>  misc_feature
<222>  (206)..(242)
<223>  n is a, c, g, or u

<400>  31
uuucuaaguu ccaucucgau gcggaacgga uacuacgcug uagucuauac gacacgagug       60

auagcccugc gggguucgcc ccuaaguccg uaugacaagc cucuuucagg cgguggacuu      120

cuaagagugc uggugggugg aaucccuaag ccacccaccu ccuugaaagu cgcaugaggc      180

gagaagcacu cuuagggaau gaaagnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn      240

nn                                                                     242


<210>  32
<211>  193
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta 1 sgRNA without terminator-like hairpin


<220>
<221>  misc_feature
<222>  (157)..(193)
<223>  n is a, c, g, or u

<400>  32
uuucuaaguu ccaucucgau gcggaacgga uacuacgcug uagucuauac gacacgagug       60

auagcccugc gggguucgcc ccuaaguccg uaugacaagc cucuuucagg cgguggacuu      120

cuaagagugc ugaaaagcac ucuuagggaa ugaaagnnnn nnnnnnnnnn nnnnnnnnnn      180

nnnnnnnnnn nnn                                                         193


<210>  33
<211>  229
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta 2 sgRNA with terminator-like hairpin


<220>
<221>  misc_feature
<222>  (193)..(229)
<223>  n is a, c, g, or u

<400>  33
uuaggcguuc cgucucgacu augccguacc acuagaccga gccuacacgg cacgcgguca       60

uagcguuaac caaggcgugg ugacaagccu cuuucaggcg ucggacacuu aagagcguua      120

ggcgggcggu cccuaagccg cccgcccccu ugaaagucgc aggggaucaa gaacgcucuu      180

agggaaugaa agnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnn                  229


<210>  34
<211>  181
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta 2 sgRNA without terminator-like hairpin


<220>
<221>  misc_feature
<222>  (145)..(181)
<223>  n is a, c, g, or u

<400>  34
uuaggcguuc cgucucgacu augccguacc acuagaccga gccuacacgg cacgcgguca       60

uagcguuaac caaggcgugg ugacaagccu cuuucaggcg ucggacacuu aagagcguug      120

aaaaacgcuc uuagggaaug aaagnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn      180

n                                                                      181


<210>  35
<211>  256
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta 3 sgRNA with terminator-like hairpin


<220>
<221>  misc_feature
<222>  (220)..(256)
<223>  n is a, c, g, or u

<400>  35
uuugaaugug cucugccgaa gacgccgcac ggagccuggg ccggaaucgu agaucgaacg       60

cggcaucgaa gcccugcagc ccuucggggc caaggcggcg cagcaagccu cuuucaggcg      120

gcagaguccu uuagagugua acgagggccc ccaggaacgg gggcccagcc aucuccagga      180

aaggcgcagg acaaaaugca cacucuaaag gaaugaaagn nnnnnnnnnn nnnnnnnnnn      240

nnnnnnnnnn nnnnnn                                                      256


<210>  36
<211>  200
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta 3 sgRNA without terminator-like hairpin


<220>
<221>  misc_feature
<222>  (164)..(200)
<223>  n is a, c, g, or u

<400>  36
uuugaaugug cucugccgaa gacgccgcac ggagccuggg ccggaaucgu agaucgaacg       60

cggcaucgaa gcccugcagc ccuucggggc caaggcggcg cagcaagccu cuuucaggcg      120

gcagaguccu uuagagugug aaaacacucu aaaggaauga aagnnnnnnn nnnnnnnnnn      180

nnnnnnnnnn nnnnnnnnnn                                                  200


<210>  37
<211>  37
<212>  DNA
<213>  Artificial

<220>
<223>  T2 PAM library target sequence

<400>  37
agttgaccca acgtcgccgg cgtgcacaat ctagatg                                37


<210>  38
<211>  6
<212>  PRT
<213>  Artificial

<220>
<223>  6X histidine tag

<400>  38

His His His His His His 
1               5       


<210>  39
<211>  10
<212>  PRT
<213>  Artificial

<220>
<223>  10X histidine tag

<400>  39

His His His His His His His His His His 
1               5                   10  


<210>  40
<211>  366
<212>  PRT
<213>  Escherichia coli

<400>  40

Lys Ile Glu Glu Gly Lys Leu Val Ile Trp Ile Asn Gly Asp Lys Gly 
1               5                   10                  15      


Tyr Asn Gly Leu Ala Glu Val Gly Lys Lys Phe Glu Lys Asp Thr Gly 
            20                  25                  30          


Ile Lys Val Thr Val Glu His Pro Asp Lys Leu Glu Glu Lys Phe Pro 
        35                  40                  45              


Gln Val Ala Ala Thr Gly Asp Gly Pro Asp Ile Ile Phe Trp Ala His 
    50                  55                  60                  


Asp Arg Phe Gly Gly Tyr Ala Gln Ser Gly Leu Leu Ala Glu Ile Thr 
65                  70                  75                  80  


Pro Asp Lys Ala Phe Gln Asp Lys Leu Tyr Pro Phe Thr Trp Asp Ala 
                85                  90                  95      


Val Arg Tyr Asn Gly Lys Leu Ile Ala Tyr Pro Ile Ala Val Glu Ala 
            100                 105                 110         


Leu Ser Leu Ile Tyr Asn Lys Asp Leu Leu Pro Asn Pro Pro Lys Thr 
        115                 120                 125             


Trp Glu Glu Ile Pro Ala Leu Asp Lys Glu Leu Lys Ala Lys Gly Lys 
    130                 135                 140                 


Ser Ala Leu Met Phe Asn Leu Gln Glu Pro Tyr Phe Thr Trp Pro Leu 
145                 150                 155                 160 


Ile Ala Ala Asp Gly Gly Tyr Ala Phe Lys Tyr Glu Asn Gly Lys Tyr 
                165                 170                 175     


Asp Ile Lys Asp Val Gly Val Asp Asn Ala Gly Ala Lys Ala Gly Leu 
            180                 185                 190         


Thr Phe Leu Val Asp Leu Ile Lys Asn Lys His Met Asn Ala Asp Thr 
        195                 200                 205             


Asp Tyr Ser Ile Ala Glu Ala Ala Phe Asn Lys Gly Glu Thr Ala Met 
    210                 215                 220                 


Thr Ile Asn Gly Pro Trp Ala Trp Ser Asn Ile Asp Thr Ser Lys Val 
225                 230                 235                 240 


Asn Tyr Gly Val Thr Val Leu Pro Thr Phe Lys Gly Gln Pro Ser Lys 
                245                 250                 255     


Pro Phe Val Gly Val Leu Ser Ala Gly Ile Asn Ala Ala Ser Pro Asn 
            260                 265                 270         


Lys Glu Leu Ala Lys Glu Phe Leu Glu Asn Tyr Leu Leu Thr Asp Glu 
        275                 280                 285             


Gly Leu Glu Ala Val Asn Lys Asp Lys Pro Leu Gly Ala Val Ala Leu 
    290                 295                 300                 


Lys Ser Tyr Glu Glu Glu Leu Val Lys Asp Pro Arg Ile Ala Ala Thr 
305                 310                 315                 320 


Met Glu Asn Ala Gln Lys Gly Glu Ile Met Pro Asn Ile Pro Gln Met 
                325                 330                 335     


Ser Ala Phe Trp Tyr Ala Val Arg Thr Ala Val Ile Asn Ala Ala Ser 
            340                 345                 350         


Gly Arg Gln Thr Val Asp Glu Ala Leu Lys Asp Ala Gln Thr 
        355                 360                 365     


<210>  41
<211>  96
<212>  PRT
<213>  Brachypodium distachyon

<400>  41

Ser Ala Ala Gly Gly Glu Glu Asp Lys Lys Pro Ala Gly Gly Glu Gly 
1               5                   10                  15      


Gly Gly Ala His Ile Asn Leu Lys Val Lys Gly Gln Asp Gly Asn Glu 
            20                  25                  30          


Val Phe Phe Arg Ile Lys Arg Ser Thr Gln Leu Lys Lys Leu Met Asn 
        35                  40                  45              


Ala Tyr Cys Asp Arg Gln Ser Val Asp Met Thr Ala Ile Ala Phe Leu 
    50                  55                  60                  


Phe Asp Gly Arg Arg Leu Arg Ala Glu Gln Thr Pro Asp Glu Leu Glu 
65                  70                  75                  80  


Met Glu Asp Gly Asp Glu Ile Asp Ala Met Leu His Gln Thr Gly Gly 
                85                  90                  95      


<210>  42
<211>  31
<212>  DNA
<213>  Artificial

<220>
<223>  A1 oligonucleotide

<400>  42
cggcattcct gctgaaccgc tcttccgatc t                                      31


<210>  43
<211>  30
<212>  DNA
<213>  Artificial

<220>
<223>  A2 oligonucleotide

<400>  43
gatcggaaga gcggttcagc aggaatgccg                                        30


<210>  44
<211>  22
<212>  DNA
<213>  Artificial

<220>
<223>  R0 oligonucleotide

<400>  44
gccagggttt tcccagtcac ga                                                22


<210>  45
<211>  28
<212>  DNA
<213>  Artificial

<220>
<223>  C0 oligonucleotide

<400>  45
gaaattctaa acgctaaaga ggaagagg                                          28


<210>  46
<211>  56
<212>  DNA
<213>  Artificial

<220>
<223>  F1 oligonucleotide

<400>  46
ctacactctt tccctacacg acgctcttcc gatctaaggc ggcattcctg ctgaac           56


<210>  47
<211>  49
<212>  DNA
<213>  Artificial

<220>
<223>  R1 oligonucleotide

<400>  47
caagcagaag acggcatacg agctcttccg atctcggcga cgttgggtc                   49


<210>  48
<211>  35
<212>  DNA
<213>  Artificial

<220>
<223>  Bridge amplification portion of F1 oligonucleotide

<400>  48
ctacactctt tccctacacg acgctcttcc gatct                                  35


<210>  49
<211>  34
<212>  DNA
<213>  Artificial

<220>
<223>  Bridge amplification portion of R1 oligonucleotide

<400>  49
caagcagaag acggcatacg agctcttccg atct                                   34


<210>  50
<211>  43
<212>  DNA
<213>  Artificial

<220>
<223>  F2 oligonucleotide

<400>  50
aatgatacgg cgaccaccga gatctacact ctttccctac acg                         43


<210>  51
<211>  18
<212>  DNA
<213>  Artificial

<220>
<223>  R2 oligonucleotide

<400>  51
caagcagaag acggcata                                                     18


<210>  52
<211>  60
<212>  DNA
<213>  Artificial

<220>
<223>  C1 oligonucleotide

<400>  52
ctacactctt tccctacacg acgctcttcc gatctggaat aaacgctaaa gaggaagagg       60


<210>  53
<211>  36
<212>  DNA
<213>  Artificial

<220>
<223>  Sequence resulting from cleavage and adapter ligation at position
       23 of the target

<400>  53
gctcttccga tctacgccgg cgacgttggg tcaact                                 36


<210>  54
<211>  13
<212>  DNA
<213>  Artificial

<220>
<223>  Adapter portion of SEQ ID NO. 53

<400>  54
gctcttccga tct                                                          13


<210>  55
<211>  23
<212>  DNA
<213>  Artificial

<220>
<223>  Target portion of SEQ ID NO. 53

<400>  55
acgccggcga cgttgggtca act                                               23


<210>  56
<211>  10
<212>  DNA
<213>  Artificial

<220>
<223>  Sequence 5' of PAM

<400>  56
tgtcctcttc                                                              10


<210>  57
<211>  860
<212>  PRT
<213>  Armatimonadetes bacterium

<400>  57

Met Gly Lys Asn Arg Ser Ser Ser Ser Asp Leu Ser Gln Leu Glu Arg 
1               5                   10                  15      


Ser Leu Arg Lys Val Gly Glu Asn Arg Leu Glu Arg Leu Arg Val Arg 
            20                  25                  30          


Gly Gln Lys Ile Arg Lys His Leu Glu Gln His Pro Arg Gly Lys Asn 
        35                  40                  45              


Asp His Gln Ala Leu His Phe Leu Leu His Gln Ile Glu Val Glu Arg 
    50                  55                  60                  


Asn Asp Leu Tyr Arg Asn Leu Lys Asp Pro Glu Tyr Val Pro Lys Pro 
65                  70                  75                  80  


Ala Lys Arg Arg Arg Glu Arg Arg Gln Ile Asn Val Ala Gln Pro Pro 
                85                  90                  95      


Thr Arg Pro Thr Lys Ser Val Gly Pro Lys Pro Ala Pro Thr Thr Tyr 
            100                 105                 110         


Val Ile Pro Arg Pro Glu Pro Gly Arg Asp Leu Pro Ala Phe Ala Ser 
        115                 120                 125             


Arg Tyr Lys Ala Ser Asp Ser Arg Gly Glu Asp Asp Gln Asp Gly Arg 
    130                 135                 140                 


Ser Trp Thr Ala Ala Pro Phe Val Glu Val Glu Leu Pro Ile Gln Ile 
145                 150                 155                 160 


Ala Gly Lys Ile Leu Glu Lys Leu Arg Lys Tyr Val Gln Lys Asp Glu 
                165                 170                 175     


Arg Glu Ile Val Arg Glu Trp Ala Val Lys Thr Tyr Gly Ser Ile Glu 
            180                 185                 190         


Ala Ala Arg Glu Pro Leu Leu Ile Gly Ala Gln Val Ser Glu Asp Val 
        195                 200                 205             


Ser Val Trp Arg Gly Leu Leu Ala Glu Thr Lys Asn Ala Gln Asp Phe 
    210                 215                 220                 


Ala Ala Leu Ser Asp Asp Gln Ile Glu Ala Ala Met Ser Lys Glu Ala 
225                 230                 235                 240 


Lys Gly Ser Asp Leu Arg Pro Arg Arg Ala Ala Leu Leu Val Ala Gln 
                245                 250                 255     


Arg His Trp Val Asp Gln Thr Val Lys Ala Ile Lys Glu Ser Ala Pro 
            260                 265                 270         


Lys Gly Val Asp Lys Asp Thr Leu Asp Arg Arg Leu Arg Ala Gly Leu 
        275                 280                 285             


Arg Gly Phe His Thr Ala Ala Asn Ser Gly Lys His Thr Asn Pro Gln 
    290                 295                 300                 


Phe Pro Tyr Leu Thr Pro Lys Glu Ala Lys Val Pro Leu Glu Ser Val 
305                 310                 315                 320 


Val Asn Gln Val Leu Glu Phe Leu Asp Asp Ala Asp Asp Gln Arg Tyr 
                325                 330                 335     


Val Gln Val His Arg Val Ser His Leu Gln Lys Glu Leu Gly Lys Ala 
            340                 345                 350         


Arg Pro Arg Lys Arg Leu Glu Leu Gln Arg Pro Lys Trp Ala Gly Arg 
        355                 360                 365             


Pro Thr Val Gln Gly Thr Ile Ser Lys Arg Arg Asp Ala Ala Leu Val 
    370                 375                 380                 


Trp Asp Thr Ser Lys Lys Glu Asn Gly Leu Cys Leu Ala Leu Pro Leu 
385                 390                 395                 400 


Gly Gly Leu Gln Lys Ile Asp Val Glu Arg Phe Ile Tyr Gln Asp Gly 
                405                 410                 415     


Thr Ser Leu Leu Ser Asp Cys Gln Ile Ala Ser Lys Thr Ser Lys Lys 
            420                 425                 430         


Gly Ala Ala Cys Ala Leu Met Pro Leu Lys Pro Lys His Asp Phe Leu 
        435                 440                 445             


Arg Trp Tyr Thr Lys His Val Glu Asn His Asn Ala Asp Ala Pro Leu 
    450                 455                 460                 


Glu Arg Arg Cys Leu His Asn Thr Thr Gln Phe Val Ile Val Asp Pro 
465                 470                 475                 480 


Glu Gly Gln Arg Pro Arg Leu Phe Ile Arg Pro Val Phe Lys Phe Tyr 
                485                 490                 495     


Asp Pro Gly Lys Ala Val Pro Asn Thr His Glu Thr Trp Lys Lys Pro 
            500                 505                 510         


Asp Cys Arg Tyr Leu Val Gly Ile Asp Arg Gly Ile Asn Tyr Val Leu 
        515                 520                 525             


Arg Ala Val Val Val Asp Ile Glu Lys Lys Glu Val Ile Ala Asp Ile 
    530                 535                 540                 


His Leu Gln Gly Asp Lys His Lys Trp Arg Met Ile Arg Asp Glu Ile 
545                 550                 555                 560 


Ala Tyr His Gln Gln Met Arg Asp Leu Ala Ser Asn Thr Gly Lys His 
                565                 570                 575     


Pro Ser Val Val Ala Arg His Val Arg Ala Leu Ala Leu Ala Arg Lys 
            580                 585                 590         


Lys Asp Arg Ala Leu Gly Arg Phe Thr Thr Val Lys Ala Val Ala Asp 
        595                 600                 605             


Ile Val Met Gln Cys Glu Asn Asp Tyr Gly Ser Gly Asn Tyr Cys Phe 
    610                 615                 620                 


Val Leu Glu Asp Leu Asp Met Gly Lys Met Asn Leu Lys Arg Asn Asn 
625                 630                 635                 640 


Arg Val Lys His Met Ala Val Met Lys Glu Ala Leu Val Asn Gln Met 
                645                 650                 655     


Arg Lys Arg Gly Tyr Ala Tyr Asp Gly Arg Arg Gly Arg Ala Asp Gly 
            660                 665                 670         


Val Arg Tyr Glu Gly Ala Trp Tyr Thr Ser Gln Val Ser Pro Phe Gly 
        675                 680                 685             


Trp Trp Ala Lys Arg Glu Glu Val Glu Glu Ala Trp Lys Lys Asp Thr 
    690                 695                 700                 


Ser Arg Pro Ile Gly Arg Lys Val Gly Asn Trp Tyr Glu Met Pro Asp 
705                 710                 715                 720 


Pro Asn Glu Glu Gly Lys Arg Ser Asp Val Tyr Arg Lys Gly Cys Trp 
                725                 730                 735     


Lys Lys Pro Gln Asn Ala Ser Gly Lys Pro Tyr Gly Arg Asn Arg Phe 
            740                 745                 750         


Cys Val Glu Pro Gly Asp Glu Lys Pro Asp Ala Gln Arg Arg Phe Ser 
        755                 760                 765             


Trp Gly Ser Glu Leu Phe Trp Asp Pro Asn Val Lys Ser Phe Lys Gly 
    770                 775                 780                 


Lys Glu Phe Pro Glu Gly Val Val Leu Asp Ala Asp Phe Val Gly Ala 
785                 790                 795                 800 


Leu Asn Ile Ala Leu Arg Pro Leu Val Asn Asp Gly Gln Gly Arg Gly 
                805                 810                 815     


Phe Thr Ala Asp Lys Met Ala Glu Ala His Thr Arg Leu Asn Pro Gln 
            820                 825                 830         


Phe Glu Ile Val Cys Lys Ile Pro Val Tyr Glu Phe Ile Glu Glu His 
        835                 840                 845             


Gly Asp Lys Arg Ala Lys Leu Arg Arg Ile Val Leu 
    850                 855                 860 


<210>  58
<211>  21
<212>  PRT
<213>  Artificial

<220>
<223>  14X histidine tag

<400>  58

His His His His Ser Gly His His His Thr Gly His His His His Ser 
1               5                   10                  15      


Gly Ser His His His 
            20      


<210>  59
<211>  24
<212>  RNA
<213>  Artificial

<220>
<223>  24 nt spacer sequence (targeting T2)

<400>  59
aguugaccca acgucgccgg cgug                                              24


<210>  60
<211>  217
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta 2 sgRNA with terminator-like hairpin targeting T2 
       sequence

<400>  60
guuaggcguu ccgucucgac uaugccguac cacuagaccg agccuacacg gcacgcgguc       60

auagcguuaa ccaaggcgug gugacaagcc ucuuucaggc gucggacacu uaagagcguu      120

aggcgggcgg ucccuaagcc gcccgccccc uugaaagucg caggggauca agaacgcucu      180

uagggaauga aagaguugac ccaacgucgc cggcgug                               217


<210>  61
<211>  169
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta 2 sgRNA without terminator-like hairpin targeting T2 
       sequence

<400>  61
guuaggcguu ccgucucgac uaugccguac cacuagaccg agccuacacg gcacgcgguc       60

auagcguuaa ccaaggcgug gugacaagcc ucuuucaggc gucggacacu uaagagcguu      120

gaaaaacgcu cuuagggaau gaaagaguug acccaacguc gccggcgug                  169


<210>  62
<211>  244
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta 3 sgRNA with terminator-like hairpin targeting T2 
       sequence

<400>  62
guuugaaugu gcucugccga agacgccgca cggagccugg gccggaaucg uagaucgaac       60

gcggcaucga agcccugcag cccuucgggg ccaaggcggc gcagcaagcc ucuuucaggc      120

ggcagagucc uuuagagugu aacgagggcc cccaggaacg ggggcccagc caucuccagg      180

aaaggcgcag gacaaaaugc acacucuaaa ggaaugaaag aguugaccca acgucgccgg      240

cgug                                                                   244


<210>  63
<211>  188
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta 3 sgRNA without terminator-like hairpin targeting T2 
       sequence

<400>  63
guuugaaugu gcucugccga agacgccgca cggagccugg gccggaaucg uagaucgaac       60

gcggcaucga agcccugcag cccuucgggg ccaaggcggc gcagcaagcc ucuuucaggc      120

ggcagagucc uuuagagugu gaaaacacuc uaaaggaaug aaagaguuga cccaacgucg      180

ccggcgug                                                               188


<210>  64
<211>  24
<212>  DNA
<213>  Artificial

<220>
<223>  T1 target sequence

<400>  64
tgtcctcttc ctctttagcg ttta                                              24


<210>  65
<211>  169
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta 2 sgRNA without terminator-like hairpin targeting T1 
       sequence

<400>  65
guuaggcguu ccgucucgac uaugccguac cacuagaccg agccuacacg gcacgcgguc       60

auagcguuaa ccaaggcgug gugacaagcc ucuuucaggc gucggacacu uaagagcguu      120

gaaaaacgcu cuuagggaau gaaagugucc ucuuccucuu uagcguuua                  169


<210>  66
<211>  165
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta 2 sgRNA without terminator-like hairpin targeting 20 nt 
       of T2 sequence

<400>  66
guuaggcguu ccgucucgac uaugccguac cacuagaccg agccuacacg gcacgcgguc       60

auagcguuaa ccaaggcgug gugacaagcc ucuuucaggc gucggacacu uaagagcguu      120

gaaaaacgcu cuuagggaau gaaagaguug acccaacguc gccgg                      165


<210>  67
<211>  45
<212>  DNA
<213>  Artificial

<220>
<223>  ssDNA oligonucleotide activator

<400>  67
agcttcatct agattgtgca cgccggcgac gttgggtcaa ctggg                       45


<210>  68
<211>  45
<212>  DNA
<213>  Artificial

<220>
<223>  Complementary oligonucleotide used to generate dsDNA activator 
       and used as ssDNA non-specific activator

<400>  68
aattcccagt tgacccaacg tcgccggcgt gcacaatcta gatga                       45


<210>  69
<211>  45
<212>  DNA
<213>  Artificial

<220>
<223>  Oligonucleotide 1 used to generate dsDNA non-specific activator

<400>  69
agctttgtcc tcttcctctt tagcgtttag aatttgtcga cgggg                       45


<210>  70
<211>  45
<212>  DNA
<213>  Artificial

<220>
<223>  Oligonucleotide 2 used to generated dsDNA non-specific activator

<400>  70
aattccccgt cgacaaattc taaacgctaa agaggaagag gacaa                       45


<210>  71
<211>  7
<212>  PRT
<213>  Simian virus 40

<400>  71

Pro Lys Lys Lys Arg Lys Val 
1               5           


<210>  72
<211>  896
<212>  DNA
<213>  Zea mays

<400>  72
gtgcagcgtg acccggtcgt gcccctctct agagataatg agcattgcat gtctaagtta       60

taaaaaatta ccacatattt tttttgtcac acttgtttga agtgcagttt atctatcttt      120

atacatatat ttaaacttta ctctacgaat aatataatct atagtactac aataatatca      180

gtgttttaga gaatcatata aatgaacagt tagacatggt ctaaaggaca attgagtatt      240

ttgacaacag gactctacag ttttatcttt ttagtgtgca tgtgttctcc tttttttttg      300

caaatagctt cacctatata atacttcatc cattttatta gtacatccat ttagggttta      360

gggttaatgg tttttataga ctaatttttt tagtacatct attttattct attttagcct      420

ctaaattaag aaaactaaaa ctctatttta gtttttttat ttaataattt agatataaaa      480

tagaataaaa taaagtgact aaaaattaaa caaataccct ttaagaaatt aaaaaaacta      540

aggaaacatt tttcttgttt cgagtagata atgccagcct gttaaacgcc gtcgacgagt      600

ctaacggaca ccaaccagcg aaccagcagc gtcgcgtcgg gccaagcgaa gcagacggca      660

cggcatctct gtcgctgcct ctggacccct ctcgagagtt ccgctccacc gttggacttg      720

ctccgctgtc ggcatccaga aattgcgtgg cggagcggca gacgtgagcc ggcacggcag      780

gcggcctcct cctcctctca cggcaccggc agctacgggg gattcctttc ccaccgctcc      840

ttcgctttcc cttcctcgcc cgccgtaata aatagacacc ccctccacac cctctt          896


<210>  73
<211>  82
<212>  DNA
<213>  Zea mays

<400>  73
tccccaacct cgtgttgttc ggagcgcaca cacacacaac cagatctccc ccaaatccac       60

ccgtcggcac ctccgcttca ag                                                82


<210>  74
<211>  1013
<212>  DNA
<213>  Zea mays

<400>  74
gtacgccgct cgtcctcccc cccccccctc tctaccttct ctagatcggc gttccggtcc       60

atgcatggtt agggcccggt agttctactt ctgttcatgt ttgtgttaga tccgtgtttg      120

tgttagatcc gtgctgctag cgttcgtaca cggatgcgac ctgtacgtca gacacgttct      180

gattgctaac ttgccagtgt ttctctttgg ggaatcctgg gatggctcta gccgttccgc      240

agacgggatc gatttcatga ttttttttgt ttcgttgcat agggtttggt ttgccctttt      300

cctttatttc aatatatgcc gtgcacttgt ttgtcgggtc atcttttcat gctttttttt      360

gtcttggttg tgatgatgtg gtctggttgg gcggtcgttc tagatcggag tagaattctg      420

tttcaaacta cctggtggat ttattaattt tggatctgta tgtgtgtgcc atacatattc      480

atagttacga attgaagatg atggatggaa atatcgatct aggataggta tacatgttga      540

tgcgggtttt actgatgcat atacagagat gctttttgtt cgcttggttg tgatgatgtg      600

gtgtggttgg gcggtcgttc attcgttcta gatcggagta gaatactgtt tcaaactacc      660

tggtgtattt attaattttg gaactgtatg tgtgtgtcat acatcttcat agttacgagt      720

ttaagatgga tggaaatatc gatctaggat aggtatacat gttgatgtgg gttttactga      780

tgcatataca tgatggcata tgcagcatct attcatatgc tctaaccttg agtacctatc      840

tattataata aacaagtatg ttttataatt attttgatct tgatatactt ggatgatggc      900

atatgcagca gctatatgtg gattttttta gccctgcctt catacgctat ttatttgctt      960

ggtactgttt cttttgtcga tgctcaccct gttgtttggt gttacttctg cag            1013


<210>  75
<211>  21
<212>  DNA
<213>  Artificial

<220>
<223>  Sequence encoding SV40 NLS

<400>  75
ccaaagaaga agcgcaaggt c                                                 21


<210>  76
<211>  96
<212>  DNA
<213>  Artificial

<220>
<223>  Exon 1 of maize optimized Cas-beta 1 nuclease

<400>  76
atgcctaaaa agacctcgac ggtcgccctc agccccaggg acatcaggct cagggagctg       60

ggcgagaaga ggctgcagag gctgaggcag agggag                                 96


<210>  77
<211>  2466
<212>  DNA
<213>  Artificial

<220>
<223>  Exon 2 of maize optimized Cas-beta 1 nuclease

<400>  77
gagaagatcc gccgccacct ggagagcgag aggggccgcc gcgacttcca gagcctccac       60

tttctgctcc ataaaattga ggtcgagcgc aacgacctct acaggaacct ctaccagaac      120

gagggccacg agagctacgt ccccaagccc ggcaagacca agcacaggaa ggagctcagc      180

ctcccgagca ccgagctccc cagccccccc gacgagaaga agggaccccg cccgaagaag      240

agcaggtacg tcatcccaca gcccgtcccc ggaatcaatc tccccaggct catcaacagg      300

ttcggcaaga gcgaccagaa gagcgagagc gaccaggagg gacgcttctg gacgagcgcc      360

cccttcatcg aggtcgagct cccaatgctc aacgcccacc gcgtcataaa ggccctcatg      420

cgcttcgtcc agaaggacga gcgcagcgtc gtccgcacct gggccgtcac caagttcgga      480

tcgatagagg cggccaggga ggtcctcctc gccggcgccc tcctgcagag ggagcccgag      540

attatgcgcg gcttcctcca gaatatcgac ccctggggat cgctgtcgga cgaggagctg      600

atccgcgatg aaaaagcctg gaggaccgtc aagctcctcg cccagaagaa ctgggtcgac      660

cagatagcca agtcgataaa ggactctgcg cccaagggcg tcgacaagga caccctcgac      720

aggcgcctcc gcagcggcct caaagcattc cactctgcgg ccaacagcgg caagcacacc      780

aacccccagt tcccatacct cacctccgag aagcccagcg ccaacttcga gtccgtcgtc      840

gacagcgtcc tggagttcct cgacctggag gacaaggacc gctacaccat agcgaaggtc      900

gacgacaaga agaggcacag ggtcaccgcc ctccagaagg agctcggcca ggccaagccc      960

cgcgtccgcc tggagcagga gaggagccgc tgggccggcc acagctacct ccagggcacc     1020

atcacccgca agaggcaggc cagcctcgtc tgggacggcc acaggaccga gaacggactc     1080

gccctcgcca taccgctaga cggcatgccc aagatcgacg tccagaggta catgtaccag     1140

gacggaacca gcctcctcag cgaccgccag atcaccagca agaccaagag cgagggcaag     1200

gactgcgccc tcatgcccct caggttcaag catgccttcc tcaggtggta caccaagcac     1260

gtcgagaacc acgtcgccga ggcccccctg gagcgccgct gcatccacaa caccacccag     1320

ttcgtgatag tggacccaga gggaaagcat cccaggctct ttatcaggcc cgtgttcaag     1380

ttctacgaca gcaacaagac aatacagaac agcaacgccc cgtggtgcaa gccccagtgc     1440

aggtacctca taggcatcga ccgcggcatc aactacgtcc tccgcgccgt cgtcgtcgac     1500

accgaggaga aggccgtcat cgacgacatc cccctccccg gccgcaagcg cgagtggagg     1560

gccatcaggc aggagatcgc ctacttccag aggatgaggg acctcagcaa gagcgcccag     1620

gagcgcaaca ggtacgtcgt cgccctcgcc aaggccagga ggaaggacag gagcctggga     1680

aagaccgaaa ccgtggaggc cgtcgccaag ctcgtccaga actgcagcga gaggttcggc     1740

gagggcaact actgcttcgt cctggagaac ctggagctcg gcgccctcaa cctcaagagg     1800

aacaacaggg tcaagcacct cgccagcatg gaggaggccc tcatctacca gatgaggaag     1860

aggggctact tctacaacag caggagcaac cgcgtcgacg gcgtccgctg ggaggccgcc     1920

aggtacacca gccaggtcag cccgttcgga tggtgggcga agagggacga ggtcgagaag     1980

gccaagaagc aggataaaag catggccatc ggccgcaaga tcggcgaggg ctacgagggc     2040

cctcaggacg atgaaataga gtcccactct acgatctaca ggcagggcag gtggatgaaa     2100

ctcaggaacg aggaggggaa ggcgtacgga aggagcaggt tcgtcgtcca gcccgaggac     2160

ctcgacccag cccagccccg gaggttcagc tggggcagcg agctgttctg ggacccatac     2220

cagaaggagt tcaagggcaa gagcttcagc cagggcgtgg tcctggacgc cgactttgtc     2280

ggcgcgctga acatagcgct gaggcccctc gtcaacgacg gcaagggcaa gggcttcacc     2340

accgccatga tggccgaggc ccacgtcaag ctcaacccaa cgttcgagat caggtgcaag     2400

atccccgtct acgagttcat cgccgagaac gacaacagca gggccgccct caggcgcatc     2460

gtcatc                                                                2466


<210>  78
<211>  189
<212>  DNA
<213>  Solanum tuberosum

<400>  78
gtaagtttct gcttctacct ttgatatata tataataatt atcattaatt agtagtaata       60

taatatttca aatatttttt tcaaaataaa agaatgtagt atatagcaat tgcttttctg      120

tagtttataa gtgtgtatat tttaatttat aacttttcta atatatgacc aaaacatggt      180

gatgtgcag                                                              189


<210>  79
<211>  317
<212>  DNA
<213>  Solanum tuberosum

<400>  79
agacttgtcc atcttctgga ttggccaact taattaatgt atgaaataaa aggatgcaca       60

catagtgaca tgctaatcac tataatgtgg gcatcaaagt tgtgtgttat gtgtaattac      120

tagttatctg aataaaagag aaagagatca tccatatttc ttatcctaaa tgaatgtcac      180

gtgtctttat aattctttga tgaaccagat gcatttcatt aaccaaatcc atatacatat      240

aaatattaat catatataat taatatcaat tgggttagca aaacaaatct agtctaggtg      300

tgttttgcga atgcggc                                                     317


<210>  80
<211>  1001
<212>  DNA
<213>  Zea mays

<400>  80
tgagagtaca atgatgaacc tagattaatc aatgccaaag tctgaaaaat gcaccctcag       60

tctatgatcc agaaaatcaa gattgcttga ggccctgttc ggttgttccg gattagagcc      120

ccggattaat tcctagccgg attacttctc taatttatat agattttgat gagctggaat      180

gaatcctggc ttattccggt acaaccgaac aggccctgaa ggataccagt aatcgctgag      240

ctaaattggc atgctgtcag agtgtcagta ttgcagcaag gtagtgagat aaccggcatc      300

atggtgccag tttgatggca ccattagggt tagagatggt ggccatgggc gcatgtcctg      360

gccaactttg tatgatatat ggcagggtga ataggaaagt aaaattgtat tgtaaaaagg      420

gatttcttct gtttgttagc gcatgtacaa ggaatgcaag ttttgagcga gggggcatca      480

aagatctggc tgtgtttcca gctgtttttg ttagccccat cgaatccttg acataatgat      540

cccgcttaaa taagcaacct cgcttgtata gttccttgtg ctctaacaca cgatgatgat      600

aagtcgtaaa atagtggtgt ccaaagaatt tccaggccca gttgtaaaag ctaaaatgct      660

attcgaattt ctactagcag taagtcgtgt ttagaaatta tttttttata tacctttttt      720

ccttctatgt acagtaggac acagtgtcag cgccgcgttg acggagaata tttgcaaaaa      780

agtaaaagag aaagtcatag cggcgtatgt gccaaaaact tcgtcacaga gagggccata      840

agaaacatgg cccacggccc aatacgaagc accgcgacga agcccaaaca gcagtccgta      900

ggtggagcaa agcgctgggt aatacgcaaa cgttttgtcc caccttgact aatcacaaga      960

gtggagcgta ccttataaac cgagccgcaa gcaccgaatt g                         1001


<210>  81
<211>  156
<212>  DNA
<213>  Artificial

<220>
<223>  Sequence encoding Cas-beta Affinity Domain of sgRNA

<400>  81
tttctaagtt ccatctcgat gcggaacgga tactacgctg tagtctatac gacacgagtg       60

atagccctgc ggggttcgcc cctaagtccg tatgacaagc ctctttcagg cggtggactt      120

ctaagagtgc tgaaaagcac tcttagggaa tgaaag                                156


<210>  82
<211>  24
<212>  DNA
<213>  Zea mays

<400>  82
tcctagcgca gagcatcaag ctcg                                              24


<210>  83
<211>  68
<212>  DNA
<213>  Hepatitis delta virus

<400>  83
ggccggcatg gtcccagcct cctcgctggc gccggctggg caacatgctt cggcatggcg       60

aatgggac                                                                68


<210>  84
<211>  8
<212>  DNA
<213>  Zea mays

<400>  84
tttttttt                                                                 8


<210>  85
<211>  92
<212>  PRT
<213>  Armatimonadetes bacterium

<400>  85

Met Lys Trp Leu Val Cys Tyr Asp Ile Glu Lys Asp Ser Val Arg Asn 
1               5                   10                  15      


Lys Val Ala Asp Phe Cys Leu Asp Lys Gly Leu Glu Arg Val Gln Tyr 
            20                  25                  30          


Ser Val Phe Leu Gly Ser Met Thr Arg Thr Leu Ala Lys Glu Leu Gly 
        35                  40                  45              


Ala Gln Ile Arg Lys Arg Met Gly Lys Asn Pro Gly Gln Val Arg Phe 
    50                  55                  60                  


Val Pro Ile Cys Glu Lys Asp Trp Arg Ser Ser Phe Arg Val Gln Val 
65                  70                  75                  80  


Gly Asp His Met Gly Glu Lys Ser Ser Asp Asp Lys 
                85                  90          


<210>  86
<211>  221
<212>  PRT
<213>  Armatimonadetes bacterium

<400>  86

Met Ile Ser Arg Phe His Ala Tyr Gln Pro Leu Asp Met Val Asn Val 
1               5                   10                  15      


Ser Asp Leu Arg Gln Trp Val Tyr Cys Pro Arg Val Val Trp Tyr Gly 
            20                  25                  30          


Arg Ser Leu Gly Asp Tyr Arg Pro Arg Thr Gly Ala Met Lys Val Gly 
        35                  40                  45              


Ile Glu Ala Glu Ala Glu Arg Gln Arg Leu Glu Glu Arg Arg Thr Phe 
    50                  55                  60                  


Ala Gln Tyr Gly Leu Gly Ala Cys Thr Lys Arg Phe Gln Val Pro Val 
65                  70                  75                  80  


Val Ser Glu Ala Leu Gly Leu Ser Gly Arg Ile Asp Cys Leu Phe Glu 
                85                  90                  95      


Leu Thr Pro Val Ala Leu Glu Asp Ala Gln Val Gly Val Arg Pro Leu 
            100                 105                 110         


Asn Trp Lys Glu Gly Asp Pro Met Phe Ala Pro Val Glu Tyr Lys Trp 
        115                 120                 125             


Thr Ser Arg Ala Asp Gln Arg Arg Asn Thr Ile Gln Leu Ala Ala Tyr 
    130                 135                 140                 


Gly Met Ile Leu Glu Ser Leu Thr Gly Thr Pro Val Pro Leu Gly Phe 
145                 150                 155                 160 


Ile Ala Leu Leu Pro Glu Glu Glu Val Val Arg Val Glu Leu Gly Pro 
                165                 170                 175     


Arg Val Arg Arg Ala Val Lys Arg Thr Leu Asp Glu Ala Arg Glu Gly 
            180                 185                 190         


Leu Ser Ala Ser Glu Leu Pro Trp Pro Thr Leu His Arg Gly Lys Cys 
        195                 200                 205             


Gln Asp Cys Glu Phe Arg Arg Phe Cys Asn Asp Val Trp 
    210                 215                 220     


<210>  87
<211>  2583
<212>  DNA
<213>  Armatimonadetes bacterium

<400>  87
atgggtaaga atcggtcctc gtcctcggat ttgagccagc tcgaacgatc cttacggaaa       60

gtcggtgaga atcgccttga gcggctgcgg gtgcgtgggc agaagattag gaagcacctt      120

gaacagcacc cccgaggtaa gaacgatcat caggccctcc actttctgct ccaccagatc      180

gaggtcgaac ggaatgacct gtaccgaaac ctcaaagacc ccgaatacgt gcccaagcca      240

gcgaaacggc ggcgagaaag acggcagatc aacgtcgccc aaccgccgac ccgacccacc      300

aagagtgtgg ggccgaaacc agcgccgacg acatacgtga tcccgcgccc cgagccaggc      360

cgtgacctac cagcattcgc gagcaggtac aaggcaagtg actcgagagg cgaggacgac      420

caagacggtc ggtcatggac tgccgcgccc tttgtcgaag tcgagctgcc gatacaaatt      480

gccggcaaga tcctcgagaa actccgtaag tacgtgcaaa aggacgaacg ggagatcgtt      540

cgcgagtggg ctgtcaagac ctatggctcg atcgaagccg caagagaacc acttcttatc      600

ggggcacaag tctcggaaga cgtctcggtc tggcgcggac tcctcgcaga aacgaaaaac      660

gcacaggact tcgccgccct ctccgacgat cagatcgaag cagcgatgtc gaaggaggcg      720

aaggggtcag acctgcgtcc gaggcgcgcc gcactgctag tcgcacagcg ccactgggtg      780

gatcagaccg tcaaggcaat caaggagtcg gccccgaaag gcgtcgataa ggacacactc      840

gatcgccgtt tgcgcgctgg cctaaggggg tttcatacag cagctaattc gggtaagcac      900

acgaacccac agttcccata cttgacgccg aaagaggcaa aggtgccgtt agaatcggtc      960

gtcaatcagg tcttagagtt cctcgacgac gcggacgacc agcgctacgt ccaggtccac     1020

agagtcagtc atctccagaa ggaactcggg aaggcgaggc cgcgcaagcg actggagctt     1080

cagaggccaa agtgggcggg taggcctaca gtgcaaggaa cgatcagcaa acggcgcgac     1140

gccgcactcg tgtgggacac gagcaagaag gaaaacggcc tctgcctcgc gctcccgctc     1200

gggggtttgc agaagataga tgttgagcgg ttcatctacc aagacggcac gtcactattg     1260

tcggactgcc agatcgcgtc gaagacctcc aagaaaggtg cggcgtgcgc gctcatgccg     1320

ctcaagccca agcacgactt cctgcgttgg tacaccaaac acgtcgagaa ccacaacgca     1380

gacgcgccgc tcgagcgccg ctgtctgcac aacacgaccc agttcgtgat cgtggatcca     1440

gaggggcagc gcccgcgtct cttcatccgc cccgtcttca agttctacga ccccggcaag     1500

gcagtgccga acacgcacga aacttggaag aagccggact gccgctacct ggtagggatc     1560

gaccgaggta tcaactacgt tctgcgcgct gtcgttgtgg acatcgaaaa gaaggaagtc     1620

atcgctgaca tccacctaca aggcgacaag cacaaatgga ggatgatccg cgacgagatc     1680

gcctaccacc aacagatgcg tgatcttgcc agcaacacag gcaaacaccc gagcgtcgtg     1740

gcgaggcacg tccgcgcact cgccctcgcc cgcaagaagg atcgcgcgct cggcaggttt     1800

acgacggtca aggctgtcgc agatatcgtc atgcaatgcg aaaacgacta cggaagcggt     1860

aactactgct tcgtgctcga agacctcgac atgggcaaga tgaatctcaa gcgcaacaac     1920

cgcgtgaagc acatggccgt catgaaggaa gcgcttgtca atcaaatgcg caagcgcggc     1980

tatgcctacg acggtcgccg cggccgggcg gacggcgtca ggtacgaggg cgcatggtac     2040

acgagccaag tgtccccctt cggctggtgg gccaagcgtg aagaggtgga ggaggcgtgg     2100

aagaaggaca cgtcgcgccc gatcggtcgc aaggtcggca actggtacga gatgccagat     2160

ccgaacgaag aaggaaagcg gtcagacgtg tatcggaagg gctgctggaa gaaaccgcag     2220

aacgcaagcg gaaagccata cgggcggaac cgcttctgtg tggaacctgg cgacgagaag     2280

ccggacgctc agcggcgctt ctcctggggg agcgagctgt tctgggaccc gaacgtgaag     2340

tccttcaagg gcaaagagtt tcccgagggg gtcgtgctgg acgccgactt cgtaggagcg     2400

ctcaacatcg cccttcgccc actcgtcaac gacggtcagg gcaggggctt cacggcagac     2460

aagatggccg aagcgcatac gagactcaac ccgcagttcg agatcgtttg caaaatcccc     2520

gtttatgagt tcatcgaaga gcacggtgac aagagggcaa aactcaggcg gatcgtgcta     2580

tag                                                                   2583


<210>  88
<211>  5309
<212>  DNA
<213>  Armatimonadetes bacterium


<220>
<221>  misc_feature
<222>  (2994)..(3091)
<223>  n is a, c, g, or t

<400>  88
atgggtaaga atcggtcctc gtcctcggat ttgagccagc tcgaacgatc cttacggaaa       60

gtcggtgaga atcgccttga gcggctgcgg gtgcgtgggc agaagattag gaagcacctt      120

gaacagcacc cccgaggtaa gaacgatcat caggccctcc actttctgct ccaccagatc      180

gaggtcgaac ggaatgacct gtaccgaaac ctcaaagacc ccgaatacgt gcccaagcca      240

gcgaaacggc ggcgagaaag acggcagatc aacgtcgccc aaccgccgac ccgacccacc      300

aagagtgtgg ggccgaaacc agcgccgacg acatacgtga tcccgcgccc cgagccaggc      360

cgtgacctac cagcattcgc gagcaggtac aaggcaagtg actcgagagg cgaggacgac      420

caagacggtc ggtcatggac tgccgcgccc tttgtcgaag tcgagctgcc gatacaaatt      480

gccggcaaga tcctcgagaa actccgtaag tacgtgcaaa aggacgaacg ggagatcgtt      540

cgcgagtggg ctgtcaagac ctatggctcg atcgaagccg caagagaacc acttcttatc      600

ggggcacaag tctcggaaga cgtctcggtc tggcgcggac tcctcgcaga aacgaaaaac      660

gcacaggact tcgccgccct ctccgacgat cagatcgaag cagcgatgtc gaaggaggcg      720

aaggggtcag acctgcgtcc gaggcgcgcc gcactgctag tcgcacagcg ccactgggtg      780

gatcagaccg tcaaggcaat caaggagtcg gccccgaaag gcgtcgataa ggacacactc      840

gatcgccgtt tgcgcgctgg cctaaggggg tttcatacag cagctaattc gggtaagcac      900

acgaacccac agttcccata cttgacgccg aaagaggcaa aggtgccgtt agaatcggtc      960

gtcaatcagg tcttagagtt cctcgacgac gcggacgacc agcgctacgt ccaggtccac     1020

agagtcagtc atctccagaa ggaactcggg aaggcgaggc cgcgcaagcg actggagctt     1080

cagaggccaa agtgggcggg taggcctaca gtgcaaggaa cgatcagcaa acggcgcgac     1140

gccgcactcg tgtgggacac gagcaagaag gaaaacggcc tctgcctcgc gctcccgctc     1200

gggggtttgc agaagataga tgttgagcgg ttcatctacc aagacggcac gtcactattg     1260

tcggactgcc agatcgcgtc gaagacctcc aagaaaggtg cggcgtgcgc gctcatgccg     1320

ctcaagccca agcacgactt cctgcgttgg tacaccaaac acgtcgagaa ccacaacgca     1380

gacgcgccgc tcgagcgccg ctgtctgcac aacacgaccc agttcgtgat cgtggatcca     1440

gaggggcagc gcccgcgtct cttcatccgc cccgtcttca agttctacga ccccggcaag     1500

gcagtgccga acacgcacga aacttggaag aagccggact gccgctacct ggtagggatc     1560

gaccgaggta tcaactacgt tctgcgcgct gtcgttgtgg acatcgaaaa gaaggaagtc     1620

atcgctgaca tccacctaca aggcgacaag cacaaatgga ggatgatccg cgacgagatc     1680

gcctaccacc aacagatgcg tgatcttgcc agcaacacag gcaaacaccc gagcgtcgtg     1740

gcgaggcacg tccgcgcact cgccctcgcc cgcaagaagg atcgcgcgct cggcaggttt     1800

acgacggtca aggctgtcgc agatatcgtc atgcaatgcg aaaacgacta cggaagcggt     1860

aactactgct tcgtgctcga agacctcgac atgggcaaga tgaatctcaa gcgcaacaac     1920

cgcgtgaagc acatggccgt catgaaggaa gcgcttgtca atcaaatgcg caagcgcggc     1980

tatgcctacg acggtcgccg cggccgggcg gacggcgtca ggtacgaggg cgcatggtac     2040

acgagccaag tgtccccctt cggctggtgg gccaagcgtg aagaggtgga ggaggcgtgg     2100

aagaaggaca cgtcgcgccc gatcggtcgc aaggtcggca actggtacga gatgccagat     2160

ccgaacgaag aaggaaagcg gtcagacgtg tatcggaagg gctgctggaa gaaaccgcag     2220

aacgcaagcg gaaagccata cgggcggaac cgcttctgtg tggaacctgg cgacgagaag     2280

ccggacgctc agcggcgctt ctcctggggg agcgagctgt tctgggaccc gaacgtgaag     2340

tccttcaagg gcaaagagtt tcccgagggg gtcgtgctgg acgccgactt cgtaggagcg     2400

ctcaacatcg cccttcgccc actcgtcaac gacggtcagg gcaggggctt cacggcagac     2460

aagatggccg aagcgcatac gagactcaac ccgcagttcg agatcgtttg caaaatcccc     2520

gtttatgagt tcatcgaaga gcacggtgac aagagggcaa aactcaggcg gatcgtgcta     2580

tagtaggccg ttctgactcg atgcgggacg gatactacac taagcctaaa cggcacgagc     2640

gatagccctg cggggattcc ccaaagcccg tacgacaagc ctctttcagg cgtcggacac     2700

ttaagagcgt taggcgggcg gtccctaagc cgctcgcccc cttatcccca cggtttccaa     2760

gaaccccgta actctgccag tcacaaacac ccagaagcgc gtctacactt acttgtgagt     2820

aaggagtaga tagacatgcg ctatgagatc gtagatggct acgggtgcca ggtgcttaag     2880

cacagcgagc gcctgatcct caagtggtcc gccagagacg accagcaacc caagcgcgag     2940

gtgccgattc tgcacctcga ccacctgctc gtcggctcca agggcgtgac ggtnnnnnnn     3000

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn     3060

nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ntcacggtcg tggactggcg tggtcgtcca     3120

gttgggaggt tcggaagccc cgccctccac ggaacggcac aggtgcgccg agcacagata     3180

gccgccttcg acacaaaacg aggagcctct ttcgccagcg agatcgtcgg cggcaagctc     3240

ctcaaccagg cagacaatct caactacatc ggcaagaaca gaaagacgcg cgctccagaa     3300

gtttacgaag agttgacaag gacagccgac acgctccaac gtctggcaaa gaaggcagtg     3360

gcggtcaaag gcaagaatgc ggacgaaatt cggatgctcc tcatgacggt cgaggcagag     3420

ggcgcccgcg cttactggtc ggtgctcagc gaagtctacg gcaaagggtc ggggttcgcc     3480

aagcgcgaac agcgcggcac ccgcgacccg gtcaacgccg tgctcaacta cgcctatggc     3540

gtactcaatg gcgaggtctg gaacgccgtc gtgctggccg gactggagcc ctacgcaggg     3600

ttcctgcacg tcgatcggcc gggacgcctg agcttcgtgc tcgatctgat ggaggagttc     3660

cgccccgtcg tcgccgaccg cgttgtgttt ggcctcgtcg ccaagggatg gaagatcggt     3720

caggaggaga acggttggct cgatctatcg acgaagaagc ggctcgtccg ggcgatcggc     3780

gacaggtggg ggacgcgcgt cgagcaccaa ggtcgcaagt tgcaactacg atctgtgctc     3840

cagctgcagg cacgggacgc cgcacggcac ttccaggaca aggcggagta cgacgcgttc     3900

cgccagaggt ggtaaggcat gaagtggctc gtgtgctacg acatcgagaa ggatagtgtc     3960

cgaaacaagg tggcagactt ctgccttgac aaggggctcg aacgggttca atacagcgtc     4020

ttccttgggt cgatgacaag aacgctcgcc aaagaacttg gagcacagat caggaagcga     4080

atgggcaaga atccagggca agtgcgcttt gtgccgattt gtgagaagga ctggcgttcg     4140

tcgttccgcg tccaagtcgg tgaccacatg ggagaaaagt catccgatga taagtaggtt     4200

ccacgcctac caaccacttg acatggtgaa cgtgagcgat ctccgccagt gggtgtactg     4260

tccgcgcgtc gtctggtacg ggcgctcatt gggcgactac cgcccgagga ccggggcgat     4320

gaaggtgggg atagaggcgg aggcggagcg gcagaggctg gaggagcgaa ggacgttcgc     4380

tcaatacggt ctgggggcgt gcaccaagcg gttccaggtt cccgttgtgt cggaggcgtt     4440

ggggctgtcg gggcgaatcg actgcctgtt cgaacttacg cccgttgcgc tggaggacgc     4500

tcaggtcggg gtgaggccgc tgaactggaa ggagggcgat ccgatgttcg cgccagttga     4560

gtacaaatgg acgtccagag ctgaccagag gcggaacaca attcagctgg ccgcttatgg     4620

gatgattctg gaaagcttga cggggacgcc cgtgccgctc ggcttcatcg cgctcctgcc     4680

tgaggaagag gttgtccgtg tcgaactcgg ccccagagtt aggcgcgcag tcaaacgcac     4740

cctagacgag gctcgcgaag gcctttccgc ctcggaactc ccctggccaa cgctccaccg     4800

cggaaagtgc caagactgcg agttccgacg gttttgcaat gacgtctggt agatggggtt     4860

cgaaaccctc gcagctgctc ggatttcggg gcgatgtgcg aatcctaacg cccccgacac     4920

tcaagatgat ggaggccttc gacccacaat caagcccgac tcagacagcg aagagccaat     4980

ggtgtgcgca aattcgctct caccgggtac attcattcgc acaacggcgg actttgtaag     5040

ctaaatagcg ggtttcgaaa cgacgcggcc gaacctgtta gggcagggtt gcagcaaatg     5100

gacggcgtgt ctgagtccgg tcatgcatgg tcgcagggga tcaagaacgc tcttagggaa     5160

tgaaaggacc gacggccttc cgaacggaga ggttctcgcg tcgtcgcagg ggatcaagaa     5220

cgctcttagg gaatgaaaga tggctctcga cgaaggccta cggctggacc gtcgagatgc     5280

aggtcaaggc ggcccgcgcc ggcctgcgc                                       5309


<210>  89
<211>  160
<212>  RNA
<213>  Armatimonadetes bacterium

<400>  89
uaggccguuc ugacucgaug cgggacggau acuacacuaa gccuaaacgg cacgagcgau       60

agcccugcgg ggauucccca aagcccguac gacaagccuc uuucaggcgu cggacacuua      120

agagcguuag gcgggcgguc ccuaagccgc ucgcccccuu                            160


<210>  90
<211>  128
<212>  RNA
<213>  Armatimonadetes bacterium

<400>  90
uaggccguuc ugacucgaug cgggacggau acuacacuaa gccuaaacgg cacgagcgau       60

agcccugcgg ggauucccca aagcccguac gacaagccuc uuucaggcgu cggacacuua      120

agagcguu                                                               128


<210>  91
<211>  231
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta 4 sgRNA with terminator-like hairpin


<220>
<221>  misc_feature
<222>  (202)..(231)
<223>  n is a, c, g, or u

<400>  91
uaggccguuc ugacucgaug cgggacggau acuacacuaa gccuaaacgg cacgagcgau       60

agcccugcgg ggauucccca aagcccguac gacaagccuc uuucaggcgu cggacacuua      120

agagcguuag gcgggcgguc ccuaagccgc ucgcccccuu gaaagucgca ggggaucaag      180

aacgcucuua gggaaugaaa gnnnnnnnnn nnnnnnnnnn nnnnnnnnnn n               231


<210>  92
<211>  190
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta 4 sgRNA without terminator-like hairpin


<220>
<221>  misc_feature
<222>  (154)..(190)
<223>  n is a, c, g, or u

<400>  92
uaggccguuc ugacucgaug cgggacggau acuacacuaa gccuaaacgg cacgagcgau       60

agcccugcgg ggauucccca aagcccguac gacaagccuc uuucaggcgu cggacacuua      120

agagcguuga aaaacgcucu uagggaauga aagnnnnnnn nnnnnnnnnn nnnnnnnnnn      180

nnnnnnnnnn                                                             190


<210>  93
<211>  27
<212>  DNA
<213>  Homo sapiens

<400>  93
cccacagttc gattaccttt cccactc                                           27


<210>  94
<211>  27
<212>  DNA
<213>  Homo sapiens

<400>  94
cctttcccac tcactgcttt ctcctct                                           27


<210>  95
<211>  27
<212>  DNA
<213>  Homo sapiens

<400>  95
cccgccttca gaagagggtg cattttc                                           27


<210>  96
<211>  27
<212>  DNA
<213>  Homo sapiens

<400>  96
cctgaaaatg caccctcttc tgaaggc                                           27


<210>  97
<211>  169
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta2 sgRNA targeting WTAP site 1

<400>  97
guuaggcguu ccgucucgac uaugccguac cacuagaccg agccuacacg gcacgcgguc       60

auagcguuaa ccaaggcgug gugacaagcc ucuuucaggc gucggacacu uaagagcguu      120

gaaaaacgcu cuuagggaau gaaagacagu ucgauuaccu uucccacuc                  169


<210>  98
<211>  169
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta2 sgRNA targeting WTAP site 2

<400>  98
guuaggcguu ccgucucgac uaugccguac cacuagaccg agccuacacg gcacgcgguc       60

auagcguuaa ccaaggcgug gugacaagcc ucuuucaggc gucggacacu uaagagcguu      120

gaaaaacgcu cuuagggaau gaaaguuccc acucacugcu uucuccucu                  169


<210>  99
<211>  169
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta2 sgRNA targeting RUNX1 site 1

<400>  99
guuaggcguu ccgucucgac uaugccguac cacuagaccg agccuacacg gcacgcgguc       60

auagcguuaa ccaaggcgug gugacaagcc ucuuucaggc gucggacacu uaagagcguu      120

gaaaaacgcu cuuagggaau gaaaggccuu cagaagaggg ugcauuuuc                  169


<210>  100
<211>  169
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta2 sgRNA targeting RUNX1 site 2

<400>  100
guuaggcguu ccgucucgac uaugccguac cacuagaccg agccuacacg gcacgcgguc       60

auagcguuaa ccaaggcgug gugacaagcc ucuuucaggc gucggacacu uaagagcguu      120

gaaaaacgcu cuuagggaau gaaaggaaaa ugcacccucu ucugaaggc                  169


<210>  101
<211>  188
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta3 sgRNA targeting WTAP site 1

<400>  101
guuugaaugu gcucugccga agacgccgca cggagccugg gccggaaucg uagaucgaac       60

gcggcaucga agcccugcag cccuucgggg ccaaggcggc gcagcaagcc ucuuucaggc      120

ggcagagucc uuuagagugu gaaaacacuc uaaaggaaug aaagacaguu cgauuaccuu      180

ucccacuc                                                               188


<210>  102
<211>  188
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta3 sgRNA targeting WTAP site 2

<400>  102
guuugaaugu gcucugccga agacgccgca cggagccugg gccggaaucg uagaucgaac       60

gcggcaucga agcccugcag cccuucgggg ccaaggcggc gcagcaagcc ucuuucaggc      120

ggcagagucc uuuagagugu gaaaacacuc uaaaggaaug aaaguuccca cucacugcuu      180

ucuccucu                                                               188


<210>  103
<211>  188
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta3 sgRNA targeting RUNX1 site 1

<400>  103
guuugaaugu gcucugccga agacgccgca cggagccugg gccggaaucg uagaucgaac       60

gcggcaucga agcccugcag cccuucgggg ccaaggcggc gcagcaagcc ucuuucaggc      120

ggcagagucc uuuagagugu gaaaacacuc uaaaggaaug aaaggccuuc agaagagggu      180

gcauuuuc                                                               188


<210>  104
<211>  188
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta3 sgRNA targeting RUNX1 site 2

<400>  104
guuugaaugu gcucugccga agacgccgca cggagccugg gccggaaucg uagaucgaac       60

gcggcaucga agcccugcag cccuucgggg ccaaggcggc gcagcaagcc ucuuucaggc      120

ggcagagucc uuuagagugu gaaaacacuc uaaaggaaug aaaggaaaau gcacccucuu      180

cugaaggc                                                               188


<210>  105
<211>  73
<212>  DNA
<213>  Artificial

<220>
<223>  5' FAM labeled RUNX1 site 1 non-target strand

<400>  105
accagcagga ctacagcttc cccgccttca gaagagggtg cattttcagc ctttttgtgg       60

gtgtacgttt tgg                                                          73


<210>  106
<211>  73
<212>  DNA
<213>  Artificial

<220>
<223>  5' ROX labeled RUNX1 site 1 target strand

<400>  106
ccaaaacgta cacccacaaa aaggctgaaa atgcaccctc ttctgaaggc ggggaagctg       60

tagtcctgct ggt                                                          73


<210>  107
<211>  71
<212>  DNA
<213>  Artificial

<220>
<223>  5' FAM labeled WTAP site 1 non-target strand

<400>  107
accagcagga ctacagcttc cccacagttc gattaccttt cccactccct ttttgtgggt       60

gtacgttttg g                                                            71


<210>  108
<211>  71
<212>  DNA
<213>  Artificial

<220>
<223>  5' ROX labeled WTAP site 1 target strand

<400>  108
ccaaaacgta cacccacaaa aagggagtgg gaaaggtaat cgaactgtgg ggaagctgta       60

gtcctgctgg t                                                            71


<210>  109
<211>  27
<212>  DNA
<213>  Homo sapiens

<400>  109
cccactcact gctttctcct cttgagg                                           27


<210>  110
<211>  27
<212>  DNA
<213>  Homo sapiens

<400>  110
cccgccacgt tcagaatggc ttggact                                           27


<210>  111
<211>  27
<212>  DNA
<213>  Homo sapiens

<400>  111
ccctcaagag gagaaagcag tgagtgg                                           27


<210>  112
<211>  27
<212>  DNA
<213>  Homo sapiens

<400>  112
cccacgggca gtgaaaactc tctcaca                                           27


<210>  113
<211>  27
<212>  DNA
<213>  Homo sapiens

<400>  113
cccgtgggag agtctacact ttcatac                                           27


<210>  114
<211>  27
<212>  DNA
<213>  Homo sapiens

<400>  114
ccctcttctg aaggcggggg actcaat                                           27


<210>  115
<211>  27
<212>  DNA
<213>  Homo sapiens

<400>  115
ccccgccttc agaagagggt gcatttt                                           27


<210>  116
<211>  27
<212>  DNA
<213>  Homo sapiens

<400>  116
cccctctagc cctacatctc tctttct                                           27


<210>  117
<211>  27
<212>  DNA
<213>  Homo sapiens

<400>  117
ccctctagcc ctacatctct ctttctt                                           27


<210>  118
<211>  27
<212>  DNA
<213>  Homo sapiens

<400>  118
ccctacatct ctctttcttc tcccctc                                           27


<210>  119
<211>  31
<212>  DNA
<213>  Artificial

<220>
<223>  Non-target strand (NTS)

<400>  119
ccccgtcgac aaattctaaa cgctaaagag g                                      31


<210>  120
<211>  11
<212>  DNA
<213>  Artificial

<220>
<223>  Sanger sequence from the reverse direction (Cas-beta2)

<400>  120
cctctttagc g                                                            11


<210>  121
<211>  31
<212>  DNA
<213>  Artificial

<220>
<223>  Target strand (TS)

<400>  121
cctctttagc gtttagaatt tgtcgacggg g                                      31


<210>  122
<211>  27
<212>  DNA
<213>  Artificial

<220>
<223>  Sanger sequence from the forward drection (Cas-beta2)

<400>  122
ccccgtcgac aaattctaaa cgctaaa                                           27


<210>  123
<211>  13
<212>  DNA
<213>  Artificial

<220>
<223>  Sanger sequence from the reverse direction (Cas-beta3)

<400>  123
cctctttagc gtt                                                          13


<210>  124
<211>  26
<212>  DNA
<213>  Artificial

<220>
<223>  Sanger sequence from the forward drection (Cas-beta3)

<400>  124
ccccgtcgac aaattctaaa cgctaa                                            26


<210>  125
<211>  56
<212>  DNA
<213>  Homo sapiens

<400>  125
tcattgagtc ccccgccttc agaagagggt gcattttcag gaggaagcga tggctt           56


<210>  126
<211>  50
<212>  DNA
<213>  Homo sapiens

<400>  126
tcattgagtc ccccgccttc agaagagggt gcaggaggaa gcgatggctt                  50


<210>  127
<211>  51
<212>  DNA
<213>  Homo sapiens

<400>  127
tcattgagtc ccccgccttc agaagagggt ttcaggagga agcgatggct t                51


<210>  128
<211>  51
<212>  DNA
<213>  Homo sapiens

<400>  128
tcattgagtc ccccgccttc agaagagggt gctaggagga agcgatggct t                51


<210>  129
<211>  50
<212>  DNA
<213>  Homo sapiens

<400>  129
tcattgagtc ccccgccttc agaagagttt tcaggaggaa gcgatggctt                  50


<210>  130
<211>  51
<212>  DNA
<213>  Homo sapiens

<400>  130
tcattgagtc ccccgccttc agaagaggtt ttcaggagga agcgatggct t                51


<210>  131
<211>  40
<212>  DNA
<213>  Homo sapiens

<400>  131
tcattgagtc ccccgccttc agaagaggaa gcgatggctt                             40


<210>  132
<211>  95
<212>  DNA
<213>  Homo sapiens

<400>  132
gcggggtatg aaagtgtaga ctctcccacg ggcagtgaaa actctctcac acaccaatca       60

aatgacacag actccagtca tgaccctcaa gagca                                  95


<210>  133
<211>  8
<212>  DNA
<213>  Homo sapiens

<400>  133
gcggggca                                                                 8


<210>  134
<211>  37
<212>  DNA
<213>  Homo sapiens

<400>  134
gcggggcaca ctaccgtcgg aatgaccctc aagagca                                37


<210>  135
<211>  8
<212>  DNA
<213>  Homo sapiens

<400>  135
gcggagca                                                                 8


<210>  136
<211>  32
<212>  DNA
<213>  Homo sapiens

<400>  136
gcggggcaca ctaccgttga ccctcaagag ca                                     32


<210>  137
<211>  36
<212>  DNA
<213>  Homo sapiens

<400>  137
gcggggcaca ctaccgtcgg atgaccctca agagca                                 36


<210>  138
<211>  99
<212>  DNA
<213>  Homo sapiens

<400>  138
tgctcttgag ggtcatgact ggagtctgtg tcatttgatt ggtgtgtgag agagttttca       60

ctgcccgtgg gagagtctac actttcatac cccgcactg                              99


<210>  139
<211>  12
<212>  DNA
<213>  Homo sapiens

<400>  139
tgccccgcac tg                                                           12


<210>  140
<211>  12
<212>  DNA
<213>  Homo sapiens

<400>  140
tgctccgcac tg                                                           12


<210>  141
<211>  17
<212>  DNA
<213>  Homo sapiens

<400>  141
tgctcttgag ggcactg                                                      17


<210>  142
<211>  16
<212>  DNA
<213>  Homo sapiens

<400>  142
tgctcttgag gcactg                                                       16


<210>  143
<211>  165
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta2 sgRNA targeted RUNX1 site 1 (20 nt spacer)

<400>  143
guuaggcguu ccgucucgac uaugccguac cacuagaccg agccuacacg gcacgcgguc       60

auagcguuaa ccaaggcgug gugacaagcc ucuuucaggc gucggacacu uaagagcguu      120

gaaaaacgcu cuuagggaau gaaaggccuu cagaagaggg ugcau                      165


<210>  144
<211>  165
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta2 sgRNA targeted WTAP site 6 (20 nt spacer)

<400>  144
guuaggcguu ccgucucgac uaugccguac cacuagaccg agccuacacg gcacgcgguc       60

auagcguuaa ccaaggcgug gugacaagcc ucuuucaggc gucggacacu uaagagcguu      120

gaaaaacgcu cuuagggaau gaaagacggg cagugaaaac ucucu                      165


<210>  145
<211>  165
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta2 sgRNA targeted WTAP site 7 (20 nt spacer)

<400>  145
guuaggcguu ccgucucgac uaugccguac cacuagaccg agccuacacg gcacgcgguc       60

auagcguuaa ccaaggcgug gugacaagcc ucuuucaggc gucggacacu uaagagcguu      120

gaaaaacgcu cuuagggaau gaaagguggg agagucuaca cuuuc                      165


<210>  146
<211>  184
<212>  RNA
<213>  Artificial

<220>
<223>  Cas-beta3 sgRNA targeted WTAP site 7 (20 nt spacer)

<400>  146
guuugaaugu gcucugccga agacgccgca cggagccugg gccggaaucg uagaucgaac       60

gcggcaucga agcccugcag cccuucgggg ccaaggcggc gcagcaagcc ucuuucaggc      120

ggcagagucc uuuagagugu gaaaacacuc uaaaggaaug aaagguggga gagucuacac      180

uuuc                                                                   184


<210>  147
<211>  62
<212>  DNA
<213>  Zea mays

<400>  147
aggcagatcc tcggccccgc ctggaacgcc gtgaacttga acggcgacgc gttgcggaac       60

gc                                                                      62


<210>  148
<211>  60
<212>  DNA
<213>  Zea mays

<400>  148
aggcagatcc tcggccccgc ctggaacgcc gtgaacttac ggcgacgcgt tgcggaacgc       60


<210>  149
<211>  62
<212>  DNA
<213>  Zea mays

<400>  149
gggtcagagg ccctcctagc gcagagcatc aagctcgtgg acgagttcac ctacagcgtg       60

at                                                                      62


<210>  150
<211>  37
<212>  DNA
<213>  Zea mays

<400>  150
gggtcagagg ccctcctagc gtcacctaca gcgtgat                                37


<210>  151
<211>  43
<212>  DNA
<213>  Artificial

<220>
<223>  Cas-beta2 native spacer 1 target

<400>  151
cccacgggca gtatcggagg gcgcgggaga ggtgccagcc ttc                         43


<210>  152
<211>  40
<212>  DNA
<213>  Artificial

<220>
<223>  Cas-beta2 native spacer 2 target

<400>  152
ccccgttgca ggcaatgatc tctgcaactg gccgtcgaat                             40


<210>  153
<211>  38
<212>  DNA
<213>  Artificial

<220>
<223>  Cas-beta2 native spacer 3 target

<400>  153
cccggagtgg gtgcgactta aagcgtttga cggtggct                               38


<210>  154
<211>  867
<212>  PRT
<213>  Artificial

<220>
<223>  D528A cleavage inactive Cas-beta2

<400>  154

Met Gly Lys Asn Arg Ser Ser Ser Ser Asp Leu Ser Pro Leu Glu Arg 
1               5                   10                  15      


Ser Leu Arg Lys Val Gly Glu Asn Arg Leu Glu Arg Leu Arg Val Arg 
            20                  25                  30          


Glu Glu Lys Ile Arg Lys His Ile Glu Gln His Pro Arg Gly Lys Asn 
        35                  40                  45              


Asp His Gln Ala Leu His Phe Leu Leu His Gln Ile Glu Val Glu Arg 
    50                  55                  60                  


Asn Asp Leu Tyr Arg Asn Leu Lys Asp Pro Glu Tyr Val Pro Lys Pro 
65                  70                  75                  80  


Ala Lys Gln Arg Arg Glu Arg Arg Gln Ile Asn Val Ala Lys Pro Pro 
                85                  90                  95      


Thr Arg Pro Lys Lys Glu Lys Gly Pro Gln Pro Glu Ser Thr Lys Tyr 
            100                 105                 110         


Val Ile Arg Pro Pro Val Pro Gly Lys Asn Leu Pro Ala Phe Ala Ser 
        115                 120                 125             


Lys Tyr Glu Ala Arg Asp Thr Arg Asp Asp Ser Tyr Gln Asp Gly Arg 
    130                 135                 140                 


Ser Trp Thr Ser Ala Pro Tyr Val Glu Val Glu Leu Pro Ile Leu Gly 
145                 150                 155                 160 


Ala Asp Lys Val Ile Gln Lys Leu Met Lys Phe Val Gln Lys Asp Glu 
                165                 170                 175     


Arg Ser Ile Val Arg Asp Trp Ala Thr Lys Thr Tyr Ser Ser Ile Glu 
            180                 185                 190         


Ala Ala Arg Glu Ala Leu Leu Val Gly Ala Gln Val Ser Glu Asp Val 
        195                 200                 205             


Ser Val Trp Arg Gly Leu Leu Ala Glu Thr Lys Asn Ala Gln Asn Phe 
    210                 215                 220                 


Ala Ala Leu Ser Asp Asp Gln Ile Glu Ala Ala Met Ser Lys Glu Ala 
225                 230                 235                 240 


Lys Gly Ala Asp Leu Arg Pro Arg Arg Ala Ala Leu Leu Val Ala Gln 
                245                 250                 255     


Arg His Trp Val Asp Gln Thr Val Lys Ala Ile Lys Glu Ser Ala Pro 
            260                 265                 270         


Ser Gly Val Asp Lys Asp Thr Leu Asp Arg Arg Leu Arg Ala Gly Leu 
        275                 280                 285             


Arg Gly Phe His Thr Ala Ala Asn Ser Gly Lys His Thr Asn Pro Gln 
    290                 295                 300                 


Phe Pro Tyr Leu Thr Ala Glu Lys Pro Val Val Pro Met Glu Ser Val 
305                 310                 315                 320 


Val Gln Ser Val Leu Ala Phe Leu Asp Asp Pro Asp Asp Gln Arg Tyr 
                325                 330                 335     


Thr Lys Asp Lys Glu Asp Asp Lys Lys Arg His Arg Val Thr Val Leu 
            340                 345                 350         


Gln Lys Glu Leu Gly Lys Ala Arg Pro Arg Lys Arg Leu Glu Leu Gln 
        355                 360                 365             


Thr Pro Lys Trp Ala Gly Arg Pro Thr Val Lys Gly Thr Ile Ser Lys 
    370                 375                 380                 


Arg Arg Asp Ala Ala Leu Val Trp Asp Thr Ser Lys Glu Ala Asn Gly 
385                 390                 395                 400 


Leu Cys Leu Ala Leu Pro Ile Gly Gly Met Pro Lys Ile Asp Val Glu 
                405                 410                 415     


Gln Phe Ile Tyr Gln Asp Gly Thr Ser Leu Leu Ser Asp Cys Gln Ile 
            420                 425                 430         


Ala Ser Lys Thr Thr Lys Lys Gly Ala Ala Cys Ala Val Leu Pro Leu 
        435                 440                 445             


Lys Pro Lys His Asp Phe Leu Arg Trp Phe Thr Lys His Val Glu Asn 
    450                 455                 460                 


His Asn Pro Asp Ala Pro Leu Glu Arg Arg Cys Leu His Asn Thr Thr 
465                 470                 475                 480 


Gln Phe Val Ile Val Asp Pro Glu Gly Pro Arg Pro Arg Leu Phe Val 
                485                 490                 495     


Arg Pro Val Phe Lys Phe Tyr Asp Pro Gly Lys Thr Val Pro Asn Thr 
            500                 505                 510         


His Glu Thr Trp Lys Lys Pro Asp Cys Arg Tyr Leu Val Gly Ile Ala 
        515                 520                 525             


Arg Gly Ile Asn Tyr Val Leu Arg Ala Val Val Val Asp Thr Glu Glu 
    530                 535                 540                 


Lys Lys Val Ile Ala Asp Ile Gly Leu Pro Gly Arg Lys His Glu Trp 
545                 550                 555                 560 


Arg Met Ile Arg Asp Glu Ile Ala Tyr His Gln Gln Met Arg Asp Leu 
                565                 570                 575     


Ala Arg Asn Thr Gly Lys His Ala Ser Val Val Ala Lys His Val Arg 
            580                 585                 590         


Ala Leu Ala Leu Ala Arg Lys Lys Asp Arg Ala Leu Gly Lys Phe Ala 
        595                 600                 605             


Thr Val Glu Ala Val Ala Glu Leu Val Lys Lys Cys Glu Gln Asp Tyr 
    610                 615                 620                 


Gly Ser Gly Asn Tyr Cys Phe Val Leu Glu Asp Leu Asp Met Gly Ala 
625                 630                 635                 640 


Met Asn Leu Lys Arg Asn Asn Arg Val Lys His Met Ala Val Met Glu 
                645                 650                 655     


Glu Ala Leu Val Asn Gln Met Arg Lys Gln Gly Tyr Ala Tyr Asp Gly 
            660                 665                 670         


Arg Arg Gly Arg Val Asp Gly Val Arg His Glu Gly Ala Trp Tyr Thr 
        675                 680                 685             


Ser Gln Val Ser Pro Phe Gly Trp Trp Ala Lys Arg Asp Glu Val Glu 
    690                 695                 700                 


Glu Ala Trp Lys Arg Asp Lys Thr Arg Pro Ile Gly Arg Lys Val Gly 
705                 710                 715                 720 


Asn Trp Tyr Glu Met Pro Glu Pro Gly Gln Asp Gly Asp Arg Pro Asp 
                725                 730                 735     


Thr Tyr Arg Lys Gly Tyr Trp Ser Lys Pro Lys Asn Ala Glu Gly Lys 
            740                 745                 750         


Pro Tyr Gly Arg Asn Arg Phe Ser Val Glu Pro Gly Asp Glu Lys Pro 
        755                 760                 765             


Asp Ala Glu Arg Arg Phe Cys Trp Gly Ser Glu Leu Phe Trp Asp Pro 
    770                 775                 780                 


Asn Val Lys Ser Phe Lys Gly Lys Glu Phe Pro Glu Gly Val Val Leu 
785                 790                 795                 800 


Asp Ala Asp Phe Val Gly Ala Leu Asn Ile Ala Leu Arg Pro Leu Val 
                805                 810                 815     


Asn Asp Gly Gln Gly Lys Gly Phe Lys Ala Glu Asp Met Ala Arg Glu 
            820                 825                 830         


His Thr Ile Leu Asn Pro Gln Phe Lys Ile Ala Cys Gln Ile Pro Val 
        835                 840                 845             


Tyr Glu Phe Val Glu Glu Asp Gly Asp Lys Trp Ala Ala Leu Arg Arg 
    850                 855                 860                 


Ile Met Leu 
865         


<210>  155
<211>  867
<212>  PRT
<213>  Artificial

<220>
<223>  E634A cleavage inactive Cas-beta2

<400>  155

Met Gly Lys Asn Arg Ser Ser Ser Ser Asp Leu Ser Pro Leu Glu Arg 
1               5                   10                  15      


Ser Leu Arg Lys Val Gly Glu Asn Arg Leu Glu Arg Leu Arg Val Arg 
            20                  25                  30          


Glu Glu Lys Ile Arg Lys His Ile Glu Gln His Pro Arg Gly Lys Asn 
        35                  40                  45              


Asp His Gln Ala Leu His Phe Leu Leu His Gln Ile Glu Val Glu Arg 
    50                  55                  60                  


Asn Asp Leu Tyr Arg Asn Leu Lys Asp Pro Glu Tyr Val Pro Lys Pro 
65                  70                  75                  80  


Ala Lys Gln Arg Arg Glu Arg Arg Gln Ile Asn Val Ala Lys Pro Pro 
                85                  90                  95      


Thr Arg Pro Lys Lys Glu Lys Gly Pro Gln Pro Glu Ser Thr Lys Tyr 
            100                 105                 110         


Val Ile Arg Pro Pro Val Pro Gly Lys Asn Leu Pro Ala Phe Ala Ser 
        115                 120                 125             


Lys Tyr Glu Ala Arg Asp Thr Arg Asp Asp Ser Tyr Gln Asp Gly Arg 
    130                 135                 140                 


Ser Trp Thr Ser Ala Pro Tyr Val Glu Val Glu Leu Pro Ile Leu Gly 
145                 150                 155                 160 


Ala Asp Lys Val Ile Gln Lys Leu Met Lys Phe Val Gln Lys Asp Glu 
                165                 170                 175     


Arg Ser Ile Val Arg Asp Trp Ala Thr Lys Thr Tyr Ser Ser Ile Glu 
            180                 185                 190         


Ala Ala Arg Glu Ala Leu Leu Val Gly Ala Gln Val Ser Glu Asp Val 
        195                 200                 205             


Ser Val Trp Arg Gly Leu Leu Ala Glu Thr Lys Asn Ala Gln Asn Phe 
    210                 215                 220                 


Ala Ala Leu Ser Asp Asp Gln Ile Glu Ala Ala Met Ser Lys Glu Ala 
225                 230                 235                 240 


Lys Gly Ala Asp Leu Arg Pro Arg Arg Ala Ala Leu Leu Val Ala Gln 
                245                 250                 255     


Arg His Trp Val Asp Gln Thr Val Lys Ala Ile Lys Glu Ser Ala Pro 
            260                 265                 270         


Ser Gly Val Asp Lys Asp Thr Leu Asp Arg Arg Leu Arg Ala Gly Leu 
        275                 280                 285             


Arg Gly Phe His Thr Ala Ala Asn Ser Gly Lys His Thr Asn Pro Gln 
    290                 295                 300                 


Phe Pro Tyr Leu Thr Ala Glu Lys Pro Val Val Pro Met Glu Ser Val 
305                 310                 315                 320 


Val Gln Ser Val Leu Ala Phe Leu Asp Asp Pro Asp Asp Gln Arg Tyr 
                325                 330                 335     


Thr Lys Asp Lys Glu Asp Asp Lys Lys Arg His Arg Val Thr Val Leu 
            340                 345                 350         


Gln Lys Glu Leu Gly Lys Ala Arg Pro Arg Lys Arg Leu Glu Leu Gln 
        355                 360                 365             


Thr Pro Lys Trp Ala Gly Arg Pro Thr Val Lys Gly Thr Ile Ser Lys 
    370                 375                 380                 


Arg Arg Asp Ala Ala Leu Val Trp Asp Thr Ser Lys Glu Ala Asn Gly 
385                 390                 395                 400 


Leu Cys Leu Ala Leu Pro Ile Gly Gly Met Pro Lys Ile Asp Val Glu 
                405                 410                 415     


Gln Phe Ile Tyr Gln Asp Gly Thr Ser Leu Leu Ser Asp Cys Gln Ile 
            420                 425                 430         


Ala Ser Lys Thr Thr Lys Lys Gly Ala Ala Cys Ala Val Leu Pro Leu 
        435                 440                 445             


Lys Pro Lys His Asp Phe Leu Arg Trp Phe Thr Lys His Val Glu Asn 
    450                 455                 460                 


His Asn Pro Asp Ala Pro Leu Glu Arg Arg Cys Leu His Asn Thr Thr 
465                 470                 475                 480 


Gln Phe Val Ile Val Asp Pro Glu Gly Pro Arg Pro Arg Leu Phe Val 
                485                 490                 495     


Arg Pro Val Phe Lys Phe Tyr Asp Pro Gly Lys Thr Val Pro Asn Thr 
            500                 505                 510         


His Glu Thr Trp Lys Lys Pro Asp Cys Arg Tyr Leu Val Gly Ile Asp 
        515                 520                 525             


Arg Gly Ile Asn Tyr Val Leu Arg Ala Val Val Val Asp Thr Glu Glu 
    530                 535                 540                 


Lys Lys Val Ile Ala Asp Ile Gly Leu Pro Gly Arg Lys His Glu Trp 
545                 550                 555                 560 


Arg Met Ile Arg Asp Glu Ile Ala Tyr His Gln Gln Met Arg Asp Leu 
                565                 570                 575     


Ala Arg Asn Thr Gly Lys His Ala Ser Val Val Ala Lys His Val Arg 
            580                 585                 590         


Ala Leu Ala Leu Ala Arg Lys Lys Asp Arg Ala Leu Gly Lys Phe Ala 
        595                 600                 605             


Thr Val Glu Ala Val Ala Glu Leu Val Lys Lys Cys Glu Gln Asp Tyr 
    610                 615                 620                 


Gly Ser Gly Asn Tyr Cys Phe Val Leu Ala Asp Leu Asp Met Gly Ala 
625                 630                 635                 640 


Met Asn Leu Lys Arg Asn Asn Arg Val Lys His Met Ala Val Met Glu 
                645                 650                 655     


Glu Ala Leu Val Asn Gln Met Arg Lys Gln Gly Tyr Ala Tyr Asp Gly 
            660                 665                 670         


Arg Arg Gly Arg Val Asp Gly Val Arg His Glu Gly Ala Trp Tyr Thr 
        675                 680                 685             


Ser Gln Val Ser Pro Phe Gly Trp Trp Ala Lys Arg Asp Glu Val Glu 
    690                 695                 700                 


Glu Ala Trp Lys Arg Asp Lys Thr Arg Pro Ile Gly Arg Lys Val Gly 
705                 710                 715                 720 


Asn Trp Tyr Glu Met Pro Glu Pro Gly Gln Asp Gly Asp Arg Pro Asp 
                725                 730                 735     


Thr Tyr Arg Lys Gly Tyr Trp Ser Lys Pro Lys Asn Ala Glu Gly Lys 
            740                 745                 750         


Pro Tyr Gly Arg Asn Arg Phe Ser Val Glu Pro Gly Asp Glu Lys Pro 
        755                 760                 765             


Asp Ala Glu Arg Arg Phe Cys Trp Gly Ser Glu Leu Phe Trp Asp Pro 
    770                 775                 780                 


Asn Val Lys Ser Phe Lys Gly Lys Glu Phe Pro Glu Gly Val Val Leu 
785                 790                 795                 800 


Asp Ala Asp Phe Val Gly Ala Leu Asn Ile Ala Leu Arg Pro Leu Val 
                805                 810                 815     


Asn Asp Gly Gln Gly Lys Gly Phe Lys Ala Glu Asp Met Ala Arg Glu 
            820                 825                 830         


His Thr Ile Leu Asn Pro Gln Phe Lys Ile Ala Cys Gln Ile Pro Val 
        835                 840                 845             


Tyr Glu Phe Val Glu Glu Asp Gly Asp Lys Trp Ala Ala Leu Arg Arg 
    850                 855                 860                 


Ile Met Leu 
865         


