                         SEQUENCE LISTING

<110>  Biocatalysts Limited and Biomega Group AS
 
<120>  Enzymatic method

<130>  69.67.133682/02

<150>  GB 1807246.2
<151>  2018-05-02

<160>  18    

<170>  PatentIn version 3.5

<210>  1
<211>  456
<212>  PRT
<213>  Methylophaga sp. strain SK1

<400>  1

Met Ala Thr Arg Ile Ala Ile Leu Gly Ala Gly Pro Ser Gly Met Ala 
1               5                   10                  15      


Gln Leu Arg Ala Phe Gln Ser Ala Gln Glu Lys Gly Ala Glu Ile Pro 
            20                  25                  30          


Glu Leu Val Cys Phe Glu Lys Gln Ala Asp Trp Gly Gly Gln Trp Asn 
        35                  40                  45              


Tyr Thr Trp Arg Thr Gly Leu Asp Glu Asn Gly Glu Pro Val His Ser 
    50                  55                  60                  


Ser Met Tyr Arg Tyr Leu Trp Ser Asn Gly Pro Lys Glu Cys Leu Glu 
65                  70                  75                  80  


Phe Ala Asp Tyr Thr Phe Asp Glu His Phe Gly Lys Pro Ile Ala Ser 
                85                  90                  95      


Tyr Pro Pro Arg Glu Val Leu Trp Asp Tyr Ile Lys Gly Arg Val Glu 
            100                 105                 110         


Lys Ala Gly Val Arg Lys Tyr Ile Arg Phe Asn Thr Ala Val Arg His 
        115                 120                 125             


Val Glu Phe Asn Glu Asp Ser Gln Thr Phe Thr Val Thr Val Gln Asp 
    130                 135                 140                 


His Thr Thr Asp Thr Ile Tyr Ser Glu Glu Phe Asp Tyr Val Val Cys 
145                 150                 155                 160 


Cys Thr Gly His Phe Ser Thr Pro Tyr Val Pro Glu Phe Glu Gly Phe 
                165                 170                 175     


Glu Lys Phe Gly Gly Arg Ile Leu His Ala His Asp Phe Arg Asp Ala 
            180                 185                 190         


Leu Glu Phe Lys Asp Lys Thr Val Leu Leu Val Gly Ser Ser Tyr Ser 
        195                 200                 205             


Ala Glu Asp Ile Gly Ser Gln Cys Tyr Lys Tyr Gly Ala Lys Lys Leu 
    210                 215                 220                 


Ile Ser Cys Tyr Arg Thr Ala Pro Met Gly Tyr Lys Trp Pro Glu Asn 
225                 230                 235                 240 


Trp Asp Glu Arg Pro Asn Leu Val Arg Val Asp Thr Glu Asn Ala Tyr 
                245                 250                 255     


Phe Ala Asp Gly Ser Ser Glu Lys Val Asp Ala Ile Ile Leu Cys Thr 
            260                 265                 270         


Gly Tyr Ile His His Phe Pro Phe Leu Asn Asp Asp Leu Arg Leu Val 
        275                 280                 285             


Thr Asn Asn Arg Leu Trp Pro Leu Asn Leu Tyr Lys Gly Val Val Trp 
    290                 295                 300                 


Glu Asp Asn Pro Lys Phe Phe Tyr Ile Gly Met Gln Asp Gln Trp Tyr 
305                 310                 315                 320 


Ser Phe Asn Met Phe Asp Ala Gln Ala Trp Tyr Ala Arg Asp Val Ile 
                325                 330                 335     


Met Gly Arg Leu Pro Leu Pro Ser Lys Glu Glu Met Lys Ala Asp Ser 
            340                 345                 350         


Met Ala Trp Arg Glu Lys Glu Leu Thr Leu Val Thr Ala Glu Glu Met 
        355                 360                 365             


Tyr Thr Tyr Gln Gly Asp Tyr Ile Gln Asn Leu Ile Asp Met Thr Asp 
    370                 375                 380                 


Tyr Pro Ser Phe Asp Ile Pro Ala Thr Asn Lys Thr Phe Leu Glu Trp 
385                 390                 395                 400 


Lys His His Lys Lys Glu Asn Ile Met Thr Phe Arg Asp His Ser Tyr 
                405                 410                 415     


Arg Ser Leu Met Thr Gly Thr Met Ala Pro Lys His His Thr Pro Trp 
            420                 425                 430         


Ile Asp Ala Leu Asp Asp Ser Leu Glu Ala Tyr Leu Ser Asp Lys Ser 
        435                 440                 445             


Glu Ile Pro Val Ala Lys Glu Ala 
    450                 455     


<210>  2
<211>  1371
<212>  DNA
<213>  Methylophaga sp. strain SK1

<400>  2
atggcaaccc gtattgcaat tttaggtgca ggtccgagcg gtatggcaca gctgcgtgca       60

tttcagagcg cacaagaaaa aggtgcagaa attccggaac tggtgtgttt tgaaaaacag      120

gcagattggg gtggtcagtg gaattatacc tggcgtaccg gtctggatga aaatggtgaa      180

ccggtgcata gcagcatgta tcgttatctg tggtcaaatg gtccgaaaga atgtctggaa      240

tttgccgatt acacctttga tgaacatttt ggtaaaccga ttgcaagcta tccgcctcgt      300

gaagttctgt gggattatat caaaggtcgt gttgaaaaag ccggtgtgcg caaatatatc      360

cgttttaata ccgcagttcg ccacgtggaa tttaatgaag atagccagac ctttaccgtt      420

accgttcagg atcataccac cgataccatt tatagcgaag agtttgatta tgttgtttgc      480

tgcaccggtc attttagcac cccgtatgtg ccggaatttg aaggctttga aaaatttggt      540

ggtcgtattc tgcatgccca tgattttcgt gatgcactgg aattcaaaga taaaaccgtt      600

ctgctggttg gtagcagcta tagtgccgaa gatattggta gccagtgtta caaatatggt      660

gccaaaaaac tgattagctg ctatcgtacc gcaccgatgg gttataaatg gcctgaaaat      720

tgggatgaac gtccgaatct ggttcgtgtt gataccgaaa atgcatattt tgcagatggc      780

agcagcgaaa aagttgatgc aattattctg tgcaccggct atattcatca tttcccgttt      840

ctgaatgatg atctgcgtct ggttaccaat aatcgtctgt ggcctctgaa tctgtataaa      900

ggtgttgttt gggaagataa cccgaaattc ttttatatcg gtatgcagga tcagtggtac      960

agctttaata tgtttgatgc acaggcatgg tatgcccgtg atgttattat gggtcgtctg     1020

ccgctgccga gcaaagaaga aatgaaagca gatagcatgg catggcgtga aaaagaactg     1080

accctggtta cagccgaaga aatgtatacc tatcagggcg attatatcca gaacctgatt     1140

gatatgaccg attatccgag ctttgatatt ccggcaacca ataaaacatt cctggaatgg     1200

aaacatcaca aaaaagaaaa catcatgacc ttccgcgatc atagctatcg tagcctgatg     1260

accggcacca tggcaccgaa acatcatacc ccgtggattg atgccctgga tgatagcctg     1320

gaagcatatc tgagcgataa aagcgaaatc ccggttgcaa aagaagccta a              1371


<210>  3
<211>  487
<212>  PRT
<213>  Pelagibacterales bacteria

<400>  3

Met Arg Lys Ala Ala Leu Thr His Val Gly Leu Gly Ala Pro Pro Tyr 
1               5                   10                  15      


Trp Val His Leu Leu Arg Ala Thr Ala Ala Asp Asn Pro Ser Leu Trp 
            20                  25                  30          


Arg Thr Leu Leu Gly Arg Trp Phe Gly Thr Lys Leu Leu Gly Ile Lys 
        35                  40                  45              


Ala Val Ile Ala Glu Ser Phe Glu Arg Ile His Arg Ser Asn Leu Ile 
    50                  55                  60                  


Gly Met Gly Val Leu Pro Leu Gln Phe Ile Ser Asn Gln Asn Arg Leu 
65                  70                  75                  80  


Ser Leu Asn Leu Asn Gly Ser Glu Lys Ile Ser Ile Lys Asn Ile Ile 
                85                  90                  95      


Asn Ile Phe Pro Asn Met Glu Phe Asp Cys Glu Ile Ile Ser Glu Asn 
            100                 105                 110         


Glu Thr Arg Leu Ile Lys Leu Leu Cys Leu Glu Phe Ala Asp Tyr Ser 
        115                 120                 125             


Phe Asp Glu His Phe Gly Lys Pro Ile Pro Ser Phe Pro Pro Arg Glu 
    130                 135                 140                 


Val Leu Tyr Asp Tyr Ile Val Gly Arg Val Ser Lys Gly Asn Leu Lys 
145                 150                 155                 160 


His Lys Ile Lys Phe Asn Thr Arg Val Thr Asn Thr Ser Phe Lys Asn 
                165                 170                 175     


Asp Lys Phe Glu Ile Ser Tyr Gln Asp Lys Ala Asn Asn Lys Ile Leu 
            180                 185                 190         


Thr Glu Asn Phe Asp Tyr Leu Val Ile Ser Thr Gly His Phe Ser Val 
        195                 200                 205             


Pro Phe Ile Pro Glu Tyr Glu Gly Met Asn Ser Phe Pro Gly Arg Ile 
    210                 215                 220                 


Met His Ser His Asp Phe Arg Asp Ala Glu Glu Phe Arg Asp Lys Asn 
225                 230                 235                 240 


Val Ile Val Leu Gly Ser Ser Tyr Ser Ala Glu Asp Val Ala Leu Gln 
                245                 250                 255     


Cys Asn Lys Tyr Gly Ala Lys Ser Val Thr Ile Gly Phe Arg His Asn 
            260                 265                 270         


Pro Met Gly Phe Arg Trp Pro Lys Gly Met Lys Glu Val His Tyr Leu 
        275                 280                 285             


Asp Lys Leu Asp Gly Lys Asn Ala Ile Phe Lys Asp Gly Thr Lys Gln 
    290                 295                 300                 


Glu Ala Asp Val Ile Ile Leu Cys Thr Gly Tyr Leu His His Phe Pro 
305                 310                 315                 320 


Phe Leu Glu Glu Ser Ile Lys Leu Lys Thr His Asn Arg Leu Tyr Pro 
                325                 330                 335     


Pro Lys Leu Tyr Lys Gly Ile Val Trp Gln Asp Asn His Lys Val Leu 
            340                 345                 350         


Tyr Leu Gly Met Gln Asp Gln Phe His Thr Phe Asn Met Phe Asp Cys 
        355                 360                 365             


Gln Ala Trp Tyr Ala Arg Asp Val Ile Met Gly Lys Ile Lys Met Pro 
    370                 375                 380                 


Asn Asn Asp Glu Ile Glu Lys Asp Ile Asn Lys Trp Val Gly Met Glu 
385                 390                 395                 400 


Glu Lys Leu Glu Asn Pro Asp Gln Met Ile Asp Phe Gln Thr Glu Tyr 
                405                 410                 415     


Thr Lys Glu Leu His Glu Met Ser Asp Tyr Pro Lys Ile Asp Phe Glu 
            420                 425                 430         


Leu Ile Arg Lys His Phe Lys Glu Trp Glu His His Lys Val Glu Asp 
        435                 440                 445             


Ile Leu Thr Tyr Arg Asn Lys Ser Phe Ser Ser Pro Val Thr Gly Ser 
    450                 455                 460                 


Ile Ala Pro Ile His His Thr Pro Trp Glu Lys Ala Met Asp Asp Ser 
465                 470                 475                 480 


Met Lys Thr Phe Leu Asn Lys 
                485         


<210>  4
<211>  1473
<212>  DNA
<213>  Pelagibacterales bacteria

<400>  4
catatgcgta aggcggcgct gacccatgtg ggtctgggtg ctccgccgta ctgggttcac       60

ctgctgcgtg cgaccgcggc ggacaacccg agcctgtggc gtaccctgct gggtcgttgg      120

tttggcacca aactgctggg tatcaaggcg gtgattgcgg agagcttcga acgtatccac      180

cgtagcaacc tgattggtat gggcgttctg ccgctgcagt ttatcagcaa ccaaaaccgt      240

ctgagcctga acctgaacgg cagcgagaag atcagcatca agaacatcat caacatcttc      300

ccgaacatgg agttcgactg cgaaatcatt agcgagaacg aaacccgtct gatcaaactg      360

ctgtgcctgg agtttgcgga ctacagcttt gatgaacact ttggcaagcc gatcccgagc      420

ttcccgccgc gtgaagtgct gtacgattat attgtgggtc gtgttagcaa aggcaacctg      480

aagcacaaaa tcaagttcaa cacccgtgtt accaacacca gctttaaaaa cgacaagttc      540

gagatcagct accaggataa agcgaacaac aagattctga ccgaaaactt tgattatctg      600

gtgatcagca ccggtcactt tagcgttccg ttcattccgg agtatgaggg tatgaacagc      660

ttcccgggcc gtattatgca cagccacgac tttcgtgatg cggaggaatt ccgtgacaag      720

aacgtgatcg ttctgggcag cagctacagc gcggaggatg tggcgctgca gtgcaacaaa      780

tatggtgcga agagcgttac cattggcttt cgtcacaacc cgatgggttt ccgttggccg      840

aaaggcatga aagaggtgca ctacctggac aaactggatg gcaagaacgc gatcttcaaa      900

gacggcacca agcaagaagc ggatgttatc attctgtgca ccggttatct gcaccacttc      960

ccgtttctgg aggaaagcat taaactgaag acccacaacc gtctgtaccc gccgaaactg     1020

tataagggta tcgtgtggca ggacaaccac aaagttctgt acctgggcat gcaggatcaa     1080

ttccacacct ttaacatgtt cgactgccaa gcgtggtatg cgcgtgatgt gatcatgggt     1140

aaaattaaga tgccgaacaa cgacgagatc gaaaaagata ttaacaagtg ggttggcatg     1200

gaggaaaagc tggagaaccc ggaccagatg atcgatttcc aaaccgaata caccaaagag     1260

ctgcacgaaa tgagcgacta tccgaagatc gattttgagc tgattcgtaa acacttcaag     1320

gagtgggaac accacaaagt ggaagacatc ctgacctacc gtaacaagag ctttagcagc     1380

ccggttaccg gcagcattgc gccgattcac cacaccccgt gggaaaaggc gatggacgat     1440

agcatgaaaa ccttcctgaa caagtaagga tcc                                  1473


<210>  5
<211>  444
<212>  PRT
<213>  alpha-proteobacterium HIMB59

<400>  5

Met Thr Lys Val Ala Leu Ile Gly Thr Gly Pro Cys Gly Leu Ser Phe 
1               5                   10                  15      


Leu Arg Ser Leu Tyr Gln Ala Lys Lys Lys Gly Glu Asp Ile Pro Glu 
            20                  25                  30          


Val Val Ala Phe Asp Lys Gln Ser Asp Trp Gly Gly Leu Trp Asn Tyr 
        35                  40                  45              


Ser Trp Arg Thr Gly Ser Asp Glu Phe Gly Asp Pro Ile Pro Asn Ser 
    50                  55                  60                  


Met Tyr Arg Tyr Leu Trp Ser Asn Gly Pro Lys Glu Cys Leu Glu Phe 
65                  70                  75                  80  


Ala Asp Tyr Ser Phe Asp Glu His Phe Asn Lys Pro Ile Pro Ser Phe 
                85                  90                  95      


Pro Pro Arg Glu Val Leu Lys Asp Tyr Ile Ile Gly Arg Ala Glu Lys 
            100                 105                 110         


Ser Glu Leu Lys Lys Asn Val Lys Phe Asn Thr Thr Val Thr Ser Val 
        115                 120                 125             


Thr Ser Asp Gly Asp Ala Phe Asn Val Ser Tyr Lys Asp Lys Val Glu 
    130                 135                 140                 


Asp Lys Ile Ser Thr Glu Ser Phe Asp Asn Val Val Val Ala Thr Gly 
145                 150                 155                 160 


His Phe Ser Val Pro Tyr Ile Pro Glu Tyr Lys Gly Met Ser Ser Phe 
                165                 170                 175     


Pro Gly Arg Ile Met His Ser His Asp Phe Arg Asp Ala Glu Glu Phe 
            180                 185                 190         


Arg Asp Lys Asn Val Val Val Leu Gly Ser Ser Tyr Ser Ala Glu Asp 
        195                 200                 205             


Val Ala Leu Gln Cys His Lys Tyr Gly Ala Lys Ser Val Thr Ile Gly 
    210                 215                 220                 


Tyr Arg Asn Asn Pro Met Gly Phe His Trp Pro Glu Gly Met Lys Glu 
225                 230                 235                 240 


Val His Tyr Met Asp Arg Ile Glu Gly Asn Lys Ala Ile Phe Lys Asp 
                245                 250                 255     


Gly Thr Val Gln Glu Met Asp Ala Leu Ile Leu Cys Thr Gly Tyr Leu 
            260                 265                 270         


His His Phe Pro Phe Leu Ser Glu Glu Leu Lys Leu Lys Thr His Asn 
        275                 280                 285             


Arg Leu Tyr Pro Pro Lys Leu Tyr Lys Gly Val Ala Trp Gln Asp Asn 
    290                 295                 300                 


Pro Asn Leu Phe Tyr Leu Gly Met Gln Asp Gln Phe His Thr Phe Asn 
305                 310                 315                 320 


Met Phe Asp Ala Gln Ala Trp Tyr Val Arg Asp Ile Ile Met Asn Lys 
                325                 330                 335     


Ile Thr Leu Pro Ser Asn Asp Glu Met Glu Lys Asp Ile Asn Asn Trp 
            340                 345                 350         


Val Ser Lys Glu Glu Ala Leu Glu Asp Ala His Gln Met Ile Asp Phe 
        355                 360                 365             


Gln Thr Asp Tyr Cys Val Asp Leu Cys Ser Ser Ile Asp Tyr Pro Lys 
    370                 375                 380                 


Ile Asp Phe Glu Leu Ile Arg Lys Asn Phe Tyr Glu Trp Glu Asp His 
385                 390                 395                 400 


Lys Glu Glu Asn Ile Leu Thr Tyr Arg Asp Lys Ser Phe Pro Ser Pro 
                405                 410                 415     


Val Thr Gly Thr Val Gly Pro Ser His His Thr Thr Trp Leu Glu Ala 
            420                 425                 430         


Met Asp Asp Ser Met Gln Thr Tyr Leu Lys Thr Lys 
        435                 440                 


<210>  6
<211>  1344
<212>  DNA
<213>  alpha-proteobacterium HIMB59

<400>  6
catatgacca aagtggcgct gattggtacc ggcccgtgcg gtctgagctt tctgcgtagc       60

ctgtaccagg cgaagaaaaa gggcgaggac atcccggaag tggttgcgtt cgacaagcaa      120

agcgattggg gtggcctgtg gaactatagc tggcgtaccg gtagcgacga gtttggcgat      180

ccgattccga acagcatgta ccgttatctg tggagcaacg gtccgaaaga gtgcctggaa      240

tttgcggact acagctttga tgagcacttc aacaaaccga tcccgagctt cccgccgcgt      300

gaagtgctga aggattatat cattggtcgt gcggagaaaa gcgaactgaa aaagaacgtt      360

aagtttaaca ccaccgtgac cagcgttacc agcgacggcg atgcgttcaa cgtgagctac      420

aaagacaagg ttgaggataa gatcagcacc gaaagctttg acaacgtggt tgtggcgacc      480

ggtcacttca gcgttccgta cattccggag tataagggta tgagcagctt cccgggccgt      540

atcatgcaca gccacgactt tcgtgatgcg gaggaattcc gtgataagaa cgttgtggtt      600

ctgggcagca gctacagcgc ggaagacgtg gcgctgcagt gccacaaata cggtgcgaag      660

agcgttacca ttggctatcg taacaacccg atgggttttc actggccgga gggcatgaaa      720

gaagtgcact acatggatcg tattgagggt aacaaagcga tcttcaagga tggcaccgtt      780

caagaaatgg acgcgctgat cctgtgcacc ggttatctgc accacttccc gtttctgagc      840

gaggaactga aactgaagac ccacaaccgt ctgtacccgc cgaaactgta taagggtgtg      900

gcgtggcagg ataacccgaa cctgttctac ctgggcatgc aggaccaatt ccacaccttt      960

aacatgttcg atgcgcaagc gtggtacgtg cgtgacatca ttatgaacaa gattaccctg     1020

ccgagcaacg atgagatgga aaaagacatc aacaactggg ttagcaagga agaggcgctg     1080

gaggacgcgc accagatgat tgactttcaa accgattact gcgttgacct gtgcagcagc     1140

atcgattatc cgaaaattga ctttgaactg atccgtaaga acttctacga gtgggaagat     1200

cacaaagagg aaaacatcct gacctatcgt gacaagagct ttccgagccc ggtgaccggt     1260

accgttggtc cgagccacca caccacctgg ctggaagcga tggacgatag catgcagacc     1320

tacctgaaaa ccaagtaagg atcc                                            1344


<210>  7
<211>  446
<212>  PRT
<213>  Candidatus pelagibacter ubique

<400>  7

Met Thr Lys Val Ala Ile Ile Gly Ala Gly Pro Cys Gly Leu Ser Ala 
1               5                   10                  15      


Leu Arg Ser Phe Glu Gln Ala Glu Lys Lys Gly Glu Lys Ile Pro Glu 
            20                  25                  30          


Ile Val Cys Phe Asp Lys Gln Glu Asp Trp Gly Gly Leu Trp Asn Tyr 
        35                  40                  45              


Ser Trp Arg Thr Gly Ser Asp Gln Tyr Gly Asp Pro Val Pro Asn Ser 
    50                  55                  60                  


Met Tyr Arg Tyr Leu Trp Ser Asn Gly Pro Lys Glu Cys Leu Glu Phe 
65                  70                  75                  80  


Ala Asp Tyr Ser Phe Asp Glu His Phe Gly Lys Pro Ile Pro Ser Phe 
                85                  90                  95      


Pro Pro Arg Glu Val Leu Tyr Asp Tyr Ile Leu Gly Arg Ala Lys Lys 
            100                 105                 110         


Gly Asn Leu Lys Ser Lys Ile Lys Phe Asn Thr Ser Val Ser Asn Val 
        115                 120                 125             


Ser Phe Asp Gly Lys Asn Phe Glu Val Thr Tyr Arg Asp Lys Lys Asn 
    130                 135                 140                 


Asp Lys Ile Ser Lys Asp Val Phe Asp Tyr Val Ile Val Ser Thr Gly 
145                 150                 155                 160 


His Phe Ser Val Pro Phe Ile Pro Glu Tyr Pro Gly Met Lys Ala Phe 
                165                 170                 175     


Pro Gly Arg Ile Met His Ser His Asp Phe Arg Asp Ala Glu Glu Phe 
            180                 185                 190         


Arg Asp Lys Asn Val Val Val Leu Gly Ser Ser Tyr Ser Ala Glu Asp 
        195                 200                 205             


Val Ala Leu Gln Cys His Lys Tyr Gly Ala Lys Ser Val Thr Ile Ala 
    210                 215                 220                 


Tyr Arg His Asn Pro Met Gly Phe Lys Trp Pro Asp Gly Met Lys Glu 
225                 230                 235                 240 


Val Phe His Leu Asp Arg Leu Glu Gly Asn Lys Ala Ile Phe Lys Asp 
                245                 250                 255     


Gly His Ile Gln Glu Thr Asp Ala Val Ile Leu Cys Thr Gly Tyr Leu 
            260                 265                 270         


His His Phe Pro Phe Ile Ser Glu Gly Leu Lys Leu Lys Thr Gly Asn 
        275                 280                 285             


Arg Leu Tyr Pro Pro Lys Leu Tyr Lys Gly Val Val Trp Gln Asn Asn 
    290                 295                 300                 


His Lys Leu Ile Tyr Leu Gly Met Gln Asp Gln Phe His Thr Phe Asn 
305                 310                 315                 320 


Met Phe Asp Cys Gln Ala Trp Phe Ala Arg Asp Val Ile Met Gly Lys 
                325                 330                 335     


Ile Lys Val Pro Ser Asp Ser Glu Ile Glu Lys Asp Ile Asn Lys Trp 
            340                 345                 350         


Val Ser Leu Glu Glu Lys Leu Glu Asn Ala Asp Gln Met Ile Asp Phe 
        355                 360                 365             


Gln Thr Glu Tyr Thr Lys Glu Leu His Asp Leu Ser Asp Tyr Pro Lys 
    370                 375                 380                 


Ile Asp Phe Glu Leu Ile Arg Lys Thr Phe Lys Glu Trp Glu His His 
385                 390                 395                 400 


Lys Val Glu Asn Ile Met Thr Tyr Arg Asn Lys Ser Phe Ala Ser Pro 
                405                 410                 415     


Val Thr Gly Ser Ile Gly Pro Lys His His Thr Asp Trp Glu Val Ala 
            420                 425                 430         


Met Asp Asp Ser Leu Lys Thr Phe Leu Asp Gln Pro Lys Lys 
        435                 440                 445     


<210>  8
<211>  1350
<212>  DNA
<213>  Candidatus pelagibacter ubique

<400>  8
catatgacca aagttgcgat cattggtgcg ggtccgtgcg gtctgagcgc gctgcgtagc       60

tttgagcagg cggaaaagaa aggcgagaaa atcccggaaa ttgtttgctt cgacaagcaa      120

gaggattggg gtggcctgtg gaactacagc tggcgtaccg gtagcgacca gtatggcgat      180

ccggtgccga acagcatgta ccgttatctg tggagcaacg gtccgaaaga gtgcctggaa      240

tttgcggact acagctttga tgagcacttt ggcaagccga tcccgagctt cccgccgcgt      300

gaagttctgt acgactatat tctgggtcgt gcgaagaaag gcaacctgaa gagcaaaatc      360

aagttcaaca ccagcgttag caacgtgagc tttgatggca agaacttcga ggtgacctac      420

cgtgacaaga aaaacgataa aatcagcaag gacgtttttg attatgttat tgtgagcacc      480

ggccacttta gcgtgccgtt cattccggaa tacccgggta tgaaagcgtt cccgggccgt      540

atcatgcaca gccacgactt tcgtgatgcg gaggaattcc gtgacaagaa cgtggttgtg      600

ctgggtagca gctatagcgc ggaagatgtt gcgctgcaat gccacaaata cggcgcgaag      660

agcgtgacca tcgcgtatcg tcacaacccg atgggtttta aatggccgga cggcatgaaa      720

gaggtgttcc acctggatcg tctggagggt aacaaagcga tcttcaagga cggccacatt      780

caggagaccg atgcggtgat cctgtgcacc ggttacctgc accacttccc gtttattagc      840

gaaggtctga aactgaagac cggcaaccgt ctgtacccgc cgaaactgta taagggtgtt      900

gtgtggcaaa acaaccacaa actgatttat ctgggcatgc aggaccaatt ccacaccttt      960

aacatgttcg actgccaggc gtggtttgcg cgtgatgtta tcatgggtaa aattaaggtg     1020

ccgagcgaca gcgagatcga aaaagatatt aacaagtggg ttagcctgga ggaaaagctg     1080

gagaacgcgg accagatgat cgatttccaa accgagtaca ccaaagaact gcacgacctg     1140

agcgattatc cgaagatcga ctttgagctg attcgtaaaa ccttcaagga gtgggaacac     1200

cacaaagtgg aaaacattat gacctaccgt aacaagagct ttgcgagccc ggttaccggt     1260

agcattggtc cgaaacacca caccgactgg gaagtggcga tggacgatag cctgaagacc     1320

ttcctggatc aaccgaagaa ataaggatcc                                      1350


<210>  9
<211>  446
<212>  PRT
<213>  Candidatus pelagibacter ubique

<400>  9

Met Thr Lys Val Ala Ile Ile Gly Ala Gly Pro Cys Gly Leu Ser Ala 
1               5                   10                  15      


Leu Arg Ser Phe Glu Gln Ala Glu Lys Lys Gly Glu Lys Ile Pro Glu 
            20                  25                  30          


Ile Val Cys Phe Asp Lys Gln Glu Asp Trp Gly Gly Leu Trp Asn Tyr 
        35                  40                  45              


Ser Trp Arg Thr Gly Ser Asp Gln Tyr Gly Asp Pro Val Pro Asn Ser 
    50                  55                  60                  


Met Tyr Arg Tyr Leu Trp Ser Asn Gly Pro Lys Glu Cys Leu Glu Phe 
65                  70                  75                  80  


Ala Asp Tyr Ser Phe Asp Glu His Phe Gly Lys Pro Ile Pro Ser Phe 
                85                  90                  95      


Pro Pro Arg Glu Val Leu Tyr Asp Tyr Ile Leu Gly Arg Ala Lys Lys 
            100                 105                 110         


Gly Asn Leu Lys Ser Lys Ile Lys Phe Asn Thr Ser Val Thr Asn Val 
        115                 120                 125             


Ser Tyr Glu Gly Asn Ser Phe Glu Val Thr Tyr Arg Asp Lys Lys Asn 
    130                 135                 140                 


Asp Lys Ile Ser Lys Asp Ile Phe Asp Tyr Val Ile Val Ser Thr Gly 
145                 150                 155                 160 


His Phe Ser Val Pro Phe Ile Pro Glu Tyr Pro Gly Met Lys Ala Phe 
                165                 170                 175     


Pro Gly Arg Ile Met His Ser His Asp Phe Arg Asp Ala Glu Glu Phe 
            180                 185                 190         


Arg Asp Lys Asn Val Val Val Leu Gly Ser Ser Tyr Ser Ala Glu Asp 
        195                 200                 205             


Val Ala Leu Gln Cys His Lys Tyr Gly Ala Lys Ser Val Thr Ile Gly 
    210                 215                 220                 


Tyr Arg His Asn Pro Met Gly Phe Lys Trp Pro Asp Gly Met Lys Glu 
225                 230                 235                 240 


Val Phe His Leu Asp Arg Leu Glu Gly Asn Lys Ala Ile Phe Lys Asp 
                245                 250                 255     


Gly His Ile Gln Glu Thr Asp Ala Val Ile Leu Cys Thr Gly Tyr Leu 
            260                 265                 270         


His His Phe Pro Phe Ile Ser Glu Gly Leu Lys Leu Lys Thr Gly Asn 
        275                 280                 285             


Arg Leu Tyr Pro Pro Lys Leu Tyr Lys Gly Val Val Trp Gln Asn Asn 
    290                 295                 300                 


His Lys Leu Ile Tyr Leu Gly Met Gln Asp Gln Phe His Thr Phe Asn 
305                 310                 315                 320 


Met Phe Asp Cys Gln Ala Trp Phe Ala Arg Asp Ile Ile Met Gly Lys 
                325                 330                 335     


Ile Lys Val Pro Ser Asp Thr Glu Ile Glu Lys Asp Ile Asn Lys Trp 
            340                 345                 350         


Val Ser Leu Glu Glu Lys Leu Glu Asn Ala Asp Gln Met Ile Asp Phe 
        355                 360                 365             


Gln Thr Glu Tyr Thr Lys Glu Leu His Asp Leu Ser Asn Tyr Pro Lys 
    370                 375                 380                 


Ile Asp Phe Glu Leu Ile Arg Lys Thr Phe Lys Glu Trp Glu His His 
385                 390                 395                 400 


Lys Val Glu Asn Ile Met Thr Tyr Arg Asn Lys Ser Phe Ala Ser Pro 
                405                 410                 415     


Val Thr Gly Ser Ile Gly Pro Lys His His Thr Asp Trp Glu Val Ala 
            420                 425                 430         


Met Asp Asp Ser Leu Lys Thr Phe Leu Asp Gln Pro Lys Lys 
        435                 440                 445     


<210>  10
<211>  1350
<212>  DNA
<213>  Candidatus pelagibacter ubique

<400>  10
catatgacca aagttgcgat cattggtgcg ggtccgtgcg gtctgagcgc gctgcgtagc       60

tttgagcagg cggaaaagaa aggcgagaaa atcccggaaa ttgtttgctt cgacaagcaa      120

gaggattggg gtggcctgtg gaactacagc tggcgtaccg gtagcgacca gtatggcgat      180

ccggtgccga acagcatgta ccgttatctg tggagcaacg gtccgaaaga gtgcctggaa      240

tttgcggact acagctttga tgagcacttt ggcaagccga tcccgagctt cccgccgcgt      300

gaagtgctgt acgactatat tctgggtcgt gcgaagaaag gcaacctgaa gagcaaaatc      360

aagtttaaca ccagcgttac caacgtgagc tacgagggta acagcttcga agttacctat      420

cgtgacaaga aaaacgataa gatcagcaag gacatcttcg attacgttat cgtgagcacc      480

ggccacttta gcgtgccgtt cattccggag tatccgggta tgaaagcgtt cccgggccgt      540

atcatgcaca gccacgactt tcgtgatgcg gaggaattcc gtgacaagaa cgtggttgtg      600

ctgggtagca gctatagcgc ggaagatgtt gcgctgcaat gccacaaata cggtgcgaag      660

agcgtgacca tcggctatcg tcacaacccg atgggtttta aatggccgga cggcatgaaa      720

gaggtgttcc acctggatcg tctggagggt aacaaagcga tcttcaagga cggccacatt      780

caggagaccg atgcggtgat cctgtgcacc ggttacctgc accacttccc gtttattagc      840

gaaggtctga aactgaagac cggcaaccgt ctgtacccgc cgaaactgta taagggtgtt      900

gtgtggcaaa acaaccacaa actgatttat ctgggcatgc aggaccaatt ccacaccttt      960

aacatgttcg actgccaggc gtggtttgcg cgtgatatca ttatgggtaa aatcaaggtt     1020

ccgagcgaca ccgagatcga aaaagatatt aacaagtggg ttagcctgga ggaaaagctg     1080

gagaacgcgg accagatgat tgatttccaa accgagtaca ccaaagaact gcacgacctg     1140

agcaactatc cgaagatcga ttttgagctg attcgtaaaa ccttcaagga gtgggaacac     1200

cacaaagttg aaaacattat gacctaccgt aacaagagct ttgcgagccc ggttaccggt     1260

agcattggtc cgaaacacca caccgactgg gaagtggcga tggacgatag cctgaagacc     1320

ttcctggatc aaccgaagaa ataaggatcc                                      1350


<210>  11
<211>  446
<212>  PRT
<213>  Candidatus pelagibacter ubique

<400>  11

Met Thr Lys Val Ala Ile Ile Gly Ala Gly Pro Cys Gly Leu Ser Ala 
1               5                   10                  15      


Leu Arg Ser Phe Glu Gln Ala Glu Lys Lys Gly Glu Lys Ile Pro Glu 
            20                  25                  30          


Ile Val Cys Phe Asp Lys Gln Glu Asp Trp Gly Gly Leu Trp Asn Tyr 
        35                  40                  45              


Ser Trp Arg Thr Gly Ser Asp Gln Tyr Gly Asp Pro Val Pro Asn Ser 
    50                  55                  60                  


Met Tyr Arg Tyr Leu Trp Ser Asn Gly Pro Lys Glu Cys Leu Glu Phe 
65                  70                  75                  80  


Ala Asp Tyr Ser Phe Asp Glu His Phe Gly Lys Pro Ile Pro Ser Phe 
                85                  90                  95      


Pro Pro Arg Glu Val Leu Tyr Asp Tyr Ile Leu Gly Arg Val Lys Lys 
            100                 105                 110         


Gly Asn Leu Lys Ser Lys Ile Lys Phe Asn Thr Thr Val Thr Asn Val 
        115                 120                 125             


Ser Tyr Asp Asn Glu Ser Phe Glu Ile Thr Tyr Arg Asp Lys Lys Asn 
    130                 135                 140                 


Asp Lys Ile Ser Lys Asp Ile Phe Asp Tyr Val Ile Val Ser Thr Gly 
145                 150                 155                 160 


His Phe Ser Val Pro Phe Ile Pro Glu Tyr Pro Gly Met Lys Ala Phe 
                165                 170                 175     


Pro Gly Arg Ile Met His Ser His Asp Phe Arg Asp Ala Glu Glu Phe 
            180                 185                 190         


Arg Asp Lys Asn Val Val Val Leu Gly Ser Ser Tyr Ser Ala Glu Asp 
        195                 200                 205             


Val Ala Leu Gln Cys His Lys Tyr Gly Ala Lys Ser Val Thr Ile Ala 
    210                 215                 220                 


Tyr Arg His Asn Pro Met Gly Phe Lys Trp Pro Asn Gly Met Lys Glu 
225                 230                 235                 240 


Val Phe His Leu Asp Arg Leu Glu Gly Asn Lys Ala Ile Phe Lys Asp 
                245                 250                 255     


Gly His Val Gln Glu Thr Asp Ala Val Ile Leu Cys Thr Gly Tyr Leu 
            260                 265                 270         


His His Phe Pro Phe Leu Ser Glu Asp Leu Lys Leu Lys Thr Gly Asn 
        275                 280                 285             


Arg Leu Tyr Pro Pro Lys Leu Tyr Lys Gly Val Val Trp Gln Asn Asn 
    290                 295                 300                 


His Lys Leu Met Tyr Leu Gly Met Gln Asp Gln Phe His Thr Phe Asn 
305                 310                 315                 320 


Met Phe Asp Cys Gln Ala Trp Phe Ala Arg Asp Val Ile Met Gly Lys 
                325                 330                 335     


Ile Lys Thr Leu Asn Asp Ser Glu Ile Glu Lys Asp Ile Asn Lys Trp 
            340                 345                 350         


Val Ser Leu Glu Glu Lys Leu Glu Asn Ala Asp Gln Met Ile Asp Phe 
        355                 360                 365             


Gln Thr Glu Tyr Thr Lys Glu Leu His Asp Leu Ser Asn Tyr Pro Lys 
    370                 375                 380                 


Ile Asp Phe Glu Leu Ile Arg Lys Thr Phe Lys Glu Trp Glu His His 
385                 390                 395                 400 


Lys Val Glu Asn Ile Met Thr Tyr Arg Asn Lys Ser Phe Ala Ser Pro 
                405                 410                 415     


Val Thr Gly Ser Ile Gly Pro Lys His His Thr Glu Trp Glu Val Ala 
            420                 425                 430         


Met Asp Asp Ser Leu Lys Thr Phe Leu Asp Gln Pro Lys Lys 
        435                 440                 445     


<210>  12
<211>  1350
<212>  DNA
<213>  Candidatus pelagibacter ubique

<400>  12
catatgacca aagttgcgat cattggtgcg ggtccgtgcg gtctgagcgc gctgcgtagc       60

tttgagcagg cggaaaagaa aggcgagaaa atcccggaaa ttgtttgctt cgacaagcaa      120

gaggattggg gtggcctgtg gaactacagc tggcgtaccg gtagcgacca gtatggcgat      180

ccggtgccga acagcatgta ccgttatctg tggagcaacg gtccgaaaga gtgcctggaa      240

tttgcggact acagctttga tgagcacttt ggcaagccga tcccgagctt cccgccgcgt      300

gaagttctgt acgactatat tctgggtcgt gtgaagaaag gcaacctgaa gagcaaaatc      360

aagtttaaca ccaccgttac caacgtgagc tacgataacg agagcttcga aattacctat      420

cgtgacaaga aaaacgataa gatcagcaag gacatcttcg attacgttat cgtgagcacc      480

ggtcacttta gcgttccgtt cattccggag tatccgggta tgaaagcgtt cccgggccgt      540

atcatgcaca gccacgactt tcgtgatgcg gaggaattcc gtgacaagaa cgtggttgtg      600

ctgggtagca gctatagcgc ggaagatgtt gcgctgcaat gccacaaata cggcgcgaag      660

agcgtgacca ttgcgtatcg tcacaacccg atgggtttta aatggccgaa cggcatgaaa      720

gaggtgttcc acctggaccg tctggagggt aacaaagcga ttttcaagga cggccacgtt      780

caggagaccg atgcggtgat cctgtgcacc ggttacctgc accacttccc gtttctgagc      840

gaagacctga aactgaagac cggtaaccgt ctgtacccgc cgaaactgta taagggcgtt      900

gtgtggcaaa acaaccacaa actgatgtat ctgggtatgc aggatcaatt ccacaccttt      960

aacatgttcg actgccaggc gtggtttgcg cgtgatgtta tcatgggcaa aattaagacc     1020

ctgaacgaca gcgagatcga aaaagatatt aacaagtggg ttagcctgga ggaaaagctg     1080

gaaaacgcgg accagatgat cgatttccaa accgagtaca ccaaagaact gcacgacctg     1140

agcaactatc cgaagatcga ttttgagctg attcgtaaaa ccttcaagga gtgggaacac     1200

cacaaagtgg aaaacattat gacctaccgt aacaagagct ttgcgagccc ggttaccggt     1260

agcattggtc cgaaacacca caccgagtgg gaagtggcga tggacgatag cctgaagacc     1320

ttcctggacc aaccgaagaa ataaggatcc                                      1350


<210>  13
<211>  462
<212>  PRT
<213>  Candidatus pelagibacter ubique

<400>  13

Met Leu Val Lys Tyr Ile Tyr Lys Phe Ile Leu Asp Arg Asp Asn Leu 
1               5                   10                  15      


Met Thr Lys Val Ala Ile Ile Gly Ala Gly Pro Cys Gly Leu Ser Met 
            20                  25                  30          


Leu Arg Ser Phe Glu Gln Ala Glu Lys Lys Gly Glu Lys Ile Pro Gln 
        35                  40                  45              


Ile Val Cys Phe Asp Lys Gln Glu Asp Trp Gly Gly Leu Trp Asn Tyr 
    50                  55                  60                  


Ser Trp Arg Thr Gly Ser Asp Gln Tyr Gly Asp Pro Val Pro Asn Ser 
65                  70                  75                  80  


Met Tyr Arg Tyr Leu Trp Ser Asn Gly Pro Lys Glu Cys Leu Glu Phe 
                85                  90                  95      


Ala Asp Tyr Ser Phe Asp Glu His Phe Gly Asn Pro Ile Pro Ser Phe 
            100                 105                 110         


Pro Pro Arg Glu Val Leu Tyr Asp Tyr Ile Leu Gly Arg Val Lys Lys 
        115                 120                 125             


Gly Asn Leu Lys Asp Lys Ile Lys Phe Asn Asn Ser Val Thr Asn Val 
    130                 135                 140                 


Thr Tyr Asn Gly Asn Asn Phe Glu Ile Thr Ser Leu Asp Lys Lys Asn 
145                 150                 155                 160 


Asp Lys Phe Ser Lys Asp Val Phe Asp Tyr Val Val Val Ala Ser Gly 
                165                 170                 175     


His Phe Ser Val Pro Phe Ile Pro Glu Tyr Pro Gly Met Lys Ser Phe 
            180                 185                 190         


Pro Gly Arg Ile Leu His Ser His Asp Phe Arg Asp Ala Glu Glu Phe 
        195                 200                 205             


Arg Gly Lys Asn Val Ile Val Leu Gly Ser Ser Tyr Ser Ala Glu Asp 
    210                 215                 220                 


Val Ala Leu Gln Cys His Lys Tyr Gly Ala Lys Ser Val Thr Ile Gly 
225                 230                 235                 240 


Tyr Arg His Asn Pro Met Gly Phe Lys Trp Pro Asn Gly Val Lys Glu 
                245                 250                 255     


Val Phe His Leu Asp Arg Leu Glu Asp Ser Lys Ala Ile Phe Lys Asp 
            260                 265                 270         


Gly His Glu Gln Glu Ala Asp Ala Val Ile Leu Ala Thr Gly Tyr Leu 
        275                 280                 285             


His His Phe Pro Phe Leu Ser Glu Asp Leu Gln Leu Lys Thr Arg Asn 
    290                 295                 300                 


Arg Leu Tyr Pro Pro Lys Leu Tyr Lys Gly Val Val Trp Gln Asn Asn 
305                 310                 315                 320 


His Lys Leu Leu Tyr Leu Gly Met Gln Asp Gln Phe His Thr Phe Asn 
                325                 330                 335     


Met Phe Asp Cys Gln Ala Trp Phe Ala Arg Asp Val Val Met Gly Lys 
            340                 345                 350         


Ile Lys Ile Pro Asn Asn Ser Glu Ile Glu Lys Asp Ile Asn Lys Trp 
        355                 360                 365             


Val Ser Met Glu Glu Lys Leu Glu Asn Pro Asp Gln Met Ile Asp Phe 
    370                 375                 380                 


Gln Thr Glu Tyr Thr Lys Glu Leu His Asp Leu Ser Asp Tyr Pro Lys 
385                 390                 395                 400 


Ile Asp Phe Glu Leu Ile Arg Lys Asn Phe Lys Glu Trp Glu His His 
                405                 410                 415     


Lys Val Glu Asn Ile Met Thr Tyr Arg Asn Lys Ser Phe Pro Ser Pro 
            420                 425                 430         


Val Thr Gly Ser Val Ala Pro Ile His His Thr Ala Trp Glu Ala Ala 
        435                 440                 445             


Met Asp Asp Ser Ser Lys Thr Phe Leu Asp Gln Ser Lys Asn 
    450                 455                 460         


<210>  14
<211>  1398
<212>  DNA
<213>  Candidatus pelagibacter ubique

<400>  14
catatgctgg ttaagtacat ctacaagttc atcctggacc gtgataacct gatgaccaaa       60

gtggcgatca ttggtgcggg tccgtgcggt ctgagcatgc tgcgtagctt tgagcaggcg      120

gaaaagaaag gcgagaaaat cccgcagatt gtttgcttcg acaagcaaga agattggggt      180

ggcctgtgga actacagctg gcgtaccggt agcgaccaat atggcgatcc ggtgccgaac      240

agcatgtacc gttatctgtg gagcaacggt ccgaaggagt gcctggaatt tgcggactac      300

agctttgatg agcactttgg taacccgatc ccgagcttcc cgccgcgtga agttctgtac      360

gactatattc tgggtcgtgt gaagaaaggc aacctgaagg ataaaatcaa gtttaacaac      420

agcgttacca acgtgaccta taacggtaac aacttcgaga ttaccagcct ggacaagaaa      480

aacgataaat ttagcaagga cgttttcgat tacgtggttg tggcgagcgg ccactttagc      540

gtgccgttca ttccggaata tccgggtatg aaaagctttc cgggccgtat cctgcacagc      600

cacgactttc gtgatgcgga ggaattccgt ggcaagaacg ttattgtgct gggcagcagc      660

tacagcgcgg aggacgttgc gctgcagtgc cacaaatacg gtgcgaagag cgtgaccatc      720

ggctatcgtc acaacccgat gggttttaaa tggccgaacg gcgttaaaga ggtgttccac      780

ctggaccgtc tggaagatag caaagcgatt ttcaaggacg gtcacgagca agaagcggat      840

gcggttatcc tggcgaccgg ctacctgcac cacttcccgt ttctgagcga agacctgcag      900

ctgaaaaccc gtaaccgtct gtacccgccg aaactgtata agggtgttgt gtggcaaaac      960

aaccacaagc tgctgtatct gggcatgcag gatcaattcc acacctttaa catgttcgac     1020

tgccaggcgt ggtttgcgcg tgatgttgtg atgggtaaaa tcaagattcc gaacaacagc     1080

gagatcgaaa aagacattaa caagtgggtt agcatggagg aaaaactgga gaacccggac     1140

cagatgatcg atttccaaac cgagtacacc aaagaactgc acgacctgag cgattatccg     1200

aagatcgatt ttgagctgat tcgtaaaaac ttcaaggagt gggaacacca caaagtggaa     1260

aacattatga cctaccgtaa caagagcttt ccgagcccgg ttaccggtag cgtggcgccg     1320

atccaccaca ccgcgtggga agcggcgatg gacgatagca gcaaaacctt cctggaccaa     1380

agcaagaact aaggatcc                                                   1398


<210>  15
<211>  446
<212>  PRT
<213>  Candidatus pelagibacter TMED239

<400>  15

Met Thr Lys Val Ala Ile Ile Gly Ala Gly Pro Cys Gly Leu Ser Met 
1               5                   10                  15      


Leu Arg Ser Phe Glu Gln Ala Glu Lys Ser Gly Glu Lys Ile Pro Glu 
            20                  25                  30          


Ile Thr Cys Phe Glu Lys Gln Asp Asp Trp Gly Gly Leu Trp Asn Tyr 
        35                  40                  45              


Ser Trp Arg Thr Gly Ser Asp Gln Tyr Gly Asp Pro Val Pro Asn Ser 
    50                  55                  60                  


Met Tyr Arg Tyr Leu Trp Ser Asn Gly Pro Lys Glu Cys Leu Glu Phe 
65                  70                  75                  80  


Ala Asp Tyr Ser Phe Asp Glu His Phe Gly Lys Pro Ile Pro Ser Phe 
                85                  90                  95      


Pro Pro Arg Glu Val Leu Tyr Asp Tyr Ile Thr Gly Arg Val Lys Lys 
            100                 105                 110         


Gly Asn Leu Lys Asn Lys Val Lys Phe Asn Thr Ser Val Leu Ser Val 
        115                 120                 125             


Ile Phe Asn Gly Asn Asp Phe Glu Ile Thr Ser Leu Asp Lys Lys Asn 
    130                 135                 140                 


Asp Lys Phe Ser Lys Asp Ile Phe Asp Tyr Val Val Val Ser Thr Gly 
145                 150                 155                 160 


His Phe Ser Val Pro Phe Ile Pro Glu Tyr Pro Gly Met Lys Ala Phe 
                165                 170                 175     


Pro Gly Arg Ile Met His Ser His Asp Phe Arg Asp Ala Glu Glu Phe 
            180                 185                 190         


Lys Gly Lys Asn Val Ile Val Leu Gly Ser Ser Tyr Ser Ala Glu Asp 
        195                 200                 205             


Val Ala Leu Gln Cys His Lys Tyr Gly Ala Lys Ser Val Thr Ile Gly 
    210                 215                 220                 


Tyr Arg Asn Asn Pro Met Gly Phe Lys Trp Pro Lys Gly Val Lys Glu 
225                 230                 235                 240 


Val His Tyr Leu Asp Arg Leu Glu Gly Asn Lys Ala Ile Phe Lys Asp 
                245                 250                 255     


Gly His Lys Gln Glu Ala Asp Ala Ile Ile Leu Cys Ser Gly Tyr Leu 
            260                 265                 270         


His Tyr Phe Pro Phe Leu Thr Glu Asp Leu Lys Leu Lys Thr Arg Asn 
        275                 280                 285             


Arg Leu Tyr Pro Pro Lys Leu Tyr Lys Gly Val Val Trp Gln Asp Asn 
    290                 295                 300                 


His Lys Leu Leu Tyr Leu Gly Met Gln Asp Gln Phe His Thr Phe Asn 
305                 310                 315                 320 


Met Phe Asp Cys Gln Ala Trp Tyr Ala Arg Asp Val Ile Met Gly Lys 
                325                 330                 335     


Ile Lys Met Pro Asn Ser Ser Glu Ile Glu Lys Asp Ile Asn Lys Trp 
            340                 345                 350         


Val Ala Met Glu Glu Lys Leu Glu Asn Pro Asp Gln Met Ile Asp Phe 
        355                 360                 365             


Gln Thr Glu Tyr Thr Lys Glu Leu His Glu Leu Ser Asp Tyr Pro Lys 
    370                 375                 380                 


Ile Asp Phe Glu Leu Ile Arg Lys His Phe Lys Glu Trp Glu His His 
385                 390                 395                 400 


Lys Val Glu Asp Ile Met Thr Tyr Arg Asn Lys Ser Phe Ser Ser Pro 
                405                 410                 415     


Val Thr Gly Ser Val Ala Pro Ile His His Thr Pro Trp Ala Ser Ala 
            420                 425                 430         


Met Asp Asp Ser Ser Lys Thr Phe Leu Asn Gln Ser Lys Lys 
        435                 440                 445     


<210>  16
<211>  1350
<212>  DNA
<213>  Candidatus pelagibacter TMED239

<400>  16
catatgacca aagtggcgat cattggtgcg ggtccgtgcg gtctgagcat gctgcgtagc       60

tttgagcagg cggaaaagag cggcgagaaa atcccggaaa ttacctgctt cgagaagcaa      120

gacgattggg gtggcctgtg gaactacagc tggcgtaccg gtagcgacca gtatggcgat      180

ccggttccga acagcatgta ccgttatctg tggagcaacg gtccgaaaga gtgcctggaa      240

tttgcggact acagctttga tgagcacttt ggcaagccga tcccgagctt cccgccgcgt      300

gaagttctgt acgactatat taccggtcgt gtgaagaaag gcaacctgaa gaacaaggtt      360

aagtttaaca ccagcgttct gagcgtgatc tttaacggta acgatttcga gattaccagc      420

ctggacaaga aaaacgataa atttagcaag gacatcttcg attacgtggt tgtgagcacc      480

ggccacttta gcgtgccgtt catcccggaa tatccgggta tgaaagcgtt cccgggccgt      540

attatgcaca gccacgactt tcgtgatgcg gaggaattca aaggcaagaa cgttatcgtg      600

ctgggcagca gctacagcgc ggaagacgtg gcgctgcaat gccacaagta cggtgcgaaa      660

agcgttacca ttggctatcg taacaacccg atgggtttta aatggccgaa gggcgttaaa      720

gaggtgcact atctggaccg tctggagggt aacaaggcga ttttcaaaga cggccacaag      780

caggaagcgg atgcgatcat tctgtgcagc ggttacctgc actatttccc gtttctgacc      840

gaagatctga aactgaagac ccgtaaccgt ctgtacccgc cgaagctgta taaaggtgtt      900

gtgtggcaag acaaccacaa actgctgtac ctgggcatgc aggatcaatt ccacaccttt      960

aacatgttcg actgccaggc gtggtatgcg cgtgatgtga tcatgggcaa aattaagatg     1020

ccgaacagca gcgagatcga aaaagacatt aacaagtggg ttgcgatgga ggaaaagctg     1080

gagaacccgg accagatgat cgatttccaa accgaataca ccaaagagct gcacgaactg     1140

agcgactatc cgaagatcga ttttgagctg attcgtaagc acttcaaaga gtgggaacac     1200

cacaaagttg aagatatcat gacctaccgt aacaaaagct ttagcagccc ggttaccggt     1260

agcgtggcgc cgattcatca taccccgtgg gcgagcgcga tggacgatag cagcaagacc     1320

ttcctgaacc aaagcaagaa ataaggatcc                                      1350


<210>  17
<211>  429
<212>  PRT
<213>  Soonwoa buanensis

<400>  17

Met Thr Asn Phe Lys Val Gly Ile Ile Gly Ala Gly Pro Ser Gly Leu 
1               5                   10                  15      


Ala Met Leu Arg Ala Phe Glu Ser Glu Gln Lys Lys Gly Asn Pro Ile 
            20                  25                  30          


Pro Glu Ile Lys Cys Tyr Glu Lys Gln Asp Asn Trp Gly Gly Met Trp 
        35                  40                  45              


Asn Tyr Thr Trp Arg Thr Gly Val Gly Lys Tyr Gly Glu Pro Leu His 
    50                  55                  60                  


Gly Ser Met Tyr Lys Tyr Leu Trp Ser Asn Gly Pro Lys Glu Cys Leu 
65                  70                  75                  80  


Glu Phe Ser Asp Tyr Ser Phe Thr Glu His Phe Gly Gln Ser Ile Ser 
                85                  90                  95      


Ser Tyr Pro Pro Arg Glu Val Leu Phe Asp Tyr Ile Gln Gly Arg Ile 
            100                 105                 110         


Lys Lys Ser Lys Ala Arg Asp Phe Ile Lys Phe Asn Thr Val Ala Arg 
        115                 120                 125             


Trp Val Asp Tyr Leu Glu Asp Lys Lys Gln Phe Arg Val Ile Phe Asp 
    130                 135                 140                 


Asp Leu Val Asn Asn Glu Thr Phe Glu Glu Tyr Phe Asp Tyr Leu Val 
145                 150                 155                 160 


Val Gly Thr Gly His Phe Ser Thr Pro Asn Leu Pro Tyr Phe Lys Gly 
                165                 170                 175     


Ile Glu Asn Phe Pro Gly Ala Val Met His Ala His Asp Phe Arg Gly 
            180                 185                 190         


Ala Asp Gln Phe Ile Asp Lys Asn Ile Leu Leu Ile Gly Ser Ser Tyr 
        195                 200                 205             


Ser Ala Glu Asp Ile Ala Val Gln Cys Tyr Lys His Gly Ala Lys Ser 
    210                 215                 220                 


Ile Thr Ile Ser Tyr Arg Ser Asn Pro Ile Asp Thr Lys Trp Pro Lys 
225                 230                 235                 240 


Glu Ile Lys Glu Lys Pro Leu Val Thr His Phe Glu Asn Asn Thr Ala 
                245                 250                 255     


Tyr Phe Lys Asp Gly Thr Thr Glu Asp Tyr Asp Ala Val Ile Phe Cys 
            260                 265                 270         


Thr Gly Tyr Gln His Lys Leu Pro Phe Leu Pro Asp Glu Leu Arg Leu 
        275                 280                 285             


Lys Thr Arg Asn Cys Leu Tyr Pro Asp Gln Leu Tyr Lys Gly Val Ile 
    290                 295                 300                 


Phe Asn Asn Asn Glu Arg Leu Ile Tyr Leu Gly Met Gln Asp Gln Tyr 
305                 310                 315                 320 


Tyr Thr Phe Asn Met Phe Asp Ala Gln Ala Trp Phe Ala Arg Asp Tyr 
                325                 330                 335     


Met Leu Gly Arg Ile Asn Leu Pro Glu Leu Lys Leu Arg Thr Asp Asp 
            340                 345                 350         


Ile Asn Leu Trp Lys Ala Gln Glu Leu Ala Thr Glu Thr Gly Glu Asp 
        355                 360                 365             


His Val Asp Phe Gln Thr Asp Tyr Ile Lys Asp Ile Leu Ser Gln Thr 
    370                 375                 380                 


Asp Tyr Pro Phe Leu Asn Leu Asp Lys Val Ala Glu Met Phe Lys Ser 
385                 390                 395                 400 


Trp Leu Lys Asp Lys Glu Glu Asn Ile Leu Asn Tyr Arg Asp Lys Ile 
                405                 410                 415     


Tyr Thr Ser Val Val Asp Gly Thr Glu Ala Asn Leu His 
            420                 425                 


<210>  18
<211>  1299
<212>  DNA
<213>  Soonwoa buanensis

<400>  18
catatgacca acttcaaagt gggtatcatt ggtgcgggtc cgagcggtct ggcgatgctg       60

cgtgcgtttg agagcgaaca gaagaaaggt aacccgatcc cggagatcaa gtgctacgaa      120

aagcaagata actggggtgg catgtggaac tacacctggc gtaccggtgt gggcaaatat      180

ggcgagccgc tgcacggcag catgtacaag tatctgtgga gcaacggtcc gaaagagtgc      240

ctggaattca gcgactacag cttcaccgag cactttggcc agagcatcag cagctacccg      300

ccgcgtgaag ttctgtttga ctatatccaa ggccgtatta agaaaagcaa ggcgcgtgat      360

ttcatcaaat ttaacaccgt ggcgcgttgg gttgactatc tggaggataa gaaacagttc      420

cgtgtgattt ttgacgatct ggttaacaac gaaaccttcg aggaatactt tgattatctg      480

gtggttggta ccggccactt cagcaccccg aacctgccgt acttcaaggg tatcgagaac      540

tttccgggtg cggtgatgca tgcgcatgac ttccgtggtg cggaccagtt tatcgataaa      600

aacatcctgc tgattggcag cagctatagc gcggaagata ttgcggttca atgctacaag      660

cacggtgcga aaagcatcac cattagctac cgtagcaacc cgatcgacac caagtggccg      720

aaagagatta aggaaaaacc gctggtgacc cacttcgaga acaacaccgc gtactttaag      780

gatggtacca ccgaagacta tgatgcggtt atcttctgca ccggctacca gcacaagctg      840

ccgtttctgc cggacgagct gcgtctgaaa acccgtaact gcctgtaccc ggatcaactg      900

tataaaggtg ttatcttcaa caacaacgaa cgtctgattt atctgggcat gcaggaccaa      960

tactatacct tcaacatgtt tgacgcgcag gcgtggtttg cgcgtgatta catgctgggt     1020

cgtatcaacc tgccggagct gaagctgcgt accgacgata ttaacctgtg gaaagcgcaa     1080

gaactggcga ccgagaccgg cgaagaccac gtggatttcc agaccgacta tatcaaggat     1140

attctgagcc aaaccgacta cccgttcctg aacctggata aggttgcgga gatgtttaaa     1200

agctggctga aggacaaaga ggaaaacatc ctgaactacc gtgacaaaat ttataccagc     1260

gtggttgatg gtaccgaagc gaacctgcac taaggatcc                            1299


