                         SEQUENCE LISTING

<110>  SYNTHETIC GENOMICS, INC.
       IMAM, Saheed
       MOELLERING, Eric R.
       PEACH, Luke
       LAMBERT, William F.
       KWOK, Kathleen
 
<120>  RECOMBANT ALGAE HAVING HIGH LIPID PRODUCTIVITY

<130>  SGI2290-1WO

<140>
<141>

<150>  63/110,301
<151>  2020-11-05

<160>  26    

<170>  PatentIn version 3.5

<210>  1
<211>  340
<212>  PRT
<213>  Oocystis sp.


<220>
<221>  misc_feature
<223>  WD40 repeat protein, 3EUKT2027500

<400>  1

Met Ala Glu Pro Asp Ala Asn Gly Ala Ser Asp Gly Lys Arg Ala Glu 
1               5                   10                  15      


Ile Tyr Thr Tyr Glu Phe Pro Asn Leu Val Tyr Ser Met Asn Trp Thr 
            20                  25                  30          


Ser Arg Arg Asp Lys Lys Phe Arg Leu Ala Val Gly Ser Phe Ile Glu 
        35                  40                  45              


Asp Tyr Asn Asn Val Val Asn Ile Ile Ser Leu Asp Glu Glu Gln Gly 
    50                  55                  60                  


Lys Phe Val Cys Asp Pro Ser Leu Thr Phe Lys His Pro Tyr Pro Pro 
65                  70                  75                  80  


Thr Lys Val Met Phe Val Pro Asp Arg Glu Gly Thr Arg Pro Asp Leu 
                85                  90                  95      


Leu Ala Thr Thr Gly Asp Tyr Leu Arg Val Trp Lys Ile Gly Glu Asp 
            100                 105                 110         


Gly Val Thr Leu Gln Lys Leu Leu Asn Asp Asn Lys Asn Ser Glu Phe 
        115                 120                 125             


Cys Ala Pro Leu Thr Ser Phe Asp Trp Asn Glu Thr Asp Pro Lys Arg 
    130                 135                 140                 


Leu Gly Thr Ser Ser Ile Asp Thr Thr Cys Thr Ile Trp Asp Ile Glu 
145                 150                 155                 160 


Lys Gly Val Val Asp Thr Gln Leu Ile Ala His Asp Lys Glu Val Tyr 
                165                 170                 175     


Asp Ile Ala Trp Gly Gly Val Gly Val Phe Ala Ser Val Ser Ala Asp 
            180                 185                 190         


Gly Ser Val Arg Val Phe Asp Leu Arg Asp Lys Glu His Ser Thr Ile 
        195                 200                 205             


Ile Tyr Glu Thr Pro Ser Pro Glu Thr Pro Leu Leu Arg Leu Gly Trp 
    210                 215                 220                 


Asn Lys Gln Asp Pro Arg Tyr Met Ala Thr Ile Val Met Asp Ser Asn 
225                 230                 235                 240 


Arg Val Val Val Leu Asp Ile Arg Val Pro Thr Val Pro Val Ala Glu 
                245                 250                 255     


Leu Gln Arg His Gln Ala Cys Ala Asn Ala Leu Ala Trp Ala Pro His 
            260                 265                 270         


Ser Ser Cys His Ile Cys Thr Ala Gly Asp Asp Ala Gln Ala Leu Ile 
        275                 280                 285             


Trp Asp Leu Ser Ala Val Ser Lys Glu Gly Asp Ser Gly Leu Asp Pro 
    290                 295                 300                 


Ile Leu Ala Tyr Asn Ala Gly Gln Glu Val Asn Gln Leu Gln Trp Ser 
305                 310                 315                 320 


Ser Thr Gln Pro Asp Trp Val Ala Val Cys Phe Gly Asn Lys Ala Gln 
                325                 330                 335     


Ile Leu Arg Val 
            340 


<210>  2
<211>  335
<212>  PRT
<213>  Coccomyxa subellipsoidea


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  2

Met Asp Gly Arg Leu Asn Asp Arg Arg Ala Glu Ile Tyr Thr Tyr Asp 
1               5                   10                  15      


Ser Glu Asn Ile Val Tyr Gly Leu Ser Trp Ser Asn Arg Arg Asp Lys 
            20                  25                  30          


Lys Phe Arg Leu Ala Val Gly Ser Phe Ile Glu Glu Tyr Asp Asn Tyr 
        35                  40                  45              


Val Glu Ile Ile Thr Leu Asp Asp Ala Thr Cys Lys Phe Thr Ser Asp 
    50                  55                  60                  


Ala Gln Leu Ala Phe Gln His Pro Tyr Pro Pro Thr Lys Ile Met Phe 
65                  70                  75                  80  


Met Pro Asp Lys Glu Gly Ala Gln Pro Asp Leu Leu Ala Thr Thr Gly 
                85                  90                  95      


Asp Tyr Leu Arg Ile Trp Gln Leu Lys Glu Asp Gly Thr Gln Leu Val 
            100                 105                 110         


Lys Leu Leu Asn Asn Asn Lys Asn Ser Glu Phe Cys Ala Pro Leu Thr 
        115                 120                 125             


Ser Phe Asp Trp Asn Glu Thr Asp Leu Asn Arg Leu Gly Thr Ser Ser 
    130                 135                 140                 


Ile Asp Thr Thr Cys Thr Ile Trp Asp Ile Glu Lys Gly Val Val Asp 
145                 150                 155                 160 


Thr Gln Leu Ile Ala His Asp Lys Glu Val Tyr Asp Ile Ala Trp Gly 
                165                 170                 175     


Gly Val Gly Val Phe Ala Ser Val Ser Ala Asp Gly Ser Val Arg Val 
            180                 185                 190         


Phe Asp Leu Arg Asp Lys Glu His Ser Thr Ile Ile Tyr Asp Ser Pro 
        195                 200                 205             


Gln Pro Asp Thr Pro Leu Leu Arg Leu Gly Trp Asn Lys Gln Asp Pro 
    210                 215                 220                 


Arg Tyr Met Ala Thr Val Leu Met Asp Ser Thr Lys Val Val Ile Leu 
225                 230                 235                 240 


Asp Ile Arg Tyr Pro Thr Leu Pro Val Ala Glu Leu Gln Arg His Gln 
                245                 250                 255     


Ala Pro Val Asn Ala Val Ala Trp Ala Pro His Ser Ser Cys His Ile 
            260                 265                 270         


Cys Ser Ala Gly Asp Asp Ala Gln Ala Leu Ile Trp Asp Leu Ser Ser 
        275                 280                 285             


Met Ser Arg Pro Met Asp Gln Thr Leu Asp Pro Ile Leu Ala Tyr Ser 
    290                 295                 300                 


Ala Gly Ala Glu Val Asn Gln Leu Gln Trp Ser Thr Thr Gln Pro Asp 
305                 310                 315                 320 


Trp Val Ala Ile Cys Phe Ala Asn Lys Thr Gln Ile Leu Arg Val 
                325                 330                 335 


<210>  3
<211>  359
<212>  PRT
<213>  Chlamydomonas reinhardtii


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  3

Met Ser Ala Ser Asp Lys Arg Glu Arg Gln Glu Val Tyr Thr Tyr Val 
1               5                   10                  15      


Ala Pro Asp Pro Val Tyr Ala Met Asn Trp Ser Val Arg Arg Asp Lys 
            20                  25                  30          


Arg Phe Arg Leu Gly Val Ala Ser Phe Arg Glu Asp Val Thr Asn Tyr 
        35                  40                  45              


Val Asp Ile Val Ser Leu Asp Asp Glu Ser Asp Glu Leu Arg Ala Asp 
    50                  55                  60                  


Pro Gly Leu Arg Phe Pro His Asp Tyr Pro Ala Thr Lys Leu Met Trp 
65                  70                  75                  80  


Met Pro Asp Arg Glu Gly Cys Arg Pro Asp Leu Leu Ala Thr Thr Gly 
                85                  90                  95      


Glu Ala Leu Arg Ile Trp Arg Val Cys Asp Gly Ser Glu Gly Glu Glu 
            100                 105                 110         


Ser Gly Ser Gly Pro Gly Gly Arg Gly Val Gln Leu Arg Ser Leu Leu 
        115                 120                 125             


Asn Asn Asn Lys Gln Ser Glu Phe Ser Ala Pro Leu Thr Ser Phe Asp 
    130                 135                 140                 


Trp Asn Glu Ala Asp Pro Lys Arg Leu Gly Thr Ser Ser Ile Asp Thr 
145                 150                 155                 160 


Thr Cys Thr Ile Trp Asp Ile Glu Lys Gly Glu Val Asp Thr Gln Leu 
                165                 170                 175     


Ile Ala His Asp Arg Glu Val Tyr Asp Ile Ala Trp Gly Gly Leu Gly 
            180                 185                 190         


Val Phe Ala Thr Val Ser Ala Asp Gly Ser Val Arg Val Phe Asp Leu 
        195                 200                 205             


Arg Asp Lys Glu His Ser Thr Ile Ile Tyr Glu Ser Pro Gln Pro Asp 
    210                 215                 220                 


Thr Pro Leu Leu Arg Leu Gly Trp Asn Arg Gln Asp Pro Arg Tyr Met 
225                 230                 235                 240 


Ala Thr Ile Leu Gln Asp Ser Pro Lys Val Val Ile Leu Asp Ile Arg 
                245                 250                 255     


Tyr Pro Thr Leu Pro Val Ala Glu Leu Cys Arg His Gln Ala Pro Val 
            260                 265                 270         


Asn Ala Leu Ala Trp Ala Pro His Ser Ala Gln His Ile Cys Thr Ala 
        275                 280                 285             


Gly Asp Asp Ser Gln Ala Leu Ile Trp Asp Val Ser Ala Val Gly Gly 
    290                 295                 300                 


Gly Asn Asn Ala Asn Ala Ala Ala Gly Gly Gly Ala Ser Asp Val Ser 
305                 310                 315                 320 


Leu Asp Pro Ile Leu Ala Tyr Gly Ala Ala Ser Glu Val Asn Gln Leu 
                325                 330                 335     


Gln Trp Ser Ser Ala Gln Pro Asp Trp Val Ala Ile Cys Phe Gly Asn 
            340                 345                 350         


Lys Thr Gln Ile Leu Arg Val 
        355                 


<210>  4
<211>  351
<212>  PRT
<213>  Volvox carteri


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  4

Met Ser Asn Ser Asp Lys Arg Ala Glu Ile Tyr Thr Tyr Val Ala Gln 
1               5                   10                  15      


Asp Pro Val Tyr Ala Met Asn Trp Ser Val Arg Arg Asp Arg Arg Phe 
            20                  25                  30          


Arg Leu Ala Val Gly Ser Phe Arg Glu Asp Val Thr Asn Tyr Val Glu 
        35                  40                  45              


Ile Ile Ser Leu Asp Asp Asp Ala Ala Glu Leu Arg Ser Asp Pro Ser 
    50                  55                  60                  


Leu Arg Phe His His Asp Tyr Pro Ala Thr Lys Leu Met Trp Leu Pro 
65                  70                  75                  80  


Asp Arg Glu Gly Cys Arg Pro Asp Leu Leu Ala Thr Thr Gly Glu Ala 
                85                  90                  95      


Leu Arg Ile Trp Arg Val Leu Asp Pro Asp Ser Val Ala Gly Asp Gly 
            100                 105                 110         


Glu Asp Leu Arg Ala Leu Leu Asn Asn Asn Lys Gln Ser Glu Phe Ser 
        115                 120                 125             


Ala Pro Leu Thr Ser Phe Asp Trp Asn Glu Ala Asp Pro Lys Arg Leu 
    130                 135                 140                 


Gly Thr Ser Ser Ile Asp Thr Thr Cys Thr Ile Trp Asp Ile Glu Lys 
145                 150                 155                 160 


Gly Glu Val Asp Thr Gln Leu Ile Ala His Asp Arg Glu Val Tyr Asp 
                165                 170                 175     


Ile Ala Trp Gly Gly Leu Gly Val Phe Ala Thr Val Ser Ala Asp Gly 
            180                 185                 190         


Ser Val Arg Val Phe Asp Leu Arg Asp Lys Glu His Ser Thr Ile Ile 
        195                 200                 205             


Tyr Glu Ser Pro Gln Pro Asp Thr Pro Leu Leu Arg Leu Gly Trp Asn 
    210                 215                 220                 


Arg Gln Asp Pro Arg Tyr Met Ala Thr Ile Leu Met Asp Ser Pro Lys 
225                 230                 235                 240 


Val Val Ile Leu Asp Ile Arg Tyr Pro Thr Leu Pro Val Ala Glu Leu 
                245                 250                 255     


His Arg His Gln Ala Pro Val Asn Ala Leu Ala Trp Ala Pro His Ser 
            260                 265                 270         


Ala Gln His Ile Cys Thr Ala Gly Asp Asp Ser Gln Ala Leu Ile Trp 
        275                 280                 285             


Asp Val Ser Ala Val Gly Ser Gly Gly Gly Gln Pro Gly Ala Leu Gly 
    290                 295                 300                 


Gly Gly Thr Ala Gly Asp Val Ser Leu Asp Pro Ile Leu Ala Tyr Gly 
305                 310                 315                 320 


Ala Gln Ser Glu Val Asn Gln Leu Gln Trp Ser Ser Ala Gln Pro Asp 
                325                 330                 335     


Trp Val Ala Ile Cys Phe Ala Asn Lys Thr Gln Ile Leu Arg Val 
            340                 345                 350     


<210>  5
<211>  341
<212>  PRT
<213>  Auxenochlorella protothecoides


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  5

Met Ser Gly Pro Ser Gly Asp Lys Arg Ala Glu Ile Tyr Thr His Glu 
1               5                   10                  15      


Ser Ala Asp Pro Ile Tyr Ala Leu Asn Trp Ser Val Arg Thr Asp Lys 
            20                  25                  30          


Pro Phe Arg Leu Ala Thr Gly Ser Tyr Val Glu Asp Gln Asn Asn His 
        35                  40                  45              


Ile Asp Ile Ile Val Leu Asp Glu Ala Arg Glu Gln Phe Gln Ala Asp 
    50                  55                  60                  


Pro Arg Leu Ser Phe Val His Pro Phe Pro Ala Thr Lys Leu Met Phe 
65                  70                  75                  80  


Leu Pro Val Lys Asp Pro Asn Gln Pro Asp Leu Leu Ala Thr Thr Ser 
                85                  90                  95      


Asp Phe Leu Arg Ile Trp Ser Ile Ser Glu Asp Gly Val Ala Leu Glu 
            100                 105                 110         


Lys Leu Leu Asn Asn Thr Lys Thr Ser Glu Tyr Cys Glu Pro Ile Thr 
        115                 120                 125             


Ser Phe Asp Trp Asn His Leu Glu Pro Arg Arg Leu Gly Thr Ala Ser 
    130                 135                 140                 


Leu Asp Ala Thr Cys Thr Val Trp Asp Ile Glu Arg Gly Cys Val Asp 
145                 150                 155                 160 


Thr Gln Leu Ile Ala His Asp Gly Glu Val Tyr Asp Leu Ala Trp Gly 
                165                 170                 175     


Gly Ala Thr Met Phe Ala Ser Val Ser Ala Asp Ala Ser Val Arg Val 
            180                 185                 190         


Phe Asp Leu Arg Asp Arg Asp His Ser Thr Ile Thr Tyr Glu Ser Arg 
        195                 200                 205             


Gly Gly Glu Ala Leu Val Arg Leu Gly Trp Asn Arg Ala Asp Pro Arg 
    210                 215                 220                 


Phe Met Ala Val Leu Ala Ala Gly Ser Ala Glu Val Val Val Leu Asp 
225                 230                 235                 240 


Val Arg Arg Pro Ala Ala Pro Leu Ala Arg Leu Ala Arg His Thr Ala 
                245                 250                 255     


Pro Ala Asn Val Leu Ala Trp Ala Pro His Ser Ala Cys His Leu Cys 
            260                 265                 270         


Ser Ala Gly Asp Asp Gly Ala Ala Leu Ile Trp Asp Val Gly Ala Leu 
        275                 280                 285             


Gly Gly Gly Gly Gly Pro Gly Gly Ala Ala Gln Asp Pro Gly Leu Asp 
    290                 295                 300                 


Pro Ile Leu Ala Tyr Asn Ala Gly Ala Glu Val Ala Ala Leu Gln Trp 
305                 310                 315                 320 


Ser Ala Ala Gln Pro Asp Trp Val Ala Ile Ala Phe Gly Asn Asn Ala 
                325                 330                 335     


Gln Val Leu Arg Val 
            340     


<210>  6
<211>  351
<212>  PRT
<213>  Chlorella sorokiniana


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  6

Met Gln Gln Gln Gly Glu Gly Arg Ala Glu Ile Tyr Thr Tyr Glu Ser 
1               5                   10                  15      


Pro His Leu Val Tyr Gly Ala Gly Trp Ser Val Arg Pro Asp Lys Pro 
            20                  25                  30          


Phe Arg Leu Ala Leu Gly Ser Phe Ile Glu Asp Tyr Ala Asn Arg Val 
        35                  40                  45              


Glu Ile Val Gln Leu Asp Glu Gly Arg Gly Val Ile Arg Ser Asn Pro 
    50                  55                  60                  


Ala Leu Gly Phe Gln His Pro Tyr Pro Pro Thr Lys Val Gly Phe Ile 
65                  70                  75                  80  


Pro Asp Lys Asp Gly Thr Arg Pro Asp Leu Leu Ala Thr Ser Gly Asp 
                85                  90                  95      


Phe Leu Arg Leu Trp Arg Ile His Asp Glu Pro Gly Ser Asn Gln His 
            100                 105                 110         


Val Arg Leu Glu Lys Leu Leu Asn Asn Asn Lys Gly Gly Glu Phe Ser 
        115                 120                 125             


Ala Pro Leu Thr Ser Phe Asp Trp Asn Glu Leu Asp Pro Arg Arg Ile 
    130                 135                 140                 


Gly Thr Ala Ser Ile Asp Thr Thr Cys Thr Val Trp Asp Val Glu Arg 
145                 150                 155                 160 


Gly Val Val Asp Thr Gln Leu Ile Ala His Asp Lys Glu Val Tyr Asp 
                165                 170                 175     


Ile Ala Trp Gly Gly Val Gly Ile Phe Ala Ser Val Ser Ala Asp Gly 
            180                 185                 190         


Ser Val Arg Val Phe Asp Leu Arg Asp Lys Glu His Ser Thr Ile Ile 
        195                 200                 205             


Tyr Glu Ser Pro Gln Pro Ser Thr Pro Leu Leu Arg Leu Ser Trp Asn 
    210                 215                 220                 


Lys Gln Asp Pro Arg Tyr Ile Ala Ala Phe Ala Met Asp Ser Ser Lys 
225                 230                 235                 240 


Val Leu Val Leu Asp Ile Arg Tyr Pro Thr Leu Pro Val Ala Gln Leu 
                245                 250                 255     


Gln Arg His Gln Ala Ser Val Asn Ala Val Cys Trp Ala Pro His Ser 
            260                 265                 270         


Ala Val His Leu Cys Ser Ala Gly Asp Asp Cys Gln Ala Leu Ile Trp 
        275                 280                 285             


Asp Leu Ala Leu Ser Gly Ala Met Gly Gly Gln Gln Gln Asp Gly Thr 
    290                 295                 300                 


Ala Ala Ala Ala Ala Ala Gly Gly Leu Asp Pro Ile Leu Ala Tyr Asn 
305                 310                 315                 320 


Ala Gly Thr Glu Ile Asn Gln Leu Gln Trp Ser Ala Ser Gln Pro Asp 
                325                 330                 335     


Trp Val Ala Ile Cys Phe Gly Asn Lys Ala Gln Ile Leu Arg Val 
            340                 345                 350     


<210>  7
<211>  355
<212>  PRT
<213>  Chlorella variabilis


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  7

Met Gln Asp Gln Gln Gln Gln Gly Glu Gly Arg Ala Glu Ile Tyr Thr 
1               5                   10                  15      


Tyr Ser Ser Ser Ala Ser Val Tyr Ala Cys Gly Phe Ser Ser Arg Pro 
            20                  25                  30          


Asp Lys Pro Phe Arg Leu Ala Val Gly Ser Phe Ile Asp Asp Tyr Ala 
        35                  40                  45              


Asn Lys Val Glu Ile Ile Gln Leu Asp Glu Ala Ala Gly Val Val Arg 
    50                  55                  60                  


Asn Asn Pro Ala Leu Thr Phe Gln His Pro Tyr Pro Pro Thr Lys Val 
65                  70                  75                  80  


Ala Phe Ile Pro Asp Lys Ser Gly Thr Arg Pro Asp Leu Leu Ala Thr 
                85                  90                  95      


Ser Gly Asp Phe Leu Arg Leu Trp Arg Val Ser Asp Glu Pro Gly Ala 
            100                 105                 110         


Gln Gln Gly Val Arg Leu Glu Lys Leu Leu Asn Asn Asn Lys Gly Gly 
        115                 120                 125             


Asp Phe Ala Ala Pro Leu Thr Ser Phe Asp Trp Asn Glu Leu Asp Pro 
    130                 135                 140                 


Arg Arg Val Gly Thr Ala Ser Ile Asp Thr Thr Cys Thr Val Trp Asp 
145                 150                 155                 160 


Val Glu Arg Gly Val Val Asp Thr Gln Leu Ile Ala His Asp Lys Glu 
                165                 170                 175     


Val Tyr Asp Ile Ala Trp Gly Gly Val Gly Ile Phe Ala Ser Val Ser 
            180                 185                 190         


Ala Asp Gly Ser Val Arg Val Phe Asp Leu Arg Asp Lys Glu His Ser 
        195                 200                 205             


Thr Ile Ile Tyr Glu Ser Pro Gln Pro Asp Thr Pro Leu Leu Arg Leu 
    210                 215                 220                 


Ser Trp Asn Lys Gln Asp Pro Arg Tyr Ile Ala Val Leu Ala Met Asp 
225                 230                 235                 240 


Ser Pro Arg Val Thr Val Leu Asp Ile Arg Tyr Pro Thr Leu Pro Val 
                245                 250                 255     


Ala Glu Leu Gln Arg His Gln Ala Gly Val Asn Ala Ile Cys Trp Ala 
            260                 265                 270         


Pro His Ser Ala Thr His Leu Cys Ser Ala Gly Asp Asp Ser Gln Ala 
        275                 280                 285             


Leu Ile Trp Asp Leu Gly Leu Leu Gly Thr Leu Gly Gln Gln Pro Glu 
    290                 295                 300                 


Gly Gly Pro Pro Gly Ala Ala Ala Ala Gly Gly Gly Leu Asp Pro Ile 
305                 310                 315                 320 


Leu Ala Tyr Asn Ala Gly Ala Glu Val Asn Gln Leu Gln Trp Ser Pro 
                325                 330                 335     


Ala Gln Pro Asp Trp Val Ala Ile Cys Phe Gly Asn Lys Thr Gln Leu 
            340                 345                 350         


Leu Arg Val 
        355 


<210>  8
<211>  334
<212>  PRT
<213>  Parachlorella sp.


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  8

Met Gln Arg Ala Glu Ile His Thr Tyr Glu Ser Pro Thr Leu Val Tyr 
1               5                   10                  15      


Ala Leu Asn Trp Ser Val Arg Pro Asp Lys Pro Phe Arg Leu Ala Ile 
            20                  25                  30          


Gly Ser Tyr Ile Glu Asp Tyr Asn Asn Arg Val Glu Ile Val Thr Leu 
        35                  40                  45              


Gly Glu Asp Gly Asn Gly Met Arg Pro Ser Pro Arg His Thr Phe Gln 
    50                  55                  60                  


His Pro Tyr Pro Pro Thr Lys Leu Gln Phe Val Pro Asp Pro Asp Gly 
65                  70                  75                  80  


Ser Arg Pro Asp Leu Leu Ala Ser Ser Gly Asp Phe Leu Arg Leu Trp 
                85                  90                  95      


Arg Ile Thr Glu Asp Gly Val Ser Leu Glu Lys Leu Leu Asn Asn Asn 
            100                 105                 110         


Lys Ala Ser Glu Phe Cys Ala Pro Leu Thr Ser Phe Asp Trp Asn Glu 
        115                 120                 125             


Asn Asp Pro Lys Arg Val Gly Thr Ala Ser Ile Asp Thr Thr Cys Thr 
    130                 135                 140                 


Val Trp Asp Ile Glu Lys Gly Val Val Asp Thr Gln Leu Ile Ala His 
145                 150                 155                 160 


Asp Lys Glu Val Tyr Asp Ile Ala Trp Gly Gly Val Gly Val Phe Ala 
                165                 170                 175     


Ser Val Ser Ala Asp Gly Ser Val Arg Val Phe Asp Leu Arg Asp Lys 
            180                 185                 190         


Glu His Ser Thr Ile Ile Tyr Glu Ser Pro Gln Pro Asp Thr Pro Leu 
        195                 200                 205             


Leu Arg Leu Ala Trp Asn Lys Gln Asp Pro Arg Tyr Met Ala Thr Thr 
    210                 215                 220                 


Ala Leu Asn Ser Ser Ala Ile Val Val Leu Asp Ile Arg Phe Pro Thr 
225                 230                 235                 240 


Val Pro Val Val Glu Leu Ser Lys His Gln Ala Ala Cys Asn Ala Val 
                245                 250                 255     


Ala Trp Ala Pro Gln Ser Ala Asn His Ile Cys Ser Ala Gly Asp Asp 
            260                 265                 270         


Cys Gln Ala Leu Ile Trp Asp Leu Ser Thr Leu Gly Glu Gly Gly Ala 
        275                 280                 285             


Gly Gln Ala Gly Ser Pro Pro Leu Asp Pro Ile Leu Ser Tyr Met Ala 
    290                 295                 300                 


Gly Ala Glu Val Asn Gln Leu Gln Trp Ser Ala Ser His Pro Asp Trp 
305                 310                 315                 320 


Val Ala Ile Cys Phe Gly Asn Lys Thr Gln Ile Leu Arg Val 
                325                 330                 


<210>  9
<211>  337
<212>  PRT
<213>  Picochlorum celeri


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  9

Met Glu His Pro Pro Ala Pro Asn Ile Leu Thr Tyr Asp Ser Ser Ser 
1               5                   10                  15      


Ile Val Phe Ala Leu Asp Trp Ser Ser Arg Gln Asp Lys Gly Val Arg 
            20                  25                  30          


Val Ala Val Gly Ser Phe Val Glu Gly Val Ser Asn Thr Val Glu Ile 
        35                  40                  45              


Leu Arg Val Thr Pro Ala Gly Leu Ile Val Asp Asp Lys Glu Thr Phe 
    50                  55                  60                  


Gly Ile Glu Tyr Pro Ala Thr Gln Val Gly Phe Ile Pro Asp Arg Phe 
65                  70                  75                  80  


Cys Asn Lys Pro Asp Leu Leu Ala Thr Ser Gly Asp Ala Val Arg Leu 
                85                  90                  95      


Trp Lys Ile Ser Asp Ala Gly Thr Thr Leu Glu Leu Val Leu Asn Asp 
            100                 105                 110         


Pro Lys Asn Thr Ser Lys Asn Phe Ser Ala Val Thr Cys Phe Asp Trp 
        115                 120                 125             


Ser Glu Ile Asn Val Lys Val Leu Ala Ala Gly Ser Ser Ala Gly Arg 
    130                 135                 140                 


Leu Leu Leu Trp Asp Thr Glu Ser Gly Arg Leu Gln Gly Thr Met Val 
145                 150                 155                 160 


Gly His Glu Asp Glu Ile Leu Asp Cys Gln Trp Ala Ala Asn Asp Val 
                165                 170                 175     


Ile Val Ser Ser Ser Gly Asp Gly Ser Ile Arg Met Tyr Asp Leu Arg 
            180                 185                 190         


Asp Lys Asp His Cys Thr Val Leu Tyr Glu Thr Pro Arg Arg Thr Pro 
        195                 200                 205             


Val Pro Arg Phe Cys Trp Asn Lys Leu Asp Pro Arg His Leu Ala Phe 
    210                 215                 220                 


Ser Ile Glu Lys Ser Arg Leu Val Ser Val Leu Asp Val Arg Phe Pro 
225                 230                 235                 240 


Thr Glu Pro Val Ile Leu Leu Asp Gly His Met Gly Asn Cys Thr Ala 
                245                 250                 255     


Leu Gly Trp Ser Pro His Arg Glu Glu Tyr Leu Cys Ser Val Gly Asp 
            260                 265                 270         


Asp Cys His Ala Leu Ile Trp Asp Val Gly Lys Val Asn Ser Glu Glu 
        275                 280                 285             


Asp Ser Lys Pro Asn Arg Glu Ala Val Asp Ala Ser Pro Ile Leu Ala 
    290                 295                 300                 


Tyr Asn Ala Gln Ala Glu Ile Asn Ala Met Ala Trp Asn Pro Ile Asp 
305                 310                 315                 320 


Pro Asp Trp Ile Ala Ile Cys Ala Arg Asn Arg Thr Gln Val Leu Arg 
                325                 330                 335     


Ile 
    


<210>  10
<211>  412
<212>  PRT
<213>  Tetraselmis sp.


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  10

Met Pro Thr Ser Asp Ala Thr Gln Ala His Glu His His His Thr Leu 
1               5                   10                  15      


Ala Ala Thr Pro Thr Gln Gln Ala Asn Asn Ala Ala Pro Leu Ala Asp 
            20                  25                  30          


Arg Phe Thr Leu Gly Leu Leu Ala Met Ala Ser Gly Pro Glu Asp Arg 
        35                  40                  45              


Gly Ala Gly Ala Ala Gly Ala Ala Pro His Gln Arg Gly Asp Ser Asn 
    50                  55                  60                  


Gly Lys Ala Val Thr Asp Lys Arg Gly Glu Ile Tyr Thr Tyr Glu Ala 
65                  70                  75                  80  


Pro Tyr Pro Val Tyr Gly Met Asn Trp Ser Val Leu Leu Gln Val Arg 
                85                  90                  95      


Glu Asp Met Lys Phe Arg Leu Ala Val Gly Ser Phe Val Glu Asp Val 
            100                 105                 110         


Glu Asn Ala Val Glu Leu Ile Arg Leu Asn Glu Glu Thr Gly Lys Phe 
        115                 120                 125             


Glu Ser Asn Pro Ala His Lys Phe Val His Pro Tyr Pro Pro Thr Lys 
    130                 135                 140                 


Ile Met Phe Ile Pro Asp Arg Asp Cys Ser Arg Pro Asp Leu Leu Ala 
145                 150                 155                 160 


Thr Thr Gly Asp Tyr Leu Arg Leu Trp Arg Val Glu Glu Asp Gly Val 
                165                 170                 175     


Thr Leu His Lys Leu Leu Thr Asn Asn Lys Asn Ser Glu Phe Cys Ala 
            180                 185                 190         


Pro Leu Thr Ser Phe Asp Trp Asn Glu Ala Asp Pro Arg Gln Leu Gly 
        195                 200                 205             


Thr Ser Ser Ile Asp Thr Thr Cys Thr Ile Trp Asp Ile Glu Arg Gly 
    210                 215                 220                 


Val Val Asp Thr Gln Leu Ile Ala His Asp Lys Glu Val Tyr Asp Ile 
225                 230                 235                 240 


Ala Trp Gly Gly Gln Gly Val Phe Ala Ser Val Ser Ala Asp Gly Ser 
                245                 250                 255     


Val Arg Val Phe Asp Leu Arg Asp Lys Asp His Ser Thr Ile Ile Tyr 
            260                 265                 270         


Glu Ser Gly Met Pro Glu Ile Pro Leu Leu Arg Leu Gly Trp Asn Lys 
        275                 280                 285             


Gln Asp Pro Arg Tyr Met Ala Thr Ile Leu Met Asp Ser Ser Lys Val 
    290                 295                 300                 


Val Val Leu Asp Ile Arg Tyr Pro Thr Met Pro Val Ala Glu Leu Glu 
305                 310                 315                 320 


Ala His His Lys Pro Val Asn Ala Leu Ala Trp Ala Pro Gln Ser Ser 
                325                 330                 335     


Ser His Ile Cys Thr Ala Gly Asp Asp Ala Gln Ala Leu Ile Trp Asn 
            340                 345                 350         


Leu Ala Pro Met Gly Thr Gln Gly Pro Met Gly Gly Ala Ala Pro Ala 
        355                 360                 365             


Val Leu Gly Ala Asp Leu Asp Pro Ile Leu Ala Tyr Asn Ala Gly Glu 
    370                 375                 380                 


Glu Ile Asn Gln Leu Gln Trp Ser Ser Thr Gln Ser Asp Trp Val Gly 
385                 390                 395                 400 


Ile Ser Phe Gly Asn Lys Ile Gln Ile Leu Arg Ile 
                405                 410         


<210>  11
<211>  332
<212>  PRT
<213>  Ostreococcus lucimarinus


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  11

Met Asn Ala Glu Lys Arg Ala Glu Ile Tyr Thr Tyr Glu Ala Pro Trp 
1               5                   10                  15      


Met Ile Tyr Ala Cys Asn Trp Ser Val Arg Gln Asp Lys Arg Phe Arg 
            20                  25                  30          


Leu Ala Leu Gly Ser Phe Val Glu Glu Tyr Ser Asn Lys Val Glu Ile 
        35                  40                  45              


Ile Thr Leu Asp Glu Glu Thr Gly Glu Phe Pro Lys Glu Ala Gln Cys 
    50                  55                  60                  


Ser Phe Thr His Pro Tyr Pro Cys Thr Lys Ile Leu Phe Ile Pro Asp 
65                  70                  75                  80  


Lys Glu Cys Thr Lys Glu Asp Leu Leu Ala Thr Thr Gly Asp Tyr Leu 
                85                  90                  95      


Arg Ile Trp Gln Val Gln Asp Asp Asn Thr Val Gln Met Lys Ser Leu 
            100                 105                 110         


Leu Asn Asn Asn Lys Ser Ser Glu Phe Cys Ala Pro Leu Thr Ser Phe 
        115                 120                 125             


Asp Trp Asn Glu Thr Lys Leu Gln Arg Val Gly Thr Ser Ser Ile Asp 
    130                 135                 140                 


Thr Thr Cys Thr Ile Trp Asp Ile Glu Arg Glu Cys Val Asp Thr Gln 
145                 150                 155                 160 


Leu Ile Ala His Asp Lys Glu Val Tyr Asp Ile Ala Trp Gly Gly Pro 
                165                 170                 175     


Glu Val Phe Ala Ser Val Ser Ala Asp Gly Ser Val Arg Val Phe Asp 
            180                 185                 190         


Leu Arg Asp Lys Asp His Ser Thr Ile Ile Tyr Glu Ser Gln Thr Pro 
        195                 200                 205             


Asp Thr Pro Leu Leu Arg Leu Gly Trp Asn Lys Gln Asp Pro Arg Tyr 
    210                 215                 220                 


Met Ala Thr Ile Cys Met Asp Ser Pro Val Ile Ile Leu Asp Ile Arg 
225                 230                 235                 240 


Phe Pro Thr Leu Pro Val Ala Glu Leu Gln Ser His Arg Ala Ser Val 
                245                 250                 255     


Asn Thr Leu Ala Trp Ala Pro His Ser Ser Ser His Met Cys Thr Ala 
            260                 265                 270         


Gly Asp Asp Ser Gln Ala Leu Ile Trp Asp Leu Ser Ser Met Asn Gln 
        275                 280                 285             


Pro Pro Glu Gly Gly Leu Asp Pro Ile Leu Ala Tyr Ser Ala Gly Ala 
    290                 295                 300                 


Glu Ile Asn Gln Leu Gln Trp Ser Ala Ser Gln Pro Asp Trp Ile Ser 
305                 310                 315                 320 


Ile Ala Phe Arg Asn Ser Leu Gln Ile Leu Arg Val 
                325                 330         


<210>  12
<211>  336
<212>  PRT
<213>  Micromonas commoda


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  12

Met Ala Ala Met Gly Ser Gly Gln Ser Gly Ala Glu Ile Tyr Thr Tyr 
1               5                   10                  15      


Glu Ala Pro Trp Leu Val Tyr Ala Met Asn Trp Ser Val Arg Gln Asp 
            20                  25                  30          


Lys Arg Phe Arg Leu Ala Leu Gly Ser Phe Val Glu Glu Tyr Ser Asn 
        35                  40                  45              


Lys Val Glu Ile Ile Thr Leu Asp Glu Gln Arg Arg Glu Phe Pro Ala 
    50                  55                  60                  


Glu Pro Thr His Arg Phe Asp His Pro Tyr Pro Cys Thr Lys Ile Met 
65                  70                  75                  80  


Phe Val Pro Asp Ala Glu Gly Thr Ser Glu Asp Leu Leu Ala Thr Ser 
                85                  90                  95      


Gly Asp Tyr Leu Arg Val Trp Arg Ile Gly Asp Asp Gly Val His Leu 
            100                 105                 110         


Arg Ser Leu Leu Asn Asn Asn Lys Asn Ser Asp Phe Cys Ala Pro Leu 
        115                 120                 125             


Thr Ser Phe Asp Trp Ser Thr Thr Asn Leu Ala Arg Val Gly Thr Ser 
    130                 135                 140                 


Ser Leu Asp Thr Thr Cys Thr Ile Trp Asp Leu Glu Lys Glu Thr Val 
145                 150                 155                 160 


Asp Ser Gln Leu Ile Ala His Asp Lys Glu Val Tyr Asp Ile Ala Trp 
                165                 170                 175     


Gly Gly Pro Glu Val Phe Ala Ser Val Ser Ala Asp Gly Ser Val Arg 
            180                 185                 190         


Val Phe Asp Leu Arg Asp Lys Asp His Ser Thr Ile Val Tyr Glu Ser 
        195                 200                 205             


Pro Thr Pro Asp Thr Pro Leu Leu Arg Leu Gly Trp Asn Lys Gln Asn 
    210                 215                 220                 


Pro Arg Tyr Met Ala Thr Met Glu Met Asp Ser Ala Lys Val Val Val 
225                 230                 235                 240 


Leu Asp Ile Arg Val Pro Ala Leu Pro Val Ala Glu Leu Lys Lys His 
                245                 250                 255     


Arg Ala Ala Val Asn Thr Leu Ala Trp Ala Pro His Ser Ser Arg Asn 
            260                 265                 270         


Ile Cys Thr Ala Gly Asp Asp Ala Gln Ala Leu Ile Trp Asp Leu Ser 
        275                 280                 285             


Ser Val Ala Gln Pro Gly Glu Asp Gly Met Asp Pro Met Leu Ala Tyr 
    290                 295                 300                 


Asn Ala Gly Ala Glu Ile Ser Gln Leu Gln Trp Ser Ala Thr Gln Thr 
305                 310                 315                 320 


Asp Trp Ile Ala Ile Ala Phe Gly Lys Asn Leu Gln Val Leu His Val 
                325                 330                 335     


<210>  13
<211>  346
<212>  PRT
<213>  Arabidopsis sp.


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  13

Met Gly Thr Ser Ser Asp Pro Ile Gln Asp Gly Ser Asp Glu Gln Gln 
1               5                   10                  15      


Lys Arg Ser Glu Ile Tyr Thr Tyr Glu Ala Pro Trp His Ile Tyr Ala 
            20                  25                  30          


Met Asn Trp Ser Val Arg Arg Asp Lys Lys Tyr Arg Leu Ala Ile Thr 
        35                  40                  45              


Ser Leu Leu Glu Gln Tyr Pro Asn Arg Val Glu Ile Val Gln Leu Asp 
    50                  55                  60                  


Glu Ser Asn Gly Glu Ile Arg Ser Asp Pro Asn Leu Ser Phe Glu His 
65                  70                  75                  80  


Pro Tyr Pro Pro Thr Lys Thr Ile Phe Ile Pro Asp Lys Glu Cys Gln 
                85                  90                  95      


Arg Pro Asp Leu Leu Ala Thr Ser Ser Asp Phe Leu Arg Leu Trp Arg 
            100                 105                 110         


Ile Ala Asp Asp His Ser Arg Val Glu Leu Lys Ser Cys Leu Asn Ser 
        115                 120                 125             


Asn Lys Asn Ser Glu Phe Cys Gly Pro Leu Thr Ser Phe Asp Trp Asn 
    130                 135                 140                 


Glu Ala Glu Pro Arg Arg Ile Gly Thr Ser Ser Thr Asp Thr Thr Cys 
145                 150                 155                 160 


Thr Ile Trp Asp Ile Glu Arg Glu Ala Val Asp Thr Gln Leu Ile Ala 
                165                 170                 175     


His Asp Lys Glu Val Phe Asp Ile Ala Trp Gly Gly Val Gly Val Phe 
            180                 185                 190         


Ala Ser Val Ser Ala Asp Gly Ser Val Arg Val Phe Asp Leu Arg Asp 
        195                 200                 205             


Lys Glu His Ser Thr Ile Ile Tyr Glu Ser Ser Glu Pro Asp Thr Pro 
    210                 215                 220                 


Leu Val Arg Leu Gly Trp Asn Lys Gln Asp Pro Arg Tyr Met Ala Thr 
225                 230                 235                 240 


Ile Ile Met Asp Ser Ala Lys Val Val Val Leu Asp Ile Arg Phe Pro 
                245                 250                 255     


Ala Leu Pro Val Val Glu Leu Gln Arg His Gln Ala Ser Val Asn Ala 
            260                 265                 270         


Ile Ala Trp Ala Pro His Ser Ser Cys His Ile Cys Thr Ala Gly Asp 
        275                 280                 285             


Asp Ser Gln Ala Leu Ile Trp Asp Ile Ser Ser Met Gly Gln His Val 
    290                 295                 300                 


Glu Gly Gly Leu Asp Pro Ile Leu Ala Tyr Thr Ala Gly Ala Glu Ile 
305                 310                 315                 320 


Glu Gln Leu Gln Trp Ser Ser Ser Gln Pro Asp Trp Val Ala Ile Ala 
                325                 330                 335     


Phe Ser Thr Lys Leu Gln Ile Leu Arg Val 
            340                 345     


<210>  14
<211>  12
<212>  DNA
<213>  Oocystis sp.


<220>
<221>  misc_feature
<223>  partial sequence of WD40 repeat protein

<400>  14
agactcgcac cg                                                           12


<210>  15
<211>  1643
<212>  DNA
<213>  Oocystis sp.


<220>
<221>  misc_feature
<223>  WD40 repeat protein, 3EUKG2027500, encodes SEQ ID NO: 1

<400>  15
atggccgagc cggacgcgaa cggcgcgtcg gatggcaagc gcgcggagat ttacacgtac       60

gagttcccca acctcgttta ctccatgaac tggacggtaa gcaagcacag ttgtagcgca      120

cacagccttg ggtgctgctg cgtccttgct gatggtcatg cttgcgtcgg ccacccgttt      180

tctcacctac cgaaactggc tgcgcacccc tttcagttat gcttgtaatc accgcctcat      240

tacctttgtg tgcagtctcg tcgggacaag aagtttcgac tggcagtggg cagcttcatc      300

gaggactata ataacgtcgt caatatcata tcctgtacgc gccttccccg cactacccag      360

cgtagcggct cagtctgtcg tgcggggttg gcacgacagt cgttgggttg aagactattt      420

gaactattgc tgattagtga cgcccatgct ctctcgtaaa gcggggcgga tggctccttt      480

cgcgcccact accatctggc cgcgacctgt ggctctgacg cctcgctggg tctcgacctg      540

ctgaccatac tccctcgtta ctggctgcaa cgcagtggac gaggaacagg gcaagtttgt      600

atgcgacccg tcactgacct tcaagcatcc gtacccaccg accaaggtga tgtttgtgcc      660

agaccgggaa ggcactcggc ccgacctgtt ggccaccacc ggcgactatc tgcgcgtgtg      720

gaagatcggg gaggatggcg tcacgctgca gaagctgctg aacgatgtaa ggcctagatt      780

aacgtccagc gctgtgggga aggaccgacg gcacgggccg gaaagggaca tgcatgccgt      840

accgggggtg atcacgggcg acggcccacc ggatcatcat cttcctcgct tccaccaacc      900

ctgcgcaacg tccttcgcaa cgtcttgaac aatattttgc atctttcacg ttcatccatc      960

ctcgtcatgg cgaaattaac ttgcagaaca agaacagcga gttttgcgcg ccgctcacat     1020

cgttcgactg gaacgagacc gaccccaagc gcctgggcac cagctctatc gataccacgt     1080

gcacgatctg ggacatcgag aagggcgtgg tggacacgca gctcatcgcg cacgacaagg     1140

aggtgtatga catcgcgtgg ggcggcgtcg gcgtcttcgc gtcggtgtcc gccgacggct     1200

cggtgcgagt cttcgacttg cgagacaagg agcatagcac gatcatctac gagacgccgt     1260

cgccagagac gccgctgctg cgcctggggt ggaacaaaca ggaccccagg tacatggcga     1320

cgatcgtgat ggactctaac cgcgtggttg tgctggacat ccgcgtgccc accgtgcctg     1380

tcgccgagct gcagcggcac caggcgtgcg caaacgcgct tgcctgggcg ccgcacagca     1440

gctgccacat ctgcacagcg ggcgacgacg cacaggcgct gatctgggac ctcagcgcgg     1500

tatccaagga gggcgactcg ggcctggacc ccatcctcgc gtacaatgca ggtcaagagg     1560

taaaccagct gcagtggtct tcgacgcagc ccgattgggt ggcagtctgc tttggcaaca     1620

aggcgcagat cctgcgagtg tga                                             1643


<210>  16
<211>  1023
<212>  DNA
<213>  Oocystis sp.


<220>
<221>  misc_feature
<223>  WD40 repeat protein, cds of 3EUKG2027500, encodes SEQ ID NO: 1

<400>  16
atggccgagc cggacgcgaa cggcgcgtcg gatggcaagc gcgcggagat ttacacgtac       60

gagttcccca acctcgttta ctccatgaac tggacgtctc gtcgggacaa gaagtttcga      120

ctggcagtgg gcagcttcat cgaggactat aataacgtcg tcaatatcat atccttggac      180

gaggaacagg gcaagtttgt atgcgacccg tcactgacct tcaagcatcc gtacccaccg      240

accaaggtga tgtttgtgcc agaccgggaa ggcactcggc ccgacctgtt ggccaccacc      300

ggcgactatc tgcgcgtgtg gaagatcggg gaggatggcg tcacgctgca gaagctgctg      360

aacgataaca agaacagcga gttttgcgcg ccgctcacat cgttcgactg gaacgagacc      420

gaccccaagc gcctgggcac cagctctatc gataccacgt gcacgatctg ggacatcgag      480

aagggcgtgg tggacacgca gctcatcgcg cacgacaagg aggtgtatga catcgcgtgg      540

ggcggcgtcg gcgtcttcgc gtcggtgtcc gccgacggct cggtgcgagt cttcgacttg      600

cgagacaagg agcatagcac gatcatctac gagacgccgt cgccagagac gccgctgctg      660

cgcctggggt ggaacaaaca ggaccccagg tacatggcga cgatcgtgat ggactctaac      720

cgcgtggttg tgctggacat ccgcgtgccc accgtgcctg tcgccgagct gcagcggcac      780

caggcgtgcg caaacgcgct tgcctgggcg ccgcacagca gctgccacat ctgcacagcg      840

ggcgacgacg cacaggcgct gatctgggac ctcagcgcgg tatccaagga gggcgactcg      900

ggcctggacc ccatcctcgc gtacaatgca ggtcaagagg taaaccagct gcagtggtct      960

tcgacgcagc ccgattgggt ggcagtctgc tttggcaaca aggcgcagat cctgcgagtg     1020

tga                                                                   1023


<210>  17
<211>  4908
<212>  DNA
<213>  Chlamydomonas reinhardtii


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  17
atgagcgcga gcgacaagcg cgagcggcag gaggtttata catatgtggc accggacccg       60

gtgtacgcca tgaactggag cgtgagttgc cctcgggcac ggctatgacg ggcgctcgct      120

gggctgctgg ggacggggct tgggaaggcg ggaggcccgg cgccgtctgc agaccctgcc      180

gctttgtttg ggtggatagg actgtgtcgc ggcaaacgtc cttgcggact gcactggctt      240

gaatgccatg agccaagggg cgctgtggag aggcagagca gagggcgggg atgccaagca      300

gcgcgtggcg tggcggccgc cgcccctgcc tacacctgac tgccctcggg gctgccgacg      360

attgctgact cgctgtacga caaattggcg tgccagaggg ccggggctgt gggatggcga      420

tggcagtaac attctgcaag catgcctctg ctgtgccacc acgtgctcac acacccacac      480

tcgcctgcac ccgctaactt gcccatgacc cacacctaca ccccgcacaa ccctgcacac      540

ccgcacccgc aaccatacct accacaggta cggcgggaca agcgcttccg actgggcgtt      600

gcttcattcc gggaggacgt caccaactat gttgacattg tgtcgcgtga gtccaggcgc      660

acacggggcg attgagtaac atggggattg cggacggcag ttacactgtt acgggggtat      720

gaagtggagc tgtagagtac agccggagat aagcacatcg ttcggcacag agggagtgtg      780

tcttggcctt gcccacccca cactgctcca cttccccgcc tccccgcctc cccgctcccc      840

gcctcccccc cgccccccgc ctgccccagt ggacgacgag tcggacgagc tgcgggcgga      900

ccccgggctg cgcttcccgc acgactaccc ggccaccaag ctcatgtgga tgccggaccg      960

cgagggctgt cggcccgacc tgttggcgac cacaggtgcg gcggggctgt ggcggtggtg     1020

gtggtggtgg tggcggcggt cgtggtggtg gtggtggcgg cggtcaggga tgggatgtgg     1080

gttgggggat tgttcgagat gtctggcgca ggtacaagtg gccgggctgg cagcagtgct     1140

gcacttggta acactgccgt tatgtgtctg acacaagccg cacacacgtc tgggctggta     1200

catgcacctg cttcggccgg tttacacgga ttacccgcca acacacacac acacacacac     1260

acacacacac gcacacacac acgtgcacac acaaccacca cacaggggag gcgctacgca     1320

tctggcgtgt gtgtgacggc tcggagggcg aggagagcgg cagcgggccg ggcggacgcg     1380

gcgttcagct gcggtcactg cttaacaacg tgagcggcgt gcggggcgcg cgtttgcgct     1440

gtgtacgctg tcatgttagt gttagggggc acatgggaag gcaaacgggc agggcagtgt     1500

gtgcgctgcg tgatctgtgt tgcggtggtg tgtgcgggcc gtgtgccgtc atgggggctg     1560

cgtgtcgcgt gcctgcatag gttgtggggt tgtgtgtatg tgtgtgtggg cgatcacaat     1620

gtggggttcg gacccgggtg tgaggagggt tgggcgggtt gcgcgaactc atcagagcgg     1680

cagctgcgct cacgctgcaa gcccatccaa atctacggtc acaactgttt gcacacccac     1740

gaacccttca aacaaaccgc ccaaacccgc atcaacgttc tcgcacctcg cagaacaagc     1800

agtcggagtt ctcggcgcca ctcacgtcct tcgactggaa cgaggccgac cccaagcgcc     1860

tgggcacctc ctccatcgac accacctgca ccatctggga catcgaggtg cggcgcggat     1920

ggcggcgtgc ggcgtgtggc gtgtggcgtg ggatggaggc tggtggtggt ggcggcgttt     1980

cgtccgccct tgcttaacgt ctcacctccc tggagtgacc tgcagtcgct caccgcggct     2040

gtgtgcgtcc tctgtgcctt acgcctccac cgcccccccc catcgcccag caagccatgc     2100

cagtaccccc ccccttagct agtcatgaag gtaaacctcc cccgccccac ccctcgtctt     2160

cggagccctg ccaagctcct cccccacctc ctcccccgcc ggcacctgct gtctacttcg     2220

ctacacttct ctgtaccaat ccttgcaacc cccgccggca acgacgcggt tgctgtgcca     2280

ctgtgcctgg ctacaacttc acttcttggt ttattatgaa tgtagtaacc catctcccca     2340

ccaccccctc ctctccccca cctgcagaaa ggcgaggtgg acacgcagct catcgcgcac     2400

gaccgggagg tgtacgacat agcctggggt gggctgggcg tgttcgccac cgtcagcgcg     2460

gacggcagcg tgcgcgtgtt cgacctgcgg tgagcggagg gggggcgaga gggcgggagg     2520

gagggaggga gggcatcggg atggggggcc gggcgggagg gaggcctttt catgttatga     2580

tcacagtttt ggagcgctgg cgggggatgg cgaggtggtt cacctaaaag caaccatgac     2640

acacgaagat gtagggcagg cgtgggtgcg cgcggggagg gggctgaaag gcgtgcgcct     2700

gcgtggtacg gggttcgtgt ctacggcgca tggggggtcg tggctgctgc caggcctgtc     2760

gggcgggtgg cacgtggcac gtggcgggtg gactgctggt caaactccga cattcccagt     2820

cccagcccat gtccgcacgg gtcccgagcc actcccatcc cgcccttcct cccgccccgc     2880

atcccccatc tcctcctccc cctcgccgca gcgacaagga gcactcgacc atcatctacg     2940

agagcccgca gcccgacacg ccgctgctgc gcctgggctg gaaccggcag gacccccgct     3000

acatggccac catactgcag gtgtgcggcg gctgggtata tgtatgtgtg tgtgtgtgta     3060

tatgtgtgtg tgtgtgtagc agctggtggc atttgcgctt ggcagccatt ggccgttggc     3120

aagacaggca ggaacaattc acgatttgga gagcgggacc gttgatgtga tcaggtggcg     3180

gtttgaagcg aacatcaggg tgtgtgggtg ggggggggga tcagcaacag ggctaacgcg     3240

gcgggcgcct cagtgcggca ctgacagctg cacgcggtgg cggcaggcga gcgcaaacgt     3300

ggagcgcaag cgtttgctgc gtcgccagtc gcgcgagtgc attgctgtca ctggcggcat     3360

ggtggtggtg atggcggtat gtgtcatgct ggctcagccc ctttccctct ctccctctcc     3420

accacatccg ccctttgcgc tctgtctttc gtggcccatc catctcctcg cctcgcctgc     3480

aggactcgcc caaggtggtc atcctggaca tccgctaccc caccctgccc gtggcggagc     3540

tgtgcaggca ccaggtgcga ggcggcggag cggttgtgtg cagggcggcg gcggtgggtg     3600

ggtgggttgg ttggtttcta gctgccaggg ttacggcagg ggaaggagca aagacagagc     3660

gatggcagtg cgggcgattg tcgattggag ccagtcgcgg ggtctcggag ggggcccagc     3720

acttgcagtt gaaaggcgct gggttttgcg aggtgaaacg gattgtgttg tgttattgga     3780

gttgcgggac ctttcatttg ctgtgctctg tgccgctgtt tcctcacagg cgccggtcaa     3840

cgccttggcc tgggcaccgc actcggccca acacatctgc accgccgggg acgactcaca     3900

ggtgtgcaca aggggtggcg gaatggggca ggcagcgtgt gtggttgtgt gcggtggttg     3960

ggggttgcaa ggattggtac agagaagtgt agcgaagtag acaacaggcg tggggtggat     4020

ttccagcgtg gcttgtactc gagctcctct ccgttggtgt gaccgccaag ccaagccggg     4080

ctgcttccga agctgttttc gtcagcccca tcaccggtgc ccggccacct ctgcaaagcc     4140

cgcgcctcgc gctcgcccca acgcaaccct tgcctcgcac cccgtctgcc cacccacgca     4200

caggcgctga tctgggatgt gtcggcggtg ggcggcggca acaatgccaa cgcggcggcg     4260

ggcggcggcg ccagtgacgt cagcttggac cccatactgg cgtacggagc ggccagcgag     4320

gtaagggggg tttctgggga ctgggtcttt ggggctgggg ttctggtgtt tgcgtgtgtg     4380

gggggggggt gtatgtgtgt gaaagaagaa aacgaaacga tgtgtgtgtg tgtgtgtgtg     4440

tgtcagagag caaggaaaca caggagagag tgtgtgtgca tgtgtgtgtg ttagaaataa     4500

cacgagatag gtgtgtgtat gtgtgtttgc aagcatgcac caaacccagc cgcgaaccca     4560

tcctgtcggt gaggtgcgaa gggtggaggg cgtgggttac agcggtgtag tttgcttcag     4620

ctggttgagt gcattggaaa ggcgtgcgcg tcagaagggc tcgcgcgacg agaagagggg     4680

tgtgttccgt gggcattggg ggtgctgtgg gtggtggcga agaggagggg cggcgccgaa     4740

gggcctgcgg tttgggctgg gttcgttgcc ggtgctgctc cggccttgtt gcgccccaac     4800

cggcatcccc catcacaatt gcaggtgaac cagctgcagt ggagctccgc gcagccggac     4860

tgggtggcaa tctgcttcgg caacaaaacg cagatcctaa gggtgtga                  4908


<210>  18
<211>  5770
<212>  DNA
<213>  Coccomyxa subellipsoidea


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  18
gcttgagcgc acaccagata agtgccaagt tcaccatcgc acgtagcaat ttggcagctg       60

ccaactgttg aacacccatt ggtgatcaag gttaggaaaa accacgaact ctgcaggagg      120

atcaaggatc aaatcttgaa gtgatgatag ttggatctca atggtagaat ggatgggaga      180

ctgaatgaca gacgggcgga gatatacacc tacgactctg agaacatcgt atatggcttg      240

agctggagtg taaggatcca ttcagtctat cttcctgaat ttaattgctg cattatgtat      300

tgttccactt gcccctgggt caagtcagtc catgcactct tacttgctac ccctctggag      360

acacagactg actggattgt tctccttgct gggctgcaga accgccgtga caagaagttt      420

cgcctggcag tgggcagctt catcgaggag tacgacaact acgtcgaaat catcacacgt      480

gcgtgccgtc ctagtctcac cttttgcacc ttcacctgca ccgaacctgc cctcccctgc      540

ttgtgaaact ctcctcaaac cacagttgag ccacatgcaa gaaacagagg aagatccgac      600

accaccggct gtttattgct caacttgatg ataattacga gatgaatttg catgagggta      660

ggactggggt gagaggggta gagcactcct tggatggccc aaagccatac cacttacaga      720

ccctcttcca ccaatgtgtt gcagacaggc ttagtgcgca aggagccaga aactgagatc      780

agatgatacc accaaactgt tgaagacagg gctttgggtg cgaaggtcca aagcagaggg      840

gaaaagatag attcggtgct gcagaaaggc ttgcataagc taagattcag acactctcac      900

caatctgttg cagacagtct atggacgcaa gggtgcaaaa attgagatcc atatattctt      960

taccagtctg ttgcagacag gctttggctg cagaggccca aagccgagga agaaagtttg     1020

gttcgggtgt gtagacaggc ttgcagaacc agaggtccac gtactctgac cagtctgtta     1080

cagacagatt cttggggtgc agaggcataa agcagacggg gaaagattga ctgggttgtc     1140

caaacaggct ttgtgcaggc aggctctatg cagacaggcc tttaagtgca gcttgagaag     1200

cagagaccca gacactctca ccaagctgct gcagacaggc atgggatgca agggtccaga     1260

gcagagctga gagggagaaa gcggttcggt tttacatgac gcctgcagac agtccgagac     1320

aggcttgcag aataatacga ttatctgttc gatagcacaa cgtgcaaatt catcacagat     1380

aagctctggg tgcaaagctt cagaagttga attccagatg ctctcaccca tgacaggcgt     1440

gggtgacaaa gctccagaag ctgagaggga aagagtggtt cggttgtaca ggacgcctgc     1500

agccagtctt gagacaggct ggcagaggct gacagggaga cattgtttgt ctgcgcaatc     1560

gatgacatga catgcgagtt cacatcggac gctcagatct agacaccctc accaatgcgt     1620

tgcagacagg gctctggatg caacgctgca gaagctgagg agggctggca ttgtctggcc     1680

catcagggct tttttgccgt cgtcaattcg ccgttagggg gagtttatga ttgcagcgag     1740

gtgtagattt tagggttcga ttttagggtt taaggtttaa aaccttattc ccccgtgacg     1800

gcgacgacga tgaaaaaagc catatgcctg ggctgagcag acgggcttgg gttgccaaac     1860

ttaaaaattc gagaatgaac caggggtgtg tggtgcgcgc agtcgatgac gcgacatgca     1920

agttcacgtc ggacgcgcag ctggcgttcc agcacccgta cccacccaca aagatcatgt     1980

tcatgccgga caaggagggg gcgcagccgg acctgctggc aaccactggc gactacctgc     2040

gcatctggca gctgaaggag gacggcacgc agctggtcaa gcttctcaac aatgtgcgcc     2100

tccttcccta gtaaaattgt ctgtttatta gtatcccacg ctgttgatat cccccgggga     2160

gattgcgcag cctcaaacaa gcttcaggta aaggtcgaca gctgaaggag gacggcactc     2220

agctgatcaa gctgctgaac aatgtgcgcc cgctctcaga aagacaaatg gagaaagaag     2280

tatgcagaca tcaggcagcg cgagcgctag aagatagaag aaagggctgg ctcacaagtg     2340

cacgtcatgc aacaacaata caatgatacc cagccctgac aattcttgtt attgtgaaca     2400

catgctgtca gattaccccc ctcctaccaa gggggggaag gtgcggctgc aaacaagctt     2460

caggtgacag atggctgctg aaggaggatg gcacgcagct ggtcaatctg ctcatggatt     2520

tgcgcctgct ccagcatagt gaccaattat tgtaatcttc acatgttgtg tgcaattgtg     2580

tattttctcc ttcgcgccga cagggcccgc cactgtaggc aggatacggc ttacagacgc     2640

ggtaggaacc tactagagat ttgtggtggc tgcagcttac ggtgcctgcc atgtagatga     2700

cggaacccat ggtgtggtgg tggtaatggg tttgggccac atgctgtgat gtgctgccct     2760

tctgtatcag agcagtgcat ctcacttcct ggttcaggct gacggattgg aaataactga     2820

gctcaacatc ggaactaagt ctttcaatcc agaaggccct tttcgtcgat cttcgcacgg     2880

aagtggaagt gcagccgccc caatatctgg ttcaagctgg actctttcag gcaggactgg     2940

caggactgag gcttgcttgc gtagttgagc ttgtgcagcc gccacccctg tgcgctggaa     3000

gttagcagtg ctgaattgtg tgtggcttat gtgtgaacta tctgcgcaga acaagaacag     3060

cgagttctgc gcaccgctca cctccttcga ctggaacgag accgacctga atcggctggg     3120

gacgtcgagc atcgacacca cgtgcaccat ctgggacatc gaggtgactc gctcacaccc     3180

ttgcaccatg atttcaaggg tcttgaacta cagtaagaca catattttct ggcttgcgta     3240

ggggccagag cacagtgtcc cctgagctga gaatcccctg tgtgactagc tttacatgga     3300

agcctaacca gatcgcctca tgcagtccta gtcacacagg ggactctttg ttgtatgatc     3360

ttattgaaag atcttttcta gtctcaaagg ggactcttta cctaacgtca aatcagcgcg     3420

ctctgagtgg catatcagat aagggagtgt gtctcaatgt atgtcaccaa ctggacccta     3480

aggccagtct gacctctagg tggttcctca catgatagac cggtactgag catgtcactg     3540

agctgagctg gcggtcgcgt gattgtctct gcagaaaggc gtggtggaca cacagctcat     3600

agcgcatgac aaagaggtct atgacatcgc ttggggcggc gtgggagtct ttgcgtcagt     3660

ctcggctgac ggctctgtgc gcgtatttga cctccggtgc gcccctgaac tggcaggtgt     3720

cgattgatga gctggcgctg gctcattggg cgatattgga agtgtcttag ggagttcatt     3780

tccttgcata gcagtcagga gagtacttgc atggacaatg gaactcgatg acaaaggcag     3840

ccctcccatg ttcgccccac accctactcc cacacctgaa gcaagaggtg aagtgggact     3900

tgtgagggca gaagcacatt gtactggtgt ttacagaaga ctacctctta cgggcctgag     3960

tgccagctgc tgggtctcac aaaagcgccg aaaaaattgc actcctttcc cttgctgcca     4020

aggtctttct tggcgactgc tagagtgacg aagaagtctg taagcaggtt tgaagctcct     4080

tggcactgtg attgtgatca gcggggggct tgtttgctga tcagcaagga gtgcattgct     4140

gcctgcagat atcatgcagg ctatgcatcc ccgccctcct atctggtgcc ttcatatttg     4200

caggtctacc tcaaactttg gggtacagtg catcacattg aggtgccgct tttgcaggga     4260

caaggagcac agcacgataa tctacgacag cccgcagccg gacacgccgc tgctgaggct     4320

ggggtggaac aagcaggacc cccgctacat ggccacagtc ctcatggact ccaccaaagt     4380

ggtcatcctg gacatcaggt gacacactgt gcttacgctt ctttgaataa ttgcactgta     4440

acgggcggcc ctcttctctg cagtttctga atttctgaat ccattacctc tctttaaacg     4500

ctgaatggag tgtactacgc tgggtttgcc cattatgtca acattaacaa tgaacactgt     4560

aattaggaat aataatatgg ctgccagcaa ggtggagggc ttgccaggca tcttcaggca     4620

ccgccctgtg cgcctcggaa aatagaattt caaacttgaa tttggcactt gtgccgcggc     4680

aggtaaccgt ctaggtccac accctcctgg gatgtcagac cctgcttgag cctcggggac     4740

tctatttcca cctttgtact tttcacccag agggcccaga tgagctgtaa gtggggcgac     4800

ccgtcgcatg agcttgaatt caaaagctgt gccgtggcag gtatccgacc ctgccggtgg     4860

cggagttgca gaggcaccag gcgccggtga acgcggtggc gtgggccccc cactcctcgt     4920

gccacatctg cagcgcgggg gacgatgctc aggcgctgat ctgggatctc tcttccatgt     4980

ctcgccccat ggaccagacc ctgggtgagc ccaccttgat ctagatcatg tgtatctcat     5040

ttgctacacg cctgtcctgc caaaagtcaa gattaattct tgcagacaag ctgcagtaaa     5100

aggctgccac aacctacccc tcgttggcct cgccaaatgc aagcaagacc caatcaaccg     5160

taggcacggt gctcgagctc agtgtttaca ccgccccctg tcagactggg cgtccatgtg     5220

gcaggaatgc tgcaaggagg gagctcgtcg ggttgatggt gtgtgggaag gagtgggcgg     5280

aagattgttg tgcttgaggg gcatacatgt tgtcaaggca tgatgtgctg tgctgcgcag     5340

atcccatcct ggcatacagc gctggggcgg aggtgaatca gctgcaatgg tccacgacgc     5400

agcctgactg ggtggccatc tgctttgcaa acaagaccca gatcttgagg gtgtgagggg     5460

attgcgaacc tgtgaggcgt ttgtctcagt ccaattaaat ctgtgaaggg gtgccagaat     5520

ggatgtacag gagcgtcgag tagaattcta tattcttggt atcgggttac gttcgggttg     5580

ttcctgttgg aacggttgag ctgtgtttta gttctgcctg tctggcacaa aaaaatgtct     5640

gttctggtcc ccgtggactg tgaatgagag aaccggctga gatctttgag aacctattga     5700

catgaaggtt gcagcacatt aagaaaagga ggagcgctct tggtaacctt gctgtaacgt     5760

tgctagatgc                                                            5770


<210>  19
<211>  5079
<212>  DNA
<213>  Volvox carteri


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  19
atgagcaaca gcgacaagcg tgcggagatt tatacctatg tggcccagga ccccgtctat       60

gcgatgaact ggagcgtaag tgtcggactt tggtgttttt tctccctctt cgcaactcga      120

gccactcata gtcactcgca cttaagggtc gtaacgtatc gcctgacggg atggtactgg      180

ggtcatgggg ctgacaacca ccgctagctc actttgcatt cccgggctgt gggggagggt      240

ggtgtgaggg cggggaggac ggcacaacgg gtggcagtgc gtagcccctg tccatgccat      300

agtttaaatg ctgcttggag cttgcgttga gctctcgaca cccccatgcc gccgccttag      360

atgtaaggga gcagtaggga cgtcggtgag cgggcccctc ttaaccccct ttccacaacg      420

aaatacacac gcacaccgtg tcggcccggc ccagtcactc cagcccccat ggcttttctc      480

tccccgccga caggtccgcc gtgaccggcg ctttcggttg gcggttggtt cgttccgcga      540

agatgtaacc aactacgtgg aaatcatcag ccgtgagtac ggcagctacg tcggtgatgg      600

ggtcccgtct ctagggtgcc acataaggtt gccccttttg ctaagtgtcg ttgttgatac      660

tgttgttgtt gtggtggtgg taatggtaat ggtaatggta atggtaatgg taatggtgat      720

gatgatggtg ggcgtgggtg tgctttgggt gttgcgcatg ggaagtgcgg cttccctccg      780

cccactctcc aacccactct ccccttcccc tcccccccca cacacacact cacacacacg      840

cagtggatga cgatgctgct gagctgcgct ccgacccctc cctgcgcttc caccacgact      900

accccgccac caagctcatg tggcttcccg acagagaggg ctgcagaccc gacctgctag      960

cgaccacggg ggaggcgctt cggatctgga gagttctgga tccggattca gttgcggggg     1020

acggggagga cgtggcggcg gcgggggcag ggggaggagg agcgacgggt gttgggcggg     1080

gtgtgcagct acgagctctg ctgaataacg tgagtttggt tttggggagg ggggccgccg     1140

tgcgtttggc tgcgtttggg gcgggggacc cctgagatag gttcgtatca ttagctaata     1200

tacttaatat cttgaatatt cctggatttt gcttttacgc tagcagaact tgctattgca     1260

cgtacaccat tattatccat taccgaacca ggaaattacc ataccgccaa gcaactggtc     1320

acagcccttc tgccccaccc ccgggtgccc ccgggtccgg ctcctgctcc ccccttcccc     1380

cccgcccacg gctcattgaa accggccacc cttcagcggc aactgatatc cgatattcct     1440

cagagcttat cactgcagat gaaccttcaa tttacttttt attcctatcg acacgggcgc     1500

agacttgttg ccctatcccc tgtggtaggg cccgttcttt gactttggca tcatcacgta     1560

actgaggtgg ccccaggtga ccgacttgac taacattgat gtgccccttc ctgtgctggt     1620

gctggtgctg gtgttggtgt tggtgttggt gttgttcaga acaagcagtc cgagttctcc     1680

gctccgctga cctccttcga ctggaacgaa gcggacccca agcggctggg aacctcctcc     1740

atcgatacca cctgcacgat ctgggacatc gaggtgtgtg gagcgtgagg gtgggggggc     1800

cctctggggg gctgtggggg ttggggggag gggaaggtgg gaggggaaac ccgctgatcc     1860

cggggtttgg gaccgttgga tttgcgcggg acatgggtat aggggtttcg ggggatcggg     1920

gtttgagctg gggagctgtg gcgtgttggg agagagaagg gagggaaggg tgtgctgagc     1980

ggtgggtttc gtggggctga tgtgtttttt tatgtgagat gtgtgccggt gttgtgggct     2040

tatgtttcct cccttttcga ccctctgcac ccctctggcc ttcttcctta cacatcacct     2100

agatcccgtt ccccctcctc tctcttcgcc tctccgcccc catacacaca accacattga     2160

tcgcgtaaag aaaggggagg tggacacgca gctgattgca catgaccggg aggtgtacga     2220

catcgcctgg ggaggactgg gggtatttgc aaccgtgtcg gcagacggct ccgtgcgcgt     2280

gttcgatttg cggtgggttt ggattgggtt tggattggct ttgtatgtat gtatgtatgt     2340

atgtatgtat gtatgtatgt atgtatgtat gtatgtatgt atgtatgtgt ctatgtatgt     2400

atgtatgtat gtatgtatgt gtctatgtgt gcatgtgtgc atgtgtctat gtgtctatgt     2460

gtctatgtga ggtttgggca gtgtggtgag aggcgccatg ggcgccgtgg gtttcaggac     2520

acggtgtctc cctgggaccg gacctctccc cgttgactgg ccgggactcg ttgaccatca     2580

gcctgatgtg gcgcggccct cgcctcccgc cctacttcaa attcaaattt cccgccctcc     2640

gctgtccggc tcctaccccc ttattgaacg tttcgacagt tgattttcct gagctggctc     2700

ctccttgtat cattagattt cgcttcaccc tttcgctctg ttccgctcta cgccccgcct     2760

ttgctgcccc tcctccccgc tccctagtga caaggagcac agcaccatca tctacgaaag     2820

cccccagccc gacacgccgc tgctgagact gggctggaac cggcaggacc ccaggtacat     2880

ggccaccata ctcatggact cacccaaggt gaggggaagg gagaggggga gggggcccta     2940

agggggagca tggagggcat gcaacgagca aaacttactg gaaattagca tccccagtgc     3000

ggcagcaaag ccgggaggag aggccggagg aaagcctgca gcagcagcag cagcgcgcgt     3060

cccaagtacc aaaacattcc atatattacg tatacgcttt ctgcacctgt atgggggtgt     3120

tgtgtagtgt tgtgatttgt gatcgtacgc gcgtatttgg taaccccccc tcctgctgtc     3180

cggcgcaacg ccccatcgtt gctttgctgc ctggacgtgt atattccctc tgtcccgtac     3240

gttcccgggt gccttgcacg taggaattcc ctcccccctc atttccccac ccccgtgtgt     3300

gtgtgtgcgt gtgaacccct taccccccac cccccacccc caggtggtaa ttctggatat     3360

ccggtacccc acgttgccgg tggcggagct gcaccgccac caggcacccg tcaacgcact     3420

ggcctgggcg ccccactcag cacaacacat atgtaccgcg ggggacgact cgcaggttgg     3480

gaggaggggg gcgggacccg tgggtcggtg cagggaagga tggcgccacg gatgggaaca     3540

cggaagggcc gatattattt gctgttgatg ctacatttgc aggtgacata tggcccgagt     3600

tggcgttttg ccggtgtgtg tgtgtgtgtg tgtgcgtgtg tgttatcccc gtatgtgctg     3660

ctgcacgctg tactgtactg ccccgggtgc tcgtcgaggc ggtcgatccc tcggcacgca     3720

tcctcgcggt ttcttgcata tgattcccag ttcccctccc cgtccccctc cccgtctcgc     3780

tcgcaggcgc tcatctggga cgtctcagct gtgggcagtg gcggtggtca gccgggggcg     3840

ttaggggggg gaaccgcggg ggatgtgtcc ctggatccca tcctggcgta cggcgcacag     3900

agcgaggtac ggcagagcag aggggatgaa actagaagtg ggtggtcagg aggatggagc     3960

taatgaacag actgtgcgga agggttctgg agttcggaat gaatcagctg gtactgggta     4020

gtagctggtt ccgatgtggg taccaggtgg tggtcggttc ggcaggtggg cgacagcggt     4080

gtcgtggggc gccgtggaac acatccagct atcttccccg gttaatgagg gcttgggtcg     4140

cattttcttt ggcggggttt catccacagt gattggattc tttggaggag tgcaggccag     4200

gtagcgcgct ctgcatgatg tgaatgaaca gtgtggtgca gcgcagctct gttgcaccac     4260

ccacagaaag ctgcgtcgag cctcggcgcg aatacacgtg tgggctacca ccattgccct     4320

gcattgcagt ccggggatgt gggacgctca tgcagagaaa aaagctgaac cgcctttctc     4380

cccaatcggc cgctggggga cgcatgcagc tgatgatgct gccgtcctgc tgccggcgtt     4440

gtctgtccca ccaacacgat ctgccccctt ccgtaaccct gtctgtctgt ctgtccgccc     4500

ttctatcctg atgttttggg ctgccttttc cttgcaggtg aaccagctgc agtggagctc     4560

cgcgcagccc gactgggtgg ccatatgctt cgccaacaag acccagatcc tcagagtgtg     4620

acccctgtgg gacctgcggc gacttggcta ggcttggata actcagcggc tgcgactttg     4680

cccatgtttc gcactcccgt agtgcagatg gcacgagtgt gggggcaggc aggatcaaca     4740

gcgtcgcctt gcctcaggcc acatgactcc gccatgtctg ggtgtgtagg tccgcgggcg     4800

ctgcataaaa aaagcaatcg tggcattgga gagcgcggat gcgtgccgaa aaatcttgcg     4860

ttttgcccga gggctgatgg gtcctgggtc gaggaggaag ggtgcggtta gagaggactt     4920

tcacactgaa tggctttctt tgaggagccc tcgacccctg tatgcaaatt tagcctcccc     4980

gtccacaagg aggcatgctg cggctttttg cggcaagctg tccaaaatgt ggcatgcgta     5040

ttccagagga gttttggatg taagcaggtg gaacctggg                            5079


<210>  20
<211>  3272
<212>  DNA
<213>  Chlorella variabilis


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  20
gtgaactcgg ccggttgcca tccaggagag agccaacccc tgagcccctg cacgggtgag       60

gaccgtcgga ggaaggcgca gaatgccggc gcggggcgcc agagcggggg agtgcgggcg      120

ccgaccgccg cccagcagca gcagcgccgg ccgcttagca gcagcgcctt gtggggggtc      180

gtgcgcttat ctcctcgcat tctctcttac aggctcccat gcaggaccag cagcagcagg      240

gggaggggcg ggcagagatc tacacctaca gcagctcggc cagcgtctac gcctgcggct      300

tctcggtgag cagccagcca atggcagcgg ccgccgtcag ctcttccagc ccagagtgca      360

gcacaggcag cgcaggcagc gcagcacgaa gtctcagagc atgtcggccg ctgcgctgca      420

gccgtgtgcc cggcagctta ccgccactct cgccgctccc gccgctgcgt gcctcccttg      480

cagtcccgcc ccgacaagcc cttccgcctg gctgtcggca gcttcatcga cgactatgca      540

aacaaagtgg agatcatcca gtgtgagtgt ctgtttttag gcaggcgctg cgctgcacgg      600

actgcggatg gcacggtgct gccgcggctg ctgtgattgc acgcggctgc tgctcgggtc      660

tggcgccgcc gctggggtgg gatcttgtct ggctctggct ctggatcttc atctggggcg      720

cagtcttggg catgtgggtg gtagcacgct gctcagacca ggccccacct gccgccacca      780

ccactgccta tgcgcgccac ccgcagtgga cgaggcggcc ggggtggtgc gcaacaaccc      840

tgcgctgacc ttccagcacc cctacccgcc caccaaggtc gccttcatcc cggataaggt      900

gaggcgccga ggccgggctg actgagccag gttgtgggct gcctgccggg acaaggcagc      960

agcctgcggg tgctgtcctg cccctgcccc tgacctgacc gtgctgcccc gtatgctggc     1020

tctgtgatcc ccgccttctg aacgccacca cctgccgccg ccgcctcctg ccccagagcg     1080

ggacccgccc cgacctgctg gccactagcg gcgacttcct gcgcctgtgg cgtgtgtcgg     1140

atgagcccgg ggcgcagcag ggcgtgcgcc tggagaagct gctgaacaat gtgagggcgg     1200

gcgggcgggc ggctgcgctg cagggggtgc tcagcgaggg tgggagcagg ggtaggggcg     1260

cctcggtgcg gcgagtgccg gcagggctgc tggggagcga cgtgccggcc agtgtacaga     1320

gcgcaccctc gccgccagcc ctcgacgcgg cgctcgcacc acctcacacc ctgccgtctc     1380

cctctctccc ccatgctccc atgctcccat gctccccttc ccgcccgcca cccgccgcag     1440

aacaagggcg gcgactttgc ggcgccgctg acgtcctttg actggaacga gctggaccct     1500

cgccgcgtgg gcaccgcctc gatcgacacg acgtgcacgg tgtgggacgt ggagcgcggg     1560

gtggtggaca cgcagctcat cgcgcacgac aaggaggtgg gtgggtgcgg cgcggggggt     1620

gtagtgtggc cacgggcgtg gcgtgcctgc ttgtgggtgg agcatggggt ggtggtcagg     1680

ccgcagttga ttgcgagcta caagggggtg ggtggggtgg tggcttctct aggcggtgct     1740

ggtcctgtgt tcagccatgc gcttgagcat atgtgcacac cgattgcatg gaggggtgtg     1800

ggcgtggtgg ggcatcaaca tcacacctgc ctgcccctgc ccgcccctca aagacctcgc     1860

tgccgcccgc ccgcctgccc gcctccccca cacacatcta tgccaggtgt acgacattgc     1920

ctggggcggc gtcgggatct ttgcctcggt ctccgccgac ggctcggtgc gcgtgtttga     1980

cctcaggtgg gaacagccgg ctcacccgcc agccgggctg cgtgcccgcc tgcccagccg     2040

cttgccctcc ggcgcagccc gtgcacactt ctgcgagctc cgccgccagg ccgcaaaccc     2100

acgcacgctc gtggccccgc cagaaaccca cgcacgcacg ctcacccgct tgcgcatgca     2160

tgcacttatt gttgtttgcc cacagggaca aggagcactc caccatcatt tatgagtccc     2220

cccagcccga cacccccctg ctgcgcctgt cctggaacaa gcaggacccg cgctacatcg     2280

caggtgcggc gcggccggcg cggggcctcg cggtaccatc gtgccccagc cgagctgcct     2340

tgcggctgct caggcccagg cacacaggca ctcaagcaca ccaccacaac caccacagcc     2400

accactttca cacccacgca cacacaagcc gttccttccc tgcgtgcagt gttagccatg     2460

gattcgccgc gggtgacggt actggacatc cggtacccca cgctccccgt ggccgagctg     2520

cagcggcacc aggcgggggt caacgccatc tgctgggccc cccacagcgc cacccacctg     2580

tgctccgcgg gcgacgacag ccaggcgctg atctgggacc tgggcctgct gggcacgctg     2640

gggcagcagc ccgagggcgg cccgccgggc gccgcggcgg cgggtggcgg cctggacccg     2700

atcctggcct acaacgcagg cgccgaggtc aaccagctgc agtggagccc cgcccagccg     2760

gactgggtgg caatctgctt cggcaacaag acccagctgc tgcgggtgtg aggcgcgtgc     2820

cgacagagca ccacaccgcc gcgctgctcg ccggctgcag cgttgctcgc cgctcccctc     2880

cagggcagcg cggcgcccgc gccgttcctg cttcccaagc tgccagcctt cttgcgtccc     2940

ttattcgcct gctccccctg tcttgcttcc gctcctgctg tactgccccc cgcccgcatc     3000

tgtatcaccc gggtgccttt tcttcactgc acacgtacca ccgcatctgt ggcaccctgc     3060

ccctccctaa tgcacggccc tctggcacgc tgccagcccc tctaatgcat gggccctgcc     3120

attcacacca atcgcatcaa ccgtacatct gtccccctgc actgctcttt gtcaccattc     3180

cacctgatgc tctctcctct cccaacccaa ctgatccccc cgtcgcatcc ataccgattt     3240

cgagagacac cttgcaatga aacacgcagc gc                                   3272


<210>  21
<211>  3032
<212>  DNA
<213>  Parachlorella sp.


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  21
atgcagcgcg cagaaatcca tacgtacgag agccccacgt tggtttatgc actcaactgg       60

agtgtaagtc acacaatatg ttgacaagat actgaagcgc gacatgtata ggctgtgtcc      120

tgaagaggca agctttcacc tgctgtaggt gcggcctgac aaacccttca gactagccat      180

cggaagctac atcgaggact acaacaaccg agtggaaatc gtcacacgta tgttcaccgc      240

ttcaggtctt gctgctattg cgttcgtgcc ttcaaagtca gactttcgaa gcaagtttgc      300

ccaccggggg tctctggtgc agtcggtgaa gatggaaatg gaatgcggcc tagcccacgg      360

cacacctttc agcatcccta tccacccacc aaacttcagt ttgtaccaga tcctgatggc      420

tcccggcctg atctgttagc cagctccggt gacttccttc gactctggcg catcacggag      480

gacggggttt ccctggaaaa gcttctcaac aacgtgagcg cgcctgctct gatagcgctg      540

tcctgttgta ccatggacgg ctagcgcaca gcggtagcgc gcagtgcaac gaagacgacc      600

ggggctgacg ctactctgaa tgcaaccacc ttgcctgctg tggtgcagaa caaagcaagc      660

gagttttgcg cgcccttgac cagctttgac tggaatgaga acgaccccaa gagggtgggc      720

actgccagca tcgacaccac ctgcactgta tgggacatag agaagggggt ggtggacacc      780

caggtgggtt gagtggagta gagtggagtg gcggagaacc tggaagagca ggccatgacc      840

agggcatgcg tgctgagcac tcctgccgcc gccgccgtcg cgtcaagtct tggccgttca      900

aagactcgag gtccttccgg tgtgctgctt ggagtcgccc gcgtcctgcc acatcgtgtg      960

cctgctgccg ctttggagcc atgctcccct tgccacgcgc tcacccttgg aaccgtgtgg     1020

cccctgcagc tgattgccca tgacaaggag gtgtacgaca tagcatgggg gggcgtgggg     1080

gtgtttgcct cggtgtcagc agatggctcg gtgcgggtgt ttgacttgag gtaggtgctc     1140

tgcctgcggc cttcgacctg gggctgagtg accgggtggg gccggcctgg aggatggagg     1200

gtggaatatg gtgcagtacg cgaggtacat gatcttggcg gtggctcatc ctggtgtggg     1260

tcgctggtag aagaagcggc gtggaacagg tgtaggtgtg gggcttggag cagtgaggtg     1320

caagccagtt gtagagtatg ccccagcagc ctccccaaag ggccctcagc ccgcagcacc     1380

ctgccagcag gagcttcccc agttgctgcg ccaggggcag cgcatgctcg ggcgcacagc     1440

tgtgtgtgtg cacgacagag ccagctcctg gtgtggagct ggcggtgggc acgtgtggga     1500

gggctctgtt ggggtcccat gcgttagaca gggcatggca ggggtctggg cgtgtgggca     1560

ggtcgtgctt ctgcctcatg cgccctccac ccaccctcac cccgctgctg cctcatacgc     1620

cccgcaccca ccccctcccc ccttcctgcc gctgcctcat gcgccctcca ccaacattcc     1680

ctgcagggat aaggagcact cgaccatcat ctatgagtcg ccgcagcccg acacccccct     1740

gctgcgcctg gcctggaaca agcaggaccc gcggtacatg gccaccactg ccctcaactc     1800

ctccgccatt gtggtgctgg acatccgatt ccccacggtg ccggtggtgg agctgtccaa     1860

gcaccaggtg cgatatggca gcgggggtgc agtcgctgcg gggggtgcct gcaatggggc     1920

agctgctatc tgccgccttg ctgtgagctg gcctggcgct gggagccagg agtgctcacc     1980

ctgctgcgag cgctgccacg ggttggctgc atcgcacagg ctgaggcccc cgggctgggc     2040

tggaagggcg ggtgcctcac cggctcaggc gcccacgagg atcgccgatt tcctgccctg     2100

gcctggtgat catgccgctg gcggcggtgc ggtggcgtgc gcagcatgcg gctgcgcact     2160

gcattggcca gccacgctgc tcgtgccatg gtgtgcatgc agcccgcacc agccaaagcg     2220

tgctcgcgct gcccgctgct gaacagacct ctcaacgcgc tccctccctc tgtggctagt     2280

gtgccccagc accaggtggc agctggagcg cgcccagcat ggtgcagcaa caggaggaaa     2340

gccgccgccc ctgggcctgc gggcaatttt agcgcgttgc gctgcgctgc attcctcgca     2400

acgtgcacgt caccgaagcg tgtatctccc cccttcccca ctgcctgcag gccgcctgca     2460

atgctgtggc ctgggccccc caaagtgcca accacatctg cagtgccggg gacgattgcc     2520

aggtggggcg cgctggaagg ggggaaggaa agggggaggg gggtcctgtg aactgggcag     2580

aatgctggcc tttctaataa accgccacca acaagctggg gctgtgcttg gggagggcgg     2640

gggcagtcgt cccccgcggc acgggggccg acacccttgc cttcccttgg cgagtattcg     2700

agcacgggac cccttcctcc cctcctgtgc atctcatgtg atgtactcgc cgagccctct     2760

gcccctgctg cgactgtcgc gcagctgcaa ggccacgcgc caagcgcatt gcggcgctgt     2820

ccctccgcgc ctctgtgccg tgcgtgcagg ctttgatctg ggacctgtcc actctggggg     2880

agggcggcgc gggccaggcg gggagccccc ccctggaccc catcctgtcc tacatggcgg     2940

gggcggaggt gaaccagctg cagtggtcgg cgtcccaccc cgactgggtg gccatctgct     3000

ttgggaacaa gacgcagatt ttgagggtgt ga                                   3032


<210>  22
<211>  1045
<212>  DNA
<213>  Picochlorum sp.


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  22
atgtcgactg attggttggt tgtaattgtg cagtctaggc aagataaggg tgtacgagta       60

gctgttggca gttttgtgga gggtgtctcg aacacagtcg agattcttcg cggtacggat      120

catctgtgag tctctgttct acttgggaaa tgtgggcata attttttgta tatgaggtgc      180

agtgactccc gcaggtttaa ttgtggatga caaggaaacg ttcggcatag agtatccggc      240

gacgcaggtg ggatttattc ctgataggtt ttgcaacaag ccagatttgt tggcaacctc      300

tggggatgct gttaggttgt ggaaaatttc agacgcaggg acgacgcttg aactggtgtt      360

gaatgatcca aagaatacct ctaaaaattt cagtgcggtg acctgctttg attggagtga      420

aatcaatgtg aaagtgttgg cggcagggtc gagtgcaggg cgattattgc tgtgggacac      480

cgagtcaggg aggctgcagg gcacaatggt gggacatgag gatgagattc ttgattgtca      540

gtgggcagct aatgatgtga ttgtttcttc ttcgggcgat ggatcaattc ggatgtatga      600

cctgcgggat aaagaccatt gcacggtatt gtatgagacc cccaggagga cccctgtgcc      660

gaggttttgt tggaacaagc tggatccgag gcatcttgca ttttctatag aaaagagtcg      720

gcttgttagt gttctcgatg ttcgctttcc gacagagccg gtgatcttgc tggacggtca      780

tatggggaac tgtacagcac ttggttggtc ccctcacaga gaggaatacc tctgctcggt      840

tggagatgat tgccatgcat tgatatggga cgtggggaag gtgaatagtg aggaggatag      900

taagccaaat cgagaggcgg tggacgcatc tcctatccta gcgtacaatg ctcaggcaga      960

gatcaatgcg atggcttgga atccaataga cccagattgg attgccattt gcgctagaaa     1020

cagaacacaa gtattgagaa tatga                                           1045


<210>  23
<211>  2857
<212>  DNA
<213>  Tetraselmis sp.


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  23
atgccgacga gcgacgcgac gcaggcacat gagcaccatc acgtgccgca cgcgcggccg       60

cgacccccgc caagaacgtt gtctgcatgt gccatgttga tgagcaagga gaaacggcgg      120

cgccacgcag acaagagctg gccgattttt tggccagtgt gtgtgtggct agctttcgcg      180

agctccacca agaaagcctc agcgtgcggg ccagctcgct ttgcataatg cttcggcaat      240

ttgattgttg actagccgct acagtacatg tacaacggag cgtcgcctac gggtatacgg      300

agggcttgcc gtaactgtgg atcgctgcct attaaacggc cgtcgtaaag atagacgctg      360

caagggtttc actggtgtca gtcctgtccc gggggcggga aaaccaaaac cctggtcgta      420

aatgagacgt ggcaaagttt caccggcgcc agtcgcgtat ccaggaaaaa aaaacccgcc      480

ccctcgcact cttcgtggcc acaggtagag gactgccgaa gtgatcaccc cacagcaaag      540

cccaccgtcg tcagcgctac ccgttacgtg cgggtactca gactctcgcc gcgacaccca      600

cgcaacaagc caacaacgcc gccccgcttg cggatcgctt cacgcttggg ctggtgcgga      660

accttgttca gggaccttgt tcccgccgac aaattattgc tgctcggctc tcctgcactc      720

cgacagcgtg gcaccaggct ggcctgttca ccgcccgggg ccggtgtggg ttgtggccca      780

cgcgagtggc cactcgggtt cctttcagcc tacgctgggc gtaaagccct taccagcatg      840

cttgtcctgc acgcgccccg tgctgccacg ggctgacgca ctgatcctgc cgtgttgcgc      900

ttgtggcggc aggctcggtg ctgtgccccg ggcgacgcgc gcctgggtgg gctagcttgc      960

gatggcgagc gggccggagg accggggtgc gggggcggcg ggggcggcgc cccaccaacg     1020

aggcgatagc aacggcaaag cggtgacaga caagcgcggg gagatataca cctacgaggc     1080

gccgtacccg gtatacggga tgaactggag tgtgcgtgcg ccggacatgg ccaagggggg     1140

ccaggggagg cccccccggg gggggggggg ggggaggagt tacttggtac aagcagactt     1200

tggccccgtg gctgaggggg tcgagtgttg caggtgcggg aggatatgaa gttccgcctc     1260

gcggtgggaa gctttgtaga ggacgtggag aatgcggtgg agctcatccg gcgtgagtgc     1320

ccgagcgcgt acgcaggcct gcctgctgtg tgcagggagc gcgcgggcat ccgcctgatg     1380

acgtgctgtg tgcacagtga acgaggaaac cggcaagttt gagagcaacc cggcgcacaa     1440

gtttgtgcac ccgtatccac ccaccaaaat aatgtttatc cccgaccgcg actgctcgcg     1500

ccctgacctg ctcgccacca cgggcgatta tctgcggctg tggcgcgtgg aggaggacgg     1560

cgtcacgctg cacaagctgc tgacaaatgt gagcgcggac aaacctttgt gcccgccccc     1620

cccccaccac caccagtcct ccttccctct aaggcccatt ctcaagagat accacggcca     1680

gctccagcat gacccccgcc cccctatttc gtcaacatgc acctcccccc tgcagaacaa     1740

aaacagtgag ttttgcgcgc cattaacatc ctttgactgg aacgaagccg acccgaggca     1800

actgggcacc tcatccatcg acacaacatg cacaatatgg gacatcgagg tgggccaggc     1860

agtccaagcc cccccccccc cccccccccc gcaaatgccc tccttgcgct gacatgtcaa     1920

aacgcctgcc tcgtggtgcg tccagagagg cgttgttgac acgcagctta tcgctcacga     1980

caaggaggtg tacgacattg cgtggggcgg gcagggcgtc tttgccagcg tgtccgcaga     2040

cggctctgtg cgagtgtttg acctccggtg cgccctgcgg tacctgcggc cgcgcgcgac     2100

cccacctggg cgccgtggct ggcttgcggg gggggggggg ggggggctga cgcgccggcg     2160

ggctgttcgg caccgcaggg acaaggacca ctcaaccatc atctacgaga gcgggatgcc     2220

cgagatcccg ctgctgcggc tgggctggaa caagcaggac ccgcgctaca tggccaccat     2280

cctcatggac tcctccaagg tggtcgtcct ggacatcagg tccgccgccc ttgcctgcca     2340

tcacgcaaca tatactgggg gtgtgtctgg cggcgctgac ctgtgatgct gcgccgccgt     2400

gcctgctgca ggtaccccac gatgcctgta gctgagctgg aggcgcatca caagcctgtg     2460

aatgccctgg cgtgggcccc gcagtcctcc tcgcacatct gcactgcggg ggacgacgca     2520

caggtgtggc gggatgcatg ctctgatgca tcacaggaga cagagcgaca gggggggggt     2580

gagcaggggg gggggggggg ggaagggggg atgcccgggt gaggcagatg gtggctgact     2640

gttgcatgct gccgcccagg cgctcatctg gaaccttgcg cccatgggca cccaggggcc     2700

catggggggt gctgcgcctg cagttctagg cgcggacctg gatcccatcc tagcgtacaa     2760

cgccggcgag gaaatcaatc agctgcagtg gtccagcacg cagtccgact gggtgggcat     2820

atcctttggc aacaagatcc agattttgag aatctag                              2857


<210>  24
<211>  1431
<212>  DNA
<213>  Ostreococcus sp.


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  24
cggtcgacgc gctcgacgcg gtcgacgcga gcgaggcgcg cgcttcgagc gacgcgaagg       60

cgtcggacgc gaacgcgaag gcgtcgagcg cgaacgcggg acccgacggg cggtgaaggc      120

gcgcgacgaa agacgaggaa ggcgcgcgcg aacgatgaac gcggagaaga gggcggaaat      180

atacacctat gaggcgccgt ggatgatcta cgcgtgcaat tggagcgtgc gtggcgagcg      240

aggcgatgga ttgggggcga gcgcgggaga attgaatcgc gaggggcgac ggaggagacg      300

cgacggagga gactcgggga cgcgcgcgaa cggtcgatcg gagattaaaa atggagacgc      360

gcgagtgaag acgcgaatgg cgtggactga cgacgtcgaa ttgaacgcga caggttcgac      420

aagataaacg cttccgcctc gccttgggtt cgttcgtgga ggagtatagc aacaaggttg      480

agatcatcac cttggacgag gaaaccgggg agtttccgaa ggaggcgcag tgttcgttca      540

cgcatccgta tccttgcacg aaaattttgt tcattccgga caaggagtgc acgaaggagg      600

atttgttagc gacgacgggg gactacttgc gaatctggca agtgcaggat gataacacgg      660

tgcagatgaa atctttactg aataataaca agagcagcga attttgcgca ccgctgacga      720

gctttgattg gaacgagacc aagcttcagc gagtggggac gtcgtcgatc gacacgacgt      780

gtacgatttg ggacatcgag cgcgagtgcg tggacacgca gctcatcgcg catgataagg      840

aggtgtacga catcgcgtgg ggtggtccag aggttttcgc tagcgtaagt gcggatggaa      900

gtgtgcgagt tttcgacttg agagacaagg atcacagtac gatcatttac gagagtcaaa      960

ctccagacac gccgctgctg cgtttggggt ggaacaagca ggatccgaga tacatggcca     1020

ccatttgcat ggatagtccg gtgatcattc tcgatattcg cttcccgacg ttgccggtcg     1080

cagaacttca gagtcacaga gcgagcgtga atacattggc gtgggcgcca cacagctcaa     1140

gccacatgtg cacggcgggc gacgacagtc aggcgttgat ttgggatttg tcgtccatga     1200

atcaaccacc cgaaggcggt ctcgacccta ttctcgctta ctctgctgga gcagaaatca     1260

atcagttaca gtggagcgcg tcgcaaccgg attggatctc gatagctttc cgaaacagcc     1320

tccagatcct ccgagtttag tcaacgcgct gtcaggtctg cgccgacgcc actgtatatt     1380

acccgaattt ccggatacgc gacacacgac acacgacacg cacgcacgta g              1431


<210>  25
<211>  1294
<212>  DNA
<213>  Micromonas commoda


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  25
atggcggcca tgggcagcgg ccagagcggc gccgagattt acacgtacga ggcgccatgg       60

ctcgtgtacg cgatgaattg gagtgtgagt gcccgtcgat gactctgctc cgtcccgccg      120

cgttcctccc cgcccgggcc gatccctcgc ctgcacccaa tctgacccgg caagatccgc      180

tgctgacgcg actttgagga cgcgcccggc agtcgaccga cgcgccccgc cccgcgacgt      240

gacccgctga cgcttcactc gatataaacc tccccctccc cgcgcgcgca ttcaacaggt      300

gaggcaggac aagaggttcc gcctcgcgct cgggtcgttc gtggaggagt acagcaacaa      360

ggttgagatc atcacgctgg acgagcagcg acgggagttt ccggcggagc cgacgcacag      420

gttcgaccac ccgtacccgt gcacgaagat catgttcgtc ccagacgccg agggaaccag      480

cgaggactta ctggccacga gcggcgacta tctgcgggtt tggcgcatag gcgacgacgg      540

cgtgcacctg cggagcctcc tgaacaacaa caagaacagc gacttttgcg cgccgctcac      600

gtcgttcgac tggagcacca ccaacctggc gagggtgggc accagcagtt tggacaccac      660

gtgcaccatc tgggacctgg agaaggagac ggttgactcg cagctcatcg cgcacgacaa      720

ggaggtgtac gacatcgcgt ggggcgggcc ggaggttttc gcgagcgtct ccgccgacgg      780

gagcgtcagg gtgttcgacc tgcgggataa ggaccacagc acgatcgtct acgagtcccc      840

gacgccggac acgccgctgc tgaggttggg ttggaacaag cagaacccga ggtacatggc      900

gacgatggag atggacagcg ccaaggttgt ggtgctggac attcgcgtgc ccgcgctgcc      960

ggtggcggag ctgaagaagc acagagccgc ggtgaacacg ctggcgtggg cgccgcacag     1020

ctcgaggaac atatgcaccg ccggggacga cgcgcaggcg ctcatttggg acctgtcgtc     1080

ggtggcgcag cccggggagg acgggatgga tccgatgctg gcgtacaacg cgggggcgga     1140

gatcagtcag ctgcagtgga gcgcgacgca aaccgactgg atagccatcg cattcggcaa     1200

aaacctgcag gtgcttcacg tgtgacgccc gcggggagaa cgtggcgatc gtagtcctag     1260

ttcggttttg aattcaacgt tcatttagca ctca                                 1294


<210>  26
<211>  1041
<212>  DNA
<213>  Arabidopsis sp.


<220>
<221>  misc_feature
<223>  WD40 repeat protein

<400>  26
atgggaacga gcagcgatcc gattcaagat ggttccgatg agcagcagaa gcgatcagag       60

atctatacat acgaagcgcc atggcacatc tacgcaatga attggagcgt tcgtcgcgat      120

aagaagtatc gtctcgccat cactagcctc ctcgagcaat acccgaaccg tgtcgagatt      180

gtgcagctcg atgaatccaa tggtgagatc cgttccgatc ctaacctctc ctttgagcat      240

ccttatccac caacgaagac cattttcata cctgacaagg aatgccaaag acctgatctt      300

ctcgctactt caagtgattt ccttcgttta tggagaatcg ctgatgatca ttcccgtgtt      360

gagctcaaat cttgtctcaa tagcaataag aacagtgagt tttgtggtcc tcttacttct      420

tttgattgga atgaagctga gccacgtcga attggaacat ctagtactga tacgacttgt      480

actatctggg acattgagcg tgaagctgtt gatactcagc ttattgctca tgataaggaa      540

gtttttgata ttgcttgggg tggtgttggt gtttttgcat ctgtttcagc tgatggctcc      600

gttagggtgt ttgatcttcg tgataaggaa cattcgacga ttatctatga gagctccgag      660

cctgatactc ctttagtgcg tcttggttgg aacaaacagg atcctaggta catggctact      720

attatcatgg acagtgctaa agttgtggtg cttgacattc gttttccggc tcttcctgtg      780

gttgagcttc aacgacatca agctagtgtc aatgccattg cttgggctcc tcatagctct      840

tgtcacattt gtactgctgg agatgattct caagctttga tttgggatat ttcatccatg      900

ggacagcatg ttgaaggtgg tcttgaccct attctagctt acactgctgg tgctgagatt      960

gagcagcttc agtggtcctc ttctcagcct gattgggtcg caattgcttt ctctactaag     1020

ctgcaaattc tcagggtttg a                                               1041


