                         SEQUENCE LISTING

<110>  E.I. duPont de Nemours and Company, Inc.
       Yadav, Narendra  S.
 
<120>  DELTA-6 DESATURASES AND THEIR USE IN MAKING POLYUNSATURATED FATTY
        ACIDS

<130>  CL4033

<150>  US 61/085,482
<151>  2008-08-01

<160>  53    

<170>  PatentIn version 3.5

<210>  1
<211>  1416
<212>  DNA
<213>  Porphyridium cruentum


<220>
<221>  CDS
<222>  (1)..(1416)
<223>  delta-6 desaturase

<400>  1
atg gcg ccg aat gtg gac tcc gga agc aag gac cgc ggc gtg agc gcg         48
Met Ala Pro Asn Val Asp Ser Gly Ser Lys Asp Arg Gly Val Ser Ala           
1               5                   10                  15                

gtc aaa gaa gta gtc tct ggc gcg acg gcc aac gcg ctg agt ccg gcc         96
Val Lys Glu Val Val Ser Gly Ala Thr Ala Asn Ala Leu Ser Pro Ala           
            20                  25                  30                    

gag cgc gtg gtg acc agg aag gag ctc gcg ggg cac gcc tca agg gag        144
Glu Arg Val Val Thr Arg Lys Glu Leu Ala Gly His Ala Ser Arg Glu           
        35                  40                  45                        

tcg gtg tgg att gcg gtg aac ggc cgt gtg tac gat gtg acc ggc ttt        192
Ser Val Trp Ile Ala Val Asn Gly Arg Val Tyr Asp Val Thr Gly Phe           
    50                  55                  60                            

gag aac gtt cac cct ggc ggc gag atc att ctg acc gcc gcc ggg cag        240
Glu Asn Val His Pro Gly Gly Glu Ile Ile Leu Thr Ala Ala Gly Gln           
65                  70                  75                  80            

gac gca acg gac gtg ttt gcc gcg ttt cac acg ccc gcc acg tgg aaa        288
Asp Ala Thr Asp Val Phe Ala Ala Phe His Thr Pro Ala Thr Trp Lys           
                85                  90                  95                

atg atg ccg cag ttc ctc gtg gga aac ctc gag gag gac gcg ctc tct        336
Met Met Pro Gln Phe Leu Val Gly Asn Leu Glu Glu Asp Ala Leu Ser           
            100                 105                 110                   

gcc aaa ccg tct aag cag ctt aat ggg cat tcg cca cac gag tac caa        384
Ala Lys Pro Ser Lys Gln Leu Asn Gly His Ser Pro His Glu Tyr Gln           
        115                 120                 125                       

gct gat atc cga aag atg cgt gcg gaa ctt gtc aag ctg cgc gcg ttc        432
Ala Asp Ile Arg Lys Met Arg Ala Glu Leu Val Lys Leu Arg Ala Phe           
    130                 135                 140                           

gac tcg aac aag ttc ttc tac ctg ttc aag ttc ctg tcc acg tct gcg        480
Asp Ser Asn Lys Phe Phe Tyr Leu Phe Lys Phe Leu Ser Thr Ser Ala           
145                 150                 155                 160           

att tgc gcc ctc tcg gtg gtc atg gcg ctc ggc atg aag gac tcg atg        528
Ile Cys Ala Leu Ser Val Val Met Ala Leu Gly Met Lys Asp Ser Met           
                165                 170                 175               

atc gtc acg gcg ctc gcc gcg ttc acc atg gca ctc ttc tgg cag cag        576
Ile Val Thr Ala Leu Ala Ala Phe Thr Met Ala Leu Phe Trp Gln Gln           
            180                 185                 190                   

tgc ggc tgg ctc gca cac gac ttt ctg cac cat cag gtg ttc aag aac        624
Cys Gly Trp Leu Ala His Asp Phe Leu His His Gln Val Phe Lys Asn           
        195                 200                 205                       

agg gtg ttc aac aac ctg gtc ggt ctt gtt gtt ggt aat gtc tat cag        672
Arg Val Phe Asn Asn Leu Val Gly Leu Val Val Gly Asn Val Tyr Gln           
    210                 215                 220                           

ggc ttt tcg gta tcc tgg tgg aag atg aag cac aac cac cac cac gcc        720
Gly Phe Ser Val Ser Trp Trp Lys Met Lys His Asn His His His Ala           
225                 230                 235                 240           

gct cca aac gtg acg tca acg gcc gct ggg cca gac cca gac atc gac        768
Ala Pro Asn Val Thr Ser Thr Ala Ala Gly Pro Asp Pro Asp Ile Asp           
                245                 250                 255               

act gtg ccc gtg ctc ttg tgg agc gag aaa ctc atc gag ggt gat agc        816
Thr Val Pro Val Leu Leu Trp Ser Glu Lys Leu Ile Glu Gly Asp Ser           
            260                 265                 270                   

aag gag atg gag gat ctg ccc atg ttc ctc atg aag aac cag aag atc        864
Lys Glu Met Glu Asp Leu Pro Met Phe Leu Met Lys Asn Gln Lys Ile           
        275                 280                 285                       

ttt tac tgg ccg gtt ctg tgc gtg gcg cgc atc agc tgg ctc ctg cag        912
Phe Tyr Trp Pro Val Leu Cys Val Ala Arg Ile Ser Trp Leu Leu Gln           
    290                 295                 300                           

agc ctt ctc ttc cag cgc gcg ccg gtc tgg aac ttt gtg ggc gga aac        960
Ser Leu Leu Phe Gln Arg Ala Pro Val Trp Asn Phe Val Gly Gly Asn           
305                 310                 315                 320           

agc tgg cgc gcg gtg gag atc gtc gcg ctt ctc atg cat cac ggc gcc       1008
Ser Trp Arg Ala Val Glu Ile Val Ala Leu Leu Met His His Gly Ala           
                325                 330                 335               

tac ttc tac ttg ctg tcc ttg ctc aag agc tgg gtc cat gtc gcg ctc       1056
Tyr Phe Tyr Leu Leu Ser Leu Leu Lys Ser Trp Val His Val Ala Leu           
            340                 345                 350                   

ttt ttg gtg gtg agc cag gcg atg ggt ggt gtg cta ctc ggc gtc gtg       1104
Phe Leu Val Val Ser Gln Ala Met Gly Gly Val Leu Leu Gly Val Val           
        355                 360                 365                       

ttc acc gtc ggg cac aac gcg atg aaa gtc ctc tcc gag gaa gaa atg       1152
Phe Thr Val Gly His Asn Ala Met Lys Val Leu Ser Glu Glu Glu Met           
    370                 375                 380                           

aag tca acc gac ttt gtc cag atg cag gtc ctg acg acg aga aat att       1200
Lys Ser Thr Asp Phe Val Gln Met Gln Val Leu Thr Thr Arg Asn Ile           
385                 390                 395                 400           

gag ccg acg gct ttc aat cgg tgg ttc agc ggt ggc ctc agc tac cag       1248
Glu Pro Thr Ala Phe Asn Arg Trp Phe Ser Gly Gly Leu Ser Tyr Gln           
                405                 410                 415               

att gag cac cac atc tgg cct cag ctg ccc cga cac agc tta ccc aag       1296
Ile Glu His His Ile Trp Pro Gln Leu Pro Arg His Ser Leu Pro Lys           
            420                 425                 430                   

gcg cgc gaa att ctc acc aag ttt tgc agc aag tat gat att ccg tac       1344
Ala Arg Glu Ile Leu Thr Lys Phe Cys Ser Lys Tyr Asp Ile Pro Tyr           
        435                 440                 445                       

gcc agt caa ggc ctc att gaa ggt aac atg gaa gtg tgg aaa atg ctc       1392
Ala Ser Gln Gly Leu Ile Glu Gly Asn Met Glu Val Trp Lys Met Leu           
    450                 455                 460                           

tcg aag ctt ggg gaa tcc cta tag                                       1416
Ser Lys Leu Gly Glu Ser Leu                                               
465                 470                                                   


<210>  2
<211>  471
<212>  PRT
<213>  Porphyridium cruentum

<400>  2

Met Ala Pro Asn Val Asp Ser Gly Ser Lys Asp Arg Gly Val Ser Ala 
1               5                   10                  15      


Val Lys Glu Val Val Ser Gly Ala Thr Ala Asn Ala Leu Ser Pro Ala 
            20                  25                  30          


Glu Arg Val Val Thr Arg Lys Glu Leu Ala Gly His Ala Ser Arg Glu 
        35                  40                  45              


Ser Val Trp Ile Ala Val Asn Gly Arg Val Tyr Asp Val Thr Gly Phe 
    50                  55                  60                  


Glu Asn Val His Pro Gly Gly Glu Ile Ile Leu Thr Ala Ala Gly Gln 
65                  70                  75                  80  


Asp Ala Thr Asp Val Phe Ala Ala Phe His Thr Pro Ala Thr Trp Lys 
                85                  90                  95      


Met Met Pro Gln Phe Leu Val Gly Asn Leu Glu Glu Asp Ala Leu Ser 
            100                 105                 110         


Ala Lys Pro Ser Lys Gln Leu Asn Gly His Ser Pro His Glu Tyr Gln 
        115                 120                 125             


Ala Asp Ile Arg Lys Met Arg Ala Glu Leu Val Lys Leu Arg Ala Phe 
    130                 135                 140                 


Asp Ser Asn Lys Phe Phe Tyr Leu Phe Lys Phe Leu Ser Thr Ser Ala 
145                 150                 155                 160 


Ile Cys Ala Leu Ser Val Val Met Ala Leu Gly Met Lys Asp Ser Met 
                165                 170                 175     


Ile Val Thr Ala Leu Ala Ala Phe Thr Met Ala Leu Phe Trp Gln Gln 
            180                 185                 190         


Cys Gly Trp Leu Ala His Asp Phe Leu His His Gln Val Phe Lys Asn 
        195                 200                 205             


Arg Val Phe Asn Asn Leu Val Gly Leu Val Val Gly Asn Val Tyr Gln 
    210                 215                 220                 


Gly Phe Ser Val Ser Trp Trp Lys Met Lys His Asn His His His Ala 
225                 230                 235                 240 


Ala Pro Asn Val Thr Ser Thr Ala Ala Gly Pro Asp Pro Asp Ile Asp 
                245                 250                 255     


Thr Val Pro Val Leu Leu Trp Ser Glu Lys Leu Ile Glu Gly Asp Ser 
            260                 265                 270         


Lys Glu Met Glu Asp Leu Pro Met Phe Leu Met Lys Asn Gln Lys Ile 
        275                 280                 285             


Phe Tyr Trp Pro Val Leu Cys Val Ala Arg Ile Ser Trp Leu Leu Gln 
    290                 295                 300                 


Ser Leu Leu Phe Gln Arg Ala Pro Val Trp Asn Phe Val Gly Gly Asn 
305                 310                 315                 320 


Ser Trp Arg Ala Val Glu Ile Val Ala Leu Leu Met His His Gly Ala 
                325                 330                 335     


Tyr Phe Tyr Leu Leu Ser Leu Leu Lys Ser Trp Val His Val Ala Leu 
            340                 345                 350         


Phe Leu Val Val Ser Gln Ala Met Gly Gly Val Leu Leu Gly Val Val 
        355                 360                 365             


Phe Thr Val Gly His Asn Ala Met Lys Val Leu Ser Glu Glu Glu Met 
    370                 375                 380                 


Lys Ser Thr Asp Phe Val Gln Met Gln Val Leu Thr Thr Arg Asn Ile 
385                 390                 395                 400 


Glu Pro Thr Ala Phe Asn Arg Trp Phe Ser Gly Gly Leu Ser Tyr Gln 
                405                 410                 415     


Ile Glu His His Ile Trp Pro Gln Leu Pro Arg His Ser Leu Pro Lys 
            420                 425                 430         


Ala Arg Glu Ile Leu Thr Lys Phe Cys Ser Lys Tyr Asp Ile Pro Tyr 
        435                 440                 445             


Ala Ser Gln Gly Leu Ile Glu Gly Asn Met Glu Val Trp Lys Met Leu 
    450                 455                 460                 


Ser Lys Leu Gly Glu Ser Leu 
465                 470     


<210>  3
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  His-rich motif


<220>
<221>  misc_feature
<222>  (2)..(4)
<223>  Xaa can be any naturally occurring amino acid

<400>  3

His Xaa Xaa Xaa His 
1               5   


<210>  4
<211>  6
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  His-rich motif


<220>
<221>  misc_feature
<222>  (2)..(5)
<223>  Xaa can be any naturally occurring amino acid

<400>  4

His Xaa Xaa Xaa Xaa His 
1               5       


<210>  5
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  His-rich motif


<220>
<221>  misc_feature
<222>  (2)..(3)
<223>  Xaa can be any naturally occurring amino acid

<400>  5

His Xaa Xaa His His 
1               5   


<210>  6
<211>  6
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  His-rich motif


<220>
<221>  misc_feature
<222>  (2)..(4)
<223>  Xaa can be any naturally occurring amino acid

<400>  6

His Xaa Xaa Xaa His His 
1               5       


<210>  7
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  His-rich motif


<220>
<221>  MISC_FEATURE
<222>  (1)..(1)
<223>  Xaa = His [H] or Gln [Q]

<220>
<221>  misc_feature
<222>  (2)..(3)
<223>  Xaa can be any naturally occurring amino acid

<400>  7

Xaa Xaa Xaa His His 
1               5   


<210>  8
<211>  6
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  His-rich motif


<220>
<221>  MISC_FEATURE
<222>  (1)..(1)
<223>  Xaa = His [H] or Gln [Q]

<220>
<221>  misc_feature
<222>  (2)..(4)
<223>  Xaa can be any naturally occurring amino acid

<400>  8

Xaa Xaa Xaa Xaa His His 
1               5       


<210>  9
<211>  39
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  SMART IV oligonucleotide

<400>  9
aagcagtggt atcaacgcag agtggccatt acggccggg                              39


<210>  10
<211>  59
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  CDSIII/3'PCR primer


<220>
<221>  misc_feature
<222>  (28)..(57)
<223>  thymidine (dT); see BD Biosciences Clontech's SMART cDNA 
       technology

<220>
<221>  misc_feature
<222>  (59)..(59)
<223>  n is a, c, g, or t

<400>  10
attctagagg ccgaggcggc cgacatgttt tttttttttt tttttttttt tttttttvn        59


<210>  11
<211>  23
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  5'-PCR primer

<400>  11
aagcagtggt atcaacgcag agt                                               23


<210>  12
<211>  29
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 523


<220>
<221>  misc_feature
<222>  (21)..(21)
<223>  n is a, c, g, or t

<400>  12
tggcagcaga tgggctggyt nagycayga                                         29


<210>  13
<211>  29
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 524


<220>
<221>  misc_feature
<222>  (21)..(21)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (24)..(24)
<223>  n is a, c, g, or t

<400>  13
tggcagcaga tgggctggyt ntcncayga                                         29


<210>  14
<211>  29
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 525


<220>
<221>  misc_feature
<222>  (21)..(21)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (24)..(24)
<223>  n is a, c, g, or t

<400>  14
tggcagcaga tgggctggyt ngcncayga                                         29


<210>  15
<211>  10
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  conserved amino acid sequence


<220>
<221>  MISC_FEATURE
<222>  (8)..(8)
<223>  Xaa = Ser (S) or Ala (A)

<400>  15

Trp Gln Gln Met Gly Trp Leu Xaa His Asp 
1               5                   10  


<210>  16
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 526


<220>
<221>  misc_feature
<222>  (24)..(24)
<223>  n is a, c, g, or t

<400>  16
ttatggcgcg gcatcgtcgg raanarrtgr tg                                     32


<210>  17
<211>  32
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 527


<220>
<221>  misc_feature
<222>  (24)..(24)
<223>  n is a, c, g, or t

<400>  17
ttatggcgcg gcagcgacgg ccanarrtgr tg                                     32


<210>  18
<211>  11
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  conserved amino acid sequence


<220>
<221>  MISC_FEATURE
<222>  (4)..(4)
<223>  Xaa = Trp (W) or Phe (F)

<220>
<221>  MISC_FEATURE
<222>  (6)..(6)
<223>  Xaa = Thr (T) or Ser (S)

<220>
<221>  MISC_FEATURE
<222>  (7)..(7)
<223>  Xaa = Met (M) or Leu (L)

<400>  18

His His Leu Xaa Pro Xaa Xaa Pro Arg His Asn 
1               5                   10      


<210>  19
<211>  30
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 528


<220>
<221>  misc_feature
<222>  (22)..(22)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (25)..(25)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (28)..(28)
<223>  n is a, c, g, or t

<400>  19
gtggtgctcg atctggtart tnarnccncc                                        30


<210>  20
<211>  30
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 529


<220>
<221>  misc_feature
<222>  (22)..(22)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (25)..(25)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (28)..(28)
<223>  n is a, c, g, or t

<400>  20
gtggtgctcg atctggtart gnarnccncc                                        30


<210>  21
<211>  10
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  conserved amino acid sequence


<220>
<221>  MISC_FEATURE
<222>  (4)..(4)
<223>  Xaa = Asn (N) or His (H)

<400>  21

Gly Gly Leu Xaa Tyr Gln Ile Glu His His 
1               5                   10  


<210>  22
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer T3

<400>  22
attaaccctc actaaaggga                                                   20


<210>  23
<211>  19
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer T7

<400>  23
ggaaacagct atgaccatg                                                    19


<210>  24
<211>  457
<212>  PRT
<213>  Mortierella alpina

<400>  24

Met Ala Ala Ala Pro Ser Val Arg Thr Phe Thr Arg Ala Glu Val Leu 
1               5                   10                  15      


Asn Ala Glu Ala Leu Asn Glu Gly Lys Lys Asp Ala Glu Ala Pro Phe 
            20                  25                  30          


Leu Met Ile Ile Asp Asn Lys Val Tyr Asp Val Arg Glu Phe Val Pro 
        35                  40                  45              


Asp His Pro Gly Gly Ser Val Ile Leu Thr His Val Gly Lys Asp Gly 
    50                  55                  60                  


Thr Asp Val Phe Asp Thr Phe His Pro Glu Ala Ala Trp Glu Thr Leu 
65                  70                  75                  80  


Ala Asn Phe Tyr Val Gly Asp Ile Asp Glu Ser Asp Arg Asp Ile Lys 
                85                  90                  95      


Asn Asp Asp Phe Ala Ala Glu Val Arg Lys Leu Arg Thr Leu Phe Gln 
            100                 105                 110         


Ser Leu Gly Tyr Tyr Asp Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val 
        115                 120                 125             


Ser Phe Asn Leu Cys Ile Trp Gly Leu Ser Thr Val Ile Val Ala Lys 
    130                 135                 140                 


Trp Gly Gln Thr Ser Thr Leu Ala Asn Val Leu Ser Ala Ala Leu Leu 
145                 150                 155                 160 


Gly Leu Phe Trp Gln Gln Cys Gly Trp Leu Ala His Asp Phe Leu His 
                165                 170                 175     


His Gln Val Phe Gln Asp Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe 
            180                 185                 190         


Leu Gly Gly Val Cys Gln Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys 
        195                 200                 205             


His Asn Thr His His Ala Ala Pro Asn Val His Gly Glu Asp Pro Asp 
    210                 215                 220                 


Ile Asp Thr His Pro Leu Leu Thr Trp Ser Glu His Ala Leu Glu Met 
225                 230                 235                 240 


Phe Ser Asp Val Pro Asp Glu Glu Leu Thr Arg Met Trp Ser Arg Phe 
                245                 250                 255     


Met Val Leu Asn Gln Thr Trp Phe Tyr Phe Pro Ile Leu Ser Phe Ala 
            260                 265                 270         


Arg Leu Ser Trp Cys Leu Gln Ser Ile Leu Phe Val Leu Pro Asn Gly 
        275                 280                 285             


Gln Ala His Lys Pro Ser Gly Ala Arg Val Pro Ile Ser Leu Val Glu 
    290                 295                 300                 


Gln Leu Ser Leu Ala Met His Trp Thr Trp Tyr Leu Ala Thr Met Phe 
305                 310                 315                 320 


Leu Phe Ile Lys Asp Pro Val Asn Met Leu Val Tyr Phe Leu Val Ser 
                325                 330                 335     


Gln Ala Val Cys Gly Asn Leu Leu Ala Ile Val Phe Ser Leu Asn His 
            340                 345                 350         


Asn Gly Met Pro Val Ile Ser Lys Glu Glu Ala Val Asp Met Asp Phe 
        355                 360                 365             


Phe Thr Lys Gln Ile Ile Thr Gly Arg Asp Val His Pro Gly Leu Phe 
    370                 375                 380                 


Ala Asn Trp Phe Thr Gly Gly Leu Asn Tyr Gln Ile Glu His His Leu 
385                 390                 395                 400 


Phe Pro Ser Met Pro Arg His Asn Phe Ser Lys Ile Gln Pro Ala Val 
                405                 410                 415     


Glu Thr Leu Cys Lys Lys Tyr Asn Val Arg Tyr His Thr Thr Gly Met 
            420                 425                 430         


Ile Glu Gly Thr Ala Glu Val Phe Ser Arg Leu Asn Glu Val Ser Lys 
        435                 440                 445             


Ala Ala Ser Lys Met Gly Lys Ala Gln 
    450                 455         


<210>  25
<211>  23
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 535

<400>  25
ctcctgcaga gccttctctt cca                                               23


<210>  26
<211>  23
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 536

<400>  26
cctacttcta cttgctgtcc ttg                                               23


<210>  27
<211>  22
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 533

<400>  27
atgcatgaga agcgcgacga tc                                                22


<210>  28
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 534

<400>  28
tggaagagaa ggctctgcag                                                   20


<210>  29
<211>  19
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 537

<400>  29
ctccttgcta tcaccctcg                                                    19


<210>  30
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer AUAP

<400>  30
ggccacgcgt cgactagtac                                                   20


<210>  31
<211>  36
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer AAP


<220>
<221>  misc_feature
<222>  (24)..(25)
<223>  n = deoxyinosine

<220>
<221>  misc_feature
<222>  (29)..(30)
<223>  n = deoxyinosine

<220>
<221>  misc_feature
<222>  (34)..(35)
<223>  n = deoxyinosine

<400>  31
ggccacgcgt cgactagtac gggnngggnn gggnng                                 36


<210>  32
<211>  38
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 539

<400>  32
aaactaaccc agctctccat ggcgccgaat gtggactc                               38


<210>  33
<211>  38
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 540

<400>  33
atccacactt gcggccctat agggattccc caagcttc                               38


<210>  34
<211>  8423
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pY91M

<400>  34
gtacgagccg gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca       60

ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat      120

taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc      180

tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca      240

aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca      300

aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg      360

ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg      420

acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt      480

ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt      540

tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc      600

tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt      660

gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt      720

agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc      780

tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa      840

agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt      900

tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct      960

acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta     1020

tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa     1080

agtatatatg agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc     1140

tcagcgatct gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact     1200

acgatacggg agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc     1260

tcaccggctc cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt     1320

ggtcctgcaa ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta     1380

agtagttcgc cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg     1440

tcacgctcgt cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt     1500

acatgatccc ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc     1560

agaagtaagt tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt     1620

actgtcatgc catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc     1680

tgagaatagt gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc     1740

gcgccacata gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa     1800

ctctcaagga tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac     1860

tgatcttcag catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa     1920

aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt     1980

tttcaatatt attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa     2040

tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct     2100

gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc     2160

gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc     2220

acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt     2280

agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg     2340

ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt     2400

ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta     2460

taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt     2520

aacgcgaatt ttaacaaaat attaacgctt acaatttcca ttcgccattc aggctgcgca     2580

actgttggga agggcgatcg gtgcgggcct cttcgctatt acgccagctg gcgaaagggg     2640

gatgtgctgc aaggcgatta agttgggtaa cgccagggtt ttcccagtca cgacgttgta     2700

aaacgacggc cagtgaattg taatacgact cactataggg cgaattgggt accgggcccc     2760

ccctcgaggt cgatggtgtc gataagcttg atatcgaatt catgtcacac aaaccgatct     2820

tcgcctcaag gaaacctaat tctacatccg agagactgcc gagatccagt ctacactgat     2880

taattttcgg gccaataatt taaaaaaatc gtgttatata atattatatg tattatatat     2940

atacatcatg atgatactga cagtcatgtc ccattgctaa atagacagac tccatctgcc     3000

gcctccaact gatgttctca atatttaagg ggtcatctcg cattgtttaa taataaacag     3060

actccatcta ccgcctccaa atgatgttct caaaatatat tgtatgaact tatttttatt     3120

acttagtatt attagacaac ttacttgctt tatgaaaaac acttcctatt taggaaacaa     3180

tttataatgg cagttcgttc atttaacaat ttatgtagaa taaatgttat aaatgcgtat     3240

gggaaatctt aaatatggat agcataaatg atatctgcat tgcctaattc gaaatcaaca     3300

gcaacgaaaa aaatcccttg tacaacataa atagtcatcg agaaatatca actatcaaag     3360

aacagctatt cacacgttac tattgagatt attattggac gagaatcaca cactcaactg     3420

tctttctctc ttctagaaat acaggtacaa gtatgtacta ttctcattgt tcatacttct     3480

agtcatttca tcccacatat tccttggatt tctctccaat gaatgacatt ctatcttgca     3540

aattcaacaa ttataataag atataccaaa gtagcggtat agtggcaatc aaaaagcttc     3600

tctggtgtgc ttctcgtatt tatttttatt ctaatgatcc attaaaggta tatatttatt     3660

tcttgttata taatcctttt gtttattaca tgggctggat acataaaggt attttgattt     3720

aattttttgc ttaaattcaa tcccccctcg ttcagtgtca actgtaatgg taggaaatta     3780

ccatactttt gaagaagcaa aaaaaatgaa agaaaaaaaa aatcgtattt ccaggttaga     3840

cgttccgcag aatctagaat gcggtatgcg gtacattgtt cttcgaacgt aaaagttgcg     3900

ctccctgaga tattgtacat ttttgctttt acaagtacaa gtacatcgta caactatgta     3960

ctactgttga tgcatccaca acagtttgtt ttgttttttt ttgttttttt tttttctaat     4020

gattcattac cgctatgtat acctacttgt acttgtagta agccgggtta ttggcgttca     4080

attaatcata gacttatgaa tctgcacggt gtgcgctgcg agttactttt agcttatgca     4140

tgctacttgg gtgtaatatt gggatctgtt cggaaatcaa cggatgctca atcgatttcg     4200

acagtaatta attaagtcat acacaagtca gctttcttcg agcctcatat aagtataagt     4260

agttcaacgt attagcactg tacccagcat ctccgtatcg agaaacacaa caacatgccc     4320

cattggacag atcatgcgga tacacaggtt gtgcagtatc atacatactc gatcagacag     4380

gtcgtctgac catcatacaa gctgaacaag cgctccatac ttgcacgctc tctatataca     4440

cagttaaatt acatatccat agtctaacct ctaacagtta atcttctggt aagcctccca     4500

gccagccttc tggtatcgct tggcctcctc aataggatct cggttctggc cgtacagacc     4560

tcggccgaca attatgatat ccgttccggt agacatgaca tcctcaacag ttcggtactg     4620

ctgtccgaga gcgtctccct tgtcgtcaag acccaccccg ggggtcagaa taagccagtc     4680

ctcagagtcg cccttaggtc ggttctgggc aatgaagcca accacaaact cggggtcgga     4740

tcgggcaagc tcaatggtct gcttggagta ctcgccagtg gccagagagc ccttgcaaga     4800

cagctcggcc agcatgagca gacctctggc cagcttctcg ttgggagagg ggactaggaa     4860

ctccttgtac tgggagttct cgtagtcaga gacgtcctcc ttcttctgtt cagagacagt     4920

ttcctcggca ccagctcgca ggccagcaat gattccggtt ccgggtacac cgtgggcgtt     4980

ggtgatatcg gaccactcgg cgattcggtg acaccggtac tggtgcttga cagtgttgcc     5040

aatatctgcg aactttctgt cctcgaacag gaagaaaccg tgcttaagag caagttcctt     5100

gagggggagc acagtgccgg cgtaggtgaa gtcgtcaatg atgtcgatat gggttttgat     5160

catgcacaca taaggtccga ccttatcggc aagctcaatg agctccttgg tggtggtaac     5220

atccagagaa gcacacaggt tggttttctt ggctgccacg agcttgagca ctcgagcggc     5280

aaaggcggac ttgtggacgt tagctcgagc ttcgtaggag ggcattttgg tggtgaagag     5340

gagactgaaa taaatttagt ctgcagaact ttttatcgga accttatctg gggcagtgaa     5400

gtatatgtta tggtaatagt tacgagttag ttgaacttat agatagactg gactatacgg     5460

ctatcggtcc aaattagaaa gaacgtcaat ggctctctgg gcgtcgcctt tgccgacaaa     5520

aatgtgatca tgatgaaagc cagcaatgac gttgcagctg atattgttgt cggccaaccg     5580

cgccgaaaac gcagctgtca gacccacagc ctccaacgaa gaatgtatcg tcaaagtgat     5640

ccaagcacac tcatagttgg agtcgtactc caaaggcggc aatgacgagt cagacagata     5700

ctcgtcgact caggcgacga cggaattcct gcagcccatc tgcagaattc aggagagacc     5760

gggttggcgg cgtatttgtg tcccaaaaaa cagccccaat tgccccggag aagacggcca     5820

ggccgcctag atgacaaatt caacaactca cagctgactt tctgccattg ccactagggg     5880

ggggcctttt tatatggcca agccaagctc tccacgtcgg ttgggctgca cccaacaata     5940

aatgggtagg gttgcaccaa caaagggatg ggatgggggg tagaagatac gaggataacg     6000

gggctcaatg gcacaaataa gaacgaatac tgccattaag actcgtgatc cagcgactga     6060

caccattgca tcatctaagg gcctcaaaac tacctcggaa ctgctgcgct gatctggaca     6120

ccacagaggt tccgagcact ttaggttgca ccaaatgtcc caccaggtgc aggcagaaaa     6180

cgctggaaca gcgtgtacag tttgtcttaa caaaaagtga gggcgctgag gtcgagcagg     6240

gtggtgtgac ttgttatagc ctttagagct gcgaaagcgc gtatggattt ggctcatcag     6300

gccagattga gggtctgtgg acacatgtca tgttagtgta cttcaatcgc cccctggata     6360

tagccccgac aataggccgt ggcctcattt ttttgccttc cgcacatttc cattgctcgg     6420

tacccacacc ttgcttctcc tgcacttgcc aaccttaata ctggtttaca ttgaccaaca     6480

tcttacaagc ggggggcttg tctagggtat atataaacag tggctctccc aatcggttgc     6540

cagtctcttt tttcctttct ttccccacag attcgaaatc taaactacac atcacacaat     6600

gcctgttact gacgtcctta agcgaaagtc cggtgtcatc gtcggcgacg atgtccgagc     6660

cgtgagtatc cacgacaaga tcagtgtcga gacgacgcgt tttgtgtaat gacacaatcc     6720

gaaagtcgct agcaacacac actctctaca caaactaacc cagctctcca tgggtggcgg     6780

aggacagcag acagaccgaa tcaccgacac caacggcaga ttcagcagct acacctggga     6840

ggaggtgcag aaacacacca aacatggaga tcagtgggtg gtggtggaga ggaaggttta     6900

taacgtcagc cagtgggtga agagacaccc cggaggactg aggatcctcg gacactatgc     6960

tggagaagac gccacggagg cgttcactgc gtttcatcca aaccttcagc tggtgaggaa     7020

atacctgaag ccgctgctaa tcggagagct ggaggcgtct gaacccagtc aggaccggca     7080

gaaaaacgct gctctcgtgg aggatttccg agccctgcgt gagcgtctgg aggctgaagg     7140

ctgttttaaa acgcagccgc tgtttttcgc tctgcatttg ggccacattc tgctcctgga     7200

ggccatcgct ttcatgatgg tgtggtattt cggcaccggt tggatcaaca cgctcatcgt     7260

cgctgttatt ctggctactg cacagtcaca agctggatgg ttgcagcatg acttcggtca     7320

tctgtccgtg tttaaaacct ctggaatgaa tcatttggtg cacaaatttg tcatcggaca     7380

cctgaaggga gcgtctgcgg gctggtggaa ccatcggcac ttccagcatc acgctaaacc     7440

caacatcttc aagaaggacc cggacgtcaa catgctgaac gcctttgtgg tgggaaacgt     7500

gcagcccgtg gagtatggcg ttaagaagat caagcatctg ccctacaacc atcagcacaa     7560

gtacttcttc ttcattggtc ctcccctgct catcccagtg tatttccagt tccaaatctt     7620

tcacaatatg atcagtcatg gcatgtgggt ggacctgctg tggtgtatca gctactacgt     7680

ccgatacttc ctttgttaca cgcagttcta cggcgtcttt tgggctatta tcctctttaa     7740

tttcgtcagg tttatggaga gccactggtt tgtttgggtc acacagatga gccacatccc     7800

catgaacatt gactatgaga aaaatcagga ctggctcagc atgcagctgg tcgcgacctg     7860

taacatcgag cagtctgcct tcaacgactg gttcagcgga cacctcaact tccagatcga     7920

gcatcatctc tttcccacag tgcctcggca caactactgg cgcgccgctc cacgggtgcg     7980

agcgttgtgt gagaaatacg gagtcaaata ccaagagaag accttgtacg gagcatttgc     8040

ggatatcatt aggtctttgg agaaatctgg cgagctctgg ctggatgcgt atctcaacaa     8100

ataagcggcc gcaagtgtgg atggggaagt gagtgcccgg ttctgtgtgc acaattggca     8160

atccaagatg gatggattca acacagggat atagcgagct acgtggtggt gcgaggatat     8220

agcaacggat atttatgttt gacacttgag aatgtacgat acaagcactg tccaagtaca     8280

atactaaaca tactgtacat actcatactc gtacccgggc aacggtttca cttgagtgca     8340

gtggctagtg ctcttactcg tacagtgtgc aatactgcgt atcatagtct ttgatgtata     8400

tcgtattcat tcatgttagt tgc                                             8423


<210>  35
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 373

<400>  35
cgcgttttgt gtaatgacac                                                   20


<210>  36
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer 507

<400>  36
acacagaacc gggcactcac                                                   20


<210>  37
<211>  477
<212>  PRT
<213>  Phaeodactylum tricornutum


<220>
<221>  MISC_FEATURE
<222>  (1)..(477)
<223>  GenBank Accession No. AAL92563 (gi_19879689)

<400>  37

Met Gly Lys Gly Gly Asp Ala Arg Ala Ser Lys Gly Ser Thr Ala Ala 
1               5                   10                  15      


Arg Lys Ile Ser Trp Gln Glu Val Lys Thr His Ala Ser Pro Glu Asp 
            20                  25                  30          


Ala Trp Ile Ile His Ser Asn Lys Val Tyr Asp Val Ser Asn Trp His 
        35                  40                  45              


Glu His Pro Gly Gly Ala Val Ile Phe Thr His Ala Gly Asp Asp Met 
    50                  55                  60                  


Thr Asp Ile Phe Ala Ala Phe His Ala Pro Gly Ser Gln Ser Leu Met 
65                  70                  75                  80  


Lys Lys Phe Tyr Ile Gly Glu Leu Leu Pro Glu Thr Thr Gly Lys Glu 
                85                  90                  95      


Pro Gln Gln Ile Ala Phe Glu Lys Gly Tyr Arg Asp Leu Arg Ser Lys 
            100                 105                 110         


Leu Ile Met Met Gly Met Phe Lys Ser Asn Lys Trp Phe Tyr Val Tyr 
        115                 120                 125             


Lys Cys Leu Ser Asn Met Ala Ile Trp Ala Ala Ala Cys Ala Leu Val 
    130                 135                 140                 


Phe Tyr Ser Asp Arg Phe Trp Val His Leu Ala Ser Ala Val Met Leu 
145                 150                 155                 160 


Gly Thr Phe Phe Gln Gln Ser Gly Trp Leu Ala His Asp Phe Leu His 
                165                 170                 175     


His Gln Val Phe Thr Lys Arg Lys His Gly Asp Leu Gly Gly Leu Phe 
            180                 185                 190         


Trp Gly Asn Leu Met Gln Gly Tyr Ser Val Gln Trp Trp Lys Asn Lys 
        195                 200                 205             


His Asn Gly His His Ala Val Pro Asn Leu His Cys Ser Ser Ala Val 
    210                 215                 220                 


Ala Gln Asp Gly Asp Pro Asp Ile Asp Thr Met Pro Leu Leu Ala Trp 
225                 230                 235                 240 


Ser Val Gln Gln Ala Gln Ser Tyr Arg Glu Leu Gln Ala Asp Gly Lys 
                245                 250                 255     


Asp Ser Gly Leu Val Lys Phe Met Ile Arg Asn Gln Ser Tyr Phe Tyr 
            260                 265                 270         


Phe Pro Ile Leu Leu Leu Ala Arg Leu Ser Trp Leu Asn Glu Ser Phe 
        275                 280                 285             


Lys Cys Ala Phe Gly Leu Gly Ala Ala Ser Glu Asn Ala Ala Leu Glu 
    290                 295                 300                 


Leu Lys Ala Lys Gly Leu Gln Tyr Pro Leu Leu Glu Lys Ala Gly Ile 
305                 310                 315                 320 


Leu Leu His Tyr Ala Trp Met Leu Thr Val Ser Ser Gly Phe Gly Arg 
                325                 330                 335     


Phe Ser Phe Ala Tyr Thr Ala Phe Tyr Phe Leu Thr Ala Thr Ala Ser 
            340                 345                 350         


Cys Gly Phe Leu Leu Ala Ile Val Phe Gly Leu Gly His Asn Gly Met 
        355                 360                 365             


Ala Thr Tyr Asn Ala Asp Ala Arg Pro Asp Phe Trp Lys Leu Gln Val 
    370                 375                 380                 


Thr Thr Thr Arg Asn Val Thr Gly Gly His Gly Phe Pro Gln Ala Phe 
385                 390                 395                 400 


Val Asp Trp Phe Cys Gly Gly Leu Gln Tyr Gln Val Asp His His Leu 
                405                 410                 415     


Phe Pro Ser Leu Pro Arg His Asn Leu Ala Lys Thr His Ala Leu Val 
            420                 425                 430         


Glu Ser Phe Cys Lys Glu Trp Gly Val Gln Tyr His Glu Ala Asp Leu 
        435                 440                 445             


Val Asp Gly Thr Met Glu Val Leu His His Leu Gly Ser Val Ala Gly 
    450                 455                 460                 


Glu Phe Val Val Asp Phe Val Arg Asp Gly Pro Ala Met 
465                 470                 475         


<210>  38
<211>  525
<212>  PRT
<213>  Physcomitrella patens


<220>
<221>  MISC_FEATURE
<222>  (1)..(525)
<223>  GenBank Accession No. CAA11033 (gi_3790209)

<400>  38

Met Val Phe Ala Gly Gly Gly Leu Gln Gln Gly Ser Leu Glu Glu Asn 
1               5                   10                  15      


Ile Asp Val Glu His Ile Ala Ser Met Ser Leu Phe Ser Asp Phe Phe 
            20                  25                  30          


Ser Tyr Val Ser Ser Thr Val Gly Ser Trp Ser Val His Ser Ile Gln 
        35                  40                  45              


Pro Leu Lys Arg Leu Thr Ser Lys Lys Arg Val Ser Glu Ser Ala Ala 
    50                  55                  60                  


Val Gln Cys Ile Ser Ala Glu Val Gln Arg Asn Ser Ser Thr Gln Gly 
65                  70                  75                  80  


Thr Ala Glu Ala Leu Ala Glu Ser Val Val Lys Pro Thr Arg Arg Arg 
                85                  90                  95      


Ser Ser Gln Trp Lys Lys Ser Thr His Pro Leu Ser Glu Val Ala Val 
            100                 105                 110         


His Asn Lys Pro Ser Asp Cys Trp Ile Val Val Lys Asn Lys Val Tyr 
        115                 120                 125             


Asp Val Ser Asn Phe Ala Asp Glu His Pro Gly Gly Ser Val Ile Ser 
    130                 135                 140                 


Thr Tyr Phe Gly Arg Asp Gly Thr Asp Val Phe Ser Ser Phe His Ala 
145                 150                 155                 160 


Ala Ser Thr Trp Lys Ile Leu Gln Asp Phe Tyr Ile Gly Asp Val Glu 
                165                 170                 175     


Arg Val Glu Pro Thr Pro Glu Leu Leu Lys Asp Phe Arg Glu Met Arg 
            180                 185                 190         


Ala Leu Phe Leu Arg Glu Gln Leu Phe Lys Ser Ser Lys Leu Tyr Tyr 
        195                 200                 205             


Val Met Lys Leu Leu Thr Asn Val Ala Ile Phe Ala Ala Ser Ile Ala 
    210                 215                 220                 


Ile Ile Cys Trp Ser Lys Thr Ile Ser Ala Val Leu Ala Ser Ala Cys 
225                 230                 235                 240 


Met Met Ala Leu Cys Phe Gln Gln Cys Gly Trp Leu Ser His Asp Phe 
                245                 250                 255     


Leu His Asn Gln Val Phe Glu Thr Arg Trp Leu Asn Glu Val Val Gly 
            260                 265                 270         


Tyr Val Ile Gly Asn Ala Val Leu Gly Phe Ser Thr Gly Trp Trp Lys 
        275                 280                 285             


Glu Lys His Asn Leu His His Ala Ala Pro Asn Glu Cys Asp Gln Thr 
    290                 295                 300                 


Tyr Gln Pro Ile Asp Glu Asp Ile Asp Thr Leu Pro Leu Ile Ala Trp 
305                 310                 315                 320 


Ser Lys Asp Ile Leu Ala Thr Val Glu Asn Lys Thr Phe Leu Arg Ile 
                325                 330                 335     


Leu Gln Tyr Gln His Leu Phe Phe Met Gly Leu Leu Phe Phe Ala Arg 
            340                 345                 350         


Gly Ser Trp Leu Phe Trp Ser Trp Arg Tyr Thr Ser Thr Ala Val Leu 
        355                 360                 365             


Ser Pro Val Asp Arg Leu Leu Glu Lys Gly Thr Val Leu Phe His Tyr 
    370                 375                 380                 


Phe Trp Phe Val Gly Thr Ala Cys Tyr Leu Leu Pro Gly Trp Lys Pro 
385                 390                 395                 400 


Leu Val Trp Met Ala Val Thr Glu Leu Met Ser Gly Met Leu Leu Gly 
                405                 410                 415     


Phe Val Phe Val Leu Ser His Asn Gly Met Glu Val Tyr Asn Ser Ser 
            420                 425                 430         


Lys Glu Phe Val Ser Ala Gln Ile Val Ser Thr Arg Asp Ile Lys Gly 
        435                 440                 445             


Asn Ile Phe Asn Asp Trp Phe Thr Gly Gly Leu Asn Arg Gln Ile Glu 
    450                 455                 460                 


His His Leu Phe Pro Thr Met Pro Arg His Asn Leu Asn Lys Ile Ala 
465                 470                 475                 480 


Pro Arg Val Glu Val Phe Cys Lys Lys His Gly Leu Val Tyr Glu Asp 
                485                 490                 495     


Val Ser Ile Ala Thr Gly Thr Cys Lys Val Leu Lys Ala Leu Lys Glu 
            500                 505                 510         


Val Ala Glu Ala Ala Ala Glu Gln His Ala Thr Thr Ser 
        515                 520                 525 


<210>  39
<211>  481
<212>  PRT
<213>  Marchantia polymorpha


<220>
<221>  MISC_FEATURE
<222>  (1)..(481)
<223>  GenBank Accession No. AAT85661 (gi_50882491)

<400>  39

Met Ala Ser Ser Thr Thr Thr Ala Val Lys Gln Ser Ser Gly Gly Leu 
1               5                   10                  15      


Trp Ser Lys Trp Gly Thr Gly Ser Asn Leu Ser Phe Val Ser Arg Lys 
            20                  25                  30          


Glu Gln Gln Gln Gln Gln Gln Gln Ser Ser Pro Glu Ala Ser Thr Pro 
        35                  40                  45              


Ala Ala Gln Gln Glu Lys Ser Ile Ser Arg Glu Ser Ile Pro Glu Gly 
    50                  55                  60                  


Phe Leu Thr Val Glu Glu Val Ser Lys His Asp Asn Pro Ser Asp Cys 
65                  70                  75                  80  


Trp Ile Val Ile Asn Asp Lys Val Tyr Asp Val Ser Ala Phe Gly Lys 
                85                  90                  95      


Thr His Pro Gly Gly Pro Val Ile Phe Thr Gln Ala Gly Arg Asp Ala 
            100                 105                 110         


Thr Asp Ser Phe Lys Val Phe His Ser Ala Lys Ala Trp Gln Phe Leu 
        115                 120                 125             


Gln Asp Leu Tyr Ile Gly Asp Leu Tyr Asn Ala Glu Pro Val Ser Glu 
    130                 135                 140                 


Leu Val Lys Asp Tyr Arg Asp Leu Arg Thr Ala Phe Met Arg Ser Gln 
145                 150                 155                 160 


Leu Phe Lys Ser Ser Lys Met Tyr Tyr Val Thr Lys Cys Val Thr Asn 
                165                 170                 175     


Phe Ala Ile Leu Ala Ala Ser Leu Ala Val Ile Ala Trp Ser Gln Thr 
            180                 185                 190         


Tyr Leu Ala Val Leu Cys Ser Ser Phe Leu Leu Ala Leu Phe Trp Gln 
        195                 200                 205             


Gln Cys Gly Trp Leu Ser His Asp Phe Leu His His Gln Val Thr Glu 
    210                 215                 220                 


Asn Arg Ser Leu Asn Thr Tyr Phe Gly Gly Leu Phe Trp Gly Asn Phe 
225                 230                 235                 240 


Ala Gln Gly Tyr Ser Val Gly Trp Trp Lys Thr Lys His Asn Val His 
                245                 250                 255     


His Ala Ala Thr Asn Glu Cys Asp Asp Lys Tyr Gln Pro Ile Asp Pro 
            260                 265                 270         


Asp Ile Asp Thr Val Pro Leu Leu Ala Trp Ser Lys Glu Ile Leu Ala 
        275                 280                 285             


Thr Val Asp Asp Gln Phe Phe Arg Ser Ile Ile Ser Val Gln His Leu 
    290                 295                 300                 


Leu Phe Phe Pro Leu Leu Phe Leu Ala Arg Phe Ser Trp Leu His Ser 
305                 310                 315                 320 


Ser Trp Ala His Ala Ser Asn Phe Glu Met Pro Arg Tyr Met Arg Trp 
                325                 330                 335     


Ala Glu Lys Ala Ser Leu Leu Gly His Tyr Gly Ala Ser Ile Gly Ala 
            340                 345                 350         


Ala Phe Tyr Ile Leu Pro Ile Pro Gln Ala Ile Cys Trp Leu Phe Leu 
        355                 360                 365             


Ser Gln Leu Phe Cys Gly Ala Leu Leu Ser Ile Val Phe Val Ile Ser 
    370                 375                 380                 


His Asn Gly Met Asp Val Tyr Asn Asp Pro Arg Asp Phe Val Thr Ala 
385                 390                 395                 400 


Gln Val Thr Ser Thr Arg Asn Ile Glu Gly Asn Phe Phe Asn Asp Trp 
                405                 410                 415     


Phe Thr Gly Gly Leu Asn Arg Gln Ile Glu His His Leu Phe Pro Ser 
            420                 425                 430         


Leu Pro Arg His Asn Leu Ala Lys Val Ala Pro His Val Lys Ala Leu 
        435                 440                 445             


Cys Ala Lys His Gly Leu His Tyr Glu Glu Leu Ser Leu Gly Thr Gly 
    450                 455                 460                 


Val Cys Arg Val Phe Asn Arg Leu Val Glu Val Ala Tyr Ala Ala Lys 
465                 470                 475                 480 


Val 
    


<210>  40
<211>  457
<212>  PRT
<213>  Mortierella alpina


<220>
<221>  MISC_FEATURE
<222>  (1)..(457)
<223>  GenBank Accession No. AAL73947 (gi_18483175)

<400>  40

Met Ala Ala Ala Pro Ser Val Arg Thr Phe Thr Arg Ala Glu Ile Leu 
1               5                   10                  15      


Asn Ala Glu Ala Leu Asn Glu Gly Lys Lys Asp Ala Glu Ala Pro Phe 
            20                  25                  30          


Leu Met Ile Ile Asp Asn Lys Val Tyr Asp Val Arg Glu Phe Val Pro 
        35                  40                  45              


Asp His Pro Gly Gly Ser Val Ile Leu Thr His Val Gly Lys Asp Gly 
    50                  55                  60                  


Thr Asp Val Phe Asp Thr Phe His Pro Glu Ala Ala Trp Glu Thr Leu 
65                  70                  75                  80  


Ala Asn Phe Tyr Val Gly Asp Ile Asp Glu Ser Asp Arg Ala Ile Lys 
                85                  90                  95      


Asn Asp Asp Phe Ala Ala Glu Val Arg Lys Leu Arg Thr Leu Phe Gln 
            100                 105                 110         


Ser Leu Gly Tyr Tyr Asp Ser Ser Lys Ala Tyr Tyr Ala Phe Lys Val 
        115                 120                 125             


Ser Phe Asn Leu Cys Ile Trp Gly Leu Ser Thr Phe Ile Val Ala Lys 
    130                 135                 140                 


Trp Gly Gln Thr Ser Thr Leu Ala Asn Glu Leu Ser Ala Ala Leu Leu 
145                 150                 155                 160 


Gly Leu Phe Trp Gln Gln Arg Gly Trp Leu Ala His Asp Phe Leu His 
                165                 170                 175     


His Gln Val Phe Gln Asp Arg Phe Trp Gly Asp Leu Phe Gly Ala Phe 
            180                 185                 190         


Leu Gly Gly Asp Cys Gln Gly Phe Ser Ser Ser Trp Trp Lys Asp Lys 
        195                 200                 205             


His Asn Thr His His Ala Ala Pro Asn Val His Gly Glu Asp Pro Asp 
    210                 215                 220                 


Ile Asp Thr His Pro Leu Leu Thr Trp Ser Glu His Ala Leu Glu Met 
225                 230                 235                 240 


Phe Ser Asp Val Pro Asp Glu Glu Leu Thr Arg Met Trp Ser Arg Phe 
                245                 250                 255     


Met Val Leu Asn Gln Thr Trp Phe Tyr Phe Pro Ile Leu Ser Phe Ala 
            260                 265                 270         


Arg Leu Ser Trp Cys Leu Gln Ser Ile Leu Phe Val Leu Pro Asn Gly 
        275                 280                 285             


Gln Ala His Lys Pro Ser Gly Ala Arg Val Pro Ile Ser Leu Val Glu 
    290                 295                 300                 


Gln Leu Ser Leu Ala Met His Trp Thr Trp Tyr Leu Ala Thr Met Phe 
305                 310                 315                 320 


Leu Phe Ile Lys Asp Pro Val Asn Met Met Val Tyr Phe Leu Val Ser 
                325                 330                 335     


Gln Ala Val Cys Gly Asn Leu Leu Ala Ile Val Phe Ser Leu Asn His 
            340                 345                 350         


Asn Gly Met Pro Val Ile Ser Lys Glu Glu Ala Val Asp Met Asp Phe 
        355                 360                 365             


Phe Thr Lys Gln Ile Ile Thr Gly Arg Asp Val His Pro Gly Leu Phe 
    370                 375                 380                 


Ala Asn Trp Phe Thr Gly Gly Leu Asn Tyr Gln Ile Glu His His Leu 
385                 390                 395                 400 


Phe Pro Ser Met Pro Arg His Asn Phe Ser Lys Ile Gln Pro Ala Val 
                405                 410                 415     


Glu Thr Leu Cys Lys Lys Tyr Gly Val Arg Tyr His Thr Thr Gly Met 
            420                 425                 430         


Ile Glu Gly Thr Ala Glu Val Phe Ser Arg Leu Asn Glu Val Ser Lys 
        435                 440                 445             


Ala Ala Ser Lys Met Gly Lys Ala Gln 
    450                 455         


<210>  41
<211>  419
<212>  PRT
<213>  Euglena gracilis


<220>
<221>  MISC_FEATURE
<222>  (1)..(419)
<223>  GenBank Accession No. AAD45877 (gi_5639724)

<400>  41

Met Lys Ser Lys Arg Gln Ala Leu Ser Pro Leu Gln Leu Met Glu Gln 
1               5                   10                  15      


Thr Tyr Asp Val Val Asn Phe His Pro Gly Gly Ala Glu Ile Ile Glu 
            20                  25                  30          


Asn Tyr Gln Gly Arg Asp Ala Thr Asp Ala Phe Met Val Met His Phe 
        35                  40                  45              


Gln Glu Ala Phe Asp Lys Leu Lys Arg Met Pro Lys Ile Asn Pro Ser 
    50                  55                  60                  


Phe Glu Leu Pro Pro Gln Ala Ala Val Asn Glu Ala Gln Glu Asp Phe 
65                  70                  75                  80  


Arg Lys Leu Arg Glu Glu Leu Ile Ala Thr Gly Met Phe Asp Ala Ser 
                85                  90                  95      


Pro Leu Trp Tyr Ser Tyr Lys Ile Ser Thr Thr Leu Gly Leu Gly Val 
            100                 105                 110         


Leu Gly Tyr Phe Leu Met Val Gln Tyr Gln Met Tyr Phe Ile Gly Ala 
        115                 120                 125             


Val Leu Leu Gly Met His Tyr Gln Gln Met Gly Trp Leu Ser His Asp 
    130                 135                 140                 


Ile Cys His His Gln Thr Phe Lys Asn Arg Asn Trp Asn Asn Leu Val 
145                 150                 155                 160 


Gly Leu Val Phe Gly Asn Gly Leu Gln Gly Phe Ser Val Thr Cys Trp 
                165                 170                 175     


Lys Asp Arg His Asn Ala His His Ser Ala Thr Asn Val Gln Gly His 
            180                 185                 190         


Asp Pro Asp Ile Asp Asn Leu Pro Pro Leu Ala Trp Ser Glu Asp Asp 
        195                 200                 205             


Val Thr Arg Ala Ser Pro Ile Ser Arg Lys Leu Ile Gln Phe Gln Gln 
    210                 215                 220                 


Tyr Tyr Phe Leu Val Ile Cys Ile Leu Leu Arg Phe Ile Trp Cys Phe 
225                 230                 235                 240 


Gln Cys Val Leu Thr Val Arg Ser Leu Lys Asp Arg Asp Asn Gln Phe 
                245                 250                 255     


Tyr Arg Ser Gln Tyr Lys Lys Glu Ala Ile Gly Leu Ala Leu His Trp 
            260                 265                 270         


Thr Leu Lys Ala Leu Phe His Leu Phe Phe Met Pro Ser Ile Leu Thr 
        275                 280                 285             


Ser Leu Leu Val Phe Phe Val Ser Glu Leu Val Gly Gly Phe Gly Ile 
    290                 295                 300                 


Ala Ile Val Val Phe Met Asn His Tyr Pro Leu Glu Lys Ile Gly Asp 
305                 310                 315                 320 


Pro Val Trp Asp Gly His Gly Phe Ser Val Gly Gln Ile His Glu Thr 
                325                 330                 335     


Met Asn Ile Arg Arg Gly Ile Ile Thr Asp Trp Phe Phe Gly Gly Leu 
            340                 345                 350         


Asn Tyr Gln Ile Glu His His Leu Trp Pro Thr Leu Pro Arg His Asn 
        355                 360                 365             


Leu Thr Ala Val Ser Tyr Gln Val Glu Gln Leu Cys Gln Lys His Asn 
    370                 375                 380                 


Leu Pro Tyr Arg Asn Pro Leu Pro His Glu Gly Leu Val Ile Leu Leu 
385                 390                 395                 400 


Arg Tyr Leu Ala Val Phe Ala Arg Met Ala Glu Lys Gln Pro Ala Gly 
                405                 410                 415     


Lys Ala Leu 
            


<210>  42
<211>  1416
<212>  DNA
<213>  Porphyridium cruentum


<220>
<221>  CDS
<222>  (1)..(1416)
<223>  variant of SEQ ID NO:1

<400>  42
atg gcg ccg aat gtg gac tcc gga agc aag gac cgc ggc gtg agc gcg         48
Met Ala Pro Asn Val Asp Ser Gly Ser Lys Asp Arg Gly Val Ser Ala           
1               5                   10                  15                

gtc aaa gaa gta gtc tct ggc gcg acg gcc aac gcg ctg agt ccg gcc         96
Val Lys Glu Val Val Ser Gly Ala Thr Ala Asn Ala Leu Ser Pro Ala           
            20                  25                  30                    

gag cgc gtg gtg acc agg aag gag ctc gcg ggg cac gcc tca agg gag        144
Glu Arg Val Val Thr Arg Lys Glu Leu Ala Gly His Ala Ser Arg Glu           
        35                  40                  45                        

tcg gtg tgg att gcg gtg aac ggc cgt gtg tac gat gtg acc ggc ttt        192
Ser Val Trp Ile Ala Val Asn Gly Arg Val Tyr Asp Val Thr Gly Phe           
    50                  55                  60                            

gag aac gtt cac cct ggc ggc gag atc att ctg acc gcc gcc ggg cag        240
Glu Asn Val His Pro Gly Gly Glu Ile Ile Leu Thr Ala Ala Gly Gln           
65                  70                  75                  80            

gac gca acg gac gtg ttt gcc gcg ttt cac acg ccc gcc acg tgg aaa        288
Asp Ala Thr Asp Val Phe Ala Ala Phe His Thr Pro Ala Thr Trp Lys           
                85                  90                  95                

atg atg ccg cag ttc ctc gtg gga aac ctc gag gag gac gcg ctc tct        336
Met Met Pro Gln Phe Leu Val Gly Asn Leu Glu Glu Asp Ala Leu Ser           
            100                 105                 110                   

gcc aaa ccg tct aag cag ctt aat ggg cat tcg cca cac gag tac caa        384
Ala Lys Pro Ser Lys Gln Leu Asn Gly His Ser Pro His Glu Tyr Gln           
        115                 120                 125                       

gct gat atc cga aag atg cgt gcg gaa ctt gtc aag ctg cgc gcg ttc        432
Ala Asp Ile Arg Lys Met Arg Ala Glu Leu Val Lys Leu Arg Ala Phe           
    130                 135                 140                           

gac tcg aac aag ttc ttc tac ctg ttc aag ttc ctg tcc acg tct gcg        480
Asp Ser Asn Lys Phe Phe Tyr Leu Phe Lys Phe Leu Ser Thr Ser Ala           
145                 150                 155                 160           

att tgc gcc ctc ttg gtg gtc atg gcg ctc ggc atg aag gac tcg atg        528
Ile Cys Ala Leu Leu Val Val Met Ala Leu Gly Met Lys Asp Ser Met           
                165                 170                 175               

atc gtc acg gcg ctc gcc gcg ttc acc atg gca ctc ttc tgg cag cag        576
Ile Val Thr Ala Leu Ala Ala Phe Thr Met Ala Leu Phe Trp Gln Gln           
            180                 185                 190                   

tgc ggc tgg ctc gct cac gac ttt ctg cac cat cag gtg ttc aag aac        624
Cys Gly Trp Leu Ala His Asp Phe Leu His His Gln Val Phe Lys Asn           
        195                 200                 205                       

agg gtg ttc aac aac ctg gtc ggt ctt gtt gtt ggt aat gtc tat cag        672
Arg Val Phe Asn Asn Leu Val Gly Leu Val Val Gly Asn Val Tyr Gln           
    210                 215                 220                           

ggc ttt tcg gta tcc tgg tgg aag atg aag cac aac cac cac cac gcc        720
Gly Phe Ser Val Ser Trp Trp Lys Met Lys His Asn His His His Ala           
225                 230                 235                 240           

gct cca aac gtg acg tca acg gcc gct ggg cca gac cca gac atc gac        768
Ala Pro Asn Val Thr Ser Thr Ala Ala Gly Pro Asp Pro Asp Ile Asp           
                245                 250                 255               

act gtg ccc gtg ctc tcg tgg agc gag aaa ctc atc gag ggt gat agc        816
Thr Val Pro Val Leu Ser Trp Ser Glu Lys Leu Ile Glu Gly Asp Ser           
            260                 265                 270                   

aag gag atg gag gat ctg ccc atg ttc ctc atg aag aac cag aag atc        864
Lys Glu Met Glu Asp Leu Pro Met Phe Leu Met Lys Asn Gln Lys Ile           
        275                 280                 285                       

ttt tac tgg ccg gtt ctg tgc gtg gcg cgc atc agc tgg ctc ctg cag        912
Phe Tyr Trp Pro Val Leu Cys Val Ala Arg Ile Ser Trp Leu Leu Gln           
    290                 295                 300                           

agc ctt ctc ttc cag cgc gcg ccg gtc tgg aac ttt gtg ggc gga aac        960
Ser Leu Leu Phe Gln Arg Ala Pro Val Trp Asn Phe Val Gly Gly Asn           
305                 310                 315                 320           

agc tgg cgc gcg gtg gag acc gtc gcg ctt ctc atg cat cac ggc gcc       1008
Ser Trp Arg Ala Val Glu Thr Val Ala Leu Leu Met His His Gly Ala           
                325                 330                 335               

tac ttc tac ttg ctg tcc ttg ctc aag agc tgg gtc cat gtc gtg ctc       1056
Tyr Phe Tyr Leu Leu Ser Leu Leu Lys Ser Trp Val His Val Val Leu           
            340                 345                 350                   

ttt ttg gtg gtg agc cag gcg atg ggt ggt gtg cta ctc ggc gtc gtg       1104
Phe Leu Val Val Ser Gln Ala Met Gly Gly Val Leu Leu Gly Val Val           
        355                 360                 365                       

ttc acc gtc ggg cgc aac gcg atg aaa gtc ctc tcc gag gaa gaa atg       1152
Phe Thr Val Gly Arg Asn Ala Met Lys Val Leu Ser Glu Glu Glu Met           
    370                 375                 380                           

aag tca acc gac ttt gtc cag atg cag gtc ctg acg acg aga aat att       1200
Lys Ser Thr Asp Phe Val Gln Met Gln Val Leu Thr Thr Arg Asn Ile           
385                 390                 395                 400           

gag ccg acg gct ttc aat cgg tgg ttc agc ggt ggc ctc agc tac cag       1248
Glu Pro Thr Ala Phe Asn Arg Trp Phe Ser Gly Gly Leu Ser Tyr Gln           
                405                 410                 415               

att gag cac cac atc tgg cct cag ctg ccc cga cac agc tta ccc aag       1296
Ile Glu His His Ile Trp Pro Gln Leu Pro Arg His Ser Leu Pro Lys           
            420                 425                 430                   

gcg cgc gaa att ctc acc aag ttt tgc agc aag tat gat att ccg tac       1344
Ala Arg Glu Ile Leu Thr Lys Phe Cys Ser Lys Tyr Asp Ile Pro Tyr           
        435                 440                 445                       

gcc agt caa ggc ctc att gaa ggt aac atg gaa gtg tgg aaa atg ctc       1392
Ala Ser Gln Gly Leu Ile Glu Gly Asn Met Glu Val Trp Lys Met Leu           
    450                 455                 460                           

tcg aag ctt ggg gaa tcc cta tag                                       1416
Ser Lys Leu Gly Glu Ser Leu                                               
465                 470                                                   


<210>  43
<211>  471
<212>  PRT
<213>  Porphyridium cruentum

<400>  43

Met Ala Pro Asn Val Asp Ser Gly Ser Lys Asp Arg Gly Val Ser Ala 
1               5                   10                  15      


Val Lys Glu Val Val Ser Gly Ala Thr Ala Asn Ala Leu Ser Pro Ala 
            20                  25                  30          


Glu Arg Val Val Thr Arg Lys Glu Leu Ala Gly His Ala Ser Arg Glu 
        35                  40                  45              


Ser Val Trp Ile Ala Val Asn Gly Arg Val Tyr Asp Val Thr Gly Phe 
    50                  55                  60                  


Glu Asn Val His Pro Gly Gly Glu Ile Ile Leu Thr Ala Ala Gly Gln 
65                  70                  75                  80  


Asp Ala Thr Asp Val Phe Ala Ala Phe His Thr Pro Ala Thr Trp Lys 
                85                  90                  95      


Met Met Pro Gln Phe Leu Val Gly Asn Leu Glu Glu Asp Ala Leu Ser 
            100                 105                 110         


Ala Lys Pro Ser Lys Gln Leu Asn Gly His Ser Pro His Glu Tyr Gln 
        115                 120                 125             


Ala Asp Ile Arg Lys Met Arg Ala Glu Leu Val Lys Leu Arg Ala Phe 
    130                 135                 140                 


Asp Ser Asn Lys Phe Phe Tyr Leu Phe Lys Phe Leu Ser Thr Ser Ala 
145                 150                 155                 160 


Ile Cys Ala Leu Leu Val Val Met Ala Leu Gly Met Lys Asp Ser Met 
                165                 170                 175     


Ile Val Thr Ala Leu Ala Ala Phe Thr Met Ala Leu Phe Trp Gln Gln 
            180                 185                 190         


Cys Gly Trp Leu Ala His Asp Phe Leu His His Gln Val Phe Lys Asn 
        195                 200                 205             


Arg Val Phe Asn Asn Leu Val Gly Leu Val Val Gly Asn Val Tyr Gln 
    210                 215                 220                 


Gly Phe Ser Val Ser Trp Trp Lys Met Lys His Asn His His His Ala 
225                 230                 235                 240 


Ala Pro Asn Val Thr Ser Thr Ala Ala Gly Pro Asp Pro Asp Ile Asp 
                245                 250                 255     


Thr Val Pro Val Leu Ser Trp Ser Glu Lys Leu Ile Glu Gly Asp Ser 
            260                 265                 270         


Lys Glu Met Glu Asp Leu Pro Met Phe Leu Met Lys Asn Gln Lys Ile 
        275                 280                 285             


Phe Tyr Trp Pro Val Leu Cys Val Ala Arg Ile Ser Trp Leu Leu Gln 
    290                 295                 300                 


Ser Leu Leu Phe Gln Arg Ala Pro Val Trp Asn Phe Val Gly Gly Asn 
305                 310                 315                 320 


Ser Trp Arg Ala Val Glu Thr Val Ala Leu Leu Met His His Gly Ala 
                325                 330                 335     


Tyr Phe Tyr Leu Leu Ser Leu Leu Lys Ser Trp Val His Val Val Leu 
            340                 345                 350         


Phe Leu Val Val Ser Gln Ala Met Gly Gly Val Leu Leu Gly Val Val 
        355                 360                 365             


Phe Thr Val Gly Arg Asn Ala Met Lys Val Leu Ser Glu Glu Glu Met 
    370                 375                 380                 


Lys Ser Thr Asp Phe Val Gln Met Gln Val Leu Thr Thr Arg Asn Ile 
385                 390                 395                 400 


Glu Pro Thr Ala Phe Asn Arg Trp Phe Ser Gly Gly Leu Ser Tyr Gln 
                405                 410                 415     


Ile Glu His His Ile Trp Pro Gln Leu Pro Arg His Ser Leu Pro Lys 
            420                 425                 430         


Ala Arg Glu Ile Leu Thr Lys Phe Cys Ser Lys Tyr Asp Ile Pro Tyr 
        435                 440                 445             


Ala Ser Gln Gly Leu Ile Glu Gly Asn Met Glu Val Trp Lys Met Leu 
    450                 455                 460                 


Ser Lys Leu Gly Glu Ser Leu 
465                 470     


<210>  44
<211>  8502
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pY109 #1

<400>  44
gtacgagccg gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca       60

ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat      120

taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc      180

tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca      240

aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca      300

aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg      360

ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg      420

acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt      480

ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt      540

tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc      600

tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt      660

gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt      720

agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc      780

tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa      840

agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt      900

tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct      960

acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta     1020

tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa     1080

agtatatatg agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc     1140

tcagcgatct gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact     1200

acgatacggg agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc     1260

tcaccggctc cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt     1320

ggtcctgcaa ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta     1380

agtagttcgc cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg     1440

tcacgctcgt cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt     1500

acatgatccc ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc     1560

agaagtaagt tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt     1620

actgtcatgc catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc     1680

tgagaatagt gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc     1740

gcgccacata gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa     1800

ctctcaagga tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac     1860

tgatcttcag catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa     1920

aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt     1980

tttcaatatt attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa     2040

tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct     2100

gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc     2160

gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc     2220

acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt     2280

agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg     2340

ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt     2400

ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta     2460

taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt     2520

aacgcgaatt ttaacaaaat attaacgctt acaatttcca ttcgccattc aggctgcgca     2580

actgttggga agggcgatcg gtgcgggcct cttcgctatt acgccagctg gcgaaagggg     2640

gatgtgctgc aaggcgatta agttgggtaa cgccagggtt ttcccagtca cgacgttgta     2700

aaacgacggc cagtgaattg taatacgact cactataggg cgaattgggt accgggcccc     2760

ccctcgaggt cgatggtgtc gataagcttg atatcgaatt catgtcacac aaaccgatct     2820

tcgcctcaag gaaacctaat tctacatccg agagactgcc gagatccagt ctacactgat     2880

taattttcgg gccaataatt taaaaaaatc gtgttatata atattatatg tattatatat     2940

atacatcatg atgatactga cagtcatgtc ccattgctaa atagacagac tccatctgcc     3000

gcctccaact gatgttctca atatttaagg ggtcatctcg cattgtttaa taataaacag     3060

actccatcta ccgcctccaa atgatgttct caaaatatat tgtatgaact tatttttatt     3120

acttagtatt attagacaac ttacttgctt tatgaaaaac acttcctatt taggaaacaa     3180

tttataatgg cagttcgttc atttaacaat ttatgtagaa taaatgttat aaatgcgtat     3240

gggaaatctt aaatatggat agcataaatg atatctgcat tgcctaattc gaaatcaaca     3300

gcaacgaaaa aaatcccttg tacaacataa atagtcatcg agaaatatca actatcaaag     3360

aacagctatt cacacgttac tattgagatt attattggac gagaatcaca cactcaactg     3420

tctttctctc ttctagaaat acaggtacaa gtatgtacta ttctcattgt tcatacttct     3480

agtcatttca tcccacatat tccttggatt tctctccaat gaatgacatt ctatcttgca     3540

aattcaacaa ttataataag atataccaaa gtagcggtat agtggcaatc aaaaagcttc     3600

tctggtgtgc ttctcgtatt tatttttatt ctaatgatcc attaaaggta tatatttatt     3660

tcttgttata taatcctttt gtttattaca tgggctggat acataaaggt attttgattt     3720

aattttttgc ttaaattcaa tcccccctcg ttcagtgtca actgtaatgg taggaaatta     3780

ccatactttt gaagaagcaa aaaaaatgaa agaaaaaaaa aatcgtattt ccaggttaga     3840

cgttccgcag aatctagaat gcggtatgcg gtacattgtt cttcgaacgt aaaagttgcg     3900

ctccctgaga tattgtacat ttttgctttt acaagtacaa gtacatcgta caactatgta     3960

ctactgttga tgcatccaca acagtttgtt ttgttttttt ttgttttttt tttttctaat     4020

gattcattac cgctatgtat acctacttgt acttgtagta agccgggtta ttggcgttca     4080

attaatcata gacttatgaa tctgcacggt gtgcgctgcg agttactttt agcttatgca     4140

tgctacttgg gtgtaatatt gggatctgtt cggaaatcaa cggatgctca atcgatttcg     4200

acagtaatta attaagtcat acacaagtca gctttcttcg agcctcatat aagtataagt     4260

agttcaacgt attagcactg tacccagcat ctccgtatcg agaaacacaa caacatgccc     4320

cattggacag atcatgcgga tacacaggtt gtgcagtatc atacatactc gatcagacag     4380

gtcgtctgac catcatacaa gctgaacaag cgctccatac ttgcacgctc tctatataca     4440

cagttaaatt acatatccat agtctaacct ctaacagtta atcttctggt aagcctccca     4500

gccagccttc tggtatcgct tggcctcctc aataggatct cggttctggc cgtacagacc     4560

tcggccgaca attatgatat ccgttccggt agacatgaca tcctcaacag ttcggtactg     4620

ctgtccgaga gcgtctccct tgtcgtcaag acccaccccg ggggtcagaa taagccagtc     4680

ctcagagtcg cccttaggtc ggttctgggc aatgaagcca accacaaact cggggtcgga     4740

tcgggcaagc tcaatggtct gcttggagta ctcgccagtg gccagagagc ccttgcaaga     4800

cagctcggcc agcatgagca gacctctggc cagcttctcg ttgggagagg ggactaggaa     4860

ctccttgtac tgggagttct cgtagtcaga gacgtcctcc ttcttctgtt cagagacagt     4920

ttcctcggca ccagctcgca ggccagcaat gattccggtt ccgggtacac cgtgggcgtt     4980

ggtgatatcg gaccactcgg cgattcggtg acaccggtac tggtgcttga cagtgttgcc     5040

aatatctgcg aactttctgt cctcgaacag gaagaaaccg tgcttaagag caagttcctt     5100

gagggggagc acagtgccgg cgtaggtgaa gtcgtcaatg atgtcgatat gggttttgat     5160

catgcacaca taaggtccga ccttatcggc aagctcaatg agctccttgg tggtggtaac     5220

atccagagaa gcacacaggt tggttttctt ggctgccacg agcttgagca ctcgagcggc     5280

aaaggcggac ttgtggacgt tagctcgagc ttcgtaggag ggcattttgg tggtgaagag     5340

gagactgaaa taaatttagt ctgcagaact ttttatcgga accttatctg gggcagtgaa     5400

gtatatgtta tggtaatagt tacgagttag ttgaacttat agatagactg gactatacgg     5460

ctatcggtcc aaattagaaa gaacgtcaat ggctctctgg gcgtcgcctt tgccgacaaa     5520

aatgtgatca tgatgaaagc cagcaatgac gttgcagctg atattgttgt cggccaaccg     5580

cgccgaaaac gcagctgtca gacccacagc ctccaacgaa gaatgtatcg tcaaagtgat     5640

ccaagcacac tcatagttgg agtcgtactc caaaggcggc aatgacgagt cagacagata     5700

ctcgtcgact caggcgacga cggaattcct gcagcccatc tgcagaattc aggagagacc     5760

gggttggcgg cgtatttgtg tcccaaaaaa cagccccaat tgccccggag aagacggcca     5820

ggccgcctag atgacaaatt caacaactca cagctgactt tctgccattg ccactagggg     5880

ggggcctttt tatatggcca agccaagctc tccacgtcgg ttgggctgca cccaacaata     5940

aatgggtagg gttgcaccaa caaagggatg ggatgggggg tagaagatac gaggataacg     6000

gggctcaatg gcacaaataa gaacgaatac tgccattaag actcgtgatc cagcgactga     6060

caccattgca tcatctaagg gcctcaaaac tacctcggaa ctgctgcgct gatctggaca     6120

ccacagaggt tccgagcact ttaggttgca ccaaatgtcc caccaggtgc aggcagaaaa     6180

cgctggaaca gcgtgtacag tttgtcttaa caaaaagtga gggcgctgag gtcgagcagg     6240

gtggtgtgac ttgttatagc ctttagagct gcgaaagcgc gtatggattt ggctcatcag     6300

gccagattga gggtctgtgg acacatgtca tgttagtgta cttcaatcgc cccctggata     6360

tagccccgac aataggccgt ggcctcattt ttttgccttc cgcacatttc cattgctcgg     6420

tacccacacc ttgcttctcc tgcacttgcc aaccttaata ctggtttaca ttgaccaaca     6480

tcttacaagc ggggggcttg tctagggtat atataaacag tggctctccc aatcggttgc     6540

cagtctcttt tttcctttct ttccccacag attcgaaatc taaactacac atcacacaat     6600

gcctgttact gacgtcctta agcgaaagtc cggtgtcatc gtcggcgacg atgtccgagc     6660

cgtgagtatc cacgacaaga tcagtgtcga gacgacgcgt tttgtgtaat gacacaatcc     6720

gaaagtcgct agcaacacac actctctaca caaactaacc cagctctcca tggcgccgaa     6780

tgtggactcc ggaagcaagg accgcggcgt gagcgcggtc aaagaagtag tctctggcgc     6840

gacggccaac gcgctgagtc cggccgagcg cgtggtgacc aggaaggagc tcgcggggca     6900

cgcctcaagg gagtcggtgt ggattgcggt gaacggccgt gtgtacgatg tgaccggctt     6960

tgagaacgtt caccctggcg gcgagatcat tctgaccgcc gccgggcagg acgcaacgga     7020

cgtgtttgcc gcgtttcaca cgcccgccac gtggaaaatg atgccgcagt tcctcgtggg     7080

aaacctcgag gaggacgcgc tctctgccaa accgtctaag cagcttaatg ggcattcgcc     7140

acacgagtac caagctgata tccgaaagat gcgtgcggaa cttgtcaagc tgcgcgcgtt     7200

cgactcgaac aagttcttct acctgttcaa gttcctgtcc acgtctgcga tttgcgccct     7260

ctcggtggtc atggcgctcg gcatgaagga ctcgatgatc gtcacggcgc tcgccgcgtt     7320

caccatggca ctcttctggc agcagtgcgg ctggctcgca cacgactttc tgcaccatca     7380

ggtgttcaag aacagggtgt tcaacaacct ggtcggtctt gttgttggta atgtctatca     7440

gggcttttcg gtatcctggt ggaagatgaa gcacaaccac caccacgccg ctccaaacgt     7500

gacgtcaacg gccgctgggc cagacccaga catcgacact gtgcccgtgc tcttgtggag     7560

cgagaaactc atcgagggtg atagcaagga gatggaggat ctgcccatgt tcctcatgaa     7620

gaaccagaag atcttttact ggccggttct gtgcgtggcg cgcatcagct ggctcctgca     7680

gagccttctc ttccagcgcg cgccggtctg gaactttgtg ggcggaaaca gctggcgcgc     7740

ggtggagatc gtcgcgcttc tcatgcatca cggcgcctac ttctacttgc tgtccttgct     7800

caagagctgg gtccatgtcg cgctcttttt ggtggtgagc caggcgatgg gtggtgtgct     7860

actcggcgtc gtgttcaccg tcgggcacaa cgcgatgaaa gtcctctccg aggaagaaat     7920

gaagtcaacc gactttgtcc agatgcaggt cctgacgacg agaaatattg agccgacggc     7980

tttcaatcgg tggttcagcg gtggcctcag ctaccagatt gagcaccaca tctggcctca     8040

gctgccccga cacagcttac ccaaggcgcg cgaaattctc accaagtttt gcagcaagta     8100

tgatattccg tacgccagtc aaggcctcat tgaaggtaac atggaagtgt ggaaaatgct     8160

ctcgaagctt ggggaatccc tatagggccg caagtgtgga tggggaagtg agtgcccggt     8220

tctgtgtgca caattggcaa tccaagatgg atggattcaa cacagggata tagcgagcta     8280

cgtggtggtg cgaggatata gcaacggata tttatgtttg acacttgaga atgtacgata     8340

caagcactgt ccaagtacaa tactaaacat actgtacata ctcatactcg tacccgggca     8400

acggtttcac ttgagtgcag tggctagtgc tcttactcgt acagtgtgca atactgcgta     8460

tcatagtctt tgatgtatat cgtattcatt catgttagtt gc                        8502


<210>  45
<211>  8502
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pY109 #2

<400>  45
gtacgagccg gaagcataaa gtgtaaagcc tggggtgcct aatgagtgag ctaactcaca       60

ttaattgcgt tgcgctcact gcccgctttc cagtcgggaa acctgtcgtg ccagctgcat      120

taatgaatcg gccaacgcgc ggggagaggc ggtttgcgta ttgggcgctc ttccgcttcc      180

tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca      240

aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca      300

aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg      360

ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg      420

acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt      480

ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt      540

tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc      600

tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt      660

gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt      720

agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtggcc taactacggc      780

tacactagaa ggacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa      840

agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt      900

tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct      960

acggggtctg acgctcagtg gaacgaaaac tcacgttaag ggattttggt catgagatta     1020

tcaaaaagga tcttcaccta gatcctttta aattaaaaat gaagttttaa atcaatctaa     1080

agtatatatg agtaaacttg gtctgacagt taccaatgct taatcagtga ggcacctatc     1140

tcagcgatct gtctatttcg ttcatccata gttgcctgac tccccgtcgt gtagataact     1200

acgatacggg agggcttacc atctggcccc agtgctgcaa tgataccgcg agacccacgc     1260

tcaccggctc cagatttatc agcaataaac cagccagccg gaagggccga gcgcagaagt     1320

ggtcctgcaa ctttatccgc ctccatccag tctattaatt gttgccggga agctagagta     1380

agtagttcgc cagttaatag tttgcgcaac gttgttgcca ttgctacagg catcgtggtg     1440

tcacgctcgt cgtttggtat ggcttcattc agctccggtt cccaacgatc aaggcgagtt     1500

acatgatccc ccatgttgtg caaaaaagcg gttagctcct tcggtcctcc gatcgttgtc     1560

agaagtaagt tggccgcagt gttatcactc atggttatgg cagcactgca taattctctt     1620

actgtcatgc catccgtaag atgcttttct gtgactggtg agtactcaac caagtcattc     1680

tgagaatagt gtatgcggcg accgagttgc tcttgcccgg cgtcaatacg ggataatacc     1740

gcgccacata gcagaacttt aaaagtgctc atcattggaa aacgttcttc ggggcgaaaa     1800

ctctcaagga tcttaccgct gttgagatcc agttcgatgt aacccactcg tgcacccaac     1860

tgatcttcag catcttttac tttcaccagc gtttctgggt gagcaaaaac aggaaggcaa     1920

aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt gaatactcat actcttcctt     1980

tttcaatatt attgaagcat ttatcagggt tattgtctca tgagcggata catatttgaa     2040

tgtatttaga aaaataaaca aataggggtt ccgcgcacat ttccccgaaa agtgccacct     2100

gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc     2160

gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc     2220

acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt     2280

agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc acgtagtggg     2340

ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt     2400

ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc ttttgattta     2460

taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt     2520

aacgcgaatt ttaacaaaat attaacgctt acaatttcca ttcgccattc aggctgcgca     2580

actgttggga agggcgatcg gtgcgggcct cttcgctatt acgccagctg gcgaaagggg     2640

gatgtgctgc aaggcgatta agttgggtaa cgccagggtt ttcccagtca cgacgttgta     2700

aaacgacggc cagtgaattg taatacgact cactataggg cgaattgggt accgggcccc     2760

ccctcgaggt cgatggtgtc gataagcttg atatcgaatt catgtcacac aaaccgatct     2820

tcgcctcaag gaaacctaat tctacatccg agagactgcc gagatccagt ctacactgat     2880

taattttcgg gccaataatt taaaaaaatc gtgttatata atattatatg tattatatat     2940

atacatcatg atgatactga cagtcatgtc ccattgctaa atagacagac tccatctgcc     3000

gcctccaact gatgttctca atatttaagg ggtcatctcg cattgtttaa taataaacag     3060

actccatcta ccgcctccaa atgatgttct caaaatatat tgtatgaact tatttttatt     3120

acttagtatt attagacaac ttacttgctt tatgaaaaac acttcctatt taggaaacaa     3180

tttataatgg cagttcgttc atttaacaat ttatgtagaa taaatgttat aaatgcgtat     3240

gggaaatctt aaatatggat agcataaatg atatctgcat tgcctaattc gaaatcaaca     3300

gcaacgaaaa aaatcccttg tacaacataa atagtcatcg agaaatatca actatcaaag     3360

aacagctatt cacacgttac tattgagatt attattggac gagaatcaca cactcaactg     3420

tctttctctc ttctagaaat acaggtacaa gtatgtacta ttctcattgt tcatacttct     3480

agtcatttca tcccacatat tccttggatt tctctccaat gaatgacatt ctatcttgca     3540

aattcaacaa ttataataag atataccaaa gtagcggtat agtggcaatc aaaaagcttc     3600

tctggtgtgc ttctcgtatt tatttttatt ctaatgatcc attaaaggta tatatttatt     3660

tcttgttata taatcctttt gtttattaca tgggctggat acataaaggt attttgattt     3720

aattttttgc ttaaattcaa tcccccctcg ttcagtgtca actgtaatgg taggaaatta     3780

ccatactttt gaagaagcaa aaaaaatgaa agaaaaaaaa aatcgtattt ccaggttaga     3840

cgttccgcag aatctagaat gcggtatgcg gtacattgtt cttcgaacgt aaaagttgcg     3900

ctccctgaga tattgtacat ttttgctttt acaagtacaa gtacatcgta caactatgta     3960

ctactgttga tgcatccaca acagtttgtt ttgttttttt ttgttttttt tttttctaat     4020

gattcattac cgctatgtat acctacttgt acttgtagta agccgggtta ttggcgttca     4080

attaatcata gacttatgaa tctgcacggt gtgcgctgcg agttactttt agcttatgca     4140

tgctacttgg gtgtaatatt gggatctgtt cggaaatcaa cggatgctca atcgatttcg     4200

acagtaatta attaagtcat acacaagtca gctttcttcg agcctcatat aagtataagt     4260

agttcaacgt attagcactg tacccagcat ctccgtatcg agaaacacaa caacatgccc     4320

cattggacag atcatgcgga tacacaggtt gtgcagtatc atacatactc gatcagacag     4380

gtcgtctgac catcatacaa gctgaacaag cgctccatac ttgcacgctc tctatataca     4440

cagttaaatt acatatccat agtctaacct ctaacagtta atcttctggt aagcctccca     4500

gccagccttc tggtatcgct tggcctcctc aataggatct cggttctggc cgtacagacc     4560

tcggccgaca attatgatat ccgttccggt agacatgaca tcctcaacag ttcggtactg     4620

ctgtccgaga gcgtctccct tgtcgtcaag acccaccccg ggggtcagaa taagccagtc     4680

ctcagagtcg cccttaggtc ggttctgggc aatgaagcca accacaaact cggggtcgga     4740

tcgggcaagc tcaatggtct gcttggagta ctcgccagtg gccagagagc ccttgcaaga     4800

cagctcggcc agcatgagca gacctctggc cagcttctcg ttgggagagg ggactaggaa     4860

ctccttgtac tgggagttct cgtagtcaga gacgtcctcc ttcttctgtt cagagacagt     4920

ttcctcggca ccagctcgca ggccagcaat gattccggtt ccgggtacac cgtgggcgtt     4980

ggtgatatcg gaccactcgg cgattcggtg acaccggtac tggtgcttga cagtgttgcc     5040

aatatctgcg aactttctgt cctcgaacag gaagaaaccg tgcttaagag caagttcctt     5100

gagggggagc acagtgccgg cgtaggtgaa gtcgtcaatg atgtcgatat gggttttgat     5160

catgcacaca taaggtccga ccttatcggc aagctcaatg agctccttgg tggtggtaac     5220

atccagagaa gcacacaggt tggttttctt ggctgccacg agcttgagca ctcgagcggc     5280

aaaggcggac ttgtggacgt tagctcgagc ttcgtaggag ggcattttgg tggtgaagag     5340

gagactgaaa taaatttagt ctgcagaact ttttatcgga accttatctg gggcagtgaa     5400

gtatatgtta tggtaatagt tacgagttag ttgaacttat agatagactg gactatacgg     5460

ctatcggtcc aaattagaaa gaacgtcaat ggctctctgg gcgtcgcctt tgccgacaaa     5520

aatgtgatca tgatgaaagc cagcaatgac gttgcagctg atattgttgt cggccaaccg     5580

cgccgaaaac gcagctgtca gacccacagc ctccaacgaa gaatgtatcg tcaaagtgat     5640

ccaagcacac tcatagttgg agtcgtactc caaaggcggc aatgacgagt cagacagata     5700

ctcgtcgact caggcgacga cggaattcct gcagcccatc tgcagaattc aggagagacc     5760

gggttggcgg cgtatttgtg tcccaaaaaa cagccccaat tgccccggag aagacggcca     5820

ggccgcctag atgacaaatt caacaactca cagctgactt tctgccattg ccactagggg     5880

ggggcctttt tatatggcca agccaagctc tccacgtcgg ttgggctgca cccaacaata     5940

aatgggtagg gttgcaccaa caaagggatg ggatgggggg tagaagatac gaggataacg     6000

gggctcaatg gcacaaataa gaacgaatac tgccattaag actcgtgatc cagcgactga     6060

caccattgca tcatctaagg gcctcaaaac tacctcggaa ctgctgcgct gatctggaca     6120

ccacagaggt tccgagcact ttaggttgca ccaaatgtcc caccaggtgc aggcagaaaa     6180

cgctggaaca gcgtgtacag tttgtcttaa caaaaagtga gggcgctgag gtcgagcagg     6240

gtggtgtgac ttgttatagc ctttagagct gcgaaagcgc gtatggattt ggctcatcag     6300

gccagattga gggtctgtgg acacatgtca tgttagtgta cttcaatcgc cccctggata     6360

tagccccgac aataggccgt ggcctcattt ttttgccttc cgcacatttc cattgctcgg     6420

tacccacacc ttgcttctcc tgcacttgcc aaccttaata ctggtttaca ttgaccaaca     6480

tcttacaagc ggggggcttg tctagggtat atataaacag tggctctccc aatcggttgc     6540

cagtctcttt tttcctttct ttccccacag attcgaaatc taaactacac atcacacaat     6600

gcctgttact gacgtcctta agcgaaagtc cggtgtcatc gtcggcgacg atgtccgagc     6660

cgtgagtatc cacgacaaga tcagtgtcga gacgacgcgt tttgtgtaat gacacaatcc     6720

gaaagtcgct agcaacacac actctctaca caaactaacc cagctctcca tggcgccgaa     6780

tgtggactcc ggaagcaagg accgcggcgt gagcgcggtc aaagaagtag tctctggcgc     6840

gacggccaac gcgctgagtc cggccgagcg cgtggtgacc aggaaggagc tcgcggggca     6900

cgcctcaagg gagtcggtgt ggattgcggt gaacggccgt gtgtacgatg tgaccggctt     6960

tgagaacgtt caccctggcg gcgagatcat tctgaccgcc gccgggcagg acgcaacgga     7020

cgtgtttgcc gcgtttcaca cgcccgccac gtggaaaatg atgccgcagt tcctcgtggg     7080

aaacctcgag gaggacgcgc tctctgccaa accgtctaag cagcttaatg ggcattcgcc     7140

acacgagtac caagctgata tccgaaagat gcgtgcggaa cttgtcaagc tgcgcgcgtt     7200

cgactcgaac aagttcttct acctgttcaa gttcctgtcc acgtctgcga tttgcgccct     7260

cttggtggtc atggcgctcg gcatgaagga ctcgatgatc gtcacggcgc tcgccgcgtt     7320

caccatggca ctcttctggc agcagtgcgg ctggctcgct cacgactttc tgcaccatca     7380

ggtgttcaag aacagggtgt tcaacaacct ggtcggtctt gttgttggta atgtctatca     7440

gggcttttcg gtatcctggt ggaagatgaa gcacaaccac caccacgccg ctccaaacgt     7500

gacgtcaacg gccgctgggc cagacccaga catcgacact gtgcccgtgc tctcgtggag     7560

cgagaaactc atcgagggtg atagcaagga gatggaggat ctgcccatgt tcctcatgaa     7620

gaaccagaag atcttttact ggccggttct gtgcgtggcg cgcatcagct ggctcctgca     7680

gagccttctc ttccagcgcg cgccggtctg gaactttgtg ggcggaaaca gctggcgcgc     7740

ggtggagacc gtcgcgcttc tcatgcatca cggcgcctac ttctacttgc tgtccttgct     7800

caagagctgg gtccatgtcg tgctcttttt ggtggtgagc caggcgatgg gtggtgtgct     7860

actcggcgtc gtgttcaccg tcgggcgcaa cgcgatgaaa gtcctctccg aggaagaaat     7920

gaagtcaacc gactttgtcc agatgcaggt cctgacgacg agaaatattg agccgacggc     7980

tttcaatcgg tggttcagcg gtggcctcag ctaccagatt gagcaccaca tctggcctca     8040

gctgccccga cacagcttac ccaaggcgcg cgaaattctc accaagtttt gcagcaagta     8100

tgatattccg tacgccagtc aaggcctcat tgaaggtaac atggaagtgt ggaaaatgct     8160

ctcgaagctt ggggaatccc tatagggccg caagtgtgga tggggaagtg agtgcccggt     8220

tctgtgtgca caattggcaa tccaagatgg atggattcaa cacagggata tagcgagcta     8280

cgtggtggtg cgaggatata gcaacggata tttatgtttg acacttgaga atgtacgata     8340

caagcactgt ccaagtacaa tactaaacat actgtacata ctcatactcg tacccgggca     8400

acggtttcac ttgagtgcag tggctagtgc tcttactcgt acagtgtgca atactgcgta     8460

tcatagtctt tgatgtatat cgtattcatt catgttagtt gc                        8502


<210>  46
<211>  1426
<212>  DNA
<213>  Porphyridium cruentum


<220>
<221>  CDS
<222>  (3)..(1418)
<223>  synthetic delta-6 desaturase (codon-optimized for Yarrowia 
       lipolytica)

<400>  46
cc atg gct ccc aac gtc gac tcc gga tcg aag gac cga ggc gtg tct          47
   Met Ala Pro Asn Val Asp Ser Gly Ser Lys Asp Arg Gly Val Ser            
   1               5                   10                  15             

gcc gtc aag gag gtg gtc tcc ggt gct act gcc aac gct ctg tct cct         95
Ala Val Lys Glu Val Val Ser Gly Ala Thr Ala Asn Ala Leu Ser Pro           
                20                  25                  30                

gcc gag cga gtt gtc acc cga aag gag ctg gca gga cac gcc tct cga        143
Ala Glu Arg Val Val Thr Arg Lys Glu Leu Ala Gly His Ala Ser Arg           
            35                  40                  45                    

gaa tcc gtg tgg att gct gtc aac ggc aga gtt tac gat gtt acc gga        191
Glu Ser Val Trp Ile Ala Val Asn Gly Arg Val Tyr Asp Val Thr Gly           
        50                  55                  60                        

ttc gag aac gtg cat ccc ggt ggc gag atc att ctc act gcc gct gga        239
Phe Glu Asn Val His Pro Gly Gly Glu Ile Ile Leu Thr Ala Ala Gly           
    65                  70                  75                            

cag gac gcg acc gat gtc ttt gct gcc ttt cac aca cct gcc acc tgg        287
Gln Asp Ala Thr Asp Val Phe Ala Ala Phe His Thr Pro Ala Thr Trp           
80                  85                  90                  95            

aag atg atg cct cag ttc ctc gtg gga aac ctc gag gaa gac gct ctg        335
Lys Met Met Pro Gln Phe Leu Val Gly Asn Leu Glu Glu Asp Ala Leu           
                100                 105                 110               

tct gcc aag ccc tcc aag cag ctc aat ggt cat tct cca cac gag tac        383
Ser Ala Lys Pro Ser Lys Gln Leu Asn Gly His Ser Pro His Glu Tyr           
            115                 120                 125                   

cag gcc gac att cga aag atg cgt gcc gag ctt gtc aag ctg cga gct        431
Gln Ala Asp Ile Arg Lys Met Arg Ala Glu Leu Val Lys Leu Arg Ala           
        130                 135                 140                       

ttc gat tcc aac aag ttc ttt tac ctg ttc aag ttt ctc tca acc tct        479
Phe Asp Ser Asn Lys Phe Phe Tyr Leu Phe Lys Phe Leu Ser Thr Ser           
    145                 150                 155                           

gcc atc tgt gcg ctg tcg gtg gtc atg gct ctt ggc atg aag gac tcc        527
Ala Ile Cys Ala Leu Ser Val Val Met Ala Leu Gly Met Lys Asp Ser           
160                 165                 170                 175           

atg att gtc aca gcg ctg gct gcc ttt act atg gca ctc ttc tgg cag        575
Met Ile Val Thr Ala Leu Ala Ala Phe Thr Met Ala Leu Phe Trp Gln           
                180                 185                 190               

caa tgc gga tgg ctg gca cac gac ttt ctt cac cat cag gtc ttc aag        623
Gln Cys Gly Trp Leu Ala His Asp Phe Leu His His Gln Val Phe Lys           
            195                 200                 205                   

aac cga gtg ttc aac aat ctg gtc ggt ctc gtt gtc gga aac gtc tac        671
Asn Arg Val Phe Asn Asn Leu Val Gly Leu Val Val Gly Asn Val Tyr           
        210                 215                 220                       

cag ggc ttt tcg gtg tcc tgg tgg aag atg aaa cac aat cat cac cat        719
Gln Gly Phe Ser Val Ser Trp Trp Lys Met Lys His Asn His His His           
    225                 230                 235                           

gcc gct ccc aac gtt acg tct act gcc gct gga cca gac ccc gat atc        767
Ala Ala Pro Asn Val Thr Ser Thr Ala Ala Gly Pro Asp Pro Asp Ile           
240                 245                 250                 255           

gac acc gtt cct gtc ctc ttg tgg tcc gag aag ctt atc gaa ggc gat        815
Asp Thr Val Pro Val Leu Leu Trp Ser Glu Lys Leu Ile Glu Gly Asp           
                260                 265                 270               

tcc aag gag atg gaa gac ctt ccc atg ttc ctc atg aag aac cag aaa        863
Ser Lys Glu Met Glu Asp Leu Pro Met Phe Leu Met Lys Asn Gln Lys           
            275                 280                 285                   

atc ttc tac tgg cct gtt ctg tgt gtg gct cga atc agc tgg ctg ctt        911
Ile Phe Tyr Trp Pro Val Leu Cys Val Ala Arg Ile Ser Trp Leu Leu           
        290                 295                 300                       

cag tcc ctg ctc ttt cag cga gca ccc gtc tgg aac ttc gtt ggt ggc        959
Gln Ser Leu Leu Phe Gln Arg Ala Pro Val Trp Asn Phe Val Gly Gly           
    305                 310                 315                           

aac agc tgg cga gcc gtc gag atc gtt gct ctg ctc atg cac cac gga       1007
Asn Ser Trp Arg Ala Val Glu Ile Val Ala Leu Leu Met His His Gly           
320                 325                 330                 335           

gcc tac ttc tac ctt ctg tcc ttg ctc aag tct tgg gtc cac gtg gca       1055
Ala Tyr Phe Tyr Leu Leu Ser Leu Leu Lys Ser Trp Val His Val Ala           
                340                 345                 350               

ctg ttt ctt gtc gtg tcc cag gct atg ggt ggc gtt ctg ctc gga gtc       1103
Leu Phe Leu Val Val Ser Gln Ala Met Gly Gly Val Leu Leu Gly Val           
            355                 360                 365                   

gtg ttc acc gtt ggt cac aac gcc atg aag gtt ctg agc gag gaa gag       1151
Val Phe Thr Val Gly His Asn Ala Met Lys Val Leu Ser Glu Glu Glu           
        370                 375                 380                       

atg aag tct acc gac ttt gtc cag atg caa gtg ctt act acc cga aac       1199
Met Lys Ser Thr Asp Phe Val Gln Met Gln Val Leu Thr Thr Arg Asn           
    385                 390                 395                           

atc gaa ccc aca gcc ttc aac cga tgg ttc agc ggt ggc ctg tcc tat       1247
Ile Glu Pro Thr Ala Phe Asn Arg Trp Phe Ser Gly Gly Leu Ser Tyr           
400                 405                 410                 415           

cag atc gag cat cac att tgg cct cag ctt ccc aga cac tct ctt ccc       1295
Gln Ile Glu His His Ile Trp Pro Gln Leu Pro Arg His Ser Leu Pro           
                420                 425                 430               

aag gct cgg gag att ctt acc aag ttc tgc tcc aag tac gac att ccc       1343
Lys Ala Arg Glu Ile Leu Thr Lys Phe Cys Ser Lys Tyr Asp Ile Pro           
            435                 440                 445                   

tac gcc tct caa ggt ctc atc gaa ggc aac atg gag gtc tgg aaa atg       1391
Tyr Ala Ser Gln Gly Leu Ile Glu Gly Asn Met Glu Val Trp Lys Met           
        450                 455                 460                       

ctg tcg aaa ctt ggc gag tcc ctg taa gcggccgc                          1426
Leu Ser Lys Leu Gly Glu Ser Leu                                           
    465                 470                                               


<210>  47
<211>  471
<212>  PRT
<213>  Porphyridium cruentum

<400>  47

Met Ala Pro Asn Val Asp Ser Gly Ser Lys Asp Arg Gly Val Ser Ala 
1               5                   10                  15      


Val Lys Glu Val Val Ser Gly Ala Thr Ala Asn Ala Leu Ser Pro Ala 
            20                  25                  30          


Glu Arg Val Val Thr Arg Lys Glu Leu Ala Gly His Ala Ser Arg Glu 
        35                  40                  45              


Ser Val Trp Ile Ala Val Asn Gly Arg Val Tyr Asp Val Thr Gly Phe 
    50                  55                  60                  


Glu Asn Val His Pro Gly Gly Glu Ile Ile Leu Thr Ala Ala Gly Gln 
65                  70                  75                  80  


Asp Ala Thr Asp Val Phe Ala Ala Phe His Thr Pro Ala Thr Trp Lys 
                85                  90                  95      


Met Met Pro Gln Phe Leu Val Gly Asn Leu Glu Glu Asp Ala Leu Ser 
            100                 105                 110         


Ala Lys Pro Ser Lys Gln Leu Asn Gly His Ser Pro His Glu Tyr Gln 
        115                 120                 125             


Ala Asp Ile Arg Lys Met Arg Ala Glu Leu Val Lys Leu Arg Ala Phe 
    130                 135                 140                 


Asp Ser Asn Lys Phe Phe Tyr Leu Phe Lys Phe Leu Ser Thr Ser Ala 
145                 150                 155                 160 


Ile Cys Ala Leu Ser Val Val Met Ala Leu Gly Met Lys Asp Ser Met 
                165                 170                 175     


Ile Val Thr Ala Leu Ala Ala Phe Thr Met Ala Leu Phe Trp Gln Gln 
            180                 185                 190         


Cys Gly Trp Leu Ala His Asp Phe Leu His His Gln Val Phe Lys Asn 
        195                 200                 205             


Arg Val Phe Asn Asn Leu Val Gly Leu Val Val Gly Asn Val Tyr Gln 
    210                 215                 220                 


Gly Phe Ser Val Ser Trp Trp Lys Met Lys His Asn His His His Ala 
225                 230                 235                 240 


Ala Pro Asn Val Thr Ser Thr Ala Ala Gly Pro Asp Pro Asp Ile Asp 
                245                 250                 255     


Thr Val Pro Val Leu Leu Trp Ser Glu Lys Leu Ile Glu Gly Asp Ser 
            260                 265                 270         


Lys Glu Met Glu Asp Leu Pro Met Phe Leu Met Lys Asn Gln Lys Ile 
        275                 280                 285             


Phe Tyr Trp Pro Val Leu Cys Val Ala Arg Ile Ser Trp Leu Leu Gln 
    290                 295                 300                 


Ser Leu Leu Phe Gln Arg Ala Pro Val Trp Asn Phe Val Gly Gly Asn 
305                 310                 315                 320 


Ser Trp Arg Ala Val Glu Ile Val Ala Leu Leu Met His His Gly Ala 
                325                 330                 335     


Tyr Phe Tyr Leu Leu Ser Leu Leu Lys Ser Trp Val His Val Ala Leu 
            340                 345                 350         


Phe Leu Val Val Ser Gln Ala Met Gly Gly Val Leu Leu Gly Val Val 
        355                 360                 365             


Phe Thr Val Gly His Asn Ala Met Lys Val Leu Ser Glu Glu Glu Met 
    370                 375                 380                 


Lys Ser Thr Asp Phe Val Gln Met Gln Val Leu Thr Thr Arg Asn Ile 
385                 390                 395                 400 


Glu Pro Thr Ala Phe Asn Arg Trp Phe Ser Gly Gly Leu Ser Tyr Gln 
                405                 410                 415     


Ile Glu His His Ile Trp Pro Gln Leu Pro Arg His Ser Leu Pro Lys 
            420                 425                 430         


Ala Arg Glu Ile Leu Thr Lys Phe Cys Ser Lys Tyr Asp Ile Pro Tyr 
        435                 440                 445             


Ala Ser Gln Gly Leu Ile Glu Gly Asn Met Glu Val Trp Lys Met Leu 
    450                 455                 460                 


Ser Lys Leu Gly Glu Ser Leu 
465                 470     


<210>  48
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Porphyridium cruentum  delta-6 desaturase His-rich motif

<400>  48

His Asp Phe Leu His 
1               5   


<210>  49
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Porphyridium cruentum  delta-6 desaturase His-rich motif

<400>  49

His Asn His His His 
1               5   


<210>  50
<211>  5
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Porphyridium cruentum  delta-6 desaturase His-rich motif

<400>  50

Gln Ile Glu His His 
1               5   


<210>  51
<211>  693
<212>  DNA
<213>  Porphyridium cruentum

<400>  51
tggcagcaga tgggctggtt ragccatgac tttctgcacc atcaggtgtt caagaacagg       60

gtgttcaaca acctggtcgg tcttgttgtt ggtaatgtct atcagggctt ttcggtatcc      120

tggtggaaga tgaagcacaa ccaccaccac gccgctccaa acgtgacgtc aacggccgct      180

gggccagacc cagacatcga cactgtgccc gtgctcttgt ggagcgagaa actcatcgag      240

ggtgatagca aggagatgga ggatctgccc atgttcctca tgaagaacca gaagatcttt      300

tactggccgg ttctgtgcgt ggcgcgcatc agctggctcc tgcagagcct tctcttccag      360

cgcgcgccgg tctggaactt tgtgggcgga aacagctggc gcgcggtgga gatcgtcgcg      420

cttctcatgc atcacggcgc ctacttctac ttgctgtcct tgctcaagag ctgggtccat      480

gtcgcgctct ttttggtggt gagccaggcg atgggtggtg tgctactcgg cgtcgtgttc      540

accgtcgggc acaacgcgat gaaagtcctc tccgaggaag aaatgaagtc aaccgacttt      600

gtccagatgc aggtcctgac gacgagaaat attgagccga cggctttcaa tcggtggttc      660

agcggyggct tcagctacca gatygagcac cac                                   693


<210>  52
<211>  410
<212>  DNA
<213>  Porphyridium cruentum

<400>  52
cctacttcta cttgctgtcc ttgctcaaga gctgggtcca tgtcgcgctc tttttggtgg       60

tgagccaggc gatgggtggt gtgctactcg gcgtcgtgtt caccgtcggg cacaacgcga      120

tgaaagtcct ctccgaggaa gaaatgaagt caaccgactt tgtccagatg caggtcctga      180

cgacgagaaa tattgagccg acggctttca atcggtggtt cagcggtggc ctcagctacc      240

agattgagca ccacatctgg cctcagctgc cccgacacag cttacccaag gcgcgcgaaa      300

ttctcaccaa gttttgcagc aagtatgata ttccgtacgc cagtcaaggc ctcattgaag      360

gtaacatgga agtgtggaaa atgctctcga agcttgggga atccctatag                 410


<210>  53
<211>  822
<212>  DNA
<213>  Porphyridium cruentum

<400>  53
atggcgccga atgtggactc cggaagcaag gaccgcggcg tgagcgcggt caaagaagta       60

gtctctggcg cgacggccaa cgcgctgagt ccggccgagc gcgtggtgac caggaaggag      120

ctcgcggggc acgcctcaag ggagtcggtg tggattgcgg tgaacggccg tgtgtacgat      180

gtgaccggct ttgagaacgt tcaccctggc ggcgagatca ttctgaccgc cgccgggcag      240

gacgcaacgg acgtgtttgc cgcgtttcac acgcccgcca cgtggaaaat gatgccgcag      300

ttcctcgtgg gaaacctcga ggaggacgcg ctctctgcca aaccgtctaa gcagcttaat      360

gggcattcgc cacacgagta ccaagctgat atccgaaaga tgcgtgcgga acttgtcaag      420

ctgcgcgcgt tcgactcgaa caagttcttc tacctgttca agttcctgtc cacgtctgcg      480

atttgcgccc tctcggtggt catggcgctc ggcatgaagg actcgatgat cgtcacggcg      540

ctcgccgcgt tcaccatggc actcttctgg cagcagtgcg gctggctcgc acacgacttt      600

ctgcaccatc aggtgttcaa gaacagggtg ttcaacaacc tggtcggtct tgttgttggt      660

aatgtctatc agggcttttc ggtatcctgg tggaagatga agcacaacca ccaccacgcc      720

gctccaaacg tgacgtcaac ggccgctggg ccagacccag acatcgacac tgtgcccgtg      780

ctcttgtgga gcgagaaact catcgagggt gatagcaagg ag                         822


