                         SEQUENCE LISTING

<110>  E.I. du Pont de Nemours and Company
       Hong, Raymond
       Xue, Zhixiong
       Zhu, Quinn
 
<120>  INCREASED OIL CONTENT BY INCREASING YAP1 TRANSCRIPTION FACTOR 
       ACTIVITY IN OLEAGINOUS YEASTS

<130>  CL4911

<150>  61/428655
<151>  2010-12-30

<160>  47    

<170>  PatentIn version 3.5

<210>  1
<211>  1953
<212>  DNA
<213>  Saccharomyces cerevisiae


<220>
<221>  CDS
<222>  (1)..(1953)
<223>  GenBank Accession No. NM_001182362

<400>  1
atg agt gtg tct acc gcc aag agg tcg ctg gat gtc gtt tct ccg ggt         48
Met Ser Val Ser Thr Ala Lys Arg Ser Leu Asp Val Val Ser Pro Gly           
1               5                   10                  15                

tca tta gcg gag ttt gag ggt tca aaa tct cgt cac gat gaa ata gaa         96
Ser Leu Ala Glu Phe Glu Gly Ser Lys Ser Arg His Asp Glu Ile Glu           
            20                  25                  30                    

aat gaa cat aga cgt act ggt aca cgt gat ggc gag gat agc gag caa        144
Asn Glu His Arg Arg Thr Gly Thr Arg Asp Gly Glu Asp Ser Glu Gln           
        35                  40                  45                        

ccg aag aag aag ggt agc aaa act agc aaa aag caa gat ttg gat cct        192
Pro Lys Lys Lys Gly Ser Lys Thr Ser Lys Lys Gln Asp Leu Asp Pro           
    50                  55                  60                            

gaa act aag cag aag agg act gcc caa aat cgg gcc gct caa aga gct        240
Glu Thr Lys Gln Lys Arg Thr Ala Gln Asn Arg Ala Ala Gln Arg Ala           
65                  70                  75                  80            

ttt agg gaa cgt aag gag agg aag atg aag gaa ttg gag aag aag gta        288
Phe Arg Glu Arg Lys Glu Arg Lys Met Lys Glu Leu Glu Lys Lys Val           
                85                  90                  95                

caa agt tta gag agt att cag cag caa aat gaa gtg gaa gct act ttt        336
Gln Ser Leu Glu Ser Ile Gln Gln Gln Asn Glu Val Glu Ala Thr Phe           
            100                 105                 110                   

ttg agg gac cag tta atc act ctg gtg aat gag tta aaa aaa tat aga        384
Leu Arg Asp Gln Leu Ile Thr Leu Val Asn Glu Leu Lys Lys Tyr Arg           
        115                 120                 125                       

cca gag aca aga aat gac tca aaa gtg ctg gaa tat tta gca agg cga        432
Pro Glu Thr Arg Asn Asp Ser Lys Val Leu Glu Tyr Leu Ala Arg Arg           
    130                 135                 140                           

gat cct aat ttg cat ttt tca aaa aat aac gtt aac cac agc aat agc        480
Asp Pro Asn Leu His Phe Ser Lys Asn Asn Val Asn His Ser Asn Ser           
145                 150                 155                 160           

gag cca att gac aca ccc aat gat gac ata caa gaa aat gtt aaa caa        528
Glu Pro Ile Asp Thr Pro Asn Asp Asp Ile Gln Glu Asn Val Lys Gln           
                165                 170                 175               

aag atg aat ttc acg ttt caa tat ccg ctt gat aac gac aac gac aac        576
Lys Met Asn Phe Thr Phe Gln Tyr Pro Leu Asp Asn Asp Asn Asp Asn           
            180                 185                 190                   

gac aac agt aaa aat gtg ggg aaa caa tta cct tca cca aat gat cca        624
Asp Asn Ser Lys Asn Val Gly Lys Gln Leu Pro Ser Pro Asn Asp Pro           
        195                 200                 205                       

agt cat tcg gct cct atg cct ata aat cag aca caa aag aaa tta agt        672
Ser His Ser Ala Pro Met Pro Ile Asn Gln Thr Gln Lys Lys Leu Ser           
    210                 215                 220                           

gac gct aca gat tcc tcc agc gct act ttg gat tcc ctt tca aat agt        720
Asp Ala Thr Asp Ser Ser Ser Ala Thr Leu Asp Ser Leu Ser Asn Ser           
225                 230                 235                 240           

aac gat gtt ctt aat aac aca cca aac tcc tcc act tcg atg gat tgg        768
Asn Asp Val Leu Asn Asn Thr Pro Asn Ser Ser Thr Ser Met Asp Trp           
                245                 250                 255               

tta gat aat gta ata tat act aac agg ttt gtg tca ggt gat gat ggc        816
Leu Asp Asn Val Ile Tyr Thr Asn Arg Phe Val Ser Gly Asp Asp Gly           
            260                 265                 270                   

agc aat agt aaa act aag aat tta gac agt aat atg ttt tct aat gac        864
Ser Asn Ser Lys Thr Lys Asn Leu Asp Ser Asn Met Phe Ser Asn Asp           
        275                 280                 285                       

ttt aat ttt gaa aac caa ttt gat gaa caa gtt tcg gag ttt tgt tcg        912
Phe Asn Phe Glu Asn Gln Phe Asp Glu Gln Val Ser Glu Phe Cys Ser           
    290                 295                 300                           

aaa atg aac cag gta tgt gga aca agg caa tgt ccc att ccc aag aaa        960
Lys Met Asn Gln Val Cys Gly Thr Arg Gln Cys Pro Ile Pro Lys Lys           
305                 310                 315                 320           

ccc atc tcg gct ctt gat aaa gaa gtt ttc gcg tca tct tct ata cta       1008
Pro Ile Ser Ala Leu Asp Lys Glu Val Phe Ala Ser Ser Ser Ile Leu           
                325                 330                 335               

agt tca aat tct cct gct tta aca aat act tgg gaa tca cat tct aat       1056
Ser Ser Asn Ser Pro Ala Leu Thr Asn Thr Trp Glu Ser His Ser Asn           
            340                 345                 350                   

att aca gat aat act cct gct aat gtc att gct act gat gct act aaa       1104
Ile Thr Asp Asn Thr Pro Ala Asn Val Ile Ala Thr Asp Ala Thr Lys           
        355                 360                 365                       

tat gaa aat tcc ttc tcc ggt ttt ggc cga ctt ggt ttc gat atg agt       1152
Tyr Glu Asn Ser Phe Ser Gly Phe Gly Arg Leu Gly Phe Asp Met Ser           
    370                 375                 380                           

gcc aat cat tac gtc gtg aat gat aat agc act ggt agc act gat agc       1200
Ala Asn His Tyr Val Val Asn Asp Asn Ser Thr Gly Ser Thr Asp Ser           
385                 390                 395                 400           

act ggt agc act ggc aat aag aac aaa aag aac aat aat aat agc gat       1248
Thr Gly Ser Thr Gly Asn Lys Asn Lys Lys Asn Asn Asn Asn Ser Asp           
                405                 410                 415               

gat gta ctc cca ttc ata tcc gag tca ccg ttt gat atg aac caa gtt       1296
Asp Val Leu Pro Phe Ile Ser Glu Ser Pro Phe Asp Met Asn Gln Val           
            420                 425                 430                   

act aat ttt ttt agt ccg gga tct acc ggc atc ggc aat aat gct gcc       1344
Thr Asn Phe Phe Ser Pro Gly Ser Thr Gly Ile Gly Asn Asn Ala Ala           
        435                 440                 445                       

tct aac acc aat ccc agc cta ctg caa agc agc aaa gag gat ata cct       1392
Ser Asn Thr Asn Pro Ser Leu Leu Gln Ser Ser Lys Glu Asp Ile Pro           
    450                 455                 460                           

ttt atc aac gca aat ctg gct ttc cca gac gac aat tca act aat att       1440
Phe Ile Asn Ala Asn Leu Ala Phe Pro Asp Asp Asn Ser Thr Asn Ile           
465                 470                 475                 480           

caa tta caa cct ttc tct gaa tct caa tct caa aat aag ttt gac tac       1488
Gln Leu Gln Pro Phe Ser Glu Ser Gln Ser Gln Asn Lys Phe Asp Tyr           
                485                 490                 495               

gac atg ttt ttt aga gat tca tcg aag gaa ggt aac aat tta ttt gga       1536
Asp Met Phe Phe Arg Asp Ser Ser Lys Glu Gly Asn Asn Leu Phe Gly           
            500                 505                 510                   

gag ttt tta gag gat gac gat gat gac aaa aaa gcc gct aat atg tca       1584
Glu Phe Leu Glu Asp Asp Asp Asp Asp Lys Lys Ala Ala Asn Met Ser           
        515                 520                 525                       

gac gat gag tca agt tta atc aag aac cag tta att aac gaa gaa cca       1632
Asp Asp Glu Ser Ser Leu Ile Lys Asn Gln Leu Ile Asn Glu Glu Pro           
    530                 535                 540                           

gag ctt ccg aaa caa tat cta caa tcg gta cca gga aat gaa agc gaa       1680
Glu Leu Pro Lys Gln Tyr Leu Gln Ser Val Pro Gly Asn Glu Ser Glu           
545                 550                 555                 560           

atc tca caa aaa aat ggc agt agt tta cag aat gct gac aaa atc aat       1728
Ile Ser Gln Lys Asn Gly Ser Ser Leu Gln Asn Ala Asp Lys Ile Asn           
                565                 570                 575               

aat ggc aat gat aac gat aat gat aat gat gtc gtt cca tct aag gaa       1776
Asn Gly Asn Asp Asn Asp Asn Asp Asn Asp Val Val Pro Ser Lys Glu           
            580                 585                 590                   

ggc tct tta cta agg tgt tcg gaa att tgg gat aga ata aca aca cat       1824
Gly Ser Leu Leu Arg Cys Ser Glu Ile Trp Asp Arg Ile Thr Thr His           
        595                 600                 605                       

ccg aaa tac tca gat att gat gtc gat ggt tta tgt tcc gag cta atg       1872
Pro Lys Tyr Ser Asp Ile Asp Val Asp Gly Leu Cys Ser Glu Leu Met           
    610                 615                 620                           

gca aag gca aaa tgt tca gaa aga ggg gtt gtc atc aat gca gaa gac       1920
Ala Lys Ala Lys Cys Ser Glu Arg Gly Val Val Ile Asn Ala Glu Asp           
625                 630                 635                 640           

gtt caa tta gct ttg aat aag cat atg aac taa                           1953
Val Gln Leu Ala Leu Asn Lys His Met Asn                                   
                645                 650                                   


<210>  2
<211>  650
<212>  PRT
<213>  Saccharomyces cerevisiae

<400>  2

Met Ser Val Ser Thr Ala Lys Arg Ser Leu Asp Val Val Ser Pro Gly 
1               5                   10                  15      


Ser Leu Ala Glu Phe Glu Gly Ser Lys Ser Arg His Asp Glu Ile Glu 
            20                  25                  30          


Asn Glu His Arg Arg Thr Gly Thr Arg Asp Gly Glu Asp Ser Glu Gln 
        35                  40                  45              


Pro Lys Lys Lys Gly Ser Lys Thr Ser Lys Lys Gln Asp Leu Asp Pro 
    50                  55                  60                  


Glu Thr Lys Gln Lys Arg Thr Ala Gln Asn Arg Ala Ala Gln Arg Ala 
65                  70                  75                  80  


Phe Arg Glu Arg Lys Glu Arg Lys Met Lys Glu Leu Glu Lys Lys Val 
                85                  90                  95      


Gln Ser Leu Glu Ser Ile Gln Gln Gln Asn Glu Val Glu Ala Thr Phe 
            100                 105                 110         


Leu Arg Asp Gln Leu Ile Thr Leu Val Asn Glu Leu Lys Lys Tyr Arg 
        115                 120                 125             


Pro Glu Thr Arg Asn Asp Ser Lys Val Leu Glu Tyr Leu Ala Arg Arg 
    130                 135                 140                 


Asp Pro Asn Leu His Phe Ser Lys Asn Asn Val Asn His Ser Asn Ser 
145                 150                 155                 160 


Glu Pro Ile Asp Thr Pro Asn Asp Asp Ile Gln Glu Asn Val Lys Gln 
                165                 170                 175     


Lys Met Asn Phe Thr Phe Gln Tyr Pro Leu Asp Asn Asp Asn Asp Asn 
            180                 185                 190         


Asp Asn Ser Lys Asn Val Gly Lys Gln Leu Pro Ser Pro Asn Asp Pro 
        195                 200                 205             


Ser His Ser Ala Pro Met Pro Ile Asn Gln Thr Gln Lys Lys Leu Ser 
    210                 215                 220                 


Asp Ala Thr Asp Ser Ser Ser Ala Thr Leu Asp Ser Leu Ser Asn Ser 
225                 230                 235                 240 


Asn Asp Val Leu Asn Asn Thr Pro Asn Ser Ser Thr Ser Met Asp Trp 
                245                 250                 255     


Leu Asp Asn Val Ile Tyr Thr Asn Arg Phe Val Ser Gly Asp Asp Gly 
            260                 265                 270         


Ser Asn Ser Lys Thr Lys Asn Leu Asp Ser Asn Met Phe Ser Asn Asp 
        275                 280                 285             


Phe Asn Phe Glu Asn Gln Phe Asp Glu Gln Val Ser Glu Phe Cys Ser 
    290                 295                 300                 


Lys Met Asn Gln Val Cys Gly Thr Arg Gln Cys Pro Ile Pro Lys Lys 
305                 310                 315                 320 


Pro Ile Ser Ala Leu Asp Lys Glu Val Phe Ala Ser Ser Ser Ile Leu 
                325                 330                 335     


Ser Ser Asn Ser Pro Ala Leu Thr Asn Thr Trp Glu Ser His Ser Asn 
            340                 345                 350         


Ile Thr Asp Asn Thr Pro Ala Asn Val Ile Ala Thr Asp Ala Thr Lys 
        355                 360                 365             


Tyr Glu Asn Ser Phe Ser Gly Phe Gly Arg Leu Gly Phe Asp Met Ser 
    370                 375                 380                 


Ala Asn His Tyr Val Val Asn Asp Asn Ser Thr Gly Ser Thr Asp Ser 
385                 390                 395                 400 


Thr Gly Ser Thr Gly Asn Lys Asn Lys Lys Asn Asn Asn Asn Ser Asp 
                405                 410                 415     


Asp Val Leu Pro Phe Ile Ser Glu Ser Pro Phe Asp Met Asn Gln Val 
            420                 425                 430         


Thr Asn Phe Phe Ser Pro Gly Ser Thr Gly Ile Gly Asn Asn Ala Ala 
        435                 440                 445             


Ser Asn Thr Asn Pro Ser Leu Leu Gln Ser Ser Lys Glu Asp Ile Pro 
    450                 455                 460                 


Phe Ile Asn Ala Asn Leu Ala Phe Pro Asp Asp Asn Ser Thr Asn Ile 
465                 470                 475                 480 


Gln Leu Gln Pro Phe Ser Glu Ser Gln Ser Gln Asn Lys Phe Asp Tyr 
                485                 490                 495     


Asp Met Phe Phe Arg Asp Ser Ser Lys Glu Gly Asn Asn Leu Phe Gly 
            500                 505                 510         


Glu Phe Leu Glu Asp Asp Asp Asp Asp Lys Lys Ala Ala Asn Met Ser 
        515                 520                 525             


Asp Asp Glu Ser Ser Leu Ile Lys Asn Gln Leu Ile Asn Glu Glu Pro 
    530                 535                 540                 


Glu Leu Pro Lys Gln Tyr Leu Gln Ser Val Pro Gly Asn Glu Ser Glu 
545                 550                 555                 560 


Ile Ser Gln Lys Asn Gly Ser Ser Leu Gln Asn Ala Asp Lys Ile Asn 
                565                 570                 575     


Asn Gly Asn Asp Asn Asp Asn Asp Asn Asp Val Val Pro Ser Lys Glu 
            580                 585                 590         


Gly Ser Leu Leu Arg Cys Ser Glu Ile Trp Asp Arg Ile Thr Thr His 
        595                 600                 605             


Pro Lys Tyr Ser Asp Ile Asp Val Asp Gly Leu Cys Ser Glu Leu Met 
    610                 615                 620                 


Ala Lys Ala Lys Cys Ser Glu Arg Gly Val Val Ile Asn Ala Glu Asp 
625                 630                 635                 640 


Val Gln Leu Ala Leu Asn Lys His Met Asn 
                645                 650 


<210>  3
<211>  1605
<212>  DNA
<213>  Yarrowia lipolytica


<220>
<221>  CDS
<222>  (1)..(1605)
<223>  YALI0F03388; GenBank Accession No. XM_504945

<400>  3
atg tac tca gac tac aac att cct ggt gcc atg ccg gcg tcc atg gcc         48
Met Tyr Ser Asp Tyr Asn Ile Pro Gly Ala Met Pro Ala Ser Met Ala           
1               5                   10                  15                

atg cct ccg ttc aaa cag gag ttt gac tac gcc caa tac gac ctt aac         96
Met Pro Pro Phe Lys Gln Glu Phe Asp Tyr Ala Gln Tyr Asp Leu Asn           
            20                  25                  30                    

cag ccc ctg ccc ccg cag cag caa caa cag cct atc gac ctg acc cct        144
Gln Pro Leu Pro Pro Gln Gln Gln Gln Gln Pro Ile Asp Leu Thr Pro           
        35                  40                  45                        

gga ggg ccc ctc ccc gtc tcg gat tac tcg acg tcg tca tac acc ctg        192
Gly Gly Pro Leu Pro Val Ser Asp Tyr Ser Thr Ser Ser Tyr Thr Leu           
    50                  55                  60                            

gac aac gac tca cag aag cga aaa atg tcc ccg gga gag tcc acc agt        240
Asp Asn Asp Ser Gln Lys Arg Lys Met Ser Pro Gly Glu Ser Thr Ser           
65                  70                  75                  80            

gac gga ggc gcc gac gac gag tct cca gaa gga gat gac ggt gag gcc        288
Asp Gly Gly Ala Asp Asp Glu Ser Pro Glu Gly Asp Asp Gly Glu Ala           
                85                  90                  95                

gac ccc aag aag ccc cga aag ccc ggc cga aag ccc gaa acc acc atc        336
Asp Pro Lys Lys Pro Arg Lys Pro Gly Arg Lys Pro Glu Thr Thr Ile           
            100                 105                 110                   

ccc gcg tcc aaa cgc aag gct cag aac cgg gct gcc caa agg gcc ttc        384
Pro Ala Ser Lys Arg Lys Ala Gln Asn Arg Ala Ala Gln Arg Ala Phe           
        115                 120                 125                       

aga gag cga aag gaa aag cat ctg cgc gac ctg gaa acc aaa ata tct        432
Arg Glu Arg Lys Glu Lys His Leu Arg Asp Leu Glu Thr Lys Ile Ser           
    130                 135                 140                           

cag ctc gag ggc gag acg gca gcc aaa aac tcg gaa aac gag ttc ctg        480
Gln Leu Glu Gly Glu Thr Ala Ala Lys Asn Ser Glu Asn Glu Phe Leu           
145                 150                 155                 160           

cgc ttc cag gtc cag cgg ctt cag aac gag ctc aag ctt tac cgt gag        528
Arg Phe Gln Val Gln Arg Leu Gln Asn Glu Leu Lys Leu Tyr Arg Glu           
                165                 170                 175               

aag cct gcc ggc act tcg gga gcc tct gga gtc tct gga gcc gga gca        576
Lys Pro Ala Gly Thr Ser Gly Ala Ser Gly Val Ser Gly Ala Gly Ala           
            180                 185                 190                   

ccc gct tca aac gtg cat tcg gct ccc atc ccg gag atg tcg tcc aaa        624
Pro Ala Ser Asn Val His Ser Ala Pro Ile Pro Glu Met Ser Ser Lys           
        195                 200                 205                       

ccg ttc acg ttc gag ttc ccc tcg tac aac gtg ccc aag ccg acc gat        672
Pro Phe Thr Phe Glu Phe Pro Ser Tyr Asn Val Pro Lys Pro Thr Asp           
    210                 215                 220                           

gtg gag cga gag gca cgc gag caa ctg caa cga gag cag atc cga ggc        720
Val Glu Arg Glu Ala Arg Glu Gln Leu Gln Arg Glu Gln Ile Arg Gly           
225                 230                 235                 240           

tac ttg cag cgc aag ccc tca tct gtg gcc tcc gac acc act tct cct        768
Tyr Leu Gln Arg Lys Pro Ser Ser Val Ala Ser Asp Thr Thr Ser Pro           
                245                 250                 255               

gca tct caa acc tcg tgc aac cag tct ccc tgc acc aac ccc tcg gca        816
Ala Ser Gln Thr Ser Cys Asn Gln Ser Pro Cys Thr Asn Pro Ser Ala           
            260                 265                 270                   

tac act tcg ccc cag agc cag agt gga agt gtg agc cag cag aag ccc        864
Tyr Thr Ser Pro Gln Ser Gln Ser Gly Ser Val Ser Gln Gln Lys Pro           
        275                 280                 285                       

ctg ttg ggt gct acc atc gct gcc atg aac ggc aag ccc gac ccc cat        912
Leu Leu Gly Ala Thr Ile Ala Ala Met Asn Gly Lys Pro Asp Pro His           
    290                 295                 300                           

gct gtt gac ttt tgt gct gag ctc tcc aag gcc tgt gta aac aag gcc        960
Ala Val Asp Phe Cys Ala Glu Leu Ser Lys Ala Cys Val Asn Lys Ala           
305                 310                 315                 320           

gag ctg ctg cag cga tcc gcc aca gcc agt gca tct ccc aca acc tcc       1008
Glu Leu Leu Gln Arg Ser Ala Thr Ala Ser Ala Ser Pro Thr Thr Ser           
                325                 330                 335               

aac acg gtg gta ccg tcc gca gct gca ccg ggt agc act cag cag tcg       1056
Asn Thr Val Val Pro Ser Ala Ala Ala Pro Gly Ser Thr Gln Gln Ser           
            340                 345                 350                   

gca ggc cag ccc tct gta tcc act cct acc tcc tcc aca act gcc cct       1104
Ala Gly Gln Pro Ser Val Ser Thr Pro Thr Ser Ser Thr Thr Ala Pro           
        355                 360                 365                       

cct caa ttg tct gca tct gtc gct aca gcc ggc tct gat ctt ccc gga       1152
Pro Gln Leu Ser Ala Ser Val Ala Thr Ala Gly Ser Asp Leu Pro Gly           
    370                 375                 380                           

tcg gac ttc ctg ttt gac atg ccc ttc gac atg gac ttt atg tcg tac       1200
Ser Asp Phe Leu Phe Asp Met Pro Phe Asp Met Asp Phe Met Ser Tyr           
385                 390                 395                 400           

cga gac ccc gtt tcc gag acg gca cat ctg gac gac ttt tcg ctg ccc       1248
Arg Asp Pro Val Ser Glu Thr Ala His Leu Asp Asp Phe Ser Leu Pro           
                405                 410                 415               

gag ctc acg aca gaa aca tcc atg ttt gat cct ctg gac ccc cat tcc       1296
Glu Leu Thr Thr Glu Thr Ser Met Phe Asp Pro Leu Asp Pro His Ser           
            420                 425                 430                   

agc agc gac gtt att tct ggc aag cct ctg tct acc atg ggc gct aca       1344
Ser Ser Asp Val Ile Ser Gly Lys Pro Leu Ser Thr Met Gly Ala Thr           
        435                 440                 445                       

cac agt ggt gtc aac aac gga cag gga agt ggt gct ccc gaa gtc aag       1392
His Ser Gly Val Asn Asn Gly Gln Gly Ser Gly Ala Pro Glu Val Lys           
    450                 455                 460                           

aag gag gag gat gag gac ctg ctc atg ttc tcc aag ccc aag acg ctc       1440
Lys Glu Glu Asp Glu Asp Leu Leu Met Phe Ser Lys Pro Lys Thr Leu           
465                 470                 475                 480           

atg aac tgc acc gct gtg tgg gac cgt atc acg tcg cat ccc aag ttt       1488
Met Asn Cys Thr Ala Val Trp Asp Arg Ile Thr Ser His Pro Lys Phe           
                485                 490                 495               

ggc gat atc gac atc gag ggc ctg tgt tcg gag ctg cga aac aag gca       1536
Gly Asp Ile Asp Ile Glu Gly Leu Cys Ser Glu Leu Arg Asn Lys Ala           
            500                 505                 510                   

aag tgc agt gag agt ggc gtc gtg ttg acg gag ttg gac gtg gat ggt       1584
Lys Cys Ser Glu Ser Gly Val Val Leu Thr Glu Leu Asp Val Asp Gly           
        515                 520                 525                       

gtc ctg tca acg ttc cag taa                                           1605
Val Leu Ser Thr Phe Gln                                                   
    530                                                                   


<210>  4
<211>  534
<212>  PRT
<213>  Yarrowia lipolytica

<400>  4

Met Tyr Ser Asp Tyr Asn Ile Pro Gly Ala Met Pro Ala Ser Met Ala 
1               5                   10                  15      


Met Pro Pro Phe Lys Gln Glu Phe Asp Tyr Ala Gln Tyr Asp Leu Asn 
            20                  25                  30          


Gln Pro Leu Pro Pro Gln Gln Gln Gln Gln Pro Ile Asp Leu Thr Pro 
        35                  40                  45              


Gly Gly Pro Leu Pro Val Ser Asp Tyr Ser Thr Ser Ser Tyr Thr Leu 
    50                  55                  60                  


Asp Asn Asp Ser Gln Lys Arg Lys Met Ser Pro Gly Glu Ser Thr Ser 
65                  70                  75                  80  


Asp Gly Gly Ala Asp Asp Glu Ser Pro Glu Gly Asp Asp Gly Glu Ala 
                85                  90                  95      


Asp Pro Lys Lys Pro Arg Lys Pro Gly Arg Lys Pro Glu Thr Thr Ile 
            100                 105                 110         


Pro Ala Ser Lys Arg Lys Ala Gln Asn Arg Ala Ala Gln Arg Ala Phe 
        115                 120                 125             


Arg Glu Arg Lys Glu Lys His Leu Arg Asp Leu Glu Thr Lys Ile Ser 
    130                 135                 140                 


Gln Leu Glu Gly Glu Thr Ala Ala Lys Asn Ser Glu Asn Glu Phe Leu 
145                 150                 155                 160 


Arg Phe Gln Val Gln Arg Leu Gln Asn Glu Leu Lys Leu Tyr Arg Glu 
                165                 170                 175     


Lys Pro Ala Gly Thr Ser Gly Ala Ser Gly Val Ser Gly Ala Gly Ala 
            180                 185                 190         


Pro Ala Ser Asn Val His Ser Ala Pro Ile Pro Glu Met Ser Ser Lys 
        195                 200                 205             


Pro Phe Thr Phe Glu Phe Pro Ser Tyr Asn Val Pro Lys Pro Thr Asp 
    210                 215                 220                 


Val Glu Arg Glu Ala Arg Glu Gln Leu Gln Arg Glu Gln Ile Arg Gly 
225                 230                 235                 240 


Tyr Leu Gln Arg Lys Pro Ser Ser Val Ala Ser Asp Thr Thr Ser Pro 
                245                 250                 255     


Ala Ser Gln Thr Ser Cys Asn Gln Ser Pro Cys Thr Asn Pro Ser Ala 
            260                 265                 270         


Tyr Thr Ser Pro Gln Ser Gln Ser Gly Ser Val Ser Gln Gln Lys Pro 
        275                 280                 285             


Leu Leu Gly Ala Thr Ile Ala Ala Met Asn Gly Lys Pro Asp Pro His 
    290                 295                 300                 


Ala Val Asp Phe Cys Ala Glu Leu Ser Lys Ala Cys Val Asn Lys Ala 
305                 310                 315                 320 


Glu Leu Leu Gln Arg Ser Ala Thr Ala Ser Ala Ser Pro Thr Thr Ser 
                325                 330                 335     


Asn Thr Val Val Pro Ser Ala Ala Ala Pro Gly Ser Thr Gln Gln Ser 
            340                 345                 350         


Ala Gly Gln Pro Ser Val Ser Thr Pro Thr Ser Ser Thr Thr Ala Pro 
        355                 360                 365             


Pro Gln Leu Ser Ala Ser Val Ala Thr Ala Gly Ser Asp Leu Pro Gly 
    370                 375                 380                 


Ser Asp Phe Leu Phe Asp Met Pro Phe Asp Met Asp Phe Met Ser Tyr 
385                 390                 395                 400 


Arg Asp Pro Val Ser Glu Thr Ala His Leu Asp Asp Phe Ser Leu Pro 
                405                 410                 415     


Glu Leu Thr Thr Glu Thr Ser Met Phe Asp Pro Leu Asp Pro His Ser 
            420                 425                 430         


Ser Ser Asp Val Ile Ser Gly Lys Pro Leu Ser Thr Met Gly Ala Thr 
        435                 440                 445             


His Ser Gly Val Asn Asn Gly Gln Gly Ser Gly Ala Pro Glu Val Lys 
    450                 455                 460                 


Lys Glu Glu Asp Glu Asp Leu Leu Met Phe Ser Lys Pro Lys Thr Leu 
465                 470                 475                 480 


Met Asn Cys Thr Ala Val Trp Asp Arg Ile Thr Ser His Pro Lys Phe 
                485                 490                 495     


Gly Asp Ile Asp Ile Glu Gly Leu Cys Ser Glu Leu Arg Asn Lys Ala 
            500                 505                 510         


Lys Cys Ser Glu Ser Gly Val Val Leu Thr Glu Leu Asp Val Asp Gly 
        515                 520                 525             


Val Leu Ser Thr Phe Gln 
    530                 


<210>  5
<211>  7412
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pYRH60

<400>  5
cgacgtcggg cccaattcgc cctatagtga gtcgtattac aattcactgg ccgtcgtttt       60

acaacgtcgt gactgggaaa accctggcgt tacccaactt aatcgccttg cagcacatcc      120

ccctttcgcc agctggcgta atagcgaaga ggcccgcacc gatcgccctt cccaacagtt      180

gcgcagcctg aatggcgaat ggacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg      240

gtggttacgc gcagcgtgac cgctacactt gccagcgccc tagcgcccgc tcctttcgct      300

ttcttccctt cctttctcgc cacgttcgcc ggctttcccc gtcaagctct aaatcggggg      360

ctccctttag ggttccgatt tagtgcttta cggcacctcg accccaaaaa acttgattag      420

ggtgatggtt cacgtagtgg gccatcgccc tgatagacgg tttttcgccc tttgacgttg      480

gagtccacgt tctttaatag tggactcttg ttccaaactg gaacaacact caaccctatc      540

tcggtctatt cttttgattt ataagggatt ttgccgattt cggcctattg gttaaaaaat      600

gagctgattt aacaaaaatt taacgcgaat tttaacaaaa tattaacgct tacaatttcc      660

tgatgcggta ttttctcctt acgcatctgt gcggtatttc acaccgcatc aggtggcact      720

tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg      780

tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt      840

atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct      900

gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca      960

cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc     1020

gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc     1080

cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg     1140

gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta     1200

tgcagtgctg ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc     1260

ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt     1320

gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg     1380

cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct     1440

tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc     1500

tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct     1560

cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac     1620

acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc     1680

tcactgatta agcattggta actgtcagac caagtttact catatatact ttagattgat     1740

ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg     1800

accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc     1860

aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa     1920

ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag     1980

gtaactggct tcagcagagc gcagatacca aatactgttc ttctagtgta gccgtagtta     2040

ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta     2100

ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag     2160

ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg     2220

gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga aagcgccacg     2280

cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag     2340

cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc     2400

cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa     2460

aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg     2520

ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt tgagtgagct     2580

gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga ggaagcggaa     2640

gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta atgcagctgg     2700

cgcgcccttg tttcgttagt gtagtgtgtg tgggaggaga gtgtgtgtgc gggagtgcaa     2760

gtggaggtga aatgttgtga aaggttgtga aatgatgtgt aaagggagga taggacgggc     2820

ggaaaagacc gcaagctgta tcattttgaa gctctcgggc ccgcgaagct gtttgcggca     2880

ttaatgtctc cattcgagct cttttggcgg actccaggtg tcgtttctct ccaactacaa     2940

gtactcatac agtagccgca gccgtaaaga cctcagccac tgactcaaca ccgcggttgc     3000

ttctggaacg gtttgaaagc taaaacatct ttaggtgtca gattttggga gggtttcaga     3060

tggtgcggat tgtgcaaagt ggcagaaaag agggcgcagg aggcggattt ttgcgctttt     3120

gaagacacat atgggttttc cgagccctcg aaaccatctc tggccgtttt ccccgtcaaa     3180

aacccccgca tttcacctcc atcgtcgctt ctgctgaagt caccaggtac tcccgcaaat     3240

aagcttcatt cgccactcaa accgtcctgc cttgagataa aagtgcaacg ttgtccacca     3300

acgaaccctg acaagccgct aatcactgta cgacgaactt gaacgaccca gtcgacgatt     3360

tcaacgtaca aagttcctcc gagagtgaca cagaccgacg aacgatcgca cacagaccga     3420

cagcgaccac tcagacagtc cagacatcag acatcagact gaacacaacc aacaagcatt     3480

gaacactgcc cttccaccaa gttcgacacg cagacacaga accgctccaa ccgacacaga     3540

accgctccaa ccgacacaga accactccaa ccgacacaga acctttccaa ccgacacaga     3600

accgttccaa ccgacgcact actgtttctt gtgtctacac gtacgttgat caagcttgtg     3660

agcggataac aatttcacac aggaaacagc tatgaccatg attacgccaa gctcgaaatt     3720

aaccctcact aaagggaaca aaagctggag ctccaccgcg gacacaatat ctggtcaaat     3780

ttcagtttcg ttacatttaa acggtaggtt agtgcttggt atatgagttg taggcatgac     3840

aatttggaaa ggggtggact ttgggaatat tgtgggattt caatacctta gtttgtacag     3900

ggtaattgtt acaaatgata caaagaactg tatttctttt catttgtttt aattggttgt     3960

atatcaagtc cgttagacga gctcagtgcc ttggcttttg gcactgtatt tcatttttag     4020

aggtacacta cattcagtga ggtatggtaa ggttgagggc ataatgaagg caccttgtac     4080

tgacagtcac agacctctca ccgagaattt tatgagatat actcgggttc attttaggct     4140

catcgatacg ctctcatcaa gaatacttct tgagaaccgt ggagaccggg gttcgattcc     4200

ccgtatcgga gtgtttattt tttgctcaac cataccctgg ggtgtgttct gtggagcatt     4260

ctcacttttg gtaaacgaca ttgcttcaag tgcagcggaa tcaaaaagta taaagtgggc     4320

agcgagtata cctgtacaga ctgtaggcga taactcaatc caattacccc ccacaacatg     4380

actggccaaa ctgatctcaa gactttattg aaatcagcaa caccgattct caatgaaggc     4440

acatacttct tctgcaacat tcacttgacg cctaaagttg gtgagaaatg gaccgacaag     4500

acatattctg ctatccacgg actgttgcct gtgtcggtgg ctacaatacg tgagtcagaa     4560

gggctgacgg tggtggttcc caaggaaaag gtcgacgagt atctgtctga ctcgtcattg     4620

ccgcctttgg agtacgactc caactatgag tgtgcttgga tcactttgac gatacattct     4680

tcgttggagg ctgtgggtct gacagctgcg ttttcggcgc ggttggccga caacaatatc     4740

agctgcaacg tcattgctgg ctttcatcat gatcacattt ttgtcggcaa aggcgacgcc     4800

cagagagcca ttgacgttct ttctaatttg gaccgatagc cgtatagtcc agtctatcta     4860

taagttcaac taactcgtaa ctattaccat aacatatact tcactgcccc agataaggtt     4920

ccgataaaaa gttctgcaga ctaaatttat ttcagtctcc tcttcaccac caaaatgccc     4980

tcctacgaag ctcgagctaa cgtccacaag tccgcctttg ccgctcgagt gctcaagctc     5040

gtggcagcca agaaaaccaa cctgtgtgct tctctggatg ttaccaccac caaggagctc     5100

attgagcttg ccgataaggt cggaccttat gtgtgcatga tcaaaaccca tatcgacatc     5160

attgacgact tcacctacgc cggcactgtg ctccccctca aggaacttgc tcttaagcac     5220

ggtttcttcc tgttcgagga cagaaagttc gcagatattg gcaacactgt caagcaccag     5280

taccggtgtc accgaatcgc cgagtggtcc gatatcacca acgcccacgg tgtacccgga     5340

accggaatca ttgctggcct gcgagctggt gccgaggaaa ctgtctctga acagaagaag     5400

gaggacgtct ctgactacga gaactcccag tacaaggagt tcctagtccc ctctcccaac     5460

gagaagctgg ccagaggtct gctcatgctg gccgagctgt cttgcaaggg ctctctggcc     5520

actggcgagt actccaagca gaccattgag cttgcccgat ccgaccccga gtttgtggtt     5580

ggcttcattg cccagaaccg acctaagggc gactctgagg actggcttat tctgaccccc     5640

ggggtgggtc ttgacgacaa gggagacgct ctcggacagc agtaccgaac tgttgaggat     5700

gtcatgtcta ccggaacgga tatcataatt gtcggccgag gtctgtacgg ccagaaccga     5760

gatcctattg aggaggccaa gcgataccag aaggctggct gggaggctta ccagaagatt     5820

aactgttaga ggttagacta tggatatgta atttaactgt gtatatagag agcgtgcaag     5880

tatggagcgc ttgttcagct tgtatgatgg tcagacgacc tgtctgatcg agtatgtatg     5940

atactgcaca acctgtgtat ccgcatgatc tgtccaatgg ggcatgttgt tgtgtttctc     6000

gatacggaga tgctgggtac agtgctaata cgttgaacta cttatactta tatgaggctc     6060

gaagaaagct gacttgtgta tgacttattc tcaactacat ccccagtcac aataccacca     6120

ctgcactacc actacaccaa aaccatgatc aaaccaccca tggacttcct ggaggcagaa     6180

gaacttgtta tggaaaagct caagagagag aattcaagat actatcaaga catgtgtcgc     6240

aacttaatta aagtagagag catcccaaac aagcagtcgc agtcgcactc atcgatatgc     6300

atatgtgcta cttaactgta cgagtactgt acagtacata cagtacctgt agtgattcac     6360

attcagtcat acagtgcagg agtacttccg cttgtctcac aggctttgtc catgtgccaa     6420

tgagtcagac agacacttgt gcatgaggca gagcacacac atggcttcgt tcaatctgct     6480

gataggtcga cattctggga tctgctcagg ttgttcagat gaccaccttc tttttcaccc     6540

cctctccctg taccaccagg accgtttccg agacccacgt gaccctcaaa ccgtcgctct     6600

tgactttccc caggctctcc acctttgccg gctcaaagct cggcgtctgt ttatccctgt     6660

atccaatttt gcccacgctg gcatagagca gaatctccac ctgtctctcc acgacgtttc     6720

ttgacttgcc aaacttgact gattcagagt agaccccctg ggaggaatgg gaagagtttg     6780

cggagttacc gaacagcgaa gagaaggtgc ctccatgggt ttccatctgc caaacgacga     6840

cacgtgtttc tccgtcgaaa tcgggcccag ggacgctatt agaccctatt cccgtgagtc     6900

cagcaaccat ttttccatcc ggagaaaaga ccaggcccca caccggagca gcatgattgg     6960

aattgaattc ctgggggtca agataggcat actctgagcc gattctcaag tcgtagacaa     7020

tcagagaaca ggtgtcggtg accttaatat ccaggtctct gttgagttca gacagagaag     7080

atcttcgtcg agacacattc ttttcaatca tcacagcagt ggcagtagga taaatagcca     7140

cagccagacg ttgactgggc ttatggaacg tgacaatgta cggaaatgtc tgtgtgattt     7200

gagacagtag agctgtgacc ttggactgca gagaaacgcc tctctggagg gtcgagtgac     7260

gcagcaagtc cggattcagc attttgcaag cagtgtgcat cacaaacggc acaaacatgt     7320

ccatggagga ggattttcgg gtgtggctga agaagctgga aagcacatcg atagctgtga     7380

ttcgcacaac taacggcttg tcgaggtgca tg                                   7412


<210>  6
<211>  7966
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pYPS161

<400>  6
aaatgtaacg aaactgaaat ttgaccagat attgtgtccg cggtggagct ccagcttttg       60

ttccctttag tgagggttaa tttcgagctt ggcgtaatca tggtcatagc tgtttcctgt      120

gtgaaattgt tatccgctca caagcttcca cacaacgtac gttctggttg gctcggatga      180

tttctgcggc cccagcgtaa ggcaggcgtt ccgtccggat cggtttgggt cggatcggct      240

ttttgattgt cgtattgtcg ctcatgttgg acctggtgtg tagttgtagt gtcagatcag      300

attcaccagc gaatgcatgt gaacttcccc acattttgag ccgaggcaga tttgggttgc      360

ttagtaagca gacgtggcgt tgcaagtaga tgtggcaaat ggggacgaag attccgaggg      420

gatatcatag ttccaagggg atgtcatcat ttgccagctt tcgccgccac ttttgacgag      480

tttttgtggg tcaaataagt ttagttgaac ttttcaaatt tcagttggca ttttgttaat      540

agaaagggtg ccggtgctgg ggggttcatt cctcgggttg cagatatcct atctgtctta      600

ggggtatctc tttcaatcga caagatgtag ttgggtaaca attatttatt aatattctct      660

ccatccagta cagtactaac atcttgacat ctcagcacaa gtgcatcttc ccaagtgttt      720

gttggagagg ttgttgggta ttacttagga aacagaacac agtacgtgga gatcttggat      780

acatcgtaca tggaggttat ccataaaaaa gaccctccag gactagttac aatgccgtta      840

gatgaggaaa tccacaaccc tgattcacta tgaacatatt atcttccccc aaacttgcga      900

tatatggccc ttgatgatag ccttgatttt acccttgatg gtacctccac gaccaaccga      960

tctgctgttt gaagagatat tttcaaattt gaagtgctca gatctactaa acatgagtcc     1020

agtaattctt tccgtctttc cgatttccga tattcccttt tttagcccga cttttcactg     1080

ctcccatgtc aaacgattag gacttgggag acaatcccac tgtcaaaatc accccgatat     1140

tctctgtaaa acaagtactt cttccacgtg atcttcaaat acctcttcca cgtgaccttc     1200

aaatacctct tcaagtacct cttccacgcg accttcaaag tcccttcaaa tacccttctc     1260

aattctcccc ttctcctcca tagtccttct ctctgactaa gcttgagaat acatgacgct     1320

aagacgaaaa cacactagag accctgagag cctgaacatg catccactct gcagttgcgc     1380

acgtgcctac agcaactatc gggtccagtg ctggatctga cactgcgtct ccctatgaag     1440

aaactgataa acagatctgc actcataaca atgatctgag cgatgaaaac gtgacctcca     1500

cagccacaag tcataatcgg cgcgccagct gcattaatga atcggccaac gcgcggggag     1560

aggcggtttg cgtattgggc gctcttccgc ttcctcgctc actgactcgc tgcgctcggt     1620

cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg gtaatacggt tatccacaga     1680

atcaggggat aacgcaggaa agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg     1740

taaaaaggcc gcgttgctgg cgtttttcca taggctccgc ccccctgacg agcatcacaa     1800

aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga ctataaagat accaggcgtt     1860

tccccctgga agctccctcg tgcgctctcc tgttccgacc ctgccgctta ccggatacct     1920

gtccgccttt ctcccttcgg gaagcgtggc gctttctcat agctcacgct gtaggtatct     1980

cagttcggtg taggtcgttc gctccaagct gggctgtgtg cacgaacccc ccgttcagcc     2040

cgaccgctgc gccttatccg gtaactatcg tcttgagtcc aacccggtaa gacacgactt     2100

atcgccactg gcagcagcca ctggtaacag gattagcaga gcgaggtatg taggcggtgc     2160

tacagagttc ttgaagtggt ggcctaacta cggctacact agaagaacag tatttggtat     2220

ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa     2280

acaaaccacc gctggtagcg gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa     2340

aaaaggatct caagaagatc ctttgatctt ttctacgggg tctgacgctc agtggaacga     2400

aaactcacgt taagggattt tggtcatgag attatcaaaa aggatcttca cctagatcct     2460

tttaaattaa aaatgaagtt ttaaatcaat ctaaagtata tatgagtaaa cttggtctga     2520

cagttaccaa tgcttaatca gtgaggcacc tatctcagcg atctgtctat ttcgttcatc     2580

catagttgcc tgactccccg tcgtgtagat aactacgata cgggagggct taccatctgg     2640

ccccagtgct gcaatgatac cgcgagaccc acgctcaccg gctccagatt tatcagcaat     2700

aaaccagcca gccggaaggg ccgagcgcag aagtggtcct gcaactttat ccgcctccat     2760

ccagtctatt aattgttgcc gggaagctag agtaagtagt tcgccagtta atagtttgcg     2820

caacgttgtt gccattgcta caggcatcgt ggtgtcacgc tcgtcgtttg gtatggcttc     2880

attcagctcc ggttcccaac gatcaaggcg agttacatga tcccccatgt tgtgcaaaaa     2940

agcggttagc tccttcggtc ctccgatcgt tgtcagaagt aagttggccg cagtgttatc     3000

actcatggtt atggcagcac tgcataattc tcttactgtc atgccatccg taagatgctt     3060

ttctgtgact ggtgagtact caaccaagtc attctgagaa tagtgtatgc ggcgaccgag     3120

ttgctcttgc ccggcgtcaa tacgggataa taccgcgcca catagcagaa ctttaaaagt     3180

gctcatcatt ggaaaacgtt cttcggggcg aaaactctca aggatcttac cgctgttgag     3240

atccagttcg atgtaaccca ctcgtgcacc caactgatct tcagcatctt ttactttcac     3300

cagcgtttct gggtgagcaa aaacaggaag gcaaaatgcc gcaaaaaagg gaataagggc     3360

gacacggaaa tgttgaatac tcatactctt cctttttcaa tattattgaa gcatttatca     3420

gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata aacaaatagg     3480

ggttccgcgc acatttcccc gaaaagtgcc acctgatgcg gtgtgaaata ccgcacagat     3540

gcgtaaggag aaaataccgc atcaggaaat tgtaagcgtt aatattttgt taaaattcgc     3600

gttaaatttt tgttaaatca gctcattttt taaccaatag gccgaaatcg gcaaaatccc     3660

ttataaatca aaagaataga ccgagatagg gttgagtgtt gttccagttt ggaacaagag     3720

tccactatta aagaacgtgg actccaacgt caaagggcga aaaaccgtct atcagggcga     3780

tggcccacta cgtgaaccat caccctaatc aagttttttg gggtcgaggt gccgtaaagc     3840

actaaatcgg aaccctaaag ggagcccccg atttagagct tgacggggaa agccggcgaa     3900

cgtggcgaga aaggaaggga agaaagcgaa aggagcgggc gctagggcgc tggcaagtgt     3960

agcggtcacg ctgcgcgtaa ccaccacacc cgccgcgctt aatgcgccgc tacagggcgc     4020

gtccattcgc cattcaggct gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg     4080

ctattacgcc agctggcgaa agggggatgt gctgcaaggc gattaagttg ggtaacgcca     4140

gggttttccc agtcacgacg ttgtaaaacg acggccagtg aattgtaata cgactcacta     4200

tagggcgaat tgggcccgac gtcgcatgca actattagtg aggcttcggg agtggttgtc     4260

tcggttgtct cattcagact cgttgtgttg tatctatatc tatataaaca ctcttgtccc     4320

tcaatcccac tgccatcttt tgctaaactt gccgccaata tgaaactcat ctccctcatc     4380

accgtcgcta ccaccgctct ggcggctgtc ggagacaagt acaagctgac ctataccaga     4440

tcagacgccc aatcggtcga atctctgccc gtcacctacc aagatgacct gatcaccgcc     4500

tccaccgacg gcgaacccat caccatcacc gagggcgagg gcaacacctt ctctgttaac     4560

gacatgccca tcgcctatct ggagctgcag gctttgttct ggaccggcga ctacggctac     4620

aagctccagg gctcggtctt tgacattgcc gccgatggaa cctttgagct gagagacggc     4680

cccaaggagt actactattg cactcctcac cctgagcgaa acgtcatcta cgtcatcaac     4740

agccccgact actccaagtg tcggttcaag cgtaccatca agttccacgc tgaaaagatc     4800

taagtggtaa tcgaccgact aaccattttt agctgacaaa cacttgctaa ctcctataac     4860

gaatgaatga ctaacttggc atattgttac caagtattac ttgggatata gttgagtgta     4920

accattgcta agaatccaaa ctggagcttc taaaggtctg ggagtcgccg tatgtgttca     4980

tatcgaaatc aaagaaatca taatcgcaac agaattcaaa atcaagcaga ttaatatcca     5040

ttattgtact cggatcgtga catatctgat atgatctcgg atatgatctc tgactgttta     5100

ctgggagatt tgttgaagat ttgttgaggt tatctgaaaa gtagacaata gagacaaaat     5160

gacgatatca agaactgaat cgggccgaaa tactcggtat cattcccttc agcagtaact     5220

gtattgctct atcaatgcga cgagatacct ccacaattaa tactgtatac gctctaccac     5280

tcatatctcc aatgctaaaa tatattcatg cccaggacct ctgtgcactg ctatgcagca     5340

cagtgttgtc gattgaattg gtcgtgtctg gtccctgatg ctctgtgtct cgctgactag     5400

tccttccatc cagacctcgt cattatctga taggcaacaa gttctgctct ctcacaccct     5460

gccgacacaa gggacactcg ggcttctctc tcacccattc ggaaatacag tccttaatta     5520

agttgcgaca catgtcttga tagtatcttg aattctctct cttgagcttt tccataacaa     5580

gttcttctgc ctccaggaag tccatgggtg gtttgatcat ggttttggtg tagtggtagt     5640

gcagtggtgg tattgtgact ggggatgtag ttgagaataa gtcatacaca agtcagcttt     5700

cttcgagcct catataagta taagtagttc aacgtattag cactgtaccc agcatctccg     5760

tatcgagaaa cacaacaaca tgccccattg gacagatcat gcggatacac aggttgtgca     5820

gtatcataca tactcgatca gacaggtcgt ctgaccatca tacaagctga acaagcgctc     5880

catacttgca cgctctctat atacacagtt aaattacata tccatagtct aacctctaac     5940

agttaatctt ctggtaagcc tcccagccag ccttctggta tcgcttggcc tcctcaatag     6000

gatctcggtt ctggccgtac agacctcggc cgacaattat gatatccgtt ccggtagaca     6060

tgacatcctc aacagttcgg tactgctgtc cgagagcgtc tcccttgtcg tcaagaccca     6120

ccccgggggt cagaataagc cagtcctcag agtcgccctt aggtcggttc tgggcaatga     6180

agccaaccac aaactcgggg tcggatcggg caagctcaat ggtctgcttg gagtactcgc     6240

cagtggccag agagcccttg caagacagct cggccagcat gagcagacct ctggccagct     6300

tctcgttggg agaggggact aggaactcct tgtactggga gttctcgtag tcagagacgt     6360

cctccttctt ctgttcagag acagtttcct cggcaccagc tcgcaggcca gcaatgattc     6420

cggttccggg tacaccgtgg gcgttggtga tatcggacca ctcggcgatt cggtgacacc     6480

ggtactggtg cttgacagtg ttgccaatat ctgcgaactt tctgtcctcg aacaggaaga     6540

aaccgtgctt aagagcaagt tccttgaggg ggagcacagt gccggcgtag gtgaagtcgt     6600

caatgatgtc gatatgggtt ttgatcatgc acacataagg tccgacctta tcggcaagct     6660

caatgagctc cttggtggtg gtaacatcca gagaagcaca caggttggtt ttcttggctg     6720

ccacgagctt gagcactcga gcggcaaagg cggacttgtg gacgttagct cgagcttcgt     6780

aggagggcat tttggtggtg aagaggagac tgaaataaat ttagtctgca gaacttttta     6840

tcggaacctt atctggggca gtgaagtata tgttatggta atagttacga gttagttgaa     6900

cttatagata gactggacta tacggctatc ggtccaaatt agaaagaacg tcaatggctc     6960

tctgggcgtc gcctttgccg acaaaaatgt gatcatgatg aaagccagca atgacgttgc     7020

agctgatatt gttgtcggcc aaccgcgccg aaaacgcagc tgtcagaccc acagcctcca     7080

acgaagaatg tatcgtcaaa gtgatccaag cacactcata gttggagtcg tactccaaag     7140

gcggcaatga cgagtcagac agatactcgt cgaccttttc cttgggaacc accaccgtca     7200

gcccttctga ctcacgtatt gtagccaccg acacaggcaa cagtccgtgg atagcagaat     7260

atgtcttgtc ggtccatttc tcaccaactt taggcgtcaa gtgaatgttg cagaagaagt     7320

atgtgccttc attgagaatc ggtgttgctg atttcaataa agtcttgaga tcagtttggc     7380

cagtcatgtt gtggggggta attggattga gttatcgcct acagtctgta caggtatact     7440

cgctgcccac tttatacttt ttgattccgc tgcacttgaa gcaatgtcgt ttaccaaaag     7500

tgagaatgct ccacagaaca caccccaggg tatggttgag caaaaaataa acactccgat     7560

acggggaatc gaaccccggt ctccacggtt ctcaagaagt attcttgatg agagcgtatc     7620

gatgagccta aaatgaaccc gagtatatct cataaaattc tcggtgagag gtctgtgact     7680

gtcagtacaa ggtgccttca ttatgccctc aaccttacca tacctcactg aatgtagtgt     7740

acctctaaaa atgaaataca gtgccaaaag ccaaggcact gagctcgtct aacggacttg     7800

atatacaacc aattaaaaca aatgaaaaga aatacagttc tttgtatcat ttgtaacaat     7860

taccctgtac aaactaaggt attgaaatcc cacaatattc ccaaagtcca cccctttcca     7920

aattgtcatg cctacaactc atataccaag cactaaccta ccgttt                    7966


<210>  7
<211>  940
<212>  DNA
<213>  Yarrowia lipolytica

<400>  7
cgcgcccttg tttcgttagt gtagtgtgtg tgggaggaga gtgtgtgtgc gggagtgcaa       60

gtggaggtga aatgttgtga aaggttgtga aatgatgtgt aaagggagga taggacgggc      120

ggaaaagacc gcaagctgta tcattttgaa gctctcgggc ccgcgaagct gtttgcggca      180

ttaatgtctc cattcgagct cttttggcgg actccaggtg tcgtttctct ccaactacaa      240

gtactcatac agtagccgca gccgtaaaga cctcagccac tgactcaaca ccgcggttgc      300

ttctggaacg gtttgaaagc taaaacatct ttaggtgtca gattttggga gggtttcaga      360

tggtgcggat tgtgcaaagt ggcagaaaag agggcgcagg aggcggattt ttgcgctttt      420

gaagacacat atgggttttc cgagccctcg aaaccatctc tggccgtttt ccccgtcaaa      480

aacccccgca tttcacctcc atcgtcgctt ctgctgaagt caccaggtac tcccgcaaat      540

aagcttcatt cgccactcaa accgtcctgc cttgagataa aagtgcaacg ttgtccacca      600

acgaaccctg acaagccgct aatcactgta cgacgaactt gaacgaccca gtcgacgatt      660

tcaacgtaca aagttcctcc gagagtgaca cagaccgacg aacgatcgca cacagaccga      720

cagcgaccac tcagacagtc cagacatcag acatcagact gaacacaacc aacaagcatt      780

gaacactgcc cttccaccaa gttcgacacg cagacacaga accgctccaa ccgacacaga      840

accgctccaa ccgacacaga accactccaa ccgacacaga acctttccaa ccgacacaga      900

accgttccaa ccgacgcact actgtttctt gtgtctacac                            940


<210>  8
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer Yl-EF-1214F

<400>  8
ccaagcccat gtgtgttgag                                                   20


<210>  9
<211>  18
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer Yl-EF-1270R

<400>  9
cggcgaatcg accaagag                                                     18


<210>  10
<211>  19
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer YAP1-346F

<400>  10
ccaaacgcaa ggctcagaa                                                    19


<210>  11
<211>  23
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer YAP1-409R

<400>  11
agatgctttt cctttcgctc tct                                               23


<210>  12
<211>  16
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer YL-EF-MGB-1235T

<400>  12
ccttcactga gtaccc                                                       16


<210>  13
<211>  18
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer YAP1-366T

<400>  13
cgggctgccc aaagggcc                                                     18


<210>  14
<211>  8043
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pYRH61

<400>  14
ctagtatgta ctcagactac aacattcctg gtgccatgcc ggcgtccatg gccatgcctc       60

cgttcaaaca ggagtttgac tacgcccaat acgaccttaa ccagcccctg cccccgcagc      120

agcaacaaca gcctatcgac ctgacccctg gagggcccct ccccgtctcg gattactcga      180

cgtcgtcata caccctggac aacgactcac agaagcgaaa aatgtccccg ggagagtcca      240

ccagtgacgg aggcgccgac gacgagtctc cagaaggaga tgacggtgag gccgacccca      300

agaagccccg aaagcccggc cgaaagcccg aaaccaccat ccccgcgtcc aaacgcaagg      360

ctcagaaccg ggctgcccaa agggccttca gagagcgaaa ggaaaagcat ctgcgcgacc      420

tggaaaccaa aatatctcag ctcgagggcg agacggcagc caaaaactcg gaaaacgagt      480

tcctgcgctt ccaggtccag cggcttcaga acgagctcaa gctttaccgt gagaagcctg      540

ccggcacttc gggagcctct ggagtctctg gagccggagc acccgcttca aacgtgcatt      600

cggctcccat cccggagatg tcgtccaaac cgttcacgtt cgagttcccc tcgtacaacg      660

tgcccaagcc gaccgatgtg gagcgagagg cacgcgagca actgcaacga gagcagatcc      720

gaggctactt gcagcgcaag ccctcatctg tggcctccga caccacttct cctgcatctc      780

aaacctcgtg caaccagtct ccctgcacca acccctcggc atacacttcg ccccagagcc      840

agagtggaag tgtgagccag cagaagcccc tgttgggtgc taccatcgct gccatgaacg      900

gcaagcccga cccccatgct gttgactttt gtgctgagct ctccaaggcc tgtgtaaaca      960

aggccgagct gctgcagcga tccgccacag ccagtgcatc tcccacaacc tccaacacgg     1020

tggtaccgtc cgcagctgca ccgggtagca ctcagcagtc ggcaggccag ccctctgtat     1080

ccactcctac ctcctccaca actgcccctc ctcaattgtc tgcatctgtc gctacagccg     1140

gctctgatct tcccggatcg gacttcctgt ttgacatgcc cttcgacatg gactttatgt     1200

cgtaccgaga ccccgtttcc gagacggcac atctggacga cttttcgctg cccgagctca     1260

cgacagaaac atccatgttt gatcctctgg acccccattc cagcagcgac gttatttctg     1320

gcaagcctct gtctaccatg ggcgctacac acagtggtgt caacaacgga cagggaagtg     1380

gtgctcccga agtcaagaag gaggaggatg aggacctgct catgttctcc aagcccaaga     1440

cgctcatgaa ctgcaccgct gtgtgggacc gtatcacgtc gcatcccaag tttggcgata     1500

tcgacatcga gggcctgtgt tcggagctgc gaaacaaggc aaagtgcagt gagagtggcg     1560

tcgtgttgac ggagttggac gtggatggtg tcctgtcaac gttccagtaa gcggccgcgt     1620

taattcaaat taattgatat agttttttaa tgagtattga atctgtttag aaataatgga     1680

atattatttt tatttattta tttatattat tggtcggctc ttttcttctg aaggtcaatg     1740

acaaaatgat atgaaggaaa taatgatttc taaaatttta caacgtaaga tatttttaca     1800

aaagcctagc tcatcttttg tcatgcacta ttttactcac gcttgaaatt aacggccagt     1860

ccactgcgga gtcatttcaa agtcatccta atcgatctat cgtttttgat agctcatttt     1920

ggagttcgcg attgtcttct gttattcaca actgttttaa tttttatttc attctggaac     1980

tcttcgagtt ctttgtaaag tctttcatag tagcttactt tatcctccaa catatttaac     2040

ttcatgtcaa tttcggctct taaattttcc acatcatcaa gttcaacatc atcttttaac     2100

ttgaatttat tctctagctc ttccaaccaa gcctcattgc tccttgattt actggtgaaa     2160

agtgatacac tttgcgcgca atccaggtca aaactttcct gcaaagaatt caccaatttc     2220

tcgacatcat agtacaattt gttttgttct cccatcacaa tttaatatac ctgatggatt     2280

cttatgaagc gctgggtaat ggacgtgtca ctctacttcg cctttttccc tactcctttt     2340

agtacggaag acaatgctaa taaataagag ggtaataata atattattaa tcggcaaaaa     2400

agattaaacg ccaagcgttt aattatcaga aagcaaacgt cgtaccaatc cttgaatgct     2460

tcccaattgt atattaagag tcatcacagc aacatattct tgttattaaa ttaattatta     2520

ttgatttttg atattgtata aaaaaaccaa atatgtataa aaaaagtgaa taaaaaatac     2580

caagtatgga gaaatatatt agaagtctat acgttaaacc accgcggtgg agctccaatt     2640

cgccctatag tgagtcgtat tacaattcac tggccgtcgt tttacaacgt cgtgactggg     2700

aaaaccctgg cgttacccaa cttaatcgcc ttgcagcaca tccccccttc gccagctggc     2760

gtaatagcga agaggcccgc accgatcgcc cttcccaaca gttgcgcagc ctgaatggcg     2820

aatggcgcga cgcgccctgt agcggcgcat taagcgcggc gggtgtggtg gttacgcgca     2880

gcgtgaccgc tacacttgcc agcgccctag cgcccgctcc tttcgctttc ttcccttcct     2940

ttctcgccac gttcgccggc tttccccgtc aagctctaaa tcgggggctc cctttagggt     3000

tccgatttag tgctttacgg cacctcgacc ccaaaaaact tgattagggt gatggttcac     3060

gtagtgggcc atcgccctga tagacggttt ttcgcccttt gacgttggag tccacgttct     3120

ttaatagtgg actcttgttc caaactggaa caacactcaa ccctatctcg gtctattctt     3180

ttgatttata agggattttg ccgatttcgg cctattggtt aaaaaatgag ctgatttaac     3240

aaaaatttaa cgcgaatttt aacaaaatat taacgtttac aatttcctga tgcggtattt     3300

tctccttacg catctgtgcg gtatttcaca ccgcagggta ataactgata taattaaatt     3360

gaagctctaa tttgtgagtt tagtatacat gcatttactt ataatacagt tttttagttt     3420

tgctggccgc atcttctcaa atatgcttcc cagcctgctt ttctgtaacg ttcaccctct     3480

accttagcat cccttccctt tgcaaatagt cctcttccaa caataataat gtcagatcct     3540

gtagagacca catcatccac ggttctatac tgttgaccca atgcgtctcc cttgtcatct     3600

aaacccacac cgggtgtcat aatcaaccaa tcgtaacctt catctcttcc acccatgtct     3660

ctttgagcaa taaagccgat aacaaaatct ttgtcgctct tcgcaatgtc aacagtaccc     3720

ttagtatatt ctccagtaga tagggagccc ttgcatgaca attctgctaa catcaaaagg     3780

cctctaggtt cctttgttac ttcttctgcc gcctgcttca aaccgctaac aatacctggg     3840

cccaccacac cgtgtgcatt cgtaatgtct gcccattctg ctattctgta tacacccgca     3900

gagtactgca atttgactgt attaccaatg tcagcaaatt ttctgtcttc gaagagtaaa     3960

aaattgtact tggcggataa tgcctttagc ggcttaactg tgccctccat ggaaaaatca     4020

gtcaagatat ccacatgtgt ttttagtaaa caaattttgg gacctaatgc ttcaactaac     4080

tccagtaatt ccttggtggt acgaacatcc aatgaagcac acaagtttgt ttgcttttcg     4140

tgcatgatat taaatagctt ggcagcaaca ggactaggat gagtagcagc acgttcctta     4200

tatgtagctt tcgacatgat ttatcttcgt ttcctgcagg tttttgttct gtgcagttgg     4260

gttaagaata ctgggcaatt tcatgtttct tcaacactac atatgcgtat atataccaat     4320

ctaagtctgt gctccttcct tcgttcttcc ttctgttcgg agattaccga atcaaaaaaa     4380

tttcaaagaa accgaaatca aaaaaaagaa taaaaaaaaa atgatgaatt gaattgaaaa     4440

gcgtggtgca ctctcagtac aatctgctct gatgccgcat agttaagcca gccccgacac     4500

ccgccaacac ccgctgacgc gccctgacgg gcttgtctgc tcccggcatc cgcttacaga     4560

caagctgtga ccgtctccgg gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa     4620

cgcgcgagac gaaagggcct cgtgatacgc ctatttttat aggttaatgt catgataata     4680

atggtttctt aggacggatc gcttgcctgt aacttacacg cgcctcgtat cttttaatga     4740

tggaataatt tgggaattta ctctgtgttt atttattttt atgttttgta tttggatttt     4800

agaaagtaaa taaagaaggt agaagagtta cggaatgaag aaaaaaaaat aaacaaaggt     4860

ttaaaaaatt tcaacaaaaa gcgtacttta catatatatt tattagacaa gaaaagcaga     4920

ttaaatagat atacattcga ttaacgataa gtaaaatgta aaatcacagg attttcgtgt     4980

gtggtcttct acacagacaa gatgaaacaa ttcggcatta atacctgaga gcaggaagag     5040

caagataaaa ggtagtattt gttggcgatc cccctagagt cttttacatc ttcggaaaac     5100

aaaaactatt ttttctttaa tttctttttt tactttctat ttttaattta tatatttata     5160

ttaaaaaatt taaattataa ttatttttat agcacgtgat gaaaaggacc caggtggcac     5220

ttttcgggga aatgtgcgcg gaacccctat ttgtttattt ttctaaatac attcaaatat     5280

gtatccgctc atgagacaat aaccctgata aatgcttcaa taatattgaa aaaggaagag     5340

tatgagtatt caacatttcc gtgtcgccct tattcccttt tttgcggcat tttgccttcc     5400

tgtttttgct cacccagaaa cgctggtgaa agtaaaagat gctgaagatc agttgggtgc     5460

acgagtgggt tacatcgaac tggatctcaa cagcggtaag atccttgaga gttttcgccc     5520

cgaagaacgt tttccaatga tgagcacttt taaagttctg ctatgtggcg cggtattatc     5580

ccgtattgac gccgggcaag agcaactcgg tcgccgcata cactattctc agaatgactt     5640

ggttgagtac tcaccagtca cagaaaagca tcttacggat ggcatgacag taagagaatt     5700

atgcagtgct gccataacca tgagtgataa cactgcggcc aacttacttc tgacaacgat     5760

cggaggaccg aaggagctaa ccgctttttt tcacaacatg ggggatcatg taactcgcct     5820

tgatcgttgg gaaccggagc tgaatgaagc cataccaaac gacgagcgtg acaccacgat     5880

gcctgtagca atggcaacaa cgttgcgcaa actattaact ggcgaactac ttactctagc     5940

ttcccggcaa caattaatag actggatgga ggcggataaa gttgcaggac cacttctgcg     6000

ctcggccctt ccggctggct ggtttattgc tgataaatct ggagccggtg agcgtgggtc     6060

tcgcggtatc attgcagcac tggggccaga tggtaagccc tcccgtatcg tagttatcta     6120

cacgacgggc agtcaggcaa ctatggatga acgaaataga cagatcgctg agataggtgc     6180

ctcactgatt aagcattggt aactgtcaga ccaagtttac tcatatatac tttagattga     6240

tttaaaactt catttttaat ttaaaaggat ctaggtgaag atcctttttg ataatctcat     6300

gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg tcagaccccg tagaaaagat     6360

caaaggatct tcttgagatc ctttttttct gcgcgtaatc tgctgcttgc aaacaaaaaa     6420

accaccgcta ccagcggtgg tttgtttgcc ggatcaagag ctaccaactc tttttccgaa     6480

ggtaactggc ttcagcagag cgcagatacc aaatactgtc cttctagtgt agccgtagtt     6540

aggccaccac ttcaagaact ctgtagcacc gcctacatac ctcgctctgc taatcctgtt     6600

accagtggct gctgccagtg gcgataagtc gtgtcttacc gggttggact caagacgata     6660

gttaccggat aaggcgcagc ggtcgggctg aacggggggt tcgtgcacac agcccagctt     6720

ggagcgaacg acctacaccg aactgagata cctacagcgt gagcattgag aaagcgccac     6780

gcttcccgaa gggagaaagg cggacaggta tccggtaagc ggcagggtcg gaacaggaga     6840

gcgcacgagg gagcttccag gggggaacgc ctggtatctt tatagtcctg tcgggtttcg     6900

ccacctctga cttgagcgtc gatttttgtg atgctcgtca ggggggccga gcctatggaa     6960

aaacgccagc aacgcggcct ttttacggtt cctggccttt tgctggcctt ttgctcacat     7020

gttctttcct gcgttatccc ctgattctgt ggataaccgt attaccgcct ttgagtgagc     7080

tgataccgct cgccgcagcc gaacgaccga gcgcagcgag tcagtgagcg aggaagcgga     7140

agagcgccca atacgcaaac cgcctctccc cgcgcgttgg ccgattcatt aatgcagctg     7200

gcacgacagg tttcccgact ggaaagcggg cagtgagcgc aacgcaatta atgtgagtta     7260

cctcactcat taggcacccc aggctttaca ctttatgctt ccggctccta tgttgtgtgg     7320

aattgtgagc ggataacaat ttcacacagg aaacagctat gaccatgatt acgccaagct     7380

cggaattaac cctcactaaa gggaacaaaa gctgggtacc gggccccccc tcgaggtcga     7440

cgcctacttg gcttcacata cgttgcatac gtcgatatag ataataatga taatgacagc     7500

aggattatcg taatacgtaa tagttgaaaa tctcaaaaat gtgtgggtca ttacgtaaat     7560

aatgatagga atgggattct tctatttttc ctttttccat tctagcagcc gtcgggaaaa     7620

cgtggcatcc tctctttcgg gctcaattgg agtcacgctg ccgtgagcat cctctctttc     7680

catatctaac aactgagcac gtaaccaatg gaaaagcatg agcttagcgt tgctccaaaa     7740

aagtattgga tggttaatac catttgtctg ttctcttctg actttgactc ctcaaaaaaa     7800

aaaaatctac aatcaacaga tcgcttcaat tacgccctca caaaaacttt tttccttctt     7860

cttcgcccac gttaaatttt atccctcatg ttgtctaacg gatttctgca cttgatttat     7920

tataaaaaga caaagacata atacttctct atcaatttca gttattgttc ttccttgcgt     7980

tattcttctg ttcttctttt tcttttgtca tatataacca taaccaagta atacatattc     8040

aaa                                                                   8043


<210>  15
<211>  48
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer Yl.Yap1-F-SpeI

<400>  15
atattcaaac tagtatgtac tcagactaca acattcctgg tgccatgc                    48


<210>  16
<211>  44
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer Yap1-R

<400>  16
gatcaagcgg ccgcttactg gaacgttgac aggacaccat ccac                        44


<210>  17
<211>  601
<212>  DNA
<213>  Saccharomyces cerevisiae

<400>  17
gcctacttgg cttcacatac gttgcatacg tcgatataga taataatgat aatgacagca       60

ggattatcgt aatacgtaat agttgaaaat ctcaaaaatg tgtgggtcat tacgtaaata      120

atgataggaa tgggattctt ctatttttcc tttttccatt ctagcagccg tcgggaaaac      180

gtggcatcct ctctttcggg ctcaattgga gtcacgctgc cgtgagcatc ctctctttcc      240

atatctaaca actgagcacg taaccaatgg aaaagcatga gcttagcgtt gctccaaaaa      300

agtattggat ggttaatacc atttgtctgt tctcttctga ctttgactcc tcaaaaaaaa      360

aaaatctaca atcaacagat cgcttcaatt acgccctcac aaaaactttt ttccttcttc      420

ttcgcccacg ttaaatttta tccctcatgt tgtctaacgg atttctgcac ttgatttatt      480

ataaaaagac aaagacataa tacttctcta tcaatttcag ttattgttct tccttgcgtt      540

attcttctgt tcttcttttt cttttgtcat atataaccat aaccaagtaa tacatattca      600

a                                                                      601


<210>  18
<211>  1022
<212>  DNA
<213>  Saccharomyces cerevisiae

<400>  18
ggccgcgtta attcaaatta attgatatag ttttttaatg agtattgaat ctgtttagaa       60

ataatggaat attattttta tttatttatt tatattattg gtcggctctt ttcttctgaa      120

ggtcaatgac aaaatgatat gaaggaaata atgatttcta aaattttaca acgtaagata      180

tttttacaaa agcctagctc atcttttgtc atgcactatt ttactcacgc ttgaaattaa      240

cggccagtcc actgcggagt catttcaaag tcatcctaat cgatctatcg tttttgatag      300

ctcattttgg agttcgcgat tgtcttctgt tattcacaac tgttttaatt tttatttcat      360

tctggaactc ttcgagttct ttgtaaagtc tttcatagta gcttacttta tcctccaaca      420

tatttaactt catgtcaatt tcggctctta aattttccac atcatcaagt tcaacatcat      480

cttttaactt gaatttattc tctagctctt ccaaccaagc ctcattgctc cttgatttac      540

tggtgaaaag tgatacactt tgcgcgcaat ccaggtcaaa actttcctgc aaagaattca      600

ccaatttctc gacatcatag tacaatttgt tttgttctcc catcacaatt taatatacct      660

gatggattct tatgaagcgc tgggtaatgg acgtgtcact ctacttcgcc tttttcccta      720

ctccttttag tacggaagac aatgctaata aataagaggg taataataat attattaatc      780

ggcaaaaaag attaaacgcc aagcgtttaa ttatcagaaa gcaaacgtcg taccaatcct      840

tgaatgcttc ccaattgtat attaagagtc atcacagcaa catattcttg ttattaaatt      900

aattattatt gatttttgat attgtataaa aaaaccaaat atgtataaaa aaagtgaata      960

aaaaatacca agtatggaga aatatattag aagtctatac gttaaaccac cgcggtggag     1020

ct                                                                    1022


<210>  19
<211>  4887
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pRS316

<400>  19
tcgcgcgttt cggtgatgac ggtgaaaacc tctgacacat gcagctcccg gagacggtca       60

cagcttgtct gtaagcggat gccgggagca gacaagcccg tcagggcgcg tcagcgggtg      120

ttggcgggtg tcggggctgg cttaactatg cggcatcaga gcagattgta ctgagagtgc      180

accacgcttt tcaattcaat tcatcatttt ttttttattc ttttttttga tttcggtttc      240

tttgaaattt ttttgattcg gtaatctccg aacagaagga agaacgaagg aaggagcaca      300

gacttagatt ggtatatata cgcatatgta gtgttgaaga aacatgaaat tgcccagtat      360

tcttaaccca actgcacaga acaaaaacct gcaggaaacg aagataaatc atgtcgaaag      420

ctacatataa ggaacgtgct gctactcatc ctagtcctgt tgctgccaag ctatttaata      480

tcatgcacga aaagcaaaca aacttgtgtg cttcattgga tgttcgtacc accaaggaat      540

tactggagtt agttgaagca ttaggtccca aaatttgttt actaaaaaca catgtggata      600

tcttgactga tttttccatg gagggcacag ttaagccgct aaaggcatta tccgccaagt      660

acaatttttt actcttcgaa gacagaaaat ttgctgacat tggtaataca gtcaaattgc      720

agtactctgc gggtgtatac agaatagcag aatgggcaga cattacgaat gcacacggtg      780

tggtgggccc aggtattgtt agcggtttga agcaggcggc agaagaagta acaaaggaac      840

ctagaggcct tttgatgtta gcagaattgt catgcaaggg ctccctatct actggagaat      900

atactaaggg tactgttgac attgcgaaga gcgacaaaga ttttgttatc ggctttattg      960

ctcaaagaga catgggtgga agagatgaag gttacgattg gttgattatg acacccggtg     1020

tgggtttaga tgacaaggga gacgcattgg gtcaacagta tagaaccgtg gatgatgtgg     1080

tctctacagg atctgacatt attattgttg gaagaggact atttgcaaag ggaagggatg     1140

ctaaggtaga gggtgaacgt tacagaaaag caggctggga agcatatttg agaagatgcg     1200

gccagcaaaa ctaaaaaact gtattataag taaatgcatg tatactaaac tcacaaatta     1260

gagcttcaat ttaattatat cagttattac cctgcggtgt gaaataccgc acagatgcgt     1320

aaggagaaaa taccgcatca ggaaattgta aacgttaata ttttgttaaa attcgcgtta     1380

aatttttgtt aaatcagctc attttttaac caataggccg aaatcggcaa aatcccttat     1440

aaatcaaaag aatagaccga gatagggttg agtgttgttc cagtttggaa caagagtcca     1500

ctattaaaga acgtggactc caacgtcaaa gggcgaaaaa ccgtctatca gggcgatggc     1560

ccactacgtg aaccatcacc ctaatcaagt tttttggggt cgaggtgccg taaagcacta     1620

aatcggaacc ctaaagggag cccccgattt agagcttgac ggggaaagcc ggcgaacgtg     1680

gcgagaaagg aagggaagaa agcgaaagga gcgggcgcta gggcgctggc aagtgtagcg     1740

gtcacgctgc gcgtaaccac cacacccgcc gcgcttaatg cgccgctaca gggcgcgtcg     1800

cgccattcgc cattcaggct gcgcaactgt tgggaagggc gatcggtgcg ggcctcttcg     1860

ctattacgcc agctggcgaa ggggggatgt gctgcaaggc gattaagttg ggtaacgcca     1920

gggttttccc agtcacgacg ttgtaaaacg acggccagtg aattgtaata cgactcacta     1980

tagggcgaat tggagctcca ccgcggtggc ggccgctcta gaactagtgg atcccccggg     2040

ctgcaggaat tcgatatcaa gcttatcgat accgtcgacc tcgagggggg gcccggtacc     2100

cagcttttgt tccctttagt gagggttaat tccgagcttg gcgtaatcat ggtcatagct     2160

gtttcctgtg tgaaattgtt atccgctcac aattccacac aacataggag ccggaagcat     2220

aaagtgtaaa gcctggggtg cctaatgagt gaggtaactc acattaattg cgttgcgctc     2280

actgcccgct ttccagtcgg gaaacctgtc gtgccagctg cattaatgaa tcggccaacg     2340

cgcggggaga ggcggtttgc gtattgggcg ctcttccgct tcctcgctca ctgactcgct     2400

gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg taatacggtt     2460

atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc agcaaaaggc     2520

caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctcggcc cccctgacga     2580

gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac tataaagata     2640

ccaggcgttc ccccctggaa gctccctcgt gcgctctcct gttccgaccc tgccgcttac     2700

cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcaat gctcacgctg     2760

taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc acgaaccccc     2820

cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca acccggtaag     2880

acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag cgaggtatgt     2940

aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta gaaggacagt     3000

atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg gtagctcttg     3060

atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc agcagattac     3120

gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt ctgacgctca     3180

gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa ggatcttcac     3240

ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat atgagtaaac     3300

ttggtctgac agttaccaat gcttaatcag tgaggcacct atctcagcga tctgtctatt     3360

tcgttcatcc atagttgcct gactgcccgt cgtgtagata actacgatac gggagggctt     3420

accatctggc cccagtgctg caatgatacc gcgagaccca cgctcaccgg ctccagattt     3480

atcagcaata aaccagccag ccggaagggc cgagcgcaga agtggtcctg caactttatc     3540

cgcctccatc cagtctatta attgttgccg ggaagctaga gtaagtagtt cgccagttaa     3600

tagtttgcgc aacgttgttg ccattgctac aggcatcgtg gtgtcacgct cgtcgtttgg     3660

tatggcttca ttcagctccg gttcccaacg atcaaggcga gttacatgat cccccatgtt     3720

gtgaaaaaaa gcggttagct ccttcggtcc tccgatcgtt gtcagaagta agttggccgc     3780

agtgttatca ctcatggtta tggcagcact gcataattct cttactgtca tgccatccgt     3840

aagatgcttt tctgtgactg gtgagtactc aaccaagtca ttctgagaat agtgtatgcg     3900

gcgaccgagt tgctcttgcc cggcgtcaat acgggataat accgcgccac atagcagaac     3960

tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga aaactctcaa ggatcttacc     4020

gctgttgaga tccagttcga tgtaacccac tcgtgcaccc aactgatctt cagcatcttt     4080

tactttcacc agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg caaaaaaggg     4140

aataagggcg acacggaaat gttgaatact catactcttc ctttttcaat attattgaag     4200

catttatcag ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa     4260

acaaataggg gttccgcgca catttccccg aaaagtgcca cctgggtcct tttcatcacg     4320

tgctataaaa ataattataa tttaaatttt ttaatataaa tatataaatt aaaaatagaa     4380

agtaaaaaaa gaaattaaag aaaaaatagt ttttgttttc cgaagatgta aaagactcta     4440

gggggatcgc caacaaatac taccttttat cttgctcttc ctgctctcag gtattaatgc     4500

cgaattgttt catcttgtct gtgtagaaga ccacacacga aaatcctgtg attttacatt     4560

ttacttatcg ttaatcgaat gtatatctat ttaatctgct tttcttgtct aataaatata     4620

tatgtaaagt acgctttttg ttgaaatttt ttaaaccttt gtttattttt ttttcttcat     4680

tccgtaactc ttctaccttc tttatttact ttctaaaatc caaatacaaa acataaaaat     4740

aaataaacac agagtaaatt cccaaattat tccatcatta aaagatacga ggcgcgtgta     4800

agttacaggc aagcgatccg tcctaagaaa ccattattat catgacatta acctataaaa     4860

ataggcgtat cacgaggccc tttcgtc                                         4887


<210>  20
<211>  8597
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pYRH43

<400>  20
ggccgcaagt gtggatgggg aagtgagtgc ccggttctgt gtgcacaatt ggcaatccaa       60

gatggatgga ttcaacacag ggatatagcg agctacgtgg tggtgcgagg atatagcaac      120

ggatatttat gtttgacact tgagaatgta cgatacaagc actgtccaag tacaatacta      180

aacatactgt acatactcat actcgtaccc gggcaacggt ttcacttgag tgcagtggct      240

agtgctctta ctcgtacagt gtgcaatact gcgtatcata gtctttgatg tatatcgtat      300

tcattcatgt tagttgcgta cgagccggaa gcataaagtg taaagcctgg ggtgcctaat      360

gagtgagcta actcacatta attgcgttgc gctcactgcc cgctttccag tcgggaaacc      420

tgtcgtgcca gctgcattaa tgaatcggcc aacgcgcggg gagaggcggt ttgcgtattg      480

ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag      540

cggtatcagc tcactcaaag gcggtaatac ggttatccac agaatcaggg gataacgcag      600

gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc      660

tggcgttttt ccataggctc cgcccccctg acgagcatca caaaaatcga cgctcaagtc      720

agaggtggcg aaacccgaca ggactataaa gataccaggc gtttccccct ggaagctccc      780

tcgtgcgctc tcctgttccg accctgccgc ttaccggata cctgtccgcc tttctccctt      840

cgggaagcgt ggcgctttct catagctcac gctgtaggta tctcagttcg gtgtaggtcg      900

ttcgctccaa gctgggctgt gtgcacgaac cccccgttca gcccgaccgc tgcgccttat      960

ccggtaacta tcgtcttgag tccaacccgg taagacacga cttatcgcca ctggcagcag     1020

ccactggtaa caggattagc agagcgaggt atgtaggcgg tgctacagag ttcttgaagt     1080

ggtggcctaa ctacggctac actagaagga cagtatttgg tatctgcgct ctgctgaagc     1140

cagttacctt cggaaaaaga gttggtagct cttgatccgg caaacaaacc accgctggta     1200

gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag     1260

atcctttgat cttttctacg gggtctgacg ctcagtggaa cgaaaactca cgttaaggga     1320

ttttggtcat gagattatca aaaaggatct tcacctagat ccttttaaat taaaaatgaa     1380

gttttaaatc aatctaaagt atatatgagt aaacttggtc tgacagttac caatgcttaa     1440

tcagtgaggc acctatctca gcgatctgtc tatttcgttc atccatagtt gcctgactcc     1500

ccgtcgtgta gataactacg atacgggagg gcttaccatc tggccccagt gctgcaatga     1560

taccgcgaga cccacgctca ccggctccag atttatcagc aataaaccag ccagccggaa     1620

gggccgagcg cagaagtggt cctgcaactt tatccgcctc catccagtct attaattgtt     1680

gccgggaagc tagagtaagt agttcgccag ttaatagttt gcgcaacgtt gttgccattg     1740

ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc ttcattcagc tccggttccc     1800

aacgatcaag gcgagttaca tgatccccca tgttgtgcaa aaaagcggtt agctccttcg     1860

gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt atcactcatg gttatggcag     1920

cactgcataa ttctcttact gtcatgccat ccgtaagatg cttttctgtg actggtgagt     1980

actcaaccaa gtcattctga gaatagtgta tgcggcgacc gagttgctct tgcccggcgt     2040

caatacggga taataccgcg ccacatagca gaactttaaa agtgctcatc attggaaaac     2100

gttcttcggg gcgaaaactc tcaaggatct taccgctgtt gagatccagt tcgatgtaac     2160

ccactcgtgc acccaactga tcttcagcat cttttacttt caccagcgtt tctgggtgag     2220

caaaaacagg aaggcaaaat gccgcaaaaa agggaataag ggcgacacgg aaatgttgaa     2280

tactcatact cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga     2340

gcggatacat atttgaatgt atttagaaaa ataaacaaat aggggttccg cgcacatttc     2400

cccgaaaagt gccacctgac gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg     2460

ttacgcgcag cgtgaccgct acacttgcca gcgccctagc gcccgctcct ttcgctttct     2520

tcccttcctt tctcgccacg ttcgccggct ttccccgtca agctctaaat cgggggctcc     2580

ctttagggtt ccgatttagt gctttacggc acctcgaccc caaaaaactt gattagggtg     2640

atggttcacg tagtgggcca tcgccctgat agacggtttt tcgccctttg acgttggagt     2700

ccacgttctt taatagtgga ctcttgttcc aaactggaac aacactcaac cctatctcgg     2760

tctattcttt tgatttataa gggattttgc cgatttcggc ctattggtta aaaaatgagc     2820

tgatttaaca aaaatttaac gcgaatttta acaaaatatt aacgcttaca atttccattc     2880

gccattcagg ctgcgcaact gttgggaagg gcgatcggtg cgggcctctt cgctattacg     2940

ccagctggcg aaagggggat gtgctgcaag gcgattaagt tgggtaacgc cagggttttc     3000

ccagtcacga cgttgtaaaa cgacggccag tgaattgtaa tacgactcac tatagggcga     3060

attgggtacc gggccccccc tcgaggtcga tggtgtcgat aagcttgata tcgaattcat     3120

gtcacacaaa ccgatcttcg cctcaaggaa acctaattct acatccgaga gactgccgag     3180

atccagtcta cactgattaa ttttcgggcc aataatttaa aaaaatcgtg ttatataata     3240

ttatatgtat tatatatata catcatgatg atactgacag tcatgtccca ttgctaaata     3300

gacagactcc atctgccgcc tccaactgat gttctcaata tttaaggggt catctcgcat     3360

tgtttaataa taaacagact ccatctaccg cctccaaatg atgttctcaa aatatattgt     3420

atgaacttat ttttattact tagtattatt agacaactta cttgctttat gaaaaacact     3480

tcctatttag gaaacaattt ataatggcag ttcgttcatt taacaattta tgtagaataa     3540

atgttataaa tgcgtatggg aaatcttaaa tatggatagc ataaatgata tctgcattgc     3600

ctaattcgaa atcaacagca acgaaaaaaa tcccttgtac aacataaata gtcatcgaga     3660

aatatcaact atcaaagaac agctattcac acgttactat tgagattatt attggacgag     3720

aatcacacac tcaactgtct ttctctcttc tagaaataca ggtacaagta tgtactattc     3780

tcattgttca tacttctagt catttcatcc cacatattcc ttggatttct ctccaatgaa     3840

tgacattcta tcttgcaaat tcaacaatta taataagata taccaaagta gcggtatagt     3900

ggcaatcaaa aagcttctct ggtgtgcttc tcgtatttat ttttattcta atgatccatt     3960

aaaggtatat atttatttct tgttatataa tccttttgtt tattacatgg gctggataca     4020

taaaggtatt ttgatttaat tttttgctta aattcaatcc cccctcgttc agtgtcaact     4080

gtaatggtag gaaattacca tacttttgaa gaagcaaaaa aaatgaaaga aaaaaaaaat     4140

cgtatttcca ggttagacgt tccgcagaat ctagaatgcg gtatgcggta cattgttctt     4200

cgaacgtaaa agttgcgctc cctgagatat tgtacatttt tgcttttaca agtacaagta     4260

catcgtacaa ctatgtacta ctgttgatgc atccacaaca gtttgttttg tttttttttg     4320

tttttttttt ttctaatgat tcattaccgc tatgtatacc tacttgtact tgtagtaagc     4380

cgggttattg gcgttcaatt aatcatagac ttatgaatct gcacggtgtg cgctgcgagt     4440

tacttttagc ttatgcatgc tacttgggtg taatattggg atctgttcgg aaatcaacgg     4500

atgctcaatc gatttcgaca gtaattaatt aagtcataca caagtcagct ttcttcgagc     4560

ctcatataag tataagtagt tcaacgtatt agcactgtac ccagcatctc cgtatcgaga     4620

aacacaacaa catgccccat tggacagatc atgcggatac acaggttgtg cagtatcata     4680

catactcgat cagacaggtc gtctgaccat catacaagct gaacaagcgc tccatacttg     4740

cacgctctct atatacacag ttaaattaca tatccatagt ctaacctcta acagttaatc     4800

ttctggtaag cctcccagcc agccttctgg tatcgcttgg cctcctcaat aggatctcgg     4860

ttctggccgt acagacctcg gccgacaatt atgatatccg ttccggtaga catgacatcc     4920

tcaacagttc ggtactgctg tccgagagcg tctcccttgt cgtcaagacc caccccgggg     4980

gtcagaataa gccagtcctc agagtcgccc ttaggtcggt tctgggcaat gaagccaacc     5040

acaaactcgg ggtcggatcg ggcaagctca atggtctgct tggagtactc gccagtggcc     5100

agagagccct tgcaagacag ctcggccagc atgagcagac ctctggccag cttctcgttg     5160

ggagagggga ctaggaactc cttgtactgg gagttctcgt agtcagagac gtcctccttc     5220

ttctgttcag agacagtttc ctcggcacca gctcgcaggc cagcaatgat tccggttccg     5280

ggtacaccgt gggcgttggt gatatcggac cactcggcga ttcggtgaca ccggtactgg     5340

tgcttgacag tgttgccaat atctgcgaac tttctgtcct cgaacaggaa gaaaccgtgc     5400

ttaagagcaa gttccttgag ggggagcaca gtgccggcgt aggtgaagtc gtcaatgatg     5460

tcgatatggg ttttgatcat gcacacataa ggtccgacct tatcggcaag ctcaatgagc     5520

tccttggtgg tggtaacatc cagagaagca cacaggttgg ttttcttggc tgccacgagc     5580

ttgagcactc gagcggcaaa ggcggacttg tggacgttag ctcgagcttc gtaggagggc     5640

attttggtgg tgaagaggag actgaaataa atttagtctg cagaactttt tatcggaacc     5700

ttatctgggg cagtgaagta tatgttatgg taatagttac gagttagttg aacttataga     5760

tagactggac tatacggcta tcggtccaaa ttagaaagaa cgtcaatggc tctctgggcg     5820

tcgcctttgc cgacaaaaat gtgatcatga tgaaagccag caatgacgtt gcagctgata     5880

ttgttgtcgg ccaaccgcgc cgaaaacgca gctgtcagac ccacagcctc caacgaagaa     5940

tgtatcgtca aagtgatcca agcacactca tagttggagt cgtactccaa aggcggcaat     6000

gacgagtcag acagatactc gtcgacgttt aaacagtgta cgcagatcta ctatagagga     6060

acatttaaat tgccccggag aagacggcca ggccgcctag atgacaaatt caacaactca     6120

cagctgactt tctgccattg ccactagggg ggggcctttt tatatggcca agccaagctc     6180

tccacgtcgg ttgggctgca cccaacaata aatgggtagg gttgcaccaa caaagggatg     6240

ggatgggggg tagaagatac gaggataacg gggctcaatg gcacaaataa gaacgaatac     6300

tgccattaag actcgtgatc cagcgactga caccattgca tcatctaagg gcctcaaaac     6360

tacctcggaa ctgctgcgct gatctggaca ccacagaggt tccgagcact ttaggttgca     6420

ccaaatgtcc caccaggtgc aggcagaaaa cgctggaaca gcgtgtacag tttgtcttaa     6480

caaaaagtga gggcgctgag gtcgagcagg gtggtgtgac ttgttatagc ctttagagct     6540

gcgaaagcgc gtatggattt ggctcatcag gccagattga gggtctgtgg acacatgtca     6600

tgttagtgta cttcaatcgc cccctggata tagccccgac aataggccgt ggcctcattt     6660

ttttgccttc cgcacatttc cattgctcga tacccacacc ttgcttctcc tgcacttgcc     6720

aaccttaata ctggtttaca ttgaccaaca tcttacaagc ggggggcttg tctagggtat     6780

atataaacag tggctctccc aatcggttgc cagtctcttt tttcctttct ttccccacag     6840

attcgaaatc taaactacac atcacagaat tccgagccgt gagtatccac gacaagatca     6900

gtgtcgagac gacgcgtttt gtgtaatgac acaatccgaa agtcgctagc aacacacact     6960

ctctacacaa actaacccag ctctggtacc atgtactcag actacaacat tcctggtgcc     7020

atgccggcgt ccatggccat gcctccgttc aaacaggagt ttgactacgc ccaatacgac     7080

cttaaccagc ccctgccccc gcagcagcaa caacagccta tcgacctgac ccctggaggg     7140

cccctccccg tctcggatta ctcgacgtcg tcatacaccc tggacaacga ctcacagaag     7200

cgaaaaatgt ccccgggaga gtccaccagt gacggaggcg ccgacgacga gtctccagaa     7260

ggagatgacg gtgaggccga ccccaagaag ccccgaaagc ccggccgaaa gcccgaaacc     7320

accatccccg cgtccaaacg caaggctcag aaccgggctg cccaaagggc cttcagagag     7380

cgaaaggaaa agcatctgcg cgacctggaa accaaaatat ctcagctcga gggcgagacg     7440

gcagccaaaa actcggaaaa cgagttcctg cgcttccagg tccagcggct tcagaacgag     7500

ctcaagcttt accgtgagaa gcctgccggc acttcgggag cctctggagt ctctggagcc     7560

ggagcacccg cttcaaacgt gcattcggct cccatcccgg agatgtcgtc caaaccgttc     7620

acgttcgagt tcccctcgta caacgtgccc aagccgaccg atgtggagcg agaggcacgc     7680

gagcaactgc aacgagagca gatccgaggc tacttgcagc gcaagccctc atctgtggcc     7740

tccgacacca cttctcctgc atctcaaacc tcgtgcaacc agtctccctg caccaacccc     7800

tcggcataca cttcgcccca gagccagagt ggaagtgtga gccagcagaa gcccctgttg     7860

ggtgctacca tcgctgccat gaacggcaag cccgaccccc atgctgttga cttttgtgct     7920

gagctctcca aggcctgtgt aaacaaggcc gagctgctgc agcgatccgc cacagccagt     7980

gcatctccca caacctccaa cacggtggta ccgtccgcag ctgcaccggg tagcactcag     8040

cagtcggcag gccagccctc tgtatccact cctacctcct ccacaactgc ccctcctcaa     8100

ttgtctgcat ctgtcgctac agccggctct gatcttcccg gatcggactt cctgtttgac     8160

atgcccttcg acatggactt tatgtcgtac cgagaccccg tttccgagac ggcacatctg     8220

gacgactttt cgctgcccga gctcacgaca gaaacatcca tgtttgatcc tctggacccc     8280

cattccagca gcgacgttat ttctggcaag cctctgtcta ccatgggcgc tacacacagt     8340

ggtgtcaaca acggacaggg aagtggtgct cccgaagtca agaaggagga ggatgaggac     8400

ctgctcatgt tctccaagcc caagacgctc atgaactgca ccgctgtgtg ggaccgtatc     8460

acgtcgcatc ccaagtttgg cgatatcgac atcgagggcc tgtgttcgga gctgcgaaac     8520

aaggcaaagt gcagtgagag tggcgtcgtg ttgacggagt tggacgtgga tggtgtcctg     8580

tcaacgttcc agtaagc                                                    8597


<210>  21
<211>  42
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer Yap1-F

<400>  21
gatcaaacat gtactcagac tacaacattc ctggtgccat gc                          42


<210>  22
<211>  20
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer ef-324F

<400>  22
cgactgtgcc atcctcatca                                                   20


<210>  23
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer ef-392R

<400>  23
tgaccgtcct tggagatacc a                                                 21


<210>  24
<211>  21
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Primer ef-345T

<400>  24
tgctggtggt gttggtgagt t                                                 21


<210>  25
<211>  492
<212>  DNA
<213>  Saccharomyces cerevisiae


<220>
<221>  CDS
<222>  (1)..(492)
<223>  GenBank Accession No. NM_001179559

<400>  25
atg tca gaa ttc tat aag cta gca cct gtt gac aag aaa ggc caa cca         48
Met Ser Glu Phe Tyr Lys Leu Ala Pro Val Asp Lys Lys Gly Gln Pro           
1               5                   10                  15                

ttc ccc ttc gac caa tta aag gga aaa gtg gtg ctt atc gtt aat gtt         96
Phe Pro Phe Asp Gln Leu Lys Gly Lys Val Val Leu Ile Val Asn Val           
            20                  25                  30                    

gcc tcc aaa tgt gga ttc act cct caa tac aaa gaa cta gag gcc ttg        144
Ala Ser Lys Cys Gly Phe Thr Pro Gln Tyr Lys Glu Leu Glu Ala Leu           
        35                  40                  45                        

tac aaa cgt tat aag gac gaa gga ttt acc atc atc ggg ttc cca tgc        192
Tyr Lys Arg Tyr Lys Asp Glu Gly Phe Thr Ile Ile Gly Phe Pro Cys           
    50                  55                  60                            

aac cag ttt ggc cac caa gaa cct ggc tct gat gaa gaa att gcc cag        240
Asn Gln Phe Gly His Gln Glu Pro Gly Ser Asp Glu Glu Ile Ala Gln           
65                  70                  75                  80            

ttc tgc caa ctg aac tat ggc gtg act ttc ccc att atg aaa aaa att        288
Phe Cys Gln Leu Asn Tyr Gly Val Thr Phe Pro Ile Met Lys Lys Ile           
                85                  90                  95                

gac gtt aat ggt ggc aat gag gac cct gtt tac aag ttt ttg aag agc        336
Asp Val Asn Gly Gly Asn Glu Asp Pro Val Tyr Lys Phe Leu Lys Ser           
            100                 105                 110                   

caa aaa tcc ggt atg ttg ggc ttg aga ggt atc aaa tgg aat ttt gaa        384
Gln Lys Ser Gly Met Leu Gly Leu Arg Gly Ile Lys Trp Asn Phe Glu           
        115                 120                 125                       

aaa ttc tta gtc gat aaa aag ggt aaa gtg tac gaa aga tac tct tca        432
Lys Phe Leu Val Asp Lys Lys Gly Lys Val Tyr Glu Arg Tyr Ser Ser           
    130                 135                 140                           

cta acc aaa cct tct tcg ttg tcc gaa acc atc gaa gaa ctt ttg aaa        480
Leu Thr Lys Pro Ser Ser Leu Ser Glu Thr Ile Glu Glu Leu Leu Lys           
145                 150                 155                 160           

gag gtg gaa tag                                                        492
Glu Val Glu                                                               
                                                                          


<210>  26
<211>  163
<212>  PRT
<213>  Saccharomyces cerevisiae

<400>  26

Met Ser Glu Phe Tyr Lys Leu Ala Pro Val Asp Lys Lys Gly Gln Pro 
1               5                   10                  15      


Phe Pro Phe Asp Gln Leu Lys Gly Lys Val Val Leu Ile Val Asn Val 
            20                  25                  30          


Ala Ser Lys Cys Gly Phe Thr Pro Gln Tyr Lys Glu Leu Glu Ala Leu 
        35                  40                  45              


Tyr Lys Arg Tyr Lys Asp Glu Gly Phe Thr Ile Ile Gly Phe Pro Cys 
    50                  55                  60                  


Asn Gln Phe Gly His Gln Glu Pro Gly Ser Asp Glu Glu Ile Ala Gln 
65                  70                  75                  80  


Phe Cys Gln Leu Asn Tyr Gly Val Thr Phe Pro Ile Met Lys Lys Ile 
                85                  90                  95      


Asp Val Asn Gly Gly Asn Glu Asp Pro Val Tyr Lys Phe Leu Lys Ser 
            100                 105                 110         


Gln Lys Ser Gly Met Leu Gly Leu Arg Gly Ile Lys Trp Asn Phe Glu 
        115                 120                 125             


Lys Phe Leu Val Asp Lys Lys Gly Lys Val Tyr Glu Arg Tyr Ser Ser 
    130                 135                 140                 


Leu Thr Lys Pro Ser Ser Leu Ser Glu Thr Ile Glu Glu Leu Leu Lys 
145                 150                 155                 160 


Glu Val Glu 
            


<210>  27
<211>  507
<212>  DNA
<213>  Yarrowia lipolytica


<220>
<221>  CDS
<222>  (1)..(507)
<223>  YALI0E02310; GenBank Accession No. XP_503454

<400>  27
atg tcc gcc gag aaa acc aat acc gct ttc tac aac ctc gct cca ctc         48
Met Ser Ala Glu Lys Thr Asn Thr Ala Phe Tyr Asn Leu Ala Pro Leu           
1               5                   10                  15                

gac aag aac gga gag cct ttc ccc ttc aag cag ctt gag ggc aag gtc         96
Asp Lys Asn Gly Glu Pro Phe Pro Phe Lys Gln Leu Glu Gly Lys Val           
            20                  25                  30                    

gtg ctc atc gtg aac gtc gcc tcc aag tgt ggc ttt act ccc caa tac        144
Val Leu Ile Val Asn Val Ala Ser Lys Cys Gly Phe Thr Pro Gln Tyr           
        35                  40                  45                        

aag ggc ctt gag gag gtc tac cag aag tac aag gat cag gga ttc acc        192
Lys Gly Leu Glu Glu Val Tyr Gln Lys Tyr Lys Asp Gln Gly Phe Thr           
    50                  55                  60                            

atc atc ggc ttc ccc tgc aac cag ttt ggt ggc caa gag cct ggt tcc        240
Ile Ile Gly Phe Pro Cys Asn Gln Phe Gly Gly Gln Glu Pro Gly Ser           
65                  70                  75                  80            

gct gac gag atc tcc tcc ttc tgt cag ctg aac tac ggc gtc act ttc        288
Ala Asp Glu Ile Ser Ser Phe Cys Gln Leu Asn Tyr Gly Val Thr Phe           
                85                  90                  95                

ccc gtt ctt cag aag atc aac gtc aac ggc aac gac gcc gac ccc gtc        336
Pro Val Leu Gln Lys Ile Asn Val Asn Gly Asn Asp Ala Asp Pro Val           
            100                 105                 110                   

tac gtc tac ctg aag gag cag aag gct ggt ctg ctg ggc ttc cga gga        384
Tyr Val Tyr Leu Lys Glu Gln Lys Ala Gly Leu Leu Gly Phe Arg Gly           
        115                 120                 125                       

atc aag tgg aac ttt gag aag ttc ctg gtt gat aag cac ggt aac gtc        432
Ile Lys Trp Asn Phe Glu Lys Phe Leu Val Asp Lys His Gly Asn Val           
    130                 135                 140                           

gtc gac cga tat gct tcc ctc aag acc ccc gcc ggc ctc gaa tcc acc        480
Val Asp Arg Tyr Ala Ser Leu Lys Thr Pro Ala Gly Leu Glu Ser Thr           
145                 150                 155                 160           

atc gag acc ctc ctc aaa aag ccc taa                                    507
Ile Glu Thr Leu Leu Lys Lys Pro                                           
                165                                                       


<210>  28
<211>  168
<212>  PRT
<213>  Yarrowia lipolytica

<400>  28

Met Ser Ala Glu Lys Thr Asn Thr Ala Phe Tyr Asn Leu Ala Pro Leu 
1               5                   10                  15      


Asp Lys Asn Gly Glu Pro Phe Pro Phe Lys Gln Leu Glu Gly Lys Val 
            20                  25                  30          


Val Leu Ile Val Asn Val Ala Ser Lys Cys Gly Phe Thr Pro Gln Tyr 
        35                  40                  45              


Lys Gly Leu Glu Glu Val Tyr Gln Lys Tyr Lys Asp Gln Gly Phe Thr 
    50                  55                  60                  


Ile Ile Gly Phe Pro Cys Asn Gln Phe Gly Gly Gln Glu Pro Gly Ser 
65                  70                  75                  80  


Ala Asp Glu Ile Ser Ser Phe Cys Gln Leu Asn Tyr Gly Val Thr Phe 
                85                  90                  95      


Pro Val Leu Gln Lys Ile Asn Val Asn Gly Asn Asp Ala Asp Pro Val 
            100                 105                 110         


Tyr Val Tyr Leu Lys Glu Gln Lys Ala Gly Leu Leu Gly Phe Arg Gly 
        115                 120                 125             


Ile Lys Trp Asn Phe Glu Lys Phe Leu Val Asp Lys His Gly Asn Val 
    130                 135                 140                 


Val Asp Arg Tyr Ala Ser Leu Lys Thr Pro Ala Gly Leu Glu Ser Thr 
145                 150                 155                 160 


Ile Glu Thr Leu Leu Lys Lys Pro 
                165             


<210>  29
<211>  7651
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Plasmid pYRH65

<400>  29
ggccgcaagt gtggatgggg aagtgagtgc ccggttctgt gtgcacaatt ggcaatccaa       60

gatggatgga ttcaacacag ggatatagcg agctacgtgg tggtgcgagg atatagcaac      120

ggatatttat gtttgacact tgagaatgta cgatacaagc actgtccaag tacaatacta      180

aacatactgt acatactcat actcgtaccc gggcaacggt ttcacttgag tgcagtggct      240

agtgctctta ctcgtacagt gtgcaatact gcgtatcata gtctttgatg tatatcgtat      300

tcattcatgt tagttgcgta cgttgattga ggtggagcca gatgggctat tgtttcatat      360

atagactggc agccacctct ttggcccagc atgtttgtat acctggaagg gaaaactaaa      420

gaagctggct agtttagttt gattattata gtagatgtcc taatcactag agattagaat      480

gtcttggcga tgattagtcg tcgtcccctg tatcatgtct agaccaactg tgtcatgaag      540

ttggtgctgg tgttttacct gtgtactaca agtaggtgtc ctagatctag tgtacagagc      600

cgtttagacc catgtggact tcaccattaa cgatggaaaa tgttcattat atgacagtat      660

attacaatgg acttgctcca tttcttcctt gcatcacatg ttctccacct ccatagttga      720

tcaacacatc atagtagcta aggctgctgc tctcccacta cagtccacca caagttaagt      780

agcaccgtca gtacagctaa aagtacacgt ctagtacgtt tcataactag tcaagtagcc      840

cctattacag atatcagcac tatcacgcac gagtttttct ctgtgctatc taatcaactt      900

gccaagtatt cggagaagat acactttctt ggcatcaggt atacgaggga gcctatcaga      960

tgaaaaaggg tatattggat ccattcatat ccacctacac gttgtcataa tctcctcatt     1020

cacgtgattc atttcgtgac actagtttct cactttcccc cccgcaccta tagtcaactt     1080

ggcggacacg ctacttgtag ctgacgttga tttatagacc caatcaaagc gggttatcgg     1140

tcaggtagca cttatcattc atcgttcata ctacgatgag caatctcggg catgtccgga     1200

aaagtgtcgg gcgcgccagc tgcattaatg aatcggccaa cgcgcgggga gaggcggttt     1260

gcgtattggg cgctcttccg cttcctcgct cactgactcg ctgcgctcgg tcgttcggct     1320

gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg ttatccacag aatcagggga     1380

taacgcagga aagaacatgt gagcaaaagg ccagcaaaag gccaggaacc gtaaaaaggc     1440

cgcgttgctg gcgtttttcc ataggctccg cccccctgac gagcatcaca aaaatcgacg     1500

ctcaagtcag aggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg     1560

aagctccctc gtgcgctctc ctgttccgac cctgccgctt accggatacc tgtccgcctt     1620

tctcccttcg ggaagcgtgg cgctttctca tagctcacgc tgtaggtatc tcagttcggt     1680

gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc cccgttcagc ccgaccgctg     1740

cgccttatcc ggtaactatc gtcttgagtc caacccggta agacacgact tatcgccact     1800

ggcagcagcc actggtaaca ggattagcag agcgaggtat gtaggcggtg ctacagagtt     1860

cttgaagtgg tggcctaact acggctacac tagaagaaca gtatttggta tctgcgctct     1920

gctgaagcca gttaccttcg gaaaaagagt tggtagctct tgatccggca aacaaaccac     1980

cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc     2040

tcaagaagat cctttgatct tttctacggg gtctgacgct cagtggaacg aaaactcacg     2100

ttaagggatt ttggtcatga gattatcaaa aaggatcttc acctagatcc ttttaaatta     2160

aaaatgaagt tttaaatcaa tctaaagtat atatgagtaa acttggtctg acagttacca     2220

atgcttaatc agtgaggcac ctatctcagc gatctgtcta tttcgttcat ccatagttgc     2280

ctgactcccc gtcgtgtaga taactacgat acgggagggc ttaccatctg gccccagtgc     2340

tgcaatgata ccgcgagacc cacgctcacc ggctccagat ttatcagcaa taaaccagcc     2400

agccggaagg gccgagcgca gaagtggtcc tgcaacttta tccgcctcca tccagtctat     2460

taattgttgc cgggaagcta gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt     2520

tgccattgct acaggcatcg tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc     2580

cggttcccaa cgatcaaggc gagttacatg atcccccatg ttgtgcaaaa aagcggttag     2640

ctccttcggt cctccgatcg ttgtcagaag taagttggcc gcagtgttat cactcatggt     2700

tatggcagca ctgcataatt ctcttactgt catgccatcc gtaagatgct tttctgtgac     2760

tggtgagtac tcaaccaagt cattctgaga atagtgtatg cggcgaccga gttgctcttg     2820

cccggcgtca atacgggata ataccgcgcc acatagcaga actttaaaag tgctcatcat     2880

tggaaaacgt tcttcggggc gaaaactctc aaggatctta ccgctgttga gatccagttc     2940

gatgtaaccc actcgtgcac ccaactgatc ttcagcatct tttactttca ccagcgtttc     3000

tgggtgagca aaaacaggaa ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa     3060

atgttgaata ctcatactct tcctttttca atattattga agcatttatc agggttattg     3120

tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg     3180

cacatttccc cgaaaagtgc cacctgatgc ggtgtgaaat accgcacaga tgcgtaagga     3240

gaaaataccg catcaggaaa ttgtaagcgt taatattttg ttaaaattcg cgttaaattt     3300

ttgttaaatc agctcatttt ttaaccaata ggccgaaatc ggcaaaatcc cttataaatc     3360

aaaagaatag accgagatag ggttgagtgt tgttccagtt tggaacaaga gtccactatt     3420

aaagaacgtg gactccaacg tcaaagggcg aaaaaccgtc tatcagggcg atggcccact     3480

acgtgaacca tcaccctaat caagtttttt ggggtcgagg tgccgtaaag cactaaatcg     3540

gaaccctaaa gggagccccc gatttagagc ttgacgggga aagccggcga acgtggcgag     3600

aaaggaaggg aagaaagcga aaggagcggg cgctagggcg ctggcaagtg tagcggtcac     3660

gctgcgcgta accaccacac ccgccgcgct taatgcgccg ctacagggcg cgtccattcg     3720

ccattcaggc tgcgcaactg ttgggaaggg cgatcggtgc gggcctcttc gctattacgc     3780

cagctggcga aagggggatg tgctgcaagg cgattaagtt gggtaacgcc agggttttcc     3840

cagtcacgac gttgtaaaac gacggccagt gaattgtaat acgactcact atagggcgaa     3900

ttgggcccga cgtcgcatgc attccgacag cagcgactgg gcaccatgat caagcgaaac     3960

accttccccc agctgccctg gcaaaccatc aagaacccta ctttcatcaa gtgcaagaac     4020

ggttctactc ttctcacctc cggtgtctac ggctggtgcc gaaagcctaa ctacaccgct     4080

gatttcatca tgtgcctcac ctgggctctc atgtgcggtg ttgcttctcc cctgccttac     4140

ttctacccgg tcttcttctt cctggtgctc atccaccgag cttaccgaga ctttgagcga     4200

ctggagcgaa agtacggtga ggactaccag gagttcaagc gacaggtccc ttggatcttc     4260

atcccttatg ttttctaaac gataagctta gtgagcgaat ggtgaggtta cttaattgag     4320

tggccagcct atgggattgt ataacagaca gtcaatatat tactgaaaag actgaacagc     4380

cagacggagt gaggttgtga gtgaatcgta gagggcggct attacagcaa gtctactcta     4440

cagtgtacta acacagcaga gaacaaatac aggtgtgcat tcggctatct gagaattagt     4500

tggagagctc gagaccctcg gcgataaact gctcctcggt tttgtgtcca tacttgtacg     4560

gaccattgta atggggcaag tcgttgagtt ctcgtcgtcc gacgttcaga gcacagaaac     4620

caatgtaatc aatgtagcag agatggttct gcaaaagatt gatttgtgcg agcaggttaa     4680

ttaagtcata cacaagtcag ctttcttcga gcctcatata agtataagta gttcaacgta     4740

ttagcactgt acccagcatc tccgtatcga gaaacacaac aacatgcccc attggacaga     4800

tcatgcggat acacaggttg tgcagtatca tacatactcg atcagacagg tcgtctgacc     4860

atcatacaag ctgaacaagc gctccatact tgcacgctct ctatatacac agttaaatta     4920

catatccata gtctaacctc taacagttaa tcttctggta agcctcccag ccagccttct     4980

ggtatcgctt ggcctcctca ataggatctc ggttctggcc gtacagacct cggccgacaa     5040

ttatgatatc cgttccggta gacatgacat cctcaacagt tcggtactgc tgtccgagag     5100

cgtctccctt gtcgtcaaga cccaccccgg gggtcagaat aagccagtcc tcagagtcgc     5160

ccttaggtcg gttctgggca atgaagccaa ccacaaactc ggggtcggat cgggcaagct     5220

caatggtctg cttggagtac tcgccagtgg ccagagagcc cttgcaagac agctcggcca     5280

gcatgagcag acctctggcc agcttctcgt tgggagaggg gactaggaac tccttgtact     5340

gggagttctc gtagtcagag acgtcctcct tcttctgttc agagacagtt tcctcggcac     5400

cagctcgcag gccagcaatg attccggttc cgggtacacc gtgggcgttg gtgatatcgg     5460

accactcggc gattcggtga caccggtact ggtgcttgac agtgttgcca atatctgcga     5520

actttctgtc ctcgaacagg aagaaaccgt gcttaagagc aagttccttg agggggagca     5580

cagtgccggc gtaggtgaag tcgtcaatga tgtcgatatg ggttttgatc atgcacacat     5640

aaggtccgac cttatcggca agctcaatga gctccttggt ggtggtaaca tccagagaag     5700

cacacaggtt ggttttcttg gctgccacga gcttgagcac tcgagcggca aaggcggact     5760

tgtggacgtt agctcgagct tcgtaggagg gcattttggt ggtgaagagg agactgaaat     5820

aaatttagtc tgcagaactt tttatcggaa ccttatctgg ggcagtgaag tatatgttat     5880

ggtaatagtt acgagttagt tgaacttata gatagactgg actatacggc tatcggtcca     5940

aattagaaag aacgtcaatg gctctctggg cgtcgccttt gccgacaaaa atgtgatcat     6000

gatgaaagcc agcaatgacg ttgcagctga tattgttgtc ggccaaccgc gccgaaaacg     6060

cagctgtcag acccacagcc tccaacgaag aatgtatcgt caaagtgatc caagcacact     6120

catagttgga gtcgtactcc aaaggcggca atgacgagtc agacagatac tcgtcgacgt     6180

ttaaacagtg tacgcagatc tactatagag gaacatttaa attgccccgg agaagacggc     6240

caggccgcct agatgacaaa ttcaacaact cacagctgac tttctgccat tgccactagg     6300

ggggggcctt tttatatggc caagccaagc tctccacgtc ggttgggctg cacccaacaa     6360

taaatgggta gggttgcacc aacaaaggga tgggatgggg ggtagaagat acgaggataa     6420

cggggctcaa tggcacaaat aagaacgaat actgccatta agactcgtga tccagcgact     6480

gacaccattg catcatctaa gggcctcaaa actacctcgg aactgctgcg ctgatctgga     6540

caccacagag gttccgagca ctttaggttg caccaaatgt cccaccaggt gcaggcagaa     6600

aacgctggaa cagcgtgtac agtttgtctt aacaaaaagt gagggcgctg aggtcgagca     6660

gggtggtgtg acttgttata gcctttagag ctgcgaaagc gcgtatggat ttggctcatc     6720

aggccagatt gagggtctgt ggacacatgt catgttagtg tacttcaatc gccccctgga     6780

tatagccccg acaataggcc gtggcctcat ttttttgcct tccgcacatt tccattgctc     6840

gatacccaca ccttgcttct cctgcacttg ccaaccttaa tactggttta cattgaccaa     6900

catcttacaa gcggggggct tgtctagggt atatataaac agtggctctc ccaatcggtt     6960

gccagtctct tttttccttt ctttccccac agattcgaaa tctaaactac acatcacaga     7020

attccgagcc gtgagtatcc acgacaagat cagtgtcgag acgacgcgtt ttgtgtaatg     7080

acacaatccg aaagtcgcta gcaacacaca ctctctacac aaactaaccc agctctggta     7140

ccatggccgc cgagaaaacc aataccgctt tctacaacct cgctccactc gacaagaacg     7200

gagagccttt ccccttcaag cagcttgagg gcaaggtcgt gctcatcgtg aacgtcgcct     7260

ccaagtgtgg ctttactccc caatacaagg gccttgagga ggtctaccag aagtacaagg     7320

atcagggatt caccatcatc ggcttcccct gcaaccagtt tggtggccaa gagcctggtt     7380

ccgctgacga gatctcctcc ttctgtcagc tgaactacgg cgtcactttc cccgttcttc     7440

agaagatcaa cgtcaacggc aacgacgccg accccgtcta cgtctacctg aaggagcaga     7500

aggctggtct gctgggcttc cgaggaatca agtggaactt tgagaagttc ctggttgata     7560

agcacggtaa cgtcgtcgac cgatatgctt ccctcaagac ccccgccggc ctcgaatcca     7620

ccatcgagac cctcctcaaa aagccctaag c                                    7651


<210>  30
<211>  44
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer GPX3-F

<400>  30
gatcaaccat ggccgccgag aaaaccaata ccgctttcta caac                        44


<210>  31
<211>  45
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer GPX3-R

<400>  31
gatcaagcgg ccgcttaggg ctttttgagg agggtctcga tggtg                       45


<210>  32
<211>  1164
<212>  DNA
<213>  Yarrowia lipolytica

<400>  32
taaagtagag agcatcccaa acaagcagtc gcagtcgcac tcatcgatat gcatatgtgc       60

tacttaactg tacgagtact gtacagtaca tacagtacct gtagtgattc acattcagtc      120

atacagtgca ggagtacttc cgcttgtctc acaggctttg tccatgtgcc aatgagtcag      180

acagacactt gtgcatgagg cagagcacac acatggcttc gttcaatctg ctgataggtc      240

gacattctgg gatctgctca ggttgttcag atgaccacct tctttttcac cccctctccc      300

tgtaccacca ggaccgtttc cgagacccac gtgaccctca aaccgtcgct cttgactttc      360

cccaggctct ccacctttgc cggctcaaag ctcggcgtct gtttatccct gtatccaatt      420

ttgcccacgc tggcatagag cagaatctcc acctgtctct ccacgacgtt tcttgacttg      480

ccaaacttga ctgattcaga gtagaccccc tgggaggaat gggaagagtt tgcggagtta      540

ccgaacagcg aagagaaggt gcctccatgg gtttccatct gccaaacgac gacacgtgtt      600

tctccgtcga aatcgggccc agggacgcta ttagacccta ttcccgtgag tccagcaacc      660

atttttccat ccggagaaaa gaccaggccc cacaccggag cagcatgatt ggaattgaat      720

tcctgggggt caagataggc atactctgag ccgattctca agtcgtagac aatcagagaa      780

caggtgtcgg tgaccttaat atccaggtct ctgttgagtt cagacagaga agatcttcgt      840

cgagacacat tcttttcaat catcacagca gtggcagtag gataaatagc cacagccaga      900

cgttgactgg gcttatggaa cgtgacaatg tacggaaatg tctgtgtgat ttgagacagt      960

agagctgtga ccttggactg cagagaaacg cctctctgga gggtcgagtg acgcagcaag     1020

tccggattca gcattttgca agcagtgtgc atcacaaacg gcacaaacat gtccatggag     1080

gaggattttc gggtgtggct gaagaagctg gaaagcacat cgatagctgt gattcgcaca     1140

actaacggct tgtcgaggtg catg                                            1164


<210>  33
<211>  591
<212>  DNA
<213>  Saccharomyces cerevisiae


<220>
<221>  CDS
<222>  (1)..(591)
<223>  GenBank Accession No. NM_001182386.1

<400>  33
atg gtc gct caa gtt caa aag caa gct cca act ttt aag aaa act gcc         48
Met Val Ala Gln Val Gln Lys Gln Ala Pro Thr Phe Lys Lys Thr Ala           
1               5                   10                  15                

gtc gtc gac ggt gtc ttt gac gaa gtc tcc ttg gac aaa tac aag ggt         96
Val Val Asp Gly Val Phe Asp Glu Val Ser Leu Asp Lys Tyr Lys Gly           
            20                  25                  30                    

aag tac gtt gtc cta gcc ttt att cca ttg gcc ttc act ttc gtc tgt        144
Lys Tyr Val Val Leu Ala Phe Ile Pro Leu Ala Phe Thr Phe Val Cys           
        35                  40                  45                        

cca acc gaa atc att gct ttc tca gaa gct gct aag aaa ttc gaa gaa        192
Pro Thr Glu Ile Ile Ala Phe Ser Glu Ala Ala Lys Lys Phe Glu Glu           
    50                  55                  60                            

caa ggc gct caa gtt ctt ttc gcc tcc act gac tcc gaa tac tcc ctt        240
Gln Gly Ala Gln Val Leu Phe Ala Ser Thr Asp Ser Glu Tyr Ser Leu           
65                  70                  75                  80            

ttg gca tgg acc aat atc cca aga aag gaa ggt ggt ttg ggc cca atc        288
Leu Ala Trp Thr Asn Ile Pro Arg Lys Glu Gly Gly Leu Gly Pro Ile           
                85                  90                  95                

aac att cca ttg ttg gct gac acc aac cac tct ttg tcc aga gac tat        336
Asn Ile Pro Leu Leu Ala Asp Thr Asn His Ser Leu Ser Arg Asp Tyr           
            100                 105                 110                   

ggt gtc ttg atc gaa gaa gaa ggt gtc gcc ttg aga ggt ttg ttc atc        384
Gly Val Leu Ile Glu Glu Glu Gly Val Ala Leu Arg Gly Leu Phe Ile           
        115                 120                 125                       

atc gac cca aag ggt gtc att aga cac atc acc att aac gat ttg cca        432
Ile Asp Pro Lys Gly Val Ile Arg His Ile Thr Ile Asn Asp Leu Pro           
    130                 135                 140                           

gtc ggt aga aac gtt gac gaa gcc ttg aga ttg gtt gaa gcc ttc caa        480
Val Gly Arg Asn Val Asp Glu Ala Leu Arg Leu Val Glu Ala Phe Gln           
145                 150                 155                 160           

tgg acc gac aag aac ggt act gtc ttg cca tgt aac tgg act cca ggt        528
Trp Thr Asp Lys Asn Gly Thr Val Leu Pro Cys Asn Trp Thr Pro Gly           
                165                 170                 175               

gct gct acc atc aag cca acc gtt gaa gac tcc aag gaa tac ttc gaa        576
Ala Ala Thr Ile Lys Pro Thr Val Glu Asp Ser Lys Glu Tyr Phe Glu           
            180                 185                 190                   

gct gcc aac aaa taa                                                    591
Ala Ala Asn Lys                                                           
        195                                                               


<210>  34
<211>  196
<212>  PRT
<213>  Saccharomyces cerevisiae

<400>  34

Met Val Ala Gln Val Gln Lys Gln Ala Pro Thr Phe Lys Lys Thr Ala 
1               5                   10                  15      


Val Val Asp Gly Val Phe Asp Glu Val Ser Leu Asp Lys Tyr Lys Gly 
            20                  25                  30          


Lys Tyr Val Val Leu Ala Phe Ile Pro Leu Ala Phe Thr Phe Val Cys 
        35                  40                  45              


Pro Thr Glu Ile Ile Ala Phe Ser Glu Ala Ala Lys Lys Phe Glu Glu 
    50                  55                  60                  


Gln Gly Ala Gln Val Leu Phe Ala Ser Thr Asp Ser Glu Tyr Ser Leu 
65                  70                  75                  80  


Leu Ala Trp Thr Asn Ile Pro Arg Lys Glu Gly Gly Leu Gly Pro Ile 
                85                  90                  95      


Asn Ile Pro Leu Leu Ala Asp Thr Asn His Ser Leu Ser Arg Asp Tyr 
            100                 105                 110         


Gly Val Leu Ile Glu Glu Glu Gly Val Ala Leu Arg Gly Leu Phe Ile 
        115                 120                 125             


Ile Asp Pro Lys Gly Val Ile Arg His Ile Thr Ile Asn Asp Leu Pro 
    130                 135                 140                 


Val Gly Arg Asn Val Asp Glu Ala Leu Arg Leu Val Glu Ala Phe Gln 
145                 150                 155                 160 


Trp Thr Asp Lys Asn Gly Thr Val Leu Pro Cys Asn Trp Thr Pro Gly 
                165                 170                 175     


Ala Ala Thr Ile Lys Pro Thr Val Glu Asp Ser Lys Glu Tyr Phe Glu 
            180                 185                 190         


Ala Ala Asn Lys 
        195     


<210>  35
<211>  591
<212>  DNA
<213>  Yarrowia lipolytica


<220>
<221>  CDS
<222>  (1)..(591)
<223>  YALI0B15125; GenBank Accession No. XM_500915

<400>  35
atg gtc gcc act gtt cag cat ccc gcc ccc gac ttc aag aag act gcc         48
Met Val Ala Thr Val Gln His Pro Ala Pro Asp Phe Lys Lys Thr Ala           
1               5                   10                  15                

gtc tct ggt ggt gtc ttc gag gag gtc tcc ctc gac cag ttc aag ggt         96
Val Ser Gly Gly Val Phe Glu Glu Val Ser Leu Asp Gln Phe Lys Gly           
            20                  25                  30                    

aag tgg gtt gtc ctc gcc ttc atc ccc ctg gct ttc acc ttc gtc tgc        144
Lys Trp Val Val Leu Ala Phe Ile Pro Leu Ala Phe Thr Phe Val Cys           
        35                  40                  45                        

ccc acc gag atc atc gct tac tcc gat gcc gtc tct cag ttc aag gag        192
Pro Thr Glu Ile Ile Ala Tyr Ser Asp Ala Val Ser Gln Phe Lys Glu           
    50                  55                  60                            

cga ggc gcc gag gtt ctc ttt gcc tcc acc gac tcc gag tac tct ctg        240
Arg Gly Ala Glu Val Leu Phe Ala Ser Thr Asp Ser Glu Tyr Ser Leu           
65                  70                  75                  80            

ctt gcc tgg acc aac gtt gcc cga aag gat ggt ggt ctt ggt ccc gtc        288
Leu Ala Trp Thr Asn Val Ala Arg Lys Asp Gly Gly Leu Gly Pro Val           
                85                  90                  95                

aac atc ccc ctg ctt gct gac acc aac cac acc ctg tcc aag gac tac        336
Asn Ile Pro Leu Leu Ala Asp Thr Asn His Thr Leu Ser Lys Asp Tyr           
            100                 105                 110                   

ggt gtt ctc atc ccc gag gcc ggt gtc gct ctc cga ggt atc ttc atc        384
Gly Val Leu Ile Pro Glu Ala Gly Val Ala Leu Arg Gly Ile Phe Ile           
        115                 120                 125                       

att gac ccc aag ggc gtt gtc cga cag atc acc atc aac gat ctc ccc        432
Ile Asp Pro Lys Gly Val Val Arg Gln Ile Thr Ile Asn Asp Leu Pro           
    130                 135                 140                           

gtt ggc cga tcc gtc gag gag acc ctc cga ctc atc gat gcc ttc cag        480
Val Gly Arg Ser Val Glu Glu Thr Leu Arg Leu Ile Asp Ala Phe Gln           
145                 150                 155                 160           

ttc act gag aag cac ggt gag gtc tgc ccc gcc aac tgg cag aag ggc        528
Phe Thr Glu Lys His Gly Glu Val Cys Pro Ala Asn Trp Gln Lys Gly           
                165                 170                 175               

tcc gat act atc aag gct gac cct gtc aac gcc aag gag tac ttc gag        576
Ser Asp Thr Ile Lys Ala Asp Pro Val Asn Ala Lys Glu Tyr Phe Glu           
            180                 185                 190                   

aag gcc aac aaa taa                                                    591
Lys Ala Asn Lys                                                           
        195                                                               


<210>  36
<211>  196
<212>  PRT
<213>  Yarrowia lipolytica

<400>  36

Met Val Ala Thr Val Gln His Pro Ala Pro Asp Phe Lys Lys Thr Ala 
1               5                   10                  15      


Val Ser Gly Gly Val Phe Glu Glu Val Ser Leu Asp Gln Phe Lys Gly 
            20                  25                  30          


Lys Trp Val Val Leu Ala Phe Ile Pro Leu Ala Phe Thr Phe Val Cys 
        35                  40                  45              


Pro Thr Glu Ile Ile Ala Tyr Ser Asp Ala Val Ser Gln Phe Lys Glu 
    50                  55                  60                  


Arg Gly Ala Glu Val Leu Phe Ala Ser Thr Asp Ser Glu Tyr Ser Leu 
65                  70                  75                  80  


Leu Ala Trp Thr Asn Val Ala Arg Lys Asp Gly Gly Leu Gly Pro Val 
                85                  90                  95      


Asn Ile Pro Leu Leu Ala Asp Thr Asn His Thr Leu Ser Lys Asp Tyr 
            100                 105                 110         


Gly Val Leu Ile Pro Glu Ala Gly Val Ala Leu Arg Gly Ile Phe Ile 
        115                 120                 125             


Ile Asp Pro Lys Gly Val Val Arg Gln Ile Thr Ile Asn Asp Leu Pro 
    130                 135                 140                 


Val Gly Arg Ser Val Glu Glu Thr Leu Arg Leu Ile Asp Ala Phe Gln 
145                 150                 155                 160 


Phe Thr Glu Lys His Gly Glu Val Cys Pro Ala Asn Trp Gln Lys Gly 
                165                 170                 175     


Ser Asp Thr Ile Lys Ala Asp Pro Val Asn Ala Lys Glu Tyr Phe Glu 
            180                 185                 190         


Lys Ala Asn Lys 
        195     


<210>  37
<211>  2025
<212>  DNA
<213>  Saccharomyces cerevisiae


<220>
<221>  CDS
<222>  (1)..(2025)
<223>  GenBank Accession No. NM_001178564.1

<400>  37
atg gaa cca att gat gac ata ctt ttt gag gtt act gat gcg ttc aaa         48
Met Glu Pro Ile Asp Asp Ile Leu Phe Glu Val Thr Asp Ala Phe Lys           
1               5                   10                  15                

act cag aag gag gat ctt ctt gag ttg gta aca ttg att gat ata tat         96
Thr Gln Lys Glu Asp Leu Leu Glu Leu Val Thr Leu Ile Asp Ile Tyr           
            20                  25                  30                    

ggc gag caa gtt aac caa gag ggg agc tat gaa gaa aag acg aga ttc        144
Gly Glu Gln Val Asn Gln Glu Gly Ser Tyr Glu Glu Lys Thr Arg Phe           
        35                  40                  45                        

att gaa act ttg aat aca ttg tta gag gat aat ccg agt act act ggt        192
Ile Glu Thr Leu Asn Thr Leu Leu Glu Asp Asn Pro Ser Thr Thr Gly           
    50                  55                  60                            

gaa atc ggt tgg gat ctg cct aag gga tta ttg aag ttc ttg tca aag        240
Glu Ile Gly Trp Asp Leu Pro Lys Gly Leu Leu Lys Phe Leu Ser Lys           
65                  70                  75                  80            

gat aat gtc gat gta aat gga aga cta ggt acg aat atg att gtc caa        288
Asp Asn Val Asp Val Asn Gly Arg Leu Gly Thr Asn Met Ile Val Gln           
                85                  90                  95                

ggt gta atg aag tgt ttc tat gcc atc tca atc caa ggc gag ccc aaa        336
Gly Val Met Lys Cys Phe Tyr Ala Ile Ser Ile Gln Gly Glu Pro Lys           
            100                 105                 110                   

aaa tgt tta att act ggg ttg gag ttg ctt tca tcc ctt tgt tca aaa        384
Lys Cys Leu Ile Thr Gly Leu Glu Leu Leu Ser Ser Leu Cys Ser Lys           
        115                 120                 125                       

gat ttt tcc aag agt gat caa cag aat aag gaa gac ttt gtt gat aaa        432
Asp Phe Ser Lys Ser Asp Gln Gln Asn Lys Glu Asp Phe Val Asp Lys           
    130                 135                 140                           

aag gcc aat acg tta cct cct gaa gga gta atc gaa aat tcc tct aat        480
Lys Ala Asn Thr Leu Pro Pro Glu Gly Val Ile Glu Asn Ser Ser Asn           
145                 150                 155                 160           

cga aaa gat ttt cca tcc tac ggt gaa agc aag agt tca aat gaa ttt        528
Arg Lys Asp Phe Pro Ser Tyr Gly Glu Ser Lys Ser Ser Asn Glu Phe           
                165                 170                 175               

ttc ttg aag ttg aaa tcc tac att tta ttt gaa ttc ata ggg gcg agt        576
Phe Leu Lys Leu Lys Ser Tyr Ile Leu Phe Glu Phe Ile Gly Ala Ser           
            180                 185                 190                   

ctg aaa agg att tct act ctg ttt cct tcg aaa tat ctg gga gct gct        624
Leu Lys Arg Ile Ser Thr Leu Phe Pro Ser Lys Tyr Leu Gly Ala Ala           
        195                 200                 205                       

gtg tca aca att gag aaa ttt gtg tat agt cat gcg gac act ttt gaa        672
Val Ser Thr Ile Glu Lys Phe Val Tyr Ser His Ala Asp Thr Phe Glu           
    210                 215                 220                           

gat gcc ctt ttc ctt ctt cgt agg gtg tac aca ttc tgc agg aac tat        720
Asp Ala Leu Phe Leu Leu Arg Arg Val Tyr Thr Phe Cys Arg Asn Tyr           
225                 230                 235                 240           

att ccc cct gat cca cca aaa gat ata caa ttg aac gaa gat ttt act        768
Ile Pro Pro Asp Pro Pro Lys Asp Ile Gln Leu Asn Glu Asp Phe Thr           
                245                 250                 255               

cga gag atg ttt gat aaa gtt gtg gag gaa gaa agt gaa tta cag gtt        816
Arg Glu Met Phe Asp Lys Val Val Glu Glu Glu Ser Glu Leu Gln Val           
            260                 265                 270                   

aga cta ttg cgt agg ctt tgt act ttt ggt att tcg aca ccc ata aaa        864
Arg Leu Leu Arg Arg Leu Cys Thr Phe Gly Ile Ser Thr Pro Ile Lys           
        275                 280                 285                       

act gtc acc acc aat gcc gac gtg aaa tac tat tgt gca cta aat caa        912
Thr Val Thr Thr Asn Ala Asp Val Lys Tyr Tyr Cys Ala Leu Asn Gln           
    290                 295                 300                           

cag aag ttt gaa tta tct gca tat tac acc gaa tat ctt gag cta ttt        960
Gln Lys Phe Glu Leu Ser Ala Tyr Tyr Thr Glu Tyr Leu Glu Leu Phe           
305                 310                 315                 320           

tgc agg tat tac caa atg gcg ttc tcg ctt gat gtt gat ata gag gga       1008
Cys Arg Tyr Tyr Gln Met Ala Phe Ser Leu Asp Val Asp Ile Glu Gly           
                325                 330                 335               

gaa ttt cag aat gtg ata aaa gaa tgt agg att att tat aag tct gta       1056
Glu Phe Gln Asn Val Ile Lys Glu Cys Arg Ile Ile Tyr Lys Ser Val           
            340                 345                 350                   

ccc cag gag att tcc gct gtt aat gat gaa gca aag ttg gtt ttg gaa       1104
Pro Gln Glu Ile Ser Ala Val Asn Asp Glu Ala Lys Leu Val Leu Glu           
        355                 360                 365                       

aga atg gta tat aaa ttg gct tat aca ttc gaa gta caa aag gcc gct       1152
Arg Met Val Tyr Lys Leu Ala Tyr Thr Phe Glu Val Gln Lys Ala Ala           
    370                 375                 380                           

aaa gaa aaa aat gtt ggt ttg gac tat aat ggt gta ata tta ttt tct       1200
Lys Glu Lys Asn Val Gly Leu Asp Tyr Asn Gly Val Ile Leu Phe Ser           
385                 390                 395                 400           

ggt atc cac tat ttg gaa acc aat caa cat tta gta aag gaa atg aat       1248
Gly Ile His Tyr Leu Glu Thr Asn Gln His Leu Val Lys Glu Met Asn           
                405                 410                 415               

ata acg gat gcc att tat ctc tac ttg aga ttt aca act cca tca tta       1296
Ile Thr Asp Ala Ile Tyr Leu Tyr Leu Arg Phe Thr Thr Pro Ser Leu           
            420                 425                 430                   

tat tct aaa gtt tac tat aat gta gca gtt gaa tca gtt agt cgc tac       1344
Tyr Ser Lys Val Tyr Tyr Asn Val Ala Val Glu Ser Val Ser Arg Tyr           
        435                 440                 445                       

tgg cta tgg tat gct att aca acc gag ccc ttg gag gat gta aaa aaa       1392
Trp Leu Trp Tyr Ala Ile Thr Thr Glu Pro Leu Glu Asp Val Lys Lys           
    450                 455                 460                           

gaa ttg aag aat ctt tca gtg ttt gtt aca aaa aca tta ttg cat gtt       1440
Glu Leu Lys Asn Leu Ser Val Phe Val Thr Lys Thr Leu Leu His Val           
465                 470                 475                 480           

cta ctt caa aag aac tgt att cag gtc aat cag cag tta aga atg ata       1488
Leu Leu Gln Lys Asn Cys Ile Gln Val Asn Gln Gln Leu Arg Met Ile           
                485                 490                 495               

act ttc act ctt ctc acc aga tta cta tgt tta ata cct gaa aaa gtt       1536
Thr Phe Thr Leu Leu Thr Arg Leu Leu Cys Leu Ile Pro Glu Lys Val           
            500                 505                 510                   

gca ttt gag ttt atc tta gat gtg ctt aag aca tct ccc ctt cca ttg       1584
Ala Phe Glu Phe Ile Leu Asp Val Leu Lys Thr Ser Pro Leu Pro Leu           
        515                 520                 525                       

gct aag acg tcc gta tta tgt gtt ttt aaa gac ctt tca agg cga cgc       1632
Ala Lys Thr Ser Val Leu Cys Val Phe Lys Asp Leu Ser Arg Arg Arg           
    530                 535                 540                           

atc tcc acc aag gat aac gat tct gag acg gat ttg att gtc gaa aaa       1680
Ile Ser Thr Lys Asp Asn Asp Ser Glu Thr Asp Leu Ile Val Glu Lys           
545                 550                 555                 560           

tta tcc aaa ctg aag gtc aat gat agt aac aaa gct cag caa agt aac       1728
Leu Ser Lys Leu Lys Val Asn Asp Ser Asn Lys Ala Gln Gln Ser Asn           
                565                 570                 575               

atc aga cat tat atc caa cta gat tct tcc aaa atg aaa gct gtt cat       1776
Ile Arg His Tyr Ile Gln Leu Asp Ser Ser Lys Met Lys Ala Val His           
            580                 585                 590                   

gac tgt tgt ctg cag act atc caa gat tca ttt acg gca gat gcc aag       1824
Asp Cys Cys Leu Gln Thr Ile Gln Asp Ser Phe Thr Ala Asp Ala Lys           
        595                 600                 605                       

aag agt gat ata tta tta ctg cta act tac ttg aat att ttc att gtg       1872
Lys Ser Asp Ile Leu Leu Leu Leu Thr Tyr Leu Asn Ile Phe Ile Val           
    610                 615                 620                           

cta aaa aaa aca tgg gat gaa gat cta ctg aag att gtt tgt tcg aag       1920
Leu Lys Lys Thr Trp Asp Glu Asp Leu Leu Lys Ile Val Cys Ser Lys           
625                 630                 635                 640           

att gat tct aat ttg aag tca gtc gaa cct gat aaa ctt ccg aag tat       1968
Ile Asp Ser Asn Leu Lys Ser Val Glu Pro Asp Lys Leu Pro Lys Tyr           
                645                 650                 655               

aag gaa att gtg gat aaa aac gaa tct cta aat gac tat ttt act ggt       2016
Lys Glu Ile Val Asp Lys Asn Glu Ser Leu Asn Asp Tyr Phe Thr Gly           
            660                 665                 670                   

ata aaa tga                                                           2025
Ile Lys                                                                   
                                                                          


<210>  38
<211>  674
<212>  PRT
<213>  Saccharomyces cerevisiae

<400>  38

Met Glu Pro Ile Asp Asp Ile Leu Phe Glu Val Thr Asp Ala Phe Lys 
1               5                   10                  15      


Thr Gln Lys Glu Asp Leu Leu Glu Leu Val Thr Leu Ile Asp Ile Tyr 
            20                  25                  30          


Gly Glu Gln Val Asn Gln Glu Gly Ser Tyr Glu Glu Lys Thr Arg Phe 
        35                  40                  45              


Ile Glu Thr Leu Asn Thr Leu Leu Glu Asp Asn Pro Ser Thr Thr Gly 
    50                  55                  60                  


Glu Ile Gly Trp Asp Leu Pro Lys Gly Leu Leu Lys Phe Leu Ser Lys 
65                  70                  75                  80  


Asp Asn Val Asp Val Asn Gly Arg Leu Gly Thr Asn Met Ile Val Gln 
                85                  90                  95      


Gly Val Met Lys Cys Phe Tyr Ala Ile Ser Ile Gln Gly Glu Pro Lys 
            100                 105                 110         


Lys Cys Leu Ile Thr Gly Leu Glu Leu Leu Ser Ser Leu Cys Ser Lys 
        115                 120                 125             


Asp Phe Ser Lys Ser Asp Gln Gln Asn Lys Glu Asp Phe Val Asp Lys 
    130                 135                 140                 


Lys Ala Asn Thr Leu Pro Pro Glu Gly Val Ile Glu Asn Ser Ser Asn 
145                 150                 155                 160 


Arg Lys Asp Phe Pro Ser Tyr Gly Glu Ser Lys Ser Ser Asn Glu Phe 
                165                 170                 175     


Phe Leu Lys Leu Lys Ser Tyr Ile Leu Phe Glu Phe Ile Gly Ala Ser 
            180                 185                 190         


Leu Lys Arg Ile Ser Thr Leu Phe Pro Ser Lys Tyr Leu Gly Ala Ala 
        195                 200                 205             


Val Ser Thr Ile Glu Lys Phe Val Tyr Ser His Ala Asp Thr Phe Glu 
    210                 215                 220                 


Asp Ala Leu Phe Leu Leu Arg Arg Val Tyr Thr Phe Cys Arg Asn Tyr 
225                 230                 235                 240 


Ile Pro Pro Asp Pro Pro Lys Asp Ile Gln Leu Asn Glu Asp Phe Thr 
                245                 250                 255     


Arg Glu Met Phe Asp Lys Val Val Glu Glu Glu Ser Glu Leu Gln Val 
            260                 265                 270         


Arg Leu Leu Arg Arg Leu Cys Thr Phe Gly Ile Ser Thr Pro Ile Lys 
        275                 280                 285             


Thr Val Thr Thr Asn Ala Asp Val Lys Tyr Tyr Cys Ala Leu Asn Gln 
    290                 295                 300                 


Gln Lys Phe Glu Leu Ser Ala Tyr Tyr Thr Glu Tyr Leu Glu Leu Phe 
305                 310                 315                 320 


Cys Arg Tyr Tyr Gln Met Ala Phe Ser Leu Asp Val Asp Ile Glu Gly 
                325                 330                 335     


Glu Phe Gln Asn Val Ile Lys Glu Cys Arg Ile Ile Tyr Lys Ser Val 
            340                 345                 350         


Pro Gln Glu Ile Ser Ala Val Asn Asp Glu Ala Lys Leu Val Leu Glu 
        355                 360                 365             


Arg Met Val Tyr Lys Leu Ala Tyr Thr Phe Glu Val Gln Lys Ala Ala 
    370                 375                 380                 


Lys Glu Lys Asn Val Gly Leu Asp Tyr Asn Gly Val Ile Leu Phe Ser 
385                 390                 395                 400 


Gly Ile His Tyr Leu Glu Thr Asn Gln His Leu Val Lys Glu Met Asn 
                405                 410                 415     


Ile Thr Asp Ala Ile Tyr Leu Tyr Leu Arg Phe Thr Thr Pro Ser Leu 
            420                 425                 430         


Tyr Ser Lys Val Tyr Tyr Asn Val Ala Val Glu Ser Val Ser Arg Tyr 
        435                 440                 445             


Trp Leu Trp Tyr Ala Ile Thr Thr Glu Pro Leu Glu Asp Val Lys Lys 
    450                 455                 460                 


Glu Leu Lys Asn Leu Ser Val Phe Val Thr Lys Thr Leu Leu His Val 
465                 470                 475                 480 


Leu Leu Gln Lys Asn Cys Ile Gln Val Asn Gln Gln Leu Arg Met Ile 
                485                 490                 495     


Thr Phe Thr Leu Leu Thr Arg Leu Leu Cys Leu Ile Pro Glu Lys Val 
            500                 505                 510         


Ala Phe Glu Phe Ile Leu Asp Val Leu Lys Thr Ser Pro Leu Pro Leu 
        515                 520                 525             


Ala Lys Thr Ser Val Leu Cys Val Phe Lys Asp Leu Ser Arg Arg Arg 
    530                 535                 540                 


Ile Ser Thr Lys Asp Asn Asp Ser Glu Thr Asp Leu Ile Val Glu Lys 
545                 550                 555                 560 


Leu Ser Lys Leu Lys Val Asn Asp Ser Asn Lys Ala Gln Gln Ser Asn 
                565                 570                 575     


Ile Arg His Tyr Ile Gln Leu Asp Ser Ser Lys Met Lys Ala Val His 
            580                 585                 590         


Asp Cys Cys Leu Gln Thr Ile Gln Asp Ser Phe Thr Ala Asp Ala Lys 
        595                 600                 605             


Lys Ser Asp Ile Leu Leu Leu Leu Thr Tyr Leu Asn Ile Phe Ile Val 
    610                 615                 620                 


Leu Lys Lys Thr Trp Asp Glu Asp Leu Leu Lys Ile Val Cys Ser Lys 
625                 630                 635                 640 


Ile Asp Ser Asn Leu Lys Ser Val Glu Pro Asp Lys Leu Pro Lys Tyr 
                645                 650                 655     


Lys Glu Ile Val Asp Lys Asn Glu Ser Leu Asn Asp Tyr Phe Thr Gly 
            660                 665                 670         


Ile Lys 
        


<210>  39
<211>  2025
<212>  DNA
<213>  Yarrowia lipolytica


<220>
<221>  CDS
<222>  (1)..(2025)
<223>  YALI0B03762; GenBank Accession No. XM_500469

<400>  39
atg caa cta acc gac gac cat aag aaa gac ctg gaa aag ctg ggc gag         48
Met Gln Leu Thr Asp Asp His Lys Lys Asp Leu Glu Lys Leu Gly Glu           
1               5                   10                  15                

gaa ttg aag ggc aag gag gag cac acg gtg gct ggg gag gat gag gaa         96
Glu Leu Lys Gly Lys Glu Glu His Thr Val Ala Gly Glu Asp Glu Glu           
            20                  25                  30                    

gat gtc aac cat ggc gcc gac gat tcc gaa gac gcc gaa gac gaa gac        144
Asp Val Asn His Gly Ala Asp Asp Ser Glu Asp Ala Glu Asp Glu Asp           
        35                  40                  45                        

gcc gaa gac gag aac gac tac acc gaa ctg gat gtg gac att gtg tgc        192
Ala Glu Asp Glu Asn Asp Tyr Thr Glu Leu Asp Val Asp Ile Val Cys           
    50                  55                  60                            

caa ttc atc aag gac gcc gcc aga gag gcc gag aag acg ggc gac tac        240
Gln Phe Ile Lys Asp Ala Ala Arg Glu Ala Glu Lys Thr Gly Asp Tyr           
65                  70                  75                  80            

att tcc tac gca acc gtc atc gac atc cac tgc tcg gat cca tcc aga        288
Ile Ser Tyr Ala Thr Val Ile Asp Ile His Cys Ser Asp Pro Ser Arg           
                85                  90                  95                

tac aag cac gta gac agg gtc aag atc ctc acg tct ctt ctg gag gtg        336
Tyr Lys His Val Asp Arg Val Lys Ile Leu Thr Ser Leu Leu Glu Val           
            100                 105                 110                   

ctg cgg acc aac ccc aag att tgc gag gaa att ggc tgg gat ctt cca        384
Leu Arg Thr Asn Pro Lys Ile Cys Glu Glu Ile Gly Trp Asp Leu Pro           
        115                 120                 125                       

gcg ctt ttg ctg ccc tac ttc aat gtc gag gac ttt gat ttc aac gac        432
Ala Leu Leu Leu Pro Tyr Phe Asn Val Glu Asp Phe Asp Phe Asn Asp           
    130                 135                 140                           

ggt ctc gag ggt cac ccg acc ttc tac cct ctg att atg ctg ttc tcg        480
Gly Leu Glu Gly His Pro Thr Phe Tyr Pro Leu Ile Met Leu Phe Ser           
145                 150                 155                 160           

acc ctg gca gag tac ggc aac ccc aag gag ctg ttt ctc aag acc gtc        528
Thr Leu Ala Glu Tyr Gly Asn Pro Lys Glu Leu Phe Leu Lys Thr Val           
                165                 170                 175               

gag acg ctc agt aca ctg acc tgt gac cgc gca ccc gaa aat gac aaa        576
Glu Thr Leu Ser Thr Leu Thr Cys Asp Arg Ala Pro Glu Asn Asp Lys           
            180                 185                 190                   

ctc aag ttc aaa cag gcc gaa tct cta cgg aaa ttc gag gtc tgc aag        624
Leu Lys Phe Lys Gln Ala Glu Ser Leu Arg Lys Phe Glu Val Cys Lys           
        195                 200                 205                       

ttc cac gtt ctc gag gaa ctc atg agc tcg tgc atg aag aaa atc aag        672
Phe His Val Leu Glu Glu Leu Met Ser Ser Cys Met Lys Lys Ile Lys           
    210                 215                 220                           

acc cag tac ccc tcc cgg ttc ttg gct tcc gct gct tcc gcc att ctc        720
Thr Gln Tyr Pro Ser Arg Phe Leu Ala Ser Ala Ala Ser Ala Ile Leu           
225                 230                 235                 240           

atg ttc tcc gct cga aat gcg gca ctt ttc aga cac ttc cct ctc att        768
Met Phe Ser Ala Arg Asn Ala Ala Leu Phe Arg His Phe Pro Leu Ile           
                245                 250                 255               

gtc ggc att ctg gct aga aga gtc tac gta ttt att cga gac tgg ggg        816
Val Gly Ile Leu Ala Arg Arg Val Tyr Val Phe Ile Arg Asp Trp Gly           
            260                 265                 270                   

atg gac gga gac gaa ccc atg gac atg tcg cct gac gaa caa gcc aag        864
Met Asp Gly Asp Glu Pro Met Asp Met Ser Pro Asp Glu Gln Ala Lys           
        275                 280                 285                       

agc gcc aag att cta cag tcc ctg tcc acg tac ttt ttt tac tcg tgg        912
Ser Ala Lys Ile Leu Gln Ser Leu Ser Thr Tyr Phe Phe Tyr Ser Trp           
    290                 295                 300                           

ttc cac cgg gtg gct gtc cga tgg agt agc aat ctc ttc cga gag atc        960
Phe His Arg Val Ala Val Arg Trp Ser Ser Asn Leu Phe Arg Glu Ile           
305                 310                 315                 320           

aaa cac tca atc cac gag ttg ccc aga gcc gaa aga gcc aag tac gac       1008
Lys His Ser Ile His Glu Leu Pro Arg Ala Glu Arg Ala Lys Tyr Asp           
                325                 330                 335               

aac ccg aaa tca aat gga tcg gcc gtt tac acc att tac aac cga tgg       1056
Asn Pro Lys Ser Asn Gly Ser Ala Val Tyr Thr Ile Tyr Asn Arg Trp           
            340                 345                 350                   

ggc act ctg gcg cta tct ctg gat ctt gat ccc agt caa tat ttc ctt       1104
Gly Thr Leu Ala Leu Ser Leu Asp Leu Asp Pro Ser Gln Tyr Phe Leu           
        355                 360                 365                       

cct ctg atc cag gag atc cag gag gac gtc cag gag gcc acc aag gga       1152
Pro Leu Ile Gln Glu Ile Gln Glu Asp Val Gln Glu Ala Thr Lys Gly           
    370                 375                 380                           

ggg ttg gac gat act ctt gcg gga ttc agc aag agt tca ctt tca gac       1200
Gly Leu Asp Asp Thr Leu Ala Gly Phe Ser Lys Ser Ser Leu Ser Asp           
385                 390                 395                 400           

gcc tcc ccc atc gca ttt gtt gac tac agc atg tac gat gac gcc tct       1248
Ala Ser Pro Ile Ala Phe Val Asp Tyr Ser Met Tyr Asp Asp Ala Ser           
                405                 410                 415               

gag att cct ctg tct cag gag ggt ctg ctt atg ctt gct acc cag tac       1296
Glu Ile Pro Leu Ser Gln Glu Gly Leu Leu Met Leu Ala Thr Gln Tyr           
            420                 425                 430                   

atg atg gag aac cgc gac cac agt ctc aat att tct cta gat cag ctg       1344
Met Met Glu Asn Arg Asp His Ser Leu Asn Ile Ser Leu Asp Gln Leu           
        435                 440                 445                       

gtt tct ctg aca cta tat ctt gtg cac aga tcc tcc cct aag gaa cct       1392
Val Ser Leu Thr Leu Tyr Leu Val His Arg Ser Ser Pro Lys Glu Pro           
    450                 455                 460                           

ctt cct ttt gcc att aca gac ttg ctc ctg ttc tgg gga tgg tgg act       1440
Leu Pro Phe Ala Ile Thr Asp Leu Leu Leu Phe Trp Gly Trp Trp Thr           
465                 470                 475                 480           

ctc aaa gac atg gag cgt ccc gag gtg cga caa ctt gat gaa gca ttt       1488
Leu Lys Asp Met Glu Arg Pro Glu Val Arg Gln Leu Asp Glu Ala Phe           
                485                 490                 495               

tac gtc aag tat ctg cag ttc ctg gtg ttt att tcg gca tct tct ccc       1536
Tyr Val Lys Tyr Leu Gln Phe Leu Val Phe Ile Ser Ala Ser Ser Pro           
            500                 505                 510                   

ttg ccc gaa atc aga aac att gcc tac aca ctc tgt ggg cgg ctg ttg       1584
Leu Pro Glu Ile Arg Asn Ile Ala Tyr Thr Leu Cys Gly Arg Leu Leu           
        515                 520                 525                       

tac ctg cag cac gag tct gtc tcg ttc gcc ttc atc gca gac act att       1632
Tyr Leu Gln His Glu Ser Val Ser Phe Ala Phe Ile Ala Asp Thr Ile           
    530                 535                 540                           

gcg gat tgt ccg ttt gag aat gcc cag gta gcc atg gta ggt att ctc       1680
Ala Asp Cys Pro Phe Glu Asn Ala Gln Val Ala Met Val Gly Ile Leu           
545                 550                 555                 560           

aag cgt ctg atg atc cct gac gag atc tcc gac cag ctc tca aaa ctc       1728
Lys Arg Leu Met Ile Pro Asp Glu Ile Ser Asp Gln Leu Ser Lys Leu           
                565                 570                 575               

aga att ccc gat gtg ccg acc cga gag gga gtc gaa cac cag aag gcc       1776
Arg Ile Pro Asp Val Pro Thr Arg Glu Gly Val Glu His Gln Lys Ala           
            580                 585                 590                   

tcc cag acc acc atc ccg aca act ccc gag cat gtg gat act atc aag       1824
Ser Gln Thr Thr Ile Pro Thr Thr Pro Glu His Val Asp Thr Ile Lys           
        595                 600                 605                       

agt ctt tgt aac gct gca ttg gaa cag gag aac acg cac ctg gtg atc       1872
Ser Leu Cys Asn Ala Ala Leu Glu Gln Glu Asn Thr His Leu Val Ile           
    610                 615                 620                           

acc tgg ctc aac ttc ctg tcc aca gtg aag ctg gac tgc ggt ttc gcg       1920
Thr Trp Leu Asn Phe Leu Ser Thr Val Lys Leu Asp Cys Gly Phe Ala           
625                 630                 635                 640           

ggt gac tat gct gag cgg gtg gag aag gtg att gac gag gtg gag gat       1968
Gly Asp Tyr Ala Glu Arg Val Glu Lys Val Ile Asp Glu Val Glu Asp           
                645                 650                 655               

gag aac gac cgg act ctg att aga ctg gct ctg gac gtg ttg gca aag       2016
Glu Asn Asp Arg Thr Leu Ile Arg Leu Ala Leu Asp Val Leu Ala Lys           
            660                 665                 670                   

acc gtc tag                                                           2025
Thr Val                                                                   
                                                                          


<210>  40
<211>  674
<212>  PRT
<213>  Yarrowia lipolytica

<400>  40

Met Gln Leu Thr Asp Asp His Lys Lys Asp Leu Glu Lys Leu Gly Glu 
1               5                   10                  15      


Glu Leu Lys Gly Lys Glu Glu His Thr Val Ala Gly Glu Asp Glu Glu 
            20                  25                  30          


Asp Val Asn His Gly Ala Asp Asp Ser Glu Asp Ala Glu Asp Glu Asp 
        35                  40                  45              


Ala Glu Asp Glu Asn Asp Tyr Thr Glu Leu Asp Val Asp Ile Val Cys 
    50                  55                  60                  


Gln Phe Ile Lys Asp Ala Ala Arg Glu Ala Glu Lys Thr Gly Asp Tyr 
65                  70                  75                  80  


Ile Ser Tyr Ala Thr Val Ile Asp Ile His Cys Ser Asp Pro Ser Arg 
                85                  90                  95      


Tyr Lys His Val Asp Arg Val Lys Ile Leu Thr Ser Leu Leu Glu Val 
            100                 105                 110         


Leu Arg Thr Asn Pro Lys Ile Cys Glu Glu Ile Gly Trp Asp Leu Pro 
        115                 120                 125             


Ala Leu Leu Leu Pro Tyr Phe Asn Val Glu Asp Phe Asp Phe Asn Asp 
    130                 135                 140                 


Gly Leu Glu Gly His Pro Thr Phe Tyr Pro Leu Ile Met Leu Phe Ser 
145                 150                 155                 160 


Thr Leu Ala Glu Tyr Gly Asn Pro Lys Glu Leu Phe Leu Lys Thr Val 
                165                 170                 175     


Glu Thr Leu Ser Thr Leu Thr Cys Asp Arg Ala Pro Glu Asn Asp Lys 
            180                 185                 190         


Leu Lys Phe Lys Gln Ala Glu Ser Leu Arg Lys Phe Glu Val Cys Lys 
        195                 200                 205             


Phe His Val Leu Glu Glu Leu Met Ser Ser Cys Met Lys Lys Ile Lys 
    210                 215                 220                 


Thr Gln Tyr Pro Ser Arg Phe Leu Ala Ser Ala Ala Ser Ala Ile Leu 
225                 230                 235                 240 


Met Phe Ser Ala Arg Asn Ala Ala Leu Phe Arg His Phe Pro Leu Ile 
                245                 250                 255     


Val Gly Ile Leu Ala Arg Arg Val Tyr Val Phe Ile Arg Asp Trp Gly 
            260                 265                 270         


Met Asp Gly Asp Glu Pro Met Asp Met Ser Pro Asp Glu Gln Ala Lys 
        275                 280                 285             


Ser Ala Lys Ile Leu Gln Ser Leu Ser Thr Tyr Phe Phe Tyr Ser Trp 
    290                 295                 300                 


Phe His Arg Val Ala Val Arg Trp Ser Ser Asn Leu Phe Arg Glu Ile 
305                 310                 315                 320 


Lys His Ser Ile His Glu Leu Pro Arg Ala Glu Arg Ala Lys Tyr Asp 
                325                 330                 335     


Asn Pro Lys Ser Asn Gly Ser Ala Val Tyr Thr Ile Tyr Asn Arg Trp 
            340                 345                 350         


Gly Thr Leu Ala Leu Ser Leu Asp Leu Asp Pro Ser Gln Tyr Phe Leu 
        355                 360                 365             


Pro Leu Ile Gln Glu Ile Gln Glu Asp Val Gln Glu Ala Thr Lys Gly 
    370                 375                 380                 


Gly Leu Asp Asp Thr Leu Ala Gly Phe Ser Lys Ser Ser Leu Ser Asp 
385                 390                 395                 400 


Ala Ser Pro Ile Ala Phe Val Asp Tyr Ser Met Tyr Asp Asp Ala Ser 
                405                 410                 415     


Glu Ile Pro Leu Ser Gln Glu Gly Leu Leu Met Leu Ala Thr Gln Tyr 
            420                 425                 430         


Met Met Glu Asn Arg Asp His Ser Leu Asn Ile Ser Leu Asp Gln Leu 
        435                 440                 445             


Val Ser Leu Thr Leu Tyr Leu Val His Arg Ser Ser Pro Lys Glu Pro 
    450                 455                 460                 


Leu Pro Phe Ala Ile Thr Asp Leu Leu Leu Phe Trp Gly Trp Trp Thr 
465                 470                 475                 480 


Leu Lys Asp Met Glu Arg Pro Glu Val Arg Gln Leu Asp Glu Ala Phe 
                485                 490                 495     


Tyr Val Lys Tyr Leu Gln Phe Leu Val Phe Ile Ser Ala Ser Ser Pro 
            500                 505                 510         


Leu Pro Glu Ile Arg Asn Ile Ala Tyr Thr Leu Cys Gly Arg Leu Leu 
        515                 520                 525             


Tyr Leu Gln His Glu Ser Val Ser Phe Ala Phe Ile Ala Asp Thr Ile 
    530                 535                 540                 


Ala Asp Cys Pro Phe Glu Asn Ala Gln Val Ala Met Val Gly Ile Leu 
545                 550                 555                 560 


Lys Arg Leu Met Ile Pro Asp Glu Ile Ser Asp Gln Leu Ser Lys Leu 
                565                 570                 575     


Arg Ile Pro Asp Val Pro Thr Arg Glu Gly Val Glu His Gln Lys Ala 
            580                 585                 590         


Ser Gln Thr Thr Ile Pro Thr Thr Pro Glu His Val Asp Thr Ile Lys 
        595                 600                 605             


Ser Leu Cys Asn Ala Ala Leu Glu Gln Glu Asn Thr His Leu Val Ile 
    610                 615                 620                 


Thr Trp Leu Asn Phe Leu Ser Thr Val Lys Leu Asp Cys Gly Phe Ala 
625                 630                 635                 640 


Gly Asp Tyr Ala Glu Arg Val Glu Lys Val Ile Asp Glu Val Glu Asp 
                645                 650                 655     


Glu Asn Asp Arg Thr Leu Ile Arg Leu Ala Leu Asp Val Leu Ala Lys 
            660                 665                 670         


Thr Val 
        


<210>  41
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HPGS motif

<400>  41

His Pro Gly Ser 
1               


<210>  42
<211>  4
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  HAGG motif

<400>  42

His Ala Gly Gly 
1               


<210>  43
<211>  655
<212>  PRT
<213>  Candida glabrata

<400>  43

Met Ser Asp Ala Phe Glu Glu Val Cys Asp Ala Leu Lys Ala Ser Phe 
1               5                   10                  15      


Thr Asp Asp Lys Glu Asp Ser Leu Thr Leu Val Thr Met Ile Asp Thr 
            20                  25                  30          


Leu Ser Glu Glu Val Asp Glu Gly Phe Glu Val Lys Glu Lys Glu Gln 
        35                  40                  45              


Phe Leu Glu Leu Leu Leu Asn Leu Leu Glu Ala Asp Thr Glu Leu Val 
    50                  55                  60                  


Ser Ala Val Gly Trp Asp Leu Pro Arg Thr Leu Leu Arg Phe Cys Asn 
65                  70                  75                  80  


Ala Lys Asn Ile Lys Asn Ser Asp Arg Leu Arg Lys Cys Lys Val Val 
                85                  90                  95      


Thr Ile Cys Met Ala Ile Phe Asn Leu Leu Ala Leu His Ala Lys Pro 
            100                 105                 110         


Gln Glu Cys Leu Val Thr Thr Leu Glu Leu Leu Ser Glu Leu Asn Phe 
        115                 120                 125             


Lys Asn Ile Val Glu Glu Cys His Gln Leu Ser Glu Asp Gly Ser Asp 
    130                 135                 140                 


Asn Asn Thr Ala Glu Glu Asp Asn Asp Ala Val Glu Asp Tyr Met Lys 
145                 150                 155                 160 


Asp Arg Asp Gln Pro Glu Ile Ile Phe Gly Val Lys Ser Tyr Ala Leu 
                165                 170                 175     


Phe Glu Leu Ala Gly Ser Leu Ile Arg Arg Val Ala Thr Leu His Pro 
            180                 185                 190         


Ser Lys Tyr Leu Glu Glu Ala Val Thr Ala Ile Arg Lys Tyr Val Thr 
        195                 200                 205             


Asn Asn Thr Glu Val Val Glu Asp Val Lys Phe Ile Leu Arg Arg Val 
    210                 215                 220                 


Phe Ala Phe Cys Arg Gly Tyr Ile Pro Pro Glu Pro Pro Arg Gln Leu 
225                 230                 235                 240 


Ile Val Asp Leu Lys Met Asn His Glu Glu Tyr Asp Glu Ile Met Asn 
                245                 250                 255     


Ser Glu Ile Glu Leu Gln Val Arg Leu Leu Arg Asn Leu Cys Thr Phe 
            260                 265                 270         


Ser Val Ala Tyr Cys Val Lys Phe Leu Asn Asp Lys Thr Glu Val Val 
        275                 280                 285             


Tyr Phe His Lys Leu Ile Asn Lys Asp Leu Gln Leu Pro Glu Phe Tyr 
    290                 295                 300                 


Arg Ser Val His Asp Ile Ile Ser Arg Tyr Tyr Gln Ile Ala Phe Ser 
305                 310                 315                 320 


Phe Asp Ile Asp Leu Asn Asp Glu Phe Asn Asp Ile Leu Arg Glu Thr 
                325                 330                 335     


Arg Gly Ile Tyr Glu Asp Val Ile Lys Arg Ile Asn Glu Thr Asn Asn 
            340                 345                 350         


Thr Asp Lys Asn Ala Lys Ser Asp Ile Leu Leu Lys Ala Gly Tyr Tyr 
        355                 360                 365             


Tyr Glu Val Gln Lys Thr Ala Arg Glu Lys Glu Ile Asn Pro Asp Thr 
    370                 375                 380                 


Lys Gly Ile Ile Leu Leu Ser Gly Phe Asn Tyr Ile Glu Asn Gly Asp 
385                 390                 395                 400 


His Leu Ile Asp Ile Asp Ile Ala Asp Ala Leu Tyr Leu Tyr Leu Arg 
                405                 410                 415     


Phe Ala Ser Glu Ser Leu Phe Ser Pro Thr Cys His Asn Val Thr Ile 
            420                 425                 430         


Glu Gly Val Ala Arg Tyr Trp Ile Trp Ala Ala Leu Thr Thr Thr Asp 
        435                 440                 445             


Asn Asn Ile Leu Lys Glu Lys Leu Ala Glu Leu Ser Pro Leu Val Leu 
    450                 455                 460                 


His Ser Val Leu Asn Leu Leu Leu Val Lys Asn Cys His Gln Val Asn 
465                 470                 475                 480 


Glu Glu Ile Arg Met Ile Thr Phe Thr Leu Ile Thr Arg Ile Leu Cys 
                485                 490                 495     


Leu Leu Pro Glu Asn Cys Ser Tyr Glu Phe Leu Met Asp Glu Leu Asp 
            500                 505                 510         


Asn Cys Ala Val Val Phe Gly Lys Ser Cys Val Leu Gly Ile Leu Arg 
        515                 520                 525             


Asp Leu Val Ile Lys Val Asp His Ser Val Ser Ser Asn Asn Thr Asp 
    530                 535                 540                 


Thr Glu Asp Leu Ser Glu Ser Met Ala Gln Leu Lys Ile Asn Asn Glu 
545                 550                 555                 560 


Lys Arg Ala Lys Lys Thr Phe Ile Thr Leu Asp Pro Lys Arg Ala Gly 
                565                 570                 575     


Glu Ile Glu Asp Leu Ala Ile Lys Thr Leu Lys Glu Thr Lys Lys Ser 
            580                 585                 590         


Met Lys Lys Asp Tyr Ile Leu Leu Val Leu Asn Tyr Ile Lys Phe Phe 
        595                 600                 605             


Ser Thr Phe Ala His Lys Trp Asn Lys Ser Lys Leu Asn Glu Phe Thr 
    610                 615                 620                 


Thr Leu Val Ala Thr Asn Phe Ser Asp Ser Lys Met Leu Pro Glu Ile 
625                 630                 635                 640 


Asn Ala Ile Ile Asp Ala Asn Glu Lys Leu Arg Ser Leu Thr Glu 
                645                 650                 655 


<210>  44
<211>  702
<212>  PRT
<213>  Kluyveromyces lactis

<400>  44

Met Pro Leu Glu Val Glu Arg Phe Lys Glu Ile Glu Glu Lys Leu Leu 
1               5                   10                  15      


Thr Ala Phe Val Glu Glu Lys Ser Asp Ile Ile Thr Leu Val Thr Ile 
            20                  25                  30          


Leu Asp Leu Tyr Ser Glu Glu Val Asn Phe Lys Gly Ser Leu Glu Gln 
        35                  40                  45              


Lys Tyr Glu Tyr Leu Ser Glu Val Leu Ser Leu Leu Gln Gln Asn Lys 
    50                  55                  60                  


Asp Val Val Tyr Glu Ile Gly Trp Asp Leu Pro Lys Ile Leu Ile Lys 
65                  70                  75                  80  


Phe Ile His Trp Gly Asn Asn Asn His Leu Gly Ala Asp Arg Ser Lys 
                85                  90                  95      


Lys Phe Leu Thr Val Ile Met Lys Cys Phe Asn Glu Val Ala Leu Phe 
            100                 105                 110         


Gly Asn Pro Lys Glu Cys Phe Phe Ala Gly Cys Glu Leu Met Ser Ser 
        115                 120                 125             


Leu Arg Ile Asn Asp Glu Ser Leu Val Arg Phe Ile Val Glu Glu Glu 
    130                 135                 140                 


Pro Val Met Asp Pro Glu Asn Glu Asp Ser Gly Asp Glu Thr Tyr Thr 
145                 150                 155                 160 


Glu Asp Glu Gly Ser Ser Asp Lys Thr Glu Glu Glu Glu Glu Lys Asn 
                165                 170                 175     


Ala Val Lys Asp Ser Pro Thr Pro Lys Ser Ala Asn Glu Ser Ile Pro 
            180                 185                 190         


Asp Leu Lys Glu Gly Tyr Ala Phe Tyr Gly Arg Leu Pro Gln Glu Val 
        195                 200                 205             


Ile Thr Glu Leu Arg Phe Tyr Ser Ile Ile Glu Leu Met Gly Ser Thr 
    210                 215                 220                 


Leu Lys Arg Ile Val Thr Leu His Pro Ser Lys Phe Leu Ser Glu Ala 
225                 230                 235                 240 


Val Glu Ala Phe Ser Arg Phe Asn Leu Gln Asn Asn Glu Asp Val Asp 
                245                 250                 255     


Asp Cys Leu Phe Ile Leu Arg Arg Leu Tyr Ser Phe Ile Arg Gly Tyr 
            260                 265                 270         


Ile Pro Pro Ser Pro Pro Pro Asp Ala Asp Lys Gln Val Ser Ala Glu 
        275                 280                 285             


Glu Leu Glu Glu Ile Lys Val Ser Glu Glu Val Leu Gln Arg Lys Leu 
    290                 295                 300                 


Leu Cys Asn Ile Leu Thr Ser Ala Leu His Gln Leu Leu Lys Ala Arg 
305                 310                 315                 320 


Thr Cys Ile Ser Leu Leu Asn Tyr His Ser His Leu Gln Gly Ile Pro 
                325                 330                 335     


Thr Leu Ser Thr Ser Ser Glu Tyr Leu Gly Gln Leu Thr Asp Ile Leu 
            340                 345                 350         


Ser Arg Tyr Tyr Gln Leu Ala Thr Ser Phe Asp Ile Asp Val Ser Ala 
        355                 360                 365             


Glu Phe Lys Arg Leu Cys Val Asp Glu Ser Val Arg Ile Tyr Arg Ser 
    370                 375                 380                 


Leu Pro Lys Asp Ser Glu Ile Lys Ser Asp Glu Glu Leu Lys Glu Ile 
385                 390                 395                 400 


Thr Asn Phe Val Tyr Gln Leu Ala Tyr Thr Tyr Glu Val Glu Lys Ile 
                405                 410                 415     


Ala Asn Val Lys Glu Ile Leu Leu Asp Pro Ala Gly Ile Leu Ile Leu 
            420                 425                 430         


Arg Ser Phe Ser Asn Glu Asp Phe Leu Pro Pro Ser Asp Ala Lys Ile 
        435                 440                 445             


Thr Leu Gln Glu Ala Ile Tyr Met Tyr Leu Arg Phe Val Thr Pro Ser 
    450                 455                 460                 


Met Phe Ser Ala Leu Phe Glu Asn Arg Ser Ser His Asp Leu Ala Arg 
465                 470                 475                 480 


Thr Trp Ile Leu Phe Ala Leu Thr Asn Asn Ser Thr His Asp Leu Met 
                485                 490                 495     


Asp Ser Leu Lys Asp Leu Pro Ser Tyr Ile Ile Thr Val Tyr Leu Gln 
            500                 505                 510         


Thr Glu Leu Ile Arg Ala Cys Leu Gln Ile Asn Asp Asn Leu Arg Arg 
        515                 520                 525             


Thr Gln Phe Ser Ile Leu Thr Arg Ile Leu Cys Leu Leu Pro Glu Asp 
    530                 535                 540                 


Phe Ala Phe Asn Phe Ile Arg Asp Thr Leu Leu Ser Cys Pro Tyr Glu 
545                 550                 555                 560 


Gln Ala Lys Cys Cys Ala Leu Ala Ile Leu Lys Asp Met Met Gln His 
                565                 570                 575     


Glu Arg Lys Val Pro Gln Lys Ser Asp Glu Asp Asp Leu Ala Lys Asp 
            580                 585                 590         


Met Glu Lys Leu Lys Ile Lys Asn Ser Pro Pro Pro Leu Pro Ser Arg 
        595                 600                 605             


Ala Tyr Met Leu Leu Asn Asp Asp Arg Ile Ala Thr Leu His Ser Ile 
    610                 615                 620                 


Thr Leu Leu Ala Ile Asp Ser Cys Ala Ala Asp Pro Glu Ser Lys Lys 
625                 630                 635                 640 


Val Lys Thr Leu Leu Thr Tyr Leu Asn Phe Leu Asn Ala Phe Leu Thr 
                645                 650                 655     


Lys Trp Asp Ser Val Phe Leu Lys Glu Ile Cys Asp Ala Val Asn Asp 
            660                 665                 670         


Lys Leu Ile Lys Asn Glu Lys Val Gly Asp Lys Asp Glu Pro His Tyr 
        675                 680                 685             


Ser Leu Leu Val Ser Thr Val Ala Ser Ile Ser Ser Lys Leu 
    690                 695                 700         


<210>  45
<211>  673
<212>  PRT
<213>  Scheffersomyces stipitis

<400>  45

Met Ser Glu Ser Asp Val Ser Glu Asn Ser Glu Ser Thr Ile Glu Pro 
1               5                   10                  15      


Phe Val Phe Glu Arg Val Leu Glu Ser Leu Lys Thr Ala Ala Thr Glu 
            20                  25                  30          


Thr Leu Glu Ser Lys Asp Tyr Leu Ser Tyr Ser Thr Leu Leu Asp Ile 
        35                  40                  45              


Tyr Leu Gly Glu Pro Ala Lys Tyr Thr Tyr Asp Glu Arg Glu Glu Leu 
    50                  55                  60                  


Leu Ser Ala Leu Leu Ser Ile Leu Ser Ala Asn Pro Gly Leu Thr Tyr 
65                  70                  75                  80  


Glu Ile Gly Trp Asp Leu Pro Gly Leu Leu Ile Leu Tyr Val Asp Ser 
                85                  90                  95      


Asp Phe Asp Phe Thr Gly Gly Leu Arg Lys Ala Pro Cys Val Tyr Lys 
            100                 105                 110         


Ile Leu Lys Ile Phe Glu Val Leu Ala Ile Asn Gly Asn Pro Lys Glu 
        115                 120                 125             


Leu Phe Leu Lys Ser Cys Glu Leu Leu Thr Thr Ile Ser Ala Asp Asp 
    130                 135                 140                 


Ser Gln Val Thr Asp Asp Ser Ser Ile Lys Glu Lys Phe Phe Asp Val 
145                 150                 155                 160 


Lys Leu Tyr Cys Ile Phe Glu Leu Val Asp Ser Cys Phe Lys Arg Ile 
                165                 170                 175     


Lys Thr Tyr Tyr Pro Ser Arg Phe Leu Ala Met Thr Val Ala Ser Phe 
            180                 185                 190         


Ile Asn Leu Ala His Lys Asn Gly Asn Asp Ser Pro Asn Asn Ile Ser 
        195                 200                 205             


Phe Ile Met Lys Arg Ala Tyr Thr Phe Ala Arg Asn Tyr Ser Ser Pro 
    210                 215                 220                 


Pro Leu Pro Asp Ser Asp Gly Asp Lys Met Ser Pro Glu Asp Leu Ser 
225                 230                 235                 240 


Lys Ile Lys Glu Asp Glu Glu Tyr Leu Gln Arg Lys Leu Leu Thr Gly 
                245                 250                 255     


Phe Ile Ser Gln Leu Ile Gln Leu Met Ser Asn Asp Asn Leu Asn Gly 
            260                 265                 270         


Tyr Thr Leu Asp His Leu Ser Phe Leu Gln Val Pro His Arg Gly Gln 
        275                 280                 285             


Leu Lys Lys Tyr Phe Glu Tyr Ser Val Asn Leu Pro Val Met Asp Arg 
    290                 295                 300                 


Leu Ala Glu Leu Ala Leu Ser Tyr Asp Ile Asn Leu Thr Gln His Phe 
305                 310                 315                 320 


Lys Ser Met Val Ala Asp Ser His Thr Leu Leu Arg Ser Phe Asp Tyr 
                325                 330                 335     


Ser Ile Asp Arg Asp Glu Leu Ser Ala Gln Ile Phe Glu Lys Val Val 
            340                 345                 350         


Val Asp Tyr Gln Lys Thr Leu Ala Met Ser Ile Ile Asn Ser Asp Ala 
        355                 360                 365             


Lys Glu Ile Arg Asp Ser Pro Leu Gly Ile Phe Leu Leu Tyr Thr His 
    370                 375                 380                 


Ala Ile Ser Val Arg Arg Thr Phe Asp Leu Ile Lys Val Ser Phe Ser 
385                 390                 395                 400 


Asp Ala Val Val Leu Thr Leu Arg Val Leu Val Pro Glu Leu Val Gln 
                405                 410                 415     


Ser Thr Phe Val Phe Lys Gly Val Glu Asp Ala Thr Ile Phe Trp Thr 
            420                 425                 430         


Trp Tyr Ala Leu Tyr Gln Thr Ser Leu Asn Asn Lys Ser Val Glu Thr 
        435                 440                 445             


Glu Ile Ala Ala Ile Ser Pro Val Leu Leu Thr Ile Tyr Tyr Gln Val 
    450                 455                 460                 


Ile Phe Phe Val Val Ile Thr Asn Ser Asn Arg Pro Asn Phe Lys Tyr 
465                 470                 475                 480 


Ala Val Leu Thr Leu Leu Thr Arg Val Leu Ala Leu Ser Pro Glu Asp 
                485                 490                 495     


Leu Ser Tyr Asp Phe Val Lys Asp Ser Leu His Asn Cys Pro Tyr Glu 
            500                 505                 510         


Ser Glu Lys Pro Ile Met Ile Gly Val Leu Lys Glu Leu Leu Thr Lys 
        515                 520                 525             


Asp Lys Ser Ser Ser Thr Ser Asp Val Thr Glu Ala Leu Ala Asn Ser 
    530                 535                 540                 


Glu Asp Ser Lys Val Pro Leu Pro Pro Thr Leu Pro Pro Arg Ala Ser 
545                 550                 555                 560 


Ser Ala Ser Ser Arg Tyr Phe Thr Leu Thr Lys Ala Arg Leu Glu Asp 
                565                 570                 575     


Ile Leu Ala Leu Val Gln Glu Ala Val Asp Ser Ala Phe Val Thr His 
            580                 585                 590         


Glu Ser Thr Val Ala Ile Asp Pro Ser Lys Leu Ser Thr Leu Ser Ala 
        595                 600                 605             


Tyr Leu Asn Leu Leu Val Ile Ile Lys Lys Asp Pro Val Val Leu Gln 
    610                 615                 620                 


Asp Lys Lys Ala Leu Asp Lys Val Val Glu Ser Ala Glu Glu Asn Ile 
625                 630                 635                 640 


Ala Ala Val Lys Glu Lys His Lys Lys Tyr Pro Asn Ser Asn Lys Phe 
                645                 650                 655     


Glu Leu Asn Ala Ala Gly Ile Leu Glu Ile Thr Ile Asp Arg Ile Lys 
            660                 665                 670         


Ser 
    


<210>  46
<211>  664
<212>  PRT
<213>  Zygosaccharomyces rouxii

<400>  46

Met Glu Asn Ile Asp Thr Val Cys Glu Asn Leu Glu Lys Ala Phe Ala 
1               5                   10                  15      


Glu Gln Lys Asp Asp Ser Val Thr Leu Ala Thr Ile Ile Asp Met Tyr 
            20                  25                  30          


Val Val Gln Ile Asn Asp Glu Gly Ser Asn Lys Asp Lys Glu Gln Phe 
        35                  40                  45              


Leu Thr Lys Leu Leu Asp Gln Leu Arg Ala Ser Pro Asp Ile Val Ala 
    50                  55                  60                  


Glu Ile Gly Trp Asp Leu Pro Arg Gly Leu Leu Lys Phe Tyr Asn Lys 
65                  70                  75                  80  


Lys Asn Ile Asp Val Asp Ala Lys Leu Lys Ser Asn Pro Ile Val Gly 
                85                  90                  95      


Leu Val Met Gln Cys Phe Ser Glu Val Ala Leu Ser Gly Asn Pro Lys 
            100                 105                 110         


Glu Cys Leu Leu Thr Gly Cys Glu Ile Leu Ser Glu Leu Thr Thr Ile 
        115                 120                 125             


Gln Ile Asn Glu Gln Met Leu Glu Asp Asp Ser Lys Glu Glu Gly Asp 
    130                 135                 140                 


Val Thr Lys Asp Glu Lys Lys Thr Asp Glu Lys Gly Glu Trp Ile Pro 
145                 150                 155                 160 


Glu Pro Pro His Arg Asp Pro Val Glu Phe Phe Leu Tyr Leu Asn Ser 
                165                 170                 175     


Tyr Val Leu Phe Glu Leu Ile Gln Thr Ala Leu Lys Arg Ile Ala Ser 
            180                 185                 190         


Leu Tyr Pro Ser Lys Phe Leu Gly Met Ala Val Ser Ala Ile Tyr Lys 
        195                 200                 205             


Phe Val Arg Asn Asn Ile Asp Glu Val Tyr Asn Thr Pro Phe Ile Leu 
    210                 215                 220                 


Arg Arg Ile Tyr Thr Phe Cys Arg Gly Tyr Ile Pro Pro Glu Ile Pro 
225                 230                 235                 240 


Lys Gln Leu Leu Glu Asn Thr Lys Leu Glu Lys Lys Glu Leu Asp Lys 
                245                 250                 255     


Ile Thr Glu Asp Glu Ser Ile Leu Gln Gly Gln Leu Leu Arg Ser Leu 
            260                 265                 270         


Ser Thr Phe Ala Val Gly Glu Cys Leu Lys Asn Lys Ala Ser Arg Leu 
        275                 280                 285             


Asp Leu Glu Tyr Phe His Arg Leu Arg Asn Thr Glu Phe His Leu Ser 
    290                 295                 300                 


Glu Asn Asp Glu Glu Leu Val Leu Ile Ser Lys Arg Phe Tyr Gln Leu 
305                 310                 315                 320 


Met Phe Ser Phe Asp Leu Asp Val Lys Glu Gln Phe Leu Ser Phe Ile 
                325                 330                 335     


Glu Glu Thr Lys Gly Ile Tyr Lys Ala Leu Pro Pro Asp Ser Glu Ile 
            340                 345                 350         


Pro Asn Asp Glu Ala Arg Arg Ala Ile Gly Gln Val Val Tyr Gln Leu 
        355                 360                 365             


Ser Tyr Thr Tyr Gln Leu Gln Lys Leu Thr Lys Leu Lys His Leu Glu 
    370                 375                 380                 


Leu Asn Ser Asn Gly Ile Phe Ile Leu Ser Gly Leu His Tyr Gln Glu 
385                 390                 395                 400 


Thr Gln Lys His Leu Tyr Pro Glu Ile Ser Ile Lys Asp Thr Val Leu 
                405                 410                 415     


Leu Tyr Ile Arg Cys Ala Thr Pro Ser Leu Phe Ser Ser Thr Tyr Thr 
            420                 425                 430         


Asn Leu Tyr Ala Glu Gly Thr Ala Arg Tyr Trp Val Trp Val Ala Ile 
        435                 440                 445             


Thr Asn Asn Lys Val Gln Lys Leu Arg Glu Glu Leu Ser Glu Leu Pro 
    450                 455                 460                 


Ser Tyr Ile Arg Thr Val Phe Leu Gln Met Val Leu Met Gln Ser Cys 
465                 470                 475                 480 


Asn Gln Pro Asn Glu Glu Ala Arg Met Ile Ser Phe Thr Leu Leu Thr 
                485                 490                 495     


Arg Ile Met Cys Leu Met Pro Glu Asp Thr Ser Phe Glu Phe Val Leu 
            500                 505                 510         


Asp Thr Leu Leu Thr Cys Pro Phe Thr His Ala Lys Ile Ala Val Leu 
        515                 520                 525             


Gly Ile Leu Lys Asp Leu Met Leu Arg Asn Cys Gln Asn Lys Gln Ser 
    530                 535                 540                 


Leu Glu Glu Gln Phe Ser Asn Met Asn Leu Thr Ser Lys Asp Ser Asp 
545                 550                 555                 560 


Lys Arg Ser Thr Ser Thr Ser Pro Pro Ser Leu Pro Pro Arg Ala Tyr 
                565                 570                 575     


Ile Asp Ile Asn Glu Asp Arg Met Ala Ser Ile His Ser Ala Ala Met 
            580                 585                 590         


Met Thr Phe Gln Asp Gln Lys Ala Lys Gly Lys Asp Lys His Ile Leu 
        595                 600                 605             


Ile Leu Asn Phe Leu Asn Phe Phe Asn Gly Leu Ser Gln Lys Trp Asp 
    610                 615                 620                 


Lys Asn Leu Leu Gln Ala Val His Lys Glu Val Ala Leu Gln Tyr Asn 
625                 630                 635                 640 


Glu Lys Thr Lys Glu Asp Val Pro Glu Val Gly Phe Ile Lys Ile Ala 
                645                 650                 655     


Asn Glu Thr Leu Gly Lys His Leu 
            660                 


<210>  47
<211>  664
<212>  PRT
<213>  Candida albicans

<400>  47

Ser Glu Thr Asp His Ser Glu Thr Ser Glu Ser Thr Ile Glu Pro Phe 
1               5                   10                  15      


Gln Phe Glu Lys Val Met Glu Asn Leu Glu Ser Gly Ala Gln Asp Ala 
            20                  25                  30          


Leu Gln Ser Lys Asp Phe Leu Ser Tyr Ser Thr Leu Leu Asp Ile Tyr 
        35                  40                  45              


Leu Asn Asp Pro Thr Lys Tyr Ser Asn Glu Glu Lys Glu Gln Leu Leu 
    50                  55                  60                  


Gly His Ile Leu Thr Ile Leu Ser Glu Asn Lys Gln Leu Thr Tyr Glu 
65                  70                  75                  80  


Ile Gly Trp Asp Leu Pro Gln Leu Leu Ile Leu Tyr Val Asp Ser Asp 
                85                  90                  95      


Tyr Glu Phe Asn Gly Pro Ile Arg Asp Ser Pro Gly Val Tyr Lys Ile 
            100                 105                 110         


Leu Lys Ile Phe Glu Asn Leu Ala Ile Asn Gly Asn His Lys Glu Leu 
        115                 120                 125             


Phe Leu Lys Ser Cys Glu Leu Leu Asn Asp Leu Glu Leu Ser Gln Asp 
    130                 135                 140                 


Glu Asp Ile Glu Leu Leu Lys Arg Glu Asn Phe Phe Glu Ile Lys Leu 
145                 150                 155                 160 


Tyr Cys Val Phe Glu Leu Ile Asp Ala Cys Leu Lys Lys Ile His Thr 
                165                 170                 175     


Leu Tyr Pro Ser Arg Phe Leu Ala Met Thr Val Ser Ser Phe Asn Asn 
            180                 185                 190         


Leu Met Phe Lys Leu Thr Lys Gln His Gly Ser Leu Gly Asn Tyr His 
        195                 200                 205             


Phe Val Met Lys Arg Val Tyr Ser Phe Cys Arg Asn Tyr Ile Ser Pro 
    210                 215                 220                 


Pro Leu Pro Thr Asn Ala Lys Glu Met Pro Gln Glu Glu Leu Asp Lys 
225                 230                 235                 240 


Ile Val Lys Asp Glu Glu Tyr Leu Gln Arg Arg Leu Leu Thr Gly Phe 
                245                 250                 255     


Leu Thr Gln Val Ile Tyr Leu Ala Asn Ile Asn Gly Thr Glu Gly Tyr 
            260                 265                 270         


Ser Ile Glu His Phe Ser Trp Leu Gln Gln Gln Ser Lys Ser Lys Ile 
        275                 280                 285             


Lys Phe Val Phe Glu Arg Asp Gly Ala Phe Cys Asp Arg Phe Val Glu 
    290                 295                 300                 


Leu Ala Ser Ser Phe Asp Ile Asp Leu Leu Lys Cys Phe Gln Gly Phe 
305                 310                 315                 320 


Ile Thr Asp Ser His Lys Leu Leu Ile Gly Ile Asp Tyr Lys Asn Lys 
                325                 330                 335     


Asn Lys Ser Glu Asp Glu Ile Ile Glu Leu Leu Phe Glu Arg Val Val 
            340                 345                 350         


Val Asp Tyr Gln Lys Asn Val Leu Thr Ser Ile Val Asp Ser Asp Ala 
        355                 360                 365             


Lys Ala Ile Lys Asp Ser Ile Ile Gly Glu Leu Ile Leu Phe Thr His 
    370                 375                 380                 


Ser Ile Ala Gly Lys Lys Asn Phe Ala Lys Pro Thr Met Ser Ile His 
385                 390                 395                 400 


Asp Ser Leu Val Met Thr Leu Arg Leu Ile Ile Pro Gln Met Val Asn 
                405                 410                 415     


Pro Lys Phe Ile Asn Ala Gly Asn His Asp Val Val Val Phe Trp Val 
            420                 425                 430         


Trp Phe Ala Leu Tyr Gln Gln Gln Ile Ile Asn Ser Lys Asn Leu Gln 
        435                 440                 445             


Leu Glu Ile Ser Tyr Ile Pro Lys Val Leu Leu Thr Thr Phe Phe Gln 
    450                 455                 460                 


Cys Leu Leu Phe Ile Val Ile Lys Ser Glu Gly Lys Pro Asn Phe Lys 
465                 470                 475                 480 


Tyr Met Leu Leu Thr Leu Leu Thr Lys Leu Leu Thr Leu Ser Pro Asp 
                485                 490                 495     


Thr Gly Tyr Glu Phe Ile Lys Asp Ser Leu Asn Asn Cys Pro Tyr Glu 
            500                 505                 510         


Ser Val Tyr Pro Ser Leu Ile Gly Val Tyr Lys Gln Leu Leu Leu Asn 
        515                 520                 525             


Glu Lys Trp Asp Val Asn Ser Ile Glu Leu Glu Lys Leu Asn Ile Ser 
    530                 535                 540                 


Ser Ser Ser Ser Asn Thr Pro Pro Lys Leu Pro Pro Arg Asn Gly Ile 
545                 550                 555                 560 


Lys Arg Lys His Phe Ser Leu Thr Asn Glu Ser Leu Asn Asp Leu Val 
                565                 570                 575     


Asp Leu Ile Asn Asn Ser Ser Lys Asn Ala Phe Val Glu Asp Asn Ser 
            580                 585                 590         


Lys Ile Asp Pro Ser Lys Leu Ser Thr Ile Ala Ala Tyr Leu Asn Leu 
        595                 600                 605             


Leu Val Ala Ile Lys Lys Asp Pro Val Ile Val Glu Asn Lys Glu Lys 
    610                 615                 620                 


Leu Thr Thr Leu Ile Ser Ser Ile Glu Asn Lys Ile Lys Ser Val Lys 
625                 630                 635                 640 


Lys Ser Ser Gln Asn Gln Phe Glu Leu Asn Ala Ala Gly Met Leu Glu 
                645                 650                 655     


Ile Thr Ile Glu Arg Phe Asn Glu 
            660                 


