                         SEQUENCE LISTING

<110>  Chr. Hansen A/S
 
<120>  Glycosyltransferase - glycosylating flavokermesic acid and/or 
       kermesic acid

<130>  P4547PC00

<150>  EP13198110.2
<151>  2013-12-18

<160>  5     

<170>  PatentIn version 3.5

<210>  1
<211>  1548
<212>  DNA
<213>  Dactylopius coccus costa


<220>
<221>  CDS
<222>  (1)..(1548)

<400>  1
atg gaa ttt cgt tta cta atc ctg gct ctt ttt tct gta ctt atg agt         48
Met Glu Phe Arg Leu Leu Ile Leu Ala Leu Phe Ser Val Leu Met Ser           
1               5                   10                  15                

act tca aac gga gca gaa att tta gct ctt ttc cct att cac ggt atc         96
Thr Ser Asn Gly Ala Glu Ile Leu Ala Leu Phe Pro Ile His Gly Ile           
            20                  25                  30                    

agt aat tat aat gtt gct gaa gca ctg ctg aag acc tta gct aac cgg        144
Ser Asn Tyr Asn Val Ala Glu Ala Leu Leu Lys Thr Leu Ala Asn Arg           
        35                  40                  45                        

ggt cat aat gtt aca gtt gtc aca tct ttt cct caa aaa aaa cct gta        192
Gly His Asn Val Thr Val Val Thr Ser Phe Pro Gln Lys Lys Pro Val           
    50                  55                  60                            

cct aat ttg tac gaa att gac gta tct gga gct aaa ggc ttg gct act        240
Pro Asn Leu Tyr Glu Ile Asp Val Ser Gly Ala Lys Gly Leu Ala Thr           
65                  70                  75                  80            

aat tca ata cat ttt gaa aga tta caa acg att att caa gat gta aaa        288
Asn Ser Ile His Phe Glu Arg Leu Gln Thr Ile Ile Gln Asp Val Lys           
                85                  90                  95                

tcg aac ttt aag aac atg gta cga ctt agc aga aca tac tgt gag att        336
Ser Asn Phe Lys Asn Met Val Arg Leu Ser Arg Thr Tyr Cys Glu Ile           
            100                 105                 110                   

atg ttt tct gat ccg agg gtt ttg aac att cga gac aag aaa ttc gat        384
Met Phe Ser Asp Pro Arg Val Leu Asn Ile Arg Asp Lys Lys Phe Asp           
        115                 120                 125                       

ctc gta ata aac gcc gta ttt ggc agt gac tgc gat gcc gga ttc gca        432
Leu Val Ile Asn Ala Val Phe Gly Ser Asp Cys Asp Ala Gly Phe Ala           
    130                 135                 140                           

tgg aaa agt caa gct cca ttg att tca att ctc aat gct aga cat act        480
Trp Lys Ser Gln Ala Pro Leu Ile Ser Ile Leu Asn Ala Arg His Thr           
145                 150                 155                 160           

cct tgg gcc cta cac aga atg gga aat cca tca aat cca gcg tat atg        528
Pro Trp Ala Leu His Arg Met Gly Asn Pro Ser Asn Pro Ala Tyr Met           
                165                 170                 175               

cct gtc att cat tct aga ttt cct gta aaa atg aat ttc ttc caa aga        576
Pro Val Ile His Ser Arg Phe Pro Val Lys Met Asn Phe Phe Gln Arg           
            180                 185                 190                   

atg ata aat acg ggt tgg cat ttg tat ttt ctg tac atg tac ttt tat        624
Met Ile Asn Thr Gly Trp His Leu Tyr Phe Leu Tyr Met Tyr Phe Tyr           
        195                 200                 205                       

tat ggt aat gga gaa gat gcc aac aaa atg gcg aga aaa ttt ttt ggc        672
Tyr Gly Asn Gly Glu Asp Ala Asn Lys Met Ala Arg Lys Phe Phe Gly           
    210                 215                 220                           

aac gac atg ccc gac ata aat gaa atg gtt ttt aat aca tct tta tta        720
Asn Asp Met Pro Asp Ile Asn Glu Met Val Phe Asn Thr Ser Leu Leu           
225                 230                 235                 240           

ttc gta aat act cac ttt tcg gtt gat atg cca tat cct ttg gtt cca        768
Phe Val Asn Thr His Phe Ser Val Asp Met Pro Tyr Pro Leu Val Pro           
                245                 250                 255               

aac tgc att gaa ata gga gga ata cat gta aaa gag cca caa cca ctg        816
Asn Cys Ile Glu Ile Gly Gly Ile His Val Lys Glu Pro Gln Pro Leu           
            260                 265                 270                   

cct ttg gaa ata caa aaa ttc atg gac gaa gca gaa cat ggg gtc att        864
Pro Leu Glu Ile Gln Lys Phe Met Asp Glu Ala Glu His Gly Val Ile           
        275                 280                 285                       

ttc ttc acg cta gga tca atg gtg cgt act tcc acg ttt cca aat caa        912
Phe Phe Thr Leu Gly Ser Met Val Arg Thr Ser Thr Phe Pro Asn Gln           
    290                 295                 300                           

act att caa gca ttt aag gaa gct ttt gcc gaa tta cct caa aga gtc        960
Thr Ile Gln Ala Phe Lys Glu Ala Phe Ala Glu Leu Pro Gln Arg Val           
305                 310                 315                 320           

tta tgg aag ttt gag aat gaa aat gag gat atg cca tca aat gta ctc       1008
Leu Trp Lys Phe Glu Asn Glu Asn Glu Asp Met Pro Ser Asn Val Leu           
                325                 330                 335               

ata agg aaa tgg ttt cca caa aat gat ata ttc ggt cat aag aat atc       1056
Ile Arg Lys Trp Phe Pro Gln Asn Asp Ile Phe Gly His Lys Asn Ile           
            340                 345                 350                   

aaa gca ttc att agt cac ggt gga aat tct gga gct ctg gag gct gtt       1104
Lys Ala Phe Ile Ser His Gly Gly Asn Ser Gly Ala Leu Glu Ala Val           
        355                 360                 365                       

cat ttc gga gta ccg ata att gga att cct tta ttc tac gat cag tac       1152
His Phe Gly Val Pro Ile Ile Gly Ile Pro Leu Phe Tyr Asp Gln Tyr           
    370                 375                 380                           

agg aat att ttg agt ttc gtt aaa gaa ggt gtt gcc gtt ctt ttg gat       1200
Arg Asn Ile Leu Ser Phe Val Lys Glu Gly Val Ala Val Leu Leu Asp           
385                 390                 395                 400           

gtg aat gat ctg acg aaa gat aat att tta tct tct gtc agg act gtt       1248
Val Asn Asp Leu Thr Lys Asp Asn Ile Leu Ser Ser Val Arg Thr Val           
                405                 410                 415               

gtt aat gat aag agt tac tca gaa cgt atg aaa gca ttg tca caa cta       1296
Val Asn Asp Lys Ser Tyr Ser Glu Arg Met Lys Ala Leu Ser Gln Leu           
            420                 425                 430                   

ttc cga gat cga cca atg agt cct ctt gac aca gct gtt tac tgg aca       1344
Phe Arg Asp Arg Pro Met Ser Pro Leu Asp Thr Ala Val Tyr Trp Thr           
        435                 440                 445                       

gaa tat gtc atc cgc cat aga gga gcc cat cac ctc aag acc gct ggc       1392
Glu Tyr Val Ile Arg His Arg Gly Ala His His Leu Lys Thr Ala Gly           
    450                 455                 460                           

gca ttt ttg cat tgg tat cag tat tta ctt ttg gac gtt att acc ttc       1440
Ala Phe Leu His Trp Tyr Gln Tyr Leu Leu Leu Asp Val Ile Thr Phe           
465                 470                 475                 480           

tta tta gtc aca ttc tgc gct ttt tgt ttt att gtg aaa tat ata tgt       1488
Leu Leu Val Thr Phe Cys Ala Phe Cys Phe Ile Val Lys Tyr Ile Cys           
                485                 490                 495               

aaa gct ctc att cat cat tat tgg agc agt tcg aaa tct gaa aag ttg       1536
Lys Ala Leu Ile His His Tyr Trp Ser Ser Ser Lys Ser Glu Lys Leu           
            500                 505                 510                   

aaa aaa aat taa                                                       1548
Lys Lys Asn                                                               
        515                                                               


<210>  2
<211>  515
<212>  PRT
<213>  Dactylopius coccus costa

<400>  2

Met Glu Phe Arg Leu Leu Ile Leu Ala Leu Phe Ser Val Leu Met Ser 
1               5                   10                  15      


Thr Ser Asn Gly Ala Glu Ile Leu Ala Leu Phe Pro Ile His Gly Ile 
            20                  25                  30          


Ser Asn Tyr Asn Val Ala Glu Ala Leu Leu Lys Thr Leu Ala Asn Arg 
        35                  40                  45              


Gly His Asn Val Thr Val Val Thr Ser Phe Pro Gln Lys Lys Pro Val 
    50                  55                  60                  


Pro Asn Leu Tyr Glu Ile Asp Val Ser Gly Ala Lys Gly Leu Ala Thr 
65                  70                  75                  80  


Asn Ser Ile His Phe Glu Arg Leu Gln Thr Ile Ile Gln Asp Val Lys 
                85                  90                  95      


Ser Asn Phe Lys Asn Met Val Arg Leu Ser Arg Thr Tyr Cys Glu Ile 
            100                 105                 110         


Met Phe Ser Asp Pro Arg Val Leu Asn Ile Arg Asp Lys Lys Phe Asp 
        115                 120                 125             


Leu Val Ile Asn Ala Val Phe Gly Ser Asp Cys Asp Ala Gly Phe Ala 
    130                 135                 140                 


Trp Lys Ser Gln Ala Pro Leu Ile Ser Ile Leu Asn Ala Arg His Thr 
145                 150                 155                 160 


Pro Trp Ala Leu His Arg Met Gly Asn Pro Ser Asn Pro Ala Tyr Met 
                165                 170                 175     


Pro Val Ile His Ser Arg Phe Pro Val Lys Met Asn Phe Phe Gln Arg 
            180                 185                 190         


Met Ile Asn Thr Gly Trp His Leu Tyr Phe Leu Tyr Met Tyr Phe Tyr 
        195                 200                 205             


Tyr Gly Asn Gly Glu Asp Ala Asn Lys Met Ala Arg Lys Phe Phe Gly 
    210                 215                 220                 


Asn Asp Met Pro Asp Ile Asn Glu Met Val Phe Asn Thr Ser Leu Leu 
225                 230                 235                 240 


Phe Val Asn Thr His Phe Ser Val Asp Met Pro Tyr Pro Leu Val Pro 
                245                 250                 255     


Asn Cys Ile Glu Ile Gly Gly Ile His Val Lys Glu Pro Gln Pro Leu 
            260                 265                 270         


Pro Leu Glu Ile Gln Lys Phe Met Asp Glu Ala Glu His Gly Val Ile 
        275                 280                 285             


Phe Phe Thr Leu Gly Ser Met Val Arg Thr Ser Thr Phe Pro Asn Gln 
    290                 295                 300                 


Thr Ile Gln Ala Phe Lys Glu Ala Phe Ala Glu Leu Pro Gln Arg Val 
305                 310                 315                 320 


Leu Trp Lys Phe Glu Asn Glu Asn Glu Asp Met Pro Ser Asn Val Leu 
                325                 330                 335     


Ile Arg Lys Trp Phe Pro Gln Asn Asp Ile Phe Gly His Lys Asn Ile 
            340                 345                 350         


Lys Ala Phe Ile Ser His Gly Gly Asn Ser Gly Ala Leu Glu Ala Val 
        355                 360                 365             


His Phe Gly Val Pro Ile Ile Gly Ile Pro Leu Phe Tyr Asp Gln Tyr 
    370                 375                 380                 


Arg Asn Ile Leu Ser Phe Val Lys Glu Gly Val Ala Val Leu Leu Asp 
385                 390                 395                 400 


Val Asn Asp Leu Thr Lys Asp Asn Ile Leu Ser Ser Val Arg Thr Val 
                405                 410                 415     


Val Asn Asp Lys Ser Tyr Ser Glu Arg Met Lys Ala Leu Ser Gln Leu 
            420                 425                 430         


Phe Arg Asp Arg Pro Met Ser Pro Leu Asp Thr Ala Val Tyr Trp Thr 
        435                 440                 445             


Glu Tyr Val Ile Arg His Arg Gly Ala His His Leu Lys Thr Ala Gly 
    450                 455                 460                 


Ala Phe Leu His Trp Tyr Gln Tyr Leu Leu Leu Asp Val Ile Thr Phe 
465                 470                 475                 480 


Leu Leu Val Thr Phe Cys Ala Phe Cys Phe Ile Val Lys Tyr Ile Cys 
                485                 490                 495     


Lys Ala Leu Ile His His Tyr Trp Ser Ser Ser Lys Ser Glu Lys Leu 
            500                 505                 510         


Lys Lys Asn 
        515 


<210>  3
<211>  1548
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Yeast Optimized Sequence

<400>  3
atggaattca gattgttgat attggccttg ttctccgtat tgatgtctac ctctaatggt       60

gccgaaatct tggctttatt ccctattcat ggtatatcta actacaacgt agctgaagca      120

ttgttgaaga ctttggctaa cagaggtcac aacgttaccg ttgtaacttc atttccacaa      180

aagaaaccag ttcctaattt gtacgaaatt gatgtatcag gtgcaaaggg tttagccaca      240

aactccatcc atttcgaaag attgcaaacc atcatccaag atgtcaagag taacttcaag      300

aacatggtta gattgtctag aacatactgt gaaatcatgt tctcagaccc aagagttttg      360

aacatcagag ataaaaagtt tgacttggtt ataaacgccg tattcggttc agattgcgac      420

gctggttttg catggaaaag tcaagctcct ttaatatcta tcttgaatgc cagacataca      480

ccatgggctt tgcacagaat gggtaatcct tccaacccag catatatgcc tgtaatccat      540

agtagattcc cagtcaagat gaatttcttt caaagaatga taaacaccgg ttggcactta      600

tactttttgt acatgtactt ctactacggt aatggtgaag atgctaacaa aatggcaaga      660

aagtttttcg gtaatgatat gcctgacata aacgaaatgg tttttaacac ctccttgttg      720

ttcgtaaaca ctcatttcag tgtcgatatg ccataccctt tagtcccaaa ctgtatcgaa      780

atcggtggta tccatgttaa ggaaccacaa cctttgccat tggaaatcca aaagtttatg      840

gatgaagcag aacatggtgt aatctttttc accttgggta gtatggtcag aacttctaca      900

ttccctaatc aaactattca agcctttaaa gaagccttcg ctgaattacc acaaagagtt      960

ttgtggaagt tcgaaaacga aaacgaagat atgccttcca acgttttgat cagaaagtgg     1020

ttcccacaaa acgacatctt cggtcataag aacatcaagg ctttcatttc acacggtggt     1080

aattccggtg ccttggaagc tgtccatttc ggtgttccta tcataggtat cccattgttt     1140

tatgatcaat acagaaacat cttgtctttc gttaaagaag gtgtagctgt cttgttggat     1200

gtaaacgact taactaagga taacatcttg tcttcagtta gaacagtcgt taacgacaag     1260

tcatactccg aaagaatgaa ggcattgtct caattgttta gagatagacc tatgtcacca     1320

ttagacacag ctgtttattg gaccgaatac gtaattagac atagaggtgc acatcactta     1380

aaaactgcag gtgccttttt gcactggtat caatacttgt tgttggatgt catcacattt     1440

ttgttggtta cattctgtgc attctgcttc atcgttaagt acatctgcaa ggccttaatc     1500

catcactact ggtccagttc taaatctgaa aagttgaaaa agaattaa                  1548


<210>  4
<211>  492
<212>  PRT
<213>  Sorghum bicolor

<400>  4

Met Gly Ser Asn Ala Pro Pro Pro Pro Thr Pro His Val Val Leu Val 
1               5                   10                  15      


Pro Phe Pro Gly Gln Gly His Val Ala Pro Leu Met Gln Leu Ala Arg 
            20                  25                  30          


Leu Leu His Ala Arg Gly Ala Arg Val Thr Phe Val Tyr Thr Gln Tyr 
        35                  40                  45              


Asn Tyr Arg Arg Leu Leu Arg Ala Lys Gly Glu Ala Ala Val Arg Pro 
    50                  55                  60                  


Pro Ala Thr Ser Ser Ala Arg Phe Arg Ile Glu Val Ile Asp Asp Gly 
65                  70                  75                  80  


Leu Ser Leu Ser Val Pro Gln Asn Asp Val Gly Gly Leu Val Asp Ser 
                85                  90                  95      


Leu Arg Lys Asn Cys Leu His Pro Phe Arg Ala Leu Leu Arg Arg Leu 
            100                 105                 110         


Gly Gln Glu Val Glu Gly Gln Asp Ala Pro Pro Val Thr Cys Val Val 
        115                 120                 125             


Gly Asp Val Val Met Thr Phe Ala Ala Ala Ala Ala Arg Glu Ala Gly 
    130                 135                 140                 


Ile Pro Glu Val Gln Phe Phe Thr Ala Ser Ala Cys Gly Leu Leu Gly 
145                 150                 155                 160 


Tyr Leu His Tyr Gly Glu Leu Val Glu Arg Gly Leu Val Pro Phe Arg 
                165                 170                 175     


Asp Ala Ser Leu Leu Ala Asp Asp Asp Tyr Leu Asp Thr Pro Leu Glu 
            180                 185                 190         


Trp Val Pro Gly Met Ser His Met Arg Leu Arg Asp Met Pro Thr Phe 
        195                 200                 205             


Cys Arg Thr Thr Asp Pro Asp Asp Val Met Val Ser Ala Thr Leu Gln 
    210                 215                 220                 


Gln Met Glu Ser Ala Ala Gly Ser Lys Ala Leu Ile Leu Asn Thr Leu 
225                 230                 235                 240 


Tyr Glu Leu Glu Lys Asp Val Val Asp Ala Leu Ala Ala Phe Phe Pro 
                245                 250                 255     


Pro Ile Tyr Thr Val Gly Pro Leu Ala Glu Val Ile Ala Ser Ser Asp 
            260                 265                 270         


Ser Ala Ser Ala Gly Leu Ala Ala Met Asp Ile Ser Ile Trp Gln Glu 
        275                 280                 285             


Asp Thr Arg Cys Leu Ser Trp Leu Asp Gly Lys Pro Ala Gly Ser Val 
    290                 295                 300                 


Val Tyr Val Asn Phe Gly Ser Met Ala Val Met Thr Ala Ala Gln Ala 
305                 310                 315                 320 


Arg Glu Phe Ala Leu Gly Leu Ala Ser Cys Gly Ser Pro Phe Leu Trp 
                325                 330                 335     


Val Lys Arg Pro Asp Val Val Glu Gly Glu Glu Val Leu Leu Pro Glu 
            340                 345                 350         


Ala Leu Leu Asp Glu Val Ala Arg Gly Arg Gly Leu Val Val Pro Trp 
        355                 360                 365             


Cys Pro Gln Ala Ala Val Leu Lys His Ala Ala Val Gly Leu Phe Val 
    370                 375                 380                 


Ser His Cys Gly Trp Asn Ser Leu Leu Glu Ala Thr Ala Ala Gly Gln 
385                 390                 395                 400 


Pro Val Leu Ala Trp Pro Cys His Gly Glu Gln Thr Thr Asn Cys Arg 
                405                 410                 415     


Gln Leu Cys Glu Val Trp Gly Asn Gly Ala Gln Leu Pro Arg Glu Val 
            420                 425                 430         


Glu Ser Gly Ala Val Ala Arg Leu Val Arg Glu Met Met Val Gly Asp 
        435                 440                 445             


Leu Gly Lys Glu Lys Arg Ala Lys Ala Ala Glu Trp Lys Ala Ala Ala 
    450                 455                 460                 


Glu Ala Ala Ala Arg Lys Gly Gly Ala Ser Trp Arg Asn Val Glu Arg 
465                 470                 475                 480 


Val Val Asn Asp Leu Leu Leu Val Gly Gly Lys Gln 
                485                 490         


<210>  5
<211>  471
<212>  PRT
<213>  Oryza sativa

<400>  5

Met Pro Ser Ser Gly Asp Ala Ala Gly Arg Arg Pro His Val Val Leu 
1               5                   10                  15      


Ile Pro Ser Ala Gly Met Gly His Leu Val Pro Phe Gly Arg Leu Ala 
            20                  25                  30          


Val Ala Leu Ser Ser Gly His Gly Cys Asp Val Ser Leu Val Thr Val 
        35                  40                  45              


Leu Pro Thr Val Ser Thr Ala Glu Ser Lys His Leu Asp Ala Leu Phe 
    50                  55                  60                  


Asp Ala Phe Pro Ala Val Arg Arg Leu Asp Phe Glu Leu Ala Pro Phe 
65                  70                  75                  80  


Asp Ala Ser Glu Phe Pro Gly Ala Asp Pro Phe Phe Leu Arg Phe Glu 
                85                  90                  95      


Ala Met Arg Arg Ser Ala Pro Leu Leu Gly Pro Leu Leu Thr Gly Ala 
            100                 105                 110         


Gly Ala Ser Ala Leu Ala Thr Asp Ile Ala Leu Thr Ser Val Val Ile 
        115                 120                 125             


Pro Val Ala Lys Glu Gln Gly Leu Pro Cys His Ile Leu Phe Thr Ala 
    130                 135                 140                 


Ser Ala Ala Met Leu Ser Leu Cys Ala Tyr Phe Pro Thr Tyr Leu Asp 
145                 150                 155                 160 


Ala Asn Ala Gly Gly Gly Gly Gly Val Gly Asp Val Asp Ile Pro Gly 
                165                 170                 175     


Val Tyr Arg Ile Pro Lys Ala Ser Ile Pro Gln Ala Leu His Asp Pro 
            180                 185                 190         


Asn His Leu Phe Thr Arg Gln Phe Val Ala Asn Gly Arg Ser Leu Thr 
        195                 200                 205             


Ser Ala Ala Gly Ile Leu Val Asn Thr Phe Asp Ala Leu Glu Pro Glu 
    210                 215                 220                 


Ala Val Ala Ala Leu Gln Gln Gly Lys Val Ala Ser Gly Phe Pro Pro 
225                 230                 235                 240 


Val Phe Ala Val Gly Pro Leu Leu Pro Ala Ser Asn Gln Ala Lys Asp 
                245                 250                 255     


Pro Gln Ala Asn Tyr Met Glu Trp Leu Asp Ala Gln Pro Ala Arg Ser 
            260                 265                 270         


Val Val Tyr Val Ser Phe Gly Ser Arg Lys Ala Ile Ser Arg Glu Gln 
        275                 280                 285             


Leu Arg Glu Leu Ala Ala Gly Leu Glu Gly Ser Gly His Arg Phe Leu 
    290                 295                 300                 


Trp Val Val Lys Ser Thr Val Val Asp Arg Asp Asp Ala Ala Glu Leu 
305                 310                 315                 320 


Gly Glu Leu Leu Asp Glu Gly Phe Leu Glu Arg Val Glu Lys Arg Gly 
                325                 330                 335     


Leu Val Thr Lys Ala Trp Val Asp Gln Glu Glu Val Leu Lys His Glu 
            340                 345                 350         


Ser Val Ala Leu Phe Val Ser His Cys Gly Trp Asn Ser Val Thr Glu 
        355                 360                 365             


Ala Ala Ala Ser Gly Val Pro Val Leu Ala Leu Pro Arg Phe Gly Asp 
    370                 375                 380                 


Gln Arg Val Asn Ser Gly Val Val Ala Arg Ala Gly Leu Gly Val Trp 
385                 390                 395                 400 


Ala Asp Thr Trp Ser Trp Glu Gly Glu Ala Gly Val Ile Gly Ala Glu 
                405                 410                 415     


Glu Ile Ser Glu Lys Val Lys Ala Ala Met Ala Asp Glu Ala Leu Arg 
            420                 425                 430         


Met Lys Ala Ala Ser Leu Ala Glu Ala Ala Ala Lys Ala Val Ala Gly 
        435                 440                 445             


Gly Gly Ser Ser His Arg Cys Leu Ala Glu Phe Ala Arg Leu Cys Gln 
    450                 455                 460                 


Gly Gly Thr Cys Arg Thr Asn 
465                 470     


