                         SEQUENCE LISTING

<110>  Novozymes A/S
 
<120>  Polypeptides having peroxygenase activity

<130>  12491-WO-PCT

<160>  18    

<170>  PatentIn version 3.5

<210>  1
<211>  1025
<212>  DNA
<213>  Thielavia terrestris


<220>
<221>  CDS
<222>  (1)..(109)

<220>
<221>  sig_peptide
<222>  (1)..(51)

<220>
<221>  mat_peptide
<222>  (52)..(1022)

<220>
<221>  Intron
<222>  (110)..(174)

<220>
<221>  CDS
<222>  (175)..(418)

<220>
<221>  Intron
<222>  (419)..(496)

<220>
<221>  CDS
<222>  (497)..(771)

<220>
<221>  Intron
<222>  (772)..(852)

<220>
<221>  CDS
<222>  (853)..(1022)

<400>  1
atg aag ctc tcg atc ctc ctg acc ctg gca tca ggc caa ctg gct ttg         48
Met Lys Leu Ser Ile Leu Leu Thr Leu Ala Ser Gly Gln Leu Ala Leu           
        -15                 -10                 -5                        

agt tcg ccg gac tgg tcc agt ccc gag ttc tgg agc tgg cat cct cct         96
Ser Ser Pro Asp Trp Ser Ser Pro Glu Phe Trp Ser Trp His Pro Pro           
-1  1               5                   10                  15            

gct cca ggc gat g gtgggttgtg atcctggtgt accaagctcc ttcggcttat          149
Ala Pro Gly Asp                                                           
                                                                          

gctaaccatt ctcctcatcc gaaag at  cgg cgc ggg ccc tgt ccg atg ctg        200
                            Asp Arg Arg Gly Pro Cys Pro Met Leu           
                            20                  25                        

aac aca ttg gcc aac cac ggc ttt ctg cca cac aac ggg cgc aac atc        248
Asn Thr Leu Ala Asn His Gly Phe Leu Pro His Asn Gly Arg Asn Ile           
    30                  35                  40                            

acc aag gag atc acg gtg aat gct ctc aac tcc gcc ctg aac gtt aat        296
Thr Lys Glu Ile Thr Val Asn Ala Leu Asn Ser Ala Leu Asn Val Asn           
45                  50                  55                  60            

aag acg ctc ggc gag ctc ctg ttc aac ttc gcg gta acc acc aac ccc        344
Lys Thr Leu Gly Glu Leu Leu Phe Asn Phe Ala Val Thr Thr Asn Pro           
                65                  70                  75                

cag ccc aac gcg acc ttc ttc gac ctc gac cat ctc agc cgc cac aac        392
Gln Pro Asn Ala Thr Phe Phe Asp Leu Asp His Leu Ser Arg His Asn           
            80                  85                  90                    

att ctg gag cac gac gcc agc ttg ag  gtgagtgctc ctgctcttgg              438
Ile Leu Glu His Asp Ala Ser Leu Ser                                       
        95                  100                                           

gtcacgcgcc gacaggccga aaaccgcctt catactgatg ttcaaatcca cactgcag c      497

cgc gcg gac tac tac ttc ggt cac gac gac cac acg ttc aac caa act        545
Arg Ala Asp Tyr Tyr Phe Gly His Asp Asp His Thr Phe Asn Gln Thr           
            105                 110                 115                   

gtc ttc gac cag acc aag tcc tac tgg aag acc ccc atc atc gac gtc        593
Val Phe Asp Gln Thr Lys Ser Tyr Trp Lys Thr Pro Ile Ile Asp Val           
        120                 125                 130                       

cag cag gcg gcc aac gcc cgc ctg gcg cgc gtg ctg acc tcc aac gcg        641
Gln Gln Ala Ala Asn Ala Arg Leu Ala Arg Val Leu Thr Ser Asn Ala           
    135                 140                 145                           

acc aac ccc acc ttc gtg ctc tcc cag atc ggc gag gcg ttc agc ttc        689
Thr Asn Pro Thr Phe Val Leu Ser Gln Ile Gly Glu Ala Phe Ser Phe           
150                 155                 160                 165           

ggc gag acg gcc gcg tac atc ctc gcg ctg ggc gac cgc gtg tcc ggg        737
Gly Glu Thr Ala Ala Tyr Ile Leu Ala Leu Gly Asp Arg Val Ser Gly           
                170                 175                 180               

acg gtg ccc cgg cag tgg gtc gag tat ctc ttc g gttagtttgc               781
Thr Val Pro Arg Gln Trp Val Glu Tyr Leu Phe                               
            185                 190                                       

tgctggctgc ttgatctgcg tcgttgtctt ggcagtgctt gtgtctgaca gtggattcca      841

aattttgcca g ag  aac gaa cgc ctg ccg ctg gaa ctt ggc tgg cgg cgg       890
             Glu Asn Glu Arg Leu Pro Leu Glu Leu Gly Trp Arg Arg          
                     195                 200                 205          

gcg aag gag gtc att tca aac tcg gac ctc gac caa ctg acg aac cgg        938
Ala Lys Glu Val Ile Ser Asn Ser Asp Leu Asp Gln Leu Thr Asn Arg           
                210                 215                 220               

gtt atc aac gcc act ggg gcg ctg gca aac atc acg cga aag att aag        986
Val Ile Asn Ala Thr Gly Ala Leu Ala Asn Ile Thr Arg Lys Ile Lys           
            225                 230                 235                   

gtg cgg gac ttc cac gcc ggg agg ttt ccg ggg gag tga                   1025
Val Arg Asp Phe His Ala Gly Arg Phe Pro Gly Glu                           
        240                 245                                           


<210>  2
<211>  266
<212>  PRT
<213>  Thielavia terrestris

<400>  2

Met Lys Leu Ser Ile Leu Leu Thr Leu Ala Ser Gly Gln Leu Ala Leu 
        -15                 -10                 -5              


Ser Ser Pro Asp Trp Ser Ser Pro Glu Phe Trp Ser Trp His Pro Pro 
-1  1               5                   10                  15  


Ala Pro Gly Asp Asp Arg Arg Gly Pro Cys Pro Met Leu Asn Thr Leu 
                20                  25                  30      


Ala Asn His Gly Phe Leu Pro His Asn Gly Arg Asn Ile Thr Lys Glu 
            35                  40                  45          


Ile Thr Val Asn Ala Leu Asn Ser Ala Leu Asn Val Asn Lys Thr Leu 
        50                  55                  60              


Gly Glu Leu Leu Phe Asn Phe Ala Val Thr Thr Asn Pro Gln Pro Asn 
    65                  70                  75                  


Ala Thr Phe Phe Asp Leu Asp His Leu Ser Arg His Asn Ile Leu Glu 
80                  85                  90                  95  


His Asp Ala Ser Leu Ser Arg Ala Asp Tyr Tyr Phe Gly His Asp Asp 
                100                 105                 110     


His Thr Phe Asn Gln Thr Val Phe Asp Gln Thr Lys Ser Tyr Trp Lys 
            115                 120                 125         


Thr Pro Ile Ile Asp Val Gln Gln Ala Ala Asn Ala Arg Leu Ala Arg 
        130                 135                 140             


Val Leu Thr Ser Asn Ala Thr Asn Pro Thr Phe Val Leu Ser Gln Ile 
    145                 150                 155                 


Gly Glu Ala Phe Ser Phe Gly Glu Thr Ala Ala Tyr Ile Leu Ala Leu 
160                 165                 170                 175 


Gly Asp Arg Val Ser Gly Thr Val Pro Arg Gln Trp Val Glu Tyr Leu 
                180                 185                 190     


Phe Glu Asn Glu Arg Leu Pro Leu Glu Leu Gly Trp Arg Arg Ala Lys 
            195                 200                 205         


Glu Val Ile Ser Asn Ser Asp Leu Asp Gln Leu Thr Asn Arg Val Ile 
        210                 215                 220             


Asn Ala Thr Gly Ala Leu Ala Asn Ile Thr Arg Lys Ile Lys Val Arg 
    225                 230                 235                 


Asp Phe His Ala Gly Arg Phe Pro Gly Glu 
240                 245                 


<210>  3
<211>  8
<212>  PRT
<213>  Artificial

<220>
<223>  Motif sequence

<400>  3

Glu His Asp Gly Ser Leu Ser Arg 
1               5               


<210>  4
<211>  8
<212>  PRT
<213>  Artificial

<220>
<223>  Motif sequence

<400>  4

Glu His Asp Ala Ser Leu Ser Arg 
1               5               


<210>  5
<211>  8
<212>  PRT
<213>  Artificial

<220>
<223>  Motif sequence

<400>  5

Glu His Asp Gly Ser Ile Ser Arg 
1               5               


<210>  6
<211>  8
<212>  PRT
<213>  Artificial

<220>
<223>  Motif sequence

<400>  6

Glu His Asp Ala Ser Ile Ser Arg 
1               5               


<210>  7
<211>  8
<212>  PRT
<213>  Artificial

<220>
<223>  Motif sequence


<220>
<221>  MISC_FEATURE
<222>  (4)..(4)
<223>  Xaa is Gly or Ala

<220>
<221>  MISC_FEATURE
<222>  (6)..(6)
<223>  Xaa is Leu or Ile

<400>  7

Glu His Asp Xaa Ser Xaa Ser Arg 
1               5               


<210>  8
<211>  10
<212>  PRT
<213>  Artificial

<220>
<223>  Motif sequence


<220>
<221>  misc_feature
<222>  (6)..(6)
<223>  Xaa can be any naturally occurring amino acid

<400>  8

Arg Gly Pro Cys Pro Xaa Met Asn Ser Leu 
1               5                   10  


<210>  9
<211>  10
<212>  PRT
<213>  Artificial

<220>
<223>  Motif sequence


<220>
<221>  misc_feature
<222>  (6)..(6)
<223>  Xaa can be any naturally occurring amino acid

<400>  9

Arg Ala Pro Cys Pro Xaa Met Asn Ser Leu 
1               5                   10  


<210>  10
<211>  10
<212>  PRT
<213>  Artificial

<220>
<223>  Motif sequence


<220>
<221>  misc_feature
<222>  (6)..(6)
<223>  Xaa can be any naturally occurring amino acid

<400>  10

Arg Gly Pro Cys Pro Xaa Leu Asn Ser Leu 
1               5                   10  


<210>  11
<211>  10
<212>  PRT
<213>  Artificial

<220>
<223>  Motif sequence


<220>
<221>  misc_feature
<222>  (6)..(6)
<223>  Xaa can be any naturally occurring amino acid

<400>  11

Arg Ala Pro Cys Pro Xaa Leu Asn Ser Leu 
1               5                   10  


<210>  12
<211>  10
<212>  PRT
<213>  Artificial

<220>
<223>  Motif sequence


<220>
<221>  misc_feature
<222>  (6)..(6)
<223>  Xaa can be any naturally occurring amino acid

<400>  12

Arg Gly Pro Cys Pro Xaa Met Asn Thr Leu 
1               5                   10  


<210>  13
<211>  10
<212>  PRT
<213>  Artificial

<220>
<223>  Motif sequence


<220>
<221>  misc_feature
<222>  (6)..(6)
<223>  Xaa can be any naturally occurring amino acid

<400>  13

Arg Ala Pro Cys Pro Xaa Met Asn Thr Leu 
1               5                   10  


<210>  14
<211>  10
<212>  PRT
<213>  Artificial

<220>
<223>  Motif sequence


<220>
<221>  misc_feature
<222>  (6)..(6)
<223>  Xaa can be any naturally occurring amino acid

<400>  14

Arg Gly Pro Cys Pro Xaa Leu Asn Thr Leu 
1               5                   10  


<210>  15
<211>  10
<212>  PRT
<213>  Artificial

<220>
<223>  Motif sequence


<220>
<221>  misc_feature
<222>  (6)..(6)
<223>  Xaa can be any naturally occurring amino acid

<400>  15

Arg Ala Pro Cys Pro Xaa Leu Asn Thr Leu 
1               5                   10  


<210>  16
<211>  10
<212>  PRT
<213>  Artificial

<220>
<223>  Motif sequence


<220>
<221>  MISC_FEATURE
<222>  (2)..(2)
<223>  Xaa is Gly or Ala

<220>
<221>  MISC_FEATURE
<222>  (6)..(6)
<223>  Xaa is any amino acid

<220>
<221>  MISC_FEATURE
<222>  (7)..(7)
<223>  Xaa is Leu or Met

<220>
<221>  MISC_FEATURE
<222>  (9)..(9)
<223>  Xaa is Ser or Thr

<400>  16

Arg Xaa Pro Cys Pro Xaa Xaa Asn Xaa Leu 
1               5                   10  


<210>  17
<211>  43
<212>  DNA
<213>  Artificial

<220>
<223>  Primer forward

<400>  17
acacaactgg ggatccacca tgaagctctc gatcctcctg acc                         43


<210>  18
<211>  36
<212>  DNA
<213>  Artificial

<220>
<223>  Primer reverse

<400>  18
agatctcgag aagcttactc ccccggaaac ctcccg                                 36


