                         SEQUENCE LISTING

<110>  Trustees of the University of Pennsylvania
 
<120>  GENE THERAPY FOR OCULAR DISORDERS

<130>  UPN-17-8313PCT

<150>  US 62/2519821
<151>  2017-06-14

<160>  46    

<170>  PatentIn version 3.5

<210>  1
<211>  1962
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  codon optimized sequence


<220>
<221>  CDS
<222>  (1)..(1962)

<400>  1
atg gct gat acc ctg ccc tct gaa ttc gac gtg att gtg att gga acc         48
Met Ala Asp Thr Leu Pro Ser Glu Phe Asp Val Ile Val Ile Gly Thr           
1               5                   10                  15                

gga ctc cct gaa tcg atc atc gcc gcg gcc tgt tcc cgg tcc ggt cgg         96
Gly Leu Pro Glu Ser Ile Ile Ala Ala Ala Cys Ser Arg Ser Gly Arg           
            20                  25                  30                    

cgc gtg ctg cac gtc gat tcg aga agc tac tac gga ggg aat tgg gcc        144
Arg Val Leu His Val Asp Ser Arg Ser Tyr Tyr Gly Gly Asn Trp Ala           
        35                  40                  45                        

tca ttc tcc ttc tcc gga ctg ctc tcc tgg ctg aag gag tat cag gag        192
Ser Phe Ser Phe Ser Gly Leu Leu Ser Trp Leu Lys Glu Tyr Gln Glu           
    50                  55                  60                            

aac tcc gac att gtc tcc gac tca cct gtg tgg cag gac cag atc ctg        240
Asn Ser Asp Ile Val Ser Asp Ser Pro Val Trp Gln Asp Gln Ile Leu           
65                  70                  75                  80            

gaa aac gag gaa gca ata gcc ctg agc cgg aag gac aag acc atc cag        288
Glu Asn Glu Glu Ala Ile Ala Leu Ser Arg Lys Asp Lys Thr Ile Gln           
                85                  90                  95                

cac gtg gag gtg ttc tgt tat gcc tcc caa gac ctc cat gag gac gtg        336
His Val Glu Val Phe Cys Tyr Ala Ser Gln Asp Leu His Glu Asp Val           
            100                 105                 110                   

gaa gag gct gga gcg ttg cag aag aat cat gcc ctc gtg acc tcc gct        384
Glu Glu Ala Gly Ala Leu Gln Lys Asn His Ala Leu Val Thr Ser Ala           
        115                 120                 125                       

aac tcc acc gag gca gcc gac agc gcc ttc ctg ccg acc gag gat gaa        432
Asn Ser Thr Glu Ala Ala Asp Ser Ala Phe Leu Pro Thr Glu Asp Glu           
    130                 135                 140                           

tcc ctg tca act atg tcg tgc gaa atg ctg acc gaa cag act ccg agc        480
Ser Leu Ser Thr Met Ser Cys Glu Met Leu Thr Glu Gln Thr Pro Ser           
145                 150                 155                 160           

tcc gac ccc gaa aac gcc ctg gaa gtg aac gga gcg gaa gtg acc ggc        528
Ser Asp Pro Glu Asn Ala Leu Glu Val Asn Gly Ala Glu Val Thr Gly           
                165                 170                 175               

gaa aag gag aac cat tgc gac gac aag act tgt gtc cca tcc act tcc        576
Glu Lys Glu Asn His Cys Asp Asp Lys Thr Cys Val Pro Ser Thr Ser           
            180                 185                 190                   

gcg gag gac atg tcc gag aat gtg cct atc gcc gag gac acc acc gaa        624
Ala Glu Asp Met Ser Glu Asn Val Pro Ile Ala Glu Asp Thr Thr Glu           
        195                 200                 205                       

cag ccc aag aag aac aga atc acg tac agc cag atc atc aag gag ggg        672
Gln Pro Lys Lys Asn Arg Ile Thr Tyr Ser Gln Ile Ile Lys Glu Gly           
    210                 215                 220                           

cgg agg ttt aac atc gat ctg gtg tcg aag ctg ctg tac agc cgc ggt        720
Arg Arg Phe Asn Ile Asp Leu Val Ser Lys Leu Leu Tyr Ser Arg Gly           
225                 230                 235                 240           

ctg ctg atc gat ctg ctc att aag tcg aac gtg tcg aga tac gcc gag        768
Leu Leu Ile Asp Leu Leu Ile Lys Ser Asn Val Ser Arg Tyr Ala Glu           
                245                 250                 255               

ttc aag aac atc aca agg att ctc gcc ttc cgg gaa gga aga gtg gaa        816
Phe Lys Asn Ile Thr Arg Ile Leu Ala Phe Arg Glu Gly Arg Val Glu           
            260                 265                 270                   

caa gtg ccg tgc tcc cgg gcc gac gtg ttc aac tca aag caa ctt acc        864
Gln Val Pro Cys Ser Arg Ala Asp Val Phe Asn Ser Lys Gln Leu Thr           
        275                 280                 285                       

atg gtg gaa aag cgc atg ctg atg aaa ttc ctg acc ttc tgc atg gag        912
Met Val Glu Lys Arg Met Leu Met Lys Phe Leu Thr Phe Cys Met Glu           
    290                 295                 300                           

tac gaa aag tac cct gat gag tac aag ggt tac gaa gaa att act ttc        960
Tyr Glu Lys Tyr Pro Asp Glu Tyr Lys Gly Tyr Glu Glu Ile Thr Phe           
305                 310                 315                 320           

tac gag tac ctc aag acc cag aag ctg acc ccg aat ctg cag tac att       1008
Tyr Glu Tyr Leu Lys Thr Gln Lys Leu Thr Pro Asn Leu Gln Tyr Ile           
                325                 330                 335               

gtg atg cac tca atc gca atg acc tcc gaa acc gcc tcc tcg acc atc       1056
Val Met His Ser Ile Ala Met Thr Ser Glu Thr Ala Ser Ser Thr Ile           
            340                 345                 350                   

gac ggg ctc aag gcc acc aag aac ttc ctg cac tgt ttg ggg cgc tac       1104
Asp Gly Leu Lys Ala Thr Lys Asn Phe Leu His Cys Leu Gly Arg Tyr           
        355                 360                 365                       

ggc aac act ccg ttc ctc ttc ccg ctg tac ggc cag gga gag ctg cct       1152
Gly Asn Thr Pro Phe Leu Phe Pro Leu Tyr Gly Gln Gly Glu Leu Pro           
    370                 375                 380                           

cag tgt ttc tgc cgg atg tgc gcc gtg ttc ggc gga atc tac tgt ctc       1200
Gln Cys Phe Cys Arg Met Cys Ala Val Phe Gly Gly Ile Tyr Cys Leu           
385                 390                 395                 400           

cgc cac tcg gtc cag tgc ctg gtg gtg gac aag gaa tcc agg aag tgc       1248
Arg His Ser Val Gln Cys Leu Val Val Asp Lys Glu Ser Arg Lys Cys           
                405                 410                 415               

aaa gcc att att gac cag ttc gga caa cgg atc att tcc gag cac ttt       1296
Lys Ala Ile Ile Asp Gln Phe Gly Gln Arg Ile Ile Ser Glu His Phe           
            420                 425                 430                   

ctt gtg gag gac tca tac ttc ccg gag aac atg tgc tct cgg gtc cag       1344
Leu Val Glu Asp Ser Tyr Phe Pro Glu Asn Met Cys Ser Arg Val Gln           
        435                 440                 445                       

tat cga cag att tcc agg gcg gtg ctc att act gac cgg agc gtc ctc       1392
Tyr Arg Gln Ile Ser Arg Ala Val Leu Ile Thr Asp Arg Ser Val Leu           
    450                 455                 460                           

aag acc gat agc gac cag cag atc tcc atc ctg acc gtg ccg gcg gaa       1440
Lys Thr Asp Ser Asp Gln Gln Ile Ser Ile Leu Thr Val Pro Ala Glu           
465                 470                 475                 480           

gaa ccc ggc act ttt gcc gtg cgc gtg atc gag ctt tgc tca tcc acc       1488
Glu Pro Gly Thr Phe Ala Val Arg Val Ile Glu Leu Cys Ser Ser Thr           
                485                 490                 495               

atg act tgc atg aaa ggc act tac ctg gtg cac ctg acg tgc acc tca       1536
Met Thr Cys Met Lys Gly Thr Tyr Leu Val His Leu Thr Cys Thr Ser           
            500                 505                 510                   

tcg aaa acc gct aga gag gac ctg gaa tcc gtc gtc caa aag ctg ttc       1584
Ser Lys Thr Ala Arg Glu Asp Leu Glu Ser Val Val Gln Lys Leu Phe           
        515                 520                 525                       

gtg cct tac acc gag atg gaa att gaa aac gaa caa gtg gag aag ccc       1632
Val Pro Tyr Thr Glu Met Glu Ile Glu Asn Glu Gln Val Glu Lys Pro           
    530                 535                 540                           

cgc atc ctt tgg gcc ctg tac ttt aac atg cgc gat tcc tcc gat atc       1680
Arg Ile Leu Trp Ala Leu Tyr Phe Asn Met Arg Asp Ser Ser Asp Ile           
545                 550                 555                 560           

tcg cgg tcc tgc tat aac gac ttg cct tcg aac gtc tac gtc tgc tcc       1728
Ser Arg Ser Cys Tyr Asn Asp Leu Pro Ser Asn Val Tyr Val Cys Ser           
                565                 570                 575               

ggg cca gac tgc ggt ctt ggc aac gac aat gcc gtg aag cag gcg gaa       1776
Gly Pro Asp Cys Gly Leu Gly Asn Asp Asn Ala Val Lys Gln Ala Glu           
            580                 585                 590                   

aca ctg ttc caa gag atc tgc cct aac gag gat ttt tgc ccg ccc ccc       1824
Thr Leu Phe Gln Glu Ile Cys Pro Asn Glu Asp Phe Cys Pro Pro Pro           
        595                 600                 605                       

cca aac ccc gag gat atc atc ttg gac gga gac agc ctg cag cca gaa       1872
Pro Asn Pro Glu Asp Ile Ile Leu Asp Gly Asp Ser Leu Gln Pro Glu           
    610                 615                 620                           

gca tcc gag tcc agc gcc atc ccg gag gcc aac agc gaa acc ttc aag       1920
Ala Ser Glu Ser Ser Ala Ile Pro Glu Ala Asn Ser Glu Thr Phe Lys           
625                 630                 635                 640           

gag agc act aac ctg ggc aac ctg gaa gag tcc agc gaa tga               1962
Glu Ser Thr Asn Leu Gly Asn Leu Glu Glu Ser Ser Glu                       
                645                 650                                   


<210>  2
<211>  653
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  2

Met Ala Asp Thr Leu Pro Ser Glu Phe Asp Val Ile Val Ile Gly Thr 
1               5                   10                  15      


Gly Leu Pro Glu Ser Ile Ile Ala Ala Ala Cys Ser Arg Ser Gly Arg 
            20                  25                  30          


Arg Val Leu His Val Asp Ser Arg Ser Tyr Tyr Gly Gly Asn Trp Ala 
        35                  40                  45              


Ser Phe Ser Phe Ser Gly Leu Leu Ser Trp Leu Lys Glu Tyr Gln Glu 
    50                  55                  60                  


Asn Ser Asp Ile Val Ser Asp Ser Pro Val Trp Gln Asp Gln Ile Leu 
65                  70                  75                  80  


Glu Asn Glu Glu Ala Ile Ala Leu Ser Arg Lys Asp Lys Thr Ile Gln 
                85                  90                  95      


His Val Glu Val Phe Cys Tyr Ala Ser Gln Asp Leu His Glu Asp Val 
            100                 105                 110         


Glu Glu Ala Gly Ala Leu Gln Lys Asn His Ala Leu Val Thr Ser Ala 
        115                 120                 125             


Asn Ser Thr Glu Ala Ala Asp Ser Ala Phe Leu Pro Thr Glu Asp Glu 
    130                 135                 140                 


Ser Leu Ser Thr Met Ser Cys Glu Met Leu Thr Glu Gln Thr Pro Ser 
145                 150                 155                 160 


Ser Asp Pro Glu Asn Ala Leu Glu Val Asn Gly Ala Glu Val Thr Gly 
                165                 170                 175     


Glu Lys Glu Asn His Cys Asp Asp Lys Thr Cys Val Pro Ser Thr Ser 
            180                 185                 190         


Ala Glu Asp Met Ser Glu Asn Val Pro Ile Ala Glu Asp Thr Thr Glu 
        195                 200                 205             


Gln Pro Lys Lys Asn Arg Ile Thr Tyr Ser Gln Ile Ile Lys Glu Gly 
    210                 215                 220                 


Arg Arg Phe Asn Ile Asp Leu Val Ser Lys Leu Leu Tyr Ser Arg Gly 
225                 230                 235                 240 


Leu Leu Ile Asp Leu Leu Ile Lys Ser Asn Val Ser Arg Tyr Ala Glu 
                245                 250                 255     


Phe Lys Asn Ile Thr Arg Ile Leu Ala Phe Arg Glu Gly Arg Val Glu 
            260                 265                 270         


Gln Val Pro Cys Ser Arg Ala Asp Val Phe Asn Ser Lys Gln Leu Thr 
        275                 280                 285             


Met Val Glu Lys Arg Met Leu Met Lys Phe Leu Thr Phe Cys Met Glu 
    290                 295                 300                 


Tyr Glu Lys Tyr Pro Asp Glu Tyr Lys Gly Tyr Glu Glu Ile Thr Phe 
305                 310                 315                 320 


Tyr Glu Tyr Leu Lys Thr Gln Lys Leu Thr Pro Asn Leu Gln Tyr Ile 
                325                 330                 335     


Val Met His Ser Ile Ala Met Thr Ser Glu Thr Ala Ser Ser Thr Ile 
            340                 345                 350         


Asp Gly Leu Lys Ala Thr Lys Asn Phe Leu His Cys Leu Gly Arg Tyr 
        355                 360                 365             


Gly Asn Thr Pro Phe Leu Phe Pro Leu Tyr Gly Gln Gly Glu Leu Pro 
    370                 375                 380                 


Gln Cys Phe Cys Arg Met Cys Ala Val Phe Gly Gly Ile Tyr Cys Leu 
385                 390                 395                 400 


Arg His Ser Val Gln Cys Leu Val Val Asp Lys Glu Ser Arg Lys Cys 
                405                 410                 415     


Lys Ala Ile Ile Asp Gln Phe Gly Gln Arg Ile Ile Ser Glu His Phe 
            420                 425                 430         


Leu Val Glu Asp Ser Tyr Phe Pro Glu Asn Met Cys Ser Arg Val Gln 
        435                 440                 445             


Tyr Arg Gln Ile Ser Arg Ala Val Leu Ile Thr Asp Arg Ser Val Leu 
    450                 455                 460                 


Lys Thr Asp Ser Asp Gln Gln Ile Ser Ile Leu Thr Val Pro Ala Glu 
465                 470                 475                 480 


Glu Pro Gly Thr Phe Ala Val Arg Val Ile Glu Leu Cys Ser Ser Thr 
                485                 490                 495     


Met Thr Cys Met Lys Gly Thr Tyr Leu Val His Leu Thr Cys Thr Ser 
            500                 505                 510         


Ser Lys Thr Ala Arg Glu Asp Leu Glu Ser Val Val Gln Lys Leu Phe 
        515                 520                 525             


Val Pro Tyr Thr Glu Met Glu Ile Glu Asn Glu Gln Val Glu Lys Pro 
    530                 535                 540                 


Arg Ile Leu Trp Ala Leu Tyr Phe Asn Met Arg Asp Ser Ser Asp Ile 
545                 550                 555                 560 


Ser Arg Ser Cys Tyr Asn Asp Leu Pro Ser Asn Val Tyr Val Cys Ser 
                565                 570                 575     


Gly Pro Asp Cys Gly Leu Gly Asn Asp Asn Ala Val Lys Gln Ala Glu 
            580                 585                 590         


Thr Leu Phe Gln Glu Ile Cys Pro Asn Glu Asp Phe Cys Pro Pro Pro 
        595                 600                 605             


Pro Asn Pro Glu Asp Ile Ile Leu Asp Gly Asp Ser Leu Gln Pro Glu 
    610                 615                 620                 


Ala Ser Glu Ser Ser Ala Ile Pro Glu Ala Asn Ser Glu Thr Phe Lys 
625                 630                 635                 640 


Glu Ser Thr Asn Leu Gly Asn Leu Glu Glu Ser Ser Glu 
                645                 650             


<210>  3
<211>  1962
<212>  DNA
<213>  Homo sapiens


<220>
<221>  CDS
<222>  (1)..(1962)

<400>  3
atg gcg gat act ctc cct tcg gag ttt gat gtg atc gta ata ggg acg         48
Met Ala Asp Thr Leu Pro Ser Glu Phe Asp Val Ile Val Ile Gly Thr           
1               5                   10                  15                

ggt ttg cct gaa tcc atc att gca gct gca tgt tca aga agt ggc cgg         96
Gly Leu Pro Glu Ser Ile Ile Ala Ala Ala Cys Ser Arg Ser Gly Arg           
            20                  25                  30                    

aga gtt ctg cat gtt gat tca aga agc tac tat gga gga aac tgg gcc        144
Arg Val Leu His Val Asp Ser Arg Ser Tyr Tyr Gly Gly Asn Trp Ala           
        35                  40                  45                        

agt ttt agc ttt tca gga cta ttg tcc tgg cta aag gaa tac cag gaa        192
Ser Phe Ser Phe Ser Gly Leu Leu Ser Trp Leu Lys Glu Tyr Gln Glu           
    50                  55                  60                            

aac agt gac att gta agt gac agt cca gtg tgg caa gac cag atc ctt        240
Asn Ser Asp Ile Val Ser Asp Ser Pro Val Trp Gln Asp Gln Ile Leu           
65                  70                  75                  80            

gaa aat gaa gaa gcc att gct ctt agc agg aag gac aaa act att caa        288
Glu Asn Glu Glu Ala Ile Ala Leu Ser Arg Lys Asp Lys Thr Ile Gln           
                85                  90                  95                

cat gtg gaa gta ttt tgt tat gcc agt cag gat ttg cat gaa gat gtc        336
His Val Glu Val Phe Cys Tyr Ala Ser Gln Asp Leu His Glu Asp Val           
            100                 105                 110                   

gaa gaa gct ggt gca ctg cag aaa aat cat gct ctt gtg aca tct gca        384
Glu Glu Ala Gly Ala Leu Gln Lys Asn His Ala Leu Val Thr Ser Ala           
        115                 120                 125                       

aac tcc aca gaa gct gca gat tct gcc ttc ctg cct acg gag gat gag        432
Asn Ser Thr Glu Ala Ala Asp Ser Ala Phe Leu Pro Thr Glu Asp Glu           
    130                 135                 140                           

tca tta agc act atg agc tgt gaa atg ctc aca gaa caa act cca agc        480
Ser Leu Ser Thr Met Ser Cys Glu Met Leu Thr Glu Gln Thr Pro Ser           
145                 150                 155                 160           

agc gat cca gag aat gcg cta gaa gta aat ggt gct gaa gtg aca ggg        528
Ser Asp Pro Glu Asn Ala Leu Glu Val Asn Gly Ala Glu Val Thr Gly           
                165                 170                 175               

gaa aaa gaa aac cat tgt gat gat aaa act tgt gtg cca tca act tca        576
Glu Lys Glu Asn His Cys Asp Asp Lys Thr Cys Val Pro Ser Thr Ser           
            180                 185                 190                   

gca gaa gac atg agt gaa aat gtg cct ata gca gaa gat acc aca gag        624
Ala Glu Asp Met Ser Glu Asn Val Pro Ile Ala Glu Asp Thr Thr Glu           
        195                 200                 205                       

caa cca aag aaa aac aga att act tac tca caa att att aaa gaa ggc        672
Gln Pro Lys Lys Asn Arg Ile Thr Tyr Ser Gln Ile Ile Lys Glu Gly           
    210                 215                 220                           

agg aga ttt aat att gat tta gta tca aag ctg ctg tat tct cga gga        720
Arg Arg Phe Asn Ile Asp Leu Val Ser Lys Leu Leu Tyr Ser Arg Gly           
225                 230                 235                 240           

tta cta att gat ctt cta atc aaa tct aat gtt agt cga tat gca gag        768
Leu Leu Ile Asp Leu Leu Ile Lys Ser Asn Val Ser Arg Tyr Ala Glu           
                245                 250                 255               

ttt aaa aat att acc agg att ctt gca ttt cga gaa gga cga gtg gaa        816
Phe Lys Asn Ile Thr Arg Ile Leu Ala Phe Arg Glu Gly Arg Val Glu           
            260                 265                 270                   

cag gtt ccg tgt tcc aga gca gat gtc ttt aat agc aaa caa ctt act        864
Gln Val Pro Cys Ser Arg Ala Asp Val Phe Asn Ser Lys Gln Leu Thr           
        275                 280                 285                       

atg gta gaa aag cga atg cta atg aaa ttt ctt aca ttt tgt atg gaa        912
Met Val Glu Lys Arg Met Leu Met Lys Phe Leu Thr Phe Cys Met Glu           
    290                 295                 300                           

tat gag aaa tat cct gat gaa tat aaa gga tat gaa gag atc aca ttt        960
Tyr Glu Lys Tyr Pro Asp Glu Tyr Lys Gly Tyr Glu Glu Ile Thr Phe           
305                 310                 315                 320           

tat gaa tat tta aag act caa aaa tta acc ccc aac ctc caa tat att       1008
Tyr Glu Tyr Leu Lys Thr Gln Lys Leu Thr Pro Asn Leu Gln Tyr Ile           
                325                 330                 335               

gtc atg cat tca att gca atg aca tca gag aca gcc agc agc acc ata       1056
Val Met His Ser Ile Ala Met Thr Ser Glu Thr Ala Ser Ser Thr Ile           
            340                 345                 350                   

gat ggt ctc aaa gct acc aaa aac ttt ctt cac tgt ctt ggg cgg tat       1104
Asp Gly Leu Lys Ala Thr Lys Asn Phe Leu His Cys Leu Gly Arg Tyr           
        355                 360                 365                       

ggc aac act cca ttt ttg ttt cct tta tat ggc caa gga gaa ctc ccc       1152
Gly Asn Thr Pro Phe Leu Phe Pro Leu Tyr Gly Gln Gly Glu Leu Pro           
    370                 375                 380                           

cag tgt ttc tgc agg atg tgt gct gtg ttt ggt gga att tat tgt ctt       1200
Gln Cys Phe Cys Arg Met Cys Ala Val Phe Gly Gly Ile Tyr Cys Leu           
385                 390                 395                 400           

cgc cat tca gta cag tgc ctt gta gtg gac aaa gaa tcc aga aaa tgt       1248
Arg His Ser Val Gln Cys Leu Val Val Asp Lys Glu Ser Arg Lys Cys           
                405                 410                 415               

aaa gca att ata gat cag ttt ggt cag aga ata atc tct gag cat ttc       1296
Lys Ala Ile Ile Asp Gln Phe Gly Gln Arg Ile Ile Ser Glu His Phe           
            420                 425                 430                   

ctc gtg gag gac agt tac ttt cct gag aac atg tgc tca cgt gtg caa       1344
Leu Val Glu Asp Ser Tyr Phe Pro Glu Asn Met Cys Ser Arg Val Gln           
        435                 440                 445                       

tac agg cag atc tcc agg gca gtg ctg att aca gat aga tct gtc cta       1392
Tyr Arg Gln Ile Ser Arg Ala Val Leu Ile Thr Asp Arg Ser Val Leu           
    450                 455                 460                           

aaa aca gat tca gat caa cag att tcc att ttg aca gtg cca gca gag       1440
Lys Thr Asp Ser Asp Gln Gln Ile Ser Ile Leu Thr Val Pro Ala Glu           
465                 470                 475                 480           

gaa cca gga act ttt gct gtt cgg gtc att gag tta tgt tct tca acg       1488
Glu Pro Gly Thr Phe Ala Val Arg Val Ile Glu Leu Cys Ser Ser Thr           
                485                 490                 495               

atg aca tgc atg aaa ggc acc tat ttg gtt cat ttg act tgc aca tct       1536
Met Thr Cys Met Lys Gly Thr Tyr Leu Val His Leu Thr Cys Thr Ser           
            500                 505                 510                   

tct aaa aca gca aga gaa gat tta gaa tca gtt gtg cag aaa ttg ttt       1584
Ser Lys Thr Ala Arg Glu Asp Leu Glu Ser Val Val Gln Lys Leu Phe           
        515                 520                 525                       

gtt cca tat act gaa atg gag ata gaa aat gaa caa gta gaa aag cca       1632
Val Pro Tyr Thr Glu Met Glu Ile Glu Asn Glu Gln Val Glu Lys Pro           
    530                 535                 540                           

aga att ctg tgg gct ctt tac ttc aat atg aga gat tcg tca gac atc       1680
Arg Ile Leu Trp Ala Leu Tyr Phe Asn Met Arg Asp Ser Ser Asp Ile           
545                 550                 555                 560           

agc agg agc tgt tat aat gat tta cca tcc aac gtt tat gtc tgc tct       1728
Ser Arg Ser Cys Tyr Asn Asp Leu Pro Ser Asn Val Tyr Val Cys Ser           
                565                 570                 575               

ggc cca gat tgt ggt tta gga aat gat aat gca gtc aaa cag gct gaa       1776
Gly Pro Asp Cys Gly Leu Gly Asn Asp Asn Ala Val Lys Gln Ala Glu           
            580                 585                 590                   

aca ctt ttc cag gaa atc tgc ccc aat gaa gat ttc tgt ccc cct cca       1824
Thr Leu Phe Gln Glu Ile Cys Pro Asn Glu Asp Phe Cys Pro Pro Pro           
        595                 600                 605                       

cca aat cct gaa gac att atc ctt gat gga gac agt tta cag cca gag       1872
Pro Asn Pro Glu Asp Ile Ile Leu Asp Gly Asp Ser Leu Gln Pro Glu           
    610                 615                 620                           

gct tca gaa tcc agt gcc ata cca gag gct aac tcg gag act ttc aag       1920
Ala Ser Glu Ser Ser Ala Ile Pro Glu Ala Asn Ser Glu Thr Phe Lys           
625                 630                 635                 640           

gaa agc aca aac ctt gga aac cta gag gag tcc tct gaa taa               1962
Glu Ser Thr Asn Leu Gly Asn Leu Glu Glu Ser Ser Glu                       
                645                 650                                   


<210>  4
<211>  653
<212>  PRT
<213>  Homo sapiens

<400>  4

Met Ala Asp Thr Leu Pro Ser Glu Phe Asp Val Ile Val Ile Gly Thr 
1               5                   10                  15      


Gly Leu Pro Glu Ser Ile Ile Ala Ala Ala Cys Ser Arg Ser Gly Arg 
            20                  25                  30          


Arg Val Leu His Val Asp Ser Arg Ser Tyr Tyr Gly Gly Asn Trp Ala 
        35                  40                  45              


Ser Phe Ser Phe Ser Gly Leu Leu Ser Trp Leu Lys Glu Tyr Gln Glu 
    50                  55                  60                  


Asn Ser Asp Ile Val Ser Asp Ser Pro Val Trp Gln Asp Gln Ile Leu 
65                  70                  75                  80  


Glu Asn Glu Glu Ala Ile Ala Leu Ser Arg Lys Asp Lys Thr Ile Gln 
                85                  90                  95      


His Val Glu Val Phe Cys Tyr Ala Ser Gln Asp Leu His Glu Asp Val 
            100                 105                 110         


Glu Glu Ala Gly Ala Leu Gln Lys Asn His Ala Leu Val Thr Ser Ala 
        115                 120                 125             


Asn Ser Thr Glu Ala Ala Asp Ser Ala Phe Leu Pro Thr Glu Asp Glu 
    130                 135                 140                 


Ser Leu Ser Thr Met Ser Cys Glu Met Leu Thr Glu Gln Thr Pro Ser 
145                 150                 155                 160 


Ser Asp Pro Glu Asn Ala Leu Glu Val Asn Gly Ala Glu Val Thr Gly 
                165                 170                 175     


Glu Lys Glu Asn His Cys Asp Asp Lys Thr Cys Val Pro Ser Thr Ser 
            180                 185                 190         


Ala Glu Asp Met Ser Glu Asn Val Pro Ile Ala Glu Asp Thr Thr Glu 
        195                 200                 205             


Gln Pro Lys Lys Asn Arg Ile Thr Tyr Ser Gln Ile Ile Lys Glu Gly 
    210                 215                 220                 


Arg Arg Phe Asn Ile Asp Leu Val Ser Lys Leu Leu Tyr Ser Arg Gly 
225                 230                 235                 240 


Leu Leu Ile Asp Leu Leu Ile Lys Ser Asn Val Ser Arg Tyr Ala Glu 
                245                 250                 255     


Phe Lys Asn Ile Thr Arg Ile Leu Ala Phe Arg Glu Gly Arg Val Glu 
            260                 265                 270         


Gln Val Pro Cys Ser Arg Ala Asp Val Phe Asn Ser Lys Gln Leu Thr 
        275                 280                 285             


Met Val Glu Lys Arg Met Leu Met Lys Phe Leu Thr Phe Cys Met Glu 
    290                 295                 300                 


Tyr Glu Lys Tyr Pro Asp Glu Tyr Lys Gly Tyr Glu Glu Ile Thr Phe 
305                 310                 315                 320 


Tyr Glu Tyr Leu Lys Thr Gln Lys Leu Thr Pro Asn Leu Gln Tyr Ile 
                325                 330                 335     


Val Met His Ser Ile Ala Met Thr Ser Glu Thr Ala Ser Ser Thr Ile 
            340                 345                 350         


Asp Gly Leu Lys Ala Thr Lys Asn Phe Leu His Cys Leu Gly Arg Tyr 
        355                 360                 365             


Gly Asn Thr Pro Phe Leu Phe Pro Leu Tyr Gly Gln Gly Glu Leu Pro 
    370                 375                 380                 


Gln Cys Phe Cys Arg Met Cys Ala Val Phe Gly Gly Ile Tyr Cys Leu 
385                 390                 395                 400 


Arg His Ser Val Gln Cys Leu Val Val Asp Lys Glu Ser Arg Lys Cys 
                405                 410                 415     


Lys Ala Ile Ile Asp Gln Phe Gly Gln Arg Ile Ile Ser Glu His Phe 
            420                 425                 430         


Leu Val Glu Asp Ser Tyr Phe Pro Glu Asn Met Cys Ser Arg Val Gln 
        435                 440                 445             


Tyr Arg Gln Ile Ser Arg Ala Val Leu Ile Thr Asp Arg Ser Val Leu 
    450                 455                 460                 


Lys Thr Asp Ser Asp Gln Gln Ile Ser Ile Leu Thr Val Pro Ala Glu 
465                 470                 475                 480 


Glu Pro Gly Thr Phe Ala Val Arg Val Ile Glu Leu Cys Ser Ser Thr 
                485                 490                 495     


Met Thr Cys Met Lys Gly Thr Tyr Leu Val His Leu Thr Cys Thr Ser 
            500                 505                 510         


Ser Lys Thr Ala Arg Glu Asp Leu Glu Ser Val Val Gln Lys Leu Phe 
        515                 520                 525             


Val Pro Tyr Thr Glu Met Glu Ile Glu Asn Glu Gln Val Glu Lys Pro 
    530                 535                 540                 


Arg Ile Leu Trp Ala Leu Tyr Phe Asn Met Arg Asp Ser Ser Asp Ile 
545                 550                 555                 560 


Ser Arg Ser Cys Tyr Asn Asp Leu Pro Ser Asn Val Tyr Val Cys Ser 
                565                 570                 575     


Gly Pro Asp Cys Gly Leu Gly Asn Asp Asn Ala Val Lys Gln Ala Glu 
            580                 585                 590         


Thr Leu Phe Gln Glu Ile Cys Pro Asn Glu Asp Phe Cys Pro Pro Pro 
        595                 600                 605             


Pro Asn Pro Glu Asp Ile Ile Leu Asp Gly Asp Ser Leu Gln Pro Glu 
    610                 615                 620                 


Ala Ser Glu Ser Ser Ala Ile Pro Glu Ala Asn Ser Glu Thr Phe Lys 
625                 630                 635                 640 


Glu Ser Thr Asn Leu Gly Asn Leu Glu Glu Ser Ser Glu 
                645                 650             


<210>  5
<211>  1985
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed plasmid


<220>
<221>  misc_feature
<222>  (1)..(8)
<223>  NotI restriction site for subcloning into proviral plasmid

<220>
<221>  misc_feature
<222>  (4)..(16)
<223>  Kozak consensus sequence

<220>
<221>  CDS
<222>  (13)..(1971)
<223>  codon-optimized open reading frame (ORF)

<220>
<221>  misc_feature
<222>  (1972)..(1977)
<223>  BclI restriction site with embedded stop codon/ site to add 
       optional epitope tag

<220>
<221>  misc_feature
<222>  (1980)..(1985)
<223>  BamHI restriction site for subcloning into proviral plasmid

<400>  5
gcggccgcca cc atg gct gat acc ctg ccc tct gaa ttc gac gtg att gtg       51
              Met Ala Asp Thr Leu Pro Ser Glu Phe Asp Val Ile Val         
              1               5                   10                      

att gga acc gga ctc cct gaa tcg atc atc gcc gcg gcc tgt tcc cgg         99
Ile Gly Thr Gly Leu Pro Glu Ser Ile Ile Ala Ala Ala Cys Ser Arg           
    15                  20                  25                            

tcc ggt cgg cgc gtg ctg cac gtc gat tcg aga agc tac tac gga ggg        147
Ser Gly Arg Arg Val Leu His Val Asp Ser Arg Ser Tyr Tyr Gly Gly           
30                  35                  40                  45            

aat tgg gcc tca ttc tcc ttc tcc gga ctg ctc tcc tgg ctg aag gag        195
Asn Trp Ala Ser Phe Ser Phe Ser Gly Leu Leu Ser Trp Leu Lys Glu           
                50                  55                  60                

tat cag gag aac tcc gac att gtc tcc gac tca cct gtg tgg cag gac        243
Tyr Gln Glu Asn Ser Asp Ile Val Ser Asp Ser Pro Val Trp Gln Asp           
            65                  70                  75                    

cag atc ctg gaa aac gag gaa gca ata gcc ctg agc cgg aag gac aag        291
Gln Ile Leu Glu Asn Glu Glu Ala Ile Ala Leu Ser Arg Lys Asp Lys           
        80                  85                  90                        

acc atc cag cac gtg gag gtg ttc tgt tat gcc tcc caa gac ctc cat        339
Thr Ile Gln His Val Glu Val Phe Cys Tyr Ala Ser Gln Asp Leu His           
    95                  100                 105                           

gag gac gtg gaa gag gct gga gcg ttg cag aag aat cat gcc ctc gtg        387
Glu Asp Val Glu Glu Ala Gly Ala Leu Gln Lys Asn His Ala Leu Val           
110                 115                 120                 125           

acc tcc gct aac tcc acc gag gca gcc gac agc gcc ttc ctg ccg acc        435
Thr Ser Ala Asn Ser Thr Glu Ala Ala Asp Ser Ala Phe Leu Pro Thr           
                130                 135                 140               

gag gat gaa tcc ctg tca act atg tcg tgc gaa atg ctg acc gaa cag        483
Glu Asp Glu Ser Leu Ser Thr Met Ser Cys Glu Met Leu Thr Glu Gln           
            145                 150                 155                   

act ccg agc tcc gac ccc gaa aac gcc ctg gaa gtg aac gga gcg gaa        531
Thr Pro Ser Ser Asp Pro Glu Asn Ala Leu Glu Val Asn Gly Ala Glu           
        160                 165                 170                       

gtg acc ggc gaa aag gag aac cat tgc gac gac aag act tgt gtc cca        579
Val Thr Gly Glu Lys Glu Asn His Cys Asp Asp Lys Thr Cys Val Pro           
    175                 180                 185                           

tcc act tcc gcg gag gac atg tcc gag aat gtg cct atc gcc gag gac        627
Ser Thr Ser Ala Glu Asp Met Ser Glu Asn Val Pro Ile Ala Glu Asp           
190                 195                 200                 205           

acc acc gaa cag ccc aag aag aac aga atc acg tac agc cag atc atc        675
Thr Thr Glu Gln Pro Lys Lys Asn Arg Ile Thr Tyr Ser Gln Ile Ile           
                210                 215                 220               

aag gag ggg cgg agg ttt aac atc gat ctg gtg tcg aag ctg ctg tac        723
Lys Glu Gly Arg Arg Phe Asn Ile Asp Leu Val Ser Lys Leu Leu Tyr           
            225                 230                 235                   

agc cgc ggt ctg ctg atc gat ctg ctc att aag tcg aac gtg tcg aga        771
Ser Arg Gly Leu Leu Ile Asp Leu Leu Ile Lys Ser Asn Val Ser Arg           
        240                 245                 250                       

tac gcc gag ttc aag aac atc aca agg att ctc gcc ttc cgg gaa gga        819
Tyr Ala Glu Phe Lys Asn Ile Thr Arg Ile Leu Ala Phe Arg Glu Gly           
    255                 260                 265                           

aga gtg gaa caa gtg ccg tgc tcc cgg gcc gac gtg ttc aac tca aag        867
Arg Val Glu Gln Val Pro Cys Ser Arg Ala Asp Val Phe Asn Ser Lys           
270                 275                 280                 285           

caa ctt acc atg gtg gaa aag cgc atg ctg atg aaa ttc ctg acc ttc        915
Gln Leu Thr Met Val Glu Lys Arg Met Leu Met Lys Phe Leu Thr Phe           
                290                 295                 300               

tgc atg gag tac gaa aag tac cct gat gag tac aag ggt tac gaa gaa        963
Cys Met Glu Tyr Glu Lys Tyr Pro Asp Glu Tyr Lys Gly Tyr Glu Glu           
            305                 310                 315                   

att act ttc tac gag tac ctc aag acc cag aag ctg acc ccg aat ctg       1011
Ile Thr Phe Tyr Glu Tyr Leu Lys Thr Gln Lys Leu Thr Pro Asn Leu           
        320                 325                 330                       

cag tac att gtg atg cac tca atc gca atg acc tcc gaa acc gcc tcc       1059
Gln Tyr Ile Val Met His Ser Ile Ala Met Thr Ser Glu Thr Ala Ser           
    335                 340                 345                           

tcg acc atc gac ggg ctc aag gcc acc aag aac ttc ctg cac tgt ttg       1107
Ser Thr Ile Asp Gly Leu Lys Ala Thr Lys Asn Phe Leu His Cys Leu           
350                 355                 360                 365           

ggg cgc tac ggc aac act ccg ttc ctc ttc ccg ctg tac ggc cag gga       1155
Gly Arg Tyr Gly Asn Thr Pro Phe Leu Phe Pro Leu Tyr Gly Gln Gly           
                370                 375                 380               

gag ctg cct cag tgt ttc tgc cgg atg tgc gcc gtg ttc ggc gga atc       1203
Glu Leu Pro Gln Cys Phe Cys Arg Met Cys Ala Val Phe Gly Gly Ile           
            385                 390                 395                   

tac tgt ctc cgc cac tcg gtc cag tgc ctg gtg gtg gac aag gaa tcc       1251
Tyr Cys Leu Arg His Ser Val Gln Cys Leu Val Val Asp Lys Glu Ser           
        400                 405                 410                       

agg aag tgc aaa gcc att att gac cag ttc gga caa cgg atc att tcc       1299
Arg Lys Cys Lys Ala Ile Ile Asp Gln Phe Gly Gln Arg Ile Ile Ser           
    415                 420                 425                           

gag cac ttt ctt gtg gag gac tca tac ttc ccg gag aac atg tgc tct       1347
Glu His Phe Leu Val Glu Asp Ser Tyr Phe Pro Glu Asn Met Cys Ser           
430                 435                 440                 445           

cgg gtc cag tat cga cag att tcc agg gcg gtg ctc att act gac cgg       1395
Arg Val Gln Tyr Arg Gln Ile Ser Arg Ala Val Leu Ile Thr Asp Arg           
                450                 455                 460               

agc gtc ctc aag acc gat agc gac cag cag atc tcc atc ctg acc gtg       1443
Ser Val Leu Lys Thr Asp Ser Asp Gln Gln Ile Ser Ile Leu Thr Val           
            465                 470                 475                   

ccg gcg gaa gaa ccc ggc act ttt gcc gtg cgc gtg atc gag ctt tgc       1491
Pro Ala Glu Glu Pro Gly Thr Phe Ala Val Arg Val Ile Glu Leu Cys           
        480                 485                 490                       

tca tcc acc atg act tgc atg aaa ggc act tac ctg gtg cac ctg acg       1539
Ser Ser Thr Met Thr Cys Met Lys Gly Thr Tyr Leu Val His Leu Thr           
    495                 500                 505                           

tgc acc tca tcg aaa acc gct aga gag gac ctg gaa tcc gtc gtc caa       1587
Cys Thr Ser Ser Lys Thr Ala Arg Glu Asp Leu Glu Ser Val Val Gln           
510                 515                 520                 525           

aag ctg ttc gtg cct tac acc gag atg gaa att gaa aac gaa caa gtg       1635
Lys Leu Phe Val Pro Tyr Thr Glu Met Glu Ile Glu Asn Glu Gln Val           
                530                 535                 540               

gag aag ccc cgc atc ctt tgg gcc ctg tac ttt aac atg cgc gat tcc       1683
Glu Lys Pro Arg Ile Leu Trp Ala Leu Tyr Phe Asn Met Arg Asp Ser           
            545                 550                 555                   

tcc gat atc tcg cgg tcc tgc tat aac gac ttg cct tcg aac gtc tac       1731
Ser Asp Ile Ser Arg Ser Cys Tyr Asn Asp Leu Pro Ser Asn Val Tyr           
        560                 565                 570                       

gtc tgc tcc ggg cca gac tgc ggt ctt ggc aac gac aat gcc gtg aag       1779
Val Cys Ser Gly Pro Asp Cys Gly Leu Gly Asn Asp Asn Ala Val Lys           
    575                 580                 585                           

cag gcg gaa aca ctg ttc caa gag atc tgc cct aac gag gat ttt tgc       1827
Gln Ala Glu Thr Leu Phe Gln Glu Ile Cys Pro Asn Glu Asp Phe Cys           
590                 595                 600                 605           

ccg ccc ccc cca aac ccc gag gat atc atc ttg gac gga gac agc ctg       1875
Pro Pro Pro Pro Asn Pro Glu Asp Ile Ile Leu Asp Gly Asp Ser Leu           
                610                 615                 620               

cag cca gaa gca tcc gag tcc agc gcc atc ccg gag gcc aac agc gaa       1923
Gln Pro Glu Ala Ser Glu Ser Ser Ala Ile Pro Glu Ala Asn Ser Glu           
            625                 630                 635                   

acc ttc aag gag agc act aac ctg ggc aac ctg gaa gag tcc agc gaa       1971
Thr Phe Lys Glu Ser Thr Asn Leu Gly Asn Leu Glu Glu Ser Ser Glu           
        640                 645                 650                       

tgatcatagg atcc                                                       1985


<210>  6
<211>  653
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  6

Met Ala Asp Thr Leu Pro Ser Glu Phe Asp Val Ile Val Ile Gly Thr 
1               5                   10                  15      


Gly Leu Pro Glu Ser Ile Ile Ala Ala Ala Cys Ser Arg Ser Gly Arg 
            20                  25                  30          


Arg Val Leu His Val Asp Ser Arg Ser Tyr Tyr Gly Gly Asn Trp Ala 
        35                  40                  45              


Ser Phe Ser Phe Ser Gly Leu Leu Ser Trp Leu Lys Glu Tyr Gln Glu 
    50                  55                  60                  


Asn Ser Asp Ile Val Ser Asp Ser Pro Val Trp Gln Asp Gln Ile Leu 
65                  70                  75                  80  


Glu Asn Glu Glu Ala Ile Ala Leu Ser Arg Lys Asp Lys Thr Ile Gln 
                85                  90                  95      


His Val Glu Val Phe Cys Tyr Ala Ser Gln Asp Leu His Glu Asp Val 
            100                 105                 110         


Glu Glu Ala Gly Ala Leu Gln Lys Asn His Ala Leu Val Thr Ser Ala 
        115                 120                 125             


Asn Ser Thr Glu Ala Ala Asp Ser Ala Phe Leu Pro Thr Glu Asp Glu 
    130                 135                 140                 


Ser Leu Ser Thr Met Ser Cys Glu Met Leu Thr Glu Gln Thr Pro Ser 
145                 150                 155                 160 


Ser Asp Pro Glu Asn Ala Leu Glu Val Asn Gly Ala Glu Val Thr Gly 
                165                 170                 175     


Glu Lys Glu Asn His Cys Asp Asp Lys Thr Cys Val Pro Ser Thr Ser 
            180                 185                 190         


Ala Glu Asp Met Ser Glu Asn Val Pro Ile Ala Glu Asp Thr Thr Glu 
        195                 200                 205             


Gln Pro Lys Lys Asn Arg Ile Thr Tyr Ser Gln Ile Ile Lys Glu Gly 
    210                 215                 220                 


Arg Arg Phe Asn Ile Asp Leu Val Ser Lys Leu Leu Tyr Ser Arg Gly 
225                 230                 235                 240 


Leu Leu Ile Asp Leu Leu Ile Lys Ser Asn Val Ser Arg Tyr Ala Glu 
                245                 250                 255     


Phe Lys Asn Ile Thr Arg Ile Leu Ala Phe Arg Glu Gly Arg Val Glu 
            260                 265                 270         


Gln Val Pro Cys Ser Arg Ala Asp Val Phe Asn Ser Lys Gln Leu Thr 
        275                 280                 285             


Met Val Glu Lys Arg Met Leu Met Lys Phe Leu Thr Phe Cys Met Glu 
    290                 295                 300                 


Tyr Glu Lys Tyr Pro Asp Glu Tyr Lys Gly Tyr Glu Glu Ile Thr Phe 
305                 310                 315                 320 


Tyr Glu Tyr Leu Lys Thr Gln Lys Leu Thr Pro Asn Leu Gln Tyr Ile 
                325                 330                 335     


Val Met His Ser Ile Ala Met Thr Ser Glu Thr Ala Ser Ser Thr Ile 
            340                 345                 350         


Asp Gly Leu Lys Ala Thr Lys Asn Phe Leu His Cys Leu Gly Arg Tyr 
        355                 360                 365             


Gly Asn Thr Pro Phe Leu Phe Pro Leu Tyr Gly Gln Gly Glu Leu Pro 
    370                 375                 380                 


Gln Cys Phe Cys Arg Met Cys Ala Val Phe Gly Gly Ile Tyr Cys Leu 
385                 390                 395                 400 


Arg His Ser Val Gln Cys Leu Val Val Asp Lys Glu Ser Arg Lys Cys 
                405                 410                 415     


Lys Ala Ile Ile Asp Gln Phe Gly Gln Arg Ile Ile Ser Glu His Phe 
            420                 425                 430         


Leu Val Glu Asp Ser Tyr Phe Pro Glu Asn Met Cys Ser Arg Val Gln 
        435                 440                 445             


Tyr Arg Gln Ile Ser Arg Ala Val Leu Ile Thr Asp Arg Ser Val Leu 
    450                 455                 460                 


Lys Thr Asp Ser Asp Gln Gln Ile Ser Ile Leu Thr Val Pro Ala Glu 
465                 470                 475                 480 


Glu Pro Gly Thr Phe Ala Val Arg Val Ile Glu Leu Cys Ser Ser Thr 
                485                 490                 495     


Met Thr Cys Met Lys Gly Thr Tyr Leu Val His Leu Thr Cys Thr Ser 
            500                 505                 510         


Ser Lys Thr Ala Arg Glu Asp Leu Glu Ser Val Val Gln Lys Leu Phe 
        515                 520                 525             


Val Pro Tyr Thr Glu Met Glu Ile Glu Asn Glu Gln Val Glu Lys Pro 
    530                 535                 540                 


Arg Ile Leu Trp Ala Leu Tyr Phe Asn Met Arg Asp Ser Ser Asp Ile 
545                 550                 555                 560 


Ser Arg Ser Cys Tyr Asn Asp Leu Pro Ser Asn Val Tyr Val Cys Ser 
                565                 570                 575     


Gly Pro Asp Cys Gly Leu Gly Asn Asp Asn Ala Val Lys Gln Ala Glu 
            580                 585                 590         


Thr Leu Phe Gln Glu Ile Cys Pro Asn Glu Asp Phe Cys Pro Pro Pro 
        595                 600                 605             


Pro Asn Pro Glu Asp Ile Ile Leu Asp Gly Asp Ser Leu Gln Pro Glu 
    610                 615                 620                 


Ala Ser Glu Ser Ser Ala Ile Pro Glu Ala Asn Ser Glu Thr Phe Lys 
625                 630                 635                 640 


Glu Ser Thr Asn Leu Gly Asn Leu Glu Glu Ser Ser Glu 
                645                 650             


<210>  7
<211>  9187
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed plasmid


<220>
<221>  misc_feature
<222>  (1)..(145)
<223>  5' ITR

<220>
<221>  promoter
<222>  (169)..(1786)
<223>  CMV.CBA promoter

<220>
<221>  misc_feature
<222>  (1787)..(1794)
<223>  Not I cloning site, cuts at 1789

<220>
<221>  misc_feature
<222>  (1805)..(1810)
<223>  BamHI cloning site, cuts at 1806

<220>
<221>  polyA_signal
<222>  (1850)..(2052)
<223>  BGH PolyA

<220>
<221>  misc_feature
<222>  (2109)..(2252)
<223>  3' ITR

<220>
<221>  misc_feature
<222>  (2571)..(6624)
<223>  lambda stuffer

<220>
<221>  misc_feature
<222>  (7314)..(8126)
<223>  Kanamycin resistance (complementary)

<220>
<221>  misc_feature
<222>  (8485)..(9128)
<223>  Origin of replication (complementary)

<400>  7
tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt cgggcgacct ttggtcgccc       60

ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc aactccatca ctaggggttc      120

ctgcggccta gtaggctcag aggcacacag gagtttctgc aaatctagtg caggcgttac      180

ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc cattgacgtc      240

aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac gtcaatgggt      300

ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata tgccaagtac      360

gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc agtacatgac      420

cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta ttaacatggt      480

cgaggtgagc cccacgttct gcttcactct ccccatctcc cccccctccc cacccccaat      540

tttgtattta tttatttttt aattattttg tgcagcgatg ggggcggggg gggggggggg      600

gcgcgcgcca ggcggggcgg ggcggggcga ggggcggggc ggggcgaggc ggagaggtgc      660

ggcggcagcc aatcagagcg gcgcgctccg aaagtttcct tttatggcga ggcggcggcg      720

gcggcggccc tataaaaagc gaagcgcgcg gcgggcgggg agtcgctgcg acgctgcctt      780

cgccccgtgc cccgctccgc cgccgcctcg cgccgcccgc cccggctctg actgaccgcg      840

ttactcccac aggtgagcgg gcgggacggc ccttctcctc cgggctgtaa ttagcgcttg      900

gtttaatgac ggcttgtttc ttttctgtgg ctgcgtgaaa gccttgaggg gctccgggag      960

ggccctttgt gcggggggag cggctcgggg ggtgcgtgcg tgtgtgtgtg cgtggggagc     1020

gccgcgtgcg gctccgcgct gcccggcggc tgtgagcgct gcgggcgcgg cgcggggctt     1080

tgtgcgctcc gcagtgtgcg cgaggggagc gcggccgggg gcggtgcccc gcggtgcggg     1140

gggggctgcg aggggaacaa aggctgcgtg cggggtgtgt gcgtgggggg gtgagcaggg     1200

ggtgtgggcg cgtcggtcgg gctgcaaccc cccctgcacc cccctccccg agttgctgag     1260

cacggcccgg cttcgggtgc ggggctccgt acggggcgtg gcgcggggct cgccgtgccg     1320

ggcggggggt ggcggcaggt gggggtgccg ggcggggcgg ggccgcctcg ggccggggag     1380

ggctcggggg aggggcgcgg cggcccccgg agcgccggcg gctgtcgagg cgcggcgagc     1440

cgcagccatt gccttttatg gtaatcgtgc gagagggcgc agggacttcc tttgtcccaa     1500

atctgtgcgg agccgaaatc tgggaggcgc cgccgcaccc cctctagcgg gcgcggggcg     1560

aagcggtgcg gcgccggcag gaaggaaatg ggcggggagg gccttcgtgc gtcgccgcgc     1620

cgccgtcccc ttctccctct ccagcctcgg ggctgtccgc ggggggacgg ctgccttcgg     1680

gggggacggg gcagggcggg gttcggcttc tggcgtgtga ccggcggctc tagacaattg     1740

tactaacctt cttctctttc ctctcctgac aggttggtgt acactagcgg ccgcatagta     1800

ctgcggatcc tgcagatctc gagccgaatt cctgcagccc gggggatcag cctcgactgt     1860

gccttctagt tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga     1920

aggtgccact cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag     1980

taggtgtcat tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga     2040

agacaatagc aggcatgctg gggatgcggt gggctctatg gcttctgagg cggaaagaac     2100

cagctggggc tcgagatcca ctagggccgc aggaacccct agtgatggag ttggccactc     2160

cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc cgacgcccgg     2220

gctttgcccg ggcggcctca gtgagcgagc gacctgcagg ggcagcttga aggaaatact     2280

aaggcaaagg tactgcaagt gctcgcaaca ttcgcttatg cggattattg ccgtagtgcc     2340

gcgacgccgg gggcaagatg cagagattgc catggtacag gccgtgcggt tgatattgcc     2400

aaaacagagc tgtgggggag agttgtcgag aaagagtgcg gaagatgcaa aggcgtcggc     2460

tattcaagga tgccagcaag cgcagcatat cgcgctgtga cgatgctaat cccaaacctt     2520

acccaaccca cctggtcacg cactgttaag ccgctgtatg acgctctggt ggtgcaatgc     2580

cacaaagaag agtcaatcgc agacaacatt ttgaatgcgg tcacacgtta gcagcatgat     2640

tgccacggat ggcaacatat taacggcatg atattgactt attgaataaa attgggtaaa     2700

tttgactcaa cgatgggtta attcgctcgt tgtggtagtg agatgaaaag aggcggcgct     2760

tactaccgat tccgcctagt tggtcacttc gacgtatcgt ctggaactcc aaccatcgca     2820

ggcagagagg tctgcaaaat gcaatcccga aacagttcgc aggtaatagt tagagcctgc     2880

ataacggttt cgggattttt tatatctgca caacaggtaa gagcattgag tcgataatcg     2940

tgaagagtcg gcgagcctgg ttagccagtg ctctttccgt tgtgctgaat taagcgaata     3000

ccggaagcag aaccggatca ccaaatgcgt acaggcgtca tcgccgccca gcaacagcac     3060

aacccaaact gagccgtagc cactgtctgt cctgaattca ttagtaatag ttacgctgcg     3120

gccttttaca catgaccttc gtgaaagcgg gtggcaggag gtcgcgctaa caacctcctg     3180

ccgttttgcc cgtgcatatc ggtcacgaac aaatctgatt actaaacaca gtagcctgga     3240

tttgttctat cagtaatcga ccttattcct aattaaatag agcaaatccc cttattgggg     3300

gtaagacatg aagatgccag aaaaacatga cctgttggcc gccattctcg cggcaaagga     3360

acaaggcatc ggggcaatcc ttgcgtttgc aatggcgtac cttcgcggca gatataatgg     3420

cggtgcgttt acaaaaacag taatcgacgc aacgatgtgc gccattatcg cctggttcat     3480

tcgtgacctt ctcgacttcg ccggactaag tagcaatctc gcttatataa cgagcgtgtt     3540

tatcggctac atcggtactg actcgattgg ttcgcttatc aaacgcttcg ctgctaaaaa     3600

agccggagta gaagatggta gaaatcaata atcaacgtaa ggcgttcctc gatatgctgg     3660

cgtggtcgga gggaactgat aacggacgtc agaaaaccag aaatcatggt tatgacgtca     3720

ttgtaggcgg agagctattt actgattact ccgatcaccc tcgcaaactt gtcacgctaa     3780

acccaaaact caaatcaaca ggcgccggac gctaccagct tctttcccgt tggtgggatg     3840

cctaccgcaa gcagcttggc ctgaaagact tctctccgaa aagtcaggac gctgtggcat     3900

tgcagcagat taaggagcgt ggcgctttac ctatgattga tcgtggtgat atccgtcagg     3960

caatcgaccg ttgcagcaat atctgggctt cactgccggg cgctggttat ggtcagttcg     4020

agcataaggc tgacagcctg attgcaaaat tcaaagaagc gggcggaacg gtcagagaga     4080

ttgatgtatg agcagagtca ccgcgattat ctccgctctg gttatctgca tcatcgtctg     4140

cctgtcatgg gctgttaatc attaccgtga taacgccatt acctacaaag cccagcgcga     4200

caaaaatgcc agagaactga agctggcgaa cgcggcaatt actgacatgc agatgcgtca     4260

gcgtgatgtt gctgcgctcg atgcaaaata cacgaaggag ttagctgatg ctaaagctga     4320

aaatgatgct ctgcgtgatg atgttgccgc tggtcgtcgt cggttgcaca tcaaagcagt     4380

ctgtcagtca gtgcgtgaag ccaccaccgc ctccggcgtg gataatgcag cctccccccg     4440

actggcagac accgctgaac gggattattt caccctcaga gagaggctga tcactatgca     4500

aaaacaactg gaaggaaccc agaagtatat taatgagcag tgcagataga gttgcccata     4560

tcgatgggca actcatgcaa ttattgtgag caatacacac gcgcttccag cggagtataa     4620

atgcctaaag taataaaacc gagcaatcca tttacgaatg tttgctgggt ttctgtttta     4680

acaacatttt ctgcgccgcc acaaattttg gctgcatcga cagttttctt ctgcccaatt     4740

ccagaaacga agaaatgatg ggtgatggtt tcctttggtg ctactgctgc cggtttgttt     4800

tgaacagtaa acgtctgttg agcacatcct gtaataagca gggccagcgc agtagcgagt     4860

agcatttttt tcatggtgtt attcccgatg ctttttgaag ttcgcagaat cgtatgtgta     4920

gaaaattaaa caaaccctaa acaatgagtt gaaatttcat attgttaata tttattaatg     4980

tatgtcaggt gcgatgaatc gtcattgtat tcccggatta actatgtcca cagccctgac     5040

ggggaacttc tctgcgggag tgtccgggaa taattaaaac gatgcacaca gggtttagcg     5100

cgtacacgta ttgcattatg ccaacgcccc ggtgctgaca cggaagaaac cggacgttat     5160

gatttagcgt ggaaagattt gtgtagtgtt ctgaatgctc tcagtaaata gtaatgaatt     5220

atcaaaggta tagtaatatc ttttatgttc atggatattt gtaacccatc ggaaaactcc     5280

tgctttagca agattttccc tgtattgctg aaatgtgatt tctcttgatt tcaacctatc     5340

ataggacgtt tctataagat gcgtgtttct tgagaattta acatttacaa cctttttaag     5400

tccttttatt aacacggtgt tatcgttttc taacacgatg tgaatattat ctgtggctag     5460

atagtaaata taatgtgaga cgttgtgacg ttttagttca gaataaaaca attcacagtc     5520

taaatctttt cgcacttgat cgaatatttc tttaaaaatg gcaacctgag ccattggtaa     5580

aaccttccat gtgatacgag ggcgcgtagt ttgcattatc gtttttatcg tttcaatctg     5640

gtctgacctc cttgtgtttt gttgatgatt tatgtcaaat attaggaatg ttttcactta     5700

atagtattgg ttgcgtaaca aagtgcggtc ctgctggcat tctggaggga aatacaaccg     5760

acagatgtat gtaaggccaa cgtgctcaaa tcttcataca gaaagatttg aagtaatatt     5820

ttaaccgcta gatgaagagc aagcgcatgg agcgacaaaa tgaataaaga acaatctgct     5880

gatgatccct ccgtggatct gattcgtgta aaaaatatgc ttaatagcac catttctatg     5940

agttaccctg atgttgtaat tgcatgtata gaacataagg tgtctctgga agcattcaga     6000

gcaattgagg cagcgttggt gaagcacgat aataatatga aggattattc cctggtggtt     6060

gactgatcac cataactgct aatcattcaa actatttagt ctgtgacaga gccaacacgc     6120

agtctgtcac tgtcaggaaa gtggtaaaac tgcaactcaa ttactgcaat gccctcgtaa     6180

ttaagtgaat ttacaatatc gtcctgttcg gagggaagaa cgcgggatgt tcattcttca     6240

tcacttttaa ttgatgtata tgctctcttt tctgacgtta gtctccgacg gcaggcttca     6300

atgacccagg ctgagaaatt cccggaccct ttttgctcaa gagcgatgtt aatttgttca     6360

atcatttggt taggaaagcg gatgttgcgg gttgttgttc tgcgggttct gttcttcgtt     6420

gacatgaggt tgccccgtat tcagtgtcgc tgatttgtat tgtctgaagt tgtttttacg     6480

ttaagttgat gcagatcaat taatacgata cctgcgtcat aattgattat ttgacgtggt     6540

ttgatggcct ccacgcacgt tgtgatatgt agatgataat cattatcact ttacgggtcc     6600

tttccggtga tccgacaggt tacggcctga tgcggtattt tctccttacg catctgtgcg     6660

gtatttcaca ccgcatacgt caaagcaacc atagtacgcg ccctgtagcg gcgcattaag     6720

cgcggcgggt gtggtggtta cgcgcagcgt gaccgctaca cttgccagcg ccctagcgcc     6780

cgctcctttc gctttcttcc cttcctttct cgccacgttc gccggctttc cccgtcaagc     6840

tctaaatcgg gggctccctt tagggttccg atttagtgct ttacggcacc tcgaccccaa     6900

aaaacttgat ttgggtgatg gttcacgtag tgggccatcg ccctgataga cggtttttcg     6960

ccctttgacg ttggagtcca cgttctttaa tagtggactc ttgttccaaa ctggaacaac     7020

actcaaccct atctcgggct attcttttga tttagacctg caggcatgca agcttactgg     7080

ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt tacccaactt aatcgccttg     7140

cagcacatcc ccctttcgcc agctggcgta atagcgaaga ggcccgcacc gatcgccctt     7200

cccaacagtt gcgcagcctg aatggcgaat gcgatttatt caacaaagcc gccgtcccgt     7260

caagtcagcg taatgctctg ccagtgttac aaccaattaa ccaattctga ttagaaaaac     7320

tcatcgagca tcaaatgaaa ctgcaattta ttcatatcag gattatcaat accatatttt     7380

tgaaaaagcc gtttctgtaa tgaaggagaa aactcaccga ggcagttcca taggatggca     7440

agatcctggt atcggtctgc gattccgact cgtccaacat caatacaacc tattaatttc     7500

ccctcgtcaa aaataaggtt atcaagtgag aaatcaccat gagtgacgac tgaatccggt     7560

gagaatggca aaagcttatg catttctttc cagacttgtt caacaggcca gccattacgc     7620

tcgtcatcaa aatcactcgc atcaaccaaa ccgttattca ttcgtgattg cgcctgagcg     7680

agacgaaata cgcgatcgct gttaaaagga caattacaaa caggaatcga atgcaaccgg     7740

cgcaggaaca ctgccagcgc atcaacaata ttttcacctg aatcaggata ttcttctaat     7800

acctggaatg ctgttttccc ggggatcgca gtggtgagta accatgcatc atcaggagta     7860

cggataaaat gcttgatggt cggaagaggc ataaattccg tcagccagtt tagtctgacc     7920

atctcatctg taacatcatt ggcaacgcta cctttgccat gtttcagaaa caactctggc     7980

gcatcgggct tcccatacaa tcgatagatt gtcgcacctg attgcccgac attatcgcga     8040

gcccatttat acccatataa atcagcatcc atgttggaat ttaatcgcgg cttcgagcaa     8100

gacgtttccc gttgaatatg gctcataaca ccccttgtat tactgtttat gtaagcagac     8160

agttttattg ttcatgatga tatattttta tcttgtgcaa tgtaacatca gagattttga     8220

gacacaacgt ggctttgttg aataaatcga acttttgctg agttgaagga tcagatcacg     8280

catcttcccg acaacgcaga ccgttccgtg gcaaagcaaa agttcaaaat caccaactgg     8340

tccacctaca acaaagctct catcaaccgt ggctccctca ctttctggct ggatgatggg     8400

gcgattcagg cctggtatga gtcagcaaca ccttcttcac gaggcagacc tctcgacgga     8460

tcgttccact gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg agatcctttt     8520

tttctgcgcg taatctgctg cttgcaaaca aaaaaaccac cgctaccagc ggtggtttgt     8580

ttgccggatc aagagctacc aactcttttt ccgaaggtaa ctggcttcag cagagcgcag     8640

ataccaaata ctgtccttct agtgtagccg tagttaggcc accacttcaa gaactctgta     8700

gcaccgccta catacctcgc tctgctaatc ctgttaccag tggctgctgc cagtggcgat     8760

aagtcgtgtc ttaccgggtt ggactcaaga cgatagttac cggataaggc gcagcggtcg     8820

ggctgaacgg ggggttcgtg cacacagccc agcttggagc gaacgaccta caccgaactg     8880

agatacctac agcgtgagct atgagaaagc gccacgcttc ccgaagggag aaaggcggac     8940

aggtatccgg taagcggcag ggtcggaaca ggagagcgca cgagggagct tccaggggga     9000

aacgcctggt atctttatag tcctgtcggg tttcgccacc tctgacttga gcgtcgattt     9060

ttgtgatgct cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc ggccttttta     9120

cggttcctgg ccttttgctg gccttttgct cacatgtcct gcaggcagct gcgcgccagc     9180

tgcgcgc                                                               9187


<210>  8
<211>  11148
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed plasmid

<400>  8
tcgctcgctc actgaggccg cccgggcaaa gcccgggcgt cgggcgacct ttggtcgccc       60

ggcctcagtg agcgagcgag cgcgcagaga gggagtggcc aactccatca ctaggggttc      120

ctgcggccta gtaggctcag aggcacacag gagtttctgc aaatctagtg caggcgttac      180

ataacttacg gtaaatggcc cgcctggctg accgcccaac gacccccgcc cattgacgtc      240

aataatgacg tatgttccca tagtaacgcc aatagggact ttccattgac gtcaatgggt      300

ggagtattta cggtaaactg cccacttggc agtacatcaa gtgtatcata tgccaagtac      360

gccccctatt gacgtcaatg acggtaaatg gcccgcctgg cattatgccc agtacatgac      420

cttatgggac tttcctactt ggcagtacat ctacgtatta gtcatcgcta ttaacatggt      480

cgaggtgagc cccacgttct gcttcactct ccccatctcc cccccctccc cacccccaat      540

tttgtattta tttatttttt aattattttg tgcagcgatg ggggcggggg gggggggggg      600

gcgcgcgcca ggcggggcgg ggcggggcga ggggcggggc ggggcgaggc ggagaggtgc      660

ggcggcagcc aatcagagcg gcgcgctccg aaagtttcct tttatggcga ggcggcggcg      720

gcggcggccc tataaaaagc gaagcgcgcg gcgggcgggg agtcgctgcg acgctgcctt      780

cgccccgtgc cccgctccgc cgccgcctcg cgccgcccgc cccggctctg actgaccgcg      840

ttactcccac aggtgagcgg gcgggacggc ccttctcctc cgggctgtaa ttagcgcttg      900

gtttaatgac ggcttgtttc ttttctgtgg ctgcgtgaaa gccttgaggg gctccgggag      960

ggccctttgt gcggggggag cggctcgggg ggtgcgtgcg tgtgtgtgtg cgtggggagc     1020

gccgcgtgcg gctccgcgct gcccggcggc tgtgagcgct gcgggcgcgg cgcggggctt     1080

tgtgcgctcc gcagtgtgcg cgaggggagc gcggccgggg gcggtgcccc gcggtgcggg     1140

gggggctgcg aggggaacaa aggctgcgtg cggggtgtgt gcgtgggggg gtgagcaggg     1200

ggtgtgggcg cgtcggtcgg gctgcaaccc cccctgcacc cccctccccg agttgctgag     1260

cacggcccgg cttcgggtgc ggggctccgt acggggcgtg gcgcggggct cgccgtgccg     1320

ggcggggggt ggcggcaggt gggggtgccg ggcggggcgg ggccgcctcg ggccggggag     1380

ggctcggggg aggggcgcgg cggcccccgg agcgccggcg gctgtcgagg cgcggcgagc     1440

cgcagccatt gccttttatg gtaatcgtgc gagagggcgc agggacttcc tttgtcccaa     1500

atctgtgcgg agccgaaatc tgggaggcgc cgccgcaccc cctctagcgg gcgcggggcg     1560

aagcggtgcg gcgccggcag gaaggaaatg ggcggggagg gccttcgtgc gtcgccgcgc     1620

cgccgtcccc ttctccctct ccagcctcgg ggctgtccgc ggggggacgg ctgccttcgg     1680

gggggacggg gcagggcggg gttcggcttc tggcgtgtga ccggcggctc tagacaattg     1740

tactaacctt cttctctttc ctctcctgac aggttggtgt acactagcgg ccgccaccat     1800

ggctgatacc ctgccctctg aattcgacgt gattgtgatt ggaaccggac tccctgaatc     1860

gatcatcgcc gcggcctgtt cccggtccgg tcggcgcgtg ctgcacgtcg attcgagaag     1920

ctactacgga gggaattggg cctcattctc cttctccgga ctgctctcct ggctgaagga     1980

gtatcaggag aactccgaca ttgtctccga ctcacctgtg tggcaggacc agatcctgga     2040

aaacgaggaa gcaatagccc tgagccggaa ggacaagacc atccagcacg tggaggtgtt     2100

ctgttatgcc tcccaagacc tccatgagga cgtggaagag gctggagcgt tgcagaagaa     2160

tcatgccctc gtgacctccg ctaactccac cgaggcagcc gacagcgcct tcctgccgac     2220

cgaggatgaa tccctgtcaa ctatgtcgtg cgaaatgctg accgaacaga ctccgagctc     2280

cgaccccgaa aacgccctgg aagtgaacgg agcggaagtg accggcgaaa aggagaacca     2340

ttgcgacgac aagacttgtg tcccatccac ttccgcggag gacatgtccg agaatgtgcc     2400

tatcgccgag gacaccaccg aacagcccaa gaagaacaga atcacgtaca gccagatcat     2460

caaggagggg cggaggttta acatcgatct ggtgtcgaag ctgctgtaca gccgcggtct     2520

gctgatcgat ctgctcatta agtcgaacgt gtcgagatac gccgagttca agaacatcac     2580

aaggattctc gccttccggg aaggaagagt ggaacaagtg ccgtgctccc gggccgacgt     2640

gttcaactca aagcaactta ccatggtgga aaagcgcatg ctgatgaaat tcctgacctt     2700

ctgcatggag tacgaaaagt accctgatga gtacaagggt tacgaagaaa ttactttcta     2760

cgagtacctc aagacccaga agctgacccc gaatctgcag tacattgtga tgcactcaat     2820

cgcaatgacc tccgaaaccg cctcctcgac catcgacggg ctcaaggcca ccaagaactt     2880

cctgcactgt ttggggcgct acggcaacac tccgttcctc ttcccgctgt acggccaggg     2940

agagctgcct cagtgtttct gccggatgtg cgccgtgttc ggcggaatct actgtctccg     3000

ccactcggtc cagtgcctgg tggtggacaa ggaatccagg aagtgcaaag ccattattga     3060

ccagttcgga caacggatca tttccgagca ctttcttgtg gaggactcat acttcccgga     3120

gaacatgtgc tctcgggtcc agtatcgaca gatttccagg gcggtgctca ttactgaccg     3180

gagcgtcctc aagaccgata gcgaccagca gatctccatc ctgaccgtgc cggcggaaga     3240

acccggcact tttgccgtgc gcgtgatcga gctttgctca tccaccatga cttgcatgaa     3300

aggcacttac ctggtgcacc tgacgtgcac ctcatcgaaa accgctagag aggacctgga     3360

atccgtcgtc caaaagctgt tcgtgcctta caccgagatg gaaattgaaa acgaacaagt     3420

ggagaagccc cgcatccttt gggccctgta ctttaacatg cgcgattcct ccgatatctc     3480

gcggtcctgc tataacgact tgccttcgaa cgtctacgtc tgctccgggc cagactgcgg     3540

tcttggcaac gacaatgccg tgaagcaggc ggaaacactg ttccaagaga tctgccctaa     3600

cgaggatttt tgcccgcccc ccccaaaccc cgaggatatc atcttggacg gagacagcct     3660

gcagccagaa gcatccgagt ccagcgccat cccggaggcc aacagcgaaa ccttcaagga     3720

gagcactaac ctgggcaacc tggaagagtc cagcgaatga tcataggatc ctgcagatct     3780

cgagccgaat tcctgcagcc cgggggatca gcctcgactg tgccttctag ttgccagcca     3840

tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg aaggtgccac tcccactgtc     3900

ctttcctaat aaaatgagga aattgcatcg cattgtctga gtaggtgtca ttctattctg     3960

gggggtgggg tggggcagga cagcaagggg gaggattggg aagacaatag caggcatgct     4020

ggggatgcgg tgggctctat ggcttctgag gcggaaagaa ccagctgggg ctcgagatcc     4080

actagggccg caggaacccc tagtgatgga gttggccact ccctctctgc gcgctcgctc     4140

gctcactgag gccgggcgac caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc     4200

agtgagcgag cgacctgcag gggcagcttg aaggaaatac taaggcaaag gtactgcaag     4260

tgctcgcaac attcgcttat gcggattatt gccgtagtgc cgcgacgccg ggggcaagat     4320

gcagagattg ccatggtaca ggccgtgcgg ttgatattgc caaaacagag ctgtggggga     4380

gagttgtcga gaaagagtgc ggaagatgca aaggcgtcgg ctattcaagg atgccagcaa     4440

gcgcagcata tcgcgctgtg acgatgctaa tcccaaacct tacccaaccc acctggtcac     4500

gcactgttaa gccgctgtat gacgctctgg tggtgcaatg ccacaaagaa gagtcaatcg     4560

cagacaacat tttgaatgcg gtcacacgtt agcagcatga ttgccacgga tggcaacata     4620

ttaacggcat gatattgact tattgaataa aattgggtaa atttgactca acgatgggtt     4680

aattcgctcg ttgtggtagt gagatgaaaa gaggcggcgc ttactaccga ttccgcctag     4740

ttggtcactt cgacgtatcg tctggaactc caaccatcgc aggcagagag gtctgcaaaa     4800

tgcaatcccg aaacagttcg caggtaatag ttagagcctg cataacggtt tcgggatttt     4860

ttatatctgc acaacaggta agagcattga gtcgataatc gtgaagagtc ggcgagcctg     4920

gttagccagt gctctttccg ttgtgctgaa ttaagcgaat accggaagca gaaccggatc     4980

accaaatgcg tacaggcgtc atcgccgccc agcaacagca caacccaaac tgagccgtag     5040

ccactgtctg tcctgaattc attagtaata gttacgctgc ggccttttac acatgacctt     5100

cgtgaaagcg ggtggcagga ggtcgcgcta acaacctcct gccgttttgc ccgtgcatat     5160

cggtcacgaa caaatctgat tactaaacac agtagcctgg atttgttcta tcagtaatcg     5220

accttattcc taattaaata gagcaaatcc ccttattggg ggtaagacat gaagatgcca     5280

gaaaaacatg acctgttggc cgccattctc gcggcaaagg aacaaggcat cggggcaatc     5340

cttgcgtttg caatggcgta ccttcgcggc agatataatg gcggtgcgtt tacaaaaaca     5400

gtaatcgacg caacgatgtg cgccattatc gcctggttca ttcgtgacct tctcgacttc     5460

gccggactaa gtagcaatct cgcttatata acgagcgtgt ttatcggcta catcggtact     5520

gactcgattg gttcgcttat caaacgcttc gctgctaaaa aagccggagt agaagatggt     5580

agaaatcaat aatcaacgta aggcgttcct cgatatgctg gcgtggtcgg agggaactga     5640

taacggacgt cagaaaacca gaaatcatgg ttatgacgtc attgtaggcg gagagctatt     5700

tactgattac tccgatcacc ctcgcaaact tgtcacgcta aacccaaaac tcaaatcaac     5760

aggcgccgga cgctaccagc ttctttcccg ttggtgggat gcctaccgca agcagcttgg     5820

cctgaaagac ttctctccga aaagtcagga cgctgtggca ttgcagcaga ttaaggagcg     5880

tggcgcttta cctatgattg atcgtggtga tatccgtcag gcaatcgacc gttgcagcaa     5940

tatctgggct tcactgccgg gcgctggtta tggtcagttc gagcataagg ctgacagcct     6000

gattgcaaaa ttcaaagaag cgggcggaac ggtcagagag attgatgtat gagcagagtc     6060

accgcgatta tctccgctct ggttatctgc atcatcgtct gcctgtcatg ggctgttaat     6120

cattaccgtg ataacgccat tacctacaaa gcccagcgcg acaaaaatgc cagagaactg     6180

aagctggcga acgcggcaat tactgacatg cagatgcgtc agcgtgatgt tgctgcgctc     6240

gatgcaaaat acacgaagga gttagctgat gctaaagctg aaaatgatgc tctgcgtgat     6300

gatgttgccg ctggtcgtcg tcggttgcac atcaaagcag tctgtcagtc agtgcgtgaa     6360

gccaccaccg cctccggcgt ggataatgca gcctcccccc gactggcaga caccgctgaa     6420

cgggattatt tcaccctcag agagaggctg atcactatgc aaaaacaact ggaaggaacc     6480

cagaagtata ttaatgagca gtgcagatag agttgcccat atcgatgggc aactcatgca     6540

attattgtga gcaatacaca cgcgcttcca gcggagtata aatgcctaaa gtaataaaac     6600

cgagcaatcc atttacgaat gtttgctggg tttctgtttt aacaacattt tctgcgccgc     6660

cacaaatttt ggctgcatcg acagttttct tctgcccaat tccagaaacg aagaaatgat     6720

gggtgatggt ttcctttggt gctactgctg ccggtttgtt ttgaacagta aacgtctgtt     6780

gagcacatcc tgtaataagc agggccagcg cagtagcgag tagcattttt ttcatggtgt     6840

tattcccgat gctttttgaa gttcgcagaa tcgtatgtgt agaaaattaa acaaacccta     6900

aacaatgagt tgaaatttca tattgttaat atttattaat gtatgtcagg tgcgatgaat     6960

cgtcattgta ttcccggatt aactatgtcc acagccctga cggggaactt ctctgcggga     7020

gtgtccggga ataattaaaa cgatgcacac agggtttagc gcgtacacgt attgcattat     7080

gccaacgccc cggtgctgac acggaagaaa ccggacgtta tgatttagcg tggaaagatt     7140

tgtgtagtgt tctgaatgct ctcagtaaat agtaatgaat tatcaaaggt atagtaatat     7200

cttttatgtt catggatatt tgtaacccat cggaaaactc ctgctttagc aagattttcc     7260

ctgtattgct gaaatgtgat ttctcttgat ttcaacctat cataggacgt ttctataaga     7320

tgcgtgtttc ttgagaattt aacatttaca acctttttaa gtccttttat taacacggtg     7380

ttatcgtttt ctaacacgat gtgaatatta tctgtggcta gatagtaaat ataatgtgag     7440

acgttgtgac gttttagttc agaataaaac aattcacagt ctaaatcttt tcgcacttga     7500

tcgaatattt ctttaaaaat ggcaacctga gccattggta aaaccttcca tgtgatacga     7560

gggcgcgtag tttgcattat cgtttttatc gtttcaatct ggtctgacct ccttgtgttt     7620

tgttgatgat ttatgtcaaa tattaggaat gttttcactt aatagtattg gttgcgtaac     7680

aaagtgcggt cctgctggca ttctggaggg aaatacaacc gacagatgta tgtaaggcca     7740

acgtgctcaa atcttcatac agaaagattt gaagtaatat tttaaccgct agatgaagag     7800

caagcgcatg gagcgacaaa atgaataaag aacaatctgc tgatgatccc tccgtggatc     7860

tgattcgtgt aaaaaatatg cttaatagca ccatttctat gagttaccct gatgttgtaa     7920

ttgcatgtat agaacataag gtgtctctgg aagcattcag agcaattgag gcagcgttgg     7980

tgaagcacga taataatatg aaggattatt ccctggtggt tgactgatca ccataactgc     8040

taatcattca aactatttag tctgtgacag agccaacacg cagtctgtca ctgtcaggaa     8100

agtggtaaaa ctgcaactca attactgcaa tgccctcgta attaagtgaa tttacaatat     8160

cgtcctgttc ggagggaaga acgcgggatg ttcattcttc atcactttta attgatgtat     8220

atgctctctt ttctgacgtt agtctccgac ggcaggcttc aatgacccag gctgagaaat     8280

tcccggaccc tttttgctca agagcgatgt taatttgttc aatcatttgg ttaggaaagc     8340

ggatgttgcg ggttgttgtt ctgcgggttc tgttcttcgt tgacatgagg ttgccccgta     8400

ttcagtgtcg ctgatttgta ttgtctgaag ttgtttttac gttaagttga tgcagatcaa     8460

ttaatacgat acctgcgtca taattgatta tttgacgtgg tttgatggcc tccacgcacg     8520

ttgtgatatg tagatgataa tcattatcac tttacgggtc ctttccggtg atccgacagg     8580

ttacggcctg atgcggtatt ttctccttac gcatctgtgc ggtatttcac accgcatacg     8640

tcaaagcaac catagtacgc gccctgtagc ggcgcattaa gcgcggcggg tgtggtggtt     8700

acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt cgctttcttc     8760

ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg ggggctccct     8820

ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga tttgggtgat     8880

ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac gttggagtcc     8940

acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc tatctcgggc     9000

tattcttttg atttagacct gcaggcatgc aagcttactg gccgtcgttt tacaacgtcg     9060

tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc cccctttcgc     9120

cagctggcgt aatagcgaag aggcccgcac cgatcgccct tcccaacagt tgcgcagcct     9180

gaatggcgaa tgcgatttat tcaacaaagc cgccgtcccg tcaagtcagc gtaatgctct     9240

gccagtgtta caaccaatta accaattctg attagaaaaa ctcatcgagc atcaaatgaa     9300

actgcaattt attcatatca ggattatcaa taccatattt ttgaaaaagc cgtttctgta     9360

atgaaggaga aaactcaccg aggcagttcc ataggatggc aagatcctgg tatcggtctg     9420

cgattccgac tcgtccaaca tcaatacaac ctattaattt cccctcgtca aaaataaggt     9480

tatcaagtga gaaatcacca tgagtgacga ctgaatccgg tgagaatggc aaaagcttat     9540

gcatttcttt ccagacttgt tcaacaggcc agccattacg ctcgtcatca aaatcactcg     9600

catcaaccaa accgttattc attcgtgatt gcgcctgagc gagacgaaat acgcgatcgc     9660

tgttaaaagg acaattacaa acaggaatcg aatgcaaccg gcgcaggaac actgccagcg     9720

catcaacaat attttcacct gaatcaggat attcttctaa tacctggaat gctgttttcc     9780

cggggatcgc agtggtgagt aaccatgcat catcaggagt acggataaaa tgcttgatgg     9840

tcggaagagg cataaattcc gtcagccagt ttagtctgac catctcatct gtaacatcat     9900

tggcaacgct acctttgcca tgtttcagaa acaactctgg cgcatcgggc ttcccataca     9960

atcgatagat tgtcgcacct gattgcccga cattatcgcg agcccattta tacccatata    10020

aatcagcatc catgttggaa tttaatcgcg gcttcgagca agacgtttcc cgttgaatat    10080

ggctcataac accccttgta ttactgttta tgtaagcaga cagttttatt gttcatgatg    10140

atatattttt atcttgtgca atgtaacatc agagattttg agacacaacg tggctttgtt    10200

gaataaatcg aacttttgct gagttgaagg atcagatcac gcatcttccc gacaacgcag    10260

accgttccgt ggcaaagcaa aagttcaaaa tcaccaactg gtccacctac aacaaagctc    10320

tcatcaaccg tggctccctc actttctggc tggatgatgg ggcgattcag gcctggtatg    10380

agtcagcaac accttcttca cgaggcagac ctctcgacgg atcgttccac tgagcgtcag    10440

accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct    10500

gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac    10560

caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat actgtccttc    10620

tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg    10680

ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt    10740

tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt    10800

gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc    10860

tatgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca    10920

gggtcggaac aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata    10980

gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg    11040

ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct    11100

ggccttttgc tcacatgtcc tgcaggcagc tgcgcgccag ctgcgcgc                 11148


<210>  9
<211>  2085
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  codon optimized sequence


<220>
<221>  CDS
<222>  (1)..(2085)
<223>  codon-optimized ORF

<400>  9
atg gct aag att aac acc cag tac tca cat cca tcc cgc act cac ctc         48
Met Ala Lys Ile Asn Thr Gln Tyr Ser His Pro Ser Arg Thr His Leu           
1               5                   10                  15                

aaa gtc aag acc tcc gat cgg gat ctg aac cgg gct gag aat ggg ctg         96
Lys Val Lys Thr Ser Asp Arg Asp Leu Asn Arg Ala Glu Asn Gly Leu           
            20                  25                  30                    

tcg cgc gcc cac tcg tcg tcc gag gaa acc agc agc gtg ctc cag ccg        144
Ser Arg Ala His Ser Ser Ser Glu Glu Thr Ser Ser Val Leu Gln Pro           
        35                  40                  45                        

ggc atc gcc atg gaa act agg ggg ctg gcg gac tcc gga cag gga tcc        192
Gly Ile Ala Met Glu Thr Arg Gly Leu Ala Asp Ser Gly Gln Gly Ser           
    50                  55                  60                            

ttc act gga cag ggt att gcc cgg ctg agc aga ctg atc ttc ctg ctt        240
Phe Thr Gly Gln Gly Ile Ala Arg Leu Ser Arg Leu Ile Phe Leu Leu           
65                  70                  75                  80            

cgc cgc tgg gcg gcc aga cac gtg cac cat cag gac cag gga cct gat        288
Arg Arg Trp Ala Ala Arg His Val His His Gln Asp Gln Gly Pro Asp           
                85                  90                  95                

agc ttc ccc gac cgc ttt agg gga gcc gag ctg aaa gaa gtg tca agc        336
Ser Phe Pro Asp Arg Phe Arg Gly Ala Glu Leu Lys Glu Val Ser Ser           
            100                 105                 110                   

cag gag tca aac gcg cag gcc aac gtc ggc agc caa gag cct gca gac        384
Gln Glu Ser Asn Ala Gln Ala Asn Val Gly Ser Gln Glu Pro Ala Asp           
        115                 120                 125                       

cgg gga cgc tcg gca tgg ccg ctc gca aag tgc aac act aac act tcc        432
Arg Gly Arg Ser Ala Trp Pro Leu Ala Lys Cys Asn Thr Asn Thr Ser           
    130                 135                 140                           

aac aac acc gaa gag gaa aag aaa acc aag aag aag gat gca att gtg        480
Asn Asn Thr Glu Glu Glu Lys Lys Thr Lys Lys Lys Asp Ala Ile Val           
145                 150                 155                 160           

gtg gac cct tcc tcc aac ctg tac tac cgc tgg ttg acc gcc atc gcc        528
Val Asp Pro Ser Ser Asn Leu Tyr Tyr Arg Trp Leu Thr Ala Ile Ala           
                165                 170                 175               

ctc ccg gtc ttt tac aat tgg tat ctc ctt atc tgc cgg gcc tgc ttc        576
Leu Pro Val Phe Tyr Asn Trp Tyr Leu Leu Ile Cys Arg Ala Cys Phe           
            180                 185                 190                   

gac gaa ctg caa tca gag tac ctg atg ctg tgg ctg gtg ctg gac tat        624
Asp Glu Leu Gln Ser Glu Tyr Leu Met Leu Trp Leu Val Leu Asp Tyr           
        195                 200                 205                       

agc gcc gat gtg ctc tac gtc ctg gat gtg ctc gtg cgc gcc cgg acc        672
Ser Ala Asp Val Leu Tyr Val Leu Asp Val Leu Val Arg Ala Arg Thr           
    210                 215                 220                           

gga ttc ttg gaa caa ggc ctg atg gtg tcc gac acg aat aga ctg tgg        720
Gly Phe Leu Glu Gln Gly Leu Met Val Ser Asp Thr Asn Arg Leu Trp           
225                 230                 235                 240           

cag cac tat aag acc aca acc cag ttc aag ctt gac gtg ctc agc ctt        768
Gln His Tyr Lys Thr Thr Thr Gln Phe Lys Leu Asp Val Leu Ser Leu           
                245                 250                 255               

gtg ccg act gac ctg gcc tac ctg aaa gtc gga act aac tac ccg gaa        816
Val Pro Thr Asp Leu Ala Tyr Leu Lys Val Gly Thr Asn Tyr Pro Glu           
            260                 265                 270                   

gtc aga ttc aac cga ctc ctg aag ttc agc agg ctg ttc gag ttc ttt        864
Val Arg Phe Asn Arg Leu Leu Lys Phe Ser Arg Leu Phe Glu Phe Phe           
        275                 280                 285                       

gac cgc acc gag act cgg acc aac tac cct aac atg ttc cgg atc gga        912
Asp Arg Thr Glu Thr Arg Thr Asn Tyr Pro Asn Met Phe Arg Ile Gly           
    290                 295                 300                           

aat ctg gtg ctc tac ata ctg att atc atc cat tgg aac gcc tgt atc        960
Asn Leu Val Leu Tyr Ile Leu Ile Ile Ile His Trp Asn Ala Cys Ile           
305                 310                 315                 320           

tat ttc gcc att tcg aag ttc atc ggt ttc gga acc gat tcc tgg gtg       1008
Tyr Phe Ala Ile Ser Lys Phe Ile Gly Phe Gly Thr Asp Ser Trp Val           
                325                 330                 335               

tac ccc aac atc tcg atc ccc gaa cac ggt cgc ctg tcc cgg aag tac       1056
Tyr Pro Asn Ile Ser Ile Pro Glu His Gly Arg Leu Ser Arg Lys Tyr           
            340                 345                 350                   

atc tac tcc ctg tac tgg tcc act ctg act ctg acc acg atc ggg gaa       1104
Ile Tyr Ser Leu Tyr Trp Ser Thr Leu Thr Leu Thr Thr Ile Gly Glu           
        355                 360                 365                       

acc cct cca ccc gtg aag gac gaa gag tac ctg ttc gtg gtg gtg gac       1152
Thr Pro Pro Pro Val Lys Asp Glu Glu Tyr Leu Phe Val Val Val Asp           
    370                 375                 380                           

ttc ctg gtc gga gtg ttg att ttc gcc acc att gtg gga aac gtg ggc       1200
Phe Leu Val Gly Val Leu Ile Phe Ala Thr Ile Val Gly Asn Val Gly           
385                 390                 395                 400           

tcc atg atc tcc aac atg aac gcg tcg aga gct gag ttc caa gcc aag       1248
Ser Met Ile Ser Asn Met Asn Ala Ser Arg Ala Glu Phe Gln Ala Lys           
                405                 410                 415               

atc gac tcc att aag cag tac atg cag ttc aga aag gtc acc aag gac       1296
Ile Asp Ser Ile Lys Gln Tyr Met Gln Phe Arg Lys Val Thr Lys Asp           
            420                 425                 430                   

ctg gaa acc agg gtc atc cgc tgg ttc gac tac ctg tgg gcc aac aaa       1344
Leu Glu Thr Arg Val Ile Arg Trp Phe Asp Tyr Leu Trp Ala Asn Lys           
        435                 440                 445                       

aag act gtg gac gaa aag gaa gtg ctg aag tcg ctg ccg gat aag ctg       1392
Lys Thr Val Asp Glu Lys Glu Val Leu Lys Ser Leu Pro Asp Lys Leu           
    450                 455                 460                           

aag gcc gaa atc gcc att aac gtg cac ctt gac acc ctg aag aaa gtc       1440
Lys Ala Glu Ile Ala Ile Asn Val His Leu Asp Thr Leu Lys Lys Val           
465                 470                 475                 480           

cgg atc ttc caa gac tgt gaa gcc ggc ctc ctg gtg gag ctc gtg ctc       1488
Arg Ile Phe Gln Asp Cys Glu Ala Gly Leu Leu Val Glu Leu Val Leu           
                485                 490                 495               

aag ctg cgg ccc acc gtg ttc agc ccg gga gat tac att tgc aag aag       1536
Lys Leu Arg Pro Thr Val Phe Ser Pro Gly Asp Tyr Ile Cys Lys Lys           
            500                 505                 510                   

ggc gat atc ggc aaa gag atg tac atc atc aac gag gga aag ctg gcc       1584
Gly Asp Ile Gly Lys Glu Met Tyr Ile Ile Asn Glu Gly Lys Leu Ala           
        515                 520                 525                       

gtg gtc gcg gac gac ggc gtg acc cag ttc gtg gtg ctg tcc gac gga       1632
Val Val Ala Asp Asp Gly Val Thr Gln Phe Val Val Leu Ser Asp Gly           
    530                 535                 540                           

tcc tac ttc ggt gaa atc tca atc ctc aac atc aag ggg tcc aag tcc       1680
Ser Tyr Phe Gly Glu Ile Ser Ile Leu Asn Ile Lys Gly Ser Lys Ser           
545                 550                 555                 560           

ggc aac cgg aga act gcc aac att cgc tcc atc gga tac agc gac ctg       1728
Gly Asn Arg Arg Thr Ala Asn Ile Arg Ser Ile Gly Tyr Ser Asp Leu           
                565                 570                 575               

ttt tgc ctg tcc aag gat gac ctg atg gag gct ctg act gag tac cct       1776
Phe Cys Leu Ser Lys Asp Asp Leu Met Glu Ala Leu Thr Glu Tyr Pro           
            580                 585                 590                   

gaa gcg aag aag gct ttg gag gaa aag ggg cgg cag att ctg atg aag       1824
Glu Ala Lys Lys Ala Leu Glu Glu Lys Gly Arg Gln Ile Leu Met Lys           
        595                 600                 605                       

gac aat ttg atc gac gag gag ctc gca cgg gcc ggc gcc gac ccc aag       1872
Asp Asn Leu Ile Asp Glu Glu Leu Ala Arg Ala Gly Ala Asp Pro Lys           
    610                 615                 620                           

gat ctc gaa gag aag gtc gaa cag ctg ggt tct tcg ctt gat acc ctg       1920
Asp Leu Glu Glu Lys Val Glu Gln Leu Gly Ser Ser Leu Asp Thr Leu           
625                 630                 635                 640           

caa acc cga ttc gcg cgg ctg ctc gcc gag tac aac gcg acc cag atg       1968
Gln Thr Arg Phe Ala Arg Leu Leu Ala Glu Tyr Asn Ala Thr Gln Met           
                645                 650                 655               

aag atg aag cag aga ctg tca cag ttg gaa tcc caa gtc aag ggc gga       2016
Lys Met Lys Gln Arg Leu Ser Gln Leu Glu Ser Gln Val Lys Gly Gly           
            660                 665                 670                   

ggc gac aag ccg ctg gcg gac ggg gaa gtg ccc ggg gac gcc acc aag       2064
Gly Asp Lys Pro Leu Ala Asp Gly Glu Val Pro Gly Asp Ala Thr Lys           
        675                 680                 685                       

act gag gac aag cag cag tga                                           2085
Thr Glu Asp Lys Gln Gln                                                   
    690                                                                   


<210>  10
<211>  694
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  10

Met Ala Lys Ile Asn Thr Gln Tyr Ser His Pro Ser Arg Thr His Leu 
1               5                   10                  15      


Lys Val Lys Thr Ser Asp Arg Asp Leu Asn Arg Ala Glu Asn Gly Leu 
            20                  25                  30          


Ser Arg Ala His Ser Ser Ser Glu Glu Thr Ser Ser Val Leu Gln Pro 
        35                  40                  45              


Gly Ile Ala Met Glu Thr Arg Gly Leu Ala Asp Ser Gly Gln Gly Ser 
    50                  55                  60                  


Phe Thr Gly Gln Gly Ile Ala Arg Leu Ser Arg Leu Ile Phe Leu Leu 
65                  70                  75                  80  


Arg Arg Trp Ala Ala Arg His Val His His Gln Asp Gln Gly Pro Asp 
                85                  90                  95      


Ser Phe Pro Asp Arg Phe Arg Gly Ala Glu Leu Lys Glu Val Ser Ser 
            100                 105                 110         


Gln Glu Ser Asn Ala Gln Ala Asn Val Gly Ser Gln Glu Pro Ala Asp 
        115                 120                 125             


Arg Gly Arg Ser Ala Trp Pro Leu Ala Lys Cys Asn Thr Asn Thr Ser 
    130                 135                 140                 


Asn Asn Thr Glu Glu Glu Lys Lys Thr Lys Lys Lys Asp Ala Ile Val 
145                 150                 155                 160 


Val Asp Pro Ser Ser Asn Leu Tyr Tyr Arg Trp Leu Thr Ala Ile Ala 
                165                 170                 175     


Leu Pro Val Phe Tyr Asn Trp Tyr Leu Leu Ile Cys Arg Ala Cys Phe 
            180                 185                 190         


Asp Glu Leu Gln Ser Glu Tyr Leu Met Leu Trp Leu Val Leu Asp Tyr 
        195                 200                 205             


Ser Ala Asp Val Leu Tyr Val Leu Asp Val Leu Val Arg Ala Arg Thr 
    210                 215                 220                 


Gly Phe Leu Glu Gln Gly Leu Met Val Ser Asp Thr Asn Arg Leu Trp 
225                 230                 235                 240 


Gln His Tyr Lys Thr Thr Thr Gln Phe Lys Leu Asp Val Leu Ser Leu 
                245                 250                 255     


Val Pro Thr Asp Leu Ala Tyr Leu Lys Val Gly Thr Asn Tyr Pro Glu 
            260                 265                 270         


Val Arg Phe Asn Arg Leu Leu Lys Phe Ser Arg Leu Phe Glu Phe Phe 
        275                 280                 285             


Asp Arg Thr Glu Thr Arg Thr Asn Tyr Pro Asn Met Phe Arg Ile Gly 
    290                 295                 300                 


Asn Leu Val Leu Tyr Ile Leu Ile Ile Ile His Trp Asn Ala Cys Ile 
305                 310                 315                 320 


Tyr Phe Ala Ile Ser Lys Phe Ile Gly Phe Gly Thr Asp Ser Trp Val 
                325                 330                 335     


Tyr Pro Asn Ile Ser Ile Pro Glu His Gly Arg Leu Ser Arg Lys Tyr 
            340                 345                 350         


Ile Tyr Ser Leu Tyr Trp Ser Thr Leu Thr Leu Thr Thr Ile Gly Glu 
        355                 360                 365             


Thr Pro Pro Pro Val Lys Asp Glu Glu Tyr Leu Phe Val Val Val Asp 
    370                 375                 380                 


Phe Leu Val Gly Val Leu Ile Phe Ala Thr Ile Val Gly Asn Val Gly 
385                 390                 395                 400 


Ser Met Ile Ser Asn Met Asn Ala Ser Arg Ala Glu Phe Gln Ala Lys 
                405                 410                 415     


Ile Asp Ser Ile Lys Gln Tyr Met Gln Phe Arg Lys Val Thr Lys Asp 
            420                 425                 430         


Leu Glu Thr Arg Val Ile Arg Trp Phe Asp Tyr Leu Trp Ala Asn Lys 
        435                 440                 445             


Lys Thr Val Asp Glu Lys Glu Val Leu Lys Ser Leu Pro Asp Lys Leu 
    450                 455                 460                 


Lys Ala Glu Ile Ala Ile Asn Val His Leu Asp Thr Leu Lys Lys Val 
465                 470                 475                 480 


Arg Ile Phe Gln Asp Cys Glu Ala Gly Leu Leu Val Glu Leu Val Leu 
                485                 490                 495     


Lys Leu Arg Pro Thr Val Phe Ser Pro Gly Asp Tyr Ile Cys Lys Lys 
            500                 505                 510         


Gly Asp Ile Gly Lys Glu Met Tyr Ile Ile Asn Glu Gly Lys Leu Ala 
        515                 520                 525             


Val Val Ala Asp Asp Gly Val Thr Gln Phe Val Val Leu Ser Asp Gly 
    530                 535                 540                 


Ser Tyr Phe Gly Glu Ile Ser Ile Leu Asn Ile Lys Gly Ser Lys Ser 
545                 550                 555                 560 


Gly Asn Arg Arg Thr Ala Asn Ile Arg Ser Ile Gly Tyr Ser Asp Leu 
                565                 570                 575     


Phe Cys Leu Ser Lys Asp Asp Leu Met Glu Ala Leu Thr Glu Tyr Pro 
            580                 585                 590         


Glu Ala Lys Lys Ala Leu Glu Glu Lys Gly Arg Gln Ile Leu Met Lys 
        595                 600                 605             


Asp Asn Leu Ile Asp Glu Glu Leu Ala Arg Ala Gly Ala Asp Pro Lys 
    610                 615                 620                 


Asp Leu Glu Glu Lys Val Glu Gln Leu Gly Ser Ser Leu Asp Thr Leu 
625                 630                 635                 640 


Gln Thr Arg Phe Ala Arg Leu Leu Ala Glu Tyr Asn Ala Thr Gln Met 
                645                 650                 655     


Lys Met Lys Gln Arg Leu Ser Gln Leu Glu Ser Gln Val Lys Gly Gly 
            660                 665                 670         


Gly Asp Lys Pro Leu Ala Asp Gly Glu Val Pro Gly Asp Ala Thr Lys 
        675                 680                 685             


Thr Glu Asp Lys Gln Gln 
    690                 


<210>  11
<211>  2250
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  codon optimized sequence


<220>
<221>  CDS
<222>  (1)..(2250)
<223>  codon-optimized ORF

<400>  11
atg gct aag att aac acc cag tac tca cat cca tcc cgc act cac ctc         48
Met Ala Lys Ile Asn Thr Gln Tyr Ser His Pro Ser Arg Thr His Leu           
1               5                   10                  15                

aaa gtc aag acc tcc gat cgg gat ctg aac cgg gct gag aat ggg ctg         96
Lys Val Lys Thr Ser Asp Arg Asp Leu Asn Arg Ala Glu Asn Gly Leu           
            20                  25                  30                    

tcg cgc gcc cac tcg tcg tcc gag gaa acc agc agc gtg ctc cag ccg        144
Ser Arg Ala His Ser Ser Ser Glu Glu Thr Ser Ser Val Leu Gln Pro           
        35                  40                  45                        

ggc atc gcc atg gaa act agg ggg ctg gcg gac tcc gga cag gga tcc        192
Gly Ile Ala Met Glu Thr Arg Gly Leu Ala Asp Ser Gly Gln Gly Ser           
    50                  55                  60                            

ttc act gga cag ggt att gcc cgg ttc ggg cgg att cag aag aag tcc        240
Phe Thr Gly Gln Gly Ile Ala Arg Phe Gly Arg Ile Gln Lys Lys Ser           
65                  70                  75                  80            

cag ccg gag aag gtc gtg cgg gct gcc agc agg ggc agg cca ctc att        288
Gln Pro Glu Lys Val Val Arg Ala Ala Ser Arg Gly Arg Pro Leu Ile           
                85                  90                  95                

ggt tgg aca cag tgg tgc gct gag gat ggt gga gat gaa tcg gaa atg        336
Gly Trp Thr Gln Trp Cys Ala Glu Asp Gly Gly Asp Glu Ser Glu Met           
            100                 105                 110                   

gca ctg gcc ggc tct ccc gga tgc agc tcg ggc ccc caa ggg aga ctg        384
Ala Leu Ala Gly Ser Pro Gly Cys Ser Ser Gly Pro Gln Gly Arg Leu           
        115                 120                 125                       

agc aga ctg atc ttc ctg ctt cgc cgc tgg gcg gcc aga cac gtg cac        432
Ser Arg Leu Ile Phe Leu Leu Arg Arg Trp Ala Ala Arg His Val His           
    130                 135                 140                           

cat cag gac cag gga cct gat agc ttc ccc gac cgc ttt agg gga gcc        480
His Gln Asp Gln Gly Pro Asp Ser Phe Pro Asp Arg Phe Arg Gly Ala           
145                 150                 155                 160           

gag ctg aaa gaa gtg tca agc cag gag tca aac gcg cag gcc aac gtc        528
Glu Leu Lys Glu Val Ser Ser Gln Glu Ser Asn Ala Gln Ala Asn Val           
                165                 170                 175               

ggc agc caa gag cct gca gac cgg gga cgc tcg gca tgg ccg ctc gca        576
Gly Ser Gln Glu Pro Ala Asp Arg Gly Arg Ser Ala Trp Pro Leu Ala           
            180                 185                 190                   

aag tgc aac act aac act tcc aac aac acc gaa gag gaa aag aaa acc        624
Lys Cys Asn Thr Asn Thr Ser Asn Asn Thr Glu Glu Glu Lys Lys Thr           
        195                 200                 205                       

aag aag aag gat gca att gtg gtg gac cct tcc tcc aac ctg tac tac        672
Lys Lys Lys Asp Ala Ile Val Val Asp Pro Ser Ser Asn Leu Tyr Tyr           
    210                 215                 220                           

cgc tgg ttg acc gcc atc gcc ctc ccg gtc ttt tac aat tgg tat ctc        720
Arg Trp Leu Thr Ala Ile Ala Leu Pro Val Phe Tyr Asn Trp Tyr Leu           
225                 230                 235                 240           

ctt atc tgc cgg gcc tgc ttc gac gaa ctg caa tca gag tac ctg atg        768
Leu Ile Cys Arg Ala Cys Phe Asp Glu Leu Gln Ser Glu Tyr Leu Met           
                245                 250                 255               

ctg tgg ctg gtg ctg gac tat agc gcc gat gtg ctc tac gtc ctg gat        816
Leu Trp Leu Val Leu Asp Tyr Ser Ala Asp Val Leu Tyr Val Leu Asp           
            260                 265                 270                   

gtg ctc gtg cgc gcc cgg acc gga ttc ttg gaa caa ggc ctg atg gtg        864
Val Leu Val Arg Ala Arg Thr Gly Phe Leu Glu Gln Gly Leu Met Val           
        275                 280                 285                       

tcc gac acg aat aga ctg tgg cag cac tat aag acc aca acc cag ttc        912
Ser Asp Thr Asn Arg Leu Trp Gln His Tyr Lys Thr Thr Thr Gln Phe           
    290                 295                 300                           

aag ctt gac gtg ctc agc ctt gtg ccg act gac ctg gcc tac ctg aaa        960
Lys Leu Asp Val Leu Ser Leu Val Pro Thr Asp Leu Ala Tyr Leu Lys           
305                 310                 315                 320           

gtc gga act aac tac ccg gaa gtc aga ttc aac cga ctc ctg aag ttc       1008
Val Gly Thr Asn Tyr Pro Glu Val Arg Phe Asn Arg Leu Leu Lys Phe           
                325                 330                 335               

agc agg ctg ttc gag ttc ttt gac cgc acc gag act cgg acc aac tac       1056
Ser Arg Leu Phe Glu Phe Phe Asp Arg Thr Glu Thr Arg Thr Asn Tyr           
            340                 345                 350                   

cct aac atg ttc cgg atc gga aat ctg gtg ctc tac ata ctg att atc       1104
Pro Asn Met Phe Arg Ile Gly Asn Leu Val Leu Tyr Ile Leu Ile Ile           
        355                 360                 365                       

atc cat tgg aac gcc tgt atc tat ttc gcc att tcg aag ttc atc ggt       1152
Ile His Trp Asn Ala Cys Ile Tyr Phe Ala Ile Ser Lys Phe Ile Gly           
    370                 375                 380                           

ttc gga acc gat tcc tgg gtg tac ccc aac atc tcg atc ccc gaa cac       1200
Phe Gly Thr Asp Ser Trp Val Tyr Pro Asn Ile Ser Ile Pro Glu His           
385                 390                 395                 400           

ggt cgc ctg tcc cgg aag tac atc tac tcc ctg tac tgg tcc act ctg       1248
Gly Arg Leu Ser Arg Lys Tyr Ile Tyr Ser Leu Tyr Trp Ser Thr Leu           
                405                 410                 415               

act ctg acc acg atc ggg gaa acc cct cca ccc gtg aag gac gaa gag       1296
Thr Leu Thr Thr Ile Gly Glu Thr Pro Pro Pro Val Lys Asp Glu Glu           
            420                 425                 430                   

tac ctg ttc gtg gtg gtg gac ttc ctg gtc gga gtg ttg att ttc gcc       1344
Tyr Leu Phe Val Val Val Asp Phe Leu Val Gly Val Leu Ile Phe Ala           
        435                 440                 445                       

acc att gtg gga aac gtg ggc tcc atg atc tcc aac atg aac gcg tcg       1392
Thr Ile Val Gly Asn Val Gly Ser Met Ile Ser Asn Met Asn Ala Ser           
    450                 455                 460                           

aga gct gag ttc caa gcc aag atc gac tcc att aag cag tac atg cag       1440
Arg Ala Glu Phe Gln Ala Lys Ile Asp Ser Ile Lys Gln Tyr Met Gln           
465                 470                 475                 480           

ttc aga aag gtc acc aag gac ctg gaa acc agg gtc atc cgc tgg ttc       1488
Phe Arg Lys Val Thr Lys Asp Leu Glu Thr Arg Val Ile Arg Trp Phe           
                485                 490                 495               

gac tac ctg tgg gcc aac aaa aag act gtg gac gaa aag gaa gtg ctg       1536
Asp Tyr Leu Trp Ala Asn Lys Lys Thr Val Asp Glu Lys Glu Val Leu           
            500                 505                 510                   

aag tcg ctg ccg gat aag ctg aag gcc gaa atc gcc att aac gtg cac       1584
Lys Ser Leu Pro Asp Lys Leu Lys Ala Glu Ile Ala Ile Asn Val His           
        515                 520                 525                       

ctt gac acc ctg aag aaa gtc cgg atc ttc caa gac tgt gaa gcc ggc       1632
Leu Asp Thr Leu Lys Lys Val Arg Ile Phe Gln Asp Cys Glu Ala Gly           
    530                 535                 540                           

ctc ctg gtg gag ctc gtg ctc aag ctg cgg ccc acc gtg ttc agc ccg       1680
Leu Leu Val Glu Leu Val Leu Lys Leu Arg Pro Thr Val Phe Ser Pro           
545                 550                 555                 560           

gga gat tac att tgc aag aag ggc gat atc ggc aaa gag atg tac atc       1728
Gly Asp Tyr Ile Cys Lys Lys Gly Asp Ile Gly Lys Glu Met Tyr Ile           
                565                 570                 575               

atc aac gag gga aag ctg gcc gtg gtc gcg gac gac ggc gtg acc cag       1776
Ile Asn Glu Gly Lys Leu Ala Val Val Ala Asp Asp Gly Val Thr Gln           
            580                 585                 590                   

ttc gtg gtg ctg tcc gac gga tcc tac ttc ggt gaa atc tca atc ctc       1824
Phe Val Val Leu Ser Asp Gly Ser Tyr Phe Gly Glu Ile Ser Ile Leu           
        595                 600                 605                       

aac atc aag ggg tcc aag tcc ggc aac cgg aga act gcc aac att cgc       1872
Asn Ile Lys Gly Ser Lys Ser Gly Asn Arg Arg Thr Ala Asn Ile Arg           
    610                 615                 620                           

tcc atc gga tac agc gac ctg ttt tgc ctg tcc aag gat gac ctg atg       1920
Ser Ile Gly Tyr Ser Asp Leu Phe Cys Leu Ser Lys Asp Asp Leu Met           
625                 630                 635                 640           

gag gct ctg act gag tac cct gaa gcg aag aag gct ttg gag gaa aag       1968
Glu Ala Leu Thr Glu Tyr Pro Glu Ala Lys Lys Ala Leu Glu Glu Lys           
                645                 650                 655               

ggg cgg cag att ctg atg aag gac aat ttg atc gac gag gag ctc gca       2016
Gly Arg Gln Ile Leu Met Lys Asp Asn Leu Ile Asp Glu Glu Leu Ala           
            660                 665                 670                   

cgg gcc ggc gcc gac ccc aag gat ctc gaa gag aag gtc gaa cag ctg       2064
Arg Ala Gly Ala Asp Pro Lys Asp Leu Glu Glu Lys Val Glu Gln Leu           
        675                 680                 685                       

ggt tct tcg ctt gat acc ctg caa acc cga ttc gcg cgg ctg ctc gcc       2112
Gly Ser Ser Leu Asp Thr Leu Gln Thr Arg Phe Ala Arg Leu Leu Ala           
    690                 695                 700                           

gag tac aac gcg acc cag atg aag atg aag cag aga ctg tca cag ttg       2160
Glu Tyr Asn Ala Thr Gln Met Lys Met Lys Gln Arg Leu Ser Gln Leu           
705                 710                 715                 720           

gaa tcc caa gtc aag ggc gga ggc gac aag ccg ctg gcg gac ggg gaa       2208
Glu Ser Gln Val Lys Gly Gly Gly Asp Lys Pro Leu Ala Asp Gly Glu           
                725                 730                 735               

gtg ccc ggg gac gcc acc aag act gag gac aag cag cag tga               2250
Val Pro Gly Asp Ala Thr Lys Thr Glu Asp Lys Gln Gln                       
            740                 745                                       


<210>  12
<211>  749
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  12

Met Ala Lys Ile Asn Thr Gln Tyr Ser His Pro Ser Arg Thr His Leu 
1               5                   10                  15      


Lys Val Lys Thr Ser Asp Arg Asp Leu Asn Arg Ala Glu Asn Gly Leu 
            20                  25                  30          


Ser Arg Ala His Ser Ser Ser Glu Glu Thr Ser Ser Val Leu Gln Pro 
        35                  40                  45              


Gly Ile Ala Met Glu Thr Arg Gly Leu Ala Asp Ser Gly Gln Gly Ser 
    50                  55                  60                  


Phe Thr Gly Gln Gly Ile Ala Arg Phe Gly Arg Ile Gln Lys Lys Ser 
65                  70                  75                  80  


Gln Pro Glu Lys Val Val Arg Ala Ala Ser Arg Gly Arg Pro Leu Ile 
                85                  90                  95      


Gly Trp Thr Gln Trp Cys Ala Glu Asp Gly Gly Asp Glu Ser Glu Met 
            100                 105                 110         


Ala Leu Ala Gly Ser Pro Gly Cys Ser Ser Gly Pro Gln Gly Arg Leu 
        115                 120                 125             


Ser Arg Leu Ile Phe Leu Leu Arg Arg Trp Ala Ala Arg His Val His 
    130                 135                 140                 


His Gln Asp Gln Gly Pro Asp Ser Phe Pro Asp Arg Phe Arg Gly Ala 
145                 150                 155                 160 


Glu Leu Lys Glu Val Ser Ser Gln Glu Ser Asn Ala Gln Ala Asn Val 
                165                 170                 175     


Gly Ser Gln Glu Pro Ala Asp Arg Gly Arg Ser Ala Trp Pro Leu Ala 
            180                 185                 190         


Lys Cys Asn Thr Asn Thr Ser Asn Asn Thr Glu Glu Glu Lys Lys Thr 
        195                 200                 205             


Lys Lys Lys Asp Ala Ile Val Val Asp Pro Ser Ser Asn Leu Tyr Tyr 
    210                 215                 220                 


Arg Trp Leu Thr Ala Ile Ala Leu Pro Val Phe Tyr Asn Trp Tyr Leu 
225                 230                 235                 240 


Leu Ile Cys Arg Ala Cys Phe Asp Glu Leu Gln Ser Glu Tyr Leu Met 
                245                 250                 255     


Leu Trp Leu Val Leu Asp Tyr Ser Ala Asp Val Leu Tyr Val Leu Asp 
            260                 265                 270         


Val Leu Val Arg Ala Arg Thr Gly Phe Leu Glu Gln Gly Leu Met Val 
        275                 280                 285             


Ser Asp Thr Asn Arg Leu Trp Gln His Tyr Lys Thr Thr Thr Gln Phe 
    290                 295                 300                 


Lys Leu Asp Val Leu Ser Leu Val Pro Thr Asp Leu Ala Tyr Leu Lys 
305                 310                 315                 320 


Val Gly Thr Asn Tyr Pro Glu Val Arg Phe Asn Arg Leu Leu Lys Phe 
                325                 330                 335     


Ser Arg Leu Phe Glu Phe Phe Asp Arg Thr Glu Thr Arg Thr Asn Tyr 
            340                 345                 350         


Pro Asn Met Phe Arg Ile Gly Asn Leu Val Leu Tyr Ile Leu Ile Ile 
        355                 360                 365             


Ile His Trp Asn Ala Cys Ile Tyr Phe Ala Ile Ser Lys Phe Ile Gly 
    370                 375                 380                 


Phe Gly Thr Asp Ser Trp Val Tyr Pro Asn Ile Ser Ile Pro Glu His 
385                 390                 395                 400 


Gly Arg Leu Ser Arg Lys Tyr Ile Tyr Ser Leu Tyr Trp Ser Thr Leu 
                405                 410                 415     


Thr Leu Thr Thr Ile Gly Glu Thr Pro Pro Pro Val Lys Asp Glu Glu 
            420                 425                 430         


Tyr Leu Phe Val Val Val Asp Phe Leu Val Gly Val Leu Ile Phe Ala 
        435                 440                 445             


Thr Ile Val Gly Asn Val Gly Ser Met Ile Ser Asn Met Asn Ala Ser 
    450                 455                 460                 


Arg Ala Glu Phe Gln Ala Lys Ile Asp Ser Ile Lys Gln Tyr Met Gln 
465                 470                 475                 480 


Phe Arg Lys Val Thr Lys Asp Leu Glu Thr Arg Val Ile Arg Trp Phe 
                485                 490                 495     


Asp Tyr Leu Trp Ala Asn Lys Lys Thr Val Asp Glu Lys Glu Val Leu 
            500                 505                 510         


Lys Ser Leu Pro Asp Lys Leu Lys Ala Glu Ile Ala Ile Asn Val His 
        515                 520                 525             


Leu Asp Thr Leu Lys Lys Val Arg Ile Phe Gln Asp Cys Glu Ala Gly 
    530                 535                 540                 


Leu Leu Val Glu Leu Val Leu Lys Leu Arg Pro Thr Val Phe Ser Pro 
545                 550                 555                 560 


Gly Asp Tyr Ile Cys Lys Lys Gly Asp Ile Gly Lys Glu Met Tyr Ile 
                565                 570                 575     


Ile Asn Glu Gly Lys Leu Ala Val Val Ala Asp Asp Gly Val Thr Gln 
            580                 585                 590         


Phe Val Val Leu Ser Asp Gly Ser Tyr Phe Gly Glu Ile Ser Ile Leu 
        595                 600                 605             


Asn Ile Lys Gly Ser Lys Ser Gly Asn Arg Arg Thr Ala Asn Ile Arg 
    610                 615                 620                 


Ser Ile Gly Tyr Ser Asp Leu Phe Cys Leu Ser Lys Asp Asp Leu Met 
625                 630                 635                 640 


Glu Ala Leu Thr Glu Tyr Pro Glu Ala Lys Lys Ala Leu Glu Glu Lys 
                645                 650                 655     


Gly Arg Gln Ile Leu Met Lys Asp Asn Leu Ile Asp Glu Glu Leu Ala 
            660                 665                 670         


Arg Ala Gly Ala Asp Pro Lys Asp Leu Glu Glu Lys Val Glu Gln Leu 
        675                 680                 685             


Gly Ser Ser Leu Asp Thr Leu Gln Thr Arg Phe Ala Arg Leu Leu Ala 
    690                 695                 700                 


Glu Tyr Asn Ala Thr Gln Met Lys Met Lys Gln Arg Leu Ser Gln Leu 
705                 710                 715                 720 


Glu Ser Gln Val Lys Gly Gly Gly Asp Lys Pro Leu Ala Asp Gly Glu 
                725                 730                 735     


Val Pro Gly Asp Ala Thr Lys Thr Glu Asp Lys Gln Gln 
            740                 745                 


<210>  13
<211>  2085
<212>  DNA
<213>  Homo sapiens


<220>
<221>  CDS
<222>  (1)..(2085)
<223>  native open reading frame (ORF)

<400>  13
atg gcc aag atc aac acc caa tac tcc cac ccc tcc agg acc cac ctc         48
Met Ala Lys Ile Asn Thr Gln Tyr Ser His Pro Ser Arg Thr His Leu           
1               5                   10                  15                

aag gta aag acc tca gac cgg gat ctc aat cgc gct gaa aat ggc ctc         96
Lys Val Lys Thr Ser Asp Arg Asp Leu Asn Arg Ala Glu Asn Gly Leu           
            20                  25                  30                    

agc aga gcc cac tcg tca agt gag gag aca tcg tca gtg ctg cag ccg        144
Ser Arg Ala His Ser Ser Ser Glu Glu Thr Ser Ser Val Leu Gln Pro           
        35                  40                  45                        

ggg atc gcc atg gag acc aga gga ctg gct gac tcc ggg cag ggc tcc        192
Gly Ile Ala Met Glu Thr Arg Gly Leu Ala Asp Ser Gly Gln Gly Ser           
    50                  55                  60                            

ttc acc ggc cag ggg atc gcc agg ctg tcg cgc ctc atc ttc ttg ctg        240
Phe Thr Gly Gln Gly Ile Ala Arg Leu Ser Arg Leu Ile Phe Leu Leu           
65                  70                  75                  80            

cgc agg tgg gct gcc agg cat gtg cac cac cag gac cag gga ccg gac        288
Arg Arg Trp Ala Ala Arg His Val His His Gln Asp Gln Gly Pro Asp           
                85                  90                  95                

tct ttt cct gat cgt ttc cgt gga gcc gag ctt aag gag gtg tcc agc        336
Ser Phe Pro Asp Arg Phe Arg Gly Ala Glu Leu Lys Glu Val Ser Ser           
            100                 105                 110                   

caa gaa agc aat gcc cag gca aat gtg ggc agc cag gag cca gca gac        384
Gln Glu Ser Asn Ala Gln Ala Asn Val Gly Ser Gln Glu Pro Ala Asp           
        115                 120                 125                       

aga ggg aga agc gcc tgg ccc ctg gcc aaa tgc aac act aac acc agc        432
Arg Gly Arg Ser Ala Trp Pro Leu Ala Lys Cys Asn Thr Asn Thr Ser           
    130                 135                 140                           

aac aac acg gag gag gag aag aag acg aaa aag aag gat gcg atc gtg        480
Asn Asn Thr Glu Glu Glu Lys Lys Thr Lys Lys Lys Asp Ala Ile Val           
145                 150                 155                 160           

gtg gac ccg tcc agc aac ctg tac tac cgc tgg ctg acc gcc atc gcc        528
Val Asp Pro Ser Ser Asn Leu Tyr Tyr Arg Trp Leu Thr Ala Ile Ala           
                165                 170                 175               

ctg cct gtc ttc tat aac tgg tat ctg ctt att tgc agg gcc tgt ttc        576
Leu Pro Val Phe Tyr Asn Trp Tyr Leu Leu Ile Cys Arg Ala Cys Phe           
            180                 185                 190                   

gat gag ctg cag tcc gag tac ctg atg ctg tgg ctg gtc ctg gac tac        624
Asp Glu Leu Gln Ser Glu Tyr Leu Met Leu Trp Leu Val Leu Asp Tyr           
        195                 200                 205                       

tcg gca gat gtc ctg tat gtc ttg gat gtg ctt gta cga gct cgg aca        672
Ser Ala Asp Val Leu Tyr Val Leu Asp Val Leu Val Arg Ala Arg Thr           
    210                 215                 220                           

ggt ttt ctt gag caa ggc tta atg gtc agt gat acc aac agg ctg tgg        720
Gly Phe Leu Glu Gln Gly Leu Met Val Ser Asp Thr Asn Arg Leu Trp           
225                 230                 235                 240           

cag cat tac aag acg acc acg cag ttc aag ctg gat gtg ttg tcc ctg        768
Gln His Tyr Lys Thr Thr Thr Gln Phe Lys Leu Asp Val Leu Ser Leu           
                245                 250                 255               

gtc ccc acc gac ctg gct tac tta aag gtg ggc aca aac tac cca gaa        816
Val Pro Thr Asp Leu Ala Tyr Leu Lys Val Gly Thr Asn Tyr Pro Glu           
            260                 265                 270                   

gtg agg ttc aac cgc cta ctg aag ttt tcc cgg ctc ttt gaa ttc ttt        864
Val Arg Phe Asn Arg Leu Leu Lys Phe Ser Arg Leu Phe Glu Phe Phe           
        275                 280                 285                       

gac cgc aca gag aca agg acc aac tac ccc aat atg ttc agg att ggg        912
Asp Arg Thr Glu Thr Arg Thr Asn Tyr Pro Asn Met Phe Arg Ile Gly           
    290                 295                 300                           

aac ttg gtc ttg tac att ctc atc atc atc cac tgg aat gcc tgc atc        960
Asn Leu Val Leu Tyr Ile Leu Ile Ile Ile His Trp Asn Ala Cys Ile           
305                 310                 315                 320           

tac ttt gcc att tcc aag ttc att ggt ttt ggg aca gac tcc tgg gtc       1008
Tyr Phe Ala Ile Ser Lys Phe Ile Gly Phe Gly Thr Asp Ser Trp Val           
                325                 330                 335               

tac cca aac atc tca atc cca gag cat ggg cgc ctc tcc agg aag tac       1056
Tyr Pro Asn Ile Ser Ile Pro Glu His Gly Arg Leu Ser Arg Lys Tyr           
            340                 345                 350                   

att tac agt ctc tac tgg tcc acc ttg acc ctt acc acc att ggt gag       1104
Ile Tyr Ser Leu Tyr Trp Ser Thr Leu Thr Leu Thr Thr Ile Gly Glu           
        355                 360                 365                       

acc cca ccc ccc gtg aaa gat gag gag tat ctc ttt gtg gtc gta gac       1152
Thr Pro Pro Pro Val Lys Asp Glu Glu Tyr Leu Phe Val Val Val Asp           
    370                 375                 380                           

ttc ttg gtg ggt gtt ctg att ttt gcc acc att gtg ggc aat gtg ggc       1200
Phe Leu Val Gly Val Leu Ile Phe Ala Thr Ile Val Gly Asn Val Gly           
385                 390                 395                 400           

tcc atg atc tcg aat atg aat gcc tca cgg gca gag ttc cag gcc aag       1248
Ser Met Ile Ser Asn Met Asn Ala Ser Arg Ala Glu Phe Gln Ala Lys           
                405                 410                 415               

att gat tcc atc aag cag tac atg cag ttc cgc aag gtc acc aag gac       1296
Ile Asp Ser Ile Lys Gln Tyr Met Gln Phe Arg Lys Val Thr Lys Asp           
            420                 425                 430                   

ttg gag acg cgg gtt atc cgg tgg ttt gac tac ctg tgg gcc aac aag       1344
Leu Glu Thr Arg Val Ile Arg Trp Phe Asp Tyr Leu Trp Ala Asn Lys           
        435                 440                 445                       

aag acg gtg gat gag aag gag gtg ctc aag agc ctc cca gac aag ctg       1392
Lys Thr Val Asp Glu Lys Glu Val Leu Lys Ser Leu Pro Asp Lys Leu           
    450                 455                 460                           

aag gct gag atc gcc atc aac gtg cac ctg gac acg ctg aag aag gtt       1440
Lys Ala Glu Ile Ala Ile Asn Val His Leu Asp Thr Leu Lys Lys Val           
465                 470                 475                 480           

cgc atc ttc cag gac tgt gag gca ggg ctg ctg gtg gag ctg gtg ctg       1488
Arg Ile Phe Gln Asp Cys Glu Ala Gly Leu Leu Val Glu Leu Val Leu           
                485                 490                 495               

aag ctg cga ccc act gtg ttc agc cct ggg gat tat atc tgc aag aag       1536
Lys Leu Arg Pro Thr Val Phe Ser Pro Gly Asp Tyr Ile Cys Lys Lys           
            500                 505                 510                   

gga gat att ggg aag gag atg tac atc atc aac gag ggc aag ctg gcc       1584
Gly Asp Ile Gly Lys Glu Met Tyr Ile Ile Asn Glu Gly Lys Leu Ala           
        515                 520                 525                       

gtg gtg gct gat gat ggg gtc acc cag ttc gtg gtc ctc agc gat ggc       1632
Val Val Ala Asp Asp Gly Val Thr Gln Phe Val Val Leu Ser Asp Gly           
    530                 535                 540                           

agc tac ttc ggg gag atc agc att ctg aac atc aag ggg agc aag tcg       1680
Ser Tyr Phe Gly Glu Ile Ser Ile Leu Asn Ile Lys Gly Ser Lys Ser           
545                 550                 555                 560           

ggg aac cgc agg acg gcc aac atc cgc agc att ggc tac tca gac ctg       1728
Gly Asn Arg Arg Thr Ala Asn Ile Arg Ser Ile Gly Tyr Ser Asp Leu           
                565                 570                 575               

ttc tgc ctc tca aag gac gat ctc atg gag gcc ctc acc gag tac ccc       1776
Phe Cys Leu Ser Lys Asp Asp Leu Met Glu Ala Leu Thr Glu Tyr Pro           
            580                 585                 590                   

gaa gcc aag aag gcc ctg gag gag aaa gga cgg cag atc ctg atg aaa       1824
Glu Ala Lys Lys Ala Leu Glu Glu Lys Gly Arg Gln Ile Leu Met Lys           
        595                 600                 605                       

gac aac ctg atc gat gag gag ctg gcc agg gcg ggc gcg gac ccc aag       1872
Asp Asn Leu Ile Asp Glu Glu Leu Ala Arg Ala Gly Ala Asp Pro Lys           
    610                 615                 620                           

gac ctt gag gag aaa gtg gag cag ctg ggg tcc tcc ctg gac acc ctg       1920
Asp Leu Glu Glu Lys Val Glu Gln Leu Gly Ser Ser Leu Asp Thr Leu           
625                 630                 635                 640           

cag acc agg ttt gca cgc ctc ctg gct gag tac aac gcc acc cag atg       1968
Gln Thr Arg Phe Ala Arg Leu Leu Ala Glu Tyr Asn Ala Thr Gln Met           
                645                 650                 655               

aag atg aag cag cgt ctc agc caa ctg gaa agc cag gtg aag ggt ggt       2016
Lys Met Lys Gln Arg Leu Ser Gln Leu Glu Ser Gln Val Lys Gly Gly           
            660                 665                 670                   

ggg gac aag ccc ctg gct gat ggg gaa gtt ccc ggg gat gct aca aaa       2064
Gly Asp Lys Pro Leu Ala Asp Gly Glu Val Pro Gly Asp Ala Thr Lys           
        675                 680                 685                       

aca gag gac aaa caa cag tga                                           2085
Thr Glu Asp Lys Gln Gln                                                   
    690                                                                   


<210>  14
<211>  694
<212>  PRT
<213>  Homo sapiens

<400>  14

Met Ala Lys Ile Asn Thr Gln Tyr Ser His Pro Ser Arg Thr His Leu 
1               5                   10                  15      


Lys Val Lys Thr Ser Asp Arg Asp Leu Asn Arg Ala Glu Asn Gly Leu 
            20                  25                  30          


Ser Arg Ala His Ser Ser Ser Glu Glu Thr Ser Ser Val Leu Gln Pro 
        35                  40                  45              


Gly Ile Ala Met Glu Thr Arg Gly Leu Ala Asp Ser Gly Gln Gly Ser 
    50                  55                  60                  


Phe Thr Gly Gln Gly Ile Ala Arg Leu Ser Arg Leu Ile Phe Leu Leu 
65                  70                  75                  80  


Arg Arg Trp Ala Ala Arg His Val His His Gln Asp Gln Gly Pro Asp 
                85                  90                  95      


Ser Phe Pro Asp Arg Phe Arg Gly Ala Glu Leu Lys Glu Val Ser Ser 
            100                 105                 110         


Gln Glu Ser Asn Ala Gln Ala Asn Val Gly Ser Gln Glu Pro Ala Asp 
        115                 120                 125             


Arg Gly Arg Ser Ala Trp Pro Leu Ala Lys Cys Asn Thr Asn Thr Ser 
    130                 135                 140                 


Asn Asn Thr Glu Glu Glu Lys Lys Thr Lys Lys Lys Asp Ala Ile Val 
145                 150                 155                 160 


Val Asp Pro Ser Ser Asn Leu Tyr Tyr Arg Trp Leu Thr Ala Ile Ala 
                165                 170                 175     


Leu Pro Val Phe Tyr Asn Trp Tyr Leu Leu Ile Cys Arg Ala Cys Phe 
            180                 185                 190         


Asp Glu Leu Gln Ser Glu Tyr Leu Met Leu Trp Leu Val Leu Asp Tyr 
        195                 200                 205             


Ser Ala Asp Val Leu Tyr Val Leu Asp Val Leu Val Arg Ala Arg Thr 
    210                 215                 220                 


Gly Phe Leu Glu Gln Gly Leu Met Val Ser Asp Thr Asn Arg Leu Trp 
225                 230                 235                 240 


Gln His Tyr Lys Thr Thr Thr Gln Phe Lys Leu Asp Val Leu Ser Leu 
                245                 250                 255     


Val Pro Thr Asp Leu Ala Tyr Leu Lys Val Gly Thr Asn Tyr Pro Glu 
            260                 265                 270         


Val Arg Phe Asn Arg Leu Leu Lys Phe Ser Arg Leu Phe Glu Phe Phe 
        275                 280                 285             


Asp Arg Thr Glu Thr Arg Thr Asn Tyr Pro Asn Met Phe Arg Ile Gly 
    290                 295                 300                 


Asn Leu Val Leu Tyr Ile Leu Ile Ile Ile His Trp Asn Ala Cys Ile 
305                 310                 315                 320 


Tyr Phe Ala Ile Ser Lys Phe Ile Gly Phe Gly Thr Asp Ser Trp Val 
                325                 330                 335     


Tyr Pro Asn Ile Ser Ile Pro Glu His Gly Arg Leu Ser Arg Lys Tyr 
            340                 345                 350         


Ile Tyr Ser Leu Tyr Trp Ser Thr Leu Thr Leu Thr Thr Ile Gly Glu 
        355                 360                 365             


Thr Pro Pro Pro Val Lys Asp Glu Glu Tyr Leu Phe Val Val Val Asp 
    370                 375                 380                 


Phe Leu Val Gly Val Leu Ile Phe Ala Thr Ile Val Gly Asn Val Gly 
385                 390                 395                 400 


Ser Met Ile Ser Asn Met Asn Ala Ser Arg Ala Glu Phe Gln Ala Lys 
                405                 410                 415     


Ile Asp Ser Ile Lys Gln Tyr Met Gln Phe Arg Lys Val Thr Lys Asp 
            420                 425                 430         


Leu Glu Thr Arg Val Ile Arg Trp Phe Asp Tyr Leu Trp Ala Asn Lys 
        435                 440                 445             


Lys Thr Val Asp Glu Lys Glu Val Leu Lys Ser Leu Pro Asp Lys Leu 
    450                 455                 460                 


Lys Ala Glu Ile Ala Ile Asn Val His Leu Asp Thr Leu Lys Lys Val 
465                 470                 475                 480 


Arg Ile Phe Gln Asp Cys Glu Ala Gly Leu Leu Val Glu Leu Val Leu 
                485                 490                 495     


Lys Leu Arg Pro Thr Val Phe Ser Pro Gly Asp Tyr Ile Cys Lys Lys 
            500                 505                 510         


Gly Asp Ile Gly Lys Glu Met Tyr Ile Ile Asn Glu Gly Lys Leu Ala 
        515                 520                 525             


Val Val Ala Asp Asp Gly Val Thr Gln Phe Val Val Leu Ser Asp Gly 
    530                 535                 540                 


Ser Tyr Phe Gly Glu Ile Ser Ile Leu Asn Ile Lys Gly Ser Lys Ser 
545                 550                 555                 560 


Gly Asn Arg Arg Thr Ala Asn Ile Arg Ser Ile Gly Tyr Ser Asp Leu 
                565                 570                 575     


Phe Cys Leu Ser Lys Asp Asp Leu Met Glu Ala Leu Thr Glu Tyr Pro 
            580                 585                 590         


Glu Ala Lys Lys Ala Leu Glu Glu Lys Gly Arg Gln Ile Leu Met Lys 
        595                 600                 605             


Asp Asn Leu Ile Asp Glu Glu Leu Ala Arg Ala Gly Ala Asp Pro Lys 
    610                 615                 620                 


Asp Leu Glu Glu Lys Val Glu Gln Leu Gly Ser Ser Leu Asp Thr Leu 
625                 630                 635                 640 


Gln Thr Arg Phe Ala Arg Leu Leu Ala Glu Tyr Asn Ala Thr Gln Met 
                645                 650                 655     


Lys Met Lys Gln Arg Leu Ser Gln Leu Glu Ser Gln Val Lys Gly Gly 
            660                 665                 670         


Gly Asp Lys Pro Leu Ala Asp Gly Glu Val Pro Gly Asp Ala Thr Lys 
        675                 680                 685             


Thr Glu Asp Lys Gln Gln 
    690                 


<210>  15
<211>  2085
<212>  DNA
<213>  Homo sapiens

<400>  15
atggccaaga tcaacaccca atactcccac ccctccagga cccacctcaa ggtaaagacc       60

tcagaccgag atctcaatcg cgctgaaaat ggcctcagca gagcccactc gtcaagtgag      120

gagacatcgt cagtgctgca gccggggatc gccatggaga ccagaggact ggctgactcc      180

gggcagggct ccttcaccgg ccaggggatc gccaggctgt cgcgcctcat cttcttgctg      240

cgcaggtggg ctgccaggca tgtgcaccac caggaccagg gaccggactc ttttcctgat      300

cgtttccgtg gagccgagct taaggaggtg tccagccaag aaagcaatgc ccaggcaaat      360

gtgggcagcc aggagccagc agacagaggg agaagcgcct ggcccctggc caaatgcaac      420

actaacacca gcaacaacac ggaggaggag aagaagacga aaaagaagga tgcgatcgtg      480

gtggacccgt ccagcaacct gtactaccgc tggctgaccg ccatcgccct gcctgtcttc      540

tataactggt atctgcttat ttgcagggcc tgtttcgatg agctgcagtc cgagtacctg      600

atgctgtggc tggtcctgga ctactcggca gatgtcctgt atgtcttgga tgtgcttgta      660

cgagctcgga caggttttct cgagcaaggc ttaatggtca gtgataccaa caggctgtgg      720

cagcattaca agacgaccac gcagttcaag ctggatgtgt tgtccctggt ccccaccgac      780

ctggcttact taaaggtggg cacaaactac ccagaagtga ggttcaaccg cctactgaag      840

ttttcccggc tctttgaatt ctttgaccgc acagagacaa ggaccaacta ccccaatatg      900

ttcaggattg ggaacttggt cttgtacatt ctcatcatca tccactggaa tgcctgcatc      960

tactttgcca tttccaagtt cattggtttt gggacagact cctgggtcta cccaaacatc     1020

tcaatcccag agcatgggcg cctctccagg aagtacattt acagtctcta ctggtccacc     1080

ttgaccctta ccaccattgg tgagacccca ccccccgtga aagatgagga gtatctcttt     1140

gtggtcgtag acttcttggt gggtgttctg atttttgcca ccattgtggg caatgtgggc     1200

tccatgatct cgaatatgaa tgcctcacgg gcagagttcc aggccaagat tgattccatc     1260

aagcagtaca tgcagttccg caaggtcacc aaggacttgg agacgcgggt tatccggtgg     1320

tttgactacc tgtgggccaa caagaagacg gtggatgaga aggaggtgct caagagcctc     1380

ccagacaagc tgaaggctga gatcgccatc aacgtgcacc tggacacgct gaagaaggtt     1440

cgcatcttcc aggactgtga ggcagggctg ctggtggagc tggtgctgaa gctgcgaccc     1500

actgtgttca gccctgggga ttatatctgc aagaagggag atattgggaa ggagatgtac     1560

atcatcaacg agggcaagct ggccgtggtg gctgatgatg gggtcaccca gttcgtggtc     1620

ctcagcgatg gcagctactt cggggagatc agcattctga acatcaaggg gagcaagtcg     1680

gggaaccgca ggacggccaa catccgcagc attggctact cagacctgtt ctgcctctca     1740

aaggacgatc tcatggaggc cctcaccgag taccccgaag ccaagaaggc cctggaggag     1800

aaaggacggc agatcctgat gaaagacaac ctgatcgatg aggagctggc cagggcgggc     1860

gcggacccca aggaccttga ggagaaagtg gagcagctgg ggtcctccct ggacaccctg     1920

cagaccaggt ttgcacgcct cctggctgag tacaacgcca cccagatgaa gatgaagcag     1980

cgtctcagcc aactggaaag ccaggtgaag ggtggtgggg acaagcccct ggctgatggg     2040

gaagttcccg gggatgctac aaaaacagag gacaaacaac agtga                     2085


<210>  16
<211>  2107
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  16
gcggccgcca ccatggctaa gattaacacc cagtactcac atccatcccg cactcacctc       60

aaagtcaaga cctccgatcg ggatctgaac cgggctgaga atgggctgtc gcgcgcccac      120

tcgtcgtccg aggaaaccag cagcgtgctc cagccgggca tcgccatgga aactaggggg      180

ctggcggact ccggacaggg atccttcact ggacagggta ttgcccggct gagcagactg      240

atcttcctgc ttcgccgctg ggcggccaga cacgtgcacc atcaggacca gggacctgat      300

agcttccccg accgctttag gggagccgag ctgaaagaag tgtcaagcca ggagtcaaac      360

gcgcaggcca acgtcggcag ccaagagcct gcagaccggg gacgctcggc atggccgctc      420

gcaaagtgca acactaacac ttccaacaac accgaagagg aaaagaaaac caagaagaag      480

gatgcaattg tggtggaccc ttcctccaac ctgtactacc gctggttgac cgccatcgcc      540

ctcccggtct tttacaattg gtatctcctt atctgccggg cctgcttcga cgaactgcaa      600

tcagagtacc tgatgctgtg gctggtgctg gactatagcg ccgatgtgct ctacgtcctg      660

gatgtgctcg tgcgcgcccg gaccggattc ttggaacaag gcctgatggt gtccgacacg      720

aatagactgt ggcagcacta taagaccaca acccagttca agcttgacgt gctcagcctt      780

gtgccgactg acctggccta cctgaaagtc ggaactaact acccggaagt cagattcaac      840

cgactcctga agttcagcag gctgttcgag ttctttgacc gcaccgagac tcggaccaac      900

taccctaaca tgttccggat cggaaatctg gtgctctaca tactgattat catccattgg      960

aacgcctgta tctatttcgc catttcgaag ttcatcggtt tcggaaccga ttcctgggtg     1020

taccccaaca tctcgatccc cgaacacggt cgcctgtccc ggaagtacat ctactccctg     1080

tactggtcca ctctgactct gaccacgatc ggggaaaccc ctccacccgt gaaggacgaa     1140

gagtacctgt tcgtggtggt ggacttcctg gtcggagtgt tgattttcgc caccattgtg     1200

ggaaacgtgg gctccatgat ctccaacatg aacgcgtcga gagctgagtt ccaagccaag     1260

atcgactcca ttaagcagta catgcagttc agaaaggtca ccaaggacct ggaaaccagg     1320

gtcatccgct ggttcgacta cctgtgggcc aacaaaaaga ctgtggacga aaaggaagtg     1380

ctgaagtcgc tgccggataa gctgaaggcc gaaatcgcca ttaacgtgca ccttgacacc     1440

ctgaagaaag tccggatctt ccaagactgt gaagccggcc tcctggtgga gctcgtgctc     1500

aagctgcggc ccaccgtgtt cagcccggga gattacattt gcaagaaggg cgatatcggc     1560

aaagagatgt acatcatcaa cgagggaaag ctggccgtgg tcgcggacga cggcgtgacc     1620

cagttcgtgg tgctgtccga cggatcctac ttcggtgaaa tctcaatcct caacatcaag     1680

gggtccaagt ccggcaaccg gagaactgcc aacattcgct ccatcggata cagcgacctg     1740

ttttgcctgt ccaaggatga cctgatggag gctctgactg agtaccctga agcgaagaag     1800

gctttggagg aaaaggggcg gcagattctg atgaaggaca atttgatcga cgaggagctc     1860

gcacgggccg gcgccgaccc caaggatctc gaagagaagg tcgaacagct gggttcttcg     1920

cttgataccc tgcaaacccg attcgcgcgg ctgctcgccg agtacaacgc gacccagatg     1980

aagatgaagc agagactgtc acagttggaa tcccaagtca agggcggagg cgacaagccg     2040

ctggcggacg gggaagtgcc cggggacgcc accaagactg aggacaagca gcagtgatca     2100

tagatct                                                               2107


<210>  17
<211>  2272
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  17
gcggccgcca ccatggctaa gattaacacc cagtactcac atccatcccg cactcacctc       60

aaagtcaaga cctccgatcg ggatctgaac cgggctgaga atgggctgtc gcgcgcccac      120

tcgtcgtccg aggaaaccag cagcgtgctc cagccgggca tcgccatgga aactaggggg      180

ctggcggact ccggacaggg atccttcact ggacagggta ttgcccggtt cgggcggatt      240

cagaagaagt cccagccgga gaaggtcgtg cgggctgcca gcaggggcag gccactcatt      300

ggttggacac agtggtgcgc tgaggatggt ggagatgaat cggaaatggc actggccggc      360

tctcccggat gcagctcggg cccccaaggg agactgagca gactgatctt cctgcttcgc      420

cgctgggcgg ccagacacgt gcaccatcag gaccagggac ctgatagctt ccccgaccgc      480

tttaggggag ccgagctgaa agaagtgtca agccaggagt caaacgcgca ggccaacgtc      540

ggcagccaag agcctgcaga ccggggacgc tcggcatggc cgctcgcaaa gtgcaacact      600

aacacttcca acaacaccga agaggaaaag aaaaccaaga agaaggatgc aattgtggtg      660

gacccttcct ccaacctgta ctaccgctgg ttgaccgcca tcgccctccc ggtcttttac      720

aattggtatc tccttatctg ccgggcctgc ttcgacgaac tgcaatcaga gtacctgatg      780

ctgtggctgg tgctggacta tagcgccgat gtgctctacg tcctggatgt gctcgtgcgc      840

gcccggaccg gattcttgga acaaggcctg atggtgtccg acacgaatag actgtggcag      900

cactataaga ccacaaccca gttcaagctt gacgtgctca gccttgtgcc gactgacctg      960

gcctacctga aagtcggaac taactacccg gaagtcagat tcaaccgact cctgaagttc     1020

agcaggctgt tcgagttctt tgaccgcacc gagactcgga ccaactaccc taacatgttc     1080

cggatcggaa atctggtgct ctacatactg attatcatcc attggaacgc ctgtatctat     1140

ttcgccattt cgaagttcat cggtttcgga accgattcct gggtgtaccc caacatctcg     1200

atccccgaac acggtcgcct gtcccggaag tacatctact ccctgtactg gtccactctg     1260

actctgacca cgatcgggga aacccctcca cccgtgaagg acgaagagta cctgttcgtg     1320

gtggtggact tcctggtcgg agtgttgatt ttcgccacca ttgtgggaaa cgtgggctcc     1380

atgatctcca acatgaacgc gtcgagagct gagttccaag ccaagatcga ctccattaag     1440

cagtacatgc agttcagaaa ggtcaccaag gacctggaaa ccagggtcat ccgctggttc     1500

gactacctgt gggccaacaa aaagactgtg gacgaaaagg aagtgctgaa gtcgctgccg     1560

gataagctga aggccgaaat cgccattaac gtgcaccttg acaccctgaa gaaagtccgg     1620

atcttccaag actgtgaagc cggcctcctg gtggagctcg tgctcaagct gcggcccacc     1680

gtgttcagcc cgggagatta catttgcaag aagggcgata tcggcaaaga gatgtacatc     1740

atcaacgagg gaaagctggc cgtggtcgcg gacgacggcg tgacccagtt cgtggtgctg     1800

tccgacggat cctacttcgg tgaaatctca atcctcaaca tcaaggggtc caagtccggc     1860

aaccggagaa ctgccaacat tcgctccatc ggatacagcg acctgttttg cctgtccaag     1920

gatgacctga tggaggctct gactgagtac cctgaagcga agaaggcttt ggaggaaaag     1980

gggcggcaga ttctgatgaa ggacaatttg atcgacgagg agctcgcacg ggccggcgcc     2040

gaccccaagg atctcgaaga gaaggtcgaa cagctgggtt cttcgcttga taccctgcaa     2100

acccgattcg cgcggctgct cgccgagtac aacgcgaccc agatgaagat gaagcagaga     2160

ctgtcacagt tggaatccca agtcaagggc ggaggcgaca agccgctggc ggacggggaa     2220

gtgcccgggg acgccaccaa gactgaggac aagcagcagt gatcatagat ct             2272


<210>  18
<211>  2107
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  18
gcggccgcca ccatggccaa gatcaacacc caatactccc acccctccag gacccacctc       60

aaggtaaaga cctcagaccg ggatctcaat cgcgctgaaa atggcctcag cagagcccac      120

tcgtcaagtg aggagacatc gtcagtgctg cagccgggga tcgccatgga gaccagagga      180

ctggctgact ccgggcaggg ctccttcacc ggccagggga tcgccaggct gtcgcgcctc      240

atcttcttgc tgcgcaggtg ggctgccagg catgtgcacc accaggacca gggaccggac      300

tcttttcctg atcgtttccg tggagccgag cttaaggagg tgtccagcca agaaagcaat      360

gcccaggcaa atgtgggcag ccaggagcca gcagacagag ggagaagcgc ctggcccctg      420

gccaaatgca acactaacac cagcaacaac acggaggagg agaagaagac gaaaaagaag      480

gatgcgatcg tggtggaccc gtccagcaac ctgtactacc gctggctgac cgccatcgcc      540

ctgcctgtct tctataactg gtatctgctt atttgcaggg cctgtttcga tgagctgcag      600

tccgagtacc tgatgctgtg gctggtcctg gactactcgg cagatgtcct gtatgtcttg      660

gatgtgcttg tacgagctcg gacaggtttt cttgagcaag gcttaatggt cagtgatacc      720

aacaggctgt ggcagcatta caagacgacc acgcagttca agctggatgt gttgtccctg      780

gtccccaccg acctggctta cttaaaggtg ggcacaaact acccagaagt gaggttcaac      840

cgcctactga agttttcccg gctctttgaa ttctttgacc gcacagagac aaggaccaac      900

taccccaata tgttcaggat tgggaacttg gtcttgtaca ttctcatcat catccactgg      960

aatgcctgca tctactttgc catttccaag ttcattggtt ttgggacaga ctcctgggtc     1020

tacccaaaca tctcaatccc agagcatggg cgcctctcca ggaagtacat ttacagtctc     1080

tactggtcca ccttgaccct taccaccatt ggtgagaccc caccccccgt gaaagatgag     1140

gagtatctct ttgtggtcgt agacttcttg gtgggtgttc tgatttttgc caccattgtg     1200

ggcaatgtgg gctccatgat ctcgaatatg aatgcctcac gggcagagtt ccaggccaag     1260

attgattcca tcaagcagta catgcagttc cgcaaggtca ccaaggactt ggagacgcgg     1320

gttatccggt ggtttgacta cctgtgggcc aacaagaaga cggtggatga gaaggaggtg     1380

ctcaagagcc tcccagacaa gctgaaggct gagatcgcca tcaacgtgca cctggacacg     1440

ctgaagaagg ttcgcatctt ccaggactgt gaggcagggc tgctggtgga gctggtgctg     1500

aagctgcgac ccactgtgtt cagccctggg gattatatct gcaagaaggg agatattggg     1560

aaggagatgt acatcatcaa cgagggcaag ctggccgtgg tggctgatga tggggtcacc     1620

cagttcgtgg tcctcagcga tggcagctac ttcggggaga tcagcattct gaacatcaag     1680

gggagcaagt cggggaaccg caggacggcc aacatccgca gcattggcta ctcagacctg     1740

ttctgcctct caaaggacga tctcatggag gccctcaccg agtaccccga agccaagaag     1800

gccctggagg agaaaggacg gcagatcctg atgaaagaca acctgatcga tgaggagctg     1860

gccagggcgg gcgcggaccc caaggacctt gaggagaaag tggagcagct ggggtcctcc     1920

ctggacaccc tgcagaccag gtttgcacgc ctcctggctg agtacaacgc cacccagatg     1980

aagatgaagc agcgtctcag ccaactggaa agccaggtga agggtggtgg ggacaagccc     2040

ctggctgatg gggaagttcc cggggatgct acaaaaacag aggacaaaca acagtgatca     2100

tagatct                                                               2107


<210>  19
<211>  2430
<212>  DNA
<213>  Homo sapiens


<220>
<221>  CDS
<222>  (1)..(2430)

<400>  19
atg ttt aaa tcg ctg aca aaa gtc aac aag gtg aag cct ata gga gag         48
Met Phe Lys Ser Leu Thr Lys Val Asn Lys Val Lys Pro Ile Gly Glu           
1               5                   10                  15                

aac aat gag aat gaa caa agt tct cgt cgg aat gaa gaa ggc tct cac         96
Asn Asn Glu Asn Glu Gln Ser Ser Arg Arg Asn Glu Glu Gly Ser His           
            20                  25                  30                    

cca agt aat cag tct cag caa acc aca gca cag gaa gaa aac aaa ggt        144
Pro Ser Asn Gln Ser Gln Gln Thr Thr Ala Gln Glu Glu Asn Lys Gly           
        35                  40                  45                        

gaa gag aaa tct ctc aaa acc aag tca act cca gtc acg tct gaa gag        192
Glu Glu Lys Ser Leu Lys Thr Lys Ser Thr Pro Val Thr Ser Glu Glu           
    50                  55                  60                            

cca cac acc aac ata caa gac aaa ctc tcc aag aaa aat tcc tct gga        240
Pro His Thr Asn Ile Gln Asp Lys Leu Ser Lys Lys Asn Ser Ser Gly           
65                  70                  75                  80            

gat ctg acc aca aac cct gac cct caa aat gca gca gaa cca act gga        288
Asp Leu Thr Thr Asn Pro Asp Pro Gln Asn Ala Ala Glu Pro Thr Gly           
                85                  90                  95                

aca gtg cca gag cag aag gaa atg gac ccc ggg aaa gaa ggt cca aac        336
Thr Val Pro Glu Gln Lys Glu Met Asp Pro Gly Lys Glu Gly Pro Asn           
            100                 105                 110                   

agc cca caa aac aaa ccg cct gca gct cct gtt ata aat gag tat gcc        384
Ser Pro Gln Asn Lys Pro Pro Ala Ala Pro Val Ile Asn Glu Tyr Ala           
        115                 120                 125                       

gat gcc cag cta cac aac ctg gtg aaa aga atg cgt caa aga aca gcc        432
Asp Ala Gln Leu His Asn Leu Val Lys Arg Met Arg Gln Arg Thr Ala           
    130                 135                 140                           

ctc tac aag aaa aag ttg gta gag gga gat ctc tcc tca ccc gaa gcc        480
Leu Tyr Lys Lys Lys Leu Val Glu Gly Asp Leu Ser Ser Pro Glu Ala           
145                 150                 155                 160           

agc cca caa act gca aag ccc acg gct gta cca cca gta aaa gaa agc        528
Ser Pro Gln Thr Ala Lys Pro Thr Ala Val Pro Pro Val Lys Glu Ser           
                165                 170                 175               

gat gat aag cca aca gaa cat tac tac agg ctg ttg tgg ttc aaa gtc        576
Asp Asp Lys Pro Thr Glu His Tyr Tyr Arg Leu Leu Trp Phe Lys Val           
            180                 185                 190                   

aaa aag atg cct tta aca gag tac tta aag cga att aaa ctt cca aac        624
Lys Lys Met Pro Leu Thr Glu Tyr Leu Lys Arg Ile Lys Leu Pro Asn           
        195                 200                 205                       

agc ata gat tca tac aca gat cga ctc tat ctc ctg tgg ctc ttg ctt        672
Ser Ile Asp Ser Tyr Thr Asp Arg Leu Tyr Leu Leu Trp Leu Leu Leu           
    210                 215                 220                           

gtc act ctt gcc tat aac tgg aac tgc tgt ttt ata cca ctg cgc ctc        720
Val Thr Leu Ala Tyr Asn Trp Asn Cys Cys Phe Ile Pro Leu Arg Leu           
225                 230                 235                 240           

gtc ttc cca tat caa acc gca gac aac ata cac tac tgg ctt att gcg        768
Val Phe Pro Tyr Gln Thr Ala Asp Asn Ile His Tyr Trp Leu Ile Ala           
                245                 250                 255               

gac atc ata tgt gat atc atc tac ctt tat gat atg cta ttt atc cag        816
Asp Ile Ile Cys Asp Ile Ile Tyr Leu Tyr Asp Met Leu Phe Ile Gln           
            260                 265                 270                   

ccc aga ctc cag ttt gta aga gga gga gac ata ata gtg gat tca aat        864
Pro Arg Leu Gln Phe Val Arg Gly Gly Asp Ile Ile Val Asp Ser Asn           
        275                 280                 285                       

gag cta agg aaa cac tac agg act tct aca aaa ttt cag ttg gat gtc        912
Glu Leu Arg Lys His Tyr Arg Thr Ser Thr Lys Phe Gln Leu Asp Val           
    290                 295                 300                           

gca tca ata ata cca ttt gat att tgc tac ctc ttc ttt ggg ttt aat        960
Ala Ser Ile Ile Pro Phe Asp Ile Cys Tyr Leu Phe Phe Gly Phe Asn           
305                 310                 315                 320           

cca atg ttt aga gca aat agg atg tta aag tac act tca ttt ttt gaa       1008
Pro Met Phe Arg Ala Asn Arg Met Leu Lys Tyr Thr Ser Phe Phe Glu           
                325                 330                 335               

ttt aat cat cac cta gag tct ata atg gac aaa gca tat atc tac aga       1056
Phe Asn His His Leu Glu Ser Ile Met Asp Lys Ala Tyr Ile Tyr Arg           
            340                 345                 350                   

gtt att cga aca act gga tac ttg ctg ttt att ctg cac att aat gcc       1104
Val Ile Arg Thr Thr Gly Tyr Leu Leu Phe Ile Leu His Ile Asn Ala           
        355                 360                 365                       

tgt gtt tat tac tgg gct tca aac tat gaa gga att ggc act act aga       1152
Cys Val Tyr Tyr Trp Ala Ser Asn Tyr Glu Gly Ile Gly Thr Thr Arg           
    370                 375                 380                           

tgg gtg tat gat ggg gaa gga aac gag tat ctg aga tgt tat tat tgg       1200
Trp Val Tyr Asp Gly Glu Gly Asn Glu Tyr Leu Arg Cys Tyr Tyr Trp           
385                 390                 395                 400           

gca gtt cga act tta att acc att ggt ggc ctt cca gaa cca caa act       1248
Ala Val Arg Thr Leu Ile Thr Ile Gly Gly Leu Pro Glu Pro Gln Thr           
                405                 410                 415               

tta ttt gaa att gtt ttt caa ctc ttg aat ttt ttt tct gga gtt ttt       1296
Leu Phe Glu Ile Val Phe Gln Leu Leu Asn Phe Phe Ser Gly Val Phe           
            420                 425                 430                   

gtg ttc tcc agt tta att ggt cag atg aga gat gtg att gga gca gct       1344
Val Phe Ser Ser Leu Ile Gly Gln Met Arg Asp Val Ile Gly Ala Ala           
        435                 440                 445                       

aca gcc aat cag aac tac ttc cgc gcc tgc atg gat gac acc att gcc       1392
Thr Ala Asn Gln Asn Tyr Phe Arg Ala Cys Met Asp Asp Thr Ile Ala           
    450                 455                 460                           

tac atg aac aat tac tcc att cct aaa ctt gtg caa aag cga gtt cgg       1440
Tyr Met Asn Asn Tyr Ser Ile Pro Lys Leu Val Gln Lys Arg Val Arg           
465                 470                 475                 480           

act tgg tat gaa tat aca tgg gac tct caa aga atg cta gat gag tct       1488
Thr Trp Tyr Glu Tyr Thr Trp Asp Ser Gln Arg Met Leu Asp Glu Ser           
                485                 490                 495               

gat ttg ctt aag acc cta cca act acg gtc cag tta gcc ctc gcc att       1536
Asp Leu Leu Lys Thr Leu Pro Thr Thr Val Gln Leu Ala Leu Ala Ile           
            500                 505                 510                   

gat gtg aac ttc agc atc atc agc aaa gtc gac ttg ttc aag ggt tgt       1584
Asp Val Asn Phe Ser Ile Ile Ser Lys Val Asp Leu Phe Lys Gly Cys           
        515                 520                 525                       

gat aca cag atg att tat gac atg ttg cta aga ttg aaa tcc gtt ctc       1632
Asp Thr Gln Met Ile Tyr Asp Met Leu Leu Arg Leu Lys Ser Val Leu           
    530                 535                 540                           

tat ttg cct ggt gac ttt gtc tgc aaa aag gga gaa att ggc aag gaa       1680
Tyr Leu Pro Gly Asp Phe Val Cys Lys Lys Gly Glu Ile Gly Lys Glu           
545                 550                 555                 560           

atg tat atc atc aag cat gga gaa gtc caa gtt ctt gga ggc cct gat       1728
Met Tyr Ile Ile Lys His Gly Glu Val Gln Val Leu Gly Gly Pro Asp           
                565                 570                 575               

ggt act aaa gtt ctg gtt act ctg aaa gct ggg tcg gtg ttt gga gaa       1776
Gly Thr Lys Val Leu Val Thr Leu Lys Ala Gly Ser Val Phe Gly Glu           
            580                 585                 590                   

atc agc ctt cta gca gca gga gga gga aac cgt cga act gcc aat gtg       1824
Ile Ser Leu Leu Ala Ala Gly Gly Gly Asn Arg Arg Thr Ala Asn Val           
        595                 600                 605                       

gtg gcc cac ggg ttt gcc aat ctt tta act cta gac aaa aag acc ctc       1872
Val Ala His Gly Phe Ala Asn Leu Leu Thr Leu Asp Lys Lys Thr Leu           
    610                 615                 620                           

caa gaa att cta gtg cat tat cca gat tct gaa agg atc ctc atg aag       1920
Gln Glu Ile Leu Val His Tyr Pro Asp Ser Glu Arg Ile Leu Met Lys           
625                 630                 635                 640           

aaa gcc aga gtg ctt tta aag cag aag gct aag acc gca gaa gca acc       1968
Lys Ala Arg Val Leu Leu Lys Gln Lys Ala Lys Thr Ala Glu Ala Thr           
                645                 650                 655               

cct cca aga aaa gat ctt gcc ctc ctc ttc cca ccg aaa gaa gag aca       2016
Pro Pro Arg Lys Asp Leu Ala Leu Leu Phe Pro Pro Lys Glu Glu Thr           
            660                 665                 670                   

ccc aaa ctg ttt aaa act ctc cta gga ggc aca gga aaa gca agt ctt       2064
Pro Lys Leu Phe Lys Thr Leu Leu Gly Gly Thr Gly Lys Ala Ser Leu           
        675                 680                 685                       

gca aga cta ctc aaa ttg aag cga gag caa gca gct cag aag aaa gaa       2112
Ala Arg Leu Leu Lys Leu Lys Arg Glu Gln Ala Ala Gln Lys Lys Glu           
    690                 695                 700                           

aat tct gaa gga gga gag gaa gaa gga aaa gaa aat gaa gat aaa caa       2160
Asn Ser Glu Gly Gly Glu Glu Glu Gly Lys Glu Asn Glu Asp Lys Gln           
705                 710                 715                 720           

aaa gaa aat gaa gat aaa caa aaa gaa aat gaa gat aaa gga aaa gaa       2208
Lys Glu Asn Glu Asp Lys Gln Lys Glu Asn Glu Asp Lys Gly Lys Glu           
                725                 730                 735               

aat gaa gat aaa gat aaa gga aga gag cca gaa gag aag cca ctg gac       2256
Asn Glu Asp Lys Asp Lys Gly Arg Glu Pro Glu Glu Lys Pro Leu Asp           
            740                 745                 750                   

aga cct gaa tgt aca gca agt cct att gca gtg gag gaa gaa ccc cac       2304
Arg Pro Glu Cys Thr Ala Ser Pro Ile Ala Val Glu Glu Glu Pro His           
        755                 760                 765                       

tca gtt aga agg aca gtt tta ccc aga ggg act tct cgt caa tca ctc       2352
Ser Val Arg Arg Thr Val Leu Pro Arg Gly Thr Ser Arg Gln Ser Leu           
    770                 775                 780                           

att atc agc atg gct cct tct gct gag ggc gga gaa gag gtt ctt act       2400
Ile Ile Ser Met Ala Pro Ser Ala Glu Gly Gly Glu Glu Val Leu Thr           
785                 790                 795                 800           

att gaa gtc aaa gaa aag gct aag caa taa                               2430
Ile Glu Val Lys Glu Lys Ala Lys Gln                                       
                805                                                       


<210>  20
<211>  809
<212>  PRT
<213>  Homo sapiens

<400>  20

Met Phe Lys Ser Leu Thr Lys Val Asn Lys Val Lys Pro Ile Gly Glu 
1               5                   10                  15      


Asn Asn Glu Asn Glu Gln Ser Ser Arg Arg Asn Glu Glu Gly Ser His 
            20                  25                  30          


Pro Ser Asn Gln Ser Gln Gln Thr Thr Ala Gln Glu Glu Asn Lys Gly 
        35                  40                  45              


Glu Glu Lys Ser Leu Lys Thr Lys Ser Thr Pro Val Thr Ser Glu Glu 
    50                  55                  60                  


Pro His Thr Asn Ile Gln Asp Lys Leu Ser Lys Lys Asn Ser Ser Gly 
65                  70                  75                  80  


Asp Leu Thr Thr Asn Pro Asp Pro Gln Asn Ala Ala Glu Pro Thr Gly 
                85                  90                  95      


Thr Val Pro Glu Gln Lys Glu Met Asp Pro Gly Lys Glu Gly Pro Asn 
            100                 105                 110         


Ser Pro Gln Asn Lys Pro Pro Ala Ala Pro Val Ile Asn Glu Tyr Ala 
        115                 120                 125             


Asp Ala Gln Leu His Asn Leu Val Lys Arg Met Arg Gln Arg Thr Ala 
    130                 135                 140                 


Leu Tyr Lys Lys Lys Leu Val Glu Gly Asp Leu Ser Ser Pro Glu Ala 
145                 150                 155                 160 


Ser Pro Gln Thr Ala Lys Pro Thr Ala Val Pro Pro Val Lys Glu Ser 
                165                 170                 175     


Asp Asp Lys Pro Thr Glu His Tyr Tyr Arg Leu Leu Trp Phe Lys Val 
            180                 185                 190         


Lys Lys Met Pro Leu Thr Glu Tyr Leu Lys Arg Ile Lys Leu Pro Asn 
        195                 200                 205             


Ser Ile Asp Ser Tyr Thr Asp Arg Leu Tyr Leu Leu Trp Leu Leu Leu 
    210                 215                 220                 


Val Thr Leu Ala Tyr Asn Trp Asn Cys Cys Phe Ile Pro Leu Arg Leu 
225                 230                 235                 240 


Val Phe Pro Tyr Gln Thr Ala Asp Asn Ile His Tyr Trp Leu Ile Ala 
                245                 250                 255     


Asp Ile Ile Cys Asp Ile Ile Tyr Leu Tyr Asp Met Leu Phe Ile Gln 
            260                 265                 270         


Pro Arg Leu Gln Phe Val Arg Gly Gly Asp Ile Ile Val Asp Ser Asn 
        275                 280                 285             


Glu Leu Arg Lys His Tyr Arg Thr Ser Thr Lys Phe Gln Leu Asp Val 
    290                 295                 300                 


Ala Ser Ile Ile Pro Phe Asp Ile Cys Tyr Leu Phe Phe Gly Phe Asn 
305                 310                 315                 320 


Pro Met Phe Arg Ala Asn Arg Met Leu Lys Tyr Thr Ser Phe Phe Glu 
                325                 330                 335     


Phe Asn His His Leu Glu Ser Ile Met Asp Lys Ala Tyr Ile Tyr Arg 
            340                 345                 350         


Val Ile Arg Thr Thr Gly Tyr Leu Leu Phe Ile Leu His Ile Asn Ala 
        355                 360                 365             


Cys Val Tyr Tyr Trp Ala Ser Asn Tyr Glu Gly Ile Gly Thr Thr Arg 
    370                 375                 380                 


Trp Val Tyr Asp Gly Glu Gly Asn Glu Tyr Leu Arg Cys Tyr Tyr Trp 
385                 390                 395                 400 


Ala Val Arg Thr Leu Ile Thr Ile Gly Gly Leu Pro Glu Pro Gln Thr 
                405                 410                 415     


Leu Phe Glu Ile Val Phe Gln Leu Leu Asn Phe Phe Ser Gly Val Phe 
            420                 425                 430         


Val Phe Ser Ser Leu Ile Gly Gln Met Arg Asp Val Ile Gly Ala Ala 
        435                 440                 445             


Thr Ala Asn Gln Asn Tyr Phe Arg Ala Cys Met Asp Asp Thr Ile Ala 
    450                 455                 460                 


Tyr Met Asn Asn Tyr Ser Ile Pro Lys Leu Val Gln Lys Arg Val Arg 
465                 470                 475                 480 


Thr Trp Tyr Glu Tyr Thr Trp Asp Ser Gln Arg Met Leu Asp Glu Ser 
                485                 490                 495     


Asp Leu Leu Lys Thr Leu Pro Thr Thr Val Gln Leu Ala Leu Ala Ile 
            500                 505                 510         


Asp Val Asn Phe Ser Ile Ile Ser Lys Val Asp Leu Phe Lys Gly Cys 
        515                 520                 525             


Asp Thr Gln Met Ile Tyr Asp Met Leu Leu Arg Leu Lys Ser Val Leu 
    530                 535                 540                 


Tyr Leu Pro Gly Asp Phe Val Cys Lys Lys Gly Glu Ile Gly Lys Glu 
545                 550                 555                 560 


Met Tyr Ile Ile Lys His Gly Glu Val Gln Val Leu Gly Gly Pro Asp 
                565                 570                 575     


Gly Thr Lys Val Leu Val Thr Leu Lys Ala Gly Ser Val Phe Gly Glu 
            580                 585                 590         


Ile Ser Leu Leu Ala Ala Gly Gly Gly Asn Arg Arg Thr Ala Asn Val 
        595                 600                 605             


Val Ala His Gly Phe Ala Asn Leu Leu Thr Leu Asp Lys Lys Thr Leu 
    610                 615                 620                 


Gln Glu Ile Leu Val His Tyr Pro Asp Ser Glu Arg Ile Leu Met Lys 
625                 630                 635                 640 


Lys Ala Arg Val Leu Leu Lys Gln Lys Ala Lys Thr Ala Glu Ala Thr 
                645                 650                 655     


Pro Pro Arg Lys Asp Leu Ala Leu Leu Phe Pro Pro Lys Glu Glu Thr 
            660                 665                 670         


Pro Lys Leu Phe Lys Thr Leu Leu Gly Gly Thr Gly Lys Ala Ser Leu 
        675                 680                 685             


Ala Arg Leu Leu Lys Leu Lys Arg Glu Gln Ala Ala Gln Lys Lys Glu 
    690                 695                 700                 


Asn Ser Glu Gly Gly Glu Glu Glu Gly Lys Glu Asn Glu Asp Lys Gln 
705                 710                 715                 720 


Lys Glu Asn Glu Asp Lys Gln Lys Glu Asn Glu Asp Lys Gly Lys Glu 
                725                 730                 735     


Asn Glu Asp Lys Asp Lys Gly Arg Glu Pro Glu Glu Lys Pro Leu Asp 
            740                 745                 750         


Arg Pro Glu Cys Thr Ala Ser Pro Ile Ala Val Glu Glu Glu Pro His 
        755                 760                 765             


Ser Val Arg Arg Thr Val Leu Pro Arg Gly Thr Ser Arg Gln Ser Leu 
    770                 775                 780                 


Ile Ile Ser Met Ala Pro Ser Ala Glu Gly Gly Glu Glu Val Leu Thr 
785                 790                 795                 800 


Ile Glu Val Lys Glu Lys Ala Lys Gln 
                805                 


<210>  21
<211>  2430
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence


<220>
<221>  CDS
<222>  (1)..(2430)

<400>  21
atg ttt aaa tcg ctg aca aaa gtc aac aag gtg aag cct ata gga gag         48
Met Phe Lys Ser Leu Thr Lys Val Asn Lys Val Lys Pro Ile Gly Glu           
1               5                   10                  15                

aac aat gag aat gaa caa agt tct cgt cgg aat gaa gaa ggc tct cac         96
Asn Asn Glu Asn Glu Gln Ser Ser Arg Arg Asn Glu Glu Gly Ser His           
            20                  25                  30                    

cca agt aat cag tct cag caa acc aca gca cag gaa gaa aac aaa ggt        144
Pro Ser Asn Gln Ser Gln Gln Thr Thr Ala Gln Glu Glu Asn Lys Gly           
        35                  40                  45                        

gaa gag aaa tct ctc aaa acc aag tca act cca gtc acg tct gaa gag        192
Glu Glu Lys Ser Leu Lys Thr Lys Ser Thr Pro Val Thr Ser Glu Glu           
    50                  55                  60                            

cca cac acc aac ata caa gac aaa ctc tcc aag aaa aat tcc tct gga        240
Pro His Thr Asn Ile Gln Asp Lys Leu Ser Lys Lys Asn Ser Ser Gly           
65                  70                  75                  80            

gat ctg acc aca aac cct gac cct caa aat gca gca gaa cca act gga        288
Asp Leu Thr Thr Asn Pro Asp Pro Gln Asn Ala Ala Glu Pro Thr Gly           
                85                  90                  95                

aca gtg cca gag cag aag gaa atg gac ccc ggg aaa gaa ggt cca aac        336
Thr Val Pro Glu Gln Lys Glu Met Asp Pro Gly Lys Glu Gly Pro Asn           
            100                 105                 110                   

agc cca caa aac aaa ccg cca gca gct cct gtt ata aat gag tat gcc        384
Ser Pro Gln Asn Lys Pro Pro Ala Ala Pro Val Ile Asn Glu Tyr Ala           
        115                 120                 125                       

gat gcc cag cta cac aac ctg gtg aaa aga atg cgt caa aga aca gcc        432
Asp Ala Gln Leu His Asn Leu Val Lys Arg Met Arg Gln Arg Thr Ala           
    130                 135                 140                           

ctc tac aag aaa aag ttg gta gag gga gat ctc tcc tca ccc gaa gcc        480
Leu Tyr Lys Lys Lys Leu Val Glu Gly Asp Leu Ser Ser Pro Glu Ala           
145                 150                 155                 160           

agc cca caa act gca aag ccc acg gct gta cca cca gta aaa gaa agc        528
Ser Pro Gln Thr Ala Lys Pro Thr Ala Val Pro Pro Val Lys Glu Ser           
                165                 170                 175               

gat gat aag cca aca gaa cat tac tac agg ctg ttg tgg ttc aaa gtc        576
Asp Asp Lys Pro Thr Glu His Tyr Tyr Arg Leu Leu Trp Phe Lys Val           
            180                 185                 190                   

aaa aag atg cct tta aca gag tac tta aag cga att aaa ctt cca aac        624
Lys Lys Met Pro Leu Thr Glu Tyr Leu Lys Arg Ile Lys Leu Pro Asn           
        195                 200                 205                       

agc ata gat tca tac aca gat cga ctc tat ctc ctg tgg ctc ttg ctt        672
Ser Ile Asp Ser Tyr Thr Asp Arg Leu Tyr Leu Leu Trp Leu Leu Leu           
    210                 215                 220                           

gtc act ctt gcc tat aac tgg aac tgc tgt ttt ata cca ctg cgc ctc        720
Val Thr Leu Ala Tyr Asn Trp Asn Cys Cys Phe Ile Pro Leu Arg Leu           
225                 230                 235                 240           

gtc ttc cca tat caa acc gca gac aac ata cac tac tgg ctt att gcg        768
Val Phe Pro Tyr Gln Thr Ala Asp Asn Ile His Tyr Trp Leu Ile Ala           
                245                 250                 255               

gac atc atc tgt gat atc atc tac ctt tat gat atg cta ttt atc cag        816
Asp Ile Ile Cys Asp Ile Ile Tyr Leu Tyr Asp Met Leu Phe Ile Gln           
            260                 265                 270                   

ccc aga ctc cag ttt gta aga gga gga gac ata ata gtg gat tca aat        864
Pro Arg Leu Gln Phe Val Arg Gly Gly Asp Ile Ile Val Asp Ser Asn           
        275                 280                 285                       

gag cta agg aaa cac tac agg act tct aca aaa ttt cag ttg gat gtc        912
Glu Leu Arg Lys His Tyr Arg Thr Ser Thr Lys Phe Gln Leu Asp Val           
    290                 295                 300                           

gca tca ata ata cca ttt gat att tgc tac ctc ttc ttt ggg ttt aat        960
Ala Ser Ile Ile Pro Phe Asp Ile Cys Tyr Leu Phe Phe Gly Phe Asn           
305                 310                 315                 320           

cca atg ttt aga gca aat agg atg tta aag tac act tca ttt ttt gaa       1008
Pro Met Phe Arg Ala Asn Arg Met Leu Lys Tyr Thr Ser Phe Phe Glu           
                325                 330                 335               

ttt aat cat cac cta gag tct ata atg gac aaa gca tat atc tac aga       1056
Phe Asn His His Leu Glu Ser Ile Met Asp Lys Ala Tyr Ile Tyr Arg           
            340                 345                 350                   

gtt att cga aca act gga tac ttg ctg ttt att ctg cac att aat gcc       1104
Val Ile Arg Thr Thr Gly Tyr Leu Leu Phe Ile Leu His Ile Asn Ala           
        355                 360                 365                       

tgt gtt tat tac tgg gct tca aac tat gaa gga att ggc act act aga       1152
Cys Val Tyr Tyr Trp Ala Ser Asn Tyr Glu Gly Ile Gly Thr Thr Arg           
    370                 375                 380                           

tgg gtg tat gat ggg gaa gga aac gag tat ctg aga tgt tat tat tgg       1200
Trp Val Tyr Asp Gly Glu Gly Asn Glu Tyr Leu Arg Cys Tyr Tyr Trp           
385                 390                 395                 400           

gca gtt cga act tta att acc att ggt ggc ctt cca gaa cca caa act       1248
Ala Val Arg Thr Leu Ile Thr Ile Gly Gly Leu Pro Glu Pro Gln Thr           
                405                 410                 415               

tta ttt gaa att gtt ttt caa ctc ttg aat ttt ttt tct gga gtt ttt       1296
Leu Phe Glu Ile Val Phe Gln Leu Leu Asn Phe Phe Ser Gly Val Phe           
            420                 425                 430                   

gtg ttc tcc agt tta att ggt cag atg aga gat gtg att gga gca gct       1344
Val Phe Ser Ser Leu Ile Gly Gln Met Arg Asp Val Ile Gly Ala Ala           
        435                 440                 445                       

aca gcc aat cag aac tac ttc cgc gcc tgc atg gat gac acc att gcc       1392
Thr Ala Asn Gln Asn Tyr Phe Arg Ala Cys Met Asp Asp Thr Ile Ala           
    450                 455                 460                           

tac atg aac aat tac tcc att cct aaa ctt gtg caa aag cga gtt cgg       1440
Tyr Met Asn Asn Tyr Ser Ile Pro Lys Leu Val Gln Lys Arg Val Arg           
465                 470                 475                 480           

act tgg tat gaa tat aca tgg gac tct caa aga atg cta gat gag tct       1488
Thr Trp Tyr Glu Tyr Thr Trp Asp Ser Gln Arg Met Leu Asp Glu Ser           
                485                 490                 495               

gat ttg ctt aag acc cta cca act acg gtc cag tta gcc ctc gcc att       1536
Asp Leu Leu Lys Thr Leu Pro Thr Thr Val Gln Leu Ala Leu Ala Ile           
            500                 505                 510                   

gat gtg aac ttc agc atc atc agc aaa gtt gac ttg ttc aag ggt tgt       1584
Asp Val Asn Phe Ser Ile Ile Ser Lys Val Asp Leu Phe Lys Gly Cys           
        515                 520                 525                       

gat aca cag atg att tat gac atg ttg cta aga ttg aaa tcc gtt ctc       1632
Asp Thr Gln Met Ile Tyr Asp Met Leu Leu Arg Leu Lys Ser Val Leu           
    530                 535                 540                           

tat ttg cct ggt gac ttt gtc tgc aaa aag gga gaa att ggc aag gaa       1680
Tyr Leu Pro Gly Asp Phe Val Cys Lys Lys Gly Glu Ile Gly Lys Glu           
545                 550                 555                 560           

atg tat atc atc aag cat gga gaa gtc caa gtt ctt gga ggc cct gat       1728
Met Tyr Ile Ile Lys His Gly Glu Val Gln Val Leu Gly Gly Pro Asp           
                565                 570                 575               

ggt act aaa gtt ctg gtt act ctg aaa gct ggg tcg gtg ttt gga gaa       1776
Gly Thr Lys Val Leu Val Thr Leu Lys Ala Gly Ser Val Phe Gly Glu           
            580                 585                 590                   

atc agc ctt cta gca gca gga gga gga aac cgt cga act gcc aat gtg       1824
Ile Ser Leu Leu Ala Ala Gly Gly Gly Asn Arg Arg Thr Ala Asn Val           
        595                 600                 605                       

gtg gcc cac ggg ttt gcc aat ctt tta act cta gac aaa aag acc ctc       1872
Val Ala His Gly Phe Ala Asn Leu Leu Thr Leu Asp Lys Lys Thr Leu           
    610                 615                 620                           

caa gaa att cta gtg cat tat cca gat tct gaa aga atc ctc atg aag       1920
Gln Glu Ile Leu Val His Tyr Pro Asp Ser Glu Arg Ile Leu Met Lys           
625                 630                 635                 640           

aaa gcc aga gtg ctt tta aag cag aag gct aag acc gca gaa gca acc       1968
Lys Ala Arg Val Leu Leu Lys Gln Lys Ala Lys Thr Ala Glu Ala Thr           
                645                 650                 655               

cct cca aga aaa gat ctt gcc ctc ctc ttc cca ccg aaa gaa gag aca       2016
Pro Pro Arg Lys Asp Leu Ala Leu Leu Phe Pro Pro Lys Glu Glu Thr           
            660                 665                 670                   

ccc aaa ctg ttt aaa act ctc cta gga ggc aca gga aaa gca agt ctt       2064
Pro Lys Leu Phe Lys Thr Leu Leu Gly Gly Thr Gly Lys Ala Ser Leu           
        675                 680                 685                       

gca aga cta ctc aaa ttg aag cga gag caa gca gct cag aag aaa gaa       2112
Ala Arg Leu Leu Lys Leu Lys Arg Glu Gln Ala Ala Gln Lys Lys Glu           
    690                 695                 700                           

aat tct gaa gga gga gag gaa gaa gga aaa gaa aat gaa gat aaa caa       2160
Asn Ser Glu Gly Gly Glu Glu Glu Gly Lys Glu Asn Glu Asp Lys Gln           
705                 710                 715                 720           

aaa gaa aat gaa gat aaa caa aaa gaa aat gaa gat aaa gga aaa gaa       2208
Lys Glu Asn Glu Asp Lys Gln Lys Glu Asn Glu Asp Lys Gly Lys Glu           
                725                 730                 735               

aat gaa gat aaa gat aaa gga aga gag cca gaa gag aag cca ctg gac       2256
Asn Glu Asp Lys Asp Lys Gly Arg Glu Pro Glu Glu Lys Pro Leu Asp           
            740                 745                 750                   

aga cct gaa tgt aca gca agt cct att gca gtg gag gaa gaa ccc cac       2304
Arg Pro Glu Cys Thr Ala Ser Pro Ile Ala Val Glu Glu Glu Pro His           
        755                 760                 765                       

tca gtt aga agg aca gtt tta ccc aga ggg act tct cgt caa tca ctc       2352
Ser Val Arg Arg Thr Val Leu Pro Arg Gly Thr Ser Arg Gln Ser Leu           
    770                 775                 780                           

att atc agc atg gct cct tct gct gag ggc gga gaa gag gtt ctt act       2400
Ile Ile Ser Met Ala Pro Ser Ala Glu Gly Gly Glu Glu Val Leu Thr           
785                 790                 795                 800           

att gaa gtc aaa gaa aag gct aag caa tga                               2430
Ile Glu Val Lys Glu Lys Ala Lys Gln                                       
                805                                                       


<210>  22
<211>  809
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  22

Met Phe Lys Ser Leu Thr Lys Val Asn Lys Val Lys Pro Ile Gly Glu 
1               5                   10                  15      


Asn Asn Glu Asn Glu Gln Ser Ser Arg Arg Asn Glu Glu Gly Ser His 
            20                  25                  30          


Pro Ser Asn Gln Ser Gln Gln Thr Thr Ala Gln Glu Glu Asn Lys Gly 
        35                  40                  45              


Glu Glu Lys Ser Leu Lys Thr Lys Ser Thr Pro Val Thr Ser Glu Glu 
    50                  55                  60                  


Pro His Thr Asn Ile Gln Asp Lys Leu Ser Lys Lys Asn Ser Ser Gly 
65                  70                  75                  80  


Asp Leu Thr Thr Asn Pro Asp Pro Gln Asn Ala Ala Glu Pro Thr Gly 
                85                  90                  95      


Thr Val Pro Glu Gln Lys Glu Met Asp Pro Gly Lys Glu Gly Pro Asn 
            100                 105                 110         


Ser Pro Gln Asn Lys Pro Pro Ala Ala Pro Val Ile Asn Glu Tyr Ala 
        115                 120                 125             


Asp Ala Gln Leu His Asn Leu Val Lys Arg Met Arg Gln Arg Thr Ala 
    130                 135                 140                 


Leu Tyr Lys Lys Lys Leu Val Glu Gly Asp Leu Ser Ser Pro Glu Ala 
145                 150                 155                 160 


Ser Pro Gln Thr Ala Lys Pro Thr Ala Val Pro Pro Val Lys Glu Ser 
                165                 170                 175     


Asp Asp Lys Pro Thr Glu His Tyr Tyr Arg Leu Leu Trp Phe Lys Val 
            180                 185                 190         


Lys Lys Met Pro Leu Thr Glu Tyr Leu Lys Arg Ile Lys Leu Pro Asn 
        195                 200                 205             


Ser Ile Asp Ser Tyr Thr Asp Arg Leu Tyr Leu Leu Trp Leu Leu Leu 
    210                 215                 220                 


Val Thr Leu Ala Tyr Asn Trp Asn Cys Cys Phe Ile Pro Leu Arg Leu 
225                 230                 235                 240 


Val Phe Pro Tyr Gln Thr Ala Asp Asn Ile His Tyr Trp Leu Ile Ala 
                245                 250                 255     


Asp Ile Ile Cys Asp Ile Ile Tyr Leu Tyr Asp Met Leu Phe Ile Gln 
            260                 265                 270         


Pro Arg Leu Gln Phe Val Arg Gly Gly Asp Ile Ile Val Asp Ser Asn 
        275                 280                 285             


Glu Leu Arg Lys His Tyr Arg Thr Ser Thr Lys Phe Gln Leu Asp Val 
    290                 295                 300                 


Ala Ser Ile Ile Pro Phe Asp Ile Cys Tyr Leu Phe Phe Gly Phe Asn 
305                 310                 315                 320 


Pro Met Phe Arg Ala Asn Arg Met Leu Lys Tyr Thr Ser Phe Phe Glu 
                325                 330                 335     


Phe Asn His His Leu Glu Ser Ile Met Asp Lys Ala Tyr Ile Tyr Arg 
            340                 345                 350         


Val Ile Arg Thr Thr Gly Tyr Leu Leu Phe Ile Leu His Ile Asn Ala 
        355                 360                 365             


Cys Val Tyr Tyr Trp Ala Ser Asn Tyr Glu Gly Ile Gly Thr Thr Arg 
    370                 375                 380                 


Trp Val Tyr Asp Gly Glu Gly Asn Glu Tyr Leu Arg Cys Tyr Tyr Trp 
385                 390                 395                 400 


Ala Val Arg Thr Leu Ile Thr Ile Gly Gly Leu Pro Glu Pro Gln Thr 
                405                 410                 415     


Leu Phe Glu Ile Val Phe Gln Leu Leu Asn Phe Phe Ser Gly Val Phe 
            420                 425                 430         


Val Phe Ser Ser Leu Ile Gly Gln Met Arg Asp Val Ile Gly Ala Ala 
        435                 440                 445             


Thr Ala Asn Gln Asn Tyr Phe Arg Ala Cys Met Asp Asp Thr Ile Ala 
    450                 455                 460                 


Tyr Met Asn Asn Tyr Ser Ile Pro Lys Leu Val Gln Lys Arg Val Arg 
465                 470                 475                 480 


Thr Trp Tyr Glu Tyr Thr Trp Asp Ser Gln Arg Met Leu Asp Glu Ser 
                485                 490                 495     


Asp Leu Leu Lys Thr Leu Pro Thr Thr Val Gln Leu Ala Leu Ala Ile 
            500                 505                 510         


Asp Val Asn Phe Ser Ile Ile Ser Lys Val Asp Leu Phe Lys Gly Cys 
        515                 520                 525             


Asp Thr Gln Met Ile Tyr Asp Met Leu Leu Arg Leu Lys Ser Val Leu 
    530                 535                 540                 


Tyr Leu Pro Gly Asp Phe Val Cys Lys Lys Gly Glu Ile Gly Lys Glu 
545                 550                 555                 560 


Met Tyr Ile Ile Lys His Gly Glu Val Gln Val Leu Gly Gly Pro Asp 
                565                 570                 575     


Gly Thr Lys Val Leu Val Thr Leu Lys Ala Gly Ser Val Phe Gly Glu 
            580                 585                 590         


Ile Ser Leu Leu Ala Ala Gly Gly Gly Asn Arg Arg Thr Ala Asn Val 
        595                 600                 605             


Val Ala His Gly Phe Ala Asn Leu Leu Thr Leu Asp Lys Lys Thr Leu 
    610                 615                 620                 


Gln Glu Ile Leu Val His Tyr Pro Asp Ser Glu Arg Ile Leu Met Lys 
625                 630                 635                 640 


Lys Ala Arg Val Leu Leu Lys Gln Lys Ala Lys Thr Ala Glu Ala Thr 
                645                 650                 655     


Pro Pro Arg Lys Asp Leu Ala Leu Leu Phe Pro Pro Lys Glu Glu Thr 
            660                 665                 670         


Pro Lys Leu Phe Lys Thr Leu Leu Gly Gly Thr Gly Lys Ala Ser Leu 
        675                 680                 685             


Ala Arg Leu Leu Lys Leu Lys Arg Glu Gln Ala Ala Gln Lys Lys Glu 
    690                 695                 700                 


Asn Ser Glu Gly Gly Glu Glu Glu Gly Lys Glu Asn Glu Asp Lys Gln 
705                 710                 715                 720 


Lys Glu Asn Glu Asp Lys Gln Lys Glu Asn Glu Asp Lys Gly Lys Glu 
                725                 730                 735     


Asn Glu Asp Lys Asp Lys Gly Arg Glu Pro Glu Glu Lys Pro Leu Asp 
            740                 745                 750         


Arg Pro Glu Cys Thr Ala Ser Pro Ile Ala Val Glu Glu Glu Pro His 
        755                 760                 765             


Ser Val Arg Arg Thr Val Leu Pro Arg Gly Thr Ser Arg Gln Ser Leu 
    770                 775                 780                 


Ile Ile Ser Met Ala Pro Ser Ala Glu Gly Gly Glu Glu Val Leu Thr 
785                 790                 795                 800 


Ile Glu Val Lys Glu Lys Ala Lys Gln 
                805                 


<210>  23
<211>  2454
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence


<220>
<221>  misc_feature
<222>  (1)..(12)
<223>  modified end with NotI site and Kozak

<220>
<221>  misc_feature
<222>  (1)..(8)
<223>  NotI site for subcloning

<220>
<221>  CDS
<222>  (13)..(2448)
<223>  ORF with silent mutations (stop codon and restriction sites 
       BamHI, PstI, SalI, and NdeI)

<220>
<221>  misc_feature
<222>  (2440)..(2442)
<223>  modifed stop codon

<220>
<221>  misc_feature
<222>  (2440)..(2445)
<223>  BclI site to facilitate addition of epitope tag

<220>
<221>  misc_feature
<222>  (2446)..(2448)
<223>  additional stop codon

<220>
<221>  misc_feature
<222>  (2449)..(2454)
<223>  PstI site for subcloning

<400>  23
gcggccgcca cc atg ttt aaa tcg ctg aca aaa gtc aac aag gtg aag cct       51
              Met Phe Lys Ser Leu Thr Lys Val Asn Lys Val Lys Pro         
              1               5                   10                      

ata gga gag aac aat gag aat gaa caa agt tct cgt cgg aat gaa gaa         99
Ile Gly Glu Asn Asn Glu Asn Glu Gln Ser Ser Arg Arg Asn Glu Glu           
    15                  20                  25                            

ggc tct cac cca agt aat cag tct cag caa acc aca gca cag gaa gaa        147
Gly Ser His Pro Ser Asn Gln Ser Gln Gln Thr Thr Ala Gln Glu Glu           
30                  35                  40                  45            

aac aaa ggt gaa gag aaa tct ctc aaa acc aag tca act cca gtc acg        195
Asn Lys Gly Glu Glu Lys Ser Leu Lys Thr Lys Ser Thr Pro Val Thr           
                50                  55                  60                

tct gaa gag cca cac acc aac ata caa gac aaa ctc tcc aag aaa aat        243
Ser Glu Glu Pro His Thr Asn Ile Gln Asp Lys Leu Ser Lys Lys Asn           
            65                  70                  75                    

tcc tct gga gat ctg acc aca aac cct gac cct caa aat gca gca gaa        291
Ser Ser Gly Asp Leu Thr Thr Asn Pro Asp Pro Gln Asn Ala Ala Glu           
        80                  85                  90                        

cca act gga aca gtg cca gag cag aag gaa atg gac ccc ggg aaa gaa        339
Pro Thr Gly Thr Val Pro Glu Gln Lys Glu Met Asp Pro Gly Lys Glu           
    95                  100                 105                           

ggt cca aac agc cca caa aac aaa ccg cca gca gct cct gtt ata aat        387
Gly Pro Asn Ser Pro Gln Asn Lys Pro Pro Ala Ala Pro Val Ile Asn           
110                 115                 120                 125           

gag tat gcc gat gcc cag cta cac aac ctg gtg aaa aga atg cgt caa        435
Glu Tyr Ala Asp Ala Gln Leu His Asn Leu Val Lys Arg Met Arg Gln           
                130                 135                 140               

aga aca gcc ctc tac aag aaa aag ttg gta gag gga gat ctc tcc tca        483
Arg Thr Ala Leu Tyr Lys Lys Lys Leu Val Glu Gly Asp Leu Ser Ser           
            145                 150                 155                   

ccc gaa gcc agc cca caa act gca aag ccc acg gct gta cca cca gta        531
Pro Glu Ala Ser Pro Gln Thr Ala Lys Pro Thr Ala Val Pro Pro Val           
        160                 165                 170                       

aaa gaa agc gat gat aag cca aca gaa cat tac tac agg ctg ttg tgg        579
Lys Glu Ser Asp Asp Lys Pro Thr Glu His Tyr Tyr Arg Leu Leu Trp           
    175                 180                 185                           

ttc aaa gtc aaa aag atg cct tta aca gag tac tta aag cga att aaa        627
Phe Lys Val Lys Lys Met Pro Leu Thr Glu Tyr Leu Lys Arg Ile Lys           
190                 195                 200                 205           

ctt cca aac agc ata gat tca tac aca gat cga ctc tat ctc ctg tgg        675
Leu Pro Asn Ser Ile Asp Ser Tyr Thr Asp Arg Leu Tyr Leu Leu Trp           
                210                 215                 220               

ctc ttg ctt gtc act ctt gcc tat aac tgg aac tgc tgt ttt ata cca        723
Leu Leu Leu Val Thr Leu Ala Tyr Asn Trp Asn Cys Cys Phe Ile Pro           
            225                 230                 235                   

ctg cgc ctc gtc ttc cca tat caa acc gca gac aac ata cac tac tgg        771
Leu Arg Leu Val Phe Pro Tyr Gln Thr Ala Asp Asn Ile His Tyr Trp           
        240                 245                 250                       

ctt att gcg gac atc atc tgt gat atc atc tac ctt tat gat atg cta        819
Leu Ile Ala Asp Ile Ile Cys Asp Ile Ile Tyr Leu Tyr Asp Met Leu           
    255                 260                 265                           

ttt atc cag ccc aga ctc cag ttt gta aga gga gga gac ata ata gtg        867
Phe Ile Gln Pro Arg Leu Gln Phe Val Arg Gly Gly Asp Ile Ile Val           
270                 275                 280                 285           

gat tca aat gag cta agg aaa cac tac agg act tct aca aaa ttt cag        915
Asp Ser Asn Glu Leu Arg Lys His Tyr Arg Thr Ser Thr Lys Phe Gln           
                290                 295                 300               

ttg gat gtc gca tca ata ata cca ttt gat att tgc tac ctc ttc ttt        963
Leu Asp Val Ala Ser Ile Ile Pro Phe Asp Ile Cys Tyr Leu Phe Phe           
            305                 310                 315                   

ggg ttt aat cca atg ttt aga gca aat agg atg tta aag tac act tca       1011
Gly Phe Asn Pro Met Phe Arg Ala Asn Arg Met Leu Lys Tyr Thr Ser           
        320                 325                 330                       

ttt ttt gaa ttt aat cat cac cta gag tct ata atg gac aaa gca tat       1059
Phe Phe Glu Phe Asn His His Leu Glu Ser Ile Met Asp Lys Ala Tyr           
    335                 340                 345                           

atc tac aga gtt att cga aca act gga tac ttg ctg ttt att ctg cac       1107
Ile Tyr Arg Val Ile Arg Thr Thr Gly Tyr Leu Leu Phe Ile Leu His           
350                 355                 360                 365           

att aat gcc tgt gtt tat tac tgg gct tca aac tat gaa gga att ggc       1155
Ile Asn Ala Cys Val Tyr Tyr Trp Ala Ser Asn Tyr Glu Gly Ile Gly           
                370                 375                 380               

act act aga tgg gtg tat gat ggg gaa gga aac gag tat ctg aga tgt       1203
Thr Thr Arg Trp Val Tyr Asp Gly Glu Gly Asn Glu Tyr Leu Arg Cys           
            385                 390                 395                   

tat tat tgg gca gtt cga act tta att acc att ggt ggc ctt cca gaa       1251
Tyr Tyr Trp Ala Val Arg Thr Leu Ile Thr Ile Gly Gly Leu Pro Glu           
        400                 405                 410                       

cca caa act tta ttt gaa att gtt ttt caa ctc ttg aat ttt ttt tct       1299
Pro Gln Thr Leu Phe Glu Ile Val Phe Gln Leu Leu Asn Phe Phe Ser           
    415                 420                 425                           

gga gtt ttt gtg ttc tcc agt tta att ggt cag atg aga gat gtg att       1347
Gly Val Phe Val Phe Ser Ser Leu Ile Gly Gln Met Arg Asp Val Ile           
430                 435                 440                 445           

gga gca gct aca gcc aat cag aac tac ttc cgc gcc tgc atg gat gac       1395
Gly Ala Ala Thr Ala Asn Gln Asn Tyr Phe Arg Ala Cys Met Asp Asp           
                450                 455                 460               

acc att gcc tac atg aac aat tac tcc att cct aaa ctt gtg caa aag       1443
Thr Ile Ala Tyr Met Asn Asn Tyr Ser Ile Pro Lys Leu Val Gln Lys           
            465                 470                 475                   

cga gtt cgg act tgg tat gaa tat aca tgg gac tct caa aga atg cta       1491
Arg Val Arg Thr Trp Tyr Glu Tyr Thr Trp Asp Ser Gln Arg Met Leu           
        480                 485                 490                       

gat gag tct gat ttg ctt aag acc cta cca act acg gtc cag tta gcc       1539
Asp Glu Ser Asp Leu Leu Lys Thr Leu Pro Thr Thr Val Gln Leu Ala           
    495                 500                 505                           

ctc gcc att gat gtg aac ttc agc atc atc agc aaa gtt gac ttg ttc       1587
Leu Ala Ile Asp Val Asn Phe Ser Ile Ile Ser Lys Val Asp Leu Phe           
510                 515                 520                 525           

aag ggt tgt gat aca cag atg att tat gac atg ttg cta aga ttg aaa       1635
Lys Gly Cys Asp Thr Gln Met Ile Tyr Asp Met Leu Leu Arg Leu Lys           
                530                 535                 540               

tcc gtt ctc tat ttg cct ggt gac ttt gtc tgc aaa aag gga gaa att       1683
Ser Val Leu Tyr Leu Pro Gly Asp Phe Val Cys Lys Lys Gly Glu Ile           
            545                 550                 555                   

ggc aag gaa atg tat atc atc aag cat gga gaa gtc caa gtt ctt gga       1731
Gly Lys Glu Met Tyr Ile Ile Lys His Gly Glu Val Gln Val Leu Gly           
        560                 565                 570                       

ggc cct gat ggt act aaa gtt ctg gtt act ctg aaa gct ggg tcg gtg       1779
Gly Pro Asp Gly Thr Lys Val Leu Val Thr Leu Lys Ala Gly Ser Val           
    575                 580                 585                           

ttt gga gaa atc agc ctt cta gca gca gga gga gga aac cgt cga act       1827
Phe Gly Glu Ile Ser Leu Leu Ala Ala Gly Gly Gly Asn Arg Arg Thr           
590                 595                 600                 605           

gcc aat gtg gtg gcc cac ggg ttt gcc aat ctt tta act cta gac aaa       1875
Ala Asn Val Val Ala His Gly Phe Ala Asn Leu Leu Thr Leu Asp Lys           
                610                 615                 620               

aag acc ctc caa gaa att cta gtg cat tat cca gat tct gaa aga atc       1923
Lys Thr Leu Gln Glu Ile Leu Val His Tyr Pro Asp Ser Glu Arg Ile           
            625                 630                 635                   

ctc atg aag aaa gcc aga gtg ctt tta aag cag aag gct aag acc gca       1971
Leu Met Lys Lys Ala Arg Val Leu Leu Lys Gln Lys Ala Lys Thr Ala           
        640                 645                 650                       

gaa gca acc cct cca aga aaa gat ctt gcc ctc ctc ttc cca ccg aaa       2019
Glu Ala Thr Pro Pro Arg Lys Asp Leu Ala Leu Leu Phe Pro Pro Lys           
    655                 660                 665                           

gaa gag aca ccc aaa ctg ttt aaa act ctc cta gga ggc aca gga aaa       2067
Glu Glu Thr Pro Lys Leu Phe Lys Thr Leu Leu Gly Gly Thr Gly Lys           
670                 675                 680                 685           

gca agt ctt gca aga cta ctc aaa ttg aag cga gag caa gca gct cag       2115
Ala Ser Leu Ala Arg Leu Leu Lys Leu Lys Arg Glu Gln Ala Ala Gln           
                690                 695                 700               

aag aaa gaa aat tct gaa gga gga gag gaa gaa gga aaa gaa aat gaa       2163
Lys Lys Glu Asn Ser Glu Gly Gly Glu Glu Glu Gly Lys Glu Asn Glu           
            705                 710                 715                   

gat aaa caa aaa gaa aat gaa gat aaa caa aaa gaa aat gaa gat aaa       2211
Asp Lys Gln Lys Glu Asn Glu Asp Lys Gln Lys Glu Asn Glu Asp Lys           
        720                 725                 730                       

gga aaa gaa aat gaa gat aaa gat aaa gga aga gag cca gaa gag aag       2259
Gly Lys Glu Asn Glu Asp Lys Asp Lys Gly Arg Glu Pro Glu Glu Lys           
    735                 740                 745                           

cca ctg gac aga cct gaa tgt aca gca agt cct att gca gtg gag gaa       2307
Pro Leu Asp Arg Pro Glu Cys Thr Ala Ser Pro Ile Ala Val Glu Glu           
750                 755                 760                 765           

gaa ccc cac tca gtt aga agg aca gtt tta ccc aga ggg act tct cgt       2355
Glu Pro His Ser Val Arg Arg Thr Val Leu Pro Arg Gly Thr Ser Arg           
                770                 775                 780               

caa tca ctc att atc agc atg gct cct tct gct gag ggc gga gaa gag       2403
Gln Ser Leu Ile Ile Ser Met Ala Pro Ser Ala Glu Gly Gly Glu Glu           
            785                 790                 795                   

gtt ctt act att gaa gtc aaa gaa aag gct aag caa tga tca taa           2448
Val Leu Thr Ile Glu Val Lys Glu Lys Ala Lys Gln     Ser                   
        800                 805                     810                   

ctgcag                                                                2454


<210>  24
<211>  809
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  24

Met Phe Lys Ser Leu Thr Lys Val Asn Lys Val Lys Pro Ile Gly Glu 
1               5                   10                  15      


Asn Asn Glu Asn Glu Gln Ser Ser Arg Arg Asn Glu Glu Gly Ser His 
            20                  25                  30          


Pro Ser Asn Gln Ser Gln Gln Thr Thr Ala Gln Glu Glu Asn Lys Gly 
        35                  40                  45              


Glu Glu Lys Ser Leu Lys Thr Lys Ser Thr Pro Val Thr Ser Glu Glu 
    50                  55                  60                  


Pro His Thr Asn Ile Gln Asp Lys Leu Ser Lys Lys Asn Ser Ser Gly 
65                  70                  75                  80  


Asp Leu Thr Thr Asn Pro Asp Pro Gln Asn Ala Ala Glu Pro Thr Gly 
                85                  90                  95      


Thr Val Pro Glu Gln Lys Glu Met Asp Pro Gly Lys Glu Gly Pro Asn 
            100                 105                 110         


Ser Pro Gln Asn Lys Pro Pro Ala Ala Pro Val Ile Asn Glu Tyr Ala 
        115                 120                 125             


Asp Ala Gln Leu His Asn Leu Val Lys Arg Met Arg Gln Arg Thr Ala 
    130                 135                 140                 


Leu Tyr Lys Lys Lys Leu Val Glu Gly Asp Leu Ser Ser Pro Glu Ala 
145                 150                 155                 160 


Ser Pro Gln Thr Ala Lys Pro Thr Ala Val Pro Pro Val Lys Glu Ser 
                165                 170                 175     


Asp Asp Lys Pro Thr Glu His Tyr Tyr Arg Leu Leu Trp Phe Lys Val 
            180                 185                 190         


Lys Lys Met Pro Leu Thr Glu Tyr Leu Lys Arg Ile Lys Leu Pro Asn 
        195                 200                 205             


Ser Ile Asp Ser Tyr Thr Asp Arg Leu Tyr Leu Leu Trp Leu Leu Leu 
    210                 215                 220                 


Val Thr Leu Ala Tyr Asn Trp Asn Cys Cys Phe Ile Pro Leu Arg Leu 
225                 230                 235                 240 


Val Phe Pro Tyr Gln Thr Ala Asp Asn Ile His Tyr Trp Leu Ile Ala 
                245                 250                 255     


Asp Ile Ile Cys Asp Ile Ile Tyr Leu Tyr Asp Met Leu Phe Ile Gln 
            260                 265                 270         


Pro Arg Leu Gln Phe Val Arg Gly Gly Asp Ile Ile Val Asp Ser Asn 
        275                 280                 285             


Glu Leu Arg Lys His Tyr Arg Thr Ser Thr Lys Phe Gln Leu Asp Val 
    290                 295                 300                 


Ala Ser Ile Ile Pro Phe Asp Ile Cys Tyr Leu Phe Phe Gly Phe Asn 
305                 310                 315                 320 


Pro Met Phe Arg Ala Asn Arg Met Leu Lys Tyr Thr Ser Phe Phe Glu 
                325                 330                 335     


Phe Asn His His Leu Glu Ser Ile Met Asp Lys Ala Tyr Ile Tyr Arg 
            340                 345                 350         


Val Ile Arg Thr Thr Gly Tyr Leu Leu Phe Ile Leu His Ile Asn Ala 
        355                 360                 365             


Cys Val Tyr Tyr Trp Ala Ser Asn Tyr Glu Gly Ile Gly Thr Thr Arg 
    370                 375                 380                 


Trp Val Tyr Asp Gly Glu Gly Asn Glu Tyr Leu Arg Cys Tyr Tyr Trp 
385                 390                 395                 400 


Ala Val Arg Thr Leu Ile Thr Ile Gly Gly Leu Pro Glu Pro Gln Thr 
                405                 410                 415     


Leu Phe Glu Ile Val Phe Gln Leu Leu Asn Phe Phe Ser Gly Val Phe 
            420                 425                 430         


Val Phe Ser Ser Leu Ile Gly Gln Met Arg Asp Val Ile Gly Ala Ala 
        435                 440                 445             


Thr Ala Asn Gln Asn Tyr Phe Arg Ala Cys Met Asp Asp Thr Ile Ala 
    450                 455                 460                 


Tyr Met Asn Asn Tyr Ser Ile Pro Lys Leu Val Gln Lys Arg Val Arg 
465                 470                 475                 480 


Thr Trp Tyr Glu Tyr Thr Trp Asp Ser Gln Arg Met Leu Asp Glu Ser 
                485                 490                 495     


Asp Leu Leu Lys Thr Leu Pro Thr Thr Val Gln Leu Ala Leu Ala Ile 
            500                 505                 510         


Asp Val Asn Phe Ser Ile Ile Ser Lys Val Asp Leu Phe Lys Gly Cys 
        515                 520                 525             


Asp Thr Gln Met Ile Tyr Asp Met Leu Leu Arg Leu Lys Ser Val Leu 
    530                 535                 540                 


Tyr Leu Pro Gly Asp Phe Val Cys Lys Lys Gly Glu Ile Gly Lys Glu 
545                 550                 555                 560 


Met Tyr Ile Ile Lys His Gly Glu Val Gln Val Leu Gly Gly Pro Asp 
                565                 570                 575     


Gly Thr Lys Val Leu Val Thr Leu Lys Ala Gly Ser Val Phe Gly Glu 
            580                 585                 590         


Ile Ser Leu Leu Ala Ala Gly Gly Gly Asn Arg Arg Thr Ala Asn Val 
        595                 600                 605             


Val Ala His Gly Phe Ala Asn Leu Leu Thr Leu Asp Lys Lys Thr Leu 
    610                 615                 620                 


Gln Glu Ile Leu Val His Tyr Pro Asp Ser Glu Arg Ile Leu Met Lys 
625                 630                 635                 640 


Lys Ala Arg Val Leu Leu Lys Gln Lys Ala Lys Thr Ala Glu Ala Thr 
                645                 650                 655     


Pro Pro Arg Lys Asp Leu Ala Leu Leu Phe Pro Pro Lys Glu Glu Thr 
            660                 665                 670         


Pro Lys Leu Phe Lys Thr Leu Leu Gly Gly Thr Gly Lys Ala Ser Leu 
        675                 680                 685             


Ala Arg Leu Leu Lys Leu Lys Arg Glu Gln Ala Ala Gln Lys Lys Glu 
    690                 695                 700                 


Asn Ser Glu Gly Gly Glu Glu Glu Gly Lys Glu Asn Glu Asp Lys Gln 
705                 710                 715                 720 


Lys Glu Asn Glu Asp Lys Gln Lys Glu Asn Glu Asp Lys Gly Lys Glu 
                725                 730                 735     


Asn Glu Asp Lys Asp Lys Gly Arg Glu Pro Glu Glu Lys Pro Leu Asp 
            740                 745                 750         


Arg Pro Glu Cys Thr Ala Ser Pro Ile Ala Val Glu Glu Glu Pro His 
        755                 760                 765             


Ser Val Arg Arg Thr Val Leu Pro Arg Gly Thr Ser Arg Gln Ser Leu 
    770                 775                 780                 


Ile Ile Ser Met Ala Pro Ser Ala Glu Gly Gly Glu Glu Val Leu Thr 
785                 790                 795                 800 


Ile Glu Val Lys Glu Lys Ala Lys Gln 
                805                 


<210>  25
<211>  11714
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence


<220>
<221>  misc_feature
<222>  (1)..(130)
<223>  5' ITR

<220>
<221>  misc_feature
<222>  (241)..(544)
<223>  CMV enhancer

<220>
<221>  misc_feature
<222>  (546)..(823)
<223>  chicken beta-actin promoter

<220>
<221>  misc_feature
<222>  (824)..(1795)
<223>  CBA exon 1 and intron

<220>
<221>  misc_feature
<222>  (1859)..(1864)
<223>  kozak

<220>
<221>  misc_feature
<222>  (1865)..(3826)
<223>  human codon optimized CHM (REP-1)

<220>
<221>  misc_feature
<222>  (3847)..(4054)
<223>  bGH poly(A) signal

<220>
<221>  misc_feature
<222>  (4104)..(4233)
<223>  3' ITR

<400>  25
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg      240

cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt      300

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      360

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      420

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      480

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattaa      540

catggtcgag gtgagcccca cgttctgctt cactctcccc atctcccccc cctccccacc      600

cccaattttg tatttattta ttttttaatt attttgtgca gcgatggggg cggggggggg      660

gggggggcgc gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag      720

aggtgcggcg gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg      780

gcggcggcgg cggccctata aaaagcgaag cgcgcggcgg gcggggagtc gctgcgacgc      840

tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg      900

accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag      960

cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc     1020

cgggagggcc ctttgtgcgg ggggagcggc tcggggggtg cgtgcgtgtg tgtgtgcgtg     1080

gggagcgccg cgtgcggctc cgcgctgccc ggcggctgtg agcgctgcgg gcgcggcgcg     1140

gggctttgtg cgctccgcag tgtgcgcgag gggagcgcgg ccgggggcgg tgccccgcgg     1200

tgcggggggg gctgcgaggg gaacaaaggc tgcgtgcggg gtgtgtgcgt gggggggtga     1260

gcagggggtg tgggcgcgtc ggtcgggctg caaccccccc tgcacccccc tccccgagtt     1320

gctgagcacg gcccggcttc gggtgcgggg ctccgtacgg ggcgtggcgc ggggctcgcc     1380

gtgccgggcg gggggtggcg gcaggtgggg gtgccgggcg gggcggggcc gcctcgggcc     1440

ggggagggct cgggggaggg gcgcggcggc ccccggagcg ccggcggctg tcgaggcgcg     1500

gcgagccgca gccattgcct tttatggtaa tcgtgcgaga gggcgcaggg acttcctttg     1560

tcccaaatct gtgcggagcc gaaatctggg aggcgccgcc gcaccccctc tagcgggcgc     1620

ggggcgaagc ggtgcggcgc cggcaggaag gaaatgggcg gggagggcct tcgtgcgtcg     1680

ccgcgccgcc gtccccttct ccctctccag cctcggggct gtccgcgggg ggacggctgc     1740

cttcgggggg gacggggcag ggcggggttc ggcttctggc gtgtgaccgg cggctctaga     1800

caattgtact aaccttcttc tctttcctct cctgacaggt tggtgtacac tagcggccgc     1860

caccatggct gataccctgc cctctgaatt cgacgtgatt gtgattggaa ccggactccc     1920

tgaatcgatc atcgccgcgg cctgttcccg gtccggtcgg cgcgtgctgc acgtcgattc     1980

gagaagctac tacggaggga attgggcctc attctccttc tccggactgc tctcctggct     2040

gaaggagtat caggagaact ccgacattgt ctccgactca cctgtgtggc aggaccagat     2100

cctggaaaac gaggaagcaa tagccctgag ccggaaggac aagaccatcc agcacgtgga     2160

ggtgttctgt tatgcctccc aagacctcca tgaggacgtg gaagaggctg gagcgttgca     2220

gaagaatcat gccctcgtga cctccgctaa ctccaccgag gcagccgaca gcgccttcct     2280

gccgaccgag gatgaatccc tgtcaactat gtcgtgcgaa atgctgaccg aacagactcc     2340

gagctccgac cccgaaaacg ccctggaagt gaacggagcg gaagtgaccg gcgaaaagga     2400

gaaccattgc gacgacaaga cttgtgtccc atccacttcc gcggaggaca tgtccgagaa     2460

tgtgcctatc gccgaggaca ccaccgaaca gcccaagaag aacagaatca cgtacagcca     2520

gatcatcaag gaggggcgga ggtttaacat cgatctggtg tcgaagctgc tgtacagccg     2580

cggtctgctg atcgatctgc tcattaagtc gaacgtgtcg agatacgccg agttcaagaa     2640

catcacaagg attctcgcct tccgggaagg aagagtggaa caagtgccgt gctcccgggc     2700

cgacgtgttc aactcaaagc aacttaccat ggtggaaaag cgcatgctga tgaaattcct     2760

gaccttctgc atggagtacg aaaagtaccc tgatgagtac aagggttacg aagaaattac     2820

tttctacgag tacctcaaga cccagaagct gaccccgaat ctgcagtaca ttgtgatgca     2880

ctcaatcgca atgacctccg aaaccgcctc ctcgaccatc gacgggctca aggccaccaa     2940

gaacttcctg cactgtttgg ggcgctacgg caacactccg ttcctcttcc cgctgtacgg     3000

ccagggagag ctgcctcagt gtttctgccg gatgtgcgcc gtgttcggcg gaatctactg     3060

tctccgccac tcggtccagt gcctggtggt ggacaaggaa tccaggaagt gcaaagccat     3120

tattgaccag ttcggacaac ggatcatttc cgagcacttt cttgtggagg actcatactt     3180

cccggagaac atgtgctctc gggtccagta tcgacagatt tccagggcgg tgctcattac     3240

tgaccggagc gtcctcaaga ccgatagcga ccagcagatc tccatcctga ccgtgccggc     3300

ggaagaaccc ggcacttttg ccgtgcgcgt gatcgagctt tgctcatcca ccatgacttg     3360

catgaaaggc acttacctgg tgcacctgac gtgcacctca tcgaaaaccg ctagagagga     3420

cctggaatcc gtcgtccaaa agctgttcgt gccttacacc gagatggaaa ttgaaaacga     3480

acaagtggag aagccccgca tcctttgggc cctgtacttt aacatgcgcg attcctccga     3540

tatctcgcgg tcctgctata acgacttgcc ttcgaacgtc tacgtctgct ccgggccaga     3600

ctgcggtctt ggcaacgaca atgccgtgaa gcaggcggaa acactgttcc aagagatctg     3660

ccctaacgag gatttttgcc cgcccccccc aaaccccgag gatatcatct tggacggaga     3720

cagcctgcag ccagaagcat ccgagtccag cgccatcccg gaggccaaca gcgaaacctt     3780

caaggagagc actaacctgg gcaacctgga agagtccagc gaatgatcat aggatctctg     3840

cctcgactgt gccttctagt tgccagccat ctgttgtttg cccctccccc gtgccttcct     3900

tgaccctgga aggtgccact cccactgtcc tttcctaata aaatgaggaa attgcatcgc     3960

attgtctgag taggtgtcat tctattctgg ggggtggggt ggggcaggac agcaaggggg     4020

aggattggga agacaatagc aggcatgctg gggactcgag ttctacgtag ataagtagca     4080

tggcgggtta atcattaact acaaggaacc cctagtgatg gagttggcca ctccctctct     4140

gcgcgctcgc tcgctcactg aggccgggcg accaaaggtc gcccgacgcc cgggctttgc     4200

ccgggcggcc tcagtgagcg agcgagcgcg cagccttata aggatatggt gcactctcag     4260

tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa cacccgctga     4320

cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc     4380

cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gacgaaaggg     4440

cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt cttagacgtc     4500

aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca     4560

ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa     4620

aaggaagagt atgagccata ttcaacggga aacgtcgagg ccgcgattaa attccaacat     4680

ggatgctgat ttatatgggt ataaatgggc tcgcgataat gtcgggcaat caggtgcgac     4740

aatctatcgc ttgtatggga agcccgatgc gccagagttg tttctgaaac atggcaaagg     4800

tagcgttgcc aatgatgtta cagatgagat ggtcagacta aactggctga cggaatttat     4860

gccacttccg accatcaagc attttatccg tactcctgat gatgcatggt tactcaccac     4920

tgcgatcccc ggaaaaacag cgttccaggt attagaagaa tatcctgatt caggtgaaaa     4980

tattgttgat gcgctggcag tgttcctgcg ccggttgcac tcgattcctg tttgtaattg     5040

tccttttaac agcgatcgcg tatttcgcct cgctcaggcg caatcacgaa tgaataacgg     5100

tttggttgat gcgagtgatt ttgatgacga gcgtaatggc tggcctgttg aacaagtctg     5160

gaaagaaatg cataaacttt tgccattctc accggattca gtcgtcactc atggtgattt     5220

ctcacttgat aaccttattt ttgacgaggg gaaattaata ggttgtattg atgttggacg     5280

agtcggaatc gcagaccgat accaggatct tgccatccta tggaactgcc tcggtgagtt     5340

ttctccttca ttacagaaac ggctttttca aaaatatggt attgataatc ctgatatgaa     5400

taaattgcag tttcatttga tgctcgatga gtttttctaa actgtcagac caagtttact     5460

catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga     5520

tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt     5580

cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct     5640

gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc     5700

taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc     5760

ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc     5820

tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg     5880

ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt     5940

cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg     6000

agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg     6060

gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt     6120

atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag     6180

gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt     6240

gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta     6300

ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt     6360

cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc     6420

cgattcatta atgcaggcgc ctgttgattt gagttttggg tttagcgtga caagtttgcg     6480

agggtgatcg gagtaatcag taaatagctc tccgcctaca atgacgtcat aaccatgatt     6540

tctggttttc tgacgtccgt tatcagttcc ctccgaccac gccagcatat cgaggaacgc     6600

cttacgttga ttattgattt ctaccatctt ctactccggc ttttttagca gcgaagcgtt     6660

tgataagcga accaatcgag tcagtaccga tgtagccgat aaacacgctc gttatataag     6720

cgagattgct acttagtccg gcgaagtcga gaaggtcacg aatgaaccag gcgataatgg     6780

cgcacatcgt tgcgtcgatt actgtttttg taaacgcacc gccattatat ctgccgcgaa     6840

ggtacgccat tgcaaacgca aggattgccc cgatgccttg ttcctttgcc gcgagaatgg     6900

cggccaacag gtcatgtttt tctggcatct tcatgtctta cccccaataa ggggatttgc     6960

tctatttaat taggaataag gtcgattact gatagaacaa atccaggcta ctgtgtttag     7020

taatcagatt tgttcgtgac cgatatgcac gggcaaaacg gcaggaggtt gttagcgcga     7080

cctcctgcca cccgctttca cgaaggtcat gtgtaaaagg ccgcagcgta actattacta     7140

atgaattcag gacagacagt ggctacggct cagtttgggt tgtgctgttg ctgggcggcg     7200

atgacgcctg tacgcatttg gtgatccggt tctgcttccg gtattcgctt aattcagcac     7260

aacggaaaga gcactggcta accaggctcg ccgactcttc acgattatcg actcaatgct     7320

cttacctgtt gtgcagatat aaaaaatccc gaaaccgtta tgcaggctct aactattacc     7380

tgcgaactgt ttcgggattg cattttgcag acctctctgc ctgcgatggt tggagttcca     7440

gacgatacgt cgaagtgacc aactaggcgg aatcggtagt aagcgccgcc tcttttcatc     7500

tcactaccac aacgagcgaa ttaacccatc gttgagtcaa atttacccaa ttttattcaa     7560

taagtcaata tcatgccgtt aatatgttgc catccgtggc aatcatgctg ctaacgtgtg     7620

accgcattca aaatgttgtc tgcgattgac tcttctttgt ggcattgcac caccagagcg     7680

tcatacagcg gcttaacagt gcgtgaccag gtgggttggg taaggtttgg gattagcatc     7740

gtcacagcgc gatatgctgc gcttgctggc atccttgaat agccgacgcc tttgcatctt     7800

ccgcactctt tctcgacaac tctcccccac agctctgttt tggcaatatc aaccgcacgg     7860

cctgtaccat ggcaatctct gcatcttgcc cccggcgtcg cggcactacg gcaataatcc     7920

gcataagcga atgttgcgag cacttgcagt acctttgcct tagtatttcc ttcaagcttt     7980

gccacaccac ggtatttccc cgataccttg tgtgcaaatt gcatcagata gttgatagcc     8040

ttttgtttgt cgttctggct gagttcgtgc ttaccgcaga atgcagccat accgaatccg     8100

gcttgtgatt gcgccatccc catagcagcc atcacatcag taccggaaag agagtcagaa     8160

gccgtggccc gtggtgagtc gctcatcatc gggctttttg gcgaatgaaa tttagctacg     8220

ctttcgagtc tcatgcgcct tctccctgta cctgaatcaa tgttaggttt ccgcagaaca     8280

ctgcgccggt atcgatatac atttggttgg caaacttgag tggtttcact gctggcgtat     8340

gaccaaagat gaacgtgtcc gcgcctttga tttctttcac gatcccgttt tgtgagttgc     8400

tgattcgttc gcggttccag attacctgct gatgatcaac tggctttcca aactcgtatt     8460

cgtcaaaggg ataatcggcg tggcagataa catatttttt atctttgctc accagttcga     8520

tgattaacgg aagttcatct gctttatggg caagagcttt agccagaatt tctttgtcgt     8580

aatcgagatt aaagaaccag ccaccgccat taagcagcca gtgattaacg tttccacgct     8640

ctgataagcc atcaatcatc atttgctcat ggtttccacg tacagctctg aaccagggga     8700

atgtgattaa ttccaggcat tcaacgttct ctgcaccacg atcaaccaaa tcgcccaccg     8760

agataagcag gtcttttttg ttgtcgaatc caatcgtatc cagtttgttc atcaggttcg     8820

tgtagcatcc gtgcagatcg ccaactaccc aaatatttcg gtatttgctg ccatcaattt     8880

tttcgtaata gcgcatctct ttcactccat ccgcgatgaa ccatgagaac gtcgttgacg     8940

atggcgtgca ttttcccgtc tttatcatca acgtattttc tgaccgtacc gcgactacat     9000

ttcagtctgc gtgctacttc tgtctgattt ccgtatgctt caacgagcat gtctggaatg     9060

gtttttactg agaacgtcat gcggcctcac ttctgctatt tcgcaggtct ttgagtttct     9120

gttggtactc tgccttgatc gccttgcact cttcgatagt ccagcgatgg cggttatggt     9180

ttgattcgat ttcgtctact gcttcctgcc cgatgcggct aatcagttcg acgcgatacg     9240

gaacgagatt tccgcttttg tgctggttgc acaccacgca ttgcttgtga atattgcgtt     9300

cattaaatcg gagttgaggt gccgcagcag ttgtccggta atgtccggca tcccactgag     9360

cagacgtgag cgttccgcac gagatacatg gtaagtcgcg gtctctttct ctgatgaagg     9420

cgtttacggc ttgttgggct tgtttaatcc agtaactgcg gggctttaag gcgagttttc     9480

gaatcttaag tttatctttc tgtttctgct cctctcgtcg tcgtttcttc tctgctgctt     9540

tttccgcttt ttcgcgttct ttacttcgtc gttcgagtgc tatcttggtt ccacactctg     9600

gagagcacca ccactgatta gcgaatgcag ggtgaaacca ttcccggcat tcatcgtttt     9660

tacatcgtct tcgcgctggt ttagccatca tcttcttcct cgtgcatcga gctattcgga     9720

tcgctcatca gttctgcgca gcagtgctca cacacgtgaa cttccagcac atgcagcttc     9780

tgaccgcagt tagcgcacgt taaagctcgc tcgacgcttt cttgttcgta acttcgattt     9840

tggtcaatca ccttgttttc ctcgcacgac gtcttagcca ccggatatcc cacaggtgag     9900

ccgtgtagtt gaaggttttt acgtcagatt cttttgggat tggcttgggt ttatttctgg     9960

tgcgtttcgt tggaaggtat ttgcagtttt cgcagattat gtcggtgata cttcgtcgct    10020

gtctcgccac acgtcctcct tttcctgcgg tagtggtaac acccctgttg gtgttctttc    10080

acaccggaga caccatcgat tccagtaagg ttgatttggt cggaagcggt tatcttcttt    10140

gcattcaccg caccgataac atcgcatcat gcagcttccc tcccgaagtc gaaatcaagc    10200

tgccctccaa atatttcgca tgactcagaa caagagccgg tatcgaatct tttagctcgt    10260

accatgtcct gatacagggc ttgataatca ttttctgaat acattttcgc gataccgtcc    10320

agcgacattc ttcctcggta cataatctcc tttggcgttt cccgatgtcc gtcacgcaca    10380

tgggatcccg tgatgacctc attaaaaaca cgctgcaatc cctcctcatc tttgcaggca    10440

agtccgattt tttgcgttga ttttttaatg cagaatatgc agttaccgag atgttccggt    10500

atttgcaaat cgaatggttg ttgcttccac catgcgagga tatcttcctt ctcaaagtct    10560

gacagttcag caagatatct gattccaggc tttggcttta gccgcttcgg ttcatcagct    10620

ctgatgccaa tccacgtggt gtaattccct cgcccgaaat ggtcatcaca gtatttggtg    10680

aagggaacga gttttaatct gtcagtgcag aacgcgccgc cgacgtatgg agtgccatat    10740

ttctttacca tatcgataaa tggcttcaga acaggcattc gcgtctgaat atcctttggt    10800

tcccataccg tataaccatt tggctgtcca agctccgggt tgatatcaac ctgcaatacg    10860

gtgagcggta tatcccagaa cttcacaact tccctgacaa accgatatgt cattggatgt    10920

tcacaacctg tatccatgaa aacgtaatgc acgtctttac ctgcccgtcg cttttgctcc    10980

attagccaga gcaaatatgc tgacgtcctg ccaccggaga aactaacgac atttatcatg    11040

cagccctgtc tccccatctc gctttccact ccagagccag tctcgcttcg tctgaccact    11100

taacgccacg ctctgtaccg aatgcctgta taagctctaa tagctccgca aattcgccta    11160

cacgcatcct gctggttgac tggcctatta ccacaaagcc attcccggca aggttaggaa    11220

caacatcctg ctgctttaat gctgcggtaa acacacactt ccagctttct gcatccagcc    11280

agcgaccatg ccattcaacc tgacgagaga cgtcacctaa gcaggcccat agcttcctgt    11340

tttggtctaa gctgcggttg cgttcctgaa tggttactac gattggtttg gttgggtctg    11400

gaaggatttg ctgtactgcg tgaatagcgt tttgctgatg tgctggagat cgaatttcaa    11460

aggttagttt tttcatgact tccctctccc ccaaataaaa aggctggcac gacaggtttc    11520

ccgactggaa agcgggcagt gagcgcaacg caattaatgt gagttagctc actcattagg    11580

caccccaggc tttacacttt atgcttccgg ctcgtatgtt gtgtggaatt gtgagcggat    11640

aacaatttca cacaggaaac agctatgacc atgattacgc caagctgtcg actctagagg    11700

atcccctaat aagg                                                      11714


<210>  26
<211>  6647
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence


<220>
<221>  misc_feature
<222>  (1)..(130)
<223>  5' ITR

<220>
<221>  misc_feature
<222>  (241)..(544)
<223>  CMV enhancer

<220>
<221>  misc_feature
<222>  (546)..(823)
<223>  chicken beta-actin promoter

<220>
<221>  misc_feature
<222>  (824)..(1795)
<223>  CBA exon 1 and intron

<220>
<221>  misc_feature
<222>  (1859)..(1864)
<223>  Kozak

<220>
<221>  misc_feature
<222>  (1865)..(3826)
<223>  human codon optimized CHM (REM-1)

<220>
<221>  misc_feature
<222>  (3847)..(4054)
<223>  bGH poly(A) signal

<220>
<221>  misc_feature
<222>  (4104)..(4233)
<223>  3' ITR

<400>  26
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg      240

cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt      300

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      360

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      420

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      480

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattaa      540

catggtcgag gtgagcccca cgttctgctt cactctcccc atctcccccc cctccccacc      600

cccaattttg tatttattta ttttttaatt attttgtgca gcgatggggg cggggggggg      660

gggggggcgc gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag      720

aggtgcggcg gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg      780

gcggcggcgg cggccctata aaaagcgaag cgcgcggcgg gcggggagtc gctgcgacgc      840

tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg      900

accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag      960

cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc     1020

cgggagggcc ctttgtgcgg ggggagcggc tcggggggtg cgtgcgtgtg tgtgtgcgtg     1080

gggagcgccg cgtgcggctc cgcgctgccc ggcggctgtg agcgctgcgg gcgcggcgcg     1140

gggctttgtg cgctccgcag tgtgcgcgag gggagcgcgg ccgggggcgg tgccccgcgg     1200

tgcggggggg gctgcgaggg gaacaaaggc tgcgtgcggg gtgtgtgcgt gggggggtga     1260

gcagggggtg tgggcgcgtc ggtcgggctg caaccccccc tgcacccccc tccccgagtt     1320

gctgagcacg gcccggcttc gggtgcgggg ctccgtacgg ggcgtggcgc ggggctcgcc     1380

gtgccgggcg gggggtggcg gcaggtgggg gtgccgggcg gggcggggcc gcctcgggcc     1440

ggggagggct cgggggaggg gcgcggcggc ccccggagcg ccggcggctg tcgaggcgcg     1500

gcgagccgca gccattgcct tttatggtaa tcgtgcgaga gggcgcaggg acttcctttg     1560

tcccaaatct gtgcggagcc gaaatctggg aggcgccgcc gcaccccctc tagcgggcgc     1620

ggggcgaagc ggtgcggcgc cggcaggaag gaaatgggcg gggagggcct tcgtgcgtcg     1680

ccgcgccgcc gtccccttct ccctctccag cctcggggct gtccgcgggg ggacggctgc     1740

cttcgggggg gacggggcag ggcggggttc ggcttctggc gtgtgaccgg cggctctaga     1800

caattgtact aaccttcttc tctttcctct cctgacaggt tggtgtacac tagcggccgc     1860

caccatggct gataccctgc cctctgaatt cgacgtgatt gtgattggaa ccggactccc     1920

tgaatcgatc atcgccgcgg cctgttcccg gtccggtcgg cgcgtgctgc acgtcgattc     1980

gagaagctac tacggaggga attgggcctc attctccttc tccggactgc tctcctggct     2040

gaaggagtat caggagaact ccgacattgt ctccgactca cctgtgtggc aggaccagat     2100

cctggaaaac gaggaagcaa tagccctgag ccggaaggac aagaccatcc agcacgtgga     2160

ggtgttctgt tatgcctccc aagacctcca tgaggacgtg gaagaggctg gagcgttgca     2220

gaagaatcat gccctcgtga cctccgctaa ctccaccgag gcagccgaca gcgccttcct     2280

gccgaccgag gatgaatccc tgtcaactat gtcgtgcgaa atgctgaccg aacagactcc     2340

gagctccgac cccgaaaacg ccctggaagt gaacggagcg gaagtgaccg gcgaaaagga     2400

gaaccattgc gacgacaaga cttgtgtccc atccacttcc gcggaggaca tgtccgagaa     2460

tgtgcctatc gccgaggaca ccaccgaaca gcccaagaag aacagaatca cgtacagcca     2520

gatcatcaag gaggggcgga ggtttaacat cgatctggtg tcgaagctgc tgtacagccg     2580

cggtctgctg atcgatctgc tcattaagtc gaacgtgtcg agatacgccg agttcaagaa     2640

catcacaagg attctcgcct tccgggaagg aagagtggaa caagtgccgt gctcccgggc     2700

cgacgtgttc aactcaaagc aacttaccat ggtggaaaag cgcatgctga tgaaattcct     2760

gaccttctgc atggagtacg aaaagtaccc tgatgagtac aagggttacg aagaaattac     2820

tttctacgag tacctcaaga cccagaagct gaccccgaat ctgcagtaca ttgtgatgca     2880

ctcaatcgca atgacctccg aaaccgcctc ctcgaccatc gacgggctca aggccaccaa     2940

gaacttcctg cactgtttgg ggcgctacgg caacactccg ttcctcttcc cgctgtacgg     3000

ccagggagag ctgcctcagt gtttctgccg gatgtgcgcc gtgttcggcg gaatctactg     3060

tctccgccac tcggtccagt gcctggtggt ggacaaggaa tccaggaagt gcaaagccat     3120

tattgaccag ttcggacaac ggatcatttc cgagcacttt cttgtggagg actcatactt     3180

cccggagaac atgtgctctc gggtccagta tcgacagatt tccagggcgg tgctcattac     3240

tgaccggagc gtcctcaaga ccgatagcga ccagcagatc tccatcctga ccgtgccggc     3300

ggaagaaccc ggcacttttg ccgtgcgcgt gatcgagctt tgctcatcca ccatgacttg     3360

catgaaaggc acttacctgg tgcacctgac gtgcacctca tcgaaaaccg ctagagagga     3420

cctggaatcc gtcgtccaaa agctgttcgt gccttacacc gagatggaaa ttgaaaacga     3480

acaagtggag aagccccgca tcctttgggc cctgtacttt aacatgcgcg attcctccga     3540

tatctcgcgg tcctgctata acgacttgcc ttcgaacgtc tacgtctgct ccgggccaga     3600

ctgcggtctt ggcaacgaca atgccgtgaa gcaggcggaa acactgttcc aagagatctg     3660

ccctaacgag gatttttgcc cgcccccccc aaaccccgag gatatcatct tggacggaga     3720

cagcctgcag ccagaagcat ccgagtccag cgccatcccg gaggccaaca gcgaaacctt     3780

caaggagagc actaacctgg gcaacctgga agagtccagc gaatgatcat aggatctctg     3840

cctcgactgt gccttctagt tgccagccat ctgttgtttg cccctccccc gtgccttcct     3900

tgaccctgga aggtgccact cccactgtcc tttcctaata aaatgaggaa attgcatcgc     3960

attgtctgag taggtgtcat tctattctgg ggggtggggt ggggcaggac agcaaggggg     4020

aggattggga agacaatagc aggcatgctg gggactcgag ttctacgtag ataagtagca     4080

tggcgggtta atcattaact acaaggaacc cctagtgatg gagttggcca ctccctctct     4140

gcgcgctcgc tcgctcactg aggccgggcg accaaaggtc gcccgacgcc cgggctttgc     4200

ccgggcggcc tcagtgagcg agcgagcgcg cagccttata aggatatggt gcactctcag     4260

tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa cacccgctga     4320

cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc     4380

cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gacgaaaggg     4440

cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt cttagacgtc     4500

aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca     4560

ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa     4620

aaggaagagt atgagccata ttcaacggga aacgtcgagg ccgcgattaa attccaacat     4680

ggatgctgat ttatatgggt ataaatgggc tcgcgataat gtcgggcaat caggtgcgac     4740

aatctatcgc ttgtatggga agcccgatgc gccagagttg tttctgaaac atggcaaagg     4800

tagcgttgcc aatgatgtta cagatgagat ggtcagacta aactggctga cggaatttat     4860

gccacttccg accatcaagc attttatccg tactcctgat gatgcatggt tactcaccac     4920

tgcgatcccc ggaaaaacag cgttccaggt attagaagaa tatcctgatt caggtgaaaa     4980

tattgttgat gcgctggcag tgttcctgcg ccggttgcac tcgattcctg tttgtaattg     5040

tccttttaac agcgatcgcg tatttcgcct cgctcaggcg caatcacgaa tgaataacgg     5100

tttggttgat gcgagtgatt ttgatgacga gcgtaatggc tggcctgttg aacaagtctg     5160

gaaagaaatg cataaacttt tgccattctc accggattca gtcgtcactc atggtgattt     5220

ctcacttgat aaccttattt ttgacgaggg gaaattaata ggttgtattg atgttggacg     5280

agtcggaatc gcagaccgat accaggatct tgccatccta tggaactgcc tcggtgagtt     5340

ttctccttca ttacagaaac ggctttttca aaaatatggt attgataatc ctgatatgaa     5400

taaattgcag tttcatttga tgctcgatga gtttttctaa actgtcagac caagtttact     5460

catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga     5520

tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt     5580

cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct     5640

gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc     5700

taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgttc     5760

ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc     5820

tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg     5880

ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt     5940

cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg     6000

agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg     6060

gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt     6120

atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag     6180

gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt     6240

gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta     6300

ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt     6360

cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc     6420

cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca     6480

acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc     6540

cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg     6600

accatgatta cgccaagctg tcgactctag aggatcccct aataagg                   6647


<210>  27
<211>  11971
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence


<220>
<221>  misc_feature
<222>  (1)..(130)
<223>  5' ITR

<220>
<221>  misc_feature
<222>  (241)..(544)
<223>  CMV Enhancer

<220>
<221>  misc_feature
<222>  (546)..(823)
<223>  chicken beta-actin promoter

<220>
<221>  misc_feature
<222>  (824)..(1795)
<223>  CBA exon 1 and intron

<220>
<221>  misc_feature
<222>  (1859)..(1864)
<223>  kozak

<220>
<221>  misc_feature
<222>  (1865)..(3826)
<223>  human codon optimized CHM (REP-1)

<220>
<221>  misc_feature
<222>  (3847)..(4054)
<223>  bGH poly(A) signal

<220>
<221>  misc_feature
<222>  (4104)..(4233)
<223>  3' ITR

<400>  27
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg      240

cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt      300

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      360

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      420

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      480

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattaa      540

catggtcgag gtgagcccca cgttctgctt cactctcccc atctcccccc cctccccacc      600

cccaattttg tatttattta ttttttaatt attttgtgca gcgatggggg cggggggggg      660

gggggggcgc gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag      720

aggtgcggcg gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg      780

gcggcggcgg cggccctata aaaagcgaag cgcgcggcgg gcggggagtc gctgcgacgc      840

tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg      900

accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag      960

cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc     1020

cgggagggcc ctttgtgcgg ggggagcggc tcggggggtg cgtgcgtgtg tgtgtgcgtg     1080

gggagcgccg cgtgcggctc cgcgctgccc ggcggctgtg agcgctgcgg gcgcggcgcg     1140

gggctttgtg cgctccgcag tgtgcgcgag gggagcgcgg ccgggggcgg tgccccgcgg     1200

tgcggggggg gctgcgaggg gaacaaaggc tgcgtgcggg gtgtgtgcgt gggggggtga     1260

gcagggggtg tgggcgcgtc ggtcgggctg caaccccccc tgcacccccc tccccgagtt     1320

gctgagcacg gcccggcttc gggtgcgggg ctccgtacgg ggcgtggcgc ggggctcgcc     1380

gtgccgggcg gggggtggcg gcaggtgggg gtgccgggcg gggcggggcc gcctcgggcc     1440

ggggagggct cgggggaggg gcgcggcggc ccccggagcg ccggcggctg tcgaggcgcg     1500

gcgagccgca gccattgcct tttatggtaa tcgtgcgaga gggcgcaggg acttcctttg     1560

tcccaaatct gtgcggagcc gaaatctggg aggcgccgcc gcaccccctc tagcgggcgc     1620

ggggcgaagc ggtgcggcgc cggcaggaag gaaatgggcg gggagggcct tcgtgcgtcg     1680

ccgcgccgcc gtccccttct ccctctccag cctcggggct gtccgcgggg ggacggctgc     1740

cttcgggggg gacggggcag ggcggggttc ggcttctggc gtgtgaccgg cggctctaga     1800

caattgtact aaccttcttc tctttcctct cctgacaggt tggtgtacac tagcggccgc     1860

caccatggct gataccctgc cctctgaatt cgacgtgatt gtgattggaa ccggactccc     1920

tgaatcgatc atcgccgcgg cctgttcccg gtccggtcgg cgcgtgctgc acgtcgattc     1980

gagaagctac tacggaggga attgggcctc attctccttc tccggactgc tctcctggct     2040

gaaggagtat caggagaact ccgacattgt ctccgactca cctgtgtggc aggaccagat     2100

cctggaaaac gaggaagcaa tagccctgag ccggaaggac aagaccatcc agcacgtgga     2160

ggtgttctgt tatgcctccc aagacctcca tgaggacgtg gaagaggctg gagcgttgca     2220

gaagaatcat gccctcgtga cctccgctaa ctccaccgag gcagccgaca gcgccttcct     2280

gccgaccgag gatgaatccc tgtcaactat gtcgtgcgaa atgctgaccg aacagactcc     2340

gagctccgac cccgaaaacg ccctggaagt gaacggagcg gaagtgaccg gcgaaaagga     2400

gaaccattgc gacgacaaga cttgtgtccc atccacttcc gcggaggaca tgtccgagaa     2460

tgtgcctatc gccgaggaca ccaccgaaca gcccaagaag aacagaatca cgtacagcca     2520

gatcatcaag gaggggcgga ggtttaacat cgatctggtg tcgaagctgc tgtacagccg     2580

cggtctgctg atcgatctgc tcattaagtc gaacgtgtcg agatacgccg agttcaagaa     2640

catcacaagg attctcgcct tccgggaagg aagagtggaa caagtgccgt gctcccgggc     2700

cgacgtgttc aactcaaagc aacttaccat ggtggaaaag cgcatgctga tgaaattcct     2760

gaccttctgc atggagtacg aaaagtaccc tgatgagtac aagggttacg aagaaattac     2820

tttctacgag tacctcaaga cccagaagct gaccccgaat ctgcagtaca ttgtgatgca     2880

ctcaatcgca atgacctccg aaaccgcctc ctcgaccatc gacgggctca aggccaccaa     2940

gaacttcctg cactgtttgg ggcgctacgg caacactccg ttcctcttcc cgctgtacgg     3000

ccagggagag ctgcctcagt gtttctgccg gatgtgcgcc gtgttcggcg gaatctactg     3060

tctccgccac tcggtccagt gcctggtggt ggacaaggaa tccaggaagt gcaaagccat     3120

tattgaccag ttcggacaac ggatcatttc cgagcacttt cttgtggagg actcatactt     3180

cccggagaac atgtgctctc gggtccagta tcgacagatt tccagggcgg tgctcattac     3240

tgaccggagc gtcctcaaga ccgatagcga ccagcagatc tccatcctga ccgtgccggc     3300

ggaagaaccc ggcacttttg ccgtgcgcgt gatcgagctt tgctcatcca ccatgacttg     3360

catgaaaggc acttacctgg tgcacctgac gtgcacctca tcgaaaaccg ctagagagga     3420

cctggaatcc gtcgtccaaa agctgttcgt gccttacacc gagatggaaa ttgaaaacga     3480

acaagtggag aagccccgca tcctttgggc cctgtacttt aacatgcgcg attcctccga     3540

tatctcgcgg tcctgctata acgacttgcc ttcgaacgtc tacgtctgct ccgggccaga     3600

ctgcggtctt ggcaacgaca atgccgtgaa gcaggcggaa acactgttcc aagagatctg     3660

ccctaacgag gatttttgcc cgcccccccc aaaccccgag gatatcatct tggacggaga     3720

cagcctgcag ccagaagcat ccgagtccag cgccatcccg gaggccaaca gcgaaacctt     3780

caaggagagc actaacctgg gcaacctgga agagtccagc gaatgatcat aggatctctg     3840

cctcgactgt gccttctagt tgccagccat ctgttgtttg cccctccccc gtgccttcct     3900

tgaccctgga aggtgccact cccactgtcc tttcctaata aaatgaggaa attgcatcgc     3960

attgtctgag taggtgtcat tctattctgg ggggtggggt ggggcaggac agcaaggggg     4020

aggattggga agacaatagc aggcatgctg gggactcgag ttctacgtag ataagtagca     4080

tggcgggtta atcattaact acaaggaacc cctagtgatg gagttggcca ctccctctct     4140

gcgcgctcgc tcgctcactg aggccgggcg accaaaggtc gcccgacgcc cgggctttgc     4200

ccgggcggcc tcagtgagcg agcgagcgcg cagccttaat taacctaata aggaaaatga     4260

agggaagttc ctatactttc tagagaatag gaacttctat agggagtcga ataagggcga     4320

cacaaaaggt attctaaatg cataataaat actgataaca tcttatagtt tgtattatat     4380

tttgtattat cgttgacatg tataattttg atatcaaaaa ctgattttcc ctttattatt     4440

ttcgagattt attttcttaa ttctctttaa caaactagaa atattgtata tacaaaaaat     4500

cataaataat agatgaatag tttaattata ggtgttcatc aatcgaaaaa gcaacgtatc     4560

ttatttaaag tgcgttgctt ttttctcatt tataaggtta aataattctc atatatcaag     4620

caaagtgaca ggcgccctta aatattctga caaatgctct ttccctaaac tccccccata     4680

aaaaaacccg ccgaagcggg tttttacgtt atttgcggat taacgattac tcgttatcag     4740

aaccgcccag gatgcctggc agttccctac tctcgccgct gcgctcggtc gttcggctgc     4800

gggacctcag cgctagcgga gtgtatactg gcttactatg ttggcactga tgagggtgtc     4860

agtgaagtgc ttcatgtggc aggagaaaaa aggctgcacc ggtgcgtcag cagaatatgt     4920

gatacaggat atattccgct tcctcgctca ctgactcgct acgctcggtc gttcgactgc     4980

ggcgagcgga aatggcttac gaacggggcg gagatttcct ggaagatgcc aggaagatac     5040

ttaacaggga agtgagaggg ccgcggcaaa gccgtttttc cataggctcc gcccccctga     5100

caagcatcac gaaatctgac gctcaaatca gtggtggcga aacccgacag gactataaag     5160

ataccaggcg tttccccctg gcggctccct cgtgcgctct cctgttcctg cctttcggtt     5220

taccggtgtc attccgctgt tatggccgcg tttgtctcat tccacgcctg acactcagtt     5280

ccgggtaggc agttcgctcc aagctggact gtatgcacga accccccgtt cagtccgacc     5340

gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggaaagacat gcaaaagcac     5400

cactggcagc agccactggt aattgattta gaggagttag tcttgaagtc atgcgccggt     5460

taaggctaaa ctgaaaggac aagttttggt gactgcgctc ctccaagcca gttacctcgg     5520

ttcaaagagt tggtagctca gagaaccttc gaaaaaccgc cctgcaaggc ggttttttcg     5580

ttttcagagc aagagattac gcgcagacca aaacgatctc aagaagatca tcttattaag     5640

ctccttttta tttgggggag agggaagtca tgaaaaaact aacctttgaa attcgatctc     5700

cagcacatca gcaaaacgct attcacgcag tacagcaaat ccttccagac ccaaccaaac     5760

caatcgtagt aaccattcag gaacgcaacc gcagcttaga ccaaaacagg aagctatggg     5820

cctgcttagg tgacgtctct cgtcaggttg aatggcatgg tcgctggctg gatgcagaaa     5880

gctggaagtg tgtgtttacc gcagcattaa agcagcagga tgttgttcct aaccttgccg     5940

ggaatggctt tgtggtaata ggccagtcaa ccagcaggat gcgtgtaggc gaatttgcgg     6000

agctattaga gcttatacag gcattcggta cagagcgtgg cgttaagtgg tcagacgaag     6060

cgagactggc tctggagtgg aaagcgagat ggggagacag ggctgcatga taaatgtcgt     6120

tagtttctcc ggtggcagga cgtcagcata tttgctctgg ctaatggagc aaaagcgacg     6180

ggcaggtaaa gacgtgcatt acgttttcat ggatacaggt tgtgaacatc caatgacata     6240

tcggtttgtc agggaagttg tgaagttctg ggatataccg ctcaccgtat tgcaggttga     6300

tatcaacccg gagcttggac agccaaatgg ttatacggta tgggaaccaa aggatattca     6360

gacgcgaatg cctgttctga agccatttat cgatatggta aagaaatatg gcactccata     6420

cgtcggcggc gcgttctgca ctgacagatt aaaactcgtt cccttcacca aatactgtga     6480

tgaccatttc gggcgaggga attacaccac gtggattggc atcagagctg atgaaccgaa     6540

gcggctaaag ccaaagcctg gaatcagata tcttgctgaa ctgtcagact ttgagaagga     6600

agatatcctc gcatggtgga agcaacaacc attcgatttg caaataccgg aacatctcgg     6660

taactgcata ttctgcatta aaaaatcaac gcaaaaaatc ggacttgcct gcaaagatga     6720

ggagggattg cagcgtgttt ttaatgaggt catcacggga tcccatgtgc gtgacggaca     6780

tcgggaaacg ccaaaggaga ttatgtaccg aggaagaatg tcgctggacg gtatcgcgaa     6840

aatgtattca gaaaatgatt atcaagccct gtatcaggac atggtacgag ctaaaagatt     6900

cgataccggc tcttgttctg agtcatgcga aatatttgga gggcagcttg atttcgactt     6960

cgggagggaa gctgcatgat gcgatgttat cggtgcggtg aatgcaaaga agataaccgc     7020

ttccgaccaa atcaacctta ctggaatcga tggtgtctcc ggtgtgaaag aacaccaaca     7080

ggggtgttac cactaccgca ggaaaaggag gacgtgtggc gagacagcga cgaagtatca     7140

ccgacataat ctgcgaaaac tgcaaatacc ttccaacgaa acgcaccaga aataaaccca     7200

agccaatccc aaaagaatct gacgtaaaaa ccttcaacta cacggctcac ctgtgggata     7260

tccggtggct aagacgtcgt gcgaggaaaa caaggtgatt gaccaaaatc gaagttacga     7320

acaagaaagc gtcgagcgag ctttaacgtg cgctaactgc ggtcagaagc tgcatgtgct     7380

ggaagttcac gtgtgtgagc actgctgcgc agaactgatg agcgatccga atagctcgat     7440

gcacgaggaa gaagatgatg gctaaaccag cgcgaagacg atgtaaaaac gatgaatgcc     7500

gggaatggtt tcaccctgca ttcgctaatc agtggtggtg ctctccagag tgtggaacca     7560

agatagcact cgaacgacga agtaaagaac gcgaaaaagc ggaaaaagca gcagagaaga     7620

aacgacgacg agaggagcag aaacagaaag ataaacttaa gattcgaaaa ctcgccttaa     7680

agccccgcag ttactggatt aaacaagccc aacaagccgt aaacgccttc atcagagaaa     7740

gagaccgcga cttaccatgt atctcgtgcg gaacgctcac gtctgctcag tgggatgccg     7800

gacattaccg gacaactgct gcggcacctc aactccgatt taatgaacgc aatattcaca     7860

agcaatgcgt ggtgtgcaac cagcacaaaa gcggaaatct cgttccgtat cgcgtcgaac     7920

tgattagccg catcgggcag gaagcagtag acgaaatcga atcaaaccat aaccgccatc     7980

gctggactat cgaagagtgc aaggcgatca aggcagagta ccaacagaaa ctcaaagacc     8040

tgcgaaatag cagaagtgag gccgcatgac gttctcagta aaaaccattc cagacatgct     8100

cgttgaagca tacggaaatc agacagaagt agcacgcaga ctgaaatgta gtcgcggtac     8160

ggtcagaaaa tacgttgatg ataaagacgg gaaaatgcac gccatcgtca acgacgttct     8220

catggttcat cgcggatgga gtgaaagaga tgcgctatta cgaaaaaatt gatggcagca     8280

aataccgaaa tatttgggta gttggcgatc tgcacggatg ctacacgaac ctgatgaaca     8340

aactggatac gattggattc gacaacaaaa aagacctgct tatctcggtg ggcgatttgg     8400

ttgatcgtgg tgcagagaac gttgaatgcc tggaattaat cacattcccc tggttcagag     8460

ctgtacgtgg aaaccatgag caaatgatga ttgatggctt atcagagcgt ggaaacgtta     8520

atcactggct gcttaatggc ggtggctggt tctttaatct cgattacgac aaagaaattc     8580

tggctaaagc tcttgcccat aaagcagatg aacttccgtt aatcatcgaa ctggtgagca     8640

aagataaaaa atatgttatc tgccacgccg attatccctt tgacgaatac gagtttggaa     8700

agccagttga tcatcagcag gtaatctgga accgcgaacg aatcagcaac tcacaaaacg     8760

ggatcgtgaa agaaatcaaa ggcgcggaca cgttcatctt tggtcatacg ccagcagtga     8820

aaccactcaa gtttgccaac caaatgtata tcgataccgg cgcagtgttc tgcggaaacc     8880

taacattgat tcaggtacag ggagaaggcg catgagactc gaaagcgtag ctaaatttca     8940

ttcgccaaaa agcccgatga tgagcgactc accacgggcc acggcttctg actctctttc     9000

cggtactgat gtgatggctg ctatggggat ggcgcaatca caagccggat tcggtatggc     9060

tgcattctgc ggtaagcacg aactcagcca gaacgacaaa caaaaggcta tcaactatct     9120

gatgcaattt gcacacaagg tatcggggaa ataccgtggt gtggcaaagc ttgaaggaaa     9180

tactaaggca aaggtactgc aagtgctcgc aacattcgct tatgcggatt attgccgtag     9240

tgccgcgacg ccgggggcaa gatgcagaga ttgccatggt acaggccgtg cggttgatat     9300

tgccaaaaca gagctgtggg ggagagttgt cgagaaagag tgcggaagat gcaaaggcgt     9360

cggctattca aggatgccag caagcgcagc atatcgcgct gtgacgatgc taatcccaaa     9420

ccttacccaa cccacctggt cacgcactgt taagccgctg tatgacgctc tggtggtgca     9480

atgccacaaa gaagagtcaa tcgcagacaa cattttgaat gcggtcacac gttagcagca     9540

tgattgccac ggatggcaac atattaacgg catgatattg acttattgaa taaaattggg     9600

taaatttgac tcaacgatgg gttaattcgc tcgttgtggt agtgagatga aaagaggcgg     9660

cgcttactac cgattccgcc tagttggtca cttcgacgta tcgtctggaa ctccaaccat     9720

cgcaggcaga gaggtctgca aaatgcaatc ccgaaacagt tcgcaggtaa tagttagagc     9780

ctgcataacg gtttcgggat tttttatatc tgcacaacag gtaagagcat tgagtcgata     9840

atcgtgaaga gtcggcgagc ctggttagcc agtgctcttt ccgttgtgct gaattaagcg     9900

aataccggaa gcagaaccgg atcaccaaat gcgtacaggc gtcatcgccg cccagcaaca     9960

gcacaaccca aactgagccg tagccactgt ctgtcctgaa ttcattagta atagttacgc    10020

tgcggccttt tacacatgac cttcgtgaaa gcgggtggca ggaggtcgcg ctaacaacct    10080

cctgccgttt tgcccgtgca tatcggtcac gaacaaatct gattactaaa cacagtagcc    10140

tggatttgtt ctatcagtaa tcgaccttat tcctaattaa atagagcaaa tccccttatt    10200

gggggtaaga catgaagatg ccagaaaaac atgacctgtt ggccgccatt ctcgcggcaa    10260

aggaacaagg catcggggca atccttgcgt ttgcaatggc gtaccttcgc ggcagatata    10320

atggcggtgc gtttacaaaa acagtaatcg acgcaacgat gtgcgccatt atcgcctggt    10380

tcattcgtga ccttctcgac ttcgccggac taagtagcaa tctcgcttat ataacgagcg    10440

tgtttatcgg ctacatcggt actgactcga ttggttcgct tatcaaacgc ttcgctgcta    10500

aaaaagccgg agtagaagat ggtagaaatc aataatcaac gtaaggcgtt cctcgatatg    10560

ctggcgtggt cggagggaac tgataacgga cgtcagaaaa ccagaaatca tggttatgac    10620

gtcattgtag gcggagagct atttactgat tactccgatc accctcgcaa acttgtcacg    10680

ctaaacccaa aactcaaatc aacaggcgca gcttttagaa aaactcatcg agcatcaaat    10740

gaaactgcaa tttattcata tcaggattat caataccata tttttgaaaa agccgtttct    10800

gtaatgaagg agaaaactca ccgaggcagt tccataggat ggcaagatcc tggtatcggt    10860

ctgcgattcc gactcgtcca acatcaatac aacctattaa tttcccctcg tcaaaaataa    10920

ggttatcaag tgagaaatca ccatgagtga cgactgaatc cggtgagaat ggcaaaagtt    10980

tatgcatttc tttccagact tgttcaacag gccagccatt acgctcgtca tcaaaatcac    11040

tcgcatcaac caaaccgtta ttcattcgtg attgcgcctg agcgaggcga aatacgcgat    11100

cgctgttaaa aggacaatta caaacaggaa tcgagtgcaa ccggcgcagg aacactgcca    11160

gcgcatcaac aatattttca cctgaatcag gatattcttc taatacctgg aacgctgttt    11220

ttccggggat cgcagtggtg agtaaccatg catcatcagg agtacggata aaatgcttga    11280

tggtcggaag tggcataaat tccgtcagcc agtttagtct gaccatctca tctgtaacat    11340

cattggcaac gctacctttg ccatgtttca gaaacaactc tggcgcatcg ggcttcccat    11400

acaagcgata gattgtcgca cctgattgcc cgacattatc gcgagcccat ttatacccat    11460

ataaatcagc atccatgttg gaatttaatc gcggcctcga cgtttcccgt tgaatatggc    11520

tcatattctt cctttttcaa tattattgaa gcatttatca gggttattgt ctcatgagcg    11580

gatacatatt tgaatgtatt tagaaaaata aacaaatagg ggtcagtgtt acaaccaatt    11640

aaccaattct gaacattatc gcgagcccat ttatacctga atatggctca taacacccct    11700

tgtttgcctg gcggcagtag cgcggtggtc ccacctgacc ccatgccgaa ctcagaagtg    11760

aaacgccgta gcgccgatgg tagtgtgggg actccccatg cgagagtagg gaactgccag    11820

gcatcaaata aaacgaaagg ctcagtcgaa agactgggcc tttcgcccgg gctaattagg    11880

gggtgtcgcc cttattcgac tctataggga agttcctatt ctctagaaag tataggaact    11940

tctgaagggg ggtcgatcga cttaattaag g                                   11971


<210>  28
<211>  6900
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence


<220>
<221>  misc_feature
<222>  (1)..(130)
<223>  5' ITR

<220>
<221>  misc_feature
<222>  (241)..(544)
<223>  CMV enhancer

<220>
<221>  misc_feature
<222>  (546)..(823)
<223>  chicken beta actin promoter

<220>
<221>  misc_feature
<222>  (824)..(1795)
<223>  CBA exon 1 and intron

<220>
<221>  misc_feature
<222>  (1859)..(1864)
<223>  kozak

<220>
<221>  misc_feature
<222>  (1865)..(3826)
<223>  human codon optimized CHM (REP-1)

<220>
<221>  misc_feature
<222>  (3847)..(4054)
<223>  bGH poly(A) signal

<220>
<221>  misc_feature
<222>  (4104)..(4233)
<223>  3' ITR

<400>  28
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg      240

cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt      300

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      360

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      420

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      480

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattaa      540

catggtcgag gtgagcccca cgttctgctt cactctcccc atctcccccc cctccccacc      600

cccaattttg tatttattta ttttttaatt attttgtgca gcgatggggg cggggggggg      660

gggggggcgc gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag      720

aggtgcggcg gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg      780

gcggcggcgg cggccctata aaaagcgaag cgcgcggcgg gcggggagtc gctgcgacgc      840

tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg      900

accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag      960

cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc     1020

cgggagggcc ctttgtgcgg ggggagcggc tcggggggtg cgtgcgtgtg tgtgtgcgtg     1080

gggagcgccg cgtgcggctc cgcgctgccc ggcggctgtg agcgctgcgg gcgcggcgcg     1140

gggctttgtg cgctccgcag tgtgcgcgag gggagcgcgg ccgggggcgg tgccccgcgg     1200

tgcggggggg gctgcgaggg gaacaaaggc tgcgtgcggg gtgtgtgcgt gggggggtga     1260

gcagggggtg tgggcgcgtc ggtcgggctg caaccccccc tgcacccccc tccccgagtt     1320

gctgagcacg gcccggcttc gggtgcgggg ctccgtacgg ggcgtggcgc ggggctcgcc     1380

gtgccgggcg gggggtggcg gcaggtgggg gtgccgggcg gggcggggcc gcctcgggcc     1440

ggggagggct cgggggaggg gcgcggcggc ccccggagcg ccggcggctg tcgaggcgcg     1500

gcgagccgca gccattgcct tttatggtaa tcgtgcgaga gggcgcaggg acttcctttg     1560

tcccaaatct gtgcggagcc gaaatctggg aggcgccgcc gcaccccctc tagcgggcgc     1620

ggggcgaagc ggtgcggcgc cggcaggaag gaaatgggcg gggagggcct tcgtgcgtcg     1680

ccgcgccgcc gtccccttct ccctctccag cctcggggct gtccgcgggg ggacggctgc     1740

cttcgggggg gacggggcag ggcggggttc ggcttctggc gtgtgaccgg cggctctaga     1800

caattgtact aaccttcttc tctttcctct cctgacaggt tggtgtacac tagcggccgc     1860

caccatggct gataccctgc cctctgaatt cgacgtgatt gtgattggaa ccggactccc     1920

tgaatcgatc atcgccgcgg cctgttcccg gtccggtcgg cgcgtgctgc acgtcgattc     1980

gagaagctac tacggaggga attgggcctc attctccttc tccggactgc tctcctggct     2040

gaaggagtat caggagaact ccgacattgt ctccgactca cctgtgtggc aggaccagat     2100

cctggaaaac gaggaagcaa tagccctgag ccggaaggac aagaccatcc agcacgtgga     2160

ggtgttctgt tatgcctccc aagacctcca tgaggacgtg gaagaggctg gagcgttgca     2220

gaagaatcat gccctcgtga cctccgctaa ctccaccgag gcagccgaca gcgccttcct     2280

gccgaccgag gatgaatccc tgtcaactat gtcgtgcgaa atgctgaccg aacagactcc     2340

gagctccgac cccgaaaacg ccctggaagt gaacggagcg gaagtgaccg gcgaaaagga     2400

gaaccattgc gacgacaaga cttgtgtccc atccacttcc gcggaggaca tgtccgagaa     2460

tgtgcctatc gccgaggaca ccaccgaaca gcccaagaag aacagaatca cgtacagcca     2520

gatcatcaag gaggggcgga ggtttaacat cgatctggtg tcgaagctgc tgtacagccg     2580

cggtctgctg atcgatctgc tcattaagtc gaacgtgtcg agatacgccg agttcaagaa     2640

catcacaagg attctcgcct tccgggaagg aagagtggaa caagtgccgt gctcccgggc     2700

cgacgtgttc aactcaaagc aacttaccat ggtggaaaag cgcatgctga tgaaattcct     2760

gaccttctgc atggagtacg aaaagtaccc tgatgagtac aagggttacg aagaaattac     2820

tttctacgag tacctcaaga cccagaagct gaccccgaat ctgcagtaca ttgtgatgca     2880

ctcaatcgca atgacctccg aaaccgcctc ctcgaccatc gacgggctca aggccaccaa     2940

gaacttcctg cactgtttgg ggcgctacgg caacactccg ttcctcttcc cgctgtacgg     3000

ccagggagag ctgcctcagt gtttctgccg gatgtgcgcc gtgttcggcg gaatctactg     3060

tctccgccac tcggtccagt gcctggtggt ggacaaggaa tccaggaagt gcaaagccat     3120

tattgaccag ttcggacaac ggatcatttc cgagcacttt cttgtggagg actcatactt     3180

cccggagaac atgtgctctc gggtccagta tcgacagatt tccagggcgg tgctcattac     3240

tgaccggagc gtcctcaaga ccgatagcga ccagcagatc tccatcctga ccgtgccggc     3300

ggaagaaccc ggcacttttg ccgtgcgcgt gatcgagctt tgctcatcca ccatgacttg     3360

catgaaaggc acttacctgg tgcacctgac gtgcacctca tcgaaaaccg ctagagagga     3420

cctggaatcc gtcgtccaaa agctgttcgt gccttacacc gagatggaaa ttgaaaacga     3480

acaagtggag aagccccgca tcctttgggc cctgtacttt aacatgcgcg attcctccga     3540

tatctcgcgg tcctgctata acgacttgcc ttcgaacgtc tacgtctgct ccgggccaga     3600

ctgcggtctt ggcaacgaca atgccgtgaa gcaggcggaa acactgttcc aagagatctg     3660

ccctaacgag gatttttgcc cgcccccccc aaaccccgag gatatcatct tggacggaga     3720

cagcctgcag ccagaagcat ccgagtccag cgccatcccg gaggccaaca gcgaaacctt     3780

caaggagagc actaacctgg gcaacctgga agagtccagc gaatgatcat aggatctctg     3840

cctcgactgt gccttctagt tgccagccat ctgttgtttg cccctccccc gtgccttcct     3900

tgaccctgga aggtgccact cccactgtcc tttcctaata aaatgaggaa attgcatcgc     3960

attgtctgag taggtgtcat tctattctgg ggggtggggt ggggcaggac agcaaggggg     4020

aggattggga agacaatagc aggcatgctg gggactcgag ttctacgtag ataagtagca     4080

tggcgggtta atcattaact acaaggaacc cctagtgatg gagttggcca ctccctctct     4140

gcgcgctcgc tcgctcactg aggccgggcg accaaaggtc gcccgacgcc cgggctttgc     4200

ccgggcggcc tcagtgagcg agcgagcgcg cagccttaat taacctaata aggaaaatga     4260

agggaagttc ctatactttc tagagaatag gaacttctat agggagtcga ataagggcga     4320

cacaaaaggt attctaaatg cataataaat actgataaca tcttatagtt tgtattatat     4380

tttgtattat cgttgacatg tataattttg atatcaaaaa ctgattttcc ctttattatt     4440

ttcgagattt attttcttaa ttctctttaa caaactagaa atattgtata tacaaaaaat     4500

cataaataat agatgaatag tttaattata ggtgttcatc aatcgaaaaa gcaacgtatc     4560

ttatttaaag tgcgttgctt ttttctcatt tataaggtta aataattctc atatatcaag     4620

caaagtgaca ggcgccctta aatattctga caaatgctct ttccctaaac tccccccata     4680

aaaaaacccg ccgaagcggg tttttacgtt atttgcggat taacgattac tcgttatcag     4740

aaccgcccag gatgcctggc agttccctac tctcgccgct gcgctcggtc gttcggctgc     4800

gggacctcag cgctagcgga gtgtatactg gcttactatg ttggcactga tgagggtgtc     4860

agtgaagtgc ttcatgtggc aggagaaaaa aggctgcacc ggtgcgtcag cagaatatgt     4920

gatacaggat atattccgct tcctcgctca ctgactcgct acgctcggtc gttcgactgc     4980

ggcgagcgga aatggcttac gaacggggcg gagatttcct ggaagatgcc aggaagatac     5040

ttaacaggga agtgagaggg ccgcggcaaa gccgtttttc cataggctcc gcccccctga     5100

caagcatcac gaaatctgac gctcaaatca gtggtggcga aacccgacag gactataaag     5160

ataccaggcg tttccccctg gcggctccct cgtgcgctct cctgttcctg cctttcggtt     5220

taccggtgtc attccgctgt tatggccgcg tttgtctcat tccacgcctg acactcagtt     5280

ccgggtaggc agttcgctcc aagctggact gtatgcacga accccccgtt cagtccgacc     5340

gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggaaagacat gcaaaagcac     5400

cactggcagc agccactggt aattgattta gaggagttag tcttgaagtc atgcgccggt     5460

taaggctaaa ctgaaaggac aagttttggt gactgcgctc ctccaagcca gttacctcgg     5520

ttcaaagagt tggtagctca gagaaccttc gaaaaaccgc cctgcaaggc ggttttttcg     5580

ttttcagagc aagagattac gcgcagacca aaacgatctc aagaagatca tcttattaag     5640

cttttagaaa aactcatcga gcatcaaatg aaactgcaat ttattcatat caggattatc     5700

aataccatat ttttgaaaaa gccgtttctg taatgaagga gaaaactcac cgaggcagtt     5760

ccataggatg gcaagatcct ggtatcggtc tgcgattccg actcgtccaa catcaataca     5820

acctattaat ttcccctcgt caaaaataag gttatcaagt gagaaatcac catgagtgac     5880

gactgaatcc ggtgagaatg gcaaaagttt atgcatttct ttccagactt gttcaacagg     5940

ccagccatta cgctcgtcat caaaatcact cgcatcaacc aaaccgttat tcattcgtga     6000

ttgcgcctga gcgaggcgaa atacgcgatc gctgttaaaa ggacaattac aaacaggaat     6060

cgagtgcaac cggcgcagga acactgccag cgcatcaaca atattttcac ctgaatcagg     6120

atattcttct aatacctgga acgctgtttt tccggggatc gcagtggtga gtaaccatgc     6180

atcatcagga gtacggataa aatgcttgat ggtcggaagt ggcataaatt ccgtcagcca     6240

gtttagtctg accatctcat ctgtaacatc attggcaacg ctacctttgc catgtttcag     6300

aaacaactct ggcgcatcgg gcttcccata caagcgatag attgtcgcac ctgattgccc     6360

gacattatcg cgagcccatt tatacccata taaatcagca tccatgttgg aatttaatcg     6420

cggcctcgac gtttcccgtt gaatatggct catattcttc ctttttcaat attattgaag     6480

catttatcag ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa     6540

acaaataggg gtcagtgtta caaccaatta accaattctg aacattatcg cgagcccatt     6600

tatacctgaa tatggctcat aacacccctt gtttgcctgg cggcagtagc gcggtggtcc     6660

cacctgaccc catgccgaac tcagaagtga aacgccgtag cgccgatggt agtgtgggga     6720

ctccccatgc gagagtaggg aactgccagg catcaaataa aacgaaaggc tcagtcgaaa     6780

gactgggcct ttcgcccggg ctaattaggg ggtgtcgccc ttattcgact ctatagggaa     6840

gttcctattc tctagaaagt ataggaactt ctgaaggggg gtcgatcgac ttaattaagg     6900


<210>  29
<211>  12074
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  29
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg      240

cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt      300

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      360

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      420

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      480

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattaa      540

catggtcgag gtgagcccca cgttctgctt cactctcccc atctcccccc cctccccacc      600

cccaattttg tatttattta ttttttaatt attttgtgca gcgatggggg cggggggggg      660

gggggggcgc gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag      720

aggtgcggcg gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg      780

gcggcggcgg cggccctata aaaagcgaag cgcgcggcgg gcggggagtc gctgcgacgc      840

tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg      900

accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag      960

cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc     1020

cgggagggcc ctttgtgcgg ggggagcggc tcggggggtg cgtgcgtgtg tgtgtgcgtg     1080

gggagcgccg cgtgcggctc cgcgctgccc ggcggctgtg agcgctgcgg gcgcggcgcg     1140

gggctttgtg cgctccgcag tgtgcgcgag gggagcgcgg ccgggggcgg tgccccgcgg     1200

tgcggggggg gctgcgaggg gaacaaaggc tgcgtgcggg gtgtgtgcgt gggggggtga     1260

gcagggggtg tgggcgcgtc ggtcgggctg caaccccccc tgcacccccc tccccgagtt     1320

gctgagcacg gcccggcttc gggtgcgggg ctccgtacgg ggcgtggcgc ggggctcgcc     1380

gtgccgggcg gggggtggcg gcaggtgggg gtgccgggcg gggcggggcc gcctcgggcc     1440

ggggagggct cgggggaggg gcgcggcggc ccccggagcg ccggcggctg tcgaggcgcg     1500

gcgagccgca gccattgcct tttatggtaa tcgtgcgaga gggcgcaggg acttcctttg     1560

tcccaaatct gtgcggagcc gaaatctggg aggcgccgcc gcaccccctc tagcgggcgc     1620

ggggcgaagc ggtgcggcgc cggcaggaag gaaatgggcg gggagggcct tcgtgcgtcg     1680

ccgcgccgcc gtccccttct ccctctccag cctcggggct gtccgcgggg ggacggctgc     1740

cttcgggggg gacggggcag ggcggggttc ggcttctggc gtgtgaccgg cggctctaga     1800

caattgtact aaccttcttc tctttcctct cctgacaggt tggtgtacac tagcggccgc     1860

atggcggata ctctcccttc ggagtttgat gtgatcgtaa tagggacggg tttgcctgaa     1920

tccatcattg cagctgcatg ttcaagaagt ggccggagag ttctgcatgt tgattcaaga     1980

agctactatg gaggaaactg ggccagtttt agcttttcag gactattgtc ctggctaaag     2040

gaataccagg aaaacagtga cattgtaagt gacagtccag tgtggcaaga ccagatcctt     2100

gaaaatgaag aagccattgc tcttagcagg aaggacaaaa ctattcaaca tgtggaagta     2160

ttttgttatg ccagtcagga tttgcatgaa gatgtcgaag aagctggtgc actgcagaaa     2220

aatcatgctc ttgtgacatc tgcaaactcc acagaagctg cagattctgc cttcctgcct     2280

acggaggatg agtcattaag cactatgagc tgtgaaatgc tcacagaaca aactccaagc     2340

agcgatccag agaatgcgct agaagtaaat ggtgctgaag tgacagggga aaaagaaaac     2400

cattgtgatg ataaaacttg tgtgccatca acttcagcag aagacatgag tgaaaatgtg     2460

cctatagcag aagataccac agagcaacca aagaaaaaca gaattactta ctcacaaatt     2520

attaaagaag gcaggagatt taatattgat ttagtatcaa agctgctgta ttctcgagga     2580

ttactaattg atcttctaat caaatctaat gttagtcgat atgcagagtt taaaaatatt     2640

accaggattc ttgcatttcg agaaggacga gtggaacagg ttccgtgttc cagagcagat     2700

gtctttaata gcaaacaact tactatggta gaaaagcgaa tgctaatgaa atttcttaca     2760

ttttgtatgg aatatgagaa atatcctgat gaatataaag gatatgaaga gatcacattt     2820

tatgaatatt taaagactca aaaattaacc cccaacctcc aatatattgt catgcattca     2880

attgcaatga catcagagac agccagcagc accatagatg gtctcaaagc taccaaaaac     2940

tttcttcact gtcttgggcg gtatggcaac actccatttt tgtttccttt atatggccaa     3000

ggagaactcc cccagtgttt ctgcaggatg tgtgctgtgt ttggtggaat ttattgtctt     3060

cgccattcag tacagtgcct tgtagtggac aaagaatcca gaaaatgtaa agcaattata     3120

gatcagtttg gtcagagaat aatctctgag catttcctcg tggaggacag ttactttcct     3180

gagaacatgt gctcacgtgt gcaatacagg cagatctcca gggcagtgct gattacagat     3240

agatctgtcc taaaaacaga ttcagatcaa cagatttcca ttttgacagt gccagcagag     3300

gaaccaggaa cttttgctgt tcgggtcatt gagttatgtt cttcaacgat gacatgcatg     3360

aaaggcacct atttggttca tttgacttgc acatcttcta aaacagcaag agaagattta     3420

gaatcagttg tgcagaaatt gtttgttcca tatactgaaa tggagataga aaatgaacaa     3480

gtagaaaagc caagaattct gtgggctctt tacttcaata tgagagattc gtcagacatc     3540

agcaggagct gttataatga tttaccatcc aacgtttatg tctgctctgg cccagattgt     3600

ggtttaggaa atgataatgc agtcaaacag gctgaaacac ttttccagga aatctgcccc     3660

aatgaagatt tctgtccccc tccaccaaat cctgaagaca ttatccttga tggagacagt     3720

ttacagccag aggcttcaga atccagtgcc ataccagagg ctaactcgga gactttcaag     3780

gaaagcacaa accttggaaa cctagaggag tcctctgaat aaggatctgc ctcgactgtg     3840

ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa     3900

ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt     3960

aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa     4020

gacaatagca ggcatgctgg ggactcgagt tctacgtaga taagtagcat ggcgggttaa     4080

tcattaacta caaggaaccc ctagtgatgg agttggccac tccctctctg cgcgctcgct     4140

cgctcactga ggccgggcga ccaaaggtcg cccgacgccc gggctttgcc cgggcggcct     4200

cagtgagcga gcgagcgcgc agccttaatt aacctaagga aaatgaagtg aagttcctat     4260

actttctaga gaataggaac ttctatagtg agtcgaataa gggcgacaca aaatttattc     4320

taaatgcata ataaatactg ataacatctt atagtttgta ttatattttg tattatcgtt     4380

gacatgtata attttgatat caaaaactga ttttcccttt attattttcg agatttattt     4440

tcttaattct ctttaacaaa ctagaaatat tgtatataca aaaaatcata aataatagat     4500

gaatagttta attataggtg ttcatcaatc gaaaaagcaa cgtatcttat ttaaagtgcg     4560

ttgctttttt ctcatttata aggttaaata attctcatat atcaagcaaa gtgacaggcg     4620

cccttaaata ttctgacaaa tgctctttcc ctaaactccc cccataaaaa aacccgccga     4680

agcgggtttt tacgttattt gcggattaac gattactcgt tatcagaacc gcccaggggg     4740

cccgagctta acctttttat ttgggggaga gggaagtcat gaaaaaacta acctttgaaa     4800

ttcgatctcc agcacatcag caaaacgcta ttcacgcagt acagcaaatc cttccagacc     4860

caaccaaacc aatcgtagta accattcagg aacgcaaccg cagcttagac caaaacagga     4920

agctatgggc ctgcttaggt gacgtctctc gtcaggttga atggcatggt cgctggctgg     4980

atgcagaaag ctggaagtgt gtgtttaccg cagcattaaa gcagcaggat gttgttccta     5040

accttgccgg gaatggcttt gtggtaatag gccagtcaac cagcaggatg cgtgtaggcg     5100

aatttgcgga gctattagag cttatacagg cattcggtac agagcgtggc gttaagtggt     5160

cagacgaagc gagactggct ctggagtgga aagcgagatg gggagacagg gctgcatgat     5220

aaatgtcgtt agtttctccg gtggcaggac gtcagcatat ttgctctggc taatggagca     5280

aaagcgacgg gcaggtaaag acgtgcatta cgttttcatg gatacaggtt gtgaacatcc     5340

aatgacatat cggtttgtca gggaagttgt gaagttctgg gatataccgc tcaccgtatt     5400

gcaggttgat atcaacccgg agcttggaca gccaaatggt tatacggtat gggaaccaaa     5460

ggatattcag acgcgaatgc ctgttctgaa gccatttatc gatatggtaa agaaatatgg     5520

cactccatac gtcggcggcg cgttctgcac tgacagatta aaactcgttc ccttcaccaa     5580

atactgtgat gaccatttcg ggcgagggaa ttacaccacg tggattggca tcagagctga     5640

tgaaccgaag cggctaaagc caaagcctgg aatcagatat cttgctgaac tgtcagactt     5700

tgagaaggaa gatatcctcg catggtggaa gcaacaacca ttcgatttgc aaataccgga     5760

acatctcggt aactgcatat tctgcattaa aaaatcaacg caaaaaatcg gacttgcctg     5820

caaagatgag gagggattgc agcgtgtttt taatgaggtc atcacgggat cccatgtgcg     5880

tgacggacat cgggaaacgc caaaggagat tatgtaccga ggaagaatgt cgctggacgg     5940

tatcgcgaaa atgtattcag aaaatgatta tcaagccctg tatcaggaca tggtacgagc     6000

taaaagattc gataccggct cttgttctga gtcatgcgaa atatttggag ggcagcttga     6060

tttcgacttc gggagggaag ctgcatgatg cgatgttatc ggtgcggtga atgcaaagaa     6120

gataaccgct tccgaccaaa tcaaccttac tggaatcgat ggtgtctccg gtgtgaaaga     6180

acaccaacag gggtgttacc actaccgcag gaaaaggagg acgtgtggcg agacagcgac     6240

gaagtatcac cgacataatc tgcgaaaact gcaaatacct tccaacgaaa cgcaccagaa     6300

ataaacccaa gccaatccca aaagaatctg acgtaaaaac cttcaactac acggctcacc     6360

tgtgggatat ccggtggcta agacgtcgtg cgaggaaaac aaggtgattg accaaaatcg     6420

aagttacgaa caagaaagcg tcgagcgagc tttaacgtgc gctaactgcg gtcagaagct     6480

gcatgtgctg gaagttcacg tgtgtgagca ctgctgcgca gaactgatga gcgatccgaa     6540

tagctcgatg cacgaggaag aagatgatgg ctaaaccagc gcgaagacga tgtaaaaacg     6600

atgaatgccg ggaatggttt caccctgcat tcgctaatca gtggtggtgc tctccagagt     6660

gtggaaccaa gatagcactc gaacgacgaa gtaaagaacg cgaaaaagcg gaaaaagcag     6720

cagagaagaa acgacgacga gaggagcaga aacagaaaga taaacttaag attcgaaaac     6780

tcgccttaaa gccccgcagt tactggatta aacaagccca acaagccgta aacgccttca     6840

tcagagaaag agaccgcgac ttaccatgta tctcgtgcgg aacgctcacg tctgctcagt     6900

gggatgccgg acattaccgg acaactgctg cggcacctca actccgattt aatgaacgca     6960

atattcacaa gcaatgcgtg gtgtgcaacc agcacaaaag cggaaatctc gttccgtatc     7020

gcgtcgaact gattagccgc atcgggcagg aagcagtaga cgaaatcgaa tcaaaccata     7080

accgccatcg ctggactatc gaagagtgca aggcgatcaa ggcagagtac caacagaaac     7140

tcaaagacct gcgaaatagc agaagtgagg ccgcatgacg ttctcagtaa aaaccattcc     7200

agacatgctc gttgaagcat acggaaatca gacagaagta gcacgcagac tgaaatgtag     7260

tcgcggtacg gtcagaaaat acgttgatga taaagacggg aaaatgcacg ccatcgtcaa     7320

cgacgttctc atggttcatc gcggatggag tgaaagagat gcgctattac gaaaaaattg     7380

atggcagcaa ataccgaaat atttgggtag ttggcgatct gcacggatgc tacacgaacc     7440

tgatgaacaa actggatacg attggattcg acaacaaaaa agacctgctt atctcggtgg     7500

gcgatttggt tgatcgtggt gcagagaacg ttgaatgcct ggaattaatc acattcccct     7560

ggttcagagc tgtacgtgga aaccatgagc aaatgatgat tgatggctta tcagagcgtg     7620

gaaacgttaa tcactggctg cttaatggcg gtggctggtt ctttaatctc gattacgaca     7680

aagaaattct ggctaaagct cttgcccata aagcagatga acttccgtta atcatcgaac     7740

tggtgagcaa agataaaaaa tatgttatct gccacgccga ttatcccttt gacgaatacg     7800

agtttggaaa gccagttgat catcagcagg taatctggaa ccgcgaacga atcagcaact     7860

cacaaaacgg gatcgtgaaa gaaatcaaag gcgcggacac gttcatcttt ggtcatacgc     7920

cagcagtgaa accactcaag tttgccaacc aaatgtatat cgataccggc gcagtgttct     7980

gcggaaacct aacattgatt caggtacagg gagaaggcgc atgagactcg aaagcgtagc     8040

taaatttcat tcgccaaaaa gcccgatgat gagcgactca ccacgggcca cggcttctga     8100

ctctctttcc ggtactgatg tgatggctgc tatggggatg gcgcaatcac aagccggatt     8160

cggtatggct gcattctgcg gtaagcacga actcagccag aacgacaaac aaaaggctat     8220

caactatctg atgcaatttg cacacaaggt atcggggaaa taccgtggtg tggcaaagct     8280

tgaaggaaat actaaggcaa aggtactgca agtgctcgca acattcgctt atgcggatta     8340

ttgccgtagt gccgcgacgc cgggggcaag atgcagagat tgccatggta caggccgtgc     8400

ggttgatatt gccaaaacag agctgtgggg gagagttgtc gagaaagagt gcggaagatg     8460

caaaggcgtc ggctattcaa ggatgccagc aagcgcagca tatcgcgctg tgacgatgct     8520

aatcccaaac cttacccaac ccacctggtc acgcactgtt aagccgctgt atgacgctct     8580

ggtggtgcaa tgccacaaag aagagtcaat cgcagacaac attttgaatg cggtcacacg     8640

ttagcagcat gattgccacg gatggcaaca tattaacggc atgatattga cttattgaat     8700

aaaattgggt aaatttgact caacgatggg ttaattcgct cgttgtggta gtgagatgaa     8760

aagaggcggc gcttactacc gattccgcct agttggtcac ttcgacgtat cgtctggaac     8820

tccaaccatc gcaggcagag aggtctgcaa aatgcaatcc cgaaacagtt cgcaggtaat     8880

agttagagcc tgcataacgg tttcgggatt ttttatatct gcacaacagg taagagcatt     8940

gagtcgataa tcgtgaagag tcggcgagcc tggttagcca gtgctctttc cgttgtgctg     9000

aattaagcga ataccggaag cagaaccgga tcaccaaatg cgtacaggcg tcatcgccgc     9060

ccagcaacag cacaacccaa actgagccgt agccactgtc tgtcctgaat tcattagtaa     9120

tagttacgct gcggcctttt acacatgacc ttcgtgaaag cgggtggcag gaggtcgcgc     9180

taacaacctc ctgccgtttt gcccgtgcat atcggtcacg aacaaatctg attactaaac     9240

acagtagcct ggatttgttc tatcagtaat cgaccttatt cctaattaaa tagagcaaat     9300

ccccttattg ggggtaagac atgaagatgc cagaaaaaca tgacctgttg gccgccattc     9360

tcgcggcaaa ggaacaaggc atcggggcaa tccttgcgtt tgcaatggcg taccttcgcg     9420

gcagatataa tggcggtgcg tttacaaaaa cagtaatcga cgcaacgatg tgcgccatta     9480

tcgcctggtt cattcgtgac cttctcgact tcgccggact aagtagcaat ctcgcttata     9540

taacgagcgt gtttatcggc tacatcggta ctgactcgat tggttcgctt atcaaacgct     9600

tcgctgctaa aaaagccgga gtagaagatg gtagaaatca ataatcaacg taaggcgttc     9660

ctcgatatgc tggcgtggtc ggagggaact gataacggac gtcagaaaac cagaaatcat     9720

ggttatgacg tcattgtagg cggagagcta tttactgatt actccgatca ccctcgcaaa     9780

cttgtcacgc taaacccaaa actcaaatca acaggcgctt aagactggcc gtcgttttac     9840

aacacagaaa gagtttgtag aaacgcaaaa aggccatccg tcaggggcct tctgcttagt     9900

ttgatgcctg gcagttccct actctcgcct tccgcttcct cgctcactga ctcgctgcgc     9960

tcggtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc    10020

acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg    10080

aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat    10140

cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag    10200

gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga    10260

tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg    10320

tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt    10380

cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac    10440

gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc    10500

ggtgctacag agttcttgaa gtggtgggct aactacggct acactagaag aacagtattt    10560

ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc    10620

ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc    10680

agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg    10740

aacgacgcgc gcgtaactca cgttaaggga ttttggtcat gagcttgcgc cgtcccgtca    10800

agtcagcgta atgctctgct tttagaaaaa ctcatcgagc atcaaatgaa actgcaattt    10860

attcatatca ggattatcaa taccatattt ttgaaaaagc cgtttctgta atgaaggaga    10920

aaactcaccg aggcagttcc ataggatggc aagatcctgg tatcggtctg cgattccgac    10980

tcgtccaaca tcaatacaac ctattaattt cccctcgtca aaaataaggt tatcaagtga    11040

gaaatcacca tgagtgacga ctgaatccgg tgagaatggc aaaagtttat gcatttcttt    11100

ccagacttgt tcaacaggcc agccattacg ctcgtcatca aaatcactcg catcaaccaa    11160

accgttattc attcgtgatt gcgcctgagc gaggcgaaat acgcgatcgc tgttaaaagg    11220

acaattacaa acaggaatcg agtgcaaccg gcgcaggaac actgccagcg catcaacaat    11280

attttcacct gaatcaggat attcttctaa tacctggaac gctgtttttc cggggatcgc    11340

agtggtgagt aaccatgcat catcaggagt acggataaaa tgcttgatgg tcggaagtgg    11400

cataaattcc gtcagccagt ttagtctgac catctcatct gtaacatcat tggcaacgct    11460

acctttgcca tgtttcagaa acaactctgg cgcatcgggc ttcccataca agcgatagat    11520

tgtcgcacct gattgcccga cattatcgcg agcccattta tacccatata aatcagcatc    11580

catgttggaa tttaatcgcg gcctcgacgt ttcccgttga atatggctca tattcttcct    11640

ttttcaatat tattgaagca tttatcaggg ttattgtctc atgagcggat acatatttga    11700

atgtatttag aaaaataaac aaataggggt cagtgttaca accaattaac caattctgaa    11760

cattatcgcg agcccattta tacctgaata tggctcataa caccccttgt ttgcctggcg    11820

gcagtagcgc ggtggtccca cctgacccca tgccgaactc agaagtgaaa cgccgtagcg    11880

ccgatggtag tgtggggact ccccatgcga gagtagggaa ctgccaggca tcaaataaaa    11940

cgaaaggctc agtcgaaaga ctgggccttt cgcccgggct aattaggggg tgtcgccctt    12000

attcgactct atagtgaagt tcctattctc tagaaagtat aggaacttct gaagtggggt    12060

cgacttaatt aagg                                                      12074


<210>  30
<211>  11041
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  30
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

aagatccaag ctcagatctc gatcgagttg ggccccagaa gcctggtggt tgtttgtcct      240

tctcagggga aaagtgaggc ggccccttgg aggaaggggc cgggcagaat gatctaatcg      300

gattccaagc agctcagggg attgtctttt tctagcacct tcttgccact cctaagcgtc      360

ctccgtgacc ccggctggga tttagcctgg tgctgtgtca gccccggtct cccaggggct      420

tcccagtggt ccccaggaac cctcgacagg gcccggtctc tctcgtccag caagggcagg      480

gacgggccac aggccaaggg ccctcgatcg aggaactgaa aaaccagaaa gttaactggt      540

aagtttagtc tttttgtctt ttatttcagg tcccggatcc ggtggtggtg caaatcaaag      600

aactgctcct cagtggatgt tgcctttact tctaggcctg tacggaagtg ttacttctgc      660

tctaaaagct gcggaattgt acccgcggcc gccaccatgg ccaagatcaa cacccaatac      720

tcccacccct ccaggaccca cctcaaggta aagacctcag accgggatct caatcgcgct      780

gaaaatggcc tcagcagagc ccactcgtca agtgaggaga catcgtcagt gctgcagccg      840

gggatcgcca tggagaccag aggactggct gactccgggc agggctcctt caccggccag      900

gggatcgcca ggctgtcgcg cctcatcttc ttgctgcgca ggtgggctgc caggcatgtg      960

caccaccagg accagggacc ggactctttt cctgatcgtt tccgtggagc cgagcttaag     1020

gaggtgtcca gccaagaaag caatgcccag gcaaatgtgg gcagccagga gccagcagac     1080

agagggagaa gcgcctggcc cctggccaaa tgcaacacta acaccagcaa caacacggag     1140

gaggagaaga agacgaaaaa gaaggatgcg atcgtggtgg acccgtccag caacctgtac     1200

taccgctggc tgaccgccat cgccctgcct gtcttctata actggtatct gcttatttgc     1260

agggcctgtt tcgatgagct gcagtccgag tacctgatgc tgtggctggt cctggactac     1320

tcggcagatg tcctgtatgt cttggatgtg cttgtacgag ctcggacagg ttttcttgag     1380

caaggcttaa tggtcagtga taccaacagg ctgtggcagc attacaagac gaccacgcag     1440

ttcaagctgg atgtgttgtc cctggtcccc accgacctgg cttacttaaa ggtgggcaca     1500

aactacccag aagtgaggtt caaccgccta ctgaagtttt cccggctctt tgaattcttt     1560

gaccgcacag agacaaggac caactacccc aatatgttca ggattgggaa cttggtcttg     1620

tacattctca tcatcatcca ctggaatgcc tgcatctact ttgccatttc caagttcatt     1680

ggttttggga cagactcctg ggtctaccca aacatctcaa tcccagagca tgggcgcctc     1740

tccaggaagt acatttacag tctctactgg tccaccttga cccttaccac cattggtgag     1800

accccacccc ccgtgaaaga tgaggagtat ctctttgtgg tcgtagactt cttggtgggt     1860

gttctgattt ttgccaccat tgtgggcaat gtgggctcca tgatctcgaa tatgaatgcc     1920

tcacgggcag agttccaggc caagattgat tccatcaagc agtacatgca gttccgcaag     1980

gtcaccaagg acttggagac gcgggttatc cggtggtttg actacctgtg ggccaacaag     2040

aagacggtgg atgagaagga ggtgctcaag agcctcccag acaagctgaa ggctgagatc     2100

gccatcaacg tgcacctgga cacgctgaag aaggttcgca tcttccagga ctgtgaggca     2160

gggctgctgg tggagctggt gctgaagctg cgacccactg tgttcagccc tggggattat     2220

atctgcaaga agggagatat tgggaaggag atgtacatca tcaacgaggg caagctggcc     2280

gtggtggctg atgatggggt cacccagttc gtggtcctca gcgatggcag ctacttcggg     2340

gagatcagca ttctgaacat caaggggagc aagtcgggga accgcaggac ggccaacatc     2400

cgcagcattg gctactcaga cctgttctgc ctctcaaagg acgatctcat ggaggccctc     2460

accgagtacc ccgaagccaa gaaggccctg gaggagaaag gacggcagat cctgatgaaa     2520

gacaacctga tcgatgagga gctggccagg gcgggcgcgg accccaagga ccttgaggag     2580

aaagtggagc agctggggtc ctccctggac accctgcaga ccaggtttgc acgcctcctg     2640

gctgagtaca acgccaccca gatgaagatg aagcagcgtc tcagccaact ggaaagccag     2700

gtgaagggtg gtggggacaa gcccctggct gatggggaag ttcccgggga tgctacaaaa     2760

acagaggaca aacaacagtg atcatagatc gatctgcctc gactgtgcct tctagttgcc     2820

agccatctgt tgtttgcccc tcccccgtgc cttccttgac cctggaaggt gccactccca     2880

ctgtcctttc ctaataaaat gaggaaattg catcgcattg tctgagtagg tgtcattcta     2940

ttctgggggg tggggtgggg caggacagca agggggagga ttgggaagac aatagcaggc     3000

atgctgggga ctcgagttct acgtagataa gtagcatggc gggttaatca ttaactacaa     3060

ggaaccccta gtgatggagt tggccactcc ctctctgcgc gctcgctcgc tcactgaggc     3120

cgggcgacca aaggtcgccc gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg     3180

agcgcgcagc cttaattaac ctaaggaaaa tgaagtgaag ttcctatact ttctagagaa     3240

taggaacttc tatagtgagt cgaataaggg cgacacaaaa tttattctaa atgcataata     3300

aatactgata acatcttata gtttgtatta tattttgtat tatcgttgac atgtataatt     3360

ttgatatcaa aaactgattt tccctttatt attttcgaga tttattttct taattctctt     3420

taacaaacta gaaatattgt atatacaaaa aatcataaat aatagatgaa tagtttaatt     3480

ataggtgttc atcaatcgaa aaagcaacgt atcttattta aagtgcgttg cttttttctc     3540

atttataagg ttaaataatt ctcatatatc aagcaaagtg acaggcgccc ttaaatattc     3600

tgacaaatgc tctttcccta aactcccccc ataaaaaaac ccgccgaagc gggtttttac     3660

gttatttgcg gattaacgat tactcgttat cagaaccgcc cagggggccc gagcttaacc     3720

tttttatttg ggggagaggg aagtcatgaa aaaactaacc tttgaaattc gatctccagc     3780

acatcagcaa aacgctattc acgcagtaca gcaaatcctt ccagacccaa ccaaaccaat     3840

cgtagtaacc attcaggaac gcaaccgcag cttagaccaa aacaggaagc tatgggcctg     3900

cttaggtgac gtctctcgtc aggttgaatg gcatggtcgc tggctggatg cagaaagctg     3960

gaagtgtgtg tttaccgcag cattaaagca gcaggatgtt gttcctaacc ttgccgggaa     4020

tggctttgtg gtaataggcc agtcaaccag caggatgcgt gtaggcgaat ttgcggagct     4080

attagagctt atacaggcat tcggtacaga gcgtggcgtt aagtggtcag acgaagcgag     4140

actggctctg gagtggaaag cgagatgggg agacagggct gcatgataaa tgtcgttagt     4200

ttctccggtg gcaggacgtc agcatatttg ctctggctaa tggagcaaaa gcgacgggca     4260

ggtaaagacg tgcattacgt tttcatggat acaggttgtg aacatccaat gacatatcgg     4320

tttgtcaggg aagttgtgaa gttctgggat ataccgctca ccgtattgca ggttgatatc     4380

aacccggagc ttggacagcc aaatggttat acggtatggg aaccaaagga tattcagacg     4440

cgaatgcctg ttctgaagcc atttatcgat atggtaaaga aatatggcac tccatacgtc     4500

ggcggcgcgt tctgcactga cagattaaaa ctcgttccct tcaccaaata ctgtgatgac     4560

catttcgggc gagggaatta caccacgtgg attggcatca gagctgatga accgaagcgg     4620

ctaaagccaa agcctggaat cagatatctt gctgaactgt cagactttga gaaggaagat     4680

atcctcgcat ggtggaagca acaaccattc gatttgcaaa taccggaaca tctcggtaac     4740

tgcatattct gcattaaaaa atcaacgcaa aaaatcggac ttgcctgcaa agatgaggag     4800

ggattgcagc gtgtttttaa tgaggtcatc acgggatccc atgtgcgtga cggacatcgg     4860

gaaacgccaa aggagattat gtaccgagga agaatgtcgc tggacggtat cgcgaaaatg     4920

tattcagaaa atgattatca agccctgtat caggacatgg tacgagctaa aagattcgat     4980

accggctctt gttctgagtc atgcgaaata tttggagggc agcttgattt cgacttcggg     5040

agggaagctg catgatgcga tgttatcggt gcggtgaatg caaagaagat aaccgcttcc     5100

gaccaaatca accttactgg aatcgatggt gtctccggtg tgaaagaaca ccaacagggg     5160

tgttaccact accgcaggaa aaggaggacg tgtggcgaga cagcgacgaa gtatcaccga     5220

cataatctgc gaaaactgca aataccttcc aacgaaacgc accagaaata aacccaagcc     5280

aatcccaaaa gaatctgacg taaaaacctt caactacacg gctcacctgt gggatatccg     5340

gtggctaaga cgtcgtgcga ggaaaacaag gtgattgacc aaaatcgaag ttacgaacaa     5400

gaaagcgtcg agcgagcttt aacgtgcgct aactgcggtc agaagctgca tgtgctggaa     5460

gttcacgtgt gtgagcactg ctgcgcagaa ctgatgagcg atccgaatag ctcgatgcac     5520

gaggaagaag atgatggcta aaccagcgcg aagacgatgt aaaaacgatg aatgccggga     5580

atggtttcac cctgcattcg ctaatcagtg gtggtgctct ccagagtgtg gaaccaagat     5640

agcactcgaa cgacgaagta aagaacgcga aaaagcggaa aaagcagcag agaagaaacg     5700

acgacgagag gagcagaaac agaaagataa acttaagatt cgaaaactcg ccttaaagcc     5760

ccgcagttac tggattaaac aagcccaaca agccgtaaac gccttcatca gagaaagaga     5820

ccgcgactta ccatgtatct cgtgcggaac gctcacgtct gctcagtggg atgccggaca     5880

ttaccggaca actgctgcgg cacctcaact ccgatttaat gaacgcaata ttcacaagca     5940

atgcgtggtg tgcaaccagc acaaaagcgg aaatctcgtt ccgtatcgcg tcgaactgat     6000

tagccgcatc gggcaggaag cagtagacga aatcgaatca aaccataacc gccatcgctg     6060

gactatcgaa gagtgcaagg cgatcaaggc agagtaccaa cagaaactca aagacctgcg     6120

aaatagcaga agtgaggccg catgacgttc tcagtaaaaa ccattccaga catgctcgtt     6180

gaagcatacg gaaatcagac agaagtagca cgcagactga aatgtagtcg cggtacggtc     6240

agaaaatacg ttgatgataa agacgggaaa atgcacgcca tcgtcaacga cgttctcatg     6300

gttcatcgcg gatggagtga aagagatgcg ctattacgaa aaaattgatg gcagcaaata     6360

ccgaaatatt tgggtagttg gcgatctgca cggatgctac acgaacctga tgaacaaact     6420

ggatacgatt ggattcgaca acaaaaaaga cctgcttatc tcggtgggcg atttggttga     6480

tcgtggtgca gagaacgttg aatgcctgga attaatcaca ttcccctggt tcagagctgt     6540

acgtggaaac catgagcaaa tgatgattga tggcttatca gagcgtggaa acgttaatca     6600

ctggctgctt aatggcggtg gctggttctt taatctcgat tacgacaaag aaattctggc     6660

taaagctctt gcccataaag cagatgaact tccgttaatc atcgaactgg tgagcaaaga     6720

taaaaaatat gttatctgcc acgccgatta tccctttgac gaatacgagt ttggaaagcc     6780

agttgatcat cagcaggtaa tctggaaccg cgaacgaatc agcaactcac aaaacgggat     6840

cgtgaaagaa atcaaaggcg cggacacgtt catctttggt catacgccag cagtgaaacc     6900

actcaagttt gccaaccaaa tgtatatcga taccggcgca gtgttctgcg gaaacctaac     6960

attgattcag gtacagggag aaggcgcatg agactcgaaa gcgtagctaa atttcattcg     7020

ccaaaaagcc cgatgatgag cgactcacca cgggccacgg cttctgactc tctttccggt     7080

actgatgtga tggctgctat ggggatggcg caatcacaag ccggattcgg tatggctgca     7140

ttctgcggta agcacgaact cagccagaac gacaaacaaa aggctatcaa ctatctgatg     7200

caatttgcac acaaggtatc ggggaaatac cgtggtgtgg caaagcttga aggaaatact     7260

aaggcaaagg tactgcaagt gctcgcaaca ttcgcttatg cggattattg ccgtagtgcc     7320

gcgacgccgg gggcaagatg cagagattgc catggtacag gccgtgcggt tgatattgcc     7380

aaaacagagc tgtgggggag agttgtcgag aaagagtgcg gaagatgcaa aggcgtcggc     7440

tattcaagga tgccagcaag cgcagcatat cgcgctgtga cgatgctaat cccaaacctt     7500

acccaaccca cctggtcacg cactgttaag ccgctgtatg acgctctggt ggtgcaatgc     7560

cacaaagaag agtcaatcgc agacaacatt ttgaatgcgg tcacacgtta gcagcatgat     7620

tgccacggat ggcaacatat taacggcatg atattgactt attgaataaa attgggtaaa     7680

tttgactcaa cgatgggtta attcgctcgt tgtggtagtg agatgaaaag aggcggcgct     7740

tactaccgat tccgcctagt tggtcacttc gacgtatcgt ctggaactcc aaccatcgca     7800

ggcagagagg tctgcaaaat gcaatcccga aacagttcgc aggtaatagt tagagcctgc     7860

ataacggttt cgggattttt tatatctgca caacaggtaa gagcattgag tcgataatcg     7920

tgaagagtcg gcgagcctgg ttagccagtg ctctttccgt tgtgctgaat taagcgaata     7980

ccggaagcag aaccggatca ccaaatgcgt acaggcgtca tcgccgccca gcaacagcac     8040

aacccaaact gagccgtagc cactgtctgt cctgaattca ttagtaatag ttacgctgcg     8100

gccttttaca catgaccttc gtgaaagcgg gtggcaggag gtcgcgctaa caacctcctg     8160

ccgttttgcc cgtgcatatc ggtcacgaac aaatctgatt actaaacaca gtagcctgga     8220

tttgttctat cagtaatcga ccttattcct aattaaatag agcaaatccc cttattgggg     8280

gtaagacatg aagatgccag aaaaacatga cctgttggcc gccattctcg cggcaaagga     8340

acaaggcatc ggggcaatcc ttgcgtttgc aatggcgtac cttcgcggca gatataatgg     8400

cggtgcgttt acaaaaacag taatcgacgc aacgatgtgc gccattatcg cctggttcat     8460

tcgtgacctt ctcgacttcg ccggactaag tagcaatctc gcttatataa cgagcgtgtt     8520

tatcggctac atcggtactg actcgattgg ttcgcttatc aaacgcttcg ctgctaaaaa     8580

agccggagta gaagatggta gaaatcaata atcaacgtaa ggcgttcctc gatatgctgg     8640

cgtggtcgga gggaactgat aacggacgtc agaaaaccag aaatcatggt tatgacgtca     8700

ttgtaggcgg agagctattt actgattact ccgatcaccc tcgcaaactt gtcacgctaa     8760

acccaaaact caaatcaaca ggcgcttaag actggccgtc gttttacaac acagaaagag     8820

tttgtagaaa cgcaaaaagg ccatccgtca ggggccttct gcttagtttg atgcctggca     8880

gttccctact ctcgccttcc gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc     8940

tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca gaatcagggg     9000

ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg     9060

ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac     9120

gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg     9180

gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac ctgtccgcct     9240

ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat ctcagttcgg     9300

tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct     9360

gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac     9420

tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt     9480

tcttgaagtg gtgggctaac tacggctaca ctagaagaac agtatttggt atctgcgctc     9540

tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca     9600

ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat     9660

ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac gacgcgcgcg     9720

taactcacgt taagggattt tggtcatgag cttgcgccgt cccgtcaagt cagcgtaatg     9780

ctctgctttt agaaaaactc atcgagcatc aaatgaaact gcaatttatt catatcagga     9840

ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa ctcaccgagg     9900

cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg tccaacatca     9960

atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa atcaccatga    10020

gtgacgactg aatccggtga gaatggcaaa agtttatgca tttctttcca gacttgttca    10080

acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc gttattcatt    10140

cgtgattgcg cctgagcgag gcgaaatacg cgatcgctgt taaaaggaca attacaaaca    10200

ggaatcgagt gcaaccggcg caggaacact gccagcgcat caacaatatt ttcacctgaa    10260

tcaggatatt cttctaatac ctggaacgct gtttttccgg ggatcgcagt ggtgagtaac    10320

catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagtggcat aaattccgtc    10380

agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc tttgccatgt    10440

ttcagaaaca actctggcgc atcgggcttc ccatacaagc gatagattgt cgcacctgat    10500

tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat gttggaattt    10560

aatcgcggcc tcgacgtttc ccgttgaata tggctcatat tcttcctttt tcaatattat    10620

tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg tatttagaaa    10680

aataaacaaa taggggtcag tgttacaacc aattaaccaa ttctgaacat tatcgcgagc    10740

ccatttatac ctgaatatgg ctcataacac cccttgtttg cctggcggca gtagcgcggt    10800

ggtcccacct gaccccatgc cgaactcaga agtgaaacgc cgtagcgccg atggtagtgt    10860

ggggactccc catgcgagag tagggaactg ccaggcatca aataaaacga aaggctcagt    10920

cgaaagactg ggcctttcgc ccgggctaat tagggggtgt cgcccttatt cgactctata    10980

gtgaagttcc tattctctag aaagtatagg aacttctgaa gtggggtcga cttaattaag    11040

g                                                                    11041


<210>  31
<211>  11041
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  31
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

aagatccaag ctcagatctc gatcgagttg ggccccagaa gcctggtggt tgtttgtcct      240

tctcagggga aaagtgaggc ggccccttgg aggaaggggc cgggcagaat gatctaatcg      300

gattccaagc agctcagggg attgtctttt tctagcacct tcttgccact cctaagcgtc      360

ctccgtgacc ccggctggga tttagcctgg tgctgtgtca gccccggtct cccaggggct      420

tcccagtggt ccccaggaac cctcgacagg gcccggtctc tctcgtccag caagggcagg      480

gacgggccac aggccaaggg ccctcgatcg aggaactgaa aaaccagaaa gttaactggt      540

aagtttagtc tttttgtctt ttatttcagg tcccggatcc ggtggtggtg caaatcaaag      600

aactgctcct cagtggatgt tgcctttact tctaggcctg tacggaagtg ttacttctgc      660

tctaaaagct gcggaattgt acccgcggcc gccaccatgg ctaagattaa cacccagtac      720

tcacatccat cccgcactca cctcaaagtc aagacctccg atcgggatct gaaccgggct      780

gagaatgggc tgtcgcgcgc ccactcgtcg tccgaggaaa ccagcagcgt gctccagccg      840

ggcatcgcca tggaaactag ggggctggcg gactccggac agggatcctt cactggacag      900

ggtattgccc ggctgagcag actgatcttc ctgcttcgcc gctgggcggc cagacacgtg      960

caccatcagg accagggacc tgatagcttc cccgaccgct ttaggggagc cgagctgaaa     1020

gaagtgtcaa gccaggagtc aaacgcgcag gccaacgtcg gcagccaaga gcctgcagac     1080

cggggacgct cggcatggcc gctcgcaaag tgcaacacta acacttccaa caacaccgaa     1140

gaggaaaaga aaaccaagaa gaaggatgca attgtggtgg acccttcctc caacctgtac     1200

taccgctggt tgaccgccat cgccctcccg gtcttttaca attggtatct ccttatctgc     1260

cgggcctgct tcgacgaact gcaatcagag tacctgatgc tgtggctggt gctggactat     1320

agcgccgatg tgctctacgt cctggatgtg ctcgtgcgcg cccggaccgg attcttggaa     1380

caaggcctga tggtgtccga cacgaataga ctgtggcagc actataagac cacaacccag     1440

ttcaagcttg acgtgctcag ccttgtgccg actgacctgg cctacctgaa agtcggaact     1500

aactacccgg aagtcagatt caaccgactc ctgaagttca gcaggctgtt cgagttcttt     1560

gaccgcaccg agactcggac caactaccct aacatgttcc ggatcggaaa tctggtgctc     1620

tacatactga ttatcatcca ttggaacgcc tgtatctatt tcgccatttc gaagttcatc     1680

ggtttcggaa ccgattcctg ggtgtacccc aacatctcga tccccgaaca cggtcgcctg     1740

tcccggaagt acatctactc cctgtactgg tccactctga ctctgaccac gatcggggaa     1800

acccctccac ccgtgaagga cgaagagtac ctgttcgtgg tggtggactt cctggtcgga     1860

gtgttgattt tcgccaccat tgtgggaaac gtgggctcca tgatctccaa catgaacgcg     1920

tcgagagctg agttccaagc caagatcgac tccattaagc agtacatgca gttcagaaag     1980

gtcaccaagg acctggaaac cagggtcatc cgctggttcg actacctgtg ggccaacaaa     2040

aagactgtgg acgaaaagga agtgctgaag tcgctgccgg ataagctgaa ggccgaaatc     2100

gccattaacg tgcaccttga caccctgaag aaagtccgga tcttccaaga ctgtgaagcc     2160

ggcctcctgg tggagctcgt gctcaagctg cggcccaccg tgttcagccc gggagattac     2220

atttgcaaga agggcgatat cggcaaagag atgtacatca tcaacgaggg aaagctggcc     2280

gtggtcgcgg acgacggcgt gacccagttc gtggtgctgt ccgacggatc ctacttcggt     2340

gaaatctcaa tcctcaacat caaggggtcc aagtccggca accggagaac tgccaacatt     2400

cgctccatcg gatacagcga cctgttttgc ctgtccaagg atgacctgat ggaggctctg     2460

actgagtacc ctgaagcgaa gaaggctttg gaggaaaagg ggcggcagat tctgatgaag     2520

gacaatttga tcgacgagga gctcgcacgg gccggcgccg accccaagga tctcgaagag     2580

aaggtcgaac agctgggttc ttcgcttgat accctgcaaa cccgattcgc gcggctgctc     2640

gccgagtaca acgcgaccca gatgaagatg aagcagagac tgtcacagtt ggaatcccaa     2700

gtcaagggcg gaggcgacaa gccgctggcg gacggggaag tgcccgggga cgccaccaag     2760

actgaggaca agcagcagtg atcatagatc gatctgcctc gactgtgcct tctagttgcc     2820

agccatctgt tgtttgcccc tcccccgtgc cttccttgac cctggaaggt gccactccca     2880

ctgtcctttc ctaataaaat gaggaaattg catcgcattg tctgagtagg tgtcattcta     2940

ttctgggggg tggggtgggg caggacagca agggggagga ttgggaagac aatagcaggc     3000

atgctgggga ctcgagttct acgtagataa gtagcatggc gggttaatca ttaactacaa     3060

ggaaccccta gtgatggagt tggccactcc ctctctgcgc gctcgctcgc tcactgaggc     3120

cgggcgacca aaggtcgccc gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg     3180

agcgcgcagc cttaattaac ctaaggaaaa tgaagtgaag ttcctatact ttctagagaa     3240

taggaacttc tatagtgagt cgaataaggg cgacacaaaa tttattctaa atgcataata     3300

aatactgata acatcttata gtttgtatta tattttgtat tatcgttgac atgtataatt     3360

ttgatatcaa aaactgattt tccctttatt attttcgaga tttattttct taattctctt     3420

taacaaacta gaaatattgt atatacaaaa aatcataaat aatagatgaa tagtttaatt     3480

ataggtgttc atcaatcgaa aaagcaacgt atcttattta aagtgcgttg cttttttctc     3540

atttataagg ttaaataatt ctcatatatc aagcaaagtg acaggcgccc ttaaatattc     3600

tgacaaatgc tctttcccta aactcccccc ataaaaaaac ccgccgaagc gggtttttac     3660

gttatttgcg gattaacgat tactcgttat cagaaccgcc cagggggccc gagcttaacc     3720

tttttatttg ggggagaggg aagtcatgaa aaaactaacc tttgaaattc gatctccagc     3780

acatcagcaa aacgctattc acgcagtaca gcaaatcctt ccagacccaa ccaaaccaat     3840

cgtagtaacc attcaggaac gcaaccgcag cttagaccaa aacaggaagc tatgggcctg     3900

cttaggtgac gtctctcgtc aggttgaatg gcatggtcgc tggctggatg cagaaagctg     3960

gaagtgtgtg tttaccgcag cattaaagca gcaggatgtt gttcctaacc ttgccgggaa     4020

tggctttgtg gtaataggcc agtcaaccag caggatgcgt gtaggcgaat ttgcggagct     4080

attagagctt atacaggcat tcggtacaga gcgtggcgtt aagtggtcag acgaagcgag     4140

actggctctg gagtggaaag cgagatgggg agacagggct gcatgataaa tgtcgttagt     4200

ttctccggtg gcaggacgtc agcatatttg ctctggctaa tggagcaaaa gcgacgggca     4260

ggtaaagacg tgcattacgt tttcatggat acaggttgtg aacatccaat gacatatcgg     4320

tttgtcaggg aagttgtgaa gttctgggat ataccgctca ccgtattgca ggttgatatc     4380

aacccggagc ttggacagcc aaatggttat acggtatggg aaccaaagga tattcagacg     4440

cgaatgcctg ttctgaagcc atttatcgat atggtaaaga aatatggcac tccatacgtc     4500

ggcggcgcgt tctgcactga cagattaaaa ctcgttccct tcaccaaata ctgtgatgac     4560

catttcgggc gagggaatta caccacgtgg attggcatca gagctgatga accgaagcgg     4620

ctaaagccaa agcctggaat cagatatctt gctgaactgt cagactttga gaaggaagat     4680

atcctcgcat ggtggaagca acaaccattc gatttgcaaa taccggaaca tctcggtaac     4740

tgcatattct gcattaaaaa atcaacgcaa aaaatcggac ttgcctgcaa agatgaggag     4800

ggattgcagc gtgtttttaa tgaggtcatc acgggatccc atgtgcgtga cggacatcgg     4860

gaaacgccaa aggagattat gtaccgagga agaatgtcgc tggacggtat cgcgaaaatg     4920

tattcagaaa atgattatca agccctgtat caggacatgg tacgagctaa aagattcgat     4980

accggctctt gttctgagtc atgcgaaata tttggagggc agcttgattt cgacttcggg     5040

agggaagctg catgatgcga tgttatcggt gcggtgaatg caaagaagat aaccgcttcc     5100

gaccaaatca accttactgg aatcgatggt gtctccggtg tgaaagaaca ccaacagggg     5160

tgttaccact accgcaggaa aaggaggacg tgtggcgaga cagcgacgaa gtatcaccga     5220

cataatctgc gaaaactgca aataccttcc aacgaaacgc accagaaata aacccaagcc     5280

aatcccaaaa gaatctgacg taaaaacctt caactacacg gctcacctgt gggatatccg     5340

gtggctaaga cgtcgtgcga ggaaaacaag gtgattgacc aaaatcgaag ttacgaacaa     5400

gaaagcgtcg agcgagcttt aacgtgcgct aactgcggtc agaagctgca tgtgctggaa     5460

gttcacgtgt gtgagcactg ctgcgcagaa ctgatgagcg atccgaatag ctcgatgcac     5520

gaggaagaag atgatggcta aaccagcgcg aagacgatgt aaaaacgatg aatgccggga     5580

atggtttcac cctgcattcg ctaatcagtg gtggtgctct ccagagtgtg gaaccaagat     5640

agcactcgaa cgacgaagta aagaacgcga aaaagcggaa aaagcagcag agaagaaacg     5700

acgacgagag gagcagaaac agaaagataa acttaagatt cgaaaactcg ccttaaagcc     5760

ccgcagttac tggattaaac aagcccaaca agccgtaaac gccttcatca gagaaagaga     5820

ccgcgactta ccatgtatct cgtgcggaac gctcacgtct gctcagtggg atgccggaca     5880

ttaccggaca actgctgcgg cacctcaact ccgatttaat gaacgcaata ttcacaagca     5940

atgcgtggtg tgcaaccagc acaaaagcgg aaatctcgtt ccgtatcgcg tcgaactgat     6000

tagccgcatc gggcaggaag cagtagacga aatcgaatca aaccataacc gccatcgctg     6060

gactatcgaa gagtgcaagg cgatcaaggc agagtaccaa cagaaactca aagacctgcg     6120

aaatagcaga agtgaggccg catgacgttc tcagtaaaaa ccattccaga catgctcgtt     6180

gaagcatacg gaaatcagac agaagtagca cgcagactga aatgtagtcg cggtacggtc     6240

agaaaatacg ttgatgataa agacgggaaa atgcacgcca tcgtcaacga cgttctcatg     6300

gttcatcgcg gatggagtga aagagatgcg ctattacgaa aaaattgatg gcagcaaata     6360

ccgaaatatt tgggtagttg gcgatctgca cggatgctac acgaacctga tgaacaaact     6420

ggatacgatt ggattcgaca acaaaaaaga cctgcttatc tcggtgggcg atttggttga     6480

tcgtggtgca gagaacgttg aatgcctgga attaatcaca ttcccctggt tcagagctgt     6540

acgtggaaac catgagcaaa tgatgattga tggcttatca gagcgtggaa acgttaatca     6600

ctggctgctt aatggcggtg gctggttctt taatctcgat tacgacaaag aaattctggc     6660

taaagctctt gcccataaag cagatgaact tccgttaatc atcgaactgg tgagcaaaga     6720

taaaaaatat gttatctgcc acgccgatta tccctttgac gaatacgagt ttggaaagcc     6780

agttgatcat cagcaggtaa tctggaaccg cgaacgaatc agcaactcac aaaacgggat     6840

cgtgaaagaa atcaaaggcg cggacacgtt catctttggt catacgccag cagtgaaacc     6900

actcaagttt gccaaccaaa tgtatatcga taccggcgca gtgttctgcg gaaacctaac     6960

attgattcag gtacagggag aaggcgcatg agactcgaaa gcgtagctaa atttcattcg     7020

ccaaaaagcc cgatgatgag cgactcacca cgggccacgg cttctgactc tctttccggt     7080

actgatgtga tggctgctat ggggatggcg caatcacaag ccggattcgg tatggctgca     7140

ttctgcggta agcacgaact cagccagaac gacaaacaaa aggctatcaa ctatctgatg     7200

caatttgcac acaaggtatc ggggaaatac cgtggtgtgg caaagcttga aggaaatact     7260

aaggcaaagg tactgcaagt gctcgcaaca ttcgcttatg cggattattg ccgtagtgcc     7320

gcgacgccgg gggcaagatg cagagattgc catggtacag gccgtgcggt tgatattgcc     7380

aaaacagagc tgtgggggag agttgtcgag aaagagtgcg gaagatgcaa aggcgtcggc     7440

tattcaagga tgccagcaag cgcagcatat cgcgctgtga cgatgctaat cccaaacctt     7500

acccaaccca cctggtcacg cactgttaag ccgctgtatg acgctctggt ggtgcaatgc     7560

cacaaagaag agtcaatcgc agacaacatt ttgaatgcgg tcacacgtta gcagcatgat     7620

tgccacggat ggcaacatat taacggcatg atattgactt attgaataaa attgggtaaa     7680

tttgactcaa cgatgggtta attcgctcgt tgtggtagtg agatgaaaag aggcggcgct     7740

tactaccgat tccgcctagt tggtcacttc gacgtatcgt ctggaactcc aaccatcgca     7800

ggcagagagg tctgcaaaat gcaatcccga aacagttcgc aggtaatagt tagagcctgc     7860

ataacggttt cgggattttt tatatctgca caacaggtaa gagcattgag tcgataatcg     7920

tgaagagtcg gcgagcctgg ttagccagtg ctctttccgt tgtgctgaat taagcgaata     7980

ccggaagcag aaccggatca ccaaatgcgt acaggcgtca tcgccgccca gcaacagcac     8040

aacccaaact gagccgtagc cactgtctgt cctgaattca ttagtaatag ttacgctgcg     8100

gccttttaca catgaccttc gtgaaagcgg gtggcaggag gtcgcgctaa caacctcctg     8160

ccgttttgcc cgtgcatatc ggtcacgaac aaatctgatt actaaacaca gtagcctgga     8220

tttgttctat cagtaatcga ccttattcct aattaaatag agcaaatccc cttattgggg     8280

gtaagacatg aagatgccag aaaaacatga cctgttggcc gccattctcg cggcaaagga     8340

acaaggcatc ggggcaatcc ttgcgtttgc aatggcgtac cttcgcggca gatataatgg     8400

cggtgcgttt acaaaaacag taatcgacgc aacgatgtgc gccattatcg cctggttcat     8460

tcgtgacctt ctcgacttcg ccggactaag tagcaatctc gcttatataa cgagcgtgtt     8520

tatcggctac atcggtactg actcgattgg ttcgcttatc aaacgcttcg ctgctaaaaa     8580

agccggagta gaagatggta gaaatcaata atcaacgtaa ggcgttcctc gatatgctgg     8640

cgtggtcgga gggaactgat aacggacgtc agaaaaccag aaatcatggt tatgacgtca     8700

ttgtaggcgg agagctattt actgattact ccgatcaccc tcgcaaactt gtcacgctaa     8760

acccaaaact caaatcaaca ggcgcttaag actggccgtc gttttacaac acagaaagag     8820

tttgtagaaa cgcaaaaagg ccatccgtca ggggccttct gcttagtttg atgcctggca     8880

gttccctact ctcgccttcc gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc     8940

tgcggcgagc ggtatcagct cactcaaagg cggtaatacg gttatccaca gaatcagggg     9000

ataacgcagg aaagaacatg tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg     9060

ccgcgttgct ggcgtttttc cataggctcc gcccccctga cgagcatcac aaaaatcgac     9120

gctcaagtca gaggtggcga aacccgacag gactataaag ataccaggcg tttccccctg     9180

gaagctccct cgtgcgctct cctgttccga ccctgccgct taccggatac ctgtccgcct     9240

ttctcccttc gggaagcgtg gcgctttctc atagctcacg ctgtaggtat ctcagttcgg     9300

tgtaggtcgt tcgctccaag ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct     9360

gcgccttatc cggtaactat cgtcttgagt ccaacccggt aagacacgac ttatcgccac     9420

tggcagcagc cactggtaac aggattagca gagcgaggta tgtaggcggt gctacagagt     9480

tcttgaagtg gtgggctaac tacggctaca ctagaagaac agtatttggt atctgcgctc     9540

tgctgaagcc agttaccttc ggaaaaagag ttggtagctc ttgatccggc aaacaaacca     9600

ccgctggtag cggtggtttt tttgtttgca agcagcagat tacgcgcaga aaaaaaggat     9660

ctcaagaaga tcctttgatc ttttctacgg ggtctgacgc tcagtggaac gacgcgcgcg     9720

taactcacgt taagggattt tggtcatgag cttgcgccgt cccgtcaagt cagcgtaatg     9780

ctctgctttt agaaaaactc atcgagcatc aaatgaaact gcaatttatt catatcagga     9840

ttatcaatac catatttttg aaaaagccgt ttctgtaatg aaggagaaaa ctcaccgagg     9900

cagttccata ggatggcaag atcctggtat cggtctgcga ttccgactcg tccaacatca     9960

atacaaccta ttaatttccc ctcgtcaaaa ataaggttat caagtgagaa atcaccatga    10020

gtgacgactg aatccggtga gaatggcaaa agtttatgca tttctttcca gacttgttca    10080

acaggccagc cattacgctc gtcatcaaaa tcactcgcat caaccaaacc gttattcatt    10140

cgtgattgcg cctgagcgag gcgaaatacg cgatcgctgt taaaaggaca attacaaaca    10200

ggaatcgagt gcaaccggcg caggaacact gccagcgcat caacaatatt ttcacctgaa    10260

tcaggatatt cttctaatac ctggaacgct gtttttccgg ggatcgcagt ggtgagtaac    10320

catgcatcat caggagtacg gataaaatgc ttgatggtcg gaagtggcat aaattccgtc    10380

agccagttta gtctgaccat ctcatctgta acatcattgg caacgctacc tttgccatgt    10440

ttcagaaaca actctggcgc atcgggcttc ccatacaagc gatagattgt cgcacctgat    10500

tgcccgacat tatcgcgagc ccatttatac ccatataaat cagcatccat gttggaattt    10560

aatcgcggcc tcgacgtttc ccgttgaata tggctcatat tcttcctttt tcaatattat    10620

tgaagcattt atcagggtta ttgtctcatg agcggataca tatttgaatg tatttagaaa    10680

aataaacaaa taggggtcag tgttacaacc aattaaccaa ttctgaacat tatcgcgagc    10740

ccatttatac ctgaatatgg ctcataacac cccttgtttg cctggcggca gtagcgcggt    10800

ggtcccacct gaccccatgc cgaactcaga agtgaaacgc cgtagcgccg atggtagtgt    10860

ggggactccc catgcgagag tagggaactg ccaggcatca aataaaacga aaggctcagt    10920

cgaaagactg ggcctttcgc ccgggctaat tagggggtgt cgcccttatt cgactctata    10980

gtgaagttcc tattctctag aaagtatagg aacttctgaa gtggggtcga cttaattaag    11040

g                                                                    11041


<210>  32
<211>  11206
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  32
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

aagatccaag ctcagatctc gatcgagttg ggccccagaa gcctggtggt tgtttgtcct      240

tctcagggga aaagtgaggc ggccccttgg aggaaggggc cgggcagaat gatctaatcg      300

gattccaagc agctcagggg attgtctttt tctagcacct tcttgccact cctaagcgtc      360

ctccgtgacc ccggctggga tttagcctgg tgctgtgtca gccccggtct cccaggggct      420

tcccagtggt ccccaggaac cctcgacagg gcccggtctc tctcgtccag caagggcagg      480

gacgggccac aggccaaggg ccctcgatcg aggaactgaa aaaccagaaa gttaactggt      540

aagtttagtc tttttgtctt ttatttcagg tcccggatcc ggtggtggtg caaatcaaag      600

aactgctcct cagtggatgt tgcctttact tctaggcctg tacggaagtg ttacttctgc      660

tctaaaagct gcggaattgt acccgcggcc gccaccatgg ctaagattaa cacccagtac      720

tcacatccat cccgcactca cctcaaagtc aagacctccg atcgggatct gaaccgggct      780

gagaatgggc tgtcgcgcgc ccactcgtcg tccgaggaaa ccagcagcgt gctccagccg      840

ggcatcgcca tggaaactag ggggctggcg gactccggac agggatcctt cactggacag      900

ggtattgccc ggttcgggcg gattcagaag aagtcccagc cggagaaggt cgtgcgggct      960

gccagcaggg gcaggccact cattggttgg acacagtggt gcgctgagga tggtggagat     1020

gaatcggaaa tggcactggc cggctctccc ggatgcagct cgggccccca agggagactg     1080

agcagactga tcttcctgct tcgccgctgg gcggccagac acgtgcacca tcaggaccag     1140

ggacctgata gcttccccga ccgctttagg ggagccgagc tgaaagaagt gtcaagccag     1200

gagtcaaacg cgcaggccaa cgtcggcagc caagagcctg cagaccgggg acgctcggca     1260

tggccgctcg caaagtgcaa cactaacact tccaacaaca ccgaagagga aaagaaaacc     1320

aagaagaagg atgcaattgt ggtggaccct tcctccaacc tgtactaccg ctggttgacc     1380

gccatcgccc tcccggtctt ttacaattgg tatctcctta tctgccgggc ctgcttcgac     1440

gaactgcaat cagagtacct gatgctgtgg ctggtgctgg actatagcgc cgatgtgctc     1500

tacgtcctgg atgtgctcgt gcgcgcccgg accggattct tggaacaagg cctgatggtg     1560

tccgacacga atagactgtg gcagcactat aagaccacaa cccagttcaa gcttgacgtg     1620

ctcagccttg tgccgactga cctggcctac ctgaaagtcg gaactaacta cccggaagtc     1680

agattcaacc gactcctgaa gttcagcagg ctgttcgagt tctttgaccg caccgagact     1740

cggaccaact accctaacat gttccggatc ggaaatctgg tgctctacat actgattatc     1800

atccattgga acgcctgtat ctatttcgcc atttcgaagt tcatcggttt cggaaccgat     1860

tcctgggtgt accccaacat ctcgatcccc gaacacggtc gcctgtcccg gaagtacatc     1920

tactccctgt actggtccac tctgactctg accacgatcg gggaaacccc tccacccgtg     1980

aaggacgaag agtacctgtt cgtggtggtg gacttcctgg tcggagtgtt gattttcgcc     2040

accattgtgg gaaacgtggg ctccatgatc tccaacatga acgcgtcgag agctgagttc     2100

caagccaaga tcgactccat taagcagtac atgcagttca gaaaggtcac caaggacctg     2160

gaaaccaggg tcatccgctg gttcgactac ctgtgggcca acaaaaagac tgtggacgaa     2220

aaggaagtgc tgaagtcgct gccggataag ctgaaggccg aaatcgccat taacgtgcac     2280

cttgacaccc tgaagaaagt ccggatcttc caagactgtg aagccggcct cctggtggag     2340

ctcgtgctca agctgcggcc caccgtgttc agcccgggag attacatttg caagaagggc     2400

gatatcggca aagagatgta catcatcaac gagggaaagc tggccgtggt cgcggacgac     2460

ggcgtgaccc agttcgtggt gctgtccgac ggatcctact tcggtgaaat ctcaatcctc     2520

aacatcaagg ggtccaagtc cggcaaccgg agaactgcca acattcgctc catcggatac     2580

agcgacctgt tttgcctgtc caaggatgac ctgatggagg ctctgactga gtaccctgaa     2640

gcgaagaagg ctttggagga aaaggggcgg cagattctga tgaaggacaa tttgatcgac     2700

gaggagctcg cacgggccgg cgccgacccc aaggatctcg aagagaaggt cgaacagctg     2760

ggttcttcgc ttgataccct gcaaacccga ttcgcgcggc tgctcgccga gtacaacgcg     2820

acccagatga agatgaagca gagactgtca cagttggaat cccaagtcaa gggcggaggc     2880

gacaagccgc tggcggacgg ggaagtgccc ggggacgcca ccaagactga ggacaagcag     2940

cagtgatcat agatcgatct gcctcgactg tgccttctag ttgccagcca tctgttgttt     3000

gcccctcccc cgtgccttcc ttgaccctgg aaggtgccac tcccactgtc ctttcctaat     3060

aaaatgagga aattgcatcg cattgtctga gtaggtgtca ttctattctg gggggtgggg     3120

tggggcagga cagcaagggg gaggattggg aagacaatag caggcatgct ggggactcga     3180

gttctacgta gataagtagc atggcgggtt aatcattaac tacaaggaac ccctagtgat     3240

ggagttggcc actccctctc tgcgcgctcg ctcgctcact gaggccgggc gaccaaaggt     3300

cgcccgacgc ccgggctttg cccgggcggc ctcagtgagc gagcgagcgc gcagccttaa     3360

ttaacctaag gaaaatgaag tgaagttcct atactttcta gagaatagga acttctatag     3420

tgagtcgaat aagggcgaca caaaatttat tctaaatgca taataaatac tgataacatc     3480

ttatagtttg tattatattt tgtattatcg ttgacatgta taattttgat atcaaaaact     3540

gattttccct ttattatttt cgagatttat tttcttaatt ctctttaaca aactagaaat     3600

attgtatata caaaaaatca taaataatag atgaatagtt taattatagg tgttcatcaa     3660

tcgaaaaagc aacgtatctt atttaaagtg cgttgctttt ttctcattta taaggttaaa     3720

taattctcat atatcaagca aagtgacagg cgcccttaaa tattctgaca aatgctcttt     3780

ccctaaactc cccccataaa aaaacccgcc gaagcgggtt tttacgttat ttgcggatta     3840

acgattactc gttatcagaa ccgcccaggg ggcccgagct taaccttttt atttggggga     3900

gagggaagtc atgaaaaaac taacctttga aattcgatct ccagcacatc agcaaaacgc     3960

tattcacgca gtacagcaaa tccttccaga cccaaccaaa ccaatcgtag taaccattca     4020

ggaacgcaac cgcagcttag accaaaacag gaagctatgg gcctgcttag gtgacgtctc     4080

tcgtcaggtt gaatggcatg gtcgctggct ggatgcagaa agctggaagt gtgtgtttac     4140

cgcagcatta aagcagcagg atgttgttcc taaccttgcc gggaatggct ttgtggtaat     4200

aggccagtca accagcagga tgcgtgtagg cgaatttgcg gagctattag agcttataca     4260

ggcattcggt acagagcgtg gcgttaagtg gtcagacgaa gcgagactgg ctctggagtg     4320

gaaagcgaga tggggagaca gggctgcatg ataaatgtcg ttagtttctc cggtggcagg     4380

acgtcagcat atttgctctg gctaatggag caaaagcgac gggcaggtaa agacgtgcat     4440

tacgttttca tggatacagg ttgtgaacat ccaatgacat atcggtttgt cagggaagtt     4500

gtgaagttct gggatatacc gctcaccgta ttgcaggttg atatcaaccc ggagcttgga     4560

cagccaaatg gttatacggt atgggaacca aaggatattc agacgcgaat gcctgttctg     4620

aagccattta tcgatatggt aaagaaatat ggcactccat acgtcggcgg cgcgttctgc     4680

actgacagat taaaactcgt tcccttcacc aaatactgtg atgaccattt cgggcgaggg     4740

aattacacca cgtggattgg catcagagct gatgaaccga agcggctaaa gccaaagcct     4800

ggaatcagat atcttgctga actgtcagac tttgagaagg aagatatcct cgcatggtgg     4860

aagcaacaac cattcgattt gcaaataccg gaacatctcg gtaactgcat attctgcatt     4920

aaaaaatcaa cgcaaaaaat cggacttgcc tgcaaagatg aggagggatt gcagcgtgtt     4980

tttaatgagg tcatcacggg atcccatgtg cgtgacggac atcgggaaac gccaaaggag     5040

attatgtacc gaggaagaat gtcgctggac ggtatcgcga aaatgtattc agaaaatgat     5100

tatcaagccc tgtatcagga catggtacga gctaaaagat tcgataccgg ctcttgttct     5160

gagtcatgcg aaatatttgg agggcagctt gatttcgact tcgggaggga agctgcatga     5220

tgcgatgtta tcggtgcggt gaatgcaaag aagataaccg cttccgacca aatcaacctt     5280

actggaatcg atggtgtctc cggtgtgaaa gaacaccaac aggggtgtta ccactaccgc     5340

aggaaaagga ggacgtgtgg cgagacagcg acgaagtatc accgacataa tctgcgaaaa     5400

ctgcaaatac cttccaacga aacgcaccag aaataaaccc aagccaatcc caaaagaatc     5460

tgacgtaaaa accttcaact acacggctca cctgtgggat atccggtggc taagacgtcg     5520

tgcgaggaaa acaaggtgat tgaccaaaat cgaagttacg aacaagaaag cgtcgagcga     5580

gctttaacgt gcgctaactg cggtcagaag ctgcatgtgc tggaagttca cgtgtgtgag     5640

cactgctgcg cagaactgat gagcgatccg aatagctcga tgcacgagga agaagatgat     5700

ggctaaacca gcgcgaagac gatgtaaaaa cgatgaatgc cgggaatggt ttcaccctgc     5760

attcgctaat cagtggtggt gctctccaga gtgtggaacc aagatagcac tcgaacgacg     5820

aagtaaagaa cgcgaaaaag cggaaaaagc agcagagaag aaacgacgac gagaggagca     5880

gaaacagaaa gataaactta agattcgaaa actcgcctta aagccccgca gttactggat     5940

taaacaagcc caacaagccg taaacgcctt catcagagaa agagaccgcg acttaccatg     6000

tatctcgtgc ggaacgctca cgtctgctca gtgggatgcc ggacattacc ggacaactgc     6060

tgcggcacct caactccgat ttaatgaacg caatattcac aagcaatgcg tggtgtgcaa     6120

ccagcacaaa agcggaaatc tcgttccgta tcgcgtcgaa ctgattagcc gcatcgggca     6180

ggaagcagta gacgaaatcg aatcaaacca taaccgccat cgctggacta tcgaagagtg     6240

caaggcgatc aaggcagagt accaacagaa actcaaagac ctgcgaaata gcagaagtga     6300

ggccgcatga cgttctcagt aaaaaccatt ccagacatgc tcgttgaagc atacggaaat     6360

cagacagaag tagcacgcag actgaaatgt agtcgcggta cggtcagaaa atacgttgat     6420

gataaagacg ggaaaatgca cgccatcgtc aacgacgttc tcatggttca tcgcggatgg     6480

agtgaaagag atgcgctatt acgaaaaaat tgatggcagc aaataccgaa atatttgggt     6540

agttggcgat ctgcacggat gctacacgaa cctgatgaac aaactggata cgattggatt     6600

cgacaacaaa aaagacctgc ttatctcggt gggcgatttg gttgatcgtg gtgcagagaa     6660

cgttgaatgc ctggaattaa tcacattccc ctggttcaga gctgtacgtg gaaaccatga     6720

gcaaatgatg attgatggct tatcagagcg tggaaacgtt aatcactggc tgcttaatgg     6780

cggtggctgg ttctttaatc tcgattacga caaagaaatt ctggctaaag ctcttgccca     6840

taaagcagat gaacttccgt taatcatcga actggtgagc aaagataaaa aatatgttat     6900

ctgccacgcc gattatccct ttgacgaata cgagtttgga aagccagttg atcatcagca     6960

ggtaatctgg aaccgcgaac gaatcagcaa ctcacaaaac gggatcgtga aagaaatcaa     7020

aggcgcggac acgttcatct ttggtcatac gccagcagtg aaaccactca agtttgccaa     7080

ccaaatgtat atcgataccg gcgcagtgtt ctgcggaaac ctaacattga ttcaggtaca     7140

gggagaaggc gcatgagact cgaaagcgta gctaaatttc attcgccaaa aagcccgatg     7200

atgagcgact caccacgggc cacggcttct gactctcttt ccggtactga tgtgatggct     7260

gctatgggga tggcgcaatc acaagccgga ttcggtatgg ctgcattctg cggtaagcac     7320

gaactcagcc agaacgacaa acaaaaggct atcaactatc tgatgcaatt tgcacacaag     7380

gtatcgggga aataccgtgg tgtggcaaag cttgaaggaa atactaaggc aaaggtactg     7440

caagtgctcg caacattcgc ttatgcggat tattgccgta gtgccgcgac gccgggggca     7500

agatgcagag attgccatgg tacaggccgt gcggttgata ttgccaaaac agagctgtgg     7560

gggagagttg tcgagaaaga gtgcggaaga tgcaaaggcg tcggctattc aaggatgcca     7620

gcaagcgcag catatcgcgc tgtgacgatg ctaatcccaa accttaccca acccacctgg     7680

tcacgcactg ttaagccgct gtatgacgct ctggtggtgc aatgccacaa agaagagtca     7740

atcgcagaca acattttgaa tgcggtcaca cgttagcagc atgattgcca cggatggcaa     7800

catattaacg gcatgatatt gacttattga ataaaattgg gtaaatttga ctcaacgatg     7860

ggttaattcg ctcgttgtgg tagtgagatg aaaagaggcg gcgcttacta ccgattccgc     7920

ctagttggtc acttcgacgt atcgtctgga actccaacca tcgcaggcag agaggtctgc     7980

aaaatgcaat cccgaaacag ttcgcaggta atagttagag cctgcataac ggtttcggga     8040

ttttttatat ctgcacaaca ggtaagagca ttgagtcgat aatcgtgaag agtcggcgag     8100

cctggttagc cagtgctctt tccgttgtgc tgaattaagc gaataccgga agcagaaccg     8160

gatcaccaaa tgcgtacagg cgtcatcgcc gcccagcaac agcacaaccc aaactgagcc     8220

gtagccactg tctgtcctga attcattagt aatagttacg ctgcggcctt ttacacatga     8280

ccttcgtgaa agcgggtggc aggaggtcgc gctaacaacc tcctgccgtt ttgcccgtgc     8340

atatcggtca cgaacaaatc tgattactaa acacagtagc ctggatttgt tctatcagta     8400

atcgacctta ttcctaatta aatagagcaa atccccttat tgggggtaag acatgaagat     8460

gccagaaaaa catgacctgt tggccgccat tctcgcggca aaggaacaag gcatcggggc     8520

aatccttgcg tttgcaatgg cgtaccttcg cggcagatat aatggcggtg cgtttacaaa     8580

aacagtaatc gacgcaacga tgtgcgccat tatcgcctgg ttcattcgtg accttctcga     8640

cttcgccgga ctaagtagca atctcgctta tataacgagc gtgtttatcg gctacatcgg     8700

tactgactcg attggttcgc ttatcaaacg cttcgctgct aaaaaagccg gagtagaaga     8760

tggtagaaat caataatcaa cgtaaggcgt tcctcgatat gctggcgtgg tcggagggaa     8820

ctgataacgg acgtcagaaa accagaaatc atggttatga cgtcattgta ggcggagagc     8880

tatttactga ttactccgat caccctcgca aacttgtcac gctaaaccca aaactcaaat     8940

caacaggcgc ttaagactgg ccgtcgtttt acaacacaga aagagtttgt agaaacgcaa     9000

aaaggccatc cgtcaggggc cttctgctta gtttgatgcc tggcagttcc ctactctcgc     9060

cttccgcttc ctcgctcact gactcgctgc gctcggtcgt tcggctgcgg cgagcggtat     9120

cagctcactc aaaggcggta atacggttat ccacagaatc aggggataac gcaggaaaga     9180

acatgtgagc aaaaggccag caaaaggcca ggaaccgtaa aaaggccgcg ttgctggcgt     9240

ttttccatag gctccgcccc cctgacgagc atcacaaaaa tcgacgctca agtcagaggt     9300

ggcgaaaccc gacaggacta taaagatacc aggcgtttcc ccctggaagc tccctcgtgc     9360

gctctcctgt tccgaccctg ccgcttaccg gatacctgtc cgcctttctc ccttcgggaa     9420

gcgtggcgct ttctcatagc tcacgctgta ggtatctcag ttcggtgtag gtcgttcgct     9480

ccaagctggg ctgtgtgcac gaaccccccg ttcagcccga ccgctgcgcc ttatccggta     9540

actatcgtct tgagtccaac ccggtaagac acgacttatc gccactggca gcagccactg     9600

gtaacaggat tagcagagcg aggtatgtag gcggtgctac agagttcttg aagtggtggg     9660

ctaactacgg ctacactaga agaacagtat ttggtatctg cgctctgctg aagccagtta     9720

ccttcggaaa aagagttggt agctcttgat ccggcaaaca aaccaccgct ggtagcggtg     9780

gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa aggatctcaa gaagatcctt     9840

tgatcttttc tacggggtct gacgctcagt ggaacgacgc gcgcgtaact cacgttaagg     9900

gattttggtc atgagcttgc gccgtcccgt caagtcagcg taatgctctg cttttagaaa     9960

aactcatcga gcatcaaatg aaactgcaat ttattcatat caggattatc aataccatat    10020

ttttgaaaaa gccgtttctg taatgaagga gaaaactcac cgaggcagtt ccataggatg    10080

gcaagatcct ggtatcggtc tgcgattccg actcgtccaa catcaataca acctattaat    10140

ttcccctcgt caaaaataag gttatcaagt gagaaatcac catgagtgac gactgaatcc    10200

ggtgagaatg gcaaaagttt atgcatttct ttccagactt gttcaacagg ccagccatta    10260

cgctcgtcat caaaatcact cgcatcaacc aaaccgttat tcattcgtga ttgcgcctga    10320

gcgaggcgaa atacgcgatc gctgttaaaa ggacaattac aaacaggaat cgagtgcaac    10380

cggcgcagga acactgccag cgcatcaaca atattttcac ctgaatcagg atattcttct    10440

aatacctgga acgctgtttt tccggggatc gcagtggtga gtaaccatgc atcatcagga    10500

gtacggataa aatgcttgat ggtcggaagt ggcataaatt ccgtcagcca gtttagtctg    10560

accatctcat ctgtaacatc attggcaacg ctacctttgc catgtttcag aaacaactct    10620

ggcgcatcgg gcttcccata caagcgatag attgtcgcac ctgattgccc gacattatcg    10680

cgagcccatt tatacccata taaatcagca tccatgttgg aatttaatcg cggcctcgac    10740

gtttcccgtt gaatatggct catattcttc ctttttcaat attattgaag catttatcag    10800

ggttattgtc tcatgagcgg atacatattt gaatgtattt agaaaaataa acaaataggg    10860

gtcagtgtta caaccaatta accaattctg aacattatcg cgagcccatt tatacctgaa    10920

tatggctcat aacacccctt gtttgcctgg cggcagtagc gcggtggtcc cacctgaccc    10980

catgccgaac tcagaagtga aacgccgtag cgccgatggt agtgtgggga ctccccatgc    11040

gagagtaggg aactgccagg catcaaataa aacgaaaggc tcagtcgaaa gactgggcct    11100

ttcgcccggg ctaattaggg ggtgtcgccc ttattcgact ctatagtgaa gttcctattc    11160

tctagaaagt ataggaactt ctgaagtggg gtcgacttaa ttaagg                   11206


<210>  33
<211>  11435
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  33
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

ctgaagagac agaaatatct ctaattccat gagcggtcat acgaggcaag agaagccgct      240

tagagcatgg acttagttag tttcagggat tggacagagt caagagctgg ggtgaggagg      300

ttaccctcgg taggggtgac acagatgtca accgcctatt ccctccacat gcatgtcctg      360

ccagaagaac ctgtccctgg gctgggaatc ttatattacc ttcctctcca atgagaagag      420

aagttcaagg ctcacagaca tgtgcataca caagctcaat gcactcaaga ttcccctcca      480

ccactcctgc ccccactacc tacaggagat tgactcctgc tgtgcacata agctgggata      540

atcagggttt ctaaacatca gcttcaaaag tccaatgtcc aaagtggtgg ggggccgggg      600

aacgaggtac tctttccata cccttggctt ttgtgtggcc tggagccgct gatatagaga      660

ttggagtggg acacgaggta ttcctttcaa aaacacaaag gcctatactt tgagccctcc      720

catttcaatc ccccaccatg cttcaccttt aagacctcca actccacttt gatcccagtt      780

ctcaggttca ggcctcacaa ggccaaaatc ctgaagttac ccttctcaaa ctcccttgcc      840

tttaacatca tcagaatcaa cctcctaccc ccactctgtc ccagcagcaa tagcctgcta      900

atcttttagc actaatcttt taggcactaa tctgctttcc aaactcttgg cacctgaact      960

atttataagc agtgttttat gcccccccac caaagaaccc tattcttttc ccatgacccc     1020

accaatcaaa acactcagag gactgtgggt ataagaggct ggggaggcag gcatagcagc     1080

ggccgccacc atggccaaga tcaacaccca atactcccac ccctccagga cccacctcaa     1140

ggtaaagacc tcagaccggg atctcaatcg cgctgaaaat ggcctcagca gagcccactc     1200

gtcaagtgag gagacatcgt cagtgctgca gccggggatc gccatggaga ccagaggact     1260

ggctgactcc gggcagggct ccttcaccgg ccaggggatc gccaggctgt cgcgcctcat     1320

cttcttgctg cgcaggtggg ctgccaggca tgtgcaccac caggaccagg gaccggactc     1380

ttttcctgat cgtttccgtg gagccgagct taaggaggtg tccagccaag aaagcaatgc     1440

ccaggcaaat gtgggcagcc aggagccagc agacagaggg agaagcgcct ggcccctggc     1500

caaatgcaac actaacacca gcaacaacac ggaggaggag aagaagacga aaaagaagga     1560

tgcgatcgtg gtggacccgt ccagcaacct gtactaccgc tggctgaccg ccatcgccct     1620

gcctgtcttc tataactggt atctgcttat ttgcagggcc tgtttcgatg agctgcagtc     1680

cgagtacctg atgctgtggc tggtcctgga ctactcggca gatgtcctgt atgtcttgga     1740

tgtgcttgta cgagctcgga caggttttct tgagcaaggc ttaatggtca gtgataccaa     1800

caggctgtgg cagcattaca agacgaccac gcagttcaag ctggatgtgt tgtccctggt     1860

ccccaccgac ctggcttact taaaggtggg cacaaactac ccagaagtga ggttcaaccg     1920

cctactgaag ttttcccggc tctttgaatt ctttgaccgc acagagacaa ggaccaacta     1980

ccccaatatg ttcaggattg ggaacttggt cttgtacatt ctcatcatca tccactggaa     2040

tgcctgcatc tactttgcca tttccaagtt cattggtttt gggacagact cctgggtcta     2100

cccaaacatc tcaatcccag agcatgggcg cctctccagg aagtacattt acagtctcta     2160

ctggtccacc ttgaccctta ccaccattgg tgagacccca ccccccgtga aagatgagga     2220

gtatctcttt gtggtcgtag acttcttggt gggtgttctg atttttgcca ccattgtggg     2280

caatgtgggc tccatgatct cgaatatgaa tgcctcacgg gcagagttcc aggccaagat     2340

tgattccatc aagcagtaca tgcagttccg caaggtcacc aaggacttgg agacgcgggt     2400

tatccggtgg tttgactacc tgtgggccaa caagaagacg gtggatgaga aggaggtgct     2460

caagagcctc ccagacaagc tgaaggctga gatcgccatc aacgtgcacc tggacacgct     2520

gaagaaggtt cgcatcttcc aggactgtga ggcagggctg ctggtggagc tggtgctgaa     2580

gctgcgaccc actgtgttca gccctgggga ttatatctgc aagaagggag atattgggaa     2640

ggagatgtac atcatcaacg agggcaagct ggccgtggtg gctgatgatg gggtcaccca     2700

gttcgtggtc ctcagcgatg gcagctactt cggggagatc agcattctga acatcaaggg     2760

gagcaagtcg gggaaccgca ggacggccaa catccgcagc attggctact cagacctgtt     2820

ctgcctctca aaggacgatc tcatggaggc cctcaccgag taccccgaag ccaagaaggc     2880

cctggaggag aaaggacggc agatcctgat gaaagacaac ctgatcgatg aggagctggc     2940

cagggcgggc gcggacccca aggaccttga ggagaaagtg gagcagctgg ggtcctccct     3000

ggacaccctg cagaccaggt ttgcacgcct cctggctgag tacaacgcca cccagatgaa     3060

gatgaagcag cgtctcagcc aactggaaag ccaggtgaag ggtggtgggg acaagcccct     3120

ggctgatggg gaagttcccg gggatgctac aaaaacagag gacaaacaac agtgatcata     3180

gatcgatctg cctcgactgt gccttctagt tgccagccat ctgttgtttg cccctccccc     3240

gtgccttcct tgaccctgga aggtgccact cccactgtcc tttcctaata aaatgaggaa     3300

attgcatcgc attgtctgag taggtgtcat tctattctgg ggggtggggt ggggcaggac     3360

agcaaggggg aggattggga agacaatagc aggcatgctg gggactcgag ttctacgtag     3420

ataagtagca tggcgggtta atcattaact acaaggaacc cctagtgatg gagttggcca     3480

ctccctctct gcgcgctcgc tcgctcactg aggccgggcg accaaaggtc gcccgacgcc     3540

cgggctttgc ccgggcggcc tcagtgagcg agcgagcgcg cagccttaat taacctaagg     3600

aaaatgaagt gaagttccta tactttctag agaataggaa cttctatagt gagtcgaata     3660

agggcgacac aaaatttatt ctaaatgcat aataaatact gataacatct tatagtttgt     3720

attatatttt gtattatcgt tgacatgtat aattttgata tcaaaaactg attttccctt     3780

tattattttc gagatttatt ttcttaattc tctttaacaa actagaaata ttgtatatac     3840

aaaaaatcat aaataataga tgaatagttt aattataggt gttcatcaat cgaaaaagca     3900

acgtatctta tttaaagtgc gttgcttttt tctcatttat aaggttaaat aattctcata     3960

tatcaagcaa agtgacaggc gcccttaaat attctgacaa atgctctttc cctaaactcc     4020

ccccataaaa aaacccgccg aagcgggttt ttacgttatt tgcggattaa cgattactcg     4080

ttatcagaac cgcccagggg gcccgagctt aaccttttta tttgggggag agggaagtca     4140

tgaaaaaact aacctttgaa attcgatctc cagcacatca gcaaaacgct attcacgcag     4200

tacagcaaat ccttccagac ccaaccaaac caatcgtagt aaccattcag gaacgcaacc     4260

gcagcttaga ccaaaacagg aagctatggg cctgcttagg tgacgtctct cgtcaggttg     4320

aatggcatgg tcgctggctg gatgcagaaa gctggaagtg tgtgtttacc gcagcattaa     4380

agcagcagga tgttgttcct aaccttgccg ggaatggctt tgtggtaata ggccagtcaa     4440

ccagcaggat gcgtgtaggc gaatttgcgg agctattaga gcttatacag gcattcggta     4500

cagagcgtgg cgttaagtgg tcagacgaag cgagactggc tctggagtgg aaagcgagat     4560

ggggagacag ggctgcatga taaatgtcgt tagtttctcc ggtggcagga cgtcagcata     4620

tttgctctgg ctaatggagc aaaagcgacg ggcaggtaaa gacgtgcatt acgttttcat     4680

ggatacaggt tgtgaacatc caatgacata tcggtttgtc agggaagttg tgaagttctg     4740

ggatataccg ctcaccgtat tgcaggttga tatcaacccg gagcttggac agccaaatgg     4800

ttatacggta tgggaaccaa aggatattca gacgcgaatg cctgttctga agccatttat     4860

cgatatggta aagaaatatg gcactccata cgtcggcggc gcgttctgca ctgacagatt     4920

aaaactcgtt cccttcacca aatactgtga tgaccatttc gggcgaggga attacaccac     4980

gtggattggc atcagagctg atgaaccgaa gcggctaaag ccaaagcctg gaatcagata     5040

tcttgctgaa ctgtcagact ttgagaagga agatatcctc gcatggtgga agcaacaacc     5100

attcgatttg caaataccgg aacatctcgg taactgcata ttctgcatta aaaaatcaac     5160

gcaaaaaatc ggacttgcct gcaaagatga ggagggattg cagcgtgttt ttaatgaggt     5220

catcacggga tcccatgtgc gtgacggaca tcgggaaacg ccaaaggaga ttatgtaccg     5280

aggaagaatg tcgctggacg gtatcgcgaa aatgtattca gaaaatgatt atcaagccct     5340

gtatcaggac atggtacgag ctaaaagatt cgataccggc tcttgttctg agtcatgcga     5400

aatatttgga gggcagcttg atttcgactt cgggagggaa gctgcatgat gcgatgttat     5460

cggtgcggtg aatgcaaaga agataaccgc ttccgaccaa atcaacctta ctggaatcga     5520

tggtgtctcc ggtgtgaaag aacaccaaca ggggtgttac cactaccgca ggaaaaggag     5580

gacgtgtggc gagacagcga cgaagtatca ccgacataat ctgcgaaaac tgcaaatacc     5640

ttccaacgaa acgcaccaga aataaaccca agccaatccc aaaagaatct gacgtaaaaa     5700

ccttcaacta cacggctcac ctgtgggata tccggtggct aagacgtcgt gcgaggaaaa     5760

caaggtgatt gaccaaaatc gaagttacga acaagaaagc gtcgagcgag ctttaacgtg     5820

cgctaactgc ggtcagaagc tgcatgtgct ggaagttcac gtgtgtgagc actgctgcgc     5880

agaactgatg agcgatccga atagctcgat gcacgaggaa gaagatgatg gctaaaccag     5940

cgcgaagacg atgtaaaaac gatgaatgcc gggaatggtt tcaccctgca ttcgctaatc     6000

agtggtggtg ctctccagag tgtggaacca agatagcact cgaacgacga agtaaagaac     6060

gcgaaaaagc ggaaaaagca gcagagaaga aacgacgacg agaggagcag aaacagaaag     6120

ataaacttaa gattcgaaaa ctcgccttaa agccccgcag ttactggatt aaacaagccc     6180

aacaagccgt aaacgccttc atcagagaaa gagaccgcga cttaccatgt atctcgtgcg     6240

gaacgctcac gtctgctcag tgggatgccg gacattaccg gacaactgct gcggcacctc     6300

aactccgatt taatgaacgc aatattcaca agcaatgcgt ggtgtgcaac cagcacaaaa     6360

gcggaaatct cgttccgtat cgcgtcgaac tgattagccg catcgggcag gaagcagtag     6420

acgaaatcga atcaaaccat aaccgccatc gctggactat cgaagagtgc aaggcgatca     6480

aggcagagta ccaacagaaa ctcaaagacc tgcgaaatag cagaagtgag gccgcatgac     6540

gttctcagta aaaaccattc cagacatgct cgttgaagca tacggaaatc agacagaagt     6600

agcacgcaga ctgaaatgta gtcgcggtac ggtcagaaaa tacgttgatg ataaagacgg     6660

gaaaatgcac gccatcgtca acgacgttct catggttcat cgcggatgga gtgaaagaga     6720

tgcgctatta cgaaaaaatt gatggcagca aataccgaaa tatttgggta gttggcgatc     6780

tgcacggatg ctacacgaac ctgatgaaca aactggatac gattggattc gacaacaaaa     6840

aagacctgct tatctcggtg ggcgatttgg ttgatcgtgg tgcagagaac gttgaatgcc     6900

tggaattaat cacattcccc tggttcagag ctgtacgtgg aaaccatgag caaatgatga     6960

ttgatggctt atcagagcgt ggaaacgtta atcactggct gcttaatggc ggtggctggt     7020

tctttaatct cgattacgac aaagaaattc tggctaaagc tcttgcccat aaagcagatg     7080

aacttccgtt aatcatcgaa ctggtgagca aagataaaaa atatgttatc tgccacgccg     7140

attatccctt tgacgaatac gagtttggaa agccagttga tcatcagcag gtaatctgga     7200

accgcgaacg aatcagcaac tcacaaaacg ggatcgtgaa agaaatcaaa ggcgcggaca     7260

cgttcatctt tggtcatacg ccagcagtga aaccactcaa gtttgccaac caaatgtata     7320

tcgataccgg cgcagtgttc tgcggaaacc taacattgat tcaggtacag ggagaaggcg     7380

catgagactc gaaagcgtag ctaaatttca ttcgccaaaa agcccgatga tgagcgactc     7440

accacgggcc acggcttctg actctctttc cggtactgat gtgatggctg ctatggggat     7500

ggcgcaatca caagccggat tcggtatggc tgcattctgc ggtaagcacg aactcagcca     7560

gaacgacaaa caaaaggcta tcaactatct gatgcaattt gcacacaagg tatcggggaa     7620

ataccgtggt gtggcaaagc ttgaaggaaa tactaaggca aaggtactgc aagtgctcgc     7680

aacattcgct tatgcggatt attgccgtag tgccgcgacg ccgggggcaa gatgcagaga     7740

ttgccatggt acaggccgtg cggttgatat tgccaaaaca gagctgtggg ggagagttgt     7800

cgagaaagag tgcggaagat gcaaaggcgt cggctattca aggatgccag caagcgcagc     7860

atatcgcgct gtgacgatgc taatcccaaa ccttacccaa cccacctggt cacgcactgt     7920

taagccgctg tatgacgctc tggtggtgca atgccacaaa gaagagtcaa tcgcagacaa     7980

cattttgaat gcggtcacac gttagcagca tgattgccac ggatggcaac atattaacgg     8040

catgatattg acttattgaa taaaattggg taaatttgac tcaacgatgg gttaattcgc     8100

tcgttgtggt agtgagatga aaagaggcgg cgcttactac cgattccgcc tagttggtca     8160

cttcgacgta tcgtctggaa ctccaaccat cgcaggcaga gaggtctgca aaatgcaatc     8220

ccgaaacagt tcgcaggtaa tagttagagc ctgcataacg gtttcgggat tttttatatc     8280

tgcacaacag gtaagagcat tgagtcgata atcgtgaaga gtcggcgagc ctggttagcc     8340

agtgctcttt ccgttgtgct gaattaagcg aataccggaa gcagaaccgg atcaccaaat     8400

gcgtacaggc gtcatcgccg cccagcaaca gcacaaccca aactgagccg tagccactgt     8460

ctgtcctgaa ttcattagta atagttacgc tgcggccttt tacacatgac cttcgtgaaa     8520

gcgggtggca ggaggtcgcg ctaacaacct cctgccgttt tgcccgtgca tatcggtcac     8580

gaacaaatct gattactaaa cacagtagcc tggatttgtt ctatcagtaa tcgaccttat     8640

tcctaattaa atagagcaaa tccccttatt gggggtaaga catgaagatg ccagaaaaac     8700

atgacctgtt ggccgccatt ctcgcggcaa aggaacaagg catcggggca atccttgcgt     8760

ttgcaatggc gtaccttcgc ggcagatata atggcggtgc gtttacaaaa acagtaatcg     8820

acgcaacgat gtgcgccatt atcgcctggt tcattcgtga ccttctcgac ttcgccggac     8880

taagtagcaa tctcgcttat ataacgagcg tgtttatcgg ctacatcggt actgactcga     8940

ttggttcgct tatcaaacgc ttcgctgcta aaaaagccgg agtagaagat ggtagaaatc     9000

aataatcaac gtaaggcgtt cctcgatatg ctggcgtggt cggagggaac tgataacgga     9060

cgtcagaaaa ccagaaatca tggttatgac gtcattgtag gcggagagct atttactgat     9120

tactccgatc accctcgcaa acttgtcacg ctaaacccaa aactcaaatc aacaggcgct     9180

taagactggc cgtcgtttta caacacagaa agagtttgta gaaacgcaaa aaggccatcc     9240

gtcaggggcc ttctgcttag tttgatgcct ggcagttccc tactctcgcc ttccgcttcc     9300

tcgctcactg actcgctgcg ctcggtcgtt cggctgcggc gagcggtatc agctcactca     9360

aaggcggtaa tacggttatc cacagaatca ggggataacg caggaaagaa catgtgagca     9420

aaaggccagc aaaaggccag gaaccgtaaa aaggccgcgt tgctggcgtt tttccatagg     9480

ctccgccccc ctgacgagca tcacaaaaat cgacgctcaa gtcagaggtg gcgaaacccg     9540

acaggactat aaagatacca ggcgtttccc cctggaagct ccctcgtgcg ctctcctgtt     9600

ccgaccctgc cgcttaccgg atacctgtcc gcctttctcc cttcgggaag cgtggcgctt     9660

tctcatagct cacgctgtag gtatctcagt tcggtgtagg tcgttcgctc caagctgggc     9720

tgtgtgcacg aaccccccgt tcagcccgac cgctgcgcct tatccggtaa ctatcgtctt     9780

gagtccaacc cggtaagaca cgacttatcg ccactggcag cagccactgg taacaggatt     9840

agcagagcga ggtatgtagg cggtgctaca gagttcttga agtggtgggc taactacggc     9900

tacactagaa gaacagtatt tggtatctgc gctctgctga agccagttac cttcggaaaa     9960

agagttggta gctcttgatc cggcaaacaa accaccgctg gtagcggtgg tttttttgtt    10020

tgcaagcagc agattacgcg cagaaaaaaa ggatctcaag aagatccttt gatcttttct    10080

acggggtctg acgctcagtg gaacgacgcg cgcgtaactc acgttaaggg attttggtca    10140

tgagcttgcg ccgtcccgtc aagtcagcgt aatgctctgc ttttagaaaa actcatcgag    10200

catcaaatga aactgcaatt tattcatatc aggattatca ataccatatt tttgaaaaag    10260

ccgtttctgt aatgaaggag aaaactcacc gaggcagttc cataggatgg caagatcctg    10320

gtatcggtct gcgattccga ctcgtccaac atcaatacaa cctattaatt tcccctcgtc    10380

aaaaataagg ttatcaagtg agaaatcacc atgagtgacg actgaatccg gtgagaatgg    10440

caaaagttta tgcatttctt tccagacttg ttcaacaggc cagccattac gctcgtcatc    10500

aaaatcactc gcatcaacca aaccgttatt cattcgtgat tgcgcctgag cgaggcgaaa    10560

tacgcgatcg ctgttaaaag gacaattaca aacaggaatc gagtgcaacc ggcgcaggaa    10620

cactgccagc gcatcaacaa tattttcacc tgaatcagga tattcttcta atacctggaa    10680

cgctgttttt ccggggatcg cagtggtgag taaccatgca tcatcaggag tacggataaa    10740

atgcttgatg gtcggaagtg gcataaattc cgtcagccag tttagtctga ccatctcatc    10800

tgtaacatca ttggcaacgc tacctttgcc atgtttcaga aacaactctg gcgcatcggg    10860

cttcccatac aagcgataga ttgtcgcacc tgattgcccg acattatcgc gagcccattt    10920

atacccatat aaatcagcat ccatgttgga atttaatcgc ggcctcgacg tttcccgttg    10980

aatatggctc atattcttcc tttttcaata ttattgaagc atttatcagg gttattgtct    11040

catgagcgga tacatatttg aatgtattta gaaaaataaa caaatagggg tcagtgttac    11100

aaccaattaa ccaattctga acattatcgc gagcccattt atacctgaat atggctcata    11160

acaccccttg tttgcctggc ggcagtagcg cggtggtccc acctgacccc atgccgaact    11220

cagaagtgaa acgccgtagc gccgatggta gtgtggggac tccccatgcg agagtaggga    11280

actgccaggc atcaaataaa acgaaaggct cagtcgaaag actgggcctt tcgcccgggc    11340

taattagggg gtgtcgccct tattcgactc tatagtgaag ttcctattct ctagaaagta    11400

taggaacttc tgaagtgggg tcgacttaat taagg                               11435


<210>  34
<211>  11432
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  34
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

ctgaagagac agaaatatct ctaattccat gagcggtcat acgaggcaag agaagccgct      240

tagagcatgg acttagttag tttcagggat tggacagagt caagagctgg ggtgaggagg      300

ttaccctcgg taggggtgac acagatgtca accgcctatt ccctccacat gcatgtcctg      360

ccagaagaac ctgtccctgg gctgggaatc ttatattacc ttcctctcca atgagaagag      420

aagttcaagg ctcacagaca tgtgcataca caagctcaat gcactcaaga ttcccctcca      480

ccactcctgc ccccactacc tacaggagat tgactcctgc tgtgcacata agctgggata      540

atcagggttt ctaaacatca gcttcaaaag tccaatgtcc aaagtggtgg ggggccgggg      600

aacgaggtac tctttccata cccttggctt ttgtgtggcc tggagccgct gatatagaga      660

ttggagtggg acacgaggta ttcctttcaa aaacacaaag gcctatactt tgagccctcc      720

catttcaatc ccccaccatg cttcaccttt aagacctcca actccacttt gatcccagtt      780

ctcaggttca ggcctcacaa ggccaaaatc ctgaagttac ccttctcaaa ctcccttgcc      840

tttaacatca tcagaatcaa cctcctaccc ccactctgtc ccagcagcaa tagcctgcta      900

atcttttagc actaatcttt taggcactaa tctgctttcc aaactcttgg cacctgaact      960

atttataagc agtgttttat gcccccccac caaagaaccc tattcttttc ccatgacccc     1020

accaatcaaa acactcagag gactgtgggt ataagaggct ggggaggcag gcatagcagc     1080

ggccgccacc atggctaaga ttaacaccca gtactcacat ccatcccgca ctcacctcaa     1140

agtcaagacc tccgatcggg atctgaaccg ggctgagaat gggctgtcgc gcgcccactc     1200

gtcgtccgag gaaaccagca gcgtgctcca gccgggcatc gccatggaaa ctagggggct     1260

ggcggactcc ggacagggat ccttcactgg acagggtatt gcccggctga gcagactgat     1320

cttcctgctt cgccgctggg cggccagaca cgtgcaccat caggaccagg gacctgatag     1380

cttccccgac cgctttaggg gagccgagct gaaagaagtg tcaagccagg agtcaaacgc     1440

gcaggccaac gtcggcagcc aagagcctgc agaccgggga cgctcggcat ggccgctcgc     1500

aaagtgcaac actaacactt ccaacaacac cgaagaggaa aagaaaacca agaagaagga     1560

tgcaattgtg gtggaccctt cctccaacct gtactaccgc tggttgaccg ccatcgccct     1620

cccggtcttt tacaattggt atctccttat ctgccgggcc tgcttcgacg aactgcaatc     1680

agagtacctg atgctgtggc tggtgctgga ctatagcgcc gatgtgctct acgtcctgga     1740

tgtgctcgtg cgcgcccgga ccggattctt ggaacaaggc ctgatggtgt ccgacacgaa     1800

tagactgtgg cagcactata agaccacaac ccagttcaag cttgacgtgc tcagccttgt     1860

gccgactgac ctggcctacc tgaaagtcgg aactaactac ccggaagtca gattcaaccg     1920

actcctgaag ttcagcaggc tgttcgagtt ctttgaccgc accgagactc ggaccaacta     1980

ccctaacatg ttccggatcg gaaatctggt gctctacata ctgattatca tccattggaa     2040

cgcctgtatc tatttcgcca tttcgaagtt catcggtttc ggaaccgatt cctgggtgta     2100

ccccaacatc tcgatccccg aacacggtcg cctgtcccgg aagtacatct actccctgta     2160

ctggtccact ctgactctga ccacgatcgg ggaaacccct ccacccgtga aggacgaaga     2220

gtacctgttc gtggtggtgg acttcctggt cggagtgttg attttcgcca ccattgtggg     2280

aaacgtgggc tccatgatct ccaacatgaa cgcgtcgaga gctgagttcc aagccaagat     2340

cgactccatt aagcagtaca tgcagttcag aaaggtcacc aaggacctgg aaaccagggt     2400

catccgctgg ttcgactacc tgtgggccaa caaaaagact gtggacgaaa aggaagtgct     2460

gaagtcgctg ccggataagc tgaaggccga aatcgccatt aacgtgcacc ttgacaccct     2520

gaagaaagtc cggatcttcc aagactgtga agccggcctc ctggtggagc tcgtgctcaa     2580

gctgcggccc accgtgttca gcccgggaga ttacatttgc aagaagggcg atatcggcaa     2640

agagatgtac atcatcaacg agggaaagct ggccgtggtc gcggacgacg gcgtgaccca     2700

gttcgtggtg ctgtccgacg gatcctactt cggtgaaatc tcaatcctca acatcaaggg     2760

gtccaagtcc ggcaaccgga gaactgccaa cattcgctcc atcggataca gcgacctgtt     2820

ttgcctgtcc aaggatgacc tgatggaggc tctgactgag taccctgaag cgaagaaggc     2880

tttggaggaa aaggggcggc agattctgat gaaggacaat ttgatcgacg aggagctcgc     2940

acgggccggc gccgacccca aggatctcga agagaaggtc gaacagctgg gttcttcgct     3000

tgataccctg caaacccgat tcgcgcggct gctcgccgag tacaacgcga cccagatgaa     3060

gatgaagcag agactgtcac agttggaatc ccaagtcaag ggcggaggcg acaagccgct     3120

ggcggacggg gaagtgcccg gggacgccac caagactgag gacaagcagc agtgatcata     3180

gatcctgcct cgactgtgcc ttctagttgc cagccatctg ttgtttgccc ctcccccgtg     3240

ccttccttga ccctggaagg tgccactccc actgtccttt cctaataaaa tgaggaaatt     3300

gcatcgcatt gtctgagtag gtgtcattct attctggggg gtggggtggg gcaggacagc     3360

aagggggagg attgggaaga caatagcagg catgctgggg actcgagttc tacgtagata     3420

agtagcatgg cgggttaatc attaactaca aggaacccct agtgatggag ttggccactc     3480

cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc cgacgcccgg     3540

gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag ccttaattaa cctaaggaaa     3600

atgaagtgaa gttcctatac tttctagaga ataggaactt ctatagtgag tcgaataagg     3660

gcgacacaaa atttattcta aatgcataat aaatactgat aacatcttat agtttgtatt     3720

atattttgta ttatcgttga catgtataat tttgatatca aaaactgatt ttccctttat     3780

tattttcgag atttattttc ttaattctct ttaacaaact agaaatattg tatatacaaa     3840

aaatcataaa taatagatga atagtttaat tataggtgtt catcaatcga aaaagcaacg     3900

tatcttattt aaagtgcgtt gcttttttct catttataag gttaaataat tctcatatat     3960

caagcaaagt gacaggcgcc cttaaatatt ctgacaaatg ctctttccct aaactccccc     4020

cataaaaaaa cccgccgaag cgggttttta cgttatttgc ggattaacga ttactcgtta     4080

tcagaaccgc ccagggggcc cgagcttaac ctttttattt gggggagagg gaagtcatga     4140

aaaaactaac ctttgaaatt cgatctccag cacatcagca aaacgctatt cacgcagtac     4200

agcaaatcct tccagaccca accaaaccaa tcgtagtaac cattcaggaa cgcaaccgca     4260

gcttagacca aaacaggaag ctatgggcct gcttaggtga cgtctctcgt caggttgaat     4320

ggcatggtcg ctggctggat gcagaaagct ggaagtgtgt gtttaccgca gcattaaagc     4380

agcaggatgt tgttcctaac cttgccggga atggctttgt ggtaataggc cagtcaacca     4440

gcaggatgcg tgtaggcgaa tttgcggagc tattagagct tatacaggca ttcggtacag     4500

agcgtggcgt taagtggtca gacgaagcga gactggctct ggagtggaaa gcgagatggg     4560

gagacagggc tgcatgataa atgtcgttag tttctccggt ggcaggacgt cagcatattt     4620

gctctggcta atggagcaaa agcgacgggc aggtaaagac gtgcattacg ttttcatgga     4680

tacaggttgt gaacatccaa tgacatatcg gtttgtcagg gaagttgtga agttctggga     4740

tataccgctc accgtattgc aggttgatat caacccggag cttggacagc caaatggtta     4800

tacggtatgg gaaccaaagg atattcagac gcgaatgcct gttctgaagc catttatcga     4860

tatggtaaag aaatatggca ctccatacgt cggcggcgcg ttctgcactg acagattaaa     4920

actcgttccc ttcaccaaat actgtgatga ccatttcggg cgagggaatt acaccacgtg     4980

gattggcatc agagctgatg aaccgaagcg gctaaagcca aagcctggaa tcagatatct     5040

tgctgaactg tcagactttg agaaggaaga tatcctcgca tggtggaagc aacaaccatt     5100

cgatttgcaa ataccggaac atctcggtaa ctgcatattc tgcattaaaa aatcaacgca     5160

aaaaatcgga cttgcctgca aagatgagga gggattgcag cgtgttttta atgaggtcat     5220

cacgggatcc catgtgcgtg acggacatcg ggaaacgcca aaggagatta tgtaccgagg     5280

aagaatgtcg ctggacggta tcgcgaaaat gtattcagaa aatgattatc aagccctgta     5340

tcaggacatg gtacgagcta aaagattcga taccggctct tgttctgagt catgcgaaat     5400

atttggaggg cagcttgatt tcgacttcgg gagggaagct gcatgatgcg atgttatcgg     5460

tgcggtgaat gcaaagaaga taaccgcttc cgaccaaatc aaccttactg gaatcgatgg     5520

tgtctccggt gtgaaagaac accaacaggg gtgttaccac taccgcagga aaaggaggac     5580

gtgtggcgag acagcgacga agtatcaccg acataatctg cgaaaactgc aaataccttc     5640

caacgaaacg caccagaaat aaacccaagc caatcccaaa agaatctgac gtaaaaacct     5700

tcaactacac ggctcacctg tgggatatcc ggtggctaag acgtcgtgcg aggaaaacaa     5760

ggtgattgac caaaatcgaa gttacgaaca agaaagcgtc gagcgagctt taacgtgcgc     5820

taactgcggt cagaagctgc atgtgctgga agttcacgtg tgtgagcact gctgcgcaga     5880

actgatgagc gatccgaata gctcgatgca cgaggaagaa gatgatggct aaaccagcgc     5940

gaagacgatg taaaaacgat gaatgccggg aatggtttca ccctgcattc gctaatcagt     6000

ggtggtgctc tccagagtgt ggaaccaaga tagcactcga acgacgaagt aaagaacgcg     6060

aaaaagcgga aaaagcagca gagaagaaac gacgacgaga ggagcagaaa cagaaagata     6120

aacttaagat tcgaaaactc gccttaaagc cccgcagtta ctggattaaa caagcccaac     6180

aagccgtaaa cgccttcatc agagaaagag accgcgactt accatgtatc tcgtgcggaa     6240

cgctcacgtc tgctcagtgg gatgccggac attaccggac aactgctgcg gcacctcaac     6300

tccgatttaa tgaacgcaat attcacaagc aatgcgtggt gtgcaaccag cacaaaagcg     6360

gaaatctcgt tccgtatcgc gtcgaactga ttagccgcat cgggcaggaa gcagtagacg     6420

aaatcgaatc aaaccataac cgccatcgct ggactatcga agagtgcaag gcgatcaagg     6480

cagagtacca acagaaactc aaagacctgc gaaatagcag aagtgaggcc gcatgacgtt     6540

ctcagtaaaa accattccag acatgctcgt tgaagcatac ggaaatcaga cagaagtagc     6600

acgcagactg aaatgtagtc gcggtacggt cagaaaatac gttgatgata aagacgggaa     6660

aatgcacgcc atcgtcaacg acgttctcat ggttcatcgc ggatggagtg aaagagatgc     6720

gctattacga aaaaattgat ggcagcaaat accgaaatat ttgggtagtt ggcgatctgc     6780

acggatgcta cacgaacctg atgaacaaac tggatacgat tggattcgac aacaaaaaag     6840

acctgcttat ctcggtgggc gatttggttg atcgtggtgc agagaacgtt gaatgcctgg     6900

aattaatcac attcccctgg ttcagagctg tacgtggaaa ccatgagcaa atgatgattg     6960

atggcttatc agagcgtgga aacgttaatc actggctgct taatggcggt ggctggttct     7020

ttaatctcga ttacgacaaa gaaattctgg ctaaagctct tgcccataaa gcagatgaac     7080

ttccgttaat catcgaactg gtgagcaaag ataaaaaata tgttatctgc cacgccgatt     7140

atccctttga cgaatacgag tttggaaagc cagttgatca tcagcaggta atctggaacc     7200

gcgaacgaat cagcaactca caaaacggga tcgtgaaaga aatcaaaggc gcggacacgt     7260

tcatctttgg tcatacgcca gcagtgaaac cactcaagtt tgccaaccaa atgtatatcg     7320

ataccggcgc agtgttctgc ggaaacctaa cattgattca ggtacaggga gaaggcgcat     7380

gagactcgaa agcgtagcta aatttcattc gccaaaaagc ccgatgatga gcgactcacc     7440

acgggccacg gcttctgact ctctttccgg tactgatgtg atggctgcta tggggatggc     7500

gcaatcacaa gccggattcg gtatggctgc attctgcggt aagcacgaac tcagccagaa     7560

cgacaaacaa aaggctatca actatctgat gcaatttgca cacaaggtat cggggaaata     7620

ccgtggtgtg gcaaagcttg aaggaaatac taaggcaaag gtactgcaag tgctcgcaac     7680

attcgcttat gcggattatt gccgtagtgc cgcgacgccg ggggcaagat gcagagattg     7740

ccatggtaca ggccgtgcgg ttgatattgc caaaacagag ctgtggggga gagttgtcga     7800

gaaagagtgc ggaagatgca aaggcgtcgg ctattcaagg atgccagcaa gcgcagcata     7860

tcgcgctgtg acgatgctaa tcccaaacct tacccaaccc acctggtcac gcactgttaa     7920

gccgctgtat gacgctctgg tggtgcaatg ccacaaagaa gagtcaatcg cagacaacat     7980

tttgaatgcg gtcacacgtt agcagcatga ttgccacgga tggcaacata ttaacggcat     8040

gatattgact tattgaataa aattgggtaa atttgactca acgatgggtt aattcgctcg     8100

ttgtggtagt gagatgaaaa gaggcggcgc ttactaccga ttccgcctag ttggtcactt     8160

cgacgtatcg tctggaactc caaccatcgc aggcagagag gtctgcaaaa tgcaatcccg     8220

aaacagttcg caggtaatag ttagagcctg cataacggtt tcgggatttt ttatatctgc     8280

acaacaggta agagcattga gtcgataatc gtgaagagtc ggcgagcctg gttagccagt     8340

gctctttccg ttgtgctgaa ttaagcgaat accggaagca gaaccggatc accaaatgcg     8400

tacaggcgtc atcgccgccc agcaacagca caacccaaac tgagccgtag ccactgtctg     8460

tcctgaattc attagtaata gttacgctgc ggccttttac acatgacctt cgtgaaagcg     8520

ggtggcagga ggtcgcgcta acaacctcct gccgttttgc ccgtgcatat cggtcacgaa     8580

caaatctgat tactaaacac agtagcctgg atttgttcta tcagtaatcg accttattcc     8640

taattaaata gagcaaatcc ccttattggg ggtaagacat gaagatgcca gaaaaacatg     8700

acctgttggc cgccattctc gcggcaaagg aacaaggcat cggggcaatc cttgcgtttg     8760

caatggcgta ccttcgcggc agatataatg gcggtgcgtt tacaaaaaca gtaatcgacg     8820

caacgatgtg cgccattatc gcctggttca ttcgtgacct tctcgacttc gccggactaa     8880

gtagcaatct cgcttatata acgagcgtgt ttatcggcta catcggtact gactcgattg     8940

gttcgcttat caaacgcttc gctgctaaaa aagccggagt agaagatggt agaaatcaat     9000

aatcaacgta aggcgttcct cgatatgctg gcgtggtcgg agggaactga taacggacgt     9060

cagaaaacca gaaatcatgg ttatgacgtc attgtaggcg gagagctatt tactgattac     9120

tccgatcacc ctcgcaaact tgtcacgcta aacccaaaac tcaaatcaac aggcgcttaa     9180

gactggccgt cgttttacaa cacagaaaga gtttgtagaa acgcaaaaag gccatccgtc     9240

aggggccttc tgcttagttt gatgcctggc agttccctac tctcgccttc cgcttcctcg     9300

ctcactgact cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag     9360

gcggtaatac ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa     9420

ggccagcaaa aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc     9480

cgcccccctg acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca     9540

ggactataaa gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg     9600

accctgccgc ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct     9660

catagctcac gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt     9720

gtgcacgaac cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag     9780

tccaacccgg taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc     9840

agagcgaggt atgtaggcgg tgctacagag ttcttgaagt ggtgggctaa ctacggctac     9900

actagaagaa cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga     9960

gttggtagct cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc    10020

aagcagcaga ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg    10080

gggtctgacg ctcagtggaa cgacgcgcgc gtaactcacg ttaagggatt ttggtcatga    10140

gcttgcgccg tcccgtcaag tcagcgtaat gctctgcttt tagaaaaact catcgagcat    10200

caaatgaaac tgcaatttat tcatatcagg attatcaata ccatattttt gaaaaagccg    10260

tttctgtaat gaaggagaaa actcaccgag gcagttccat aggatggcaa gatcctggta    10320

tcggtctgcg attccgactc gtccaacatc aatacaacct attaatttcc cctcgtcaaa    10380

aataaggtta tcaagtgaga aatcaccatg agtgacgact gaatccggtg agaatggcaa    10440

aagtttatgc atttctttcc agacttgttc aacaggccag ccattacgct cgtcatcaaa    10500

atcactcgca tcaaccaaac cgttattcat tcgtgattgc gcctgagcga ggcgaaatac    10560

gcgatcgctg ttaaaaggac aattacaaac aggaatcgag tgcaaccggc gcaggaacac    10620

tgccagcgca tcaacaatat tttcacctga atcaggatat tcttctaata cctggaacgc    10680

tgtttttccg gggatcgcag tggtgagtaa ccatgcatca tcaggagtac ggataaaatg    10740

cttgatggtc ggaagtggca taaattccgt cagccagttt agtctgacca tctcatctgt    10800

aacatcattg gcaacgctac ctttgccatg tttcagaaac aactctggcg catcgggctt    10860

cccatacaag cgatagattg tcgcacctga ttgcccgaca ttatcgcgag cccatttata    10920

cccatataaa tcagcatcca tgttggaatt taatcgcggc ctcgacgttt cccgttgaat    10980

atggctcata ttcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat    11040

gagcggatac atatttgaat gtatttagaa aaataaacaa ataggggtca gtgttacaac    11100

caattaacca attctgaaca ttatcgcgag cccatttata cctgaatatg gctcataaca    11160

ccccttgttt gcctggcggc agtagcgcgg tggtcccacc tgaccccatg ccgaactcag    11220

aagtgaaacg ccgtagcgcc gatggtagtg tggggactcc ccatgcgaga gtagggaact    11280

gccaggcatc aaataaaacg aaaggctcag tcgaaagact gggcctttcg cccgggctaa    11340

ttagggggtg tcgcccttat tcgactctat agtgaagttc ctattctcta gaaagtatag    11400

gaacttctga agtggggtcg acttaattaa gg                                  11432


<210>  35
<211>  11600
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  35
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

ctgaagagac agaaatatct ctaattccat gagcggtcat acgaggcaag agaagccgct      240

tagagcatgg acttagttag tttcagggat tggacagagt caagagctgg ggtgaggagg      300

ttaccctcgg taggggtgac acagatgtca accgcctatt ccctccacat gcatgtcctg      360

ccagaagaac ctgtccctgg gctgggaatc ttatattacc ttcctctcca atgagaagag      420

aagttcaagg ctcacagaca tgtgcataca caagctcaat gcactcaaga ttcccctcca      480

ccactcctgc ccccactacc tacaggagat tgactcctgc tgtgcacata agctgggata      540

atcagggttt ctaaacatca gcttcaaaag tccaatgtcc aaagtggtgg ggggccgggg      600

aacgaggtac tctttccata cccttggctt ttgtgtggcc tggagccgct gatatagaga      660

ttggagtggg acacgaggta ttcctttcaa aaacacaaag gcctatactt tgagccctcc      720

catttcaatc ccccaccatg cttcaccttt aagacctcca actccacttt gatcccagtt      780

ctcaggttca ggcctcacaa ggccaaaatc ctgaagttac ccttctcaaa ctcccttgcc      840

tttaacatca tcagaatcaa cctcctaccc ccactctgtc ccagcagcaa tagcctgcta      900

atcttttagc actaatcttt taggcactaa tctgctttcc aaactcttgg cacctgaact      960

atttataagc agtgttttat gcccccccac caaagaaccc tattcttttc ccatgacccc     1020

accaatcaaa acactcagag gactgtgggt ataagaggct ggggaggcag gcatagcagc     1080

ggccgccacc atggctaaga ttaacaccca gtactcacat ccatcccgca ctcacctcaa     1140

agtcaagacc tccgatcggg atctgaaccg ggctgagaat gggctgtcgc gcgcccactc     1200

gtcgtccgag gaaaccagca gcgtgctcca gccgggcatc gccatggaaa ctagggggct     1260

ggcggactcc ggacagggat ccttcactgg acagggtatt gcccggttcg ggcggattca     1320

gaagaagtcc cagccggaga aggtcgtgcg ggctgccagc aggggcaggc cactcattgg     1380

ttggacacag tggtgcgctg aggatggtgg agatgaatcg gaaatggcac tggccggctc     1440

tcccggatgc agctcgggcc cccaagggag actgagcaga ctgatcttcc tgcttcgccg     1500

ctgggcggcc agacacgtgc accatcagga ccagggacct gatagcttcc ccgaccgctt     1560

taggggagcc gagctgaaag aagtgtcaag ccaggagtca aacgcgcagg ccaacgtcgg     1620

cagccaagag cctgcagacc ggggacgctc ggcatggccg ctcgcaaagt gcaacactaa     1680

cacttccaac aacaccgaag aggaaaagaa aaccaagaag aaggatgcaa ttgtggtgga     1740

cccttcctcc aacctgtact accgctggtt gaccgccatc gccctcccgg tcttttacaa     1800

ttggtatctc cttatctgcc gggcctgctt cgacgaactg caatcagagt acctgatgct     1860

gtggctggtg ctggactata gcgccgatgt gctctacgtc ctggatgtgc tcgtgcgcgc     1920

ccggaccgga ttcttggaac aaggcctgat ggtgtccgac acgaatagac tgtggcagca     1980

ctataagacc acaacccagt tcaagcttga cgtgctcagc cttgtgccga ctgacctggc     2040

ctacctgaaa gtcggaacta actacccgga agtcagattc aaccgactcc tgaagttcag     2100

caggctgttc gagttctttg accgcaccga gactcggacc aactacccta acatgttccg     2160

gatcggaaat ctggtgctct acatactgat tatcatccat tggaacgcct gtatctattt     2220

cgccatttcg aagttcatcg gtttcggaac cgattcctgg gtgtacccca acatctcgat     2280

ccccgaacac ggtcgcctgt cccggaagta catctactcc ctgtactggt ccactctgac     2340

tctgaccacg atcggggaaa cccctccacc cgtgaaggac gaagagtacc tgttcgtggt     2400

ggtggacttc ctggtcggag tgttgatttt cgccaccatt gtgggaaacg tgggctccat     2460

gatctccaac atgaacgcgt cgagagctga gttccaagcc aagatcgact ccattaagca     2520

gtacatgcag ttcagaaagg tcaccaagga cctggaaacc agggtcatcc gctggttcga     2580

ctacctgtgg gccaacaaaa agactgtgga cgaaaaggaa gtgctgaagt cgctgccgga     2640

taagctgaag gccgaaatcg ccattaacgt gcaccttgac accctgaaga aagtccggat     2700

cttccaagac tgtgaagccg gcctcctggt ggagctcgtg ctcaagctgc ggcccaccgt     2760

gttcagcccg ggagattaca tttgcaagaa gggcgatatc ggcaaagaga tgtacatcat     2820

caacgaggga aagctggccg tggtcgcgga cgacggcgtg acccagttcg tggtgctgtc     2880

cgacggatcc tacttcggtg aaatctcaat cctcaacatc aaggggtcca agtccggcaa     2940

ccggagaact gccaacattc gctccatcgg atacagcgac ctgttttgcc tgtccaagga     3000

tgacctgatg gaggctctga ctgagtaccc tgaagcgaag aaggctttgg aggaaaaggg     3060

gcggcagatt ctgatgaagg acaatttgat cgacgaggag ctcgcacggg ccggcgccga     3120

ccccaaggat ctcgaagaga aggtcgaaca gctgggttct tcgcttgata ccctgcaaac     3180

ccgattcgcg cggctgctcg ccgagtacaa cgcgacccag atgaagatga agcagagact     3240

gtcacagttg gaatcccaag tcaagggcgg aggcgacaag ccgctggcgg acggggaagt     3300

gcccggggac gccaccaaga ctgaggacaa gcagcagtga tcatagatcg atctgcctcg     3360

actgtgcctt ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc     3420

ctggaaggtg ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt     3480

ctgagtaggt gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat     3540

tgggaagaca atagcaggca tgctggggac tcgagttcta cgtagataag tagcatggcg     3600

ggttaatcat taactacaag gaacccctag tgatggagtt ggccactccc tctctgcgcg     3660

ctcgctcgct cactgaggcc gggcgaccaa aggtcgcccg acgcccgggc tttgcccggg     3720

cggcctcagt gagcgagcga gcgcgcagcc ttaattaacc taaggaaaat gaagtgaagt     3780

tcctatactt tctagagaat aggaacttct atagtgagtc gaataagggc gacacaaaat     3840

ttattctaaa tgcataataa atactgataa catcttatag tttgtattat attttgtatt     3900

atcgttgaca tgtataattt tgatatcaaa aactgatttt ccctttatta ttttcgagat     3960

ttattttctt aattctcttt aacaaactag aaatattgta tatacaaaaa atcataaata     4020

atagatgaat agtttaatta taggtgttca tcaatcgaaa aagcaacgta tcttatttaa     4080

agtgcgttgc ttttttctca tttataaggt taaataattc tcatatatca agcaaagtga     4140

caggcgccct taaatattct gacaaatgct ctttccctaa actcccccca taaaaaaacc     4200

cgccgaagcg ggtttttacg ttatttgcgg attaacgatt actcgttatc agaaccgccc     4260

agggggcccg agcttaacct ttttatttgg gggagaggga agtcatgaaa aaactaacct     4320

ttgaaattcg atctccagca catcagcaaa acgctattca cgcagtacag caaatccttc     4380

cagacccaac caaaccaatc gtagtaacca ttcaggaacg caaccgcagc ttagaccaaa     4440

acaggaagct atgggcctgc ttaggtgacg tctctcgtca ggttgaatgg catggtcgct     4500

ggctggatgc agaaagctgg aagtgtgtgt ttaccgcagc attaaagcag caggatgttg     4560

ttcctaacct tgccgggaat ggctttgtgg taataggcca gtcaaccagc aggatgcgtg     4620

taggcgaatt tgcggagcta ttagagctta tacaggcatt cggtacagag cgtggcgtta     4680

agtggtcaga cgaagcgaga ctggctctgg agtggaaagc gagatgggga gacagggctg     4740

catgataaat gtcgttagtt tctccggtgg caggacgtca gcatatttgc tctggctaat     4800

ggagcaaaag cgacgggcag gtaaagacgt gcattacgtt ttcatggata caggttgtga     4860

acatccaatg acatatcggt ttgtcaggga agttgtgaag ttctgggata taccgctcac     4920

cgtattgcag gttgatatca acccggagct tggacagcca aatggttata cggtatggga     4980

accaaaggat attcagacgc gaatgcctgt tctgaagcca tttatcgata tggtaaagaa     5040

atatggcact ccatacgtcg gcggcgcgtt ctgcactgac agattaaaac tcgttccctt     5100

caccaaatac tgtgatgacc atttcgggcg agggaattac accacgtgga ttggcatcag     5160

agctgatgaa ccgaagcggc taaagccaaa gcctggaatc agatatcttg ctgaactgtc     5220

agactttgag aaggaagata tcctcgcatg gtggaagcaa caaccattcg atttgcaaat     5280

accggaacat ctcggtaact gcatattctg cattaaaaaa tcaacgcaaa aaatcggact     5340

tgcctgcaaa gatgaggagg gattgcagcg tgtttttaat gaggtcatca cgggatccca     5400

tgtgcgtgac ggacatcggg aaacgccaaa ggagattatg taccgaggaa gaatgtcgct     5460

ggacggtatc gcgaaaatgt attcagaaaa tgattatcaa gccctgtatc aggacatggt     5520

acgagctaaa agattcgata ccggctcttg ttctgagtca tgcgaaatat ttggagggca     5580

gcttgatttc gacttcggga gggaagctgc atgatgcgat gttatcggtg cggtgaatgc     5640

aaagaagata accgcttccg accaaatcaa ccttactgga atcgatggtg tctccggtgt     5700

gaaagaacac caacaggggt gttaccacta ccgcaggaaa aggaggacgt gtggcgagac     5760

agcgacgaag tatcaccgac ataatctgcg aaaactgcaa ataccttcca acgaaacgca     5820

ccagaaataa acccaagcca atcccaaaag aatctgacgt aaaaaccttc aactacacgg     5880

ctcacctgtg ggatatccgg tggctaagac gtcgtgcgag gaaaacaagg tgattgacca     5940

aaatcgaagt tacgaacaag aaagcgtcga gcgagcttta acgtgcgcta actgcggtca     6000

gaagctgcat gtgctggaag ttcacgtgtg tgagcactgc tgcgcagaac tgatgagcga     6060

tccgaatagc tcgatgcacg aggaagaaga tgatggctaa accagcgcga agacgatgta     6120

aaaacgatga atgccgggaa tggtttcacc ctgcattcgc taatcagtgg tggtgctctc     6180

cagagtgtgg aaccaagata gcactcgaac gacgaagtaa agaacgcgaa aaagcggaaa     6240

aagcagcaga gaagaaacga cgacgagagg agcagaaaca gaaagataaa cttaagattc     6300

gaaaactcgc cttaaagccc cgcagttact ggattaaaca agcccaacaa gccgtaaacg     6360

ccttcatcag agaaagagac cgcgacttac catgtatctc gtgcggaacg ctcacgtctg     6420

ctcagtggga tgccggacat taccggacaa ctgctgcggc acctcaactc cgatttaatg     6480

aacgcaatat tcacaagcaa tgcgtggtgt gcaaccagca caaaagcgga aatctcgttc     6540

cgtatcgcgt cgaactgatt agccgcatcg ggcaggaagc agtagacgaa atcgaatcaa     6600

accataaccg ccatcgctgg actatcgaag agtgcaaggc gatcaaggca gagtaccaac     6660

agaaactcaa agacctgcga aatagcagaa gtgaggccgc atgacgttct cagtaaaaac     6720

cattccagac atgctcgttg aagcatacgg aaatcagaca gaagtagcac gcagactgaa     6780

atgtagtcgc ggtacggtca gaaaatacgt tgatgataaa gacgggaaaa tgcacgccat     6840

cgtcaacgac gttctcatgg ttcatcgcgg atggagtgaa agagatgcgc tattacgaaa     6900

aaattgatgg cagcaaatac cgaaatattt gggtagttgg cgatctgcac ggatgctaca     6960

cgaacctgat gaacaaactg gatacgattg gattcgacaa caaaaaagac ctgcttatct     7020

cggtgggcga tttggttgat cgtggtgcag agaacgttga atgcctggaa ttaatcacat     7080

tcccctggtt cagagctgta cgtggaaacc atgagcaaat gatgattgat ggcttatcag     7140

agcgtggaaa cgttaatcac tggctgctta atggcggtgg ctggttcttt aatctcgatt     7200

acgacaaaga aattctggct aaagctcttg cccataaagc agatgaactt ccgttaatca     7260

tcgaactggt gagcaaagat aaaaaatatg ttatctgcca cgccgattat ccctttgacg     7320

aatacgagtt tggaaagcca gttgatcatc agcaggtaat ctggaaccgc gaacgaatca     7380

gcaactcaca aaacgggatc gtgaaagaaa tcaaaggcgc ggacacgttc atctttggtc     7440

atacgccagc agtgaaacca ctcaagtttg ccaaccaaat gtatatcgat accggcgcag     7500

tgttctgcgg aaacctaaca ttgattcagg tacagggaga aggcgcatga gactcgaaag     7560

cgtagctaaa tttcattcgc caaaaagccc gatgatgagc gactcaccac gggccacggc     7620

ttctgactct ctttccggta ctgatgtgat ggctgctatg gggatggcgc aatcacaagc     7680

cggattcggt atggctgcat tctgcggtaa gcacgaactc agccagaacg acaaacaaaa     7740

ggctatcaac tatctgatgc aatttgcaca caaggtatcg gggaaatacc gtggtgtggc     7800

aaagcttgaa ggaaatacta aggcaaaggt actgcaagtg ctcgcaacat tcgcttatgc     7860

ggattattgc cgtagtgccg cgacgccggg ggcaagatgc agagattgcc atggtacagg     7920

ccgtgcggtt gatattgcca aaacagagct gtgggggaga gttgtcgaga aagagtgcgg     7980

aagatgcaaa ggcgtcggct attcaaggat gccagcaagc gcagcatatc gcgctgtgac     8040

gatgctaatc ccaaacctta cccaacccac ctggtcacgc actgttaagc cgctgtatga     8100

cgctctggtg gtgcaatgcc acaaagaaga gtcaatcgca gacaacattt tgaatgcggt     8160

cacacgttag cagcatgatt gccacggatg gcaacatatt aacggcatga tattgactta     8220

ttgaataaaa ttgggtaaat ttgactcaac gatgggttaa ttcgctcgtt gtggtagtga     8280

gatgaaaaga ggcggcgctt actaccgatt ccgcctagtt ggtcacttcg acgtatcgtc     8340

tggaactcca accatcgcag gcagagaggt ctgcaaaatg caatcccgaa acagttcgca     8400

ggtaatagtt agagcctgca taacggtttc gggatttttt atatctgcac aacaggtaag     8460

agcattgagt cgataatcgt gaagagtcgg cgagcctggt tagccagtgc tctttccgtt     8520

gtgctgaatt aagcgaatac cggaagcaga accggatcac caaatgcgta caggcgtcat     8580

cgccgcccag caacagcaca acccaaactg agccgtagcc actgtctgtc ctgaattcat     8640

tagtaatagt tacgctgcgg ccttttacac atgaccttcg tgaaagcggg tggcaggagg     8700

tcgcgctaac aacctcctgc cgttttgccc gtgcatatcg gtcacgaaca aatctgatta     8760

ctaaacacag tagcctggat ttgttctatc agtaatcgac cttattccta attaaataga     8820

gcaaatcccc ttattggggg taagacatga agatgccaga aaaacatgac ctgttggccg     8880

ccattctcgc ggcaaaggaa caaggcatcg gggcaatcct tgcgtttgca atggcgtacc     8940

ttcgcggcag atataatggc ggtgcgttta caaaaacagt aatcgacgca acgatgtgcg     9000

ccattatcgc ctggttcatt cgtgaccttc tcgacttcgc cggactaagt agcaatctcg     9060

cttatataac gagcgtgttt atcggctaca tcggtactga ctcgattggt tcgcttatca     9120

aacgcttcgc tgctaaaaaa gccggagtag aagatggtag aaatcaataa tcaacgtaag     9180

gcgttcctcg atatgctggc gtggtcggag ggaactgata acggacgtca gaaaaccaga     9240

aatcatggtt atgacgtcat tgtaggcgga gagctattta ctgattactc cgatcaccct     9300

cgcaaacttg tcacgctaaa cccaaaactc aaatcaacag gcgcttaaga ctggccgtcg     9360

ttttacaaca cagaaagagt ttgtagaaac gcaaaaaggc catccgtcag gggccttctg     9420

cttagtttga tgcctggcag ttccctactc tcgccttccg cttcctcgct cactgactcg     9480

ctgcgctcgg tcgttcggct gcggcgagcg gtatcagctc actcaaaggc ggtaatacgg     9540

ttatccacag aatcagggga taacgcagga aagaacatgt gagcaaaagg ccagcaaaag     9600

gccaggaacc gtaaaaaggc cgcgttgctg gcgtttttcc ataggctccg cccccctgac     9660

gagcatcaca aaaatcgacg ctcaagtcag aggtggcgaa acccgacagg actataaaga     9720

taccaggcgt ttccccctgg aagctccctc gtgcgctctc ctgttccgac cctgccgctt     9780

accggatacc tgtccgcctt tctcccttcg ggaagcgtgg cgctttctca tagctcacgc     9840

tgtaggtatc tcagttcggt gtaggtcgtt cgctccaagc tgggctgtgt gcacgaaccc     9900

cccgttcagc ccgaccgctg cgccttatcc ggtaactatc gtcttgagtc caacccggta     9960

agacacgact tatcgccact ggcagcagcc actggtaaca ggattagcag agcgaggtat    10020

gtaggcggtg ctacagagtt cttgaagtgg tgggctaact acggctacac tagaagaaca    10080

gtatttggta tctgcgctct gctgaagcca gttaccttcg gaaaaagagt tggtagctct    10140

tgatccggca aacaaaccac cgctggtagc ggtggttttt ttgtttgcaa gcagcagatt    10200

acgcgcagaa aaaaaggatc tcaagaagat cctttgatct tttctacggg gtctgacgct    10260

cagtggaacg acgcgcgcgt aactcacgtt aagggatttt ggtcatgagc ttgcgccgtc    10320

ccgtcaagtc agcgtaatgc tctgctttta gaaaaactca tcgagcatca aatgaaactg    10380

caatttattc atatcaggat tatcaatacc atatttttga aaaagccgtt tctgtaatga    10440

aggagaaaac tcaccgaggc agttccatag gatggcaaga tcctggtatc ggtctgcgat    10500

tccgactcgt ccaacatcaa tacaacctat taatttcccc tcgtcaaaaa taaggttatc    10560

aagtgagaaa tcaccatgag tgacgactga atccggtgag aatggcaaaa gtttatgcat    10620

ttctttccag acttgttcaa caggccagcc attacgctcg tcatcaaaat cactcgcatc    10680

aaccaaaccg ttattcattc gtgattgcgc ctgagcgagg cgaaatacgc gatcgctgtt    10740

aaaaggacaa ttacaaacag gaatcgagtg caaccggcgc aggaacactg ccagcgcatc    10800

aacaatattt tcacctgaat caggatattc ttctaatacc tggaacgctg tttttccggg    10860

gatcgcagtg gtgagtaacc atgcatcatc aggagtacgg ataaaatgct tgatggtcgg    10920

aagtggcata aattccgtca gccagtttag tctgaccatc tcatctgtaa catcattggc    10980

aacgctacct ttgccatgtt tcagaaacaa ctctggcgca tcgggcttcc catacaagcg    11040

atagattgtc gcacctgatt gcccgacatt atcgcgagcc catttatacc catataaatc    11100

agcatccatg ttggaattta atcgcggcct cgacgtttcc cgttgaatat ggctcatatt    11160

cttccttttt caatattatt gaagcattta tcagggttat tgtctcatga gcggatacat    11220

atttgaatgt atttagaaaa ataaacaaat aggggtcagt gttacaacca attaaccaat    11280

tctgaacatt atcgcgagcc catttatacc tgaatatggc tcataacacc ccttgtttgc    11340

ctggcggcag tagcgcggtg gtcccacctg accccatgcc gaactcagaa gtgaaacgcc    11400

gtagcgccga tggtagtgtg gggactcccc atgcgagagt agggaactgc caggcatcaa    11460

ataaaacgaa aggctcagtc gaaagactgg gcctttcgcc cgggctaatt agggggtgtc    11520

gcccttattc gactctatag tgaagttcct attctctaga aagtatagga acttctgaag    11580

tggggtcgac ttaattaagg                                                11600


<210>  36
<211>  12209
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  36
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg      240

cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt      300

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      360

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      420

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      480

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattaa      540

catggtcgag gtgagcccca cgttctgctt cactctcccc atctcccccc cctccccacc      600

cccaattttg tatttattta ttttttaatt attttgtgca gcgatggggg cggggggggg      660

gggggggcgc gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag      720

aggtgcggcg gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg      780

gcggcggcgg cggccctata aaaagcgaag cgcgcggcgg gcggggagtc gctgcgacgc      840

tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg      900

accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag      960

cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc     1020

cgggagggcc ctttgtgcgg ggggagcggc tcggggggtg cgtgcgtgtg tgtgtgcgtg     1080

gggagcgccg cgtgcggctc cgcgctgccc ggcggctgtg agcgctgcgg gcgcggcgcg     1140

gggctttgtg cgctccgcag tgtgcgcgag gggagcgcgg ccgggggcgg tgccccgcgg     1200

tgcggggggg gctgcgaggg gaacaaaggc tgcgtgcggg gtgtgtgcgt gggggggtga     1260

gcagggggtg tgggcgcgtc ggtcgggctg caaccccccc tgcacccccc tccccgagtt     1320

gctgagcacg gcccggcttc gggtgcgggg ctccgtacgg ggcgtggcgc ggggctcgcc     1380

gtgccgggcg gggggtggcg gcaggtgggg gtgccgggcg gggcggggcc gcctcgggcc     1440

ggggagggct cgggggaggg gcgcggcggc ccccggagcg ccggcggctg tcgaggcgcg     1500

gcgagccgca gccattgcct tttatggtaa tcgtgcgaga gggcgcaggg acttcctttg     1560

tcccaaatct gtgcggagcc gaaatctggg aggcgccgcc gcaccccctc tagcgggcgc     1620

ggggcgaagc ggtgcggcgc cggcaggaag gaaatgggcg gggagggcct tcgtgcgtcg     1680

ccgcgccgcc gtccccttct ccctctccag cctcggggct gtccgcgggg ggacggctgc     1740

cttcgggggg gacggggcag ggcggggttc ggcttctggc gtgtgaccgg cggctctaga     1800

caattgtact aaccttcttc tctttcctct cctgacaggt tggtgtacac tagcggccgc     1860

caccatggcc aagatcaaca cccaatactc ccacccctcc aggacccacc tcaaggtaaa     1920

gacctcagac cgggatctca atcgcgctga aaatggcctc agcagagccc actcgtcaag     1980

tgaggagaca tcgtcagtgc tgcagccggg gatcgccatg gagaccagag gactggctga     2040

ctccgggcag ggctccttca ccggccaggg gatcgccagg ctgtcgcgcc tcatcttctt     2100

gctgcgcagg tgggctgcca ggcatgtgca ccaccaggac cagggaccgg actcttttcc     2160

tgatcgtttc cgtggagccg agcttaagga ggtgtccagc caagaaagca atgcccaggc     2220

aaatgtgggc agccaggagc cagcagacag agggagaagc gcctggcccc tggccaaatg     2280

caacactaac accagcaaca acacggagga ggagaagaag acgaaaaaga aggatgcgat     2340

cgtggtggac ccgtccagca acctgtacta ccgctggctg accgccatcg ccctgcctgt     2400

cttctataac tggtatctgc ttatttgcag ggcctgtttc gatgagctgc agtccgagta     2460

cctgatgctg tggctggtcc tggactactc ggcagatgtc ctgtatgtct tggatgtgct     2520

tgtacgagct cggacaggtt ttcttgagca aggcttaatg gtcagtgata ccaacaggct     2580

gtggcagcat tacaagacga ccacgcagtt caagctggat gtgttgtccc tggtccccac     2640

cgacctggct tacttaaagg tgggcacaaa ctacccagaa gtgaggttca accgcctact     2700

gaagttttcc cggctctttg aattctttga ccgcacagag acaaggacca actaccccaa     2760

tatgttcagg attgggaact tggtcttgta cattctcatc atcatccact ggaatgcctg     2820

catctacttt gccatttcca agttcattgg ttttgggaca gactcctggg tctacccaaa     2880

catctcaatc ccagagcatg ggcgcctctc caggaagtac atttacagtc tctactggtc     2940

caccttgacc cttaccacca ttggtgagac cccacccccc gtgaaagatg aggagtatct     3000

ctttgtggtc gtagacttct tggtgggtgt tctgattttt gccaccattg tgggcaatgt     3060

gggctccatg atctcgaata tgaatgcctc acgggcagag ttccaggcca agattgattc     3120

catcaagcag tacatgcagt tccgcaaggt caccaaggac ttggagacgc gggttatccg     3180

gtggtttgac tacctgtggg ccaacaagaa gacggtggat gagaaggagg tgctcaagag     3240

cctcccagac aagctgaagg ctgagatcgc catcaacgtg cacctggaca cgctgaagaa     3300

ggttcgcatc ttccaggact gtgaggcagg gctgctggtg gagctggtgc tgaagctgcg     3360

acccactgtg ttcagccctg gggattatat ctgcaagaag ggagatattg ggaaggagat     3420

gtacatcatc aacgagggca agctggccgt ggtggctgat gatggggtca cccagttcgt     3480

ggtcctcagc gatggcagct acttcgggga gatcagcatt ctgaacatca aggggagcaa     3540

gtcggggaac cgcaggacgg ccaacatccg cagcattggc tactcagacc tgttctgcct     3600

ctcaaaggac gatctcatgg aggccctcac cgagtacccc gaagccaaga aggccctgga     3660

ggagaaagga cggcagatcc tgatgaaaga caacctgatc gatgaggagc tggccagggc     3720

gggcgcggac cccaaggacc ttgaggagaa agtggagcag ctggggtcct ccctggacac     3780

cctgcagacc aggtttgcac gcctcctggc tgagtacaac gccacccaga tgaagatgaa     3840

gcagcgtctc agccaactgg aaagccaggt gaagggtggt ggggacaagc ccctggctga     3900

tggggaagtt cccggggatg ctacaaaaac agaggacaaa caacagtgat catagatcga     3960

tctgcctcga ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct     4020

tccttgaccc tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca     4080

tcgcattgtc tgagtaggtg tcattctatt ctggggggtg gggtggggca ggacagcaag     4140

ggggaggatt gggaagacaa tagcaggcat gctggggact cgagttctac gtagataagt     4200

agcatggcgg gttaatcatt aactacaagg aacccctagt gatggagttg gccactccct     4260

ctctgcgcgc tcgctcgctc actgaggccg ggcgaccaaa ggtcgcccga cgcccgggct     4320

ttgcccgggc ggcctcagtg agcgagcgag cgcgcagcct taattaacct aaggaaaatg     4380

aagtgaagtt cctatacttt ctagagaata ggaacttcta tagtgagtcg aataagggcg     4440

acacaaaatt tattctaaat gcataataaa tactgataac atcttatagt ttgtattata     4500

ttttgtatta tcgttgacat gtataatttt gatatcaaaa actgattttc cctttattat     4560

tttcgagatt tattttctta attctcttta acaaactaga aatattgtat atacaaaaaa     4620

tcataaataa tagatgaata gtttaattat aggtgttcat caatcgaaaa agcaacgtat     4680

cttatttaaa gtgcgttgct tttttctcat ttataaggtt aaataattct catatatcaa     4740

gcaaagtgac aggcgccctt aaatattctg acaaatgctc tttccctaaa ctccccccat     4800

aaaaaaaccc gccgaagcgg gtttttacgt tatttgcgga ttaacgatta ctcgttatca     4860

gaaccgccca gggggcccga gcttaacctt tttatttggg ggagagggaa gtcatgaaaa     4920

aactaacctt tgaaattcga tctccagcac atcagcaaaa cgctattcac gcagtacagc     4980

aaatccttcc agacccaacc aaaccaatcg tagtaaccat tcaggaacgc aaccgcagct     5040

tagaccaaaa caggaagcta tgggcctgct taggtgacgt ctctcgtcag gttgaatggc     5100

atggtcgctg gctggatgca gaaagctgga agtgtgtgtt taccgcagca ttaaagcagc     5160

aggatgttgt tcctaacctt gccgggaatg gctttgtggt aataggccag tcaaccagca     5220

ggatgcgtgt aggcgaattt gcggagctat tagagcttat acaggcattc ggtacagagc     5280

gtggcgttaa gtggtcagac gaagcgagac tggctctgga gtggaaagcg agatggggag     5340

acagggctgc atgataaatg tcgttagttt ctccggtggc aggacgtcag catatttgct     5400

ctggctaatg gagcaaaagc gacgggcagg taaagacgtg cattacgttt tcatggatac     5460

aggttgtgaa catccaatga catatcggtt tgtcagggaa gttgtgaagt tctgggatat     5520

accgctcacc gtattgcagg ttgatatcaa cccggagctt ggacagccaa atggttatac     5580

ggtatgggaa ccaaaggata ttcagacgcg aatgcctgtt ctgaagccat ttatcgatat     5640

ggtaaagaaa tatggcactc catacgtcgg cggcgcgttc tgcactgaca gattaaaact     5700

cgttcccttc accaaatact gtgatgacca tttcgggcga gggaattaca ccacgtggat     5760

tggcatcaga gctgatgaac cgaagcggct aaagccaaag cctggaatca gatatcttgc     5820

tgaactgtca gactttgaga aggaagatat cctcgcatgg tggaagcaac aaccattcga     5880

tttgcaaata ccggaacatc tcggtaactg catattctgc attaaaaaat caacgcaaaa     5940

aatcggactt gcctgcaaag atgaggaggg attgcagcgt gtttttaatg aggtcatcac     6000

gggatcccat gtgcgtgacg gacatcggga aacgccaaag gagattatgt accgaggaag     6060

aatgtcgctg gacggtatcg cgaaaatgta ttcagaaaat gattatcaag ccctgtatca     6120

ggacatggta cgagctaaaa gattcgatac cggctcttgt tctgagtcat gcgaaatatt     6180

tggagggcag cttgatttcg acttcgggag ggaagctgca tgatgcgatg ttatcggtgc     6240

ggtgaatgca aagaagataa ccgcttccga ccaaatcaac cttactggaa tcgatggtgt     6300

ctccggtgtg aaagaacacc aacaggggtg ttaccactac cgcaggaaaa ggaggacgtg     6360

tggcgagaca gcgacgaagt atcaccgaca taatctgcga aaactgcaaa taccttccaa     6420

cgaaacgcac cagaaataaa cccaagccaa tcccaaaaga atctgacgta aaaaccttca     6480

actacacggc tcacctgtgg gatatccggt ggctaagacg tcgtgcgagg aaaacaaggt     6540

gattgaccaa aatcgaagtt acgaacaaga aagcgtcgag cgagctttaa cgtgcgctaa     6600

ctgcggtcag aagctgcatg tgctggaagt tcacgtgtgt gagcactgct gcgcagaact     6660

gatgagcgat ccgaatagct cgatgcacga ggaagaagat gatggctaaa ccagcgcgaa     6720

gacgatgtaa aaacgatgaa tgccgggaat ggtttcaccc tgcattcgct aatcagtggt     6780

ggtgctctcc agagtgtgga accaagatag cactcgaacg acgaagtaaa gaacgcgaaa     6840

aagcggaaaa agcagcagag aagaaacgac gacgagagga gcagaaacag aaagataaac     6900

ttaagattcg aaaactcgcc ttaaagcccc gcagttactg gattaaacaa gcccaacaag     6960

ccgtaaacgc cttcatcaga gaaagagacc gcgacttacc atgtatctcg tgcggaacgc     7020

tcacgtctgc tcagtgggat gccggacatt accggacaac tgctgcggca cctcaactcc     7080

gatttaatga acgcaatatt cacaagcaat gcgtggtgtg caaccagcac aaaagcggaa     7140

atctcgttcc gtatcgcgtc gaactgatta gccgcatcgg gcaggaagca gtagacgaaa     7200

tcgaatcaaa ccataaccgc catcgctgga ctatcgaaga gtgcaaggcg atcaaggcag     7260

agtaccaaca gaaactcaaa gacctgcgaa atagcagaag tgaggccgca tgacgttctc     7320

agtaaaaacc attccagaca tgctcgttga agcatacgga aatcagacag aagtagcacg     7380

cagactgaaa tgtagtcgcg gtacggtcag aaaatacgtt gatgataaag acgggaaaat     7440

gcacgccatc gtcaacgacg ttctcatggt tcatcgcgga tggagtgaaa gagatgcgct     7500

attacgaaaa aattgatggc agcaaatacc gaaatatttg ggtagttggc gatctgcacg     7560

gatgctacac gaacctgatg aacaaactgg atacgattgg attcgacaac aaaaaagacc     7620

tgcttatctc ggtgggcgat ttggttgatc gtggtgcaga gaacgttgaa tgcctggaat     7680

taatcacatt cccctggttc agagctgtac gtggaaacca tgagcaaatg atgattgatg     7740

gcttatcaga gcgtggaaac gttaatcact ggctgcttaa tggcggtggc tggttcttta     7800

atctcgatta cgacaaagaa attctggcta aagctcttgc ccataaagca gatgaacttc     7860

cgttaatcat cgaactggtg agcaaagata aaaaatatgt tatctgccac gccgattatc     7920

cctttgacga atacgagttt ggaaagccag ttgatcatca gcaggtaatc tggaaccgcg     7980

aacgaatcag caactcacaa aacgggatcg tgaaagaaat caaaggcgcg gacacgttca     8040

tctttggtca tacgccagca gtgaaaccac tcaagtttgc caaccaaatg tatatcgata     8100

ccggcgcagt gttctgcgga aacctaacat tgattcaggt acagggagaa ggcgcatgag     8160

actcgaaagc gtagctaaat ttcattcgcc aaaaagcccg atgatgagcg actcaccacg     8220

ggccacggct tctgactctc tttccggtac tgatgtgatg gctgctatgg ggatggcgca     8280

atcacaagcc ggattcggta tggctgcatt ctgcggtaag cacgaactca gccagaacga     8340

caaacaaaag gctatcaact atctgatgca atttgcacac aaggtatcgg ggaaataccg     8400

tggtgtggca aagcttgaag gaaatactaa ggcaaaggta ctgcaagtgc tcgcaacatt     8460

cgcttatgcg gattattgcc gtagtgccgc gacgccgggg gcaagatgca gagattgcca     8520

tggtacaggc cgtgcggttg atattgccaa aacagagctg tgggggagag ttgtcgagaa     8580

agagtgcgga agatgcaaag gcgtcggcta ttcaaggatg ccagcaagcg cagcatatcg     8640

cgctgtgacg atgctaatcc caaaccttac ccaacccacc tggtcacgca ctgttaagcc     8700

gctgtatgac gctctggtgg tgcaatgcca caaagaagag tcaatcgcag acaacatttt     8760

gaatgcggtc acacgttagc agcatgattg ccacggatgg caacatatta acggcatgat     8820

attgacttat tgaataaaat tgggtaaatt tgactcaacg atgggttaat tcgctcgttg     8880

tggtagtgag atgaaaagag gcggcgctta ctaccgattc cgcctagttg gtcacttcga     8940

cgtatcgtct ggaactccaa ccatcgcagg cagagaggtc tgcaaaatgc aatcccgaaa     9000

cagttcgcag gtaatagtta gagcctgcat aacggtttcg ggatttttta tatctgcaca     9060

acaggtaaga gcattgagtc gataatcgtg aagagtcggc gagcctggtt agccagtgct     9120

ctttccgttg tgctgaatta agcgaatacc ggaagcagaa ccggatcacc aaatgcgtac     9180

aggcgtcatc gccgcccagc aacagcacaa cccaaactga gccgtagcca ctgtctgtcc     9240

tgaattcatt agtaatagtt acgctgcggc cttttacaca tgaccttcgt gaaagcgggt     9300

ggcaggaggt cgcgctaaca acctcctgcc gttttgcccg tgcatatcgg tcacgaacaa     9360

atctgattac taaacacagt agcctggatt tgttctatca gtaatcgacc ttattcctaa     9420

ttaaatagag caaatcccct tattgggggt aagacatgaa gatgccagaa aaacatgacc     9480

tgttggccgc cattctcgcg gcaaaggaac aaggcatcgg ggcaatcctt gcgtttgcaa     9540

tggcgtacct tcgcggcaga tataatggcg gtgcgtttac aaaaacagta atcgacgcaa     9600

cgatgtgcgc cattatcgcc tggttcattc gtgaccttct cgacttcgcc ggactaagta     9660

gcaatctcgc ttatataacg agcgtgttta tcggctacat cggtactgac tcgattggtt     9720

cgcttatcaa acgcttcgct gctaaaaaag ccggagtaga agatggtaga aatcaataat     9780

caacgtaagg cgttcctcga tatgctggcg tggtcggagg gaactgataa cggacgtcag     9840

aaaaccagaa atcatggtta tgacgtcatt gtaggcggag agctatttac tgattactcc     9900

gatcaccctc gcaaacttgt cacgctaaac ccaaaactca aatcaacagg cgcttaagac     9960

tggccgtcgt tttacaacac agaaagagtt tgtagaaacg caaaaaggcc atccgtcagg    10020

ggccttctgc ttagtttgat gcctggcagt tccctactct cgccttccgc ttcctcgctc    10080

actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg    10140

gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc    10200

cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc    10260

ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga    10320

ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc    10380

ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat    10440

agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg    10500

cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc    10560

aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga    10620

gcgaggtatg taggcggtgc tacagagttc ttgaagtggt gggctaacta cggctacact    10680

agaagaacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt    10740

ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag    10800

cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg    10860

tctgacgctc agtggaacga cgcgcgcgta actcacgtta agggattttg gtcatgagct    10920

tgcgccgtcc cgtcaagtca gcgtaatgct ctgcttttag aaaaactcat cgagcatcaa    10980

atgaaactgc aatttattca tatcaggatt atcaatacca tatttttgaa aaagccgttt    11040

ctgtaatgaa ggagaaaact caccgaggca gttccatagg atggcaagat cctggtatcg    11100

gtctgcgatt ccgactcgtc caacatcaat acaacctatt aatttcccct cgtcaaaaat    11160

aaggttatca agtgagaaat caccatgagt gacgactgaa tccggtgaga atggcaaaag    11220

tttatgcatt tctttccaga cttgttcaac aggccagcca ttacgctcgt catcaaaatc    11280

actcgcatca accaaaccgt tattcattcg tgattgcgcc tgagcgaggc gaaatacgcg    11340

atcgctgtta aaaggacaat tacaaacagg aatcgagtgc aaccggcgca ggaacactgc    11400

cagcgcatca acaatatttt cacctgaatc aggatattct tctaatacct ggaacgctgt    11460

ttttccgggg atcgcagtgg tgagtaacca tgcatcatca ggagtacgga taaaatgctt    11520

gatggtcgga agtggcataa attccgtcag ccagtttagt ctgaccatct catctgtaac    11580

atcattggca acgctacctt tgccatgttt cagaaacaac tctggcgcat cgggcttccc    11640

atacaagcga tagattgtcg cacctgattg cccgacatta tcgcgagccc atttataccc    11700

atataaatca gcatccatgt tggaatttaa tcgcggcctc gacgtttccc gttgaatatg    11760

gctcatattc ttcctttttc aatattattg aagcatttat cagggttatt gtctcatgag    11820

cggatacata tttgaatgta tttagaaaaa taaacaaata ggggtcagtg ttacaaccaa    11880

ttaaccaatt ctgaacatta tcgcgagccc atttatacct gaatatggct cataacaccc    11940

cttgtttgcc tggcggcagt agcgcggtgg tcccacctga ccccatgccg aactcagaag    12000

tgaaacgccg tagcgccgat ggtagtgtgg ggactcccca tgcgagagta gggaactgcc    12060

aggcatcaaa taaaacgaaa ggctcagtcg aaagactggg cctttcgccc gggctaatta    12120

gggggtgtcg cccttattcg actctatagt gaagttccta ttctctagaa agtataggaa    12180

cttctgaagt ggggtcgact taattaagg                                      12209


<210>  37
<211>  12209
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  37
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg      240

cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt      300

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      360

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      420

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      480

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattaa      540

catggtcgag gtgagcccca cgttctgctt cactctcccc atctcccccc cctccccacc      600

cccaattttg tatttattta ttttttaatt attttgtgca gcgatggggg cggggggggg      660

gggggggcgc gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag      720

aggtgcggcg gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg      780

gcggcggcgg cggccctata aaaagcgaag cgcgcggcgg gcggggagtc gctgcgacgc      840

tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg      900

accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag      960

cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc     1020

cgggagggcc ctttgtgcgg ggggagcggc tcggggggtg cgtgcgtgtg tgtgtgcgtg     1080

gggagcgccg cgtgcggctc cgcgctgccc ggcggctgtg agcgctgcgg gcgcggcgcg     1140

gggctttgtg cgctccgcag tgtgcgcgag gggagcgcgg ccgggggcgg tgccccgcgg     1200

tgcggggggg gctgcgaggg gaacaaaggc tgcgtgcggg gtgtgtgcgt gggggggtga     1260

gcagggggtg tgggcgcgtc ggtcgggctg caaccccccc tgcacccccc tccccgagtt     1320

gctgagcacg gcccggcttc gggtgcgggg ctccgtacgg ggcgtggcgc ggggctcgcc     1380

gtgccgggcg gggggtggcg gcaggtgggg gtgccgggcg gggcggggcc gcctcgggcc     1440

ggggagggct cgggggaggg gcgcggcggc ccccggagcg ccggcggctg tcgaggcgcg     1500

gcgagccgca gccattgcct tttatggtaa tcgtgcgaga gggcgcaggg acttcctttg     1560

tcccaaatct gtgcggagcc gaaatctggg aggcgccgcc gcaccccctc tagcgggcgc     1620

ggggcgaagc ggtgcggcgc cggcaggaag gaaatgggcg gggagggcct tcgtgcgtcg     1680

ccgcgccgcc gtccccttct ccctctccag cctcggggct gtccgcgggg ggacggctgc     1740

cttcgggggg gacggggcag ggcggggttc ggcttctggc gtgtgaccgg cggctctaga     1800

caattgtact aaccttcttc tctttcctct cctgacaggt tggtgtacac tagcggccgc     1860

caccatggct aagattaaca cccagtactc acatccatcc cgcactcacc tcaaagtcaa     1920

gacctccgat cgggatctga accgggctga gaatgggctg tcgcgcgccc actcgtcgtc     1980

cgaggaaacc agcagcgtgc tccagccggg catcgccatg gaaactaggg ggctggcgga     2040

ctccggacag ggatccttca ctggacaggg tattgcccgg ctgagcagac tgatcttcct     2100

gcttcgccgc tgggcggcca gacacgtgca ccatcaggac cagggacctg atagcttccc     2160

cgaccgcttt aggggagccg agctgaaaga agtgtcaagc caggagtcaa acgcgcaggc     2220

caacgtcggc agccaagagc ctgcagaccg gggacgctcg gcatggccgc tcgcaaagtg     2280

caacactaac acttccaaca acaccgaaga ggaaaagaaa accaagaaga aggatgcaat     2340

tgtggtggac ccttcctcca acctgtacta ccgctggttg accgccatcg ccctcccggt     2400

cttttacaat tggtatctcc ttatctgccg ggcctgcttc gacgaactgc aatcagagta     2460

cctgatgctg tggctggtgc tggactatag cgccgatgtg ctctacgtcc tggatgtgct     2520

cgtgcgcgcc cggaccggat tcttggaaca aggcctgatg gtgtccgaca cgaatagact     2580

gtggcagcac tataagacca caacccagtt caagcttgac gtgctcagcc ttgtgccgac     2640

tgacctggcc tacctgaaag tcggaactaa ctacccggaa gtcagattca accgactcct     2700

gaagttcagc aggctgttcg agttctttga ccgcaccgag actcggacca actaccctaa     2760

catgttccgg atcggaaatc tggtgctcta catactgatt atcatccatt ggaacgcctg     2820

tatctatttc gccatttcga agttcatcgg tttcggaacc gattcctggg tgtaccccaa     2880

catctcgatc cccgaacacg gtcgcctgtc ccggaagtac atctactccc tgtactggtc     2940

cactctgact ctgaccacga tcggggaaac ccctccaccc gtgaaggacg aagagtacct     3000

gttcgtggtg gtggacttcc tggtcggagt gttgattttc gccaccattg tgggaaacgt     3060

gggctccatg atctccaaca tgaacgcgtc gagagctgag ttccaagcca agatcgactc     3120

cattaagcag tacatgcagt tcagaaaggt caccaaggac ctggaaacca gggtcatccg     3180

ctggttcgac tacctgtggg ccaacaaaaa gactgtggac gaaaaggaag tgctgaagtc     3240

gctgccggat aagctgaagg ccgaaatcgc cattaacgtg caccttgaca ccctgaagaa     3300

agtccggatc ttccaagact gtgaagccgg cctcctggtg gagctcgtgc tcaagctgcg     3360

gcccaccgtg ttcagcccgg gagattacat ttgcaagaag ggcgatatcg gcaaagagat     3420

gtacatcatc aacgagggaa agctggccgt ggtcgcggac gacggcgtga cccagttcgt     3480

ggtgctgtcc gacggatcct acttcggtga aatctcaatc ctcaacatca aggggtccaa     3540

gtccggcaac cggagaactg ccaacattcg ctccatcgga tacagcgacc tgttttgcct     3600

gtccaaggat gacctgatgg aggctctgac tgagtaccct gaagcgaaga aggctttgga     3660

ggaaaagggg cggcagattc tgatgaagga caatttgatc gacgaggagc tcgcacgggc     3720

cggcgccgac cccaaggatc tcgaagagaa ggtcgaacag ctgggttctt cgcttgatac     3780

cctgcaaacc cgattcgcgc ggctgctcgc cgagtacaac gcgacccaga tgaagatgaa     3840

gcagagactg tcacagttgg aatcccaagt caagggcgga ggcgacaagc cgctggcgga     3900

cggggaagtg cccggggacg ccaccaagac tgaggacaag cagcagtgat catagatcga     3960

tctgcctcga ctgtgccttc tagttgccag ccatctgttg tttgcccctc ccccgtgcct     4020

tccttgaccc tggaaggtgc cactcccact gtcctttcct aataaaatga ggaaattgca     4080

tcgcattgtc tgagtaggtg tcattctatt ctggggggtg gggtggggca ggacagcaag     4140

ggggaggatt gggaagacaa tagcaggcat gctggggact cgagttctac gtagataagt     4200

agcatggcgg gttaatcatt aactacaagg aacccctagt gatggagttg gccactccct     4260

ctctgcgcgc tcgctcgctc actgaggccg ggcgaccaaa ggtcgcccga cgcccgggct     4320

ttgcccgggc ggcctcagtg agcgagcgag cgcgcagcct taattaacct aaggaaaatg     4380

aagtgaagtt cctatacttt ctagagaata ggaacttcta tagtgagtcg aataagggcg     4440

acacaaaatt tattctaaat gcataataaa tactgataac atcttatagt ttgtattata     4500

ttttgtatta tcgttgacat gtataatttt gatatcaaaa actgattttc cctttattat     4560

tttcgagatt tattttctta attctcttta acaaactaga aatattgtat atacaaaaaa     4620

tcataaataa tagatgaata gtttaattat aggtgttcat caatcgaaaa agcaacgtat     4680

cttatttaaa gtgcgttgct tttttctcat ttataaggtt aaataattct catatatcaa     4740

gcaaagtgac aggcgccctt aaatattctg acaaatgctc tttccctaaa ctccccccat     4800

aaaaaaaccc gccgaagcgg gtttttacgt tatttgcgga ttaacgatta ctcgttatca     4860

gaaccgccca gggggcccga gcttaacctt tttatttggg ggagagggaa gtcatgaaaa     4920

aactaacctt tgaaattcga tctccagcac atcagcaaaa cgctattcac gcagtacagc     4980

aaatccttcc agacccaacc aaaccaatcg tagtaaccat tcaggaacgc aaccgcagct     5040

tagaccaaaa caggaagcta tgggcctgct taggtgacgt ctctcgtcag gttgaatggc     5100

atggtcgctg gctggatgca gaaagctgga agtgtgtgtt taccgcagca ttaaagcagc     5160

aggatgttgt tcctaacctt gccgggaatg gctttgtggt aataggccag tcaaccagca     5220

ggatgcgtgt aggcgaattt gcggagctat tagagcttat acaggcattc ggtacagagc     5280

gtggcgttaa gtggtcagac gaagcgagac tggctctgga gtggaaagcg agatggggag     5340

acagggctgc atgataaatg tcgttagttt ctccggtggc aggacgtcag catatttgct     5400

ctggctaatg gagcaaaagc gacgggcagg taaagacgtg cattacgttt tcatggatac     5460

aggttgtgaa catccaatga catatcggtt tgtcagggaa gttgtgaagt tctgggatat     5520

accgctcacc gtattgcagg ttgatatcaa cccggagctt ggacagccaa atggttatac     5580

ggtatgggaa ccaaaggata ttcagacgcg aatgcctgtt ctgaagccat ttatcgatat     5640

ggtaaagaaa tatggcactc catacgtcgg cggcgcgttc tgcactgaca gattaaaact     5700

cgttcccttc accaaatact gtgatgacca tttcgggcga gggaattaca ccacgtggat     5760

tggcatcaga gctgatgaac cgaagcggct aaagccaaag cctggaatca gatatcttgc     5820

tgaactgtca gactttgaga aggaagatat cctcgcatgg tggaagcaac aaccattcga     5880

tttgcaaata ccggaacatc tcggtaactg catattctgc attaaaaaat caacgcaaaa     5940

aatcggactt gcctgcaaag atgaggaggg attgcagcgt gtttttaatg aggtcatcac     6000

gggatcccat gtgcgtgacg gacatcggga aacgccaaag gagattatgt accgaggaag     6060

aatgtcgctg gacggtatcg cgaaaatgta ttcagaaaat gattatcaag ccctgtatca     6120

ggacatggta cgagctaaaa gattcgatac cggctcttgt tctgagtcat gcgaaatatt     6180

tggagggcag cttgatttcg acttcgggag ggaagctgca tgatgcgatg ttatcggtgc     6240

ggtgaatgca aagaagataa ccgcttccga ccaaatcaac cttactggaa tcgatggtgt     6300

ctccggtgtg aaagaacacc aacaggggtg ttaccactac cgcaggaaaa ggaggacgtg     6360

tggcgagaca gcgacgaagt atcaccgaca taatctgcga aaactgcaaa taccttccaa     6420

cgaaacgcac cagaaataaa cccaagccaa tcccaaaaga atctgacgta aaaaccttca     6480

actacacggc tcacctgtgg gatatccggt ggctaagacg tcgtgcgagg aaaacaaggt     6540

gattgaccaa aatcgaagtt acgaacaaga aagcgtcgag cgagctttaa cgtgcgctaa     6600

ctgcggtcag aagctgcatg tgctggaagt tcacgtgtgt gagcactgct gcgcagaact     6660

gatgagcgat ccgaatagct cgatgcacga ggaagaagat gatggctaaa ccagcgcgaa     6720

gacgatgtaa aaacgatgaa tgccgggaat ggtttcaccc tgcattcgct aatcagtggt     6780

ggtgctctcc agagtgtgga accaagatag cactcgaacg acgaagtaaa gaacgcgaaa     6840

aagcggaaaa agcagcagag aagaaacgac gacgagagga gcagaaacag aaagataaac     6900

ttaagattcg aaaactcgcc ttaaagcccc gcagttactg gattaaacaa gcccaacaag     6960

ccgtaaacgc cttcatcaga gaaagagacc gcgacttacc atgtatctcg tgcggaacgc     7020

tcacgtctgc tcagtgggat gccggacatt accggacaac tgctgcggca cctcaactcc     7080

gatttaatga acgcaatatt cacaagcaat gcgtggtgtg caaccagcac aaaagcggaa     7140

atctcgttcc gtatcgcgtc gaactgatta gccgcatcgg gcaggaagca gtagacgaaa     7200

tcgaatcaaa ccataaccgc catcgctgga ctatcgaaga gtgcaaggcg atcaaggcag     7260

agtaccaaca gaaactcaaa gacctgcgaa atagcagaag tgaggccgca tgacgttctc     7320

agtaaaaacc attccagaca tgctcgttga agcatacgga aatcagacag aagtagcacg     7380

cagactgaaa tgtagtcgcg gtacggtcag aaaatacgtt gatgataaag acgggaaaat     7440

gcacgccatc gtcaacgacg ttctcatggt tcatcgcgga tggagtgaaa gagatgcgct     7500

attacgaaaa aattgatggc agcaaatacc gaaatatttg ggtagttggc gatctgcacg     7560

gatgctacac gaacctgatg aacaaactgg atacgattgg attcgacaac aaaaaagacc     7620

tgcttatctc ggtgggcgat ttggttgatc gtggtgcaga gaacgttgaa tgcctggaat     7680

taatcacatt cccctggttc agagctgtac gtggaaacca tgagcaaatg atgattgatg     7740

gcttatcaga gcgtggaaac gttaatcact ggctgcttaa tggcggtggc tggttcttta     7800

atctcgatta cgacaaagaa attctggcta aagctcttgc ccataaagca gatgaacttc     7860

cgttaatcat cgaactggtg agcaaagata aaaaatatgt tatctgccac gccgattatc     7920

cctttgacga atacgagttt ggaaagccag ttgatcatca gcaggtaatc tggaaccgcg     7980

aacgaatcag caactcacaa aacgggatcg tgaaagaaat caaaggcgcg gacacgttca     8040

tctttggtca tacgccagca gtgaaaccac tcaagtttgc caaccaaatg tatatcgata     8100

ccggcgcagt gttctgcgga aacctaacat tgattcaggt acagggagaa ggcgcatgag     8160

actcgaaagc gtagctaaat ttcattcgcc aaaaagcccg atgatgagcg actcaccacg     8220

ggccacggct tctgactctc tttccggtac tgatgtgatg gctgctatgg ggatggcgca     8280

atcacaagcc ggattcggta tggctgcatt ctgcggtaag cacgaactca gccagaacga     8340

caaacaaaag gctatcaact atctgatgca atttgcacac aaggtatcgg ggaaataccg     8400

tggtgtggca aagcttgaag gaaatactaa ggcaaaggta ctgcaagtgc tcgcaacatt     8460

cgcttatgcg gattattgcc gtagtgccgc gacgccgggg gcaagatgca gagattgcca     8520

tggtacaggc cgtgcggttg atattgccaa aacagagctg tgggggagag ttgtcgagaa     8580

agagtgcgga agatgcaaag gcgtcggcta ttcaaggatg ccagcaagcg cagcatatcg     8640

cgctgtgacg atgctaatcc caaaccttac ccaacccacc tggtcacgca ctgttaagcc     8700

gctgtatgac gctctggtgg tgcaatgcca caaagaagag tcaatcgcag acaacatttt     8760

gaatgcggtc acacgttagc agcatgattg ccacggatgg caacatatta acggcatgat     8820

attgacttat tgaataaaat tgggtaaatt tgactcaacg atgggttaat tcgctcgttg     8880

tggtagtgag atgaaaagag gcggcgctta ctaccgattc cgcctagttg gtcacttcga     8940

cgtatcgtct ggaactccaa ccatcgcagg cagagaggtc tgcaaaatgc aatcccgaaa     9000

cagttcgcag gtaatagtta gagcctgcat aacggtttcg ggatttttta tatctgcaca     9060

acaggtaaga gcattgagtc gataatcgtg aagagtcggc gagcctggtt agccagtgct     9120

ctttccgttg tgctgaatta agcgaatacc ggaagcagaa ccggatcacc aaatgcgtac     9180

aggcgtcatc gccgcccagc aacagcacaa cccaaactga gccgtagcca ctgtctgtcc     9240

tgaattcatt agtaatagtt acgctgcggc cttttacaca tgaccttcgt gaaagcgggt     9300

ggcaggaggt cgcgctaaca acctcctgcc gttttgcccg tgcatatcgg tcacgaacaa     9360

atctgattac taaacacagt agcctggatt tgttctatca gtaatcgacc ttattcctaa     9420

ttaaatagag caaatcccct tattgggggt aagacatgaa gatgccagaa aaacatgacc     9480

tgttggccgc cattctcgcg gcaaaggaac aaggcatcgg ggcaatcctt gcgtttgcaa     9540

tggcgtacct tcgcggcaga tataatggcg gtgcgtttac aaaaacagta atcgacgcaa     9600

cgatgtgcgc cattatcgcc tggttcattc gtgaccttct cgacttcgcc ggactaagta     9660

gcaatctcgc ttatataacg agcgtgttta tcggctacat cggtactgac tcgattggtt     9720

cgcttatcaa acgcttcgct gctaaaaaag ccggagtaga agatggtaga aatcaataat     9780

caacgtaagg cgttcctcga tatgctggcg tggtcggagg gaactgataa cggacgtcag     9840

aaaaccagaa atcatggtta tgacgtcatt gtaggcggag agctatttac tgattactcc     9900

gatcaccctc gcaaacttgt cacgctaaac ccaaaactca aatcaacagg cgcttaagac     9960

tggccgtcgt tttacaacac agaaagagtt tgtagaaacg caaaaaggcc atccgtcagg    10020

ggccttctgc ttagtttgat gcctggcagt tccctactct cgccttccgc ttcctcgctc    10080

actgactcgc tgcgctcggt cgttcggctg cggcgagcgg tatcagctca ctcaaaggcg    10140

gtaatacggt tatccacaga atcaggggat aacgcaggaa agaacatgtg agcaaaaggc    10200

cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg cgtttttcca taggctccgc    10260

ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga ggtggcgaaa cccgacagga    10320

ctataaagat accaggcgtt tccccctgga agctccctcg tgcgctctcc tgttccgacc    10380

ctgccgctta ccggatacct gtccgccttt ctcccttcgg gaagcgtggc gctttctcat    10440

agctcacgct gtaggtatct cagttcggtg taggtcgttc gctccaagct gggctgtgtg    10500

cacgaacccc ccgttcagcc cgaccgctgc gccttatccg gtaactatcg tcttgagtcc    10560

aacccggtaa gacacgactt atcgccactg gcagcagcca ctggtaacag gattagcaga    10620

gcgaggtatg taggcggtgc tacagagttc ttgaagtggt gggctaacta cggctacact    10680

agaagaacag tatttggtat ctgcgctctg ctgaagccag ttaccttcgg aaaaagagtt    10740

ggtagctctt gatccggcaa acaaaccacc gctggtagcg gtggtttttt tgtttgcaag    10800

cagcagatta cgcgcagaaa aaaaggatct caagaagatc ctttgatctt ttctacgggg    10860

tctgacgctc agtggaacga cgcgcgcgta actcacgtta agggattttg gtcatgagct    10920

tgcgccgtcc cgtcaagtca gcgtaatgct ctgcttttag aaaaactcat cgagcatcaa    10980

atgaaactgc aatttattca tatcaggatt atcaatacca tatttttgaa aaagccgttt    11040

ctgtaatgaa ggagaaaact caccgaggca gttccatagg atggcaagat cctggtatcg    11100

gtctgcgatt ccgactcgtc caacatcaat acaacctatt aatttcccct cgtcaaaaat    11160

aaggttatca agtgagaaat caccatgagt gacgactgaa tccggtgaga atggcaaaag    11220

tttatgcatt tctttccaga cttgttcaac aggccagcca ttacgctcgt catcaaaatc    11280

actcgcatca accaaaccgt tattcattcg tgattgcgcc tgagcgaggc gaaatacgcg    11340

atcgctgtta aaaggacaat tacaaacagg aatcgagtgc aaccggcgca ggaacactgc    11400

cagcgcatca acaatatttt cacctgaatc aggatattct tctaatacct ggaacgctgt    11460

ttttccgggg atcgcagtgg tgagtaacca tgcatcatca ggagtacgga taaaatgctt    11520

gatggtcgga agtggcataa attccgtcag ccagtttagt ctgaccatct catctgtaac    11580

atcattggca acgctacctt tgccatgttt cagaaacaac tctggcgcat cgggcttccc    11640

atacaagcga tagattgtcg cacctgattg cccgacatta tcgcgagccc atttataccc    11700

atataaatca gcatccatgt tggaatttaa tcgcggcctc gacgtttccc gttgaatatg    11760

gctcatattc ttcctttttc aatattattg aagcatttat cagggttatt gtctcatgag    11820

cggatacata tttgaatgta tttagaaaaa taaacaaata ggggtcagtg ttacaaccaa    11880

ttaaccaatt ctgaacatta tcgcgagccc atttatacct gaatatggct cataacaccc    11940

cttgtttgcc tggcggcagt agcgcggtgg tcccacctga ccccatgccg aactcagaag    12000

tgaaacgccg tagcgccgat ggtagtgtgg ggactcccca tgcgagagta gggaactgcc    12060

aggcatcaaa taaaacgaaa ggctcagtcg aaagactggg cctttcgccc gggctaatta    12120

gggggtgtcg cccttattcg actctatagt gaagttccta ttctctagaa agtataggaa    12180

cttctgaagt ggggtcgact taattaagg                                      12209


<210>  38
<211>  12374
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  38
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg      240

cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt      300

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      360

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      420

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      480

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattaa      540

catggtcgag gtgagcccca cgttctgctt cactctcccc atctcccccc cctccccacc      600

cccaattttg tatttattta ttttttaatt attttgtgca gcgatggggg cggggggggg      660

gggggggcgc gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag      720

aggtgcggcg gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg      780

gcggcggcgg cggccctata aaaagcgaag cgcgcggcgg gcggggagtc gctgcgacgc      840

tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg      900

accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag      960

cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc     1020

cgggagggcc ctttgtgcgg ggggagcggc tcggggggtg cgtgcgtgtg tgtgtgcgtg     1080

gggagcgccg cgtgcggctc cgcgctgccc ggcggctgtg agcgctgcgg gcgcggcgcg     1140

gggctttgtg cgctccgcag tgtgcgcgag gggagcgcgg ccgggggcgg tgccccgcgg     1200

tgcggggggg gctgcgaggg gaacaaaggc tgcgtgcggg gtgtgtgcgt gggggggtga     1260

gcagggggtg tgggcgcgtc ggtcgggctg caaccccccc tgcacccccc tccccgagtt     1320

gctgagcacg gcccggcttc gggtgcgggg ctccgtacgg ggcgtggcgc ggggctcgcc     1380

gtgccgggcg gggggtggcg gcaggtgggg gtgccgggcg gggcggggcc gcctcgggcc     1440

ggggagggct cgggggaggg gcgcggcggc ccccggagcg ccggcggctg tcgaggcgcg     1500

gcgagccgca gccattgcct tttatggtaa tcgtgcgaga gggcgcaggg acttcctttg     1560

tcccaaatct gtgcggagcc gaaatctggg aggcgccgcc gcaccccctc tagcgggcgc     1620

ggggcgaagc ggtgcggcgc cggcaggaag gaaatgggcg gggagggcct tcgtgcgtcg     1680

ccgcgccgcc gtccccttct ccctctccag cctcggggct gtccgcgggg ggacggctgc     1740

cttcgggggg gacggggcag ggcggggttc ggcttctggc gtgtgaccgg cggctctaga     1800

caattgtact aaccttcttc tctttcctct cctgacaggt tggtgtacac tagcggccgc     1860

caccatggct aagattaaca cccagtactc acatccatcc cgcactcacc tcaaagtcaa     1920

gacctccgat cgggatctga accgggctga gaatgggctg tcgcgcgccc actcgtcgtc     1980

cgaggaaacc agcagcgtgc tccagccggg catcgccatg gaaactaggg ggctggcgga     2040

ctccggacag ggatccttca ctggacaggg tattgcccgg ttcgggcgga ttcagaagaa     2100

gtcccagccg gagaaggtcg tgcgggctgc cagcaggggc aggccactca ttggttggac     2160

acagtggtgc gctgaggatg gtggagatga atcggaaatg gcactggccg gctctcccgg     2220

atgcagctcg ggcccccaag ggagactgag cagactgatc ttcctgcttc gccgctgggc     2280

ggccagacac gtgcaccatc aggaccaggg acctgatagc ttccccgacc gctttagggg     2340

agccgagctg aaagaagtgt caagccagga gtcaaacgcg caggccaacg tcggcagcca     2400

agagcctgca gaccggggac gctcggcatg gccgctcgca aagtgcaaca ctaacacttc     2460

caacaacacc gaagaggaaa agaaaaccaa gaagaaggat gcaattgtgg tggacccttc     2520

ctccaacctg tactaccgct ggttgaccgc catcgccctc ccggtctttt acaattggta     2580

tctccttatc tgccgggcct gcttcgacga actgcaatca gagtacctga tgctgtggct     2640

ggtgctggac tatagcgccg atgtgctcta cgtcctggat gtgctcgtgc gcgcccggac     2700

cggattcttg gaacaaggcc tgatggtgtc cgacacgaat agactgtggc agcactataa     2760

gaccacaacc cagttcaagc ttgacgtgct cagccttgtg ccgactgacc tggcctacct     2820

gaaagtcgga actaactacc cggaagtcag attcaaccga ctcctgaagt tcagcaggct     2880

gttcgagttc tttgaccgca ccgagactcg gaccaactac cctaacatgt tccggatcgg     2940

aaatctggtg ctctacatac tgattatcat ccattggaac gcctgtatct atttcgccat     3000

ttcgaagttc atcggtttcg gaaccgattc ctgggtgtac cccaacatct cgatccccga     3060

acacggtcgc ctgtcccgga agtacatcta ctccctgtac tggtccactc tgactctgac     3120

cacgatcggg gaaacccctc cacccgtgaa ggacgaagag tacctgttcg tggtggtgga     3180

cttcctggtc ggagtgttga ttttcgccac cattgtggga aacgtgggct ccatgatctc     3240

caacatgaac gcgtcgagag ctgagttcca agccaagatc gactccatta agcagtacat     3300

gcagttcaga aaggtcacca aggacctgga aaccagggtc atccgctggt tcgactacct     3360

gtgggccaac aaaaagactg tggacgaaaa ggaagtgctg aagtcgctgc cggataagct     3420

gaaggccgaa atcgccatta acgtgcacct tgacaccctg aagaaagtcc ggatcttcca     3480

agactgtgaa gccggcctcc tggtggagct cgtgctcaag ctgcggccca ccgtgttcag     3540

cccgggagat tacatttgca agaagggcga tatcggcaaa gagatgtaca tcatcaacga     3600

gggaaagctg gccgtggtcg cggacgacgg cgtgacccag ttcgtggtgc tgtccgacgg     3660

atcctacttc ggtgaaatct caatcctcaa catcaagggg tccaagtccg gcaaccggag     3720

aactgccaac attcgctcca tcggatacag cgacctgttt tgcctgtcca aggatgacct     3780

gatggaggct ctgactgagt accctgaagc gaagaaggct ttggaggaaa aggggcggca     3840

gattctgatg aaggacaatt tgatcgacga ggagctcgca cgggccggcg ccgaccccaa     3900

ggatctcgaa gagaaggtcg aacagctggg ttcttcgctt gataccctgc aaacccgatt     3960

cgcgcggctg ctcgccgagt acaacgcgac ccagatgaag atgaagcaga gactgtcaca     4020

gttggaatcc caagtcaagg gcggaggcga caagccgctg gcggacgggg aagtgcccgg     4080

ggacgccacc aagactgagg acaagcagca gtgatcatag atcgatctgc ctcgactgtg     4140

ccttctagtt gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa     4200

ggtgccactc ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt     4260

aggtgtcatt ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa     4320

gacaatagca ggcatgctgg ggactcgagt tctacgtaga taagtagcat ggcgggttaa     4380

tcattaacta caaggaaccc ctagtgatgg agttggccac tccctctctg cgcgctcgct     4440

cgctcactga ggccgggcga ccaaaggtcg cccgacgccc gggctttgcc cgggcggcct     4500

cagtgagcga gcgagcgcgc agccttaatt aacctaagga aaatgaagtg aagttcctat     4560

actttctaga gaataggaac ttctatagtg agtcgaataa gggcgacaca aaatttattc     4620

taaatgcata ataaatactg ataacatctt atagtttgta ttatattttg tattatcgtt     4680

gacatgtata attttgatat caaaaactga ttttcccttt attattttcg agatttattt     4740

tcttaattct ctttaacaaa ctagaaatat tgtatataca aaaaatcata aataatagat     4800

gaatagttta attataggtg ttcatcaatc gaaaaagcaa cgtatcttat ttaaagtgcg     4860

ttgctttttt ctcatttata aggttaaata attctcatat atcaagcaaa gtgacaggcg     4920

cccttaaata ttctgacaaa tgctctttcc ctaaactccc cccataaaaa aacccgccga     4980

agcgggtttt tacgttattt gcggattaac gattactcgt tatcagaacc gcccaggggg     5040

cccgagctta acctttttat ttgggggaga gggaagtcat gaaaaaacta acctttgaaa     5100

ttcgatctcc agcacatcag caaaacgcta ttcacgcagt acagcaaatc cttccagacc     5160

caaccaaacc aatcgtagta accattcagg aacgcaaccg cagcttagac caaaacagga     5220

agctatgggc ctgcttaggt gacgtctctc gtcaggttga atggcatggt cgctggctgg     5280

atgcagaaag ctggaagtgt gtgtttaccg cagcattaaa gcagcaggat gttgttccta     5340

accttgccgg gaatggcttt gtggtaatag gccagtcaac cagcaggatg cgtgtaggcg     5400

aatttgcgga gctattagag cttatacagg cattcggtac agagcgtggc gttaagtggt     5460

cagacgaagc gagactggct ctggagtgga aagcgagatg gggagacagg gctgcatgat     5520

aaatgtcgtt agtttctccg gtggcaggac gtcagcatat ttgctctggc taatggagca     5580

aaagcgacgg gcaggtaaag acgtgcatta cgttttcatg gatacaggtt gtgaacatcc     5640

aatgacatat cggtttgtca gggaagttgt gaagttctgg gatataccgc tcaccgtatt     5700

gcaggttgat atcaacccgg agcttggaca gccaaatggt tatacggtat gggaaccaaa     5760

ggatattcag acgcgaatgc ctgttctgaa gccatttatc gatatggtaa agaaatatgg     5820

cactccatac gtcggcggcg cgttctgcac tgacagatta aaactcgttc ccttcaccaa     5880

atactgtgat gaccatttcg ggcgagggaa ttacaccacg tggattggca tcagagctga     5940

tgaaccgaag cggctaaagc caaagcctgg aatcagatat cttgctgaac tgtcagactt     6000

tgagaaggaa gatatcctcg catggtggaa gcaacaacca ttcgatttgc aaataccgga     6060

acatctcggt aactgcatat tctgcattaa aaaatcaacg caaaaaatcg gacttgcctg     6120

caaagatgag gagggattgc agcgtgtttt taatgaggtc atcacgggat cccatgtgcg     6180

tgacggacat cgggaaacgc caaaggagat tatgtaccga ggaagaatgt cgctggacgg     6240

tatcgcgaaa atgtattcag aaaatgatta tcaagccctg tatcaggaca tggtacgagc     6300

taaaagattc gataccggct cttgttctga gtcatgcgaa atatttggag ggcagcttga     6360

tttcgacttc gggagggaag ctgcatgatg cgatgttatc ggtgcggtga atgcaaagaa     6420

gataaccgct tccgaccaaa tcaaccttac tggaatcgat ggtgtctccg gtgtgaaaga     6480

acaccaacag gggtgttacc actaccgcag gaaaaggagg acgtgtggcg agacagcgac     6540

gaagtatcac cgacataatc tgcgaaaact gcaaatacct tccaacgaaa cgcaccagaa     6600

ataaacccaa gccaatccca aaagaatctg acgtaaaaac cttcaactac acggctcacc     6660

tgtgggatat ccggtggcta agacgtcgtg cgaggaaaac aaggtgattg accaaaatcg     6720

aagttacgaa caagaaagcg tcgagcgagc tttaacgtgc gctaactgcg gtcagaagct     6780

gcatgtgctg gaagttcacg tgtgtgagca ctgctgcgca gaactgatga gcgatccgaa     6840

tagctcgatg cacgaggaag aagatgatgg ctaaaccagc gcgaagacga tgtaaaaacg     6900

atgaatgccg ggaatggttt caccctgcat tcgctaatca gtggtggtgc tctccagagt     6960

gtggaaccaa gatagcactc gaacgacgaa gtaaagaacg cgaaaaagcg gaaaaagcag     7020

cagagaagaa acgacgacga gaggagcaga aacagaaaga taaacttaag attcgaaaac     7080

tcgccttaaa gccccgcagt tactggatta aacaagccca acaagccgta aacgccttca     7140

tcagagaaag agaccgcgac ttaccatgta tctcgtgcgg aacgctcacg tctgctcagt     7200

gggatgccgg acattaccgg acaactgctg cggcacctca actccgattt aatgaacgca     7260

atattcacaa gcaatgcgtg gtgtgcaacc agcacaaaag cggaaatctc gttccgtatc     7320

gcgtcgaact gattagccgc atcgggcagg aagcagtaga cgaaatcgaa tcaaaccata     7380

accgccatcg ctggactatc gaagagtgca aggcgatcaa ggcagagtac caacagaaac     7440

tcaaagacct gcgaaatagc agaagtgagg ccgcatgacg ttctcagtaa aaaccattcc     7500

agacatgctc gttgaagcat acggaaatca gacagaagta gcacgcagac tgaaatgtag     7560

tcgcggtacg gtcagaaaat acgttgatga taaagacggg aaaatgcacg ccatcgtcaa     7620

cgacgttctc atggttcatc gcggatggag tgaaagagat gcgctattac gaaaaaattg     7680

atggcagcaa ataccgaaat atttgggtag ttggcgatct gcacggatgc tacacgaacc     7740

tgatgaacaa actggatacg attggattcg acaacaaaaa agacctgctt atctcggtgg     7800

gcgatttggt tgatcgtggt gcagagaacg ttgaatgcct ggaattaatc acattcccct     7860

ggttcagagc tgtacgtgga aaccatgagc aaatgatgat tgatggctta tcagagcgtg     7920

gaaacgttaa tcactggctg cttaatggcg gtggctggtt ctttaatctc gattacgaca     7980

aagaaattct ggctaaagct cttgcccata aagcagatga acttccgtta atcatcgaac     8040

tggtgagcaa agataaaaaa tatgttatct gccacgccga ttatcccttt gacgaatacg     8100

agtttggaaa gccagttgat catcagcagg taatctggaa ccgcgaacga atcagcaact     8160

cacaaaacgg gatcgtgaaa gaaatcaaag gcgcggacac gttcatcttt ggtcatacgc     8220

cagcagtgaa accactcaag tttgccaacc aaatgtatat cgataccggc gcagtgttct     8280

gcggaaacct aacattgatt caggtacagg gagaaggcgc atgagactcg aaagcgtagc     8340

taaatttcat tcgccaaaaa gcccgatgat gagcgactca ccacgggcca cggcttctga     8400

ctctctttcc ggtactgatg tgatggctgc tatggggatg gcgcaatcac aagccggatt     8460

cggtatggct gcattctgcg gtaagcacga actcagccag aacgacaaac aaaaggctat     8520

caactatctg atgcaatttg cacacaaggt atcggggaaa taccgtggtg tggcaaagct     8580

tgaaggaaat actaaggcaa aggtactgca agtgctcgca acattcgctt atgcggatta     8640

ttgccgtagt gccgcgacgc cgggggcaag atgcagagat tgccatggta caggccgtgc     8700

ggttgatatt gccaaaacag agctgtgggg gagagttgtc gagaaagagt gcggaagatg     8760

caaaggcgtc ggctattcaa ggatgccagc aagcgcagca tatcgcgctg tgacgatgct     8820

aatcccaaac cttacccaac ccacctggtc acgcactgtt aagccgctgt atgacgctct     8880

ggtggtgcaa tgccacaaag aagagtcaat cgcagacaac attttgaatg cggtcacacg     8940

ttagcagcat gattgccacg gatggcaaca tattaacggc atgatattga cttattgaat     9000

aaaattgggt aaatttgact caacgatggg ttaattcgct cgttgtggta gtgagatgaa     9060

aagaggcggc gcttactacc gattccgcct agttggtcac ttcgacgtat cgtctggaac     9120

tccaaccatc gcaggcagag aggtctgcaa aatgcaatcc cgaaacagtt cgcaggtaat     9180

agttagagcc tgcataacgg tttcgggatt ttttatatct gcacaacagg taagagcatt     9240

gagtcgataa tcgtgaagag tcggcgagcc tggttagcca gtgctctttc cgttgtgctg     9300

aattaagcga ataccggaag cagaaccgga tcaccaaatg cgtacaggcg tcatcgccgc     9360

ccagcaacag cacaacccaa actgagccgt agccactgtc tgtcctgaat tcattagtaa     9420

tagttacgct gcggcctttt acacatgacc ttcgtgaaag cgggtggcag gaggtcgcgc     9480

taacaacctc ctgccgtttt gcccgtgcat atcggtcacg aacaaatctg attactaaac     9540

acagtagcct ggatttgttc tatcagtaat cgaccttatt cctaattaaa tagagcaaat     9600

ccccttattg ggggtaagac atgaagatgc cagaaaaaca tgacctgttg gccgccattc     9660

tcgcggcaaa ggaacaaggc atcggggcaa tccttgcgtt tgcaatggcg taccttcgcg     9720

gcagatataa tggcggtgcg tttacaaaaa cagtaatcga cgcaacgatg tgcgccatta     9780

tcgcctggtt cattcgtgac cttctcgact tcgccggact aagtagcaat ctcgcttata     9840

taacgagcgt gtttatcggc tacatcggta ctgactcgat tggttcgctt atcaaacgct     9900

tcgctgctaa aaaagccgga gtagaagatg gtagaaatca ataatcaacg taaggcgttc     9960

ctcgatatgc tggcgtggtc ggagggaact gataacggac gtcagaaaac cagaaatcat    10020

ggttatgacg tcattgtagg cggagagcta tttactgatt actccgatca ccctcgcaaa    10080

cttgtcacgc taaacccaaa actcaaatca acaggcgctt aagactggcc gtcgttttac    10140

aacacagaaa gagtttgtag aaacgcaaaa aggccatccg tcaggggcct tctgcttagt    10200

ttgatgcctg gcagttccct actctcgcct tccgcttcct cgctcactga ctcgctgcgc    10260

tcggtcgttc ggctgcggcg agcggtatca gctcactcaa aggcggtaat acggttatcc    10320

acagaatcag gggataacgc aggaaagaac atgtgagcaa aaggccagca aaaggccagg    10380

aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc tccgcccccc tgacgagcat    10440

cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga caggactata aagataccag    10500

gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc cgaccctgcc gcttaccgga    10560

tacctgtccg cctttctccc ttcgggaagc gtggcgcttt ctcatagctc acgctgtagg    10620

tatctcagtt cggtgtaggt cgttcgctcc aagctgggct gtgtgcacga accccccgtt    10680

cagcccgacc gctgcgcctt atccggtaac tatcgtcttg agtccaaccc ggtaagacac    10740

gacttatcgc cactggcagc agccactggt aacaggatta gcagagcgag gtatgtaggc    10800

ggtgctacag agttcttgaa gtggtgggct aactacggct acactagaag aacagtattt    10860

ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa gagttggtag ctcttgatcc    10920

ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt gcaagcagca gattacgcgc    10980

agaaaaaaag gatctcaaga agatcctttg atcttttcta cggggtctga cgctcagtgg    11040

aacgacgcgc gcgtaactca cgttaaggga ttttggtcat gagcttgcgc cgtcccgtca    11100

agtcagcgta atgctctgct tttagaaaaa ctcatcgagc atcaaatgaa actgcaattt    11160

attcatatca ggattatcaa taccatattt ttgaaaaagc cgtttctgta atgaaggaga    11220

aaactcaccg aggcagttcc ataggatggc aagatcctgg tatcggtctg cgattccgac    11280

tcgtccaaca tcaatacaac ctattaattt cccctcgtca aaaataaggt tatcaagtga    11340

gaaatcacca tgagtgacga ctgaatccgg tgagaatggc aaaagtttat gcatttcttt    11400

ccagacttgt tcaacaggcc agccattacg ctcgtcatca aaatcactcg catcaaccaa    11460

accgttattc attcgtgatt gcgcctgagc gaggcgaaat acgcgatcgc tgttaaaagg    11520

acaattacaa acaggaatcg agtgcaaccg gcgcaggaac actgccagcg catcaacaat    11580

attttcacct gaatcaggat attcttctaa tacctggaac gctgtttttc cggggatcgc    11640

agtggtgagt aaccatgcat catcaggagt acggataaaa tgcttgatgg tcggaagtgg    11700

cataaattcc gtcagccagt ttagtctgac catctcatct gtaacatcat tggcaacgct    11760

acctttgcca tgtttcagaa acaactctgg cgcatcgggc ttcccataca agcgatagat    11820

tgtcgcacct gattgcccga cattatcgcg agcccattta tacccatata aatcagcatc    11880

catgttggaa tttaatcgcg gcctcgacgt ttcccgttga atatggctca tattcttcct    11940

ttttcaatat tattgaagca tttatcaggg ttattgtctc atgagcggat acatatttga    12000

atgtatttag aaaaataaac aaataggggt cagtgttaca accaattaac caattctgaa    12060

cattatcgcg agcccattta tacctgaata tggctcataa caccccttgt ttgcctggcg    12120

gcagtagcgc ggtggtccca cctgacccca tgccgaactc agaagtgaaa cgccgtagcg    12180

ccgatggtag tgtggggact ccccatgcga gagtagggaa ctgccaggca tcaaataaaa    12240

cgaaaggctc agtcgaaaga ctgggccttt cgcccgggct aattaggggg tgtcgccctt    12300

attcgactct atagtgaagt tcctattctc tagaaagtat aggaacttct gaagtggggt    12360

cgacttaatt aagg                                                      12374


<210>  39
<211>  11389
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  39
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

aagatccaag ctcagatctc gatcgagttg ggccccagaa gcctggtggt tgtttgtcct      240

tctcagggga aaagtgaggc ggccccttgg aggaaggggc cgggcagaat gatctaatcg      300

gattccaagc agctcagggg attgtctttt tctagcacct tcttgccact cctaagcgtc      360

ctccgtgacc ccggctggga tttagcctgg tgctgtgtca gccccggtct cccaggggct      420

tcccagtggt ccccaggaac cctcgacagg gcccggtctc tctcgtccag caagggcagg      480

gacgggccac aggccaaggg ccctcgatcg aggaactgaa aaaccagaaa gttaactggt      540

aagtttagtc tttttgtctt ttatttcagg tcccggatcc ggtggtggtg caaatcaaag      600

aactgctcct cagtggatgt tgcctttact tctaggcctg tacggaagtg ttacttctgc      660

tctaaaagct gcggaattgt acccggcggc cgccaccatg tttaaatcgc tgacaaaagt      720

caacaaggtg aagcctatag gagagaacaa tgagaatgaa caaagttctc gtcggaatga      780

agaaggctct cacccaagta atcagtctca gcaaaccaca gcacaggaag aaaacaaagg      840

tgaagagaaa tctctcaaaa ccaagtcaac tccagtcacg tctgaagagc cacacaccaa      900

catacaagac aaactctcca agaaaaattc ctctggagat ctgaccacaa accctgaccc      960

tcaaaatgca gcagaaccaa ctggaacagt gccagagcag aaggaaatgg accccgggaa     1020

agaaggtcca aacagcccac aaaacaaacc gccagcagct cctgttataa atgagtatgc     1080

cgatgcccag ctacacaacc tggtgaaaag aatgcgtcaa agaacagccc tctacaagaa     1140

aaagttggta gagggagatc tctcctcacc cgaagccagc ccacaaactg caaagcccac     1200

ggctgtacca ccagtaaaag aaagcgatga taagccaaca gaacattact acaggctgtt     1260

gtggttcaaa gtcaaaaaga tgcctttaac agagtactta aagcgaatta aacttccaaa     1320

cagcatagat tcatacacag atcgactcta tctcctgtgg ctcttgcttg tcactcttgc     1380

ctataactgg aactgctgtt ttataccact gcgcctcgtc ttcccatatc aaaccgcaga     1440

caacatacac tactggctta ttgcggacat catctgtgat atcatctacc tttatgatat     1500

gctatttatc cagcccagac tccagtttgt aagaggagga gacataatag tggattcaaa     1560

tgagctaagg aaacactaca ggacttctac aaaatttcag ttggatgtcg catcaataat     1620

accatttgat atttgctacc tcttctttgg gtttaatcca atgtttagag caaataggat     1680

gttaaagtac acttcatttt ttgaatttaa tcatcaccta gagtctataa tggacaaagc     1740

atatatctac agagttattc gaacaactgg atacttgctg tttattctgc acattaatgc     1800

ctgtgtttat tactgggctt caaactatga aggaattggc actactagat gggtgtatga     1860

tggggaagga aacgagtatc tgagatgtta ttattgggca gttcgaactt taattaccat     1920

tggtggcctt ccagaaccac aaactttatt tgaaattgtt tttcaactct tgaatttttt     1980

ttctggagtt tttgtgttct ccagtttaat tggtcagatg agagatgtga ttggagcagc     2040

tacagccaat cagaactact tccgcgcctg catggatgac accattgcct acatgaacaa     2100

ttactccatt cctaaacttg tgcaaaagcg agttcggact tggtatgaat atacatggga     2160

ctctcaaaga atgctagatg agtctgattt gcttaagacc ctaccaacta cggtccagtt     2220

agccctcgcc attgatgtga acttcagcat catcagcaaa gttgacttgt tcaagggttg     2280

tgatacacag atgatttatg acatgttgct aagattgaaa tccgttctct atttgcctgg     2340

tgactttgtc tgcaaaaagg gagaaattgg caaggaaatg tatatcatca agcatggaga     2400

agtccaagtt cttggaggcc ctgatggtac taaagttctg gttactctga aagctgggtc     2460

ggtgtttgga gaaatcagcc ttctagcagc aggaggagga aaccgtcgaa ctgccaatgt     2520

ggtggcccac gggtttgcca atcttttaac tctagacaaa aagaccctcc aagaaattct     2580

agtgcattat ccagattctg aaagaatcct catgaagaaa gccagagtgc ttttaaagca     2640

gaaggctaag accgcagaag caacccctcc aagaaaagat cttgccctcc tcttcccacc     2700

gaaagaagag acacccaaac tgtttaaaac tctcctagga ggcacaggaa aagcaagtct     2760

tgcaagacta ctcaaattga agcgagagca agcagctcag aagaaagaaa attctgaagg     2820

aggagaggaa gaaggaaaag aaaatgaaga taaacaaaaa gaaaatgaag ataaacaaaa     2880

agaaaatgaa gataaaggaa aagaaaatga agataaagat aaaggaagag agccagaaga     2940

gaagccactg gacagacctg aatgtacagc aagtcctatt gcagtggagg aagaacccca     3000

ctcagttaga aggacagttt tacccagagg gacttctcgt caatcactca ttatcagcat     3060

ggctccttct gctgagggcg gagaagaggt tcttactatt gaagtcaaag aaaaggctaa     3120

gcaatgatca taactgcaga tctgcctcga ctgtgccttc tagttgccag ccatctgttg     3180

tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc cactcccact gtcctttcct     3240

aataaaatga ggaaattgca tcgcattgtc tgagtaggtg tcattctatt ctggggggtg     3300

gggtggggca ggacagcaag ggggaggatt gggaagacaa tagcaggcat gctggggact     3360

cgagttctac gtagataagt agcatggcgg gttaatcatt aactacaagg aacccctagt     3420

gatggagttg gccactccct ctctgcgcgc tcgctcgctc actgaggccg ggcgaccaaa     3480

ggtcgcccga cgcccgggct ttgcccgggc ggcctcagtg agcgagcgag cgcgcagcct     3540

taattaacct aaggaaaatg aagtgaagtt cctatacttt ctagagaata ggaacttcta     3600

tagtgagtcg aataagggcg acacaaaatt tattctaaat gcataataaa tactgataac     3660

atcttatagt ttgtattata ttttgtatta tcgttgacat gtataatttt gatatcaaaa     3720

actgattttc cctttattat tttcgagatt tattttctta attctcttta acaaactaga     3780

aatattgtat atacaaaaaa tcataaataa tagatgaata gtttaattat aggtgttcat     3840

caatcgaaaa agcaacgtat cttatttaaa gtgcgttgct tttttctcat ttataaggtt     3900

aaataattct catatatcaa gcaaagtgac aggcgccctt aaatattctg acaaatgctc     3960

tttccctaaa ctccccccat aaaaaaaccc gccgaagcgg gtttttacgt tatttgcgga     4020

ttaacgatta ctcgttatca gaaccgccca gggggcccga gcttaacctt tttatttggg     4080

ggagagggaa gtcatgaaaa aactaacctt tgaaattcga tctccagcac atcagcaaaa     4140

cgctattcac gcagtacagc aaatccttcc agacccaacc aaaccaatcg tagtaaccat     4200

tcaggaacgc aaccgcagct tagaccaaaa caggaagcta tgggcctgct taggtgacgt     4260

ctctcgtcag gttgaatggc atggtcgctg gctggatgca gaaagctgga agtgtgtgtt     4320

taccgcagca ttaaagcagc aggatgttgt tcctaacctt gccgggaatg gctttgtggt     4380

aataggccag tcaaccagca ggatgcgtgt aggcgaattt gcggagctat tagagcttat     4440

acaggcattc ggtacagagc gtggcgttaa gtggtcagac gaagcgagac tggctctgga     4500

gtggaaagcg agatggggag acagggctgc atgataaatg tcgttagttt ctccggtggc     4560

aggacgtcag catatttgct ctggctaatg gagcaaaagc gacgggcagg taaagacgtg     4620

cattacgttt tcatggatac aggttgtgaa catccaatga catatcggtt tgtcagggaa     4680

gttgtgaagt tctgggatat accgctcacc gtattgcagg ttgatatcaa cccggagctt     4740

ggacagccaa atggttatac ggtatgggaa ccaaaggata ttcagacgcg aatgcctgtt     4800

ctgaagccat ttatcgatat ggtaaagaaa tatggcactc catacgtcgg cggcgcgttc     4860

tgcactgaca gattaaaact cgttcccttc accaaatact gtgatgacca tttcgggcga     4920

gggaattaca ccacgtggat tggcatcaga gctgatgaac cgaagcggct aaagccaaag     4980

cctggaatca gatatcttgc tgaactgtca gactttgaga aggaagatat cctcgcatgg     5040

tggaagcaac aaccattcga tttgcaaata ccggaacatc tcggtaactg catattctgc     5100

attaaaaaat caacgcaaaa aatcggactt gcctgcaaag atgaggaggg attgcagcgt     5160

gtttttaatg aggtcatcac gggatcccat gtgcgtgacg gacatcggga aacgccaaag     5220

gagattatgt accgaggaag aatgtcgctg gacggtatcg cgaaaatgta ttcagaaaat     5280

gattatcaag ccctgtatca ggacatggta cgagctaaaa gattcgatac cggctcttgt     5340

tctgagtcat gcgaaatatt tggagggcag cttgatttcg acttcgggag ggaagctgca     5400

tgatgcgatg ttatcggtgc ggtgaatgca aagaagataa ccgcttccga ccaaatcaac     5460

cttactggaa tcgatggtgt ctccggtgtg aaagaacacc aacaggggtg ttaccactac     5520

cgcaggaaaa ggaggacgtg tggcgagaca gcgacgaagt atcaccgaca taatctgcga     5580

aaactgcaaa taccttccaa cgaaacgcac cagaaataaa cccaagccaa tcccaaaaga     5640

atctgacgta aaaaccttca actacacggc tcacctgtgg gatatccggt ggctaagacg     5700

tcgtgcgagg aaaacaaggt gattgaccaa aatcgaagtt acgaacaaga aagcgtcgag     5760

cgagctttaa cgtgcgctaa ctgcggtcag aagctgcatg tgctggaagt tcacgtgtgt     5820

gagcactgct gcgcagaact gatgagcgat ccgaatagct cgatgcacga ggaagaagat     5880

gatggctaaa ccagcgcgaa gacgatgtaa aaacgatgaa tgccgggaat ggtttcaccc     5940

tgcattcgct aatcagtggt ggtgctctcc agagtgtgga accaagatag cactcgaacg     6000

acgaagtaaa gaacgcgaaa aagcggaaaa agcagcagag aagaaacgac gacgagagga     6060

gcagaaacag aaagataaac ttaagattcg aaaactcgcc ttaaagcccc gcagttactg     6120

gattaaacaa gcccaacaag ccgtaaacgc cttcatcaga gaaagagacc gcgacttacc     6180

atgtatctcg tgcggaacgc tcacgtctgc tcagtgggat gccggacatt accggacaac     6240

tgctgcggca cctcaactcc gatttaatga acgcaatatt cacaagcaat gcgtggtgtg     6300

caaccagcac aaaagcggaa atctcgttcc gtatcgcgtc gaactgatta gccgcatcgg     6360

gcaggaagca gtagacgaaa tcgaatcaaa ccataaccgc catcgctgga ctatcgaaga     6420

gtgcaaggcg atcaaggcag agtaccaaca gaaactcaaa gacctgcgaa atagcagaag     6480

tgaggccgca tgacgttctc agtaaaaacc attccagaca tgctcgttga agcatacgga     6540

aatcagacag aagtagcacg cagactgaaa tgtagtcgcg gtacggtcag aaaatacgtt     6600

gatgataaag acgggaaaat gcacgccatc gtcaacgacg ttctcatggt tcatcgcgga     6660

tggagtgaaa gagatgcgct attacgaaaa aattgatggc agcaaatacc gaaatatttg     6720

ggtagttggc gatctgcacg gatgctacac gaacctgatg aacaaactgg atacgattgg     6780

attcgacaac aaaaaagacc tgcttatctc ggtgggcgat ttggttgatc gtggtgcaga     6840

gaacgttgaa tgcctggaat taatcacatt cccctggttc agagctgtac gtggaaacca     6900

tgagcaaatg atgattgatg gcttatcaga gcgtggaaac gttaatcact ggctgcttaa     6960

tggcggtggc tggttcttta atctcgatta cgacaaagaa attctggcta aagctcttgc     7020

ccataaagca gatgaacttc cgttaatcat cgaactggtg agcaaagata aaaaatatgt     7080

tatctgccac gccgattatc cctttgacga atacgagttt ggaaagccag ttgatcatca     7140

gcaggtaatc tggaaccgcg aacgaatcag caactcacaa aacgggatcg tgaaagaaat     7200

caaaggcgcg gacacgttca tctttggtca tacgccagca gtgaaaccac tcaagtttgc     7260

caaccaaatg tatatcgata ccggcgcagt gttctgcgga aacctaacat tgattcaggt     7320

acagggagaa ggcgcatgag actcgaaagc gtagctaaat ttcattcgcc aaaaagcccg     7380

atgatgagcg actcaccacg ggccacggct tctgactctc tttccggtac tgatgtgatg     7440

gctgctatgg ggatggcgca atcacaagcc ggattcggta tggctgcatt ctgcggtaag     7500

cacgaactca gccagaacga caaacaaaag gctatcaact atctgatgca atttgcacac     7560

aaggtatcgg ggaaataccg tggtgtggca aagcttgaag gaaatactaa ggcaaaggta     7620

ctgcaagtgc tcgcaacatt cgcttatgcg gattattgcc gtagtgccgc gacgccgggg     7680

gcaagatgca gagattgcca tggtacaggc cgtgcggttg atattgccaa aacagagctg     7740

tgggggagag ttgtcgagaa agagtgcgga agatgcaaag gcgtcggcta ttcaaggatg     7800

ccagcaagcg cagcatatcg cgctgtgacg atgctaatcc caaaccttac ccaacccacc     7860

tggtcacgca ctgttaagcc gctgtatgac gctctggtgg tgcaatgcca caaagaagag     7920

tcaatcgcag acaacatttt gaatgcggtc acacgttagc agcatgattg ccacggatgg     7980

caacatatta acggcatgat attgacttat tgaataaaat tgggtaaatt tgactcaacg     8040

atgggttaat tcgctcgttg tggtagtgag atgaaaagag gcggcgctta ctaccgattc     8100

cgcctagttg gtcacttcga cgtatcgtct ggaactccaa ccatcgcagg cagagaggtc     8160

tgcaaaatgc aatcccgaaa cagttcgcag gtaatagtta gagcctgcat aacggtttcg     8220

ggatttttta tatctgcaca acaggtaaga gcattgagtc gataatcgtg aagagtcggc     8280

gagcctggtt agccagtgct ctttccgttg tgctgaatta agcgaatacc ggaagcagaa     8340

ccggatcacc aaatgcgtac aggcgtcatc gccgcccagc aacagcacaa cccaaactga     8400

gccgtagcca ctgtctgtcc tgaattcatt agtaatagtt acgctgcggc cttttacaca     8460

tgaccttcgt gaaagcgggt ggcaggaggt cgcgctaaca acctcctgcc gttttgcccg     8520

tgcatatcgg tcacgaacaa atctgattac taaacacagt agcctggatt tgttctatca     8580

gtaatcgacc ttattcctaa ttaaatagag caaatcccct tattgggggt aagacatgaa     8640

gatgccagaa aaacatgacc tgttggccgc cattctcgcg gcaaaggaac aaggcatcgg     8700

ggcaatcctt gcgtttgcaa tggcgtacct tcgcggcaga tataatggcg gtgcgtttac     8760

aaaaacagta atcgacgcaa cgatgtgcgc cattatcgcc tggttcattc gtgaccttct     8820

cgacttcgcc ggactaagta gcaatctcgc ttatataacg agcgtgttta tcggctacat     8880

cggtactgac tcgattggtt cgcttatcaa acgcttcgct gctaaaaaag ccggagtaga     8940

agatggtaga aatcaataat caacgtaagg cgttcctcga tatgctggcg tggtcggagg     9000

gaactgataa cggacgtcag aaaaccagaa atcatggtta tgacgtcatt gtaggcggag     9060

agctatttac tgattactcc gatcaccctc gcaaacttgt cacgctaaac ccaaaactca     9120

aatcaacagg cgcttaagac tggccgtcgt tttacaacac agaaagagtt tgtagaaacg     9180

caaaaaggcc atccgtcagg ggccttctgc ttagtttgat gcctggcagt tccctactct     9240

cgccttccgc ttcctcgctc actgactcgc tgcgctcggt cgttcggctg cggcgagcgg     9300

tatcagctca ctcaaaggcg gtaatacggt tatccacaga atcaggggat aacgcaggaa     9360

agaacatgtg agcaaaaggc cagcaaaagg ccaggaaccg taaaaaggcc gcgttgctgg     9420

cgtttttcca taggctccgc ccccctgacg agcatcacaa aaatcgacgc tcaagtcaga     9480

ggtggcgaaa cccgacagga ctataaagat accaggcgtt tccccctgga agctccctcg     9540

tgcgctctcc tgttccgacc ctgccgctta ccggatacct gtccgccttt ctcccttcgg     9600

gaagcgtggc gctttctcat agctcacgct gtaggtatct cagttcggtg taggtcgttc     9660

gctccaagct gggctgtgtg cacgaacccc ccgttcagcc cgaccgctgc gccttatccg     9720

gtaactatcg tcttgagtcc aacccggtaa gacacgactt atcgccactg gcagcagcca     9780

ctggtaacag gattagcaga gcgaggtatg taggcggtgc tacagagttc ttgaagtggt     9840

gggctaacta cggctacact agaagaacag tatttggtat ctgcgctctg ctgaagccag     9900

ttaccttcgg aaaaagagtt ggtagctctt gatccggcaa acaaaccacc gctggtagcg     9960

gtggtttttt tgtttgcaag cagcagatta cgcgcagaaa aaaaggatct caagaagatc    10020

ctttgatctt ttctacgggg tctgacgctc agtggaacga cgcgcgcgta actcacgtta    10080

agggattttg gtcatgagct tgcgccgtcc cgtcaagtca gcgtaatgct ctgcttttag    10140

aaaaactcat cgagcatcaa atgaaactgc aatttattca tatcaggatt atcaatacca    10200

tatttttgaa aaagccgttt ctgtaatgaa ggagaaaact caccgaggca gttccatagg    10260

atggcaagat cctggtatcg gtctgcgatt ccgactcgtc caacatcaat acaacctatt    10320

aatttcccct cgtcaaaaat aaggttatca agtgagaaat caccatgagt gacgactgaa    10380

tccggtgaga atggcaaaag tttatgcatt tctttccaga cttgttcaac aggccagcca    10440

ttacgctcgt catcaaaatc actcgcatca accaaaccgt tattcattcg tgattgcgcc    10500

tgagcgaggc gaaatacgcg atcgctgtta aaaggacaat tacaaacagg aatcgagtgc    10560

aaccggcgca ggaacactgc cagcgcatca acaatatttt cacctgaatc aggatattct    10620

tctaatacct ggaacgctgt ttttccgggg atcgcagtgg tgagtaacca tgcatcatca    10680

ggagtacgga taaaatgctt gatggtcgga agtggcataa attccgtcag ccagtttagt    10740

ctgaccatct catctgtaac atcattggca acgctacctt tgccatgttt cagaaacaac    10800

tctggcgcat cgggcttccc atacaagcga tagattgtcg cacctgattg cccgacatta    10860

tcgcgagccc atttataccc atataaatca gcatccatgt tggaatttaa tcgcggcctc    10920

gacgtttccc gttgaatatg gctcatattc ttcctttttc aatattattg aagcatttat    10980

cagggttatt gtctcatgag cggatacata tttgaatgta tttagaaaaa taaacaaata    11040

ggggtcagtg ttacaaccaa ttaaccaatt ctgaacatta tcgcgagccc atttatacct    11100

gaatatggct cataacaccc cttgtttgcc tggcggcagt agcgcggtgg tcccacctga    11160

ccccatgccg aactcagaag tgaaacgccg tagcgccgat ggtagtgtgg ggactcccca    11220

tgcgagagta gggaactgcc aggcatcaaa taaaacgaaa ggctcagtcg aaagactggg    11280

cctttcgccc gggctaatta gggggtgtcg cccttattcg actctatagt gaagttccta    11340

ttctctagaa agtataggaa cttctgaagt ggggtcgact taattaagg                11389


<210>  40
<211>  11388
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  40
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

aagatccaag ctcagatctc gatcgagttg ggccccagaa gcctggtggt tgtttgtcct      240

tctcagggga aaagtgaggc ggccccttgg aggaaggggc cgggcagaat gatctaatcg      300

gattccaagc agctcagggg attgtctttt tctagcacct tcttgccact cctaagcgtc      360

ctccgtgacc ccggctggga tttagcctgg tgctgtgtca gccccggtct cccaggggct      420

tcccagtggt ccccaggaac cctcgacagg gcccggtctc tctcgtccag caagggcagg      480

gacgggccac aggccaaggg ccctcgatcg aggaactgaa aaaccagaaa gttaactggt      540

aagtttagtc tttttgtctt ttatttcagg tcccggatcc ggtggtggtg caaatcaaag      600

aactgctcct cagtggatgt tgcctttact tctaggcctg tacggaagtg ttacttctgc      660

tctaaaagct gcggaattgt acccgcggcc gccaccatgt tcaagtccct caccaaagtc      720

aacaaggtca agcccatcgg agagaacaac gagaatgagc agagctctcg gcgcaacgaa      780

gaaggatccc atccgtcgaa ccagtcacag cagactaccg cacaggagga gaacaaggga      840

gaagaaaagt cgctcaagac taagtccacc cccgtgacct cggaagaacc gcacacgaac      900

attcaggaca agctgtccaa gaagaactcc tccggcgatc tcacgactaa cccggacccc      960

cagaatgccg ctgaacctac tgggaccgtg cctgagcaaa aggagatgga ccccggaaag     1020

gagggtccta actcccccca aaacaagccc ccggccgcgc cggtcatcaa tgagtacgcg     1080

gacgcgcaac tgcataacct cgtgaagcgg atgcggcaaa gaaccgccct ctacaagaag     1140

aaactggtgg agggcgacct gagctcacct gaagccagcc cacagaccgc caaacccacc     1200

gccgtgccgc ctgtgaagga gtccgatgac aagcctaccg agcactacta ccgcctgctg     1260

tggttcaagg tcaagaagat gcccctgacc gaatacctca agcggatcaa gctgccgaac     1320

agcatcgaca gctacaccga ccggctttac ttgctctggc tgctgcttgt gaccctggct     1380

tacaactgga actgttgttt cattcccctg cggctggtgt tcccttacca aaccgcggat     1440

aacattcact actggctgat tgccgacatc atttgcgaca tcatctacct gtacgatatg     1500

ctttttatcc aaccgcggct gcaattcgtc cgcgggggag acatcattgt ggactccaac     1560

gagctgcgca agcattaccg gacctcgaca aagttccagc tggatgtggc ctccatcatc     1620

ccgttcgata tctgttacct gttctttggc ttcaacccga tgttcagggc gaacaggatg     1680

ctgaagtaca cttccttctt cgaattcaac caccacctgg agtccatcat ggacaaggct     1740

tacatctacc gcgtgatccg gaccactggt tacctcctgt tcatcctgca catcaacgcc     1800

tgcgtctatt actgggcctc aaactacgaa ggcattggta ccacccgctg ggtgtacgac     1860

ggggagggaa acgagtatct gcgctgctac tactgggccg tgcgaaccct cataactatt     1920

ggcggcctcc cggaaccgca gaccctgttc gagatcgtgt tccaactcct caacttcttc     1980

tcgggagtgt tcgtgttttc aagcttgatt ggacagatgc gggacgtgat cggtgcagca     2040

actgccaacc agaactactt tcgcgcctgc atggacgaca ctatcgcgta catgaacaac     2100

tattcgatcc ccaagctggt gcagaaacgc gtgcggactt ggtatgagta cacttgggac     2160

tcccagagaa tgcttgacga gtccgatctg ctcaagaccc tgcctactac cgtgcagctg     2220

gcactcgcca tcgatgtgaa cttctccatt atctcgaaag tcgatctgtt caagggctgc     2280

gacacccaga tgatctacga catgctgctg agactcaagt ccgtgttgta cctccctggc     2340

gacttcgtgt gcaagaaggg cgaaatcggg aaggagatgt acattatcaa gcacggagaa     2400

gtccaggtgc tggggggacc agacggtacc aaggtccttg tcaccctgaa ggccgggtcc     2460

gtgttcggcg aaatttccct gttggccgcc ggcggtggca acaggagaac cgcaaatgtg     2520

gtggcccacg gcttcgcaaa ccttctgacc ctggacaaga aaaccctcca ggaaatcctc     2580

gtgcactacc cggatagcga gcggatcctg atgaagaaag cccgggtgct gctgaagcaa     2640

aaggccaaga ccgccgaagc caccccgcct cggaaggacc tggctctgct gttcccaccc     2700

aaggaggaga ctcccaaact gtttaagacc ctcttgggcg ggacgggaaa ggcctccctc     2760

gctcgcttgc ttaagttgaa gagggagcag gccgcgcaga agaaggaaaa ctccgaagga     2820

ggggaagaag agggaaagga aaacgaagat aagcagaagg agaacgagga taagcaaaag     2880

gaaaatgagg acaaggggaa agaaaacgag gacaaggata agggtcgcga acctgaagag     2940

aagccgctgg atcggccaga gtgcactgcc tcgcctatcg cggtcgaaga ggaaccccat     3000

agcgtgcgca gaaccgtgct gcctagaggc acatcgaggc agtcactgat tatctctatg     3060

gcaccaagcg ccgagggagg agaggaagtg ctcaccatcg aggtcaagga aaaagcgaag     3120

cagtgatcat aactgcagat ctgcctcgac tgtgccttct agttgccagc catctgttgt     3180

ttgcccctcc cccgtgcctt ccttgaccct ggaaggtgcc actcccactg tcctttccta     3240

ataaaatgag gaaattgcat cgcattgtct gagtaggtgt cattctattc tggggggtgg     3300

ggtggggcag gacagcaagg gggaggattg ggaagacaat agcaggcatg ctggggactc     3360

gagttctacg tagataagta gcatggcggg ttaatcatta actacaagga acccctagtg     3420

atggagttgg ccactccctc tctgcgcgct cgctcgctca ctgaggccgg gcgaccaaag     3480

gtcgcccgac gcccgggctt tgcccgggcg gcctcagtga gcgagcgagc gcgcagcctt     3540

aattaaccta aggaaaatga agtgaagttc ctatactttc tagagaatag gaacttctat     3600

agtgagtcga ataagggcga cacaaaattt attctaaatg cataataaat actgataaca     3660

tcttatagtt tgtattatat tttgtattat cgttgacatg tataattttg atatcaaaaa     3720

ctgattttcc ctttattatt ttcgagattt attttcttaa ttctctttaa caaactagaa     3780

atattgtata tacaaaaaat cataaataat agatgaatag tttaattata ggtgttcatc     3840

aatcgaaaaa gcaacgtatc ttatttaaag tgcgttgctt ttttctcatt tataaggtta     3900

aataattctc atatatcaag caaagtgaca ggcgccctta aatattctga caaatgctct     3960

ttccctaaac tccccccata aaaaaacccg ccgaagcggg tttttacgtt atttgcggat     4020

taacgattac tcgttatcag aaccgcccag ggggcccgag cttaaccttt ttatttgggg     4080

gagagggaag tcatgaaaaa actaaccttt gaaattcgat ctccagcaca tcagcaaaac     4140

gctattcacg cagtacagca aatccttcca gacccaacca aaccaatcgt agtaaccatt     4200

caggaacgca accgcagctt agaccaaaac aggaagctat gggcctgctt aggtgacgtc     4260

tctcgtcagg ttgaatggca tggtcgctgg ctggatgcag aaagctggaa gtgtgtgttt     4320

accgcagcat taaagcagca ggatgttgtt cctaaccttg ccgggaatgg ctttgtggta     4380

ataggccagt caaccagcag gatgcgtgta ggcgaatttg cggagctatt agagcttata     4440

caggcattcg gtacagagcg tggcgttaag tggtcagacg aagcgagact ggctctggag     4500

tggaaagcga gatggggaga cagggctgca tgataaatgt cgttagtttc tccggtggca     4560

ggacgtcagc atatttgctc tggctaatgg agcaaaagcg acgggcaggt aaagacgtgc     4620

attacgtttt catggataca ggttgtgaac atccaatgac atatcggttt gtcagggaag     4680

ttgtgaagtt ctgggatata ccgctcaccg tattgcaggt tgatatcaac ccggagcttg     4740

gacagccaaa tggttatacg gtatgggaac caaaggatat tcagacgcga atgcctgttc     4800

tgaagccatt tatcgatatg gtaaagaaat atggcactcc atacgtcggc ggcgcgttct     4860

gcactgacag attaaaactc gttcccttca ccaaatactg tgatgaccat ttcgggcgag     4920

ggaattacac cacgtggatt ggcatcagag ctgatgaacc gaagcggcta aagccaaagc     4980

ctggaatcag atatcttgct gaactgtcag actttgagaa ggaagatatc ctcgcatggt     5040

ggaagcaaca accattcgat ttgcaaatac cggaacatct cggtaactgc atattctgca     5100

ttaaaaaatc aacgcaaaaa atcggacttg cctgcaaaga tgaggaggga ttgcagcgtg     5160

tttttaatga ggtcatcacg ggatcccatg tgcgtgacgg acatcgggaa acgccaaagg     5220

agattatgta ccgaggaaga atgtcgctgg acggtatcgc gaaaatgtat tcagaaaatg     5280

attatcaagc cctgtatcag gacatggtac gagctaaaag attcgatacc ggctcttgtt     5340

ctgagtcatg cgaaatattt ggagggcagc ttgatttcga cttcgggagg gaagctgcat     5400

gatgcgatgt tatcggtgcg gtgaatgcaa agaagataac cgcttccgac caaatcaacc     5460

ttactggaat cgatggtgtc tccggtgtga aagaacacca acaggggtgt taccactacc     5520

gcaggaaaag gaggacgtgt ggcgagacag cgacgaagta tcaccgacat aatctgcgaa     5580

aactgcaaat accttccaac gaaacgcacc agaaataaac ccaagccaat cccaaaagaa     5640

tctgacgtaa aaaccttcaa ctacacggct cacctgtggg atatccggtg gctaagacgt     5700

cgtgcgagga aaacaaggtg attgaccaaa atcgaagtta cgaacaagaa agcgtcgagc     5760

gagctttaac gtgcgctaac tgcggtcaga agctgcatgt gctggaagtt cacgtgtgtg     5820

agcactgctg cgcagaactg atgagcgatc cgaatagctc gatgcacgag gaagaagatg     5880

atggctaaac cagcgcgaag acgatgtaaa aacgatgaat gccgggaatg gtttcaccct     5940

gcattcgcta atcagtggtg gtgctctcca gagtgtggaa ccaagatagc actcgaacga     6000

cgaagtaaag aacgcgaaaa agcggaaaaa gcagcagaga agaaacgacg acgagaggag     6060

cagaaacaga aagataaact taagattcga aaactcgcct taaagccccg cagttactgg     6120

attaaacaag cccaacaagc cgtaaacgcc ttcatcagag aaagagaccg cgacttacca     6180

tgtatctcgt gcggaacgct cacgtctgct cagtgggatg ccggacatta ccggacaact     6240

gctgcggcac ctcaactccg atttaatgaa cgcaatattc acaagcaatg cgtggtgtgc     6300

aaccagcaca aaagcggaaa tctcgttccg tatcgcgtcg aactgattag ccgcatcggg     6360

caggaagcag tagacgaaat cgaatcaaac cataaccgcc atcgctggac tatcgaagag     6420

tgcaaggcga tcaaggcaga gtaccaacag aaactcaaag acctgcgaaa tagcagaagt     6480

gaggccgcat gacgttctca gtaaaaacca ttccagacat gctcgttgaa gcatacggaa     6540

atcagacaga agtagcacgc agactgaaat gtagtcgcgg tacggtcaga aaatacgttg     6600

atgataaaga cgggaaaatg cacgccatcg tcaacgacgt tctcatggtt catcgcggat     6660

ggagtgaaag agatgcgcta ttacgaaaaa attgatggca gcaaataccg aaatatttgg     6720

gtagttggcg atctgcacgg atgctacacg aacctgatga acaaactgga tacgattgga     6780

ttcgacaaca aaaaagacct gcttatctcg gtgggcgatt tggttgatcg tggtgcagag     6840

aacgttgaat gcctggaatt aatcacattc ccctggttca gagctgtacg tggaaaccat     6900

gagcaaatga tgattgatgg cttatcagag cgtggaaacg ttaatcactg gctgcttaat     6960

ggcggtggct ggttctttaa tctcgattac gacaaagaaa ttctggctaa agctcttgcc     7020

cataaagcag atgaacttcc gttaatcatc gaactggtga gcaaagataa aaaatatgtt     7080

atctgccacg ccgattatcc ctttgacgaa tacgagtttg gaaagccagt tgatcatcag     7140

caggtaatct ggaaccgcga acgaatcagc aactcacaaa acgggatcgt gaaagaaatc     7200

aaaggcgcgg acacgttcat ctttggtcat acgccagcag tgaaaccact caagtttgcc     7260

aaccaaatgt atatcgatac cggcgcagtg ttctgcggaa acctaacatt gattcaggta     7320

cagggagaag gcgcatgaga ctcgaaagcg tagctaaatt tcattcgcca aaaagcccga     7380

tgatgagcga ctcaccacgg gccacggctt ctgactctct ttccggtact gatgtgatgg     7440

ctgctatggg gatggcgcaa tcacaagccg gattcggtat ggctgcattc tgcggtaagc     7500

acgaactcag ccagaacgac aaacaaaagg ctatcaacta tctgatgcaa tttgcacaca     7560

aggtatcggg gaaataccgt ggtgtggcaa agcttgaagg aaatactaag gcaaaggtac     7620

tgcaagtgct cgcaacattc gcttatgcgg attattgccg tagtgccgcg acgccggggg     7680

caagatgcag agattgccat ggtacaggcc gtgcggttga tattgccaaa acagagctgt     7740

gggggagagt tgtcgagaaa gagtgcggaa gatgcaaagg cgtcggctat tcaaggatgc     7800

cagcaagcgc agcatatcgc gctgtgacga tgctaatccc aaaccttacc caacccacct     7860

ggtcacgcac tgttaagccg ctgtatgacg ctctggtggt gcaatgccac aaagaagagt     7920

caatcgcaga caacattttg aatgcggtca cacgttagca gcatgattgc cacggatggc     7980

aacatattaa cggcatgata ttgacttatt gaataaaatt gggtaaattt gactcaacga     8040

tgggttaatt cgctcgttgt ggtagtgaga tgaaaagagg cggcgcttac taccgattcc     8100

gcctagttgg tcacttcgac gtatcgtctg gaactccaac catcgcaggc agagaggtct     8160

gcaaaatgca atcccgaaac agttcgcagg taatagttag agcctgcata acggtttcgg     8220

gattttttat atctgcacaa caggtaagag cattgagtcg ataatcgtga agagtcggcg     8280

agcctggtta gccagtgctc tttccgttgt gctgaattaa gcgaataccg gaagcagaac     8340

cggatcacca aatgcgtaca ggcgtcatcg ccgcccagca acagcacaac ccaaactgag     8400

ccgtagccac tgtctgtcct gaattcatta gtaatagtta cgctgcggcc ttttacacat     8460

gaccttcgtg aaagcgggtg gcaggaggtc gcgctaacaa cctcctgccg ttttgcccgt     8520

gcatatcggt cacgaacaaa tctgattact aaacacagta gcctggattt gttctatcag     8580

taatcgacct tattcctaat taaatagagc aaatcccctt attgggggta agacatgaag     8640

atgccagaaa aacatgacct gttggccgcc attctcgcgg caaaggaaca aggcatcggg     8700

gcaatccttg cgtttgcaat ggcgtacctt cgcggcagat ataatggcgg tgcgtttaca     8760

aaaacagtaa tcgacgcaac gatgtgcgcc attatcgcct ggttcattcg tgaccttctc     8820

gacttcgccg gactaagtag caatctcgct tatataacga gcgtgtttat cggctacatc     8880

ggtactgact cgattggttc gcttatcaaa cgcttcgctg ctaaaaaagc cggagtagaa     8940

gatggtagaa atcaataatc aacgtaaggc gttcctcgat atgctggcgt ggtcggaggg     9000

aactgataac ggacgtcaga aaaccagaaa tcatggttat gacgtcattg taggcggaga     9060

gctatttact gattactccg atcaccctcg caaacttgtc acgctaaacc caaaactcaa     9120

atcaacaggc gcttaagact ggccgtcgtt ttacaacaca gaaagagttt gtagaaacgc     9180

aaaaaggcca tccgtcaggg gccttctgct tagtttgatg cctggcagtt ccctactctc     9240

gccttccgct tcctcgctca ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt     9300

atcagctcac tcaaaggcgg taatacggtt atccacagaa tcaggggata acgcaggaaa     9360

gaacatgtga gcaaaaggcc agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc     9420

gtttttccat aggctccgcc cccctgacga gcatcacaaa aatcgacgct caagtcagag     9480

gtggcgaaac ccgacaggac tataaagata ccaggcgttt ccccctggaa gctccctcgt     9540

gcgctctcct gttccgaccc tgccgcttac cggatacctg tccgcctttc tcccttcggg     9600

aagcgtggcg ctttctcata gctcacgctg taggtatctc agttcggtgt aggtcgttcg     9660

ctccaagctg ggctgtgtgc acgaaccccc cgttcagccc gaccgctgcg ccttatccgg     9720

taactatcgt cttgagtcca acccggtaag acacgactta tcgccactgg cagcagccac     9780

tggtaacagg attagcagag cgaggtatgt aggcggtgct acagagttct tgaagtggtg     9840

ggctaactac ggctacacta gaagaacagt atttggtatc tgcgctctgc tgaagccagt     9900

taccttcgga aaaagagttg gtagctcttg atccggcaaa caaaccaccg ctggtagcgg     9960

tggttttttt gtttgcaagc agcagattac gcgcagaaaa aaaggatctc aagaagatcc    10020

tttgatcttt tctacggggt ctgacgctca gtggaacgac gcgcgcgtaa ctcacgttaa    10080

gggattttgg tcatgagctt gcgccgtccc gtcaagtcag cgtaatgctc tgcttttaga    10140

aaaactcatc gagcatcaaa tgaaactgca atttattcat atcaggatta tcaataccat    10200

atttttgaaa aagccgtttc tgtaatgaag gagaaaactc accgaggcag ttccatagga    10260

tggcaagatc ctggtatcgg tctgcgattc cgactcgtcc aacatcaata caacctatta    10320

atttcccctc gtcaaaaata aggttatcaa gtgagaaatc accatgagtg acgactgaat    10380

ccggtgagaa tggcaaaagt ttatgcattt ctttccagac ttgttcaaca ggccagccat    10440

tacgctcgtc atcaaaatca ctcgcatcaa ccaaaccgtt attcattcgt gattgcgcct    10500

gagcgaggcg aaatacgcga tcgctgttaa aaggacaatt acaaacagga atcgagtgca    10560

accggcgcag gaacactgcc agcgcatcaa caatattttc acctgaatca ggatattctt    10620

ctaatacctg gaacgctgtt tttccgggga tcgcagtggt gagtaaccat gcatcatcag    10680

gagtacggat aaaatgcttg atggtcggaa gtggcataaa ttccgtcagc cagtttagtc    10740

tgaccatctc atctgtaaca tcattggcaa cgctaccttt gccatgtttc agaaacaact    10800

ctggcgcatc gggcttccca tacaagcgat agattgtcgc acctgattgc ccgacattat    10860

cgcgagccca tttataccca tataaatcag catccatgtt ggaatttaat cgcggcctcg    10920

acgtttcccg ttgaatatgg ctcatattct tcctttttca atattattga agcatttatc    10980

agggttattg tctcatgagc ggatacatat ttgaatgtat ttagaaaaat aaacaaatag    11040

gggtcagtgt tacaaccaat taaccaattc tgaacattat cgcgagccca tttatacctg    11100

aatatggctc ataacacccc ttgtttgcct ggcggcagta gcgcggtggt cccacctgac    11160

cccatgccga actcagaagt gaaacgccgt agcgccgatg gtagtgtggg gactccccat    11220

gcgagagtag ggaactgcca ggcatcaaat aaaacgaaag gctcagtcga aagactgggc    11280

ctttcgcccg ggctaattag ggggtgtcgc ccttattcga ctctatagtg aagttcctat    11340

tctctagaaa gtataggaac ttctgaagtg gggtcgactt aattaagg                 11388


<210>  41
<211>  11782
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  41
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

ctgaagagac agaaatatct ctaattccat gagcggtcat acgaggcaag agaagccgct      240

tagagcatgg acttagttag tttcagggat tggacagagt caagagctgg ggtgaggagg      300

ttaccctcgg taggggtgac acagatgtca accgcctatt ccctccacat gcatgtcctg      360

ccagaagaac ctgtccctgg gctgggaatc ttatattacc ttcctctcca atgagaagag      420

aagttcaagg ctcacagaca tgtgcataca caagctcaat gcactcaaga ttcccctcca      480

ccactcctgc ccccactacc tacaggagat tgactcctgc tgtgcacata agctgggata      540

atcagggttt ctaaacatca gcttcaaaag tccaatgtcc aaagtggtgg ggggccgggg      600

aacgaggtac tctttccata cccttggctt ttgtgtggcc tggagccgct gatatagaga      660

ttggagtggg acacgaggta ttcctttcaa aaacacaaag gcctatactt tgagccctcc      720

catttcaatc ccccaccatg cttcaccttt aagacctcca actccacttt gatcccagtt      780

ctcaggttca ggcctcacaa ggccaaaatc ctgaagttac ccttctcaaa ctcccttgcc      840

tttaacatca tcagaatcaa cctcctaccc ccactctgtc ccagcagcaa tagcctgcta      900

atcttttagc actaatcttt taggcactaa tctgctttcc aaactcttgg cacctgaact      960

atttataagc agtgttttat gcccccccac caaagaaccc tattcttttc ccatgacccc     1020

accaatcaaa acactcagag gactgtgggt ataagaggct ggggaggcag gcatagcagc     1080

ggccgccacc atgtttaaat cgctgacaaa agtcaacaag gtgaagccta taggagagaa     1140

caatgagaat gaacaaagtt ctcgtcggaa tgaagaaggc tctcacccaa gtaatcagtc     1200

tcagcaaacc acagcacagg aagaaaacaa aggtgaagag aaatctctca aaaccaagtc     1260

aactccagtc acgtctgaag agccacacac caacatacaa gacaaactct ccaagaaaaa     1320

ttcctctgga gatctgacca caaaccctga ccctcaaaat gcagcagaac caactggaac     1380

agtgccagag cagaaggaaa tggaccccgg gaaagaaggt ccaaacagcc cacaaaacaa     1440

accgccagca gctcctgtta taaatgagta tgccgatgcc cagctacaca acctggtgaa     1500

aagaatgcgt caaagaacag ccctctacaa gaaaaagttg gtagagggag atctctcctc     1560

acccgaagcc agcccacaaa ctgcaaagcc cacggctgta ccaccagtaa aagaaagcga     1620

tgataagcca acagaacatt actacaggct gttgtggttc aaagtcaaaa agatgccttt     1680

aacagagtac ttaaagcgaa ttaaacttcc aaacagcata gattcataca cagatcgact     1740

ctatctcctg tggctcttgc ttgtcactct tgcctataac tggaactgct gttttatacc     1800

actgcgcctc gtcttcccat atcaaaccgc agacaacata cactactggc ttattgcgga     1860

catcatctgt gatatcatct acctttatga tatgctattt atccagccca gactccagtt     1920

tgtaagagga ggagacataa tagtggattc aaatgagcta aggaaacact acaggacttc     1980

tacaaaattt cagttggatg tcgcatcaat aataccattt gatatttgct acctcttctt     2040

tgggtttaat ccaatgttta gagcaaatag gatgttaaag tacacttcat tttttgaatt     2100

taatcatcac ctagagtcta taatggacaa agcatatatc tacagagtta ttcgaacaac     2160

tggatacttg ctgtttattc tgcacattaa tgcctgtgtt tattactggg cttcaaacta     2220

tgaaggaatt ggcactacta gatgggtgta tgatggggaa ggaaacgagt atctgagatg     2280

ttattattgg gcagttcgaa ctttaattac cattggtggc cttccagaac cacaaacttt     2340

atttgaaatt gtttttcaac tcttgaattt tttttctgga gtttttgtgt tctccagttt     2400

aattggtcag atgagagatg tgattggagc agctacagcc aatcagaact acttccgcgc     2460

ctgcatggat gacaccattg cctacatgaa caattactcc attcctaaac ttgtgcaaaa     2520

gcgagttcgg acttggtatg aatatacatg ggactctcaa agaatgctag atgagtctga     2580

tttgcttaag accctaccaa ctacggtcca gttagccctc gccattgatg tgaacttcag     2640

catcatcagc aaagttgact tgttcaaggg ttgtgataca cagatgattt atgacatgtt     2700

gctaagattg aaatccgttc tctatttgcc tggtgacttt gtctgcaaaa agggagaaat     2760

tggcaaggaa atgtatatca tcaagcatgg agaagtccaa gttcttggag gccctgatgg     2820

tactaaagtt ctggttactc tgaaagctgg gtcggtgttt ggagaaatca gccttctagc     2880

agcaggagga ggaaaccgtc gaactgccaa tgtggtggcc cacgggtttg ccaatctttt     2940

aactctagac aaaaagaccc tccaagaaat tctagtgcat tatccagatt ctgaaagaat     3000

cctcatgaag aaagccagag tgcttttaaa gcagaaggct aagaccgcag aagcaacccc     3060

tccaagaaaa gatcttgccc tcctcttccc accgaaagaa gagacaccca aactgtttaa     3120

aactctccta ggaggcacag gaaaagcaag tcttgcaaga ctactcaaat tgaagcgaga     3180

gcaagcagct cagaagaaag aaaattctga aggaggagag gaagaaggaa aagaaaatga     3240

agataaacaa aaagaaaatg aagataaaca aaaagaaaat gaagataaag gaaaagaaaa     3300

tgaagataaa gataaaggaa gagagccaga agagaagcca ctggacagac ctgaatgtac     3360

agcaagtcct attgcagtgg aggaagaacc ccactcagtt agaaggacag ttttacccag     3420

agggacttct cgtcaatcac tcattatcag catggctcct tctgctgagg gcggagaaga     3480

ggttcttact attgaagtca aagaaaaggc taagcaatga tcataactgc agatctgcct     3540

cgactgtgcc ttctagttgc cagccatctg ttgtttgccc ctcccccgtg ccttccttga     3600

ccctggaagg tgccactccc actgtccttt cctaataaaa tgaggaaatt gcatcgcatt     3660

gtctgagtag gtgtcattct attctggggg gtggggtggg gcaggacagc aagggggagg     3720

attgggaaga caatagcagg catgctgggg actcgagttc tacgtagata agtagcatgg     3780

cgggttaatc attaactaca aggaacccct agtgatggag ttggccactc cctctctgcg     3840

cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg     3900

ggcggcctca gtgagcgagc gagcgcgcag ccttaattaa cctaaggaaa atgaagtgaa     3960

gttcctatac tttctagaga ataggaactt ctatagtgag tcgaataagg gcgacacaaa     4020

atttattcta aatgcataat aaatactgat aacatcttat agtttgtatt atattttgta     4080

ttatcgttga catgtataat tttgatatca aaaactgatt ttccctttat tattttcgag     4140

atttattttc ttaattctct ttaacaaact agaaatattg tatatacaaa aaatcataaa     4200

taatagatga atagtttaat tataggtgtt catcaatcga aaaagcaacg tatcttattt     4260

aaagtgcgtt gcttttttct catttataag gttaaataat tctcatatat caagcaaagt     4320

gacaggcgcc cttaaatatt ctgacaaatg ctctttccct aaactccccc cataaaaaaa     4380

cccgccgaag cgggttttta cgttatttgc ggattaacga ttactcgtta tcagaaccgc     4440

ccagggggcc cgagcttaac ctttttattt gggggagagg gaagtcatga aaaaactaac     4500

ctttgaaatt cgatctccag cacatcagca aaacgctatt cacgcagtac agcaaatcct     4560

tccagaccca accaaaccaa tcgtagtaac cattcaggaa cgcaaccgca gcttagacca     4620

aaacaggaag ctatgggcct gcttaggtga cgtctctcgt caggttgaat ggcatggtcg     4680

ctggctggat gcagaaagct ggaagtgtgt gtttaccgca gcattaaagc agcaggatgt     4740

tgttcctaac cttgccggga atggctttgt ggtaataggc cagtcaacca gcaggatgcg     4800

tgtaggcgaa tttgcggagc tattagagct tatacaggca ttcggtacag agcgtggcgt     4860

taagtggtca gacgaagcga gactggctct ggagtggaaa gcgagatggg gagacagggc     4920

tgcatgataa atgtcgttag tttctccggt ggcaggacgt cagcatattt gctctggcta     4980

atggagcaaa agcgacgggc aggtaaagac gtgcattacg ttttcatgga tacaggttgt     5040

gaacatccaa tgacatatcg gtttgtcagg gaagttgtga agttctggga tataccgctc     5100

accgtattgc aggttgatat caacccggag cttggacagc caaatggtta tacggtatgg     5160

gaaccaaagg atattcagac gcgaatgcct gttctgaagc catttatcga tatggtaaag     5220

aaatatggca ctccatacgt cggcggcgcg ttctgcactg acagattaaa actcgttccc     5280

ttcaccaaat actgtgatga ccatttcggg cgagggaatt acaccacgtg gattggcatc     5340

agagctgatg aaccgaagcg gctaaagcca aagcctggaa tcagatatct tgctgaactg     5400

tcagactttg agaaggaaga tatcctcgca tggtggaagc aacaaccatt cgatttgcaa     5460

ataccggaac atctcggtaa ctgcatattc tgcattaaaa aatcaacgca aaaaatcgga     5520

cttgcctgca aagatgagga gggattgcag cgtgttttta atgaggtcat cacgggatcc     5580

catgtgcgtg acggacatcg ggaaacgcca aaggagatta tgtaccgagg aagaatgtcg     5640

ctggacggta tcgcgaaaat gtattcagaa aatgattatc aagccctgta tcaggacatg     5700

gtacgagcta aaagattcga taccggctct tgttctgagt catgcgaaat atttggaggg     5760

cagcttgatt tcgacttcgg gagggaagct gcatgatgcg atgttatcgg tgcggtgaat     5820

gcaaagaaga taaccgcttc cgaccaaatc aaccttactg gaatcgatgg tgtctccggt     5880

gtgaaagaac accaacaggg gtgttaccac taccgcagga aaaggaggac gtgtggcgag     5940

acagcgacga agtatcaccg acataatctg cgaaaactgc aaataccttc caacgaaacg     6000

caccagaaat aaacccaagc caatcccaaa agaatctgac gtaaaaacct tcaactacac     6060

ggctcacctg tgggatatcc ggtggctaag acgtcgtgcg aggaaaacaa ggtgattgac     6120

caaaatcgaa gttacgaaca agaaagcgtc gagcgagctt taacgtgcgc taactgcggt     6180

cagaagctgc atgtgctgga agttcacgtg tgtgagcact gctgcgcaga actgatgagc     6240

gatccgaata gctcgatgca cgaggaagaa gatgatggct aaaccagcgc gaagacgatg     6300

taaaaacgat gaatgccggg aatggtttca ccctgcattc gctaatcagt ggtggtgctc     6360

tccagagtgt ggaaccaaga tagcactcga acgacgaagt aaagaacgcg aaaaagcgga     6420

aaaagcagca gagaagaaac gacgacgaga ggagcagaaa cagaaagata aacttaagat     6480

tcgaaaactc gccttaaagc cccgcagtta ctggattaaa caagcccaac aagccgtaaa     6540

cgccttcatc agagaaagag accgcgactt accatgtatc tcgtgcggaa cgctcacgtc     6600

tgctcagtgg gatgccggac attaccggac aactgctgcg gcacctcaac tccgatttaa     6660

tgaacgcaat attcacaagc aatgcgtggt gtgcaaccag cacaaaagcg gaaatctcgt     6720

tccgtatcgc gtcgaactga ttagccgcat cgggcaggaa gcagtagacg aaatcgaatc     6780

aaaccataac cgccatcgct ggactatcga agagtgcaag gcgatcaagg cagagtacca     6840

acagaaactc aaagacctgc gaaatagcag aagtgaggcc gcatgacgtt ctcagtaaaa     6900

accattccag acatgctcgt tgaagcatac ggaaatcaga cagaagtagc acgcagactg     6960

aaatgtagtc gcggtacggt cagaaaatac gttgatgata aagacgggaa aatgcacgcc     7020

atcgtcaacg acgttctcat ggttcatcgc ggatggagtg aaagagatgc gctattacga     7080

aaaaattgat ggcagcaaat accgaaatat ttgggtagtt ggcgatctgc acggatgcta     7140

cacgaacctg atgaacaaac tggatacgat tggattcgac aacaaaaaag acctgcttat     7200

ctcggtgggc gatttggttg atcgtggtgc agagaacgtt gaatgcctgg aattaatcac     7260

attcccctgg ttcagagctg tacgtggaaa ccatgagcaa atgatgattg atggcttatc     7320

agagcgtgga aacgttaatc actggctgct taatggcggt ggctggttct ttaatctcga     7380

ttacgacaaa gaaattctgg ctaaagctct tgcccataaa gcagatgaac ttccgttaat     7440

catcgaactg gtgagcaaag ataaaaaata tgttatctgc cacgccgatt atccctttga     7500

cgaatacgag tttggaaagc cagttgatca tcagcaggta atctggaacc gcgaacgaat     7560

cagcaactca caaaacggga tcgtgaaaga aatcaaaggc gcggacacgt tcatctttgg     7620

tcatacgcca gcagtgaaac cactcaagtt tgccaaccaa atgtatatcg ataccggcgc     7680

agtgttctgc ggaaacctaa cattgattca ggtacaggga gaaggcgcat gagactcgaa     7740

agcgtagcta aatttcattc gccaaaaagc ccgatgatga gcgactcacc acgggccacg     7800

gcttctgact ctctttccgg tactgatgtg atggctgcta tggggatggc gcaatcacaa     7860

gccggattcg gtatggctgc attctgcggt aagcacgaac tcagccagaa cgacaaacaa     7920

aaggctatca actatctgat gcaatttgca cacaaggtat cggggaaata ccgtggtgtg     7980

gcaaagcttg aaggaaatac taaggcaaag gtactgcaag tgctcgcaac attcgcttat     8040

gcggattatt gccgtagtgc cgcgacgccg ggggcaagat gcagagattg ccatggtaca     8100

ggccgtgcgg ttgatattgc caaaacagag ctgtggggga gagttgtcga gaaagagtgc     8160

ggaagatgca aaggcgtcgg ctattcaagg atgccagcaa gcgcagcata tcgcgctgtg     8220

acgatgctaa tcccaaacct tacccaaccc acctggtcac gcactgttaa gccgctgtat     8280

gacgctctgg tggtgcaatg ccacaaagaa gagtcaatcg cagacaacat tttgaatgcg     8340

gtcacacgtt agcagcatga ttgccacgga tggcaacata ttaacggcat gatattgact     8400

tattgaataa aattgggtaa atttgactca acgatgggtt aattcgctcg ttgtggtagt     8460

gagatgaaaa gaggcggcgc ttactaccga ttccgcctag ttggtcactt cgacgtatcg     8520

tctggaactc caaccatcgc aggcagagag gtctgcaaaa tgcaatcccg aaacagttcg     8580

caggtaatag ttagagcctg cataacggtt tcgggatttt ttatatctgc acaacaggta     8640

agagcattga gtcgataatc gtgaagagtc ggcgagcctg gttagccagt gctctttccg     8700

ttgtgctgaa ttaagcgaat accggaagca gaaccggatc accaaatgcg tacaggcgtc     8760

atcgccgccc agcaacagca caacccaaac tgagccgtag ccactgtctg tcctgaattc     8820

attagtaata gttacgctgc ggccttttac acatgacctt cgtgaaagcg ggtggcagga     8880

ggtcgcgcta acaacctcct gccgttttgc ccgtgcatat cggtcacgaa caaatctgat     8940

tactaaacac agtagcctgg atttgttcta tcagtaatcg accttattcc taattaaata     9000

gagcaaatcc ccttattggg ggtaagacat gaagatgcca gaaaaacatg acctgttggc     9060

cgccattctc gcggcaaagg aacaaggcat cggggcaatc cttgcgtttg caatggcgta     9120

ccttcgcggc agatataatg gcggtgcgtt tacaaaaaca gtaatcgacg caacgatgtg     9180

cgccattatc gcctggttca ttcgtgacct tctcgacttc gccggactaa gtagcaatct     9240

cgcttatata acgagcgtgt ttatcggcta catcggtact gactcgattg gttcgcttat     9300

caaacgcttc gctgctaaaa aagccggagt agaagatggt agaaatcaat aatcaacgta     9360

aggcgttcct cgatatgctg gcgtggtcgg agggaactga taacggacgt cagaaaacca     9420

gaaatcatgg ttatgacgtc attgtaggcg gagagctatt tactgattac tccgatcacc     9480

ctcgcaaact tgtcacgcta aacccaaaac tcaaatcaac aggcgcttaa gactggccgt     9540

cgttttacaa cacagaaaga gtttgtagaa acgcaaaaag gccatccgtc aggggccttc     9600

tgcttagttt gatgcctggc agttccctac tctcgccttc cgcttcctcg ctcactgact     9660

cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac     9720

ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa     9780

aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg     9840

acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa     9900

gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc     9960

ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac    10020

gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac    10080

cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg    10140

taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt    10200

atgtaggcgg tgctacagag ttcttgaagt ggtgggctaa ctacggctac actagaagaa    10260

cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct    10320

cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga    10380

ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg    10440

ctcagtggaa cgacgcgcgc gtaactcacg ttaagggatt ttggtcatga gcttgcgccg    10500

tcccgtcaag tcagcgtaat gctctgcttt tagaaaaact catcgagcat caaatgaaac    10560

tgcaatttat tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat    10620

gaaggagaaa actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg    10680

attccgactc gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta    10740

tcaagtgaga aatcaccatg agtgacgact gaatccggtg agaatggcaa aagtttatgc    10800

atttctttcc agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca    10860

tcaaccaaac cgttattcat tcgtgattgc gcctgagcga ggcgaaatac gcgatcgctg    10920

ttaaaaggac aattacaaac aggaatcgag tgcaaccggc gcaggaacac tgccagcgca    10980

tcaacaatat tttcacctga atcaggatat tcttctaata cctggaacgc tgtttttccg    11040

gggatcgcag tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc    11100

ggaagtggca taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg    11160

gcaacgctac ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaag    11220

cgatagattg tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa    11280

tcagcatcca tgttggaatt taatcgcggc ctcgacgttt cccgttgaat atggctcata    11340

ttcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac    11400

atatttgaat gtatttagaa aaataaacaa ataggggtca gtgttacaac caattaacca    11460

attctgaaca ttatcgcgag cccatttata cctgaatatg gctcataaca ccccttgttt    11520

gcctggcggc agtagcgcgg tggtcccacc tgaccccatg ccgaactcag aagtgaaacg    11580

ccgtagcgcc gatggtagtg tggggactcc ccatgcgaga gtagggaact gccaggcatc    11640

aaataaaacg aaaggctcag tcgaaagact gggcctttcg cccgggctaa ttagggggtg    11700

tcgcccttat tcgactctat agtgaagttc ctattctcta gaaagtatag gaacttctga    11760

agtggggtcg acttaattaa gg                                             11782


<210>  42
<211>  11782
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  42
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

ctgaagagac agaaatatct ctaattccat gagcggtcat acgaggcaag agaagccgct      240

tagagcatgg acttagttag tttcagggat tggacagagt caagagctgg ggtgaggagg      300

ttaccctcgg taggggtgac acagatgtca accgcctatt ccctccacat gcatgtcctg      360

ccagaagaac ctgtccctgg gctgggaatc ttatattacc ttcctctcca atgagaagag      420

aagttcaagg ctcacagaca tgtgcataca caagctcaat gcactcaaga ttcccctcca      480

ccactcctgc ccccactacc tacaggagat tgactcctgc tgtgcacata agctgggata      540

atcagggttt ctaaacatca gcttcaaaag tccaatgtcc aaagtggtgg ggggccgggg      600

aacgaggtac tctttccata cccttggctt ttgtgtggcc tggagccgct gatatagaga      660

ttggagtggg acacgaggta ttcctttcaa aaacacaaag gcctatactt tgagccctcc      720

catttcaatc ccccaccatg cttcaccttt aagacctcca actccacttt gatcccagtt      780

ctcaggttca ggcctcacaa ggccaaaatc ctgaagttac ccttctcaaa ctcccttgcc      840

tttaacatca tcagaatcaa cctcctaccc ccactctgtc ccagcagcaa tagcctgcta      900

atcttttagc actaatcttt taggcactaa tctgctttcc aaactcttgg cacctgaact      960

atttataagc agtgttttat gcccccccac caaagaaccc tattcttttc ccatgacccc     1020

accaatcaaa acactcagag gactgtgggt ataagaggct ggggaggcag gcatagcagc     1080

ggccgccacc atgttcaagt ccctcaccaa agtcaacaag gtcaagccca tcggagagaa     1140

caacgagaat gagcagagct ctcggcgcaa cgaagaagga tcccatccgt cgaaccagtc     1200

acagcagact accgcacagg aggagaacaa gggagaagaa aagtcgctca agactaagtc     1260

cacccccgtg acctcggaag aaccgcacac gaacattcag gacaagctgt ccaagaagaa     1320

ctcctccggc gatctcacga ctaacccgga cccccagaat gccgctgaac ctactgggac     1380

cgtgcctgag caaaaggaga tggaccccgg aaaggagggt cctaactccc cccaaaacaa     1440

gcccccggcc gcgccggtca tcaatgagta cgcggacgcg caactgcata acctcgtgaa     1500

gcggatgcgg caaagaaccg ccctctacaa gaagaaactg gtggagggcg acctgagctc     1560

acctgaagcc agcccacaga ccgccaaacc caccgccgtg ccgcctgtga aggagtccga     1620

tgacaagcct accgagcact actaccgcct gctgtggttc aaggtcaaga agatgcccct     1680

gaccgaatac ctcaagcgga tcaagctgcc gaacagcatc gacagctaca ccgaccggct     1740

ttacttgctc tggctgctgc ttgtgaccct ggcttacaac tggaactgtt gtttcattcc     1800

cctgcggctg gtgttccctt accaaaccgc ggataacatt cactactggc tgattgccga     1860

catcatttgc gacatcatct acctgtacga tatgcttttt atccaaccgc ggctgcaatt     1920

cgtccgcggg ggagacatca ttgtggactc caacgagctg cgcaagcatt accggacctc     1980

gacaaagttc cagctggatg tggcctccat catcccgttc gatatctgtt acctgttctt     2040

tggcttcaac ccgatgttca gggcgaacag gatgctgaag tacacttcct tcttcgaatt     2100

caaccaccac ctggagtcca tcatggacaa ggcttacatc taccgcgtga tccggaccac     2160

tggttacctc ctgttcatcc tgcacatcaa cgcctgcgtc tattactggg cctcaaacta     2220

cgaaggcatt ggtaccaccc gctgggtgta cgacggggag ggaaacgagt atctgcgctg     2280

ctactactgg gccgtgcgaa ccctcataac tattggcggc ctcccggaac cgcagaccct     2340

gttcgagatc gtgttccaac tcctcaactt cttctcggga gtgttcgtgt tttcaagctt     2400

gattggacag atgcgggacg tgatcggtgc agcaactgcc aaccagaact actttcgcgc     2460

ctgcatggac gacactatcg cgtacatgaa caactattcg atccccaagc tggtgcagaa     2520

acgcgtgcgg acttggtatg agtacacttg ggactcccag agaatgcttg acgagtccga     2580

tctgctcaag accctgccta ctaccgtgca gctggcactc gccatcgatg tgaacttctc     2640

cattatctcg aaagtcgatc tgttcaaggg ctgcgacacc cagatgatct acgacatgct     2700

gctgagactc aagtccgtgt tgtacctccc tggcgacttc gtgtgcaaga agggcgaaat     2760

cgggaaggag atgtacatta tcaagcacgg agaagtccag gtgctggggg gaccagacgg     2820

taccaaggtc cttgtcaccc tgaaggccgg gtccgtgttc ggcgaaattt ccctgttggc     2880

cgccggcggt ggcaacagga gaaccgcaaa tgtggtggcc cacggcttcg caaaccttct     2940

gaccctggac aagaaaaccc tccaggaaat cctcgtgcac tacccggata gcgagcggat     3000

cctgatgaag aaagcccggg tgctgctgaa gcaaaaggcc aagaccgccg aagccacccc     3060

gcctcggaag gacctggctc tgctgttccc acccaaggag gagactccca aactgtttaa     3120

gaccctcttg ggcgggacgg gaaaggcctc cctcgctcgc ttgcttaagt tgaagaggga     3180

gcaggccgcg cagaagaagg aaaactccga aggaggggaa gaagagggaa aggaaaacga     3240

agataagcag aaggagaacg aggataagca aaaggaaaat gaggacaagg ggaaagaaaa     3300

cgaggacaag gataagggtc gcgaacctga agagaagccg ctggatcggc cagagtgcac     3360

tgcctcgcct atcgcggtcg aagaggaacc ccatagcgtg cgcagaaccg tgctgcctag     3420

aggcacatcg aggcagtcac tgattatctc tatggcacca agcgccgagg gaggagagga     3480

agtgctcacc atcgaggtca aggaaaaagc gaagcagtga tcataactgc agatctgcct     3540

cgactgtgcc ttctagttgc cagccatctg ttgtttgccc ctcccccgtg ccttccttga     3600

ccctggaagg tgccactccc actgtccttt cctaataaaa tgaggaaatt gcatcgcatt     3660

gtctgagtag gtgtcattct attctggggg gtggggtggg gcaggacagc aagggggagg     3720

attgggaaga caatagcagg catgctgggg actcgagttc tacgtagata agtagcatgg     3780

cgggttaatc attaactaca aggaacccct agtgatggag ttggccactc cctctctgcg     3840

cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg     3900

ggcggcctca gtgagcgagc gagcgcgcag ccttaattaa cctaaggaaa atgaagtgaa     3960

gttcctatac tttctagaga ataggaactt ctatagtgag tcgaataagg gcgacacaaa     4020

atttattcta aatgcataat aaatactgat aacatcttat agtttgtatt atattttgta     4080

ttatcgttga catgtataat tttgatatca aaaactgatt ttccctttat tattttcgag     4140

atttattttc ttaattctct ttaacaaact agaaatattg tatatacaaa aaatcataaa     4200

taatagatga atagtttaat tataggtgtt catcaatcga aaaagcaacg tatcttattt     4260

aaagtgcgtt gcttttttct catttataag gttaaataat tctcatatat caagcaaagt     4320

gacaggcgcc cttaaatatt ctgacaaatg ctctttccct aaactccccc cataaaaaaa     4380

cccgccgaag cgggttttta cgttatttgc ggattaacga ttactcgtta tcagaaccgc     4440

ccagggggcc cgagcttaac ctttttattt gggggagagg gaagtcatga aaaaactaac     4500

ctttgaaatt cgatctccag cacatcagca aaacgctatt cacgcagtac agcaaatcct     4560

tccagaccca accaaaccaa tcgtagtaac cattcaggaa cgcaaccgca gcttagacca     4620

aaacaggaag ctatgggcct gcttaggtga cgtctctcgt caggttgaat ggcatggtcg     4680

ctggctggat gcagaaagct ggaagtgtgt gtttaccgca gcattaaagc agcaggatgt     4740

tgttcctaac cttgccggga atggctttgt ggtaataggc cagtcaacca gcaggatgcg     4800

tgtaggcgaa tttgcggagc tattagagct tatacaggca ttcggtacag agcgtggcgt     4860

taagtggtca gacgaagcga gactggctct ggagtggaaa gcgagatggg gagacagggc     4920

tgcatgataa atgtcgttag tttctccggt ggcaggacgt cagcatattt gctctggcta     4980

atggagcaaa agcgacgggc aggtaaagac gtgcattacg ttttcatgga tacaggttgt     5040

gaacatccaa tgacatatcg gtttgtcagg gaagttgtga agttctggga tataccgctc     5100

accgtattgc aggttgatat caacccggag cttggacagc caaatggtta tacggtatgg     5160

gaaccaaagg atattcagac gcgaatgcct gttctgaagc catttatcga tatggtaaag     5220

aaatatggca ctccatacgt cggcggcgcg ttctgcactg acagattaaa actcgttccc     5280

ttcaccaaat actgtgatga ccatttcggg cgagggaatt acaccacgtg gattggcatc     5340

agagctgatg aaccgaagcg gctaaagcca aagcctggaa tcagatatct tgctgaactg     5400

tcagactttg agaaggaaga tatcctcgca tggtggaagc aacaaccatt cgatttgcaa     5460

ataccggaac atctcggtaa ctgcatattc tgcattaaaa aatcaacgca aaaaatcgga     5520

cttgcctgca aagatgagga gggattgcag cgtgttttta atgaggtcat cacgggatcc     5580

catgtgcgtg acggacatcg ggaaacgcca aaggagatta tgtaccgagg aagaatgtcg     5640

ctggacggta tcgcgaaaat gtattcagaa aatgattatc aagccctgta tcaggacatg     5700

gtacgagcta aaagattcga taccggctct tgttctgagt catgcgaaat atttggaggg     5760

cagcttgatt tcgacttcgg gagggaagct gcatgatgcg atgttatcgg tgcggtgaat     5820

gcaaagaaga taaccgcttc cgaccaaatc aaccttactg gaatcgatgg tgtctccggt     5880

gtgaaagaac accaacaggg gtgttaccac taccgcagga aaaggaggac gtgtggcgag     5940

acagcgacga agtatcaccg acataatctg cgaaaactgc aaataccttc caacgaaacg     6000

caccagaaat aaacccaagc caatcccaaa agaatctgac gtaaaaacct tcaactacac     6060

ggctcacctg tgggatatcc ggtggctaag acgtcgtgcg aggaaaacaa ggtgattgac     6120

caaaatcgaa gttacgaaca agaaagcgtc gagcgagctt taacgtgcgc taactgcggt     6180

cagaagctgc atgtgctgga agttcacgtg tgtgagcact gctgcgcaga actgatgagc     6240

gatccgaata gctcgatgca cgaggaagaa gatgatggct aaaccagcgc gaagacgatg     6300

taaaaacgat gaatgccggg aatggtttca ccctgcattc gctaatcagt ggtggtgctc     6360

tccagagtgt ggaaccaaga tagcactcga acgacgaagt aaagaacgcg aaaaagcgga     6420

aaaagcagca gagaagaaac gacgacgaga ggagcagaaa cagaaagata aacttaagat     6480

tcgaaaactc gccttaaagc cccgcagtta ctggattaaa caagcccaac aagccgtaaa     6540

cgccttcatc agagaaagag accgcgactt accatgtatc tcgtgcggaa cgctcacgtc     6600

tgctcagtgg gatgccggac attaccggac aactgctgcg gcacctcaac tccgatttaa     6660

tgaacgcaat attcacaagc aatgcgtggt gtgcaaccag cacaaaagcg gaaatctcgt     6720

tccgtatcgc gtcgaactga ttagccgcat cgggcaggaa gcagtagacg aaatcgaatc     6780

aaaccataac cgccatcgct ggactatcga agagtgcaag gcgatcaagg cagagtacca     6840

acagaaactc aaagacctgc gaaatagcag aagtgaggcc gcatgacgtt ctcagtaaaa     6900

accattccag acatgctcgt tgaagcatac ggaaatcaga cagaagtagc acgcagactg     6960

aaatgtagtc gcggtacggt cagaaaatac gttgatgata aagacgggaa aatgcacgcc     7020

atcgtcaacg acgttctcat ggttcatcgc ggatggagtg aaagagatgc gctattacga     7080

aaaaattgat ggcagcaaat accgaaatat ttgggtagtt ggcgatctgc acggatgcta     7140

cacgaacctg atgaacaaac tggatacgat tggattcgac aacaaaaaag acctgcttat     7200

ctcggtgggc gatttggttg atcgtggtgc agagaacgtt gaatgcctgg aattaatcac     7260

attcccctgg ttcagagctg tacgtggaaa ccatgagcaa atgatgattg atggcttatc     7320

agagcgtgga aacgttaatc actggctgct taatggcggt ggctggttct ttaatctcga     7380

ttacgacaaa gaaattctgg ctaaagctct tgcccataaa gcagatgaac ttccgttaat     7440

catcgaactg gtgagcaaag ataaaaaata tgttatctgc cacgccgatt atccctttga     7500

cgaatacgag tttggaaagc cagttgatca tcagcaggta atctggaacc gcgaacgaat     7560

cagcaactca caaaacggga tcgtgaaaga aatcaaaggc gcggacacgt tcatctttgg     7620

tcatacgcca gcagtgaaac cactcaagtt tgccaaccaa atgtatatcg ataccggcgc     7680

agtgttctgc ggaaacctaa cattgattca ggtacaggga gaaggcgcat gagactcgaa     7740

agcgtagcta aatttcattc gccaaaaagc ccgatgatga gcgactcacc acgggccacg     7800

gcttctgact ctctttccgg tactgatgtg atggctgcta tggggatggc gcaatcacaa     7860

gccggattcg gtatggctgc attctgcggt aagcacgaac tcagccagaa cgacaaacaa     7920

aaggctatca actatctgat gcaatttgca cacaaggtat cggggaaata ccgtggtgtg     7980

gcaaagcttg aaggaaatac taaggcaaag gtactgcaag tgctcgcaac attcgcttat     8040

gcggattatt gccgtagtgc cgcgacgccg ggggcaagat gcagagattg ccatggtaca     8100

ggccgtgcgg ttgatattgc caaaacagag ctgtggggga gagttgtcga gaaagagtgc     8160

ggaagatgca aaggcgtcgg ctattcaagg atgccagcaa gcgcagcata tcgcgctgtg     8220

acgatgctaa tcccaaacct tacccaaccc acctggtcac gcactgttaa gccgctgtat     8280

gacgctctgg tggtgcaatg ccacaaagaa gagtcaatcg cagacaacat tttgaatgcg     8340

gtcacacgtt agcagcatga ttgccacgga tggcaacata ttaacggcat gatattgact     8400

tattgaataa aattgggtaa atttgactca acgatgggtt aattcgctcg ttgtggtagt     8460

gagatgaaaa gaggcggcgc ttactaccga ttccgcctag ttggtcactt cgacgtatcg     8520

tctggaactc caaccatcgc aggcagagag gtctgcaaaa tgcaatcccg aaacagttcg     8580

caggtaatag ttagagcctg cataacggtt tcgggatttt ttatatctgc acaacaggta     8640

agagcattga gtcgataatc gtgaagagtc ggcgagcctg gttagccagt gctctttccg     8700

ttgtgctgaa ttaagcgaat accggaagca gaaccggatc accaaatgcg tacaggcgtc     8760

atcgccgccc agcaacagca caacccaaac tgagccgtag ccactgtctg tcctgaattc     8820

attagtaata gttacgctgc ggccttttac acatgacctt cgtgaaagcg ggtggcagga     8880

ggtcgcgcta acaacctcct gccgttttgc ccgtgcatat cggtcacgaa caaatctgat     8940

tactaaacac agtagcctgg atttgttcta tcagtaatcg accttattcc taattaaata     9000

gagcaaatcc ccttattggg ggtaagacat gaagatgcca gaaaaacatg acctgttggc     9060

cgccattctc gcggcaaagg aacaaggcat cggggcaatc cttgcgtttg caatggcgta     9120

ccttcgcggc agatataatg gcggtgcgtt tacaaaaaca gtaatcgacg caacgatgtg     9180

cgccattatc gcctggttca ttcgtgacct tctcgacttc gccggactaa gtagcaatct     9240

cgcttatata acgagcgtgt ttatcggcta catcggtact gactcgattg gttcgcttat     9300

caaacgcttc gctgctaaaa aagccggagt agaagatggt agaaatcaat aatcaacgta     9360

aggcgttcct cgatatgctg gcgtggtcgg agggaactga taacggacgt cagaaaacca     9420

gaaatcatgg ttatgacgtc attgtaggcg gagagctatt tactgattac tccgatcacc     9480

ctcgcaaact tgtcacgcta aacccaaaac tcaaatcaac aggcgcttaa gactggccgt     9540

cgttttacaa cacagaaaga gtttgtagaa acgcaaaaag gccatccgtc aggggccttc     9600

tgcttagttt gatgcctggc agttccctac tctcgccttc cgcttcctcg ctcactgact     9660

cgctgcgctc ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac     9720

ggttatccac agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa     9780

aggccaggaa ccgtaaaaag gccgcgttgc tggcgttttt ccataggctc cgcccccctg     9840

acgagcatca caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa     9900

gataccaggc gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc     9960

ttaccggata cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac    10020

gctgtaggta tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac    10080

cccccgttca gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg    10140

taagacacga cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt    10200

atgtaggcgg tgctacagag ttcttgaagt ggtgggctaa ctacggctac actagaagaa    10260

cagtatttgg tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct    10320

cttgatccgg caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga    10380

ttacgcgcag aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg    10440

ctcagtggaa cgacgcgcgc gtaactcacg ttaagggatt ttggtcatga gcttgcgccg    10500

tcccgtcaag tcagcgtaat gctctgcttt tagaaaaact catcgagcat caaatgaaac    10560

tgcaatttat tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat    10620

gaaggagaaa actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg    10680

attccgactc gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta    10740

tcaagtgaga aatcaccatg agtgacgact gaatccggtg agaatggcaa aagtttatgc    10800

atttctttcc agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca    10860

tcaaccaaac cgttattcat tcgtgattgc gcctgagcga ggcgaaatac gcgatcgctg    10920

ttaaaaggac aattacaaac aggaatcgag tgcaaccggc gcaggaacac tgccagcgca    10980

tcaacaatat tttcacctga atcaggatat tcttctaata cctggaacgc tgtttttccg    11040

gggatcgcag tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc    11100

ggaagtggca taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg    11160

gcaacgctac ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaag    11220

cgatagattg tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa    11280

tcagcatcca tgttggaatt taatcgcggc ctcgacgttt cccgttgaat atggctcata    11340

ttcttccttt ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac    11400

atatttgaat gtatttagaa aaataaacaa ataggggtca gtgttacaac caattaacca    11460

attctgaaca ttatcgcgag cccatttata cctgaatatg gctcataaca ccccttgttt    11520

gcctggcggc agtagcgcgg tggtcccacc tgaccccatg ccgaactcag aagtgaaacg    11580

ccgtagcgcc gatggtagtg tggggactcc ccatgcgaga gtagggaact gccaggcatc    11640

aaataaaacg aaaggctcag tcgaaagact gggcctttcg cccgggctaa ttagggggtg    11700

tcgcccttat tcgactctat agtgaagttc ctattctcta gaaagtatag gaacttctga    11760

agtggggtcg acttaattaa gg                                             11782


<210>  43
<211>  12556
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  43
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg      240

cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt      300

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      360

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      420

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      480

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattaa      540

catggtcgag gtgagcccca cgttctgctt cactctcccc atctcccccc cctccccacc      600

cccaattttg tatttattta ttttttaatt attttgtgca gcgatggggg cggggggggg      660

gggggggcgc gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag      720

aggtgcggcg gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg      780

gcggcggcgg cggccctata aaaagcgaag cgcgcggcgg gcggggagtc gctgcgacgc      840

tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg      900

accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag      960

cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc     1020

cgggagggcc ctttgtgcgg ggggagcggc tcggggggtg cgtgcgtgtg tgtgtgcgtg     1080

gggagcgccg cgtgcggctc cgcgctgccc ggcggctgtg agcgctgcgg gcgcggcgcg     1140

gggctttgtg cgctccgcag tgtgcgcgag gggagcgcgg ccgggggcgg tgccccgcgg     1200

tgcggggggg gctgcgaggg gaacaaaggc tgcgtgcggg gtgtgtgcgt gggggggtga     1260

gcagggggtg tgggcgcgtc ggtcgggctg caaccccccc tgcacccccc tccccgagtt     1320

gctgagcacg gcccggcttc gggtgcgggg ctccgtacgg ggcgtggcgc ggggctcgcc     1380

gtgccgggcg gggggtggcg gcaggtgggg gtgccgggcg gggcggggcc gcctcgggcc     1440

ggggagggct cgggggaggg gcgcggcggc ccccggagcg ccggcggctg tcgaggcgcg     1500

gcgagccgca gccattgcct tttatggtaa tcgtgcgaga gggcgcaggg acttcctttg     1560

tcccaaatct gtgcggagcc gaaatctggg aggcgccgcc gcaccccctc tagcgggcgc     1620

ggggcgaagc ggtgcggcgc cggcaggaag gaaatgggcg gggagggcct tcgtgcgtcg     1680

ccgcgccgcc gtccccttct ccctctccag cctcggggct gtccgcgggg ggacggctgc     1740

cttcgggggg gacggggcag ggcggggttc ggcttctggc gtgtgaccgg cggctctaga     1800

caattgtact aaccttcttc tctttcctct cctgacaggt tggtgtacac tagcggccgc     1860

caccatgttt aaatcgctga caaaagtcaa caaggtgaag cctataggag agaacaatga     1920

gaatgaacaa agttctcgtc ggaatgaaga aggctctcac ccaagtaatc agtctcagca     1980

aaccacagca caggaagaaa acaaaggtga agagaaatct ctcaaaacca agtcaactcc     2040

agtcacgtct gaagagccac acaccaacat acaagacaaa ctctccaaga aaaattcctc     2100

tggagatctg accacaaacc ctgaccctca aaatgcagca gaaccaactg gaacagtgcc     2160

agagcagaag gaaatggacc ccgggaaaga aggtccaaac agcccacaaa acaaaccgcc     2220

agcagctcct gttataaatg agtatgccga tgcccagcta cacaacctgg tgaaaagaat     2280

gcgtcaaaga acagccctct acaagaaaaa gttggtagag ggagatctct cctcacccga     2340

agccagccca caaactgcaa agcccacggc tgtaccacca gtaaaagaaa gcgatgataa     2400

gccaacagaa cattactaca ggctgttgtg gttcaaagtc aaaaagatgc ctttaacaga     2460

gtacttaaag cgaattaaac ttccaaacag catagattca tacacagatc gactctatct     2520

cctgtggctc ttgcttgtca ctcttgccta taactggaac tgctgtttta taccactgcg     2580

cctcgtcttc ccatatcaaa ccgcagacaa catacactac tggcttattg cggacatcat     2640

ctgtgatatc atctaccttt atgatatgct atttatccag cccagactcc agtttgtaag     2700

aggaggagac ataatagtgg attcaaatga gctaaggaaa cactacagga cttctacaaa     2760

atttcagttg gatgtcgcat caataatacc atttgatatt tgctacctct tctttgggtt     2820

taatccaatg tttagagcaa ataggatgtt aaagtacact tcattttttg aatttaatca     2880

tcacctagag tctataatgg acaaagcata tatctacaga gttattcgaa caactggata     2940

cttgctgttt attctgcaca ttaatgcctg tgtttattac tgggcttcaa actatgaagg     3000

aattggcact actagatggg tgtatgatgg ggaaggaaac gagtatctga gatgttatta     3060

ttgggcagtt cgaactttaa ttaccattgg tggccttcca gaaccacaaa ctttatttga     3120

aattgttttt caactcttga attttttttc tggagttttt gtgttctcca gtttaattgg     3180

tcagatgaga gatgtgattg gagcagctac agccaatcag aactacttcc gcgcctgcat     3240

ggatgacacc attgcctaca tgaacaatta ctccattcct aaacttgtgc aaaagcgagt     3300

tcggacttgg tatgaatata catgggactc tcaaagaatg ctagatgagt ctgatttgct     3360

taagacccta ccaactacgg tccagttagc cctcgccatt gatgtgaact tcagcatcat     3420

cagcaaagtt gacttgttca agggttgtga tacacagatg atttatgaca tgttgctaag     3480

attgaaatcc gttctctatt tgcctggtga ctttgtctgc aaaaagggag aaattggcaa     3540

ggaaatgtat atcatcaagc atggagaagt ccaagttctt ggaggccctg atggtactaa     3600

agttctggtt actctgaaag ctgggtcggt gtttggagaa atcagccttc tagcagcagg     3660

aggaggaaac cgtcgaactg ccaatgtggt ggcccacggg tttgccaatc ttttaactct     3720

agacaaaaag accctccaag aaattctagt gcattatcca gattctgaaa gaatcctcat     3780

gaagaaagcc agagtgcttt taaagcagaa ggctaagacc gcagaagcaa cccctccaag     3840

aaaagatctt gccctcctct tcccaccgaa agaagagaca cccaaactgt ttaaaactct     3900

cctaggaggc acaggaaaag caagtcttgc aagactactc aaattgaagc gagagcaagc     3960

agctcagaag aaagaaaatt ctgaaggagg agaggaagaa ggaaaagaaa atgaagataa     4020

acaaaaagaa aatgaagata aacaaaaaga aaatgaagat aaaggaaaag aaaatgaaga     4080

taaagataaa ggaagagagc cagaagagaa gccactggac agacctgaat gtacagcaag     4140

tcctattgca gtggaggaag aaccccactc agttagaagg acagttttac ccagagggac     4200

ttctcgtcaa tcactcatta tcagcatggc tccttctgct gagggcggag aagaggttct     4260

tactattgaa gtcaaagaaa aggctaagca atgatcataa ctgcagatct gcctcgactg     4320

tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg     4380

aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga     4440

gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg     4500

aagacaatag caggcatgct ggggactcga gttctacgta gataagtagc atggcgggtt     4560

aatcattaac tacaaggaac ccctagtgat ggagttggcc actccctctc tgcgcgctcg     4620

ctcgctcact gaggccgggc gaccaaaggt cgcccgacgc ccgggctttg cccgggcggc     4680

ctcagtgagc gagcgagcgc gcagccttaa ttaacctaag gaaaatgaag tgaagttcct     4740

atactttcta gagaatagga acttctatag tgagtcgaat aagggcgaca caaaatttat     4800

tctaaatgca taataaatac tgataacatc ttatagtttg tattatattt tgtattatcg     4860

ttgacatgta taattttgat atcaaaaact gattttccct ttattatttt cgagatttat     4920

tttcttaatt ctctttaaca aactagaaat attgtatata caaaaaatca taaataatag     4980

atgaatagtt taattatagg tgttcatcaa tcgaaaaagc aacgtatctt atttaaagtg     5040

cgttgctttt ttctcattta taaggttaaa taattctcat atatcaagca aagtgacagg     5100

cgcccttaaa tattctgaca aatgctcttt ccctaaactc cccccataaa aaaacccgcc     5160

gaagcgggtt tttacgttat ttgcggatta acgattactc gttatcagaa ccgcccaggg     5220

ggcccgagct taaccttttt atttggggga gagggaagtc atgaaaaaac taacctttga     5280

aattcgatct ccagcacatc agcaaaacgc tattcacgca gtacagcaaa tccttccaga     5340

cccaaccaaa ccaatcgtag taaccattca ggaacgcaac cgcagcttag accaaaacag     5400

gaagctatgg gcctgcttag gtgacgtctc tcgtcaggtt gaatggcatg gtcgctggct     5460

ggatgcagaa agctggaagt gtgtgtttac cgcagcatta aagcagcagg atgttgttcc     5520

taaccttgcc gggaatggct ttgtggtaat aggccagtca accagcagga tgcgtgtagg     5580

cgaatttgcg gagctattag agcttataca ggcattcggt acagagcgtg gcgttaagtg     5640

gtcagacgaa gcgagactgg ctctggagtg gaaagcgaga tggggagaca gggctgcatg     5700

ataaatgtcg ttagtttctc cggtggcagg acgtcagcat atttgctctg gctaatggag     5760

caaaagcgac gggcaggtaa agacgtgcat tacgttttca tggatacagg ttgtgaacat     5820

ccaatgacat atcggtttgt cagggaagtt gtgaagttct gggatatacc gctcaccgta     5880

ttgcaggttg atatcaaccc ggagcttgga cagccaaatg gttatacggt atgggaacca     5940

aaggatattc agacgcgaat gcctgttctg aagccattta tcgatatggt aaagaaatat     6000

ggcactccat acgtcggcgg cgcgttctgc actgacagat taaaactcgt tcccttcacc     6060

aaatactgtg atgaccattt cgggcgaggg aattacacca cgtggattgg catcagagct     6120

gatgaaccga agcggctaaa gccaaagcct ggaatcagat atcttgctga actgtcagac     6180

tttgagaagg aagatatcct cgcatggtgg aagcaacaac cattcgattt gcaaataccg     6240

gaacatctcg gtaactgcat attctgcatt aaaaaatcaa cgcaaaaaat cggacttgcc     6300

tgcaaagatg aggagggatt gcagcgtgtt tttaatgagg tcatcacggg atcccatgtg     6360

cgtgacggac atcgggaaac gccaaaggag attatgtacc gaggaagaat gtcgctggac     6420

ggtatcgcga aaatgtattc agaaaatgat tatcaagccc tgtatcagga catggtacga     6480

gctaaaagat tcgataccgg ctcttgttct gagtcatgcg aaatatttgg agggcagctt     6540

gatttcgact tcgggaggga agctgcatga tgcgatgtta tcggtgcggt gaatgcaaag     6600

aagataaccg cttccgacca aatcaacctt actggaatcg atggtgtctc cggtgtgaaa     6660

gaacaccaac aggggtgtta ccactaccgc aggaaaagga ggacgtgtgg cgagacagcg     6720

acgaagtatc accgacataa tctgcgaaaa ctgcaaatac cttccaacga aacgcaccag     6780

aaataaaccc aagccaatcc caaaagaatc tgacgtaaaa accttcaact acacggctca     6840

cctgtgggat atccggtggc taagacgtcg tgcgaggaaa acaaggtgat tgaccaaaat     6900

cgaagttacg aacaagaaag cgtcgagcga gctttaacgt gcgctaactg cggtcagaag     6960

ctgcatgtgc tggaagttca cgtgtgtgag cactgctgcg cagaactgat gagcgatccg     7020

aatagctcga tgcacgagga agaagatgat ggctaaacca gcgcgaagac gatgtaaaaa     7080

cgatgaatgc cgggaatggt ttcaccctgc attcgctaat cagtggtggt gctctccaga     7140

gtgtggaacc aagatagcac tcgaacgacg aagtaaagaa cgcgaaaaag cggaaaaagc     7200

agcagagaag aaacgacgac gagaggagca gaaacagaaa gataaactta agattcgaaa     7260

actcgcctta aagccccgca gttactggat taaacaagcc caacaagccg taaacgcctt     7320

catcagagaa agagaccgcg acttaccatg tatctcgtgc ggaacgctca cgtctgctca     7380

gtgggatgcc ggacattacc ggacaactgc tgcggcacct caactccgat ttaatgaacg     7440

caatattcac aagcaatgcg tggtgtgcaa ccagcacaaa agcggaaatc tcgttccgta     7500

tcgcgtcgaa ctgattagcc gcatcgggca ggaagcagta gacgaaatcg aatcaaacca     7560

taaccgccat cgctggacta tcgaagagtg caaggcgatc aaggcagagt accaacagaa     7620

actcaaagac ctgcgaaata gcagaagtga ggccgcatga cgttctcagt aaaaaccatt     7680

ccagacatgc tcgttgaagc atacggaaat cagacagaag tagcacgcag actgaaatgt     7740

agtcgcggta cggtcagaaa atacgttgat gataaagacg ggaaaatgca cgccatcgtc     7800

aacgacgttc tcatggttca tcgcggatgg agtgaaagag atgcgctatt acgaaaaaat     7860

tgatggcagc aaataccgaa atatttgggt agttggcgat ctgcacggat gctacacgaa     7920

cctgatgaac aaactggata cgattggatt cgacaacaaa aaagacctgc ttatctcggt     7980

gggcgatttg gttgatcgtg gtgcagagaa cgttgaatgc ctggaattaa tcacattccc     8040

ctggttcaga gctgtacgtg gaaaccatga gcaaatgatg attgatggct tatcagagcg     8100

tggaaacgtt aatcactggc tgcttaatgg cggtggctgg ttctttaatc tcgattacga     8160

caaagaaatt ctggctaaag ctcttgccca taaagcagat gaacttccgt taatcatcga     8220

actggtgagc aaagataaaa aatatgttat ctgccacgcc gattatccct ttgacgaata     8280

cgagtttgga aagccagttg atcatcagca ggtaatctgg aaccgcgaac gaatcagcaa     8340

ctcacaaaac gggatcgtga aagaaatcaa aggcgcggac acgttcatct ttggtcatac     8400

gccagcagtg aaaccactca agtttgccaa ccaaatgtat atcgataccg gcgcagtgtt     8460

ctgcggaaac ctaacattga ttcaggtaca gggagaaggc gcatgagact cgaaagcgta     8520

gctaaatttc attcgccaaa aagcccgatg atgagcgact caccacgggc cacggcttct     8580

gactctcttt ccggtactga tgtgatggct gctatgggga tggcgcaatc acaagccgga     8640

ttcggtatgg ctgcattctg cggtaagcac gaactcagcc agaacgacaa acaaaaggct     8700

atcaactatc tgatgcaatt tgcacacaag gtatcgggga aataccgtgg tgtggcaaag     8760

cttgaaggaa atactaaggc aaaggtactg caagtgctcg caacattcgc ttatgcggat     8820

tattgccgta gtgccgcgac gccgggggca agatgcagag attgccatgg tacaggccgt     8880

gcggttgata ttgccaaaac agagctgtgg gggagagttg tcgagaaaga gtgcggaaga     8940

tgcaaaggcg tcggctattc aaggatgcca gcaagcgcag catatcgcgc tgtgacgatg     9000

ctaatcccaa accttaccca acccacctgg tcacgcactg ttaagccgct gtatgacgct     9060

ctggtggtgc aatgccacaa agaagagtca atcgcagaca acattttgaa tgcggtcaca     9120

cgttagcagc atgattgcca cggatggcaa catattaacg gcatgatatt gacttattga     9180

ataaaattgg gtaaatttga ctcaacgatg ggttaattcg ctcgttgtgg tagtgagatg     9240

aaaagaggcg gcgcttacta ccgattccgc ctagttggtc acttcgacgt atcgtctgga     9300

actccaacca tcgcaggcag agaggtctgc aaaatgcaat cccgaaacag ttcgcaggta     9360

atagttagag cctgcataac ggtttcggga ttttttatat ctgcacaaca ggtaagagca     9420

ttgagtcgat aatcgtgaag agtcggcgag cctggttagc cagtgctctt tccgttgtgc     9480

tgaattaagc gaataccgga agcagaaccg gatcaccaaa tgcgtacagg cgtcatcgcc     9540

gcccagcaac agcacaaccc aaactgagcc gtagccactg tctgtcctga attcattagt     9600

aatagttacg ctgcggcctt ttacacatga ccttcgtgaa agcgggtggc aggaggtcgc     9660

gctaacaacc tcctgccgtt ttgcccgtgc atatcggtca cgaacaaatc tgattactaa     9720

acacagtagc ctggatttgt tctatcagta atcgacctta ttcctaatta aatagagcaa     9780

atccccttat tgggggtaag acatgaagat gccagaaaaa catgacctgt tggccgccat     9840

tctcgcggca aaggaacaag gcatcggggc aatccttgcg tttgcaatgg cgtaccttcg     9900

cggcagatat aatggcggtg cgtttacaaa aacagtaatc gacgcaacga tgtgcgccat     9960

tatcgcctgg ttcattcgtg accttctcga cttcgccgga ctaagtagca atctcgctta    10020

tataacgagc gtgtttatcg gctacatcgg tactgactcg attggttcgc ttatcaaacg    10080

cttcgctgct aaaaaagccg gagtagaaga tggtagaaat caataatcaa cgtaaggcgt    10140

tcctcgatat gctggcgtgg tcggagggaa ctgataacgg acgtcagaaa accagaaatc    10200

atggttatga cgtcattgta ggcggagagc tatttactga ttactccgat caccctcgca    10260

aacttgtcac gctaaaccca aaactcaaat caacaggcgc ttaagactgg ccgtcgtttt    10320

acaacacaga aagagtttgt agaaacgcaa aaaggccatc cgtcaggggc cttctgctta    10380

gtttgatgcc tggcagttcc ctactctcgc cttccgcttc ctcgctcact gactcgctgc    10440

gctcggtcgt tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat    10500

ccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca    10560

ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc    10620

atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc    10680

aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg    10740

gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta    10800

ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg    10860

ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac    10920

acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag    10980

gcggtgctac agagttcttg aagtggtggg ctaactacgg ctacactaga agaacagtat    11040

ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat    11100

ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc    11160

gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt    11220

ggaacgacgc gcgcgtaact cacgttaagg gattttggtc atgagcttgc gccgtcccgt    11280

caagtcagcg taatgctctg cttttagaaa aactcatcga gcatcaaatg aaactgcaat    11340

ttattcatat caggattatc aataccatat ttttgaaaaa gccgtttctg taatgaagga    11400

gaaaactcac cgaggcagtt ccataggatg gcaagatcct ggtatcggtc tgcgattccg    11460

actcgtccaa catcaataca acctattaat ttcccctcgt caaaaataag gttatcaagt    11520

gagaaatcac catgagtgac gactgaatcc ggtgagaatg gcaaaagttt atgcatttct    11580

ttccagactt gttcaacagg ccagccatta cgctcgtcat caaaatcact cgcatcaacc    11640

aaaccgttat tcattcgtga ttgcgcctga gcgaggcgaa atacgcgatc gctgttaaaa    11700

ggacaattac aaacaggaat cgagtgcaac cggcgcagga acactgccag cgcatcaaca    11760

atattttcac ctgaatcagg atattcttct aatacctgga acgctgtttt tccggggatc    11820

gcagtggtga gtaaccatgc atcatcagga gtacggataa aatgcttgat ggtcggaagt    11880

ggcataaatt ccgtcagcca gtttagtctg accatctcat ctgtaacatc attggcaacg    11940

ctacctttgc catgtttcag aaacaactct ggcgcatcgg gcttcccata caagcgatag    12000

attgtcgcac ctgattgccc gacattatcg cgagcccatt tatacccata taaatcagca    12060

tccatgttgg aatttaatcg cggcctcgac gtttcccgtt gaatatggct catattcttc    12120

ctttttcaat attattgaag catttatcag ggttattgtc tcatgagcgg atacatattt    12180

gaatgtattt agaaaaataa acaaataggg gtcagtgtta caaccaatta accaattctg    12240

aacattatcg cgagcccatt tatacctgaa tatggctcat aacacccctt gtttgcctgg    12300

cggcagtagc gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgccgtag    12360

cgccgatggt agtgtgggga ctccccatgc gagagtaggg aactgccagg catcaaataa    12420

aacgaaaggc tcagtcgaaa gactgggcct ttcgcccggg ctaattaggg ggtgtcgccc    12480

ttattcgact ctatagtgaa gttcctattc tctagaaagt ataggaactt ctgaagtggg    12540

gtcgacttaa ttaagg                                                    12556


<210>  44
<211>  12556
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  44
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

tagttattaa tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg      240

cgttacataa cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt      300

gacgtcaata atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca      360

atgggtggag tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc      420

aagtacgccc cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta      480

catgacctta tgggactttc ctacttggca gtacatctac gtattagtca tcgctattaa      540

catggtcgag gtgagcccca cgttctgctt cactctcccc atctcccccc cctccccacc      600

cccaattttg tatttattta ttttttaatt attttgtgca gcgatggggg cggggggggg      660

gggggggcgc gcgccaggcg gggcggggcg gggcgagggg cggggcgggg cgaggcggag      720

aggtgcggcg gcagccaatc agagcggcgc gctccgaaag tttcctttta tggcgaggcg      780

gcggcggcgg cggccctata aaaagcgaag cgcgcggcgg gcggggagtc gctgcgacgc      840

tgccttcgcc ccgtgccccg ctccgccgcc gcctcgcgcc gcccgccccg gctctgactg      900

accgcgttac tcccacaggt gagcgggcgg gacggccctt ctcctccggg ctgtaattag      960

cgcttggttt aatgacggct tgtttctttt ctgtggctgc gtgaaagcct tgaggggctc     1020

cgggagggcc ctttgtgcgg ggggagcggc tcggggggtg cgtgcgtgtg tgtgtgcgtg     1080

gggagcgccg cgtgcggctc cgcgctgccc ggcggctgtg agcgctgcgg gcgcggcgcg     1140

gggctttgtg cgctccgcag tgtgcgcgag gggagcgcgg ccgggggcgg tgccccgcgg     1200

tgcggggggg gctgcgaggg gaacaaaggc tgcgtgcggg gtgtgtgcgt gggggggtga     1260

gcagggggtg tgggcgcgtc ggtcgggctg caaccccccc tgcacccccc tccccgagtt     1320

gctgagcacg gcccggcttc gggtgcgggg ctccgtacgg ggcgtggcgc ggggctcgcc     1380

gtgccgggcg gggggtggcg gcaggtgggg gtgccgggcg gggcggggcc gcctcgggcc     1440

ggggagggct cgggggaggg gcgcggcggc ccccggagcg ccggcggctg tcgaggcgcg     1500

gcgagccgca gccattgcct tttatggtaa tcgtgcgaga gggcgcaggg acttcctttg     1560

tcccaaatct gtgcggagcc gaaatctggg aggcgccgcc gcaccccctc tagcgggcgc     1620

ggggcgaagc ggtgcggcgc cggcaggaag gaaatgggcg gggagggcct tcgtgcgtcg     1680

ccgcgccgcc gtccccttct ccctctccag cctcggggct gtccgcgggg ggacggctgc     1740

cttcgggggg gacggggcag ggcggggttc ggcttctggc gtgtgaccgg cggctctaga     1800

caattgtact aaccttcttc tctttcctct cctgacaggt tggtgtacac tagcggccgc     1860

caccatgttc aagtccctca ccaaagtcaa caaggtcaag cccatcggag agaacaacga     1920

gaatgagcag agctctcggc gcaacgaaga aggatcccat ccgtcgaacc agtcacagca     1980

gactaccgca caggaggaga acaagggaga agaaaagtcg ctcaagacta agtccacccc     2040

cgtgacctcg gaagaaccgc acacgaacat tcaggacaag ctgtccaaga agaactcctc     2100

cggcgatctc acgactaacc cggaccccca gaatgccgct gaacctactg ggaccgtgcc     2160

tgagcaaaag gagatggacc ccggaaagga gggtcctaac tccccccaaa acaagccccc     2220

ggccgcgccg gtcatcaatg agtacgcgga cgcgcaactg cataacctcg tgaagcggat     2280

gcggcaaaga accgccctct acaagaagaa actggtggag ggcgacctga gctcacctga     2340

agccagccca cagaccgcca aacccaccgc cgtgccgcct gtgaaggagt ccgatgacaa     2400

gcctaccgag cactactacc gcctgctgtg gttcaaggtc aagaagatgc ccctgaccga     2460

atacctcaag cggatcaagc tgccgaacag catcgacagc tacaccgacc ggctttactt     2520

gctctggctg ctgcttgtga ccctggctta caactggaac tgttgtttca ttcccctgcg     2580

gctggtgttc ccttaccaaa ccgcggataa cattcactac tggctgattg ccgacatcat     2640

ttgcgacatc atctacctgt acgatatgct ttttatccaa ccgcggctgc aattcgtccg     2700

cgggggagac atcattgtgg actccaacga gctgcgcaag cattaccgga cctcgacaaa     2760

gttccagctg gatgtggcct ccatcatccc gttcgatatc tgttacctgt tctttggctt     2820

caacccgatg ttcagggcga acaggatgct gaagtacact tccttcttcg aattcaacca     2880

ccacctggag tccatcatgg acaaggctta catctaccgc gtgatccgga ccactggtta     2940

cctcctgttc atcctgcaca tcaacgcctg cgtctattac tgggcctcaa actacgaagg     3000

cattggtacc acccgctggg tgtacgacgg ggagggaaac gagtatctgc gctgctacta     3060

ctgggccgtg cgaaccctca taactattgg cggcctcccg gaaccgcaga ccctgttcga     3120

gatcgtgttc caactcctca acttcttctc gggagtgttc gtgttttcaa gcttgattgg     3180

acagatgcgg gacgtgatcg gtgcagcaac tgccaaccag aactactttc gcgcctgcat     3240

ggacgacact atcgcgtaca tgaacaacta ttcgatcccc aagctggtgc agaaacgcgt     3300

gcggacttgg tatgagtaca cttgggactc ccagagaatg cttgacgagt ccgatctgct     3360

caagaccctg cctactaccg tgcagctggc actcgccatc gatgtgaact tctccattat     3420

ctcgaaagtc gatctgttca agggctgcga cacccagatg atctacgaca tgctgctgag     3480

actcaagtcc gtgttgtacc tccctggcga cttcgtgtgc aagaagggcg aaatcgggaa     3540

ggagatgtac attatcaagc acggagaagt ccaggtgctg gggggaccag acggtaccaa     3600

ggtccttgtc accctgaagg ccgggtccgt gttcggcgaa atttccctgt tggccgccgg     3660

cggtggcaac aggagaaccg caaatgtggt ggcccacggc ttcgcaaacc ttctgaccct     3720

ggacaagaaa accctccagg aaatcctcgt gcactacccg gatagcgagc ggatcctgat     3780

gaagaaagcc cgggtgctgc tgaagcaaaa ggccaagacc gccgaagcca ccccgcctcg     3840

gaaggacctg gctctgctgt tcccacccaa ggaggagact cccaaactgt ttaagaccct     3900

cttgggcggg acgggaaagg cctccctcgc tcgcttgctt aagttgaaga gggagcaggc     3960

cgcgcagaag aaggaaaact ccgaaggagg ggaagaagag ggaaaggaaa acgaagataa     4020

gcagaaggag aacgaggata agcaaaagga aaatgaggac aaggggaaag aaaacgagga     4080

caaggataag ggtcgcgaac ctgaagagaa gccgctggat cggccagagt gcactgcctc     4140

gcctatcgcg gtcgaagagg aaccccatag cgtgcgcaga accgtgctgc ctagaggcac     4200

atcgaggcag tcactgatta tctctatggc accaagcgcc gagggaggag aggaagtgct     4260

caccatcgag gtcaaggaaa aagcgaagca gtgatcataa ctgcagatct gcctcgactg     4320

tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg     4380

aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga     4440

gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg     4500

aagacaatag caggcatgct ggggactcga gttctacgta gataagtagc atggcgggtt     4560

aatcattaac tacaaggaac ccctagtgat ggagttggcc actccctctc tgcgcgctcg     4620

ctcgctcact gaggccgggc gaccaaaggt cgcccgacgc ccgggctttg cccgggcggc     4680

ctcagtgagc gagcgagcgc gcagccttaa ttaacctaag gaaaatgaag tgaagttcct     4740

atactttcta gagaatagga acttctatag tgagtcgaat aagggcgaca caaaatttat     4800

tctaaatgca taataaatac tgataacatc ttatagtttg tattatattt tgtattatcg     4860

ttgacatgta taattttgat atcaaaaact gattttccct ttattatttt cgagatttat     4920

tttcttaatt ctctttaaca aactagaaat attgtatata caaaaaatca taaataatag     4980

atgaatagtt taattatagg tgttcatcaa tcgaaaaagc aacgtatctt atttaaagtg     5040

cgttgctttt ttctcattta taaggttaaa taattctcat atatcaagca aagtgacagg     5100

cgcccttaaa tattctgaca aatgctcttt ccctaaactc cccccataaa aaaacccgcc     5160

gaagcgggtt tttacgttat ttgcggatta acgattactc gttatcagaa ccgcccaggg     5220

ggcccgagct taaccttttt atttggggga gagggaagtc atgaaaaaac taacctttga     5280

aattcgatct ccagcacatc agcaaaacgc tattcacgca gtacagcaaa tccttccaga     5340

cccaaccaaa ccaatcgtag taaccattca ggaacgcaac cgcagcttag accaaaacag     5400

gaagctatgg gcctgcttag gtgacgtctc tcgtcaggtt gaatggcatg gtcgctggct     5460

ggatgcagaa agctggaagt gtgtgtttac cgcagcatta aagcagcagg atgttgttcc     5520

taaccttgcc gggaatggct ttgtggtaat aggccagtca accagcagga tgcgtgtagg     5580

cgaatttgcg gagctattag agcttataca ggcattcggt acagagcgtg gcgttaagtg     5640

gtcagacgaa gcgagactgg ctctggagtg gaaagcgaga tggggagaca gggctgcatg     5700

ataaatgtcg ttagtttctc cggtggcagg acgtcagcat atttgctctg gctaatggag     5760

caaaagcgac gggcaggtaa agacgtgcat tacgttttca tggatacagg ttgtgaacat     5820

ccaatgacat atcggtttgt cagggaagtt gtgaagttct gggatatacc gctcaccgta     5880

ttgcaggttg atatcaaccc ggagcttgga cagccaaatg gttatacggt atgggaacca     5940

aaggatattc agacgcgaat gcctgttctg aagccattta tcgatatggt aaagaaatat     6000

ggcactccat acgtcggcgg cgcgttctgc actgacagat taaaactcgt tcccttcacc     6060

aaatactgtg atgaccattt cgggcgaggg aattacacca cgtggattgg catcagagct     6120

gatgaaccga agcggctaaa gccaaagcct ggaatcagat atcttgctga actgtcagac     6180

tttgagaagg aagatatcct cgcatggtgg aagcaacaac cattcgattt gcaaataccg     6240

gaacatctcg gtaactgcat attctgcatt aaaaaatcaa cgcaaaaaat cggacttgcc     6300

tgcaaagatg aggagggatt gcagcgtgtt tttaatgagg tcatcacggg atcccatgtg     6360

cgtgacggac atcgggaaac gccaaaggag attatgtacc gaggaagaat gtcgctggac     6420

ggtatcgcga aaatgtattc agaaaatgat tatcaagccc tgtatcagga catggtacga     6480

gctaaaagat tcgataccgg ctcttgttct gagtcatgcg aaatatttgg agggcagctt     6540

gatttcgact tcgggaggga agctgcatga tgcgatgtta tcggtgcggt gaatgcaaag     6600

aagataaccg cttccgacca aatcaacctt actggaatcg atggtgtctc cggtgtgaaa     6660

gaacaccaac aggggtgtta ccactaccgc aggaaaagga ggacgtgtgg cgagacagcg     6720

acgaagtatc accgacataa tctgcgaaaa ctgcaaatac cttccaacga aacgcaccag     6780

aaataaaccc aagccaatcc caaaagaatc tgacgtaaaa accttcaact acacggctca     6840

cctgtgggat atccggtggc taagacgtcg tgcgaggaaa acaaggtgat tgaccaaaat     6900

cgaagttacg aacaagaaag cgtcgagcga gctttaacgt gcgctaactg cggtcagaag     6960

ctgcatgtgc tggaagttca cgtgtgtgag cactgctgcg cagaactgat gagcgatccg     7020

aatagctcga tgcacgagga agaagatgat ggctaaacca gcgcgaagac gatgtaaaaa     7080

cgatgaatgc cgggaatggt ttcaccctgc attcgctaat cagtggtggt gctctccaga     7140

gtgtggaacc aagatagcac tcgaacgacg aagtaaagaa cgcgaaaaag cggaaaaagc     7200

agcagagaag aaacgacgac gagaggagca gaaacagaaa gataaactta agattcgaaa     7260

actcgcctta aagccccgca gttactggat taaacaagcc caacaagccg taaacgcctt     7320

catcagagaa agagaccgcg acttaccatg tatctcgtgc ggaacgctca cgtctgctca     7380

gtgggatgcc ggacattacc ggacaactgc tgcggcacct caactccgat ttaatgaacg     7440

caatattcac aagcaatgcg tggtgtgcaa ccagcacaaa agcggaaatc tcgttccgta     7500

tcgcgtcgaa ctgattagcc gcatcgggca ggaagcagta gacgaaatcg aatcaaacca     7560

taaccgccat cgctggacta tcgaagagtg caaggcgatc aaggcagagt accaacagaa     7620

actcaaagac ctgcgaaata gcagaagtga ggccgcatga cgttctcagt aaaaaccatt     7680

ccagacatgc tcgttgaagc atacggaaat cagacagaag tagcacgcag actgaaatgt     7740

agtcgcggta cggtcagaaa atacgttgat gataaagacg ggaaaatgca cgccatcgtc     7800

aacgacgttc tcatggttca tcgcggatgg agtgaaagag atgcgctatt acgaaaaaat     7860

tgatggcagc aaataccgaa atatttgggt agttggcgat ctgcacggat gctacacgaa     7920

cctgatgaac aaactggata cgattggatt cgacaacaaa aaagacctgc ttatctcggt     7980

gggcgatttg gttgatcgtg gtgcagagaa cgttgaatgc ctggaattaa tcacattccc     8040

ctggttcaga gctgtacgtg gaaaccatga gcaaatgatg attgatggct tatcagagcg     8100

tggaaacgtt aatcactggc tgcttaatgg cggtggctgg ttctttaatc tcgattacga     8160

caaagaaatt ctggctaaag ctcttgccca taaagcagat gaacttccgt taatcatcga     8220

actggtgagc aaagataaaa aatatgttat ctgccacgcc gattatccct ttgacgaata     8280

cgagtttgga aagccagttg atcatcagca ggtaatctgg aaccgcgaac gaatcagcaa     8340

ctcacaaaac gggatcgtga aagaaatcaa aggcgcggac acgttcatct ttggtcatac     8400

gccagcagtg aaaccactca agtttgccaa ccaaatgtat atcgataccg gcgcagtgtt     8460

ctgcggaaac ctaacattga ttcaggtaca gggagaaggc gcatgagact cgaaagcgta     8520

gctaaatttc attcgccaaa aagcccgatg atgagcgact caccacgggc cacggcttct     8580

gactctcttt ccggtactga tgtgatggct gctatgggga tggcgcaatc acaagccgga     8640

ttcggtatgg ctgcattctg cggtaagcac gaactcagcc agaacgacaa acaaaaggct     8700

atcaactatc tgatgcaatt tgcacacaag gtatcgggga aataccgtgg tgtggcaaag     8760

cttgaaggaa atactaaggc aaaggtactg caagtgctcg caacattcgc ttatgcggat     8820

tattgccgta gtgccgcgac gccgggggca agatgcagag attgccatgg tacaggccgt     8880

gcggttgata ttgccaaaac agagctgtgg gggagagttg tcgagaaaga gtgcggaaga     8940

tgcaaaggcg tcggctattc aaggatgcca gcaagcgcag catatcgcgc tgtgacgatg     9000

ctaatcccaa accttaccca acccacctgg tcacgcactg ttaagccgct gtatgacgct     9060

ctggtggtgc aatgccacaa agaagagtca atcgcagaca acattttgaa tgcggtcaca     9120

cgttagcagc atgattgcca cggatggcaa catattaacg gcatgatatt gacttattga     9180

ataaaattgg gtaaatttga ctcaacgatg ggttaattcg ctcgttgtgg tagtgagatg     9240

aaaagaggcg gcgcttacta ccgattccgc ctagttggtc acttcgacgt atcgtctgga     9300

actccaacca tcgcaggcag agaggtctgc aaaatgcaat cccgaaacag ttcgcaggta     9360

atagttagag cctgcataac ggtttcggga ttttttatat ctgcacaaca ggtaagagca     9420

ttgagtcgat aatcgtgaag agtcggcgag cctggttagc cagtgctctt tccgttgtgc     9480

tgaattaagc gaataccgga agcagaaccg gatcaccaaa tgcgtacagg cgtcatcgcc     9540

gcccagcaac agcacaaccc aaactgagcc gtagccactg tctgtcctga attcattagt     9600

aatagttacg ctgcggcctt ttacacatga ccttcgtgaa agcgggtggc aggaggtcgc     9660

gctaacaacc tcctgccgtt ttgcccgtgc atatcggtca cgaacaaatc tgattactaa     9720

acacagtagc ctggatttgt tctatcagta atcgacctta ttcctaatta aatagagcaa     9780

atccccttat tgggggtaag acatgaagat gccagaaaaa catgacctgt tggccgccat     9840

tctcgcggca aaggaacaag gcatcggggc aatccttgcg tttgcaatgg cgtaccttcg     9900

cggcagatat aatggcggtg cgtttacaaa aacagtaatc gacgcaacga tgtgcgccat     9960

tatcgcctgg ttcattcgtg accttctcga cttcgccgga ctaagtagca atctcgctta    10020

tataacgagc gtgtttatcg gctacatcgg tactgactcg attggttcgc ttatcaaacg    10080

cttcgctgct aaaaaagccg gagtagaaga tggtagaaat caataatcaa cgtaaggcgt    10140

tcctcgatat gctggcgtgg tcggagggaa ctgataacgg acgtcagaaa accagaaatc    10200

atggttatga cgtcattgta ggcggagagc tatttactga ttactccgat caccctcgca    10260

aacttgtcac gctaaaccca aaactcaaat caacaggcgc ttaagactgg ccgtcgtttt    10320

acaacacaga aagagtttgt agaaacgcaa aaaggccatc cgtcaggggc cttctgctta    10380

gtttgatgcc tggcagttcc ctactctcgc cttccgcttc ctcgctcact gactcgctgc    10440

gctcggtcgt tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat    10500

ccacagaatc aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca    10560

ggaaccgtaa aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc    10620

atcacaaaaa tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc    10680

aggcgtttcc ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg    10740

gatacctgtc cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta    10800

ggtatctcag ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg    10860

ttcagcccga ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac    10920

acgacttatc gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag    10980

gcggtgctac agagttcttg aagtggtggg ctaactacgg ctacactaga agaacagtat    11040

ttggtatctg cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat    11100

ccggcaaaca aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc    11160

gcagaaaaaa aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt    11220

ggaacgacgc gcgcgtaact cacgttaagg gattttggtc atgagcttgc gccgtcccgt    11280

caagtcagcg taatgctctg cttttagaaa aactcatcga gcatcaaatg aaactgcaat    11340

ttattcatat caggattatc aataccatat ttttgaaaaa gccgtttctg taatgaagga    11400

gaaaactcac cgaggcagtt ccataggatg gcaagatcct ggtatcggtc tgcgattccg    11460

actcgtccaa catcaataca acctattaat ttcccctcgt caaaaataag gttatcaagt    11520

gagaaatcac catgagtgac gactgaatcc ggtgagaatg gcaaaagttt atgcatttct    11580

ttccagactt gttcaacagg ccagccatta cgctcgtcat caaaatcact cgcatcaacc    11640

aaaccgttat tcattcgtga ttgcgcctga gcgaggcgaa atacgcgatc gctgttaaaa    11700

ggacaattac aaacaggaat cgagtgcaac cggcgcagga acactgccag cgcatcaaca    11760

atattttcac ctgaatcagg atattcttct aatacctgga acgctgtttt tccggggatc    11820

gcagtggtga gtaaccatgc atcatcagga gtacggataa aatgcttgat ggtcggaagt    11880

ggcataaatt ccgtcagcca gtttagtctg accatctcat ctgtaacatc attggcaacg    11940

ctacctttgc catgtttcag aaacaactct ggcgcatcgg gcttcccata caagcgatag    12000

attgtcgcac ctgattgccc gacattatcg cgagcccatt tatacccata taaatcagca    12060

tccatgttgg aatttaatcg cggcctcgac gtttcccgtt gaatatggct catattcttc    12120

ctttttcaat attattgaag catttatcag ggttattgtc tcatgagcgg atacatattt    12180

gaatgtattt agaaaaataa acaaataggg gtcagtgtta caaccaatta accaattctg    12240

aacattatcg cgagcccatt tatacctgaa tatggctcat aacacccctt gtttgcctgg    12300

cggcagtagc gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgccgtag    12360

cgccgatggt agtgtgggga ctccccatgc gagagtaggg aactgccagg catcaaataa    12420

aacgaaaggc tcagtcgaaa gactgggcct ttcgcccggg ctaattaggg ggtgtcgccc    12480

ttattcgact ctatagtgaa gttcctattc tctagaaagt ataggaactt ctgaagtggg    12540

gtcgacttaa ttaagg                                                    12556


<210>  45
<211>  2450
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  45
gccgccacca tgttcaagtc cctcaccaaa gtcaacaagg tcaagcccat cggagagaac       60

aacgagaatg agcagagctc tcggcgcaac gaagaaggat cccatccgtc gaaccagtca      120

cagcagacta ccgcacagga ggagaacaag ggagaagaaa agtcgctcaa gactaagtcc      180

acccccgtga cctcggaaga accgcacacg aacattcagg acaagctgtc caagaagaac      240

tcctccggcg atctcacgac taacccggac ccccagaatg ccgctgaacc tactgggacc      300

gtgcctgagc aaaaggagat ggaccccgga aaggagggtc ctaactcccc ccaaaacaag      360

cccccggccg cgccggtcat caatgagtac gcggacgcgc aactgcataa cctcgtgaag      420

cggatgcggc aaagaaccgc cctctacaag aagaaactgg tggagggcga cctgagctca      480

cctgaagcca gcccacagac cgccaaaccc accgccgtgc cgcctgtgaa ggagtccgat      540

gacaagccta ccgagcacta ctaccgcctg ctgtggttca aggtcaagaa gatgcccctg      600

accgaatacc tcaagcggat caagctgccg aacagcatcg acagctacac cgaccggctt      660

tacttgctct ggctgctgct tgtgaccctg gcttacaact ggaactgttg tttcattccc      720

ctgcggctgg tgttccctta ccaaaccgcg gataacattc actactggct gattgccgac      780

atcatttgcg acatcatcta cctgtacgat atgcttttta tccaaccgcg gctgcaattc      840

gtccgcgggg gagacatcat tgtggactcc aacgagctgc gcaagcatta ccggacctcg      900

acaaagttcc agctggatgt ggcctccatc atcccgttcg atatctgtta cctgttcttt      960

ggcttcaacc cgatgttcag ggcgaacagg atgctgaagt acacttcctt cttcgaattc     1020

aaccaccacc tggagtccat catggacaag gcttacatct accgcgtgat ccggaccact     1080

ggttacctcc tgttcatcct gcacatcaac gcctgcgtct attactgggc ctcaaactac     1140

gaaggcattg gtaccacccg ctgggtgtac gacggggagg gaaacgagta tctgcgctgc     1200

tactactggg ccgtgcgaac cctcataact attggcggcc tcccggaacc gcagaccctg     1260

ttcgagatcg tgttccaact cctcaacttc ttctcgggag tgttcgtgtt ttcaagcttg     1320

attggacaga tgcgggacgt gatcggtgca gcaactgcca accagaacta ctttcgcgcc     1380

tgcatggacg acactatcgc gtacatgaac aactattcga tccccaagct ggtgcagaaa     1440

cgcgtgcgga cttggtatga gtacacttgg gactcccaga gaatgcttga cgagtccgat     1500

ctgctcaaga ccctgcctac taccgtgcag ctggcactcg ccatcgatgt gaacttctcc     1560

attatctcga aagtcgatct gttcaagggc tgcgacaccc agatgatcta cgacatgctg     1620

ctgagactca agtccgtgtt gtacctccct ggcgacttcg tgtgcaagaa gggcgaaatc     1680

gggaaggaga tgtacattat caagcacgga gaagtccagg tgctgggggg accagacggt     1740

accaaggtcc ttgtcaccct gaaggccggg tccgtgttcg gcgaaatttc cctgttggcc     1800

gccggcggtg gcaacaggag aaccgcaaat gtggtggccc acggcttcgc aaaccttctg     1860

accctggaca agaaaaccct ccaggaaatc ctcgtgcact acccggatag cgagcggatc     1920

ctgatgaaga aagcccgggt gctgctgaag caaaaggcca agaccgccga agccaccccg     1980

cctcggaagg acctggctct gctgttccca cccaaggagg agactcccaa actgtttaag     2040

accctcttgg gcgggacggg aaaggcctcc ctcgctcgct tgcttaagtt gaagagggag     2100

caggccgcgc agaagaagga aaactccgaa ggaggggaag aagagggaaa ggaaaacgaa     2160

gataagcaga aggagaacga ggataagcaa aaggaaaatg aggacaaggg gaaagaaaac     2220

gaggacaagg ataagggtcg cgaacctgaa gagaagccgc tggatcggcc agagtgcact     2280

gcctcgccta tcgcggtcga agaggaaccc catagcgtgc gcagaaccgt gctgcctaga     2340

ggcacatcga ggcagtcact gattatctct atggcaccaa gcgccgaggg aggagaggaa     2400

gtgctcacca tcgaggtcaa ggaaaaagcg aagcagtgat cataactgca                2450


<210>  46
<211>  12006
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  constructed sequence

<400>  46
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gcaagctagc      180

ctgaagagac agaaatatct ctaattccat gagcggtcat acgaggcaag agaagccgct      240

tagagcatgg acttagttag tttcagggat tggacagagt caagagctgg ggtgaggagg      300

ttaccctcgg taggggtgac acagatgtca accgcctatt ccctccacat gcatgtcctg      360

ccagaagaac ctgtccctgg gctgggaatc ttatattacc ttcctctcca atgagaagag      420

aagttcaagg ctcacagaca tgtgcataca caagctcaat gcactcaaga ttcccctcca      480

ccactcctgc ccccactacc tacaggagat tgactcctgc tgtgcacata agctgggata      540

atcagggttt ctaaacatca gcttcaaaag tccaatgtcc aaagtggtgg ggggccgggg      600

aacgaggtac tctttccata cccttggctt ttgtgtggcc tggagccgct gatatagaga      660

ttggagtggg acacgaggta ttcctttcaa aaacacaaag gcctatactt tgagccctcc      720

catttcaatc ccccaccatg cttcaccttt aagacctcca actccacttt gatcccagtt      780

ctcaggttca ggcctcacaa ggccaaaatc ctgaagttac ccttctcaaa ctcccttgcc      840

tttaacatca tcagaatcaa cctcctaccc ccactctgtc ccagcagcaa tagcctgcta      900

atcttttagc actaatcttt taggcactaa tctgctttcc aaactcttgg cacctgaact      960

atttataagc agtgttttat gcccccccac caaagaaccc tattcttttc ccatgacccc     1020

accaatcaaa acactcagag gactgtgggt ataagaggct ggggaggcag gcatagcagc     1080

ggccgccacc atggccaaga tcaacaccca atactcccac ccctccagga cccacctcaa     1140

ggtaaagacc tcagaccggg atctcaatcg cgctgaaaat ggcctcagca gagcccactc     1200

gtcaagtgag gagacatcgt cagtgctgca gccggggatc gccatggaga ccagaggact     1260

ggctgactcc gggcagggct ccttcaccgg ccaggggatc gccaggctgt cgcgcctcat     1320

cttcttgctg cgcaggtggg ctgccaggca tgtgcaccac caggaccagg gaccggactc     1380

ttttcctgat cgtttccgtg gagccgagct taaggaggtg tccagccaag aaagcaatgc     1440

ccaggcaaat gtgggcagcc aggagccagc agacagaggg agaagcgcct ggcccctggc     1500

caaatgcaac actaacacca gcaacaacac ggaggaggag aagaagacga aaaagaagga     1560

tgcgatcgtg gtggacccgt ccagcaacct gtactaccgc tggctgaccg ccatcgccct     1620

gcctgtcttc tataactggt atctgcttat ttgcagggcc tgtttcgatg agctgcagtc     1680

cgagtacctg atgctgtggc tggtcctgga ctactcggca gatgtcctgt atgtcttgga     1740

tgtgcttgta cgagctcgga caggttttct tgagcaaggc ttaatggtca gtgataccaa     1800

caggctgtgg cagcattaca agacgaccac gcagttcaag ctggatgtgt tgtccctggt     1860

ccccaccgac ctggcttact taaaggtggg cacaaactac ccagaagtga ggttcaaccg     1920

cctactgaag ttttcccggc tctttgaatt ctttgaccgc acagagacaa ggaccaacta     1980

ccccaatatg ttcaggattg ggaacttggt cttgtacatt ctcatcatca tccactggaa     2040

tgcctgcatc tactttgcca tttccaagtt cattggtttt gggacagact cctgggtcta     2100

cccaaacatc tcaatcccag agcatgggcg cctctccagg aagtacattt acagtctcta     2160

ctggtccacc ttgaccctta ccaccattgg tgagacccca ccccccgtga aagatgagga     2220

gtatctcttt gtggtcgtag acttcttggt gggtgttctg atttttgcca ccattgtggg     2280

caatgtgggc tccatgatct cgaatatgaa tgcctcacgg gcagagttcc aggccaagat     2340

tgattccatc aagcagtaca tgcagttccg caaggtcacc aaggacttgg agacgcgggt     2400

tatccggtgg tttgactacc tgtgggccaa caagaagacg gtggatgaga aggaggtgct     2460

caagagcctc ccagacaagc tgaaggctga gatcgccatc aacgtgcacc tggacacgct     2520

gaagaaggtt cgcatcttcc aggactgtga ggcagggctg ctggtggagc tggtgctgaa     2580

gctgcgaccc actgtgttca gccctgggga ttatatctgc aagaagggag atattgggaa     2640

ggagatgtac atcatcaacg agggcaagct ggccgtggtg gctgatgatg gggtcaccca     2700

gttcgtggtc ctcagcgatg gcagctactt cggggagatc agcattctga acatcaaggg     2760

gagcaagtcg gggaaccgca ggacggccaa catccgcagc attggctact cagacctgtt     2820

ctgcctctca aaggacgatc tcatggaggc cctcaccgag taccccgaag ccaagaaggc     2880

cctggaggag aaaggacggc agatcctgat gaaagacaac ctgatcgatg aggagctggc     2940

cagggcgggc gcggacccca aggaccttga ggagaaagtg gagcagctgg ggtcctccct     3000

ggacaccctg cagaccaggt ttgcacgcct cctggctgag tacaacgcca cccagatgaa     3060

gatgaagcag cgtctcagcc aactggaaag ccaggtgaag ggtggtgggg acaagcccct     3120

ggctgatggg gaagttcccg gggatgctac aaaaacagag gacaaacaac agtgatcata     3180

gggccgcata gtactgcgga tccgatccaa tcaacctctg gattacaaaa tttgtgaaag     3240

attgactggt attcttaact atgttgctcc ttttacgcta tgtggatacg ctgctttaat     3300

gcctttgtat catgctattg cttcccgtat ggctttcatt ttctcctcct tgtataaatc     3360

ctggttgctg tctctttatg aggagttgtg gcccgttgtc aggcaacgtg gcgtggtgtg     3420

cactgtgttt gctgacgcaa cccccactgg ttggggcatt gccaccacct gtcagctcct     3480

ttccgggact ttcgctttcc ccctccctat tgccacggcg gaactcatcg ccgcctgcct     3540

tgcccgctgc tggacagggg ctcggctgtt gggcactgac aattccgtgg tgttgtcggg     3600

gaaatcatcg tcctttcctt ggctgctcgc ctgtgttgcc acctggattc tgcgcgggac     3660

gtccttctgc tacgtccctt cggccctcaa tccagcggac cttccttccc gcggcctgct     3720

gccggctctg cggcctcttc cgcgtcttcg agatcgatct gcctcgactg tgccttctag     3780

ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg aaggtgccac     3840

tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga gtaggtgtca     3900

ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg aagacaatag     3960

caggcatgct ggggactcga gttctacgta gataagtagc atggcgggtt aatcattaac     4020

tacaaggaac ccctagtgat ggagttggcc actccctctc tgcgcgctcg ctcgctcact     4080

gaggccgggc gaccaaaggt cgcccgacgc ccgggctttg cccgggcggc ctcagtgagc     4140

gagcgagcgc gcagccttaa ttaacctaag gaaaatgaag tgaagttcct atactttcta     4200

gagaatagga acttctatag tgagtcgaat aagggcgaca caaaatttat tctaaatgca     4260

taataaatac tgataacatc ttatagtttg tattatattt tgtattatcg ttgacatgta     4320

taattttgat atcaaaaact gattttccct ttattatttt cgagatttat tttcttaatt     4380

ctctttaaca aactagaaat attgtatata caaaaaatca taaataatag atgaatagtt     4440

taattatagg tgttcatcaa tcgaaaaagc aacgtatctt atttaaagtg cgttgctttt     4500

ttctcattta taaggttaaa taattctcat atatcaagca aagtgacagg cgcccttaaa     4560

tattctgaca aatgctcttt ccctaaactc cccccataaa aaaacccgcc gaagcgggtt     4620

tttacgttat ttgcggatta acgattactc gttatcagaa ccgcccaggg ggcccgagct     4680

taaccttttt atttggggga gagggaagtc atgaaaaaac taacctttga aattcgatct     4740

ccagcacatc agcaaaacgc tattcacgca gtacagcaaa tccttccaga cccaaccaaa     4800

ccaatcgtag taaccattca ggaacgcaac cgcagcttag accaaaacag gaagctatgg     4860

gcctgcttag gtgacgtctc tcgtcaggtt gaatggcatg gtcgctggct ggatgcagaa     4920

agctggaagt gtgtgtttac cgcagcatta aagcagcagg atgttgttcc taaccttgcc     4980

gggaatggct ttgtggtaat aggccagtca accagcagga tgcgtgtagg cgaatttgcg     5040

gagctattag agcttataca ggcattcggt acagagcgtg gcgttaagtg gtcagacgaa     5100

gcgagactgg ctctggagtg gaaagcgaga tggggagaca gggctgcatg ataaatgtcg     5160

ttagtttctc cggtggcagg acgtcagcat atttgctctg gctaatggag caaaagcgac     5220

gggcaggtaa agacgtgcat tacgttttca tggatacagg ttgtgaacat ccaatgacat     5280

atcggtttgt cagggaagtt gtgaagttct gggatatacc gctcaccgta ttgcaggttg     5340

atatcaaccc ggagcttgga cagccaaatg gttatacggt atgggaacca aaggatattc     5400

agacgcgaat gcctgttctg aagccattta tcgatatggt aaagaaatat ggcactccat     5460

acgtcggcgg cgcgttctgc actgacagat taaaactcgt tcccttcacc aaatactgtg     5520

atgaccattt cgggcgaggg aattacacca cgtggattgg catcagagct gatgaaccga     5580

agcggctaaa gccaaagcct ggaatcagat atcttgctga actgtcagac tttgagaagg     5640

aagatatcct cgcatggtgg aagcaacaac cattcgattt gcaaataccg gaacatctcg     5700

gtaactgcat attctgcatt aaaaaatcaa cgcaaaaaat cggacttgcc tgcaaagatg     5760

aggagggatt gcagcgtgtt tttaatgagg tcatcacggg atcccatgtg cgtgacggac     5820

atcgggaaac gccaaaggag attatgtacc gaggaagaat gtcgctggac ggtatcgcga     5880

aaatgtattc agaaaatgat tatcaagccc tgtatcagga catggtacga gctaaaagat     5940

tcgataccgg ctcttgttct gagtcatgcg aaatatttgg agggcagctt gatttcgact     6000

tcgggaggga agctgcatga tgcgatgtta tcggtgcggt gaatgcaaag aagataaccg     6060

cttccgacca aatcaacctt actggaatcg atggtgtctc cggtgtgaaa gaacaccaac     6120

aggggtgtta ccactaccgc aggaaaagga ggacgtgtgg cgagacagcg acgaagtatc     6180

accgacataa tctgcgaaaa ctgcaaatac cttccaacga aacgcaccag aaataaaccc     6240

aagccaatcc caaaagaatc tgacgtaaaa accttcaact acacggctca cctgtgggat     6300

atccggtggc taagacgtcg tgcgaggaaa acaaggtgat tgaccaaaat cgaagttacg     6360

aacaagaaag cgtcgagcga gctttaacgt gcgctaactg cggtcagaag ctgcatgtgc     6420

tggaagttca cgtgtgtgag cactgctgcg cagaactgat gagcgatccg aatagctcga     6480

tgcacgagga agaagatgat ggctaaacca gcgcgaagac gatgtaaaaa cgatgaatgc     6540

cgggaatggt ttcaccctgc attcgctaat cagtggtggt gctctccaga gtgtggaacc     6600

aagatagcac tcgaacgacg aagtaaagaa cgcgaaaaag cggaaaaagc agcagagaag     6660

aaacgacgac gagaggagca gaaacagaaa gataaactta agattcgaaa actcgcctta     6720

aagccccgca gttactggat taaacaagcc caacaagccg taaacgcctt catcagagaa     6780

agagaccgcg acttaccatg tatctcgtgc ggaacgctca cgtctgctca gtgggatgcc     6840

ggacattacc ggacaactgc tgcggcacct caactccgat ttaatgaacg caatattcac     6900

aagcaatgcg tggtgtgcaa ccagcacaaa agcggaaatc tcgttccgta tcgcgtcgaa     6960

ctgattagcc gcatcgggca ggaagcagta gacgaaatcg aatcaaacca taaccgccat     7020

cgctggacta tcgaagagtg caaggcgatc aaggcagagt accaacagaa actcaaagac     7080

ctgcgaaata gcagaagtga ggccgcatga cgttctcagt aaaaaccatt ccagacatgc     7140

tcgttgaagc atacggaaat cagacagaag tagcacgcag actgaaatgt agtcgcggta     7200

cggtcagaaa atacgttgat gataaagacg ggaaaatgca cgccatcgtc aacgacgttc     7260

tcatggttca tcgcggatgg agtgaaagag atgcgctatt acgaaaaaat tgatggcagc     7320

aaataccgaa atatttgggt agttggcgat ctgcacggat gctacacgaa cctgatgaac     7380

aaactggata cgattggatt cgacaacaaa aaagacctgc ttatctcggt gggcgatttg     7440

gttgatcgtg gtgcagagaa cgttgaatgc ctggaattaa tcacattccc ctggttcaga     7500

gctgtacgtg gaaaccatga gcaaatgatg attgatggct tatcagagcg tggaaacgtt     7560

aatcactggc tgcttaatgg cggtggctgg ttctttaatc tcgattacga caaagaaatt     7620

ctggctaaag ctcttgccca taaagcagat gaacttccgt taatcatcga actggtgagc     7680

aaagataaaa aatatgttat ctgccacgcc gattatccct ttgacgaata cgagtttgga     7740

aagccagttg atcatcagca ggtaatctgg aaccgcgaac gaatcagcaa ctcacaaaac     7800

gggatcgtga aagaaatcaa aggcgcggac acgttcatct ttggtcatac gccagcagtg     7860

aaaccactca agtttgccaa ccaaatgtat atcgataccg gcgcagtgtt ctgcggaaac     7920

ctaacattga ttcaggtaca gggagaaggc gcatgagact cgaaagcgta gctaaatttc     7980

attcgccaaa aagcccgatg atgagcgact caccacgggc cacggcttct gactctcttt     8040

ccggtactga tgtgatggct gctatgggga tggcgcaatc acaagccgga ttcggtatgg     8100

ctgcattctg cggtaagcac gaactcagcc agaacgacaa acaaaaggct atcaactatc     8160

tgatgcaatt tgcacacaag gtatcgggga aataccgtgg tgtggcaaag cttgaaggaa     8220

atactaaggc aaaggtactg caagtgctcg caacattcgc ttatgcggat tattgccgta     8280

gtgccgcgac gccgggggca agatgcagag attgccatgg tacaggccgt gcggttgata     8340

ttgccaaaac agagctgtgg gggagagttg tcgagaaaga gtgcggaaga tgcaaaggcg     8400

tcggctattc aaggatgcca gcaagcgcag catatcgcgc tgtgacgatg ctaatcccaa     8460

accttaccca acccacctgg tcacgcactg ttaagccgct gtatgacgct ctggtggtgc     8520

aatgccacaa agaagagtca atcgcagaca acattttgaa tgcggtcaca cgttagcagc     8580

atgattgcca cggatggcaa catattaacg gcatgatatt gacttattga ataaaattgg     8640

gtaaatttga ctcaacgatg ggttaattcg ctcgttgtgg tagtgagatg aaaagaggcg     8700

gcgcttacta ccgattccgc ctagttggtc acttcgacgt atcgtctgga actccaacca     8760

tcgcaggcag agaggtctgc aaaatgcaat cccgaaacag ttcgcaggta atagttagag     8820

cctgcataac ggtttcggga ttttttatat ctgcacaaca ggtaagagca ttgagtcgat     8880

aatcgtgaag agtcggcgag cctggttagc cagtgctctt tccgttgtgc tgaattaagc     8940

gaataccgga agcagaaccg gatcaccaaa tgcgtacagg cgtcatcgcc gcccagcaac     9000

agcacaaccc aaactgagcc gtagccactg tctgtcctga attcattagt aatagttacg     9060

ctgcggcctt ttacacatga ccttcgtgaa agcgggtggc aggaggtcgc gctaacaacc     9120

tcctgccgtt ttgcccgtgc atatcggtca cgaacaaatc tgattactaa acacagtagc     9180

ctggatttgt tctatcagta atcgacctta ttcctaatta aatagagcaa atccccttat     9240

tgggggtaag acatgaagat gccagaaaaa catgacctgt tggccgccat tctcgcggca     9300

aaggaacaag gcatcggggc aatccttgcg tttgcaatgg cgtaccttcg cggcagatat     9360

aatggcggtg cgtttacaaa aacagtaatc gacgcaacga tgtgcgccat tatcgcctgg     9420

ttcattcgtg accttctcga cttcgccgga ctaagtagca atctcgctta tataacgagc     9480

gtgtttatcg gctacatcgg tactgactcg attggttcgc ttatcaaacg cttcgctgct     9540

aaaaaagccg gagtagaaga tggtagaaat caataatcaa cgtaaggcgt tcctcgatat     9600

gctggcgtgg tcggagggaa ctgataacgg acgtcagaaa accagaaatc atggttatga     9660

cgtcattgta ggcggagagc tatttactga ttactccgat caccctcgca aacttgtcac     9720

gctaaaccca aaactcaaat caacaggcgc ttaagactgg ccgtcgtttt acaacacaga     9780

aagagtttgt agaaacgcaa aaaggccatc cgtcaggggc cttctgctta gtttgatgcc     9840

tggcagttcc ctactctcgc cttccgcttc ctcgctcact gactcgctgc gctcggtcgt     9900

tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat ccacagaatc     9960

aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca ggaaccgtaa    10020

aaaggccgcg ttgctggcgt ttttccatag gctccgcccc cctgacgagc atcacaaaaa    10080

tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc    10140

ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg gatacctgtc    10200

cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta ggtatctcag    10260

ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg ttcagcccga    10320

ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac acgacttatc    10380

gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag gcggtgctac    10440

agagttcttg aagtggtggg ctaactacgg ctacactaga agaacagtat ttggtatctg    10500

cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat ccggcaaaca    10560

aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa    10620

aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt ggaacgacgc    10680

gcgcgtaact cacgttaagg gattttggtc atgagcttgc gccgtcccgt caagtcagcg    10740

taatgctctg cttttagaaa aactcatcga gcatcaaatg aaactgcaat ttattcatat    10800

caggattatc aataccatat ttttgaaaaa gccgtttctg taatgaagga gaaaactcac    10860

cgaggcagtt ccataggatg gcaagatcct ggtatcggtc tgcgattccg actcgtccaa    10920

catcaataca acctattaat ttcccctcgt caaaaataag gttatcaagt gagaaatcac    10980

catgagtgac gactgaatcc ggtgagaatg gcaaaagttt atgcatttct ttccagactt    11040

gttcaacagg ccagccatta cgctcgtcat caaaatcact cgcatcaacc aaaccgttat    11100

tcattcgtga ttgcgcctga gcgaggcgaa atacgcgatc gctgttaaaa ggacaattac    11160

aaacaggaat cgagtgcaac cggcgcagga acactgccag cgcatcaaca atattttcac    11220

ctgaatcagg atattcttct aatacctgga acgctgtttt tccggggatc gcagtggtga    11280

gtaaccatgc atcatcagga gtacggataa aatgcttgat ggtcggaagt ggcataaatt    11340

ccgtcagcca gtttagtctg accatctcat ctgtaacatc attggcaacg ctacctttgc    11400

catgtttcag aaacaactct ggcgcatcgg gcttcccata caagcgatag attgtcgcac    11460

ctgattgccc gacattatcg cgagcccatt tatacccata taaatcagca tccatgttgg    11520

aatttaatcg cggcctcgac gtttcccgtt gaatatggct catattcttc ctttttcaat    11580

attattgaag catttatcag ggttattgtc tcatgagcgg atacatattt gaatgtattt    11640

agaaaaataa acaaataggg gtcagtgtta caaccaatta accaattctg aacattatcg    11700

cgagcccatt tatacctgaa tatggctcat aacacccctt gtttgcctgg cggcagtagc    11760

gcggtggtcc cacctgaccc catgccgaac tcagaagtga aacgccgtag cgccgatggt    11820

agtgtgggga ctccccatgc gagagtaggg aactgccagg catcaaataa aacgaaaggc    11880

tcagtcgaaa gactgggcct ttcgcccggg ctaattaggg ggtgtcgccc ttattcgact    11940

ctatagtgaa gttcctattc tctagaaagt ataggaactt ctgaagtggg gtcgacttaa    12000

ttaagg                                                               12006


