                         SEQUENCE LISTING

<110>  The Trustees of the University of Pennsylvania
 
<120>  COMPOSITIONS USEFUL FOR TREATING GM1 GANDLIOSIDOSIS

<130>  18-8537PCT

<150>  US 62/739,811
<151>  2018-10-01

<150>  US 62/835,178
<151>  2019-04-17

<160>  26    

<170>  PatentIn version 3.5

<210>  1
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAVhu68 vp1 capsid of Homo Sapiens origin


<220>
<221>  CDS
<222>  (1)..(2211)

<400>  1
atg gct gcc gat ggt tat ctt cca gat tgg ctc gag gac aac ctc agt         48
Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser           
1               5                   10                  15                

gaa ggc att cgc gag tgg tgg gct ttg aaa cct gga gcc cct caa ccc         96
Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro           
            20                  25                  30                    

aag gca aat caa caa cat caa gac aac gct cgg ggt ctt gtg ctt ccg        144
Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro           
        35                  40                  45                        

ggt tac aaa tac ctt gga ccc ggc aac gga ctc gac aag ggg gag ccg        192
Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro           
    50                  55                  60                            

gtc aac gaa gca gac gcg gcg gcc ctc gag cac gac aag gcc tac gac        240
Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp           
65                  70                  75                  80            

cag cag ctc aag gcc gga gac aac ccg tac ctc aag tac aac cac gcc        288
Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala           
                85                  90                  95                

gac gcc gag ttc cag gag cgg ctc aaa gaa gat acg tct ttt ggg ggc        336
Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly           
            100                 105                 110                   

aac ctc ggg cga gca gtc ttc cag gcc aaa aag agg ctt ctt gaa cct        384
Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro           
        115                 120                 125                       

ctt ggt ctg gtt gag gaa gcg gct aag acg gct cct gga aag aag agg        432
Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg           
    130                 135                 140                           

cct gta gag cag tct cct cag gaa ccg gac tcc tcc gtg ggt att ggc        480
Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Val Gly Ile Gly           
145                 150                 155                 160           

aaa tcg ggt gca cag ccc gct aaa aag aga ctc aat ttc ggt cag act        528
Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr           
                165                 170                 175               

ggc gac aca gag tca gtc ccc gac cct caa cca atc gga gaa cct ccc        576
Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro           
            180                 185                 190                   

gca gcc ccc tca ggt gtg gga tct ctt aca atg gct tca ggt ggt ggc        624
Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly           
        195                 200                 205                       

gca cca gtg gca gac aat aac gaa ggt gcc gat gga gtg ggt agt tcc        672
Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser           
    210                 215                 220                           

tcg gga aat tgg cat tgc gat tcc caa tgg ctg ggg gac aga gtc atc        720
Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile           
225                 230                 235                 240           

acc acc agc acc cga acc tgg gcc ctg ccc acc tac aac aat cac ctc        768
Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu           
                245                 250                 255               

tac aag caa atc tcc aac agc aca tct gga gga tct tca aat gac aac        816
Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn           
            260                 265                 270                   

gcc tac ttc ggc tac agc acc ccc tgg ggg tat ttt gac ttc aac aga        864
Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg           
        275                 280                 285                       

ttc cac tgc cac ttc tca cca cgt gac tgg caa aga ctc atc aac aac        912
Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn           
    290                 295                 300                           

aac tgg gga ttc cgg cct aag cga ctc aac ttc aag ctc ttc aac att        960
Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile           
305                 310                 315                 320           

cag gtc aaa gag gtt acg gac aac aat gga gtc aag acc atc gct aat       1008
Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn           
                325                 330                 335               

aac ctt acc agc acg gtc cag gtc ttc acg gac tca gac tat cag ctc       1056
Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu           
            340                 345                 350                   

ccg tac gtg ctc ggg tcg gct cac gag ggc tgc ctc ccg ccg ttc cca       1104
Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro           
        355                 360                 365                       

gcg gac gtt ttc atg att cct cag tac ggg tat cta acg ctt aat gat       1152
Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp           
    370                 375                 380                           

gga agc caa gcc gtg ggt cgt tcg tcc ttt tac tgc ctg gaa tat ttc       1200
Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe           
385                 390                 395                 400           

ccg tcg caa atg cta aga acg ggt aac aac ttc cag ttc agc tac gag       1248
Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu           
                405                 410                 415               

ttt gag aac gta cct ttc cat agc agc tat gct cac agc caa agc ctg       1296
Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu           
            420                 425                 430                   

gac cga ctc atg aat cca ctc atc gac caa tac ttg tac tat ctc tca       1344
Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser           
        435                 440                 445                       

aag act att aac ggt tct gga cag aat caa caa acg cta aaa ttc agt       1392
Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser           
    450                 455                 460                           

gtg gcc gga ccc agc aac atg gct gtc cag gga aga aac tac ata cct       1440
Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro           
465                 470                 475                 480           

gga ccc agc tac cga caa caa cgt gtc tca acc act gtg act caa aac       1488
Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn           
                485                 490                 495               

aac aac agc gaa ttt gct tgg cct gga gct tct tct tgg gct ctc aat       1536
Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn           
            500                 505                 510                   

gga cgt aat agc ttg atg aat cct gga cct gct atg gcc agc cac aaa       1584
Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys           
        515                 520                 525                       

gaa gga gag gac cgt ttc ttt cct ttg tct gga tct tta att ttt ggc       1632
Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly           
    530                 535                 540                           

aaa caa gga act gga aga gac aac gtg gat gcg gac aaa gtc atg ata       1680
Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile           
545                 550                 555                 560           

acc aac gaa gaa gaa att aaa act acc aac cca gta gca acg gag tcc       1728
Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser           
                565                 570                 575               

tat gga caa gtg gcc aca aac cac cag agt gcc caa gca cag gcg cag       1776
Tyr Gly Gln Val Ala Thr Asn His Gln Ser Ala Gln Ala Gln Ala Gln           
            580                 585                 590                   

acc ggc tgg gtt caa aac caa gga ata ctt ccg ggt atg gtt tgg cag       1824
Thr Gly Trp Val Gln Asn Gln Gly Ile Leu Pro Gly Met Val Trp Gln           
        595                 600                 605                       

gac aga gat gtg tac ctg caa gga ccc att tgg gcc aaa att cct cac       1872
Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His           
    610                 615                 620                           

acg gac ggc aac ttt cac cct tct ccg ctg atg gga ggg ttt gga atg       1920
Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Met           
625                 630                 635                 640           

aag cac ccg cct cct cag atc ctc atc aaa aac aca cct gta cct gcg       1968
Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala           
                645                 650                 655               

gat cct cca acg gct ttc aac aag gac aag ctg aac tct ttc atc acc       2016
Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys Leu Asn Ser Phe Ile Thr           
            660                 665                 670                   

cag tat tct act ggc caa gtc agc gtg gag att gag tgg gag ctg cag       2064
Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln           
        675                 680                 685                       

aag gaa aac agc aag cgc tgg aac ccg gag atc cag tac act tcc aac       2112
Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn           
    690                 695                 700                           

tat tac aag tct aat aat gtt gaa ttt gct gtt aat act gaa ggt gtt       2160
Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala Val Asn Thr Glu Gly Val           
705                 710                 715                 720           

tat tct gaa ccc cgc ccc att ggc acc aga tac ctg act cgt aat ctg       2208
Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu           
                725                 730                 735               

taa                                                                   2211


<210>  2
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Synthetic Construct

<400>  2

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Val Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu 
                405                 410                 415     


Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro 
465                 470                 475                 480 


Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn 
                485                 490                 495     


Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser 
                565                 570                 575     


Tyr Gly Gln Val Ala Thr Asn His Gln Ser Ala Gln Ala Gln Ala Gln 
            580                 585                 590         


Thr Gly Trp Val Gln Asn Gln Gly Ile Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Met 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735     


<210>  3
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  modified hu68vp1


<220>
<221>  MISC_FEATURE
<222>  (23)..(23)
<223>  Xaa may be W (Trp, tryptophan), or oxidated W.

<220>
<221>  MISC_FEATURE
<222>  (35)..(35)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (57)..(57)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (66)..(66)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (94)..(94)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (97)..(97)
<223>  Xaa may be D (asp, aspartic acid), or isomerized D.

<220>
<221>  MISC_FEATURE
<222>  (107)..(107)
<223>  Xaa may be D (asp, aspartic acid), or isomerized D.

<220>
<221>  misc_feature
<222>  (113)..(113)
<223>  Xaa can be any naturally occurring amino acid

<220>
<221>  MISC_FEATURE
<222>  (149)..(149)
<223>  Xaa may be S (Ser, serine), or Phosphorilated S

<220>
<221>  MISC_FEATURE
<222>  (149)..(149)
<223>  Xaa may be S (Ser, serine), or Phosphorylated S

<220>
<221>  MISC_FEATURE
<222>  (247)..(247)
<223>  Xaa may be W (Trp, tryptophan), or oxidated W (e.g., kynurenine).

<220>
<221>  MISC_FEATURE
<222>  (253)..(253)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (259)..(259)
<223>  Xaa represents Q, or Q deamidated to glutamic acid 
       (alpha-glutamic acid), gamma-glutamic acid (Glu), or a blend of 
       alpha- and gamma-glutamic acid

<220>
<221>  MISC_FEATURE
<222>  (270)..(270)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (297)..(297)
<223>  Xaa represents D (Asp, aspartic acid) or amindated D to N (Asn, 
       asparagine)

<220>
<221>  MISC_FEATURE
<222>  (304)..(304)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (306)..(306)
<223>  Xaa may be W (Trp, tryptophan), or oxidated W (e.g., kynurenine).

<220>
<221>  MISC_FEATURE
<222>  (314)..(314)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (319)..(319)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (329)..(329)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (332)..(332)
<223>  Xaa may be K (lys, lysine), or acetylated K

<220>
<221>  MISC_FEATURE
<222>  (336)..(336)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (384)..(384)
<223>  Xaa may be D (asp, aspartic acid), or isomerized D.

<220>
<221>  MISC_FEATURE
<222>  (404)..(404)
<223>  Xaa may be M (Met, Methionine), or oxidated M.

<220>
<221>  MISC_FEATURE
<222>  (409)..(409)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (436)..(436)
<223>  Xaa may be M (Met, Methionine), or oxidated M.

<220>
<221>  MISC_FEATURE
<222>  (452)..(452)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (477)..(477)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (499)..(499)
<223>  Xaa may be S (Ser, serine), or Phosphorylated S

<220>
<221>  MISC_FEATURE
<222>  (512)..(512)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (515)..(515)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (518)..(518)
<223>  Xaa may be M (Met, Methionine), or oxidated M.

<220>
<221>  MISC_FEATURE
<222>  (524)..(524)
<223>  Xaa may be M (Met, Methionine), or oxidated M.

<220>
<221>  MISC_FEATURE
<222>  (559)..(559)
<223>  Xaa may be M (Met, Methionine), or oxidated M.

<220>
<221>  MISC_FEATURE
<222>  (569)..(569)
<223>  Xaa may be T (Thr, threonine), or Phosphorylated T

<220>
<221>  MISC_FEATURE
<222>  (586)..(586)
<223>  Xaa may be S (Ser, serine), or Phosphorylated S

<220>
<221>  MISC_FEATURE
<222>  (599)..(599)
<223>  Xaa represents Q, or Q deamidated to glutamic acid 
       (alpha-glutamic acid), gamma-glutamic acid (Glu), or a blend of 
       alpha- and gamma-glutamic acid

<220>
<221>  MISC_FEATURE
<222>  (605)..(605)
<223>  Xaa may be M (Met, Methionine), or oxidated M.

<220>
<221>  MISC_FEATURE
<222>  (619)..(619)
<223>  Xaa may be W (Trp, tryptophan), or oxidated W (e.g., kynurenine).

<220>
<221>  MISC_FEATURE
<222>  (628)..(628)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (640)..(640)
<223>  Xaa may be M (Met, Methionine), or oxidated M.

<220>
<221>  MISC_FEATURE
<222>  (651)..(651)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (663)..(663)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (666)..(666)
<223>  Xaa may be K (lys, lysine), or acetylated K

<220>
<221>  MISC_FEATURE
<222>  (689)..(689)
<223>  Xaa may be K (lys, lysine), or acetylated K

<220>
<221>  MISC_FEATURE
<222>  (693)..(693)
<223>  Xaa may be K (lys, lysine), or acetylated K

<220>
<221>  MISC_FEATURE
<222>  (695)..(695)
<223>  Xaa may be W (Trp, tryptophan), or oxidated W.

<220>
<221>  MISC_FEATURE
<222>  (709)..(709)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<220>
<221>  MISC_FEATURE
<222>  (735)..(735)
<223>  Xaa may be Asn, or deamidated to Asp, isoAsp, or Asp/isoAsp

<400>  3

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Xaa Ala Leu Lys Pro Gly Ala Pro Gln Pro 
            20                  25                  30          


Lys Ala Xaa Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Xaa Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Xaa Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Xaa His Ala 
                85                  90                  95      


Xaa Ala Glu Phe Gln Glu Arg Leu Lys Glu Xaa Thr Ser Phe Gly Gly 
            100                 105                 110         


Xaa Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Xaa Pro Gln Glu Pro Asp Ser Ser Val Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Xaa Ala Leu Pro Thr Tyr Xaa Asn His Leu 
                245                 250                 255     


Tyr Lys Xaa Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Xaa Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Xaa Trp Gln Arg Leu Ile Asn Xaa 
    290                 295                 300                 


Asn Xaa Gly Phe Arg Pro Lys Arg Leu Xaa Phe Lys Leu Phe Xaa Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Xaa Gly Val Xaa Thr Ile Ala Xaa 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Xaa 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Xaa Leu Arg Thr Gly Xaa Asn Phe Gln Phe Ser Tyr Glu 
                405                 410                 415     


Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Xaa Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Xaa Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Xaa Tyr Ile Pro 
465                 470                 475                 480 


Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn 
                485                 490                 495     


Asn Asn Xaa Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Xaa 
            500                 505                 510         


Gly Arg Xaa Ser Leu Xaa Asn Pro Gly Pro Ala Xaa Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Xaa Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Xaa Asn Pro Val Ala Thr Glu Ser 
                565                 570                 575     


Tyr Gly Gln Val Ala Thr Asn His Gln Xaa Ala Gln Ala Gln Ala Gln 
            580                 585                 590         


Thr Gly Trp Val Gln Asn Gln Gly Ile Leu Pro Gly Xaa Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Xaa Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Xaa Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Xaa 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Xaa Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Ala Phe Xaa Lys Asp Xaa Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Xaa Glu Asn Ser Xaa Arg Xaa Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Xaa Asn Val Glu Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Xaa Leu 
                725                 730                 735     


<210>  4
<211>  677
<212>  PRT
<213>  Homo sapiens


<220>
<221>  SIGNAL
<222>  (1)..(23)

<220>
<221>  mat_peptide
<222>  (24)..(677)

<400>  4

Met Pro Gly Phe Leu Val Arg Ile Leu Leu Leu Leu Leu Val Leu Leu 
            -20                 -15                 -10         


Leu Leu Gly Pro Thr Arg Gly Leu Arg Asn Ala Thr Gln Arg Met Phe 
        -5              -1  1               5                   


Glu Ile Asp Tyr Ser Arg Asp Ser Phe Leu Lys Asp Gly Gln Pro Phe 
10                  15                  20                  25  


Arg Tyr Ile Ser Gly Ser Ile His Tyr Ser Arg Val Pro Arg Phe Tyr 
                30                  35                  40      


Trp Lys Asp Arg Leu Leu Lys Met Lys Met Ala Gly Leu Asn Ala Ile 
            45                  50                  55          


Gln Thr Tyr Val Pro Trp Asn Phe His Glu Pro Trp Pro Gly Gln Tyr 
        60                  65                  70              


Gln Phe Ser Glu Asp His Asp Val Glu Tyr Phe Leu Arg Leu Ala His 
    75                  80                  85                  


Glu Leu Gly Leu Leu Val Ile Leu Arg Pro Gly Pro Tyr Ile Cys Ala 
90                  95                  100                 105 


Glu Trp Glu Met Gly Gly Leu Pro Ala Trp Leu Leu Glu Lys Glu Ser 
                110                 115                 120     


Ile Leu Leu Arg Ser Ser Asp Pro Asp Tyr Leu Ala Ala Val Asp Lys 
            125                 130                 135         


Trp Leu Gly Val Leu Leu Pro Lys Met Lys Pro Leu Leu Tyr Gln Asn 
        140                 145                 150             


Gly Gly Pro Val Ile Thr Val Gln Val Glu Asn Glu Tyr Gly Ser Tyr 
    155                 160                 165                 


Phe Ala Cys Asp Phe Asp Tyr Leu Arg Phe Leu Gln Lys Arg Phe Arg 
170                 175                 180                 185 


His His Leu Gly Asp Asp Val Val Leu Phe Thr Thr Asp Gly Ala His 
                190                 195                 200     


Lys Thr Phe Leu Lys Cys Gly Ala Leu Gln Gly Leu Tyr Thr Thr Val 
            205                 210                 215         


Asp Phe Gly Thr Gly Ser Asn Ile Thr Asp Ala Phe Leu Ser Gln Arg 
        220                 225                 230             


Lys Cys Glu Pro Lys Gly Pro Leu Ile Asn Ser Glu Phe Tyr Thr Gly 
    235                 240                 245                 


Trp Leu Asp His Trp Gly Gln Pro His Ser Thr Ile Lys Thr Glu Ala 
250                 255                 260                 265 


Val Ala Ser Ser Leu Tyr Asp Ile Leu Ala Arg Gly Ala Ser Val Asn 
                270                 275                 280     


Leu Tyr Met Phe Ile Gly Gly Thr Asn Phe Ala Tyr Trp Asn Gly Ala 
            285                 290                 295         


Asn Ser Pro Tyr Ala Ala Gln Pro Thr Ser Tyr Asp Tyr Asp Ala Pro 
        300                 305                 310             


Leu Ser Glu Ala Gly Asp Leu Thr Glu Lys Tyr Phe Ala Leu Arg Asn 
    315                 320                 325                 


Ile Ile Gln Lys Phe Glu Lys Val Pro Glu Gly Pro Ile Pro Pro Ser 
330                 335                 340                 345 


Thr Pro Lys Phe Ala Tyr Gly Lys Val Thr Leu Glu Lys Leu Lys Thr 
                350                 355                 360     


Val Gly Ala Ala Leu Asp Ile Leu Cys Pro Ser Gly Pro Ile Lys Ser 
            365                 370                 375         


Leu Tyr Pro Leu Thr Phe Ile Gln Val Lys Gln His Tyr Gly Phe Val 
        380                 385                 390             


Leu Tyr Arg Thr Thr Leu Pro Gln Asp Cys Ser Asn Pro Ala Pro Leu 
    395                 400                 405                 


Ser Ser Pro Leu Asn Gly Val His Asp Arg Ala Tyr Val Ala Val Asp 
410                 415                 420                 425 


Gly Ile Pro Gln Gly Val Leu Glu Arg Asn Asn Val Ile Thr Leu Asn 
                430                 435                 440     


Ile Thr Gly Lys Ala Gly Ala Thr Leu Asp Leu Leu Val Glu Asn Met 
            445                 450                 455         


Gly Arg Val Asn Tyr Gly Ala Tyr Ile Asn Asp Phe Lys Gly Leu Val 
        460                 465                 470             


Ser Asn Leu Thr Leu Ser Ser Asn Ile Leu Thr Asp Trp Thr Ile Phe 
    475                 480                 485                 


Pro Leu Asp Thr Glu Asp Ala Val Arg Ser His Leu Gly Gly Trp Gly 
490                 495                 500                 505 


His Arg Asp Ser Gly His His Asp Glu Ala Trp Ala His Asn Ser Ser 
                510                 515                 520     


Asn Tyr Thr Leu Pro Ala Phe Tyr Met Gly Asn Phe Ser Ile Pro Ser 
            525                 530                 535         


Gly Ile Pro Asp Leu Pro Gln Asp Thr Phe Ile Gln Phe Pro Gly Trp 
        540                 545                 550             


Thr Lys Gly Gln Val Trp Ile Asn Gly Phe Asn Leu Gly Arg Tyr Trp 
    555                 560                 565                 


Pro Ala Arg Gly Pro Gln Leu Thr Leu Phe Val Pro Gln His Ile Leu 
570                 575                 580                 585 


Met Thr Ser Ala Pro Asn Thr Ile Thr Val Leu Glu Leu Glu Trp Ala 
                590                 595                 600     


Pro Cys Ser Ser Asp Asp Pro Glu Leu Cys Ala Val Thr Phe Val Asp 
            605                 610                 615         


Arg Pro Val Ile Gly Ser Ser Val Thr Tyr Asp His Pro Ser Lys Pro 
        620                 625                 630             


Val Glu Lys Arg Leu Met Pro Pro Pro Pro Gln Lys Asn Lys Asp Ser 
    635                 640                 645                 


Trp Leu Asp His Val 
650                 


<210>  5
<211>  2034
<212>  DNA
<213>  Homo sapiens

<400>  5
atgccggggt tcctggttcg catcctcctt ctgctgctgg ttctgctgct tctgggccct       60

acgcgcggct tgcgcaatgc cacccagagg atgtttgaaa ttgactatag ccgggactcc      120

ttcctcaagg atggccagcc atttcgctac atctcaggaa gcattcacta ctcccgtgtg      180

ccccgcttct actggaagga ccggctgctg aagatgaaga tggctgggct gaacgccatc      240

cagacgtatg tgccctggaa ctttcatgag ccctggccag gacagtacca gttttctgag      300

gaccatgatg tggaatattt tcttcggctg gctcatgagc tgggactgct ggttatcctg      360

aggcccgggc cctacatctg tgcagagtgg gaaatgggag gattacctgc ttggctgcta      420

gagaaagagt ctattcttct ccgctcctcc gacccagatt acctggcagc tgtggacaag      480

tggttgggag tccttctgcc caagatgaag cctctcctct atcagaatgg agggccagtt      540

ataacagtgc aggttgaaaa tgaatatggc agctactttg cctgtgattt tgactacctg      600

cgcttcctgc agaagcgctt tcgccaccat ctgggggatg atgtggttct gtttaccact      660

gatggagcac ataaaacatt cctgaaatgt ggggccctgc agggcctcta caccacggtg      720

gactttggaa caggcagcaa catcacagat gctttcctaa gccagaggaa gtgtgagccc      780

aaaggaccct tgatcaattc tgaattctat actggctggc tagatcactg gggccaacct      840

cactccacaa tcaagaccga agcagtggct tcctccctct atgatatact tgcccgtggg      900

gcgagtgtga acttgtacat gtttataggt gggaccaatt ttgcctattg gaatggggcc      960

aactcaccct atgcagcaca gcccaccagc tacgactatg atgccccact gagtgaggct     1020

ggggacctca ctgagaagta ttttgctctg cgaaacatca tccagaagtt tgaaaaagta     1080

ccagaaggtc ctatccctcc atctacacca aagtttgcat atggaaaggt cactttggaa     1140

aagttaaaga cagtgggagc agctctggac attctgtgtc cctctgggcc catcaaaagc     1200

ctttatccct tgacatttat ccaggtgaaa cagcattatg ggtttgtgct gtaccggaca     1260

acacttcctc aagattgcag caacccagca cctctctctt cacccctcaa tggagtccac     1320

gatcgagcat atgttgctgt ggatgggatc ccccagggag tccttgagcg aaacaatgtg     1380

atcactctga acataacagg gaaagctgga gccactctgg accttctggt agagaacatg     1440

ggacgtgtga actatggtgc atatatcaac gattttaagg gtttggtttc taacctgact     1500

ctcagttcca atatcctcac ggactggacg atctttccac tggacactga ggatgcagtg     1560

cgcagccacc tggggggctg gggacaccgt gacagtggcc accatgatga agcctgggcc     1620

cacaactcat ccaactacac gctcccggcc ttttatatgg ggaacttctc cattcccagt     1680

gggatcccag acttgcccca ggacaccttt atccagtttc ctggatggac caagggccag     1740

gtctggatta atggctttaa ccttggccgc tattggccag cccggggccc tcagttgacc     1800

ttgtttgtgc cccagcacat cctgatgacc tcggccccaa acaccatcac cgtgctggaa     1860

ctggagtggg caccctgcag cagtgatgat ccagaactat gtgctgtgac gttcgtggac     1920

aggccagtta ttggctcatc tgtgacctac gatcatccct ccaaacctgt tgaaaaaaga     1980

ctcatgcccc cacccccgca aaaaaacaaa gattcatggc tggaccatgt atga           2034


<210>  6
<211>  2031
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Engineered coding sequence for human GLB1

<400>  6
atgccgggct ttctggtgcg cattctgctg ctgctgctgg tgctgctgct gctgggcccg       60

acccgcggcc tgcgcaacgc gacccagcgc atgtttgaaa ttgattatag ccgcgatagc      120

tttctgaaag atggccagcc gtttcgctat attagcggca gcattcatta tagccgcgtg      180

ccgcgctttt attggaaaga tcgcctgctg aaaatgaaaa tggcgggcct gaacgcgatt      240

cagacctatg tgccgtggaa ctttcatgaa ccgtggccgg gccagtatca gtttagcgaa      300

gatcatgatg tggaatattt tctgcgcctg gcgcatgaac tgggcctgct ggtgattctg      360

cgcccgggcc cgtatatttg cgcggaatgg gaaatgggcg gcctgccggc gtggctgctg      420

gaaaaagaaa gcattctgct gcgcagcagc gatccggatt atctggcggc ggtggataaa      480

tggctgggcg tgctgctgcc gaaaatgaaa ccgctgctgt atcagaacgg cggcccggtg      540

attaccgtgc aggtggaaaa cgaatatggc agctattttg cgtgcgattt tgattatctg      600

cgctttctgc agaaacgctt tcgccatcat ctgggcgatg atgtggtgct gtttaccacc      660

gatggcgcgc ataaaacctt tctgaaatgc ggcgcgctgc agggcctgta taccaccgtg      720

gattttggca ccggcagcaa cattaccgat gcgtttctga gccagcgcaa atgcgaaccg      780

aaaggcccgc tgattaacag cgaattttat accggctggc tggatcattg gggccagccg      840

catagcacca ttaaaaccga agcggtggcg agcagcctgt atgatattct ggcgcgcggc      900

gcgagcgtga acctgtatat gtttattggc ggcaccaact ttgcgtattg gaacggcgcg      960

aacagcccgt atgcggcgca gccgaccagc tatgattatg atgcgccgct gagcgaagcg     1020

ggcgatctga ccgaaaaata ttttgcgctg cgcaacatta ttcagaaatt tgaaaaagtg     1080

ccggaaggcc cgattccgcc gagcaccccg aaatttgcgt atggcaaagt gaccctggaa     1140

aaactgaaaa ccgtgggcgc ggcgctggat attctgtgcc cgagcggccc gattaaaagc     1200

ctgtatccgc tgacctttat tcaggtgaaa cagcattatg gctttgtgct gtatcgcacc     1260

accctgccgc aggattgcag caacccggcg ccgctgagca gcccgctgaa cggcgtgcat     1320

gatcgcgcgt atgtggcggt ggatggcatt ccgcagggcg tgctggaacg caacaacgtg     1380

attaccctga acattaccgg caaagcgggc gcgaccctgg atctgctggt ggaaaacatg     1440

ggccgcgtga actatggcgc gtatattaac gattttaaag gcctggtgag caacctgacc     1500

ctgagcagca acattctgac cgattggacc atttttccgc tggataccga agatgcggtg     1560

cgcagccatc tgggcggctg gggccatcgc gatagcggcc atcatgatga agcgtgggcg     1620

cataacagca gcaactatac cctgccggcg ttttatatgg gcaactttag cattccgagc     1680

ggcattccgg atctgccgca ggataccttt attcagtttc cgggctggac caaaggccag     1740

gtgtggatta acggctttaa cctgggccgc tattggccgg cgcgcggccc gcagctgacc     1800

ctgtttgtgc cgcagcatat tctgatgacc agcgcgccga acaccattac cgtgctggaa     1860

ctggaatggg cgccgtgcag cagcgatgat ccggaactgt gcgcggtgac ctttgtggat     1920

cgcccggtga ttggcagcag cgtgacctat gatcatccga gcaaaccggt ggaaaaacgc     1980

ctgatgccgc cgccgccgca gaaaaacaaa gatagctggc tggatcatgt g              2031


<210>  7
<211>  2031
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Engineered coding sequence for human GLB1


<220>
<221>  misc_feature
<222>  (6)..(6)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (9)..(9)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (15)..(15)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (18)..(18)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (21)..(21)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (27)..(27)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (30)..(30)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (33)..(33)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (36)..(36)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (39)..(39)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (42)..(42)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (45)..(45)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (48)..(48)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (51)..(51)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (54)..(54)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (57)..(57)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (60)..(60)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (63)..(63)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (66)..(66)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (69)..(69)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (72)..(72)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (75)..(75)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (81)..(81)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (84)..(84)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (90)..(90)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (111)..(111)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (114)..(114)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (120)..(120)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (126)..(126)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (135)..(135)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (141)..(141)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (147)..(147)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (156)..(156)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (159)..(159)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (162)..(162)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (174)..(174)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (177)..(177)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (180)..(180)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (183)..(183)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (186)..(186)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (204)..(204)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (207)..(207)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (210)..(210)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (225)..(225)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (228)..(228)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (231)..(231)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (237)..(237)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (246)..(246)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (252)..(252)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (255)..(255)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (273)..(273)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (279)..(279)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (282)..(282)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (297)..(297)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (312)..(312)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (324)..(324)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (327)..(327)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (330)..(330)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (333)..(333)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (342)..(342)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (345)..(345)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (348)..(348)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (351)..(351)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (354)..(354)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (360)..(360)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (363)..(363)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (366)..(366)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (369)..(369)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (372)..(372)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (384)..(384)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (399)..(399)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (402)..(402)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (405)..(405)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (408)..(408)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (411)..(411)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (417)..(417)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (420)..(420)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (432)..(432)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (438)..(438)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (441)..(441)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (444)..(444)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (447)..(447)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (450)..(450)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (456)..(456)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (465)..(465)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (468)..(468)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (471)..(471)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (474)..(474)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (486)..(486)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (489)..(489)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (492)..(492)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (495)..(495)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (498)..(498)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (501)..(501)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (513)..(513)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (516)..(516)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (519)..(519)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (531)..(531)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (534)..(534)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (537)..(537)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (540)..(540)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (546)..(546)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (549)..(549)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (555)..(555)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (570)..(570)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (573)..(573)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (582)..(582)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (600)..(600)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (603)..(603)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (609)..(609)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (618)..(618)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (624)..(624)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (633)..(633)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (636)..(636)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (645)..(645)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (648)..(648)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (651)..(651)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (657)..(657)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (660)..(660)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (666)..(666)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (669)..(669)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (678)..(678)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (684)..(684)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (693)..(693)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (696)..(696)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (699)..(699)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (705)..(705)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (708)..(708)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (714)..(714)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (717)..(717)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (720)..(720)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (729)..(729)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (732)..(732)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (735)..(735)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (738)..(738)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (747)..(747)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (753)..(753)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (759)..(759)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (762)..(762)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (768)..(768)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (780)..(780)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (786)..(786)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (789)..(789)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (792)..(792)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (801)..(801)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (813)..(813)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (816)..(816)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (822)..(822)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (834)..(834)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (840)..(840)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (846)..(846)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (849)..(849)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (858)..(858)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (864)..(864)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (867)..(867)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (870)..(870)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (873)..(873)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (876)..(876)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (879)..(879)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (891)..(891)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (894)..(894)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (897)..(897)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (900)..(900)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (903)..(903)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (906)..(906)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (909)..(909)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (915)..(915)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (930)..(930)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (933)..(933)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (936)..(936)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (945)..(945)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (957)..(957)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (960)..(960)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (966)..(966)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (969)..(969)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (975)..(975)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (978)..(978)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (984)..(984)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (987)..(987)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (990)..(990)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1005)..(1005)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1008)..(1008)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1011)..(1011)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1014)..(1014)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1020)..(1020)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1023)..(1023)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1029)..(1029)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1032)..(1032)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1047)..(1047)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1050)..(1050)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1053)..(1053)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1080)..(1080)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1083)..(1083)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1089)..(1089)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1092)..(1092)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1098)..(1098)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1101)..(1101)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1104)..(1104)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1107)..(1107)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1110)..(1110)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1119)..(1119)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1125)..(1125)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1131)..(1131)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1134)..(1134)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1137)..(1137)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1146)..(1146)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1152)..(1152)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1155)..(1155)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1158)..(1158)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1161)..(1161)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1164)..(1164)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1167)..(1167)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1176)..(1176)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1182)..(1182)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1185)..(1185)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1188)..(1188)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1191)..(1191)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1200)..(1200)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1203)..(1203)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1209)..(1209)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1212)..(1212)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1215)..(1215)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1227)..(1227)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1242)..(1242)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1248)..(1248)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1251)..(1251)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1257)..(1257)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1260)..(1260)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1263)..(1263)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1266)..(1266)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1269)..(1269)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1281)..(1281)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1287)..(1287)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1290)..(1290)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1293)..(1293)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1296)..(1296)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1299)..(1299)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1302)..(1302)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1305)..(1305)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1308)..(1308)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1314)..(1314)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1317)..(1317)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1326)..(1326)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1329)..(1329)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1335)..(1335)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1338)..(1338)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1341)..(1341)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1347)..(1347)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1353)..(1353)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1359)..(1359)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1362)..(1362)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1365)..(1365)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1371)..(1371)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1380)..(1380)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1386)..(1386)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1389)..(1389)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1398)..(1398)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1401)..(1401)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1407)..(1407)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1410)..(1410)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1413)..(1413)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1416)..(1416)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1419)..(1419)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1425)..(1425)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1428)..(1428)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1431)..(1431)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1443)..(1443)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1446)..(1446)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1449)..(1449)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1458)..(1458)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1461)..(1461)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1482)..(1482)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1485)..(1485)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1488)..(1488)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1491)..(1491)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1497)..(1497)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1500)..(1500)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1503)..(1503)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1506)..(1506)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1509)..(1509)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1518)..(1518)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1521)..(1521)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1530)..(1530)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1539)..(1539)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1542)..(1542)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1548)..(1548)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1557)..(1557)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1560)..(1560)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1563)..(1563)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1566)..(1566)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1572)..(1572)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1575)..(1575)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1578)..(1578)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1584)..(1584)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1590)..(1590)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1596)..(1596)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1599)..(1599)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1614)..(1614)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1620)..(1620)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1629)..(1629)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1632)..(1632)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1641)..(1641)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1644)..(1644)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1647)..(1647)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1650)..(1650)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1662)..(1662)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1671)..(1671)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1677)..(1677)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1680)..(1680)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1683)..(1683)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1689)..(1689)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1695)..(1695)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1698)..(1698)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1707)..(1707)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1722)..(1722)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1725)..(1725)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1731)..(1731)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1737)..(1737)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1743)..(1743)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1755)..(1755)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1764)..(1764)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1767)..(1767)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1770)..(1770)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1779)..(1779)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1782)..(1782)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1785)..(1785)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1788)..(1788)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1791)..(1791)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1797)..(1797)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1800)..(1800)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1803)..(1803)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1809)..(1809)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1812)..(1812)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1824)..(1824)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1830)..(1830)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1833)..(1833)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1836)..(1836)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1839)..(1839)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1845)..(1845)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1851)..(1851)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1854)..(1854)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1857)..(1857)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1863)..(1863)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1872)..(1872)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1875)..(1875)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1881)..(1881)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1884)..(1884)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1893)..(1893)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1899)..(1899)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1905)..(1905)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1908)..(1908)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1911)..(1911)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1917)..(1917)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1923)..(1923)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1926)..(1926)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1929)..(1929)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1935)..(1935)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1938)..(1938)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1941)..(1941)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1944)..(1944)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1947)..(1947)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1959)..(1959)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1962)..(1962)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1968)..(1968)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1971)..(1971)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1980)..(1980)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1983)..(1983)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1989)..(1989)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1992)..(1992)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1995)..(1995)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (1998)..(1998)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (2016)..(2016)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (2022)..(2022)
<223>  n is a, c, g, or t

<220>
<221>  misc_feature
<222>  (2031)..(2031)
<223>  n is a, c, g, or t

<400>  7
atgccnggnt tyytngtnmg nathytnytn ytnytnytng tnytnytnyt nytnggnccn       60

acnmgnggny tnmgnaaygc nacncarmgn atgttygara thgaytayws nmgngaywsn      120

ttyytnaarg ayggncarcc nttymgntay athwsnggnw snathcayta ywsnmgngtn      180

ccnmgnttyt aytggaarga ymgnytnytn aaratgaara tggcnggnyt naaygcnath      240

caracntayg tnccntggaa yttycaygar ccntggccng gncartayca rttywsngar      300

gaycaygayg tngartaytt yytnmgnytn gcncaygary tnggnytnyt ngtnathytn      360

mgnccnggnc cntayathtg ygcngartgg garatgggng gnytnccngc ntggytnytn      420

garaargarw snathytnyt nmgnwsnwsn gayccngayt ayytngcngc ngtngayaar      480

tggytnggng tnytnytncc naaratgaar ccnytnytnt aycaraaygg nggnccngtn      540

athacngtnc argtngaraa ygartayggn wsntayttyg cntgygaytt ygaytayytn      600

mgnttyytnc araarmgntt ymgncaycay ytnggngayg aygtngtnyt nttyacnacn      660

gayggngcnc ayaaracntt yytnaartgy ggngcnytnc arggnytnta yacnacngtn      720

gayttyggna cnggnwsnaa yathacngay gcnttyytnw sncarmgnaa rtgygarccn      780

aarggnccny tnathaayws ngarttytay acnggntggy tngaycaytg gggncarccn      840

caywsnacna thaaracnga rgcngtngcn wsnwsnytnt aygayathyt ngcnmgnggn      900

gcnwsngtna ayytntayat gttyathggn ggnacnaayt tygcntaytg gaayggngcn      960

aaywsnccnt aygcngcnca rccnacnwsn taygaytayg aygcnccnyt nwsngargcn     1020

ggngayytna cngaraarta yttygcnytn mgnaayatha thcaraartt ygaraargtn     1080

ccngarggnc cnathccncc nwsnacnccn aarttygcnt ayggnaargt nacnytngar     1140

aarytnaara cngtnggngc ngcnytngay athytntgyc cnwsnggncc nathaarwsn     1200

ytntayccny tnacnttyat hcargtnaar carcaytayg gnttygtnyt ntaymgnacn     1260

acnytnccnc argaytgyws naayccngcn ccnytnwsnw snccnytnaa yggngtncay     1320

gaymgngcnt aygtngcngt ngayggnath ccncarggng tnytngarmg naayaaygtn     1380

athacnytna ayathacngg naargcnggn gcnacnytng ayytnytngt ngaraayatg     1440

ggnmgngtna aytayggngc ntayathaay gayttyaarg gnytngtnws naayytnacn     1500

ytnwsnwsna ayathytnac ngaytggacn athttyccny tngayacnga rgaygcngtn     1560

mgnwsncayy tnggnggntg gggncaymgn gaywsnggnc aycaygayga rgcntgggcn     1620

cayaaywsnw snaaytayac nytnccngcn ttytayatgg gnaayttyws nathccnwsn     1680

ggnathccng ayytnccnca rgayacntty athcarttyc cnggntggac naarggncar     1740

gtntggatha ayggnttyaa yytnggnmgn taytggccng cnmgnggncc ncarytnacn     1800

ytnttygtnc cncarcayat hytnatgacn wsngcnccna ayacnathac ngtnytngar     1860

ytngartggg cnccntgyws nwsngaygay ccngarytnt gygcngtnac nttygtngay     1920

mgnccngtna thggnwsnws ngtnacntay gaycayccnw snaarccngt ngaraarmgn     1980

ytnatgccnc cnccnccnca raaraayaar gaywsntggy tngaycaygt n              2031


<210>  8
<211>  2034
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Engineered coding sequence for human GLB1

<400>  8
atgcccggct ttctcgtgcg gattctcctg ctgctgctgg tgcttctgct gctgggccct       60

accagaggcc tgagaaacgc cacccagcgg atgttcgaga tcgactacag ccgggacagc      120

ttcctgaagg acggccagcc cttccggtac atcagcggca gcatccacta cagcagagtg      180

ccccggttct actggaagga ccggctgctg aagatgaaga tggccggcct gaacgccatc      240

cagacctacg tgccctggaa cttccacgag ccttggcctg gccagtacca gttcagcgag      300

gaccacgacg tggaatactt tctgcggctg gcccacgagc tgggcctgct cgtgattctg      360

aggcctggcc cttacatctg cgccgagtgg gagatgggag gactgcctgc ttggctgctg      420

gaaaaagaga gcatcctgct gcggagcagc gaccccgatt atctggccgc cgtggataag      480

tggctgggcg tgctgctgcc caagatgaag cccctgctgt accagaacgg cggacccgtg      540

atcaccgtgc aggtggaaaa cgagtacggc agctacttcg cctgcgactt cgactacctg      600

cggttcctgc agaagcggtt cagacaccac ctgggcgacg acgtggtgct gttcacaaca      660

gacggcgccc acaagacctt tctgaagtgt ggcgctctgc agggcctgta caccaccgtg      720

gattttggca ccggcagcaa tatcaccgac gcctttctga gccagcggaa gtgcgagcca      780

aagggccccc tgatcaacag cgagttctac accggctggc tggaccactg gggccagcct      840

cacagcacca tcaagacaga ggccgtggcc agcagcctgt acgacatcct ggctagaggc      900

gccagcgtga acctgtacat gtttatcggc ggcaccaact tcgcctactg gaacggcgcc      960

aacagccctt atgccgccca gcccaccagc tacgactacg atgcccctct gtctgaggcc     1020

ggcgacctga ccgagaagta ctttgccctg cggaacatca tccagaaatt cgagaaggtg     1080

cccgagggcc ccatcccccc tagcacacct aagttcgcct acggcaaagt gaccctggaa     1140

aagctgaaaa ccgtgggagc cgccctggac atcctgtgtc ctagcggccc tatcaagagc     1200

ctgtaccccc tgaccttcat ccaagtgaag cagcactacg gcttcgtgct gtaccggacc     1260

accctgcccc aggactgtag caatcctgcc ccactgagca gccccctgaa cggcgtgcac     1320

gatagagcct acgtggccgt ggatggcatc ccacaggggg tgctggaacg gaacaatgtg     1380

atcaccctga acatcaccgg caaggctggc gccaccctgg acctgctggt ggaaaacatg     1440

ggcagagtga actacggcgc ctacatcaac gacttcaagg gcctggtgtc caacctgacc     1500

ctgagcagca acatcctgac cgactggacc atcttcccac tggacaccga ggatgccgtg     1560

cggagccatc tgggaggatg gggacacaga gatagcggcc accacgatga agcctgggcc     1620

cacaacagca gcaactacac cctgcctgcc ttctacatgg gcaacttcag catccccagc     1680

ggcatccccg acctgccaca ggacaccttt atccagttcc ccggctggac aaagggacaa     1740

gtgtggatca atggcttcaa cctgggcaga tactggcccg ccagaggccc tcagctgacc     1800

ctgtttgtgc cccagcacat tctgatgacc agcgccccca acaccatcac cgtgctggaa     1860

ctggaatggg ccccctgcag cagcgacgac cctgaactgt gtgccgtgac cttcgtggac     1920

aggcccgtga tcggcagcag cgtgacctac gaccacccca gcaagcccgt ggaaaagcgg     1980

ctgatgcctc ccccacccca gaagaacaag gactcctggc tggatcacgt gtga           2034


<210>  9
<211>  1229
<212>  DNA
<213>  Homo sapiens

<400>  9
ggcctccgcg ccgggttttg gcgcctcccg cgggcgcccc cctcctcacg gcgagcgctg       60

ccacgtcaga cgaagggcgc agcgagcgtc ctgatccttc cgcccggacg ctcaggacag      120

cggcccgctg ctcataagac tcggccttag aaccccagta tcagcagaag gacattttag      180

gacgggactt gggtgactct agggcactgg ttttctttcc agagagcgga acaggcgagg      240

aaaagtagtc ccttctcggc gattctgcgg agggatctcc gtggggcggt gaacgccgat      300

gattatataa ggacgcgccg ggtgtggcac agctagttcc gtcgcagccg ggatttgggt      360

cgcggttctt gtttgtggat cgctgtgatc gtcacttggt gagtagcggg ctgctgggct      420

ggccggggct ttcgtggccg ccgggccgct cggtgggacg gaagcgtgtg gagagaccgc      480

caagggctgt agtctgggtc cgcgagcaag gttgccctga actgggggtt ggggggagcg      540

cagcaaaatg gcggctgttc ccgagtcttg aatggaagac gcttgtgagg cgggctgtga      600

ggtcgttgaa acaaggtggg gggcatggtg ggcggcaaga acccaaggtc ttgaggcctt      660

cgctaatgcg ggaaagctct tattcgggtg agatgggctg gggcaccatc tggggaccct      720

gacgtgaagt ttgtcactga ctggagaact cggtttgtcg tctgttgcgg gggcggcagt      780

tatggcggtg ccgttgggca gtgcacccgt acctttggga gcgcgcgccc tcgtcgtgtc      840

gtgacgtcac ccgttctgtt ggcttataat gcagggtggg gccacctgcc ggtaggtgtg      900

cggtaggctt ttctccgtcg caggacgcag ggttcgggcc tagggtaggc tctcctgaat      960

cgacaggcgc cggacctctg gtgaggggag ggataagtga ggcgtcagtt tctttggtcg     1020

gttttatgta cctatcttct taagtagctg aagctccggt tttgaactat gcgctcgggg     1080

ttggcgagtg tgttttgtga agttttttag gcaccttttg aaatgtaatc atttgggtca     1140

atatgtaatt ttcagtgtta gactagtaaa ttgtccgcta aattctggcc gtttttggct     1200

tttttgttag acgaagcttt attgcggta                                       1229


<210>  10
<211>  666
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  chicken beta actin promoter with a cytomegalovirus enhancer (CB7)

<400>  10
ctagtcgaca ttgattattg actagttatt aatagtaatc aattacgggg tcattagttc       60

atagcccata tatggagttc cgcgttacat aacttacggt aaatggcccg cctggctgac      120

cgcccaacga cccccgccca ttgacgtcaa taatgacgta tgttcccata gtaacgccaa      180

tagggacttt ccattgacgt caatgggtgg actatttacg gtaaactgcc cacttggcag      240

tacatcaagt gtatcatatg ccaagtacgc cccctattga cgtcaatgac ggtaaatggc      300

ccgcctggca ttatgcccag tacatgacct tatgggactt tcctacttgg cagtacatct      360

acgtattagt catcgctatt accatggtcg aggtgagccc cacgttctgc ttcactctcc      420

ccatctcccc cccctcccca cccccaattt tgtatttatt tattttttaa ttattttgtg      480

cagcgatggg ggcggggggg gggggggggc gcgcgccagg cggggcgggg cggggcgagg      540

ggcggggcgg ggcgaggcgg agaggtgcgg cggcagccaa tcagagcggc gcgctccgaa      600

agtttccttt tatggcgagg cggcggcggc ggcggcccta taaaaagcga agcgcgcggc      660

gggcgg                                                                 666


<210>  11
<211>  1180
<212>  DNA
<213>  Artificial sequence

<220>
<223>  human elongation initiation factor 1 alpha promoter (EF1a)

<400>  11
tggctccggt gcccgtcagt gggcagagcg cacatcgccc acagtccccg agaagttggg       60

gggaggggtc ggcaattgaa ccggtgccta gagaaggtgg cgcggggtaa actgggaaag      120

tgatgtcgtg tactggctcc gcctttttcc cgagggtggg ggagaaccgt atataagtgc      180

agtagtcgcc gtgaacgttc tttttcgcaa cgggtttgcc gccagaacac aggtaagtgc      240

cgtgtgtggt tcccgcgggc ctggcctctt tacgggttat ggcccttgcg tgccttgaat      300

tacttccacc tggctgcagt acgtgattct tgatcccgag cttcgggttg gaagtgggtg      360

ggagagttcg aggccttgcg cttaaggagc cccttcgcct cgtgcttgag ttgaggcctg      420

gcctgggcgc tggggccgcc gcgtgcgaat ctggtggcac cttcgcgcct gtctcgctgc      480

tttcgataag tctctagcca tttaaaattt ttgatgacct gctgcgacgc tttttttctg      540

gcaagatagt cttgtaaatg cgggccaaga tctgcacact ggtatttcgg tttttggggc      600

cgcgggcggc gacggggccc gtgcgtccca gcgcacatgt tcggcgaggc ggggcctgcg      660

agcgcggcca ccgagaatcg gacgggggta gtctcaagct ggccggcctg ctctggtgcc      720

tggcctcgcg ccgccgtgta tcgccccgcc ctgggcggca aggctggccc ggtcggcacc      780

agttgcgtga gcggaaagat ggccgcttcc cggccctgct gcagggagct caaaatggag      840

gacgcggcgc tcgggagagc gggcgggtga gtcacccaca caaaggaaaa gggcctttcc      900

gtcctcagcc gtcgcttcat gtgactccac ggagtaccgg gcgccgtcca ggcacctcga      960

ttagttctcg agcttttgga gtacgtcgtc tttaggttgg ggggaggggt tttatgcgat     1020

ggagtttccc cacactgagt gggtggagac tgaagttagg ccagcttggc acttgatgta     1080

attctccttg gaatttgccc tttttgagtt tggatcttgg ttcattctca agcctcagac     1140

agtggttcaa agtttttttc ttccatttca ggtgtcgtga                           1180


<210>  12
<211>  4205
<212>  DNA
<213>  Artificial sequence

<220>
<223>  UbC.GLB1.SV40 vector genome

<400>  12
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gccatgctct      180

aggaagatct ggcctccgcg ccgggttttg gcgcctcccg cgggcgcccc cctcctcacg      240

gcgagcgctg ccacgtcaga cgaagggcgc agcgagcgtc ctgatccttc cgcccggacg      300

ctcaggacag cggcccgctg ctcataagac tcggccttag aaccccagta tcagcagaag      360

gacattttag gacgggactt gggtgactct agggcactgg ttttctttcc agagagcgga      420

acaggcgagg aaaagtagtc ccttctcggc gattctgcgg agggatctcc gtggggcggt      480

gaacgccgat gattatataa ggacgcgccg ggtgtggcac agctagttcc gtcgcagccg      540

ggatttgggt cgcggttctt gtttgtggat cgctgtgatc gtcacttggt gagtagcggg      600

ctgctgggct ggccggggct ttcgtggccg ccgggccgct cggtgggacg gaagcgtgtg      660

gagagaccgc caagggctgt agtctgggtc cgcgagcaag gttgccctga actgggggtt      720

ggggggagcg cagcaaaatg gcggctgttc ccgagtcttg aatggaagac gcttgtgagg      780

cgggctgtga ggtcgttgaa acaaggtggg gggcatggtg ggcggcaaga acccaaggtc      840

ttgaggcctt cgctaatgcg ggaaagctct tattcgggtg agatgggctg gggcaccatc      900

tggggaccct gacgtgaagt ttgtcactga ctggagaact cggtttgtcg tctgttgcgg      960

gggcggcagt tatggcggtg ccgttgggca gtgcacccgt acctttggga gcgcgcgccc     1020

tcgtcgtgtc gtgacgtcac ccgttctgtt ggcttataat gcagggtggg gccacctgcc     1080

ggtaggtgtg cggtaggctt ttctccgtcg caggacgcag ggttcgggcc tagggtaggc     1140

tctcctgaat cgacaggcgc cggacctctg gtgaggggag ggataagtga ggcgtcagtt     1200

tctttggtcg gttttatgta cctatcttct taagtagctg aagctccggt tttgaactat     1260

gcgctcgggg ttggcgagtg tgttttgtga agttttttag gcaccttttg aaatgtaatc     1320

atttgggtca atatgtaatt ttcagtgtta gactagtaaa ttgtccgcta aattctggcc     1380

gtttttggct tttttgttag acgaagcttt attgcggtag tttatcacag ttaaattgct     1440

aacgcagtca gtgcttctga cacaacagtc tcgaacttaa gctgcagaag ttggtcgtga     1500

ggcactgggc aggtaagtat caaggttaca agacaggttt aaggagacca atagaaactg     1560

ggcttgtcga gacagagaag actcttgcgt ttctgatagg cacctattgg tcttactgac     1620

atccactttg cctttctctc cacaggtgtc cactcccagt tcaattacag ctcttaaggc     1680

tagagtactt aatacgactc actataggct agaattcacg cgtgccacca tgcccggctt     1740

tctcgtgcgg attctcctgc tgctgctggt gcttctgctg ctgggcccta ccagaggcct     1800

gagaaacgcc acccagcgga tgttcgagat cgactacagc cgggacagct tcctgaagga     1860

cggccagccc ttccggtaca tcagcggcag catccactac agcagagtgc cccggttcta     1920

ctggaaggac cggctgctga agatgaagat ggccggcctg aacgccatcc agacctacgt     1980

gccctggaac ttccacgagc cttggcctgg ccagtaccag ttcagcgagg accacgacgt     2040

ggaatacttt ctgcggctgg cccacgagct gggcctgctc gtgattctga ggcctggccc     2100

ttacatctgc gccgagtggg agatgggagg actgcctgct tggctgctgg aaaaagagag     2160

catcctgctg cggagcagcg accccgatta tctggccgcc gtggataagt ggctgggcgt     2220

gctgctgccc aagatgaagc ccctgctgta ccagaacggc ggacccgtga tcaccgtgca     2280

ggtggaaaac gagtacggca gctacttcgc ctgcgacttc gactacctgc ggttcctgca     2340

gaagcggttc agacaccacc tgggcgacga cgtggtgctg ttcacaacag acggcgccca     2400

caagaccttt ctgaagtgtg gcgctctgca gggcctgtac accaccgtgg attttggcac     2460

cggcagcaat atcaccgacg cctttctgag ccagcggaag tgcgagccaa agggccccct     2520

gatcaacagc gagttctaca ccggctggct ggaccactgg ggccagcctc acagcaccat     2580

caagacagag gccgtggcca gcagcctgta cgacatcctg gctagaggcg ccagcgtgaa     2640

cctgtacatg tttatcggcg gcaccaactt cgcctactgg aacggcgcca acagccctta     2700

tgccgcccag cccaccagct acgactacga tgcccctctg tctgaggccg gcgacctgac     2760

cgagaagtac tttgccctgc ggaacatcat ccagaaattc gagaaggtgc ccgagggccc     2820

catcccccct agcacaccta agttcgccta cggcaaagtg accctggaaa agctgaaaac     2880

cgtgggagcc gccctggaca tcctgtgtcc tagcggccct atcaagagcc tgtaccccct     2940

gaccttcatc caagtgaagc agcactacgg cttcgtgctg taccggacca ccctgcccca     3000

ggactgtagc aatcctgccc cactgagcag ccccctgaac ggcgtgcacg atagagccta     3060

cgtggccgtg gatggcatcc cacagggggt gctggaacgg aacaatgtga tcaccctgaa     3120

catcaccggc aaggctggcg ccaccctgga cctgctggtg gaaaacatgg gcagagtgaa     3180

ctacggcgcc tacatcaacg acttcaaggg cctggtgtcc aacctgaccc tgagcagcaa     3240

catcctgacc gactggacca tcttcccact ggacaccgag gatgccgtgc ggagccatct     3300

gggaggatgg ggacacagag atagcggcca ccacgatgaa gcctgggccc acaacagcag     3360

caactacacc ctgcctgcct tctacatggg caacttcagc atccccagcg gcatccccga     3420

cctgccacag gacaccttta tccagttccc cggctggaca aagggacaag tgtggatcaa     3480

tggcttcaac ctgggcagat actggcccgc cagaggccct cagctgaccc tgtttgtgcc     3540

ccagcacatt ctgatgacca gcgcccccaa caccatcacc gtgctggaac tggaatgggc     3600

cccctgcagc agcgacgacc ctgaactgtg tgccgtgacc ttcgtggaca ggcccgtgat     3660

cggcagcagc gtgacctacg accaccccag caagcccgtg gaaaagcggc tgatgcctcc     3720

cccaccccag aagaacaagg actcctggct ggatcacgtg tgatgactcg aggccgcttc     3780

gagcagacat gataagatac attgatgagt ttggacaaac cacaactaga atgcagtgaa     3840

aaaaatgctt tatttgtgaa atttgtgatg ctattgcttt atttgtaacc attataagct     3900

gcaataaaca agttaacaac aacaattgca ttcattttat gtttcaggtt cagggggaga     3960

tgtgggaggt tttttaaagc aagtaaaacc tctacaaatg tggtaaaatc gataaggatc     4020

ttcctagagc atggctacgt agataagtag catggcgggt taatcattaa ctacaaggaa     4080

cccctagtga tggagttggc cactccctct ctgcgcgctc gctcgctcac tgaggccggg     4140

cgaccaaagg tcgcccgacg cccgggcttt gcccgggcgg cctcagtgag cgagcgagcg     4200

cgcag                                                                 4205


<210>  13
<211>  4081
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  EF1a.GLB1.SV40 vector genome

<400>  13
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gccatgctct      180

aggaagatcc gatgtacggg ccagatatac gcgttgacat tgattattga ctaggctttt      240

gcaaaaagct ttgcaaagat ggataaagtt ttaaacagag aggaatcttt gcagctaatg      300

gaccttctag gtcttgaaag gagtgggaat tggctccggt gcccgtcagt gggcagagcg      360

cacatcgccc acagtccccg agaagttggg gggaggggtc ggcaattgaa ccggtgccta      420

gagaaggtgg cgcggggtaa actgggaaag tgatgtcgtg tactggctcc gcctttttcc      480

cgagggtggg ggagaaccgt atataagtgc agtagtcgcc gtgaacgttc tttttcgcaa      540

cgggtttgcc gccagaacac aggtaagtgc cgtgtgtggt tcccgcgggc ctggcctctt      600

tacgggttat ggcccttgcg tgccttgaat tacttccacc tggctgcagt acgtgattct      660

tgatcccgag cttcgggttg gaagtgggtg ggagagttcg aggccttgcg cttaaggagc      720

cccttcgcct cgtgcttgag ttgaggcctg gcctgggcgc tggggccgcc gcgtgcgaat      780

ctggtggcac cttcgcgcct gtctcgctgc tttcgataag tctctagcca tttaaaattt      840

ttgatgacct gctgcgacgc tttttttctg gcaagatagt cttgtaaatg cgggccaaga      900

tctgcacact ggtatttcgg tttttggggc cgcgggcggc gacggggccc gtgcgtccca      960

gcgcacatgt tcggcgaggc ggggcctgcg agcgcggcca ccgagaatcg gacgggggta     1020

gtctcaagct ggccggcctg ctctggtgcc tggcctcgcg ccgccgtgta tcgccccgcc     1080

ctgggcggca aggctggccc ggtcggcacc agttgcgtga gcggaaagat ggccgcttcc     1140

cggccctgct gcagggagct caaaatggag gacgcggcgc tcgggagagc gggcgggtga     1200

gtcacccaca caaaggaaaa gggcctttcc gtcctcagcc gtcgcttcat gtgactccac     1260

ggagtaccgg gcgccgtcca ggcacctcga ttagttctcg agcttttgga gtacgtcgtc     1320

tttaggttgg ggggaggggt tttatgcgat ggagtttccc cacactgagt gggtggagac     1380

tgaagttagg ccagcttggc acttgatgta attctccttg gaatttgccc tttttgagtt     1440

tggatcttgg ttcattctca agcctcagac agtggttcaa agtttttttc ttccatttca     1500

ggtgtcgtga ggaattagct tggtactaat acgactcact atagggagac ccaagctggc     1560

taggtaagct tggtaccgag ctcggatcaa ttcacgcgtg ccaccatgcc cggctttctc     1620

gtgcggattc tcctgctgct gctggtgctt ctgctgctgg gccctaccag aggcctgaga     1680

aacgccaccc agcggatgtt cgagatcgac tacagccggg acagcttcct gaaggacggc     1740

cagcccttcc ggtacatcag cggcagcatc cactacagca gagtgccccg gttctactgg     1800

aaggaccggc tgctgaagat gaagatggcc ggcctgaacg ccatccagac ctacgtgccc     1860

tggaacttcc acgagccttg gcctggccag taccagttca gcgaggacca cgacgtggaa     1920

tactttctgc ggctggccca cgagctgggc ctgctcgtga ttctgaggcc tggcccttac     1980

atctgcgccg agtgggagat gggaggactg cctgcttggc tgctggaaaa agagagcatc     2040

ctgctgcgga gcagcgaccc cgattatctg gccgccgtgg ataagtggct gggcgtgctg     2100

ctgcccaaga tgaagcccct gctgtaccag aacggcggac ccgtgatcac cgtgcaggtg     2160

gaaaacgagt acggcagcta cttcgcctgc gacttcgact acctgcggtt cctgcagaag     2220

cggttcagac accacctggg cgacgacgtg gtgctgttca caacagacgg cgcccacaag     2280

acctttctga agtgtggcgc tctgcagggc ctgtacacca ccgtggattt tggcaccggc     2340

agcaatatca ccgacgcctt tctgagccag cggaagtgcg agccaaaggg ccccctgatc     2400

aacagcgagt tctacaccgg ctggctggac cactggggcc agcctcacag caccatcaag     2460

acagaggccg tggccagcag cctgtacgac atcctggcta gaggcgccag cgtgaacctg     2520

tacatgttta tcggcggcac caacttcgcc tactggaacg gcgccaacag cccttatgcc     2580

gcccagccca ccagctacga ctacgatgcc cctctgtctg aggccggcga cctgaccgag     2640

aagtactttg ccctgcggaa catcatccag aaattcgaga aggtgcccga gggccccatc     2700

ccccctagca cacctaagtt cgcctacggc aaagtgaccc tggaaaagct gaaaaccgtg     2760

ggagccgccc tggacatcct gtgtcctagc ggccctatca agagcctgta ccccctgacc     2820

ttcatccaag tgaagcagca ctacggcttc gtgctgtacc ggaccaccct gccccaggac     2880

tgtagcaatc ctgccccact gagcagcccc ctgaacggcg tgcacgatag agcctacgtg     2940

gccgtggatg gcatcccaca gggggtgctg gaacggaaca atgtgatcac cctgaacatc     3000

accggcaagg ctggcgccac cctggacctg ctggtggaaa acatgggcag agtgaactac     3060

ggcgcctaca tcaacgactt caagggcctg gtgtccaacc tgaccctgag cagcaacatc     3120

ctgaccgact ggaccatctt cccactggac accgaggatg ccgtgcggag ccatctggga     3180

ggatggggac acagagatag cggccaccac gatgaagcct gggcccacaa cagcagcaac     3240

tacaccctgc ctgccttcta catgggcaac ttcagcatcc ccagcggcat ccccgacctg     3300

ccacaggaca cctttatcca gttccccggc tggacaaagg gacaagtgtg gatcaatggc     3360

ttcaacctgg gcagatactg gcccgccaga ggccctcagc tgaccctgtt tgtgccccag     3420

cacattctga tgaccagcgc ccccaacacc atcaccgtgc tggaactgga atgggccccc     3480

tgcagcagcg acgaccctga actgtgtgcc gtgaccttcg tggacaggcc cgtgatcggc     3540

agcagcgtga cctacgacca ccccagcaag cccgtggaaa agcggctgat gcctccccca     3600

ccccagaaga acaaggactc ctggctggat cacgtgtgat gactcgaggc cgcttcgagc     3660

agacatgata agatacattg atgagtttgg acaaaccaca actagaatgc agtgaaaaaa     3720

atgctttatt tgtgaaattt gtgatgctat tgctttattt gtaaccatta taagctgcaa     3780

taaacaagtt aacaacaaca attgcattca ttttatgttt caggttcagg gggagatgtg     3840

ggaggttttt taaagcaagt aaaacctcta caaatgtggt aaaatcgata aggatcttcc     3900

tagagcatgg ctacgtagat aagtagcatg gcgggttaat cattaactac aaggaacccc     3960

tagtgatgga gttggccact ccctctctgc gcgctcgctc gctcactgag gccgggcgac     4020

caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc agtgagcgag cgagcgcgca     4080

g                                                                     4081


<210>  14
<211>  4202
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  UbC.GLB1.SV40 - 2

<400>  14
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gccatgctct      180

aggaagatct ggcctccgcg ccgggttttg gcgcctcccg cgggcgcccc cctcctcacg      240

gcgagcgctg ccacgtcaga cgaagggcgc agcgagcgtc ctgatccttc cgcccggacg      300

ctcaggacag cggcccgctg ctcataagac tcggccttag aaccccagta tcagcagaag      360

gacattttag gacgggactt gggtgactct agggcactgg ttttctttcc agagagcgga      420

acaggcgagg aaaagtagtc ccttctcggc gattctgcgg agggatctcc gtggggcggt      480

gaacgccgat gattatataa ggacgcgccg ggtgtggcac agctagttcc gtcgcagccg      540

ggatttgggt cgcggttctt gtttgtggat cgctgtgatc gtcacttggt gagtagcggg      600

ctgctgggct ggccggggct ttcgtggccg ccgggccgct cggtgggacg gaagcgtgtg      660

gagagaccgc caagggctgt agtctgggtc cgcgagcaag gttgccctga actgggggtt      720

ggggggagcg cagcaaaatg gcggctgttc ccgagtcttg aatggaagac gcttgtgagg      780

cgggctgtga ggtcgttgaa acaaggtggg gggcatggtg ggcggcaaga acccaaggtc      840

ttgaggcctt cgctaatgcg ggaaagctct tattcgggtg agatgggctg gggcaccatc      900

tggggaccct gacgtgaagt ttgtcactga ctggagaact cggtttgtcg tctgttgcgg      960

gggcggcagt tatggcggtg ccgttgggca gtgcacccgt acctttggga gcgcgcgccc     1020

tcgtcgtgtc gtgacgtcac ccgttctgtt ggcttataat gcagggtggg gccacctgcc     1080

ggtaggtgtg cggtaggctt ttctccgtcg caggacgcag ggttcgggcc tagggtaggc     1140

tctcctgaat cgacaggcgc cggacctctg gtgaggggag ggataagtga ggcgtcagtt     1200

tctttggtcg gttttatgta cctatcttct taagtagctg aagctccggt tttgaactat     1260

gcgctcgggg ttggcgagtg tgttttgtga agttttttag gcaccttttg aaatgtaatc     1320

atttgggtca atatgtaatt ttcagtgtta gactagtaaa ttgtccgcta aattctggcc     1380

gtttttggct tttttgttag acgaagcttt attgcggtag tttatcacag ttaaattgct     1440

aacgcagtca gtgcttctga cacaacagtc tcgaacttaa gctgcagaag ttggtcgtga     1500

ggcactgggc aggtaagtat caaggttaca agacaggttt aaggagacca atagaaactg     1560

ggcttgtcga gacagagaag actcttgcgt ttctgatagg cacctattgg tcttactgac     1620

atccactttg cctttctctc cacaggtgtc cactcccagt tcaattacag ctcttaaggc     1680

tagagtactt aatacgactc actataggct agaattcacg cgtgccacca tgccgggctt     1740

tctggtgcgc attctgctgc tgctgctggt gctgctgctg ctgggcccga cccgcggcct     1800

gcgcaacgcg acccagcgca tgtttgaaat tgattatagc cgcgatagct ttctgaaaga     1860

tggccagccg tttcgctata ttagcggcag cattcattat agccgcgtgc cgcgctttta     1920

ttggaaagat cgcctgctga aaatgaaaat ggcgggcctg aacgcgattc agacctatgt     1980

gccgtggaac tttcatgaac cgtggccggg ccagtatcag tttagcgaag atcatgatgt     2040

ggaatatttt ctgcgcctgg cgcatgaact gggcctgctg gtgattctgc gcccgggccc     2100

gtatatttgc gcggaatggg aaatgggcgg cctgccggcg tggctgctgg aaaaagaaag     2160

cattctgctg cgcagcagcg atccggatta tctggcggcg gtggataaat ggctgggcgt     2220

gctgctgccg aaaatgaaac cgctgctgta tcagaacggc ggcccggtga ttaccgtgca     2280

ggtggaaaac gaatatggca gctattttgc gtgcgatttt gattatctgc gctttctgca     2340

gaaacgcttt cgccatcatc tgggcgatga tgtggtgctg tttaccaccg atggcgcgca     2400

taaaaccttt ctgaaatgcg gcgcgctgca gggcctgtat accaccgtgg attttggcac     2460

cggcagcaac attaccgatg cgtttctgag ccagcgcaaa tgcgaaccga aaggcccgct     2520

gattaacagc gaattttata ccggctggct ggatcattgg ggccagccgc atagcaccat     2580

taaaaccgaa gcggtggcga gcagcctgta tgatattctg gcgcgcggcg cgagcgtgaa     2640

cctgtatatg tttattggcg gcaccaactt tgcgtattgg aacggcgcga acagcccgta     2700

tgcggcgcag ccgaccagct atgattatga tgcgccgctg agcgaagcgg gcgatctgac     2760

cgaaaaatat tttgcgctgc gcaacattat tcagaaattt gaaaaagtgc cggaaggccc     2820

gattccgccg agcaccccga aatttgcgta tggcaaagtg accctggaaa aactgaaaac     2880

cgtgggcgcg gcgctggata ttctgtgccc gagcggcccg attaaaagcc tgtatccgct     2940

gacctttatt caggtgaaac agcattatgg ctttgtgctg tatcgcacca ccctgccgca     3000

ggattgcagc aacccggcgc cgctgagcag cccgctgaac ggcgtgcatg atcgcgcgta     3060

tgtggcggtg gatggcattc cgcagggcgt gctggaacgc aacaacgtga ttaccctgaa     3120

cattaccggc aaagcgggcg cgaccctgga tctgctggtg gaaaacatgg gccgcgtgaa     3180

ctatggcgcg tatattaacg attttaaagg cctggtgagc aacctgaccc tgagcagcaa     3240

cattctgacc gattggacca tttttccgct ggataccgaa gatgcggtgc gcagccatct     3300

gggcggctgg ggccatcgcg atagcggcca tcatgatgaa gcgtgggcgc ataacagcag     3360

caactatacc ctgccggcgt tttatatggg caactttagc attccgagcg gcattccgga     3420

tctgccgcag gataccttta ttcagtttcc gggctggacc aaaggccagg tgtggattaa     3480

cggctttaac ctgggccgct attggccggc gcgcggcccg cagctgaccc tgtttgtgcc     3540

gcagcatatt ctgatgacca gcgcgccgaa caccattacc gtgctggaac tggaatgggc     3600

gccgtgcagc agcgatgatc cggaactgtg cgcggtgacc tttgtggatc gcccggtgat     3660

tggcagcagc gtgacctatg atcatccgag caaaccggtg gaaaaacgcc tgatgccgcc     3720

gccgccgcag aaaaacaaag atagctggct ggatcatgtg tgactcgagg ccgcttcgag     3780

cagacatgat aagatacatt gatgagtttg gacaaaccac aactagaatg cagtgaaaaa     3840

aatgctttat ttgtgaaatt tgtgatgcta ttgctttatt tgtaaccatt ataagctgca     3900

ataaacaagt taacaacaac aattgcattc attttatgtt tcaggttcag ggggagatgt     3960

gggaggtttt ttaaagcaag taaaacctct acaaatgtgg taaaatcgat aaggatcttc     4020

ctagagcatg gctacgtaga taagtagcat ggcgggttaa tcattaacta caaggaaccc     4080

ctagtgatgg agttggccac tccctctctg cgcgctcgct cgctcactga ggccgggcga     4140

ccaaaggtcg cccgacgccc gggctttgcc cgggcggcct cagtgagcga gcgagcgcgc     4200

ag                                                                    4202


<210>  15
<211>  4206
<212>  DNA
<213>  Artificial sequence

<220>
<223>  UbC.GLB1.SV40 - 3

<400>  15
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctacgta gccatgctct      180

aggaagatct ggcctccgcg ccgggttttg gcgcctcccg cgggcgcccc cctcctcacg      240

gcgagcgctg ccacgtcaga cgaagggcgc agcgagcgtc ctgatccttc cgcccggacg      300

ctcaggacag cggcccgctg ctcataagac tcggccttag aaccccagta tcagcagaag      360

gacattttag gacgggactt gggtgactct agggcactgg ttttctttcc agagagcgga      420

acaggcgagg aaaagtagtc ccttctcggc gattctgcgg agggatctcc gtggggcggt      480

gaacgccgat gattatataa ggacgcgccg ggtgtggcac agctagttcc gtcgcagccg      540

ggatttgggt cgcggttctt gtttgtggat cgctgtgatc gtcacttggt gagtagcggg      600

ctgctgggct ggccggggct ttcgtggccg ccgggccgct cggtgggacg gaagcgtgtg      660

gagagaccgc caagggctgt agtctgggtc cgcgagcaag gttgccctga actgggggtt      720

ggggggagcg cagcaaaatg gcggctgttc ccgagtcttg aatggaagac gcttgtgagg      780

cgggctgtga ggtcgttgaa acaaggtggg gggcatggtg ggcggcaaga acccaaggtc      840

ttgaggcctt cgctaatgcg ggaaagctct tattcgggtg agatgggctg gggcaccatc      900

tggggaccct gacgtgaagt ttgtcactga ctggagaact cggtttgtcg tctgttgcgg      960

gggcggcagt tatggcggtg ccgttgggca gtgcacccgt acctttggga gcgcgcgccc     1020

tcgtcgtgtc gtgacgtcac ccgttctgtt ggcttataat gcagggtggg gccacctgcc     1080

ggtaggtgtg cggtaggctt ttctccgtcg caggacgcag ggttcgggcc tagggtaggc     1140

tctcctgaat cgacaggcgc cggacctctg gtgaggggag ggataagtga ggcgtcagtt     1200

tctttggtcg gttttatgta cctatcttct taagtagctg aagctccggt tttgaactat     1260

gcgctcgggg ttggcgagtg tgttttgtga agttttttag gcaccttttg aaatgtaatc     1320

atttgggtca atatgtaatt ttcagtgtta gactagtaaa ttgtccgcta aattctggcc     1380

gtttttggct tttttgttag acgaagcttt attgcggtag tttatcacag ttaaattgct     1440

aacgcagtca gtgcttctga cacaacagtc tcgaacttaa gctgcagaag ttggtcgtga     1500

ggcactgggc aggtaagtat caaggttaca agacaggttt aaggagacca atagaaactg     1560

ggcttgtcga gacagagaag actcttgcgt ttctgatagg cacctattgg tcttactgac     1620

atccactttg cctttctctc cacaggtgtc cactcccagt tcaattacag ctcttaaggc     1680

tagagtactt aatacgactc actataggct agaattcacg cgtgccacca tgccggggtt     1740

cctggttcgc atcctccttc tgctgctggt tctgctgctt ctgggcccta cgcgcggctt     1800

gcgcaatgcc acccagagga tgtttgaaat tgactatagc cgggactcct tcctcaagga     1860

tggccagcca tttcgctaca tctcaggaag cattcactac tcccgtgtgc cccgcttcta     1920

ctggaaggac cggctgctga agatgaagat ggctgggctg aacgccatcc agacgtatgt     1980

gccctggaac tttcatgagc cctggccagg acagtaccag ttttctgagg accatgatgt     2040

ggaatatttt cttcggctgg ctcatgagct gggactgctg gttatcctga ggcccgggcc     2100

ctacatctgt gcagagtggg aaatgggagg attacctgct tggctgctag agaaagagtc     2160

tattcttctc cgctcctccg acccagatta cctggcagct gtggacaagt ggttgggagt     2220

ccttctgccc aagatgaagc ctctcctcta tcagaatgga gggccagtta taacagtgca     2280

ggttgaaaat gaatatggca gctactttgc ctgtgatttt gactacctgc gcttcctgca     2340

gaagcgcttt cgccaccatc tgggggatga tgtggttctg tttaccactg atggagcaca     2400

taaaacattc ctgaaatgtg gggccctgca gggcctctac accacggtgg actttggaac     2460

aggcagcaac atcacagatg ctttcctaag ccagaggaag tgtgagccca aaggaccctt     2520

gatcaattct gaattctata ctggctggct agatcactgg ggccaacctc actccacaat     2580

caagaccgaa gcagtggctt cctccctcta tgatatactt gcccgtgggg cgagtgtgaa     2640

cttgtacatg tttataggtg ggaccaattt tgcctattgg aatggggcca actcacccta     2700

tgcagcacag cccaccagct acgactatga tgccccactg agtgaggctg gggacctcac     2760

tgagaagtat tttgctctgc gaaacatcat ccagaagttt gaaaaagtac cagaaggtcc     2820

tatccctcca tctacaccaa agtttgcata tggaaaggtc actttggaaa agttaaagac     2880

agtgggagca gctctggaca ttctgtgtcc ctctgggccc atcaaaagcc tttatccctt     2940

gacatttatc caggtgaaac agcattatgg gtttgtgctg taccggacaa cacttcctca     3000

agattgcagc aacccagcac ctctctcttc acccctcaat ggagtccacg atcgagcata     3060

tgttgctgtg gatgggatcc cccagggagt ccttgagcga aacaatgtga tcactctgaa     3120

cataacaggg aaagctggag ccactctgga ccttctggta gagaacatgg gacgtgtgaa     3180

ctatggtgca tatatcaacg attttaaggg tttggtttct aacctgactc tcagttccaa     3240

tatcctcacg gactggacga tctttccact ggacactgag gatgcagtgc gcagccacct     3300

ggggggctgg ggacaccgtg acagtggcca ccatgatgaa gcctgggccc acaactcatc     3360

caactacacg ctcccggcct tttatatggg gaacttctcc attcccagtg ggatcccaga     3420

cttgccccag gacaccttta tccagtttcc tggatggacc aagggccagg tctggattaa     3480

tggctttaac cttggccgct attggccagc ccggggccct cagttgacct tgtttgtgcc     3540

ccagcacatc ctgatgacct cggccccaaa caccatcacc gtgctggaac tggagtgggc     3600

accctgcagc agtgatgatc cagaactatg tgctgtgacg ttcgtggaca ggccagttat     3660

tggctcatct gtgacctacg atcatccctc caaacctgtt gaaaaaagac tcatgccccc     3720

acccccgcaa aaaaacaaag attcatggct ggaccatgta tgaatgactc gaggccgctt     3780

cgagcagaca tgataagata cattgatgag tttggacaaa ccacaactag aatgcagtga     3840

aaaaaatgct ttatttgtga aatttgtgat gctattgctt tatttgtaac cattataagc     3900

tgcaataaac aagttaacaa caacaattgc attcatttta tgtttcaggt tcagggggag     3960

atgtgggagg ttttttaaag caagtaaaac ctctacaaat gtggtaaaat cgataaggat     4020

cttcctagag catggctacg tagataagta gcatggcggg ttaatcatta actacaagga     4080

acccctagtg atggagttgg ccactccctc tctgcgcgct cgctcgctca ctgaggccgg     4140

gcgaccaaag gtcgcccgac gcccgggctt tgcccgggcg gcctcagtga gcgagcgagc     4200

gcgcag                                                                4206


<210>  16
<211>  4362
<212>  DNA
<213>  Artificial sequence

<220>
<223>  Vector genome CB7.CI.GLB1.RBG


<220>
<221>  repeat_region
<222>  (1)..(130)
<223>  5" ITR from AAV2

<220>
<221>  repeat_region
<222>  (4232)..(4362)
<223>  5" ITR from AAV2

<400>  16
ctgcgcgctc gctcgctcac tgaggccgcc cgggcaaagc ccgggcgtcg ggcgaccttt       60

ggtcgcccgg cctcagtgag cgagcgagcg cgcagagagg gagtggccaa ctccatcact      120

aggggttcct tgtagttaat gattaacccg ccatgctact tatctaccag ggtaatgggg      180

atcctctaga actatagcta gtcgacattg attattgact agttattaat agtaatcaat      240

tacggggtca ttagttcata gcccatatat ggagttccgc gttacataac ttacggtaaa      300

tggcccgcct ggctgaccgc ccaacgaccc ccgcccattg acgtcaataa tgacgtatgt      360

tcccatagta acgccaatag ggactttcca ttgacgtcaa tgggtggact atttacggta      420

aactgcccac ttggcagtac atcaagtgta tcatatgcca agtacgcccc ctattgacgt      480

caatgacggt aaatggcccg cctggcatta tgcccagtac atgaccttat gggactttcc      540

tacttggcag tacatctacg tattagtcat cgctattacc atggtcgagg tgagccccac      600

gttctgcttc actctcccca tctccccccc ctccccaccc ccaattttgt atttatttat      660

tttttaatta ttttgtgcag cgatgggggc gggggggggg ggggggcgcg cgccaggcgg      720

ggcggggcgg ggcgaggggc ggggcggggc gaggcggaga ggtgcggcgg cagccaatca      780

gagcggcgcg ctccgaaagt ttccttttat ggcgaggcgg cggcggcggc ggccctataa      840

aaagcgaagc gcgcggcggg cggggagtcg ctgcgacgct gccttcgccc cgtgccccgc      900

tccgccgccg cctcgcgccg cccgccccgg ctctgactga ccgcgttact cccacaggtg      960

agcgggcggg acggcccttc tcctccgggc tgtaattagc gcttggttta atgacggctt     1020

gtttcttttc tgtggctgcg tgaaagcctt gaggggctcc gggagggccc tttgtgcggg     1080

gggagcggct cggggggtgc gtgcgtgtgt gtgtgcgtgg ggagcgccgc gtgcggctcc     1140

gcgctgcccg gcggctgtga gcgctgcggg cgcggcgcgg ggctttgtgc gctccgcagt     1200

gtgcgcgagg ggagcgcggc cgggggcggt gccccgcggt gcgggggggg ctgcgagggg     1260

aacaaaggct gcgtgcgggg tgtgtgcgtg ggggggtgag cagggggtgt gggcgcgtcg     1320

gtcgggctgc aaccccccct gcacccccct ccccgagttg ctgagcacgg cccggcttcg     1380

ggtgcggggc tccgtacggg gcgtggcgcg gggctcgccg tgccgggcgg ggggtggcgg     1440

caggtggggg tgccgggcgg ggcggggccg cctcgggccg gggagggctc gggggagggg     1500

cgcggcggcc cccggagcgc cggcggctgt cgaggcgcgg cgagccgcag ccattgcctt     1560

ttatggtaat cgtgcgagag ggcgcaggga cttcctttgt cccaaatctg tgcggagccg     1620

aaatctggga ggcgccgccg caccccctct agcgggcgcg gggcgaagcg gtgcggcgcc     1680

ggcaggaagg aaatgggcgg ggagggcctt cgtgcgtcgc cgcgccgccg tccccttctc     1740

cctctccagc ctcggggctg tccgcggggg gacggctgcc ttcggggggg acggggcagg     1800

gcggggttcg gcttctggcg tgtgaccggc ggctctagag cctctgctaa ccatgttcat     1860

gccttcttct ttttcctaca gctcctgggc aacgtgctgg ttattgtgct gtctcatcat     1920

tttggcaaag aattcacgcg tgccaccatg cccggctttc tcgtgcggat tctcctgctg     1980

ctgctggtgc ttctgctgct gggccctacc agaggcctga gaaacgccac ccagcggatg     2040

ttcgagatcg actacagccg ggacagcttc ctgaaggacg gccagccctt ccggtacatc     2100

agcggcagca tccactacag cagagtgccc cggttctact ggaaggaccg gctgctgaag     2160

atgaagatgg ccggcctgaa cgccatccag acctacgtgc cctggaactt ccacgagcct     2220

tggcctggcc agtaccagtt cagcgaggac cacgacgtgg aatactttct gcggctggcc     2280

cacgagctgg gcctgctcgt gattctgagg cctggccctt acatctgcgc cgagtgggag     2340

atgggaggac tgcctgcttg gctgctggaa aaagagagca tcctgctgcg gagcagcgac     2400

cccgattatc tggccgccgt ggataagtgg ctgggcgtgc tgctgcccaa gatgaagccc     2460

ctgctgtacc agaacggcgg acccgtgatc accgtgcagg tggaaaacga gtacggcagc     2520

tacttcgcct gcgacttcga ctacctgcgg ttcctgcaga agcggttcag acaccacctg     2580

ggcgacgacg tggtgctgtt cacaacagac ggcgcccaca agacctttct gaagtgtggc     2640

gctctgcagg gcctgtacac caccgtggat tttggcaccg gcagcaatat caccgacgcc     2700

tttctgagcc agcggaagtg cgagccaaag ggccccctga tcaacagcga gttctacacc     2760

ggctggctgg accactgggg ccagcctcac agcaccatca agacagaggc cgtggccagc     2820

agcctgtacg acatcctggc tagaggcgcc agcgtgaacc tgtacatgtt tatcggcggc     2880

accaacttcg cctactggaa cggcgccaac agcccttatg ccgcccagcc caccagctac     2940

gactacgatg cccctctgtc tgaggccggc gacctgaccg agaagtactt tgccctgcgg     3000

aacatcatcc agaaattcga gaaggtgccc gagggcccca tcccccctag cacacctaag     3060

ttcgcctacg gcaaagtgac cctggaaaag ctgaaaaccg tgggagccgc cctggacatc     3120

ctgtgtccta gcggccctat caagagcctg taccccctga ccttcatcca agtgaagcag     3180

cactacggct tcgtgctgta ccggaccacc ctgccccagg actgtagcaa tcctgcccca     3240

ctgagcagcc ccctgaacgg cgtgcacgat agagcctacg tggccgtgga tggcatccca     3300

cagggggtgc tggaacggaa caatgtgatc accctgaaca tcaccggcaa ggctggcgcc     3360

accctggacc tgctggtgga aaacatgggc agagtgaact acggcgccta catcaacgac     3420

ttcaagggcc tggtgtccaa cctgaccctg agcagcaaca tcctgaccga ctggaccatc     3480

ttcccactgg acaccgagga tgccgtgcgg agccatctgg gaggatgggg acacagagat     3540

agcggccacc acgatgaagc ctgggcccac aacagcagca actacaccct gcctgccttc     3600

tacatgggca acttcagcat ccccagcggc atccccgacc tgccacagga cacctttatc     3660

cagttccccg gctggacaaa gggacaagtg tggatcaatg gcttcaacct gggcagatac     3720

tggcccgcca gaggccctca gctgaccctg tttgtgcccc agcacattct gatgaccagc     3780

gcccccaaca ccatcaccgt gctggaactg gaatgggccc cctgcagcag cgacgaccct     3840

gaactgtgtg ccgtgacctt cgtggacagg cccgtgatcg gcagcagcgt gacctacgac     3900

caccccagca agcccgtgga aaagcggctg atgcctcccc caccccagaa gaacaaggac     3960

tcctggctgg atcacgtgtg atgactcgag gacggggtga actacgcctg aggatccgat     4020

ctttttccct ctgccaaaaa ttatggggac atcatgaagc cccttgagca tctgacttct     4080

ggctaataaa ggaaatttat tttcattgca atagtgtgtt ggaatttttt gtgtctctca     4140

ctcggaagca attcgttgat ctgaatttcg accacccata atacccatta ccctggtaga     4200

taagtagcat ggcgggttaa tcattaacta caaggaaccc ctagtgatgg agttggccac     4260

tccctctctg cgcgctcgct cgctcactga ggccgggcga ccaaaggtcg cccgacgccc     4320

gggctttgcc cgggcggcct cagtgagcga gcgagcgcgc ag                        4362


<210>  17
<211>  973
<212>  DNA
<213>  Artificial sequence

<220>
<223>  chicken beta-actin intron

<400>  17
gtgagcgggc gggacggccc ttctcctccg ggctgtaatt agcgcttggt ttaatgacgg       60

cttgtttctt ttctgtggct gcgtgaaagc cttgaggggc tccgggaggg ccctttgtgc      120

ggggggagcg gctcgggggg tgcgtgcgtg tgtgtgtgcg tggggagcgc cgcgtgcggc      180

tccgcgctgc ccggcggctg tgagcgctgc gggcgcggcg cggggctttg tgcgctccgc      240

agtgtgcgcg aggggagcgc ggccgggggc ggtgccccgc ggtgcggggg gggctgcgag      300

gggaacaaag gctgcgtgcg gggtgtgtgc gtgggggggt gagcaggggg tgtgggcgcg      360

tcggtcgggc tgcaaccccc cctgcacccc cctccccgag ttgctgagca cggcccggct      420

tcgggtgcgg ggctccgtac ggggcgtggc gcggggctcg ccgtgccggg cggggggtgg      480

cggcaggtgg gggtgccggg cggggcgggg ccgcctcggg ccggggaggg ctcgggggag      540

gggcgcggcg gcccccggag cgccggcggc tgtcgaggcg cggcgagccg cagccattgc      600

cttttatggt aatcgtgcga gagggcgcag ggacttcctt tgtcccaaat ctgtgcggag      660

ccgaaatctg ggaggcgccg ccgcaccccc tctagcgggc gcggggcgaa gcggtgcggc      720

gccggcagga aggaaatggg cggggagggc cttcgtgcgt cgccgcgccg ccgtcccctt      780

ctccctctcc agcctcgggg ctgtccgcgg ggggacggct gccttcgggg gggacggggc      840

agggcggggt tcggcttctg gcgtgtgacc ggcggctcta gagcctctgc taaccatgtt      900

catgccttct tctttttcct acagctcctg ggcaacgtgc tggttattgt gctgtctcat      960

cattttggca aag                                                         973


<210>  18
<211>  282
<212>  DNA
<213>  Artificial sequence

<220>
<223>  CB promoter

<400>  18
tggtcgaggt gagccccacg ttctgcttca ctctccccat ctcccccccc tccccacccc       60

caattttgta tttatttatt ttttaattat tttgtgcagc gatgggggcg gggggggggg      120

gggggcgcgc gccaggcggg gcggggcggg gcgaggggcg gggcggggcg aggcggagag      180

gtgcggcggc agccaatcag agcggcgcgc tccgaaagtt tccttttatg gcgaggcggc      240

ggcggcggcg gccctataaa aagcgaagcg cgcggcgggc gg                         282


<210>  19
<211>  382
<212>  DNA
<213>  Artificial sequence

<220>
<223>  CMV Immediate early Promoter

<400>  19
ctagtcgaca ttgattattg actagttatt aatagtaatc aattacgggg tcattagttc       60

atagcccata tatggagttc cgcgttacat aacttacggt aaatggcccg cctggctgac      120

cgcccaacga cccccgccca ttgacgtcaa taatgacgta tgttcccata gtaacgccaa      180

tagggacttt ccattgacgt caatgggtgg actatttacg gtaaactgcc cacttggcag      240

tacatcaagt gtatcatatg ccaagtacgc cccctattga cgtcaatgac ggtaaatggc      300

ccgcctggca ttatgcccag tacatgacct tatgggactt tcctacttgg cagtacatct      360

acgtattagt catcgctatt ac                                               382


<210>  20
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Encoded AAV9 vp1 amino acid sequence

<400>  20

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu 
                405                 410                 415     


Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro 
465                 470                 475                 480 


Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn 
                485                 490                 495     


Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser 
                565                 570                 575     


Tyr Gly Gln Val Ala Thr Asn His Gln Ser Ala Gln Ala Gln Ala Gln 
            580                 585                 590         


Thr Gly Trp Val Gln Asn Gln Gly Ile Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Met 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735     


<210>  21
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Encoded AAVhu31 vp1 amino acid sequence

<400>  21

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Gln Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
            20                  25                  30          


Lys Pro Ala Glu Arg His Lys Asp Asp Ser Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ser Gln Pro Ala Lys Lys Lys Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Gly Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu 
                405                 410                 415     


Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro 
465                 470                 475                 480 


Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn 
                485                 490                 495     


Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser 
                565                 570                 575     


Tyr Gly Gln Val Ala Thr Asn His Gln Ser Ala Gln Ala Gln Ala Gln 
            580                 585                 590         


Thr Gly Trp Val Gln Asn Gln Gly Ile Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Met 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala Val Ser Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735     


<210>  22
<211>  736
<212>  PRT
<213>  Artificial Sequence

<220>
<223>  Encoded AAVhu32 vp1 amino acid sequence

<400>  22

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Gln Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
            20                  25                  30          


Lys Pro Ala Glu Arg His Lys Asp Asp Ser Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ser Gln Pro Ala Lys Lys Lys Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu 
                405                 410                 415     


Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro 
465                 470                 475                 480 


Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn 
                485                 490                 495     


Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser 
                565                 570                 575     


Tyr Gly Gln Val Ala Thr Asn His Gln Ser Ala Gln Ala Gln Ala Gln 
            580                 585                 590         


Thr Gly Trp Val Gln Asn Gln Gly Ile Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Met 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735     


<210>  23
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAV9 vp1 coding sequence

<400>  23
atggctgccg atggttatct tccagattgg ctcgaggaca accttagtga aggaattcgc       60

gagtggtggg ctttgaaacc tggagcccct caacccaagg caaatcaaca acatcaagac      120

aacgctcgag gtcttgtgct tccgggttac aaataccttg gacccggcaa cggactcgac      180

aagggggagc cggtcaacgc agcagacgcg gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aggccggaga caacccgtac ctcaagtaca accacgccga cgccgagttc      300

caggagcggc tcaaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaaaaaga ggcttcttga acctcttggt ctggttgagg aagcggctaa gacggctcct      420

ggaaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgc gggtattggc      480

aaatcgggtg cacagcccgc taaaaagaga ctcaatttcg gtcagactgg cgacacagag      540

tcagtcccag accctcaacc aatcggagaa cctcccgcag ccccctcagg tgtgggatct      600

cttacaatgg cttcaggtgg tggcgcacca gtggcagaca ataacgaagg tgccgatgga      660

gtgggtagtt cctcgggaaa ttggcattgc gattcccaat ggctggggga cagagtcatc      720

accaccagca cccgaacctg ggccctgccc acctacaaca atcacctcta caagcaaatc      780

tccaacagca catctggagg atcttcaaat gacaacgcct acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccacttct caccacgtga ctggcagcga      900

ctcatcaaca acaactgggg attccggcct aagcgactca acttcaagct cttcaacatt      960

caggtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtgctcgg gtcggctcac     1080

gagggctgcc tcccgccgtt cccagcggac gttttcatga ttcctcagta cgggtatctg     1140

acgcttaatg atggaagcca ggccgtgggt cgttcgtcct tttactgcct ggaatatttc     1200

ccgtcgcaaa tgctaagaac gggtaacaac ttccagttca gctacgagtt tgagaacgta     1260

cctttccata gcagctacgc tcacagccaa agcctggacc gactaatgaa tccactcatc     1320

gaccaatact tgtactatct ctcaaagact attaacggtt ctggacagaa tcaacaaacg     1380

ctaaaattca gtgtggccgg acccagcaac atggctgtcc agggaagaaa ctacatacct     1440

ggacccagct accgacaaca acgtgtctca accactgtga ctcaaaacaa caacagcgaa     1500

tttgcttggc ctggagcttc ttcttgggct ctcaatggac gtaatagctt gatgaatcct     1560

ggacctgcta tggccagcca caaagaagga gaggaccgtt tctttccttt gtctggatct     1620

ttaatttttg gcaaacaagg aactggaaga gacaacgtgg atgcggacaa agtcatgata     1680

accaacgaag aagaaattaa aactactaac ccggtagcaa cggagtccta tggacaagtg     1740

gccacaaacc accagagtgc ccaagcacag gcgcagaccg gctgggttca aaaccaagga     1800

atacttccgg gtatggtttg gcaggacaga gatgtgtacc tgcaaggacc catttgggcc     1860

aaaattcctc acacggacgg caactttcac ccttctccgc tgatgggagg gtttggaatg     1920

aagcacccgc ctcctcagat cctcatcaaa aacacacctg tacctgcgga tcctccaacg     1980

gccttcaaca aggacaagct gaactctttc atcacccagt attctactgg ccaagtcagc     2040

gtggagatcg agtgggagct gcagaaggaa aacagcaagc gctggaaccc ggagatccag     2100

tacacttcca actattacaa gtctaataat gttgaatttg ctgttaatac tgaaggtgta     2160

tatagtgaac cccgccccat tggcaccaga tacctgactc gtaatctgta a              2211


<210>  24
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAVhu31 vp1 coding sequence

<400>  24
atggctgccg atggttatct tccagattgg ctcgaggaca accttagtga aggaattcgc       60

gagtggtggg ctttgaaacc tggagcccct caacccaagg caaatcaaca acatcaagac      120

aacgctcgag gtcttgtgct tccgggttac aaataccttg gacccggcaa cggactcgac      180

aagggggagc cggtcaacgc agcagacgcg gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aggccggaga caacccgtac ctcaagtaca accacgccga cgccgagttc      300

caggagcggc tcaaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaaaaaga ggcttcttga acctcttggt ctggttgagg aagcggctaa gacggctcct      420

ggaaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgc gggtattggc      480

aaatcgggtg cacagcccgc taaaaagaga ctcaatttcg gtcagactgg cgacacagag      540

tcagtcccag accctcaacc aatcggagaa cctcccgcag ccccctcagg tgtgggatct      600

cttacaatgg cttcaggtgg tggcgcacca gtggcagaca ataacgaagg tgccgatgga      660

gtgggtagtt cctcgggaaa ttggcattgc gattcccaat ggctggggga cagagtcatc      720

accaccagca cccgaacctg ggccctgccc acctacaaca atcacctcta caagcaaatc      780

tccaacagca catctggagg atcttcaaat gacaacgcct acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccacttct caccacgtga ctggcagcga      900

ctcatcaaca acaactgggg attccggcct aagcgactca acttcaagct cttcaacatt      960

caggtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtgctcgg gtcggctcac     1080

gagggctgcc tcccgccgtt cccagcggac gttttcatga ttcctcagta cgggtatctg     1140

acgcttaatg atggaagcca ggccgtgggt cgttcgtcct tttactgcct ggaatatttc     1200

ccgtcgcaaa tgctaagaac gggtaacaac ttccagttca gctacgagtt tgagaacgta     1260

cctttccata gcagctacgc tcacagccaa agcctggacc gactaatgaa tccactcatc     1320

gaccaatact tgtactatct ctcaaagact attaacggtt ctggacagaa tcaacaaacg     1380

ctaaaattca gtgtggccgg acccagcaac atggctgtcc agggaagaaa ctacatacct     1440

ggacccagct accgacaaca acgtgtctca accactgtga ctcaaaacaa caacagcgaa     1500

tttgcttggc ctggagcttc ttcttgggct ctcaatggac gtaatagctt gatgaatcct     1560

ggacctgcta tggccagcca caaagaagga gaggaccgtt tctttccttt gtctggatct     1620

ttaatttttg gcaaacaagg aactggaaga gacaacgtgg atgcggacaa agtcatgata     1680

accaacgaag aagaaattaa aactactaac ccggtagcaa cggagtccta tggacaagtg     1740

gccacaaacc accagagtgc ccaagcacag gcgcagaccg gctgggttca aaaccaagga     1800

atacttccgg gtatggtttg gcaggacaga gatgtgtacc tgcaaggacc catttgggcc     1860

aaaattcctc acacggacgg caactttcac ccttctccgc tgatgggagg gtttggaatg     1920

aagcacccgc ctcctcagat cctcatcaaa aacacacctg tacctgcgga tcctccaacg     1980

gccttcaaca aggacaagct gaactctttc atcacccagt attctactgg ccaagtcagc     2040

gtggagatcg agtgggagct gcagaaggaa aacagcaagc gctggaaccc ggagatccag     2100

tacacttcca actattacaa gtctaataat gttgaatttg ctgttaatac tgaaggtgta     2160

tatagtgaac cccgccccat tggcaccaga tacctgactc gtaatctgta a              2211


<210>  25
<211>  2211
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  AAVhu32 vp1 coding sequence

<400>  25
atggctgccg atggttatct tccagattgg ctcgaggaca ctctctctga aggaataaga       60

cagtggtgga agctcaaacc tggcccacca ccaccaaagc ccgcagagcg gcataaggac      120

gacagcaggg gtcttgtgct tcctgggtac aagtacctcg gacccggcaa cggactcgac      180

aagggggagc cggtcaacgc agcagacgcg gcggccctcg agcacgacaa ggcctacgac      240

cagcagctca aggccggaga caacccgtac ctcaagtaca accacgccga cgccgagttc      300

caggagcggc tcaaagaaga tacgtctttt gggggcaacc tcgggcgagc agtcttccag      360

gccaaaaaga ggcttcttga acctcttggt ctggttgagg aagcggctaa gacggctcct      420

ggaaagaaga ggcctgtaga gcagtctcct caggaaccgg actcctccgc gggtattggc      480

aaatcgggtt cacagcccgc taaaaagaaa ctcaatttcg gtcagactgg cgacacagag      540

tcagtccccg accctcaacc aatcggagaa cctcccgcag ccccctcagg tgtgggatct      600

cttacaatgg cttcaggtgg tggcgcacca gtggcagaca ataacgaagg tgccgatgga      660

gtgggtagtt cctcgggaaa ttggcattgc gattcccaat ggctggggga cagagtcatc      720

accaccagca cccgaacctg ggccctgccc acctacaaca atcacctcta caagcaaatc      780

tccaacagca catctggagg atcttcaaat gacaacgcct acttcggcta cagcaccccc      840

tgggggtatt ttgacttcaa cagattccac tgccacttct caccacgtga ctggcagcga      900

ctcatcaaca acaactgggg attccggcct aagcgactca acttcaagct cttcaacatt      960

caggtcaaag aggttacgga caacaatgga gtcaagacca tcgccaataa ccttaccagc     1020

acggtccagg tcttcacgga ctcagactat cagctcccgt acgtgctcgg gtcggctcac     1080

gagggctgcc tcccgccgtt cccagcggac gttttcatga ttcctcagta cgggtatctg     1140

acgcttaatg atgggagcca ggccgtgggt cgttcgtcct tttactgcct ggaatatttc     1200

ccgtcgcaaa tgctaagaac gggtaacaac ttccagttca gctacgagtt tgagaacgta     1260

cctttccata gcagctacgc tcacagccaa agcctggacc gactaatgaa tccactcatc     1320

gaccaatact tgtactatct ctcaaagact attaacggtt ctggacagaa tcaacaaacg     1380

ctaaaattca gcgtggccgg acccagcaac atggctgtcc agggaagaaa ctacatacct     1440

ggacccagct accgacaaca acgtgtctca accactgtga ctcaaaacaa caacagcgaa     1500

tttgcttggc ctggagcttc ttcttgggct ctcaatggac gtaatagctt gatgaatcct     1560

ggacctgcta tggccagcca caaagaagga gaggaccgtt tctttccttt gtctggatct     1620

ttaatttttg gcaaacaagg aactggaaga gacaacgtgg atgcggacaa agtcatgata     1680

accaacgaag aagaaattaa aactactaac ccggtagcaa cggagtccta tggacaagtg     1740

gccacaaacc accagagtgc ccaagcacag gcgcagaccg gctgggttca aaaccaagga     1800

atacttccgg gtatggtttg gcaggacaga gatgtgtacc tgcaaggacc catttgggcc     1860

aaaattcctc acacggacgg caactttcac ccttctccgc taatgggagg gtttggaatg     1920

aagcacccgc ctcctcagat cctcatcaaa aacacacctg tacctgcgga tcctccaacg     1980

gctttcaata aggacaagct gaactctttc atcacccagt attctactgg ccaagtcagc     2040

gtggagattg agtgggagct gcagaaggaa aacagcaagc gctggaaccc ggagatccag     2100

tacacttcca actattacaa gtctaataat gttgaatttg ctgttaatac tgaaggtgta     2160

tatagtgaac cccgccccat tggcaccaga tacctgactc gtaatctgta a              2211


<210>  26
<211>  546
<212>  PRT
<213>  Homo sapiens

<400>  26

Met Pro Gly Phe Leu Val Arg Ile Leu Pro Leu Leu Leu Val Leu Leu 
1               5                   10                  15      


Leu Leu Gly Pro Thr Arg Gly Leu Arg Asn Ala Thr Gln Arg Met Phe 
            20                  25                  30          


Glu Ile Asp Tyr Ser Arg Asp Ser Phe Leu Lys Asp Gly Gln Pro Phe 
        35                  40                  45              


Arg Tyr Ile Ser Gly Ser Ile His Tyr Ser Arg Val Pro Arg Phe Tyr 
    50                  55                  60                  


Trp Lys Asp Arg Leu Leu Lys Met Lys Met Ala Gly Leu Asn Ala Ile 
65                  70                  75                  80  


Gln Thr Leu Pro Gly Ser Cys Gly Gln Val Val Gly Ser Pro Ser Ala 
                85                  90                  95      


Gln Asp Glu Ala Ser Pro Leu Ser Glu Trp Arg Ala Ser Tyr Asn Ser 
            100                 105                 110         


Ala Gly Ser Asn Ile Thr Asp Ala Phe Leu Ser Gln Arg Lys Cys Glu 
        115                 120                 125             


Pro Lys Gly Pro Leu Ile Asn Ser Glu Phe Tyr Thr Gly Trp Leu Asp 
    130                 135                 140                 


His Trp Gly Gln Pro His Ser Thr Ile Lys Thr Glu Ala Val Ala Ser 
145                 150                 155                 160 


Ser Leu Tyr Asp Ile Leu Ala Arg Gly Ala Ser Val Asn Leu Tyr Met 
                165                 170                 175     


Phe Ile Gly Gly Thr Asn Phe Ala Tyr Trp Asn Gly Ala Asn Ser Pro 
            180                 185                 190         


Tyr Ala Ala Gln Pro Thr Ser Tyr Asp Tyr Asp Ala Pro Leu Ser Glu 
        195                 200                 205             


Ala Gly Asp Leu Thr Glu Lys Tyr Phe Ala Leu Arg Asn Ile Ile Gln 
    210                 215                 220                 


Lys Phe Glu Lys Val Pro Glu Gly Pro Ile Pro Pro Ser Thr Pro Lys 
225                 230                 235                 240 


Phe Ala Tyr Gly Lys Val Thr Leu Glu Lys Leu Lys Thr Val Gly Ala 
                245                 250                 255     


Ala Leu Asp Ile Leu Cys Pro Ser Gly Pro Ile Lys Ser Leu Tyr Pro 
            260                 265                 270         


Leu Thr Phe Ile Gln Val Lys Gln His Tyr Gly Phe Val Leu Tyr Arg 
        275                 280                 285             


Thr Thr Leu Pro Gln Asp Cys Ser Asn Pro Ala Pro Leu Ser Ser Pro 
    290                 295                 300                 


Leu Asn Gly Val His Asp Arg Ala Tyr Val Ala Val Asp Gly Ile Pro 
305                 310                 315                 320 


Gln Gly Val Leu Glu Arg Asn Asn Val Ile Thr Leu Asn Ile Thr Gly 
                325                 330                 335     


Lys Ala Gly Ala Thr Leu Asp Leu Leu Val Glu Asn Met Gly Arg Val 
            340                 345                 350         


Asn Tyr Gly Ala Tyr Ile Asn Asp Phe Lys Gly Leu Val Ser Asn Leu 
        355                 360                 365             


Thr Leu Ser Ser Asn Ile Leu Thr Asp Trp Thr Ile Phe Pro Leu Asp 
    370                 375                 380                 


Thr Glu Asp Ala Val Arg Ser His Leu Gly Gly Trp Gly His Arg Asp 
385                 390                 395                 400 


Ser Gly His His Asp Glu Ala Trp Ala His Asn Ser Ser Asn Tyr Thr 
                405                 410                 415     


Leu Pro Ala Phe Tyr Met Gly Asn Phe Ser Ile Pro Ser Gly Ile Pro 
            420                 425                 430         


Asp Leu Pro Gln Asp Thr Phe Ile Gln Phe Pro Gly Trp Thr Lys Gly 
        435                 440                 445             


Gln Val Trp Ile Asn Gly Phe Asn Leu Gly Arg Tyr Trp Pro Ala Arg 
    450                 455                 460                 


Gly Pro Gln Leu Thr Leu Phe Val Pro Gln His Ile Leu Met Thr Ser 
465                 470                 475                 480 


Ala Pro Asn Thr Ile Thr Val Leu Glu Leu Glu Trp Ala Pro Cys Ser 
                485                 490                 495     


Ser Asp Asp Pro Glu Leu Cys Ala Val Thr Phe Val Asp Arg Pro Val 
            500                 505                 510         


Ile Gly Ser Ser Val Thr Tyr Asp His Pro Ser Lys Pro Val Glu Lys 
        515                 520                 525             


Arg Leu Met Pro Pro Pro Pro Gln Lys Asn Lys Asp Ser Trp Leu Asp 
    530                 535                 540                 


His Val 
545     


