                         SEQUENCE LISTING

<110>  Allen Institute
 
<120>  RESCUING VOLTAGE-GATED SODIUM CHANNEL FUNCTION IN INHIBITORY 
       NEURONS

<130>  A166-0004PCT

<150>  62/655,043
<151>  2018-04-09

<150>  62/742,835
<151>  2018-10-08

<150>  62/810,281
<151>  2019-02-25

<160>  57    

<170>  PatentIn version 3.5

<210>  1
<211>  528
<212>  DNA
<213>  artificial sequence

<220>
<223>  hDLX I56i enhancer

<400>  1
tatgcactca cagtggtttg gcatgcatct ggtgaatttt ttttaacgaa aaattagtgt       60

tggtttcgat gtatggtagc attctcccta acgtaatttg aataattcag caaagcccca      120

ctaccagctg tacttctgca gcctcttcca ttcttttcag cattataatt ttggttaatt      180

ttcaatttta ggtcctacgt ctctgcaatt tgtgtatgaa taacagaata atttccctct      240

tttgtttcgc ctttcctgtt cctgaatcta aataaagatg gctttttagt attaaaagtg      300

gaagaaaatt acaggtaatt atctttgacg gtaaaaacgc tgtaatcagc gggctacatg      360

aaaaattact ctaattatgg ctgcatttaa gagaatggaa aaaaaccttc ttgtggataa      420

aaaccttaaa ttgtccccaa tgtctgcttc aaattggatg gcactgcagc tggaggcttt      480

gttcagaatt gatcctgggg agctacgaac ccaaagtttc acagtagg                   528


<210>  2
<211>  131
<212>  DNA
<213>  artificial sequence

<220>
<223>  Core of the hDLX I56i enhancer

<400>  2
ctaaataaag atggcttttt agtattaaaa gtggaagaaa attacaggta attatctttg       60

acggtaaaaa cgctgtaatc agcgggctac atgaaaaatt actctaatta tggctgcatt      120

taagagaatg g                                                           131


<210>  3
<211>  393
<212>  DNA
<213>  artificial sequence

<220>
<223>  3xhI56iCore, Triply Concatamerized Core of the hDLX I56i enhancer

<400>  3
ctaaataaag atggcttttt agtattaaaa gtggaagaaa attacaggta attatctttg       60

acggtaaaaa cgctgtaatc agcgggctac atgaaaaatt actctaatta tggctgcatt      120

taagagaatg gctaaataaa gatggctttt tagtattaaa agtggaagaa aattacaggt      180

aattatcttt gacggtaaaa acgctgtaat cagcgggcta catgaaaaat tactctaatt      240

atggctgcat ttaagagaat ggctaaataa agatggcttt ttagtattaa aagtggaaga      300

aaattacagg taattatctt tgacggtaaa aacgctgtaa tcagcgggct acatgaaaaa      360

ttactctaat tatggctgca tttaagagaa tgg                                   393


<210>  4
<211>  527
<212>  DNA
<213>  mus musculus

<400>  4
tatacactca cagtggtttg gcatatattt ggtgaaattt tttaaggaaa aattagtgtt       60

ggtttcgata tatggtagct ttttctctaa cataatttga ataattcagc aaagccctac      120

taccagctgt acttctgcag cctcttccat tctttccagc attataattt tggttaattt      180

tcaattttag gtcctacgtc tctgcaattt gtgtatgaat aacagaataa tttccctctt      240

ttgtttcgcc tttcctgttc ctgaatctaa ataaagatgg ctttttagta ttaaaagtgg      300

aagaaaatta caggtaatta tctttgacgg taaaaacgct gtaatcagcg ggctacatga      360

aaaattactc taattatggc tgcatttaag agaatggaaa aaaaccttct tgtggataaa      420

aaccttaaat tgtccccaat gtctgcttca aattggatgg cactgcagct ggaggctttg      480

ttcagaattg atcctgggga gctacgaacc caaagtttca cagtagg                    527


<210>  5
<211>  281
<212>  DNA
<213>  zebrafish

<400>  5
acattgtaat tttagataat atcccaagcg ttcactctcc tcggcaattt gtacatgaat       60

aaccgaataa tttcatcttt tgtttcgtct ttgccacttc aaatccaaat aaagatgcct      120

tttagtatta aaagtggtag aaaattacag gtaattatct ttgacggtaa aaacgctgta      180

atcagcgggc tacatcaaaa attaccctaa ttatgtctgc atttatgaga atggaaaaaa      240

accctctctt ggataaaacc cataaattgt cccaaatatc t                          281


<210>  6
<211>  130
<212>  DNA
<213>  zebrafish

<400>  6
ccaaataaag atgcctttta gtattaaaag tggtagaaaa ttacaggtaa ttatctttga       60

cggtaaaaac gctgtaatca gcgggctaca tcaaaaatta ccctaattat gtctgcattt      120

atgagaatgg                                                             130


<210>  7
<211>  390
<212>  DNA
<213>  zebrafish

<400>  7
ccaaataaag atgcctttta gtattaaaag tggtagaaaa ttacaggtaa ttatctttga       60

cggtaaaaac gctgtaatca gcgggctaca tcaaaaatta ccctaattat gtctgcattt      120

atgagaatgg ccaaataaag atgcctttta gtattaaaag tggtagaaaa ttacaggtaa      180

ttatctttga cggtaaaaac gctgtaatca gcgggctaca tcaaaaatta ccctaattat      240

gtctgcattt atgagaatgg ccaaataaag atgcctttta gtattaaaag tggtagaaaa      300

ttacaggtaa ttatctttga cggtaaaaac gctgtaatca gcgggctaca tcaaaaatta      360

ccctaattat gtctgcattt atgagaatgg                                       390


<210>  8
<211>  376
<212>  DNA
<213>  artificial sequence

<220>
<223>  hDLX I12b enhancer

<400>  8
cagctgcaaa cccaagaggg tcagcatcat ttcactgtat tctcttcttg attacaagcc       60

gggcccatca aacacaacat aattacagta atttcaggtt tatttattct aatgcagttt      120

ccccatctct ctggtaatta tgagcaattt tttcgcccag ggaatctttt tgcattaaca      180

aaagagataa cgcactgaaa gccaaatttg ctgtgcattg agaaaaggaa aaaaaaaaat      240

caaataggtg cgagctgcca tctctgcaat tctctggtac cggagccggc aaattgcttg      300

caggtgtatg gagcaagctt gtcaatggcc aggcctccaa attagcaaat gcacagcagc      360

aaagtaatga agacag                                                      376


<210>  9
<211>  984
<212>  DNA
<213>  artificial sequence

<220>
<223>  NavSheP-D60N, codon optimized, with N-terminal 3x HA tag

<400>  9
atggtttacc cgtatgatgt cccggattac gctggcagct acccatacga tgtacccgac       60

tatgccggca gttatcccta cgacgtccct gactacgcat ctacgtccct tttgaatgcg      120

cctaccggcc ttcaagctag agtcattaat ctcgtcgaac aaaactggtt tggacacttt      180

atactgactc tcatactcat taatgctgtg cagcttggaa tggaaactag cgccagcctc      240

atggcacaat atggcgcgct gcttatgtcc ttgaataagg tccttctctc tgtgttcgtg      300

gtcgaactgc tgctccggat ttatgcgtat cggggcaagt tttttaagga cccgtggaat      360

gtgtttgact tcactgttat tgttattgct ctgattcctg catctggccc attggctgtc      420

ctccgctccc tccgagttct ccgcgtcttg agggttctga cgattgtccc cagcatgaaa      480

agagtagtgt cagcactgct tgggagcttg cccgggttgg cctccattgc aaccgtgctt      540

ctgttgatct attacgtttt cgctgtgatc gccactaaaa ttttcgggga tgcttttccg      600

gaatggttcg ggacgatagc ggactccttc tatacccttt ttcaaattat gaccttggaa      660

agttggtcta tggggatctc taggccagtg atggaggtgt acccttacgc ttgggtattc      720

tttgtgccct ttattcttgt tgctactttt accatgctta accttttcat cgccatcata      780

gtgaatacta tgcagacatt ctctgacgag gaacatgctc tggagcgaga gcaagataaa      840

cagatcttgg aacaggagca gagacaaatg cacgaggaac tgaaggccat tcgactcgag      900

cttcagcaac tccaaaccct tttgcgaaat gcggctgggg actcctccaa tgtctccaca      960

aagggcaata tcggctcaga ctaa                                             984


<210>  10
<211>  888
<212>  DNA
<213>  artificial sequence

<220>
<223>  NavSheP endogenous sequence

<400>  10
atgagtacat ctttacttaa cgcgccaacg ggtttgcagg cacgagtgat taacttggtt       60

gagcaaaact ggtttggtca ttttattttg acattgattt taatcaacgc ggtgcagtta      120

ggtatggaga cctcagccag cctgatggcg caatacggtg ctttgttgat gagtcttgat      180

aaggtgctgc tgagtgtatt tgtggtggag ttattgctgc ggatttatgc ctacaggggg      240

aaatttttta aagacccttg gaacgtgttc gattttaccg tgatagtgat agcactgatc      300

cctgcatctg ggccattggc tgtcctgcgt tcgctcaggg tattgcgggt gctgagagtg      360

ttaacaattg tgccatcaat gaaacgggtg gtgtctgcgc tgttgggatc acttcctgga      420

ttggcatcga tcgccacagt attactgctg atttattatg tgtttgcggt gatcgctacc      480

aaaatttttg gcgatgcatt ccctgaatgg tttggcacta ttgctgactc attttatacc      540

ctatttcaaa taatgacgct tgaaagctgg tctatgggaa tttcgcggcc agtgatggaa      600

gtctaccctt atgcttgggt atttttcgta ccatttattc tggtagcgac tttcacaatg      660

ctaaatttgt ttattgcgat tatcgtcaat accatgcaaa ccttcagcga cgaagagcat      720

gcattagagc gtgagcaaga caaacaaatc ttagagcagg aacaaagaca aatgcacgag      780

gagttgaaag ccatcagact cgagctacaa caattacaaa ccttgctgcg caatgctgct      840

ggtgattctt ctaatgtgtc gacaaaggga aacattggtt ctgactaa                   888


<210>  11
<211>  855
<212>  DNA
<213>  artificial sequence

<220>
<223>  NavBp, endogenous sequence

<400>  11
atggaaaaca atccagccga acaacaagtt ccaccattag tagccttagc tcagcgtatc       60

gtctttcata aggcctttac cccaactatt attaccttga ttatcattaa tgccattatt      120

gtaggccttg aaacatatcc tactgtttat caaggttata atgattggtt ctacgcagca      180

gatttagcct tactttggat ttttacaatt gagattacac tgcgttttat cgcagcgaga      240

ccgactaaat ctttttttaa aagcagctgg aactggtttg atttattaat cgttcttgcc      300

ggtcatgtct ttgccggtgc tcattttgta acggttcttc gtatcctgcg cgttcttcgc      360

gtattacgtg ccatttctgt cattccttct ctgcgtcgtt tagtcgatgc tttgctgatg      420

accatcccgg ctttaggaaa cattatgatc ctgatgggaa ttattttcta tattttcgct      480

gtgattggaa cgatgttatt tgcttctgta gcacctgagt actttggtaa cttacagctt      540

tctttattaa cattattcca agttgttaca cttgaatctt gggcaagcgg tgtcatgagg      600

ccgatttttg cagaggtttg gtggtcttgg atttattttg tcatctttat tttagtaggg      660

acatttattg tctttaactt atttatcggt gttatcgtta ataacgttga aaaagcaaac      720

gaagaagaac tcaaatcaga attagatgat aaagaggcag atacaaaaga agagcttgct      780

tctctgcgta atgaagtagc agagatgaaa gacctcatta aacaaatgca taaacagcaa      840

acaaaaaaag ggtaa                                                       855


<210>  12
<211>  951
<212>  DNA
<213>  artificial sequence

<220>
<223>  NavBp, codon optimized, with N-terminal 3x HA tag

<400>  12
atggtttacc cgtatgatgt cccggattac gctggcagct acccatacga tgtacccgac       60

tatgccggca gttatcccta cgacgtccct gactacgcag aaaacaaccc agccgaacag      120

caagtcccac ccctcgtggc gctcgcccaa cgcatagtat ttcacaaggc gtttacgccg      180

acgataatca ccctcatcat tattaatgcg atcattgtgg gactcgagac atacccaacg      240

gtttaccagg gttacaatga ttggttctat gctgccgacc ttgctttgtt gtggatattc      300

actattgaaa tcacgctccg attcatcgcc gcccgaccga cgaagagttt cttcaagtct      360

agctggaact ggtttgatct gcttatcgta ttggcgggcc acgtcttcgc tggcgcccat      420

tttgttacgg tgcttaggat cctccgcgtc ctgagggtcc tcagagctat ctcagtcata      480

cccagtctcc ggcggctggt tgacgcactt ttgatgacaa tcccagcact cggtaacatc      540

atgatactga tggggattat tttttacata ttcgcggtta tcgggacgat gctctttgca      600

tcagtagcgc cagaatactt tggcaatttg cagctgtctc tgcttacact gttccaagtg      660

gttacgctgg aaagttgggc tagtggggtt atgcgaccta tttttgccga agtctggtgg      720

tcttggatct attttgtaat ctttattctc gtgggaactt tcatagtatt taaccttttc      780

attggcgtca tcgtgaacaa tgtggaaaaa gctaacgaag aggaactgaa aagcgaactg      840

gatgataaag aggctgatac aaaagaagaa ctggcatcat tgcgaaacga ggtggcagaa      900

atgaaggatc tcataaaaca gatgcataaa cagcaaacaa aaaagggtta a               951


<210>  13
<211>  825
<212>  DNA
<213>  artificial sequence

<220>
<223>  NavMs, endogenous sequence

<400>  13
atgtcacgca aaataagaga tttaatcgaa tccaaacgct ttcaaaacgt catcaccgcc       60

attattgtgc tcaatggcgc tgtgctgggt ctgctgaccg atacaaccct atcggcctcc      120

agccaaaacc tgctggagcg tgtggatcaa ctttgtctga ctatctttat tgttgaaata      180

tccctgaaaa tatacgccta tggcgtgcga ggctttttcc gcagcggctg gaatctgttt      240

gattttgtga ttgtggccat cgcgcttatg cccgcccagg gtagcctatc ggtgctgcga      300

accttccgta tattccgcgt catgcggctc gtatcggtca taccaaccat gcgaagagtg      360

gtgcaaggca tgctcttggc actgcccggc gtgggatcgg tagcggcact gttgacggtg      420

gtcttctata ttgcggctgt catggccacc aatctctacg gggcaacctt ccctgaatgg      480

tttggtgatc ttagcaagag cctgtacaca ctatttcagg tgatgacctt agagtcatgg      540

tctatgggca ttgtgcgtcc agtgatgaac gttcatccca acgcatgggt ttttttcatc      600

cccttcatca tgctcaccac ctttaccgtg ctcaacctgt ttattggcat tattgtagat      660

gccatggcca tcaccaagga acaggaggaa gaggccaaaa ccggccacca ccaagagcct      720

attagccaaa cattgctcca tctgggagat cgcctagata ggatcgaaaa gcagcttgcg      780

caaaacaacg agctcttaca acgacaacag ccgcaaaaaa aatag                      825


<210>  14
<211>  954
<212>  DNA
<213>  artificial sequence

<220>
<223>  NavMs, codon optimized, with N-terminal 3x HA tag and linker

<400>  14
atggtttatc cgtatgatgt tcctgactat gcaggatcct atccttatga tgttcccgat       60

tacgctggtt cttaccctta cgatgttccc gattatgcca gttctggatt ggtgccacga      120

ggcagccaca tgagccggaa gatcagagat cttatcgaat ctaagagatt tcagaatgtt      180

attaccgcga taatcgtact caacggggcg gtgctcggtc tcctcaccga taccacattg      240

agcgcttcta gccagaacct gctcgaaagg gttgaccaac tgtgcctgac aatttttatc      300

gtggaaatta gcttgaaaat ttacgcctac ggcgttcgcg gttttttccg gagcggttgg      360

aatctttttg acttcgttat cgttgccatc gcgctcatgc ccgcacaggg ttctttgtct      420

gtgttgagga cattccgaat atttcgcgtg atgcgcttgg tatccgtgat ccctacgatg      480

cgccgcgtcg tacaaggaat gttgctggct ctccccggcg tcgggagcgt tgctgccctc      540

cttaccgtgg tattttacat agcggcggtt atggctacta atctttacgg agctaccttc      600

ccggagtggt tcggggattt gtccaagagc ctctatacat tgtttcaagt tatgaccctg      660

gagtcctggt ctatgggcat tgtccggccc gtaatgaacg tacacccaaa tgcgtgggtg      720

tttttcattc cattcatcat gctgactacc tttaccgtgc tgaacttgtt cattgggatt      780

atcgtggatg cgatggccat cactaaggag caagaagaag aggctaaaac tggccaccac      840

caagagccaa tttctcaaac cctcttgcat ctcggggacc gactggaccg cattgagaag      900

caactcgcgc agaacaatga gctgttgcag cgacagcaac ctcaaaaaaa ataa            954


<210>  15
<211>  885
<212>  DNA
<213>  artificial sequence

<220>
<223>  NavMs, codon optimized, with N-terminal His tag and linker

<400>  15
atgggcagca gccatcatca tcatcatcac agcagcggcc tggtgccgcg cggcagccat       60

atgtcacgca aaatccgcga tttaatcgaa tccaaacgct ttcaaaacgt catcaccgcc      120

attattgtgc tcaatggcgc tgtgctgggt ctgctgaccg atacaaccct gtcggcctcc      180

agccaaaacc tgctggagcg tgtggatcaa ctttgtctga ctatctttat tgttgaaatc      240

tccctgaaaa tctacgccta tggcgtgcgc ggctttttcc gcagcggctg gaatctgttt      300

gattttgtga ttgtggccat cgcgcttatg ccggcccagg gtagcctgtc ggtgctgcgt      360

accttccgta tcttccgcgt catgcgcctc gtatcggtca tcccaaccat gcgccgtgtg      420

gtgcaaggca tgctcttggc actgccgggc gtgggctcgg tagcggcact gttgacggtg      480

gtcttctata ttgcggctgt catggccacc aatctctacg gggcaacctt ccctgaatgg      540

tttggtgatc ttagcaagag cctgtacaca ctgtttcagg tgatgacctt agagtcatgg      600

tctatgggca ttgtgcgtcc agtgatgaac gttcatccga acgcatgggt ttttttcatc      660

ccgttcatca tgctcaccac ctttaccgtg ctcaacctgt ttattggcat tattgtagat      720

gcaatggcaa tcaccaagga acaggaggaa gaggccaaaa ccggtcacca tcaagaacct      780

atttctcaaa ctcttcttca tcttggtgat cgtcttgatc gtattgaaaa acaacttgct      840

caaaataatg aacttcttca acgtcaacaa cctcaaaaaa aataa                      885


<210>  16
<211>  6027
<212>  DNA
<213>  homo sapiens

<400>  16
atggagcaaa cagtgcttgt accaccagga cctgacagct tcaacttctt caccagagaa       60

tctcttgcgg ctattgaaag acgcattgca gaagaaaagg caaagaatcc caaaccagac      120

aaaaaagatg acgacgaaaa tggcccaaag ccaaatagtg acttggaagc tggaaagaac      180

cttccattta tttatggaga cattcctcca gagatggtgt cagagcccct ggaggacctg      240

gacccctact atatcaataa gaaaactttt atagtattga ataaagggaa ggccatcttc      300

cggttcagtg ccacctctgc cctgtacatt ttaactccct tcaatcctct taggaaaata      360

gctattaaga ttttggtaca ttcattattc agcatgctaa ttatgtgcac tattttgaca      420

aactgtgtgt ttatgacaat gagtaaccct cctgattgga caaagaatgt agaatacacc      480

ttcacaggaa tatatacttt tgaatcactt ataaaaatta ttgcaagggg attctgttta      540

gaagatttta ctttccttcg ggatccatgg aactggctcg atttcactgt cattacattt      600

gcgtacgtca cagagtttgt ggacctgggc aatgtctcgg cattgagaac attcagagtt      660

ctccgagcat tgaagacgat ttcagtcatt ccaggcctga aaaccattgt gggagccctg      720

atccagtctg tgaagaagct ctcagatgta atgatcctga ctgtgttctg tctgagcgta      780

tttgctctaa ttgggctgca gctgttcatg ggcaacctga ggaataaatg tatacaatgg      840

cctcccacca atgcttcctt ggaggaacat agtatagaaa agaatataac tgtgaattat      900

aatggtacac ttataaatga aactgtcttt gagtttgact ggaagtcata tattcaagat      960

tcaagatatc attatttcct ggagggtttt ttagatgcac tactatgtgg aaatagctct     1020

gatgcaggcc aatgtccaga gggatatatg tgtgtgaaag ctggtagaaa tcccaattat     1080

ggctacacaa gctttgatac cttcagttgg gcttttttgt ccttgtttcg actaatgact     1140

caggacttct gggaaaatct ttatcaactg acattacgtg ctgctgggaa aacgtacatg     1200

atattttttg tattggtcat tttcttgggc tcattctacc taataaattt gatcctggct     1260

gtggtggcca tggcctacga ggaacagaat caggccacct tggaagaagc agaacagaaa     1320

gaggccgaat ttcagcagat gattgaacag cttaaaaagc aacaggaggc agctcagcag     1380

gcagcaacgg caactgcctc agaacattcc agagagccca gtgcagcagg caggctctca     1440

gacagctcat ctgaagcctc taagttgagt tccaagagtg ctaaggaaag aagaaatcgg     1500

aggaagaaaa gaaaacagaa agagcagtct ggtggggaag agaaagatga ggatgaattc     1560

caaaaatctg aatctgagga cagcatcagg aggaaaggtt ttcgcttctc cattgaaggg     1620

aaccgattga catatgaaaa gaggtactcc tccccacacc agtctttgtt gagcatccgt     1680

ggctccctat tttcaccaag gcgaaatagc agaacaagcc ttttcagctt tagagggcga     1740

gcaaaggatg tgggatctga gaacgacttc gcagatgatg agcacagcac ctttgaggat     1800

aacgagagcc gtagagattc cttgtttgtg ccccgacgac acggagagag acgcaacagc     1860

aacctgagtc agaccagtag gtcatcccgg atgctggcag tgtttccagc gaatgggaag     1920

atgcacagca ctgtggattg caatggtgtg gtttccttgg ttggtggacc ttcagttcct     1980

acatcgcctg ttggacagct tctgccagag gtgataatag ataagccagc tactgatgac     2040

aatggaacaa ccactgaaac tgaaatgaga aagagaaggt caagttcttt ccacgtttcc     2100

atggactttc tagaagatcc ttcccaaagg caacgagcaa tgagtatagc cagcattcta     2160

acaaatacag tagaagaact tgaagaatcc aggcagaaat gcccaccctg ttggtataaa     2220

ttttccaaca tattcttaat ctgggactgt tctccatatt ggttaaaagt gaaacatgtt     2280

gtcaacctgg ttgtgatgga cccatttgtt gacctggcca tcaccatctg tattgtctta     2340

aatactcttt tcatggccat ggagcactat ccaatgacgg accatttcaa taatgtgctt     2400

acagtaggaa acttggtttt cactgggatc tttacagcag aaatgtttct gaaaattatt     2460

gccatggatc cttactatta tttccaagaa ggctggaata tctttgacgg ttttattgtg     2520

acgcttagcc tggtagaact tggactcgcc aatgtggaag gattatctgt tctccgttca     2580

tttcgattgc tgcgagtttt caagttggca aaatcttggc caacgttaaa tatgctaata     2640

aagatcatcg gcaattccgt gggggctctg ggaaatttaa ccctcgtctt ggccatcatc     2700

gtcttcattt ttgccgtggt cggcatgcag ctctttggta aaagctacaa agattgtgtc     2760

tgcaagatcg ccagtgattg tcaactccca cgctggcaca tgaatgactt cttccactcc     2820

ttcctgattg tgttccgcgt gctgtgtggg gagtggatag agaccatgtg ggactgtatg     2880

gaggttgctg gtcaagccat gtgccttact gtcttcatga tggtcatggt gattggaaac     2940

ctagtggtcc tgaatctctt tctggccttg cttctgagct catttagtgc agacaacctt     3000

gcagccactg atgatgataa tgaaatgaat aatctccaaa ttgctgtgga taggatgcac     3060

aaaggagtag cttatgtgaa aagaaaaata tatgaattta ttcaacagtc cttcattagg     3120

aaacaaaaga ttttagatga aattaaacca cttgatgatc taaacaacaa gaaagacagt     3180

tgtatgtcca atcatacagc agaaattggg aaagatcttg actatcttaa agatgtaaat     3240

ggaactacaa gtggtatagg aactggcagc agtgttgaat acattattga tgaaagtgat     3300

tacatgtcat tcataaacaa ccccagtctt actgtgactg taccaattgc tgtaggagaa     3360

tctgactttg aaaatttaaa cacggaagac tttagtagtg aatcggatct ggaagaaagc     3420

aaagagaaac tgaatgaaag cagtagctca tcagaaggta gcactgtgga catcggcgca     3480

cctgtagaag aacagcccgt agtggaacct gaagaaactc ttgaaccaga agcttgtttc     3540

actgaaggct gtgtacaaag attcaagtgt tgtcaaatca atgtggaaga aggcagagga     3600

aaacaatggt ggaacctgag aaggacgtgt ttccgaatag ttgaacataa ctggtttgag     3660

accttcattg ttttcatgat tctccttagt agtggtgctc tggcatttga agatatatat     3720

attgatcagc gaaagacgat taagacgatg ttggaatatg ctgacaaggt tttcacttac     3780

attttcattc tggaaatgct tctaaaatgg gtggcatatg gctatcaaac atatttcacc     3840

aatgcctggt gttggctgga cttcttaatt gttgatgttt cattggtcag tttaacagca     3900

aatgccttgg gttactcaga acttggagcc atcaaatctc tcaggacact aagagctctg     3960

agacctctaa gagccttatc tcgatttgaa gggatgaggg tggttgtgaa tgccctttta     4020

ggagcaattc catccatcat gaatgtgctt ctggtttgtc ttatattctg gctaattttc     4080

agcatcatgg gcgtaaattt gtttgctggc aaattctacc actgtattaa caccacaact     4140

ggtgacaggt ttgacatcga agacgtgaat aatcatactg attgcctaaa actaatagaa     4200

agaaatgaga ctgctcgatg gaaaaatgtg aaagtaaact ttgataatgt aggatttggg     4260

tatctctctt tgcttcaagt tgccacattc aaaggatgga tggatataat gtatgcagca     4320

gttgattcca gaaatgtgga actccagcct aagtatgaag aaagtctgta catgtatctt     4380

tactttgtta ttttcatcat ctttgggtcc ttcttcacct tgaacctgtt tattggtgtc     4440

atcatagata atttcaacca gcagaaaaag aagtttggag gtcaagacat ctttatgaca     4500

gaagaacaga agaaatacta taatgcaatg aaaaaattag gatcgaaaaa accgcaaaag     4560

cctatacctc gaccaggaaa caaatttcaa ggaatggtct ttgacttcgt aaccagacaa     4620

gtttttgaca taagcatcat gattctcatc tgtcttaaca tggtcacaat gatggtggaa     4680

acagatgacc agagtgaata tgtgactacc attttgtcac gcatcaatct ggtgttcatt     4740

gtgctattta ctggagagtg tgtactgaaa ctcatctctc tacgccatta ttattttacc     4800

attggatgga atatttttga ttttgtggtt gtcattctct ccattgtagg tatgtttctt     4860

gccgagctga tagaaaagta tttcgtgtcc cctaccctgt tccgagtgat ccgtcttgct     4920

aggattggcc gaatcctacg tctgatcaaa ggagcaaagg ggatccgcac gctgctcttt     4980

gctttgatga tgtcccttcc tgcgttgttt aacatcggcc tcctactctt cctagtcatg     5040

ttcatctacg ccatctttgg gatgtccaac tttgcctatg ttaagaggga agttgggatc     5100

gatgacatgt tcaactttga gacctttggc aacagcatga tctgcctatt ccaaattaca     5160

acctctgctg gctgggatgg attgctagca cccattctca acagtaagcc acccgactgt     5220

gaccctaata aagttaaccc tggaagctca gttaagggag actgtgggaa cccatctgtt     5280

ggaattttct tttttgtcag ttacatcatc atatccttcc tggttgtggt gaacatgtac     5340

atcgcggtca tcctggagaa cttcagtgtt gctactgaag aaagtgcaga gcctctgagt     5400

gaggatgact ttgagatgtt ctatgaggtt tgggagaagt ttgatcccga tgcaactcag     5460

ttcatggaat ttgaaaaatt atctcagttt gcagctgcgc ttgaaccgcc tctcaatctg     5520

ccacaaccaa acaaactcca gctcattgcc atggatttgc ccatggtgag tggtgaccgg     5580

atccactgtc ttgatatctt atttgctttt acaaagcggg ttctaggaga gagtggagag     5640

atggatgctc tacgaataca gatggaagag cgattcatgg cttccaatcc ttccaaggtc     5700

tcctatcagc caatcactac tactttaaaa cgaaaacaag aggaagtatc tgctgtcatt     5760

attcagcgtg cttacagacg ccacctttta aagcgaactg taaaacaagc ttcctttacg     5820

tacaataaaa acaaaatcaa aggtggggct aatcttctta taaaagaaga catgataatt     5880

gacagaataa atgaaaactc tattacagaa aaaactgatc tgaccatgtc cactgcagct     5940

tgtccacctt cctatgaccg ggtgacaaag ccaattgtgg aaaaacatga gcaagaaggc     6000

aaagatgaaa aagccaaagg gaaataa                                         6027


<210>  17
<211>  720
<212>  DNA
<213>  artificial sequence

<220>
<223>  SYFP2

<400>  17
atggtcagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac       60

ggcgacgtca atggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac      120

ggcaagctga ccctgaagct gatctgcacc accggcaagc tgcccgtgcc ctggcccacc      180

ctcgtgacca ccctgggcta cggcgtgcag tgcttcgccc gctaccccga ccacatgaag      240

cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc      300

ttcaaagacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg      360

gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac      420

aagctggagt acaactacaa cagccacaac gtctatatca ccgccgacaa gcagaagaac      480

ggcatcaagg ccaacttcaa gatccgccac aacatcgagg acggcggcgt gcagctcgcc      540

gaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac      600

tacctgagct accagtccaa gctgagcaaa gaccccaacg agaagcgcga tcacatggtc      660

ctgctggagt tcgtgaccgc cgccgggatc actctcggca tggacgagct gtacaaataa      720


<210>  18
<211>  78
<212>  DNA
<213>  artificial sequence

<220>
<223>  P2A Encoding Sequence

<400>  18
ggcagcggcg ccaccaactt cagcctgctg aagcaggccg gcgacgtgga ggagaacccc       60

ggccccggag ctagcgga                                                     78


<210>  19
<211>  246
<212>  DNA
<213>  artificial sequence

<220>
<223>  WPRE3

<400>  19
ataatcaacc tctggattac aaaatttgtg aaagattgac tggtattctt aactatgttg       60

ctccttttac gctatgtgga tacgctgctt taatgccttt gtatcatgct attgcttccc      120

gtatggcttt cattttctcc tccttgtata aatcctggtt agttcttgcc acggcggaac      180

tcatcgccgc ctgccttgcc cgctgctgga caggggctcg gctgttgggc actgacaatt      240

ccgtgg                                                                 246


<210>  20
<211>  204
<212>  DNA
<213>  artificial sequence

<220>
<223>  BGHpA

<400>  20
cgactgtgcc ttctagttgc cagccatctg ttgtttgccc ctcccccgtg ccttccttga       60

ccctggaagg tgccactccc actgtccttt cctaataaaa tgaggaaatt gcatcgcatt      120

gtctgagtag gtgtcattct attctggggg gtggggtggg gcaggacagc aagggggagg      180

attgggaaga caatagcagg catg                                             204


<210>  21
<211>  33
<212>  PRT
<213>  artificial sequence

<220>
<223>  N-terminal 3XHA tag (Protein)

<400>  21

Met Val Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Gly Ser Tyr Pro Tyr 
1               5                   10                  15      


Asp Val Pro Asp Tyr Ala Gly Ser Tyr Pro Tyr Asp Val Pro Asp Tyr 
            20                  25                  30          


Ala 
    


<210>  22
<211>  99
<212>  DNA
<213>  artificial sequence

<220>
<223>  N-terminal 3XHA tag

<400>  22
atggtttacc cgtatgatgt cccggattac gctggcagct acccatacga tgtacccgac       60

tatgccggca gttatcccta cgacgtccct gactacgca                              99


<210>  23
<211>  3133
<212>  DNA
<213>  artificial sequence

<220>
<223>  hSCN1A N-term of two-part expression system

<400>  23
atggagcaaa cagtgcttgt accaccagga cctgacagct tcaacttctt caccagagaa       60

tctcttgcgg ctattgaaag acgcattgca gaagaaaagg caaagaatcc caaaccagac      120

aaaaaagatg acgacgaaaa tggcccaaag ccaaatagtg acttggaagc tggaaagaac      180

cttccattta tttatggaga cattcctcca gagatggtgt cagagcccct ggaggacctg      240

gacccctact atatcaataa gaaaactttt atagtattga ataaagggaa ggccatcttc      300

cggttcagtg ccacctctgc cctgtacatt ttaactccct tcaatcctct taggaaaata      360

gctattaaga ttttggtaca ttcattattc agcatgctaa ttatgtgcac tattttgaca      420

aactgtgtgt ttatgacaat gagtaaccct cctgattgga caaagaatgt agaatacacc      480

ttcacaggaa tatatacttt tgaatcactt ataaaaatta ttgcaagggg attctgttta      540

gaagatttta ctttccttcg ggatccatgg aactggctcg atttcactgt cattacattt      600

gcgtacgtca cagagtttgt ggacctgggc aatgtctcgg cattgagaac attcagagtt      660

ctccgagcat tgaagacgat ttcagtcatt ccaggcctga aaaccattgt gggagccctg      720

atccagtctg tgaagaagct ctcagatgta atgatcctga ctgtgttctg tctgagcgta      780

tttgctctaa ttgggctgca gctgttcatg ggcaacctga ggaataaatg tatacaatgg      840

cctcccacca atgcttcctt ggaggaacat agtatagaaa agaatataac tgtgaattat      900

aatggtacac ttataaatga aactgtcttt gagtttgact ggaagtcata tattcaagat      960

tcaagatatc attatttcct ggagggtttt ttagatgcac tactatgtgg aaatagctct     1020

gatgcaggcc aatgtccaga gggatatatg tgtgtgaaag ctggtagaaa tcccaattat     1080

ggctacacaa gctttgatac cttcagttgg gcttttttgt ccttgtttcg actaatgact     1140

caggacttct gggaaaatct ttatcaactg acattacgtg ctgctgggaa aacgtacatg     1200

atattttttg tattggtcat tttcttgggc tcattctacc taataaattt gatcctggct     1260

gtggtggcca tggcctacga ggaacagaat caggccacct tggaagaagc agaacagaaa     1320

gaggccgaat ttcagcagat gattgaacag cttaaaaagc aacaggaggc agctcagcag     1380

gcagcaacgg caactgcctc agaacattcc agagagccca gtgcagcagg caggctctca     1440

gacagctcat ctgaagcctc taagttgagt tccaagagtg ctaaggaaag aagaaatcgg     1500

aggaagaaaa gaaaacagaa agagcagtct ggtggggaag agaaagatga ggatgaattc     1560

caaaaatctg aatctgagga cagcatcagg aggaaaggtt ttcgcttctc cattgaaggg     1620

aaccgattga catatgaaaa gaggtactcc tccccacacc agtctttgtt gagcatccgt     1680

ggctccctat tttcaccaag gcgaaatagc agaacaagcc ttttcagctt tagagggcga     1740

gcaaaggatg tgggatctga gaacgacttc gcagatgatg agcacagcac ctttgaggat     1800

aacgagagcc gtagagattc cttgtttgtg ccccgacgac acggagagag acgcaacagc     1860

aacctgagtc agaccagtag gtcatcccgg atgctggcag tgtttccagc gaatgggaag     1920

atgcacagca ctgtggattg caatggtgtg gtttccttgg ttggtggacc ttcagttcct     1980

acatcgcctg ttggacagct tctgccagag gtgataatag ataagccagc tactgatgac     2040

aatggaacaa ccactgaaac tgaaatgaga aagagaaggt caagttcttt ccacgtttcc     2100

atggactttc tagaagatcc ttcccaaagg caacgagcaa tgagtatagc cagcattcta     2160

acaaatacag tagaagaact tgaagaatcc aggcagaaat gcccaccctg ttggtataaa     2220

ttttccaaca tattcttaat ctgggactgt tctccatatt ggttaaaagt gaaacatgtt     2280

gtcaacctgg ttgtgatgga cccatttgtt gacctggcca tcaccatctg tattgtctta     2340

aatactcttt tcatggccat ggagcactat ccaatgacgg accatttcaa taatgtgctt     2400

acagtaggaa acttggtttt cactgggatc tttacagcag aaatgtttct gaaaattatt     2460

gccatggatc cttactatta tttccaagaa ggctggaata tctttgacgg ttttattgtg     2520

acgcttagcc tggtagaact tggactcgcc aatgtggaag gattatctgt tctccgttca     2580

tttcgattgc tgcgagtttt caagttggca aaatcttggc caacgttaaa tatgctaata     2640

aagatcatcg gcaattccgt gggggctctg ggaaatttaa ccctcgtctt ggccatcatc     2700

gtcttcattt ttgccgtggt cggcatgcag ctctttggta aaagctacaa agattgtgtc     2760

tgcaagatcg ccagtgattg tcaactccca cgctggcaca tgaatgactt cttccactcc     2820

ttcctgattg tgttccgcgt gctgtgtggg gagtggatag agaccatgtg ggactgtatg     2880

gaggttgctg gtcaagccat gtgccttact gtcttcatga tggtcatggt gattggaaac     2940

ctagtggtcc tgaatctctt tctggccttg cttctgagct catttagtgc agacaacctt     3000

gcagccactg atgatgataa tgaaatgaat aatctccaaa ttgctgtgga taggatgcac     3060

aaaggagtag cttatgtgaa aagaaaaata tatgaattta ttcaacagtc cttcattagg     3120

aaacaaaaga ttt                                                        3133


<210>  24
<211>  3639
<212>  DNA
<213>  artificial sequence

<220>
<223>  hSCN1A C-term of two-part expression system with c-terminal 3XHA 
       sequence

<400>  24
ctggtagaac ttggactcgc caatgtggaa ggattatctg ttctccgttc atttcgattg       60

ctgcgagttt tcaagttggc aaaatcttgg ccaacgttaa atatgctaat aaagatcatc      120

ggcaattccg tgggggctct gggaaattta accctcgtct tggccatcat cgtcttcatt      180

tttgccgtgg tcggcatgca gctctttggt aaaagctaca aagattgtgt ctgcaagatc      240

gccagtgatt gtcaactccc acgctggcac atgaatgact tcttccactc cttcctgatt      300

gtgttccgcg tgctgtgtgg ggagtggata gagaccatgt gggactgtat ggaggttgct      360

ggtcaagcca tgtgccttac tgtcttcatg atggtcatgg tgattggaaa cctagtggtc      420

ctgaatctct ttctggcctt gcttctgagc tcatttagtg cagacaacct tgcagccact      480

gatgatgata atgaaatgaa taatctccaa attgctgtgg ataggatgca caaaggagta      540

gcttatgtga aaagaaaaat atatgaattt attcaacagt ccttcattag gaaacaaaag      600

attttagatg aaattaaacc acttgatgat ctaaacaaca agaaagacag ttgtatgtcc      660

aatcatacag cagaaattgg gaaagatctt gactatctta aagatgtaaa tggaactaca      720

agtggtatag gaactggcag cagtgttgaa tacattattg atgaaagtga ttacatgtca      780

ttcataaaca accccagtct tactgtgact gtaccaattg ctgtaggaga atctgacttt      840

gaaaatttaa acacggaaga ctttagtagt gaatcggatc tggaagaaag caaagagaaa      900

ctgaatgaaa gcagtagctc atcagaaggt agcactgtgg acatcggcgc acctgtagaa      960

gaacagcccg tagtggaacc tgaagaaact cttgaaccag aagcttgttt cactgaaggc     1020

tgtgtacaaa gattcaagtg ttgtcaaatc aatgtggaag aaggcagagg aaaacaatgg     1080

tggaacctga gaaggacgtg tttccgaata gttgaacata actggtttga gaccttcatt     1140

gttttcatga ttctccttag tagtggtgct ctggcatttg aagatatata tattgatcag     1200

cgaaagacga ttaagacgat gttggaatat gctgacaagg ttttcactta cattttcatt     1260

ctggaaatgc ttctaaaatg ggtggcatat ggctatcaaa catatttcac caatgcctgg     1320

tgttggctgg acttcttaat tgttgatgtt tcattggtca gtttaacagc aaatgccttg     1380

ggttactcag aacttggagc catcaaatct ctcaggacac taagagctct gagacctcta     1440

agagccttat ctcgatttga agggatgagg gtggttgtga atgccctttt aggagcaatt     1500

ccatccatca tgaatgtgct tctggtttgt cttatattct ggctaatttt cagcatcatg     1560

ggcgtaaatt tgtttgctgg caaattctac cactgtatta acaccacaac tggtgacagg     1620

tttgacatcg aagacgtgaa taatcatact gattgcctaa aactaataga aagaaatgag     1680

actgctcgat ggaaaaatgt gaaagtaaac tttgataatg taggatttgg gtatctctct     1740

ttgcttcaag ttgccacatt caaaggatgg atggatataa tgtatgcagc agttgattcc     1800

agaaatgtgg aactccagcc taagtatgaa gaaagtctgt acatgtatct ttactttgtt     1860

attttcatca tctttgggtc cttcttcacc ttgaacctgt ttattggtgt catcatagat     1920

aatttcaacc agcagaaaaa gaagtttgga ggtcaagaca tctttatgac agaagaacag     1980

aagaaatact ataatgcaat gaaaaaatta ggatcgaaaa aaccgcaaaa gcctatacct     2040

cgaccaggaa acaaatttca aggaatggtc tttgacttcg taaccagaca agtttttgac     2100

ataagcatca tgattctcat ctgtcttaac atggtcacaa tgatggtgga aacagatgac     2160

cagagtgaat atgtgactac cattttgtca cgcatcaatc tggtgttcat tgtgctattt     2220

actggagagt gtgtactgaa actcatctct ctacgccatt attattttac cattggatgg     2280

aatatttttg attttgtggt tgtcattctc tccattgtag gtatgtttct tgccgagctg     2340

atagaaaagt atttcgtgtc ccctaccctg ttccgagtga tccgtcttgc taggattggc     2400

cgaatcctac gtctgatcaa aggagcaaag gggatccgca cgctgctctt tgctttgatg     2460

atgtcccttc ctgcgttgtt taacatcggc ctcctactct tcctagtcat gttcatctac     2520

gccatctttg ggatgtccaa ctttgcctat gttaagaggg aagttgggat cgatgacatg     2580

ttcaactttg agacctttgg caacagcatg atctgcctat tccaaattac aacctctgct     2640

ggctgggatg gattgctagc acccattctc aacagtaagc cacccgactg tgaccctaat     2700

aaagttaacc ctggaagctc agttaaggga gactgtggga acccatctgt tggaattttc     2760

ttttttgtca gttacatcat catatccttc ctggttgtgg tgaacatgta catcgcggtc     2820

atcctggaga acttcagtgt tgctactgaa gaaagtgcag agcctctgag tgaggatgac     2880

tttgagatgt tctatgaggt ttgggagaag tttgatcccg atgcaactca gttcatggaa     2940

tttgaaaaat tatctcagtt tgcagctgcg cttgaaccgc ctctcaatct gccacaacca     3000

aacaaactcc agctcattgc catggatttg cccatggtga gtggtgaccg gatccactgt     3060

cttgatatct tatttgcttt tacaaagcgg gttctaggag agagtggaga gatggatgct     3120

ctacgaatac agatggaaga gcgattcatg gcttccaatc cttccaaggt ctcctatcag     3180

ccaatcacta ctactttaaa acgaaaacaa gaggaagtat ctgctgtcat tattcagcgt     3240

gcttacagac gccacctttt aaagcgaact gtaaaacaag cttcctttac gtacaataaa     3300

aacaaaatca aaggtggggc taatcttctt ataaaagaag acatgataat tgacagaata     3360

aatgaaaact ctattacaga aaaaactgat ctgaccatgt ccactgcagc ttgtccacct     3420

tcctatgacc gggtgacaaa gccaattgtg gaaaaacatg agcaagaagg caaagatgaa     3480

aaagccaaag ggaaaggagg tggtggttca ggtgggggcg gctcagagta cccctatgat     3540

gtccctgatt atgcggcgga atacccctat gacgtgccgg actacgcggc tgaatatccg     3600

tatgacgttc ccgattatgc ggctaagctc gaataatga                            3639


<210>  25
<211>  604
<212>  DNA
<213>  artificial sequence

<220>
<223>  604 bp homology region of hSCN1A N term and C term that can be 
       used in two-part expression system

<400>  25
ctggtagaac ttggactcgc caatgtggaa ggattatctg ttctccgttc atttcgattg       60

ctgcgagttt tcaagttggc aaaatcttgg ccaacgttaa atatgctaat aaagatcatc      120

ggcaattccg tgggggctct gggaaattta accctcgtct tggccatcat cgtcttcatt      180

tttgccgtgg tcggcatgca gctctttggt aaaagctaca aagattgtgt ctgcaagatc      240

gccagtgatt gtcaactccc acgctggcac atgaatgact tcttccactc cttcctgatt      300

gtgttccgcg tgctgtgtgg ggagtggata gagaccatgt gggactgtat ggaggttgct      360

ggtcaagcca tgtgccttac tgtcttcatg atggtcatgg tgattggaaa cctagtggtc      420

ctgaatctct ttctggcctt gcttctgagc tcatttagtg cagacaacct tgcagccact      480

gatgatgata atgaaatgaa taatctccaa attgctgtgg ataggatgca caaaggagta      540

gcttatgtga aaagaaaaat atatgaattt attcaacagt ccttcattag gaaacaaaag      600

attt                                                                   604


<210>  26
<211>  26
<212>  PRT
<213>  artificial sequence

<220>
<223>  P2A Translation from CN1498

<400>  26

Gly Ser Gly Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val 
1               5                   10                  15      


Glu Glu Asn Pro Gly Pro Gly Ala Ser Gly 
            20                  25      


<210>  27
<211>  21
<212>  PRT
<213>  artificial sequence

<220>
<223>  T2A

<400>  27

Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu 
1               5                   10                  15      


Glu Asn Pro Gly Pro 
            20      


<210>  28
<211>  24
<212>  PRT
<213>  artificial sequence

<220>
<223>  E2A

<400>  28

Gly Ser Gly Gln Cys Thr Asn Tyr Ala Leu Leu Lys Leu Ala Gly Asp 
1               5                   10                  15      


Val Glu Ser Asn Pro Gly Pro Pro 
            20                  


<210>  29
<211>  25
<212>  PRT
<213>  artificial sequence

<220>
<223>  F2A

<400>  29

Gly Ser Gly Val Lys Gln Thr Leu Asn Phe Asp Leu Leu Lys Leu Ala 
1               5                   10                  15      


Gly Asp Val Glu Ser Asn Pro Gly Pro 
            20                  25  


<210>  30
<211>  53
<212>  DNA
<213>  artificial sequence

<220>
<223>  MinBglobin

<400>  30
gggctgggca taaaagtcag ggcagagcca tctattgctt acatttgctt ctg              53


<210>  31
<211>  68
<212>  DNA
<213>  artificial sequence

<220>
<223>  minCMV

<400>  31
gaggtaggcg tgtacggtgg gaggcctata taagcagagc tcgtttagtg aaccgtcaga       60

tcgcctgg                                                                68


<210>  32
<211>  11
<212>  PRT
<213>  artificial sequence

<220>
<223>  PHP.eB capsid: Corresponds to  the AAV9 VP1 capsid sequence with 
       a modification that starts at amino acid residue 586 where S AQ A
       are changed to S DGTLAVPFK A.

<400>  32

Ser Asp Gly Thr Leu Ala Val Pro Phe Lys Ala 
1               5                   10      


<210>  33
<211>  2843
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN1367 - The portion between L-ITR and R-ITR: positions 142-2984

<400>  33
gcggccgcac gcgtataggt accgagctct atgcactcac agtggtttgg catgcatctg       60

gtgaattttt tttaacgaaa aattagtgtt ggtttcgatg tatggtagca ttctccctaa      120

cgtaatttga ataattcagc aaagccccac taccagctgt acttctgcag cctcttccat      180

tcttttcagc attataattt tggttaattt tcaattttag gtcctacgtc tctgcaattt      240

gtgtatgaat aacagaataa tttccctctt ttgtttcgcc tttcctgttc ctgaatctaa      300

ataaagatgg ctttttagta ttaaaagtgg aagaaaatta caggtaatta tctttgacgg      360

taaaaacgct gtaatcagcg ggctacatga aaaattactc taattatggc tgcatttaag      420

agaatggaaa aaaaccttct tgtggataaa aaccttaaat tgtccccaat gtctgcttca      480

aattggatgg cactgcagct ggaggctttg ttcagaattg atcctgggga gctacgaacc      540

caaagtttca cagtagggag ctcgggctgg gcataaaagt cagggcagag ccatctattg      600

cttacatttg cttctgggat ccagatcttt cgaagctagc gctaccggtc gccaccatgg      660

gcagcagcca tcatcatcat catcacagca gcggcctggt gccgcgcggc agccatatgt      720

cacgcaaaat ccgcgattta atcgaatcca aacgctttca aaacgtcatc accgccatta      780

ttgtgctcaa tggcgctgtg ctgggtctgc tgaccgatac aaccctgtcg gcctccagcc      840

aaaacctgct ggagcgtgtg gatcaacttt gtctgactat ctttattgtt gaaatctccc      900

tgaaaatcta cgcctatggc gtgcgcggct ttttccgcag cggctggaat ctgtttgatt      960

ttgtgattgt ggccatcgcg cttatgccgg cccagggtag cctgtcggtg ctgcgtacct     1020

tccgtatctt ccgcgtcatg cgcctcgtat cggtcatccc aaccatgcgc cgtgtggtgc     1080

aaggcatgct cttggcactg ccgggcgtgg gctcggtagc ggcactgttg acggtggtct     1140

tctatattgc ggctgtcatg gccaccaatc tctacggggc aaccttccct gaatggtttg     1200

gtgatcttag caagagcctg tacacactgt ttcaggtgat gaccttagag tcatggtcta     1260

tgggcattgt gcgtccagtg atgaacgttc atccgaacgc atgggttttt ttcatcccgt     1320

tcatcatgct caccaccttt accgtgctca acctgtttat tggcattatt gtagatgcaa     1380

tggcaatcac caaggaacag gaggaagagg ccaaaaccgg tcaccatcaa gaacctattt     1440

ctcaaactct tcttcatctt ggtgatcgtc ttgatcgtat tgaaaaacaa cttgctcaaa     1500

ataatgaact tcttcaacgt caacaacctc aaaaaaaagg cagcggcgcc accaacttca     1560

gcctgctgaa gcaggccggc gacgtggagg agaaccccgg ccccatggtg agcaagggcg     1620

aggagctgtt caccggggtg gtgcccatcc tggtcgagct ggacggcgac gtaaacggcc     1680

acaagttcag cgtgtccggc gagggcgagg gcgatgccac ctacggcaag ctgaccctga     1740

agctgatctg caccaccggc aagctgcccg tgccctggcc caccctcgtg accaccctgg     1800

gctacggcgt gcagtgcttc gcccgctacc ccgaccacat gaagcagcac gacttcttca     1860

agtccgccat gcccgaaggc tacgtccagg agcgcaccat cttcttcaag gacgacggca     1920

actacaagac ccgcgccgag gtgaagttcg agggcgacac cctggtgaac cgcatcgagc     1980

tgaagggcat cgacttcaag gaggacggca acatcctggg gcacaagctg gagtacaact     2040

acaacagcca caacgtctat atcaccgccg acaagcagaa gaacggcatc aaggccaact     2100

tcaagatccg ccacaacatc gaggacggcg gcgtgcagct cgccgaccac taccagcaga     2160

acacccccat cggcgacggc cccgtgctgc tgcccgacaa ccactacctg agctaccagt     2220

ccaagctgag caaagacccc aacgagaagc gcgatcacat ggtcctgctg gagttcgtga     2280

ccgccgccgg gatcactctc ggcatggacg agctgtacaa gtaagtcgac ggcgcgccgc     2340

ggccgcgaat tcgatatcat aatcaacctc tggattacaa aatttgtgaa agattgactg     2400

gtattcttaa ctatgttgct ccttttacgc tatgtggata cgctgcttta atgcctttgt     2460

atcatgctat tgcttcccgt atggctttca ttttctcctc cttgtataaa tcctggttag     2520

ttcttgccac ggcggaactc atcgccgcct gccttgcccg ctgctggaca ggggctcggc     2580

tgttgggcac tgacaattcc gtggctcgag agatcttcga ctgtgccttc tagttgccag     2640

ccatctgttg tttgcccctc ccccgtgcct tccttgaccc tggaaggtgc cactcccact     2700

gtcctttcct aataaaatga ggaaattgca tcgcattgtc tgagtaggtg tcattctatt     2760

ctggggggtg gggtggggca ggacagcaag ggggaggatt gggaagacaa tagcaggcat     2820

gcacgtgcgg accgagcggc cgc                                             2843


<210>  34
<211>  2835
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN1500 - The portion between L-ITR and R-ITR: positions 142-2976

<400>  34
gcggccgcac gcgtggtacc ctaaataaag atggcttttt agtattaaaa gtggaagaaa       60

attacaggta attatctttg acggtaaaaa cgctgtaatc agcgggctac atgaaaaatt      120

actctaatta tggctgcatt taagagaatg gctaaataaa gatggctttt tagtattaaa      180

agtggaagaa aattacaggt aattatcttt gacggtaaaa acgctgtaat cagcgggcta      240

catgaaaaat tactctaatt atggctgcat ttaagagaat ggctaaataa agatggcttt      300

ttagtattaa aagtggaaga aaattacagg taattatctt tgacggtaaa aacgctgtaa      360

tcagcgggct acatgaaaaa ttactctaat tatggctgca tttaagagaa tggagctcgg      420

gctggtcgac acaattggag gtaggcgtgt acggtgggag gcctatataa gcagagctcg      480

tttagtgaac cgtcagatcg cctggaggat ccttcgaaaa gcttgctacc ggtcgccacc      540

atggtcagca agggcgagga gctgttcacc ggggtggtgc ccatcctggt cgagctggac      600

ggcgacgtca atggccacaa gttcagcgtg tccggcgagg gcgagggcga tgccacctac      660

ggcaagctga ccctgaagct gatctgcacc accggcaagc tgcccgtgcc ctggcccacc      720

ctcgtgacca ccctgggcta cggcgtgcag tgcttcgccc gctaccccga ccacatgaag      780

cagcacgact tcttcaagtc cgccatgccc gaaggctacg tccaggagcg caccatcttc      840

ttcaaagacg acggcaacta caagacccgc gccgaggtga agttcgaggg cgacaccctg      900

gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg acggcaacat cctggggcac      960

aagctggagt acaactacaa cagccacaac gtctatatca ccgccgacaa gcagaagaac     1020

ggcatcaagg ccaacttcaa gatccgccac aacatcgagg acggcggcgt gcagctcgcc     1080

gaccactacc agcagaacac ccccatcggc gacggccccg tgctgctgcc cgacaaccac     1140

tacctgagct accagtccaa gctgagcaaa gaccccaacg agaagcgcga tcacatggtc     1200

ctgctggagt tcgtgaccgc cgccgggatc actctcggca tggacgagct gtacaaaggc     1260

agcggcgcca ccaacttcag cctgctgaag caggccggcg acgtggagga gaaccccggc     1320

cccggagcta gcggaatggt ttacccgtat gatgtcccgg attacgctgg cagctaccca     1380

tacgatgtac ccgactatgc cggcagttat ccctacgacg tccctgacta cgcatctacg     1440

tcccttttga atgcgcctac cggccttcaa gctagagtca ttaatctcgt cgaacaaaac     1500

tggtttggac actttatact gactctcata ctcattaatg ctgtgcagct tggaatggaa     1560

actagcgcca gcctcatggc acaatatggc gcgctgctta tgtccttgaa taaggtcctt     1620

ctctctgtgt tcgtggtcga actgctgctc cggatttatg cgtatcgggg caagtttttt     1680

aaggacccgt ggaatgtgtt tgacttcact gttattgtta ttgctctgat tcctgcatct     1740

ggcccattgg ctgtcctccg ctccctccga gttctccgcg tcttgagggt tctgacgatt     1800

gtccccagca tgaaaagagt agtgtcagca ctgcttggga gcttgcccgg gttggcctcc     1860

attgcaaccg tgcttctgtt gatctattac gttttcgctg tgatcgccac taaaattttc     1920

ggggatgctt ttccggaatg gttcgggacg atagcggact ccttctatac cctttttcaa     1980

attatgacct tggaaagttg gtctatgggg atctctaggc cagtgatgga ggtgtaccct     2040

tacgcttggg tattctttgt gccctttatt cttgttgcta cttttaccat gcttaacctt     2100

ttcatcgcca tcatagtgaa tactatgcag acattctctg acgaggaaca tgctctggag     2160

cgagagcaag ataaacagat cttggaacag gagcagagac aaatgcacga ggaactgaag     2220

gccattcgac tcgagcttca gcaactccaa acccttttgc gaaatgcggc tggggactcc     2280

tccaatgtct ccacaaaggg caatatcggc tcagactaat gaccgcggcc gcgaattcga     2340

tatcataatc aacctctgga ttacaaaatt tgtgaaagat tgactggtat tcttaactat     2400

gttgctcctt ttacgctatg tggatacgct gctttaatgc ctttgtatca tgctattgct     2460

tcccgtatgg ctttcatttt ctcctccttg tataaatcct ggttagttct tgccacggcg     2520

gaactcatcg ccgcctgcct tgcccgctgc tggacagggg ctcggctgtt gggcactgac     2580

aattccgtgg ctcgagagat cttcgactgt gccttctagt tgccagccat ctgttgtttg     2640

cccctccccc gtgccttcct tgaccctgga aggtgccact cccactgtcc tttcctaata     2700

aaatgaggaa attgcatcgc attgtctgag taggtgtcat tctattctgg ggggtggggt     2760

ggggcaggac agcaaggggg aggattggga agacaatagc aggcatgaga tctcacgtgc     2820

ggaccgagcg gccgc                                                      2835


<210>  35
<211>  5681
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN1498 - The portion between L-ITR and R-ITR: positions 142-2943

<400>  35
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc       60

gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggcca      120

actccatcac taggggttcc tgcggccgca cgcgtggtac cctaaataaa gatggctttt      180

tagtattaaa agtggaagaa aattacaggt aattatcttt gacggtaaaa acgctgtaat      240

cagcgggcta catgaaaaat tactctaatt atggctgcat ttaagagaat ggctaaataa      300

agatggcttt ttagtattaa aagtggaaga aaattacagg taattatctt tgacggtaaa      360

aacgctgtaa tcagcgggct acatgaaaaa ttactctaat tatggctgca tttaagagaa      420

tggctaaata aagatggctt tttagtatta aaagtggaag aaaattacag gtaattatct      480

ttgacggtaa aaacgctgta atcagcgggc tacatgaaaa attactctaa ttatggctgc      540

atttaagaga atggagctcg ggctggtcga cacaattgga ggtaggcgtg tacggtggga      600

ggcctatata agcagagctc gtttagtgaa ccgtcagatc gcctggagga tccttcgaaa      660

agcttgctac cggtcgccac catggtcagc aagggcgagg agctgttcac cggggtggtg      720

cccatcctgg tcgagctgga cggcgacgtc aatggccaca agttcagcgt gtccggcgag      780

ggcgagggcg atgccaccta cggcaagctg accctgaagc tgatctgcac caccggcaag      840

ctgcccgtgc cctggcccac cctcgtgacc accctgggct acggcgtgca gtgcttcgcc      900

cgctaccccg accacatgaa gcagcacgac ttcttcaagt ccgccatgcc cgaaggctac      960

gtccaggagc gcaccatctt cttcaaagac gacggcaact acaagacccg cgccgaggtg     1020

aagttcgagg gcgacaccct ggtgaaccgc atcgagctga agggcatcga cttcaaggag     1080

gacggcaaca tcctggggca caagctggag tacaactaca acagccacaa cgtctatatc     1140

accgccgaca agcagaagaa cggcatcaag gccaacttca agatccgcca caacatcgag     1200

gacggcggcg tgcagctcgc cgaccactac cagcagaaca cccccatcgg cgacggcccc     1260

gtgctgctgc ccgacaacca ctacctgagc taccagtcca agctgagcaa agaccccaac     1320

gagaagcgcg atcacatggt cctgctggag ttcgtgaccg ccgccgggat cactctcggc     1380

atggacgagc tgtacaaagg cagcggcgcc accaacttca gcctgctgaa gcaggccggc     1440

gacgtggagg agaaccccgg ccccggagct agcggaatgg tttacccgta tgatgtcccg     1500

gattacgctg gcagctaccc atacgatgta cccgactatg ccggcagtta tccctacgac     1560

gtccctgact acgcagaaaa caacccagcc gaacagcaag tcccacccct cgtggcgctc     1620

gcccaacgca tagtatttca caaggcgttt acgccgacga taatcaccct catcattatt     1680

aatgcgatca ttgtgggact cgagacatac ccaacggttt accagggtta caatgattgg     1740

ttctatgctg ccgaccttgc tttgttgtgg atattcacta ttgaaatcac gctccgattc     1800

atcgccgccc gaccgacgaa gagtttcttc aagtctagct ggaactggtt tgatctgctt     1860

atcgtattgg cgggccacgt cttcgctggc gcccattttg ttacggtgct taggatcctc     1920

cgcgtcctga gggtcctcag agctatctca gtcataccca gtctccggcg gctggttgac     1980

gcacttttga tgacaatccc agcactcggt aacatcatga tactgatggg gattattttt     2040

tacatattcg cggttatcgg gacgatgctc tttgcatcag tagcgccaga atactttggc     2100

aatttgcagc tgtctctgct tacactgttc caagtggtta cgctggaaag ttgggctagt     2160

ggggttatgc gacctatttt tgccgaagtc tggtggtctt ggatctattt tgtaatcttt     2220

attctcgtgg gaactttcat agtatttaac cttttcattg gcgtcatcgt gaacaatgtg     2280

gaaaaagcta acgaagagga actgaaaagc gaactggatg ataaagaggc tgatacaaaa     2340

gaagaactgg catcattgcg aaacgaggtg gcagaaatga aggatctcat aaaacagatg     2400

cataaacagc aaacaaaaaa gggttaatga ccgcggccgc gaattcgata tcataatcaa     2460

cctctggatt acaaaatttg tgaaagattg actggtattc ttaactatgt tgctcctttt     2520

acgctatgtg gatacgctgc tttaatgcct ttgtatcatg ctattgcttc ccgtatggct     2580

ttcattttct cctccttgta taaatcctgg ttagttcttg ccacggcgga actcatcgcc     2640

gcctgccttg cccgctgctg gacaggggct cggctgttgg gcactgacaa ttccgtggct     2700

cgagagatct tcgactgtgc cttctagttg ccagccatct gttgtttgcc cctcccccgt     2760

gccttccttg accctggaag gtgccactcc cactgtcctt tcctaataaa atgaggaaat     2820

tgcatcgcat tgtctgagta ggtgtcattc tattctgggg ggtggggtgg ggcaggacag     2880

caagggggag gattgggaag acaatagcag gcatgagatc tcacgtgcgg accgagcggc     2940

cgcaggaacc cctagtgatg gagttggcca ctccctctct gcgcgctcgc tcgctcactg     3000

aggccgggcg accaaaggtc gcccgacgcc cgggctttgc ccgggcggcc tcagtgagcg     3060

agcgagcgcg cagctgcctg caggggcgcc tgatgcggta ttttctcctt acgcatctgt     3120

gcggtatttc acaccgcata cgtcaaagca accatagtac gcgccctgta gcggcgcatt     3180

aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca gcgccctagc     3240

gcccgctcct ttcgctttct tcccttcctt tctcgccacg ttcgccggct ttccccgtca     3300

agctctaaat cgggggctcc ctttagggtt ccgatttagt gctttacggc acctcgaccc     3360

caaaaaactt gatttgggtg atggttcacg tagtgggcca tcgccctgat agacggtttt     3420

tcgccctttg acgttggagt ccacgttctt taatagtgga ctcttgttcc aaactggaac     3480

aacactcaac cctatctcgg gctattcttt tgatttataa gggattttgc cgatttcggc     3540

ctattggtta aaaaatgagc tgatttaaca aaaatttaac gcgaatttta acaaaatatt     3600

aacgtttaca attttatggt gcactctcag tacaatctgc tctgatgccg catagttaag     3660

ccagccccga cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc     3720

atccgcttac agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc     3780

gtcatcaccg aaacgcgcga gacgaaaggg cctcgtgata cgcctatttt tataggttaa     3840

tgtcatgata ataatggttt cttagacgtc aggtggcact tttcggggaa atgtgcgcgg     3900

aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata     3960

accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc aacatttccg     4020

tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc acccagaaac     4080

gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt acatcgaact     4140

ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt ttccaatgat     4200

gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga     4260

gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact caccagtcac     4320

agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg ccataaccat     4380

gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga aggagctaac     4440

cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg aaccggagct     4500

gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa tggcaacaac     4560

gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac aattaataga     4620

ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc cggctggctg     4680

gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca ttgcagcact     4740

ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga gtcaggcaac     4800

tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta agcattggta     4860

actgtcagac caagtttact catatatact ttagattgat ttaaaacttc atttttaatt     4920

taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc cttaacgtga     4980

gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt cttgagatcc     5040

tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt     5100

ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct tcagcagagc     5160

gcagatacca aatactgtcc ttctagtgta gccgtagtta ggccaccact tcaagaactc     5220

tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg ctgccagtgg     5280

cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata aggcgcagcg     5340

gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga cctacaccga     5400

actgagatac ctacagcgtg agctatgaga aagcgccacg cttcccgaag ggagaaaggc     5460

ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg agcttccagg     5520

gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg     5580

atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca acgcggcctt     5640

tttacggttc ctggcctttt gctggccttt tgctcacatg t                         5681


<210>  36
<211>  5684
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN1499 - The portion between L-ITR and R-ITR: positions 142-2946

<400>  36
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc       60

gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggcca      120

actccatcac taggggttcc tgcggccgca cgcgtggtac cctaaataaa gatggctttt      180

tagtattaaa agtggaagaa aattacaggt aattatcttt gacggtaaaa acgctgtaat      240

cagcgggcta catgaaaaat tactctaatt atggctgcat ttaagagaat ggctaaataa      300

agatggcttt ttagtattaa aagtggaaga aaattacagg taattatctt tgacggtaaa      360

aacgctgtaa tcagcgggct acatgaaaaa ttactctaat tatggctgca tttaagagaa      420

tggctaaata aagatggctt tttagtatta aaagtggaag aaaattacag gtaattatct      480

ttgacggtaa aaacgctgta atcagcgggc tacatgaaaa attactctaa ttatggctgc      540

atttaagaga atggagctcg ggctggtcga cacaattgga ggtaggcgtg tacggtggga      600

ggcctatata agcagagctc gtttagtgaa ccgtcagatc gcctggagga tccttcgaaa      660

agcttgctac cggtcgccac catggtcagc aagggcgagg agctgttcac cggggtggtg      720

cccatcctgg tcgagctgga cggcgacgtc aatggccaca agttcagcgt gtccggcgag      780

ggcgagggcg atgccaccta cggcaagctg accctgaagc tgatctgcac caccggcaag      840

ctgcccgtgc cctggcccac cctcgtgacc accctgggct acggcgtgca gtgcttcgcc      900

cgctaccccg accacatgaa gcagcacgac ttcttcaagt ccgccatgcc cgaaggctac      960

gtccaggagc gcaccatctt cttcaaagac gacggcaact acaagacccg cgccgaggtg     1020

aagttcgagg gcgacaccct ggtgaaccgc atcgagctga agggcatcga cttcaaggag     1080

gacggcaaca tcctggggca caagctggag tacaactaca acagccacaa cgtctatatc     1140

accgccgaca agcagaagaa cggcatcaag gccaacttca agatccgcca caacatcgag     1200

gacggcggcg tgcagctcgc cgaccactac cagcagaaca cccccatcgg cgacggcccc     1260

gtgctgctgc ccgacaacca ctacctgagc taccagtcca agctgagcaa agaccccaac     1320

gagaagcgcg atcacatggt cctgctggag ttcgtgaccg ccgccgggat cactctcggc     1380

atggacgagc tgtacaaagg cagcggcgcc accaacttca gcctgctgaa gcaggccggc     1440

gacgtggagg agaaccccgg ccccggagct agcggaatgg tttatccgta tgatgttcct     1500

gactatgcag gatcctatcc ttatgatgtt cccgattacg ctggttctta cccttacgat     1560

gttcccgatt atgccagttc tggattggtg ccacgaggca gccacatgag ccggaagatc     1620

agagatctta tcgaatctaa gagatttcag aatgttatta ccgcgataat cgtactcaac     1680

ggggcggtgc tcggtctcct caccgatacc acattgagcg cttctagcca gaacctgctc     1740

gaaagggttg accaactgtg cctgacaatt tttatcgtgg aaattagctt gaaaatttac     1800

gcctacggcg ttcgcggttt tttccggagc ggttggaatc tttttgactt cgttatcgtt     1860

gccatcgcgc tcatgcccgc acagggttct ttgtctgtgt tgaggacatt ccgaatattt     1920

cgcgtgatgc gcttggtatc cgtgatccct acgatgcgcc gcgtcgtaca aggaatgttg     1980

ctggctctcc ccggcgtcgg gagcgttgct gccctcctta ccgtggtatt ttacatagcg     2040

gcggttatgg ctactaatct ttacggagct accttcccgg agtggttcgg ggatttgtcc     2100

aagagcctct atacattgtt tcaagttatg accctggagt cctggtctat gggcattgtc     2160

cggcccgtaa tgaacgtaca cccaaatgcg tgggtgtttt tcattccatt catcatgctg     2220

actaccttta ccgtgctgaa cttgttcatt gggattatcg tggatgcgat ggccatcact     2280

aaggagcaag aagaagaggc taaaactggc caccaccaag agccaatttc tcaaaccctc     2340

ttgcatctcg gggaccgact ggaccgcatt gagaagcaac tcgcgcagaa caatgagctg     2400

ttgcagcgac agcaacctca aaaaaaataa tgaccgcggc cgcgaattcg atatcataat     2460

caacctctgg attacaaaat ttgtgaaaga ttgactggta ttcttaacta tgttgctcct     2520

tttacgctat gtggatacgc tgctttaatg cctttgtatc atgctattgc ttcccgtatg     2580

gctttcattt tctcctcctt gtataaatcc tggttagttc ttgccacggc ggaactcatc     2640

gccgcctgcc ttgcccgctg ctggacaggg gctcggctgt tgggcactga caattccgtg     2700

gctcgagaga tcttcgactg tgccttctag ttgccagcca tctgttgttt gcccctcccc     2760

cgtgccttcc ttgaccctgg aaggtgccac tcccactgtc ctttcctaat aaaatgagga     2820

aattgcatcg cattgtctga gtaggtgtca ttctattctg gggggtgggg tggggcagga     2880

cagcaagggg gaggattggg aagacaatag caggcatgag atctcacgtg cggaccgagc     2940

ggccgcagga acccctagtg atggagttgg ccactccctc tctgcgcgct cgctcgctca     3000

ctgaggccgg gcgaccaaag gtcgcccgac gcccgggctt tgcccgggcg gcctcagtga     3060

gcgagcgagc gcgcagctgc ctgcaggggc gcctgatgcg gtattttctc cttacgcatc     3120

tgtgcggtat ttcacaccgc atacgtcaaa gcaaccatag tacgcgccct gtagcggcgc     3180

attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg ccagcgccct     3240

agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg gctttccccg     3300

tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac ggcacctcga     3360

ccccaaaaaa cttgatttgg gtgatggttc acgtagtggg ccatcgccct gatagacggt     3420

ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt tccaaactgg     3480

aacaacactc aaccctatct cgggctattc ttttgattta taagggattt tgccgatttc     3540

ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt ttaacaaaat     3600

attaacgttt acaattttat ggtgcactct cagtacaatc tgctctgatg ccgcatagtt     3660

aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt gtctgctccc     3720

ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc     3780

accgtcatca ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat ttttataggt     3840

taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg gaaatgtgcg     3900

cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc tcatgagaca     3960

ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta ttcaacattt     4020

ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg ctcacccaga     4080

aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg gttacatcga     4140

actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac gttttccaat     4200

gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg acgccgggca     4260

agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt actcaccagt     4320

cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg ctgccataac     4380

catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac cgaaggagct     4440

aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt gggaaccgga     4500

gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag caatggcaac     4560

aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc aacaattaat     4620

agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc ttccggctgg     4680

ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta tcattgcagc     4740

actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg ggagtcaggc     4800

aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga ttaagcattg     4860

gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac ttcattttta     4920

atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa tcccttaacg     4980

tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga     5040

tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt     5100

ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg gcttcagcag     5160

agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc acttcaagaa     5220

ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg ctgctgccag     5280

tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg ataaggcgca     5340

gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa cgacctacac     5400

cgaactgaga tacctacagc gtgagctatg agaaagcgcc acgcttcccg aagggagaaa     5460

ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga gggagcttcc     5520

agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct gacttgagcg     5580

tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca gcaacgcggc     5640

ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgt                      5684


<210>  37
<211>  4780
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN1244 - The portion between L-ITR and R-ITR: positions 142-2042

<400>  37
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc       60

gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggcca      120

actccatcac taggggttcc tgcggccgca cgcgtatagg taccgagctc tatgcactca      180

cagtggtttg gcatgcatct ggtgaatttt ttttaacgaa aaattagtgt tggtttcgat      240

gtatggtagc attctcccta acgtaatttg aataattcag caaagcccca ctaccagctg      300

tacttctgca gcctcttcca ttcttttcag cattataatt ttggttaatt ttcaatttta      360

ggtcctacgt ctctgcaatt tgtgtatgaa taacagaata atttccctct tttgtttcgc      420

ctttcctgtt cctgaatcta aataaagatg gctttttagt attaaaagtg gaagaaaatt      480

acaggtaatt atctttgacg gtaaaaacgc tgtaatcagc gggctacatg aaaaattact      540

ctaattatgg ctgcatttaa gagaatggaa aaaaaccttc ttgtggataa aaaccttaaa      600

ttgtccccaa tgtctgcttc aaattggatg gcactgcagc tggaggcttt gttcagaatt      660

gatcctgggg agctacgaac ccaaagtttc acagtaggga gctcgggctg ggcataaaag      720

tcagggcaga gccatctatt gcttacattt gcttctggga tccagatctt tcgaagctag      780

cgctaccggt cgccaccatg gtgagcaagg gcgaggagct gttcaccggg gtggtgccca      840

tcctggtcga gctggacggc gacgtaaacg gccacaagtt cagcgtgtcc ggcgagggcg      900

agggcgatgc cacctacggc aagctgaccc tgaagctgat ctgcaccacc ggcaagctgc      960

ccgtgccctg gcccaccctc gtgaccaccc tgggctacgg cgtgcagtgc ttcgcccgct     1020

accccgacca catgaagcag cacgacttct tcaagtccgc catgcccgaa ggctacgtcc     1080

aggagcgcac catcttcttc aaggacgacg gcaactacaa gacccgcgcc gaggtgaagt     1140

tcgagggcga caccctggtg aaccgcatcg agctgaaggg catcgacttc aaggaggacg     1200

gcaacatcct ggggcacaag ctggagtaca actacaacag ccacaacgtc tatatcaccg     1260

ccgacaagca gaagaacggc atcaaggcca acttcaagat ccgccacaac atcgaggacg     1320

gcggcgtgca gctcgccgac cactaccagc agaacacccc catcggcgac ggccccgtgc     1380

tgctgcccga caaccactac ctgagctacc agtccaagct gagcaaagac cccaacgaga     1440

agcgcgatca catggtcctg ctggagttcg tgaccgccgc cgggatcact ctcggcatgg     1500

acgagctgta caagtaagtc gacggcgcgc cgcggccgcg aattcgatat cataatcaac     1560

ctctggatta caaaatttgt gaaagattga ctggtattct taactatgtt gctcctttta     1620

cgctatgtgg atacgctgct ttaatgcctt tgtatcatgc tattgcttcc cgtatggctt     1680

tcattttctc ctccttgtat aaatcctggt tagttcttgc cacggcggaa ctcatcgccg     1740

cctgccttgc ccgctgctgg acaggggctc ggctgttggg cactgacaat tccgtggctc     1800

gagagatctt cgactgtgcc ttctagttgc cagccatctg ttgtttgccc ctcccccgtg     1860

ccttccttga ccctggaagg tgccactccc actgtccttt cctaataaaa tgaggaaatt     1920

gcatcgcatt gtctgagtag gtgtcattct attctggggg gtggggtggg gcaggacagc     1980

aagggggagg attgggaaga caatagcagg catgagatct cacgtgcgga ccgagcggcc     2040

gcaggaaccc ctagtgatgg agttggccac tccctctctg cgcgctcgct cgctcactga     2100

ggccgggcga ccaaaggtcg cccgacgccc gggctttgcc cgggcggcct cagtgagcga     2160

gcgagcgcgc agctgcctgc aggggcgcct gatgcggtat tttctcctta cgcatctgtg     2220

cggtatttca caccgcatac gtcaaagcaa ccatagtacg cgccctgtag cggcgcatta     2280

agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta cacttgccag cgccctagcg     2340

cccgctcctt tcgctttctt cccttccttt ctcgccacgt tcgccggctt tccccgtcaa     2400

gctctaaatc gggggctccc tttagggttc cgatttagtg ctttacggca cctcgacccc     2460

aaaaaacttg atttgggtga tggttcacgt agtgggccat cgccctgata gacggttttt     2520

cgccctttga cgttggagtc cacgttcttt aatagtggac tcttgttcca aactggaaca     2580

acactcaacc ctatctcggg ctattctttt gatttataag ggattttgcc gatttcggcc     2640

tattggttaa aaaatgagct gatttaacaa aaatttaacg cgaattttaa caaaatatta     2700

acgtttacaa ttttatggtg cactctcagt acaatctgct ctgatgccgc atagttaagc     2760

cagccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct gctcccggca     2820

tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag gttttcaccg     2880

tcatcaccga aacgcgcgag acgaaagggc ctcgtgatac gcctattttt ataggttaat     2940

gtcatgataa taatggtttc ttagacgtca ggtggcactt ttcggggaaa tgtgcgcgga     3000

acccctattt gtttattttt ctaaatacat tcaaatatgt atccgctcat gagacaataa     3060

ccctgataaa tgcttcaata atattgaaaa aggaagagta tgagtattca acatttccgt     3120

gtcgccctta ttcccttttt tgcggcattt tgccttcctg tttttgctca cccagaaacg     3180

ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac gagtgggtta catcgaactg     3240

gatctcaaca gcggtaagat ccttgagagt tttcgccccg aagaacgttt tccaatgatg     3300

agcactttta aagttctgct atgtggcgcg gtattatccc gtattgacgc cgggcaagag     3360

caactcggtc gccgcataca ctattctcag aatgacttgg ttgagtactc accagtcaca     3420

gaaaagcatc ttacggatgg catgacagta agagaattat gcagtgctgc cataaccatg     3480

agtgataaca ctgcggccaa cttacttctg acaacgatcg gaggaccgaa ggagctaacc     3540

gcttttttgc acaacatggg ggatcatgta actcgccttg atcgttggga accggagctg     3600

aatgaagcca taccaaacga cgagcgtgac accacgatgc ctgtagcaat ggcaacaacg     3660

ttgcgcaaac tattaactgg cgaactactt actctagctt cccggcaaca attaatagac     3720

tggatggagg cggataaagt tgcaggacca cttctgcgct cggcccttcc ggctggctgg     3780

tttattgctg ataaatctgg agccggtgag cgtgggtctc gcggtatcat tgcagcactg     3840

gggccagatg gtaagccctc ccgtatcgta gttatctaca cgacggggag tcaggcaact     3900

atggatgaac gaaatagaca gatcgctgag ataggtgcct cactgattaa gcattggtaa     3960

ctgtcagacc aagtttactc atatatactt tagattgatt taaaacttca tttttaattt     4020

aaaaggatct aggtgaagat cctttttgat aatctcatga ccaaaatccc ttaacgtgag     4080

ttttcgttcc actgagcgtc agaccccgta gaaaagatca aaggatcttc ttgagatcct     4140

ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc agcggtggtt     4200

tgtttgccgg atcaagagct accaactctt tttccgaagg taactggctt cagcagagcg     4260

cagataccaa atactgtcct tctagtgtag ccgtagttag gccaccactt caagaactct     4320

gtagcaccgc ctacatacct cgctctgcta atcctgttac cagtggctgc tgccagtggc     4380

gataagtcgt gtcttaccgg gttggactca agacgatagt taccggataa ggcgcagcgg     4440

tcgggctgaa cggggggttc gtgcacacag cccagcttgg agcgaacgac ctacaccgaa     4500

ctgagatacc tacagcgtga gctatgagaa agcgccacgc ttcccgaagg gagaaaggcg     4560

gacaggtatc cggtaagcgg cagggtcgga acaggagagc gcacgaggga gcttccaggg     4620

ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc acctctgact tgagcgtcga     4680

tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa acgccagcaa cgcggccttt     4740

ttacggttcc tggccttttg ctggcctttt gctcacatgt                           4780


<210>  38
<211>  4398
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN1389 - The portion between L-ITR and R-ITR corresponds to 
       positions 142-1897

<400>  38
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc       60

gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggcca      120

actccatcac taggggttcc tgcggccgca cgcgttcgcc tttcctgttc ctgaatctaa      180

ataaagatgg ctttttagta ttaaaagtgg aagaaaatta caggtaatta tctttgacgg      240

taaaaacgct gtaatcagcg ggctacatga aaaattactc taattatggc tgcatttaag      300

agaatggacc tgcagggagc tcgggctggg cataaaagtc agggcagagc catctattgc      360

ttacatttgc ttctgggatc cagatctttc gaagctagcg ctaccggtcg ccaccatggt      420

gagcaagggc gaggagctgt tcaccggggt ggtgcccatc ctggtcgagc tggacggcga      480

cgtaaacggc cacaagttca gcgtgtccgg cgagggcgag ggcgatgcca cctacggcaa      540

gctgaccctg aagctgatct gcaccaccgg caagctgccc gtgccctggc ccaccctcgt      600

gaccaccctg ggctacggcg tgcagtgctt cgcccgctac cccgaccaca tgaagcagca      660

cgacttcttc aagtccgcca tgcccgaagg ctacgtccag gagcgcacca tcttcttcaa      720

ggacgacggc aactacaaga cccgcgccga ggtgaagttc gagggcgaca ccctggtgaa      780

ccgcatcgag ctgaagggca tcgacttcaa ggaggacggc aacatcctgg ggcacaagct      840

ggagtacaac tacaacagcc acaacgtcta tatcaccgcc gacaagcaga agaacggcat      900

caaggccaac ttcaagatcc gccacaacat cgaggacggc ggcgtgcagc tcgccgacca      960

ctaccagcag aacaccccca tcggcgacgg ccccgtgctg ctgcccgaca accactacct     1020

gagctaccag tccaagctga gcaaagaccc caacgagaag cgcgatcaca tggtcctgct     1080

ggagttcgtg accgccgccg ggatcactct cggcatggac gagctgtaca agtaagtcga     1140

cggcgcgccg cggccgcgaa ttcgatatca taatcaacct ctggattaca aaatttgtga     1200

aagattgact ggtattctta actatgttgc tccttttacg ctatgtggat acgctgcttt     1260

aatgcctttg tatcatgcta ttgcttcccg tatggctttc attttctcct ccttgtataa     1320

atcctggtta gttcttgcca cggcggaact catcgccgcc tgccttgccc gctgctggac     1380

aggggctcgg ctgttgggca ctgacaattc cgtggctcga gagatcttcg actgtgcctt     1440

ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc ctggaaggtg     1500

ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt ctgagtaggt     1560

gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat tgggaagaca     1620

atagcaggca tgagatctca cgtgcggacc gagcggccgc aggaacccct agtgatggag     1680

ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc     1740

cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag ctgcctgcag     1800

gggcgcctga tgcggtattt tctccttacg catctgtgcg gtatttcaca ccgcatacgt     1860

caaagcaacc atagtacgcg ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta     1920

cgcgcagcgt gaccgctaca cttgccagcg ccctagcgcc cgctcctttc gctttcttcc     1980

cttcctttct cgccacgttc gccggctttc cccgtcaagc tctaaatcgg gggctccctt     2040

tagggttccg atttagtgct ttacggcacc tcgaccccaa aaaacttgat ttgggtgatg     2100

gttcacgtag tgggccatcg ccctgataga cggtttttcg ccctttgacg ttggagtcca     2160

cgttctttaa tagtggactc ttgttccaaa ctggaacaac actcaaccct atctcgggct     2220

attcttttga tttataaggg attttgccga tttcggccta ttggttaaaa aatgagctga     2280

tttaacaaaa atttaacgcg aattttaaca aaatattaac gtttacaatt ttatggtgca     2340

ctctcagtac aatctgctct gatgccgcat agttaagcca gccccgacac ccgccaacac     2400

ccgctgacgc gccctgacgg gcttgtctgc tcccggcatc cgcttacaga caagctgtga     2460

ccgtctccgg gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa cgcgcgagac     2520

gaaagggcct cgtgatacgc ctatttttat aggttaatgt catgataata atggtttctt     2580

agacgtcagg tggcactttt cggggaaatg tgcgcggaac ccctatttgt ttatttttct     2640

aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg cttcaataat     2700

attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt cccttttttg     2760

cggcattttg ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta aaagatgctg     2820

aagatcagtt gggtgcacga gtgggttaca tcgaactgga tctcaacagc ggtaagatcc     2880

ttgagagttt tcgccccgaa gaacgttttc caatgatgag cacttttaaa gttctgctat     2940

gtggcgcggt attatcccgt attgacgccg ggcaagagca actcggtcgc cgcatacact     3000

attctcagaa tgacttggtt gagtactcac cagtcacaga aaagcatctt acggatggca     3060

tgacagtaag agaattatgc agtgctgcca taaccatgag tgataacact gcggccaact     3120

tacttctgac aacgatcgga ggaccgaagg agctaaccgc ttttttgcac aacatggggg     3180

atcatgtaac tcgccttgat cgttgggaac cggagctgaa tgaagccata ccaaacgacg     3240

agcgtgacac cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta ttaactggcg     3300

aactacttac tctagcttcc cggcaacaat taatagactg gatggaggcg gataaagttg     3360

caggaccact tctgcgctcg gcccttccgg ctggctggtt tattgctgat aaatctggag     3420

ccggtgagcg tgggtctcgc ggtatcattg cagcactggg gccagatggt aagccctccc     3480

gtatcgtagt tatctacacg acggggagtc aggcaactat ggatgaacga aatagacaga     3540

tcgctgagat aggtgcctca ctgattaagc attggtaact gtcagaccaa gtttactcat     3600

atatacttta gattgattta aaacttcatt tttaatttaa aaggatctag gtgaagatcc     3660

tttttgataa tctcatgacc aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag     3720

accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc gtaatctgct     3780

gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat caagagctac     3840

caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat actgtccttc     3900

tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct acatacctcg     3960

ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt cttaccgggt     4020

tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg gggggttcgt     4080

gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta cagcgtgagc     4140

tatgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg gtaagcggca     4200

gggtcggaac aggagagcgc acgagggagc ttccaggggg aaacgcctgg tatctttata     4260

gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg     4320

ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg gccttttgct     4380

ggccttttgc tcacatgt                                                   4398


<210>  39
<211>  4635
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN1390 - The portion between L-ITR and R-ITR corresponds to 
       positions 142-1660

<400>  39
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc       60

gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggcca      120

actccatcac taggggttcc tgcggccgca cgcgtggtac cctaaataaa gatggctttt      180

tagtattaaa agtggaagaa aattacaggt aattatcttt gacggtaaaa acgctgtaat      240

cagcgggcta catgaaaaat tactctaatt atggctgcat ttaagagaat ggctaaataa      300

agatggcttt ttagtattaa aagtggaaga aaattacagg taattatctt tgacggtaaa      360

aacgctgtaa tcagcgggct acatgaaaaa ttactctaat tatggctgca tttaagagaa      420

tggctaaata aagatggctt tttagtatta aaagtggaag aaaattacag gtaattatct      480

ttgacggtaa aaacgctgta atcagcgggc tacatgaaaa attactctaa ttatggctgc      540

atttaagaga atggagctcg ggctgggcat aaaagtcagg gcagagccat ctattgctta      600

catttgcttc tgggatccag atctttcgaa gctagcgcta ccggtcgcca ccatggtgag      660

caagggcgag gagctgttca ccggggtggt gcccatcctg gtcgagctgg acggcgacgt      720

aaacggccac aagttcagcg tgtccggcga gggcgagggc gatgccacct acggcaagct      780

gaccctgaag ctgatctgca ccaccggcaa gctgcccgtg ccctggccca ccctcgtgac      840

caccctgggc tacggcgtgc agtgcttcgc ccgctacccc gaccacatga agcagcacga      900

cttcttcaag tccgccatgc ccgaaggcta cgtccaggag cgcaccatct tcttcaagga      960

cgacggcaac tacaagaccc gcgccgaggt gaagttcgag ggcgacaccc tggtgaaccg     1020

catcgagctg aagggcatcg acttcaagga ggacggcaac atcctggggc acaagctgga     1080

gtacaactac aacagccaca acgtctatat caccgccgac aagcagaaga acggcatcaa     1140

ggccaacttc aagatccgcc acaacatcga ggacggcggc gtgcagctcg ccgaccacta     1200

ccagcagaac acccccatcg gcgacggccc cgtgctgctg cccgacaacc actacctgag     1260

ctaccagtcc aagctgagca aagaccccaa cgagaagcgc gatcacatgg tcctgctgga     1320

gttcgtgacc gccgccggga tcactctcgg catggacgag ctgtacaagt aagtcgacgg     1380

cgcgccgcgg ccgcgaattc gatatcataa tcaacctctg gattacaaaa tttgtgaaag     1440

attgactggt attcttaact atgttgctcc ttttacgcta tgtggatacg ctgctttaat     1500

gcctttgtat catgctattg cttcccgtat ggctttcatt ttctcctcct tgtataaatc     1560

ctggttagtt cttgccacgg cggaactcat cgccgcctgc cttgcccgct gctggacagg     1620

ggctcggctg ttgggcactg acaattccgt ggctcgagag atcttcgact gtgccttcta     1680

gttgccagcc atctgttgtt tgcccctccc ccgtgccttc cttgaccctg gaaggtgcca     1740

ctcccactgt cctttcctaa taaaatgagg aaattgcatc gcattgtctg agtaggtgtc     1800

attctattct ggggggtggg gtggggcagg acagcaaggg ggaggattgg gaagacaata     1860

gcaggcatga gatctcacgt gcggaccgag cggccgcagg aacccctagt gatggagttg     1920

gccactccct ctctgcgcgc tcgctcgctc actgaggccg ggcgaccaaa ggtcgcccga     1980

cgcccgggct ttgcccgggc ggcctcagtg agcgagcgag cgcgcagctg cctgcagggg     2040

cgcctgatgc ggtattttct ccttacgcat ctgtgcggta tttcacaccg catacgtcaa     2100

agcaaccata gtacgcgccc tgtagcggcg cattaagcgc ggcgggtgtg gtggttacgc     2160

gcagcgtgac cgctacactt gccagcgccc tagcgcccgc tcctttcgct ttcttccctt     2220

cctttctcgc cacgttcgcc ggctttcccc gtcaagctct aaatcggggg ctccctttag     2280

ggttccgatt tagtgcttta cggcacctcg accccaaaaa acttgatttg ggtgatggtt     2340

cacgtagtgg gccatcgccc tgatagacgg tttttcgccc tttgacgttg gagtccacgt     2400

tctttaatag tggactcttg ttccaaactg gaacaacact caaccctatc tcgggctatt     2460

cttttgattt ataagggatt ttgccgattt cggcctattg gttaaaaaat gagctgattt     2520

aacaaaaatt taacgcgaat tttaacaaaa tattaacgtt tacaatttta tggtgcactc     2580

tcagtacaat ctgctctgat gccgcatagt taagccagcc ccgacacccg ccaacacccg     2640

ctgacgcgcc ctgacgggct tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg     2700

tctccgggag ctgcatgtgt cagaggtttt caccgtcatc accgaaacgc gcgagacgaa     2760

agggcctcgt gatacgccta tttttatagg ttaatgtcat gataataatg gtttcttaga     2820

cgtcaggtgg cacttttcgg ggaaatgtgc gcggaacccc tatttgttta tttttctaaa     2880

tacattcaaa tatgtatccg ctcatgagac aataaccctg ataaatgctt caataatatt     2940

gaaaaaggaa gagtatgagt attcaacatt tccgtgtcgc ccttattccc ttttttgcgg     3000

cattttgcct tcctgttttt gctcacccag aaacgctggt gaaagtaaaa gatgctgaag     3060

atcagttggg tgcacgagtg ggttacatcg aactggatct caacagcggt aagatccttg     3120

agagttttcg ccccgaagaa cgttttccaa tgatgagcac ttttaaagtt ctgctatgtg     3180

gcgcggtatt atcccgtatt gacgccgggc aagagcaact cggtcgccgc atacactatt     3240

ctcagaatga cttggttgag tactcaccag tcacagaaaa gcatcttacg gatggcatga     3300

cagtaagaga attatgcagt gctgccataa ccatgagtga taacactgcg gccaacttac     3360

ttctgacaac gatcggagga ccgaaggagc taaccgcttt tttgcacaac atgggggatc     3420

atgtaactcg ccttgatcgt tgggaaccgg agctgaatga agccatacca aacgacgagc     3480

gtgacaccac gatgcctgta gcaatggcaa caacgttgcg caaactatta actggcgaac     3540

tacttactct agcttcccgg caacaattaa tagactggat ggaggcggat aaagttgcag     3600

gaccacttct gcgctcggcc cttccggctg gctggtttat tgctgataaa tctggagccg     3660

gtgagcgtgg gtctcgcggt atcattgcag cactggggcc agatggtaag ccctcccgta     3720

tcgtagttat ctacacgacg gggagtcagg caactatgga tgaacgaaat agacagatcg     3780

ctgagatagg tgcctcactg attaagcatt ggtaactgtc agaccaagtt tactcatata     3840

tactttagat tgatttaaaa cttcattttt aatttaaaag gatctaggtg aagatccttt     3900

ttgataatct catgaccaaa atcccttaac gtgagttttc gttccactga gcgtcagacc     3960

ccgtagaaaa gatcaaagga tcttcttgag atcctttttt tctgcgcgta atctgctgct     4020

tgcaaacaaa aaaaccaccg ctaccagcgg tggtttgttt gccggatcaa gagctaccaa     4080

ctctttttcc gaaggtaact ggcttcagca gagcgcagat accaaatact gtccttctag     4140

tgtagccgta gttaggccac cacttcaaga actctgtagc accgcctaca tacctcgctc     4200

tgctaatcct gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt accgggttgg     4260

actcaagacg atagttaccg gataaggcgc agcggtcggg ctgaacgggg ggttcgtgca     4320

cacagcccag cttggagcga acgacctaca ccgaactgag atacctacag cgtgagctat     4380

gagaaagcgc cacgcttccc gaagggagaa aggcggacag gtatccggta agcggcaggg     4440

tcggaacagg agagcgcacg agggagcttc cagggggaaa cgcctggtat ctttatagtc     4500

ctgtcgggtt tcgccacctc tgacttgagc gtcgattttt gtgatgctcg tcaggggggc     4560

ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc ttttgctggc     4620

cttttgctca catgt                                                      4635


<210>  40
<211>  4841
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN1203 - The portion between L-ITR and R-ITR corresponds to 
       positions 183-2052

<400>  40
aaagcttccc ggggggatct gggccactcc ctctctgcgc gctcgctcgc tcactgaggc       60

cgggcgacca aaggtcgccc gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg      120

agcgcgcaga gagggagtgg ccaactccat cactaggggt tcctggaggg gtggagtcgt      180

gacctaggac gcgtataggt accgagctct atgcactcac agtggtttgg catgcatctg      240

gtgaattttt tttaacgaaa aattagtgtt ggtttcgatg tatggtagca ttctccctaa      300

cgtaatttga ataattcagc aaagccccac taccagctgt acttctgcag cctcttccat      360

tcttttcagc attataattt tggttaattt tcaattttag gtcctacgtc tctgcaattt      420

gtgtatgaat aacagaataa tttccctctt ttgtttcgcc tttcctgttc ctgaatctaa      480

ataaagatgg ctttttagta ttaaaagtgg aagaaaatta caggtaatta tctttgacgg      540

taaaaacgct gtaatcagcg ggctacatga aaaattactc taattatggc tgcatttaag      600

agaatggaaa aaaaccttct tgtggataaa aaccttaaat tgtccccaat gtctgcttca      660

aattggatgg cactgcagct ggaggctttg ttcagaattg atcctgggga gctacgaacc      720

caaagtttca cagtagggag ctcgggctgg gcataaaagt cagggcagag ccatctattg      780

cttacatttg cttctgggat ccagatcttt cgaagctagc gctaccggtc gccaccatgg      840

tgagcaaggg cgaggagctg ttcaccgggg tggtgcccat cctggtcgag ctggacggcg      900

acgtaaacgg ccacaagttc agcgtgtccg gcgagggcga gggcgatgcc acctacggca      960

agctgaccct gaagctgatc tgcaccaccg gcaagctgcc cgtgccctgg cccaccctcg     1020

tgaccaccct gggctacggc gtgcagtgct tcgcccgcta ccccgaccac atgaagcagc     1080

acgacttctt caagtccgcc atgcccgaag gctacgtcca ggagcgcacc atcttcttca     1140

aggacgacgg caactacaag acccgcgccg aggtgaagtt cgagggcgac accctggtga     1200

accgcatcga gctgaagggc atcgacttca aggaggacgg caacatcctg gggcacaagc     1260

tggagtacaa ctacaacagc cacaacgtct atatcaccgc cgacaagcag aagaacggca     1320

tcaaggccaa cttcaagatc cgccacaaca tcgaggacgg cggcgtgcag ctcgccgacc     1380

actaccagca gaacaccccc atcggcgacg gccccgtgct gctgcccgac aaccactacc     1440

tgagctacca gtccaagctg agcaaagacc ccaacgagaa gcgcgatcac atggtcctgc     1500

tggagttcgt gaccgccgcc gggatcactc tcggcatgga cgagctgtac aagtaagtcg     1560

acggcgcgcc gcggccgcga attcgatatc ataatcaacc tctggattac aaaatttgtg     1620

aaagattgac tggtattctt aactatgttg ctccttttac gctatgtgga tacgctgctt     1680

taatgccttt gtatcatgct attgcttccc gtatggcttt cattttctcc tccttgtata     1740

aatcctggtt agttcttgcc acggcggaac tcatcgccgc ctgccttgcc cgctgctgga     1800

caggggctcg gctgttgggc actgacaatt ccgtggctcg agcgactgtg ccttctagtt     1860

gccagccatc tgttgtttgc ccctcccccg tgccttcctt gaccctggaa ggtgccactc     1920

ccactgtcct ttcctaataa aatgaggaaa ttgcatcgca ttgtctgagt aggtgtcatt     1980

ctattctggg gggtggggtg gggcaggaca gcaaggggga ggattgggaa gacaatagca     2040

ggcatgacta gtccactccc tctctgcgcg ctcgctcgct cactgaggcc gggcgaccaa     2100

aggtcgcccg acgcccgggc tttgcccggg cggcctcagt gagcgagcga gcgcgcagag     2160

agggacagat ccgggcccgc atgcgtcgac aattcactgg ccgtcgtttt acaacgtcgt     2220

gactgggaaa accctggcgt tacccaactt aatcgccttg cagcacatcc ccctttcgcc     2280

agctggcgta atagcgaaga ggcccgcacc gatcgccctt cccaacagtt gcgcagcctg     2340

aatggcgaat ggcgcctgat gcggtatttt ctccttacgc atctgtgcgg tatttcacac     2400

cgcatatggt gcactctcag tacaatctgc tctgatgccg catagttaag ccagccccga     2460

cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac     2520

agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg     2580

aaacgcgcga gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata     2640

ataatggttt cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt     2700

tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa     2760

atgcttcaat aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt     2820

attccctttt ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa     2880

gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac     2940

agcggtaaga tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt     3000

aaagttctgc tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt     3060

cgccgcatac actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat     3120

cttacggatg gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac     3180

actgcggcca acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg     3240

cacaacatgg gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc     3300

ataccaaacg acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa     3360

ctattaactg gcgaactact tactctagct tcccggcaac aattaataga ctggatggag     3420

gcggataaag ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct     3480

gataaatctg gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat     3540

ggtaagccct cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa     3600

cgaaatagac agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac     3660

caagtttact catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc     3720

taggtgaaga tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc     3780

cactgagcgt cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg     3840

cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg     3900

gatcaagagc taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca     3960

aatactgttc ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg     4020

cctacatacc tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg     4080

tgtcttaccg ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga     4140

acggggggtt cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac     4200

ctacagcgtg agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat     4260

ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc     4320

tggtatcttt atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga     4380

tgctcgtcag gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc     4440

ctggcctttt gctggccttt tgctcacatg ttctttcctg cgttatcccc tgattctgtg     4500

gataaccgta ttaccgcctt tgagtgagct gataccgctc gccgcagccg aacgaccgag     4560

cgcagcgagt cagtgagcga ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc     4620

gcgcgttggc cgattcatta atgcagctgg cacgacaggt ttcccgactg gaaagcgggc     4680

agtgagcgca acgcaattaa tgtgagttag ctcactcatt aggcacccca ggctttacac     4740

tttatgcttc cggctcgtat gttgtgtgga attgtgagcg gataacaatt tcacacagga     4800

aacagctatg accatgatta cgccaagctc tcgagatcta g                         4841


<210>  41
<211>  4680
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN1180 - The portion between L-ITR and R-ITR corresponds to 
       positions 183-1891

<400>  41
aaagcttccc ggggggatct gggccactcc ctctctgcgc gctcgctcgc tcactgaggc       60

cgggcgacca aaggtcgccc gacgcccggg ctttgcccgg gcggcctcag tgagcgagcg      120

agcgcgcaga gagggagtgg ccaactccat cactaggggt tcctggaggg gtggagtcgt      180

gacctaggac gcgtcagctg caaacccaag agggtcagca tcatttcact gtattctctt      240

cttgattaca agccgggccc atcaaacaca acataattac agtaatttca ggtttattta      300

ttctaatgca gtttccccat ctctctggta attatgagca attttttcgc ccagggaatc      360

tttttgcatt aacaaaagag ataacgcact gaaagccaaa tttgctgtgc attgagaaaa      420

ggaaaaaaaa aaatcaaata ggtgcgagct gccatctctg caattctctg gtaccggagc      480

cggcaaattg cttgcaggtg tatggagcaa gcttgtcaat ggccaggcct ccaaattagc      540

aaatgcacag cagcaaagta atgaagacag gagctcgggc tgggcataaa agtcagggca      600

gagccatcta ttgcttacat ttgcttctgg gatccagatc tttcgaagct agcgctaccg      660

gtcgccacca tggtgagcaa gggcgaggag ctgttcaccg gggtggtgcc catcctggtc      720

gagctggacg gcgacgtaaa cggccacaag ttcagcgtgt ccggcgaggg cgagggcgat      780

gccacctacg gcaagctgac cctgaagctg atctgcacca ccggcaagct gcccgtgccc      840

tggcccaccc tcgtgaccac cctgggctac ggcgtgcagt gcttcgcccg ctaccccgac      900

cacatgaagc agcacgactt cttcaagtcc gccatgcccg aaggctacgt ccaggagcgc      960

accatcttct tcaaggacga cggcaactac aagacccgcg ccgaggtgaa gttcgagggc     1020

gacaccctgg tgaaccgcat cgagctgaag ggcatcgact tcaaggagga cggcaacatc     1080

ctggggcaca agctggagta caactacaac agccacaacg tctatatcac cgccgacaag     1140

cagaagaacg gcatcaaggc caacttcaag atccgccaca acatcgagga cggcggcgtg     1200

cagctcgccg accactacca gcagaacacc cccatcggcg acggccccgt gctgctgccc     1260

gacaaccact acctgagcta ccagtccaag ctgagcaaag accccaacga gaagcgcgat     1320

cacatggtcc tgctggagtt cgtgaccgcc gccgggatca ctctcggcat ggacgagctg     1380

tacaagtaag tcgacggcgc gccgcggccg cgaattcgat atcataatca acctctggat     1440

tacaaaattt gtgaaagatt gactggtatt cttaactatg ttgctccttt tacgctatgt     1500

ggatacgctg ctttaatgcc tttgtatcat gctattgctt cccgtatggc tttcattttc     1560

tcctccttgt ataaatcctg gttagttctt gccacggcgg aactcatcgc cgcctgcctt     1620

gcccgctgct ggacaggggc tcggctgttg ggcactgaca attccgtggc tcgagcgact     1680

gtgccttcta gttgccagcc atctgttgtt tgcccctccc ccgtgccttc cttgaccctg     1740

gaaggtgcca ctcccactgt cctttcctaa taaaatgagg aaattgcatc gcattgtctg     1800

agtaggtgtc attctattct ggggggtggg gtggggcagg acagcaaggg ggaggattgg     1860

gaagacaata gcaggcatga ctagtgcatg cccactccct ctctgcgcgc tcgctcgctc     1920

actgaggccg ggcgaccaaa ggtcgcccga cgcccgggct ttgcccgggc ggcctcagtg     1980

agcgagcgag cgcgcagaga gggacagatc cgggcccgca tgcgtcgaca attcactggc     2040

cgtcgtttta caacgtcgtg actgggaaaa ccctggcgtt acccaactta atcgccttgc     2100

agcacatccc cctttcgcca gctggcgtaa tagcgaagag gcccgcaccg atcgcccttc     2160

ccaacagttg cgcagcctga atggcgaatg gcgcctgatg cggtattttc tccttacgca     2220

tctgtgcggt atttcacacc gcatatggtg cactctcagt acaatctgct ctgatgccgc     2280

atagttaagc cagccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct     2340

gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag     2400

gttttcaccg tcatcaccga aacgcgcgag acgaaagggc ctcgtgatac gcctattttt     2460

ataggttaat gtcatgataa taatggtttc ttagacgtca ggtggcactt ttcggggaaa     2520

tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt atccgctcat     2580

gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta tgagtattca     2640

acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg tttttgctca     2700

cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac gagtgggtta     2760

catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg aagaacgttt     2820

tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc gtattgacgc     2880

cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg ttgagtactc     2940

accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat gcagtgctgc     3000

cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg gaggaccgaa     3060

ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg atcgttggga     3120

accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc ctgtagcaat     3180

ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt cccggcaaca     3240

attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct cggcccttcc     3300

ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc gcggtatcat     3360

tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca cgacggggag     3420

tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct cactgattaa     3480

gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt taaaacttca     3540

tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga ccaaaatccc     3600

ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca aaggatcttc     3660

ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc     3720

agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg taactggctt     3780

cagcagagcg cagataccaa atactgttct tctagtgtag ccgtagttag gccaccactt     3840

caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac cagtggctgc     3900

tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt taccggataa     3960

ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg agcgaacgac     4020

ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc ttcccgaagg     4080

gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc gcacgaggga     4140

gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc acctctgact     4200

tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa acgccagcaa     4260

cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt tctttcctgc     4320

gttatcccct gattctgtgg ataaccgtat taccgccttt gagtgagctg ataccgctcg     4380

ccgcagccga acgaccgagc gcagcgagtc agtgagcgag gaagcggaag agcgcccaat     4440

acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc acgacaggtt     4500

tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc tcactcatta     4560

ggcaccccag gctttacact ttatgcttcc ggctcgtatg ttgtgtggaa ttgtgagcgg     4620

ataacaattt cacacaggaa acagctatga ccatgattac gccaagctct cgagatctag     4680


<210>  42
<211>  4761
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN2001 - The portion between L-ITR and R-ITR corresponds to 
       positions 142-2023

<400>  42
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc       60

gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggcca      120

actccatcac taggggttcc tgcggccgca cgcgtggtac cctaaataaa gatggctttt      180

tagtattaaa agtggaagaa aattacaggt aattatcttt gacggtaaaa acgctgtaat      240

cagcgggcta catgaaaaat tactctaatt atggctgcat ttaagagaat ggctaaataa      300

agatggcttt ttagtattaa aagtggaaga aaattacagg taattatctt tgacggtaaa      360

aacgctgtaa tcagcgggct acatgaaaaa ttactctaat tatggctgca tttaagagaa      420

tggctaaata aagatggctt tttagtatta aaagtggaag aaaattacag gtaattatct      480

ttgacggtaa aaacgctgta atcagcgggc tacatgaaaa attactctaa ttatggctgc      540

atttaagaga atggagctcg ggctgggcat aaaagtcagg gcagagccat ctattgctta      600

catttgcttc tgggatccag atctttcgaa gctagcgcta ccaccatgga aaacaaccca      660

gccgaacagc aagtcccacc cctcgtggcg ctcgcccaac gcatagtatt tcacaaggcg      720

tttacgccga cgataatcac cctcatcatt attaatgcga tcattgtggg actcgagaca      780

tacccaacgg tttaccaggg ttacaatgat tggttctatg ctgccgacct tgctttgttg      840

tggatattca ctattgaaat cacgctccga ttcatcgccg cccgaccgac gaagagtttc      900

ttcaagtcta gctggaactg gtttgatctg cttatcgtat tggcgggcca cgtcttcgct      960

ggcgcccatt ttgttacggt gcttaggatc ctccgcgtcc tgagggtcct cagagctatc     1020

tcagtcatac ccagtctccg gcggctggtt gacgcacttt tgatgacaat cccagcactc     1080

ggtaacatca tgatactgat ggggattatt ttttacatat tcgcggttat cgggacgatg     1140

ctctttgcat cagtagcgcc agaatacttt ggcaatttgc agctgtctct gcttacactg     1200

ttccaagtgg ttacgctgga aagttgggct agtggggtta tgcgacctat ttttgccgaa     1260

gtctggtggt cttggatcta ttttgtaatc tttattctcg tgggaacttt catagtattt     1320

aaccttttca ttggcgtcat cgtgaacaat gtggaaaaag ctaacgaaga ggaactgaaa     1380

agcgaactgg atgataaaga ggctgataca aaagaagaac tggcatcatt gcgaaacgag     1440

gtggcagaaa tgaaggatct cataaaacag atgcataaac agcaaacaaa aaagggttaa     1500

tgacggcgcg ccgcggccgc gaattcgata tcataatcaa cctctggatt acaaaatttg     1560

tgaaagattg actggtattc ttaactatgt tgctcctttt acgctatgtg gatacgctgc     1620

tttaatgcct ttgtatcatg ctattgcttc ccgtatggct ttcattttct cctccttgta     1680

taaatcctgg ttagttcttg ccacggcgga actcatcgcc gcctgccttg cccgctgctg     1740

gacaggggct cggctgttgg gcactgacaa ttccgtggct cgagagatct tcgactgtgc     1800

cttctagttg ccagccatct gttgtttgcc cctcccccgt gccttccttg accctggaag     1860

gtgccactcc cactgtcctt tcctaataaa atgaggaaat tgcatcgcat tgtctgagta     1920

ggtgtcattc tattctgggg ggtggggtgg ggcaggacag caagggggag gattgggaag     1980

acaatagcag gcatgagatc tcacgtgcgg accgagcggc cgcaggaacc cctagtgatg     2040

gagttggcca ctccctctct gcgcgctcgc tcgctcactg aggccgggcg accaaaggtc     2100

gcccgacgcc cgggctttgc ccgggcggcc tcagtgagcg agcgagcgcg cagctgcctg     2160

caggggcgcc tgatgcggta ttttctcctt acgcatctgt gcggtatttc acaccgcata     2220

cgtcaaagca accatagtac gcgccctgta gcggcgcatt aagcgcggcg ggtgtggtgg     2280

ttacgcgcag cgtgaccgct acacttgcca gcgccctagc gcccgctcct ttcgctttct     2340

tcccttcctt tctcgccacg ttcgccggct ttccccgtca agctctaaat cgggggctcc     2400

ctttagggtt ccgatttagt gctttacggc acctcgaccc caaaaaactt gatttgggtg     2460

atggttcacg tagtgggcca tcgccctgat agacggtttt tcgccctttg acgttggagt     2520

ccacgttctt taatagtgga ctcttgttcc aaactggaac aacactcaac cctatctcgg     2580

gctattcttt tgatttataa gggattttgc cgatttcggc ctattggtta aaaaatgagc     2640

tgatttaaca aaaatttaac gcgaatttta acaaaatatt aacgtttaca attttatggt     2700

gcactctcag tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa     2760

cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg     2820

tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga     2880

gacgaaaggg cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt     2940

cttagacgtc aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt     3000

tctaaataca ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat     3060

aatattgaaa aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt     3120

ttgcggcatt ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg     3180

ctgaagatca gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga     3240

tccttgagag ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc     3300

tatgtggcgc ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac     3360

actattctca gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg     3420

gcatgacagt aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca     3480

acttacttct gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg     3540

gggatcatgt aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg     3600

acgagcgtga caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg     3660

gcgaactact tactctagct tcccggcaac aattaataga ctggatggag gcggataaag     3720

ttgcaggacc acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg     3780

gagccggtga gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct     3840

cccgtatcgt agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac     3900

agatcgctga gataggtgcc tcactgatta agcattggta actgtcagac caagtttact     3960

catatatact ttagattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga     4020

tcctttttga taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt     4080

cagaccccgt agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct     4140

gctgcttgca aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc     4200

taccaactct ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc     4260

ttctagtgta gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc     4320

tcgctctgct aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg     4380

ggttggactc aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt     4440

cgtgcacaca gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg     4500

agctatgaga aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg     4560

gcagggtcgg aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt     4620

atagtcctgt cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag     4680

gggggcggag cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt     4740

gctggccttt tgctcacatg t                                               4761


<210>  43
<211>  4732
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN2002 - The portion between L-ITR and R-ITR corresponds to 
       positions 142-1993

<400>  43
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc       60

gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggcca      120

actccatcac taggggttcc tgcggccgca cgcgtggtac cctaaataaa gatggctttt      180

tagtattaaa agtggaagaa aattacaggt aattatcttt gacggtaaaa acgctgtaat      240

cagcgggcta catgaaaaat tactctaatt atggctgcat ttaagagaat ggctaaataa      300

agatggcttt ttagtattaa aagtggaaga aaattacagg taattatctt tgacggtaaa      360

aacgctgtaa tcagcgggct acatgaaaaa ttactctaat tatggctgca tttaagagaa      420

tggctaaata aagatggctt tttagtatta aaagtggaag aaaattacag gtaattatct      480

ttgacggtaa aaacgctgta atcagcgggc tacatgaaaa attactctaa ttatggctgc      540

atttaagaga atggagctcg ggctgggcat aaaagtcagg gcagagccat ctattgctta      600

catttgcttc tgggatccag atctttcgaa gctagcgcta ccaccatgag ccggaagatc      660

agagatctta tcgaatctaa gagatttcag aatgttatta ccgcgataat cgtactcaac      720

ggggcggtgc tcggtctcct caccgatacc acattgagcg cttctagcca gaacctgctc      780

gaaagggttg accaactgtg cctgacaatt tttatcgtgg aaattagctt gaaaatttac      840

gcctacggcg ttcgcggttt tttccggagc ggttggaatc tttttgactt cgttatcgtt      900

gccatcgcgc tcatgcccgc acagggttct ttgtctgtgt tgaggacatt ccgaatattt      960

cgcgtgatgc gcttggtatc cgtgatccct acgatgcgcc gcgtcgtaca aggaatgttg     1020

ctggctctcc ccggcgtcgg gagcgttgct gccctcctta ccgtggtatt ttacatagcg     1080

gcggttatgg ctactaatct ttacggagct accttcccgg agtggttcgg ggatttgtcc     1140

aagagcctct atacattgtt tcaagttatg accctggagt cctggtctat gggcattgtc     1200

cggcccgtaa tgaacgtaca cccaaatgcg tgggtgtttt tcattccatt catcatgctg     1260

actaccttta ccgtgctgaa cttgttcatt gggattatcg tggatgcgat ggccatcact     1320

aaggagcaag aagaagaggc taaaactggc caccaccaag agccaatttc tcaaaccctc     1380

ttgcatctcg gggaccgact ggaccgcatt gagaagcaac tcgcgcagaa caatgagctg     1440

ttgcagcgac agcaacctca aaaaaaataa tgacggcgcg ccgcggccgc gaattcgata     1500

tcataatcaa cctctggatt acaaaatttg tgaaagattg actggtattc ttaactatgt     1560

tgctcctttt acgctatgtg gatacgctgc tttaatgcct ttgtatcatg ctattgcttc     1620

ccgtatggct ttcattttct cctccttgta taaatcctgg ttagttcttg ccacggcgga     1680

actcatcgcc gcctgccttg cccgctgctg gacaggggct cggctgttgg gcactgacaa     1740

ttccgtggct cgagagatct tcgactgtgc cttctagttg ccagccatct gttgtttgcc     1800

cctcccccgt gccttccttg accctggaag gtgccactcc cactgtcctt tcctaataaa     1860

atgaggaaat tgcatcgcat tgtctgagta ggtgtcattc tattctgggg ggtggggtgg     1920

ggcaggacag caagggggag gattgggaag acaatagcag gcatgagatc tcacgtgcgg     1980

accgagcggc cgcaggaacc cctagtgatg gagttggcca ctccctctct gcgcgctcgc     2040

tcgctcactg aggccgggcg accaaaggtc gcccgacgcc cgggctttgc ccgggcggcc     2100

tcagtgagcg agcgagcgcg cagctgcctg caggggcgcc tgatgcggta ttttctcctt     2160

acgcatctgt gcggtatttc acaccgcata cgtcaaagca accatagtac gcgccctgta     2220

gcggcgcatt aagcgcggcg ggtgtggtgg ttacgcgcag cgtgaccgct acacttgcca     2280

gcgccctagc gcccgctcct ttcgctttct tcccttcctt tctcgccacg ttcgccggct     2340

ttccccgtca agctctaaat cgggggctcc ctttagggtt ccgatttagt gctttacggc     2400

acctcgaccc caaaaaactt gatttgggtg atggttcacg tagtgggcca tcgccctgat     2460

agacggtttt tcgccctttg acgttggagt ccacgttctt taatagtgga ctcttgttcc     2520

aaactggaac aacactcaac cctatctcgg gctattcttt tgatttataa gggattttgc     2580

cgatttcggc ctattggtta aaaaatgagc tgatttaaca aaaatttaac gcgaatttta     2640

acaaaatatt aacgtttaca attttatggt gcactctcag tacaatctgc tctgatgccg     2700

catagttaag ccagccccga cacccgccaa cacccgctga cgcgccctga cgggcttgtc     2760

tgctcccggc atccgcttac agacaagctg tgaccgtctc cgggagctgc atgtgtcaga     2820

ggttttcacc gtcatcaccg aaacgcgcga gacgaaaggg cctcgtgata cgcctatttt     2880

tataggttaa tgtcatgata ataatggttt cttagacgtc aggtggcact tttcggggaa     2940

atgtgcgcgg aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca     3000

tgagacaata accctgataa atgcttcaat aatattgaaa aaggaagagt atgagtattc     3060

aacatttccg tgtcgccctt attccctttt ttgcggcatt ttgccttcct gtttttgctc     3120

acccagaaac gctggtgaaa gtaaaagatg ctgaagatca gttgggtgca cgagtgggtt     3180

acatcgaact ggatctcaac agcggtaaga tccttgagag ttttcgcccc gaagaacgtt     3240

ttccaatgat gagcactttt aaagttctgc tatgtggcgc ggtattatcc cgtattgacg     3300

ccgggcaaga gcaactcggt cgccgcatac actattctca gaatgacttg gttgagtact     3360

caccagtcac agaaaagcat cttacggatg gcatgacagt aagagaatta tgcagtgctg     3420

ccataaccat gagtgataac actgcggcca acttacttct gacaacgatc ggaggaccga     3480

aggagctaac cgcttttttg cacaacatgg gggatcatgt aactcgcctt gatcgttggg     3540

aaccggagct gaatgaagcc ataccaaacg acgagcgtga caccacgatg cctgtagcaa     3600

tggcaacaac gttgcgcaaa ctattaactg gcgaactact tactctagct tcccggcaac     3660

aattaataga ctggatggag gcggataaag ttgcaggacc acttctgcgc tcggcccttc     3720

cggctggctg gtttattgct gataaatctg gagccggtga gcgtgggtct cgcggtatca     3780

ttgcagcact ggggccagat ggtaagccct cccgtatcgt agttatctac acgacgggga     3840

gtcaggcaac tatggatgaa cgaaatagac agatcgctga gataggtgcc tcactgatta     3900

agcattggta actgtcagac caagtttact catatatact ttagattgat ttaaaacttc     3960

atttttaatt taaaaggatc taggtgaaga tcctttttga taatctcatg accaaaatcc     4020

cttaacgtga gttttcgttc cactgagcgt cagaccccgt agaaaagatc aaaggatctt     4080

cttgagatcc tttttttctg cgcgtaatct gctgcttgca aacaaaaaaa ccaccgctac     4140

cagcggtggt ttgtttgccg gatcaagagc taccaactct ttttccgaag gtaactggct     4200

tcagcagagc gcagatacca aatactgtcc ttctagtgta gccgtagtta ggccaccact     4260

tcaagaactc tgtagcaccg cctacatacc tcgctctgct aatcctgtta ccagtggctg     4320

ctgccagtgg cgataagtcg tgtcttaccg ggttggactc aagacgatag ttaccggata     4380

aggcgcagcg gtcgggctga acggggggtt cgtgcacaca gcccagcttg gagcgaacga     4440

cctacaccga actgagatac ctacagcgtg agctatgaga aagcgccacg cttcccgaag     4500

ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg aacaggagag cgcacgaggg     4560

agcttccagg gggaaacgcc tggtatcttt atagtcctgt cgggtttcgc cacctctgac     4620

ttgagcgtcg atttttgtga tgctcgtcag gggggcggag cctatggaaa aacgccagca     4680

acgcggcctt tttacggttc ctggcctttt gctggccttt tgctcacatg ts             4732


<210>  44
<211>  4794
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN2003 - The portion between L-ITR and R-ITR corresponds to 
       positions 142-2056

<400>  44
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc       60

gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggcca      120

actccatcac taggggttcc tgcggccgca cgcgtggtac cctaaataaa gatggctttt      180

tagtattaaa agtggaagaa aattacaggt aattatcttt gacggtaaaa acgctgtaat      240

cagcgggcta catgaaaaat tactctaatt atggctgcat ttaagagaat ggctaaataa      300

agatggcttt ttagtattaa aagtggaaga aaattacagg taattatctt tgacggtaaa      360

aacgctgtaa tcagcgggct acatgaaaaa ttactctaat tatggctgca tttaagagaa      420

tggctaaata aagatggctt tttagtatta aaagtggaag aaaattacag gtaattatct      480

ttgacggtaa aaacgctgta atcagcgggc tacatgaaaa attactctaa ttatggctgc      540

atttaagaga atggagctcg ggctgggcat aaaagtcagg gcagagccat ctattgctta      600

catttgcttc tgggatccag atctttcgaa gctagcgcta ccaccatgtc tacgtccctt      660

ttgaatgcgc ctaccggcct tcaagctaga gtcattaatc tcgtcgaaca aaactggttt      720

ggacacttta tactgactct catactcatt aatgctgtgc agcttggaat ggaaactagc      780

gccagcctca tggcacaata tggcgcgctg cttatgtcct tgaataaggt ccttctctct      840

gtgttcgtgg tcgaactgct gctccggatt tatgcgtatc ggggcaagtt ttttaaggac      900

ccgtggaatg tgtttgactt cactgttatt gttattgctc tgattcctgc atctggccca      960

ttggctgtcc tccgctccct ccgagttctc cgcgtcttga gggttctgac gattgtcccc     1020

agcatgaaaa gagtagtgtc agcactgctt gggagcttgc ccgggttggc ctccattgca     1080

accgtgcttc tgttgatcta ttacgttttc gctgtgatcg ccactaaaat tttcggggat     1140

gcttttccgg aatggttcgg gacgatagcg gactccttct ataccctttt tcaaattatg     1200

accttggaaa gttggtctat ggggatctct aggccagtga tggaggtgta cccttacgct     1260

tgggtattct ttgtgccctt tattcttgtt gctactttta ccatgcttaa ccttttcatc     1320

gccatcatag tgaatactat gcagacattc tctgacgagg aacatgctct ggagcgagag     1380

caagataaac agatcttgga acaggagcag agacaaatgc acgaggaact gaaggccatt     1440

cgactcgagc ttcagcaact ccaaaccctt ttgcgaaatg cggctgggga ctcctccaat     1500

gtctccacaa agggcaatat cggctcagac taatgacggc gcgccgcggc cgcgaattcg     1560

atatcataat caacctctgg attacaaaat ttgtgaaaga ttgactggta ttcttaacta     1620

tgttgctcct tttacgctat gtggatacgc tgctttaatg cctttgtatc atgctattgc     1680

ttcccgtatg gctttcattt tctcctcctt gtataaatcc tggttagttc ttgccacggc     1740

ggaactcatc gccgcctgcc ttgcccgctg ctggacaggg gctcggctgt tgggcactga     1800

caattccgtg gctcgagaga tcttcgactg tgccttctag ttgccagcca tctgttgttt     1860

gcccctcccc cgtgccttcc ttgaccctgg aaggtgccac tcccactgtc ctttcctaat     1920

aaaatgagga aattgcatcg cattgtctga gtaggtgtca ttctattctg gggggtgggg     1980

tggggcagga cagcaagggg gaggattggg aagacaatag caggcatgag atctcacgtg     2040

cggaccgagc ggccgcagga acccctagtg atggagttgg ccactccctc tctgcgcgct     2100

cgctcgctca ctgaggccgg gcgaccaaag gtcgcccgac gcccgggctt tgcccgggcg     2160

gcctcagtga gcgagcgagc gcgcagctgc ctgcaggggc gcctgatgcg gtattttctc     2220

cttacgcatc tgtgcggtat ttcacaccgc atacgtcaaa gcaaccatag tacgcgccct     2280

gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg cagcgtgacc gctacacttg     2340

ccagcgccct agcgcccgct cctttcgctt tcttcccttc ctttctcgcc acgttcgccg     2400

gctttccccg tcaagctcta aatcgggggc tccctttagg gttccgattt agtgctttac     2460

ggcacctcga ccccaaaaaa cttgatttgg gtgatggttc acgtagtggg ccatcgccct     2520

gatagacggt ttttcgccct ttgacgttgg agtccacgtt ctttaatagt ggactcttgt     2580

tccaaactgg aacaacactc aaccctatct cgggctattc ttttgattta taagggattt     2640

tgccgatttc ggcctattgg ttaaaaaatg agctgattta acaaaaattt aacgcgaatt     2700

ttaacaaaat attaacgttt acaattttat ggtgcactct cagtacaatc tgctctgatg     2760

ccgcatagtt aagccagccc cgacacccgc caacacccgc tgacgcgccc tgacgggctt     2820

gtctgctccc ggcatccgct tacagacaag ctgtgaccgt ctccgggagc tgcatgtgtc     2880

agaggttttc accgtcatca ccgaaacgcg cgagacgaaa gggcctcgtg atacgcctat     2940

ttttataggt taatgtcatg ataataatgg tttcttagac gtcaggtggc acttttcggg     3000

gaaatgtgcg cggaacccct atttgtttat ttttctaaat acattcaaat atgtatccgc     3060

tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag agtatgagta     3120

ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt cctgtttttg     3180

ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt gcacgagtgg     3240

gttacatcga actggatctc aacagcggta agatccttga gagttttcgc cccgaagaac     3300

gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta tcccgtattg     3360

acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac ttggttgagt     3420

actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa ttatgcagtg     3480

ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg atcggaggac     3540

cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc cttgatcgtt     3600

gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg atgcctgtag     3660

caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta gcttcccggc     3720

aacaattaat agactggatg gaggcggata aagttgcagg accacttctg cgctcggccc     3780

ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg tctcgcggta     3840

tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc tacacgacgg     3900

ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt gcctcactga     3960

ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt gatttaaaac     4020

ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc atgaccaaaa     4080

tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag atcaaaggat     4140

cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa aaaccaccgc     4200

taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg aaggtaactg     4260

gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag ttaggccacc     4320

acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg ttaccagtgg     4380

ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga tagttaccgg     4440

ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc ttggagcgaa     4500

cgacctacac cgaactgaga tacctacagc gtgagctatg agaaagcgcc acgcttcccg     4560

aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga gagcgcacga     4620

gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt cgccacctct     4680

gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg aaaaacgcca     4740

gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac atgt           4794


<210>  45
<211>  7368
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN1504 - The portion between L-ITR and R-ITR corresponds to 
       positions 142-4489

<400>  45
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc       60

gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggcca      120

actccatcac taggggttcc tgcggccgca cgcgtggtac cctaaataaa gatggctttt      180

tagtattaaa agtggaagaa aattacaggt aattatcttt gacggtaaaa acgctgtaat      240

cagcgggcta catgaaaaat tactctaatt atggctgcat ttaagagaat ggctaaataa      300

agatggcttt ttagtattaa aagtggaaga aaattacagg taattatctt tgacggtaaa      360

aacgctgtaa tcagcgggct acatgaaaaa ttactctaat tatggctgca tttaagagaa      420

tggctaaata aagatggctt tttagtatta aaagtggaag aaaattacag gtaattatct      480

ttgacggtaa aaacgctgta atcagcgggc tacatgaaaa attactctaa ttatggctgc      540

atttaagaga atggagctcg ggctggtcga cacaattgga ggtaggcgtg tacggtggga      600

ggcctatata agcagagctc gtttagtgaa ccgtcagatc gcctggagga tccttcgaaa      660

agcttgctac cggtgccacc atggtcagca agggcgagga gctgttcacc ggggtggtgc      720

ccatcctggt cgagctggac ggcgacgtca atggccacaa gttcagcgtg tccggcgagg      780

gcgagggcga tgccacctac ggcaagctga ccctgaagct gatctgcacc accggcaagc      840

tgcccgtgcc ctggcccacc ctcgtgacca ccctgggcta cggcgtgcag tgcttcgccc      900

gctaccccga ccacatgaag cagcacgact tcttcaagtc cgccatgccc gaaggctacg      960

tccaggagcg caccatcttc ttcaaagacg acggcaacta caagacccgc gccgaggtga     1020

agttcgaggg cgacaccctg gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg     1080

acggcaacat cctggggcac aagctggagt acaactacaa cagccacaac gtctatatca     1140

ccgccgacaa gcagaagaac ggcatcaagg ccaacttcaa gatccgccac aacatcgagg     1200

acggcggcgt gcagctcgcc gaccactacc agcagaacac ccccatcggc gacggccccg     1260

tgctgctgcc cgacaaccac tacctgagct accagtccaa gctgagcaaa gaccccaacg     1320

agaagcgcga tcacatggtc ctgctggagt tcgtgaccgc cgccgggatc actctcggca     1380

tggacgagct gtacaaaggc agcggcgcca ccaacttcag cctgctgaag caggccggcg     1440

acgtggagga gaaccccggc cccggaacta gtggtatgga gcaaacagtg cttgtaccac     1500

caggacctga cagcttcaac ttcttcacca gagaatctct tgcggctatt gaaagacgca     1560

ttgcagaaga aaaggcaaag aatcccaaac cagacaaaaa agatgacgac gaaaatggcc     1620

caaagccaaa tagtgacttg gaagctggaa agaaccttcc atttatttat ggagacattc     1680

ctccagagat ggtgtcagag cccctggagg acctggaccc ctactatatc aataagaaaa     1740

cttttatagt attgaataaa gggaaggcca tcttccggtt cagtgccacc tctgccctgt     1800

acattttaac tcccttcaat cctcttagga aaatagctat taagattttg gtacattcat     1860

tattcagcat gctaattatg tgcactattt tgacaaactg tgtgtttatg acaatgagta     1920

accctcctga ttggacaaag aatgtagaat acaccttcac aggaatatat acttttgaat     1980

cacttataaa aattattgca aggggattct gtttagaaga ttttactttc cttcgggatc     2040

catggaactg gctcgatttc actgtcatta catttgcgta cgtcacagag tttgtggacc     2100

tgggcaatgt ctcggcattg agaacattca gagttctccg agcattgaag acgatttcag     2160

tcattccagg cctgaaaacc attgtgggag ccctgatcca gtctgtgaag aagctctcag     2220

atgtaatgat cctgactgtg ttctgtctga gcgtatttgc tctaattggg ctgcagctgt     2280

tcatgggcaa cctgaggaat aaatgtatac aatggcctcc caccaatgct tccttggagg     2340

aacatagtat agaaaagaat ataactgtga attataatgg tacacttata aatgaaactg     2400

tctttgagtt tgactggaag tcatatattc aagattcaag atatcattat ttcctggagg     2460

gttttttaga tgcactacta tgtggaaata gctctgatgc aggccaatgt ccagagggat     2520

atatgtgtgt gaaagctggt agaaatccca attatggcta cacaagcttt gataccttca     2580

gttgggcttt tttgtccttg tttcgactaa tgactcagga cttctgggaa aatctttatc     2640

aactgacatt acgtgctgct gggaaaacgt acatgatatt ttttgtattg gtcattttct     2700

tgggctcatt ctacctaata aatttgatcc tggctgtggt ggccatggcc tacgaggaac     2760

agaatcaggc caccttggaa gaagcagaac agaaagaggc cgaatttcag cagatgattg     2820

aacagcttaa aaagcaacag gaggcagctc agcaggcagc aacggcaact gcctcagaac     2880

attccagaga gcccagtgca gcaggcaggc tctcagacag ctcatctgaa gcctctaagt     2940

tgagttccaa gagtgctaag gaaagaagaa atcggaggaa gaaaagaaaa cagaaagagc     3000

agtctggtgg ggaagagaaa gatgaggatg aattccaaaa atctgaatct gaggacagca     3060

tcaggaggaa aggttttcgc ttctccattg aagggaaccg attgacatat gaaaagaggt     3120

actcctcccc acaccagtct ttgttgagca tccgtggctc cctattttca ccaaggcgaa     3180

atagcagaac aagccttttc agctttagag ggcgagcaaa ggatgtggga tctgagaacg     3240

acttcgcaga tgatgagcac agcacctttg aggataacga gagccgtaga gattccttgt     3300

ttgtgccccg acgacacgga gagagacgca acagcaacct gagtcagacc agtaggtcat     3360

cccggatgct ggcagtgttt ccagcgaatg ggaagatgca cagcactgtg gattgcaatg     3420

gtgtggtttc cttggttggt ggaccttcag ttcctacatc gcctgttgga cagcttctgc     3480

cagaggtgat aatagataag ccagctactg atgacaatgg aacaaccact gaaactgaaa     3540

tgagaaagag aaggtcaagt tctttccacg tttccatgga ctttctagaa gatccttccc     3600

aaaggcaacg agcaatgagt atagccagca ttctaacaaa tacagtagaa gaacttgaag     3660

aatccaggca gaaatgccca ccctgttggt ataaattttc caacatattc ttaatctggg     3720

actgttctcc atattggtta aaagtgaaac atgttgtcaa cctggttgtg atggacccat     3780

ttgttgacct ggccatcacc atctgtattg tcttaaatac tcttttcatg gccatggagc     3840

actatccaat gacggaccat ttcaataatg tgcttacagt aggaaacttg gttttcactg     3900

ggatctttac agcagaaatg tttctgaaaa ttattgccat ggatccttac tattatttcc     3960

aagaaggctg gaatatcttt gacggtttta ttgtgacgct tagcctggta gaacttggac     4020

tcgccaatgt ggaaggatta tctgttctcc gttcatttcg attgctgcga gttttcaagt     4080

tggcaaaatc ttggccaacg ttaaatatgc taataaagat catcggcaat tccgtggggg     4140

ctctgggaaa tttaaccctc gtcttggcca tcatcgtctt catttttgcc gtggtcggca     4200

tgcagctctt tggtaaaagc tacaaagatt gtgtctgcaa gatcgccagt gattgtcaac     4260

tcccacgctg gcacatgaat gacttcttcc actccttcct gattgtgttc cgcgtgctgt     4320

gtggggagtg gatagagacc atgtgggact gtatggaggt tgctggtcaa gccatgtgcc     4380

ttactgtctt catgatggtc atggtgattg gaaacctagt ggtcctgaat ctctttctgg     4440

ccttgcttct gagctcattt agtgcagaca accttgcagc cactgatgat gataatgaaa     4500

tgaataatct ccaaattgct gtggatagga tgcacaaagg agtagcttat gtgaaaagaa     4560

aaatatatga atttattcaa cagtccttca ttaggaaaca aaagatctca cgtgcggacc     4620

gagcggccgc aggaacccct agtgatggag ttggccactc cctctctgcg cgctcgctcg     4680

ctcactgagg ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca     4740

gtgagcgagc gagcgcgcag ctgcctgcag gggcgcctga tgcggtattt tctccttacg     4800

catctgtgcg gtatttcaca ccgcatacgt caaagcaacc atagtacgcg ccctgtagcg     4860

gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt gaccgctaca cttgccagcg     4920

ccctagcgcc cgctcctttc gctttcttcc cttcctttct cgccacgttc gccggctttc     4980

cccgtcaagc tctaaatcgg gggctccctt tagggttccg atttagtgct ttacggcacc     5040

tcgaccccaa aaaacttgat ttgggtgatg gttcacgtag tgggccatcg ccctgataga     5100

cggtttttcg ccctttgacg ttggagtcca cgttctttaa tagtggactc ttgttccaaa     5160

ctggaacaac actcaaccct atctcgggct attcttttga tttataaggg attttgccga     5220

tttcggccta ttggttaaaa aatgagctga tttaacaaaa atttaacgcg aattttaaca     5280

aaatattaac gtttacaatt ttatggtgca ctctcagtac aatctgctct gatgccgcat     5340

agttaagcca gccccgacac ccgccaacac ccgctgacgc gccctgacgg gcttgtctgc     5400

tcccggcatc cgcttacaga caagctgtga ccgtctccgg gagctgcatg tgtcagaggt     5460

tttcaccgtc atcaccgaaa cgcgcgagac gaaagggcct cgtgatacgc ctatttttat     5520

aggttaatgt catgataata atggtttctt agacgtcagg tggcactttt cggggaaatg     5580

tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat ccgctcatga     5640

gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg agtattcaac     5700

atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt tttgctcacc     5760

cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga gtgggttaca     5820

tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc     5880

caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt attgacgccg     5940

ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt gagtactcac     6000

cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc agtgctgcca     6060

taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga ggaccgaagg     6120

agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat cgttgggaac     6180

cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg     6240

caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc cggcaacaat     6300

taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg gcccttccgg     6360

ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc ggtatcattg     6420

cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg acggggagtc     6480

aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca ctgattaagc     6540

attggtaact gtcagaccaa gtttactcat atatacttta gattgattta aaacttcatt     6600

tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc aaaatccctt     6660

aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa ggatcttctt     6720

gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag     6780

cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta actggcttca     6840

gcagagcgca gataccaaat actgtccttc tagtgtagcc gtagttaggc caccacttca     6900

agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca gtggctgctg     6960

ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta ccggataagg     7020

cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag cgaacgacct     7080

acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt cccgaaggga     7140

gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc acgagggagc     7200

ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac ctctgacttg     7260

agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac gccagcaacg     7320

cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgt                  7368


<210>  46
<211>  7044
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN1512 - The portion between L-ITR and R-ITR corresponds to 
       positions 142-4165

<400>  46
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc       60

gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggcca      120

actccatcac taggggttcc tgcggccgca cgcgtatagg taccctggta gaacttggac      180

tcgccaatgt ggaaggatta tctgttctcc gttcatttcg attgctgcga gttttcaagt      240

tggcaaaatc ttggccaacg ttaaatatgc taataaagat catcggcaat tccgtggggg      300

ctctgggaaa tttaaccctc gtcttggcca tcatcgtctt catttttgcc gtggtcggca      360

tgcagctctt tggtaaaagc tacaaagatt gtgtctgcaa gatcgccagt gattgtcaac      420

tcccacgctg gcacatgaat gacttcttcc actccttcct gattgtgttc cgcgtgctgt      480

gtggggagtg gatagagacc atgtgggact gtatggaggt tgctggtcaa gccatgtgcc      540

ttactgtctt catgatggtc atggtgattg gaaacctagt ggtcctgaat ctctttctgg      600

ccttgcttct gagctcattt agtgcagaca accttgcagc cactgatgat gataatgaaa      660

tgaataatct ccaaattgct gtggatagga tgcacaaagg agtagcttat gtgaaaagaa      720

aaatatatga atttattcaa cagtccttca ttaggaaaca aaagatttta gatgaaatta      780

aaccacttga tgatctaaac aacaagaaag acagttgtat gtccaatcat acagcagaaa      840

ttgggaaaga tcttgactat cttaaagatg taaatggaac tacaagtggt ataggaactg      900

gcagcagtgt tgaatacatt attgatgaaa gtgattacat gtcattcata aacaacccca      960

gtcttactgt gactgtacca attgctgtag gagaatctga ctttgaaaat ttaaacacgg     1020

aagactttag tagtgaatcg gatctggaag aaagcaaaga gaaactgaat gaaagcagta     1080

gctcatcaga aggtagcact gtggacatcg gcgcacctgt agaagaacag cccgtagtgg     1140

aacctgaaga aactcttgaa ccagaagctt gtttcactga aggctgtgta caaagattca     1200

agtgttgtca aatcaatgtg gaagaaggca gaggaaaaca atggtggaac ctgagaagga     1260

cgtgtttccg aatagttgaa cataactggt ttgagacctt cattgttttc atgattctcc     1320

ttagtagtgg tgctctggca tttgaagata tatatattga tcagcgaaag acgattaaga     1380

cgatgttgga atatgctgac aaggttttca cttacatttt cattctggaa atgcttctaa     1440

aatgggtggc atatggctat caaacatatt tcaccaatgc ctggtgttgg ctggacttct     1500

taattgttga tgtttcattg gtcagtttaa cagcaaatgc cttgggttac tcagaacttg     1560

gagccatcaa atctctcagg acactaagag ctctgagacc tctaagagcc ttatctcgat     1620

ttgaagggat gagggtggtt gtgaatgccc ttttaggagc aattccatcc atcatgaatg     1680

tgcttctggt ttgtcttata ttctggctaa ttttcagcat catgggcgta aatttgtttg     1740

ctggcaaatt ctaccactgt attaacacca caactggtga caggtttgac atcgaagacg     1800

tgaataatca tactgattgc ctaaaactaa tagaaagaaa tgagactgct cgatggaaaa     1860

atgtgaaagt aaactttgat aatgtaggat ttgggtatct ctctttgctt caagttgcca     1920

cattcaaagg atggatggat ataatgtatg cagcagttga ttccagaaat gtggaactcc     1980

agcctaagta tgaagaaagt ctgtacatgt atctttactt tgttattttc atcatctttg     2040

ggtccttctt caccttgaac ctgtttattg gtgtcatcat agataatttc aaccagcaga     2100

aaaagaagtt tggaggtcaa gacatcttta tgacagaaga acagaagaaa tactataatg     2160

caatgaaaaa attaggatcg aaaaaaccgc aaaagcctat acctcgacca ggaaacaaat     2220

ttcaaggaat ggtctttgac ttcgtaacca gacaagtttt tgacataagc atcatgattc     2280

tcatctgtct taacatggtc acaatgatgg tggaaacaga tgaccagagt gaatatgtga     2340

ctaccatttt gtcacgcatc aatctggtgt tcattgtgct atttactgga gagtgtgtac     2400

tgaaactcat ctctctacgc cattattatt ttaccattgg atggaatatt tttgattttg     2460

tggttgtcat tctctccatt gtaggtatgt ttcttgccga gctgatagaa aagtatttcg     2520

tgtcccctac cctgttccga gtgatccgtc ttgctaggat tggccgaatc ctacgtctga     2580

tcaaaggagc aaaggggatc cgcacgctgc tctttgcttt gatgatgtcc cttcctgcgt     2640

tgtttaacat cggcctccta ctcttcctag tcatgttcat ctacgccatc tttgggatgt     2700

ccaactttgc ctatgttaag agggaagttg ggatcgatga catgttcaac tttgagacct     2760

ttggcaacag catgatctgc ctattccaaa ttacaacctc tgctggctgg gatggattgc     2820

tagcacccat tctcaacagt aagccacccg actgtgaccc taataaagtt aaccctggaa     2880

gctcagttaa gggagactgt gggaacccat ctgttggaat tttctttttt gtcagttaca     2940

tcatcatatc cttcctggtt gtggtgaaca tgtacatcgc ggtcatcctg gagaacttca     3000

gtgttgctac tgaagaaagt gcagagcctc tgagtgagga tgactttgag atgttctatg     3060

aggtttggga gaagtttgat cccgatgcaa ctcagttcat ggaatttgaa aaattatctc     3120

agtttgcagc tgcgcttgaa ccgcctctca atctgccaca accaaacaaa ctccagctca     3180

ttgccatgga tttgcccatg gtgagtggtg accggatcca ctgtcttgat atcttatttg     3240

cttttacaaa gcgggttcta ggagagagtg gagagatgga tgctctacga atacagatgg     3300

aagagcgatt catggcttcc aatccttcca aggtctccta tcagccaatc actactactt     3360

taaaacgaaa acaagaggaa gtatctgctg tcattattca gcgtgcttac agacgccacc     3420

ttttaaagcg aactgtaaaa caagcttcct ttacgtacaa taaaaacaaa atcaaaggtg     3480

gggctaatct tcttataaaa gaagacatga taattgacag aataaatgaa aactctatta     3540

cagaaaaaac tgatctgacc atgtccactg cagcttgtcc accttcctat gaccgggtga     3600

caaagccaat tgtggaaaaa catgagcaag aaggcaaaga tgaaaaagcc aaagggaaag     3660

gaggtggtgg ttcaggtggg ggcggctcag agtaccccta tgatgtccct gattatgcgg     3720

cggaataccc ctatgacgtg ccggactacg cggctgaata tccgtatgac gttcccgatt     3780

atgcggctaa gctcgaataa tgatgagaat tcatcataat caacctctgg attacaaaat     3840

ttgtgaaaga ttgactggta ttcttaacta tgttgctcct tttacgctat gtggatacgc     3900

tgctttaatg cctttgtatc atgctattgc ttcccgtatg gctttcattt tctcctcctt     3960

gtataaatcc tggttagttc ttgccacggc ggaactcatc gccgcctgcc ttgcccgctg     4020

ctggacaggg gctcggctgt tgggcactga caattccgtg gctcgagaga tcttcgactg     4080

tgccttctag ttgccagcca tctgttgttt gcccctcccc cgtgccttcc ttgaccctgg     4140

aaggtgccac tcccactgtc ctttcctaat aaaatgagga aattgcatcg cattgtctga     4200

gtaggtgtca ttctattctg gggggtgggg tggggcagga cagcaagggg gaggattggg     4260

aagacaatag caggcatgag atctcacgtg cggaccgagc ggccgcagga acccctagtg     4320

atggagttgg ccactccctc tctgcgcgct cgctcgctca ctgaggccgg gcgaccaaag     4380

gtcgcccgac gcccgggctt tgcccgggcg gcctcagtga gcgagcgagc gcgcagctgc     4440

ctgcaggggc gcctgatgcg gtattttctc cttacgcatc tgtgcggtat ttcacaccgc     4500

atacgtcaaa gcaaccatag tacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg     4560

tggttacgcg cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt     4620

tcttcccttc ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc     4680

tccctttagg gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgatttgg     4740

gtgatggttc acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg     4800

agtccacgtt ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct     4860

cgggctattc ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg     4920

agctgattta acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaattttat     4980

ggtgcactct cagtacaatc tgctctgatg ccgcatagtt aagccagccc cgacacccgc     5040

caacacccgc tgacgcgccc tgacgggctt gtctgctccc ggcatccgct tacagacaag     5100

ctgtgaccgt ctccgggagc tgcatgtgtc agaggttttc accgtcatca ccgaaacgcg     5160

cgagacgaaa gggcctcgtg atacgcctat ttttataggt taatgtcatg ataataatgg     5220

tttcttagac gtcaggtggc acttttcggg gaaatgtgcg cggaacccct atttgtttat     5280

ttttctaaat acattcaaat atgtatccgc tcatgagaca ataaccctga taaatgcttc     5340

aataatattg aaaaaggaag agtatgagta ttcaacattt ccgtgtcgcc cttattccct     5400

tttttgcggc attttgcctt cctgtttttg ctcacccaga aacgctggtg aaagtaaaag     5460

atgctgaaga tcagttgggt gcacgagtgg gttacatcga actggatctc aacagcggta     5520

agatccttga gagttttcgc cccgaagaac gttttccaat gatgagcact tttaaagttc     5580

tgctatgtgg cgcggtatta tcccgtattg acgccgggca agagcaactc ggtcgccgca     5640

tacactattc tcagaatgac ttggttgagt actcaccagt cacagaaaag catcttacgg     5700

atggcatgac agtaagagaa ttatgcagtg ctgccataac catgagtgat aacactgcgg     5760

ccaacttact tctgacaacg atcggaggac cgaaggagct aaccgctttt ttgcacaaca     5820

tgggggatca tgtaactcgc cttgatcgtt gggaaccgga gctgaatgaa gccataccaa     5880

acgacgagcg tgacaccacg atgcctgtag caatggcaac aacgttgcgc aaactattaa     5940

ctggcgaact acttactcta gcttcccggc aacaattaat agactggatg gaggcggata     6000

aagttgcagg accacttctg cgctcggccc ttccggctgg ctggtttatt gctgataaat     6060

ctggagccgg tgagcgtggg tctcgcggta tcattgcagc actggggcca gatggtaagc     6120

cctcccgtat cgtagttatc tacacgacgg ggagtcaggc aactatggat gaacgaaata     6180

gacagatcgc tgagataggt gcctcactga ttaagcattg gtaactgtca gaccaagttt     6240

actcatatat actttagatt gatttaaaac ttcattttta atttaaaagg atctaggtga     6300

agatcctttt tgataatctc atgaccaaaa tcccttaacg tgagttttcg ttccactgag     6360

cgtcagaccc cgtagaaaag atcaaaggat cttcttgaga tccttttttt ctgcgcgtaa     6420

tctgctgctt gcaaacaaaa aaaccaccgc taccagcggt ggtttgtttg ccggatcaag     6480

agctaccaac tctttttccg aaggtaactg gcttcagcag agcgcagata ccaaatactg     6540

tccttctagt gtagccgtag ttaggccacc acttcaagaa ctctgtagca ccgcctacat     6600

acctcgctct gctaatcctg ttaccagtgg ctgctgccag tggcgataag tcgtgtctta     6660

ccgggttgga ctcaagacga tagttaccgg ataaggcgca gcggtcgggc tgaacggggg     6720

gttcgtgcac acagcccagc ttggagcgaa cgacctacac cgaactgaga tacctacagc     6780

gtgagctatg agaaagcgcc acgcttcccg aagggagaaa ggcggacagg tatccggtaa     6840

gcggcagggt cggaacagga gagcgcacga gggagcttcc agggggaaac gcctggtatc     6900

tttatagtcc tgtcgggttt cgccacctct gacttgagcg tcgatttttg tgatgctcgt     6960

caggggggcg gagcctatgg aaaaacgcca gcaacgcggc ctttttacgg ttcctggcct     7020

tttgctggcc ttttgctcac atgt                                            7044


<210>  47
<211>  6530
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN2004 - The portion between L-ITR and R-ITR corresponds to 
       positions 142-3792

<400>  47
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc       60

gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggcca      120

actccatcac taggggttcc tgcggccgca cgcgtggtac cctaaataaa gatggctttt      180

tagtattaaa agtggaagaa aattacaggt aattatcttt gacggtaaaa acgctgtaat      240

cagcgggcta catgaaaaat tactctaatt atggctgcat ttaagagaat ggctaaataa      300

agatggcttt ttagtattaa aagtggaaga aaattacagg taattatctt tgacggtaaa      360

aacgctgtaa tcagcgggct acatgaaaaa ttactctaat tatggctgca tttaagagaa      420

tggctaaata aagatggctt tttagtatta aaagtggaag aaaattacag gtaattatct      480

ttgacggtaa aaacgctgta atcagcgggc tacatgaaaa attactctaa ttatggctgc      540

atttaagaga atggagctcg ggctgggcat aaaagtcagg gcagagccat ctattgctta      600

catttgcttc tgggatccag atctttcgaa gctagcgcta atggagcaaa cagtgcttgt      660

accaccagga cctgacagct tcaacttctt caccagagaa tctcttgcgg ctattgaaag      720

acgcattgca gaagaaaagg caaagaatcc caaaccagac aaaaaagatg acgacgaaaa      780

tggcccaaag ccaaatagtg acttggaagc tggaaagaac cttccattta tttatggaga      840

cattcctcca gagatggtgt cagagcccct ggaggacctg gacccctact atatcaataa      900

gaaaactttt atagtattga ataaagggaa ggccatcttc cggttcagtg ccacctctgc      960

cctgtacatt ttaactccct tcaatcctct taggaaaata gctattaaga ttttggtaca     1020

ttcattattc agcatgctaa ttatgtgcac tattttgaca aactgtgtgt ttatgacaat     1080

gagtaaccct cctgattgga caaagaatgt agaatacacc ttcacaggaa tatatacttt     1140

tgaatcactt ataaaaatta ttgcaagggg attctgttta gaagatttta ctttccttcg     1200

ggatccatgg aactggctcg atttcactgt cattacattt gcgtacgtca cagagtttgt     1260

ggacctgggc aatgtctcgg cattgagaac attcagagtt ctccgagcat tgaagacgat     1320

ttcagtcatt ccaggcctga aaaccattgt gggagccctg atccagtctg tgaagaagct     1380

ctcagatgta atgatcctga ctgtgttctg tctgagcgta tttgctctaa ttgggctgca     1440

gctgttcatg ggcaacctga ggaataaatg tatacaatgg cctcccacca atgcttcctt     1500

ggaggaacat agtatagaaa agaatataac tgtgaattat aatggtacac ttataaatga     1560

aactgtcttt gagtttgact ggaagtcata tattcaagat tcaagatatc attatttcct     1620

ggagggtttt ttagatgcac tactatgtgg aaatagctct gatgcaggcc aatgtccaga     1680

gggatatatg tgtgtgaaag ctggtagaaa tcccaattat ggctacacaa gctttgatac     1740

cttcagttgg gcttttttgt ccttgtttcg actaatgact caggacttct gggaaaatct     1800

ttatcaactg acattacgtg ctgctgggaa aacgtacatg atattttttg tattggtcat     1860

tttcttgggc tcattctacc taataaattt gatcctggct gtggtggcca tggcctacga     1920

ggaacagaat caggccacct tggaagaagc agaacagaaa gaggccgaat ttcagcagat     1980

gattgaacag cttaaaaagc aacaggaggc agctcagcag gcagcaacgg caactgcctc     2040

agaacattcc agagagccca gtgcagcagg caggctctca gacagctcat ctgaagcctc     2100

taagttgagt tccaagagtg ctaaggaaag aagaaatcgg aggaagaaaa gaaaacagaa     2160

agagcagtct ggtggggaag agaaagatga ggatgaattc caaaaatctg aatctgagga     2220

cagcatcagg aggaaaggtt ttcgcttctc cattgaaggg aaccgattga catatgaaaa     2280

gaggtactcc tccccacacc agtctttgtt gagcatccgt ggctccctat tttcaccaag     2340

gcgaaatagc agaacaagcc ttttcagctt tagagggcga gcaaaggatg tgggatctga     2400

gaacgacttc gcagatgatg agcacagcac ctttgaggat aacgagagcc gtagagattc     2460

cttgtttgtg ccccgacgac acggagagag acgcaacagc aacctgagtc agaccagtag     2520

gtcatcccgg atgctggcag tgtttccagc gaatgggaag atgcacagca ctgtggattg     2580

caatggtgtg gtttccttgg ttggtggacc ttcagttcct acatcgcctg ttggacagct     2640

tctgccagag gtgataatag ataagccagc tactgatgac aatggaacaa ccactgaaac     2700

tgaaatgaga aagagaaggt caagttcttt ccacgtttcc atggactttc tagaagatcc     2760

ttcccaaagg caacgagcaa tgagtatagc cagcattcta acaaatacag tagaagaact     2820

tgaagaatcc aggcagaaat gcccaccctg ttggtataaa ttttccaaca tattcttaat     2880

ctgggactgt tctccatatt ggttaaaagt gaaacatgtt gtcaacctgg ttgtgatgga     2940

cccatttgtt gacctggcca tcaccatctg tattgtctta aatactcttt tcatggccat     3000

ggagcactat ccaatgacgg accatttcaa taatgtgctt acagtaggaa acttggtttt     3060

cactgggatc tttacagcag aaatgtttct gaaaattatt gccatggatc cttactatta     3120

tttccaagaa ggctggaata tctttgacgg ttttattgtg acgcttagcc tggtagaact     3180

tggactcgcc aatgtggaag gattatctgt tctccgttca tttcgattgc tgcgagtttt     3240

caagttggca aaatcttggc caacgttaaa tatgctaata aagatcatcg gcaattccgt     3300

gggggctctg ggaaatttaa ccctcgtctt ggccatcatc gtcttcattt ttgccgtggt     3360

cggcatgcag ctctttggta aaagctacaa agattgtgtc tgcaagatcg ccagtgattg     3420

tcaactccca cgctggcaca tgaatgactt cttccactcc ttcctgattg tgttccgcgt     3480

gctgtgtggg gagtggatag agaccatgtg ggactgtatg gaggttgctg gtcaagccat     3540

gtgccttact gtcttcatga tggtcatggt gattggaaac ctagtggtcc tgaatctctt     3600

tctggccttg cttctgagct catttagtgc agacaacctt gcagccactg atgatgataa     3660

tgaaatgaat aatctccaaa ttgctgtgga taggatgcac aaaggagtag cttatgtgaa     3720

aagaaaaata tatgaattta ttcaacagtc cttcattagg aaacaaaaga tctgtgcgga     3780

ccgagcggcc gcaggaaccc ctagtgatgg agttggccac tccctctctg cgcgctcgct     3840

cgctcactga ggccgggcga ccaaaggtcg cccgacgccc gggctttgcc cgggcggcct     3900

cagtgagcga gcgagcgcgc agctgcctgc aggggcgcct gatgcggtat tttctcctta     3960

cgcatctgtg cggtatttca caccgcatac gtcaaagcaa ccatagtacg cgccctgtag     4020

cggcgcatta agcgcggcgg gtgtggtggt tacgcgcagc gtgaccgcta cacttgccag     4080

cgccctagcg cccgctcctt tcgctttctt cccttccttt ctcgccacgt tcgccggctt     4140

tccccgtcaa gctctaaatc gggggctccc tttagggttc cgatttagtg ctttacggca     4200

cctcgacccc aaaaaacttg atttgggtga tggttcacgt agtgggccat cgccctgata     4260

gacggttttt cgccctttga cgttggagtc cacgttcttt aatagtggac tcttgttcca     4320

aactggaaca acactcaacc ctatctcggg ctattctttt gatttataag ggattttgcc     4380

gatttcggcc tattggttaa aaaatgagct gatttaacaa aaatttaacg cgaattttaa     4440

caaaatatta acgtttacaa ttttatggtg cactctcagt acaatctgct ctgatgccgc     4500

atagttaagc cagccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct     4560

gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag     4620

gttttcaccg tcatcaccga aacgcgcgag acgaaagggc ctcgtgatac gcctattttt     4680

ataggttaat gtcatgataa taatggtttc ttagacgtca ggtggcactt ttcggggaaa     4740

tgtgcgcgga acccctattt gtttattttt ctaaatacat tcaaatatgt atccgctcat     4800

gagacaataa ccctgataaa tgcttcaata atattgaaaa aggaagagta tgagtattca     4860

acatttccgt gtcgccctta ttcccttttt tgcggcattt tgccttcctg tttttgctca     4920

cccagaaacg ctggtgaaag taaaagatgc tgaagatcag ttgggtgcac gagtgggtta     4980

catcgaactg gatctcaaca gcggtaagat ccttgagagt tttcgccccg aagaacgttt     5040

tccaatgatg agcactttta aagttctgct atgtggcgcg gtattatccc gtattgacgc     5100

cgggcaagag caactcggtc gccgcataca ctattctcag aatgacttgg ttgagtactc     5160

accagtcaca gaaaagcatc ttacggatgg catgacagta agagaattat gcagtgctgc     5220

cataaccatg agtgataaca ctgcggccaa cttacttctg acaacgatcg gaggaccgaa     5280

ggagctaacc gcttttttgc acaacatggg ggatcatgta actcgccttg atcgttggga     5340

accggagctg aatgaagcca taccaaacga cgagcgtgac accacgatgc ctgtagcaat     5400

ggcaacaacg ttgcgcaaac tattaactgg cgaactactt actctagctt cccggcaaca     5460

attaatagac tggatggagg cggataaagt tgcaggacca cttctgcgct cggcccttcc     5520

ggctggctgg tttattgctg ataaatctgg agccggtgag cgtgggtctc gcggtatcat     5580

tgcagcactg gggccagatg gtaagccctc ccgtatcgta gttatctaca cgacggggag     5640

tcaggcaact atggatgaac gaaatagaca gatcgctgag ataggtgcct cactgattaa     5700

gcattggtaa ctgtcagacc aagtttactc atatatactt tagattgatt taaaacttca     5760

tttttaattt aaaaggatct aggtgaagat cctttttgat aatctcatga ccaaaatccc     5820

ttaacgtgag ttttcgttcc actgagcgtc agaccccgta gaaaagatca aaggatcttc     5880

ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa acaaaaaaac caccgctacc     5940

agcggtggtt tgtttgccgg atcaagagct accaactctt tttccgaagg taactggctt     6000

cagcagagcg cagataccaa atactgtcct tctagtgtag ccgtagttag gccaccactt     6060

caagaactct gtagcaccgc ctacatacct cgctctgcta atcctgttac cagtggctgc     6120

tgccagtggc gataagtcgt gtcttaccgg gttggactca agacgatagt taccggataa     6180

ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag cccagcttgg agcgaacgac     6240

ctacaccgaa ctgagatacc tacagcgtga gctatgagaa agcgccacgc ttcccgaagg     6300

gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga acaggagagc gcacgaggga     6360

gcttccaggg ggaaacgcct ggtatcttta tagtcctgtc gggtttcgcc acctctgact     6420

tgagcgtcga tttttgtgat gctcgtcagg ggggcggagc ctatggaaaa acgccagcaa     6480

cgcggccttt ttacggttcc tggccttttg ctggcctttt gctcacatgt                6530


<210>  48
<211>  6898
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN2005 - The portion between L-ITR and R-ITR corresponds to 
       positions 142-4160

<400>  48
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc       60

gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggcca      120

actccatcac taggggttcc tgcggccgca cgcgtatagg taccctggta gaacttggac      180

tcgccaatgt ggaaggatta tctgttctcc gttcatttcg attgctgcga gttttcaagt      240

tggcaaaatc ttggccaacg ttaaatatgc taataaagat catcggcaat tccgtggggg      300

ctctgggaaa tttaaccctc gtcttggcca tcatcgtctt catttttgcc gtggtcggca      360

tgcagctctt tggtaaaagc tacaaagatt gtgtctgcaa gatcgccagt gattgtcaac      420

tcccacgctg gcacatgaat gacttcttcc actccttcct gattgtgttc cgcgtgctgt      480

gtggggagtg gatagagacc atgtgggact gtatggaggt tgctggtcaa gccatgtgcc      540

ttactgtctt catgatggtc atggtgattg gaaacctagt ggtcctgaat ctctttctgg      600

ccttgcttct gagctcattt agtgcagaca accttgcagc cactgatgat gataatgaaa      660

tgaataatct ccaaattgct gtggatagga tgcacaaagg agtagcttat gtgaaaagaa      720

aaatatatga atttattcaa cagtccttca ttaggaaaca aaagatttta gatgaaatta      780

aaccacttga tgatctaaac aacaagaaag acagttgtat gtccaatcat acagcagaaa      840

ttgggaaaga tcttgactat cttaaagatg taaatggaac tacaagtggt ataggaactg      900

gcagcagtgt tgaatacatt attgatgaaa gtgattacat gtcattcata aacaacccca      960

gtcttactgt gactgtacca attgctgtag gagaatctga ctttgaaaat ttaaacacgg     1020

aagactttag tagtgaatcg gatctggaag aaagcaaaga gaaactgaat gaaagcagta     1080

gctcatcaga aggtagcact gtggacatcg gcgcacctgt agaagaacag cccgtagtgg     1140

aacctgaaga aactcttgaa ccagaagctt gtttcactga aggctgtgta caaagattca     1200

agtgttgtca aatcaatgtg gaagaaggca gaggaaaaca atggtggaac ctgagaagga     1260

cgtgtttccg aatagttgaa cataactggt ttgagacctt cattgttttc atgattctcc     1320

ttagtagtgg tgctctggca tttgaagata tatatattga tcagcgaaag acgattaaga     1380

cgatgttgga atatgctgac aaggttttca cttacatttt cattctggaa atgcttctaa     1440

aatgggtggc atatggctat caaacatatt tcaccaatgc ctggtgttgg ctggacttct     1500

taattgttga tgtttcattg gtcagtttaa cagcaaatgc cttgggttac tcagaacttg     1560

gagccatcaa atctctcagg acactaagag ctctgagacc tctaagagcc ttatctcgat     1620

ttgaagggat gagggtggtt gtgaatgccc ttttaggagc aattccatcc atcatgaatg     1680

tgcttctggt ttgtcttata ttctggctaa ttttcagcat catgggcgta aatttgtttg     1740

ctggcaaatt ctaccactgt attaacacca caactggtga caggtttgac atcgaagacg     1800

tgaataatca tactgattgc ctaaaactaa tagaaagaaa tgagactgct cgatggaaaa     1860

atgtgaaagt aaactttgat aatgtaggat ttgggtatct ctctttgctt caagttgcca     1920

cattcaaagg atggatggat ataatgtatg cagcagttga ttccagaaat gtggaactcc     1980

agcctaagta tgaagaaagt ctgtacatgt atctttactt tgttattttc atcatctttg     2040

ggtccttctt caccttgaac ctgtttattg gtgtcatcat agataatttc aaccagcaga     2100

aaaagaagtt tggaggtcaa gacatcttta tgacagaaga acagaagaaa tactataatg     2160

caatgaaaaa attaggatcg aaaaaaccgc aaaagcctat acctcgacca ggaaacaaat     2220

ttcaaggaat ggtctttgac ttcgtaacca gacaagtttt tgacataagc atcatgattc     2280

tcatctgtct taacatggtc acaatgatgg tggaaacaga tgaccagagt gaatatgtga     2340

ctaccatttt gtcacgcatc aatctggtgt tcattgtgct atttactgga gagtgtgtac     2400

tgaaactcat ctctctacgc cattattatt ttaccattgg atggaatatt tttgattttg     2460

tggttgtcat tctctccatt gtaggtatgt ttcttgccga gctgatagaa aagtatttcg     2520

tgtcccctac cctgttccga gtgatccgtc ttgctaggat tggccgaatc ctacgtctga     2580

tcaaaggagc aaaggggatc cgcacgctgc tctttgcttt gatgatgtcc cttcctgcgt     2640

tgtttaacat cggcctccta ctcttcctag tcatgttcat ctacgccatc tttgggatgt     2700

ccaactttgc ctatgttaag agggaagttg ggatcgatga catgttcaac tttgagacct     2760

ttggcaacag catgatctgc ctattccaaa ttacaacctc tgctggctgg gatggattgc     2820

tagcacccat tctcaacagt aagccacccg actgtgaccc taataaagtt aaccctggaa     2880

gctcagttaa gggagactgt gggaacccat ctgttggaat tttctttttt gtcagttaca     2940

tcatcatatc cttcctggtt gtggtgaaca tgtacatcgc ggtcatcctg gagaacttca     3000

gtgttgctac tgaagaaagt gcagagcctc tgagtgagga tgactttgag atgttctatg     3060

aggtttggga gaagtttgat cccgatgcaa ctcagttcat ggaatttgaa aaattatctc     3120

agtttgcagc tgcgcttgaa ccgcctctca atctgccaca accaaacaaa ctccagctca     3180

ttgccatgga tttgcccatg gtgagtggtg accggatcca ctgtcttgat atcttatttg     3240

cttttacaaa gcgggttcta ggagagagtg gagagatgga tgctctacga atacagatgg     3300

aagagcgatt catggcttcc aatccttcca aggtctccta tcagccaatc actactactt     3360

taaaacgaaa acaagaggaa gtatctgctg tcattattca gcgtgcttac agacgccacc     3420

ttttaaagcg aactgtaaaa caagcttcct ttacgtacaa taaaaacaaa atcaaaggtg     3480

gggctaatct tcttataaaa gaagacatga taattgacag aataaatgaa aactctatta     3540

cagaaaaaac tgatctgacc atgtccactg cagcttgtcc accttcctat gaccgggtga     3600

caaagccaat tgtggaaaaa catgagcaag aaggcaaaga tgaaaaagcc aaagggaaat     3660

aatgacatca taatcaacct ctggattaca aaatttgtga aagattgact ggtattctta     3720

actatgttgc tccttttacg ctatgtggat acgctgcttt aatgcctttg tatcatgcta     3780

ttgcttcccg tatggctttc attttctcct ccttgtataa atcctggtta gttcttgcca     3840

cggcggaact catcgccgcc tgccttgccc gctgctggac aggggctcgg ctgttgggca     3900

ctgacaattc cgtggctcga gagatcttcg actgtgcctt ctagttgcca gccatctgtt     3960

gtttgcccct cccccgtgcc ttccttgacc ctggaaggtg ccactcccac tgtcctttcc     4020

taataaaatg aggaaattgc atcgcattgt ctgagtaggt gtcattctat tctggggggt     4080

ggggtggggc aggacagcaa gggggaggat tgggaagaca atagcaggca tgagatctca     4140

cgtgcggacc gagcggccgc aggaacccct agtgatggag ttggccactc cctctctgcg     4200

cgctcgctcg ctcactgagg ccgggcgacc aaaggtcgcc cgacgcccgg gctttgcccg     4260

ggcggcctca gtgagcgagc gagcgcgcag ctgcctgcag gggcgcctga tgcggtattt     4320

tctccttacg catctgtgcg gtatttcaca ccgcatacgt caaagcaacc atagtacgcg     4380

ccctgtagcg gcgcattaag cgcggcgggt gtggtggtta cgcgcagcgt gaccgctaca     4440

cttgccagcg ccctagcgcc cgctcctttc gctttcttcc cttcctttct cgccacgttc     4500

gccggctttc cccgtcaagc tctaaatcgg gggctccctt tagggttccg atttagtgct     4560

ttacggcacc tcgaccccaa aaaacttgat ttgggtgatg gttcacgtag tgggccatcg     4620

ccctgataga cggtttttcg ccctttgacg ttggagtcca cgttctttaa tagtggactc     4680

ttgttccaaa ctggaacaac actcaaccct atctcgggct attcttttga tttataaggg     4740

attttgccga tttcggccta ttggttaaaa aatgagctga tttaacaaaa atttaacgcg     4800

aattttaaca aaatattaac gtttacaatt ttatggtgca ctctcagtac aatctgctct     4860

gatgccgcat agttaagcca gccccgacac ccgccaacac ccgctgacgc gccctgacgg     4920

gcttgtctgc tcccggcatc cgcttacaga caagctgtga ccgtctccgg gagctgcatg     4980

tgtcagaggt tttcaccgtc atcaccgaaa cgcgcgagac gaaagggcct cgtgatacgc     5040

ctatttttat aggttaatgt catgataata atggtttctt agacgtcagg tggcactttt     5100

cggggaaatg tgcgcggaac ccctatttgt ttatttttct aaatacattc aaatatgtat     5160

ccgctcatga gacaataacc ctgataaatg cttcaataat attgaaaaag gaagagtatg     5220

agtattcaac atttccgtgt cgcccttatt cccttttttg cggcattttg ccttcctgtt     5280

tttgctcacc cagaaacgct ggtgaaagta aaagatgctg aagatcagtt gggtgcacga     5340

gtgggttaca tcgaactgga tctcaacagc ggtaagatcc ttgagagttt tcgccccgaa     5400

gaacgttttc caatgatgag cacttttaaa gttctgctat gtggcgcggt attatcccgt     5460

attgacgccg ggcaagagca actcggtcgc cgcatacact attctcagaa tgacttggtt     5520

gagtactcac cagtcacaga aaagcatctt acggatggca tgacagtaag agaattatgc     5580

agtgctgcca taaccatgag tgataacact gcggccaact tacttctgac aacgatcgga     5640

ggaccgaagg agctaaccgc ttttttgcac aacatggggg atcatgtaac tcgccttgat     5700

cgttgggaac cggagctgaa tgaagccata ccaaacgacg agcgtgacac cacgatgcct     5760

gtagcaatgg caacaacgtt gcgcaaacta ttaactggcg aactacttac tctagcttcc     5820

cggcaacaat taatagactg gatggaggcg gataaagttg caggaccact tctgcgctcg     5880

gcccttccgg ctggctggtt tattgctgat aaatctggag ccggtgagcg tgggtctcgc     5940

ggtatcattg cagcactggg gccagatggt aagccctccc gtatcgtagt tatctacacg     6000

acggggagtc aggcaactat ggatgaacga aatagacaga tcgctgagat aggtgcctca     6060

ctgattaagc attggtaact gtcagaccaa gtttactcat atatacttta gattgattta     6120

aaacttcatt tttaatttaa aaggatctag gtgaagatcc tttttgataa tctcatgacc     6180

aaaatccctt aacgtgagtt ttcgttccac tgagcgtcag accccgtaga aaagatcaaa     6240

ggatcttctt gagatccttt ttttctgcgc gtaatctgct gcttgcaaac aaaaaaacca     6300

ccgctaccag cggtggtttg tttgccggat caagagctac caactctttt tccgaaggta     6360

actggcttca gcagagcgca gataccaaat actgtccttc tagtgtagcc gtagttaggc     6420

caccacttca agaactctgt agcaccgcct acatacctcg ctctgctaat cctgttacca     6480

gtggctgctg ccagtggcga taagtcgtgt cttaccgggt tggactcaag acgatagtta     6540

ccggataagg cgcagcggtc gggctgaacg gggggttcgt gcacacagcc cagcttggag     6600

cgaacgacct acaccgaact gagataccta cagcgtgagc tatgagaaag cgccacgctt     6660

cccgaaggga gaaaggcgga caggtatccg gtaagcggca gggtcggaac aggagagcgc     6720

acgagggagc ttccaggggg aaacgcctgg tatctttata gtcctgtcgg gtttcgccac     6780

ctctgacttg agcgtcgatt tttgtgatgc tcgtcagggg ggcggagcct atggaaaaac     6840

gccagcaacg cggccttttt acggttcctg gccttttgct ggccttttgc tcacatgt       6898


<210>  49
<211>  7528
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN2006 - The portion between L-ITR and R-ITR corresponds to 
       positions 142-4790

<400>  49
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc       60

gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggcca      120

actccatcac taggggttcc tgcggccgca cgcgtggtac cctaaataaa gatggctttt      180

tagtattaaa agtggaagaa aattacaggt aattatcttt gacggtaaaa acgctgtaat      240

cagcgggcta catgaaaaat tactctaatt atggctgcat ttaagagaat ggctaaataa      300

agatggcttt ttagtattaa aagtggaaga aaattacagg taattatctt tgacggtaaa      360

aacgctgtaa tcagcgggct acatgaaaaa ttactctaat tatggctgca tttaagagaa      420

tggctaaata aagatggctt tttagtatta aaagtggaag aaaattacag gtaattatct      480

ttgacggtaa aaacgctgta atcagcgggc tacatgaaaa attactctaa ttatggctgc      540

atttaagaga atggagctcg ggctggtcga cacaattgga ggtaggcgtg tacggtggga      600

ggcctatata agcagagctc gtttagtgaa ccgtcagatc gcctggagga tccttcgaaa      660

agcttgctac cggtgccacc atggtcagca agggcgagga gctgttcacc ggggtggtgc      720

ccatcctggt cgagctggac ggcgacgtca atggccacaa gttcagcgtg tccggcgagg      780

gcgagggcga tgccacctac ggcaagctga ccctgaagct gatctgcacc accggcaagc      840

tgcccgtgcc ctggcccacc ctcgtgacca ccctgggcta cggcgtgcag tgcttcgccc      900

gctaccccga ccacatgaag cagcacgact tcttcaagtc cgccatgccc gaaggctacg      960

tccaggagcg caccatcttc ttcaaagacg acggcaacta caagacccgc gccgaggtga     1020

agttcgaggg cgacaccctg gtgaaccgca tcgagctgaa gggcatcgac ttcaaggagg     1080

acggcaacat cctggggcac aagctggagt acaactacaa cagccacaac gtctatatca     1140

ccgccgacaa gcagaagaac ggcatcaagg ccaacttcaa gatccgccac aacatcgagg     1200

acggcggcgt gcagctcgcc gaccactacc agcagaacac ccccatcggc gacggccccg     1260

tgctgctgcc cgacaaccac tacctgagct accagtccaa gctgagcaaa gaccccaacg     1320

agaagcgcga tcacatggtc ctgctggagt tcgtgaccgc cgccgggatc actctcggca     1380

tggacgagct gtacaaaggc agcggcgcca ccaacttcag cctgctgaag caggccggcg     1440

acgtggagga gaaccccggc cccggaacta gtggtatgga gcaaacagtg cttgtaccac     1500

caggacctga cagcttcaac ttcttcacca gagaatctct tgcggctatt gaaagacgca     1560

ttgcagaaga aaaggcaaag aatcccaaac cagacaaaaa agatgacgac gaaaatggcc     1620

caaagccaaa tagtgacttg gaagctggaa agaaccttcc atttatttat ggagacattc     1680

ctccagagat ggtgtcagag cccctggagg acctggaccc ctactatatc aataagaaaa     1740

cttttatagt attgaataaa gggaaggcca tcttccggtt cagtgccacc tctgccctgt     1800

acattttaac tcccttcaat cctcttagga aaatagctat taagattttg gtacattcat     1860

tattcagcat gctaattatg tgcactattt tgacaaactg tgtgtttatg acaatgagta     1920

accctcctga ttggacaaag aatgtagaat acaccttcac aggaatatat acttttgaat     1980

cacttataaa aattattgca aggggattct gtttagaaga ttttactttc cttcgggatc     2040

catggaactg gctcgatttc actgtcatta catttgcgta cgtcacagag tttgtggacc     2100

tgggcaatgt ctcggcattg agaacattca gagttctccg agcattgaag acgatttcag     2160

tcattccagg cctgaaaacc attgtgggag ccctgatcca gtctgtgaag aagctctcag     2220

atgtaatgat cctgactgtg ttctgtctga gcgtatttgc tctaattggg ctgcagctgt     2280

tcatgggcaa cctgaggaat aaatgtatac aatggcctcc caccaatgct tccttggagg     2340

aacatagtat agaaaagaat ataactgtga attataatgg tacacttata aatgaaactg     2400

tctttgagtt tgactggaag tcatatattc aagattcaag atatcattat ttcctggagg     2460

gttttttaga tgcactacta tgtggaaata gctctgatgc aggccaatgt ccagagggat     2520

atatgtgtgt gaaagctggt agaaatccca attatggcta cacaagcttt gataccttca     2580

gttgggcttt tttgtccttg tttcgactaa tgactcagga cttctgggaa aatctttatc     2640

aactgacatt acgtgctgct gggaaaacgt acatgatatt ttttgtattg gtcattttct     2700

tgggctcatt ctacctaata aatttgatcc tggctgtggt ggccatggcc tacgaggaac     2760

agaatcaggc caccttggaa gaagcagaac agaaagaggc cgaatttcag cagatgattg     2820

aacagcttaa aaagcaacag gaggcagctc agcaggcagc aacggcaact gcctcagaac     2880

attccagaga gcccagtgca gcaggcaggc tctcagacag ctcatctgaa gcctctaagt     2940

tgagttccaa gagtgctaag gaaagaagaa atcggaggaa gaaaagaaaa cagaaagagc     3000

agtctggtgg ggaagagaaa gatgaggatg aattccaaaa atctgaatct gaggacagca     3060

tcaggaggaa aggttttcgc ttctccattg aagggaaccg attgacatat gaaaagaggt     3120

actcctcccc acaccagtct ttgttgagca tccgtggctc cctattttca ccaaggcgaa     3180

atagcagaac aagccttttc agctttagag ggcgagcaaa ggatgtggga tctgagaacg     3240

acttcgcaga tgatgagcac agcacctttg aggataacga gagccgtaga gattccttgt     3300

ttgtgccccg acgacacgga gagagacgca acagcaacct gagtcagacc agtaggtcat     3360

cccggatgct ggcagtgttt ccagcgaatg ggaagatgca cagcactgtg gattgcaatg     3420

gtgtggtttc cttggttggt ggaccttcag ttcctacatc gcctgttgga cagcttctgc     3480

cagaggtgat aatagataag ccagctactg atgacaatgg aacaaccact gaaactgaaa     3540

tgagaaagag aaggtcaagt tctttccacg tttccatgga ctttctagaa gatccttccc     3600

aaaggcaacg agcaatgagt atagccagca ttctaacaaa tacagtagaa gaacttgaag     3660

aatccaggca gaaatgccca ccctgttggt ataaattttc caacatattc ttaatctggg     3720

actgttctcc atattggtta aaagtgaaac atgttgtcaa cctggttgtg atggacccat     3780

ttgttgacct ggccatcacc atctgtattg tcttaaatac tcttttcatg gccatggagc     3840

actatccaat gacggaccat ttcaataatg tgcttacagt aggaaacttg gttttcactg     3900

ggatctttac agcagaaatg tttctgaaaa ttattgccat ggatccttac tattatttcc     3960

aagaaggctg gaatatcttt gacggtttta ttgtgacgct tagcctggta gaacttggac     4020

tcgccaatgt ggaaggatta tctgttctcc gttcatttcg attgctgcga gttttcaagt     4080

tggcaaaatc ttggccaacg ttaaatatgc taataaagat catcggcaat tccgtggggg     4140

ctctgggaaa tttaaccctc gtcttggcca tcatcgtctt catttttgcc gtggtcgtga     4200

gtttggggac ccttgattgt tctttctttt tcgctattgt aaaattcatg ttatatggag     4260

ggggcaaagt tttcagggtg ttgtttagaa tgggaagatg tcccttgtat caccatggac     4320

cctcatgata attttgtttc tttcactttc tactctgttg acaaccattg tctcctctta     4380

ttttcttttc attttctgta actttttcgt taaactttag cttgcatttg taacgaattt     4440

ttaaattcac ttttgtttat ttgtcagatt gtaagtactt tctctaatca cttttttttc     4500

aaggcaatca gggtatatta tattgtactt cagcacagtt ttagagaaca attgttataa     4560

ttaaatgata aggtagaata tttctgcata taaattctgg ctggcgtgga aatattctta     4620

ttggtagaaa caactacacc ctggtcatca tcctgccttt ctctttatgg ttacaatgat     4680

atacactgtt tgagatgagg ataaaatact ctgagtccaa accgggcccc tctgctaacc     4740

atgttcatgc cttcttctct ttcctactca cgtgcggacc gagcggccgc aggaacccct     4800

agtgatggag ttggccactc cctctctgcg cgctcgctcg ctcactgagg ccgggcgacc     4860

aaaggtcgcc cgacgcccgg gctttgcccg ggcggcctca gtgagcgagc gagcgcgcag     4920

ctgcctgcag gggcgcctga tgcggtattt tctccttacg catctgtgcg gtatttcaca     4980

ccgcatacgt caaagcaacc atagtacgcg ccctgtagcg gcgcattaag cgcggcgggt     5040

gtggtggtta cgcgcagcgt gaccgctaca cttgccagcg ccctagcgcc cgctcctttc     5100

gctttcttcc cttcctttct cgccacgttc gccggctttc cccgtcaagc tctaaatcgg     5160

gggctccctt tagggttccg atttagtgct ttacggcacc tcgaccccaa aaaacttgat     5220

ttgggtgatg gttcacgtag tgggccatcg ccctgataga cggtttttcg ccctttgacg     5280

ttggagtcca cgttctttaa tagtggactc ttgttccaaa ctggaacaac actcaaccct     5340

atctcgggct attcttttga tttataaggg attttgccga tttcggccta ttggttaaaa     5400

aatgagctga tttaacaaaa atttaacgcg aattttaaca aaatattaac gtttacaatt     5460

ttatggtgca ctctcagtac aatctgctct gatgccgcat agttaagcca gccccgacac     5520

ccgccaacac ccgctgacgc gccctgacgg gcttgtctgc tcccggcatc cgcttacaga     5580

caagctgtga ccgtctccgg gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa     5640

cgcgcgagac gaaagggcct cgtgatacgc ctatttttat aggttaatgt catgataata     5700

atggtttctt agacgtcagg tggcactttt cggggaaatg tgcgcggaac ccctatttgt     5760

ttatttttct aaatacattc aaatatgtat ccgctcatga gacaataacc ctgataaatg     5820

cttcaataat attgaaaaag gaagagtatg agtattcaac atttccgtgt cgcccttatt     5880

cccttttttg cggcattttg ccttcctgtt tttgctcacc cagaaacgct ggtgaaagta     5940

aaagatgctg aagatcagtt gggtgcacga gtgggttaca tcgaactgga tctcaacagc     6000

ggtaagatcc ttgagagttt tcgccccgaa gaacgttttc caatgatgag cacttttaaa     6060

gttctgctat gtggcgcggt attatcccgt attgacgccg ggcaagagca actcggtcgc     6120

cgcatacact attctcagaa tgacttggtt gagtactcac cagtcacaga aaagcatctt     6180

acggatggca tgacagtaag agaattatgc agtgctgcca taaccatgag tgataacact     6240

gcggccaact tacttctgac aacgatcgga ggaccgaagg agctaaccgc ttttttgcac     6300

aacatggggg atcatgtaac tcgccttgat cgttgggaac cggagctgaa tgaagccata     6360

ccaaacgacg agcgtgacac cacgatgcct gtagcaatgg caacaacgtt gcgcaaacta     6420

ttaactggcg aactacttac tctagcttcc cggcaacaat taatagactg gatggaggcg     6480

gataaagttg caggaccact tctgcgctcg gcccttccgg ctggctggtt tattgctgat     6540

aaatctggag ccggtgagcg tgggtctcgc ggtatcattg cagcactggg gccagatggt     6600

aagccctccc gtatcgtagt tatctacacg acggggagtc aggcaactat ggatgaacga     6660

aatagacaga tcgctgagat aggtgcctca ctgattaagc attggtaact gtcagaccaa     6720

gtttactcat atatacttta gattgattta aaacttcatt tttaatttaa aaggatctag     6780

gtgaagatcc tttttgataa tctcatgacc aaaatccctt aacgtgagtt ttcgttccac     6840

tgagcgtcag accccgtaga aaagatcaaa ggatcttctt gagatccttt ttttctgcgc     6900

gtaatctgct gcttgcaaac aaaaaaacca ccgctaccag cggtggtttg tttgccggat     6960

caagagctac caactctttt tccgaaggta actggcttca gcagagcgca gataccaaat     7020

actgtccttc tagtgtagcc gtagttaggc caccacttca agaactctgt agcaccgcct     7080

acatacctcg ctctgctaat cctgttacca gtggctgctg ccagtggcga taagtcgtgt     7140

cttaccgggt tggactcaag acgatagtta ccggataagg cgcagcggtc gggctgaacg     7200

gggggttcgt gcacacagcc cagcttggag cgaacgacct acaccgaact gagataccta     7260

cagcgtgagc tatgagaaag cgccacgctt cccgaaggga gaaaggcgga caggtatccg     7320

gtaagcggca gggtcggaac aggagagcgc acgagggagc ttccaggggg aaacgcctgg     7380

tatctttata gtcctgtcgg gtttcgccac ctctgacttg agcgtcgatt tttgtgatgc     7440

tcgtcagggg ggcggagcct atggaaaaac gccagcaacg cggccttttt acggttcctg     7500

gccttttgct ggccttttgc tcacatgt                                        7528


<210>  50
<211>  7409
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN2007 - The portion between L-ITR and R-ITR corresponds to 
       positions 142-4671

<400>  50
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc       60

gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggcca      120

actccatcac taggggttcc tgcggccgca gagtttgggg acccttgatt gttctttctt      180

tttcgctatt gtaaaattca tgttatatgg agggggcaaa gttttcaggg tgttgtttag      240

aatgggaaga tgtcccttgt atcaccatgg accctcatga taattttgtt tctttcactt      300

tctactctgt tgacaaccat tgtctcctct tattttcttt tcattttctg taactttttc      360

gttaaacttt agcttgcatt tgtaacgaat ttttaaattc acttttgttt atttgtcaga      420

ttgtaagtac tttctctaat cacttttttt tcaaggcaat cagggtatat tatattgtac      480

ttcagcacag ttttagagaa caattgttat aattaaatga taaggtagaa tatttctgca      540

tataaattct ggctggcgtg gaaatattct tattggtaga aacaactaca ccctggtcat      600

catcctgcct ttctctttat ggttacaatg atatacactg tttgagatga ggataaaata      660

ctctgagtcc aaaccgggcc cctctgctaa ccatgttcat gccttcttct ctttcctaca      720

gggcatgcag ctctttggta aaagctacaa agattgtgtc tgcaagatcg ccagtgattg      780

tcaactccca cgctggcaca tgaatgactt cttccactcc ttcctgattg tgttccgcgt      840

gctgtgtggg gagtggatag agaccatgtg ggactgtatg gaggttgctg gtcaagccat      900

gtgccttact gtcttcatga tggtcatggt gattggaaac ctagtggtcc tgaatctctt      960

tctggccttg cttctgagct catttagtgc agacaacctt gcagccactg atgatgataa     1020

tgaaatgaat aatctccaaa ttgctgtgga taggatgcac aaaggagtag cttatgtgaa     1080

aagaaaaata tatgaattta ttcaacagtc cttcattagg aaacaaaaga ttttagatga     1140

aattaaacca cttgatgatc taaacaacaa gaaagacagt tgtatgtcca atcatacagc     1200

agaaattggg aaagatcttg actatcttaa agatgtaaat ggaactacaa gtggtatagg     1260

aactggcagc agtgttgaat acattattga tgaaagtgat tacatgtcat tcataaacaa     1320

ccccagtctt actgtgactg taccaattgc tgtaggagaa tctgactttg aaaatttaaa     1380

cacggaagac tttagtagtg aatcggatct ggaagaaagc aaagagaaac tgaatgaaag     1440

cagtagctca tcagaaggta gcactgtgga catcggcgca cctgtagaag aacagcccgt     1500

agtggaacct gaagaaactc ttgaaccaga agcttgtttc actgaaggct gtgtacaaag     1560

attcaagtgt tgtcaaatca atgtggaaga aggcagagga aaacaatggt ggaacctgag     1620

aaggacgtgt ttccgaatag ttgaacataa ctggtttgag accttcattg ttttcatgat     1680

tctccttagt agtggtgctc tggcatttga agatatatat attgatcagc gaaagacgat     1740

taagacgatg ttggaatatg ctgacaaggt tttcacttac attttcattc tggaaatgct     1800

tctaaaatgg gtggcatatg gctatcaaac atatttcacc aatgcctggt gttggctgga     1860

cttcttaatt gttgatgttt cattggtcag tttaacagca aatgccttgg gttactcaga     1920

acttggagcc atcaaatctc tcaggacact aagagctctg agacctctaa gagccttatc     1980

tcgatttgaa gggatgaggg tggttgtgaa tgccctttta ggagcaattc catccatcat     2040

gaatgtgctt ctggtttgtc ttatattctg gctaattttc agcatcatgg gcgtaaattt     2100

gtttgctggc aaattctacc actgtattaa caccacaact ggtgacaggt ttgacatcga     2160

agacgtgaat aatcatactg attgcctaaa actaatagaa agaaatgaga ctgctcgatg     2220

gaaaaatgtg aaagtaaact ttgataatgt aggatttggg tatctctctt tgcttcaagt     2280

tgccacattc aaaggatgga tggatataat gtatgcagca gttgattcca gaaatgtgga     2340

actccagcct aagtatgaag aaagtctgta catgtatctt tactttgtta ttttcatcat     2400

ctttgggtcc ttcttcacct tgaacctgtt tattggtgtc atcatagata atttcaacca     2460

gcagaaaaag aagtttggag gtcaagacat ctttatgaca gaagaacaga agaaatacta     2520

taatgcaatg aaaaaattag gatcgaaaaa accgcaaaag cctatacctc gaccaggaaa     2580

caaatttcaa ggaatggtct ttgacttcgt aaccagacaa gtttttgaca taagcatcat     2640

gattctcatc tgtcttaaca tggtcacaat gatggtggaa acagatgacc agagtgaata     2700

tgtgactacc attttgtcac gcatcaatct ggtgttcatt gtgctattta ctggagagtg     2760

tgtactgaaa ctcatctctc tacgccatta ttattttacc attggatgga atatttttga     2820

ttttgtggtt gtcattctct ccattgtagg tatgtttctt gccgagctga tagaaaagta     2880

tttcgtgtcc cctaccctgt tccgagtgat ccgtcttgct aggattggcc gaatcctacg     2940

tctgatcaaa ggagcaaagg ggatccgcac gctgctcttt gctttgatga tgtcccttcc     3000

tgcgttgttt aacatcggcc tcctactctt cctagtcatg ttcatctacg ccatctttgg     3060

gatgtccaac tttgcctatg ttaagaggga agttgggatc gatgacatgt tcaactttga     3120

gacctttggc aacagcatga tctgcctatt ccaaattaca acctctgctg gctgggatgg     3180

attgctagca cccattctca acagtaagcc acccgactgt gaccctaata aagttaaccc     3240

tggaagctca gttaagggag actgtgggaa cccatctgtt ggaattttct tttttgtcag     3300

ttacatcatc atatccttcc tggttgtggt gaacatgtac atcgcggtca tcctggagaa     3360

cttcagtgtt gctactgaag aaagtgcaga gcctctgagt gaggatgact ttgagatgtt     3420

ctatgaggtt tgggagaagt ttgatcccga tgcaactcag ttcatggaat ttgaaaaatt     3480

atctcagttt gcagctgcgc ttgaaccgcc tctcaatctg ccacaaccaa acaaactcca     3540

gctcattgcc atggatttgc ccatggtgag tggtgaccgg atccactgtc ttgatatctt     3600

atttgctttt acaaagcggg ttctaggaga gagtggagag atggatgctc tacgaataca     3660

gatggaagag cgattcatgg cttccaatcc ttccaaggtc tcctatcagc caatcactac     3720

tactttaaaa cgaaaacaag aggaagtatc tgctgtcatt attcagcgtg cttacagacg     3780

ccacctttta aagcgaactg taaaacaagc ttcctttacg tacaataaaa acaaaatcaa     3840

aggtggggct aatcttctta taaaagaaga catgataatt gacagaataa atgaaaactc     3900

tattacagaa aaaactgatc tgaccatgtc cactgcagct tgtccacctt cctatgaccg     3960

ggtgacaaag ccaattgtgg aaaaacatga gcaagaaggc aaagatgaaa aagccaaagg     4020

gaaaggaggt ggtggttcag gtgggggcgg ctcagagtac ccctatgatg tccctgatta     4080

tgcggcggaa tacccctatg acgtgccgga ctacgcggct gaatatccgt atgacgttcc     4140

cgattatgcg gctaagctcg aataatgatg agaattcatc ataatcaacc tctggattac     4200

aaaatttgtg aaagattgac tggtattctt aactatgttg ctccttttac gctatgtgga     4260

tacgctgctt taatgccttt gtatcatgct attgcttccc gtatggcttt cattttctcc     4320

tccttgtata aatcctggtt agttcttgcc acggcggaac tcatcgccgc ctgccttgcc     4380

cgctgctgga caggggctcg gctgttgggc actgacaatt ccgtggctcg agagatcttc     4440

gactgtgcct tctagttgcc agccatctgt tgtttgcccc tcccccgtgc cttccttgac     4500

cctggaaggt gccactccca ctgtcctttc ctaataaaat gaggaaattg catcgcattg     4560

tctgagtagg tgtcattcta ttctgggggg tggggtgggg caggacagca agggggagga     4620

ttgggaagac aatagcaggc atgagatctc acgtgcggac cgagcggccg caggaacccc     4680

tagtgatgga gttggccact ccctctctgc gcgctcgctc gctcactgag gccgggcgac     4740

caaaggtcgc ccgacgcccg ggctttgccc gggcggcctc agtgagcgag cgagcgcgca     4800

gctgcctgca ggggcgcctg atgcggtatt ttctccttac gcatctgtgc ggtatttcac     4860

accgcatacg tcaaagcaac catagtacgc gccctgtagc ggcgcattaa gcgcggcggg     4920

tgtggtggtt acgcgcagcg tgaccgctac acttgccagc gccctagcgc ccgctccttt     4980

cgctttcttc ccttcctttc tcgccacgtt cgccggcttt ccccgtcaag ctctaaatcg     5040

ggggctccct ttagggttcc gatttagtgc tttacggcac ctcgacccca aaaaacttga     5100

tttgggtgat ggttcacgta gtgggccatc gccctgatag acggtttttc gccctttgac     5160

gttggagtcc acgttcttta atagtggact cttgttccaa actggaacaa cactcaaccc     5220

tatctcgggc tattcttttg atttataagg gattttgccg atttcggcct attggttaaa     5280

aaatgagctg atttaacaaa aatttaacgc gaattttaac aaaatattaa cgtttacaat     5340

tttatggtgc actctcagta caatctgctc tgatgccgca tagttaagcc agccccgaca     5400

cccgccaaca cccgctgacg cgccctgacg ggcttgtctg ctcccggcat ccgcttacag     5460

acaagctgtg accgtctccg ggagctgcat gtgtcagagg ttttcaccgt catcaccgaa     5520

acgcgcgaga cgaaagggcc tcgtgatacg cctattttta taggttaatg tcatgataat     5580

aatggtttct tagacgtcag gtggcacttt tcggggaaat gtgcgcggaa cccctatttg     5640

tttatttttc taaatacatt caaatatgta tccgctcatg agacaataac cctgataaat     5700

gcttcaataa tattgaaaaa ggaagagtat gagtattcaa catttccgtg tcgcccttat     5760

tccctttttt gcggcatttt gccttcctgt ttttgctcac ccagaaacgc tggtgaaagt     5820

aaaagatgct gaagatcagt tgggtgcacg agtgggttac atcgaactgg atctcaacag     5880

cggtaagatc cttgagagtt ttcgccccga agaacgtttt ccaatgatga gcacttttaa     5940

agttctgcta tgtggcgcgg tattatcccg tattgacgcc gggcaagagc aactcggtcg     6000

ccgcatacac tattctcaga atgacttggt tgagtactca ccagtcacag aaaagcatct     6060

tacggatggc atgacagtaa gagaattatg cagtgctgcc ataaccatga gtgataacac     6120

tgcggccaac ttacttctga caacgatcgg aggaccgaag gagctaaccg cttttttgca     6180

caacatgggg gatcatgtaa ctcgccttga tcgttgggaa ccggagctga atgaagccat     6240

accaaacgac gagcgtgaca ccacgatgcc tgtagcaatg gcaacaacgt tgcgcaaact     6300

attaactggc gaactactta ctctagcttc ccggcaacaa ttaatagact ggatggaggc     6360

ggataaagtt gcaggaccac ttctgcgctc ggcccttccg gctggctggt ttattgctga     6420

taaatctgga gccggtgagc gtgggtctcg cggtatcatt gcagcactgg ggccagatgg     6480

taagccctcc cgtatcgtag ttatctacac gacggggagt caggcaacta tggatgaacg     6540

aaatagacag atcgctgaga taggtgcctc actgattaag cattggtaac tgtcagacca     6600

agtttactca tatatacttt agattgattt aaaacttcat ttttaattta aaaggatcta     6660

ggtgaagatc ctttttgata atctcatgac caaaatccct taacgtgagt tttcgttcca     6720

ctgagcgtca gaccccgtag aaaagatcaa aggatcttct tgagatcctt tttttctgcg     6780

cgtaatctgc tgcttgcaaa caaaaaaacc accgctacca gcggtggttt gtttgccgga     6840

tcaagagcta ccaactcttt ttccgaaggt aactggcttc agcagagcgc agataccaaa     6900

tactgtcctt ctagtgtagc cgtagttagg ccaccacttc aagaactctg tagcaccgcc     6960

tacatacctc gctctgctaa tcctgttacc agtggctgct gccagtggcg ataagtcgtg     7020

tcttaccggg ttggactcaa gacgatagtt accggataag gcgcagcggt cgggctgaac     7080

ggggggttcg tgcacacagc ccagcttgga gcgaacgacc tacaccgaac tgagatacct     7140

acagcgtgag ctatgagaaa gcgccacgct tcccgaaggg agaaaggcgg acaggtatcc     7200

ggtaagcggc agggtcggaa caggagagcg cacgagggag cttccagggg gaaacgcctg     7260

gtatctttat agtcctgtcg ggtttcgcca cctctgactt gagcgtcgat ttttgtgatg     7320

ctcgtcaggg gggcggagcc tatggaaaaa cgccagcaac gcggcctttt tacggttcct     7380

ggccttttgc tggccttttg ctcacatgt                                       7409


<210>  51
<211>  6733
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN2008 - The portion between L-ITR and R-ITR corresponds to 
       positions 142-3995

<400>  51
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc       60

gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggcca      120

actccatcac taggggttcc tgcggccgca cgcgtggtac cctaaataaa gatggctttt      180

tagtattaaa agtggaagaa aattacaggt aattatcttt gacggtaaaa acgctgtaat      240

cagcgggcta catgaaaaat tactctaatt atggctgcat ttaagagaat ggctaaataa      300

agatggcttt ttagtattaa aagtggaaga aaattacagg taattatctt tgacggtaaa      360

aacgctgtaa tcagcgggct acatgaaaaa ttactctaat tatggctgca tttaagagaa      420

tggctaaata aagatggctt tttagtatta aaagtggaag aaaattacag gtaattatct      480

ttgacggtaa aaacgctgta atcagcgggc tacatgaaaa attactctaa ttatggctgc      540

atttaagaga atggagctcg ggctggtcga cacaattgga ggtaggcgtg tacggtggga      600

ggcctatata agcagagctc gtttagtgaa ccgtcagatc gcctggagga tccttcgaaa      660

agcttgctac cggtgccacc atggagcaaa cagtgcttgt accaccagga cctgacagct      720

tcaacttctt caccagagaa tctcttgcgg ctattgaaag acgcattgca gaagaaaagg      780

caaagaatcc caaaccagac aaaaaagatg acgacgaaaa tggcccaaag ccaaatagtg      840

acttggaagc tggaaagaac cttccattta tttatggaga cattcctcca gagatggtgt      900

cagagcccct ggaggacctg gacccctact atatcaataa gaaaactttt atagtattga      960

ataaagggaa ggccatcttc cggttcagtg ccacctctgc cctgtacatt ttaactccct     1020

tcaatcctct taggaaaata gctattaaga ttttggtaca ttcattattc agcatgctaa     1080

ttatgtgcac tattttgaca aactgtgtgt ttatgacaat gagtaaccct cctgattgga     1140

caaagaatgt agaatacacc ttcacaggaa tatatacttt tgaatcactt ataaaaatta     1200

ttgcaagggg attctgttta gaagatttta ctttccttcg ggatccatgg aactggctcg     1260

atttcactgt cattacattt gcgtacgtca cagagtttgt ggacctgggc aatgtctcgg     1320

cattgagaac attcagagtt ctccgagcat tgaagacgat ttcagtcatt ccaggcctga     1380

aaaccattgt gggagccctg atccagtctg tgaagaagct ctcagatgta atgatcctga     1440

ctgtgttctg tctgagcgta tttgctctaa ttgggctgca gctgttcatg ggcaacctga     1500

ggaataaatg tatacaatgg cctcccacca atgcttcctt ggaggaacat agtatagaaa     1560

agaatataac tgtgaattat aatggtacac ttataaatga aactgtcttt gagtttgact     1620

ggaagtcata tattcaagat tcaagatatc attatttcct ggagggtttt ttagatgcac     1680

tactatgtgg aaatagctct gatgcaggcc aatgtccaga gggatatatg tgtgtgaaag     1740

ctggtagaaa tcccaattat ggctacacaa gctttgatac cttcagttgg gcttttttgt     1800

ccttgtttcg actaatgact caggacttct gggaaaatct ttatcaactg acattacgtg     1860

ctgctgggaa aacgtacatg atattttttg tattggtcat tttcttgggc tcattctacc     1920

taataaattt gatcctggct gtggtggcca tggcctacga ggaacagaat caggccacct     1980

tggaagaagc agaacagaaa gaggccgaat ttcagcagat gattgaacag cttaaaaagc     2040

aacaggaggc agctcagcag gcagcaacgg caactgcctc agaacattcc agagagccca     2100

gtgcagcagg caggctctca gacagctcat ctgaagcctc taagttgagt tccaagagtg     2160

ctaaggaaag aagaaatcgg aggaagaaaa gaaaacagaa agagcagtct ggtggggaag     2220

agaaagatga ggatgaattc caaaaatctg aatctgagga cagcatcagg aggaaaggtt     2280

ttcgcttctc cattgaaggg aaccgattga catatgaaaa gaggtactcc tccccacacc     2340

agtctttgtt gagcatccgt ggctccctat tttcaccaag gcgaaatagc agaacaagcc     2400

ttttcagctt tagagggcga gcaaaggatg tgggatctga gaacgacttc gcagatgatg     2460

agcacagcac ctttgaggat aacgagagcc gtagagattc cttgtttgtg ccccgacgac     2520

acggagagag acgcaacagc aacctgagtc agaccagtag gtcatcccgg atgctggcag     2580

tgtttccagc gaatgggaag atgcacagca ctgtggattg caatggtgtg gtttccttgg     2640

ttggtggacc ttcagttcct acatcgcctg ttggacagct tctgccagag gtgataatag     2700

ataagccagc tactgatgac aatggaacaa ccactgaaac tgaaatgaga aagagaaggt     2760

caagttcttt ccacgtttcc atggactttc tagaagatcc ttcccaaagg caacgagcaa     2820

tgagtatagc cagcattcta acaaatacag tagaagaact tgaagaatcc aggcagaaat     2880

gcccaccctg ttggtataaa ttttccaaca tattcttaat ctgggactgt tctccatatt     2940

ggttaaaagt gaaacatgtt gtcaacctgg ttgtgatgga cccatttgtt gacctggcca     3000

tcaccatctg tattgtctta aatactcttt tcatggccat ggagcactat ccaatgacgg     3060

accatttcaa taatgtgctt acagtaggaa acttggtttt cactgggatc tttacagcag     3120

aaatgtttct gaaaattatt gccatggatc cttactatta tttccaagaa ggctggaata     3180

tctttgacgg ttttattgtg acgcttagcc tggtagaact tggactcgcc aatgtggaag     3240

gattatctgt tctccgttca tttcgattgc tgcgagtttt caagttggca aaatcttggc     3300

caacgttaaa tatgctaata aagatcatcg gcaattccgt gggggctctg ggaaatttaa     3360

ccctcgtctt ggccatcatc gtcttcattt ttgccgtggt cgtgagtttg gggacccttg     3420

attgttcttt ctttttcgct attgtaaaat tcatgttata tggagggggc aaagttttca     3480

gggtgttgtt tagaatggga agatgtccct tgtatcacca tggaccctca tgataatttt     3540

gtttctttca ctttctactc tgttgacaac cattgtctcc tcttattttc ttttcatttt     3600

ctgtaacttt ttcgttaaac tttagcttgc atttgtaacg aatttttaaa ttcacttttg     3660

tttatttgtc agattgtaag tactttctct aatcactttt ttttcaaggc aatcagggta     3720

tattatattg tacttcagca cagttttaga gaacaattgt tataattaaa tgataaggta     3780

gaatatttct gcatataaat tctggctggc gtggaaatat tcttattggt agaaacaact     3840

acaccctggt catcatcctg cctttctctt tatggttaca atgatataca ctgtttgaga     3900

tgaggataaa atactctgag tccaaaccgg gcccctctgc taaccatgtt catgccttct     3960

tctctttcct actcacgtgc ggaccgagcg gccgcaggaa cccctagtga tggagttggc     4020

cactccctct ctgcgcgctc gctcgctcac tgaggccggg cgaccaaagg tcgcccgacg     4080

cccgggcttt gcccgggcgg cctcagtgag cgagcgagcg cgcagctgcc tgcaggggcg     4140

cctgatgcgg tattttctcc ttacgcatct gtgcggtatt tcacaccgca tacgtcaaag     4200

caaccatagt acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc     4260

agcgtgaccg ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc     4320

tttctcgcca cgttcgccgg ctttccccgt caagctctaa atcgggggct ccctttaggg     4380

ttccgattta gtgctttacg gcacctcgac cccaaaaaac ttgatttggg tgatggttca     4440

cgtagtgggc catcgccctg atagacggtt tttcgccctt tgacgttgga gtccacgttc     4500

tttaatagtg gactcttgtt ccaaactgga acaacactca accctatctc gggctattct     4560

tttgatttat aagggatttt gccgatttcg gcctattggt taaaaaatga gctgatttaa     4620

caaaaattta acgcgaattt taacaaaata ttaacgttta caattttatg gtgcactctc     4680

agtacaatct gctctgatgc cgcatagtta agccagcccc gacacccgcc aacacccgct     4740

gacgcgccct gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc     4800

tccgggagct gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gagacgaaag     4860

ggcctcgtga tacgcctatt tttataggtt aatgtcatga taataatggt ttcttagacg     4920

tcaggtggca cttttcgggg aaatgtgcgc ggaaccccta tttgtttatt tttctaaata     4980

cattcaaata tgtatccgct catgagacaa taaccctgat aaatgcttca ataatattga     5040

aaaaggaaga gtatgagtat tcaacatttc cgtgtcgccc ttattccctt ttttgcggca     5100

ttttgccttc ctgtttttgc tcacccagaa acgctggtga aagtaaaaga tgctgaagat     5160

cagttgggtg cacgagtggg ttacatcgaa ctggatctca acagcggtaa gatccttgag     5220

agttttcgcc ccgaagaacg ttttccaatg atgagcactt ttaaagttct gctatgtggc     5280

gcggtattat cccgtattga cgccgggcaa gagcaactcg gtcgccgcat acactattct     5340

cagaatgact tggttgagta ctcaccagtc acagaaaagc atcttacgga tggcatgaca     5400

gtaagagaat tatgcagtgc tgccataacc atgagtgata acactgcggc caacttactt     5460

ctgacaacga tcggaggacc gaaggagcta accgcttttt tgcacaacat gggggatcat     5520

gtaactcgcc ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt     5580

gacaccacga tgcctgtagc aatggcaaca acgttgcgca aactattaac tggcgaacta     5640

cttactctag cttcccggca acaattaata gactggatgg aggcggataa agttgcagga     5700

ccacttctgc gctcggccct tccggctggc tggtttattg ctgataaatc tggagccggt     5760

gagcgtgggt ctcgcggtat cattgcagca ctggggccag atggtaagcc ctcccgtatc     5820

gtagttatct acacgacggg gagtcaggca actatggatg aacgaaatag acagatcgct     5880

gagataggtg cctcactgat taagcattgg taactgtcag accaagttta ctcatatata     5940

ctttagattg atttaaaact tcatttttaa tttaaaagga tctaggtgaa gatccttttt     6000

gataatctca tgaccaaaat cccttaacgt gagttttcgt tccactgagc gtcagacccc     6060

gtagaaaaga tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg     6120

caaacaaaaa aaccaccgct accagcggtg gtttgtttgc cggatcaaga gctaccaact     6180

ctttttccga aggtaactgg cttcagcaga gcgcagatac caaatactgt ccttctagtg     6240

tagccgtagt taggccacca cttcaagaac tctgtagcac cgcctacata cctcgctctg     6300

ctaatcctgt taccagtggc tgctgccagt ggcgataagt cgtgtcttac cgggttggac     6360

tcaagacgat agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca     6420

cagcccagct tggagcgaac gacctacacc gaactgagat acctacagcg tgagctatga     6480

gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc     6540

ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct     6600

gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg     6660

agcctatgga aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct     6720

tttgctcaca tgt                                                        6733


<210>  52
<211>  7263
<212>  DNA
<213>  artificial sequence

<220>
<223>  CN2009 - The portion between L-ITR and R-ITR corresponds to 
       positions 142-4525

<400>  52
cctgcaggca gctgcgcgct cgctcgctca ctgaggccgc ccgggcaaag cccgggcgtc       60

gggcgacctt tggtcgcccg gcctcagtga gcgagcgagc gcgcagagag ggagtggcca      120

actccatcac taggggttcc tgcggccgca gagtttgggg acccttgatt gttctttctt      180

tttcgctatt gtaaaattca tgttatatgg agggggcaaa gttttcaggg tgttgtttag      240

aatgggaaga tgtcccttgt atcaccatgg accctcatga taattttgtt tctttcactt      300

tctactctgt tgacaaccat tgtctcctct tattttcttt tcattttctg taactttttc      360

gttaaacttt agcttgcatt tgtaacgaat ttttaaattc acttttgttt atttgtcaga      420

ttgtaagtac tttctctaat cacttttttt tcaaggcaat cagggtatat tatattgtac      480

ttcagcacag ttttagagaa caattgttat aattaaatga taaggtagaa tatttctgca      540

tataaattct ggctggcgtg gaaatattct tattggtaga aacaactaca ccctggtcat      600

catcctgcct ttctctttat ggttacaatg atatacactg tttgagatga ggataaaata      660

ctctgagtcc aaaccgggcc cctctgctaa ccatgttcat gccttcttct ctttcctaca      720

gggcatgcag ctctttggta aaagctacaa agattgtgtc tgcaagatcg ccagtgattg      780

tcaactccca cgctggcaca tgaatgactt cttccactcc ttcctgattg tgttccgcgt      840

gctgtgtggg gagtggatag agaccatgtg ggactgtatg gaggttgctg gtcaagccat      900

gtgccttact gtcttcatga tggtcatggt gattggaaac ctagtggtcc tgaatctctt      960

tctggccttg cttctgagct catttagtgc agacaacctt gcagccactg atgatgataa     1020

tgaaatgaat aatctccaaa ttgctgtgga taggatgcac aaaggagtag cttatgtgaa     1080

aagaaaaata tatgaattta ttcaacagtc cttcattagg aaacaaaaga ttttagatga     1140

aattaaacca cttgatgatc taaacaacaa gaaagacagt tgtatgtcca atcatacagc     1200

agaaattggg aaagatcttg actatcttaa agatgtaaat ggaactacaa gtggtatagg     1260

aactggcagc agtgttgaat acattattga tgaaagtgat tacatgtcat tcataaacaa     1320

ccccagtctt actgtgactg taccaattgc tgtaggagaa tctgactttg aaaatttaaa     1380

cacggaagac tttagtagtg aatcggatct ggaagaaagc aaagagaaac tgaatgaaag     1440

cagtagctca tcagaaggta gcactgtgga catcggcgca cctgtagaag aacagcccgt     1500

agtggaacct gaagaaactc ttgaaccaga agcttgtttc actgaaggct gtgtacaaag     1560

attcaagtgt tgtcaaatca atgtggaaga aggcagagga aaacaatggt ggaacctgag     1620

aaggacgtgt ttccgaatag ttgaacataa ctggtttgag accttcattg ttttcatgat     1680

tctccttagt agtggtgctc tggcatttga agatatatat attgatcagc gaaagacgat     1740

taagacgatg ttggaatatg ctgacaaggt tttcacttac attttcattc tggaaatgct     1800

tctaaaatgg gtggcatatg gctatcaaac atatttcacc aatgcctggt gttggctgga     1860

cttcttaatt gttgatgttt cattggtcag tttaacagca aatgccttgg gttactcaga     1920

acttggagcc atcaaatctc tcaggacact aagagctctg agacctctaa gagccttatc     1980

tcgatttgaa gggatgaggg tggttgtgaa tgccctttta ggagcaattc catccatcat     2040

gaatgtgctt ctggtttgtc ttatattctg gctaattttc agcatcatgg gcgtaaattt     2100

gtttgctggc aaattctacc actgtattaa caccacaact ggtgacaggt ttgacatcga     2160

agacgtgaat aatcatactg attgcctaaa actaatagaa agaaatgaga ctgctcgatg     2220

gaaaaatgtg aaagtaaact ttgataatgt aggatttggg tatctctctt tgcttcaagt     2280

tgccacattc aaaggatgga tggatataat gtatgcagca gttgattcca gaaatgtgga     2340

actccagcct aagtatgaag aaagtctgta catgtatctt tactttgtta ttttcatcat     2400

ctttgggtcc ttcttcacct tgaacctgtt tattggtgtc atcatagata atttcaacca     2460

gcagaaaaag aagtttggag gtcaagacat ctttatgaca gaagaacaga agaaatacta     2520

taatgcaatg aaaaaattag gatcgaaaaa accgcaaaag cctatacctc gaccaggaaa     2580

caaatttcaa ggaatggtct ttgacttcgt aaccagacaa gtttttgaca taagcatcat     2640

gattctcatc tgtcttaaca tggtcacaat gatggtggaa acagatgacc agagtgaata     2700

tgtgactacc attttgtcac gcatcaatct ggtgttcatt gtgctattta ctggagagtg     2760

tgtactgaaa ctcatctctc tacgccatta ttattttacc attggatgga atatttttga     2820

ttttgtggtt gtcattctct ccattgtagg tatgtttctt gccgagctga tagaaaagta     2880

tttcgtgtcc cctaccctgt tccgagtgat ccgtcttgct aggattggcc gaatcctacg     2940

tctgatcaaa ggagcaaagg ggatccgcac gctgctcttt gctttgatga tgtcccttcc     3000

tgcgttgttt aacatcggcc tcctactctt cctagtcatg ttcatctacg ccatctttgg     3060

gatgtccaac tttgcctatg ttaagaggga agttgggatc gatgacatgt tcaactttga     3120

gacctttggc aacagcatga tctgcctatt ccaaattaca acctctgctg gctgggatgg     3180

attgctagca cccattctca acagtaagcc acccgactgt gaccctaata aagttaaccc     3240

tggaagctca gttaagggag actgtgggaa cccatctgtt ggaattttct tttttgtcag     3300

ttacatcatc atatccttcc tggttgtggt gaacatgtac atcgcggtca tcctggagaa     3360

cttcagtgtt gctactgaag aaagtgcaga gcctctgagt gaggatgact ttgagatgtt     3420

ctatgaggtt tgggagaagt ttgatcccga tgcaactcag ttcatggaat ttgaaaaatt     3480

atctcagttt gcagctgcgc ttgaaccgcc tctcaatctg ccacaaccaa acaaactcca     3540

gctcattgcc atggatttgc ccatggtgag tggtgaccgg atccactgtc ttgatatctt     3600

atttgctttt acaaagcggg ttctaggaga gagtggagag atggatgctc tacgaataca     3660

gatggaagag cgattcatgg cttccaatcc ttccaaggtc tcctatcagc caatcactac     3720

tactttaaaa cgaaaacaag aggaagtatc tgctgtcatt attcagcgtg cttacagacg     3780

ccacctttta aagcgaactg taaaacaagc ttcctttacg tacaataaaa acaaaatcaa     3840

aggtggggct aatcttctta taaaagaaga catgataatt gacagaataa atgaaaactc     3900

tattacagaa aaaactgatc tgaccatgtc cactgcagct tgtccacctt cctatgaccg     3960

ggtgacaaag ccaattgtgg aaaaacatga gcaagaaggc aaagatgaaa aagccaaagg     4020

gaaataatga catcataatc aacctctgga ttacaaaatt tgtgaaagat tgactggtat     4080

tcttaactat gttgctcctt ttacgctatg tggatacgct gctttaatgc ctttgtatca     4140

tgctattgct tcccgtatgg ctttcatttt ctcctccttg tataaatcct ggttagttct     4200

tgccacggcg gaactcatcg ccgcctgcct tgcccgctgc tggacagggg ctcggctgtt     4260

gggcactgac aattccgtgg ctcgagagat cttcgactgt gccttctagt tgccagccat     4320

ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact cccactgtcc     4380

tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat tctattctgg     4440

ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc aggcatgaga     4500

tctcacgtgc ggaccgagcg gccgcaggaa cccctagtga tggagttggc cactccctct     4560

ctgcgcgctc gctcgctcac tgaggccggg cgaccaaagg tcgcccgacg cccgggcttt     4620

gcccgggcgg cctcagtgag cgagcgagcg cgcagctgcc tgcaggggcg cctgatgcgg     4680

tattttctcc ttacgcatct gtgcggtatt tcacaccgca tacgtcaaag caaccatagt     4740

acgcgccctg tagcggcgca ttaagcgcgg cgggtgtggt ggttacgcgc agcgtgaccg     4800

ctacacttgc cagcgcccta gcgcccgctc ctttcgcttt cttcccttcc tttctcgcca     4860

cgttcgccgg ctttccccgt caagctctaa atcgggggct ccctttaggg ttccgattta     4920

gtgctttacg gcacctcgac cccaaaaaac ttgatttggg tgatggttca cgtagtgggc     4980

catcgccctg atagacggtt tttcgccctt tgacgttgga gtccacgttc tttaatagtg     5040

gactcttgtt ccaaactgga acaacactca accctatctc gggctattct tttgatttat     5100

aagggatttt gccgatttcg gcctattggt taaaaaatga gctgatttaa caaaaattta     5160

acgcgaattt taacaaaata ttaacgttta caattttatg gtgcactctc agtacaatct     5220

gctctgatgc cgcatagtta agccagcccc gacacccgcc aacacccgct gacgcgccct     5280

gacgggcttg tctgctcccg gcatccgctt acagacaagc tgtgaccgtc tccgggagct     5340

gcatgtgtca gaggttttca ccgtcatcac cgaaacgcgc gagacgaaag ggcctcgtga     5400

tacgcctatt tttataggtt aatgtcatga taataatggt ttcttagacg tcaggtggca     5460

cttttcgggg aaatgtgcgc ggaaccccta tttgtttatt tttctaaata cattcaaata     5520

tgtatccgct catgagacaa taaccctgat aaatgcttca ataatattga aaaaggaaga     5580

gtatgagtat tcaacatttc cgtgtcgccc ttattccctt ttttgcggca ttttgccttc     5640

ctgtttttgc tcacccagaa acgctggtga aagtaaaaga tgctgaagat cagttgggtg     5700

cacgagtggg ttacatcgaa ctggatctca acagcggtaa gatccttgag agttttcgcc     5760

ccgaagaacg ttttccaatg atgagcactt ttaaagttct gctatgtggc gcggtattat     5820

cccgtattga cgccgggcaa gagcaactcg gtcgccgcat acactattct cagaatgact     5880

tggttgagta ctcaccagtc acagaaaagc atcttacgga tggcatgaca gtaagagaat     5940

tatgcagtgc tgccataacc atgagtgata acactgcggc caacttactt ctgacaacga     6000

tcggaggacc gaaggagcta accgcttttt tgcacaacat gggggatcat gtaactcgcc     6060

ttgatcgttg ggaaccggag ctgaatgaag ccataccaaa cgacgagcgt gacaccacga     6120

tgcctgtagc aatggcaaca acgttgcgca aactattaac tggcgaacta cttactctag     6180

cttcccggca acaattaata gactggatgg aggcggataa agttgcagga ccacttctgc     6240

gctcggccct tccggctggc tggtttattg ctgataaatc tggagccggt gagcgtgggt     6300

ctcgcggtat cattgcagca ctggggccag atggtaagcc ctcccgtatc gtagttatct     6360

acacgacggg gagtcaggca actatggatg aacgaaatag acagatcgct gagataggtg     6420

cctcactgat taagcattgg taactgtcag accaagttta ctcatatata ctttagattg     6480

atttaaaact tcatttttaa tttaaaagga tctaggtgaa gatccttttt gataatctca     6540

tgaccaaaat cccttaacgt gagttttcgt tccactgagc gtcagacccc gtagaaaaga     6600

tcaaaggatc ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa     6660

aaccaccgct accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga     6720

aggtaactgg cttcagcaga gcgcagatac caaatactgt ccttctagtg tagccgtagt     6780

taggccacca cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt     6840

taccagtggc tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat     6900

agttaccgga taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct     6960

tggagcgaac gacctacacc gaactgagat acctacagcg tgagctatga gaaagcgcca     7020

cgcttcccga agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag     7080

agcgcacgag ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc     7140

gccacctctg acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga     7200

aaaacgccag caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca     7260

tgt                                                                   7263


<210>  53
<211>  7
<212>  PRT
<213>  artificial sequence

<220>
<223>  AAV-BR1

<400>  53

Asn Arg Gly Thr Glu Trp Asp 
1               5           


<210>  54
<211>  7
<212>  PRT
<213>  artificial sequence

<220>
<223>  AAV-PHP.S

<400>  54

Gln Ala Val Arg Thr Ser Leu 
1               5           


<210>  55
<211>  7
<212>  PRT
<213>  artificial sequence

<220>
<223>   AAV-PHP.B

<400>  55

Thr Leu Ala Val Pro Phe Lys 
1               5           


<210>  56
<211>  7
<212>  PRT
<213>  artificial sequence

<220>
<223>  AAV-PPS

<400>  56

Asp Ser Pro Ala His Pro Ser 
1               5           


<210>  57
<211>  736
<212>  PRT
<213>  artificial sequence

<220>
<223>  Capsid protein VP1 from Adeno-associated virus 9 (AVV9)

<400>  57

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1               5                   10                  15      


Glu Gly Ile Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Gln Pro 
            20                  25                  30          


Lys Ala Asn Gln Gln His Gln Asp Asn Ala Arg Gly Leu Val Leu Pro 
        35                  40                  45              


Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
    50                  55                  60                  


Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65                  70                  75                  80  


Gln Gln Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
                85                  90                  95      


Asp Ala Glu Phe Gln Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
            100                 105                 110         


Asn Leu Gly Arg Ala Val Phe Gln Ala Lys Lys Arg Leu Leu Glu Pro 
        115                 120                 125             


Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Arg 
    130                 135                 140                 


Pro Val Glu Gln Ser Pro Gln Glu Pro Asp Ser Ser Ala Gly Ile Gly 
145                 150                 155                 160 


Lys Ser Gly Ala Gln Pro Ala Lys Lys Arg Leu Asn Phe Gly Gln Thr 
                165                 170                 175     


Gly Asp Thr Glu Ser Val Pro Asp Pro Gln Pro Ile Gly Glu Pro Pro 
            180                 185                 190         


Ala Ala Pro Ser Gly Val Gly Ser Leu Thr Met Ala Ser Gly Gly Gly 
        195                 200                 205             


Ala Pro Val Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser Ser 
    210                 215                 220                 


Ser Gly Asn Trp His Cys Asp Ser Gln Trp Leu Gly Asp Arg Val Ile 
225                 230                 235                 240 


Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
                245                 250                 255     


Tyr Lys Gln Ile Ser Asn Ser Thr Ser Gly Gly Ser Ser Asn Asp Asn 
            260                 265                 270         


Ala Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
        275                 280                 285             


Phe His Cys His Phe Ser Pro Arg Asp Trp Gln Arg Leu Ile Asn Asn 
    290                 295                 300                 


Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn Ile 
305                 310                 315                 320 


Gln Val Lys Glu Val Thr Asp Asn Asn Gly Val Lys Thr Ile Ala Asn 
                325                 330                 335     


Asn Leu Thr Ser Thr Val Gln Val Phe Thr Asp Ser Asp Tyr Gln Leu 
            340                 345                 350         


Pro Tyr Val Leu Gly Ser Ala His Glu Gly Cys Leu Pro Pro Phe Pro 
        355                 360                 365             


Ala Asp Val Phe Met Ile Pro Gln Tyr Gly Tyr Leu Thr Leu Asn Asp 
    370                 375                 380                 


Gly Ser Gln Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385                 390                 395                 400 


Pro Ser Gln Met Leu Arg Thr Gly Asn Asn Phe Gln Phe Ser Tyr Glu 
                405                 410                 415     


Phe Glu Asn Val Pro Phe His Ser Ser Tyr Ala His Ser Gln Ser Leu 
            420                 425                 430         


Asp Arg Leu Met Asn Pro Leu Ile Asp Gln Tyr Leu Tyr Tyr Leu Ser 
        435                 440                 445             


Lys Thr Ile Asn Gly Ser Gly Gln Asn Gln Gln Thr Leu Lys Phe Ser 
    450                 455                 460                 


Val Ala Gly Pro Ser Asn Met Ala Val Gln Gly Arg Asn Tyr Ile Pro 
465                 470                 475                 480 


Gly Pro Ser Tyr Arg Gln Gln Arg Val Ser Thr Thr Val Thr Gln Asn 
                485                 490                 495     


Asn Asn Ser Glu Phe Ala Trp Pro Gly Ala Ser Ser Trp Ala Leu Asn 
            500                 505                 510         


Gly Arg Asn Ser Leu Met Asn Pro Gly Pro Ala Met Ala Ser His Lys 
        515                 520                 525             


Glu Gly Glu Asp Arg Phe Phe Pro Leu Ser Gly Ser Leu Ile Phe Gly 
    530                 535                 540                 


Lys Gln Gly Thr Gly Arg Asp Asn Val Asp Ala Asp Lys Val Met Ile 
545                 550                 555                 560 


Thr Asn Glu Glu Glu Ile Lys Thr Thr Asn Pro Val Ala Thr Glu Ser 
                565                 570                 575     


Tyr Gly Gln Val Ala Thr Asn His Gln Ser Ala Gln Ala Gln Ala Gln 
            580                 585                 590         


Thr Gly Trp Val Gln Asn Gln Gly Ile Leu Pro Gly Met Val Trp Gln 
        595                 600                 605             


Asp Arg Asp Val Tyr Leu Gln Gly Pro Ile Trp Ala Lys Ile Pro His 
    610                 615                 620                 


Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Met 
625                 630                 635                 640 


Lys His Pro Pro Pro Gln Ile Leu Ile Lys Asn Thr Pro Val Pro Ala 
                645                 650                 655     


Asp Pro Pro Thr Ala Phe Asn Lys Asp Lys Leu Asn Ser Phe Ile Thr 
            660                 665                 670         


Gln Tyr Ser Thr Gly Gln Val Ser Val Glu Ile Glu Trp Glu Leu Gln 
        675                 680                 685             


Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Ile Gln Tyr Thr Ser Asn 
    690                 695                 700                 


Tyr Tyr Lys Ser Asn Asn Val Glu Phe Ala Val Asn Thr Glu Gly Val 
705                 710                 715                 720 


Tyr Ser Glu Pro Arg Pro Ile Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
                725                 730                 735     


