                         SEQUENCE LISTING

<110>  GeoVax Labs, Inc.
       Duke University
 
<120>  RECOMBINANT MVA-BASED HIV IMMUNOGENS AND USES THEREOF

<130>  GEO-1200

<150>  62/487,939
<151>  2017-04-20

<160>  16    

<170>  PatentIn version 3.5

<210>  1
<211>  4794
<212>  DNA
<213>  Human immunodeficiency virus type 1

<400>  1
gaattccctg ggacatacgt atatttctat gatctgtctt atatgaagtc tatacagcga       60

atagattcag aatttctaca taattatata ttgtacgcta ataagtttaa tctaacactc      120

cccgaagatt tgtttataat ccctacaaat ttggatattc tatggcgtac aaaggaatat      180

atagactcgt tcgatattag tacagaaaca tggaataaat tattatccaa ttattatatg      240

aagatgatag agtatgctaa actttatgta ctaagtccta ttctcgctga ggagttggat      300

aattttgaga ggacgggaga attaactagt attgtacaag aagccatttt atctctaaat      360

ttacgaatta agattttaaa ttttaaacat aaagatgatg atacgtatat acacttttgt      420

aaaatattat tcggtgtcta taacggaaca aacgctacta tatattatca tagacctcta      480

acgggatata tgaatatgat ttcagatact atatttgttc ctgtagataa taactaaggc      540

gcgcctttca ttttgttttt ttctatgcta taaatggtga gcaagggcga ggagctgttc      600

accggggtgg tgcccatcct ggtcgagctg gacggcgacg taaacggcca caagttcagc      660

gtgtccggcg agggcgaggg cgatgccacc tacggcaagc tgaccctgaa gttcatctgc      720

accaccggca agctgcccgt gccctggccc accctcgtga ccaccctgac ctacggcgtg      780

cagtgcttca gccgctaccc cgaccacatg aagcagcacg acttcttcaa gtccgccatg      840

cccgaaggct acgtccagga gcgcaccatc ttcttcaagg acgacggcaa ctacaagacc      900

cgcgccgagg tgaagttcga gggcgacacc ctggtgaacc gcatcgagct gaagggcatc      960

gacttcaagg aggacggcaa catcctgggg cacaagctgg agtacaacta caacagccac     1020

aacgtctata tcatggccga caagcagaag aacggcatca aggtgaactt caagatccgc     1080

cacaacatcg aggacggcag cgtgcagctc gccgaccact accagcagaa cacccccatc     1140

ggcgacggcc ccgtgctgct gcccgacaac cactacctga gcacccagtc cgccctgagc     1200

aaagacccca acgagaagcg cgatcacatg gtcctgctgg agttcgtgac cgccgccggg     1260

atcactctcg gcatgcacga gctgtacaag taagagctcg aggacgggag aattaactag     1320

tattgtacaa gaagccattt tatctctaaa tttacgaatt aagattttaa attttaaaca     1380

taaagatgat gatacgtata tacacttttg taaaatatta ttcggtgtct ataacggaac     1440

aaacgctact atatattatc atagacctct aacgggatat atgaatatga tttcagatac     1500

tatatttgtt cctgtagata ataactaact cgaggccgct ggtacccaac ctaaaaattg     1560

aaaataaata caaaggttct tgagggttgt gttaaattga aagcgagaaa taatcataaa     1620

taagcccggg atgctcgagc ggccgcacca tgagagtaat gggaattcaa agaaattatc     1680

cacaatggtg gatttggtct atgctaggat tttggatgct aatgatatgt aatggaatgt     1740

gggttacagt atattatggt gtaccagtat ggaaagaagc gaaaacaaca ctattttgtg     1800

cgtctgatgc gaaagcgtat gaaaaagaag ttcataatgt ttgggcgaca catgcgtgtg     1860

taccaacaga tccaaatcct caagaaatgg tattgaaaaa tgtaacagaa aactttaata     1920

tgtggaaaaa tgatatggta gatcaaatgc atgaagatgt aatttctcta tgggatcaat     1980

ctctaaaacc atgtgtaaaa ctaacaccac tatgtgtaac attgaattgt acaaatgcga     2040

cagcgtctaa ttcttctatt attgaaggaa tgaaaaattg ttcttttaat attacaacag     2100

aactaagaga taaaagagaa aaaaaaaatg ctttgtttta taaactagat atagtacaac     2160

tagatggaaa ttcttctcaa tatagattga ttaattgtaa tacttctgta attactcaag     2220

cgtgtccaaa agtatctttt gatccaattc caattcatta ttgtgcgcca gcgggatatg     2280

cgattttgaa atgtaataat aaaactttta ctggaacagg accttgtaat aatgtatcta     2340

cagtacaatg tactcatggt attaaaccag tagtatctac acaactacta ttgaatggat     2400

ctctagcgga aggagaaatt attattagat ctgaaaatat tacaaataat gtaaaaacaa     2460

ttatagtaca tctaaatgaa tctgtaaaaa ttgaatgtac tagaccaaat aataaaacaa     2520

gaacatctat tagaattgga ccaggacaag cgttttatgc gacaggacaa gtaattggag     2580

atattagaga agcgtattgt aatattaatg aatctaaatg gaatgaaaca ctacaaagag     2640

tatctaaaaa actaaaagaa tattttcctc ataaaaatat tacttttcaa ccatcttctg     2700

gtggagatct agaaattact actcatagtt ttaattgtgg tggagaattc ttttattgta     2760

atacttcatc tttgtttaat agaacatata tggcgaattc tacagatatg gctaatagta     2820

cagaaacaaa ttctacaaga actattacta ttcattgtag aattaaacaa attataaata     2880

tgtggcaaga agttggtaga gctatgtatg cgccaccaat tgcgggaaat attacatgta     2940

tatctaatat tactggacta ctactaacaa gagatggtgg aaaaaataat actgaaactt     3000

ttagaccagg tggtggaaat atgaaagata attggagaag tgaattgtat aaatataaag     3060

tagtagaagt aaaaccattg ggagtagcgc caactaatgc gagaagaaga gtagttgaaa     3120

gagaaaaaag agctgtagga atgggagcgg tatttttggg atttctaggt gctgcgggat     3180

ctacaatggg agctgcgagt attacactaa cagtacaagc gagacaacta ttgtctggaa     3240

tagttcaaca acaatctaat ctattgaaag cgattgaagc gcaacaacat atgctaaaac     3300

taactgtatg gggtattaaa caattgcaag cgagagtact agcgttggaa agatatttga     3360

aagatcaaca attgctagga atgtggggat gttctggaaa actaatatgt actacaaatg     3420

tatattggaa ttcttcttgg tcaaataaaa catatggaga tatatgggat aatatgacat     3480

ggatgcaatg ggaaagagaa atatctaatt atacagaaat tatatatgaa ctactagaag     3540

aatctcaaaa tcaacaagaa aaaaatgaac aagatctatt ggcgctagat agatggaata     3600

gtctatggaa ttggtttaat attactaatt ggctatggta tataaaaatt ttcattatga     3660

tagttggagg actaattgga ctaagaatta tttttgcggt attgtctcta gtaaatagag     3720

ttagacaagg atattctcca ctatctctac aaacactaat tccatctcca agaggaccag     3780

atagacctgg tggaattgaa gaagaaggtg gagaacaaga tagaaattaa tttttatgtc     3840

gacctgcagt caaactctaa tgaccacatc tttttttaga gatgaaaaat tttccacatc     3900

tccttttgta gacacgacta aacattttgc agaaaaaagt ttattagtgt ttagataatc     3960

gtatacttca tcagtgtaga tagtaaatgt gaacagataa aaggtattct tgctcaatag     4020

attggtaaat tccatagaat atattaatcc tttcttcttg agatcccaca tcatttcaac     4080

cagagacgtt ttatccaatg atttacctcg tactatacca catacaaaac tagattttgc     4140

agtgacgtcg tatctggtat tcctaccaaa caaaatttta cttttagttc ttttagaaaa     4200

ttctaaggta gaatctctat ttgccaatat gtcatctatg gaattaccac tagcaaaaaa     4260

tgatagaaat atatattgat acatcgcagc tggttttgat ctactatact ttaaaaacga     4320

atcagattcc ataattgcct gtatatcatc agctgaaaaa ctatgtttta cacgtattcc     4380

ttcggcattt ctttttaatg atatatcttg tttagacaat gataaagtta tcatgtccat     4440

gagagacgcg tctccgtatc gtataaatat ttcattagat gttagacgct tcattagggg     4500

tatacttcta taaggtttct taatcagtcc atcattggtt gcgtcaagaa caagcttgtc     4560

tccctatagt gagtcgtatt agagcttggc gtaatcatgg tcatagctgt ttcctgtgtg     4620

aaattgttat ccgctcacaa ttccacacaa catacgagcc ggaagcataa agtgtaaagc     4680

ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac tgcccgcttt     4740

cgagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg cggg           4794


<210>  2
<211>  4646
<212>  DNA
<213>  Human immunodeficiency virus type-1

<400>  2
gagaggcggt ttgcgtattg ggcgctcttc cgcttcctcg ctcactgact cgctgcgctc       60

ggtcgttcgg ctgcggcgag cggtatcagc tcactcaaag gcggtaatac ggttatccac      120

agaatcaggg gataacgcag gaaagaacat gtgagcaaaa ggccagcaaa aggccaggaa      180

ccgtaaaaag gccgcgttgc tggcgttttt cgataggctc cgcccccctg acgagcatca      240

caaaaatcga cgctcaagtc agaggtggcg aaacccgaca ggactataaa gataccaggc      300

gtttccccct ggaagctccc tcgtgcgctc tcctgttccg accctgccgc ttaccggata      360

cctgtccgcc tttctccctt cgggaagcgt ggcgctttct catagctcac gctgtaggta      420

tctcagttcg gtgtaggtcg ttcgctccaa gctgggctgt gtgcacgaac cccccgttca      480

gcccgaccgc tgcgccttat ccggtaacta tcgtcttgag tccaacccgg taagacacga      540

cttatcgcca ctggcagcag ccactggtaa caggattagc agagcgaggt atgtaggcgg      600

tgctacagag ttcttgaagt ggtggcctaa ctacggctac actagaagga cagtatttgg      660

tatctgcgct ctgctgaagc cagttacctt cggaaaaaga gttggtagct cttgatccgg      720

caaacaaacc accgctggta gcggtggttt ttttgtttgc aagcagcaga ttacgcgcag      780

aaaaaaagga tctcaagaag atcctttgat cttttctacg gggtctgacg ctcagtggaa      840

cgaaaactca cgttaaggga ttttggtcat gagattatca aaaaggatct tcacctagat      900

ccttttaaat taaaaatgaa gttttaaatc aatctaaagt atatatgagt aaacttggtc      960

tgacagttac caatgcttaa tcagtgaggc acctatctca gcgatctgtc tatttcgttc     1020

atccatagtt gcctgactcc ccgtcgtgta gataactacg atacgggagg gcttaccatc     1080

tggccccagt gctgcaatga taccgcgaga cccacgctca ccggctccag atttatcagc     1140

aataaaccag ccagccggaa gggccgagcg cagaagtggt cctgcaactt tatccgcctc     1200

catccagtct attaattgtt gccgggaagc tagagtaagt agttcgccag ttaatagttt     1260

gcgcaacgtt gttggcattg ctacaggcat cgtggtgtca cgctcgtcgt ttggtatggc     1320

ttcattcagc tccggttccc aacgatcaag gcgagttaca tgatccccca tgttgtgcaa     1380

aaaagcggtt agctccttcg gtcctccgat cgttgtcaga agtaagttgg ccgcagtgtt     1440

atcactcatg gttatggcag cactgcataa ttctcttact gtcatgccat ccgtaagatg     1500

cttttctgtg actggtgagt actcaaccaa gtcattctga gaatagtgta tgcggcgacc     1560

gagttgctct tgcccggcgt caatacggga taataccgcg ccacatagca gaactttaaa     1620

agtgctcatc attggaaaac gttcttcggg gcgaaaactc tcaaggatct taccgctgtt     1680

gagatccagt tcgatgtaac ccactcgtgc acccaactga tcttcagcat cttttacttt     1740

caccagcgtt tctgggtgag caaaaacagg aaggcaaaat gccgcaaaaa agggaataag     1800

ggcgacacgg aaatgttgaa tactcatact cttccttttt caatattatt gaagcattta     1860

tcagggttat tgtctcatga gcggatacat atttgaatgt atttagaaaa ataaacaaat     1920

aggggttccg cgcacatttc cccgaaaagt gccacctgac gtctaagaaa ccattattat     1980

catgacatta acctataaaa ataggcgtat cacgaggccc tttcgtctcg cgcgtttcgg     2040

tgatgacggt gaaaacctct gacacatgca gctcccggag acggtcacag cttgtctgta     2100

agcggatgcc gggagcagac aagcccgtca gggcgcgtca gcgggtgttg gcgggtgtcg     2160

gggctggctt aactatgcgg catcagagca gattgtactg agagtgcacc atatgcggtg     2220

tgaaataccg cacagatgcg taaggagaaa ataccgcatc aggcgccatt cgccattcag     2280

gctgcgcaac tgttgggaag ggcgatcggt gcgggcctct tcgctattac gccagctggc     2340

gaaaggggga tgtgctgcaa ggcgattaag ttgggtaacg ccagggtttt cccagtcacg     2400

acgttgtaaa acgacggcca gtgaattgga tttaggtgac actataatgc tcgagcggcc     2460

gcaccatgag agtaatggga attcaaagaa attatccaca atggtggatt tggtctatgc     2520

taggattttg gatgctaatg atatgtaatg gaatgtgggt tacagtatat tatggtgtac     2580

cagtatggaa agaagcgaaa acaacactat tttgtgcgtc tgatgcgaaa gcgtatgaaa     2640

aagaagttca taatgtttgg gcgacacatg cgtgtgtacc aacagatcca aatcctcaag     2700

aaatggtatt gaaaaatgta acagaaaact ttaatatgtg gaaaaatgat atggtagatc     2760

aaatgcatga agatgtaatt tctctatggg atcaatctct aaaaccatgt gtaaaactaa     2820

caccactatg tgtaacattg aattgtacaa atgcgacagc gtctaattct tctattattg     2880

aaggaatgaa aaattgttct tttaatatta caacagaact aagagataaa agagaaaaaa     2940

aaaatgcttt gttttataaa ctagatatag tacaactaga tggaaattct tctcaatata     3000

gattgattaa ttgtaatact tctgtaatta ctcaagcgtg tccaaaagta tcttttgatc     3060

caattccaat tcattattgt gcgccagcgg gatatgcgat tttgaaatgt aataataaaa     3120

cttttactgg aacaggacct tgtaataatg tatctacagt acaatgtact catggtatta     3180

aaccagtagt atctacacaa ctactattga atggatctct agcggaagga gaaattatta     3240

ttagatctga aaatattaca aataatgtaa aaacaattat agtacatcta aatgaatctg     3300

taaaaattga atgtactaga ccaaataata aaacaagaac atctattaga attggaccag     3360

gacaagcgtt ttatgcgaca ggacaagtaa ttggagatat tagagaagcg tattgtaata     3420

ttaatgaatc taaatggaat gaaacactac aaagagtatc taaaaaacta aaagaatatt     3480

ttcctcataa aaatattact tttcaaccat cttctggtgg agatctagaa attactactc     3540

atagttttaa ttgtggtgga gaattctttt attgtaatac ttcatctttg tttaatagaa     3600

catatatggc gaattctaca gatatggcta atagtacaga aacaaattct acaagaacta     3660

ttactattca ttgtagaatt aaacaaatta taaatatgtg gcaagaagtt ggtagagcta     3720

tgtatgcgcc accaattgcg ggaaatatta catgtatatc taatattact ggactactac     3780

taacaagaga tggtggaaaa aataatactg aaacttttag accaggtggt ggaaatatga     3840

aagataattg gagaagtgaa ttgtataaat ataaagtagt agaagtaaaa ccattgggag     3900

tagcgccaac taatgcgaga agaagagtag ttgaaagaga aaaaagagct gtaggaatgg     3960

gagcggtatt tttgggattt ctaggtgctg cgggatctac aatgggagct gcgagtatta     4020

cactaacagt acaagcgaga caactattgt ctggaatagt tcaacaacaa tctaatctat     4080

tgaaagcgat tgaagcgcaa caacatatgc taaaactaac tgtatggggt attaaacaat     4140

tgcaagcgag agtactagcg ttggaaagat atttgaaaga tcaacaattg ctaggaatgt     4200

ggggatgttc tggaaaacta atatgtacta caaatgtata ttggaattct tcttggtcaa     4260

ataaaacata tggagatata tgggataata tgacatggat gcaatgggaa agagaaatat     4320

ctaattatac agaaattata tatgaactac tagaagaatc tcaaaatcaa caagaaaaaa     4380

atgaacaaga tctattggcg ctagatagat ggaatagtct atggaattgg tttaatatta     4440

ctaattggct atggtatata aaaattttca ttatgatagt tggaggacta attggactaa     4500

gaattatttt tgcggtattg tctctagtaa atagagttag acaaggatat tctccactat     4560

ctctacaaac actaattcca tctccaagag gaccagatag acctggtgga attgaagaag     4620

aaggtggaga acaagataga aattaa                                          4646


<210>  3
<211>  726
<212>  PRT
<213>  Human immunodeficiency virus type-1

<400>  3

Met Arg Val Met Gly Ile Gln Arg Asn Tyr Pro Gln Trp Trp Ile Trp 
1               5                   10                  15      


Ser Met Leu Gly Phe Trp Met Leu Met Ile Cys Asn Gly Met Trp Val 
            20                  25                  30          


Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Lys Thr Thr Leu 
        35                  40                  45              


Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Lys Glu Val His Asn Val 
    50                  55                  60                  


Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gln Glu Met 
65                  70                  75                  80  


Val Leu Lys Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asp Met 
                85                  90                  95      


Val Asp Gln Met His Glu Asp Val Ile Ser Leu Trp Asp Gln Ser Leu 
            100                 105                 110         


Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Thr 
        115                 120                 125             


Asn Ala Thr Ala Ser Asn Ser Ser Ile Ile Glu Gly Met Lys Asn Cys 
    130                 135                 140                 


Ser Phe Asn Ile Thr Thr Glu Leu Arg Asp Lys Arg Glu Lys Lys Asn 
145                 150                 155                 160 


Ala Leu Phe Tyr Lys Leu Asp Ile Val Gln Leu Asp Gly Asn Ser Ser 
                165                 170                 175     


Gln Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile Thr Gln Ala Cys 
            180                 185                 190         


Pro Lys Val Ser Phe Asp Pro Ile Pro Ile His Tyr Cys Ala Pro Ala 
        195                 200                 205             


Gly Tyr Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Thr Gly Thr Gly 
    210                 215                 220                 


Pro Cys Asn Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro 
225                 230                 235                 240 


Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Gly Glu 
                245                 250                 255     


Ile Ile Ile Arg Ser Glu Asn Ile Thr Asn Asn Val Lys Thr Ile Ile 
            260                 265                 270         


Val His Leu Asn Glu Ser Val Lys Ile Glu Cys Thr Arg Pro Asn Asn 
        275                 280                 285             


Lys Thr Arg Thr Ser Ile Arg Ile Gly Pro Gly Gln Ala Phe Tyr Ala 
    290                 295                 300                 


Thr Gly Gln Val Ile Gly Asp Ile Arg Glu Ala Tyr Cys Asn Ile Asn 
305                 310                 315                 320 


Glu Ser Lys Trp Asn Glu Thr Leu Gln Arg Val Ser Lys Lys Leu Lys 
                325                 330                 335     


Glu Tyr Phe Pro His Lys Asn Ile Thr Phe Gln Pro Ser Ser Gly Gly 
            340                 345                 350         


Asp Leu Glu Ile Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe 
        355                 360                 365             


Tyr Cys Asn Thr Ser Ser Leu Phe Asn Arg Thr Tyr Met Ala Asn Ser 
    370                 375                 380                 


Thr Asp Met Ala Asn Ser Thr Glu Thr Asn Ser Thr Arg Thr Ile Thr 
385                 390                 395                 400 


Ile His Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly 
                405                 410                 415     


Arg Ala Met Tyr Ala Pro Pro Ile Ala Gly Asn Ile Thr Cys Ile Ser 
            420                 425                 430         


Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Lys Asn Asn Thr 
        435                 440                 445             


Glu Thr Phe Arg Pro Gly Gly Gly Asn Met Lys Asp Asn Trp Arg Ser 
    450                 455                 460                 


Glu Leu Tyr Lys Tyr Lys Val Val Glu Val Lys Pro Leu Gly Val Ala 
465                 470                 475                 480 


Pro Thr Asn Ala Arg Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val 
                485                 490                 495     


Gly Met Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
            500                 505                 510         


Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu Leu 
        515                 520                 525             


Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Lys Ala Ile Glu Ala 
    530                 535                 540                 


Gln Gln His Met Leu Lys Leu Thr Val Trp Gly Ile Lys Gln Leu Gln 
545                 550                 555                 560 


Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Lys Asp Gln Gln Leu Leu 
                565                 570                 575     


Gly Met Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Asn Val Tyr 
            580                 585                 590         


Trp Asn Ser Ser Trp Ser Asn Lys Thr Tyr Gly Asp Ile Trp Asp Asn 
        595                 600                 605             


Met Thr Trp Met Gln Trp Glu Arg Glu Ile Ser Asn Tyr Thr Glu Ile 
    610                 615                 620                 


Ile Tyr Glu Leu Leu Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn Glu 
625                 630                 635                 640 


Gln Asp Leu Leu Ala Leu Asp Arg Trp Asn Ser Leu Trp Asn Trp Phe 
                645                 650                 655     


Asn Ile Thr Asn Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile Val 
            660                 665                 670         


Gly Gly Leu Ile Gly Leu Arg Ile Ile Phe Ala Val Leu Ser Leu Val 
        675                 680                 685             


Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Leu Gln Thr Leu Ile 
    690                 695                 700                 


Pro Ser Pro Arg Gly Pro Asp Arg Pro Gly Gly Ile Glu Glu Glu Gly 
705                 710                 715                 720 


Gly Glu Gln Asp Arg Asn 
                725     


<210>  4
<211>  2239
<212>  DNA
<213>  Human immunodeficiency virus type-1

<400>  4
atgctcgagc ggccgcacca tgagagtaat gggaattcaa agaaattatc cacaatggtg       60

gatttggtct atgctaggat tttggatgct aatgatatgt aatggaatgt gggttacagt      120

atattatggt gtaccagtat ggaaagaagc gaaaacaaca ctattttgtg cgtctgatgc      180

gaaagcgtat gaaaaagaag ttcataatgt ttgggcgaca catgcgtgtg taccaacaga      240

tccaaatcct caagaaatgg tattgaaaaa tgtaacagaa aactttaata tgtggaaaaa      300

tgatatggta gatcaaatgc atgaagatgt aatttctcta tgggatcaat ctctaaaacc      360

atgtgtaaaa ctaacaccac tatgtgtaac attgaattgt acaaatgcga atgcgacagc      420

gtctaattct tctattattg aaggaatgaa ttctagtata attgaaggaa tgaaaaattg      480

ttcttttaat attacaacag aactaagaga taaaagagaa aaaaaaaatg ctttgtttta      540

taaactagat atagtacaac tagatggaaa ttcttctcaa tatagattga ttaattgtaa      600

tacttctgta attactcaag cgtgtccaaa agtatctttt gatccaattc caattcatta      660

ttgtgcgcca gcgggatatg cgattttgaa atgtaataat aaaactttta atggaacagg      720

accttgtaat aatgtatcta cagtacaatg tactcatggt attaaaccag tagtatctac      780

acaactacta ttgaatggat ctctagcgga aggagaaatt attattagat ctgaaaatat      840

tacagataat ggaaaaacaa ttatagtaca tctaaatgaa tctgtaaaaa ttgaatgtac      900

tagaccaagt aataatacaa gaacatctat tagaattgga ccaggacaag cgttttatgc      960

gacaggacaa gtaattggag atattagaga agcgcattgt aatattagtg aatctaaatg     1020

gaatgaaaca ctacaaagag tatcagaaaa attgaaagaa tattttcctc ataaaaatat     1080

tacttttcaa ccatcttctg gtggagatct agaaattact actcatagtt ttaattgtgg     1140

tggagaattc ttttattgta atacttcatc tttgtttaat agaacatata tggcgacatc     1200

tacagatatg gcgaattcta ctgaaacaaa ttctacaaga attattacta ttagatgtag     1260

aattaaacaa attataaata tgtggcaaga agttggtaga gctatgtatg cgccaccaat     1320

tgcgggaaat attacatgta tttcaaatat tactggacta ctactaacaa gagatggtgg     1380

aaaaaataat actgaaacat ttgaaacttt tagaccaggt ggtggaaata tgaaagataa     1440

ttggagaagt gaattgtata aatataaagt agtagaagta aaaccattgg gagtagcgcc     1500

aactaatgcg agaagaagag tagttgaaag agaaaaaaga gctgtaggaa tgggagcggt     1560

atttttggga tttctaggtg ctgcgggatc tacaatggga gctgcgagta ttacactaac     1620

agtacaagcg agacaactat tgtctggaat agttcaacaa caatctaatc tattgaaagc     1680

gattgaagcg caacaacata tgctaaaact aactgtatgg ggtattaaac aattgcaagc     1740

gagagtacta gcgttggaaa gatatctaaa agatcaacaa ttgctaggaa tgtggggatg     1800

ttctggaaaa ctaatatgta ctacaaatgt atattggaat tcttcttggt ctaataaaac     1860

atatggagat atatgggata atatgacatg gatgcaatgg gaaagagaaa tatctaatta     1920

tacagaaatt atatatgaac tactagaaga aagtcaaaat caacaagaaa aaaatgaaca     1980

agatctattg gcgctagata gatggaatag tctatggaat tggtttaata ttactaattg     2040

gctatggtat ataaaaattt tcattatgat agttggagga ctaattggat tgagaattat     2100

ttttgcggta ttgtctctag taaatagagt tagacaagga tattctccac tatctctaca     2160

aacactaatt ccatctccaa gaggaccaga tagacctggt ggaattgaag aagaaggtgg     2220

agaacaagat agaaattaa                                                  2239


<210>  5
<211>  739
<212>  PRT
<213>  Human immunodeficiency virus type-1

<400>  5

Met Arg Val Met Gly Ile Gln Arg Asn Tyr Pro Gln Trp Trp Ile Trp 
1               5                   10                  15      


Ser Met Leu Gly Phe Trp Met Leu Met Ile Cys Asn Gly Met Trp Val 
            20                  25                  30          


Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Lys Thr Thr Leu 
        35                  40                  45              


Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Lys Glu Val His Asn Val 
    50                  55                  60                  


Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gln Glu Met 
65                  70                  75                  80  


Val Leu Lys Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asp Met 
                85                  90                  95      


Val Asp Gln Met His Glu Asp Val Ile Ser Leu Trp Asp Gln Ser Leu 
            100                 105                 110         


Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Thr 
        115                 120                 125             


Asn Ala Asn Ala Thr Ala Ser Asn Ser Ser Ile Ile Glu Gly Met Asn 
    130                 135                 140                 


Ser Ser Ile Ile Glu Gly Met Lys Asn Cys Ser Phe Asn Ile Thr Thr 
145                 150                 155                 160 


Glu Leu Arg Asp Lys Arg Glu Lys Lys Asn Ala Leu Phe Tyr Lys Leu 
                165                 170                 175     


Asp Ile Val Gln Leu Asp Gly Asn Ser Ser Gln Tyr Arg Leu Ile Asn 
            180                 185                 190         


Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Asp 
        195                 200                 205             


Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Tyr Ala Ile Leu Lys 
    210                 215                 220                 


Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Asn Asn Val Ser 
225                 230                 235                 240 


Thr Val Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu 
                245                 250                 255     


Leu Leu Asn Gly Ser Leu Ala Glu Gly Glu Ile Ile Ile Arg Ser Glu 
            260                 265                 270         


Asn Ile Thr Asp Asn Gly Lys Thr Ile Ile Val His Leu Asn Glu Ser 
        275                 280                 285             


Val Lys Ile Glu Cys Thr Arg Pro Ser Asn Asn Thr Arg Thr Ser Ile 
    290                 295                 300                 


Arg Ile Gly Pro Gly Gln Ala Phe Tyr Ala Thr Gly Gln Val Ile Gly 
305                 310                 315                 320 


Asp Ile Arg Glu Ala His Cys Asn Ile Ser Glu Ser Lys Trp Asn Glu 
                325                 330                 335     


Thr Leu Gln Arg Val Ser Glu Lys Leu Lys Glu Tyr Phe Pro His Lys 
            340                 345                 350         


Asn Ile Thr Phe Gln Pro Ser Ser Gly Gly Asp Leu Glu Ile Thr Thr 
        355                 360                 365             


His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser Ser 
    370                 375                 380                 


Leu Phe Asn Arg Thr Tyr Met Ala Thr Ser Thr Asp Met Ala Asn Ser 
385                 390                 395                 400 


Thr Glu Thr Asn Ser Thr Arg Ile Ile Thr Ile Arg Cys Arg Ile Lys 
                405                 410                 415     


Gln Ile Ile Asn Met Trp Gln Glu Val Gly Arg Ala Met Tyr Ala Pro 
            420                 425                 430         


Pro Ile Ala Gly Asn Ile Thr Cys Ile Ser Asn Ile Thr Gly Leu Leu 
        435                 440                 445             


Leu Thr Arg Asp Gly Gly Lys Asn Asn Thr Glu Thr Phe Glu Thr Phe 
    450                 455                 460                 


Arg Pro Gly Gly Gly Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr 
465                 470                 475                 480 


Lys Tyr Lys Val Val Glu Val Lys Pro Leu Gly Val Ala Pro Thr Asn 
                485                 490                 495     


Ala Arg Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly Met Gly 
            500                 505                 510         


Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 
        515                 520                 525             


Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu Leu Ser Gly Ile 
    530                 535                 540                 


Val Gln Gln Gln Ser Asn Leu Leu Lys Ala Ile Glu Ala Gln Gln His 
545                 550                 555                 560 


Met Leu Lys Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Val 
                565                 570                 575     


Leu Ala Leu Glu Arg Tyr Leu Lys Asp Gln Gln Leu Leu Gly Met Trp 
            580                 585                 590         


Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Asn Val Tyr Trp Asn Ser 
        595                 600                 605             


Ser Trp Ser Asn Lys Thr Tyr Gly Asp Ile Trp Asp Asn Met Thr Trp 
    610                 615                 620                 


Met Gln Trp Glu Arg Glu Ile Ser Asn Tyr Thr Glu Ile Ile Tyr Glu 
625                 630                 635                 640 


Leu Leu Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn Glu Gln Asp Leu 
                645                 650                 655     


Leu Ala Leu Asp Arg Trp Asn Ser Leu Trp Asn Trp Phe Asn Ile Thr 
            660                 665                 670         


Asn Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile Val Gly Gly Leu 
        675                 680                 685             


Ile Gly Leu Arg Ile Ile Phe Ala Val Leu Ser Leu Val Asn Arg Val 
    690                 695                 700                 


Arg Gln Gly Tyr Ser Pro Leu Ser Leu Gln Thr Leu Ile Pro Ser Pro 
705                 710                 715                 720 


Arg Gly Pro Asp Arg Pro Gly Gly Ile Glu Glu Glu Gly Gly Glu Gln 
                725                 730                 735     


Asp Arg Asn 
            


<210>  6
<211>  2206
<212>  DNA
<213>  Human immunodeficiency virus type-1

<400>  6
atgctcgagc ggccgcacca tgagagtaac tggaattcaa agaaattatc cacaatggtg       60

gatttggtct atgctaggac tatggatgct aatgatatgt aatggaatgt gggttacagt      120

atattatggt gtaccagtat ggaaagaagc gaaaacaaca ctattttgtg cgtctgatgc      180

gaaagcgtat gaaaaagaag ttcataatgt ttgggcgaca catgcgtgtg taccaacaga      240

tccaaatcct caagaaatgg tattgaaaaa tgtaacagaa aactttaata tgtggaaaaa      300

tgatatggcg gatcaaatgc atgaagatgt aatttctcta tgggatcaat ctctaaaacc      360

atgtgtaaaa ctaacaccac tatgtgtaac attgaattgt attgatgcga atgcgacagc      420

gtctaatgcg actgcgagta attcttctat tattgaagga atgaaaaatt gttcttttaa      480

tattacaaca gaactaagag acaaaattga aaaaaaaaat gctttgtttt ataaactaga      540

tatagtacaa ctagatggaa attcttctca atatagattg attaattgta atacttctgt      600

aattactcaa gcgtgtccaa aagtatcttt tgatccaatt ccaattcatt attgtgcgcc      660

agcgggatat gcgattttga aatgtaataa taaaactttt aatggaacag gaccttgtaa      720

taatgtatct acagtacaat gtactcatgg tattaaacca gtagtatcta cacaactact      780

attgaatgga tctctagcgg aaggagaaat tattattaga tctgaaaata ttactaattc      840

tgcgaaaact attatagtac atctaaatga atctgtaaaa attgaatgta caagaccaag      900

taataataca agaacatcta ttagaattgg accaggacaa gcgttttatg cgacaggaca      960

agtaattgga gatattagaa aagcgcattg taatatttca gaaagtaaat ggaatgaaac     1020

actacaaaga gtatctaaaa aactaaaaga atattttcct cataaaaata ttacttttca     1080

accatcttct ggtggagatc tagaaattac tactcatagt tttaattgtg gtggagaatt     1140

cttctattgt aatacttcat ctttgtttaa tagaacatat atggcgaatt ctacagaaac     1200

aaattctaca agaactatta cactacattg tagaattaaa caaattataa atatgtggca     1260

agaagttggt agagctatgt atgcgccacc aattgcggga aatattacat gtatttctaa     1320

tattactgga ctactactaa caagagatgg tggaaataat aatactactg aaacttttag     1380

accaggtggt ggaaatatga aagataattg gagaagtgaa ttgtataaat ataaagtagt     1440

agaaattaaa ccattgggag tagcgccaac aaatgcgaga agaagagtag ttgaaagaga     1500

aaaaagagct gtaggaatgg gagcggtatt tctaggattt ttgggagcgg cgggatctac     1560

aatgggagct gcttctatta cattgacagt acaagcgaga caactattgt ctggaatagt     1620

tcaacaacaa tctaatctat tgaaagcgat tgaagcgcaa caacatatgc taaaactaac     1680

tgtatggggt attaaacaat tgcaagcgag agtactagcg ttggaaagat atttgaaaga     1740

tcaacaattg ctaggaatgt ggggatgttc tggaaaacta atatgtacta caaatgtata     1800

ttggaattct tcttggtcta ataaaacata tggagatata tgggataata tgacatggat     1860

gcaatgggaa agagaaatat ctgattatac agaaattata tatgaactac tagaagaatc     1920

tcaaaatcaa caagaaaaaa atgaacaaga tctattggcg ctagatagat ggaatagtct     1980

atggaattgg tttaatatta caaattggct atggtatata aaaattttca ttatgatagt     2040

tggaggacta attggactaa gaattatttt tgcggtattg tctctagtaa atagagttag     2100

acaaggatat tctccactat ctctacaaac actaactcca tctccaagag gaccagatag     2160

acctggtgga attgaagaag aaggtggaga acaagataga aattaa                    2206


<210>  7
<211>  728
<212>  PRT
<213>  Human immunodeficiency virus type-1

<400>  7

Met Arg Val Thr Gly Ile Gln Arg Asn Tyr Pro Gln Trp Trp Ile Trp 
1               5                   10                  15      


Ser Met Leu Gly Leu Trp Met Leu Met Ile Cys Asn Gly Met Trp Val 
            20                  25                  30          


Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Lys Thr Thr Leu 
        35                  40                  45              


Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Lys Glu Val His Asn Val 
    50                  55                  60                  


Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gln Glu Met 
65                  70                  75                  80  


Val Leu Lys Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asp Met 
                85                  90                  95      


Ala Asp Gln Met His Glu Asp Val Ile Ser Leu Trp Asp Gln Ser Leu 
            100                 105                 110         


Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Ile 
        115                 120                 125             


Asp Ala Asn Ala Thr Ala Ser Asn Ala Thr Ala Ser Asn Ser Ser Ile 
    130                 135                 140                 


Ile Glu Gly Met Lys Asn Cys Ser Phe Asn Ile Thr Thr Glu Leu Arg 
145                 150                 155                 160 


Asp Lys Ile Glu Lys Lys Asn Ala Leu Phe Tyr Lys Leu Asp Ile Val 
                165                 170                 175     


Gln Leu Asp Gly Asn Ser Ser Gln Tyr Arg Leu Ile Asn Cys Asn Thr 
            180                 185                 190         


Ser Val Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Asp Pro Ile Pro 
        195                 200                 205             


Ile His Tyr Cys Ala Pro Ala Gly Tyr Ala Ile Leu Lys Cys Asn Asn 
    210                 215                 220                 


Lys Thr Phe Asn Gly Thr Gly Pro Cys Asn Asn Val Ser Thr Val Gln 
225                 230                 235                 240 


Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Asn 
                245                 250                 255     


Gly Ser Leu Ala Glu Gly Glu Ile Ile Ile Arg Ser Glu Asn Ile Thr 
            260                 265                 270         


Asn Ser Ala Lys Thr Ile Ile Val His Leu Asn Glu Ser Val Lys Ile 
        275                 280                 285             


Glu Cys Thr Arg Pro Ser Asn Asn Thr Arg Thr Ser Ile Arg Ile Gly 
    290                 295                 300                 


Pro Gly Gln Ala Phe Tyr Ala Thr Gly Gln Val Ile Gly Asp Ile Arg 
305                 310                 315                 320 


Lys Ala His Cys Asn Ile Ser Glu Ser Lys Trp Asn Glu Thr Leu Gln 
                325                 330                 335     


Arg Val Ser Lys Lys Leu Lys Glu Tyr Phe Pro His Lys Asn Ile Thr 
            340                 345                 350         


Phe Gln Pro Ser Ser Gly Gly Asp Leu Glu Ile Thr Thr His Ser Phe 
        355                 360                 365             


Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser Ser Leu Phe Asn 
    370                 375                 380                 


Arg Thr Tyr Met Ala Asn Ser Thr Glu Thr Asn Ser Thr Arg Thr Ile 
385                 390                 395                 400 


Thr Leu His Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val 
                405                 410                 415     


Gly Arg Ala Met Tyr Ala Pro Pro Ile Ala Gly Asn Ile Thr Cys Ile 
            420                 425                 430         


Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Asn 
        435                 440                 445             


Thr Thr Glu Thr Phe Arg Pro Gly Gly Gly Asn Met Lys Asp Asn Trp 
    450                 455                 460                 


Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Glu Ile Lys Pro Leu Gly 
465                 470                 475                 480 


Val Ala Pro Thr Asn Ala Arg Arg Arg Val Val Glu Arg Glu Lys Arg 
                485                 490                 495     


Ala Val Gly Met Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly 
            500                 505                 510         


Ser Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln 
        515                 520                 525             


Leu Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Lys Ala Ile 
    530                 535                 540                 


Glu Ala Gln Gln His Met Leu Lys Leu Thr Val Trp Gly Ile Lys Gln 
545                 550                 555                 560 


Leu Gln Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Lys Asp Gln Gln 
                565                 570                 575     


Leu Leu Gly Met Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Asn 
            580                 585                 590         


Val Tyr Trp Asn Ser Ser Trp Ser Asn Lys Thr Tyr Gly Asp Ile Trp 
        595                 600                 605             


Asp Asn Met Thr Trp Met Gln Trp Glu Arg Glu Ile Ser Asp Tyr Thr 
    610                 615                 620                 


Glu Ile Ile Tyr Glu Leu Leu Glu Glu Ser Gln Asn Gln Gln Glu Lys 
625                 630                 635                 640 


Asn Glu Gln Asp Leu Leu Ala Leu Asp Arg Trp Asn Ser Leu Trp Asn 
                645                 650                 655     


Trp Phe Asn Ile Thr Asn Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met 
            660                 665                 670         


Ile Val Gly Gly Leu Ile Gly Leu Arg Ile Ile Phe Ala Val Leu Ser 
        675                 680                 685             


Leu Val Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Leu Gln Thr 
    690                 695                 700                 


Leu Thr Pro Ser Pro Arg Gly Pro Asp Arg Pro Gly Gly Ile Glu Glu 
705                 710                 715                 720 


Glu Gly Gly Glu Gln Asp Arg Asn 
                725             


<210>  8
<211>  2251
<212>  DNA
<213>  Human immunodeficiency virus type-1

<400>  8
atgctcgagc ggccgcacca tgaaagttag aggaattcaa agaaattatc cacaatggtg       60

gatttggtct atgctaggac tatggatgct aatgatatgt aatggaatgt gggttacagt      120

atattatggt gtaccagtat ggaaagaagc gaaaacaaca ctattttgtg cgtctgatgc      180

gaaagcgtat gaaaaagaag ttcataatgt ttgggcgaca catgcgtgtg taccaacaga      240

tccaaatcct caagaaatgg tactagaaaa tgtaacagaa aactttaata tgtggaaaaa      300

tgatatggcg gatcaaatgc atgaagatgt aatttctcta tgggatcaat ctctaaaacc      360

atgtgtaaaa ctaacaccac tatgtgtaac attgaattgt acagatgcga atgcgacagc      420

gtctaataca aatgcgactg cgagtaatat taatgctact gcgtctaaat cttctattat      480

tgaagaaatg aaaaattgtt cttttaatat tacaacagaa ctaagagata aaagagaaaa      540

aaaatatgct ttgttttata aactagatat agtacaacta gatggaaatt cttctcaata      600

tagattgatt aattgtaata cttctgtaat tactcaagcg tgtccaaaag tatcttttga      660

tccaattcca attcattatt gtgcgccagc gggatatgcg attttgaaat gtaataataa      720

aacttttaat ggaacaggac cttgtaataa tgtatctaca gtacaatgta ctcatggtat      780

taaaccagta gtatctacac aactactatt gaatggatct ctagcggaag gagaaattat      840

tattagatct gaaaatatta cagataattc taaaacaatt atagtacatc taaatgaatc      900

tgtaaaaatt gaatgtacaa gaccaagtaa taatacaaga acatctatta gaattggacc      960

aggacaagcg ttttatgcga caggacaagt aattggagat attagagaag cgcattgtaa     1020

tattagtgaa tctaaatgga atgaaacact acaaagagta tctaaaaaac taaaagaata     1080

ttttcctgat aaaaatatta cttttcaacc atcttctggt ggagatccag aaattactac     1140

tcatagtttt aattgtggtg gagaattctt ctattgtaat acttcatctt tgtttaatag     1200

aacatatatg gcgaattcta cagaaacaaa ttctacaaga actattacac tacattgtag     1260

aattaaacaa attataaata tgtggcaaga agttggtaga gctatgtatg cgccaccaat     1320

tgcgggaaat attacatgta tttcaaatat tactggacta ctactaacaa gagatggtgg     1380

agaaaatact agagatggtg gaaataataa tactgaaact tttagacctg aaggtggaaa     1440

tatgaaagat aattggagaa gtgaattgta taaatataaa gtagtagaag taaaaccatt     1500

gggagtagcg ccaacaaaag cgagaagaag agtagttgaa agagaaaaaa gagctgtagg     1560

aatgggagcg gtatttctag gatttttggg agcggcggga tctacaatgg gagctgcttc     1620

tattacattg acagtacaag cgagacaact attgtctgga atagttcaac aacaatctaa     1680

tctattgaaa gcgattgaag cgcaacaaca tatgctaaaa ctaactgtat ggggtattaa     1740

acaattgcaa gcgagagtac tagcgttgga aagatatttg aaagatcaac aattgctagg     1800

aatgtgggga tgttctggaa aactaatatg tactacaaat gtatattgga attcttcttg     1860

gtctaataaa acatatggag atatatggga taatatgaca tggatgcaat gggaaagaga     1920

aatatctaat tatacagata ttatatatga tctactagaa gaatctcaaa atcaacaaga     1980

aaaaaatgaa caagatctat tggcgctaga tagatggaat agtctatgga attggtttaa     2040

tattacaaaa tggctatggt atataaaaat tttcattatg atagttggag gactaattgg     2100

actaagaatt atttttgcgg tattgtctct agtaaataga gttagacaag gatattctcc     2160

tctatctcta caaacactaa ttccatctcc aagaggacca gatagaccag gtggaattga     2220

agaagaaggt ggagaacaag atagaaatta a                                    2251


<210>  9
<211>  743
<212>  PRT
<213>  Human immunodeficiency virus type-1

<400>  9

Met Lys Val Arg Gly Ile Gln Arg Asn Tyr Pro Gln Trp Trp Ile Trp 
1               5                   10                  15      


Ser Met Leu Gly Leu Trp Met Leu Met Ile Cys Asn Gly Met Trp Val 
            20                  25                  30          


Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Lys Thr Thr Leu 
        35                  40                  45              


Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Lys Glu Val His Asn Val 
    50                  55                  60                  


Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gln Glu Met 
65                  70                  75                  80  


Val Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asp Met 
                85                  90                  95      


Ala Asp Gln Met His Glu Asp Val Ile Ser Leu Trp Asp Gln Ser Leu 
            100                 105                 110         


Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Thr 
        115                 120                 125             


Asp Ala Asn Ala Thr Ala Ser Asn Thr Asn Ala Thr Ala Ser Asn Ile 
    130                 135                 140                 


Asn Ala Thr Ala Ser Lys Ser Ser Ile Ile Glu Glu Met Lys Asn Cys 
145                 150                 155                 160 


Ser Phe Asn Ile Thr Thr Glu Leu Arg Asp Lys Arg Glu Lys Lys Tyr 
                165                 170                 175     


Ala Leu Phe Tyr Lys Leu Asp Ile Val Gln Leu Asp Gly Asn Ser Ser 
            180                 185                 190         


Gln Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile Thr Gln Ala Cys 
        195                 200                 205             


Pro Lys Val Ser Phe Asp Pro Ile Pro Ile His Tyr Cys Ala Pro Ala 
    210                 215                 220                 


Gly Tyr Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly 
225                 230                 235                 240 


Pro Cys Asn Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro 
                245                 250                 255     


Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Gly Glu 
            260                 265                 270         


Ile Ile Ile Arg Ser Glu Asn Ile Thr Asp Asn Ser Lys Thr Ile Ile 
        275                 280                 285             


Val His Leu Asn Glu Ser Val Lys Ile Glu Cys Thr Arg Pro Ser Asn 
    290                 295                 300                 


Asn Thr Arg Thr Ser Ile Arg Ile Gly Pro Gly Gln Ala Phe Tyr Ala 
305                 310                 315                 320 


Thr Gly Gln Val Ile Gly Asp Ile Arg Glu Ala His Cys Asn Ile Ser 
                325                 330                 335     


Glu Ser Lys Trp Asn Glu Thr Leu Gln Arg Val Ser Lys Lys Leu Lys 
            340                 345                 350         


Glu Tyr Phe Pro Asp Lys Asn Ile Thr Phe Gln Pro Ser Ser Gly Gly 
        355                 360                 365             


Asp Pro Glu Ile Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe 
    370                 375                 380                 


Tyr Cys Asn Thr Ser Ser Leu Phe Asn Arg Thr Tyr Met Ala Asn Ser 
385                 390                 395                 400 


Thr Glu Thr Asn Ser Thr Arg Thr Ile Thr Leu His Cys Arg Ile Lys 
                405                 410                 415     


Gln Ile Ile Asn Met Trp Gln Glu Val Gly Arg Ala Met Tyr Ala Pro 
            420                 425                 430         


Pro Ile Ala Gly Asn Ile Thr Cys Ile Ser Asn Ile Thr Gly Leu Leu 
        435                 440                 445             


Leu Thr Arg Asp Gly Gly Glu Asn Thr Arg Asp Gly Gly Asn Asn Asn 
    450                 455                 460                 


Thr Glu Thr Phe Arg Pro Glu Gly Gly Asn Met Lys Asp Asn Trp Arg 
465                 470                 475                 480 


Ser Glu Leu Tyr Lys Tyr Lys Val Val Glu Val Lys Pro Leu Gly Val 
                485                 490                 495     


Ala Pro Thr Lys Ala Arg Arg Arg Val Val Glu Arg Glu Lys Arg Ala 
            500                 505                 510         


Val Gly Met Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser 
        515                 520                 525             


Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu 
    530                 535                 540                 


Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Lys Ala Ile Glu 
545                 550                 555                 560 


Ala Gln Gln His Met Leu Lys Leu Thr Val Trp Gly Ile Lys Gln Leu 
                565                 570                 575     


Gln Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Lys Asp Gln Gln Leu 
            580                 585                 590         


Leu Gly Met Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Asn Val 
        595                 600                 605             


Tyr Trp Asn Ser Ser Trp Ser Asn Lys Thr Tyr Gly Asp Ile Trp Asp 
    610                 615                 620                 


Asn Met Thr Trp Met Gln Trp Glu Arg Glu Ile Ser Asn Tyr Thr Asp 
625                 630                 635                 640 


Ile Ile Tyr Asp Leu Leu Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn 
                645                 650                 655     


Glu Gln Asp Leu Leu Ala Leu Asp Arg Trp Asn Ser Leu Trp Asn Trp 
            660                 665                 670         


Phe Asn Ile Thr Lys Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile 
        675                 680                 685             


Val Gly Gly Leu Ile Gly Leu Arg Ile Ile Phe Ala Val Leu Ser Leu 
    690                 695                 700                 


Val Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Leu Gln Thr Leu 
705                 710                 715                 720 


Ile Pro Ser Pro Arg Gly Pro Asp Arg Pro Gly Gly Ile Glu Glu Glu 
                725                 730                 735     


Gly Gly Glu Gln Asp Arg Asn 
            740             


<210>  10
<211>  2024
<212>  DNA
<213>  Human immunodeficiency virus type-1

<400>  10
atgctcgagc ggccgcacca tgagagtaat gggaattcaa agaaattatc cacaatggtg       60

gatttggtct atgctaggat tttggatgct aatgatatgt aatggaatgt gggttacagt      120

atattatggt gtaccagtat ggaaagaagc gaaaacaaca ctattttgtg cgtctgatgc      180

gaaagcgtat gaaaaagaag ttcataatgt ttgggcgaca catgcgtgtg taccaacaga      240

tccaaatcct caagaaatgg tattgaaaaa tgtaacagaa aactttaata tgtggaaaaa      300

tgatatggta gatcaaatgc atgaagatgt aatttctcta tgggatcaat ctctaaaacc      360

atgtgtaaaa ctaacaccac tatgtgtaac attgaattgt acaaatgcga cagcgtctaa      420

ttcttctatt attgaaggaa tgaaaaattg ttcttttaat attacaacag aactaagaga      480

taaaagagaa aaaaaaaatg ctttgtttta taaactagat atagtacaac tagatggaaa      540

ttcttctcaa tatagattga ttaattgtaa tacttctgta attactcaag cgtgtccaaa      600

agtatctttt gatccaattc caattcatta ttgtgcgcca gcgggatatg cgattttgaa      660

atgtaataat aaaactttta ctggaacagg accttgtaat aatgtatcta cagtacaatg      720

tactcatggt attaaaccag tagtatctac acaactacta ttgaatggat ctctagcgga      780

aggagaaatt attattagat ctgaaaatat tacaaataat gtaaaaacaa ttatagtaca      840

tctaaatgaa tctgtaaaaa ttgaatgtac tagaccaaat aataaaacaa gaacatctat      900

tagaattgga ccaggacaag cgttttatgc gacaggacaa gtaattggag atattagaga      960

agcgtattgt aatattaatg aatctaaatg gaatgaaaca ctacaaagag tatctaaaaa     1020

actaaaagaa tattttcctc ataaaaatat tacttttcaa ccatcttctg gtggagatct     1080

agaaattact actcatagtt ttaattgtgg tggagaattc ttttattgta atacttcatc     1140

tttgtttaat agaacatata tggcgaattc tacagatatg gctaatagta cagaaacaaa     1200

ttctacaaga actattacta ttcattgtag aattaaacaa attataaata tgtggcaaga     1260

agttggtaga gctatgtatg cgccaccaat tgcgggaaat attacatgta tatctaatat     1320

tactggacta ctactaacaa gagatggtgg aaaaaataat actgaaactt ttagaccagg     1380

tggtggaaat atgaaagata attggagaag tgaattgtat aaatataaag tagtagaagt     1440

aaaaccattg ggagtagcgc caactaatgc gagaagaaga gtagttgaaa gagaaaaaag     1500

agctgtagga atgggagcgg tatttttggg atttctaggt gctgcgggat ctacaatggg     1560

agctgcgagt attacactaa cagtacaagc gagacaacta ttgtctggaa tagttcaaca     1620

acaatctaat ctattgaaag cgattgaagc gcaacaacat atgctaaaac taactgtatg     1680

gggtattaaa caattgcaag cgagagtact agcgttggaa agatatttga aagatcaaca     1740

attgctagga atgtggggat gttctggaaa actaatatgt actacaaatg tatattggaa     1800

ttcttcttgg tcaaataaaa catatggaga tatatgggat aatatgacat ggatgcaatg     1860

ggaaagagaa atatctaatt atacagaaat tatatatgaa ctactagaag aatctcaaaa     1920

tcaacaagaa aaaaatgaac aagatctatt ggcgctagat agatggaata gtctatggaa     1980

ttggtttaat attactaatt ggctatggta tatataattt ttat                      2024


<210>  11
<211>  665
<212>  PRT
<213>  Human immunodeficiency virus type-1

<400>  11

Met Arg Val Met Gly Ile Gln Arg Asn Tyr Pro Gln Trp Trp Ile Trp 
1               5                   10                  15      


Ser Met Leu Gly Phe Trp Met Leu Met Ile Cys Asn Gly Met Trp Val 
            20                  25                  30          


Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Lys Thr Thr Leu 
        35                  40                  45              


Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Lys Glu Val His Asn Val 
    50                  55                  60                  


Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gln Glu Met 
65                  70                  75                  80  


Val Leu Lys Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asp Met 
                85                  90                  95      


Val Asp Gln Met His Glu Asp Val Ile Ser Leu Trp Asp Gln Ser Leu 
            100                 105                 110         


Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asn Cys Thr 
        115                 120                 125             


Asn Ala Thr Ala Ser Asn Ser Ser Ile Ile Glu Gly Met Lys Asn Cys 
    130                 135                 140                 


Ser Phe Asn Ile Thr Thr Glu Leu Arg Asp Lys Arg Glu Lys Lys Asn 
145                 150                 155                 160 


Ala Leu Phe Tyr Lys Leu Asp Ile Val Gln Leu Asp Gly Asn Ser Ser 
                165                 170                 175     


Gln Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile Thr Gln Ala Cys 
            180                 185                 190         


Pro Lys Val Ser Phe Asp Pro Ile Pro Ile His Tyr Cys Ala Pro Ala 
        195                 200                 205             


Gly Tyr Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Thr Gly Thr Gly 
    210                 215                 220                 


Pro Cys Asn Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro 
225                 230                 235                 240 


Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Gly Glu 
                245                 250                 255     


Ile Ile Ile Arg Ser Glu Asn Ile Thr Asn Asn Val Lys Thr Ile Ile 
            260                 265                 270         


Val His Leu Asn Glu Ser Val Lys Ile Glu Cys Thr Arg Pro Asn Asn 
        275                 280                 285             


Lys Thr Arg Thr Ser Ile Arg Ile Gly Pro Gly Gln Ala Phe Tyr Ala 
    290                 295                 300                 


Thr Gly Gln Val Ile Gly Asp Ile Arg Glu Ala Tyr Cys Asn Ile Asn 
305                 310                 315                 320 


Glu Ser Lys Trp Asn Glu Thr Leu Gln Arg Val Ser Lys Lys Leu Lys 
                325                 330                 335     


Glu Tyr Phe Pro His Lys Asn Ile Thr Phe Gln Pro Ser Ser Gly Gly 
            340                 345                 350         


Asp Leu Glu Ile Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe 
        355                 360                 365             


Tyr Cys Asn Thr Ser Ser Leu Phe Asn Arg Thr Tyr Met Ala Asn Ser 
    370                 375                 380                 


Thr Asp Met Ala Asn Ser Thr Glu Thr Asn Ser Thr Arg Thr Ile Thr 
385                 390                 395                 400 


Ile His Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly 
                405                 410                 415     


Arg Ala Met Tyr Ala Pro Pro Ile Ala Gly Asn Ile Thr Cys Ile Ser 
            420                 425                 430         


Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Lys Asn Asn Thr 
        435                 440                 445             


Glu Thr Phe Arg Pro Gly Gly Gly Asn Met Lys Asp Asn Trp Arg Ser 
    450                 455                 460                 


Glu Leu Tyr Lys Tyr Lys Val Val Glu Val Lys Pro Leu Gly Val Ala 
465                 470                 475                 480 


Pro Thr Asn Ala Arg Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val 
                485                 490                 495     


Gly Met Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr 
            500                 505                 510         


Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu Leu 
        515                 520                 525             


Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Lys Ala Ile Glu Ala 
    530                 535                 540                 


Gln Gln His Met Leu Lys Leu Thr Val Trp Gly Ile Lys Gln Leu Gln 
545                 550                 555                 560 


Ala Arg Val Leu Ala Leu Glu Arg Tyr Leu Lys Asp Gln Gln Leu Leu 
                565                 570                 575     


Gly Met Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Asn Val Tyr 
            580                 585                 590         


Trp Asn Ser Ser Trp Ser Asn Lys Thr Tyr Gly Asp Ile Trp Asp Asn 
        595                 600                 605             


Met Thr Trp Met Gln Trp Glu Arg Glu Ile Ser Asn Tyr Thr Glu Ile 
    610                 615                 620                 


Ile Tyr Glu Leu Leu Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn Glu 
625                 630                 635                 640 


Gln Asp Leu Leu Ala Leu Asp Arg Trp Asn Ser Leu Trp Asn Trp Phe 
                645                 650                 655     


Asn Ile Thr Asn Trp Leu Trp Tyr Ile 
            660                 665 


<210>  12
<211>  6387
<212>  DNA
<213>  Human immunodeficiency virus type-1

<400>  12
gaattcggag tatacgaacc gggaaagaga agatggttaa aaataaagcg agactatttg       60

aacgagggtt ccatggcaga ttctgccgat ttagtagtac taggtgctta ctatggtaaa      120

ggagcaaagg gtggtatcat ggcagtcttt ctaatgggtt gttacgacga tgaatccggt      180

aaatggaaga cggttaccaa gtgttcagga cacgatgata atacgttaag ggagttgcaa      240

gaccaattaa agatgattaa aattaacaag gatcccaaaa aaattccaga gtggttagta      300

gttaataaaa tctatattcc cgattttgta gtagaggatc caaaacaatc tcagatatgg      360

gaaatttcag gagcagagtt tacatcttcc aagtcccata ccgcaaatgg aatatccatt      420

agatttccta gatttactag gataagagag gataaaacgt ggaaagaatc tactcatcta      480

aacgatttag taaacttgac taaatcttaa tttttatggc gcgcctttca ttttgttttt      540

ttctatgcta taaatggtga gcaagggcga ggagctgttc accggggtgg tgcccatcct      600

ggtcgagctg gacggcgacg taaacggcca caagttcagc gtgtccggcg agggcgaggg      660

cgatgccacc tacggcaagc tgaccctgaa gttcatctgc accaccggca agctgcccgt      720

gccctggccc accctcgtga ccaccctgac ctacggcgtg cagtgcttca gccgctaccc      780

cgaccacatg aagcagcacg acttcttcaa gtccgccatg cccgaaggct acgtccagga      840

gcgcaccatc ttcttcaagg acgacggcaa ctacaagacc cgcgccgagg tgaagttcga      900

gggcgacacc ctggtgaacc gcatcgagct gaagggcatc gacttcaagg aggacggcaa      960

catcctgggg cacaagctgg agtacaacta caacagccac aacgtctata tcatggccga     1020

caagcagaag aacggcatca aggtgaactt caagatccgc cacaacatcg aggacggcag     1080

cgtgcagctc gccgaccact accagcagaa cacccccatc ggcgacggcc ccgtgctgct     1140

gcccgacaac cactacctga gcacccagtc cgccctgagc aaagacccca acgagaagcg     1200

cgatcacatg gtcctgctgg agttcgtgac cgccgccggg atcactctcg gcatgcacga     1260

gctgtacaag taagagctcc ccgattttgt agtagaggat ccaaaacaat ctcagatatg     1320

ggaaatttca ggagcagagt ttacatcttc caagtcccat accgcaaatg gaatatccat     1380

tagatttcct agatttacta ggataagaga ggataaaacg tggaaagaat ctactcatct     1440

aaacgattta gtaaacttga ctaaatctta atttttatct cgaggccgct ggtacccaac     1500

ctaaaaattg aaaataaata caaaggttct tgagggttgt gttaaattga aagcgagaaa     1560

taatcataaa taagcccggg atgggagcga gagcgagtat tctaagaggt ggaaaactag     1620

ataaatggga aagaattaga ctaagacctg gtggaaaaaa atgttatatg attaaacatc     1680

tagtatgggc gtctagagaa ctagaaagat ttgcgctaaa tccaggacta ctagaaacat     1740

ctgaaggatg tagacaaatt attaaacaac tacaaccatc tctacaaact ggaacagaag     1800

aactaagatc tttgtataat actgtagtaa cactatattg tgtacatgaa gaaattgaag     1860

ttagagatac aaaagaagcg ctagataaac tagaagaaga acaaaataaa tgtcaacaaa     1920

aagcgcaaca agcggaagcg gcggataaag gaaaagtatc tcaaaattat ccaatagtac     1980

aaaatctaca aggacaaatg gtacatcaac cactatctcc aagaacattg aatgcgtggg     2040

ttaaagtaat tgaagaaaaa ggttttaatc ctgaagtaat tccaatgttt tctgcgctat     2100

ctgaaggtgc tacaccacaa gatctaaata caatgctaaa tactgttgga ggacatcaag     2160

cggctatgca aatgctaaaa gatacaatta atgaagaagc tgcggaatgg gatagactac     2220

atccagtaca tgctggacca attgcgcctg gacaaatgag agaacctaga ggatctgata     2280

ttgcgggaac aacatctaat ctacaagaac aaattgcttg gatgacagcg aatccaccaa     2340

ttccagttgg agaattgtat aaaagatgga ttattctagg attgaacaaa attgttagaa     2400

tgtattctcc agtatctatt ctagatatta aacaaggacc aaaagaacct tttagagatt     2460

atgtagatag attctttaaa acactaagag cggaacaagc gacacaagat gtaaaaaatt     2520

ggatgactga tacactacta acacaaaatg cgaatccaga ttgtaaaact attttgagag     2580

cgctaggacc aggtgctact ctagaagaaa tgatgactgc gtgtcaaggt gtaggtggac     2640

catctcataa agcgagagta ctagcggaag ctatgtctca agtaaatcat ccaaatatta     2700

tgatgcaaag aaataacttt aaaggtccaa aaagaatagt taaatgtttt aattgtggaa     2760

aagaaggaca tattgcgaga aattgtagag cgccaagaaa aagaggatgt tggaaatgtg     2820

gaaaagaagg acatcaaatg aaagattgta cagaaagaca agcgaatttt ctaggaaaaa     2880

tttggccaag tcataaagga agacctggaa actttattca aaatagacta gaacctacag     2940

cgccaccagc ggaaagtttt agatttgaag aaacaacacc atctttgaaa caagaaccta     3000

aagaaagaga accaccacta acaagtttga aatctctatt tggatctgat ccattgtctc     3060

aataattttt atgtcgacga cctgcagcta atgtattagt taaatattaa aacttaccac     3120

gtaaaactta aaatttaaaa tgatatttca ttgacagata gatcacacat tatgaacttt     3180

caaggacttg tgttaactga caattgcaaa aatcaatggg tcgttggacc attaatagga     3240

aaaggtggat ttggtagtat ttatactact aatgacaata attatgtagt aaaaatagag     3300

cccaaagcta acggatcatt atttaccgaa caggcatttt atactagagt acttaaacca     3360

tccgttatcg aagaatggaa aaaatctcac aatataaagc acgtaggtct tatcacgtgc     3420

aaggcatttg gtctatacaa atccattaat gtggaatatc gattcttggt aattaataga     3480

ttaggtgcag atctagatgc ggtgatcaga gccaataata atagattacc aaaaaggtcg     3540

gtgatgttga tcggaatcga aatcttaaat accatacaat ttatgcacga gcaaggatat     3600

tctcacggag atattaaagc gagtaatata gtcttggatc aaatagataa gaataaatta     3660

tatctagtgg attacggatt ggtttctaaa ttcatgtcaa gcttgtctcc ctatagtgag     3720

tcgtattaga gcttggcgta atcatggtca tagctgtttc ctgtgtgaaa ttgttatccg     3780

ctcacaattc cacacaacat acgagccgga agcataaagt gtaaagcctg gggtgcctaa     3840

tgagtgagct aactcacatt aattgcgttg cgctcactgc ccgctttcga gtcgggaaac     3900

ctgtcgtgcc agctgcatta atgaatcggc caacgcgcgg ggagaggcgg tttgcgtatt     3960

gggcgctctt ccgcttcctc gctcactgac tcgctgcgct cggtcgttcg gctgcggcga     4020

gcggtatcag ctcactcaaa ggcggtaata cggttatcca cagaatcagg ggataacgca     4080

ggaaagaaca tgtgagcaaa aggccagcaa aaggccagga accgtaaaaa ggccgcgttg     4140

ctggcgtttt tcgataggct ccgcccccct gacgagcatc acaaaaatcg acgctcaagt     4200

cagaggtggc gaaacccgac aggactataa agataccagg cgtttccccc tggaagctcc     4260

ctcgtgcgct ctcctgttcc gaccctgccg cttaccggat acctgtccgc ctttctccct     4320

tcgggaagcg tggcgctttc tcatagctca cgctgtaggt atctcagttc ggtgtaggtc     4380

gttcgctcca agctgggctg tgtgcacgaa ccccccgttc agcccgaccg ctgcgcctta     4440

tccggtaact atcgtcttga gtccaacccg gtaagacacg acttatcgcc actggcagca     4500

gccactggta acaggattag cagagcgagg tatgtaggcg gtgctacaga gttcttgaag     4560

tggtggccta actacggcta cactagaagg acagtatttg gtatctgcgc tctgctgaag     4620

ccagttacct tcggaaaaag agttggtagc tcttgatccg gcaaacaaac caccgctggt     4680

agcggtggtt tttttgtttg caagcagcag attacgcgca gaaaaaaagg atctcaagaa     4740

gatcctttga tcttttctac ggggtctgac gctcagtgga acgaaaactc acgttaaggg     4800

attttggtca tgagattatc aaaaaggatc ttcacctaga tccttttaaa ttaaaaatga     4860

agttttaaat caatctaaag tatatatgag taaacttggt ctgacagtta ccaatgctta     4920

atcagtgagg cacctatctc agcgatctgt ctatttcgtt catccatagt tgcctgactc     4980

cccgtcgtgt agataactac gatacgggag ggcttaccat ctggccccag tgctgcaatg     5040

ataccgcgag acccacgctc accggctcca gatttatcag caataaacca gccagccgga     5100

agggccgagc gcagaagtgg tcctgcaact ttatccgcct ccatccagtc tattaattgt     5160

tgccgggaag ctagagtaag tagttcgcca gttaatagtt tgcgcaacgt tgttggcatt     5220

gctacaggca tcgtggtgtc acgctcgtcg tttggtatgg cttcattcag ctccggttcc     5280

caacgatcaa ggcgagttac atgatccccc atgttgtgca aaaaagcggt tagctccttc     5340

ggtcctccga tcgttgtcag aagtaagttg gccgcagtgt tatcactcat ggttatggca     5400

gcactgcata attctcttac tgtcatgcca tccgtaagat gcttttctgt gactggtgag     5460

tactcaacca agtcattctg agaatagtgt atgcggcgac cgagttgctc ttgcccggcg     5520

tcaatacggg ataataccgc gccacatagc agaactttaa aagtgctcat cattggaaaa     5580

cgttcttcgg ggcgaaaact ctcaaggatc ttaccgctgt tgagatccag ttcgatgtaa     5640

cccactcgtg cacccaactg atcttcagca tcttttactt tcaccagcgt ttctgggtga     5700

gcaaaaacag gaaggcaaaa tgccgcaaaa aagggaataa gggcgacacg gaaatgttga     5760

atactcatac tcttcctttt tcaatattat tgaagcattt atcagggtta ttgtctcatg     5820

agcggataca tatttgaatg tatttagaaa aataaacaaa taggggttcc gcgcacattt     5880

ccccgaaaag tgccacctga cgtctaagaa accattatta tcatgacatt aacctataaa     5940

aataggcgta tcacgaggcc ctttcgtctc gcgcgtttcg gtgatgacgg tgaaaacctc     6000

tgacacatgc agctcccgga gacggtcaca gcttgtctgt aagcggatgc cgggagcaga     6060

caagcccgtc agggcgcgtc agcgggtgtt ggcgggtgtc ggggctggct taactatgcg     6120

gcatcagagc agattgtact gagagtgcac catatgcggt gtgaaatacc gcacagatgc     6180

gtaaggagaa aataccgcat caggcgccat tcgccattca ggctgcgcaa ctgttgggaa     6240

gggcgatcgg tgcgggcctc ttcgctatta cgccagctgg cgaaaggggg atgtgctgca     6300

aggcgattaa gttgggtaac gccagggttt tcccagtcac gacgttgtaa aacgacggcc     6360

agtgaattgg atttaggtga cactata                                         6387


<210>  13
<211>  1485
<212>  DNA
<213>  Human immunodeficiency virus type-1

<400>  13
atgggagcga gagcgagtat tctaagaggt ggaaaactag ataaatggga aagaattaga       60

ctaagacctg gtggaaaaaa atgttatatg attaaacatc tagtatgggc gtctagagaa      120

ctagaaagat ttgcgctaaa tccaggacta ctagaaacat ctgaaggatg tagacaaatt      180

attaaacaac tacaaccatc tctacaaact ggaacagaag aactaagatc tttgtataat      240

actgtagtaa cactatattg tgtacatgaa gaaattgaag ttagagatac aaaagaagcg      300

ctagataaac tagaagaaga acaaaataaa tgtcaacaaa aagcgcaaca agcggaagcg      360

gcggataaag gaaaagtatc tcaaaattat ccaatagtac aaaatctaca aggacaaatg      420

gtacatcaac cactatctcc aagaacattg aatgcgtggg ttaaagtaat tgaagaaaaa      480

ggttttaatc ctgaagtaat tccaatgttt tctgcgctat ctgaaggtgc tacaccacaa      540

gatctaaata caatgctaaa tactgttgga ggacatcaag cggctatgca aatgctaaaa      600

gatacaatta atgaagaagc tgcggaatgg gatagactac atccagtaca tgctggacca      660

attgcgcctg gacaaatgag agaacctaga ggatctgata ttgcgggaac aacatctaat      720

ctacaagaac aaattgcttg gatgacagcg aatccaccaa ttccagttgg agaattgtat      780

aaaagatgga ttattctagg attgaacaaa attgttagaa tgtattctcc agtatctatt      840

ctagatatta aacaaggacc aaaagaacct tttagagatt atgtagatag attctttaaa      900

acactaagag cggaacaagc gacacaagat gtaaaaaatt ggatgactga tacactacta      960

acacaaaatg cgaatccaga ttgtaaaact attttgagag cgctaggacc aggtgctact     1020

ctagaagaaa tgatgactgc gtgtcaaggt gtaggtggac catctcataa agcgagagta     1080

ctagcggaag ctatgtctca agtaaatcat ccaaatatta tgatgcaaag aaataacttt     1140

aaaggtccaa aaagaatagt taaatgtttt aattgtggaa aagaaggaca tattgcgaga     1200

aattgtagag cgccaagaaa aagaggatgt tggaaatgtg gaaaagaagg acatcaaatg     1260

aaagattgta cagaaagaca agcgaatttt ctaggaaaaa tttggccaag tcataaagga     1320

agacctggaa actttattca aaatagacta gaacctacag cgccaccagc ggaaagtttt     1380

agatttgaag aaacaacacc atctttgaaa caagaaccta aagaaagaga accaccacta     1440

acaagtttga aatctctatt tggatctgat ccattgtctc aataa                     1485


<210>  14
<211>  494
<212>  PRT
<213>  Human immunodeficiency virus type-1

<400>  14

Met Gly Ala Arg Ala Ser Ile Leu Arg Gly Gly Lys Leu Asp Lys Trp 
1               5                   10                  15      


Glu Arg Ile Arg Leu Arg Pro Gly Gly Lys Lys Cys Tyr Met Ile Lys 
            20                  25                  30          


His Leu Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Leu Asn Pro 
        35                  40                  45              


Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gln Ile Ile Lys Gln Leu 
    50                  55                  60                  


Gln Pro Ser Leu Gln Thr Gly Thr Glu Glu Leu Arg Ser Leu Tyr Asn 
65                  70                  75                  80  


Thr Val Val Thr Leu Tyr Cys Val His Glu Glu Ile Glu Val Arg Asp 
                85                  90                  95      


Thr Lys Glu Ala Leu Asp Lys Leu Glu Glu Glu Gln Asn Lys Cys Gln 
            100                 105                 110         


Gln Lys Ala Gln Gln Ala Glu Ala Ala Asp Lys Gly Lys Val Ser Gln 
        115                 120                 125             


Asn Tyr Pro Ile Val Gln Asn Leu Gln Gly Gln Met Val His Gln Pro 
    130                 135                 140                 


Leu Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Ile Glu Glu Lys 
145                 150                 155                 160 


Gly Phe Asn Pro Glu Val Ile Pro Met Phe Ser Ala Leu Ser Glu Gly 
                165                 170                 175     


Ala Thr Pro Gln Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His 
            180                 185                 190         


Gln Ala Ala Met Gln Met Leu Lys Asp Thr Ile Asn Glu Glu Ala Ala 
        195                 200                 205             


Glu Trp Asp Arg Leu His Pro Val His Ala Gly Pro Ile Ala Pro Gly 
    210                 215                 220                 


Gln Met Arg Glu Pro Arg Gly Ser Asp Ile Ala Gly Thr Thr Ser Asn 
225                 230                 235                 240 


Leu Gln Glu Gln Ile Ala Trp Met Thr Ala Asn Pro Pro Ile Pro Val 
                245                 250                 255     


Gly Glu Leu Tyr Lys Arg Trp Ile Ile Leu Gly Leu Asn Lys Ile Val 
            260                 265                 270         


Arg Met Tyr Ser Pro Val Ser Ile Leu Asp Ile Lys Gln Gly Pro Lys 
        275                 280                 285             


Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Phe Lys Thr Leu Arg Ala 
    290                 295                 300                 


Glu Gln Ala Thr Gln Asp Val Lys Asn Trp Met Thr Asp Thr Leu Leu 
305                 310                 315                 320 


Thr Gln Asn Ala Asn Pro Asp Cys Lys Thr Ile Leu Arg Ala Leu Gly 
                325                 330                 335     


Pro Gly Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gln Gly Val Gly 
            340                 345                 350         


Gly Pro Ser His Lys Ala Arg Val Leu Ala Glu Ala Met Ser Gln Val 
        355                 360                 365             


Asn His Pro Asn Ile Met Met Gln Arg Asn Asn Phe Lys Gly Pro Lys 
    370                 375                 380                 


Arg Ile Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His Ile Ala Arg 
385                 390                 395                 400 


Asn Cys Arg Ala Pro Arg Lys Arg Gly Cys Trp Lys Cys Gly Lys Glu 
                405                 410                 415     


Gly His Gln Met Lys Asp Cys Thr Glu Arg Gln Ala Asn Phe Leu Gly 
            420                 425                 430         


Lys Ile Trp Pro Ser His Lys Gly Arg Pro Gly Asn Phe Ile Gln Asn 
        435                 440                 445             


Arg Leu Glu Pro Thr Ala Pro Pro Ala Glu Ser Phe Arg Phe Glu Glu 
    450                 455                 460                 


Thr Thr Pro Ser Leu Lys Gln Glu Pro Lys Glu Arg Glu Pro Pro Leu 
465                 470                 475                 480 


Thr Ser Leu Lys Ser Leu Phe Gly Ser Asp Pro Leu Ser Gln 
                485                 490                 


<210>  15
<211>  7826
<212>  DNA
<213>  Human immunodeficiency virus type-1


<220>
<221>  promoter
<222>  (1)..(690)
<223>  CMV promoter

<220>
<221>  Intron
<222>  (691)..(1638)
<223>  CMV intron A

<220>
<221>  misc_feature
<222>  (1692)..(1695)
<223>  Splice Donor 1

<220>
<221>  gene
<222>  (1743)..(3227)
<223>  gag

<220>
<221>  misc_feature
<222>  (3277)..(3282)
<223>  Internal BamHI site

<220>
<221>  misc_feature
<222>  (3378)..(3380)
<223>  Splice Acceptor 3

<220>
<221>  gene
<222>  (3434)..(3648)
<223>  tat

<220>
<221>  misc_feature
<222>  (3537)..(3539)
<223>  Splice Acceptor 4C

<220>
<221>  misc_feature
<222>  (3556)..(3557)
<223>  Splice Acceptor 4A

<220>
<221>  misc_feature
<222>  (3562)..(3563)
<223>  Splice Acceptor 4B

<220>
<221>  gene
<222>  (3573)..(3648)
<223>  rev

<220>
<221>  misc_feature
<222>  (3578)..(3579)
<223>  Splice Acceptor 5

<220>
<221>  misc_feature
<222>  (3649)..(3673)
<223>  Splice Donor 4

<220>
<221>  gene
<222>  (3673)..(3850)
<223>  vpu

<220>
<221>  gene
<222>  (3851)..(6031)
<223>  gp150 env

<220>
<221>  gene
<222>  (5954)..(6031)
<223>  tat

<220>
<221>  gene
<222>  (5954)..(6031)
<223>  rev

<220>
<221>  polyA_signal
<222>  (6072)..(6294)
<223>  BGH polyA

<220>
<221>  terminator
<222>  (6295)..(6329)

<220>
<221>  gene
<222>  (6350)..(7144)
<223>  KanR

<400>  15
ggcccgcctg gcattatgcc cagtacatga ccttacggga ctttcctact tggcagtaca       60

tctacggtat tagtcatcgg ctattaccat ggtgatgcgg ttttggcagt acaccaatgg      120

gcgtggatag cggtttgact cacggggatt tccaagtctc caccccattg acgtcaatgg      180

gagtttgttt tggcaccaaa atcaacggga ctttccaaaa tgtcgtaata accccgcccc      240

gttgacgcaa atgggcggta ggcgtgtacg gtgggaggtc tatataagca gagctcgttt      300

agtgaaccgt cagatcgcct ggagacgcca tccacgctgt tttgacctcc atagaagaca      360

ccgggaccga tccagcctcc gcggccggga acggtgcatt ggaacgcgga ttccccgtgc      420

caagagtgac gtaagtaccg cctatagact ctataggcac acccctttgg ctcttatgca      480

tgctatactg tttttggctt ggggcctata cacccccgct tccttatgct ataggtgatg      540

gtatagctta gcctataggt gtgggttatt gaccattatt gaccactccc ctattggtga      600

cgatactttc cattactaat ccataacatg gctctttgcc acaactatct ctattggcta      660

tatgccaata ctctgtcctt cagagactga cacggactct gtatttttac aggatggggt      720

cccatttatt atttacaaat tcacatatac aacaacgccg tcccccgtgc ccgcagtttt      780

tattaaacat agcgtgggat ctccacgcga atctcgggta ccgtgttccg gacatgggyt      840

cttctccggt agcggcggag cttccacatc cgagccctgg tcccatgcct ccagcggctc      900

atggtcgctc ggcagctcct tgctcctaac agtggaggcc agacttaggc acagcacaat      960

gcccaccacc accagtgtgc cgcacaaggc cgtggcggta gggtatgtgt ctgaaaatga     1020

gctcggagat tgggctcgca ccgctgacgc agatggaaga cttaaggcag cggcagaaga     1080

agatgcaggc agctgagttg ttgtattctg ataagagtca gaggtaactc ccgttgcggt     1140

gctgttaacg gtggagggca gtgtagtctg agcagtactc gttgctgccg cgcgcgccac     1200

cagacataat agctgacaga ctaacagact gttcctttcc atgggtcttt tctgcagtca     1260

ccatcgatgg cttgctgaag tgcactcggc aagaggcgag agcggcggct ggtgagtacg     1320

ccaaatttta tttgactagc ggaggctaga aggagagaga tgggtgcgag agcgtcaata     1380

ttaagagggg gaaaattaga taaatgggaa agaattaggt taaggccagg gggaaagaaa     1440

tgctatatga taaaacactt agtatgggca agcagggagt tggaaagatt tgcacttaat     1500

cctggcctct tagaaacatc agaaggctgt agacaaataa taaagcagct acaaccatct     1560

cttcagacag gaacagagga acttagatca ttatataaca cagtagtaac tctctattgt     1620

gtacatgaag agatagaagt acgagacacc aaagaagcct tagacaaact agaggaagaa     1680

caaaacaaat gtcagcaaaa agcacagcaa gcagaggcgg ctgacaaagg aaaggtcagc     1740

caaaattatc ctatagtaca gaatctccaa gggcaaatgg tacaccagcc cctatcacct     1800

agaactttga atgcatgggt gaaagtaata gaagagaagg gttttaaccc agaggtaata     1860

cccatgtttt cagcattatc agaaggagcc accccacaag acttaaacac catgttaaat     1920

acagtagggg gacatcaagc agccatgcaa atgttgaaag ataccatcaa tgaggaggct     1980

gcagaatggg atagattaca tccagtccat gcagggccta ttgcaccagg ccaaatgaga     2040

gaaccaaggg gaagtgatat agcaggaaca actagcaacc ttcaggaaca aatagcatgg     2100

atgacagcta acccacctat cccagtggga gaattgtata aaagatggat aattctggga     2160

ttaaataaaa tagtaagaat gtatagccct gtcagcattt tggacataaa acaagggcca     2220

aaagaaccct ttagagacta tgtagaccgg ttctttaaaa ctttgagagc tgaacaagct     2280

acacaagatg taaaaaattg gatgacagac accttgttga cccaaaatgc gaacccagat     2340

tgtaagacca ttttaagagc attaggacca ggggctacat tagaagaaat gatgacagca     2400

tgccaaggag tgggaggacc tagccacaaa gcaagagtgc tagctgaagc aatgagccaa     2460

gtaaatcatc caaacataat gatgcagaga aacaatttta aaggaccaaa aagaattgtt     2520

aaaagcttca acagtggcaa ggaagggcac atagccagaa attgcagggc acctaggaaa     2580

aggggcagtt ggaaaagtgg aaaggaagga caccaaatga aagactgtac tgaaaggcag     2640

gctaattttt tagggaaaat ttggccttcc cacaagggga ggccagggaa tttcatccag     2700

aacaggctag agcccacagc cccaccagca gagagtttca ggttcgagga gacaaccccc     2760

agtctgaagc aggagccgaa ggagagggaa ccacccttaa cttccctcaa atcactcttt     2820

ggcagcgacc ccttgtctca ataagagtag ggggccagat aaaggaggct ctcttagaca     2880

caggagcaga tgaggatcct cattaggaca acatatctat gatacctatg gggatacttg     2940

gacaggagtt gaagctataa taagaatact tcaacagtta ctgtttactc atttcagaat     3000

tgggtgccaa catagcagaa taggcattct gcgacagaga agagcaagaa atggagccag     3060

tagaccctaa cctagagccc tggaatcatc caggaagtca gcccaaaact ccttgtaata     3120

agtgttattg taagcgatgc tgctatcatt gtctagtttg ctttcagaca aaaggcttag     3180

gcatttccca tggcaggaag aagcggagac agcgacgaag cgctcctcca agcagtgaga     3240

atcatcaaaa tcctttatca aagcagtgag tattcaataa gcatatgtaa tgtttgattt     3300

atatgcaaga gtagattata gaataggagt aggagcattg gcaatagcac taatcatagc     3360

aataatagtg tggaccatag tatatataga atataggaaa ttagtaagac aaagaaaaat     3420

agaccagtta attaaaagaa ttagggaaag agcagaagac agtggcaatg agagtgatgg     3480

ggatacagag gaattatcca caatggtgga tatggagcat gttaggcttt tggatgctaa     3540

tgatttgtaa tgggatgtgg gtcacagtct actatggggt acctgtgtgg aaagaagcaa     3600

aaactactct attttgtgca tcagatgcta aagcatatga gaaagaagtg cataatgtct     3660

gggctacaca tgcctgtgta cccacagacc ccaatccaca agaaatggtt ttaaaaaatg     3720

taacagaaaa tttcaacatg tggaaaaatg acatggtgga tcagatgcat gaagatgtaa     3780

ttagtttatg ggatcaaagc ctcaagccat gtgtaaagtt gaccccactc tgtgtcactc     3840

taaactgtac caatgctact gccagcaata gcagtataat agagggaatg aaaaattgct     3900

ctttcaatat aaccacagaa ttaagagata agagagagaa aaagaatgca cttttttata     3960

aacttgatat agtacaacta gatggcaact ctagtcagta tagattaata aattgtaata     4020

cctcagtcat aacacaagcc tgtccaaagg tctcttttga cccaattcct atacattatt     4080

gtgctccagc tggttatgcg attctaaagt gtaataataa gacattcact ggaacaggac     4140

cgtgtaataa tgtcagcaca gtacaatgta cacatggaat taagccagtg gtttcaactc     4200

aactattgtt aaatggtagc ctagcagaag gagagataat aattagatct gaaaatataa     4260

caaacaatgt caaaacaata atagtacatc tcaatgaatc tgtaaagatt gagtgtacga     4320

gacccaataa taaaacaaga acaagtataa gaataggacc aggacaagca ttttatgcaa     4380

caggacaagt aataggagac ataagagaag catattgtaa cattaatgaa agtaaatgga     4440

atgaaacttt acaaagggta agtaaaaaat taaaagaata cttccctcat aagaatataa     4500

catttcaacc atcctcagga ggggacctag aaattacaac acatagcttt aattgtggag     4560

gagaattttt ctattgcaat acatcaagcc tgtttaatag gacatatatg gctaatagta     4620

cagatatggc taatagtaca gaaactaaca gtacacgaac catcacaatc cactgcagaa     4680

taaaacaaat tataaacatg tggcaggagg tgggacgagc aatgtatgcc cctcccattg     4740

caggaaacat aacatgtata tcaaatatca caggactact attgacaagg gatggaggaa     4800

aaaacaatac ggagacattc agacctggag gaggaaatat gaaggacaat tggagaagtg     4860

aattatataa atataaagtg gtagaagtta agccattagg agtagcaccc actaatgcaa     4920

gaaggagagt ggtggagaga gaaaaaagag cagtgggaat gggagctgtg ttccttgggt     4980

tcttgggagc ggcaggaagc actatgggcg cagcatcaat aacgctgacg gtacaggcca     5040

gacaattatt gtctggtata gtgcaacagc aaagcaattt gctgaaggct atagaggctc     5100

aacagcatat gttgaaactc acggtctggg gcattaaaca gctccaggca agagtcctgg     5160

ccttggaaag atacctaaag gatcaacagc tcctagggat gtggggctgc tctggaaaac     5220

tcatctgcac cactaatgta tattggaact ctagttggag taataaaact tatggtgata     5280

tttgggataa catgacctgg atgcagtggg agagagaaat tagcaattat acagaaataa     5340

tatatgaatt gcttgaagaa tcacaaaacc agcaggaaaa gaatgaacaa gatttactag     5400

cattggacag atggaacagt ctgtggaatt ggtttaacat aacaaattgg ctgtggtata     5460

taaaaatatt cataatgata gtaggaggct tgataggttt aagaataatt tttgctgtgc     5520

tttctttagt aaatagagtt aggcagggat actcacctct gtcgttgcag acccttatcc     5580

caagcccgag gggaccagac aggcccggag gaatcgaaga agaaggtgga gagcaagaca     5640

gaaactaaaa tggctagccc cgggtgataa acggaccgcg caatccctag gctgtgcctt     5700

ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc ctggaaggtg     5760

ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt ctgagtaggt     5820

gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat tgggaagaca     5880

atagcaggca tgctggggat gcggtgggct ctatataaaa aacgcccggc ggcaaccgag     5940

cgttctgaac gctagagtcg acaaattcag aagaactcgt caagaaggcg atagaaggcg     6000

atgcgctgcg aatcgggagc ggcgataccg taaagcacga ggaagcggtc agcccattcg     6060

ccgccaagct cttcagcaat atcacgggta gccaacgcta tgtcctgata gcggtctgcc     6120

acacccagcc ggccacagtc gatgaatcca gaaaagcggc cattttccac catgatattc     6180

ggcaagcagg catcgccatg ggtcacgacg agatcctcgc cgtcgggcat gctcgccttg     6240

agcctggcga acagttcggc tggcgcgagc ccctgatgct cttcgtccag atcatcctga     6300

tcgacaagac cggcttccat ccgagtacgt gctcgctcga tgcgatgttt cgcttggtgg     6360

tcgaatgggc aggtagccgg atcaagcgta tgcagccgcc gcattgcatc agccatgatg     6420

gatactttct cggcaggagc aaggtgagat gacaggagat cctgccccgg cacttcgccc     6480

aatagcagcc agtcccttcc cgcttcagtg acaacgtcga gcacagctgc gcaaggaacg     6540

cccgtcgtgg ccagccacga tagccgcgct gcctcgtctt gcagttcatt cagggcaccg     6600

gacaggtcgg tcttgacaaa aagaaccggg cgcccctgcg ctgacagccg gaacacggcg     6660

gcatcagagc agccgattgt ctgttgtgcc cagtcatagc cgaatagcct ctccacccaa     6720

gcggccggag aacctgcgtg caatccatct tgttcaatca tgcgaaacga tcctcatcct     6780

gtctcttgat cagatcttga tcccctgcgc catcagatcc ttggcggcaa gaaagccatc     6840

cagtttactt tgcagggctt cccaacctta ccagagggcg ccccagctgg caattccggt     6900

tcgcttgctg tccataaaac cgcccagtct agctatcgcc atgtaagccc actgcaagct     6960

acctgctttc tctttgcgct tgcgttttcc cttgtccaga tagcccagta gctgacattc     7020

atccggggtc agcaccgttt ctgcggactg gctttctacg tgaaaaggat ctaggtgaag     7080

atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg     7140

tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc     7200

tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag     7260

ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtt     7320

cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac     7380

ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc     7440

gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt     7500

tcgtgcacac agcccagctt ggagcgaacg acctacaccc gaactgagat acctacagcg     7560

tgagctatga gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag     7620

cggcagggtc ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct     7680

ttatagtcct gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc     7740

aggggggcgg agcctatgga aaaacgccag caacgcggcc cttttacggt tcctggcctt     7800

ttgctggcct tttgctcaca tgttgt                                          7826


<210>  16
<211>  8186
<212>  DNA
<213>  Human immunodeficiency virus type-1


<220>
<221>  promoter
<222>  (1)..(690)
<223>  CMV promoter

<220>
<221>  Intron
<222>  (691)..(1638)
<223>  CMV intron A

<220>
<221>  misc_feature
<222>  (1692)..(1695)
<223>  Splice Donor 1

<220>
<221>  gene
<222>  (1743)..(3227)
<223>  gag

<220>
<221>  misc_feature
<222>  (3277)..(3282)
<223>  Internal BamHI site

<220>
<221>  misc_feature
<222>  (3378)..(3380)
<223>  Splice Acceptor 3

<220>
<221>  gene
<222>  (3434)..(3648)
<223>  tat

<220>
<221>  misc_feature
<222>  (3537)..(3539)
<223>  Splice Acceptor 4C

<220>
<221>  misc_feature
<222>  (3556)..(3557)
<223>  Splice Acceptor 4A

<220>
<221>  misc_feature
<222>  (3562)..(3563)
<223>  Splice Acceptor 4B

<220>
<221>  gene
<222>  (3573)..(3648)
<223>  rev

<220>
<221>  misc_feature
<222>  (3578)..(3579)
<223>  Splice Acceptor 5

<220>
<221>  misc_feature
<222>  (3649)..(3673)
<223>  Splice Donor 4

<220>
<221>  gene
<222>  (3673)..(3933)
<223>  vpu

<220>
<221>  gene
<222>  (3851)..(6391)
<223>  gp160 env

<220>
<221>  misc_feature
<222>  (5952)..(5953)
<223>  Splice Acceptor 7

<220>
<221>  gene
<222>  (5954)..(6044)
<223>  tat

<220>
<221>  gene
<222>  (5954)..(6201)
<223>  rev

<220>
<221>  polyA_signal
<222>  (6432)..(6654)
<223>  BGH polyA

<220>
<221>  terminator
<222>  (6655)..(6689)

<220>
<221>  gene
<222>  (6710)..(7504)
<223>  KanR

<400>  16
ggcccgcctg gcattatgcc cagtacatga ccttacggga ctttcctact tggcagtaca       60

tctacggtat tagtcatcgg ctattaccat ggtgatgcgg ttttggcagt acaccaatgg      120

gcgtggatag cggtttgact cacggggatt tccaagtctc caccccattg acgtcaatgg      180

gagtttgttt tggcaccaaa atcaacggga ctttccaaaa tgtcgtaata accccgcccc      240

gttgacgcaa atgggcggta ggcgtgtacg gtgggaggtc tatataagca gagctcgttt      300

agtgaaccgt cagatcgcct ggagacgcca tccacgctgt tttgacctcc atagaagaca      360

ccgggaccga tccagcctcc gcggccggga acggtgcatt ggaacgcgga ttccccgtgc      420

caagagtgac gtaagtaccg cctatagact ctataggcac acccctttgg ctcttatgca      480

tgctatactg tttttggctt ggggcctata cacccccgct tccttatgct ataggtgatg      540

gtatagctta gcctataggt gtgggttatt gaccattatt gaccactccc ctattggtga      600

cgatactttc cattactaat ccataacatg gctctttgcc acaactatct ctattggcta      660

tatgccaata ctctgtcctt cagagactga cacggactct gtatttttac aggatggggt      720

cccatttatt atttacaaat tcacatatac aacaacgccg tcccccgtgc ccgcagtttt      780

tattaaacat agcgtgggat ctccacgcga atctcgggta ccgtgttccg gacatgggyt      840

cttctccggt agcggcggag cttccacatc cgagccctgg tcccatgcct ccagcggctc      900

atggtcgctc ggcagctcct tgctcctaac agtggaggcc agacttaggc acagcacaat      960

gcccaccacc accagtgtgc cgcacaaggc cgtggcggta gggtatgtgt ctgaaaatga     1020

gctcggagat tgggctcgca ccgctgacgc agatggaaga cttaaggcag cggcagaaga     1080

agatgcaggc agctgagttg ttgtattctg ataagagtca gaggtaactc ccgttgcggt     1140

gctgttaacg gtggagggca gtgtagtctg agcagtactc gttgctgccg cgcgcgccac     1200

cagacataat agctgacaga ctaacagact gttcctttcc atgggtcttt tctgcagtca     1260

ccatcgatgg cttgctgaag tgcactcggc aagaggcgag agcggcggct ggtgagtacg     1320

ccaaatttta tttgactagc ggaggctaga aggagagaga tgggtgcgag agcgtcaata     1380

ttaagagggg gaaaattaga taaatgggaa agaattaggt taaggccagg gggaaagaaa     1440

tgctatatga taaaacactt agtatgggca agcagggagt tggaaagatt tgcacttaat     1500

cctggcctct tagaaacatc agaaggctgt agacaaataa taaagcagct acaaccatct     1560

cttcagacag gaacagagga acttagatca ttatataaca cagtagtaac tctctattgt     1620

gtacatgaag agatagaagt acgagacacc aaagaagcct tagacaaact agaggaagaa     1680

caaaacaaat gtcagcaaaa agcacagcaa gcagaggcgg ctgacaaagg aaaggtcagc     1740

caaaattatc ctatagtaca gaatctccaa gggcaaatgg tacaccagcc cctatcacct     1800

agaactttga atgcatgggt gaaagtaata gaagagaagg gttttaaccc agaggtaata     1860

cccatgtttt cagcattatc agaaggagcc accccacaag acttaaacac catgttaaat     1920

acagtagggg gacatcaagc agccatgcaa atgttgaaag ataccatcaa tgaggaggct     1980

gcagaatggg atagattaca tccagtccat gcagggccta ttgcaccagg ccaaatgaga     2040

gaaccaaggg gaagtgatat agcaggaaca actagcaacc ttcaggaaca aatagcatgg     2100

atgacagcta acccacctat cccagtggga gaattgtata aaagatggat aattctggga     2160

ttaaataaaa tagtaagaat gtatagccct gtcagcattt tggacataaa acaagggcca     2220

aaagaaccct ttagagacta tgtagaccgg ttctttaaaa ctttgagagc tgaacaagct     2280

acacaagatg taaaaaattg gatgacagac accttgttga cccaaaatgc gaacccagat     2340

tgtaagacca ttttaagagc attaggacca ggggctacat tagaagaaat gatgacagca     2400

tgccaaggag tgggaggacc tagccacaaa gcaagagtgc tagctgaagc aatgagccaa     2460

gtaaatcatc caaacataat gatgcagaga aacaatttta aaggaccaaa aagaattgtt     2520

aaaagcttca acagtggcaa ggaagggcac atagccagaa attgcagggc acctaggaaa     2580

aggggcagtt ggaaaagtgg aaaggaagga caccaaatga aagactgtac tgaaaggcag     2640

gctaattttt tagggaaaat ttggccttcc cacaagggga ggccagggaa tttcatccag     2700

aacaggctag agcccacagc cccaccagca gagagtttca ggttcgagga gacaaccccc     2760

agtctgaagc aggagccgaa ggagagggaa ccacccttaa cttccctcaa atcactcttt     2820

ggcagcgacc ccttgtctca ataagagtag ggggccagat aaaggaggct ctcttagaca     2880

caggagcaga tgaggatcct cattaggaca acatatctat gatacctatg gggatacttg     2940

gacaggagtt gaagctataa taagaatact tcaacagtta ctgtttactc atttcagaat     3000

tgggtgccaa catagcagaa taggcattct gcgacagaga agagcaagaa atggagccag     3060

tagaccctaa cctagagccc tggaatcatc caggaagtca gcccaaaact ccttgtaata     3120

agtgttattg taagcgatgc tgctatcatt gtctagtttg ctttcagaca aaaggcttag     3180

gcatttccca tggcaggaag aagcggagac agcgacgaag cgctcctcca agcagtgaga     3240

atcatcaaaa tcctttatca aagcagtgag tattcaataa gcatatgtaa tgtttgattt     3300

atatgcaaga gtagattata gaataggagt aggagcattg gcaatagcac taatcatagc     3360

aataatagtg tggaccatag tatatataga atataggaaa ttagtaagac aaagaaaaat     3420

agaccagtta attaaaagaa ttagggaaag agcagaagac agtggcaatg agagtgatgg     3480

ggatacagag gaattatcca caatggtgga tatggagcat gttaggcttt tggatgctaa     3540

tgatttgtaa tgggatgtgg gtcacagtct actatggggt acctgtgtgg aaagaagcaa     3600

aaactactct attttgtgca tcagatgcta aagcatatga gaaagaagtg cataatgtct     3660

gggctacaca tgcctgtgta cccacagacc ccaatccaca agaaatggtt ttaaaaaatg     3720

taacagaaaa tttcaacatg tggaaaaatg acatggtgga tcagatgcat gaagatgtaa     3780

ttagtttatg ggatcaaagc ctcaagccat gtgtaaagtt gaccccactc tgtgtcactc     3840

taaactgtac caatgctact gccagcaata gcagtataat agagggaatg aaaaattgct     3900

ctttcaatat aaccacagaa ttaagagata agagagagaa aaagaatgca cttttttata     3960

aacttgatat agtacaacta gatggcaact ctagtcagta tagattaata aattgtaata     4020

cctcagtcat aacacaagcc tgtccaaagg tctcttttga cccaattcct atacattatt     4080

gtgctccagc tggttatgcg attctaaagt gtaataataa gacattcact ggaacaggac     4140

cgtgtaataa tgtcagcaca gtacaatgta cacatggaat taagccagtg gtttcaactc     4200

aactattgtt aaatggtagc ctagcagaag gagagataat aattagatct gaaaatataa     4260

caaacaatgt caaaacaata atagtacatc tcaatgaatc tgtaaagatt gagtgtacga     4320

gacccaataa taaaacaaga acaagtataa gaataggacc aggacaagca ttttatgcaa     4380

caggacaagt aataggagac ataagagaag catattgtaa cattaatgaa agtaaatgga     4440

atgaaacttt acaaagggta agtaaaaaat taaaagaata cttccctcat aagaatataa     4500

catttcaacc atcctcagga ggggacctag aaattacaac acatagcttt aattgtggag     4560

gagaattttt ctattgcaat acatcaagcc tgtttaatag gacatatatg gctaatagta     4620

cagatatggc taatagtaca gaaactaaca gtacacgaac catcacaatc cactgcagaa     4680

taaaacaaat tataaacatg tggcaggagg tgggacgagc aatgtatgcc cctcccattg     4740

caggaaacat aacatgtata tcaaatatca caggactact attgacaagg gatggaggaa     4800

aaaacaatac ggagacattc agacctggag gaggaaatat gaaggacaat tggagaagtg     4860

aattatataa atataaagtg gtagaagtta agccattagg agtagcaccc actaatgcaa     4920

gaaggagagt ggtggagaga gaaaaaagag cagtgggaat gggagctgtg ttccttgggt     4980

tcttgggagc ggcaggaagc actatgggcg cagcatcaat aacgctgacg gtacaggcca     5040

gacaattatt gtctggtata gtgcaacagc aaagcaattt gctgaaggct atagaggctc     5100

aacagcatat gttgaaactc acggtctggg gcattaaaca gctccaggca agagtcctgg     5160

ccttggaaag atacctaaag gatcaacagc tcctagggat gtggggctgc tctggaaaac     5220

tcatctgcac cactaatgta tattggaact ctagttggag taataaaact tatggtgata     5280

tttgggataa catgacctgg atgcagtggg agagagaaat tagcaattat acagaaataa     5340

tatatgaatt gcttgaagaa tcacaaaacc agcaggaaaa gaatgaacaa gatttactag     5400

cattggacag atggaacagt ctgtggaatt ggtttaacat aacaaattgg ctgtggtata     5460

taaaaatatt cataatgata gtaggaggct tgataggttt aagaataatt tttgctgtgc     5520

tttctttagt aaatagagtt aggcagggat actcacctct gtcgttgcag acccttatcc     5580

caagcccgag gggaccagac aggcccggag gaatcgaaga agaaggtgga gagcaagaca     5640

gaaacagatc aacgcgatta gtgagcggat tcttagcgct tgtctgggac gacctgcgga     5700

gcctgtgcct tttcatctac caccgattga gagacttcat attaattgca gcgagagcgg     5760

gggaacttct gggacgcagc agtctcaagg gactacggag aggatgggaa gcccttaagt     5820

atctgggaag tcttgtgcag tattggggcc tggaactaaa aaggagtgct attagtctat     5880

tggataccct agcaatagca gtaggtgaag gaacagatag gattctagaa tttgtattag     5940

gaatttgtag agctatccgc aacataccta caagaataag acagggcttt gaaacagctt     6000

tgctataaaa tggctagccc cgggtgataa acggaccgcg caatccctag gctgtgcctt     6060

ctagttgcca gccatctgtt gtttgcccct cccccgtgcc ttccttgacc ctggaaggtg     6120

ccactcccac tgtcctttcc taataaaatg aggaaattgc atcgcattgt ctgagtaggt     6180

gtcattctat tctggggggt ggggtggggc aggacagcaa gggggaggat tgggaagaca     6240

atagcaggca tgctggggat gcggtgggct ctatataaaa aacgcccggc ggcaaccgag     6300

cgttctgaac gctagagtcg acaaattcag aagaactcgt caagaaggcg atagaaggcg     6360

atgcgctgcg aatcgggagc ggcgataccg taaagcacga ggaagcggtc agcccattcg     6420

ccgccaagct cttcagcaat atcacgggta gccaacgcta tgtcctgata gcggtctgcc     6480

acacccagcc ggccacagtc gatgaatcca gaaaagcggc cattttccac catgatattc     6540

ggcaagcagg catcgccatg ggtcacgacg agatcctcgc cgtcgggcat gctcgccttg     6600

agcctggcga acagttcggc tggcgcgagc ccctgatgct cttcgtccag atcatcctga     6660

tcgacaagac cggcttccat ccgagtacgt gctcgctcga tgcgatgttt cgcttggtgg     6720

tcgaatgggc aggtagccgg atcaagcgta tgcagccgcc gcattgcatc agccatgatg     6780

gatactttct cggcaggagc aaggtgagat gacaggagat cctgccccgg cacttcgccc     6840

aatagcagcc agtcccttcc cgcttcagtg acaacgtcga gcacagctgc gcaaggaacg     6900

cccgtcgtgg ccagccacga tagccgcgct gcctcgtctt gcagttcatt cagggcaccg     6960

gacaggtcgg tcttgacaaa aagaaccggg cgcccctgcg ctgacagccg gaacacggcg     7020

gcatcagagc agccgattgt ctgttgtgcc cagtcatagc cgaatagcct ctccacccaa     7080

gcggccggag aacctgcgtg caatccatct tgttcaatca tgcgaaacga tcctcatcct     7140

gtctcttgat cagatcttga tcccctgcgc catcagatcc ttggcggcaa gaaagccatc     7200

cagtttactt tgcagggctt cccaacctta ccagagggcg ccccagctgg caattccggt     7260

tcgcttgctg tccataaaac cgcccagtct agctatcgcc atgtaagccc actgcaagct     7320

acctgctttc tctttgcgct tgcgttttcc cttgtccaga tagcccagta gctgacattc     7380

atccggggtc agcaccgttt ctgcggactg gctttctacg tgaaaaggat ctaggtgaag     7440

atcctttttg ataatctcat gaccaaaatc ccttaacgtg agttttcgtt ccactgagcg     7500

tcagaccccg tagaaaagat caaaggatct tcttgagatc ctttttttct gcgcgtaatc     7560

tgctgcttgc aaacaaaaaa accaccgcta ccagcggtgg tttgtttgcc ggatcaagag     7620

ctaccaactc tttttccgaa ggtaactggc ttcagcagag cgcagatacc aaatactgtt     7680

cttctagtgt agccgtagtt aggccaccac ttcaagaact ctgtagcacc gcctacatac     7740

ctcgctctgc taatcctgtt accagtggct gctgccagtg gcgataagtc gtgtcttacc     7800

gggttggact caagacgata gttaccggat aaggcgcagc ggtcgggctg aacggggggt     7860

tcgtgcacac agcccagctt ggagcgaacg acctacaccc gaactgagat acctacagcg     7920

tgagctatga gaaagcgcca cgcttcccga agggagaaag gcggacaggt atccggtaag     7980

cggcagggtc ggaacaggag agcgcacgag ggagcttcca gggggaaacg cctggtatct     8040

ttatagtcct gtcgggtttc gccacctctg acttgagcgt cgatttttgt gatgctcgtc     8100

aggggggcgg agcctatgga aaaacgccag caacgcggcc cttttacggt tcctggcctt     8160

ttgctggcct tttgctcaca tgttgt                                          8186


