
                                SEQUENCE LISTING

<110> Binley, James

<120> ENV TRIMER IMMUNOGENS
  

<130> TPIMS.006WO

<150> 61/360067           
<151> 2010-06-30  

<160> 46

<170> FastSEQ for Windows Version 4.0

<210> 1
<211> 844
<212> PRT
<213> HIV-1

<220> 
<221> VARIANT        
<222> (0)...(0)
<223> Xaa = any amino acid

<400> 1
Met Lys Val Arg Gly Ile Gln Arg Asn Tyr Gln His Leu Leu Thr Trp
 1               5                  10                  15      
Gly Thr Met Ile Leu Gly Ile Leu Gly Phe Cys Asn Ala Ala Glu Asn
            20                  25                  30          
Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Asp Ala Glu
        35                  40                  45              
Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Ser Thr Glu Lys
    50                  55                  60                  
His Xaa Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro
65                  70                  75                  80  
Gln Glu Ile His Leu Glu Asn Val Thr Glu Glu Phe Asn Met Trp Lys
                85                  90                  95      
Asn Asn Met Val Asp Gln Met His Glu Asp Ile Ile Ser Leu Trp Asp
            100                 105                 110         
Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu
        115                 120                 125             
Asn Cys Asn Asn Ile Asn Asp Ser Lys Ile Ile Asp Lys Glu Met Lys
    130                 135                 140                 
Gly Gln Ile Lys Asn Cys Ser Tyr Asn Met Thr Thr Glu Leu Arg Asp
145                 150                 155                 160 
Lys Lys Lys Gln Val Tyr Ser Leu Phe Tyr Lys Val Asp Val Val Pro
                165                 170                 175     
Ile Glu Glu Asn Asn Gly Asn Ser Asn Ser Ser Glu Tyr Arg Leu Ile
            180                 185                 190         
Asn Cys Asn Thr Ser Ala Ile Thr Gln Ala Cys Pro Lys Val Ser Phe
        195                 200                 205             
Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu
    210                 215                 220                 
Lys Cys Arg Asp Arg Glu Phe Asn Gly Thr Gly Pro Cys Lys Asn Val
225                 230                 235                 240 
Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln
                245                 250                 255     
Leu Leu Leu Asn Gly Ser Leu Ser Glu Lys Glu Ile Ile Ile Arg Ala
            260                 265                 270         
Glu Asn Ile Thr Asn Asn Ala Lys Ile Ile Ile Val Gln Leu Asn Glu
        275                 280                 285             
Ser Val Trp Ile Asn Cys Ser Arg Pro Asn Asn Asn Thr Arg Lys Ser
    290                 295                 300                 
Val Arg Ile Gly Pro Gly Gln Ala Phe Phe Ala Thr Gly Glu Ile Ile
305                 310                 315                 320 
Gly Asp Ile Arg Gln Ala Gln Cys Asn Ile Ser Arg Ser Lys Trp Asn
                325                 330                 335     
Glu Thr Leu Gln Arg Val Lys Gly Lys Leu Lys Asp Tyr Phe Lys Asn
            340                 345                 350         
Asn Ile Thr Phe Asp Asn Ser Ser Gly Gly Asp Leu Glu Ile Thr Thr
        355                 360                 365             
His Ser Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asn Thr Thr Gly
    370                 375                 380                 
Leu Phe Asn Glu Ser Leu Leu Asn Asn Ser Thr Asn Glu Asn Ile Thr
385                 390                 395                 400 
Leu Pro Cys Lys Ile Lys Gln Ile Val Arg Met Trp Gln Arg Val Gly
                405                 410                 415     
Gln Ala Met Tyr Ala Pro Pro Ile Ala Gly Lys Leu Glu Cys Arg Ser
            420                 425                 430         
Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Lys Glu Asn Gln
        435                 440                 445             
Thr Glu Asn Asn Pro Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg
    450                 455                 460                 
Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu
465                 470                 475                 480 
Pro Leu Gly Val Ala Pro Thr Lys Ala Arg Arg Arg Val Val Glu Arg
                485                 490                 495     
Glu Lys Arg Ala Val Gly Met Gly Ala Leu Phe Leu Gly Phe Leu Gly
            500                 505                 510         
Thr Ala Gly Ser Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln
        515                 520                 525             
Ala Arg Xaa Leu Leu Ser Gly Ile Val Gln Gln Gln Asn Asn Leu Leu
    530                 535                 540                 
Lys Ala Ile Glu Ala Gln Gln His Leu Leu Lys Leu Thr Val Trp Gly
545                 550                 555                 560 
Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Lys
                565                 570                 575     
Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys
            580                 585                 590         
Thr Thr Asn Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Thr Glu Gly
        595                 600                 605             
Glu Ile Trp Asp Asn Met Thr Trp Leu Gln Trp Asp Lys Glu Ile Ser
    610                 615                 620                 
Asn Tyr Thr Gln Ile Ile Tyr Asn Leu Leu Glu Glu Ser Gln Asn Gln
625                 630                 635                 640 
Gln Glu Lys Asn Glu Gln Asp Leu Leu Ala Leu Asp Lys Trp Asp Ser
                645                 650                 655     
Leu Trp Asn Trp Phe Ser Ile Ser Lys Trp Leu Trp Tyr Ile Lys Ile
            660                 665                 670         
Phe Ile Met Ile Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe Ala
        675                 680                 685             
Val Ile Ser Val Ile Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser
    690                 695                 700                 
Phe Gln Ile His Thr Pro Asn Pro Arg Gly Pro Asp Arg Pro Gly Arg
705                 710                 715                 720 
Ile Glu Glu Glu Gly Gly Glu Pro Gly Arg Asp Arg Ser Thr Arg Leu
                725                 730                 735     
Val Asn Gly Phe Leu Ala Leu Val Trp Asp Asp Leu Arg Ser Leu Phe
            740                 745                 750         
Leu Phe Ser Tyr His His Leu Arg Asp Phe Ile Leu Ile Ala Ala Arg
        755                 760                 765             
Thr Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr Leu
    770                 775                 780                 
Gly Asn Leu Leu Leu Tyr Trp Gly Arg Glu Leu Lys Ile Ser Ala Ile
785                 790                 795                 800 
Asn Leu Leu Asp Thr Ile Ala Ile Ala Val Ala Gly Trp Thr Asp Arg
                805                 810                 815     
Ala Ile Glu Val Gly Gln Arg Ile Xaa Xaa Ala Val Leu Asn Ile Pro
            820                 825                 830         
Arg Arg Ile Arg Gln Xaa Phe Glu Arg Ala Leu Leu
        835                 840                 


<210> 2
<211> 861
<212> PRT
<213> HIV-1

<220> 
<221> VARIANT        
<222> (0)...(0)
<223> Xaa = any amino acid

<400> 2
Met Arg Val Met Gly Xaa Gln Met Asn Trp Gln Gly Leu Trp Arg Trp
 1               5                  10                  15      
Gly Thr Met Ile Leu Gly Met Ile Ile Ile Cys Ser Ala Ala Asp Asn
            20                  25                  30          
Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Arg Asp Ala Asp
        35                  40                  45              
Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Xaa Glu Ala
    50                  55                  60                  
His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro
65                  70                  75                  80  
Gln Glu Xaa His Leu Lys Asn Val Thr Glu Glu Phe Asn Met Trp Lys
                85                  90                  95      
Asn Asn Met Val Glu Gln Met His Thr Asp Ile Ile Ser Leu Trp Asp
            100                 105                 110         
Gln Xaa Leu Lys Pro Cys Val Lys Leu Thr Pro Xaa Cys Val Thr Leu
        115                 120                 125             
Asx Cys Val Asn Asn Ile Thr Phe Tyr Asn Asn Ser Ser Pro Gln Phe
    130                 135                 140                 
Thr Asn Ser Ser Asp Met Arg Asn Xaa Ser Phe Asn Met Thr Thr Glu
145                 150                 155                 160 
Leu Arg Asp Lys Xaa Gln Xaa Val His Ser Leu Phe Tyr Lys Leu Asp
                165                 170                 175     
Ile Val Pro Ile Gly Gly Thr Asn Asn Xaa Asp Gly Gln Tyr Arg Leu
            180                 185                 190         
Ile Asn Cys Asn Thr Ser Ala Ile Thr Gln Ala Cys Pro Lys Val Ser
        195                 200                 205             
Phe Glu Pro Ile Pro Ile His Tyr Xaa Thr Pro Ala Gly Phe Ala Ile
    210                 215                 220                 
Leu Leu Cys Asn Asp Lys Xaa Phe Asn Gly Thr Gly Pro Cys Lys Asn
225                 230                 235                 240 
Val Ser Ser Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr
                245                 250                 255     
Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Arg Ile Arg
            260                 265                 270         
Ser Glu Xaa Leu Thr Asp Asn Ala Lys Xaa Ile Ile Val Gln Leu Xaa
        275                 280                 285             
Xaa Pro Val Gln Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys
    290                 295                 300                 
Ser Val His Ile Gly Pro Gly Gln Ala Phe Phe Ala Thr Gly Asp Ile
305                 310                 315                 320 
Ile Gly Asp Ile Arg Glu Ala Phe Cys Glu Val Asn Thr Lys Lys Trp
                325                 330                 335     
Asn Ala Thr Leu Gln Lys Val Ala Xaa Gln Leu Lys Asn Tyr Phe Asn
            340                 345                 350         
Lys Thr Ile Ile Phe Asn Ser Ser Ser Gly Gly Asp Leu Glu Ile Thr
        355                 360                 365             
Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser
    370                 375                 380                 
Arg Leu Xaa Asn Ser Thr Trp Met Phe Asn Gly Thr Trp Gly Asn Asn
385                 390                 395                 400 
Thr Val Glu Xaa Glu Lys Ser Asn Asp Thr Leu Xaa Leu Pro Xaa Lys
                405                 410                 415     
Ile Lys Gln Ile Ile Arg Met Trp Gln Arg Ala Gly Gln Ala Met Tyr
            420                 425                 430         
Ala Pro Pro Ile Gln Gly Val Ile Xaa Cys Val Ser Asn Ile Thr Gly
        435                 440                 445             
Leu Leu Leu Thr Arg Asx Gly Gly Lys Xaa Ser Asn Glu Ser Glu Thr
    450                 455                 460                 
Phe Arg Pro Glu Gly Gly Asn Met Arg Asp Asn Trp Arg Ser Glu Leu
465                 470                 475                 480 
Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala Pro Thr
                485                 490                 495     
Xaa Ala Lys Arg Arg Val Val Gln Arg Glu Xaa Arg Ala Ala Ile Gly
            500                 505                 510         
Leu Gly Ala Val Phe Xaa Gly Phe Leu Gly Ala Ala Gly Ser Thr Met
        515                 520                 525             
Gly Ala Ala Ser Met Thr Leu Thr Val Gln Ala Arg Gln Leu Leu Ser
    530                 535                 540                 
Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Xaa Ala Ile Glu Ala Gln
545                 550                 555                 560 
Gln His Leu Leu Lys Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala
                565                 570                 575     
Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gln Gln Leu Leu Gly
            580                 585                 590         
Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Ala Val Pro Trp
        595                 600                 605             
Asn Ser Thr Trp Ser Lys Lys Asn Gln Ser Glu Ile Trp Asp Asn Met
    610                 615                 620                 
Thr Trp Leu Gln Trp Asp Lys Glu Ile Ser Asn Tyr Thr Asp Ile Ile
625                 630                 635                 640 
Tyr Asn Leu Leu Glu Glu Xaa Gln Asn Gln Gln Glu Lys Asn Glu Gln
                645                 650                 655     
Asp Leu Leu Ala Leu Asp Lys Trp Ala Xaa Leu Trp Asn Trp Phe Asp
            660                 665                 670         
Ile Ser Lys Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile Val Gly
        675                 680                 685             
Gly Leu Ile Gly Leu Arg Ile Val Phe Ala Val Leu Ser Ile Ile Asn
    690                 695                 700                 
Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Ile His Xaa Pro
705                 710                 715                 720 
Asn Pro Glu Ala Leu Asp Arg Pro Glu Xaa Ile Glu Glu Glu Gly Gly
                725                 730                 735     
Glu Gln Gly Arg Asp Arg Ser Ile Arg Leu Val Ser Gly Phe Leu Ala
            740                 745                 750         
Leu Ala Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ser Tyr His Arg
        755                 760                 765             
Leu Arg Asp Phe Ile Leu Ile Val Thr Arg Thr Val Glu Leu Leu Gly
    770                 775                 780                 
His Ser Ser Leu Lys Gly Leu Arg Leu Gly Trp Glu Gly Leu Lys Tyr
785                 790                 795                 800 
Leu Gly Asn Leu Leu Thr Xaa Trp Gly Gln Glu Leu Lys Xaa Ser Ala
                805                 810                 815     
Ile Asn Leu Leu Asp Thr Xaa Ala Ile Ala Val Ala Gly Trp Thr Asp
            820                 825                 830         
Arg Val Ile Glu Ile Val Gln Arg Ile Cys Arg Ala Phe Leu Asn Ile
        835                 840                 845             
Pro Arg Arg Ile Arg Gln Gly Leu Glu Arg Ile Leu Leu
    850                 855                 860     


<210> 3
<211> 863
<212> PRT
<213> HIV-1

<400> 3
Met Ile Val Met Gly Thr Gln Arg Asn Tyr Gln His Leu Leu Arg Trp
 1               5                  10                  15      
Gly Thr Met Ile Leu Gly Leu Ile Ile Ile Cys Ser Ala Ala Asp Asn
            20                  25                  30          
Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Asp Ala Glu
        35                  40                  45              
Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Thr Glu Lys
    50                  55                  60                  
His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro
65                  70                  75                  80  
Gln Glu Ile Pro Leu Glu Asn Val Thr Glu Glu Phe Asn Met Trp Lys
                85                  90                  95      
Asn Lys Met Val Glu Gln Met His Thr Asp Ile Ile Ser Leu Trp Asp
            100                 105                 110         
Gln Ser Leu Gln Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu
        115                 120                 125             
Asn Cys Thr Asp Ala Thr Asn Gly Thr Ile Gly Asn Ile Thr Asp Glu
    130                 135                 140                 
Met Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Glu Ile
145                 150                 155                 160 
Arg Asp Lys Lys Gln Lys Val Tyr Ser Leu Phe Tyr Arg Leu Asp Val
                165                 170                 175     
Val Pro Ile Glu Pro Asp Ser Ser Asn Ser Ser Arg Asn Ser Ser Glu
            180                 185                 190         
Tyr Arg Leu Ile Asn Cys Asn Thr Ser Ala Ile Thr Gln Ala Cys Pro
        195                 200                 205             
Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly
    210                 215                 220                 
Phe Ala Ile Leu Lys Cys Arg Asp Lys Glu Phe Asn Gly Thr Gly Lys
225                 230                 235                 240 
Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro Val
                245                 250                 255     
Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Gly Glu Val
            260                 265                 270         
Arg Ile Arg Ser Glu Asn Ile Thr Asn Asn Ala Lys Thr Ile Ile Val
        275                 280                 285             
Gln Leu Val Glu Pro Val Arg Ile Asn Cys Thr Arg Pro Asn Asn Asn
    290                 295                 300                 
Thr Arg Glu Ser Val Arg Ile Gly Pro Gly Gln Ala Phe Phe Ala Thr
305                 310                 315                 320 
Gly Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Val Ser Arg
                325                 330                 335     
Ser Gln Trp Asn Lys Thr Leu Gln Gln Val Ala Ala Gln Leu Gly Glu
            340                 345                 350         
His Phe Lys Asn Lys Ala Ile Thr Phe Asn Ser Ser Ser Gly Gly Asp
        355                 360                 365             
Leu Glu Ile Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr
    370                 375                 380                 
Cys Asn Thr Ser Gly Leu Phe Asn Ser Thr Trp Lys Ala Asn Asn Gly
385                 390                 395                 400 
Thr Trp Lys Ala Asn Ile Ser Glu Ser Asn Asn Thr Glu Ile Thr Leu
                405                 410                 415     
Gln Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Arg Thr Gly Gln
            420                 425                 430         
Ala Ile Tyr Ala Pro Pro Ile Gln Gly Val Ile Arg Cys Glu Ser Asn
        435                 440                 445             
Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Glu Gly Asn Asn Glu
    450                 455                 460                 
Ser Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg
465                 470                 475                 480 
Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val
                485                 490                 495     
Ala Pro Thr Arg Ala Arg Arg Arg Val Val Gly Arg Glu Lys Arg Ala
            500                 505                 510         
Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser
        515                 520                 525             
Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu
    530                 535                 540                 
Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Arg Ala Ile Glu
545                 550                 555                 560 
Ala Gln Gln His Met Leu Lys Leu Thr Val Trp Gly Ile Lys Gln Leu
                565                 570                 575     
Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gln Gln Leu
            580                 585                 590         
Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Asn Val
        595                 600                 605             
Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser His Asp Glu Ile Trp Asn
    610                 615                 620                 
Asn Met Thr Trp Leu Gln Trp Asp Lys Glu Ile Ser Asn Tyr Thr Asn
625                 630                 635                 640 
Leu Ile Tyr Ser Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn
                645                 650                 655     
Glu Gln Asp Leu Leu Ala Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp
            660                 665                 670         
Phe Asp Ile Ser Lys Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile
        675                 680                 685             
Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe Ala Val Leu Ala Val
    690                 695                 700                 
Ile Lys Arg Val Arg Gln Gly Tyr Ser Pro Val Ser Phe Gln Ile His
705                 710                 715                 720 
Asn Pro Asn Pro Gly Gly Leu Asp Arg Pro Gly Arg Ile Glu Glu Glu
                725                 730                 735     
Gly Gly Glu Pro Gly Arg Gly Arg Ser Ile Arg Leu Val Ser Gly Phe
            740                 745                 750         
Leu Ala Leu Ala Trp Asp Asp Leu Arg Asn Leu Cys Leu Phe Ser Tyr
        755                 760                 765             
His Arg Leu Arg Asp Phe Ala Leu Ile Val Ala Arg Thr Val Glu Leu
    770                 775                 780                 
Leu Gly His Ser Ser Leu Lys Gly Leu Arg Leu Gly Trp Glu Gly Leu
785                 790                 795                 800 
Lys Tyr Leu Trp Asn Leu Leu Val Tyr Trp Ser Gln Glu Leu Lys Thr
                805                 810                 815     
Ser Ala Ile Asn Leu Val Asp Thr Ile Ala Ile Ala Val Ala Gly Trp
            820                 825                 830         
Thr Asp Arg Val Ile Glu Ile Gly Gln Gly Ile Gly Arg Ala Phe Leu
        835                 840                 845             
His Ile Pro Arg Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu
    850                 855                 860             


<210> 4
<211> 2592
<212> DNA
<213> HIV-1

<400> 4
atgatagtga tggggacaca gaggaattat cagcacttat tgagatgggg aactatgatc 60
ttgggattga taataatctg tagtgctgca gacaacttgt gggttactgt ctattatggg 120
gtacctgtgt ggaaagatgc agagaccacc ttattttgtg catcagatgc taaagcatat 180
gagacagaaa agcataatgt ttgggctaca catgcctgtg tgcccacaga ccccaaccca 240
caagaaatac ctttggaaaa tgtgacagaa gagtttaaca tgtggaaaaa taaaatggta 300
gaacaaatgc atacagatat aatcagtcta tgggaccaaa gcctacagcc atgtgtaaag 360
ttaacccctc tctgtgttac tttaaattgt acggatgcta ctaatggtac gattggcaac 420
atcaccgatg aaatgaaggg agaaataaaa aactgctctt tcaatataac cacagaaata 480
agggataaga aacagaaagt atattcactt ttttatagac ttgatgtagt accaatagag 540
ccagatagta gtaatagtag tagaaacagt agtgagtata gattaataaa ttgtaatacc 600
tcagccatta cacaagcctg cccaaaggta agctttgagc caattcccat acattattgt 660
gccccagctg gttttgcgat cctgaagtgt agggataaag agttcaatgg aacagggaaa 720
tgcaagaatg tcagcacagt ccaatgcaca catggaatca agccagtagt atcaactcaa 780
ctgctgttaa atggcagtct agcagaagga gaggtaagaa ttagatctga aaatatcaca 840
aacaatgcca aaactataat agtacaactt gtcgagcctg tgagaattaa ttgtactaga 900
cctaataaca atacaagaga gagtgtgcgt atagggccag gacaagcatt ctttgcaaca 960
ggtgacataa taggggatat aagacaagca cattgtaatg tcagtagatc acaatggaat 1020
aagactttac aacaggtagc tgcacaatta ggagaacact ttaaaaacaa agcaataaca 1080
tttaacagtt cctcaggagg agatctagaa atcacaacac atagttttaa ttgtggagga 1140
gaatttttct attgtaatac atcaggtctg ttcaatagca cctggaaggc caacaatggc 1200
acctggaagg ccaacatatc agagtcaaat aacacggaga taactctcca atgcagaata 1260
aagcaaatta taaatatgtg gcagagaaca ggacaagcaa tatatgcccc tcccatccag 1320
ggagtgataa ggtgtgaatc aaacatcaca ggactactgt taacaagaga tggtggggag 1380
gggaacaatg aaagtgagat cttcagacct ggaggaggag atatgaggga caactggaga 1440
agtgaattat ataagtataa agtagtaaaa attgaaccac taggagtagc acccaccagg 1500
gcaaggagaa gagtggtggg aagagaaaaa agagcagttg gaataggagc tgttttcctt 1560
gggttcttag gagcagcagg aagcactatg ggcgcggcgt caataacgct gacggtacag 1620
gccaggcaat tattgtctgg catagtgcaa cagcaaagca atttgctgag ggctatagag 1680
gctcaacaac atatgttgaa actcacggtc tggggcatta aacagctcca ggcaagagtc 1740
cttgctgtgg agagatacct aagggatcaa cagctcctag gaatttgggg ctgctctgga 1800
aaactcatct gcaccactaa tgtgccctgg aactctagtt ggagtaataa atctcatgat 1860
gaaatatgga acaacatgac ctggctgcaa tgggataaag aaattagcaa ttacacaaac 1920
ctaatatata gtctaattga agaatcgcaa aaccagcagg aaaagaatga acaagattta 1980
ttggcattgg acaagtgggc aagtctgtgg aattggtttg acatatcaaa gtggctgtgg 2040
tatataaaaa tatttataat gatagtagga ggtttaatag gattaagaat agtttttgct 2100
gtgcttgctg taataaagag agttaggcag ggatactcac ctgtgtcatt tcagatccac 2160
aacccaaacc cagggggtct cgacaggccc ggaagaatcg aagaagaagg tggagagcca 2220
ggcagaggca gatcgattcg attagtgagc ggattcttag cacttgcctg ggacgatctg 2280
aggaacctgt gcctcttcag ctaccatcgc ttgagagact tcgccttgat tgttgcgagg 2340
actgtggaac ttctgggaca cagcagtctc aaggggttga gactggggtg ggaaggcctc 2400
aagtatctgt ggaatctcct ggtatactgg agtcaggaac tgaaaactag tgctattaat 2460
ttggttgata ctatagcaat agcagtagct ggctggacag atagggttat agaaatagga 2520
caaggaattg gtagagcttt tctccacata cctagaagaa tcagacaggg cttagaaagg 2580
gcattgctgt aa                                                     2592

<210> 5
<211> 847
<212> PRT
<213> HIV-1

<400> 5
Met Arg Val Lys Gly Ile Arg Lys Ser Tyr Gln Tyr Leu Trp Lys Gly
 1               5                  10                  15      
Gly Thr Leu Leu Leu Gly Ile Leu Met Ile Cys Ser Ala Val Glu Lys
            20                  25                  30          
Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr
        35                  40                  45              
Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val
    50                  55                  60                  
His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro
65                  70                  75                  80  
Gln Glu Val Val Leu Glu Asn Val Thr Glu His Phe Asn Met Trp Lys
                85                  90                  95      
Asn Asn Met Val Glu Gln Met Gln Glu Asp Ile Ile Ser Leu Trp Asp
            100                 105                 110         
Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu
        115                 120                 125             
Asn Cys Lys Asp Val Asn Ala Thr Asn Thr Thr Asn Asp Ser Glu Gly
    130                 135                 140                 
Thr Met Glu Arg Gly Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr
145                 150                 155                 160 
Ser Ile Arg Asp Glu Val Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Leu
                165                 170                 175     
Asp Val Val Pro Ile Asp Asn Asn Asn Thr Ser Tyr Arg Leu Ile Ser
            180                 185                 190         
Cys Asp Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu
        195                 200                 205             
Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys
    210                 215                 220                 
Cys Asn Asp Lys Thr Phe Asn Gly Lys Gly Pro Cys Lys Asn Val Ser
225                 230                 235                 240 
Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu
                245                 250                 255     
Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Asp
            260                 265                 270         
Asn Phe Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Lys Glu Ser
        275                 280                 285             
Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile
    290                 295                 300                 
His Ile Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly Glu Ile Ile Gly
305                 310                 315                 320 
Asp Ile Arg Gln Ala His Cys Asn Ile Ser Arg Ala Lys Trp Asn Asp
                325                 330                 335     
Thr Leu Lys Gln Ile Val Ile Lys Leu Arg Glu Gln Phe Glu Asn Lys
            340                 345                 350         
Thr Ile Val Phe Asn His Ser Ser Gly Gly Asp Pro Glu Ile Val Met
        355                 360                 365             
His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gln
    370                 375                 380                 
Leu Phe Asn Ser Thr Trp Asn Asn Asn Thr Glu Gly Ser Asn Asn Thr
385                 390                 395                 400 
Glu Gly Asn Thr Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn
                405                 410                 415     
Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly
            420                 425                 430         
Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp
        435                 440                 445             
Gly Gly Ile Asn Glu Asn Gly Thr Glu Ile Phe Arg Pro Gly Gly Gly
    450                 455                 460                 
Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val
465                 470                 475                 480 
Lys Ile Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val
                485                 490                 495     
Val Gln Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly
            500                 505                 510         
Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu
        515                 520                 525             
Thr Val Gln Ala Arg Leu Leu Leu Ser Gly Ile Val Gln Gln Gln Asn
    530                 535                 540                 
Asn Leu Leu Arg Ala Ile Glu Ala Gln Gln Arg Met Leu Gln Leu Thr
545                 550                 555                 560 
Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg
                565                 570                 575     
Tyr Leu Gly Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys
            580                 585                 590         
Leu Ile Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys
        595                 600                 605             
Ser Leu Asp Arg Ile Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg
    610                 615                 620                 
Glu Ile Asp Asn Tyr Thr Ser Glu Ile Tyr Thr Leu Ile Glu Glu Ser
625                 630                 635                 640 
Gln Asn Gln Gln Glu Lys Asn Glu Gln Glu Leu Leu Glu Leu Asp Lys
                645                 650                 655     
Trp Ala Ser Leu Trp Asn Trp Phe Asp Ile Thr Lys Trp Leu Trp Tyr
            660                 665                 670         
Ile Lys Ile Phe Ile Met Ile Val Gly Gly Leu Val Gly Leu Arg Leu
        675                 680                 685             
Val Phe Thr Val Leu Ser Ile Val Asn Arg Val Arg Gln Gly Tyr Ser
    690                 695                 700                 
Pro Leu Ser Phe Gln Thr Leu Leu Pro Ala Pro Arg Gly Pro Asp Arg
705                 710                 715                 720 
Pro Glu Gly Ile Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser
                725                 730                 735     
Gly Arg Leu Val Asn Gly Phe Leu Ala Leu Ile Trp Val Asp Leu Arg
            740                 745                 750         
Ser Leu Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu Thr
        755                 760                 765             
Val Thr Arg Ile Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Val Leu
    770                 775                 780                 
Lys Tyr Trp Trp Asn Leu Leu Gln Tyr Trp Ser Gln Glu Leu Lys Asn
785                 790                 795                 800 
Ser Ala Val Ser Leu Leu Asn Ala Thr Ala Ile Ala Val Ala Glu Gly
                805                 810                 815     
Thr Asp Arg Ile Ile Glu Ala Leu Gln Arg Thr Tyr Arg Ala Ile Leu
            820                 825                 830         
His Ile Pro Thr Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu
        835                 840                 845         


<210> 6
<211> 2541
<212> DNA
<213> HIV-1

<400> 6
atgagagtga aggggatcag gaagagttat cagtacttgt ggaaaggggg caccttgctc 60
cttgggatat taatgatctg tagtgctgta gaaaagttgt gggtcacagt ctattatggg 120
gtacctgtgt ggaaagaagc aaccaccact ctattttgtg catcagatgc taaagcatat 180
gatacagagg tacataatgt ttgggccaca catgcctgtg tacccacaga ccccaaccca 240
caagaagtag tattggaaaa tgtaacagaa cattttaaca tgtggaaaaa taacatggta 300
gaacagatgc aggaggatat aatcagttta tgggatcaaa gcctaaagcc atgtgtaaaa 360
ttaaccccac tctgtgttac tttaaattgc aaggatgtga atgctactaa taccactaat 420
gatagcgagg gaacgatgga gagaggagaa ataaaaaact gctctttcaa tatcaccaca 480
agcataagag atgaggtgca gaaagaatat gctctttttt ataaacttga tgtagtacca 540
atagataata ataataccag ctataggttg ataagttgtg acacctcagt cattacacag 600
gcctgtccaa agatatcctt tgagccaatt cccatacatt attgtgcccc ggctggtttt 660
gcgattctaa agtgtaatga taagacgttc aatggaaaag gaccatgtaa aaatgtcagc 720
acagtacaat gtacacatgg aattaggcca gtagtatcaa ctcaactgct gctaaatggc 780
agtctagcag aagaagaggt agtaattaga tctgacaatt tcacgaacaa tgctaaaacc 840
ataatagtac agctgaaaga atctgtagaa attaattgta caagacccaa caacaataca 900
agaaaaagta tacatatagg accagggaga gcattttata ctacaggaga aataatagga 960
gatataagac aagcacattg taacattagt agagcaaaat ggaatgacac tttaaaacag 1020
atagttataa aattaagaga acaatttgag aataaaacaa tagtctttaa tcactcctca 1080
ggaggggacc cagaaattgt aatgcacagt tttaattgtg gaggagaatt tttctactgt 1140
aattcaacac aactgtttaa tagtacttgg aataataata ctgaagggtc aaataacact 1200
gaaggaaata ctatcacact cccatgcaga ataaaacaaa ttataaacat gtggcaggaa 1260
gtaggaaaag caatgtatgc ccctcccatc agaggacaaa ttagatgttc atcaaatatt 1320
acagggctgc tattaacaag agatggtggt attaatgaga atgggaccga gatcttcaga 1380
cctggaggag gagatatgag ggacaattgg agaagtgaat tatataaata taaagtagta 1440
aaaattgaac cattaggagt agcacccacc aaggcaaaga gaagagtggt gcaaagagaa 1500
aaaagagcag tgggaatagg agctgtgttc cttgggttct tgggagcagc aggaagcact 1560
atgggcgcag cgtcaatgac actgacggta caggccagac tattattgtc tggtatagtg 1620
caacagcaga acaatttgct gagggctatt gaggcgcaac agcgtatgtt gcaactcaca 1680
gtctggggca tcaagcagct ccaggcaaga gtcctggctg tggaaagata cctaggggat 1740
caacagctcc tggggatttg gggttgctct ggaaaactca tttgcaccac tgctgtgcct 1800
tggaatgcta gttggagtaa taaatctctg gataggattt ggaataacat gacctggatg 1860
gagtgggaaa gagaaattga caattacaca agcgaaatat acaccctaat tgaagaatcg 1920
cagaaccaac aagaaaagaa tgaacaagaa ttattggaat tagataaatg ggcaagtttg 1980
tggaattggt ttgacataac aaaatggctg tggtatataa aaatattcat aatgatagta 2040
ggaggcttag taggtttaag actagttttt actgtacttt ctatagtgaa tagagttagg 2100
cagggatact caccattatc gtttcagacc ctcctcccag ccccgagggg acccgacagg 2160
cccgaaggaa tcgaagaaga aggtggagag agagacagag acagatccgg acgattagtg 2220
aacggattct tagcacttat ctgggtcgac ctgcggagcc tgtgcctctt cagctaccac 2280
cgcttgagag acttactctt gactgtaacg aggattgtgg aacttctggg acgcaggggg 2340
tgggaagtcc tgaaatattg gtggaatctc ctacagtatt ggagtcagga actaaagaat 2400
agtgctgtta gcttgctcaa tgccacagcc atagcagtag ctgaggggac agataggatt 2460
atagaagcat tacaaagaac ttatagagct attctccaca tacctacaag aataagacag 2520
ggcttggaaa gggctttgct a                                           2541

<210> 7
<211> 855
<212> PRT
<213> HIV-1

<400> 7
Met Arg Val Lys Glu Lys Tyr Gln His Leu Trp Arg Trp Gly Trp Arg
 1               5                  10                  15      
Trp Gly Thr Met Leu Leu Gly Met Leu Met Ile Cys Ser Ala Thr Glu
            20                  25                  30          
Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala
        35                  40                  45              
Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu
    50                  55                  60                  
Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn
65                  70                  75                  80  
Pro Gln Glu Val Glu Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp
                85                  90                  95      
Lys Asn Asn Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp
            100                 105                 110         
Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr
        115                 120                 125             
Leu Asn Cys Thr Asp Leu Arg Asn Ala Thr Asn Gly Asn Asp Thr Asn
    130                 135                 140                 
Thr Thr Ser Ser Ser Arg Glu Met Met Gly Gly Gly Glu Met Lys Asn
145                 150                 155                 160 
Cys Ser Phe Lys Ile Thr Thr Asn Ile Arg Gly Lys Val Gln Lys Glu
                165                 170                 175     
Tyr Ala Leu Phe Tyr Glu Leu Asp Ile Val Pro Ile Asp Asn Asn Ser
            180                 185                 190         
Asn Asn Arg Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr Gln
        195                 200                 205             
Ala Cys Pro Lys Ile Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala
    210                 215                 220                 
Pro Ala Gly Phe Ala Ile Leu Lys Cys Lys Asp Lys Lys Phe Asn Gly
225                 230                 235                 240 
Lys Gly Pro Cys Ser Asn Val Ser Thr Val Gln Cys Thr His Gly Ile
                245                 250                 255     
Arg Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu
            260                 265                 270         
Glu Glu Val Val Ile Arg Ser Glu Asn Phe Ala Asp Asn Ala Lys Thr
        275                 280                 285             
Ile Ile Val Gln Leu Asn Glu Ser Val Glu Ile Asn Cys Thr Arg Pro
    290                 295                 300                 
Asn Asn Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly Arg Ala Leu
305                 310                 315                 320 
Tyr Thr Thr Gly Glu Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn
                325                 330                 335     
Leu Ser Arg Ala Lys Trp Asn Asp Thr Leu Asn Lys Ile Val Ile Lys
            340                 345                 350         
Leu Arg Glu Gln Phe Gly Asn Lys Thr Ile Val Phe Lys His Ser Ser
        355                 360                 365             
Gly Gly Asp Pro Glu Ile Val Thr His Ser Phe Asn Cys Gly Gly Glu
    370                 375                 380                 
Phe Phe Tyr Cys Asn Ser Thr Gln Leu Phe Asn Ser Thr Trp Asn Val
385                 390                 395                 400 
Thr Glu Glu Ser Asn Asn Thr Val Glu Asn Asn Thr Ile Thr Leu Pro
                405                 410                 415     
Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Lys Val Gly Arg Ala
            420                 425                 430         
Met Tyr Ala Pro Pro Ile Arg Gly Gln Ile Arg Cys Ser Ser Asn Ile
        435                 440                 445             
Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Pro Glu Asp Asn Lys Thr
    450                 455                 460                 
Glu Val Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser
465                 470                 475                 480 
Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala
                485                 490                 495     
Pro Thr Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg Ala Val
            500                 505                 510         
Gly Ile Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr
        515                 520                 525             
Met Gly Ala Ala Ser Met Thr Leu Thr Val Gln Ala Arg Leu Leu Leu
    530                 535                 540                 
Ser Gly Ile Val Gln Gln Gln Asn Asn Leu Leu Arg Ala Ile Glu Ala
545                 550                 555                 560 
Gln Gln His Leu Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln
                565                 570                 575     
Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gln Gln Leu Leu
            580                 585                 590         
Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Ala Val Pro
        595                 600                 605             
Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu Asn Lys Ile Trp Asp Asn
    610                 615                 620                 
Met Thr Trp Met Glu Trp Asp Arg Glu Ile Asn Asn Tyr Thr Ser Ile
625                 630                 635                 640 
Ile Tyr Ser Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn Glu
                645                 650                 655     
Gln Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe
            660                 665                 670         
Glu Ile Thr Glu Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile Ile
        675                 680                 685             
Gly Gly Leu Ile Gly Leu Arg Ile Val Phe Ser Val Leu Ser Ile Met
    690                 695                 700                 
Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr His Leu
705                 710                 715                 720 
Pro Ala Ser Arg Gly Pro Asp Arg Pro Gly Gly Ile Glu Glu Glu Gly
                725                 730                 735     
Gly Glu Arg Asp Arg Asp Arg Ser Gly Arg Leu Val Asn Gly Ser Leu
            740                 745                 750         
Ala Leu Ile Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ser Tyr His
        755                 760                 765             
Arg Leu Arg Asp Leu Leu Leu Ile Val Thr Arg Ile Val Glu Leu Leu
    770                 775                 780                 
Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr Trp Trp Asn Leu Leu Gln
785                 790                 795                 800 
Tyr Trp Ser Gln Glu Leu Lys Asn Ser Ala Val Ser Leu Leu Asn Ala
                805                 810                 815     
Thr Ala Ile Ala Val Ala Glu Gly Thr Asp Arg Val Ile Glu Val Val
            820                 825                 830         
Gln Gly Ala Cys Arg Ala Ile Arg His Ile Pro Arg Arg Ile Arg Gln
        835                 840                 845             
Gly Leu Glu Arg Ile Leu Leu
    850                 855 


<210> 8
<211> 2568
<212> DNA
<213> HIV-1

<400> 8
atgagagtga aggagaaata tcagcacttg tggagatggg ggtggagatg gggcaccatg 60
ctccttggga tgttgatgat ctgtagtgct acagaaaaat tgtgggtcac agtctattat 120
ggggtacctg tgtggaaaga agcaaccacc actctatttt gtgcatcaga tgctaaagca 180
tatgatacag aggtacataa tgtttgggcc acacatgcct gtgtacccac agaccccaac 240
ccacaagaag tagaattgga aaatgtgaca gaaaatttta acatgtggaa aaataacatg 300
gtagaacaga tgcatgagga tataatcagt ttatgggatc aaagcctaaa gccatgtgta 360
aaattaactc cactctgtgt tactttaaat tgcactgatt tgaggaatgc tactaatggg 420
aatgacacta ataccactag tagtagcagg gaaatgatgg ggggaggaga aatgaaaaat 480
tgctctttca aaatcaccac aaacataaga ggtaaggtgc agaaagaata tgcacttttt 540
tatgaacttg atatagtacc aatagataat aatagtaata atagatatag gttgataagt 600
tgtaacacct cagtcattac acaggcctgt ccaaagatat cctttgagcc aattcccata 660
cattattgtg ccccggctgg ttttgcgatt ctaaagtgta aagataagaa gttcaatgga 720
aaaggaccat gttcaaatgt cagcacagta caatgtacac atgggattag gccagtagta 780
tcaactcaac tgctgttaaa tggcagtcta gcagaagaag aggtagtaat tagatccgaa 840
aatttcgcgg acaatgctaa aaccataata gtacagctga atgaatctgt agaaattaat 900
tgtacaagac ccaacaacaa tacaagaaaa agtatacata taggaccagg cagagcatta 960
tatacaacag gagaaataat aggagatata agacaagcac attgtaacct tagtagagca 1020
aaatggaatg acactttaaa taagatagtt ataaaattaa gagaacaatt tgggaataaa 1080
acaatagtct ttaagcattc ctcaggaggg gacccagaaa ttgtgacgca cagttttaat 1140
tgtggagggg aatttttcta ctgtaattca acacaactgt ttaatagtac ttggaatgtt 1200
actgaagagt caaataacac tgtagaaaat aacacaatca cactcccatg cagaataaaa 1260
caaattataa acatgtggca gaaagtagga agagcaatgt atgcccctcc catcagagga 1320
caaattagat gttcatcaaa tattacaggg ctgctattaa caagagatgg tggtccagag 1380
gacaacaaga ccgaggtctt cagacctgga ggaggagata tgagggacaa ttggagaagt 1440
gaattatata aatataaagt agtaaaaatt gaaccattag gagtagcacc caccaaggca 1500
aagagaagag tggtgcagag agaaaaaaga gcagtgggaa taggagctgt gttccttggg 1560
ttcttgggag cagcaggaag cactatgggc gcagcgtcaa tgacgctgac ggtacaggcc 1620
agactattat tgtctggtat agtgcaacag cagaacaatc tgctgagggc tattgaggcg 1680
caacagcatc tgttgcaact cacagtctgg ggcatcaagc agctccaggc aagagtcctg 1740
gctgtggaaa gatacctaag ggatcaacag ctcctgggaa tttggggttg ctctggaaaa 1800
ctcatttgca ccactgctgt gccttggaat gctagttgga gtaataaatc tctgaataag 1860
atttgggata acatgacctg gatggagtgg gacagagaaa ttaacaatta cacaagcata 1920
atatatagct taattgaaga atcgcagaac caacaagaaa agaatgaaca agaattatta 1980
gaattagaca aatgggcaag tttgtggaat tggtttgaaa taacagaatg gctgtggtat 2040
ataaaaatat tcataatgat aataggaggc ttgataggtt taagaatagt tttttctgta 2100
ctttctataa tgaatagagt taggcaggga tactcaccat tatcgtttca gacccacctc 2160
ccagcctcga ggggacccga caggcccgga ggaatcgaag aagaaggtgg agagagagac 2220
agagacagat ccggtcgatt agtgaacgga tccttagcac ttatctggga cgatctgcgg 2280
agcctgtgcc tcttcagcta ccaccgcttg agagacttac tcttgattgt aacgaggatt 2340
gtggaacttc tgggacgcag ggggtgggaa gccctcaaat attggtggaa tctcctacaa 2400
tattggagtc aggagctaaa gaatagtgct gttagcttgc tcaatgccac agccatagca 2460
gtagctgagg ggacagatag ggttatagaa gtagtacaag gagcttgtag agctattcgc 2520
cacataccta gaagaataag acagggcttg gaaaggattt tgctataa              2568

<210> 9
<211> 844
<212> PRT
<213> HIV-1

<400> 9
Met Arg Val Arg Gly Ile Pro Arg Asn Trp Pro Gln Trp Trp Ile Trp
 1               5                  10                  15      
Gly Ile Leu Gly Phe Trp Met Ile Ile Ile Cys Arg Val Val Gly Asn
            20                  25                  30          
Leu Asn Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu
        35                  40                  45              
Ala Lys Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Lys
    50                  55                  60                  
Glu Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro
65                  70                  75                  80  
Asn Pro Gln Glu Ile Val Leu Glu Asn Val Thr Glu Asn Phe Asn Met
                85                  90                  95      
Trp Lys Asn Asp Met Val Asp Gln Met His Glu Asp Ile Ile Ser Leu
            100                 105                 110         
Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val
        115                 120                 125             
Thr Leu Asn Cys Thr Asn Ala Thr Ala Tyr Asn Asn Ser Met His Gly
    130                 135                 140                 
Glu Met Lys Asn Cys Ser Phe Asn Thr Thr Thr Glu Ile Arg Asp Arg
145                 150                 155                 160 
Lys Gln Lys Ala Tyr Ala Leu Phe Tyr Lys Pro Asp Val Val Pro Leu
                165                 170                 175     
Asn Arg Arg Glu Glu Asn Asn Gly Thr Gly Glu Tyr Ile Leu Ile Asn
            180                 185                 190         
Cys Asn Ser Ser Thr Ile Thr Gln Ala Cys Pro Lys Val Thr Phe Asp
        195                 200                 205             
Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Tyr Ala Ile Leu Lys
    210                 215                 220                 
Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Asn Asn Val Ser
225                 230                 235                 240 
Thr Val Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu
                245                 250                 255     
Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Ile Ile Ile Arg Ser Glu
            260                 265                 270         
Asn Leu Thr Asn Asn Ile Lys Thr Ile Ile Val His Leu Asn Lys Ser
        275                 280                 285             
Val Glu Ile Val Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile
    290                 295                 300                 
Arg Ile Gly Pro Gly Gln Thr Phe Tyr Ala Thr Gly Glu Ile Ile Gly
305                 310                 315                 320 
Asn Ile Arg Glu Ala His Cys Asn Ile Ser Lys Ser Ser Trp Thr Ser
                325                 330                 335     
Thr Leu Glu Gln Val Lys Lys Lys Leu Lys Glu His Tyr Asn Lys Thr
            340                 345                 350         
Ile Glu Phe Lys Pro Pro Ser Gly Gly Asp Leu Glu Val Thr Thr His
        355                 360                 365             
Ser Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asn Thr Thr Lys Leu
    370                 375                 380                 
Phe Ser Asn Asn Ser Asp Ser Asn Asn Glu Thr Ile Thr Leu Pro Cys
385                 390                 395                 400 
Lys Ile Lys Gln Ile Ile Asn Met Trp Gln Lys Val Gly Arg Ala Met
                405                 410                 415     
Tyr Ala Pro Pro Ile Glu Gly Asn Ile Thr Cys Lys Ser Asn Ile Thr
            420                 425                 430         
Gly Leu Leu Leu Thr Arg Asp Gly Gly Lys Asn Thr Thr Asn Glu Ile
        435                 440                 445             
Phe Arg Pro Gly Gly Gly Asn Met Lys Asp Asn Trp Arg Ser Glu Leu
    450                 455                 460                 
Tyr Lys Tyr Lys Val Val Glu Ile Glu Pro Leu Gly Val Ala Pro Thr
465                 470                 475                 480 
Lys Ser Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly Leu
                485                 490                 495     
Gly Ala Val Leu Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly
            500                 505                 510         
Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu Leu Ser Gly
        515                 520                 525             
Ile Val Gln Gln Gln Ser Asn Leu Leu Arg Ala Ile Glu Ala Gln Gln
    530                 535                 540                 
His Met Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Thr Arg
545                 550                 555                 560 
Val Leu Ala Ile Glu Arg Tyr Leu Lys Asp Gln Gln Leu Leu Gly Leu
                565                 570                 575     
Trp Gly Cys Ser Gly Lys Ile Ile Cys Thr Thr Ala Val Pro Trp Asn
            580                 585                 590         
Ser Ser Trp Ser Asn Lys Ser Gln Glu Asp Ile Trp Asp Asn Met Thr
        595                 600                 605             
Trp Met Gln Trp Asp Arg Glu Ile Ser Asn Tyr Thr Gly Thr Ile Tyr
    610                 615                 620                 
Arg Leu Leu Glu Asp Ser Gln Asn Gln Gln Glu Lys Asn Glu Lys Asp
625                 630                 635                 640 
Leu Leu Ala Leu Asp Ser Trp Lys Asn Leu Trp Asn Trp Phe Asp Ile
                645                 650                 655     
Thr Asn Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile Val Gly Gly
            660                 665                 670         
Leu Ile Gly Leu Arg Ile Ile Phe Gly Val Leu Ala Ile Val Lys Arg
        675                 680                 685             
Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr Leu Thr Pro Ser
    690                 695                 700                 
Pro Arg Gly Pro Asp Arg Leu Gly Arg Ile Glu Glu Glu Gly Gly Glu
705                 710                 715                 720 
Gln Asp Lys Asn Arg Ser Ile Arg Leu Val Ser Gly Phe Leu Ala Leu
                725                 730                 735     
Ala Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Gly Tyr His Gln Leu
            740                 745                 750         
Arg Asp Phe Ile Leu Ile Ala Ala Arg Ala Ala Glu Leu Leu Gly Arg
        755                 760                 765             
Ser Ser Leu Arg Gly Leu Gln Arg Gly Trp Glu Ala Leu Lys Tyr Leu
    770                 775                 780                 
Gly Asn Leu Val Gln Tyr Gly Gly Leu Glu Leu Lys Arg Ser Ala Ile
785                 790                 795                 800 
Lys Leu Phe Asp Thr Ile Ala Ile Ala Val Ala Glu Gly Thr Asp Arg
                805                 810                 815     
Ile Leu Glu Val Ile Arg Arg Ile Cys Arg Ala Ile Arg His Ile Pro
            820                 825                 830         
Ile Arg Ile Arg Gln Gly Phe Glu Ala Ala Leu Leu
        835                 840                 


<210> 10
<211> 2535
<212> DNA
<213> HIV-1

<400> 10
atgagagtga gggggatacc gaggaattgg ccacaatggt ggatatgggg catcttaggc 60
ttttggatga taataatttg tagggtggtg gggaacttga acttgtgggt cacagtctat 120
tatggggtac ctgtgtggaa agaagcaaaa actactctat tctgtgcatc agatgctaaa 180
gcatatgata aagaagtaca taatgtctgg gctacacatg cctgtgtacc cacagacccc 240
aacccacaag aaatagtttt ggaaaatgta acagaaaatt ttaacatgtg gaaaaatgac 300
atggtggatc agatgcatga ggatataatc agtttatggg atcaaagcct aaaaccatgt 360
gtaaagttga ccccactctg tgtcacttta aattgtacaa atgcaactgc ctacaataat 420
agcatgcatg gagaaatgaa aaattgctct ttcaatacaa ccacagaaat aagagatagg 480
aaacagaaag cgtatgcact tttttataaa cctgatgtag tgccacttaa taggagagaa 540
gagaataatg ggacaggaga gtatatatta ataaattgca attcctcaac cataacacaa 600
gcctgtccaa aggtcacttt tgacccaatt cctatacatt attgtgctcc agctggttat 660
gcgattctaa agtgtaataa taagacattc aatgggacag gaccatgcaa taatgtcagc 720
acagtacaat gtacacatgg aattaagcca gtggtatcaa ctcaattact gttaaatggt 780
agcctagcag aagaagagat aataattaga tctgaaaatc tgacaaacaa tatcaaaaca 840
ataatagtcc accttaataa atctgtagaa attgtgtgta caagacccaa caataataca 900
agaaaaagta taaggatagg accaggacaa acattctatg caacaggtga aataatagga 960
aacataagag aagcacattg taacattagt aaaagtagct ggaccagtac tttagaacag 1020
gtaaagaaaa aattaaaaga acactacaat aagacaatag aatttaaacc accctcagga 1080
ggggatctag aagttacaac acatagcttt aattgtagag gagaattttt ctattgcaat 1140
acaacaaaac tgttttcaaa caacagtgat tcaaacaacg aaaccatcac actcccatgc 1200
aagataaaac aaattataaa catgtggcag aaggtaggac gagcaatgta tgcccctccc 1260
attgaaggaa acataacatg taaatcaaat atcacaggac tactattgac acgtgatgga 1320
ggaaagaata caacaaatga gatattcaga ccgggaggag gaaatatgaa ggacaattgg 1380
agaagtgaat tatataaata taaagtggta gaaattgagc cattgggagt agcacccact 1440
aaatcaaaaa ggagagtggt ggagagagaa aaaagagcag tgggactagg agctgtactc 1500
cttgggttct tgggagcagc aggaagcact atgggcgcgg cgtcaataac gctgacggta 1560
caggccagac aactgttgtc tggtatagtg caacagcaaa gcaatttgct gagagctata 1620
gaggcgcaac agcatatgtt gcaactcacg gtctggggca ttaagcagct ccagacaaga 1680
gtcttggcta tagagagata cctaaaggat caacagctcc tagggctttg gggctgctct 1740
ggaaaaatca tctgcaccac tgctgtgcct tggaactcca gttggagtaa taaatctcaa 1800
gaagatattt gggataacat gacctggatg cagtgggata gagaaattag taattacaca 1860
ggcacaatat ataggttact tgaagactcg caaaaccagc aggagaaaaa tgaaaaagat 1920
ttattagcat tggacagttg gaaaaacttg tggaattggt ttgacataac aaattggctg 1980
tggtatataa aaatattcat catgatagta ggaggcttga taggtttgag aataattttt 2040
ggtgtactcg ctatagtgaa aagagttagg cagggatact cacctttgtc gtttcagacc 2100
cttaccccaa gcccgagggg tcccgacagg ctcggaagaa tcgaagaaga aggtggagag 2160
caagacaaaa acagatccat tcgattagtg agcggattct tagcacttgc ctgggacgat 2220
ctgcggagcc tgtgcctctt cggttaccac caattgagag acttcatatt gattgcagcg 2280
agagcagcgg aacttctggg acgcagcagt ctcaggggac tgcagagagg gtgggaagcc 2340
cttaagtatc tgggaaatct tgtgcagtat gggggtctgg agctaaaaag aagtgctatt 2400
aaactgtttg ataccatagc aatagcagta gctgaaggaa cagataggat tcttgaagta 2460
atccgaagaa tttgtagagc tatccgccac atacctataa gaataagaca gggctttgaa 2520
gcagctttgc tataa                                                  2535

<210> 11
<211> 857
<212> PRT
<213> HIV-1

<400> 11
Met Arg Val Arg Gly Thr Leu Arg Asn Tyr Gln Gln Trp Trp Ile Trp
 1               5                  10                  15      
Gly Val Leu Gly Phe Trp Met Leu Met Ile Cys Asn Gly Gly Gly Asn
            20                  25                  30          
Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Lys
        35                  40                  45              
Thr Thr Leu Leu Cys Ala Ser Asp Ala Lys Ala Tyr Glu Arg Glu Val
    50                  55                  60                  
His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro
65                  70                  75                  80  
Gln Glu Ile Val Leu Gly Asn Val Thr Glu Asn Phe Asn Met Trp Lys
                85                  90                  95      
Asn Asp Met Val Asp Gln Met His Glu Asp Val Ile Ser Leu Trp Asp
            100                 105                 110         
Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu
        115                 120                 125             
Glu Cys Arg Asn Val Ser Arg Asn Val Ser Ser Tyr Asn Thr Tyr Asn
    130                 135                 140                 
Gly Ser Val Glu Glu Ile Lys Asn Cys Ser Phe Asn Ala Thr Pro Glu
145                 150                 155                 160 
Val Arg Asp Arg Lys Gln Arg Met Tyr Ala Leu Phe Tyr Gly Leu Asp
                165                 170                 175     
Ile Val Pro Leu Asn Lys Lys Asn Ser Ser Glu Asn Ser Ser Glu Tyr
            180                 185                 190         
Arg Leu Ile Asn Cys Asn Thr Ser Ala Ile Thr Gln Ala Cys Pro Lys
        195                 200                 205             
Val Thr Phe Asp Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Tyr
    210                 215                 220                 
Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys
225                 230                 235                 240 
Asn Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro Val Val
                245                 250                 255     
Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Gly Glu Ile Ile
            260                 265                 270         
Ile Arg Ser Glu Asn Leu Thr Asn Asn Val Lys Thr Ile Ile Val His
        275                 280                 285             
Leu Asn Gln Ser Val Glu Ile Val Cys Thr Arg Pro Asn Asn Asn Thr
    290                 295                 300                 
Arg Lys Ser Ile Arg Ile Gly Pro Gly Gln Thr Phe Tyr Ala Thr Gly
305                 310                 315                 320 
Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Arg Asp
                325                 330                 335     
Lys Trp Asn Glu Thr Leu Gln Arg Val Gly Lys Lys Leu Ala Glu His
            340                 345                 350         
Phe His Asn Lys Thr Ile Lys Phe Ala Ser Ser Ser Gly Gly Asp Leu
        355                 360                 365             
Glu Ile Thr Thr His Ser Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys
    370                 375                 380                 
Asn Thr Ser Gly Leu Phe Asn Gly Thr Tyr Met Pro Thr Tyr Met Pro
385                 390                 395                 400 
Asn Gly Thr Glu Ser Asn Ser Asn Ser Thr Ile Thr Ile Pro Cys Arg
                405                 410                 415     
Ile Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Arg Ala Met Tyr
            420                 425                 430         
Ala Pro Pro Ile Ala Gly Asn Ile Thr Cys Thr Ser Asn Ile Thr Gly
        435                 440                 445             
Leu Leu Leu Val His Asp Gly Gly Ile Lys Glu Asn Asp Thr Glu Asn
    450                 455                 460                 
Lys Thr Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp
465                 470                 475                 480 
Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Glu Ile Lys Pro Leu Gly
                485                 490                 495     
Val Ala Pro Thr Ala Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg
            500                 505                 510         
Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly
        515                 520                 525             
Ser Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Ala Gln Ala Arg Gln
    530                 535                 540                 
Leu Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Arg Ala Ile
545                 550                 555                 560 
Glu Ala Gln Gln His Leu Leu Gln Leu Thr Val Trp Gly Ile Lys Gln
                565                 570                 575     
Leu Gln Thr Arg Val Leu Ala Ile Glu Arg Tyr Leu Lys Asp Gln Gln
            580                 585                 590         
Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Ala
        595                 600                 605             
Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Thr Gln Ser Glu Ile Trp
    610                 615                 620                 
Asn Asn Met Thr Trp Met Gln Trp Asp Arg Glu Val Ser Asn Tyr Thr
625                 630                 635                 640 
Asn Ile Ile Tyr Ser Leu Leu Glu Glu Ser Gln Asn Gln Gln Glu Lys
                645                 650                 655     
Asn Glu Lys Asp Leu Leu Ala Leu Asp Ser Trp Lys Asn Leu Trp Ser
            660                 665                 670         
Trp Phe Asp Ile Thr Asn Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met
        675                 680                 685             
Ile Val Gly Gly Leu Ile Gly Leu Arg Ile Ile Phe Ala Val Leu Ser
    690                 695                 700                 
Ile Val Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr
705                 710                 715                 720 
Leu Thr Pro Asn Pro Arg Gly Pro Asp Arg Leu Gly Arg Ile Glu Glu
                725                 730                 735     
Glu Gly Gly Glu Gln Asp Lys Asp Arg Ser Ile Arg Leu Val Asn Gly
            740                 745                 750         
Phe Leu Ala Leu Ala Trp Asp Asp Leu Arg Asn Leu Cys Leu Phe Ser
        755                 760                 765             
Tyr His Arg Leu Arg Asp Phe Ile Ser Val Ala Ala Arg Val Val Glu
    770                 775                 780                 
Leu Leu Gly Arg Ser Ser Trp Glu Ala Leu Lys Tyr Leu Gly Ser Leu
785                 790                 795                 800 
Val Gln Tyr Trp Gly Leu Glu Leu Lys Lys Ser Ala Ile Ser Leu Phe
                805                 810                 815     
Asp Ser Ile Ala Ile Val Val Ala Glu Gly Thr Asp Arg Ile Ile Glu
            820                 825                 830         
Leu Val Gln Gly Phe Cys Arg Ala Ile Arg Asn Ile Pro Thr Arg Ile
        835                 840                 845             
Arg Gln Gly Phe Glu Ala Ala Leu Gln
    850                 855         


<210> 12
<211> 2574
<212> DNA
<213> HIV-1

<400> 12
atgagagtga gggggacact gaggaattat caacaatggt ggatatgggg cgtcttaggc 60
ttttggatgt taatgatttg taatggggga ggaaacttgt gggtcacagt ctattatggg 120
gtacctgtgt ggaaagaagc aaaaaccact ctactctgtg catcagatgc caaagcatat 180
gagagggaag tgcataatgt ctgggctaca catgcctgtg tacccacaga ccccaaccca 240
caagaaatag ttttgggaaa tgtaacagaa aattttaaca tgtggaaaaa tgacatggtg 300
gatcagatgc atgaggatgt aatcagttta tgggatcaaa gcctaaagcc atgtgtaaaa 360
ttgaccccac tctgtgtcac tttagaatgt agaaatgtta gcagaaatgt tagcagttat 420
aatacctaca atgggagcgt ggaggaaata aaaaattgct ctttcaatgc aaccccagaa 480
gtaagagata ggaagcagag aatgtatgct ctcttttatg gacttgatat agtaccactt 540
aataagaaga actctagtga gaactccagt gagtatagat taataaattg taatacctca 600
gccataacac aagcctgtcc aaaggtcact tttgatccaa ttcctataca ctattgtgct 660
ccggctggtt atgcgattct aaagtgtaat aataagacat tcaatgggac aggaccatgc 720
aataatgtta gtacagtaca atgtacacat ggaattaagc cagtagtatc aactcaacta 780
ctgttaaatg gtagcctagc agaaggagag ataataatta gatctgaaaa tctgacaaac 840
aatgtcaaaa caataatagt acatcttaat caatctgtag aaattgtgtg tacaagaccc 900
aataataata caagaaaaag tataaggata ggaccaggac aaacattcta tgcaacagga 960
gacataatag gagacataag acaagcacat tgtaacatta gtagagataa atggaatgaa 1020
actttacaaa gggtaggtaa aaaattagca gaacacttcc ataataagac aataaaattt 1080
gcatcatcct caggagggga cctagaaatt acaacacata gctttaattg tagaggagaa 1140
tttttctatt gtaatacatc aggcctgttt aatggtacat acatgcctac atacatgcct 1200
aatggtacag aaagtaattc aaactcaact atcacaatcc catgcagaat aaagcaaatt 1260
ataaacatgt ggcaggaggt aggacgagca atgtatgccc ctcccattgc aggaaacata 1320
acatgtacat caaatatcac aggactacta ttggtacatg atggaggaat aaaggaaaat 1380
gatacagaga ataagacaga gatatttaga cctggaggag gagatatgag ggacaattgg 1440
agaagtgaat tatataaata taaagtggta gaaattaagc cattgggagt agcacccact 1500
gcagcaaaaa ggagagtggt ggagagagaa aaaagagcag tgggaatagg agctgtgttc 1560
cttgggttct tgggagcagc aggaagcact atgggcgcgg cgtcaataac gctgacggca 1620
caggccagac aattgttgtc tggtatagtg caacagcaaa gcaatttgct gagggctata 1680
gaggcgcaac agcatctgtt gcaactcaca gtctggggca ttaagcagct ccagacaaga 1740
gtcctggcta tagagagata cctaaaggat caacagctcc tagggatttg gggctgctct 1800
ggaaaactca tctgcactac tgctgtacct tggaactcca gttggagtaa caaaactcaa 1860
agtgagattt ggaataacat gacctggatg cagtgggata gagaagttag taattacaca 1920
aacataatat acagcttgct tgaagaatcg caaaaccagc aggaaaaaaa tgaaaaagat 1980
ttattagcat tggacagttg gaaaaatcta tggagttggt ttgacataac aaattggctg 2040
tggtatataa aaatattcat aatgatagta ggaggcttga taggtttaag aataattttt 2100
gctgtgctct ctatagtgaa tagagttagg cagggatact cacctttgtc gtttcagacc 2160
cttaccccga acccaagggg acccgacagg ctcggaagaa tcgaagaaga aggtggagag 2220
caagacaaag acagatccat tcgattagtg aacggattct tagcacttgc ctgggacgat 2280
ctacggaacc tgtgcctctt cagctaccac cgattgagag acttcatatc ggtggcagcg 2340
agagtggtgg aacttctggg acgcagcagt tgggaagccc ttaaatatct gggaagtctt 2400
gtgcagtatt ggggtctgga gctaaaaaag agtgctatta gtctgtttga tagcatagca 2460
atagtagtag ctgaaggaac agataggatt atagaattag tacaaggatt ttgtagagct 2520
atccgcaaca tacctacaag aataagacag ggctttgaag cagctttgca ataa       2574

<210> 13
<211> 859
<212> PRT
<213> HIV-1

<400> 13
Met Arg Val Arg Glu Ile Glu Arg Asn Tyr Leu Cys Leu Trp Arg Trp
 1               5                  10                  15      
Gly Ile Met Leu Leu Gly Met Leu Met Thr Tyr Ser Val Ala Glu Lys
            20                  25                  30          
Lys Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr
        35                  40                  45              
Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ser Tyr Lys Thr Glu Val
    50                  55                  60                  
His Asn Ile Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro
65                  70                  75                  80  
Arg Glu Ile Glu Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys
                85                  90                  95      
Asn Asn Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp Asp
            100                 105                 110         
Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu
        115                 120                 125             
Asn Cys Thr Asp Ala Arg Arg Asn Glu Thr Arg Asn Asn Ile Thr Gly
    130                 135                 140                 
Met Glu Asn Asn Asp Gln Ile Glu Met Lys Asn Cys Ser Phe Asn Ile
145                 150                 155                 160 
Thr Thr Lys Leu Ile Asp Lys Lys Lys Gln Val His Ala Leu Phe Tyr
                165                 170                 175     
Arg Leu Asp Val Val Gln Ile Asp Asn Asp Thr Ser Asn Ser Asn Tyr
            180                 185                 190         
Ser Asn Tyr Arg Leu Ile Asn Cys Asn Thr Ser Ala Ile Thr Gln Ala
        195                 200                 205             
Cys Pro Lys Val Thr Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro
    210                 215                 220                 
Ala Gly Phe Ala Ile Leu Lys Cys Arg Asp Lys Lys Phe Asn Gly Thr
225                 230                 235                 240 
Gly Pro Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg
                245                 250                 255     
Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu
            260                 265                 270         
Glu Ile Ile Ile Arg Ser Glu Asn Leu Thr Asn Asn Ala Lys Thr Leu
        275                 280                 285             
Ile Val Gln Leu Asn Glu Ser Val Glu Ile Asn Cys Thr Arg Pro Tyr
    290                 295                 300                 
Tyr Asn Gln Ile Arg Gln Arg Thr Ser Ile Gly Gln Gly Gln Ala Leu
305                 310                 315                 320 
Tyr Thr Thr Arg Val Thr Gly Asp Ile Arg Lys Ala Tyr Cys Asn Ile
                325                 330                 335     
Ser Lys Ala Gly Trp Asn Lys Thr Leu Gln Gln Val Ala Lys Lys Leu
            340                 345                 350         
Gly Asp Leu Phe Asn Gln Thr Thr Ile Ile Phe Lys Pro Ser Ser Gly
        355                 360                 365             
Gly Asp Pro Glu Ile Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe
    370                 375                 380                 
Phe Tyr Cys Asn Thr Ser Lys Leu Phe Asn Ser Ala Trp Asn Asp Ser
385                 390                 395                 400 
Thr Trp Asn Ile Gly Asn Asn Asn Thr Gly Ser Asp Asn Glu Thr Ile
                405                 410                 415     
Ile Ile Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Gly Val
            420                 425                 430         
Gly Lys Ala Met Tyr Ala Pro Pro Ile Glu Gly Trp Ile Asn Cys Ala
        435                 440                 445             
Ser Asn Ile Thr Gly Leu Leu Leu Val Arg Asp Gly Gly Gly Ala Asn
    450                 455                 460                 
Asp Ser Gln Asn Glu Thr Phe Arg Pro Gln Gly Gly Asp Met Arg Asp
465                 470                 475                 480 
Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro
                485                 490                 495     
Leu Gly Ile Ala Pro Thr Lys Ala Lys Arg Arg Val Val Glu Arg Glu
            500                 505                 510         
Lys Arg Ala Ile Gly Leu Gly Ala Met Phe Leu Gly Phe Leu Gly Ala
        515                 520                 525             
Ala Gly Ser Thr Met Gly Ala Ala Ser Leu Thr Leu Thr Val Gln Ala
    530                 535                 540                 
Arg Gln Leu Leu Ser Gly Ile Val Gln His Gln Asn Asn Leu Leu Met
545                 550                 555                 560 
Ala Ile Glu Ala Gln Gln His Leu Leu Gln Leu Thr Val Trp Gly Ile
                565                 570                 575     
Lys Gln Leu Gln Ala Arg Ile Leu Ala Val Glu Arg Tyr Leu Gln Asp
            580                 585                 590         
Gln Gln Leu Leu Gly Ser Trp Gly Cys Ser Gly Arg His Ile Cys Thr
        595                 600                 605             
Thr Thr Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser Ile Asp Asp
    610                 615                 620                 
Ile Trp Asn Asn Met Thr Trp Met Glu Trp Glu Lys Glu Ile Asp Asn
625                 630                 635                 640 
Tyr Thr Gly Val Ile Tyr Arg Leu Ile Glu Glu Ser Gln Thr Gln Gln
                645                 650                 655     
Glu Lys Asn Glu Gln Glu Leu Leu Gln Leu Asp Lys Trp Ala Ser Leu
            660                 665                 670         
Trp Asn Trp Phe Ser Ile Thr Lys Trp Leu Trp Tyr Ile Lys Ile Phe
        675                 680                 685             
Ile Met Ile Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe Thr Val
    690                 695                 700                 
Leu Ser Leu Val Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe
705                 710                 715                 720 
Gln Thr Leu Phe Pro Ala Pro Arg Gly Pro Asp Arg Pro Glu Glu Ile
                725                 730                 735     
Glu Glu Gly Gly Gly Glu Gln Gly Arg Gly Arg Ser Thr Arg Leu Val
            740                 745                 750         
Asn Gly Phe Ser Thr Leu Ile Trp Asp Asp Leu Arg Asn Leu Cys Leu
        755                 760                 765             
Phe Ser Tyr His Arg Leu Arg Asp Leu Ile Leu Ile Ala Thr Arg Ile
    770                 775                 780                 
Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Ala Ile Lys Tyr Leu Trp
785                 790                 795                 800 
Asn Leu Leu Gln Tyr Trp Ser Gln Glu Leu Lys Thr Ser Ala Ile Ser
                805                 810                 815     
Leu Phe Asn Ala Thr Ala Val Ala Val Ala Glu Gly Thr Asp Arg Val
            820                 825                 830         
Ile Glu Val Val Gln Arg Phe Phe Arg Gly Ile Leu Asn Val Pro Thr
        835                 840                 845             
Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu
    850                 855                 


<210> 14
<211> 2580
<212> DNA
<213> HIV-1

<400> 14
atgagagtga gggagataga gaggaattat ctatgcttgt ggagatgggg catcatgctc 60
cttgggatgt tgatgacata tagtgttgca gagaagaagt gggtcacagt gtattatggg 120
gtacctgtgt ggaaagaagc aacaaccact ctattttgtg catcagatgc taaatcatat 180
aaaacagagg tacataatat ctgggctaca catgcctgtg taccaacaga ccccaaccca 240
cgagaaatag aactggaaaa tgtcacagaa aactttaaca tgtggaaaaa taacatggtg 300
gagcagatgc atgaggatat catcagttta tgggatcaaa gcctaaaacc atgtgtaaaa 360
ttaaccccac tctgtgtcac tttaaactgc actgatgcaa ggaggaatga gactaggaat 420
aatattacag gaatggaaaa caatgatcaa atagaaatga aaaactgctc tttcaatata 480
accacaaaat taatagataa gaagaagcaa gtacatgcac ttttttatag acttgatgtg 540
gtacaaatag ataatgatac tagtaatagc aactatagca actatagatt aataaattgc 600
aatacctcag ccattacaca ggcttgtcca aaggtaactt ttgagccaat tcccatacat 660
tattgtgccc cagctggttt tgcaattcta aagtgtagag ataagaagtt caatggaaca 720
ggaccatgca aaaatgtcag cacagtacaa tgcacacatg gaattaggcc agtagtgtca 780
acccaactgc tgttgaatgg cagtctagca gaagaagaga taataattag atctgaaaat 840
ctcacaaaca atgctaaaac cctaatagta cagcttaatg agtctgtaga aatcaattgt 900
acaaggccct actacaacca gataagacaa agaacatcta taggacaagg gcaagcactc 960
tatacaacaa gagtaacggg agatataaga aaagcatatt gcaatattag taaagcagga 1020
tggaataaaa ctttacagca ggtagcaaaa aaattaggag acctctttaa ccagacaaca 1080
ataattttta aaccatcctc gggaggagac ccagaaatta caacacacag ctttaattgt 1140
ggaggggaat ttttctactg caatacatca aaactgttta acagtgcatg gaatgacagt 1200
acatggaata tagggaataa taatacaggg tcagataatg agacaatcat tatcccatgc 1260
agaataaaac aaattataaa catgtggcag ggagtaggaa aagcaatgta tgcccctccc 1320
atcgaaggat ggatcaattg tgcatcaaat attacagggc tcttactggt aagggatggt 1380
ggtggtgcaa atgatagtca gaacgagacc ttcagacctc aaggaggaga tatgagagac 1440
aattggagaa gtgaattata caagtataaa gtagtaaaaa ttgaaccact aggaatagca 1500
cccaccaagg caaagagaag agtggtggaa agagaaaaaa gagcaatagg actaggagct 1560
atgttccttg ggttcttggg agcagcagga agcacgatgg gcgcagcgtc attgacgctg 1620
acggtacagg ccagacaatt attgtctggt atagtgcaac atcaaaacaa tttgctgatg 1680
gctatagagg cgcaacagca tctgttgcaa ctcacagtct ggggcattaa acagctccag 1740
gcaagaatcc tggctgtgga aagataccta caggatcaac agctcctagg aagttggggg 1800
tgctctggaa gacacatttg caccactact gtgccctgga actctagttg gagtaataaa 1860
tctatagatg acatttggaa taacatgacc tggatggagt gggaaaaaga aattgacaat 1920
tacacaggtg taatatacag attaattgag gaatcgcaaa cccagcaaga aaagaatgaa 1980
caagaactat tgcaattgga caaatgggca agtttgtgga attggtttag cataacaaaa 2040
tggctgtggt atataaaaat attcataatg atagtaggag gcttaatagg gttaagaata 2100
gtttttactg tgctttcttt agtaaataga gttaggcagg gatactcacc tctatcgttt 2160
cagaccctct tcccagcccc gaggggaccc gacaggcccg aagaaataga agaaggaggt 2220
ggagagcaag gcagaggcag atccactcga ttggtgaacg gattctcaac acttatctgg 2280
gacgatctga ggaacctgtg cctcttcagc taccaccgct tgagagactt aatcttaatt 2340
gcaacgagga ttgtggaact tctgggacgc agggggtggg aagccatcaa atatttgtgg 2400
aatctcctgc agtattggag tcaggaactg aagactagtg ctattagctt gtttaacgct 2460
acagcagtag cagtagctga ggggacagat agggttatag aagtagtaca aagatttttt 2520
agaggtattc ttaacgtacc cacacgaata agacagggct tggaaagggc gttactataa 2580


<210> 15
<211> 851
<212> PRT
<213> HIV-1

<400> 15
Met Arg Val Arg Glu Ile Glu Arg Asn Tyr Gln His Leu Trp Arg Trp
 1               5                  10                  15      
Ile Thr Met Leu Leu Gly Met Leu Met Ile Cys Ser Val Thr Gly Gln
            20                  25                  30          
Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr
        35                  40                  45              
Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Lys Ala Glu Ala
    50                  55                  60                  
His Asn Ile Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro
65                  70                  75                  80  
Gln Glu Ile Gln Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys
                85                  90                  95      
Asn Asn Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp Asp
            100                 105                 110         
Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu
        115                 120                 125             
Asn Cys Thr Glu Trp Glu Asn Thr Asn Arg Thr Asn Asn Asn Val Thr
    130                 135                 140                 
Asn Glu Glu Ile Gly Met Lys Asn Cys Ser Phe Asn Thr Thr Thr Glu
145                 150                 155                 160 
Val Arg Asp Arg Lys Gln Gln Val His Ala Leu Phe Tyr Lys Leu Asp
                165                 170                 175     
Val Val Pro Met Asn Asp Asn Asn Ser Thr Asp Ile Asn Tyr Thr Asn
            180                 185                 190         
Tyr Arg Leu Ile Asn Cys Asn Thr Ser Ala Ile Thr Gln Ala Cys Pro
        195                 200                 205             
Lys Val Thr Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly
    210                 215                 220                 
Phe Ala Ile Leu Lys Cys Asn Asn Lys Lys Phe Asn Gly Met Gly Ser
225                 230                 235                 240 
Cys Asn Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro Val
                245                 250                 255     
Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Ile
            260                 265                 270         
Ile Ile Arg Thr Glu Asn Ile Ser Asp Asn Ala Lys Ile Ile Ile Val
        275                 280                 285             
Gln Leu Asn Glu Ser Val Thr Ile Asn Cys Thr Arg Pro Tyr Asn Asn
    290                 295                 300                 
Thr Arg Lys Gly Thr His Ile Gly Pro Gly Arg Ala Trp Tyr Thr Thr
305                 310                 315                 320 
Gly Ile Val Gly Asp Ile Arg Gln Ala His Cys Lys Val Asn Lys Thr
                325                 330                 335     
Glu Trp Asn Lys Thr Leu Glu Arg Val Ala Lys Lys Leu Arg Asp Leu
            340                 345                 350         
Ile Asn Lys Thr Thr Ile Lys Phe Ser Pro Pro Ser Gly Gly Asp Leu
        355                 360                 365             
Glu Ile Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys
    370                 375                 380                 
Asn Thr Ser Arg Leu Phe Asn Ser Thr Trp Gly Asp Asn Asn Thr Ser
385                 390                 395                 400 
Ser Asp Thr Glu Glu Gly Asn Ile Thr Ile Pro Cys Arg Ile Lys Gln
                405                 410                 415     
Ile Ile Asn Met Trp Gln Gly Val Gly Lys Ala Met Tyr Ala Pro Pro
            420                 425                 430         
Ile Glu Gly Leu Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu
        435                 440                 445             
Thr Tyr Asp Gly Gly Val Asn Asn Asn Ser Gln Ser Glu Ile Phe Arg
    450                 455                 460                 
Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys
465                 470                 475                 480 
Tyr Lys Val Val Arg Leu Glu Pro Leu Gly Leu Ala Pro Thr Lys Ala
                485                 490                 495     
Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Ile Gly Leu Gly Ala
            500                 505                 510         
Met Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala
        515                 520                 525             
Ser Leu Thr Leu Thr Val Gln Ala Arg Gln Leu Leu Ser Gly Ile Val
    530                 535                 540                 
Gln Gln Gln Asn Asn Leu Leu Met Ala Ile Glu Ala Gln Gln His Leu
545                 550                 555                 560 
Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Val Leu
                565                 570                 575     
Ala Val Glu Arg Tyr Leu Lys Asp Gln Gln Leu Leu Gly Ile Trp Gly
            580                 585                 590         
Cys Ser Gly Arg Leu Ile Cys Thr Thr Asn Val Pro Trp Asn Ser Ser
        595                 600                 605             
Trp Ser Asn Lys Ser Ile Asp Glu Ile Trp Asn Asn Met Thr Trp Met
    610                 615                 620                 
Gln Trp Glu Ser Glu Ile Asp Asn Tyr Thr Gly Leu Ile Tyr Lys Leu
625                 630                 635                 640 
Ile Glu Glu Ser Gln Ile Gln Gln Asn Lys Asn Glu Lys Glu Leu Leu
                645                 650                 655     
Glu Leu Asp Lys Trp Ala Ser Leu Trp Thr Trp Phe Asp Ile Thr Asn
            660                 665                 670         
Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile Val Gly Gly Leu Ile
        675                 680                 685             
Gly Leu Arg Ile Val Phe Ala Val Leu Ser Leu Val Asn Arg Val Arg
    690                 695                 700                 
Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr Leu Leu Pro Val Pro Arg
705                 710                 715                 720 
Gly Pro Asp Arg Pro Glu Glu Ile Glu Glu Gly Gly Gly Glu Gln Asp
                725                 730                 735     
Arg Gly Arg Ser Val Arg Leu Val Asn Gly Phe Ser Ala Leu Ile Trp
            740                 745                 750         
Asp Asp Leu Arg Asn Leu Cys Leu Phe Ser Tyr His Arg Leu Arg Asp
        755                 760                 765             
Leu Ile Leu Ile Ala Thr Arg Ile Val Glu Leu Leu Gly Arg Arg Gly
    770                 775                 780                 
Trp Glu Thr Leu Lys Tyr Leu Trp Asn Leu Leu Gln Tyr Trp Ile Gln
785                 790                 795                 800 
Glu Leu Lys Asn Ser Ala Ile Ser Leu Phe Asn Thr Thr Ala Ile Val
                805                 810                 815     
Val Ala Glu Gly Thr Asp Arg Phe Leu Glu Ile Ile Gln Arg Ile Gly
            820                 825                 830         
Arg Ala Ile Leu Asn Ile Pro Thr Arg Ile Arg Gln Gly Phe Glu Arg
        835                 840                 845             
Ala Leu Leu
    850     


<210> 16
<211> 2556
<212> DNA
<213> HIV-1

<400> 16
atgagagtga gggagataga gaggaattat caacacttgt ggagatggat caccatgctc 60
cttgggatgt tgatgatatg tagtgttaca ggacagttat gggtcacagt ttattatggg 120
gtacctgtgt ggaaagaagc aaccactact ctattttgtg catcagatgc taaagcatat 180
aaagcagagg cacataatat ctgggctaca catgcctgtg taccaacaga ccccaaccca 240
caagaaatac aattagaaaa tgtcacagaa aattttaaca tgtggaaaaa taacatggtg 300
gaacaaatgc atgaggatat aatcagttta tgggatcaaa gcctaaaacc atgtgtaaaa 360
ttaaccccac tctgtgtcac tttaaactgc actgaatggg aaaatacaaa tagaactaac 420
aacaacgtca ctaatgagga aataggaatg aaaaactgct ctttcaatac aaccacagaa 480
gtaagagata ggaagcagca agtacatgca cttttttata aacttgatgt ggtaccaatg 540
aatgataata atagtactga tatcaattat accaattata gactaataaa ttgtaatacc 600
tcagccatta cacaggcgtg tccaaaggta acctttgagc caattcccat acattattgt 660
gccccagctg gatttgcaat tctaaagtgt aacaataaga agttcaatgg gatgggatca 720
tgcaacaatg tcagcacagt acagtgtaca catgggatta agccagtagt gtcaacccaa 780
ttgttgttga atggtagtct agcagaggag gaaataataa ttagaactga aaatatctca 840
gataatgcaa aaatcataat agtacagctt aatgagtctg taacaattaa ttgcacaagg 900
ccctacaaca atacaagaaa aggtacacac ataggaccag ggcgagcatg gtatacaaca 960
ggaatagtag gagatataag acaagcacac tgtaaggtta ataaaacaga atggaataaa 1020
actttagaac gggtagctaa aaaattaaga gaccttatta ataagacaac aataaagttt 1080
agtccaccct cgggagggga cctagaaatt acaacacaca gctttaattg tggaggggaa 1140
tttttctact gcaatacatc aagactgttt aatagtacat ggggggataa taatacatca 1200
agtgatacag aggaaggtaa catcaccatc ccatgtagaa taaaacaaat tataaacatg 1260
tggcaaggag taggaaaagc aatgtatgcc cctcccattg aaggactaat cagatgttca 1320
tcaaatatta caggattact gttaacatat gatgggggtg taaataataa tagtcagagt 1380
gagatcttca gacctggagg aggagatatg agagacaatt ggagaagtga attatacaaa 1440
tataaagtag taagacttga accactaggt ctagcaccca ccaaggcaaa aagaagagtg 1500
gtggaaagag aaaaaagagc aataggccta ggagctatgt tccttgggtt cttgggagca 1560
gcaggaagca cgatgggcgc agcgtcattg acgctgacgg tacaggccag acaattattg 1620
tctggtatag tgcagcagca aaacaatctg ctgatggcta tagaggcgca acagcatctg 1680
ttgcaactca cagtctgggg cattaaacag ctccaggcaa gagtcctggc tgtggaaaga 1740
tacctaaagg accaacagct cctaggaatt tggggttgct ctggaagact catttgcacc 1800
actaatgtgc catggaactc tagctggagt aataaatcca tagatgagat ttggaataac 1860
atgacctgga tgcagtggga aagtgaaatt gacaattaca caggtttaat atataaatta 1920
attgaagaat cgcaaatcca gcaaaacaaa aatgaaaaag aactattgga attggacaaa 1980
tgggcaagtt tgtggacttg gtttgacata acaaactggc tgtggtatat aaaaatattc 2040
ataatgatag taggaggctt gataggttta agaatagttt ttgctgtgct ttctttagta 2100
aatagagtta ggcagggata ttcacctctg tcttttcaga ccctcctccc agtcccgagg 2160
ggacccgaca ggcccgaaga aatagaagaa ggaggtggag agcaagacag aggcagatca 2220
gtgcgattgg tgaacggatt ctcagcactt atctgggacg atctgaggaa cctgtgcctc 2280
ttcagctacc accgcttgag agacttaatc ttaattgcaa cgaggattgt ggaacttctg 2340
ggacgcaggg ggtgggaaac cctcaaatat ctgtggaatc tcctgcagta ctggattcag 2400
gaactaaaga atagtgctat tagcttgttt aataccacag caatagtagt agctgagggg 2460
acagataggt ttttagaaat aatacaaaga attggtagag ctattcttaa tatacccacg 2520
cgaataagac agggctttga aagagcttta ctataa                           2556

<210> 17
<211> 857
<212> PRT
<213> HIV-1

<400> 17
Met Arg Val Lys Glu Thr Gln Met Asn Trp Pro Asn Leu Trp Lys Trp
 1               5                  10                  15      
Gly Thr Leu Ile Leu Gly Leu Val Ile Ile Cys Ser Ala Ser Asp Asn
            20                  25                  30          
Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Asp Ala Asp
        35                  40                  45              
Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala His Glu Thr Glu Val
    50                  55                  60                  
His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro
65                  70                  75                  80  
Gln Glu Ile His Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys
                85                  90                  95      
Asn Asn Met Val Glu Gln Met Gln Glu Asp Val Ile Ser Leu Trp Asp
            100                 105                 110         
Gln Ser Leu Gln Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu
        115                 120                 125             
His Cys Thr Thr Ala Lys Leu Thr Asn Val Thr Asn Ile Thr Asn Val
    130                 135                 140                 
Pro Asn Ile Gly Asn Ile Thr Asp Glu Val Arg Asn Cys Ser Phe Asn
145                 150                 155                 160 
Met Thr Thr Glu Ile Arg Asp Lys Lys Gln Lys Val His Ala Leu Phe
                165                 170                 175     
Tyr Lys Leu Asp Ile Val Gln Ile Glu Asp Lys Asn Asp Ser Ser Lys
            180                 185                 190         
Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile Lys Gln Ala Cys Pro
        195                 200                 205             
Lys Ile Ser Phe Asp Pro Ile Pro Ile His Tyr Cys Thr Pro Ala Gly
    210                 215                 220                 
Tyr Val Ile Leu Lys Cys Asn Asp Lys Asn Phe Asn Gly Thr Gly Pro
225                 230                 235                 240 
Cys Lys Asn Val Ser Ser Val Gln Cys Thr His Gly Ile Lys Pro Val
                245                 250                 255     
Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Ile
            260                 265                 270         
Ile Ile Arg Ser Glu Asn Leu Thr Asn Asn Ala Lys Thr Ile Ile Val
        275                 280                 285             
His Leu Asn Lys Ser Val Glu Ile Asn Cys Thr Arg Pro Ser Asn Asn
    290                 295                 300                 
Met Arg Thr Ser Met Arg Ile Gly Pro Gly Gln Val Phe Tyr Arg Thr
305                 310                 315                 320 
Gly Ser Ile Thr Gly Asp Ile Arg Lys Ala Tyr Cys Glu Ile Asn Gly
                325                 330                 335     
Thr Lys Trp Asn Lys Val Leu Lys Gln Val Thr Glu Lys Leu Lys Glu
            340                 345                 350         
His Phe Asn Asn Lys Thr Ile Ile Phe Gln Pro Pro Ser Gly Gly Asp
        355                 360                 365             
Leu Glu Ile Thr Met His His Phe Asn Cys Arg Gly Glu Phe Phe Tyr
    370                 375                 380                 
Cys Asn Thr Thr Gln Leu Phe Asn Asn Thr Cys Ile Gly Asn Glu Thr
385                 390                 395                 400 
Met Lys Gly Cys Asn Gly Thr Ile Thr Leu Pro Cys Lys Ile Lys Gln
                405                 410                 415     
Ile Ile Asn Met Trp Gln Gly Thr Gly Gln Ala Met Tyr Ala Pro Pro
            420                 425                 430         
Ile Asp Gly Lys Ile Asn Cys Val Ser Asn Ile Thr Gly Ile Leu Leu
        435                 440                 445             
Thr Arg Asp Gly Gly Ala Asn Asn Thr Ser Asn Glu Thr Phe Arg Pro
    450                 455                 460                 
Gly Gly Gly Asn Ile Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr
465                 470                 475                 480 
Lys Val Val Gln Ile Glu Pro Leu Gly Ile Ala Pro Thr Arg Ala Lys
                485                 490                 495     
Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Met
            500                 505                 510         
Ile Phe Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser
        515                 520                 525             
Ile Thr Leu Thr Val Gln Ala Arg Gln Leu Leu Ser Gly Ile Val Gln
    530                 535                 540                 
Gln Gln Ser Asn Leu Leu Arg Ala Ile Glu Ala Gln Gln His Leu Leu
545                 550                 555                 560 
Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala
                565                 570                 575     
Val Glu Arg Tyr Leu Lys Asp Gln Lys Phe Leu Gly Leu Trp Gly Cys
            580                 585                 590         
Ser Gly Lys Ile Ile Cys Thr Thr Ala Val Pro Trp Asn Ser Thr Trp
        595                 600                 605             
Ser Asn Lys Ser Phe Glu Glu Ile Trp Asn Asn Met Thr Trp Ile Glu
    610                 615                 620                 
Trp Glu Arg Glu Ile Ser Asn Tyr Thr Asn Gln Ile Tyr Glu Ile Leu
625                 630                 635                 640 
Thr Glu Ser Gln Asn Gln Gln Asp Arg Asn Glu Lys Asp Leu Leu Glu
                645                 650                 655     
Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asp Ile Thr Asn Trp
            660                 665                 670         
Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile Val Gly Gly Leu Ile Gly
        675                 680                 685             
Leu Arg Ile Ile Phe Ala Val Leu Ser Ile Val Asn Arg Val Arg Gln
    690                 695                 700                 
Gly Tyr Ser Pro Leu Ser Phe Gln Thr Pro Ile His His Gln Arg Glu
705                 710                 715                 720 
Pro Asp Arg Pro Glu Arg Ile Glu Glu Gly Gly Gly Glu Gln Gly Arg
                725                 730                 735     
Asp Arg Ser Val Arg Leu Val Ser Gly Phe Leu Ser Leu Ala Trp Asp
            740                 745                 750         
Asp Leu Arg Ser Leu Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Phe
        755                 760                 765             
Ile Leu Ile Ala Thr Arg Thr Val Glu Leu Leu Gly His Ser Ser Leu
    770                 775                 780                 
Lys Gly Leu Arg Arg Gly Trp Glu Gly Leu Lys Tyr Leu Gly Asn Leu
785                 790                 795                 800 
Leu Leu Tyr Trp Gly Gln Glu Leu Lys Ile Ser Ala Ile Ser Leu Leu
                805                 810                 815     
Asn Thr Thr Ala Ile Ala Val Ala Gly Trp Thr Asp Arg Val Ile Glu
            820                 825                 830         
Val Ala Gln Gly Ala Trp Arg Ala Ile Leu His Ile Pro Arg Arg Ile
        835                 840                 845             
Arg Gln Gly Leu Glu Arg Thr Leu Leu
    850                 855         


<210> 18
<211> 2574
<212> DNA
<213> HIV-1

<400> 18
atgagagtga aggagacaca gatgaattgg ccaaacttgt ggaaatgggg gactttgatc 60
cttgggttgg tgataatttg tagtgcctca gacaacttgt gggttacagt ttattatggg 120
gttcctgtgt ggaaagatgc agataccacc ctattttgtg catcagatgc caaagcacat 180
gagacagaag tgcacaatgt ctgggccaca catgcctgtg tacccacaga ccccaaccca 240
caagaaatac acctggaaaa tgtaacagaa aattttaaca tgtggaaaaa taacatggta 300
gagcagatgc aggaggatgt aatcagttta tgggatcaaa gtctacagcc atgtgtaaag 360
ttaactcctc tctgcgttac tttacattgt accactgcta aattgaccaa tgtcactaac 420
ataaccaatg tccctaacat aggaaatata acagatgaag taagaaactg ttcttttaat 480
atgaccacag aaataagaga taagaagcag aaggtccatg cactttttta taagcttgat 540
atagtacaaa ttgaagataa gaatgatagt agtaagtata ggttaataaa ttgtaatact 600
tcagtcatta agcaggcttg tccaaagata tcctttgatc caattcctat acactattgt 660
actccagctg gttatgtgat tttaaagtgt aatgataaga atttcaatgg gacagggcca 720
tgtaaaaatg tcagctcagt acaatgcaca catggaatta agccagtggt atcaactcaa 780
ttgctgttaa atggcagtct agcagaagaa gagataataa tcagatctga aaatctcaca 840
aacaatgcca aaaccataat agtgcacctt aataaatctg tagaaatcaa ttgtaccaga 900
ccctccaaca atatgagaac aagtatgcgt ataggaccag gacaagtatt ctatagaaca 960
ggaagcataa caggagatat aagaaaagca tattgtgaga ttaatggaac aaaatggaat 1020
aaagttttaa aacaggtaac tgaaaaatta aaagagcact ttaataataa gacaataatc 1080
tttcaaccac cctcaggagg agatctagaa attacaatgc atcattttaa ttgtagaggg 1140
gaatttttct attgcaatac aacacaactg tttaataata cttgcatagg aaatgaaacc 1200
atgaaggggt gtaatggcac tatcacactt ccatgcaaga taaagcaaat tataaacatg 1260
tggcagggaa caggacaagc aatgtatgct cctcccatcg atggaaaaat taattgtgta 1320
tcaaatatta caggaatact attgacaaga gatggtggtg ctaataatac gagtaacgag 1380
accttcagac ctggaggagg aaatataaag gacaattgga gaagtgaatt atataaatat 1440
aaagtagtac aaattgaacc actaggaata gcacccacca gggcaaagag aagagtggtg 1500
gagagagaaa aaagagcagt gggaatagga gctatgatct ttgggttctt aggagcagcc 1560
ggaagcacta tgggcgcggc gtcaataacg ctgacggtac aggccagaca attattgtct 1620
ggtatagtgc aacagcaaag caatttgctg agggctatag aggcgcagca gcatctgttg 1680
caactcacag tctggggcat taaacagctc caggcaagag tcctggctgt ggaaagatac 1740
ctaaaggatc aaaagttcct aggactttgg ggctgctctg gaaaaatcat ctgcaccact 1800
gctgtgccct ggaactccac ttggagtaat aaatcttttg aagagatttg gaacaacatg 1860
acatggatag aatgggagag agaaattagc aattacacaa accaaatata tgagatactt 1920
acagaatcgc agaaccagca ggataggaat gaaaaggatt tgttagaatt ggataaatgg 1980
gcaagtctgt ggaattggtt tgacataaca aattggctgt ggtatataaa aatatttata 2040
atgatagtag gaggtttaat aggtttaaga ataatttttg ctgtgctttc tatagtgaat 2100
agagttaggc agggatactc acctttgtct ttccagaccc ctatccatca tcagagggaa 2160
cccgacagac ccgaaagaat cgaagaagga ggtggcgagc aaggcagaga cagatccgtg 2220
cgattagtga gcggattctt atcacttgcc tgggacgatc tacggagcct gtgcctcttc 2280
agctaccacc gcttgagaga cttcatcttg attgcaacga ggactgtgga acttctggga 2340
cacagcagtc tcaagggact gagacggggg tgggaaggcc tcaaatatct ggggaatctt 2400
ctgttatatt ggggccagga actaaaaatt agtgctattt ctttgcttaa tactacagca 2460
atagcagtag cggggtggac agatagggtt atagaagtag cacaaggagc ttggagagcc 2520
attctccaca tacctagaag aatcagacag ggcttagaaa ggactttgct ataa       2574

<210> 19
<211> 862
<212> PRT
<213> HIV-1

<400> 19
Met Arg Val Lys Glu Thr Gln Met Ile Trp Pro Asn Leu Trp Lys Trp
 1               5                  10                  15      
Gly Thr Leu Ile Leu Gly Leu Val Ile Ile Cys Ser Ala Ser Asp Asn
            20                  25                  30          
Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Arg Asp Ala Glu
        35                  40                  45              
Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala His Glu Thr Glu Val
    50                  55                  60                  
His Asn Ile Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro
65                  70                  75                  80  
Gln Glu Ile Arg Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys
                85                  90                  95      
Asn Asn Met Val Glu Gln Met Gln Glu Asp Val Ile Ser Leu Trp Asp
            100                 105                 110         
Gln Ser Leu Gln Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu
        115                 120                 125             
Asn Cys Thr Asn Ala Thr Leu Asn Ala Asn Leu Thr Tyr Val Asn Asn
    130                 135                 140                 
Ile Thr Asn Gly His Asn Thr Ile Gly Asn Ile Thr Asp Glu Val Lys
145                 150                 155                 160 
Asn Cys Ser Phe Lys Met Thr Thr Glu Leu Arg Asp Lys Arg Lys Lys
                165                 170                 175     
Val His Ala Leu Phe Tyr Lys Leu Asp Ile Val Gln Leu Lys Gly Lys
            180                 185                 190         
Gly Asn Lys Asn Lys Asn Asn Asn Phe Ser Gln Tyr Arg Leu Met Ser
        195                 200                 205             
Cys Asn Thr Ser Val Ile Lys Gln Ala Cys Pro Lys Ile Thr Phe Asp
    210                 215                 220                 
Pro Ile Pro Ile His Tyr Cys Thr Pro Ala Gly Tyr Ala Ile Leu Lys
225                 230                 235                 240 
Cys Asn Asp Lys Asn Phe Asn Gly Thr Gly Pro Cys Lys Asp Val Ser
                245                 250                 255     
Ser Val Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu
            260                 265                 270         
Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Ile Val Ile Arg Ser Glu
        275                 280                 285             
Asn Leu Thr Asn Asn Ala Lys Thr Ile Ile Val His Leu Asn Lys Ser
    290                 295                 300                 
Val Glu Ile Asn Cys Thr Arg Pro Ser Asn Asn Thr Arg Ala Ser Thr
305                 310                 315                 320 
Arg Ile Gly Pro Gly Gln Val Phe Tyr Arg Thr Gly Asp Ile Ile Gly
                325                 330                 335     
Asp Ile Arg Lys Ala Tyr Cys Glu Ile Asn Gly Thr Lys Trp Asn Glu
            340                 345                 350         
Ile Leu Lys Gln Val Ala Glu Lys Leu Lys Glu His Phe Asn Asn Lys
        355                 360                 365             
Thr Ile Ile Phe Gln Pro Pro Ser Gly Gly Asp Pro Glu Val Thr Met
    370                 375                 380                 
His His Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asn Thr Ser Lys
385                 390                 395                 400 
Met Phe Asn Ser Thr Leu Gly Gly Leu Asn Gly Thr Ile Ile Leu Pro
                405                 410                 415     
Cys Lys Ile Lys Gln Ile Ile Asn Met Trp Gln Arg Val Gly Gln Ala
            420                 425                 430         
Val Tyr Ala Pro Pro Ile Ser Gly Arg Ile Asn Cys Val Ser Asn Ile
        435                 440                 445             
Thr Gly Ile Leu Leu Thr Arg Asp Gly Gly Ala Asn Asn Ala Thr Asn
    450                 455                 460                 
Glu Thr Phe Arg Pro Gly Gly Gly Asn Ile Lys Asp Asn Trp Arg Ser
465                 470                 475                 480 
Glu Leu Tyr Lys Tyr Lys Val Val Gln Ile Glu Pro Leu Gly Ile Ala
                485                 490                 495     
Pro Thr Arg Ala Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Ala
            500                 505                 510         
Gly Ile Gly Ala Met Ile Phe Gly Phe Leu Gly Ala Ala Gly Ser Thr
        515                 520                 525             
Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu Leu
    530                 535                 540                 
Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Arg Ala Ile Glu Ala
545                 550                 555                 560 
Gln Gln His Leu Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln
                565                 570                 575     
Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Asn Asp Gln Lys Phe Leu
            580                 585                 590         
Gly Leu Trp Gly Cys Ser Gly Lys Ile Ile Cys Thr Thr Ala Val Pro
        595                 600                 605             
Trp Asn Ser Thr Trp Ser Asn Lys Ser Tyr Glu Glu Ile Trp Asn Asn
    610                 615                 620                 
Met Thr Trp Val Glu Trp Glu Arg Glu Ile Ser Asn Tyr Thr Asn Gln
625                 630                 635                 640 
Ile Tyr Asp Ile Leu Thr Glu Ser Gln Asn Gln Gln Asp Lys Asn Glu
                645                 650                 655     
Lys Asp Leu Leu Glu Leu Asp Lys Trp Ala Asn Leu Trp Ser Trp Phe
            660                 665                 670         
Asn Ile Thr Asn Trp Leu Trp Tyr Ile Lys Ile Phe Ile Ile Ile Val
        675                 680                 685             
Gly Gly Leu Ile Gly Leu Arg Ile Val Phe Ala Val Leu Ser Ile Val
    690                 695                 700                 
Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr Pro Ser
705                 710                 715                 720 
His His Gln Arg Glu Thr Asp Arg Pro Glu Gly Ile Glu Glu Glu Gly
                725                 730                 735     
Gly Glu Gln Gly Arg Asp Arg Ser Val Arg Leu Val Ser Gly Phe Leu
            740                 745                 750         
Ala Leu Val Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ser Tyr His
        755                 760                 765             
Arg Leu Arg Asp Phe Ile Leu Ile Ala Ala Arg Thr Leu Glu Ile Leu
    770                 775                 780                 
Gly His Ser Ser Leu Lys Gly Leu Arg Arg Gly Trp Glu Val Leu Lys
785                 790                 795                 800 
Tyr Leu Gly Asn Leu Leu Ser Tyr Trp Gly Gln Glu Leu Lys Thr Ser
                805                 810                 815     
Ala Ile Ser Leu Leu Asp Ala Thr Ala Ile Ala Val Ala Gly Trp Thr
            820                 825                 830         
Asp Arg Val Ile Glu Val Ala Arg Arg Ala Trp Arg Ala Phe Ile His
        835                 840                 845             
Ile Pro Arg Arg Ile Arg Gln Gly Phe Glu Arg Ala Leu Leu
    850                 855                 860         


<210> 20
<211> 2589
<212> DNA
<213> HIV-1

<400> 20
atgagagtga aggagacaca gatgatttgg ccaaacttgt ggaaatgggg gactttgatc 60
cttggattgg taataatttg tagtgcctca gacaacttgt gggttacagt ttattatggg 120
gttcctgtgt ggagagatgc agaaaccacc ctattttgtg catcagatgc caaagcacat 180
gagacagaag tgcacaatat ttgggccaca catgcctgtg tacccacaga ccccaaccca 240
caagaaatac gtctggaaaa tgtaacagag aattttaaca tgtggaaaaa taacatggta 300
gagcagatgc aggaggatgt aatcagttta tgggatcaaa gtctacagcc atgtgtaaag 360
ttaactcctc tctgcgttac tttaaattgt accaatgcta cgttgaatgc taatttgacc 420
tatgtcaata acataactaa tggccataat acaataggaa atataacaga tgaagtaaaa 480
aactgttctt ttaagatgac cacagaacta agagataaga ggaagaaagt ccatgcactt 540
ttttataagc ttgatatagt acaacttaaa ggtaaaggta ataaaaataa gaataataat 600
tttagtcagt ataggttaat gagttgtaat acttcagtca ttaagcaggc ttgtccaaag 660
ataacctttg atccaattcc tatacattat tgtactccag ctggttatgc gattttaaag 720
tgtaatgata agaatttcaa tgggacaggg ccctgtaaag atgtcagctc agtacaatgc 780
acacatggaa tcaagccagt ggtatcaact cagttgttgt taaatggcag tctagcagag 840
gaagagatag taatcagatc tgaaaatctc acaaacaatg ccaaaaccat aatagtgcac 900
cttaataaat ctgtagaaat caattgtacc agaccctcca acaatacaag agcaagtaca 960
cgtataggac caggacaagt attctataga acaggagaca taattggaga tataaggaaa 1020
gcatattgtg agattaatgg aacaaaatgg aatgaaattt taaaacaggt agctgaaaaa 1080
ttaaaagagc attttaataa taagacaata atctttcaac caccctcagg aggagatcca 1140
gaagttacaa tgcatcattt taattgtaga ggggaatttt tctattgcaa tacatcaaaa 1200
atgtttaata gtactttggg ggggcttaat ggcactatca tacttccatg caagataaag 1260
caaataataa atatgtggca gagagtagga caagcagtgt atgctcctcc catcagtgga 1320
agaattaatt gtgtatcaaa tattacagga atactattga caagagatgg tggtgctaat 1380
aatgcaacta acgagacctt tagacctgga ggaggaaata taaaggacaa ttggagaagt 1440
gaattataca aatataaagt agtacaaatt gaacctctag gaatagcacc caccagggca 1500
aagagaagag tggtggagag agaaaaaaga gcagcaggaa taggagctat gatctttggg 1560
ttcttgggag cagcaggaag cactatgggc gcagcgtcaa taacgctgac ggtacaggcc 1620
agacaattat tgtctggtat agtgcaacag caaagcaatt tgctgagagc tatagaggcg 1680
cagcagcatc tgttgcaact cacagtctgg ggcattaaac agctccaggc aagagtcctg 1740
gctgtggaaa gatacctaaa tgatcaaaag ttcctaggac tttggggctg ctctggaaaa 1800
atcatctgca ccactgctgt gccctggaac tccacttgga gtaataaatc ttatgaagag 1860
atttggaaca acatgacatg ggtagaatgg gagagagaaa ttagcaatta cacaaaccaa 1920
atatatgaca tacttacaga atcacagaac cagcaggaca agaatgaaaa ggatttgttg 1980
gaattggata aatgggcaaa tctgtggagt tggtttaaca taacaaactg gctgtggtat 2040
ataaaaatat ttataataat agtaggaggc ttaataggtt taagaatagt ttttgctgtg 2100
ctttctatag taaatagagt taggcaggga tactcacctt tgtctttcca gaccccctcc 2160
catcaccaga gggaaaccga cagacccgaa ggaatcgaag aagaaggtgg cgagcaaggc 2220
agagacagat ccgtgcgctt agtgagcgga ttcttagctc ttgtctggga cgatctacgg 2280
agcctgtgcc tcttcagcta ccaccgcttg agagacttca tcttgattgc agcgaggact 2340
ctggaaattc tgggacacag cagtctcaag ggactgagac gggggtggga agtcctcaaa 2400
tatctgggga atcttctgtc atattggggc caggagctaa aaactagtgc tatttctttg 2460
cttgatgcta cagcaatagc agtagcgggg tggacagata gggttataga agtagcacga 2520
agagcttgga gagcttttat ccacatacct aggagaatca gacagggctt tgaaagggct 2580
ttgctataa                                                         2589

<210> 21
<211> 858
<212> PRT
<213> HIV-1

<400> 21
Met Arg Val Arg Gly Met Gln Arg Ser Trp Gln His Leu Gly Lys Trp
 1               5                  10                  15      
Gly Leu Leu Phe Leu Gly Ile Leu Ile Ile Cys Asn Ala Thr Glu Asn
            20                  25                  30          
Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr
        35                  40                  45              
Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Ser Glu Val
    50                  55                  60                  
His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asp Pro
65                  70                  75                  80  
Gln Glu Ile Lys Leu Asn Val Thr Glu Asn Phe Asp Met Trp Lys Asn
                85                  90                  95      
Asn Met Val Glu Gln Met His Thr Asp Ile Ile Ser Leu Trp Asp Gln
            100                 105                 110         
Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Asp
        115                 120                 125             
Cys Thr Asp Val Pro Ile Asn Ser Thr Ser Thr Leu Lys Glu Asp Glu
    130                 135                 140                 
Gly Ala Ile Lys Asn Cys Ser Phe Asn Met Thr Thr Glu Val Arg Asp
145                 150                 155                 160 
Lys Gln Gln Lys Val Gln Ala Leu Phe Tyr Lys Leu Asp Met Val Pro
                165                 170                 175     
Ile Ser Asp Asp Ser Thr Arg Asp Ser Asn Val Ser Ser Asn Gly Thr
            180                 185                 190         
Arg Gln Tyr Arg Leu Ile His Cys Asn Thr Ser Thr Ile Thr Gln Ala
        195                 200                 205             
Cys Pro Lys Ile Ser Trp Asp Pro Ile Pro Ile His Tyr Cys Ala Pro
    210                 215                 220                 
Ala Gly Tyr Ala Ile Leu Lys Cys Tyr Asp Thr Glu Phe Asn Gly Thr
225                 230                 235                 240 
Gly Pro Cys Arg Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys
                245                 250                 255     
Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Gly Pro
            260                 265                 270         
Glu Ile Ile Ile Arg Ser Gln Asn Ile Ser Asp Asn Ala Lys Thr Ile
        275                 280                 285             
Ile Val His Leu Ser Glu Ser Val Trp Ile Asn Cys Thr Arg Pro Asn
    290                 295                 300                 
Asn Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly Arg Ala Phe His
305                 310                 315                 320 
Thr Thr Asp Arg Ile Val Gly Asp Ile Arg Lys Ala His Cys Asn Val
                325                 330                 335     
Ser Arg Gly Glu Trp Ser Lys Thr Leu Gly Gln Val Arg Lys Ala Leu
            340                 345                 350         
Gly Ser His Phe Pro Asn Gly Thr Thr Ile Lys Phe Asn Ser Ser Ser
        355                 360                 365             
Gly Gly Asp Pro Glu Val Thr Met His Met Phe Asn Cys Arg Gly Glu
    370                 375                 380                 
Phe Phe Tyr Cys Asn Thr Ser Arg Leu Phe Asn Asp Thr Glu Phe Phe
385                 390                 395                 400 
Asn Asp Thr Asp Ser Asn Ser Thr Asp Pro Ile Thr Leu Pro Cys Arg
                405                 410                 415     
Ile Arg Gln Ile Val Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr
            420                 425                 430         
Ala Ala Pro Ile Ala Gly Ser Ile Thr Cys Asn Ser Thr Ile Thr Gly
        435                 440                 445             
Leu Leu Leu Thr Arg Asp Gly Gly Asn Asn Ala Asn Lys Thr Glu Asn
    450                 455                 460                 
Arg Thr Glu Thr Phe Arg Pro Gly Gly Gly Asn Met Lys Asp Asn Trp
465                 470                 475                 480 
Arg Asn Glu Leu Tyr Lys Tyr Lys Val Val Glu Ile Glu Pro Leu Gly
                485                 490                 495     
Val Ala Pro Thr Arg Ala Lys Arg Gln Val Val Lys Arg Glu Lys Arg
            500                 505                 510         
Ala Val Gly Met Leu Gly Ala Met Phe Leu Gly Phe Leu Gly Ala Ala
        515                 520                 525             
Gly Ser Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg
    530                 535                 540                 
Gln Leu Leu Ser Gly Ile Val Gln Gln Gln Asn Asn Leu Leu Arg Ala
545                 550                 555                 560 
Ile Glu Ala Gln Gln His Leu Leu Gln Leu Thr Val Trp Gly Ile Lys
                565                 570                 575     
Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Gln Asp Gln
            580                 585                 590         
Gln Phe Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr
        595                 600                 605             
Thr Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Thr Tyr Asn Glu Ile
    610                 615                 620                 
Trp Asp Lys Met Thr Trp Met Gln Trp Glu Lys Glu Ile Ser Asn Tyr
625                 630                 635                 640 
Ser Thr Glu Ile Tyr Arg Leu Ile Glu Glu Ser Gln Tyr Gln Gln Glu
                645                 650                 655     
Lys Asn Glu Gln Glu Leu Leu Ser Leu Asp Lys Trp Ala Ser Leu Trp
            660                 665                 670         
Asn Trp Phe Asp Ile Ser Asn Trp Leu Trp Tyr Ile Lys Ile Phe Ile
        675                 680                 685             
Met Ile Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe Ala Val Leu
    690                 695                 700                 
Ser Ile Val Asn Arg Val Arg Lys Gly Tyr Ser Pro Leu Ser Phe Gln
705                 710                 715                 720 
Thr His Ile Pro Ser Pro Arg Glu Pro Asp Arg Pro Glu Gly Ile Glu
                725                 730                 735     
Glu Gly Gly Gly Glu Gln Asp Lys Asp Arg Ser Val Arg Leu Val Ser
            740                 745                 750         
Gly Phe Leu Ser Leu Val Trp Asp Asp Leu Arg Asn Leu Cys Leu Phe
        755                 760                 765             
Ser Tyr Arg His Leu Arg Asp Phe Ile Leu Ile Ala Ala Arg Ile Val
    770                 775                 780                 
Asp Arg Gly Leu Lys Arg Gly Trp Glu Ala Leu Lys Tyr Leu Gly Asn
785                 790                 795                 800 
Leu Thr Gln Tyr Trp Gly Leu Glu Leu Lys Asn Ser Ala Ile Ser Leu
                805                 810                 815     
Leu Asn Thr Thr Ala Ile Val Val Ala Glu Gly Thr Asp Arg Ile Ile
            820                 825                 830         
Glu Ala Leu Gln Arg Ala Gly Arg Ala Val Leu Asn Ile Pro Arg Arg
        835                 840                 845             
Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu
    850                 855             


<210> 22
<211> 2577
<212> DNA
<213> HIV-1

<400> 22
atgagagtga gggggatgca gaggagctgg cagcacttgg ggaagtgggg ccttttattc 60
ctgggaatat taataatctg taatgctaca gaaaacttgt gggtcacagt ctattatggg 120
gtacctgtgt ggaaagaagc aaccactact ctattctgtg catcagatgc taaagcatat 180
gaaagtgagg tgcataatgt ctgggccaca catgcctgtg tacccacaga tcccgatcca 240
caagaaataa agctaaatgt aacagaaaat tttgatatgt ggaaaaataa catggtagaa 300
caaatgcata cagatataat tagtttatgg gatcaaagcc taaagccatg tgtgaagtta 360
accccactct gtgttacttt ggattgtact gatgtcccta tcaactccac cagcaccctg 420
aaggaagacg aaggggcaat aaaaaactgc tctttcaata tgaccacaga agtaagagat 480
aaacagcaga aagtacaggc acttttttat aaacttgata tggtaccaat cagtgatgac 540
agtactcgtg atagcaatgt cagtagtaat ggcactagac aatacaggct catacattgt 600
aatacttcaa ccattacaca ggcttgtcca aagatatctt gggatccaat tcccatacat 660
tattgtgctc cagctggtta tgcgattcta aagtgttatg atacagaatt caatgggacg 720
gggccatgca ggaatgtcag cacagtgcaa tgtacacatg gaattaaacc agtggtatcc 780
actcaattgt tgttaaatgg cagcctagca ggaccagaga taataatcag gtctcaaaat 840
atttcagata atgcaaaaac cataatagta catcttagtg agtctgtatg gattaattgt 900
acaagaccca ataacaatac aagaaaaagt atacatatag gaccaggacg tgcatttcat 960
acaacagaca gaatagtagg agacatcaga aaagcacatt gtaacgttag tagaggagaa 1020
tggagtaaaa cattaggaca ggtaaggaaa gcgttagggt ctcatttccc taatggaaca 1080
acaataaaat ttaactcatc ctcaggaggg gacccagaag ttacaatgca tatgtttaat 1140
tgtagaggag aatttttcta ctgcaataca tcaagattgt ttaatgacac agaatttttc 1200
aatgacacag attccaatag cactgaccct atcactctcc catgtcgaat aagacaaatt 1260
gtaaacatgt ggcaggaagt aggaaaagca atgtatgccg ctcccattgc aggaagcatt 1320
acctgtaact caactattac aggcttacta ttgacaagag atggtggtaa taatgctaat 1380
aagactgaaa atcggactga aaccttcaga cctgggggag gaaatatgaa agacaattgg 1440
agaaatgaat tatataaata taaagtagta gaaattgaac cactaggagt agcacccacc 1500
agggcaaaaa gacaagtggt gaagagagaa aaaagagcag tgggaatgct aggagctatg 1560
ttccttgggt tcttgggagc agcaggaagc actatgggcg cagcgtcaat aacgctgacg 1620
gtacaggcca gacaattatt gtctggaata gtgcaacagc agaacaatct gctgagggct 1680
attgaagcgc aacagcatct gttgcagctc acagtctggg gcattaaaca gctccaggca 1740
agagtcctgg ctgtggaaag atacctacag gatcaacagt tcctagggat ttggggctgc 1800
tctggaaaac tcatctgcac cactactgtg ccctggaact ctagttggag taataaaact 1860
tataatgaaa tttgggataa gatgacctgg atgcaatggg aaaaggagat tagcaattac 1920
tcaacagaaa tatacaggtt aattgaagaa tcgcagtacc agcaggaaaa gaatgaacaa 1980
gaattactgt cattggacaa atgggcaagt ctgtggaatt ggtttgacat atcaaattgg 2040
ctatggtata taaaaatatt cataatgata gtaggaggct taataggctt aagaatagtt 2100
tttgctgtgc tttctatagt aaatagagtt aggaagggat actcaccttt gtcatttcag 2160
acccatatcc caagcccaag ggaaccagac aggcccgaag gaatcgaaga aggaggtgga 2220
gagcaagaca aagacagatc cgtgagatta gtgagcggat tcttgagtct tgtctgggac 2280
gacctgcgga acctgtgcct cttcagctac cgccacttga gagacttcat attaattgca 2340
gcgaggattg tggacagggg actgaagagg gggtgggaag ccctcaaata cctggggaat 2400
ctcacacagt attggggtct ggaactaaag aatagtgcta ttagcttgct taacaccaca 2460
gcaatagtag tagctgaggg gacagataga attatagaag ctttgcaaag agctggtaga 2520
gctgttctca acatacctag aagaataaga cagggcttgg aaagagcttt gctttaa    2577

<210> 23
<211> 849
<212> PRT
<213> HIV-1

<400> 23
Met Arg Val Arg Glu Met Gln Arg Asn Trp Gln His Leu Gly Lys Trp
 1               5                  10                  15      
Gly Leu Leu Phe Leu Gly Ile Leu Ile Ile Cys Asn Ala Ala Asp Asn
            20                  25                  30          
Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr
        35                  40                  45              
Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Arg Glu Ile
    50                  55                  60                  
His Asn Val Trp Ala Thr Tyr Ala Cys Val Pro Thr Asp Pro Asn Pro
65                  70                  75                  80  
Gln Glu Leu Val Leu Gly Asn Val Thr Glu Asn Phe Asn Met Trp Lys
                85                  90                  95      
Asn Asn Met Val Asp Gln Met His Glu Asp Ile Ile Ser Leu Trp Asp
            100                 105                 110         
Gln Ser Leu Lys Pro Cys Val Gln Ile Thr Pro Leu Cys Val Thr Leu
        115                 120                 125             
Asn Cys Thr Asp Val Pro Val Asn Ile Thr Asn Gly Asn Ser Thr Leu
    130                 135                 140                 
Asp Asn Ile Thr Leu Glu Glu Gln Gly Glu Ile Lys Asn Cys Ser Phe
145                 150                 155                 160 
Asn Ile Thr Thr Glu Ile Asn Asp Ile Lys Lys Lys Glu Ser Ala Ile
                165                 170                 175     
Phe Tyr Arg Leu Asp Val Val Pro Ile Asn Asn Ser Thr Ser Glu Tyr
            180                 185                 190         
Arg Leu Leu Ser Cys Asn Thr Ser Thr Val Thr Gln Ala Cys Pro Lys
        195                 200                 205             
Val Ser Phe Asp Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe
    210                 215                 220                 
Ala Ile Leu Lys Cys Asn Asp Lys Glu Phe Asn Gly Thr Gly Leu Cys
225                 230                 235                 240 
Arg Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro Val Val
                245                 250                 255     
Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Gly Asp Ile Val
            260                 265                 270         
Ile Arg Ser Glu Asn Ile Ser Asp Asn Ala Lys Thr Ile Ile Val Gln
        275                 280                 285             
Phe Asn Arg Ser Val Ala Ile Asn Cys Thr Arg Pro Thr Asn Ile Thr
    290                 295                 300                 
Arg Arg Ser Met Arg Ile Gly Pro Gly Arg Val Phe Tyr Ala Thr Gly
305                 310                 315                 320 
Thr Val Leu Gly Asp Ile Arg Lys Ala Tyr Cys Thr Ile Asn Gly Thr
                325                 330                 335     
Leu Trp Asn Lys Thr Leu Glu Gly Val Ala Lys Glu Val Gln Ser His
            340                 345                 350         
Leu Asn Lys Ser Ile Thr Phe Ala Pro Ser Ser Gly Gly Asp Leu Glu
        355                 360                 365             
Val Thr Thr His Ser Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asn
    370                 375                 380                 
Thr Val Ala Leu Phe Asn Ala Thr Asn Met Thr Asn Ala Met Asn Arg
385                 390                 395                 400 
Ser Asn Gly Ile Ile Thr Leu Pro Cys Arg Ile Arg Gln Ile Val Asn
                405                 410                 415     
Met Trp Gln Arg Val Gly Arg Ala Met Tyr Ala Ala Pro Ile Ala Gly
            420                 425                 430         
Gln Ile Gln Cys Asn Ser Ser Ile Thr Gly Leu Ile Leu Thr Arg Asp
        435                 440                 445             
Gly Gly Lys Asn Asn Thr Asn Asn Asp Thr Leu Arg Pro Gly Gly Gly
    450                 455                 460                 
Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val
465                 470                 475                 480 
Lys Ile Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Gln Val
                485                 490                 495     
Val Lys Arg Glu Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Leu
            500                 505                 510         
Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Met
        515                 520                 525             
Thr Leu Thr Val Gln Ala Arg Gln Leu Leu Ser Gly Ile Val Gln Gln
    530                 535                 540                 
Gln Asn Asn Leu Leu Lys Ala Ile Glu Ala Gln Gln His Leu Leu Gln
545                 550                 555                 560 
Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Ile Leu Ala Val
                565                 570                 575     
Glu Arg Tyr Leu Lys Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser
            580                 585                 590         
Gly Lys Leu Ile Cys Thr Thr Asn Val Pro Trp Asn Ser Ser Trp Ser
        595                 600                 605             
Asn Lys Ser Gln Asn Glu Ile Trp Glu Asn Met Thr Trp Met Gln Trp
    610                 615                 620                 
Glu Lys Glu Ile Ser Asn Tyr Thr Gly Thr Ile Tyr Lys Leu Ile Glu
625                 630                 635                 640 
Asn Ala Gln Asn Gln Gln Glu Lys Asn Glu Gln Asp Leu Leu Ala Leu
                645                 650                 655     
Asp Lys Trp Asp Asn Leu Trp Ser Trp Phe Thr Ile Thr Asn Trp Leu
            660                 665                 670         
Trp Tyr Ile Lys Leu Phe Ile Met Ile Val Gly Gly Leu Ile Gly Leu
        675                 680                 685             
Arg Ile Val Phe Ala Val Leu Ala Val Ile Asn Arg Val Arg Gln Gly
    690                 695                 700                 
Tyr Ser Pro Leu Ser Leu Gln Thr Leu Thr Pro Ser Arg Arg Glu Pro
705                 710                 715                 720 
Glu Arg Pro Gly Gly Ile Glu Glu Glu Gly Gly Glu Gln Asp Lys Thr
                725                 730                 735     
Arg Ser Val Arg Leu Val Ser Gly Phe Leu Ala Leu Ala Trp Asp Asp
            740                 745                 750         
Leu Arg Ser Leu Cys Leu Phe Ser Tyr Arg His Leu Arg Asp Phe Ile
        755                 760                 765             
Leu Ile Ala Ala Arg Thr Val Asn Lys Gly Leu Ile Arg Gly Trp Glu
    770                 775                 780                 
Ile Leu Lys Tyr Leu Gly Asn Leu Ala Gln Tyr Trp Gly Arg Glu Ile
785                 790                 795                 800 
Lys Asn Ser Ala Ile Asp Leu Leu Asn Thr Thr Ala Ile Val Val Ala
                805                 810                 815     
Glu Gly Thr Asp Arg Ile Ile Glu Val Leu Gln Arg Ala Gly Arg Ala
            820                 825                 830         
Ile Leu His Ile Pro Arg Arg Ile Arg Gln Gly Ala Glu Arg Ala Leu
        835                 840                 845             
Leu
    


<210> 24
<211> 2550
<212> DNA
<213> HIV-1

<400> 24
atgagagtga gggagatgca gaggaattgg cagcacttgg ggaaatgggg ccttttattc 60
ctggggatat taataatctg taatgctgca gataacttgt gggtcacagt ctattatgga 120
gtacctgtgt ggaaagaagc aaccactact ctattttgtg catcagatgc taaagcatat 180
gaaagagaga tacataatgt ctgggctaca tatgcctgtg tacctacaga ccccaaccca 240
caagaattag ttctgggaaa tgtaacagaa aattttaaca tgtggaaaaa taacatggta 300
gaccagatgc atgaagatat aatcagttta tgggatcaaa gcctaaagcc atgtgtacag 360
ataaccccac tctgtgtaac cttaaattgt actgatgttc ctgttaacat cactaatggg 420
aacagcaccc tggataacat caccctggaa gaacaagggg aaataaaaaa ctgttctttc 480
aatatcacca cagagataaa cgatattaag aaaaaagaat ctgcaatttt ttataggctt 540
gatgtagtac caatcaataa tagtactagt gaatataggc tactaagttg taatacctca 600
accgttacac aggcttgtcc aaaggtgtcc tttgatccaa ttccaataca ttattgtgct 660
cctgctggtt ttgcgattct aaagtgtaat gataaagagt tcaatgggac agggttatgt 720
aggaatgtca gcacagtaca atgtacacat ggaattaaac cagtagtgtc aactcaacta 780
ctgttaaatg gcagcctagc agaaggagat atagtaatta gatctgaaaa tatctcagat 840
aatgcaaaaa ccataatagt acagtttaat agatctgtag caattaactg tacaagaccc 900
accaacatta caagaagaag tatgcgtata ggaccaggac gagtatttta tgcaacaggt 960
accgtactag gagatataag aaaggcatat tgtaccatta atggaacact gtggaataaa 1020
actttagaag gagtagctaa agaggtccaa agccacctta ataaatcaat aacatttgcg 1080
ccatcatcag gaggggacct agaagttaca acacatagtt ttaattgtag aggagagttt 1140
ttctactgca acacagtagc tctgtttaat gcaactaaca tgactaatgc aatgaacagg 1200
tccaatggca ttatcactct tccatgtaga ataagacaaa ttgtaaacat gtggcaaaga 1260
gtaggacgag caatgtatgc cgctcccatt gctggacaaa ttcagtgtaa ctcaagcatc 1320
acaggtctaa tattgacaag agatggtggg aaaaataata ccaataatga caccctcaga 1380
cctggagggg gagatatgag agacaattgg agaagtgaac tgtataaata taaggtagta 1440
aaaattgaac cactaggagt agcacccacc aaggcaaaaa gacaagtggt gaagagagaa 1500
agagaaaaaa gagcagtggg aataggagct gtgctccttg ggttcttggg agcagcagga 1560
agcactatgg gcgcggcgtc aatgacgctg acggtacagg ccagacaatt attgtctggt 1620
atagtgcaac agcaaaacaa tttgctgaag gctatagaag cgcaacagca tctgttgcag 1680
ctcacagtct ggggcattaa acagctccag gcgagaatcc tggctgtgga aagataccta 1740
aaggaccaac agctcctagg gatttgggga tgctctggaa aactcatctg caccactaat 1800
gtgccctgga attctagttg gagtaataaa tctcagaatg aaatttggga gaacatgacc 1860
tggatgcagt gggaaaaaga gatcagtaat tacacaggca caatatacaa attaatagaa 1920
aatgcacaaa accagcagga aaagaatgaa caggacttat tggcattgga caaatgggac 1980
aatctgtgga gttggtttac tataacaaat tggttgtggt acataaaatt attcataatg 2040
atagtaggag gcttgatagg attaagaata gtttttgctg tgcttgctgt aataaatagg 2100
gttaggcagg gatactcacc tttgtcgtta cagaccctta ccccaagccg gagggaaccc 2160
gaacggcccg gaggaatcga agaagaaggt ggagagcaag acaaaaccag atccgtcaga 2220
ttagtgagcg gattcttagc acttgcctgg gacgacctac ggagcctgtg cctcttcagc 2280
taccgccact tgagagactt catattaatt gcagcgagga ctgtgaacaa gggactaata 2340
agggggtggg aaatcctcaa atatctgggg aatctcgcgc agtattgggg ccgggaaata 2400
aagaatagtg ctattgatct gcttaatacc acagcaatag tggtagctga agggacagat 2460
agaatcatag aagttctgca aagagctggt agagctattc tccacatacc tagaagaata 2520
agacagggtg cagaaagggc tttgctataa                                  2550

<210> 25
<211> 863
<212> PRT
<213> HIV-1

<400> 25
Met Arg Val Lys Gly Ile Gln Thr Asn Trp Gln His Leu Trp Lys Trp
 1               5                  10                  15      
Gly Thr Leu Ile Leu Gly Leu Val Ile Ile Cys Ser Ala Ser Asp Lys
            20                  25                  30          
Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Glu Asp Ala Asp
        35                  40                  45              
Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ser Tyr Ser Ser Glu Lys
    50                  55                  60                  
His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro
65                  70                  75                  80  
Gln Glu Ile Val Met Lys Asn Val Thr Glu Tyr Phe Asn Met Trp Lys
                85                  90                  95      
Asn Asn Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp Asp
            100                 105                 110         
Glu Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu
        115                 120                 125             
Asn Cys Thr Asp Ile Val Tyr Ser Asn Cys Thr Arg Lys His Pro Asn
    130                 135                 140                 
Gly Thr Val Asp Tyr Asn Ser Thr Val Asp Asn Ser Ser Cys Glu Ile
145                 150                 155                 160 
Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Glu Leu Arg Asp Lys Ser
                165                 170                 175     
Lys Lys Glu Tyr Ala Leu Phe Tyr Arg Pro Asp Val Val Pro Ile Asp
            180                 185                 190         
Gly Asn Asp Asn Ser Thr Tyr Ser Asp Tyr Arg Leu Ile Asn Cys Asn
        195                 200                 205             
Val Ser Ser Ile Lys Gln Ala Cys Pro Lys Val Ile Phe Asp Pro Ile
    210                 215                 220                 
Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Gly
225                 230                 235                 240 
Asp Lys Lys Val Gln Ile Gly Thr Gly Thr Tyr Val Thr Asn Val Ser
                245                 250                 255     
Thr Val Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu
            260                 265                 270         
Leu Leu Asn Gly Ser Val Thr Glu Glu Glu Ile Ile Ile Gln Ser Glu
        275                 280                 285             
Asn Val Thr Asp Asn Ser Lys Val Ile Ile Val Gln Phe Asn Asp Thr
    290                 295                 300                 
Val Glu Ile Asn Cys Thr Arg Pro Gly Asn Asn Thr Arg Arg Ser Ile
305                 310                 315                 320 
Arg Phe Gly Pro Gly Gln Ala Phe Tyr Ala Thr Gly Asp Ile Ile Gly
                325                 330                 335     
Asn Ile Arg Gln Ala His Cys Ile Val Asn Gly Glu Lys Trp Asn Gln
            340                 345                 350         
Val Ile Gln Lys Val Lys Thr His Leu Glu Glu Ile Tyr Asn Lys Thr
        355                 360                 365             
Ile Ile Phe Ser Ser Ser Ala Gly Gly Asp Leu Glu Ile Thr Thr His
    370                 375                 380                 
Ser Phe Asn Cys Arg Gly Glu Phe Phe Tyr Cys Asn Thr Ser Arg Leu
385                 390                 395                 400 
Phe Asn Asn Glu Thr Gly Asn Ser Thr Asn Thr Asn Ile His Ser His
                405                 410                 415     
Val Asn Lys Gln Ile Val Glu Cys Gly Arg Glu Trp Asp Lys Gln Cys
            420                 425                 430         
Met Pro Leu Pro Ser Lys Glu Lys Leu Asp Val Thr Ser Thr Ile Thr
        435                 440                 445             
Gly Leu Ile Leu Pro Arg Asp Gly Gly Asn Ser Thr Asn Thr Asn Asn
    450                 455                 460                 
Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg
465                 470                 475                 480 
Ser Glu Leu Tyr Lys Tyr Lys Thr Val Lys Ile Lys Ser Leu Gly Ile
                485                 490                 495     
Ala Pro Thr Arg Ala Arg Arg Arg Val Val Glu Arg Glu Lys Arg Ala
            500                 505                 510         
Val Gly Leu Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser
        515                 520                 525             
Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Val Arg Gln Leu
    530                 535                 540                 
Leu Tyr Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Arg Ala Ile Glu
545                 550                 555                 560 
Ala Gln Gln His Met Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu
                565                 570                 575     
Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gln Gln Leu
            580                 585                 590         
Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Asn Val
        595                 600                 605             
Pro Trp Asn Thr Ser Trp Ser Asn Lys Thr Tyr Asn Glu Ile Trp Asp
    610                 615                 620                 
Asn Met Thr Trp Ile Gln Trp Glu Lys Glu Ile Asn Asn Tyr Thr Lys
625                 630                 635                 640 
Gln Ile Tyr Ser Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn
                645                 650                 655     
Glu Gln Glu Leu Leu Thr Leu Asp Lys Trp Ala Ser Leu Trp Ser Trp
            660                 665                 670         
Phe Asp Ile Thr Lys Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile
        675                 680                 685             
Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe Thr Val Leu Ser Ile
    690                 695                 700                 
Val Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr Leu
705                 710                 715                 720 
Thr His Gln Gln Arg Glu Pro Asp Arg Leu Glu Arg Thr Glu Glu Gly
                725                 730                 735     
Gly Gly Glu Gln Asp Arg Asp Arg Ser Val Arg Leu Val Ser Gly Phe
            740                 745                 750         
Leu Ala Leu Ala Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ser Tyr
        755                 760                 765             
His Arg Leu Arg Asp Phe Ala Leu Ile Ala Ala Arg Thr Val Glu Leu
    770                 775                 780                 
Leu Gly Arg Ser Ser Leu Lys Gly Leu Arg Leu Gly Trp Glu Gly Leu
785                 790                 795                 800 
Lys Tyr Leu Trp Asn Leu Leu Leu Tyr Trp Val Gln Glu Leu Lys Asn
                805                 810                 815     
Ser Ala Ile Ser Leu Leu Asp Thr Val Ala Ile Ala Val Ala Asn Trp
            820                 825                 830         
Thr Asp Arg Val Ile Glu Val Ile Gln Arg Ala Gly Arg Ala Ile Leu
        835                 840                 845             
Asn Ile Pro Thr Arg Ile Arg Gln Gly Phe Glu Arg Ala Leu Leu
    850                 855                 860             


<210> 26
<211> 2592
<212> DNA
<213> HIV-1

<400> 26
atgagagtga aggggataca gacgaattgg cagcacttgt ggaaatgggg gactttgatc 60
cttgggttgg tgataatctg tagtgcctca gacaaattgt gggtcacagt ctattatggg 120
gtacctgtgt gggaagatgc agataccact ctattctgtg catctgatgc taaatcatat 180
agttctgaaa aacataatgt ctgggctaca catgcctgtg tacccacaga ccccaatccg 240
caagaaatag ttatgaaaaa tgtaactgaa tattttaaca tgtggaaaaa taacatggta 300
gaacagatgc atgaagatat aatcagtcta tgggatgaaa gcctaaagcc atgtgtaaag 360
ctaacccctc tctgtgttac tctaaactgt actgacattg tctattctaa ttgtactaga 420
aagcatccca atggcactgt ggattacaat agcactgtgg ataacagtag ctgtgaaata 480
ataaaaaact gctctttcaa tataactaca gaactaagag acaaaagtaa gaaagagtac 540
gcgctcttct ataggcctga tgtagtacca attgatggta atgataatag tacttatagt 600
gattataggc taataaattg taatgtctca tctatcaagc aggcttgtcc aaaggtaatt 660
tttgacccaa ttcccataca ctattgtgct ccagctggtt ttgcgatttt aaaatgtggg 720
gataagaaag ttcaaattgg aacagggact tatgtaacca atgttagtac agtacaatgc 780
acacatggaa ttaagccagt ggtatcaacc caactactgc tgaatggcag tgtgacagag 840
gaagaaataa taattcaatc tgaaaatgtc acagacaata gtaaagtcat aatagtacag 900
tttaatgata ctgtagaaat taattgtacc agacccggca acaatacaag aagaagtata 960
agattcggac caggacaagc attctatgca acaggtgaca taataggaaa cataagacaa 1020
gcacattgta ttgttaatgg agaaaaatgg aatcaggtga tacaaaaggt caaaacacat 1080
ctagaggaaa tctataataa gaccataatc tttagctcat ccgcaggagg ggacctagaa 1140
attacaacac atagtttcaa ttgtagagga gaattcttct attgtaatac atcaaggctg 1200
tttaataatg aaactgggaa tagcacaaac acaaatatcc actcccatgt aaataaacaa 1260
attgtagaat gtggcagaga gtgggacaag caatgtatgc ccctcccatc gaaggagaaa 1320
ttagatgtga cctcaacaat tacaggacta atattaccaa gagatggtgg gaacagtacc 1380
aatactaata atacagagac cttcagacct ggaggaggag atatgaggga caattggaga 1440
agtgaattat ataagtacaa aacagtaaaa atcaaatcac taggaatagc acccaccagg 1500
gcaaggagac gagtggtgga gagagaaaaa agagcagttg gactgggagc tgtcttcctt 1560
gggttcttag gagcagcagg aagcactatg ggcgcggcgt caataacgct gacggtacag 1620
gtcaggcaat tattgtacgg catagtgcaa cagcaaagca atttgctgag ggctatagag 1680
gcgcaacagc atatgttgca actcacagtc tggggcatta aacagctcca ggcaagagtc 1740
ctagctgtgg aaagatacct aagggatcaa cagctcctag ggatttgggg ctgctctgga 1800
aaactcatct gcaccactaa tgtaccctgg aacactagtt ggagtaataa aacttataat 1860
gagatttggg ataacatgac ttggatacaa tgggaaaagg aaattaacaa ttacacaaaa 1920
caaatataca gcctaattga agaatcgcag aaccagcagg aaaagaatga acaagaatta 1980
ttaacattgg acaaatgggc aagtttgtgg agttggtttg acataacaaa atggctatgg 2040
tatataaaaa tattcataat gatagtagga ggtttaatag gtttaagaat agtttttact 2100
gtgctttcta tagtaaatag agttaggcag gggtactcac ctttgtcatt ccagaccctt 2160
acccatcagc agagggaacc agacaggctc gaaagaaccg aagaaggagg tggcgagcaa 2220
gacagagaca ggtccgtgcg cttagtgagc ggtttcttag cgcttgcctg ggacgaccta 2280
cggagcctgt gcctcttcag ctaccaccgc ttgagagact ttgccttgat tgcagcgagg 2340
acagtggaac ttctgggacg cagcagtctc aagggactga gactggggtg ggaaggcctc 2400
aaatatctgt ggaatctcct gttgtattgg gttcaggaac taaagaatag tgctattagt 2460
ttgcttgata cagtagcaat agcagtagct aactggacag atagggttat agaagtaata 2520
caaagagctg gcagagctat tcttaatata cctacaagga taagacaagg ctttgaaaga 2580
gctttgctat aa                                                     2592

<210> 27
<211> 852
<212> PRT
<213> HIV-1

<220> 
<221> VARIANT        
<222> (0)...(0)
<223> Xaa = any amino acid

<400> 27
Met Glu Thr Gln Arg Asn Tyr Pro His Leu Leu Ser Gly Gly Ile Leu
 1               5                  10                  15      
Ile Leu Gly Met Leu Leu Met Cys Ser Thr Gly Glu Asn Leu Trp Val
            20                  25                  30          
Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Lys Thr Thr Leu
        35                  40                  45              
Phe Cys Ala Ser Asp Ala Lys Ala Tyr Ser Thr Glu Lys His Asn Val
    50                  55                  60                  
Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Ser Pro Gln Glu Met
65                  70                  75                  80  
Val Leu Glu Asn Val Thr Glu Asn Phe Asn Met Trp Glu Asn Asn Met
                85                  90                  95      
Val Asp Gln Met His Glu Asp Ile Ile Ser Leu Trp Asp Gln Ser Leu
            100                 105                 110         
Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu Glu Cys Gly
        115                 120                 125             
Glu Val Gly Asn Lys Thr Ser Asn Ser Asn Ser Ser Ala Ser Leu Glu
    130                 135                 140                 
Gly Glu Gln Lys Leu Thr Asn Cys Ser Phe Asn Val Thr Thr Ala Ile
145                 150                 155                 160 
Arg Asp Arg Lys Lys Lys Val Gln Ala Leu Phe Tyr Arg Ile Asp Val
                165                 170                 175     
Val Pro Ile Asn Asp Glu Asp Tyr Asn Gly Thr Asn Gly Thr Asn Arg
            180                 185                 190         
Thr Leu Tyr Arg Leu Ile Asn Cys Asn Thr Ser Val Ile Thr Gln Ala
        195                 200                 205             
Cys Pro Lys Val Thr Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro
    210                 215                 220                 
Ala Gly Phe Ala Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr
225                 230                 235                 240 
Gly Pro Cys Thr Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys
                245                 250                 255     
Pro Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Gly
            260                 265                 270         
Gln Val Ile Ile Arg Ser Lys Asn Ile Thr Asp Asn Thr Lys Asn Ile
        275                 280                 285             
Ile Val Gln Phe Asn Glu Ser Val Gln Ile Asn Cys Thr Arg Pro Asn
    290                 295                 300                 
Asn Asn Thr Arg Lys Gly Ile His Ile Gly Pro Gly Gln Ala Phe Tyr
305                 310                 315                 320 
Val Thr Gly Glu Val Ile Gly Asn Ile Arg Gln Ala His Cys Asn Ile
                325                 330                 335     
Ser Gly Thr Gln Trp Asn Lys Thr Leu Tyr Asn Val Val Asn Gln Leu
            340                 345                 350         
Arg Lys His Phe Asn Lys Thr Ile Ile Phe Glu Pro Ser Ser Gly Gly
        355                 360                 365             
Asp Ile Glu Ile Thr Ser His Thr Phe Asn Cys Gly Gly Glu Phe Phe
    370                 375                 380                 
Tyr Cys Asn Thr Ser Arg Leu Phe Asn Ser Thr Trp Asn Ser Asn Asp
385                 390                 395                 400 
Thr Gly Asn Gly Thr Asp Asn Ser Thr Ile Thr Leu Pro Cys Lys Ile
                405                 410                 415     
Lys Gln Ile Ile Asn Met Trp Gln Arg Val Gly Gln Ala Met Tyr Ala
            420                 425                 430         
Pro Pro Ile Gln Gly Asn Ile Thr Cys Val Ser Asn Ile Thr Gly Leu
        435                 440                 445             
Ile Leu Thr Leu Asp Arg Tyr Val Asp Asn Gly Thr Asn Val Thr Leu
    450                 455                 460                 
Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr
465                 470                 475                 480 
Lys Tyr Lys Val Val Gln Ile Glu Pro Leu Gly Ile Ala Pro Thr Lys
                485                 490                 495     
Ala Arg Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly Met Gly
            500                 505                 510         
Ala Phe Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala
        515                 520                 525             
Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu Leu Ser Gly Ile
    530                 535                 540                 
Val Gln Gln Gln Ser Asn Leu Leu Arg Ala Ile Gln Ala Gln Gln His
545                 550                 555                 560 
Met Leu Gln Leu Thr Val Trp Gly Val Lys Gln Leu Gln Ala Arg Val
                565                 570                 575     
Leu Ala Val Glu Arg Tyr Leu Lys Asp Gln Gln Leu Leu Gly Ile Trp
            580                 585                 590         
Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Asn Val Pro Trp Asn Ser
        595                 600                 605             
Ser Trp Ser Asn Lys Ser Tyr Xaa Xaa Ile Trp Xaa Asn Met Thr Trp
    610                 615                 620                 
Met Glu Trp Glu Arg Gln Ile Asp Asn Tyr Thr Xaa Glu Ile Tyr Xaa
625                 630                 635                 640 
Leu Leu Glu Ile Ser Gln Asn Gln Gln Glu Lys Asn Glu Gln Asp Leu
                645                 650                 655     
Leu Ser Leu Asp Lys Trp Ser Ser Leu Trp Asn Trp Phe Asp Ile Ser
            660                 665                 670         
Xaa Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Val Ile Gly Gly Leu
        675                 680                 685             
Ile Gly Leu Arg Ile Val Phe Thr Val Leu Ser Ile Ile Asn Ser Val
    690                 695                 700                 
Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr Leu Thr Pro Asn Ala
705                 710                 715                 720 
Arg Arg Pro Asp Arg Pro Glu Gly Ile Glu Glu Glu Gly Gly Glu Gln
                725                 730                 735     
Gly Arg Asp Arg Ser Thr Pro Leu Val Ser Gly Phe Phe Ile Leu Val
            740                 745                 750         
Trp Gln Asp Leu Trp Asn Leu Cys Leu Phe Ser Tyr Arg Arg Leu Arg
        755                 760                 765             
Asp Leu Leu Leu Ile Val Ala Arg Thr Val Glu Leu Leu Gly Arg Arg
    770                 775                 780                 
Gly Trp Glu Ala Leu Lys Tyr Leu Trp Asn Leu Leu Gln Tyr Trp Gly
785                 790                 795                 800 
Gln Glu Leu Lys Asn Ser Ala Val Asn Leu Leu Asn Thr Thr Ala Ile
                805                 810                 815     
Val Val Ala Glu Gly Thr Asp Arg Ile Ile Glu Leu Val Gln Arg Ala
            820                 825                 830         
Gly Arg Ala Ile Ile His Ile Pro Arg Arg Ile Arg Gln Gly Phe Glu
        835                 840                 845             
Arg Ala Leu Leu
    850         


<210> 28
<211> 790
<212> PRT
<213> HIV-1

<220> 
<221> VARIANT        
<222> (0)...(0)
<223> Xaa = any amino acid

<400> 28
Met Glu Thr Gln Lys Asn Trp Gln Thr Leu Trp Arg Gly Gly Leu Met
 1               5                  10                  15      
Ile Phe Gly Met Leu Met Ile Cys Lys Ala Lys Glu Asp Leu Trp Val
            20                  25                  30          
Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Asp Ala Lys Thr Thr Leu
        35                  40                  45              
Phe Cys Ala Ser Asp Ala Lys Ala Tyr Ser Thr Glu Lys His Asn Val
    50                  55                  60                  
Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Ser Pro Gln Glu Met
65                  70                  75                  80  
Asn Leu Pro Asn Val Thr Glu Asn Phe Asn Met Trp Lys Asn Asp Met
                85                  90                  95      
Val Asp Gln Met Gln Glu Asp Ile Ile Ser Val Trp Asp Glu Ser Leu
            100                 105                 110         
Lys Pro Cys Val Lys Ile Thr Pro Leu Cys Val Thr Leu Asn Cys Ser
        115                 120                 125             
Asn Ile Thr Ser Asn Ser Asn Thr Thr Ser Asn Ser Ser Val Ser Ser
    130                 135                 140                 
Pro Asp Ile Met Thr Asn Cys Ser Phe Asn Ile Thr Thr Glu Ile Arg
145                 150                 155                 160 
Asn Lys Arg Lys Gln Glu Tyr Ala Leu Phe Tyr Arg Gln Asp Val Val
                165                 170                 175     
Pro Ile Asp Ser Asn Asn Lys Asn Tyr Ile Leu Ile Asn Cys Asn Thr
            180                 185                 190         
Ser Val Ile Lys Gln Ala Cys Pro Lys Val Ser Phe Gln Pro Ile Pro
        195                 200                 205             
Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn Asp
    210                 215                 220                 
Lys Asn Phe Asn Gly Thr Gly Ser Cys Lys Asn Val Ser Thr Val Gln
225                 230                 235                 240 
Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu Asn
                245                 250                 255     
Gly Ser Ile Ala Glu Gly Asp Ile Ile Ile Arg Ser Glu Asn Ile Ser
            260                 265                 270         
Asp Asn Ala Lys Asn Ile Ile Val Gln Leu Asn Lys Thr Val Glu Ile
        275                 280                 285             
Val Cys Tyr Arg Pro Asn Asn Asn Thr Arg Lys Gly Ile His Met Gly
    290                 295                 300                 
Pro Gly Gln Val Leu Tyr Ala Thr Gly Glu Ile Ile Gly Asn Ile Arg
305                 310                 315                 320 
Glu Thr His Cys Asn Ile Ser Glu Arg Asp Trp Ser Asn Thr Leu Arg
                325                 330                 335     
Arg Val Ala Thr Lys Leu Arg Glu His Phe Asn Lys Thr Ile Asn Phe
            340                 345                 350         
Thr Ser Pro Ser Gly Gly Asp Ile Glu Ile Val Thr His Ser Phe Asn
        355                 360                 365             
Cys Gly Gly Glu Phe Leu Tyr Cys Asn Thr Ser Lys Leu Phe Asn Ser
    370                 375                 380                 
Ser Trp Asp Lys Asn Ser Ile Glu Ala Thr Asn Asp Thr Ser Xaa Ala
385                 390                 395                 400 
Thr Ile Thr Ile Pro Cys Lys Ile Lys Gln Ile Val Arg Met Trp Gln
                405                 410                 415     
Arg Thr Gly Gln Ala Ile Tyr Ala Pro Pro Ile Ala Gly Asn Ile Thr
            420                 425                 430         
Cys Thr Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn
        435                 440                 445             
Arg Gly Asn Gly Ser Glu Asn Gly Thr Glu Thr Phe Arg Pro Thr Gly
    450                 455                 460                 
Gly Asn Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val
465                 470                 475                 480 
Val Glu Ile Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg
                485                 490                 495     
Val Val Glu Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu
            500                 505                 510         
Gly Phe Leu Gly Thr Ala Gly Ser Thr Met Gly Ala Ala Ser Ile Thr
        515                 520                 525             
Leu Thr Val Gln Val Arg Gln Leu Leu Ser Gly Ile Val Gln Gln Gln
    530                 535                 540                 
Ser Asn Leu Leu Lys Ala Ile Glu Ala Gln Gln His Leu Leu Lys Leu
545                 550                 555                 560 
Thr Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu
                565                 570                 575     
Arg Tyr Leu Lys Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly
            580                 585                 590         
Lys Leu Ile Cys Thr Thr Asn Val Pro Trp Asn Ala Ser Trp Ser Asn
        595                 600                 605             
Lys Ser Tyr Glu Asp Ile Trp Glu Asn Met Thr Trp Ile Gln Trp Glu
    610                 615                 620                 
Gly Leu Ile Gly Leu Arg Ile Ile Phe Ala Val Leu Ala Ile Val Asn
625                 630                 635                 640 
Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr Leu Ile Pro
                645                 650                 655     
Asn Pro Thr Glu Ala Asp Arg Pro Gly Gly Ile Glu Glu Gly Gly Gly
            660                 665                 670         
Glu Gln Gly Arg Thr Arg Ser Ile Arg Leu Val Asn Gly Phe Leu Ala
        675                 680                 685             
Leu Ala Trp Asp Asp Leu Arg Asn Leu Cys Leu Phe Ser Tyr His Arg
    690                 695                 700                 
Leu Arg Asp Phe Val Leu Ile Ala Ala Arg Thr Val Gly Thr Leu Gly
705                 710                 715                 720 
Leu Arg Gly Trp Glu Ile Leu Lys Tyr Leu Val Asn Leu Val Trp Tyr
                725                 730                 735     
Trp Gly Gln Glu Leu Lys Asn Ser Ala Ile Ser Leu Leu Asn Thr Thr
            740                 745                 750         
Ala Ile Ala Val Ala Glu Gly Thr Asp Arg Ile Ile Glu Ile Ala Gln
        755                 760                 765             
Arg Ala Phe Arg Ala Ile Leu His Ile Pro Arg Arg Ile Arg Gln Gly
    770                 775                 780                 
Leu Glu Arg Ala Leu Leu
785                 790 


<210> 29
<211> 842
<212> PRT
<213> HIV-1

<400> 29
Met Arg Val Arg Gly Met Gln Arg Asn Trp Gln Thr Leu Gly Asn Trp
 1               5                  10                  15      
Gly Ile Leu Phe Leu Gly Ile Leu Ile Ile Cys Ser Asn Ala Asp Lys
            20                  25                  30          
Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr
        35                  40                  45              
Pro Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Lys Glu Val
    50                  55                  60                  
His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro
65                  70                  75                  80  
Gln Glu Val Glu Met Glu Asn Val Thr Glu Asn Phe Asn Met Trp Lys
                85                  90                  95      
Asn Asn Met Val Glu Gln Met His Thr Asp Ile Ile Ser Leu Trp Asp
            100                 105                 110         
Glu Ser Leu Lys Pro Cys Val Glu Leu Thr Pro Leu Cys Val Thr Leu
        115                 120                 125             
Asn Cys Thr Asp Tyr Lys Gly Thr Asn Ser Thr Asn Asn Ala Thr Ser
    130                 135                 140                 
Thr Val Val Ser Pro Ala Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr
145                 150                 155                 160 
Thr Glu Ile Lys Asp Lys Lys Lys Lys Glu Ser Ala Leu Phe Tyr Arg
                165                 170                 175     
Leu Asp Val Leu Pro Leu Asn Gly Glu Gly Asn Asn Ser Ser Thr Glu
            180                 185                 190         
Tyr Arg Leu Ile Asn Cys Asn Thr Ser Thr Ile Thr Gln Thr Cys Pro
        195                 200                 205             
Lys Val Thr Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly
    210                 215                 220                 
Phe Ala Ile Leu Lys Cys Lys Asp Lys Arg Phe Asn Gly Thr Gly Pro
225                 230                 235                 240 
Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro Val
                245                 250                 255     
Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Ile
            260                 265                 270         
Ile Ile Arg Ser Glu Asn Ile Thr Asp Asn Thr Lys Asn Ile Ile Val
        275                 280                 285             
Gln Leu Asn Glu Thr Val Gln Ile Asn Cys Thr Arg Pro Asn Asn Asn
    290                 295                 300                 
Thr Arg Lys Ser Ile His Met Gly Pro Gly Lys Ala Phe Tyr Thr Thr
305                 310                 315                 320 
Gly Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Ile Ser Gly
                325                 330                 335     
Glu Lys Trp Asn Met Thr Leu Ser Arg Val Lys Glu Lys Leu Lys Glu
            340                 345                 350         
His Phe Lys Asn Gly Thr Ile Thr Phe Lys Pro Pro Asn Pro Gly Gly
        355                 360                 365             
Asp Pro Glu Ile Leu Thr His Met Phe Asn Cys Ala Gly Glu Phe Phe
    370                 375                 380                 
Tyr Cys Asn Thr Thr Lys Leu Phe Asn Glu Thr Gly Glu Asn Gly Thr
385                 390                 395                 400 
Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Lys
                405                 410                 415     
Val Gly Lys Ala Ile Tyr Ala Pro Pro Ile Ala Gly Ser Ile Asn Cys
            420                 425                 430         
Ser Ser Asn Ile Thr Gly Met Ile Leu Thr Arg Asp Gly Gly Asn Asn
        435                 440                 445             
Thr His Asn Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn
    450                 455                 460                 
Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Gln Ile Glu Pro Leu
465                 470                 475                 480 
Gly Ile Ala Pro Thr Arg Ala Arg Arg Arg Val Val Gln Arg Glu Lys
                485                 490                 495     
Arg Ala Val Gly Leu Gly Ala Val Phe Phe Gly Phe Leu Gly Ala Ala
            500                 505                 510         
Gly Ser Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg
        515                 520                 525             
Gln Leu Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Arg Ala
    530                 535                 540                 
Ile Glu Ala Gln Gln His Leu Leu Gln Leu Thr Val Trp Gly Ile Lys
545                 550                 555                 560 
Gln Leu Arg Ala Arg Ile Leu Ala Val Glu Arg Tyr Leu Lys Asp Gln
                565                 570                 575     
Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr
            580                 585                 590         
Asn Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser Trp Glu Glu Ile
        595                 600                 605             
Trp Asn Asn Met Thr Trp Met Glu Trp Glu Lys Glu Ile Gly Asn Tyr
    610                 615                 620                 
Ser Asp Thr Ile Tyr Lys Leu Ile Glu Glu Ser Gln Thr Gln Gln Glu
625                 630                 635                 640 
Lys Asn Glu Gln Asp Leu Leu Ala Leu Asp Lys Trp Ala Ser Leu Trp
                645                 650                 655     
Asn Trp Phe Asp Ile Thr Lys Trp Leu Trp Tyr Ile Lys Ile Phe Ile
            660                 665                 670         
Met Ile Ile Gly Gly Leu Ile Gly Leu Arg Ile Ala Phe Ala Val Leu
        675                 680                 685             
Ser Val Val Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln
    690                 695                 700                 
Thr Leu Ile Pro Thr Ser Arg Gly Ala Asp Arg Pro Glu Gly Ile Glu
705                 710                 715                 720 
Glu Glu Gly Gly Glu Gln Asp Lys Asn Arg Ser Val Arg Leu Val Ser
                725                 730                 735     
Gly Phe Leu Ala Leu Ala Trp Asp Asp Leu Arg Asn Leu Cys Leu Phe
            740                 745                 750         
Ser Tyr Arg Gln Leu Arg Asn Leu Ile Leu Ile Val Thr Arg Ile Leu
        755                 760                 765             
Glu Arg Gly Leu Arg Gly Gly Trp Glu Ala Leu Lys Tyr Leu Trp Asn
    770                 775                 780                 
Leu Val Gln Tyr Trp Ser Gln Glu Leu Lys Asn Ser Ala Ile Ser Leu
785                 790                 795                 800 
Leu Asn Thr Thr Ala Ile Ala Val Ala Gly Gly Thr Asp Arg Ile Ile
                805                 810                 815     
Glu Ile Gly Gln Arg Ala Phe Arg Ala Leu Leu His Ile Pro Arg Arg
            820                 825                 830         
Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu
        835                 840         


<210> 30
<211> 2553
<212> DNA
<213> HIV-1

<400> 30
gagaaagagc agaagacagt ggcaatgaga gtgaggggga tgcagaggaa ttggcagaca 60
ttggggaact ggggcatctt attcttggga atattgataa tctgtagcaa tgcagacaaa 120
ttgtgggtca cagtctatta tggggtacct gtgtggaaag aagcaacacc cactctattc 180
tgtgcatcag atgctaaagc atatgagaag gaggtacata atgtctgggc tacccatgcc 240
tgtgtaccca cggaccccaa cccacaagaa gtagagatgg aaaatgtaac agaaaacttt 300
aacatgtgga aaaataacat ggtagagcag atgcatacag atataatcag tttatgggat 360
gaaagcctaa aaccttgtgt agagttaacc cctctctgtg tcactttaaa ttgtactgat 420
tacaaaggaa ccaatagcac caataatgca actagcactg tggtaagccc agcagaaata 480
aaaaattgct ctttcaatat aaccacagaa ataaaagata agaagaaaaa ggaatctgca 540
cttttctata gacttgatgt actgccactt aatggcgagg gtaataacag tagtactgaa 600
tacaggctaa taaattgtaa tacctcaacc attacacaga cttgtccaaa agtaaccttt 660
gagccaattc ccatacatta ttgtgcccca gctggctttg cgattctaaa gtgtaaggat 720
aaaaggttca atgggacagg accatgtaaa aatgtcagca cagtacaatg tacacatgga 780
attaaaccag tggtatcaac tcaactgctg ttaaatggca gcctagcaga agaagagata 840
ataattaggt ctgaaaatat tacagataat acaaaaaaca taatagtaca gcttaatgaa 900
actgtacaaa ttaattgtac aaggccaaac aacaatacaa gaaaaagtat acatatggga 960
ccaggaaaag cattctatac aacaggtgat ataataggag atataagaca ggcacattgc 1020
aacattagtg gagaaaaatg gaacatgact ttaagcagag taaaggaaaa gctaaaagaa 1080
cattttaaga atggaacaat aacatttaaa ccaccaaacc caggaggaga cccagaaatt 1140
ctaacgcaca tgtttaattg tgcaggagaa tttttctact gcaatacaac aaaactgttt 1200
aatgagacag gggagaatgg tactatcaca ctcccatgta gaataaagca gattataaac 1260
atgtggcaga aagtgggaaa agcaatatat gcccctccca ttgcaggaag tattaactgt 1320
agctcaaata ttacaggaat gatattgaca agagatggtg gtaataatac tcataatgag 1380
accttcagac ctggaggagg agacatgagg gacaattgga gaagtgaact gtataaatat 1440
aaagtagtac agattgaacc actaggaata gcacccacca gggccaggag aagagtggtg 1500
cagagagaaa aaagagcagt aggattagga gctgtgttct ttggattctt gggagcagca 1560
ggaagcacta tgggcgcggc gtcaataacg ctgacggtac aggccagaca attattgtct 1620
ggtatagtgc aacagcaaag caatttgctg agagctatag aagcgcaaca acatctgtta 1680
cagctcacgg tctggggcat taaacagctc cgggcaagaa tcctggctgt agaaagatac 1740
ctaaaggatc aacagctcct agggatttgg ggctgctctg gaaaactcat ctgcaccact 1800
aatgtgccct ggaactctag ctggagtaat aaatcttggg aagagatttg gaacaacatg 1860
acctggatgg agtgggaaaa agagattggc aattactcag acacaatata taagttaatt 1920
gaagaatcac aaacccagca ggaaaagaat gaacaagatt tattggcatt ggacaaatgg 1980
gcaagtctgt ggaattggtt tgacataaca aaatggctat ggtatataaa aatattcata 2040
atgataatag gaggcttgat aggtttaaga atagcttttg ctgtgctttc tgtagtaaat 2100
agagtcaggc agggatactc acctttgtca tttcagaccc ttatcccaac ctcgagggga 2160
gcagacagac ccgaaggaat cgaagaagaa ggtggagagc aagacaaaaa cagatcagtt 2220
cgattagtga gcggcttctt agcgcttgcc tgggacgatc tgcggaacct gtgcctcttc 2280
agctaccgcc aattgagaaa cttaatctta attgtgacga ggatcctgga aaggggactg 2340
agggggggtt gggaagccct caaatatctg tggaaccttg tacagtattg gagtcaggaa 2400
ctaaagaata gtgccattag cttgcttaat accacagcaa tagcagtagc tggaggaaca 2460
gatagaatta tagaaatagg acaaagagct tttagagctt tacttcacat acctagaaga 2520
ataagacagg gtctcgaaag agctttacta taa                              2553

<210> 31
<211> 842
<212> PRT
<213> HIV-1

<400> 31
Met Gly Met Lys Ser Gly Trp Leu Leu Phe Tyr Leu Leu Val Ser Leu
 1               5                  10                  15      
Ile Lys Val Ile Gly Ser Glu Gln His Trp Val Thr Val Tyr Tyr Gly
            20                  25                  30          
Val Pro Val Trp Arg Glu Ala Glu Thr Thr Leu Phe Cys Ala Ser Asp
        35                  40                  45              
Ala Lys Ala His Ser Thr Glu Ala His Asn Ile Trp Ala Thr Gln Ala
    50                  55                  60                  
Cys Val Pro Thr Asp Pro Asn Pro Gln Glu Val Leu Leu Pro Asn Val
65                  70                  75                  80  
Thr Glu Lys Phe Asn Met Trp Glu Asn Lys Met Ala Asp Gln Met Gln
                85                  90                  95      
Glu Asp Ile Ile Ser Leu Trp Glu Gln Ser Leu Lys Pro Cys Val Lys
            100                 105                 110         
Leu Thr Pro Leu Cys Val Thr Met Leu Cys Asn Asp Ser Tyr Gly Glu
        115                 120                 125             
Glu Arg Asn Asn Thr Asn Met Thr Thr Arg Glu Pro Asp Ile Gly Tyr
    130                 135                 140                 
Lys Gln Met Lys Asn Cys Ser Phe Asn Ala Thr Thr Glu Leu Thr Asp
145                 150                 155                 160 
Lys Lys Lys Gln Val Tyr Ser Leu Phe Tyr Val Glu Asp Val Val Pro
                165                 170                 175     
Ile Asn Ala Tyr Asn Lys Thr Tyr Arg Leu Ile Asn Cys Asn Thr Thr
            180                 185                 190         
Ala Val Thr Gln Ala Cys Pro Lys Thr Ser Phe Glu Pro Ile Pro Ile
        195                 200                 205             
His Tyr Cys Ala Pro Pro Gly Phe Ala Ile Met Lys Cys Asn Glu Gly
    210                 215                 220                 
Asn Phe Ser Gly Asn Gly Ser Cys Thr Asn Val Ser Thr Val Gln Cys
225                 230                 235                 240 
Thr His Gly Ile Lys Pro Val Ile Ser Thr Gln Leu Ile Leu Asn Gly
                245                 250                 255     
Ser Leu Asn Thr Asp Gly Ile Val Ile Arg Asn Asp Ser His Ser Asn
            260                 265                 270         
Leu Leu Val Gln Trp Asn Glu Thr Val Pro Ile Asn Cys Thr Arg Pro
        275                 280                 285             
Gly Asn Asn Thr Gly Gly Gln Val Gln Ile Gly Pro Ala Met Thr Phe
    290                 295                 300                 
Tyr Asn Ile Glu Lys Ile Val Gly Asp Ile Arg Gln Ala Tyr Cys Asn
305                 310                 315                 320 
Val Ser Lys Glu Leu Trp Glu Pro Met Trp Asn Arg Thr Arg Glu Glu
                325                 330                 335     
Ile Lys Lys Ile Leu Gly Lys Asn Asn Ile Thr Phe Arg Ala Arg Glu
            340                 345                 350         
Arg Asn Glu Gly Asp Leu Glu Val Thr His Leu Met Phe Asn Cys Arg
        355                 360                 365             
Gly Glu Phe Phe Tyr Cys Asn Thr Ser Lys Leu Phe Asn Glu Glu Leu
    370                 375                 380                 
Leu Asn Glu Thr Gly Glu Pro Ile Thr Leu Pro Cys Arg Ile Arg Gln
385                 390                 395                 400 
Ile Val Asn Leu Trp Thr Arg Val Gly Lys Gly Ile Tyr Ala Pro Pro
                405                 410                 415     
Ile Arg Gly Val Leu Asn Cys Thr Ser Asn Ile Thr Gly Leu Val Leu
            420                 425                 430         
Glu Tyr Ser Gly Gly Pro Asp Thr Lys Glu Thr Ile Val Tyr Pro Ser
        435                 440                 445             
Gly Gly Asn Met Val Asn Leu Trp Arg Gln Glu Leu Tyr Lys Tyr Lys
    450                 455                 460                 
Val Val Ser Ile Glu Pro Ile Gly Val Ala Pro Gly Lys Ala Lys Arg
465                 470                 475                 480 
Arg Thr Val Ser Arg Glu Lys Arg Ala Ala Phe Gly Leu Gly Ala Leu
                485                 490                 495     
Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser
            500                 505                 510         
Ile Thr Leu Thr Val Gln Ala Arg Thr Leu Leu Ser Gly Ile Val Gln
        515                 520                 525             
Gln Gln Asn Ile Leu Leu Arg Ala Ile Glu Ala Gln Gln His Leu Leu
    530                 535                 540                 
Gln Leu Ser Ile Trp Gly Ile Lys Gln Leu Gln Ala Lys Val Leu Ala
545                 550                 555                 560 
Ile Glu Arg Tyr Leu Arg Asp Gln Gln Ile Leu Ser Leu Trp Gly Cys
                565                 570                 575     
Ser Gly Lys Thr Ile Cys Tyr Thr Thr Val Pro Trp Asn Glu Thr Trp
            580                 585                 590         
Ser Asn Asn Thr Ser Tyr Asp Thr Ile Trp Asn Asn Leu Thr Trp Gln
        595                 600                 605             
Gln Trp Asp Glu Lys Val Arg Asn Tyr Ser Gly Val Ile Phe Gly Leu
    610                 615                 620                 
Ile Glu Gln Ala Gln Glu Gln Gln Asn Thr Asn Glu Lys Ser Leu Leu
625                 630                 635                 640 
Glu Leu Asp Gln Trp Asp Ser Leu Trp Ser Trp Phe Gly Ile Thr Lys
                645                 650                 655     
Trp Leu Trp Tyr Ile Lys Ile Ala Ile Met Ile Val Ala Gly Ile Val
            660                 665                 670         
Gly Ile Arg Ile Ile Ser Ile Val Ile Thr Ile Ile Ala Arg Val Arg
        675                 680                 685             
Gln Gly Tyr Ser Pro Leu Ser Leu Gln Thr Leu Ile Pro Thr Ala Arg
    690                 695                 700                 
Gly Pro Asp Arg Pro Glu Glu Thr Glu Gly Gly Val Gly Glu Gln Asp
705                 710                 715                 720 
Arg Gly Arg Ser Val Arg Leu Val Ser Gly Phe Ser Ala Leu Val Trp
                725                 730                 735     
Glu Asp Leu Arg Asn Leu Leu Ile Phe Leu Tyr His Arg Leu Thr Asp
            740                 745                 750         
Ser Leu Leu Ile Leu Arg Arg Thr Leu Glu Leu Leu Gly Gln Ser Leu
        755                 760                 765             
Ser Arg Gly Leu Gln Leu Leu Asn Glu Leu Arg Thr His Leu Trp Gly
    770                 775                 780                 
Ile Leu Ala Tyr Trp Gly Lys Glu Leu Arg Asp Ser Ala Ile Ser Leu
785                 790                 795                 800 
Leu Asn Thr Thr Ala Ile Val Val Ala Glu Gly Thr Asp Arg Ile Ile
                805                 810                 815     
Glu Leu Ala Gln Arg Ile Gly Arg Gly Ile Leu His Ile Pro Arg Arg
            820                 825                 830         
Ile Arg Gln Gly Leu Glu Arg Ala Leu Ile
        835                 840         


<210> 32
<211> 2529
<212> DNA
<213> HIV-1

<400> 32
atggggatga agagtggttg gttactcttc tatcttctag taagcttgat caaggtaatt 60
gggtctgaac aacattgggt aacagtgtac tatggggtac cagtatggag agaagcagag 120
acaactcttt tctgtgcttc agatgctaaa gcccatagta cagaggctca caacatctgg 180
gccacacaag catgtgttcc tactgatccc aatccacaag aagtgctatt acccaatgta 240
actgaaaaat ttaatatgtg ggaaaataaa atggcagacc aaatgcaaga ggatattatc 300
agtctgtggg aacagagctt aaagccctgt gttaaattaa ccccattatg tgtaactatg 360
ctttgtaacg atagctatgg ggaggaaagg aacaatacaa atatgacaac aagagaacca 420
gacataggat acaaacaaat gaaaaattgc tcattcaatg caaccactga gctaacagat 480
aaaaagaagc aagtttactc tctgttttat gtagaagatg tagtaccaat caatgcctat 540
aataaaacat ataggctaat aaattgtaat accacagctg tgacacaagc ttgtcctaag 600
acttcctttg agccaattcc aatacattac tgtgcaccac caggctttgc cattatgaaa 660
tgtaatgaag gaaactttag tggaaatgga agctgtacaa atgtgagtac tgtacaatgc 720
acacatggaa taaagccagt gatatccact cagttaatcc taaatggaag cttaaataca 780
gatggaattg ttattagaaa tgatagtcac agtaatctgt tggtgcaatg gaatgagaca 840
gtgccaataa attgtacaag gccaggaaat aatacaggag gacaggtgca gataggacct 900
gctatgacat tttataacat agaaaaaata gtaggagaca ttagacaagc atactgtaat 960
gtctctaaag aactatggga accaatgtgg aatagaacaa gagaggaaat aaagaaaatc 1020
ctggggaaaa acaacataac cttcagggct cgagagagga atgaaggaga cctagaagtg 1080
acacacttaa tgttcaattg tagaggagag tttttctatt gtaacacttc caaattattt 1140
aatgaggaat tacttaacga gacaggtgag cctattactc tgccttgtag aataagacag 1200
attgtaaatt tgtggacaag ggtaggaaaa ggaatttatg caccaccaat tcggggagtt 1260
cttaactgta cctccaatat tactggactg gttctagaat atagtggtgg gcctgacacc 1320
aaggaaacaa tagtatatcc ctcaggagga aacatggtta atctctggag acaagagttg 1380
tataagtaca aagtagttag catagaaccc ataggagtag caccaggtaa agctaaaaga 1440
cgcacagtga gtagagaaaa aagagcagcc tttggactag gtgcgctgtt tcttgggttt 1500
cttggagcag cagggagcac tatgggcgca gcgtcaataa cgctgacggt acaggcccgg 1560
acattattat ctgggatagt gcaacagcag aatattctgt tgagagcaat agaggcgcaa 1620
caacatttgt tgcaactctc aatctggggc attaaacagc tccaggcaaa agtccttgct 1680
atagaaagat accttaggga tcagcaaatc ctaagtctat ggggctgctc aggaaaaaca 1740
atatgctata ccactgtgcc ttggaatgag acttggagca acaatacctc ttatgataca 1800
atctggaata atttaacctg gcaacaatgg gatgagaaag taagaaacta ttcaggtgtc 1860
atttttggac ttatagaaca ggcacaagaa caacagaaca caaatgagaa atcactcttg 1920
gaattggatc aatgggacag tctgtggagc tggtttggta ttacaaaatg gctgtggtat 1980
ataaaaatag ctataatgat agtagcaggc attgtaggca taagaatcat aagtatagta 2040
ataactataa tagcaagagt taggcaggga tattctcccc tttcgttgca gacccttatc 2100
ccaacagcaa ggggaccaga caggccagaa gaaacagaag gaggcgttgg agagcaagac 2160
agaggcagat ccgtgcgatt agtgagcgga ttctcagctc ttgtctggga ggacctccgg 2220
aacctgttga tcttcctcta ccaccgcttg acagactcac tcttgatact gaggaggact 2280
ctggaactcc tgggacagag tctcagcagg ggactgcaac tactgaatga actcagaaca 2340
cacttgtggg gaatacttgc atattgggga aaagagttaa gggatagtgc tatcagcttg 2400
cttaatacaa cagctattgt agtagcagaa ggaacagata ggattataga attagcacaa 2460
agaataggaa ggggaatatt acacatacct agaagaatca gacaaggcct agaaagagca 2520
ctgatataa                                                         2529

<210> 33
<211> 878
<212> PRT
<213> HIV-1

<400> 33
Met Thr Val Met Glu Lys Lys Ser Lys Lys Ser Trp Ile Leu Cys Ile
 1               5                  10                  15      
Ala Met Ala Leu Ile Ile Pro Cys Leu Ser Gly Arg Gln Leu Tyr Ile
            20                  25                  30          
Thr Val Tyr Ser Gly Val Pro Val Trp Glu Asp Ala Thr Pro Val Leu
        35                  40                  45              
Phe Cys Ala Ser Asp Ala Asn Leu Thr Ser Thr Glu Lys His Asn Val
    50                  55                  60                  
Trp Ala Ser Gln Ala Cys Val Pro Thr Asp Pro Thr Pro His Glu Tyr
65                  70                  75                  80  
Pro Leu Val Asn Val Thr Asp Lys Phe Asp Ile Trp Lys Asn Tyr Met
                85                  90                  95      
Val Asp Gln Met His Glu Asp Ile Ile Ser Leu Trp Asp Gln Ser Leu
            100                 105                 110         
Lys Pro Cys Val Gln Met Thr Phe Leu Cys Val Gln Met Asn Cys Thr
        115                 120                 125             
Glu Leu Ser Asn Ser Ser Ala Ser Asn Thr Thr Asp Leu Arg Ile Ala
    130                 135                 140                 
Gly Leu Glu Glu Ile Pro Met Lys Asn Cys Ser Phe Asn Val Thr Thr
145                 150                 155                 160 
Phe Leu Asn Asp Arg Lys Glu Lys Arg Gln Ala Leu Phe Tyr Val Ser
                165                 170                 175     
Asp Leu Val Lys Ile Asp Asn Ser Ser Thr Ile Tyr Arg Leu Thr Asn
            180                 185                 190         
Cys Asn Ser Thr Thr Ile Arg Gln Ala Cys Pro Lys Val Ser Phe Glu
        195                 200                 205             
Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Tyr Ala Ile Phe Lys
    210                 215                 220                 
Cys Asn Asn Thr Asp Phe Asn Gly Thr Gly Leu Cys Pro Asn Ile Ser
225                 230                 235                 240 
Val Val Thr Cys Thr His Gly Ile Lys Pro Thr Val Ser Thr Gln Leu
                245                 250                 255     
Ile Met Asn Gly Thr Leu Ser Arg Gly Lys Ile Arg Ile Met Gly Arg
            260                 265                 270         
Asn Ile Thr Asp Asn Thr Lys Asn Ile Val Val Thr Leu Asn Thr Ser
        275                 280                 285             
Ile Asn Met Thr Cys Thr Arg Lys Gly Arg Gly Lys Ile Gln Arg Ile
    290                 295                 300                 
Ala Thr Gly Pro Leu Arg Trp Val Ser Met Ala Ala Lys Thr Glu Ser
305                 310                 315                 320 
Gln Asn Thr Gly Ser Arg Ile Ala Tyr Cys Met Tyr Asn Asn Thr Glu
                325                 330                 335     
Trp Ile Asn Thr Leu Lys Gln Thr Ala Glu Arg Tyr Leu Glu Leu Val
            340                 345                 350         
Asn Lys Thr Arg Asn Leu Ser Lys Thr Arg Asn Phe Ser Ile Ile Phe
        355                 360                 365             
Asn His Ser Ile Ser Gly Gly Asp Ile Glu Ala Ser Ser Leu His Phe
    370                 375                 380                 
Asn Cys His Gly Glu Phe Phe Tyr Cys Asp Thr Ser Arg Leu Phe Asn
385                 390                 395                 400 
Tyr Thr Phe Lys Cys Asn Gly Ser Gln Cys Asn Glu Thr Asn Lys Thr
                405                 410                 415     
Gln Thr Asn Lys Thr Gln Asp Thr Ile Ile Pro Cys Lys Ile Arg Gln
            420                 425                 430         
Val Val Arg Ser Trp Ile Lys Gly Glu Leu Gly Leu Tyr Ala Pro Pro
        435                 440                 445             
Ile Pro Gly Asp Leu Thr Cys Lys Ser Asn Ile Thr Gly Met Ile Leu
    450                 455                 460                 
Gln Leu Asp Thr Pro Tyr Asn Ser Ser Cys Asp Asn Val Thr Phe Arg
465                 470                 475                 480 
Pro Thr Gly Gly Asp Met Arg Asp Ile Trp Arg Thr Glu Leu Tyr Asn
                485                 490                 495     
Tyr Lys Val Ile Gln Val Lys Pro Phe Ser Val Ala Pro Thr Lys Ile
            500                 505                 510         
Ser Arg Pro Ile Ile Gly Leu Asn Thr Thr His Arg Gly Lys Arg Ala
        515                 520                 525             
Val Gly Leu Gly Met Leu Phe Leu Gly Val Leu Ser Ala Ala Gly Ser
    530                 535                 540                 
Thr Met Gly Ala Ala Ala Thr Thr Leu Ala Val Arg Thr Gln Gly Val
545                 550                 555                 560 
Leu Lys Gly Ile Val Gln Gln Gln Asp Asn Leu Leu Arg Ala Ile Gln
                565                 570                 575     
Ala Gln Gln His Leu Leu Arg Leu Ser Val Trp Gly Ile Arg Gln Leu
            580                 585                 590         
Arg Ala Arg Leu Gln Ala Leu Glu Thr Leu Ile Gln Asn Gln Gln Arg
        595                 600                 605             
Leu Ser Leu Trp Gly Cys Lys Gly Arg Ile Ile Cys Tyr Thr Ser Ala
    610                 615                 620                 
Lys Trp Asn Asn Thr Trp Gly Asn Trp Thr Asp Ser Ala Trp Asn Asn
625                 630                 635                 640 
Leu Thr Trp Gln Gln Trp Asp Gln Gln Ile Asp Glu Tyr Ser Thr Thr
                645                 650                 655     
Ile Tyr Thr Lys Ile Gln Glu Ala Gln Asp Gln Gln Glu Gln Asn Glu
            660                 665                 670         
Lys Thr Leu Leu Glu Leu Asp Glu Trp Ala Ser Leu Trp Asn Trp Phe
        675                 680                 685             
Asp Ile Thr Lys Trp Leu Trp Tyr Ile Lys Ile Ala Ile Ile Ile Val
    690                 695                 700                 
Gly Ala Leu Ile Gly Ile Arg Val Val Met Ile Val Leu Asn Leu Val
705                 710                 715                 720 
Lys Asn Ile Arg Gln Gly Tyr Gln Pro Leu Ser Leu Gln Ile Pro Ile
                725                 730                 735     
Pro His Gln Glu Glu Ala Glu Thr Pro Gly Arg Thr Gly Glu Glu Gly
            740                 745                 750         
Gly Gly Val Asp Arg Arg Lys Trp Thr Pro Leu Gln Pro Gly Phe Leu
        755                 760                 765             
Gln Leu Leu Tyr Thr Asp Leu Arg Thr Ile Ile Leu Trp Thr Tyr His
    770                 775                 780                 
Leu Leu Ser Asn Leu Ala Ser Gly Ile Gln Arg Leu Ile Ser Tyr Leu
785                 790                 795                 800 
Gly Leu Gly Leu Trp Ile Leu Gly Gln Lys Thr Ile Glu Ala Cys Arg
                805                 810                 815     
Leu Cys Gly Ala Val Thr Gln Tyr Trp Leu Gln Glu Leu Arg Ala Ser
            820                 825                 830         
Ala Thr Asn Leu Leu Asp Thr Ile Ala Val Ala Val Gly Asn Trp Thr
        835                 840                 845             
Asp Ser Ile Ile Leu Gly Ile Gln Arg Ile Gly Arg Gly Phe Leu Asn
    850                 855                 860                 
Ile Pro Arg Arg Ile Arg Gln Gly Ala Glu Arg Ala Leu Asn
865                 870                 875             


<210> 34
<211> 2637
<212> DNA
<213> HIV-1

<400> 34
atgacagtaa tggagaagaa gagcaagaag tcatggatct tatgcatagc catggctttg 60
ataatcccat gtttgagtgg tagacaattg tatatcacag tctattctgg ggtacctgta 120
tgggaagatg caacaccagt actattctgt gcttcagatg ctaatttgac aagcactgaa 180
aagcataatg tttgggcatc acaagcctgc gttcccacag accccactcc acatgaatat 240
ccactagtca atgtgacaga taaatttgat atatggaaaa attacatggt ggaccaaatg 300
catgaagaca ttattagttt atgggatcaa agtttaaagc cttgtgtgca aatgactttc 360
ttatgtgtac aaatgaactg tacagagctg agtaatagca gtgcatcaaa cacaaccgac 420
ctaaggatcg caggcctaga agaaataccc atgaaaaatt gtagttttaa tgtaactaca 480
ttcctcaatg acagaaagga gaaaaggcag gctctattct atgtatcaga tttggttaag 540
attgacaaca gctcaacaat atatagatta actaattgta attccacaac catcagacaa 600
gcctgtccga aggtaagctt tgagcccatc cccatacatt attgtgctcc agcaggatat 660
gccatcttta agtgtaataa cacagacttt aatggaacag gcctatgtcc caatatttca 720
gtggttacat gtacacatgg catcaagcca acagtaagta ctcaattaat aatgaatggg 780
acactctcta gagggaagat aagaattatg ggaagaaata ttacagacaa tacaaagaat 840
attgtagtaa ccctaaacac ttctataaac atgacatgta cgagaaaagg aagaggtaaa 900
atacaaagga tagcgacagg tccactgcga tgggtcagta tggcagctaa aacagagtca 960
cagaacacag ggtcaaggat agcttattgt atgtataaca acactgaatg gataaatacc 1020
ttaaaacaaa cagctgaaag atatttagaa ctagtaaaca agacaagaaa tttaagcaag 1080
acaagaaatt ttagcataat attcaaccac agtataagtg gtggagacat agaagcaagc 1140
tctttacatt ttaactgtca tggagaattc ttttattgtg acacatctcg gctgtttaac 1200
tatactttta agtgtaatgg ttcccaatgt aatgagacca ataaaactca gacaaataaa 1260
actcaggata ctataatacc ttgcaagata agacaggtag taagatcatg gataaaggga 1320
gagttaggac tctatgcacc tcccatccca ggtgatctaa catgtaaatc caacataact 1380
gggatgattt tacaactaga tacaccctac aactcctcat gtgacaatgt cacatttaga 1440
ccaacagggg gagatatgag agatatatgg agaactgaat tgtacaacta caaagtaata 1500
caggtaaaac cttttagtgt agcacctaca aaaatttcaa gaccaataat aggccttaac 1560
accacccaca gaggaaaaag agcagtagga ttgggaatgc tattcttagg ggttctaagc 1620
gcagcaggta gcactatggg cgcagcggca acaacgctgg cggtacggac ccaaggtgta 1680
ctaaagggta tagtgcaaca gcaggacaac ctgctgagag cgatacaggc ccagcaacat 1740
ttgctgaggt tatctgtatg gggtattaga caactccgag ctcgcctgca agccttagaa 1800
acccttatac agaatcagca acgcctaagc ctatggggat gtaaaggaag gataatatgt 1860
tacacatcag caaaatggaa caacacatgg ggaaactgga ctgacagtgc ttggaacaac 1920
ttgacatggc agcaatggga ccaacaaata gatgaatata gcaccactat atacactaaa 1980
atacaagaag cacaggacca acaggaacag aatgaaaaga cattgttaga gctagatgaa 2040
tgggcttctc tttggaattg gtttgacata actaaatggt tgtggtatat aaaaatagct 2100
ataatcatag taggagcact aataggtata agagttgtca tgatagtact taatctagtg 2160
aaaaacatta ggcagggata tcaacccctc tcgttgcaga tccccatccc acaccaggag 2220
gaagcagaaa cgccaggaag aacaggagaa gaaggtggag gcgtagacag gcgcaagtgg 2280
acacccttgc aaccaggatt cttacaactg ttgtacacgg atctcaggac aataatcttg 2340
tggacttacc acctcttgag caacttagca tcagggatcc agaggttgat cagctacctg 2400
ggacttggac tgtggatcct gggacaaaag acaattgaag cttgcagact ttgtggagct 2460
gtaacacaat actggttaca agaattgcgg gctagtgcta caaatctgct tgatactatt 2520
gcagtggcag ttggcaattg gactgacagc atcatcttag gtatacaaag aatagggcga 2580
ggattcctca acatcccaag aagaattaga caaggtgcag aaagagctct aaattaa    2637

<210> 35
<211> 884
<212> PRT
<213> HIV-1

<220> 
<221> VARIANT        
<222> (0)...(0)
<223> Xaa = any amino acid

<400> 35
Met Gly Gly Met Arg Ala Met Lys Lys Lys Lys Asn Ser Leu Gly Asn
 1               5                  10                  15      
Leu Glu Ile Cys Leu Ala Leu Val Ile Tyr Phe Asn Ala Ile Ser Cys
            20                  25                  30          
Ala Ser Gly Ile His Tyr Val Thr Val Tyr Tyr Gly Val Pro Val Trp
        35                  40                  45              
Arg Asn Ala Glu Val Thr Leu Phe Cys Ala Ala Asp Ala Ser Leu Thr
    50                  55                  60                  
Ser Lys Glu Gln His Asn Ile Trp Ala Thr Gln Ala Cys Val Pro Thr
65                  70                  75                  80  
Asp Pro Thr Pro Ile Glu Val Lys Ile Asn Val Thr Glu Ser Phe Asn
                85                  90                  95      
Ile Trp Lys Asn Tyr Met Val Thr Gln Met Gln Glu Asp Ile Ile Ser
            100                 105                 110         
Leu Trp Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Ile Leu Cys
        115                 120                 125             
Val Thr Met Asn Cys Ser Glu Cys His Lys Gln Ala Lys Cys Asn Asn
    130                 135                 140                 
Pro Ser Glu Asn Arg Thr Ala Ala Asn Pro Leu Glu Leu Phe Lys Cys
145                 150                 155                 160 
Ser Phe Asn Thr Thr Thr Val Leu Lys Asp Lys Lys Gln Glu Gln Gln
                165                 170                 175     
Ala Leu Phe Tyr Arg Thr Asp Leu Ile Ala Leu Asn Glu Thr Ala Asn
            180                 185                 190         
Asn Thr Leu Tyr Arg Leu Ile Asn Cys Asn Thr Thr Thr Ile Thr Gln
        195                 200                 205             
Ala Cys Pro Lys Val Thr Phe Glu Pro Leu Pro Ile Gln Tyr Cys Ala
    210                 215                 220                 
Pro Ala Gly Tyr Ala Leu Met Lys Cys Xaa Gln Xaa Gly Phe Asn Gly
225                 230                 235                 240 
Thr Gly Pro Cys Asn Gln Thr Val Ile Thr His Cys Thr His Gly Ile
                245                 250                 255     
Lys Pro Thr Val Ser Thr Gln Leu Ile Leu Asn Gly Thr Leu Ala Glu
            260                 265                 270         
Xaa Glu Pro Leu Val Ile Thr Gln Asn Val Ser Asp Thr Arg Tyr Val
        275                 280                 285             
Ile Ile Val Lys Leu Asn Lys Asn Val Ser Leu Thr Cys Val Arg Pro
    290                 295                 300                 
Gly Asn Asn Thr Arg Xaa Gln Val Gln Ile Gly Pro Met Thr Trp Tyr
305                 310                 315                 320 
Asn Met Lys Phe Tyr Thr Gly Asp Ile Arg Lys Ala Tyr Cys Asn Xaa
                325                 330                 335     
Ser Met Gln Xaa Trp Thr Lys Thr Leu Glu Ser Val Ser Lys Ala Ile
            340                 345                 350         
Trp Lys Ala Tyr Pro Gln Pro Pro Asn Gln Asn His Thr Phe Val Phe
        355                 360                 365             
Arg Asn Ser Thr Gly Gly Asp Pro Glu Val Ser Phe Leu His Phe Ser
    370                 375                 380                 
Cys His Gly Glu Phe Phe Tyr Cys Asn Thr Ser Ser Leu Phe Asn Tyr
385                 390                 395                 400 
Ser Tyr Thr Cys Asn Glu Lys Gly Val Cys Ser Ile Asn Asn His Thr
                405                 410                 415     
Gly Asn Tyr Thr Gly Glu Asn Ile Ile Thr Leu Pro Cys Arg Leu Lys
            420                 425                 430         
Gln Val Val Asn Ser Trp Met Arg Val Gly Ser Gly Leu Phe Ala Pro
        435                 440                 445             
Pro Ile Glu Gly Gln Leu Gln Cys His Ser Asn Ile Thr Gly Leu Ile
    450                 455                 460                 
Leu Asp Arg Ala Ser Pro Tyr Asn Ala Asn Ser Ser Ser Asn Thr Thr
465                 470                 475                 480 
Leu Ser Pro Thr Gly Gly Asp Met Arg His Ile Trp Arg Ser Glu Leu
                485                 490                 495     
Tyr Pro Tyr Lys Val Val Gln Val Lys Ala Leu Ala Val Ala Pro Thr
            500                 505                 510         
Arg Val Ser Arg Pro Thr Ile Met Xaa His Asp Ala His Arg Lys Lys
        515                 520                 525             
Arg Gly Ala Gly Leu Gly Met Leu Phe Leu Gly Phe Met Ser Ala Ala
    530                 535                 540                 
Gly Ser Thr Met Gly Ala Ala Ala Val Thr Leu Thr Val Gln Ala Arg
545                 550                 555                 560 
Gln Val Leu His Gly Ile Val Gln Gln Gln Asn Asn Met Leu Arg Ala
                565                 570                 575     
Ile Glu Ala Gln Gln Glu Leu Leu Arg Leu Ser Val Trp Gly Ile Arg
            580                 585                 590         
Gln Leu Arg Ala Arg Leu Leu Ala Ile Glu Thr Tyr Leu Arg Asp Gln
        595                 600                 605             
Gln Leu Leu Gly Leu Trp Gly Cys Ser Gly Gln Ile Val Cys Tyr Thr
    610                 615                 620                 
Asn Val Pro Trp Asn Arg Ser Trp Thr Asn Lys Ser Glu Thr Glu Leu
625                 630                 635                 640 
Asp Gly Xaa Trp Thr Asn Leu Thr Trp Gln Glu Trp Asp Lys Leu Val
                645                 650                 655     
Asp Asn Tyr Thr Asp Thr Ile Tyr Leu Glu Ile Gln Arg Ala Gln Asp
            660                 665                 670         
Gln Gln Lys Ala Asn Glu Lys Lys Leu Leu Glu Leu Asp Gln Trp Ala
        675                 680                 685             
Gln Leu Trp Asn Trp Leu Asp Ile Thr Gln Trp Leu Trp Tyr Ile Lys
    690                 695                 700                 
Ile Phe Ile Met Ile Val Gly Gly Ile Ile Gly Leu Arg Ile Leu Leu
705                 710                 715                 720 
Ala Xaa Xaa Asn Val Val Arg Arg Ile Arg Gln Gly Tyr Ser Pro Val
                725                 730                 735     
Ser Leu Gln Thr Leu Gly Leu Asn Gly Asp Pro Ala Gly Ile Ala Pro
            740                 745                 750         
Gly Thr Asn Glu Glu Gly Gly Glu Ala Gly Asn Gly Arg Ser Ile Arg
        755                 760                 765             
Leu Leu Asp Gly Phe Leu Pro Leu Val Trp Asp Asp Leu Lys Asn Leu
    770                 775                 780                 
Val Val Gln Ile Tyr Gln Ile Leu Val Gly Cys Ile Leu Gly Ile Lys
785                 790                 795                 800 
Asp Leu Leu Thr Ile Leu Trp Ile His Leu Gly Gln Leu Leu Thr Arg
                805                 810                 815     
Gly Leu Asn Cys Leu Arg Asp Cys Phe Ala Ala Cys Gly Tyr Trp Thr
            820                 825                 830         
Gln Glu Leu Lys Gln Ser Ala Thr Ser Leu Leu Asp Thr Val Ala Ile
        835                 840                 845             
Ser Val Ala Gly Trp Thr Asp Gln Val Ile Ile Val Gly Gln Gln Ile
    850                 855                 860                 
Gly Arg Gly Phe Leu Asn Ile Pro Arg Arg Ile Arg Gln Gly Ile Glu
865                 870                 875                 880 
Arg Ser Leu Leu
                


<210> 36
<211> 847
<212> PRT
<213> HIV-1

<400> 36
Met Arg Val Lys Glu Ile Gln Arg Asn Tyr Gln His Leu Trp Lys Trp
 1               5                  10                  15      
Ser Leu Ile Ile Leu Gly Met Ile Met Ile Cys Lys Ala Ile Glu Lys
            20                  25                  30          
Ser Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Asp Ala Glu
        35                  40                  45              
Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Lys Glu Ser
    50                  55                  60                  
His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Ser Pro
65                  70                  75                  80  
Gln Glu Leu Val Leu Gly Asn Val Thr Glu Asn Phe Asn Met Trp Lys
                85                  90                  95      
Asn Lys Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp Asp
            100                 105                 110         
Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Phe Leu Cys Val Thr Leu
        115                 120                 125             
Asn Cys Ile Asp Val Lys Asn Ser Thr Asn Asn Asn Thr Glu Glu Ala
    130                 135                 140                 
Thr Ile Thr Asn Cys Ser Phe Lys Val Pro Thr Glu Leu Lys Asp Lys
145                 150                 155                 160 
Thr Glu Thr Val His Thr Leu Phe Tyr Lys Leu Asp Val Val Pro Leu
                165                 170                 175     
Asn Val Thr Asn Asn Ser Ser Ile Ser Ser Thr Tyr Arg Leu Ile Asn
            180                 185                 190         
Cys Asn Thr Ser Thr Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu
        195                 200                 205             
Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys
    210                 215                 220                 
Cys Asn Asp Lys Lys Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser
225                 230                 235                 240 
Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu
                245                 250                 255     
Leu Leu Asn Gly Ser Leu Ser Glu Glu Glu Val Ile Ile Arg Ser Glu
            260                 265                 270         
Asn Ile Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Asn Glu Thr
        275                 280                 285             
Val Lys Ile Asn Cys Thr Arg Pro Gly Ser Asp Lys Lys Ile Arg Gln
    290                 295                 300                 
Ser Ile Arg Ile Gly Pro Gly Lys Val Phe Tyr Ala Lys Gly Gly Ile
305                 310                 315                 320 
Thr Gly Gln Ala His Cys Asn Ile Thr Asp Gly Glu Trp Arg Asn Thr
                325                 330                 335     
Leu Gln Gln Val Ala Ile Ala Leu Arg Arg Gln Phe Asn Asn Lys Ser
            340                 345                 350         
Ile Ile Phe Asn Ser Ser Ser Gly Gly Asp Ile Glu Ile Thr Thr His
        355                 360                 365             
Thr Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser Glu Leu
    370                 375                 380                 
Phe Thr Gly Ile Trp Asn Gly Thr Trp Asp Lys Asn Cys Thr Ser Thr
385                 390                 395                 400 
Glu Ser Asn Cys Thr Gly Asn Ile Thr Leu Pro Cys Arg Ile Lys Gln
                405                 410                 415     
Val Val Arg Thr Trp Gln Gly Val Gly Gln Ala Met Tyr Ala Pro Pro
            420                 425                 430         
Ile Glu Gly Thr Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu
        435                 440                 445             
Thr Arg Asp Gly Gly Asn Gly Asn Ala Thr Gln Asn Glu Thr Phe Arg
    450                 455                 460                 
Pro Gly Gly Gly Asp Met Lys Asp Asn Trp Arg Ser Glu Leu Tyr Lys
465                 470                 475                 480 
Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala Pro Thr Arg Ala
                485                 490                 495     
Lys Arg Arg Val Val Glu Arg Glu Lys Arg Ala Val Gly Met Gly Ala
            500                 505                 510         
Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala
        515                 520                 525             
Ser Met Ala Leu Thr Ala Gln Ala Arg Gln Leu Leu Ser Gly Ile Val
    530                 535                 540                 
Gln Gln Gln Asn Asn Leu Leu Arg Ala Ile Glu Ala Gln Gln His Leu
545                 550                 555                 560 
Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Val Leu
                565                 570                 575     
Ala Val Glu Arg Tyr Leu Glu Ser Gln Gln Leu Leu Gly Leu Trp Gly
            580                 585                 590         
Cys Ser Gly Lys Leu Ile Cys Thr Thr Thr Val Pro Trp Asn Ser Ser
        595                 600                 605             
Trp Ser Asn Lys Ser Leu Asp Asn Ile Trp Asp Asn Leu Thr Trp Met
    610                 615                 620                 
Glu Trp Asp Arg Glu Ile Ser Asn Tyr Thr Gln Val Ile Tyr Gly Leu
625                 630                 635                 640 
Leu Glu Asp Ser Gln Lys Gln Gln Glu Lys Ser Glu Lys Asp Leu Leu
                645                 650                 655     
Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asp Ile Thr Asn
            660                 665                 670         
Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile Val Gly Gly Leu Ile
        675                 680                 685             
Gly Leu Arg Ile Val Phe Thr Val Phe Ser Ile Ile Asn Arg Val Arg
    690                 695                 700                 
Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr Leu Leu Pro Thr Pro Arg
705                 710                 715                 720 
Gly Pro Asp Arg Pro Gly Arg Thr Glu Glu Glu Gly Gly Glu Glu Asp
                725                 730                 735     
Asn Asn Arg Ser Val Arg Leu Val Asn Gly Phe Leu Ala Leu Ala Trp
            740                 745                 750         
Glu Asp Leu Arg Ser Leu Cys Ile Phe Ser Tyr His Arg Leu Arg Asp
        755                 760                 765             
Leu Ile Leu Ile Val Val Lys Gly Leu Arg Arg Gly Trp Glu Ala Leu
    770                 775                 780                 
Lys Tyr Leu Gly Asn Leu Val Leu Tyr Trp Gly Gln Glu Leu Lys Asn
785                 790                 795                 800 
Ser Ala Ile Ser Leu Leu Asn Ala Thr Ala Ile Val Val Ala Glu Gly
                805                 810                 815     
Thr Asp Arg Ile Ile Glu Val Gly Gln Arg Ile Cys Arg Ala Ile Leu
            820                 825                 830         
Asn Ile Pro Arg Arg Ile Arg Gln Gly Phe Glu Arg Ala Leu Leu
        835                 840                 845         


<210> 37
<211> 2544
<212> DNA
<213> HIV-1

<400> 37
atgagagtga aggagataca gaggaattat caacacttgt ggaaatggag cctgataatt 60
ttaggaatga taatgatatg taaagctata gaaaaatcgt gggtcacagt ctattatggg 120
gtacctgtgt ggaaagatgc agaaaccact ctattttgtg catcagatgc taaggcatat 180
gagaaagaat cgcataatgt ctgggctaca catgcctgtg tacccacaga ccccagccca 240
caagagctag ttttgggaaa tgtaacagaa aactttaaca tgtggaagaa taaaatggta 300
gagcagatgc atgaggatat aatcagttta tgggatcaaa gccttaagcc atgtgtaaag 360
ttaacctttc tttgtgtcac tttaaactgt attgatgtaa agaatagtac taacaataac 420
actgaagaag ctaccatcac aaattgctcc ttcaaggtac ccacagaact gaaagataag 480
acggagacag tacatacact tttttataaa ctggatgtag tgccacttaa tgtgacaaat 540
aattctagta taagtagtac ctataggtta ataaattgta atacctcaac cattacacag 600
gcttgtccaa aggtatcctt tgagccaatt cctatacatt attgtgcccc tgctggtttt 660
gcgattctaa agtgtaatga taagaagttc aatggaacag gaccatgcaa aaatgtcagc 720
acagtacaat gcacacatgg aattaggcca gtggtgtcaa ctcaattact attaaatggc 780
agtttatcag aagaagaggt aataattaga tctgaaaata tcacaaacaa tgccaaaacc 840
ataatagtac agcttaatga gactgtaaaa attaattgta ccagacccgg atccgacaag 900
aagataagac aaagtatacg tataggacca ggaaaagtat tctatgcaaa aggtggaata 960
acaggacaag cacattgtaa cattacagat ggggaatgga ggaatacttt acaacaggta 1020
gctatcgcat taagaagaca atttaataat aaatcaataa tatttaactc atcctcagga 1080
ggggacatag agattacaac acatactttt aactgtggag gagagttttt ctattgcaac 1140
acatcagagc tgtttactgg tatttggaat ggtacttggg ataagaattg cactagcact 1200
gagagtaatt gcactggaaa tattacactc ccatgcagga taaaacaagt ggtaagaaca 1260
tggcagggag taggacaagc aatgtatgcc cctcctatcg aagggacaat taggtgctca 1320
tcaaatatta caggtctact attgacaaga gatggtggta atggcaatgc aactcaaaat 1380
gagaccttta gacctggagg aggagacatg aaagataatt ggagaagtga attgtataag 1440
tataaagtag taaaaattga accactagga gtagcaccca ccagggcaaa aagaagagtg 1500
gtggagagag aaaaaagagc agtggggatg ggagctttgt ttctcgggtt cttgggagca 1560
gccggaagca ctatgggcgc ggcgtcaatg gcgctgacgg cacaggccag acaattattg 1620
tctggtatag tgcagcagca aaacaatttg ctgagggcta tagaggcgca acagcatctg 1680
ttgcaactca cagtctgggg cattaaacag ctccaggcaa gagtcctggc tgtggaaaga 1740
tacctagaga gtcaacagct cctagggctt tggggctgct ctggaaaact catctgcacc 1800
actactgtgc cctggaactc tagctggagt aataaatcct tggataacat ttgggacaat 1860
ctgacctgga tggagtggga tagagaaatt agcaattaca cacaagtaat atatgggttg 1920
cttgaagact cacaaaaaca gcaggaaaag agtgaaaaag atttactgga attggataag 1980
tgggcaagtc tgtggaactg gtttgacata acaaattggt tgtggtatat aaaaatattc 2040
ataatgatag taggaggctt gataggctta agaatagttt ttactgtgtt ttctataata 2100
aatagagtta ggcagggata ctcacctttg tctttccaga ccctcctccc aaccccgagg 2160
ggacccgaca ggccaggaag aaccgaagaa gaaggtggag aagaagacaa caacagatcc 2220
gttcgattag tgaacggatt cttagcactt gcctgggaag acctgcggag cctgtgcatc 2280
ttcagctacc accgcttgag agacttaatc ttgattgtag taaagggact gcgacggggg 2340
tgggaagcac tcaaatacct ggggaatctt gtgctgtatt ggggtcagga actaaagaat 2400
agtgctatta gtttgcttaa tgccacagca atagtagtag ctgagggaac agatagaatt 2460
atagaagtgg gacaaagaat ttgtagggct attctcaata tacctagaag aataagacag 2520
ggtttcgaaa gggctttact gtaa                                        2544

<210> 38
<211> 859
<212> PRT
<213> HIV-1

<400> 38
Met Arg Val Met Gly Ile Gln Lys Asn Tyr Pro Leu Leu Trp Arg Trp
 1               5                  10                  15      
Gly Met Ile Ile Phe Trp Ile Met Thr Ile Cys Ser Ala Gly Asn Leu
            20                  25                  30          
Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Arg Asp Ala Glu Thr
        35                  40                  45              
Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val His
    50                  55                  60                  
Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro Gln
65                  70                  75                  80  
Glu Ile His Leu Gly Asn Val Thr Glu Asp Phe Asn Met Trp Lys Asn
                85                  90                  95      
Ser Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp Asp Gln
            100                 105                 110         
Ser Leu Lys Pro Cys Val Gln Leu Thr Pro Leu Cys Val Thr Leu His
        115                 120                 125             
Cys Gln Asp Asn Leu Thr Ser Ser Gly Asn Ile Ser Glu Asn Met Gln
    130                 135                 140                 
Gly Glu Ile Lys Asn Cys Ser Phe Asn Met Thr Thr Glu Leu Arg Asp
145                 150                 155                 160 
Lys Lys Gln Lys Val Tyr Ala Leu Phe Tyr Arg Tyr Asp Val Val Gln
                165                 170                 175     
Ile Asn Glu Thr Gly Asp Asn Ile Gln Tyr Arg Leu Ile Asn Cys Asn
            180                 185                 190         
Thr Ser Ala Ile Thr Gln Ala Cys Pro Lys Val Ser Phe Glu Pro Ile
        195                 200                 205             
Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys Cys Asn
    210                 215                 220                 
Asp Glu Lys Phe Asn Gly Thr Gly Pro Cys Lys Asn Val Ser Thr Val
225                 230                 235                 240 
Gln Cys Thr His Gly Ile Lys Pro Val Val Ser Thr Gln Leu Leu Leu
                245                 250                 255     
Asn Gly Ser Leu Ala Glu Glu Glu Ile Val Ile Arg Ser Glu Asn Phe
            260                 265                 270         
Thr Asn Asn Ala Lys Ile Ile Ile Val Gln Leu His Glu Ser Val Lys
        275                 280                 285             
Ile Asn Cys Thr Arg Pro Gly Asn Asn Thr Arg Lys Ser Val Arg Ile
    290                 295                 300                 
Gly Pro Gly Gln Thr Phe Tyr Ala Thr Gly Asp Ile Ile Gly Asp Ile
305                 310                 315                 320 
Arg Gln Ala His Cys Asn Val Ser Trp Gln Gln Trp Asn Lys Thr Leu
                325                 330                 335     
His Asp Val Ala Thr Lys Leu Arg Glu Tyr Phe Asn Asn Thr Thr Ile
            340                 345                 350         
Ile Phe Asp Glu Pro Ser Gly Gly Asp Leu Glu Ile Thr Thr His Ser
        355                 360                 365             
Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Thr Ser Asn Leu Phe
    370                 375                 380                 
Asn Arg Thr Trp Asn His Asn Gly Thr Trp Asn Ala Pro Gly Pro Phe
385                 390                 395                 400 
Asn Asp Thr Glu Asp Lys Thr Ile Asn Gly Thr Glu Asp Lys Thr Ile
                405                 410                 415     
Thr Leu Gln Cys Arg Ile Lys Gln Ile Val Arg Met Trp Gln Lys Val
            420                 425                 430         
Gly Gln Ala Met Tyr Ala Pro Pro Ile Pro Gly Glu Ile Arg Cys Glu
        435                 440                 445             
Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Asn Asp Asn
    450                 455                 460                 
Asn Asn Thr Glu Thr Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn
465                 470                 475                 480 
Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu
                485                 490                 495     
Gly Val Ala Pro Ser His Ala Lys Arg Arg Val Val Glu Arg Glu Lys
            500                 505                 510         
Arg Ala Leu Val Gly Leu Gly Ala Phe Phe Phe Gly Phe Leu Gly Ala
        515                 520                 525             
Ala Gly Ser Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala
    530                 535                 540                 
Arg Gln Leu Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Lys
545                 550                 555                 560 
Ala Ile Glu Ala Gln Gln His Leu Leu Arg Leu Thr Val Trp Gly Ile
                565                 570                 575     
Lys Gln Leu Gln Ala Arg Val Leu Ala Leu Glu Ala Tyr Leu Lys Asp
            580                 585                 590         
Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr
        595                 600                 605             
Thr Thr Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Thr Tyr Asp His
    610                 615                 620                 
Ile Trp Gly Asn Met Thr Trp Leu Gln Trp Asp Lys Glu Ile Ser Asn
625                 630                 635                 640 
Tyr Thr His Ile Ile Tyr Asp Leu Ile Glu Glu Ser Gln Asn Gln Gln
                645                 650                 655     
Glu Lys Asn Glu Gln Asp Leu Leu Ala Leu Asp Lys Trp Ala Ser Leu
            660                 665                 670         
Trp Asp Trp Phe Ser Ile Ser Ser Trp Leu Trp Tyr Ile Arg Ile Phe
        675                 680                 685             
Ile Ile Ile Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe Ala Val
    690                 695                 700                 
Leu Ala Ile Ile Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe
705                 710                 715                 720 
Gln Thr Leu Thr His His Gln Arg Glu Pro Gly Arg Pro Glu Arg Ile
                725                 730                 735     
Glu Glu Gly Gly Gly Gly Gln Asp Arg Asp Arg Ser Val Arg Leu Val
            740                 745                 750         
Ser Gly Phe Leu Ala Leu Ala Trp Asp Asp Leu Arg Ser Leu Cys Leu
        755                 760                 765             
Phe Ser Tyr His Arg Leu Arg Asp Phe Val Ser Ile Val Ala Arg Thr
    770                 775                 780                 
Val Glu Leu Leu Gly His Arg Gly Trp Glu Ala Leu Lys Tyr Leu Trp
785                 790                 795                 800 
Asn Leu Leu Ser Tyr Trp Gly Gln Glu Leu Lys Asn Ser Ala Ile Ser
                805                 810                 815     
Leu Leu Asp Thr Ile Ala Ile Val Val Ala Asn Trp Thr Asp Arg Val
            820                 825                 830         
Ile Glu Leu Val Gln Arg Ala Gly Arg Ala Ile Leu Asn Ile Pro Arg
        835                 840                 845             
Arg Ile Arg Gln Gly Phe Glu Arg Ala Leu Leu
    850                 855                 


<210> 39
<211> 2580
<212> DNA
<213> HIV-1

<400> 39
atgagagtga tggggataca gaagaattat ccactcttat ggagatgggg tatgataata 60
ttttggataa tgacaatttg tagtgctgga aatttgtggg tcacggtcta ttatggggta 120
cctgtgtgga gagacgcaga gaccacccta ttttgtgcat cagatgctaa agcatatgat 180
acagaagtac ataatgtctg ggctacacat gcctgcgtac ccacagaccc taacccacaa 240
gaaatacatt tgggaaatgt aacagaagat tttaacatgt ggaaaaatag catggtagag 300
cagatgcatg aagatataat tagtctatgg gatcaaagcc taaagccatg tgtacagtta 360
acccctctct gtgttacttt acattgtcag gataacctca ctagcagcgg caacatatcg 420
gaaaacatgc aaggagaaat aaaaaactgc tctttcaata tgaccacaga actaagagat 480
aagaaacaga aagtgtatgc acttttttat agatatgatg tagtacaaat taatgaaact 540
ggggataaca ttcaatatag gttaataaat tgtaatacct cagccattac acaggcttgt 600
ccaaaggtat cctttgagcc aattcccata cattattgtg ccccagctgg ctttgcaatt 660
ctaaagtgta atgatgagaa gttcaatgga acagggccat gcaagaatgt cagcacagta 720
caatgcacac atggaatcaa gccagtagta tcaactcaac tgttattaaa tggcagccta 780
gcagaagaag agatagtgat tagatctgaa aattttacaa acaatgccaa aatcataata 840
gtacagttgc atgaatctgt aaaaattaat tgtaccagac ctggcaacaa tacaagaaaa 900
agtgtacgta taggaccagg gcaaacattc tatgcaacag gtgacataat aggggatata 960
agacaagcac attgtaatgt cagctggcaa caatggaaca aaactttaca cgatgtggct 1020
acaaaattaa gggagtattt taataatacc acaataatct ttgatgaacc ctcaggaggg 1080
gatttagaaa ttacaacaca tagttttaat tgtggaggag aatttttcta ttgcaataca 1140
tcaaatctgt ttaatagaac ttggaatcat aatggcactt ggaatgcacc aggaccgttt 1200
aatgacactg aggataaaac aataaatggc actgaggata aaacaataac tctccaatgc 1260
agaataaagc aaattgtgcg tatgtggcag aaagtaggac aagcaatgta tgcccctccc 1320
atcccaggag aaataaggtg tgaatcaaac attacaggac tactattaac aagagatgga 1380
gggaatgata ataataatac agagaccttc aggcctggag gaggagatat gagggacaat 1440
tggagaagtg aattatataa atataaagta gtaaaaattg aaccactagg tgtagcaccc 1500
tcccatgcaa aaagaagagt ggtggagaga gaaaaaagag cacttgttgg actgggagct 1560
ttcttctttg ggttcttagg agcagcagga agcactatgg gcgcggcgtc aataacgctg 1620
acggtacagg ccagacaatt attgtctggt atagtgcaac agcagagcaa tctgctgaag 1680
gctatagagg ctcaacaaca tctgttgaga ctcacggtct ggggcattaa acagctccag 1740
gcaagagtcc tggctctaga agcataccta aaggatcaac agctcctagg aatttggggc 1800
tgctctggaa aactcatctg caccactact gtaccctgga actctagttg gagtaataaa 1860
acttatgatc acatatgggg taacatgacc tggctgcaat gggataaaga aattagtaac 1920
tacacacaca taatatatga tctaattgaa gaatcgcaga accagcagga aaagaatgaa 1980
caagacttat tggcattgga caagtgggca agtctgtggg attggtttag catatcaagt 2040
tggctatggt atataagaat atttataata atagtaggag gtttaatagg cttaagaata 2100
gtctttgctg tacttgctat aataaataga gttaggcagg gatactcacc tttgtctttc 2160
cagaccctta cccaccacca gagggaaccc ggcaggcccg aaagaatcga agaaggaggt 2220
ggcgggcaag acagagacag atccgtgcga ttagtgagcg gattcttagc acttgcctgg 2280
gacgatctgc ggagcctgtg cctcttcagc taccaccgat tgagagactt cgtctcgatt 2340
gtagcgagga ctgtggaact tctgggacac agggggtggg aagccctcaa atatctgtgg 2400
aatcttctat cgtactgggg tcaggaacta aagaatagtg ctattagttt gcttgatacc 2460
atagcaatag tagtagctaa ttggacagac agagttatag aactagtaca aagagctggt 2520
agagctattc tcaacatacc taggagaatc agacagggct ttgaaagggc tttgctataa 2580


<210> 40
<211> 859
<212> PRT
<213> HIV-2

<400> 40
Met Cys Gly Arg Asn Gln Leu Phe Val Ala Ser Leu Leu Ala Ser Ala
 1               5                  10                  15      
Cys Leu Ile Tyr Cys Val Gln Tyr Val Thr Val Phe Tyr Gly Val Pro
            20                  25                  30          
Val Trp Arg Asn Ala Ser Ile Pro Leu Phe Cys Ala Thr Lys Asn Arg
        35                  40                  45              
Asp Thr Trp Gly Thr Ile Gln Cys Leu Pro Asp Asn Asp Asp Tyr Gln
    50                  55                  60                  
Glu Ile Ala Leu Asn Val Thr Glu Ala Phe Asp Ala Trp Asn Asn Thr
65                  70                  75                  80  
Val Thr Glu Gln Ala Val Glu Asp Val Trp Ser Leu Phe Glu Thr Ser
                85                  90                  95      
Ile Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Ala Met Arg Cys
            100                 105                 110         
Asn Ser Thr Thr Ala Lys Asn Thr Thr Ser Thr Pro Thr Thr Thr Thr
        115                 120                 125             
Thr Ala Asn Thr Thr Ile Gly Glu Asn Ser Ser Cys Ile Arg Thr Asp
    130                 135                 140                 
Asn Cys Thr Gly Leu Gly Glu Glu Glu Met Val Asp Cys Gln Phe Asn
145                 150                 155                 160 
Met Thr Gly Leu Glu Arg Asp Lys Lys Lys Leu Tyr Asn Glu Thr Trp
                165                 170                 175     
Tyr Ser Lys Asp Val Val Cys Glu Ser Asn Asp Thr Lys Lys Glu Lys
            180                 185                 190         
Thr Cys Tyr Met Asn His Cys Asn Thr Ser Val Ile Thr Glu Ser Cys
        195                 200                 205             
Asp Lys His Tyr Trp Asp Thr Met Arg Phe Arg Tyr Cys Ala Pro Pro
    210                 215                 220                 
Gly Phe Ala Leu Leu Arg Cys Asn Asp Thr Asn Tyr Ser Gly Phe Glu
225                 230                 235                 240 
Pro Asn Cys Ser Lys Val Val Ala Ala Thr Cys Thr Arg Met Met Glu
                245                 250                 255     
Thr Gln Thr Ser Thr Trp Phe Gly Phe Asn Gly Thr Arg Ala Glu Asn
            260                 265                 270         
Arg Thr Tyr Ile Tyr Trp His Gly Arg Asp Asn Arg Thr Ile Ile Ser
        275                 280                 285             
Leu Asn Lys Phe Tyr Asn Leu Thr Val His Cys Lys Arg Pro Gly Asn
    290                 295                 300                 
Lys Thr Val Val Pro Ile Thr Leu Met Ser Gly Leu Val Phe His Ser
305                 310                 315                 320 
Gln Pro Ile Asn Arg Arg Pro Arg Gln Ala Trp Cys Trp Phe Lys Gly
                325                 330                 335     
Glu Trp Lys Glu Ala Met Lys Glu Val Lys Leu Thr Leu Ala Lys His
            340                 345                 350         
Pro Arg Tyr Lys Gly Thr Asn Asp Thr Glu Lys Ile Arg Phe Ile Ala
        355                 360                 365             
Pro Gly Glu Arg Ser Asp Pro Glu Val Ala Tyr Met Trp Thr Asn Cys
    370                 375                 380                 
Arg Gly Glu Phe Leu Tyr Cys Asn Met Thr Trp Phe Leu Asn Trp Val
385                 390                 395                 400 
Glu Asn Arg Thr Asn Gln Thr Gln His Asn Tyr Val Pro Cys His Ile
                405                 410                 415     
Lys Gln Ile Ile Asn Thr Trp His Lys Val Gly Lys Asn Val Tyr Leu
            420                 425                 430         
Pro Pro Arg Glu Gly Gln Leu Thr Cys Asn Ser Thr Val Thr Ser Ile
        435                 440                 445             
Ile Ala Asn Ile Asp Gly Gly Glu Asn Gln Thr Asn Ile Thr Phe Ser
    450                 455                 460                 
Ala Glu Val Ala Glu Leu Tyr Arg Leu Glu Leu Gly Asp Tyr Lys Leu
465                 470                 475                 480 
Ile Glu Val Thr Pro Ile Gly Phe Ala Pro Thr Pro Val Lys Arg Tyr
                485                 490                 495     
Ser Ser Ala Pro Val Arg Asn Lys Arg Gly Val Phe Val Leu Gly Phe
            500                 505                 510         
Leu Gly Phe Leu Thr Thr Ala Gly Ala Ala Met Gly Ala Ala Ser Leu
        515                 520                 525             
Thr Leu Ser Ala Gln Ser Arg Thr Leu Leu Ala Gly Ile Val Gln Gln
    530                 535                 540                 
Gln Gln Gln Leu Leu Asp Val Val Lys Arg Gln Gln Glu Met Leu Arg
545                 550                 555                 560 
Leu Thr Val Trp Gly Thr Lys Asn Leu Gln Ala Arg Val Thr Ala Ile
                565                 570                 575     
Glu Lys Tyr Leu Lys Asp Gln Ala Gln Leu Asn Ser Trp Gly Cys Ala
            580                 585                 590         
Phe Arg Gln Val Cys His Thr Thr Val Pro Trp Val Asn Asp Thr Leu
        595                 600                 605             
Thr Pro Asp Trp Asn Asn Met Thr Trp Gln Glu Trp Glu Gln Arg Ile
    610                 615                 620                 
Arg Asn Leu Glu Ala Asn Ile Ser Glu Ser Leu Glu Gln Ala Gln Ile
625                 630                 635                 640 
Gln Gln Glu Lys Asn Met Tyr Glu Leu Gln Lys Leu Asn Ser Trp Asp
                645                 650                 655     
Val Phe Gly Asn Trp Phe Asp Leu Thr Ser Trp Ile Lys Tyr Ile Gln
            660                 665                 670         
Tyr Gly Val Tyr Ile Val Val Gly Ile Ile Val Leu Arg Ile Val Ile
        675                 680                 685             
Tyr Val Val Gln Met Leu Ser Arg Leu Arg Lys Gly Tyr Arg Pro Val
    690                 695                 700                 
Phe Ser Ser Pro Pro Ala Tyr Phe Gln Gln Ile His Ile His Lys Asp
705                 710                 715                 720 
Arg Glu Gln Pro Ala Arg Glu Glu Thr Glu Glu Asp Val Gly Asn Ser
                725                 730                 735     
Val Gly Asp Asn Trp Trp Pro Trp Pro Ile Arg Tyr Ile His Phe Leu
            740                 745                 750         
Ile Arg Gln Leu Ile Arg Leu Leu Asn Arg Leu Tyr Asn Ile Cys Arg
        755                 760                 765             
Asp Leu Leu Ser Arg Ser Phe Gln Thr Leu Gln Leu Ile Ser Gln Ser
    770                 775                 780                 
Leu Arg Arg Ala Leu Thr Ala Val Arg Asp Trp Leu Arg Phe Asn Thr
785                 790                 795                 800 
Ala Tyr Leu Gln Tyr Gly Gly Glu Trp Ile Gln Glu Ala Phe Arg Ala
                805                 810                 815     
Phe Ala Arg Ala Thr Gly Glu Thr Leu Thr Asn Ala Trp Arg Gly Phe
            820                 825                 830         
Trp Gly Thr Leu Gly Gln Ile Gly Arg Gly Ile Leu Ala Val Pro Arg
        835                 840                 845             
Arg Ile Arg Gln Gly Ala Glu Ile Ala Leu Leu
    850                 855                 


<210> 41
<211> 2580
<212> DNA
<213> HIV-2

<400> 41
atgtgtggta ggaatcaact atttgttgcc agcttgctag ctagtgcttg cttaatatat 60
tgcgtccaat atgtgactgt tttctatggc gtgcccgtgt ggagaaatgc atccattccc 120
ctcttttgtg caactaaaaa tagagatact tggggaacca tacagtgctt gccagacaat 180
gatgactatc aggaaatagc tttaaatgtg acagaggcct tcgacgcatg gaataataca 240
gtaacagaac aagcagtaga agatgtctgg agtctatttg agacatcaat aaaaccatgc 300
gtcaaactaa cacccttatg tgtagcaatg cgttgtaaca gcacaactgc aaaaaacaca 360
acctccacac caacaaccac cacaacagca aacacaacaa taggagagaa ttcttcatgc 420
atacgcacag acaactgcac agggttggga gaagaagaga tggtcgactg tcagttcaat 480
atgacaggat tagagaggga taagaaaaaa ctatataatg aaacatggta ctcaaaagat 540
gtagtctgtg aatcaaatga caccaagaaa gagaaaacat gttacatgaa ccactgcaac 600
acatcagtca tcacagagtc atgtgacaag cactattggg atactatgag gtttagatat 660
tgtgcaccac cgggttttgc cctgctaaga tgcaatgata ccaattattc aggctttgag 720
cccaattgtt ctaaggtagt agctgctaca tgtacaagga tgatggaaac gcaaacctcc 780
acttggtttg gctttaatgg cactagggca gaaaatagaa catatatcta ttggcatggt 840
agggataata gaactatcat tagcttaaac aagttttata atctcaccgt acattgtaag 900
aggccaggaa acaagacagt tgtaccaata acactcatgt cagggttagt gtttcactcc 960
cagccaatca atagaagacc caggcaagca tggtgctggt tcaaaggcga gtggaaggaa 1020
gccatgaagg aggtgaagct aacccttgca aaacatccca ggtataaagg aaccaacgac 1080
acagaaaaaa ttcgttttat agcgccagga gaacgctcag acccagaagt ggcatacatg 1140
tggactaact gcagaggaga atttctctac tgcaatatga cttggttcct caattgggta 1200
gaaaacagaa cgaatcagac acagcacaat tatgtgccat gccatataaa gcaaataatt 1260
aatacctggc acaaggtagg gaaaaatgta tatttgcctc ctagggaagg acagttaacc 1320
tgcaactcta cagtgaccag cataattgct aacattgacg gaggagagaa ccagacaaat 1380
attaccttta gtgcagaggt ggcagaacta taccgattag aattggggga ttataaattg 1440
atagaagtaa caccaattgg ctttgcacct acaccagtaa aaagatactc ctctgctcca 1500
gtgaggaata aaagaggtgt attcgtgcta gggttcttag gttttctcac gacagcagga 1560
gctgcaatgg gcgcggcgtc cttgacgctg tcggctcagt ctcggacttt attggccggg 1620
atagtgcagc aacagcaaca gctgttggac gtggtcaaga gacaacaaga aatgttgcga 1680
ctgaccgtct ggggaacaaa aaatctccag gcaagagtca ctgctatcga gaaatactta 1740
aaggaccagg cgcaactaaa ttcatgggga tgtgcgttta gacaagtctg ccacactact 1800
gtaccatggg taaatgacac cttaacgcct gattggaaca acatgacatg gcaggaatgg 1860
gagcaacgaa tccgcaacct agaggcaaat atcagtgaaa gtttagaaca ggcacaaatc 1920
cagcaagaaa agaacatgta tgaactacaa aaattaaata gctgggatgt ttttggcaac 1980
tggtttgatt taacctcctg gatcaaatat attcagtatg gagtttatat agtagtagga 2040
ataatagttt taagaatagt aatatatgta gtacaaatgt taagtagact tagaaagggc 2100
tataggcctg ttttctcttc cccccccgct tacttccaac agatccatat ccacaaggac 2160
cgggaacagc cagccagaga agaaacagaa gaagacgttg gaaacagcgt tggagacaat 2220
tggtggccct ggccgataag atatatacat ttcctgatcc gccagctgat tcgcctcttg 2280
aacagactat acaacatctg cagggactta ctatccagga gcttccagac cctccaacta 2340
atctcccaga gtcttcggag agcattgaca gcagtcagag actggctgag atttaacaca 2400
gcctacctgc aatatggggg cgagtggatc caagaagcgt tccgagcctt cgcgagggct 2460
acgggagaga ctcttacaaa cgcctggaga ggcttctggg ggacactggg acaaattggg 2520
aggggaatac ttgcagtccc aagaaggatc aggcaggggg cagaaatcgc cctcctgtga 2580


<210> 42
<211> 727
<212> PRT
<213> SIV

<400> 42
Glu Ser Phe Asp Ala Trp Glu Asn Thr Val Thr Glu Gln Ala Ile Glu
 1               5                  10                  15      
Asp Val Trp Gln Leu Phe Glu Thr Ser Ile Lys Pro Cys Val Lys Leu
            20                  25                  30          
Ser Pro Leu Cys Ile Thr Met Arg Cys Asn Lys Ser Glu Thr Asp Lys
        35                  40                  45              
Trp Gly Leu Thr Lys Ser Ser Thr Thr Thr Thr Ala Thr Thr Ala Thr
    50                  55                  60                  
Pro Ala Ser Thr Thr Arg Thr Thr Ser Ala Lys Ile Asp Met Val Asn
65                  70                  75                  80  
Glu Thr Ser Ser Cys Ile Thr His Asn Asn Cys Thr Gly Leu Glu Gln
                85                  90                  95      
Glu Gln Met Ile Ser Cys Lys Phe Asn Met Thr Gly Leu Lys Arg Asp
            100                 105                 110         
Lys Lys Lys Glu Tyr Asn Glu Thr Trp Tyr Ser Thr Asp Leu Val Cys
        115                 120                 125             
Glu Gln Gly Asn Ser Thr Asp Asn Glu Ser Arg Cys Tyr Met Asn His
    130                 135                 140                 
Cys Asn Thr Ser Val Ile Gln Glu Ser Cys Asp Lys His Tyr Trp Asp
145                 150                 155                 160 
Thr Ile Arg Phe Arg Tyr Cys Ala Pro Pro Gly Tyr Ala Leu Leu Arg
                165                 170                 175     
Cys Asn Asp Thr Asn Tyr Ser Gly Phe Met Pro Lys Cys Ser Lys Val
            180                 185                 190         
Val Val Ser Ser Cys Thr Arg Met Met Glu Thr Gln Thr Ser Thr Trp
        195                 200                 205             
Phe Gly Phe Asn Gly Thr Arg Ala Glu Asn Arg Thr Tyr Ile Tyr Trp
    210                 215                 220                 
His Gly Lys Asp Asn Arg Thr Ile Ile Ser Leu Asn Lys Tyr Tyr Asn
225                 230                 235                 240 
Leu Thr Met Lys Cys Arg Arg Pro Gly Asn Lys Thr Val Leu Pro Val
                245                 250                 255     
Thr Ile Met Ser Gly Leu Val Phe His Ser Gln Pro Ile Asn Asp Arg
            260                 265                 270         
Pro Lys Gln Ala Trp Cys Trp Phe Gly Gly Asn Trp Lys Asp Ala Ile
        275                 280                 285             
Lys Glu Val Lys Gln Thr Ile Val Lys His Pro Arg Tyr Thr Gly Thr
    290                 295                 300                 
Asn Asn Thr Asp Lys Ile Asn Leu Thr Ala Pro Arg Gly Gly Asp Pro
305                 310                 315                 320 
Glu Val Thr Phe Met Trp Thr Asn Cys Arg Gly Glu Phe Leu Tyr Cys
                325                 330                 335     
Lys Met Asn Trp Phe Leu Asn Trp Val Glu Asp Arg Asn Leu Thr Asn
            340                 345                 350         
Lys Lys Ser Lys Glu Gln His Lys Arg Asn Tyr Val Pro Cys His Ile
        355                 360                 365             
Arg Gln Ile Ile Asn Thr Trp His Lys Val Gly Lys Asn Val Tyr Leu
    370                 375                 380                 
Pro Pro Arg Glu Gly Asp Leu Thr Cys Asn Ser Thr Val Thr Ser Leu
385                 390                 395                 400 
Ile Ala Asn Ile Asp Trp Thr Asp Gly Asn Gln Thr Asn Ile Thr Met
                405                 410                 415     
Ser Ala Glu Val Ala Glu Leu Tyr Arg Leu Glu Leu Gly Asp Tyr Lys
            420                 425                 430         
Leu Val Glu Ile Thr Pro Ile Gly Leu Ala Pro Thr Asp Val Lys Arg
        435                 440                 445             
Tyr Thr Thr Gly Gly Thr Ser Arg Asn Lys Arg Gly Val Phe Val Leu
    450                 455                 460                 
Gly Phe Leu Gly Phe Leu Ala Thr Ala Gly Ser Ala Met Gly Ala Ala
465                 470                 475                 480 
Ser Leu Thr Leu Thr Ala Gln Ser Arg Thr Leu Leu Ala Gly Ile Val
                485                 490                 495     
Gln Gln Gln Gln Gln Leu Leu Asp Val Val Lys Arg Gln Gln Glu Leu
            500                 505                 510         
Leu Arg Leu Thr Val Trp Gly Thr Lys Asn Leu Gln Thr Arg Val Thr
        515                 520                 525             
Ala Ile Glu Lys Tyr Leu Lys Asp Gln Ala Gln Leu Asn Ala Trp Gly
    530                 535                 540                 
Cys Ala Phe Arg Gln Val Cys His Thr Thr Val Pro Trp Pro Asn Ala
545                 550                 555                 560 
Ser Leu Thr Pro Asp Trp Asn Asn Asp Thr Trp Gln Glu Trp Glu Arg
                565                 570                 575     
Lys Val Asp Phe Leu Glu Glu Asn Ile Thr Ala Leu Leu Glu Glu Ala
            580                 585                 590         
Gln Ile Gln Gln Glu Lys Asn Met Tyr Glu Leu Gln Lys Leu Asn Ser
        595                 600                 605             
Trp Asp Val Phe Gly Asn Trp Phe Asp Leu Ala Ser Trp Ile Arg Tyr
    610                 615                 620                 
Ile Gln Tyr Gly Ile Tyr Ile Val Val Gly Val Ile Leu Leu Arg Ile
625                 630                 635                 640 
Val Ile Tyr Ile Val Gln Ile Leu Ala Lys Leu Arg Gln Gly Tyr Arg
                645                 650                 655     
Pro Val Phe Ser Ser Pro Pro Ser Tyr Ser Gln Gln Thr His Ile Gln
            660                 665                 670         
Gln Asp Pro Ala Leu Pro Thr Arg Glu Gly Lys Glu Gly Asp Gly Gly
        675                 680                 685             
Glu Ser Gly Gly Asn Ser Ser Trp Pro Trp Gln Ile Glu Tyr Ile His
    690                 695                 700                 
Phe Leu Ile Arg Gln Leu Ile Arg Leu Leu Thr Trp Leu Phe Asn Asn
705                 710                 715                 720 
Cys Arg Thr Leu Leu Ser Arg
                725         


<210> 43
<211> 2182
<212> DNA
<213> SIV

<400> 43
gaaagctttg atgcttggga gaatacagtc acagaacagg caatagagga tgtatggcaa 60
ctctttgaga cctcaataaa gccttgtgta aaattatccc cattatgcat tactatgaga 120
tgcaataaaa gtgagacaga taaatgggga ttaacaaaat catcaacaac aacaacagca 180
acaacagcaa caccagcatc aacaacaagg acaacatcag caaaaataga catggtcaat 240
gagactagtt cttgtataac tcataataat tgcacaggct tggaacaaga gcaaatgata 300
agctgtaagt tcaacatgac agggttaaaa agagacaaga aaaaggagta caatgaaact 360
tggtactcta cagatttggt ttgtgaacaa gggaatagca ctgataatga aagtagatgc 420
tacatgaatc actgtaacac ttctgttatc caagagtctt gtgacaagca ttattgggat 480
actattagat ttaggtattg tgcacctcca ggttatgctt tgcttagatg taatgacaca 540
aattattcag gctttatgcc taaatgttct aaggtggtgg tctcttcatg cacaaggatg 600
atggagacac agacttctac ttggtttggc tttaatggaa ctagagcaga aaatagaact 660
tatatttact ggcatggtaa agataatagg actataatta gtttaaataa gtattataat 720
ctaacaatga aatgtagaag accaggaaat aagacagttt taccagtcac cattatgtct 780
ggattggttt tccactcaca accaatcaat gataggccaa agcaggcatg gtgttggttt 840
ggaggaaatt ggaaggatgc aataaaagag gtgaagcaga ccattgtcaa acatcccagg 900
tatactggaa ctaacaatac tgataagatc aatttgacgg ctcctagagg aggagatccg 960
gaagttacct tcatgtggac aaattgtaga ggagagtttc tctactgtaa aatgaattgg 1020
tttctaaatt gggtagaaga taggaatcta actaacaaga agtcaaagga acagcataaa 1080
aggaattacg tgccatgtca tattagacaa ataatcaaca cttggcataa agtaggcaaa 1140
aatgtttatt tgcctccaag agagggagac ctcacgtgta actccacagt gaccagtctc 1200
atagcaaaca tagattggac tgatggaaac caaactaata tcaccatgag tgcagaggtg 1260
gcagaactgt atcgattgga attgggagat tataaattag tagagatcac tccaattggc 1320
ttggccccca cagatgtgaa gaggtacact actggtggca cctcaagaaa taaaagaggg 1380
gtctttgtgc tagggttctt gggttttctc gcaacggcag gttctgcaat gggcgcggcg 1440
tcgttgacac tgaccgctca gtcccggact ttattggctg ggatagtgca gcaacagcaa 1500
cagctgttgg acgtggtcaa gagacaacaa gaattgttgc gactgaccgt ctggggaaca 1560
aagaacctcc agactagggt cactgccatc gagaagtact taaaggacca ggcgcagctg 1620
aatgcttggg gatgtgcgtt tagacaagtc tgccacacta ctgtaccatg gccaaatgca 1680
agtctaacac cagactggaa caatgatact tggcaagagt gggagcgaaa ggttgacttc 1740
ttggaggaaa atataacggc ccttctagaa gaggcacaaa ttcaacaaga gaagaacatg 1800
tatgaattac aaaagttgaa tagctgggat gtgtttggca attggtttga ccttgcttct 1860
tggataaggt atatacaata tggaatttat atagttgtag gagtaatact gttaagaata 1920
gtgatctata tagtacaaat actagctaag ttaaggcagg ggtataggcc agtgttctct 1980
tccccaccct cttattccca gcagacccat atccaacagg acccggcact gccaaccaga 2040
gaaggcaaag aaggagacgg tggagaaagc ggtggcaaca gctcctggcc ttggcagata 2100
gaatatattc atttcctgat ccgccaactg atacgcctct tgacttggct attcaacaac 2160
tgcagaacct tgctatcgag ag                                          2182

<210> 44
<211> 858
<212> PRT
<213> SIV

<220> 
<221> VARIANT        
<222> (0)...(0)
<223> Xaa = any amino acid

<400> 44
Met Arg Lys Pro Ile His Ile Ile Trp Gly Leu Ala Leu Leu Ile Gln
 1               5                  10                  15      
Phe Ile Glu Lys Gly Thr Asn Glu Asp Tyr Val Thr Val Phe Tyr Gly
            20                  25                  30          
Val Pro Val Trp Arg Asn Ala Thr Pro Thr Leu Phe Cys Ala Thr Asn
        35                  40                  45              
Ala Ser Met Thr Ser Thr Glu Val His Asn Val Trp Ala Thr Thr Ser
    50                  55                  60                  
Cys Val Pro Ile Asp Pro Asp Pro Ile Val Val Arg Leu Asn Thr Ser
65                  70                  75                  80  
Val Trp Phe Asn Ala Tyr Lys Asn Tyr Met Val Glu Ser Met Thr Glu
                85                  90                  95      
Asp Met Xaa Gln Leu Phe Gln Gln Ser His Lys Pro Cys Val Lys Leu
            100                 105                 110         
Thr Pro Met Cys Ile Lys Met Asn Cys Thr Gly Tyr Asn Gly Thr Pro
        115                 120                 125             
Thr Thr Pro Ser Thr Thr Thr Ser Thr Val Thr Pro Lys Thr Thr Thr
    130                 135                 140                 
Pro Ile Val Asp Gly Met Lys Leu Gln Glu Cys Asn Phe Asn Gln Ser
145                 150                 155                 160 
Thr Gly Phe Lys Asp Lys Lys Gln Lys Met Lys Ala Ile Phe Tyr Lys
                165                 170                 175     
Gly Asp Leu Met Lys Cys Gln Asp Asn Asn Glu Thr Asn Cys Tyr Tyr
            180                 185                 190         
Leu Trp His Cys Asn Thr Thr Thr Ile Thr Gln Ser Cys Glu Lys Ser
        195                 200                 205             
Thr Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Tyr Ala
    210                 215                 220                 
Ile Leu Arg Cys Glu Asp Glu Asp Phe Thr Gly Val Gly Met Cys Lys
225                 230                 235                 240 
Asn Val Ser Val Val His Cys Thr His Gly Ile Ser Pro Met Val Ala
                245                 250                 255     
Thr Trp Leu Leu Leu Asn Gly Thr Tyr Gln Thr Asn Thr Ser Val Val
            260                 265                 270         
Met Asn Gly Arg Lys Asn Glu Ser Val Leu Val Arg Phe Gly Lys Glu
        275                 280                 285             
Phe Glu Asn Leu Thr Ile Thr Cys Ile Arg Pro Gly Asn Arg Thr Val
    290                 295                 300                 
Arg Asn Leu Gln Ile Gly Pro Gly Met Thr Phe Tyr Asn Val Glu Ile
305                 310                 315                 320 
Ala Thr Gly Asp Thr Arg Lys Ala Phe Cys Thr Val Asn Lys Thr Leu
                325                 330                 335     
Trp Glu Gln Ala Arg Asn Lys Thr Glu His Val Leu Ala Glu His Trp
            340                 345                 350         
Lys Lys Val Asp Asn Lys Thr Asn Ala Lys Thr Ile Trp Thr Phe Gln
        355                 360                 365             
Asp Gly Asp Pro Glu Val Lys Val His Trp Phe Asn Cys Gln Gly Glu
    370                 375                 380                 
Phe Phe Tyr Cys Asp Ile Thr Pro Trp Phe Asn Ala Thr Tyr Thr Gly
385                 390                 395                 400 
Asn Leu Ile Thr Asn Gly Ala Leu Ile Ala His Cys Arg Ile Lys Gln
                405                 410                 415     
Ile Val Asn His Trp Gly Ile Val Ser Lys Gly Ile Tyr Leu Ala Pro
            420                 425                 430         
Arg Arg Gly Asn Val Ser Cys Thr Ser Ser Ile Thr Gly Ile Met Leu
        435                 440                 445             
Glu Gly Gln Ile Tyr Asn Glu Thr Val Lys Val Ser Pro Ala Ala Arg
    450                 455                 460                 
Val Ala Asp Gln Trp Arg Ala Glu Leu Ser Arg Tyr Gln Val Val Glu
465                 470                 475                 480 
Ile Xaa Pro Leu Ser Val Ala Pro Thr Thr Gly Lys Arg Pro Glu Ile
                485                 490                 495     
Lys Gln His Ser Arg Gln Lys Arg Gly Ile Gly Ile Gly Leu Phe Phe
            500                 505                 510         
Leu Gly Leu Leu Ser Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Ile
        515                 520                 525             
Ala Leu Thr Ala Gln Thr Arg Asn Leu Xaa His Gly Ile Val Gln Gln
    530                 535                 540                 
Gln Ala Asn Leu Leu Gln Ala Ile Glu Thr Gln Gln His Leu Leu Gln
545                 550                 555                 560 
Leu Ser Val Trp Gly Val Lys Gln Leu Gln Ala Arg Met Leu Ala Val
                565                 570                 575     
Glu Lys Tyr Leu Arg Asp Gln Gln Leu Leu Ser Leu Trp Gly Cys Ala
            580                 585                 590         
Asp Lys Val Thr Cys His Thr Thr Val Pro Trp Asn Asn Ser Trp Val
        595                 600                 605             
Asn Phe Thr Gln Thr Cys Ala Lys Asn Ser Ser Asp Ile Gln Cys Ile
    610                 615                 620                 
Trp Glu Asn Met Thr Trp Gln Glu Trp Asp Arg Leu Val Gln Asn Ser
625                 630                 635                 640 
Thr Gly Gln Ile Tyr Asn Ile Leu Gln Ile Ala His Glu Gln Gln Glu
                645                 650                 655     
Arg Asn Lys Lys Glu Leu Tyr Glu Leu Asp Lys Trp Ser Ser Leu Trp
            660                 665                 670         
Asn Trp Phe Asp Ile Thr Gln Trp Leu Trp Tyr Ile Lys Ile Phe Ile
        675                 680                 685             
Met Ile Val Gly Ala Ile Val Gly Leu Arg Ile Leu Leu Val Leu Val
    690                 695                 700                 
Ser Cys Leu Arg Lys Val Arg Gln Gly Tyr His Pro Leu Ser Phe Gln
705                 710                 715                 720 
Ile Pro Thr Gln Asn Gln Gln Asp Pro Glu Gln Pro Glu Glu Ile Arg
                725                 730                 735     
Glu Glu Gly Gly Arg Lys Asp Arg Ile Arg Trp Arg Ala Leu Gln His
            740                 745                 750         
Gly Phe Phe Ala Leu Leu Trp Val Asp Leu Thr Ser Ile Ile Gln Trp
        755                 760                 765             
Ile Tyr Gln Ile Cys Arg Thr Cys Leu Leu Asn Leu Trp Ala Val Leu
    770                 775                 780                 
Gln His Leu Cys Arg Ile Thr Phe Arg Leu Cys Asn His Leu Glu Asn
785                 790                 795                 800 
Asn Leu Ser Thr Leu Trp Thr Ile Ile Arg Thr Glu Ile Ile Lys Asn
                805                 810                 815     
Ile Asp Arg Leu Ala Ile Trp Val Gly Glu Lys Thr Asp Ser Ile Leu
            820                 825                 830         
Leu Ala Leu Gln Thr Ile Val Arg Ile Ile Arg Glu Val Pro Arg Arg
        835                 840                 845             
Ile Arg Gln Gly Leu Glu Ile Ala Leu Asn
    850                 855             


<210> 45
<211> 882
<212> PRT
<213> SIV

<400> 45
Met Gly Cys Leu Gly Asn Gln Leu Leu Ile Ala Ile Leu Phe Leu Ser
 1               5                  10                  15      
Ala Tyr Gly Ile Tyr Cys Ile Gln Tyr Val Thr Val Phe Tyr Gly Val
            20                  25                  30          
Pro Ala Trp Arg Asn Ala Thr Ile Pro Leu Phe Cys Val Thr Arg Asn
        35                  40                  45              
Arg Asp Thr Trp Gly Thr Thr Gln Cys Leu Pro Asp Asn Asp Asp Tyr
    50                  55                  60                  
Ser Glu Leu Ala Leu Asn Ile Thr Glu Ser Phe Asp Ala Trp Glu Asn
65                  70                  75                  80  
Thr Val Thr Glu Gln Ala Ile Glu Asp Val Trp His Leu Phe Glu Thr
                85                  90                  95      
Ser Ile Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Ile Thr Met Lys
            100                 105                 110         
Cys Asn Lys Ser Glu Thr Asp Lys Trp Gly Leu Thr Lys Ser Ser Thr
        115                 120                 125             
Thr Thr Ala Ala Pro Thr Thr Lys Thr Thr Thr Thr Lys Glu Ile Glu
    130                 135                 140                 
Val Val Asn Glu Asn Ser Thr Cys Val Asn Arg Asp Asn Cys Thr Gly
145                 150                 155                 160 
Leu Glu Gln Glu Pro Met Ile Ser Cys Lys Phe Asn Met Thr Gly Leu
                165                 170                 175     
Lys Arg Asp Lys Lys Arg Glu Tyr Asn Glu Thr Trp Tyr Ser Ala Asp
            180                 185                 190         
Leu Val Cys Glu Gln Gly Asn Ser Thr Glu Asp Glu Ser Arg Cys Tyr
        195                 200                 205             
Met Asn His Cys Asn Thr Ser Val Ile Gln Glu Ser Cys Asp Lys His
    210                 215                 220                 
Tyr Trp Asp Ala Ile Arg Phe Arg Tyr Cys Ala Pro Pro Gly Tyr Ala
225                 230                 235                 240 
Leu Leu Arg Cys Asn Asp Thr Lys Tyr Ser Gly Phe Met Pro Asn Cys
                245                 250                 255     
Ser Lys Val Val Val Ser Ser Cys Thr Arg Met Met Glu Thr Gln Thr
            260                 265                 270         
Ser Thr Trp Phe Gly Phe Asn Gly Thr Arg Ala Glu Asn Arg Thr Tyr
        275                 280                 285             
Ile Tyr Trp His Ser Lys Asp Asn Arg Thr Ile Ile Ser Leu Asn Lys
    290                 295                 300                 
Tyr Asn Asn Leu Thr Met Lys Cys Arg Arg Pro Gly Asn Lys Thr Val
305                 310                 315                 320 
Leu Pro Val Thr Ile Met Ser Gly Leu Val Phe His Ser Gln Pro Ile
                325                 330                 335     
Asn Glu Arg Pro Lys Gln Ala Trp Cys Arg Phe Glu Gly Asn Trp Lys
            340                 345                 350         
Glu Ala Ile Lys Glu Val Lys Gln Thr Ile Val Lys His Pro Arg Tyr
        355                 360                 365             
Thr Gly Thr Asn Asn Thr Asp Lys Ile Asn Leu Thr Ala Pro Arg Gly
    370                 375                 380                 
Gly Asp Pro Glu Val Thr Phe Met Trp Thr Asn Cys Arg Gly Glu Phe
385                 390                 395                 400 
Leu Tyr Cys Lys Met Asn Trp Phe Leu Asn Trp Val Glu Asp Lys Asn
                405                 410                 415     
Leu Thr Gly Thr Thr Gln Lys Pro Gln Glu Gln His Lys Arg Asn Tyr
            420                 425                 430         
Val Pro Cys His Ile Arg Gln Ile Ile Asn Thr Trp His Lys Val Gly
        435                 440                 445             
Lys Asn Val Tyr Leu Pro Pro Arg Glu Gly Asp Leu Thr Cys Asn Ser
    450                 455                 460                 
Thr Val Thr Ser Leu Ile Ala Asn Ile Asp Trp Ile Asp Gly Asn Gln
465                 470                 475                 480 
Thr Asn Ile Thr Met Ser Ala Glu Val Ala Glu Leu Tyr Arg Leu Glu
                485                 490                 495     
Leu Gly Asp Tyr Lys Leu Val Glu Ile Thr Pro Ile Gly Leu Ala Pro
            500                 505                 510         
Thr Asn Val Lys Arg Tyr Thr Thr Gly Gly Thr Pro Arg Asn Lys Arg
        515                 520                 525             
Gly Val Phe Val Leu Gly Phe Leu Gly Phe Leu Ala Thr Ala Gly Ser
    530                 535                 540                 
Ala Met Gly Ala Ala Ser Leu Thr Leu Thr Ala Gln Ser Arg Thr Leu
545                 550                 555                 560 
Leu Ala Gly Ile Val Gln Gln Gln Gln Gln Leu Leu Asp Val Val Lys
                565                 570                 575     
Arg Gln Gln Glu Leu Leu Arg Leu Thr Val Trp Gly Thr Lys Asn Leu
            580                 585                 590         
Gln Thr Arg Val Thr Ala Ile Glu Lys Tyr Leu Lys Asp Gln Ala Gln
        595                 600                 605             
Leu Asn Ala Trp Gly Cys Ala Phe Arg Gln Val Cys His Thr Thr Val
    610                 615                 620                 
Pro Trp Pro Asn Ala Ser Leu Thr Pro Asn Trp Asn Asn Glu Thr Trp
625                 630                 635                 640 
Gln Glu Trp Glu Arg Lys Val Asp Phe Leu Glu Glu Asn Ile Thr Ala
                645                 650                 655     
Leu Leu Glu Glu Ala Gln Ile Gln Gln Glu Lys Asn Met Tyr Glu Leu
            660                 665                 670         
Gln Lys Leu Asn Ser Trp Asp Val Phe Gly Asn Trp Phe Asp Leu Ala
        675                 680                 685             
Ser Trp Ile Arg Tyr Ile Gln Tyr Gly Val Tyr Ile Val Val Gly Val
    690                 695                 700                 
Ile Leu Leu Arg Ile Val Ile Tyr Ile Val Gln Met Leu Ala Lys Leu
705                 710                 715                 720 
Arg Gln Gly Tyr Arg Pro Val Phe Ser Ser Pro Pro Ser Tyr Phe Gln
                725                 730                 735     
Gln Thr His Ile Arg Gln Asp Gln Ala Leu Pro Thr Lys Glu Gly Thr
            740                 745                 750         
Glu Gly Asp Gly Gly Asp Ser Gly Gly Asn Ser Ser Trp Pro Trp Gln
        755                 760                 765             
Ile Glu Tyr Ile His Phe Leu Ile Arg Gln Leu Ile Arg Leu Leu Thr
    770                 775                 780                 
Trp Leu Phe Ser Asn Cys Arg Thr Leu Leu Ser Arg Ala Tyr Gln Ile
785                 790                 795                 800 
Leu Gln Pro Ile Phe Gln Arg Phe Ser Thr Thr Leu Gln Arg Val Arg
                805                 810                 815     
Glu Val Leu Arg Thr Glu Leu Thr Tyr Leu Gln Tyr Gly Trp Ser Tyr
            820                 825                 830         
Phe Gln Glu Ala Val Gln Val Ala Trp Arg Ser Ala Thr Glu Thr Leu
        835                 840                 845             
Ala Gly Ala Trp Gly Asp Leu Trp Glu Thr Leu Gly Arg Val Gly Arg
    850                 855                 860                 
Trp Ile Leu Ala Ile Pro Arg Arg Ile Arg Gln Glu Leu Glu Leu Thr
865                 870                 875                 880 
Leu Leu
        


<210> 46
<211> 2649
<212> DNA
<213> SIV

<400> 46
atgggatgtc ttgggaatca gctgcttatc gccatcttgt ttctaagtgc ctatgggatc 60
tattgcattc aatatgtcac agtcttttat ggtgtaccag cttggaggaa tgcgacaatt 120
cccctcttct gtgtaaccag gaatagggat acttggggaa caactcagtg cctaccagat 180
aatgatgatt attcagaatt ggcccttaat attacagaaa gctttgatgc ttgggagaat 240
acagtcacag aacaggcaat agaggatgta tggcatctct ttgagacctc aataaagcct 300
tgtgtaaaat taaccccatt atgcattact atgaaatgca acaaaagtga aacagataaa 360
tggggattga caaaatcatc aacaacaaca gcagcaccaa caacaaaaac aacaacaaca 420
aaggaaatag aagtggtcaa tgaaaatagt acttgtgtaa atcgtgataa ttgcacaggc 480
ttggaacaag agccaatgat aagctgtaaa ttcaacatga cagggttaaa aagagacaag 540
aaaagagagt acaatgaaac ttggtactct gcagatttgg tttgtgaaca aggtaatagc 600
actgaagatg aaagtagatg ttacatgaat cactgtaaca cttctgttat tcaagaatct 660
tgtgacaaac attattggga tgctattaga tttaggtatt gtgcacctcc aggttatgct 720
ttgcttagat gtaatgacac aaagtattca ggctttatgc ctaactgttc taaggtggtg 780
gtctcttcat gcacaagaat gatggagaca cagacttcta cttggtttgg ctttaatgga 840
actagagcag aaaatagaac ttatatttac tggcatagca aagataatag gactataatt 900
agtttgaata agtataataa tctaacaatg aaatgtagaa gaccaggaaa taagacagtt 960
ttaccagtca ccattatgtc tggattggtt ttccactcac aaccaatcaa tgaaaggcca 1020
aaacaggcat ggtgtaggtt tgaaggaaat tggaaggagg caataaaaga ggtgaagcag 1080
accattgtca aacatcccag gtatactgga actaacaata ctgataaaat caatttgacg 1140
gctcctcgag gaggagatcc ggaagttacc ttcatgtgga caaattgcag aggagagttt 1200
ctctactgta aaatgaattg gtttctaaat tgggtagaag ataagaatct gactggaact 1260
acccagaagc cacaggaaca gcataaaagg aattacgtgc catgtcatat tagacaaata 1320
atcaacactt ggcataaagt aggcaaaaat gtttatttgc ctccaagaga gggagacctc 1380
acgtgtaact ccacagtaac cagtctcata gcaaacatag attggattga tggaaaccaa 1440
actaatatca ccatgagtgc agaggtggca gaactgtatc gattggaatt gggagattat 1500
aaattagtag agatcactcc aattggcttg gcccccacaa atgtgaagag gtacactact 1560
ggtggcaccc caagaaataa aagaggggtc tttgtgctag ggttcttagg ttttctcgca 1620
acggcaggtt ctgcaatggg cgcggcgtcg ttgacgctga ccgctcagtc ccggacttta 1680
ttggctggga tagtgcagca acagcaacag ctgttggacg tggtcaagag acaacaagaa 1740
ttgttgcgac tgaccgtctg gggaacaaag aacctccaga ctagagtcac tgccatcgag 1800
aagtacttaa aggaccaggc gcagctaaat gcttggggat gtgcatttag acaagtctgc 1860
catactactg taccatggcc aaatgcaagt ctaacaccaa attggaacaa tgagacttgg 1920
caagagtggg agcgaaaggt tgacttcttg gaggaaaata taacggccct tctagaagag 1980
gcacaaattc aacaagaaaa gaacatgtat gaattacaaa agttgaatag ctgggatgtg 2040
tttggcaatt ggtttgacct tgcttcttgg ataaggtata tacaatacgg agtttatata 2100
gttgtaggag taatactgtt aagaatagtc atctatatag tacaaatgct agctaagtta 2160
aggcaagggt ataggccagt gttctcttcc ccaccttctt atttccagca gacccatatc 2220
cgacaggacc aagcactgcc aaccaaagaa ggaacagaag gagacggtgg agacagcggt 2280
ggcaacagtt cctggccttg gcagatagag tatattcatt tcctgatccg ccaactgata 2340
cgcctcttga cttggctatt cagcaactgc agaaccttgc tatcgagagc ataccagatc 2400
ctccaaccaa tattccagag attctccacg accctacaga gagtccgaga agtcctcagg 2460
actgaactaa cctacctaca atatgggtgg agctacttcc aagaagcggt ccaagtcgcc 2520
tggagatctg cgacagagac tcttgcgggc gcgtggggag acttatggga gactctggga 2580
agggttggaa gatggatact cgcaatccct aggaggatca gacaagagct tgagcttact 2640
ctcttgtga                                                         2649
