                         SEQUENCE LISTING

<110>  Fred Hutchinson Cancer Research Center
       University of Washington
       Dingens, Adam S.
       Dusenbury, Katharine
       Radford, Caelan
       Bloom, Jesse
 
<120>  Cell-stored barcoded deep mutational scanning libraries and uses 
       of the same

<130>  F053-0080PCT/18-078-WO-PCT

<150>  62/692,398
<151>  2018-06-29

<160>  58    

<170>  PatentIn version 3.5

<210>  1
<211>  63
<212>  DNA
<213>  artificial sequence

<220>
<223>  thosea asigna virus T2A

<400>  1
ggcagcggcg aaggccgcgg cagcctgctg acctgcggcg atgtggaaga aaacccgggc       60

ccg                                                                     63


<210>  2
<211>  66
<212>  DNA
<213>  artificial sequence

<220>
<223>  porcine teschovirus-1 P2A

<400>  2
ggcagcggcg cgaccaactt tagcctgctg aaacaggcgg gcgatgtgga agaaaacccg       60

ggcccg                                                                  66


<210>  3
<211>  69
<212>  DNA
<213>  artificial sequence

<220>
<223>  equine rhinitis A virus E2A

<400>  3
ggcagcggcc agtgcaccaa ctatgcgctg ctgaaactgg cgggcgatgt ggaaagcaac       60

ccgggcccg                                                               69


<210>  4
<211>  75
<212>  DNA
<213>  artificial sequence

<220>
<223>  foot-and-mouth disease virus F2A

<400>  4
ggcagcggcg tgaaacagac cctgaacttt gatctgctga aactggcggg cgatgtggaa       60

agcaacccgg gcccg                                                        75


<210>  5
<211>  90
<212>  DNA
<213>  artificial sequence

<220>
<223>  equine rhinitis B virus 12A

<400>  5
gaagcaactt tgtctaccat tctgtctgag ggtgccacaa atttttcttt gttgaagtta       60

gcaggggatg ttgaacttaa ccccggccca                                        90


<210>  6
<211>  90
<212>  DNA
<213>  artificial sequence

<220>
<223>  Saffold virus 2A

<400>  6
ttcactgatt ttttcaaagc cgttagagac tatcatgctt cttattacaa acagagactt       60

caacatgacg ttgaaacaaa ccctggccct                                        90


<210>  7
<211>  90
<212>  DNA
<213>  artificial sequence

<220>
<223>  Ljungan virus 2A

<400>  7
tactttaata taatgcacag tgatgaaatg gattttgccg gggggaaatt tttgaatcaa       60

tgtggtgatg tggaaactaa cccaggccct                                        90


<210>  8
<211>  90
<212>  DNA
<213>  artificial sequence

<220>
<223>  infectious flacherie virus 2A

<400>  8
ccctcaattg gtaatgtcgc gcggactctg acgagggcgg agattgagga tgaattgatt       60

cgtgcaggaa ttgaatcaaa tcctggacct                                        90


<210>  9
<211>  90
<212>  DNA
<213>  artificial sequence

<220>
<223>  Perina nuda picorna-like virus 2A1

<400>  9
ggacaaagga cgactgaaca gatagttacg gcccaggggt gggttccgga tttgactgtg       60

gatggagatg ttgagtcaaa tcccggaccc                                        90


<210>  10
<211>  90
<212>  DNA
<213>  artificial sequence

<220>
<223>  Perina nuda picorna-like virus 2A2

<400>  10
acgcgtggtg gtttacgacg gcaaaatatt attggtggtg ggcagaagga tttgacacaa       60

gatggtgaca tcgagtcgaa tcctgggccc                                        90


<210>  11
<211>  90
<212>  DNA
<213>  artificial sequence

<220>
<223>  Ectropis obliqua picorna-like virus 2A1

<400>  11
ggacaacgga caactgagca gatcgtgact gcacaaggtt gggccccgga tttgacacag       60

gatggagatg tagagtcaaa ccccggcccc                                        90


<210>  12
<211>  90
<212>  DNA
<213>  artificial sequence

<220>
<223>  Ectropis obliqua picorna-like virus 2A2

<400>  12
acacgtggtg gtttacagcg tcaaaacatt attggtggtg gccaaaggga tctgactcaa       60

gatggcgaca tcgagtcgaa ccccggccca                                        90


<210>  13
<211>  90
<212>  DNA
<213>  artificial sequence

<220>
<223>  Drosophila C virus 2A

<400>  13
caaggcatcg gtaagaagaa tccgaaacag gaagctgcac gtcagatgtt gctcttgtta       60

tcaggagatg ttgagactaa ccctggaccc                                        90


<210>  14
<211>  90
<212>  DNA
<213>  artificial sequence

<220>
<223>  acute bee paralysis virus 2A

<400>  14
actggttttt taaacaagtt atatcattgt ggctcatgga ctgacatatt gttgttgttg       60

tctggagatg tagaaaccaa tccaggacct                                        90


<210>  15
<211>  90
<212>  DNA
<213>  artificial sequence

<220>
<223>  Euprosterna elaeasa virus 2A

<400>  15
cgacgattgc cggagtccgc ccagctcccc caaggggcgg ggcgcggaag tctggtaaca       60

tgtggcgacg tggaggagaa tccagggccc                                        90


<210>  16
<211>  90
<212>  DNA
<213>  artificial sequence

<220>
<223>  Providence virus 2A1

<400>  16
ttggagatga aggagtctaa tagtggttac gtagtcggtg accgggggtc tcttctcact       60

tgtggggacg ttgaatccaa ccctggaccc                                        90


<210>  17
<211>  90
<212>  DNA
<213>  artificial sequence

<220>
<223>  Providence virus 2A3

<400>  17
acgcttatgg ggaacatcat gacacttgca gggtcaggtg gtcggggaag cttgctgacc       60

gcaggcgatg ttgaaaagaa ccctgggccc                                        90


<210>  18
<211>  95
<212>  DNA
<213>  artificial sequence

<220>
<223>  Bombyx mori cypovirus-1 2A

<400>  18
agaacagcgt tcgatttcca gcaggacgtt tttcgctcta attatgacct actaaagttg       60

tgcggtgata tcgagtctaa tcctggacct gttac                                  95


<210>  19
<211>  90
<212>  DNA
<213>  artificial sequence

<220>
<223>  Operophtera brumata cypovirus-18 2A

<400>  19
atccatgcta atgattatca gatggctgtg tttaaatcaa attatgattt gctgaagtta       60

tgcggggacg tggaatcaaa tcctggccct                                        90


<210>  20
<211>  90
<212>  DNA
<213>  artificial sequence

<220>
<223>  new adult diarrhea virus 2A

<400>  20
ttcttcgatt cggtttgggt gtaccacttg gcaaacagct cttgggttcg agatttaact       60

agagaatgca ttgaatctaa ccctggacca                                        90


<210>  21
<211>  21
<212>  PRT
<213>  artificial sequence

<220>
<223>  thosea asigna virus T2A

<400>  21

Gly Ser Gly Glu Gly Arg Gly Ser Leu Leu Thr Cys Gly Asp Val Glu 
1               5                   10                  15      


Glu Asn Pro Gly Pro 
            20      


<210>  22
<211>  22
<212>  PRT
<213>  artificial sequence

<220>
<223>  porcine teschovirus-1 P2A

<400>  22

Gly Ser Gly Ala Thr Asn Phe Ser Leu Leu Lys Gln Ala Gly Asp Val 
1               5                   10                  15      


Glu Glu Asn Pro Gly Pro 
            20          


<210>  23
<211>  23
<212>  PRT
<213>  artificial sequence

<220>
<223>  equine rhinitis A virus E2A

<400>  23

Gly Ser Gly Gln Cys Thr Asn Tyr Ala Leu Leu Lys Leu Ala Gly Asp 
1               5                   10                  15      


Val Glu Ser Asn Pro Gly Pro 
            20              


<210>  24
<211>  25
<212>  PRT
<213>  artificial sequence

<220>
<223>  foot-and-mouth disease virus F2A

<400>  24

Gly Ser Gly Val Lys Gln Thr Leu Asn Phe Asp Leu Leu Lys Leu Ala 
1               5                   10                  15      


Gly Asp Val Glu Ser Asn Pro Gly Pro 
            20                  25  


<210>  25
<211>  30
<212>  PRT
<213>  artificial sequence

<220>
<223>  equine rhinitis B virus 12A

<400>  25

Glu Ala Thr Leu Ser Thr Ile Leu Ser Glu Gly Ala Thr Asn Phe Ser 
1               5                   10                  15      


Leu Leu Lys Leu Ala Gly Asp Val Glu Leu Asn Pro Gly Pro 
            20                  25                  30  


<210>  26
<211>  30
<212>  PRT
<213>  artificial sequence

<220>
<223>  Saffold virus 2A

<400>  26

Phe Thr Asp Phe Phe Lys Ala Val Arg Asp Tyr His Ala Ser Tyr Tyr 
1               5                   10                  15      


Lys Gln Arg Leu Gln His Asp Val Glu Thr Asn Pro Gly Pro 
            20                  25                  30  


<210>  27
<211>  30
<212>  PRT
<213>  artificial sequence

<220>
<223>  Ljungan virus 2A

<400>  27

Tyr Phe Asn Ile Met His Ser Asp Glu Met Asp Phe Ala Gly Gly Lys 
1               5                   10                  15      


Phe Leu Asn Gln Cys Gly Asp Val Glu Thr Asn Pro Gly Pro 
            20                  25                  30  


<210>  28
<211>  30
<212>  PRT
<213>  artificial sequence

<220>
<223>  infectious flacherie virus 2A

<400>  28

Pro Ser Ile Gly Asn Val Ala Arg Thr Leu Thr Arg Ala Glu Ile Glu 
1               5                   10                  15      


Asp Glu Leu Ile Arg Ala Gly Ile Glu Ser Asn Pro Gly Pro 
            20                  25                  30  


<210>  29
<211>  30
<212>  PRT
<213>  artificial sequence

<220>
<223>  Perina nuda picorna-like virus 2A1

<400>  29

Gly Gln Arg Thr Thr Glu Gln Ile Val Thr Ala Gln Gly Trp Val Pro 
1               5                   10                  15      


Asp Leu Thr Val Asp Gly Asp Val Glu Ser Asn Pro Gly Pro 
            20                  25                  30  


<210>  30
<211>  30
<212>  PRT
<213>  artificial sequence

<220>
<223>  Perina nuda picorna-like virus 2A2

<400>  30

Thr Arg Gly Gly Leu Arg Arg Gln Asn Ile Ile Gly Gly Gly Gln Lys 
1               5                   10                  15      


Asp Leu Thr Gln Asp Gly Asp Ile Glu Ser Asn Pro Gly Pro 
            20                  25                  30  


<210>  31
<211>  30
<212>  PRT
<213>  artificial sequence

<220>
<223>  Ectropis obliqua picorna-like virus 2A1

<400>  31

Gly Gln Arg Thr Thr Glu Gln Ile Val Thr Ala Gln Gly Trp Ala Pro 
1               5                   10                  15      


Asp Leu Thr Gln Asp Gly Asp Val Glu Ser Asn Pro Gly Pro 
            20                  25                  30  


<210>  32
<211>  30
<212>  PRT
<213>  artificial sequence

<220>
<223>  Ectropis obliqua picorna-like virus 2A2

<400>  32

Thr Arg Gly Gly Leu Gln Arg Gln Asn Ile Ile Gly Gly Gly Gln Arg 
1               5                   10                  15      


Asp Leu Thr Gln Asp Gly Asp Ile Glu Ser Asn Pro Gly Pro 
            20                  25                  30  


<210>  33
<211>  30
<212>  PRT
<213>  artificial sequence

<220>
<223>  Drosophila C virus 2A

<400>  33

Gln Gly Ile Gly Lys Lys Asn Pro Lys Gln Glu Ala Ala Arg Gln Met 
1               5                   10                  15      


Leu Leu Leu Leu Ser Gly Asp Val Glu Thr Asn Pro Gly Pro 
            20                  25                  30  


<210>  34
<211>  30
<212>  PRT
<213>  artificial sequence

<220>
<223>  acute bee paralysis virus 2A

<400>  34

Thr Gly Phe Leu Asn Lys Leu Tyr His Cys Gly Ser Trp Thr Asp Ile 
1               5                   10                  15      


Leu Leu Leu Leu Ser Gly Asp Val Glu Thr Asn Pro Gly Pro 
            20                  25                  30  


<210>  35
<211>  30
<212>  PRT
<213>  artificial sequence

<220>
<223>  Euprosterna elaeasa virus 2A

<400>  35

Arg Arg Leu Pro Glu Ser Ala Gln Leu Pro Gln Gly Ala Gly Arg Gly 
1               5                   10                  15      


Ser Leu Val Thr Cys Gly Asp Val Glu Glu Asn Pro Gly Pro 
            20                  25                  30  


<210>  36
<211>  30
<212>  PRT
<213>  artificial sequence

<220>
<223>  Providence virus 2A1

<400>  36

Leu Glu Met Lys Glu Ser Asn Ser Gly Tyr Val Val Gly Asp Arg Gly 
1               5                   10                  15      


Ser Leu Leu Thr Cys Gly Asp Val Glu Ser Asn Pro Gly Pro 
            20                  25                  30  


<210>  37
<211>  30
<212>  PRT
<213>  artificial sequence

<220>
<223>  Providence virus 2A3

<400>  37

Thr Leu Met Gly Asn Ile Met Thr Leu Ala Gly Ser Gly Gly Arg Gly 
1               5                   10                  15      


Ser Leu Leu Thr Ala Gly Asp Val Glu Lys Asn Pro Gly Pro 
            20                  25                  30  


<210>  38
<211>  30
<212>  PRT
<213>  artificial sequence

<220>
<223>  Bombyx mori cypovirus-1 2A

<400>  38

Arg Thr Ala Phe Asp Phe Gln Gln Asp Val Phe Arg Ser Asn Tyr Asp 
1               5                   10                  15      


Leu Leu Lys Leu Cys Gly Asp Ile Glu Ser Asn Pro Gly Pro 
            20                  25                  30  


<210>  39
<211>  30
<212>  PRT
<213>  artificial sequence

<220>
<223>  Operophtera brumata cypovirus-18 2A

<400>  39

Ile His Ala Asn Asp Tyr Gln Met Ala Val Phe Lys Ser Asn Tyr Asp 
1               5                   10                  15      


Leu Leu Lys Leu Cys Gly Asp Val Glu Ser Asn Pro Gly Pro 
            20                  25                  30  


<210>  40
<211>  30
<212>  PRT
<213>  artificial sequence

<220>
<223>  new adult diarrhea virus 2A

<400>  40

Phe Phe Asp Ser Val Trp Val Tyr His Leu Ala Asn Ser Ser Trp Val 
1               5                   10                  15      


Arg Asp Leu Thr Arg Glu Cys Ile Glu Ser Asn Pro Gly Pro 
            20                  25                  30  


<210>  41
<211>  8
<212>  PRT
<213>  artificial sequence

<220>
<223>  Consensus motif


<220>
<221>  MISC_FEATURE
<222>  (2)..(2)
<223>  Xaa can be any amino acid

<220>
<221>  MISC_FEATURE
<222>  (4)..(4)
<223>  Xaa can be any amino acid

<400>  41

Asp Xaa Glu Xaa Asn Pro Gly Pro 
1               5               


<210>  42
<211>  280
<212>  DNA
<213>  artificial sequence

<220>
<223>  Rous sarcoma virus U3 (Cullen BR et al. (1985) Mol Cell Biol 
       5(3): 438-447)

<400>  42
agtcccctca ggatatagta gtttcgcttt tgcataggga gggggaaatg tagccttatg       60

caatactctt gtagtcttgc aacatgctta tgtaacgatg agttagcaac atgccttaca      120

aggagagaaa aagcaccgtg catgccgatt ggtggaagta aggtggtacg atcgtgcctt      180

attaggaagg caacagacgg gtctgacatg gattggacga accaccgaat tcgcattgca      240

gagagtattg tatttaagtg cctagctcga tacaataaac                            280


<210>  43
<211>  1503
<212>  DNA
<213>  artificial sequence

<220>
<223>  HIV-1 gag nucleotide sequence from plasmid psPAX2

<400>  43
atgggtgcga gagcgtcagt attaagcggg ggagaattag atcgatggga aaaaattcgg       60

ttaaggccag ggggaaagaa aaaatataaa ttaaaacata tagtatgggc aagcagggag      120

ctagaacgat tcgcagttaa tcctggcctg ttagaaacat cagaaggctg tagacaaata      180

ctgggacagc tacaaccatc ccttcagaca ggatcagaag aacttagatc attatataat      240

acagtagcaa ccctctattg tgtgcatcaa aggatagaga taaaagacac caaggaagct      300

ttagacaaga tagaggaaga gcaaaacaaa agtaagaaaa aagcacagca agcagcagct      360

gacacaggac acagcaatca ggtcagccaa aattacccta tagtgcagaa catccagggg      420

caaatggtac atcaggccat atcacctaga actttaaatg catgggtaaa agtagtagaa      480

gagaaggctt tcagcccaga agtgataccc atgttttcag cattatcaga aggagccacc      540

ccacaagatt taaacaccat gctaaacaca gtggggggac atcaagcagc catgcaaatg      600

ttaaaagaga ccatcaatga ggaagctgca gaatgggata gagtgcatcc agtgcatgca      660

gggcctattg caccaggcca gatgagagaa ccaaggggaa gtgacatagc aggaactact      720

agtacccttc aggaacaaat aggatggatg acacataatc cacctatccc agtaggagaa      780

atctataaaa gatggataat cctgggatta aataaaatag taagaatgta tagccctacc      840

agcattctgg acataagaca aggaccaaag gaacccttta gagactatgt agaccgattc      900

tataaaactc taagagccga gcaagcttca caagaggtaa aaaattggat gacagaaacc      960

ttgttggtcc aaaatgcgaa cccagattgt aagactattt taaaagcatt gggaccagga     1020

gcgacactag aagaaatgat gacagcatgt cagggagtgg ggggacccgg ccataaagca     1080

agagttttgg ctgaagcaat gagccaagta acaaatccag ctaccataat gatacagaaa     1140

ggcaatttta ggaaccaaag aaagactgtt aagtgtttca attgtggcaa agaagggcac     1200

atagccaaaa attgcagggc ccctaggaaa aagggctgtt ggaaatgtgg aaaggaagga     1260

caccaaatga aagattgtac tgagagacag gctaattttt tagggaagat ctggccttcc     1320

cacaagggaa ggccagggaa ttttcttcag agcagaccag agccaacagc cccaccagaa     1380

gagagcttca ggtttgggga agagacaaca actccctctc agaagcagga gccgatagac     1440

aaggaactgt atcctttagc ttccctcaga tcactctttg gcagcgaccc ctcgtcacaa     1500

taa                                                                   1503


<210>  44
<211>  500
<212>  PRT
<213>  artificial sequence

<220>
<223>  HIV-1 gag protein sequence from plasmid psPAX2

<400>  44

Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Arg Trp 
1               5                   10                  15      


Glu Lys Ile Arg Leu Arg Pro Gly Gly Lys Lys Lys Tyr Lys Leu Lys 
            20                  25                  30          


His Ile Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro 
        35                  40                  45              


Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gln Ile Leu Gly Gln Leu 
    50                  55                  60                  


Gln Pro Ser Leu Gln Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn 
65                  70                  75                  80  


Thr Val Ala Thr Leu Tyr Cys Val His Gln Arg Ile Glu Ile Lys Asp 
                85                  90                  95      


Thr Lys Glu Ala Leu Asp Lys Ile Glu Glu Glu Gln Asn Lys Ser Lys 
            100                 105                 110         


Lys Lys Ala Gln Gln Ala Ala Ala Asp Thr Gly His Ser Asn Gln Val 
        115                 120                 125             


Ser Gln Asn Tyr Pro Ile Val Gln Asn Ile Gln Gly Gln Met Val His 
    130                 135                 140                 


Gln Ala Ile Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu 
145                 150                 155                 160 


Glu Lys Ala Phe Ser Pro Glu Val Ile Pro Met Phe Ser Ala Leu Ser 
                165                 170                 175     


Glu Gly Ala Thr Pro Gln Asp Leu Asn Thr Met Leu Asn Thr Val Gly 
            180                 185                 190         


Gly His Gln Ala Ala Met Gln Met Leu Lys Glu Thr Ile Asn Glu Glu 
        195                 200                 205             


Ala Ala Glu Trp Asp Arg Val His Pro Val His Ala Gly Pro Ile Ala 
    210                 215                 220                 


Pro Gly Gln Met Arg Glu Pro Arg Gly Ser Asp Ile Ala Gly Thr Thr 
225                 230                 235                 240 


Ser Thr Leu Gln Glu Gln Ile Gly Trp Met Thr His Asn Pro Pro Ile 
                245                 250                 255     


Pro Val Gly Glu Ile Tyr Lys Arg Trp Ile Ile Leu Gly Leu Asn Lys 
            260                 265                 270         


Ile Val Arg Met Tyr Ser Pro Thr Ser Ile Leu Asp Ile Arg Gln Gly 
        275                 280                 285             


Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu 
    290                 295                 300                 


Arg Ala Glu Gln Ala Ser Gln Glu Val Lys Asn Trp Met Thr Glu Thr 
305                 310                 315                 320 


Leu Leu Val Gln Asn Ala Asn Pro Asp Cys Lys Thr Ile Leu Lys Ala 
                325                 330                 335     


Leu Gly Pro Gly Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gln Gly 
            340                 345                 350         


Val Gly Gly Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser 
        355                 360                 365             


Gln Val Thr Asn Pro Ala Thr Ile Met Ile Gln Lys Gly Asn Phe Arg 
    370                 375                 380                 


Asn Gln Arg Lys Thr Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His 
385                 390                 395                 400 


Ile Ala Lys Asn Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys 
                405                 410                 415     


Gly Lys Glu Gly His Gln Met Lys Asp Cys Thr Glu Arg Gln Ala Asn 
            420                 425                 430         


Phe Leu Gly Lys Ile Trp Pro Ser His Lys Gly Arg Pro Gly Asn Phe 
        435                 440                 445             


Leu Gln Ser Arg Pro Glu Pro Thr Ala Pro Pro Glu Glu Ser Phe Arg 
    450                 455                 460                 


Phe Gly Glu Glu Thr Thr Thr Pro Ser Gln Lys Gln Glu Pro Ile Asp 
465                 470                 475                 480 


Lys Glu Leu Tyr Pro Leu Ala Ser Leu Arg Ser Leu Phe Gly Ser Asp 
                485                 490                 495     


Pro Ser Ser Gln 
            500 


<210>  45
<211>  1503
<212>  DNA
<213>  artificial sequence

<220>
<223>  HIV-1 gag nucleotide sequence from plasmid pNL4-3 (GenBank 
       accession no. AF324493.2)

<400>  45
atgggtgcga gagcgtcggt attaagcggg ggagaattag ataaatggga aaaaattcgg       60

ttaaggccag ggggaaagaa acaatataaa ctaaaacata tagtatgggc aagcagggag      120

ctagaacgat tcgcagttaa tcctggcctt ttagagacat cagaaggctg tagacaaata      180

ctgggacagc tacaaccatc ccttcagaca ggatcagaag aacttagatc attatataat      240

acaatagcag tcctctattg tgtgcatcaa aggatagatg taaaagacac caaggaagcc      300

ttagataaga tagaggaaga gcaaaacaaa agtaagaaaa aggcacagca agcagcagct      360

gacacaggaa acaacagcca ggtcagccaa aattacccta tagtgcagaa cctccagggg      420

caaatggtac atcaggccat atcacctaga actttaaatg catgggtaaa agtagtagaa      480

gagaaggctt tcagcccaga agtaataccc atgttttcag cattatcaga aggagccacc      540

ccacaagatt taaataccat gctaaacaca gtggggggac atcaagcagc catgcaaatg      600

ttaaaagaga ccatcaatga ggaagctgca gaatgggata gattgcatcc agtgcatgca      660

gggcctattg caccaggcca gatgagagaa ccaaggggaa gtgacatagc aggaactact      720

agtacccttc aggaacaaat aggatggatg acacataatc cacctatccc agtaggagaa      780

atctataaaa gatggataat cctgggatta aataaaatag taagaatgta tagccctacc      840

agcattctgg acataagaca aggaccaaag gaacccttta gagactatgt agaccgattc      900

tataaaactc taagagccga gcaagcttca caagaggtaa aaaattggat gacagaaacc      960

ttgttggtcc aaaatgcgaa cccagattgt aagactattt taaaagcatt gggaccagga     1020

gcgacactag aagaaatgat gacagcatgt cagggagtgg ggggacccgg ccataaagca     1080

agagttttgg ctgaagcaat gagccaagta acaaatccag ctaccataat gatacagaaa     1140

ggcaatttta ggaaccaaag aaagactgtt aagtgtttca attgtggcaa agaagggcac     1200

atagccaaaa attgcagggc ccctaggaaa aagggctgtt ggaaatgtgg aaaggaagga     1260

caccaaatga aagattgtac tgagagacag gctaattttt tagggaagat ctggccttcc     1320

cacaagggaa ggccagggaa ttttcttcag agcagaccag agccaacagc cccaccagaa     1380

gagagcttca ggtttgggga agagacaaca actccctctc agaagcagga gccgatagac     1440

aaggaactgt atcctttagc ttccctcaga tcactctttg gcagcgaccc ctcgtcacaa     1500

taa                                                                   1503


<210>  46
<211>  500
<212>  PRT
<213>  artificial sequence

<220>
<223>  HIV-1 gag protein sequence from plasmid pNL4-3 (GenBank accession
       no. AAK08483.1)

<400>  46

Met Gly Ala Arg Ala Ser Val Leu Ser Gly Gly Glu Leu Asp Lys Trp 
1               5                   10                  15      


Glu Lys Ile Arg Leu Arg Pro Gly Gly Lys Lys Gln Tyr Lys Leu Lys 
            20                  25                  30          


His Ile Val Trp Ala Ser Arg Glu Leu Glu Arg Phe Ala Val Asn Pro 
        35                  40                  45              


Gly Leu Leu Glu Thr Ser Glu Gly Cys Arg Gln Ile Leu Gly Gln Leu 
    50                  55                  60                  


Gln Pro Ser Leu Gln Thr Gly Ser Glu Glu Leu Arg Ser Leu Tyr Asn 
65                  70                  75                  80  


Thr Ile Ala Val Leu Tyr Cys Val His Gln Arg Ile Asp Val Lys Asp 
                85                  90                  95      


Thr Lys Glu Ala Leu Asp Lys Ile Glu Glu Glu Gln Asn Lys Ser Lys 
            100                 105                 110         


Lys Lys Ala Gln Gln Ala Ala Ala Asp Thr Gly Asn Asn Ser Gln Val 
        115                 120                 125             


Ser Gln Asn Tyr Pro Ile Val Gln Asn Leu Gln Gly Gln Met Val His 
    130                 135                 140                 


Gln Ala Ile Ser Pro Arg Thr Leu Asn Ala Trp Val Lys Val Val Glu 
145                 150                 155                 160 


Glu Lys Ala Phe Ser Pro Glu Val Ile Pro Met Phe Ser Ala Leu Ser 
                165                 170                 175     


Glu Gly Ala Thr Pro Gln Asp Leu Asn Thr Met Leu Asn Thr Val Gly 
            180                 185                 190         


Gly His Gln Ala Ala Met Gln Met Leu Lys Glu Thr Ile Asn Glu Glu 
        195                 200                 205             


Ala Ala Glu Trp Asp Arg Leu His Pro Val His Ala Gly Pro Ile Ala 
    210                 215                 220                 


Pro Gly Gln Met Arg Glu Pro Arg Gly Ser Asp Ile Ala Gly Thr Thr 
225                 230                 235                 240 


Ser Thr Leu Gln Glu Gln Ile Gly Trp Met Thr His Asn Pro Pro Ile 
                245                 250                 255     


Pro Val Gly Glu Ile Tyr Lys Arg Trp Ile Ile Leu Gly Leu Asn Lys 
            260                 265                 270         


Ile Val Arg Met Tyr Ser Pro Thr Ser Ile Leu Asp Ile Arg Gln Gly 
        275                 280                 285             


Pro Lys Glu Pro Phe Arg Asp Tyr Val Asp Arg Phe Tyr Lys Thr Leu 
    290                 295                 300                 


Arg Ala Glu Gln Ala Ser Gln Glu Val Lys Asn Trp Met Thr Glu Thr 
305                 310                 315                 320 


Leu Leu Val Gln Asn Ala Asn Pro Asp Cys Lys Thr Ile Leu Lys Ala 
                325                 330                 335     


Leu Gly Pro Gly Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gln Gly 
            340                 345                 350         


Val Gly Gly Pro Gly His Lys Ala Arg Val Leu Ala Glu Ala Met Ser 
        355                 360                 365             


Gln Val Thr Asn Pro Ala Thr Ile Met Ile Gln Lys Gly Asn Phe Arg 
    370                 375                 380                 


Asn Gln Arg Lys Thr Val Lys Cys Phe Asn Cys Gly Lys Glu Gly His 
385                 390                 395                 400 


Ile Ala Lys Asn Cys Arg Ala Pro Arg Lys Lys Gly Cys Trp Lys Cys 
                405                 410                 415     


Gly Lys Glu Gly His Gln Met Lys Asp Cys Thr Glu Arg Gln Ala Asn 
            420                 425                 430         


Phe Leu Gly Lys Ile Trp Pro Ser His Lys Gly Arg Pro Gly Asn Phe 
        435                 440                 445             


Leu Gln Ser Arg Pro Glu Pro Thr Ala Pro Pro Glu Glu Ser Phe Arg 
    450                 455                 460                 


Phe Gly Glu Glu Thr Thr Thr Pro Ser Gln Lys Gln Glu Pro Ile Asp 
465                 470                 475                 480 


Lys Glu Leu Tyr Pro Leu Ala Ser Leu Arg Ser Leu Phe Gly Ser Asp 
                485                 490                 495     


Pro Ser Ser Gln 
            500 


<210>  47
<211>  3012
<212>  DNA
<213>  artificial sequence

<220>
<223>  HIV-1 pol nucleotide sequence from plasmid psPAX2

<400>  47
ttttttaggg aagatctggc cttcccacaa gggaaggcca gggaattttc ttcagagcag       60

accagagcca acagccccac cagaagagag cttcaggttt ggggaagaga caacaactcc      120

ctctcagaag caggagccga tagacaagga actgtatcct ttagcttccc tcagatcact      180

ctttggcagc gacccctcgt cacaataaag ataggggggc aattaaagga agctctatta      240

gatacaggag cagatgatac agtattagaa gaaatgaatt tgccaggaag atggaaacca      300

aaaatgatag ggggaattgg aggttttatc aaagtaagac agtatgatca gatactcata      360

gaaatctgcg gacataaagc tataggtaca gtattagtag gacctacacc tgtcaacata      420

attggaagaa atctgttgac tcagattggc tgcactttaa attttcccat tagtcctatt      480

gagactgtac cagtaaaatt aaagccagga atggatggcc caaaagttaa acaatggcca      540

ttgacagaag aaaaaataaa agcattagta gaaatttgta cagaaatgga aaaggaagga      600

aaaatttcaa aaattgggcc tgaaaatcca tacaatactc cagtatttgc cataaagaaa      660

aaagacagta ctaaatggag aaaattagta gatttcagag aacttaataa gagaactcaa      720

gatttctggg aagttcaatt aggaatacca catcctgcag ggttaaaaca gaaaaaatca      780

gtaacagtac tggatgtggg cgatgcatat ttttcagttc ccttagataa agacttcagg      840

aagtatactg catttaccat acctagtata aacaatgaga caccagggat tagatatcag      900

tacaatgtgc ttccacaggg atggaaagga tcaccagcaa tattccagtg tagcatgaca      960

aaaatcttag agccttttag aaaacaaaat ccagacatag tcatctatca atacatggat     1020

gatttgtatg taggatctga cttagaaata gggcagcata gaacaaaaat agaggaactg     1080

agacaacatc tgttgaggtg gggatttacc acaccagaca aaaaacatca gaaagaacct     1140

ccattccttt ggatgggtta tgaactccat cctgataaat ggacagtaca gcctatagtg     1200

ctgccagaaa aggacagctg gactgtcaat gacatacaga aattagtggg aaaattgaat     1260

tgggcaagtc agatttatgc agggattaaa gtaaggcaat tatgtaaact tcttagggga     1320

accaaagcac taacagaagt agtaccacta acagaagaag cagagctaga actggcagaa     1380

aacagggaga ttctaaaaga accggtacat ggagtgtatt atgacccatc aaaagactta     1440

atagcagaaa tacagaagca ggggcaaggc caatggacat atcaaattta tcaagagcca     1500

tttaaaaatc tgaaaacagg aaagtatgca agaatgaagg gtgcccacac taatgatgtg     1560

aaacaattaa cagaggcagt acaaaaaata gccacagaaa gcatagtaat atggggaaag     1620

actcctaaat ttaaattacc catacaaaag gaaacatggg aagcatggtg gacagagtat     1680

tggcaagcca cctggattcc tgagtgggag tttgtcaata cccctccctt agtgaagtta     1740

tggtaccagt tagagaaaga acccataata ggagcagaaa ctttctatgt agatggggca     1800

gccaataggg aaactaaatt aggaaaagca ggatatgtaa ctgacagagg aagacaaaaa     1860

gttgtccccc taacggacac aacaaatcag aagactgagt tacaagcaat tcatctagct     1920

ttgcaggatt cgggattaga agtaaacata gtgacagact cacaatatgc attgggaatc     1980

attcaagcac aaccagataa gagtgaatca gagttagtca gtcaaataat agagcagtta     2040

ataaaaaagg aaaaagtcta cctggcatgg gtaccagcac acaaaggaat tggaggaaat     2100

gaacaagtag ataaattggt cagtgctgga atcaggaaag tactattttt agatggaata     2160

gataaggccc aagaagaaca tgagaaatat cacagtaatt ggagagcaat ggctagtgat     2220

tttaacctac cacctgtagt agcaaaagaa atagtagcca gctgtgataa atgtcagcta     2280

aaaggggaag ccatgcatgg acaagtagac tgtagcccag gaatatggca gctagattgt     2340

acacatttag aaggaaaagt tatcttggta gcagttcatg tagccagtgg atatatagaa     2400

gcagaagtaa ttccagcaga gacagggcaa gaaacagcat acttcctctt aaaattagca     2460

ggaagatggc cagtaaaaac agtacataca gacaatggca gcaatttcac cagtactaca     2520

gttaaggccg cctgttggtg ggcggggatc aagcaggaat ttggcattcc ctacaatccc     2580

caaagtcaag gagtaataga atctatgaat aaagaattaa agaaaattat aggacaggta     2640

agagatcagg ctgaacatct taagacagca gtacaaatgg cagtattcat ccacaatttt     2700

aaaagaaaag gggggattgg ggggtacagt gcaggggaaa gaatagtaga cataatagca     2760

acagacatac aaactaaaga attacaaaaa caaattacaa aaattcaaaa ttttcgggtt     2820

tattacaggg acagcagaga tccagtttgg aaaggaccag caaagctcct ctggaaaggt     2880

gaaggggcag tagtaataca agataatagt gacataaaag tagtgccaag aagaaaagca     2940

aagatcatca gggattatgg aaaacagatg gcaggtgatg attgtgtggc aagtagacag     3000

gatgaggatt aa                                                         3012


<210>  48
<211>  1003
<212>  PRT
<213>  artificial sequence

<220>
<223>  HIV-1 pol protein sequence from psPAX2

<400>  48

Phe Phe Arg Glu Asp Leu Ala Phe Pro Gln Gly Lys Ala Arg Glu Phe 
1               5                   10                  15      


Ser Ser Glu Gln Thr Arg Ala Asn Ser Pro Thr Arg Arg Glu Leu Gln 
            20                  25                  30          


Val Trp Gly Arg Asp Asn Asn Ser Leu Ser Glu Ala Gly Ala Asp Arg 
        35                  40                  45              


Gln Gly Thr Val Ser Phe Ser Phe Pro Gln Ile Thr Leu Trp Gln Arg 
    50                  55                  60                  


Pro Leu Val Thr Ile Lys Ile Gly Gly Gln Leu Lys Glu Ala Leu Leu 
65                  70                  75                  80  


Asp Thr Gly Ala Asp Asp Thr Val Leu Glu Glu Met Asn Leu Pro Gly 
                85                  90                  95      


Arg Trp Lys Pro Lys Met Ile Gly Gly Ile Gly Gly Phe Ile Lys Val 
            100                 105                 110         


Arg Gln Tyr Asp Gln Ile Leu Ile Glu Ile Cys Gly His Lys Ala Ile 
        115                 120                 125             


Gly Thr Val Leu Val Gly Pro Thr Pro Val Asn Ile Ile Gly Arg Asn 
    130                 135                 140                 


Leu Leu Thr Gln Ile Gly Cys Thr Leu Asn Phe Pro Ile Ser Pro Ile 
145                 150                 155                 160 


Glu Thr Val Pro Val Lys Leu Lys Pro Gly Met Asp Gly Pro Lys Val 
                165                 170                 175     


Lys Gln Trp Pro Leu Thr Glu Glu Lys Ile Lys Ala Leu Val Glu Ile 
            180                 185                 190         


Cys Thr Glu Met Glu Lys Glu Gly Lys Ile Ser Lys Ile Gly Pro Glu 
        195                 200                 205             


Asn Pro Tyr Asn Thr Pro Val Phe Ala Ile Lys Lys Lys Asp Ser Thr 
    210                 215                 220                 


Lys Trp Arg Lys Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gln 
225                 230                 235                 240 


Asp Phe Trp Glu Val Gln Leu Gly Ile Pro His Pro Ala Gly Leu Lys 
                245                 250                 255     


Gln Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser 
            260                 265                 270         


Val Pro Leu Asp Lys Asp Phe Arg Lys Tyr Thr Ala Phe Thr Ile Pro 
        275                 280                 285             


Ser Ile Asn Asn Glu Thr Pro Gly Ile Arg Tyr Gln Tyr Asn Val Leu 
    290                 295                 300                 


Pro Gln Gly Trp Lys Gly Ser Pro Ala Ile Phe Gln Cys Ser Met Thr 
305                 310                 315                 320 


Lys Ile Leu Glu Pro Phe Arg Lys Gln Asn Pro Asp Ile Val Ile Tyr 
                325                 330                 335     


Gln Tyr Met Asp Asp Leu Tyr Val Gly Ser Asp Leu Glu Ile Gly Gln 
            340                 345                 350         


His Arg Thr Lys Ile Glu Glu Leu Arg Gln His Leu Leu Arg Trp Gly 
        355                 360                 365             


Phe Thr Thr Pro Asp Lys Lys His Gln Lys Glu Pro Pro Phe Leu Trp 
    370                 375                 380                 


Met Gly Tyr Glu Leu His Pro Asp Lys Trp Thr Val Gln Pro Ile Val 
385                 390                 395                 400 


Leu Pro Glu Lys Asp Ser Trp Thr Val Asn Asp Ile Gln Lys Leu Val 
                405                 410                 415     


Gly Lys Leu Asn Trp Ala Ser Gln Ile Tyr Ala Gly Ile Lys Val Arg 
            420                 425                 430         


Gln Leu Cys Lys Leu Leu Arg Gly Thr Lys Ala Leu Thr Glu Val Val 
        435                 440                 445             


Pro Leu Thr Glu Glu Ala Glu Leu Glu Leu Ala Glu Asn Arg Glu Ile 
    450                 455                 460                 


Leu Lys Glu Pro Val His Gly Val Tyr Tyr Asp Pro Ser Lys Asp Leu 
465                 470                 475                 480 


Ile Ala Glu Ile Gln Lys Gln Gly Gln Gly Gln Trp Thr Tyr Gln Ile 
                485                 490                 495     


Tyr Gln Glu Pro Phe Lys Asn Leu Lys Thr Gly Lys Tyr Ala Arg Met 
            500                 505                 510         


Lys Gly Ala His Thr Asn Asp Val Lys Gln Leu Thr Glu Ala Val Gln 
        515                 520                 525             


Lys Ile Ala Thr Glu Ser Ile Val Ile Trp Gly Lys Thr Pro Lys Phe 
    530                 535                 540                 


Lys Leu Pro Ile Gln Lys Glu Thr Trp Glu Ala Trp Trp Thr Glu Tyr 
545                 550                 555                 560 


Trp Gln Ala Thr Trp Ile Pro Glu Trp Glu Phe Val Asn Thr Pro Pro 
                565                 570                 575     


Leu Val Lys Leu Trp Tyr Gln Leu Glu Lys Glu Pro Ile Ile Gly Ala 
            580                 585                 590         


Glu Thr Phe Tyr Val Asp Gly Ala Ala Asn Arg Glu Thr Lys Leu Gly 
        595                 600                 605             


Lys Ala Gly Tyr Val Thr Asp Arg Gly Arg Gln Lys Val Val Pro Leu 
    610                 615                 620                 


Thr Asp Thr Thr Asn Gln Lys Thr Glu Leu Gln Ala Ile His Leu Ala 
625                 630                 635                 640 


Leu Gln Asp Ser Gly Leu Glu Val Asn Ile Val Thr Asp Ser Gln Tyr 
                645                 650                 655     


Ala Leu Gly Ile Ile Gln Ala Gln Pro Asp Lys Ser Glu Ser Glu Leu 
            660                 665                 670         


Val Ser Gln Ile Ile Glu Gln Leu Ile Lys Lys Glu Lys Val Tyr Leu 
        675                 680                 685             


Ala Trp Val Pro Ala His Lys Gly Ile Gly Gly Asn Glu Gln Val Asp 
    690                 695                 700                 


Lys Leu Val Ser Ala Gly Ile Arg Lys Val Leu Phe Leu Asp Gly Ile 
705                 710                 715                 720 


Asp Lys Ala Gln Glu Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala 
                725                 730                 735     


Met Ala Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu Ile Val 
            740                 745                 750         


Ala Ser Cys Asp Lys Cys Gln Leu Lys Gly Glu Ala Met His Gly Gln 
        755                 760                 765             


Val Asp Cys Ser Pro Gly Ile Trp Gln Leu Asp Cys Thr His Leu Glu 
    770                 775                 780                 


Gly Lys Val Ile Leu Val Ala Val His Val Ala Ser Gly Tyr Ile Glu 
785                 790                 795                 800 


Ala Glu Val Ile Pro Ala Glu Thr Gly Gln Glu Thr Ala Tyr Phe Leu 
                805                 810                 815     


Leu Lys Leu Ala Gly Arg Trp Pro Val Lys Thr Val His Thr Asp Asn 
            820                 825                 830         


Gly Ser Asn Phe Thr Ser Thr Thr Val Lys Ala Ala Cys Trp Trp Ala 
        835                 840                 845             


Gly Ile Lys Gln Glu Phe Gly Ile Pro Tyr Asn Pro Gln Ser Gln Gly 
    850                 855                 860                 


Val Ile Glu Ser Met Asn Lys Glu Leu Lys Lys Ile Ile Gly Gln Val 
865                 870                 875                 880 


Arg Asp Gln Ala Glu His Leu Lys Thr Ala Val Gln Met Ala Val Phe 
                885                 890                 895     


Ile His Asn Phe Lys Arg Lys Gly Gly Ile Gly Gly Tyr Ser Ala Gly 
            900                 905                 910         


Glu Arg Ile Val Asp Ile Ile Ala Thr Asp Ile Gln Thr Lys Glu Leu 
        915                 920                 925             


Gln Lys Gln Ile Thr Lys Ile Gln Asn Phe Arg Val Tyr Tyr Arg Asp 
    930                 935                 940                 


Ser Arg Asp Pro Val Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly 
945                 950                 955                 960 


Glu Gly Ala Val Val Ile Gln Asp Asn Ser Asp Ile Lys Val Val Pro 
                965                 970                 975     


Arg Arg Lys Ala Lys Ile Ile Arg Asp Tyr Gly Lys Gln Met Ala Gly 
            980                 985                 990         


Asp Asp Cys Val Ala Ser Arg Gln  Asp Glu Asp 
        995                 1000             


<210>  49
<211>  3012
<212>  DNA
<213>  artificial sequence

<220>
<223>  HIV-1 pol nucleotide sequence from plasmid pNL4-3 (GenBank 
       accession no. AF324493.2)

<400>  49
ttttttaggg aagatctggc cttcccacaa gggaaggcca gggaattttc ttcagagcag       60

accagagcca acagccccac cagaagagag cttcaggttt ggggaagaga caacaactcc      120

ctctcagaag caggagccga tagacaagga actgtatcct ttagcttccc tcagatcact      180

ctttggcagc gacccctcgt cacaataaag ataggggggc aattaaagga agctctatta      240

gatacaggag cagatgatac agtattagaa gaaatgaatt tgccaggaag atggaaacca      300

aaaatgatag ggggaattgg aggttttatc aaagtaagac agtatgatca gatactcata      360

gaaatctgcg gacataaagc tataggtaca gtattagtag gacctacacc tgtcaacata      420

attggaagaa atctgttgac tcagattggc tgcactttaa attttcccat tagtcctatt      480

gagactgtac cagtaaaatt aaagccagga atggatggcc caaaagttaa acaatggcca      540

ttgacagaag aaaaaataaa agcattagta gaaatttgta cagaaatgga aaaggaagga      600

aaaatttcaa aaattgggcc tgaaaatcca tacaatactc cagtatttgc cataaagaaa      660

aaagacagta ctaaatggag aaaattagta gatttcagag aacttaataa gagaactcaa      720

gatttctggg aagttcaatt aggaatacca catcctgcag ggttaaaaca gaaaaaatca      780

gtaacagtac tggatgtggg cgatgcatat ttttcagttc ccttagataa agacttcagg      840

aagtatactg catttaccat acctagtata aacaatgaga caccagggat tagatatcag      900

tacaatgtgc ttccacaggg atggaaagga tcaccagcaa tattccagtg tagcatgaca      960

aaaatcttag agccttttag aaaacaaaat ccagacatag tcatctatca atacatggat     1020

gatttgtatg taggatctga cttagaaata gggcagcata gaacaaaaat agaggaactg     1080

agacaacatc tgttgaggtg gggatttacc acaccagaca aaaaacatca gaaagaacct     1140

ccattccttt ggatgggtta tgaactccat cctgataaat ggacagtaca gcctatagtg     1200

ctgccagaaa aggacagctg gactgtcaat gacatacaga aattagtggg aaaattgaat     1260

tgggcaagtc agatttatgc agggattaaa gtaaggcaat tatgtaaact tcttagggga     1320

accaaagcac taacagaagt agtaccacta acagaagaag cagagctaga actggcagaa     1380

aacagggaga ttctaaaaga accggtacat ggagtgtatt atgacccatc aaaagactta     1440

atagcagaaa tacagaagca ggggcaaggc caatggacat atcaaattta tcaagagcca     1500

tttaaaaatc tgaaaacagg aaagtatgca agaatgaagg gtgcccacac taatgatgtg     1560

aaacaattaa cagaggcagt acaaaaaata gccacagaaa gcatagtaat atggggaaag     1620

actcctaaat ttaaattacc catacaaaag gaaacatggg aagcatggtg gacagagtat     1680

tggcaagcca cctggattcc tgagtgggag tttgtcaata cccctccctt agtgaagtta     1740

tggtaccagt tagagaaaga acccataata ggagcagaaa ctttctatgt agatggggca     1800

gccaataggg aaactaaatt aggaaaagca ggatatgtaa ctgacagagg aagacaaaaa     1860

gttgtccccc taacggacac aacaaatcag aagactgagt tacaagcaat tcatctagct     1920

ttgcaggatt cgggattaga agtaaacata gtgacagact cacaatatgc attgggaatc     1980

attcaagcac aaccagataa gagtgaatca gagttagtca gtcaaataat agagcagtta     2040

ataaaaaagg aaaaagtcta cctggcatgg gtaccagcac acaaaggaat tggaggaaat     2100

gaacaagtag ataaattggt cagtgctgga atcaggaaag tactattttt agatggaata     2160

gataaggccc aagaagaaca tgagaaatat cacagtaatt ggagagcaat ggctagtgat     2220

tttaacctac cacctgtagt agcaaaagaa atagtagcca gctgtgataa atgtcagcta     2280

aaaggggaag ccatgcatgg acaagtagac tgtagcccag gaatatggca gctagattgt     2340

acacatttag aaggaaaagt tatcttggta gcagttcatg tagccagtgg atatatagaa     2400

gcagaagtaa ttccagcaga gacagggcaa gaaacagcat acttcctctt aaaattagca     2460

ggaagatggc cagtaaaaac agtacataca gacaatggca gcaatttcac cagtactaca     2520

gttaaggccg cctgttggtg ggcggggatc aagcaggaat ttggcattcc ctacaatccc     2580

caaagtcaag gagtaataga atctatgaat aaagaattaa agaaaattat aggacaggta     2640

agagatcagg ctgaacatct taagacagca gtacaaatgg cagtattcat ccacaatttt     2700

aaaagaaaag gggggattgg ggggtacagt gcaggggaaa gaatagtaga cataatagca     2760

acagacatac aaactaaaga attacaaaaa caaattacaa aaattcaaaa ttttcgggtt     2820

tattacaggg acagcagaga tccagtttgg aaaggaccag caaagctcct ctggaaaggt     2880

gaaggggcag tagtaataca agataatagt gacataaaag tagtgccaag aagaaaagca     2940

aagatcatca gggattatgg aaaacagatg gcaggtgatg attgtgtggc aagtagacag     3000

gatgaggatt aa                                                         3012


<210>  50
<211>  1003
<212>  PRT
<213>  artificial sequence

<220>
<223>  HIV-1 pol protein sequence from plasmid pNL4-3 (GenBank accession
       no. AAK08484.2)

<400>  50

Phe Phe Arg Glu Asp Leu Ala Phe Pro Gln Gly Lys Ala Arg Glu Phe 
1               5                   10                  15      


Ser Ser Glu Gln Thr Arg Ala Asn Ser Pro Thr Arg Arg Glu Leu Gln 
            20                  25                  30          


Val Trp Gly Arg Asp Asn Asn Ser Leu Ser Glu Ala Gly Ala Asp Arg 
        35                  40                  45              


Gln Gly Thr Val Ser Phe Ser Phe Pro Gln Ile Thr Leu Trp Gln Arg 
    50                  55                  60                  


Pro Leu Val Thr Ile Lys Ile Gly Gly Gln Leu Lys Glu Ala Leu Leu 
65                  70                  75                  80  


Asp Thr Gly Ala Asp Asp Thr Val Leu Glu Glu Met Asn Leu Pro Gly 
                85                  90                  95      


Arg Trp Lys Pro Lys Met Ile Gly Gly Ile Gly Gly Phe Ile Lys Val 
            100                 105                 110         


Arg Gln Tyr Asp Gln Ile Leu Ile Glu Ile Cys Gly His Lys Ala Ile 
        115                 120                 125             


Gly Thr Val Leu Val Gly Pro Thr Pro Val Asn Ile Ile Gly Arg Asn 
    130                 135                 140                 


Leu Leu Thr Gln Ile Gly Cys Thr Leu Asn Phe Pro Ile Ser Pro Ile 
145                 150                 155                 160 


Glu Thr Val Pro Val Lys Leu Lys Pro Gly Met Asp Gly Pro Lys Val 
                165                 170                 175     


Lys Gln Trp Pro Leu Thr Glu Glu Lys Ile Lys Ala Leu Val Glu Ile 
            180                 185                 190         


Cys Thr Glu Met Glu Lys Glu Gly Lys Ile Ser Lys Ile Gly Pro Glu 
        195                 200                 205             


Asn Pro Tyr Asn Thr Pro Val Phe Ala Ile Lys Lys Lys Asp Ser Thr 
    210                 215                 220                 


Lys Trp Arg Lys Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gln 
225                 230                 235                 240 


Asp Phe Trp Glu Val Gln Leu Gly Ile Pro His Pro Ala Gly Leu Lys 
                245                 250                 255     


Gln Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Ser 
            260                 265                 270         


Val Pro Leu Asp Lys Asp Phe Arg Lys Tyr Thr Ala Phe Thr Ile Pro 
        275                 280                 285             


Ser Ile Asn Asn Glu Thr Pro Gly Ile Arg Tyr Gln Tyr Asn Val Leu 
    290                 295                 300                 


Pro Gln Gly Trp Lys Gly Ser Pro Ala Ile Phe Gln Cys Ser Met Thr 
305                 310                 315                 320 


Lys Ile Leu Glu Pro Phe Arg Lys Gln Asn Pro Asp Ile Val Ile Tyr 
                325                 330                 335     


Gln Tyr Met Asp Asp Leu Tyr Val Gly Ser Asp Leu Glu Ile Gly Gln 
            340                 345                 350         


His Arg Thr Lys Ile Glu Glu Leu Arg Gln His Leu Leu Arg Trp Gly 
        355                 360                 365             


Phe Thr Thr Pro Asp Lys Lys His Gln Lys Glu Pro Pro Phe Leu Trp 
    370                 375                 380                 


Met Gly Tyr Glu Leu His Pro Asp Lys Trp Thr Val Gln Pro Ile Val 
385                 390                 395                 400 


Leu Pro Glu Lys Asp Ser Trp Thr Val Asn Asp Ile Gln Lys Leu Val 
                405                 410                 415     


Gly Lys Leu Asn Trp Ala Ser Gln Ile Tyr Ala Gly Ile Lys Val Arg 
            420                 425                 430         


Gln Leu Cys Lys Leu Leu Arg Gly Thr Lys Ala Leu Thr Glu Val Val 
        435                 440                 445             


Pro Leu Thr Glu Glu Ala Glu Leu Glu Leu Ala Glu Asn Arg Glu Ile 
    450                 455                 460                 


Leu Lys Glu Pro Val His Gly Val Tyr Tyr Asp Pro Ser Lys Asp Leu 
465                 470                 475                 480 


Ile Ala Glu Ile Gln Lys Gln Gly Gln Gly Gln Trp Thr Tyr Gln Ile 
                485                 490                 495     


Tyr Gln Glu Pro Phe Lys Asn Leu Lys Thr Gly Lys Tyr Ala Arg Met 
            500                 505                 510         


Lys Gly Ala His Thr Asn Asp Val Lys Gln Leu Thr Glu Ala Val Gln 
        515                 520                 525             


Lys Ile Ala Thr Glu Ser Ile Val Ile Trp Gly Lys Thr Pro Lys Phe 
    530                 535                 540                 


Lys Leu Pro Ile Gln Lys Glu Thr Trp Glu Ala Trp Trp Thr Glu Tyr 
545                 550                 555                 560 


Trp Gln Ala Thr Trp Ile Pro Glu Trp Glu Phe Val Asn Thr Pro Pro 
                565                 570                 575     


Leu Val Lys Leu Trp Tyr Gln Leu Glu Lys Glu Pro Ile Ile Gly Ala 
            580                 585                 590         


Glu Thr Phe Tyr Val Asp Gly Ala Ala Asn Arg Glu Thr Lys Leu Gly 
        595                 600                 605             


Lys Ala Gly Tyr Val Thr Asp Arg Gly Arg Gln Lys Val Val Pro Leu 
    610                 615                 620                 


Thr Asp Thr Thr Asn Gln Lys Thr Glu Leu Gln Ala Ile His Leu Ala 
625                 630                 635                 640 


Leu Gln Asp Ser Gly Leu Glu Val Asn Ile Val Thr Asp Ser Gln Tyr 
                645                 650                 655     


Ala Leu Gly Ile Ile Gln Ala Gln Pro Asp Lys Ser Glu Ser Glu Leu 
            660                 665                 670         


Val Ser Gln Ile Ile Glu Gln Leu Ile Lys Lys Glu Lys Val Tyr Leu 
        675                 680                 685             


Ala Trp Val Pro Ala His Lys Gly Ile Gly Gly Asn Glu Gln Val Asp 
    690                 695                 700                 


Lys Leu Val Ser Ala Gly Ile Arg Lys Val Leu Phe Leu Asp Gly Ile 
705                 710                 715                 720 


Asp Lys Ala Gln Glu Glu His Glu Lys Tyr His Ser Asn Trp Arg Ala 
                725                 730                 735     


Met Ala Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu Ile Val 
            740                 745                 750         


Ala Ser Cys Asp Lys Cys Gln Leu Lys Gly Glu Ala Met His Gly Gln 
        755                 760                 765             


Val Asp Cys Ser Pro Gly Ile Trp Gln Leu Asp Cys Thr His Leu Glu 
    770                 775                 780                 


Gly Lys Val Ile Leu Val Ala Val His Val Ala Ser Gly Tyr Ile Glu 
785                 790                 795                 800 


Ala Glu Val Ile Pro Ala Glu Thr Gly Gln Glu Thr Ala Tyr Phe Leu 
                805                 810                 815     


Leu Lys Leu Ala Gly Arg Trp Pro Val Lys Thr Val His Thr Asp Asn 
            820                 825                 830         


Gly Ser Asn Phe Thr Ser Thr Thr Val Lys Ala Ala Cys Trp Trp Ala 
        835                 840                 845             


Gly Ile Lys Gln Glu Phe Gly Ile Pro Tyr Asn Pro Gln Ser Gln Gly 
    850                 855                 860                 


Val Ile Glu Ser Met Asn Lys Glu Leu Lys Lys Ile Ile Gly Gln Val 
865                 870                 875                 880 


Arg Asp Gln Ala Glu His Leu Lys Thr Ala Val Gln Met Ala Val Phe 
                885                 890                 895     


Ile His Asn Phe Lys Arg Lys Gly Gly Ile Gly Gly Tyr Ser Ala Gly 
            900                 905                 910         


Glu Arg Ile Val Asp Ile Ile Ala Thr Asp Ile Gln Thr Lys Glu Leu 
        915                 920                 925             


Gln Lys Gln Ile Thr Lys Ile Gln Asn Phe Arg Val Tyr Tyr Arg Asp 
    930                 935                 940                 


Ser Arg Asp Pro Val Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly 
945                 950                 955                 960 


Glu Gly Ala Val Val Ile Gln Asp Asn Ser Asp Ile Lys Val Val Pro 
                965                 970                 975     


Arg Arg Lys Ala Lys Ile Ile Arg Asp Tyr Gly Lys Gln Met Ala Gly 
            980                 985                 990         


Asp Asp Cys Val Ala Ser Arg Gln  Asp Glu Asp 
        995                 1000             


<210>  51
<211>  261
<212>  DNA
<213>  artificial sequence

<220>
<223>  HIV-1 Tat coding sequence from plasmid pNL4-3 (GenBank accession 
       no. AF324493.2)

<400>  51
atggagccag tagatcctag actagagccc tggaagcatc caggaagtca gcctaaaact       60

gcttgtacca attgctattg taaaaagtgt tgctttcatt gccaagtttg tttcatgaca      120

aaagccttag gcatctccta tggcaggaag aagcggagac agcgacgaag agctcatcag      180

aacagtcaga ctcatcaagc ttctctatca aagcaaccca cctcccaatc ccgaggggac      240

ccgacaggcc cgaaggaata g                                                261


<210>  52
<211>  86
<212>  PRT
<213>  artificial sequence

<220>
<223>  HIV-1 Tat protein sequence from plasmid pNL4-3 (GenBank accession
       no. AAK08486.1)

<400>  52

Met Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser 
1               5                   10                  15      


Gln Pro Lys Thr Ala Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Phe 
            20                  25                  30          


His Cys Gln Val Cys Phe Met Thr Lys Ala Leu Gly Ile Ser Tyr Gly 
        35                  40                  45              


Arg Lys Lys Arg Arg Gln Arg Arg Arg Ala His Gln Asn Ser Gln Thr 
    50                  55                  60                  


His Gln Ala Ser Leu Ser Lys Gln Pro Thr Ser Gln Ser Arg Gly Asp 
65                  70                  75                  80  


Pro Thr Gly Pro Lys Glu 
                85      


<210>  53
<211>  351
<212>  DNA
<213>  artificial sequence

<220>
<223>  HIV-1 Rev coding sequence from plasmid pNL4-3 (GenBank accession 
       no. AF324493.2)

<400>  53
atggcaggaa gaagcggaga cagcgacgaa gagctcatca gaacagtcag actcatcaag       60

cttctctatc aaagcaaccc acctcccaat cccgagggga cccgacaggc ccgaaggaat      120

agaagaagaa ggtggagaga gagacagaga cagatccatt cgattagtga acggatcctt      180

agcacttatc tgggacgatc tgcggagcct gtgcctcttc agctaccacc gcttgagaga      240

cttactcttg attgtaacga ggattgtgga acttctggga cgcagggggt gggaagccct      300

caaatattgg tggaatctcc tacagtattg gagtcaggaa ctaaagaata g               351


<210>  54
<211>  116
<212>  PRT
<213>  artificial sequence

<220>
<223>  HIV-1 Rev protein sequence from plasmid pNL4-3 (GenBank accession
       no. AAK08487.1)

<400>  54

Met Ala Gly Arg Ser Gly Asp Ser Asp Glu Glu Leu Ile Arg Thr Val 
1               5                   10                  15      


Arg Leu Ile Lys Leu Leu Tyr Gln Ser Asn Pro Pro Pro Asn Pro Glu 
            20                  25                  30          


Gly Thr Arg Gln Ala Arg Arg Asn Arg Arg Arg Arg Trp Arg Glu Arg 
        35                  40                  45              


Gln Arg Gln Ile His Ser Ile Ser Glu Arg Ile Leu Ser Thr Tyr Leu 
    50                  55                  60                  


Gly Arg Ser Ala Glu Pro Val Pro Leu Gln Leu Pro Pro Leu Glu Arg 
65                  70                  75                  80  


Leu Thr Leu Asp Cys Asn Glu Asp Cys Gly Thr Ser Gly Thr Gln Gly 
                85                  90                  95      


Val Gly Ser Pro Gln Ile Leu Val Glu Ser Pro Thr Val Leu Glu Ser 
            100                 105                 110         


Gly Thr Lys Glu 
        115     


<210>  55

<400>  55
000

<210>  56

<400>  56
000

<210>  57
<211>  46
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 5'-BsmBI-Aichi68-NP for amplification

<400>  57
catgatcgtc tcagggagca aaagcagggt agataatcac tcacag                      46


<210>  58
<211>  43
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  primer 3'-BsmBI-Aichi68-NP for amplification

<400>  58
catgatcgtc tcgtattagt agaaacaagg gtatttttct tta                         43


